40
SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION USING QUANTUM D-WAVE 2X MACHINE Nga Nguyen 1 , Amy Larson 1 , Carleton Coffrin 2 , and Garrett Kenyon 1, 3 1 CCS-3, 2 A-1, Los Alamos National Laboratory & 3 New Mexico Consortium D-Wave Debrief, LANL, April 27, 2017

SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

  • Upload
    others

  • View
    14

  • Download
    0

Embed Size (px)

Citation preview

Page 1: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION USING

QUANTUM D-WAVE 2X MACHINE

Nga Nguyen1, Amy Larson1, Carleton Coffrin2, and Garrett Kenyon1,3 1CCS-3, 2A-1, Los Alamos National Laboratory & 3New Mexico Consortium

D-Wave Debrief, LANL, April 27, 2017

Page 2: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

OUTLINE

A. SPARSE CODING REPRESENTATIONS B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D. SUMMARY AND FUTURE WORK

Page 3: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

OUTLINE

A. SPARSE CODING REPRESENTATIONS B. IMPLEMENTATION ON A D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D. SUMMARY AND FUTURE WORK

Page 4: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

A. SPARSE CODING REPRESENTATIONS

Solving a sparse-coding (SC) problem

Lp-sparseness penalty

• non-convex problem • NP-hard class

reconstruction errorOlshausen and Field, Nature 381, 607 (1996)

Rozell, Johnson, Baraniuk, and Olshausen, Neur. Comp. 20, 2526 (2008)

p=0, the problem is called L0-norm

courtesy of D-Wave

Objective function is of the form:

Page 5: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

Solving a sparse-coding (SC) problem

Objective function is of the form:

Lp-sparseness penaltyreconstruction errorOlshausen and Field, Nature 381, 607 (1996)

Rozell, Johnson, Baraniuk, and Olshausen, Neur. Comp. 20, 2526 (2008)

courtesy of Xinhua Zhang

an example of SC reconstruction

A. SPARSE CODING REPRESENTATIONS

* =

Page 6: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

OUTLINE

A. SPARSE CODING REPRESENTATIONS B. IMPLEMENTATION ON A D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D. SUMMARY AND FUTURE WORK

Page 7: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

D-Wave Hamiltonian:

where

mapping the sparse-coding problem onto a Quantum Unconstrained Binary Optimization (QUBO):

SC ON A QUANTUM D-WAVE MACHINE

Page 8: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

This mapping is achieved by the relations:

analogous to L0-sparseness penalty [Nguyen and Kenyon, PMES-16 (2016)]

SC ON A QUANTUM D-WAVE MACHINE

mapping the sparse-coding problem onto a Quantum Unconstrained Binary Optimization (QUBO):

D-Wave Hamiltonian:

where

Page 9: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

OUTLINE

A. SPARSE CODING ON A QUANTUM D-WAVE B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D. SUMMARY AND FUTURE WORK

Page 10: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

DATASET32x32

24x24

airplane

automobile

ship

truck

CIF

AR

-10

edge

detection

Page 11: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

8 hand-designed features

“row” “column”

orthogonality!number of features

Features

Page 12: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

8 hand-designed features

“row” “column”

orig

reco

n

Features

orthogonality!number of features

Page 13: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

Apply Gram-Schmidt Algorithm:

Desire: Randomly generated :

• to fulfill the Chimera orthogonality

• the way is generated defines architecture of the mapping

Features

Page 14: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

Building features

Page 15: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

Building features

Page 16: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

24x24 patch images

orig

inal

reco

n8x

12x1

2re

con

32x6

x6

airplane automobile ship truck8 and 32 features

Page 17: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

24x24 patch images

orig

inal

reco

n8x

12x1

2re

con

32x6

x6

airplane automobile ship truck8 and 32 features

1100 active qubits 3068 coupling strengths

overcomplete order:

stride: 2, 4

Page 18: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

orig

inal

reco

n11

52X1

X1re

con

32x6

x6

airplane automobile ship truck

1100 active qubits 3068 coupling strengths

overcomplete order:

32 and 1152 features

stride: 24, 4

24x24 patch images

Page 19: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

1 2 3 4 5-2309.2

-2309.0

-2308.8

-2308.6

-2308.4

-2308.2

ensemble

energy

1 2 3 4 5-1738.6

-1738.4

-1738.2

-1738.0

-1737.8

ensemble

energy

image B

image C

0 20 40 60 80 100-2500-2000-1500-1000

0 200 400 600 800 1000

-3000

-2500

-2000

-1500

-1000

-500

0

images

GSEnergy

ensembles

ensembles

Energy

1 2 3 4 5-2073.0

-2072.8

-2072.6

-2072.4

-2072.2

-2072.0

ensemble

energy

image A

ensembles

24x24Nf = 288

Page 20: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

airplane automobile ship truckre

con

orig

inal

overcomplete order : 8

stride: 2

12x12 patch images

CLASSIFICATION RESULTS

Page 21: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

airplane automobile ship truckre

con

orig

inal

overcomplete order : 8

stride: 2

CLASSIFICATION RESULTS

classes air auto bird cat deer dog frog horse ship truck

accur. (binary)

89.21% 93.38% 90.87% 89.42% 94.71% 88.94% 87.98% 89.9% 89.9% 85.58%

Classification task: SVM (liblinear) 1042 training/208 test images

Nguyen and Kenyon, PMES-16 (2016)

12x12 patch images

Page 22: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

COMPARISON WITH A CLASSICAL SOLVER

So far, quantum computation (D-Wave 2X) has NOT outperformed its classical counterpart (GUROBI). Both are comparable.

We already made the problem hard. We need to make it harder.

How can we make the SC problem harder for both?

Page 23: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

From SC perspective: more overcomplete, harder to solve… Meanwhile: The full Chimera in D-Wave offers a certain set of (nearest-neighbor) connectivity…

COMPARISON WITH A CLASSICAL SOLVER

Page 24: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

EMBEDDING technique

COMPARISON WITH A CLASSICAL SOLVER

From SC perspective: more overcomplete, harder to solve… Meanwhile: The full Chimera in D-Wave offers a certain set of (nearest-neighbor) connectivity…

Page 25: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

EMBEDDING technique•Embedding exploits the ability to tie

qubits together •Employ all bipartite couplings•Small number of nodes (qubits) but

more couplings for neurons

COMPARISON WITH A CLASSICAL SOLVER

From SC perspective: more overcomplete, harder to solve… Meanwhile: The full Chimera in D-Wave offers a certain set of (nearest-neighbor) connectivity…

Page 26: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

EMBEDDING technique5x5

COMPARISON WITH A CLASSICAL SOLVER

From SC perspective: more overcomplete, harder to solve… Meanwhile: The full Chimera in D-Wave offers a certain set of (nearest-neighbor) connectivity…

Page 27: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

EMBEDDING technique5x5

In practice (D-Wave 2X): Fully connected: 48, 49 nodes on DW2X and DW2X_VFYC, respectively Partially orthogonal: 72 nodes Feature optimization!

COMPARISON WITH A CLASSICAL SOLVER

From SC perspective: more overcomplete, harder to solve… Meanwhile: The full Chimera in D-Wave offers a certain set of (nearest-neighbor) connectivity…

Page 28: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

STARTING TO SEE SOMETHING GOOD…

solver

problem

72 nodes: partially

Chimera-orthogonal

Energy Time

~ 300 seconds

Energy Time

< 60 seconds

Energy Time

-48.476 30 min

Energy Time

-51.294 few seconds

No. of Hamiltonians: 1

GUROBI (best classical solver)

D-Wave 2X (ISING)

-27.84 -27.84

COMPARISON WITH A CLASSICAL SOLVER

47 nodes: fully connected

Page 29: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

STARTING TO SEE SOMETHING GOOD…

47 nodes: fully connected

70 nodes: partially

Chimera-orthogonal

Energy Time

~ 300 seconds

Energy Time

< 60 seconds

Energy Time

~2000 seconds

Energy Time

< 60 seconds

No. of Hamiltonians: 1

solver

problem

GUROBI (best classical solver)

D-Wave 2X (ISING)

-27.84 -27.84

COMPARISON WITH A CLASSICAL SOLVER

-43.251 -43.251

Page 30: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

{ given a set of neuron activity generated by D-Wave 2X, do:

end }

5x5

Feature Learning (in progress)before… feature optimization

Stochastic gradient descent

for iteration for mini_batch %[1:size(sampling)] %update weights end end

Page 31: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

5x5

Feature Learning (in progress)before… …after

5x5

many “lazy” features

Page 32: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

…THE UNEXPECTEDImprinting technique

randomly sampled imprinting features

GENERATING FEATURES

Page 33: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

Does this enhance the “hardness”?

…THE UNEXPECTEDImprinting technique

randomly generated features randomly sampled imprinting features

GENERATING FEATURES

Page 34: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

…THE GREAT! UNEXPECTEDImprinting technique

Energy Time

solver

problem

GUROBI (best classical solver)

D-Wave 2X (ISING)

Energy Time

< 60 seconds

-129.533 -131.14(cutoff)

~ 9 hours

Sparse coding

Page 35: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

Feature learning

…THE GREAT! UNEXPECTEDImprinting technique

Page 36: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

Feature learning

…THE GREAT! UNEXPECTEDImprinting technique

100% adaptive features

Page 37: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

OUTLINE

A. SPARSE CODING ON A QUANTUM D-WAVE B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK

Page 38: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

D. SUMMARY

first demonstration of sparse coding using a quantum computermapping of visual features to D-Wave 2X Chimerabenchmark results on standard image classification taskcompare D-Wave 2X performance with GUROBI obtained solutions to the problems where D-Wave 2X significantly outperforms GUROBI

Page 39: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

CIF

AR

-10 airplane

automobile

ship

truck

32x32

30x30

edge color

work in progress…

Page 40: SPARSE REPRESENTATIONS FOR IMAGE CLASSIFICATION … · B. IMPLEMENTATION ON D-WAVE MACHINE C. SPARSE CODING FOR OBJECT DETECTION D.SUMMARY AND FUTURE WORK. D. SUMMARY first demonstration

D. (IN PROGRESS &) FUTURE WORK • optimize features • add colors • hierarchy model • TrueNorth comparison