33
by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel UC Berkeley, Department of Electrical Engineering and Computer Sciences OpenAI

by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel

UC Berkeley, Department of Electrical Engineering and Computer Sciences

OpenAI

Page 2: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Unsupervised learning of disentangled representation

Usually, learned representation is entangled

(encoded in complicated manner)

When representation is disentangled,

it would be easier to apply to tasks

Page 3: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Disentangling information

man with glasses man without glasses woman

{ { {

Page 4: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Supervised Learning Unsupervised Learning

“to learn is

to recognize”

“to learn is to

replicate”

Page 5: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 6: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

max log𝐷(𝑥) + log(1 − 𝐷(𝐺(𝑧))max log𝐷(𝐺(𝑧)

Page 7: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

where

𝐷 𝑥 =)𝑃𝑑𝑎𝑡𝑎(𝑥

)𝑃𝑑𝑎𝑡𝑎(𝑥) + 𝑃𝐺(𝑥

Page 8: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 9: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 10: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 11: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 12: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 13: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 14: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Mutual information between latent code c and

generator distribution G(z, c) should be high

Page 15: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

measures the “amount of information” learned from knowledge

of random variable Y about the other random variable X

is the reduction of

uncertainty in X

when Y is observed

Page 16: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Given 𝑥 𝑃𝐺 𝑥𝑃𝐺 с|𝑥 should have

small entropyProblem!

𝐼(𝑐; 𝐺(𝑧, 𝑐) )is hard to minimize

directly because of access

to the posterior 𝑃(𝑐|𝑥).

Page 17: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

𝐼 𝑐; 𝐺 𝑧, 𝑐 = 𝐻 𝑐 − 𝐻 𝑐 𝐺 𝑧, 𝑐

= 𝜠𝑥𝐺 𝑧,𝑐 𝜠𝑐′𝑃 𝑐|𝑥 log 𝑃 𝑐′ 𝑥 + 𝐻(𝑐)

= 𝜠𝑥𝐺 𝑧,𝑐 [𝐷𝐾𝐿(𝑃( |𝑥))||𝑄 𝑥 ) + 𝜠𝑐′𝑃 𝑐|𝑥 log 𝑄 𝑐′ 𝑥 ] + 𝐻(𝑐)≥ 0

≥ 𝜠𝑥𝐺 𝑧,𝑐 [𝜠𝑐′𝑃 𝑐|𝑥 log𝑄 𝑐′ 𝑥 ] + 𝐻(𝑐)Treat as a

constant

Page 18: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 19: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Remind the lemma: 𝛦𝑥𝑋,𝑦𝑌|𝑥 𝑓 𝑥, 𝑦 = 𝛦𝑥𝑋,𝑦𝑌|𝑥,𝑥′𝑋|𝑦 𝑓(𝑥′, 𝑦)

𝐿𝐼 𝐺, 𝑄 = 𝛦𝑐𝑃(𝑐),𝑥𝐺(𝑧;𝑐) log𝑄(𝑐|𝑥) + 𝐻(𝑐)

= 𝛦𝑥𝐺(𝑧;𝑐) 𝛦𝑐′𝑃(𝑐|𝑥)[log𝑄 𝑐′ 𝑥 ] + 𝐻(𝑐)

≤ 𝐼 𝑐; 𝐺 𝑧, 𝑐

Page 20: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

In particular, 𝐿𝐼 can be maximized w.r.t. 𝑄 directly and w.r.t. 𝐺 via the

reparametrization trick.

for discrete latent codes, the bound becomes tight and

the maximal mutual information is achieved.

Page 21: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 22: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 23: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 24: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 25: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 26: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Manipulating latent codes on MNIST

(a) Digit type (b) No clear meaning

(c) Rotation (d) Width

Page 27: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

(a) Pose (angle) (b) Elevation

(c) Lighting (d) Width

Manipulating latent codes on 3D Faces

Page 28: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Manipulating latent codes on 3D Chairs

(a) Rotation (b) Width

Page 29: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Manipulating latent codes on SVHN

(a) Continuous code (b) Discrete code

Page 30: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 31: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 32: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 33: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation