DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015.,...

DISENTANGLING FROM SEQUENTIAL DATA

SNU Datamining Laboratory2018. 6. 18 Seminar

Sungwon, Lyulyusungwon@dm.snu.ac.kr

REPRESENTATION LEARNING

• Representation Learning• Learning representation of the data that make it easier to extract useful

information when building classifiers or other predictors

Source: Bengio, Yoshua, Aaron Courville, and Pascal Vincent. "Representation learning: A review and new perspectives." IEEE transactions on pattern analysis and machine intelligence 35.8 (2013): 1798-1828.

(110, 5, 225, 5, 210, 10)

WHAT MAKES A REPRESENTATION GOOD?

• Smoothness: • Multiple Explanatory factors: Generalize many configurations of factors• A hierarchical organization of explanatory factors• Semi-supervised learning• Shared factors across tasks• Manifolds: Low meaningful dimensions• Natural clustering: Named, categorized• Temporal and Spatial coherence• Sparsity: insensitive to small variations of x

Source: Bengio, Yoshua, Aaron Courville, and Pascal Vincent. "Representation learning: A review and new perspectives." IEEE transactions on pattern analysis and machine intelligence 35.8 (2013): 1798-1828.

x ≈ ythenf(x) ≈ f(y)

DISENTANGLING

• DC-IGN• Clamping a part of the hidden units for a pair of data points that are known to match in

all but one factors of variation

• InfoGAN• Maximize the mutual information between a latent code c and x through the use of an

auxiliary distribution Q(c|x)

• Beta-Vae• Encourages the latent representation to be factorised by adding beta to VAE objective

Source: Kulkarni, Tejas D., et al. "Deep convolutional inverse graphics network." Advances in Neural Information Processing Systems. 2015., Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." Advances in Neural Information Processing Systems. 2016., Kulkarni, Tejas D., et al. "Deep convolutional inverse graphics network." Advances in Neural Information Processing Systems. 2015.

MOTIVATION

https://www.youtube.com/watch?v=naJZITvCfI4&feature=youtu.behttps://www.youtube.com/watch?v=VMX3IZYWYdg&feature=youtu.be

Hsu, Wei-Ning, Yu Zhang, and James Glass. "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data." Advances in neural information processing systems. 2017.

• Factorized Hierarchical VAE• Sequence-level attributes

• Speaker identity, Character style• Latent sequence variable ( )

• Segment-level attributes• Phonetic content, Action• Latent segment variable ( )

• Notation

𝒟 = {X(i)}Mi=1, X(i) = {x(i,n)}N(i)

Z1 = {z(n)1 }, Z2 = {z(n)

• Generative model

• Equation

• Inference model

• Equation

• Factorising variational lower bound

• Segment Variational lower bound

• Discriminative objective

• Objetive function

IMPLEMENTATION

• Sequence to sequence autoencoder model

EXPERIMENT

• Speaker Verification • Automatic Speech Recognition

RELATED WORKS

Source: Li, Yingzhen, and Stephan Mandt. "A Deep Generative Model for Disentangled Representations of Sequential Data." arXiv preprint arXiv:1803.02991 (2018).

• A Deep Generative Model for Disentangled Representations of Sequential Data• Applied almost the same architecture to video

RELATED WORKS

Huang, Xun, et al. "Multimodal Unsupervised Image-to-Image Translation." arXiv preprint arXiv:1804.04732 (2018).

• Multimodal Unsupervised Image-to-Image Translation• Used Adversarial training to disentangle style and content feature

DISENTANGLING FROM SEQUENTIAL DATA · Advances in Neural Information Processing Systems. 2015.,...

Documents

Neural Interaction Transparency (NIT): Disentangling ...papers.nips.cc/paper/...nit-disentangling-learned... · Neural Interaction Transparency (NIT): Disentangling Learned Interactions

Mixture Density Generative Adversarial Networks ...openaccess.thecvf.com/content_CVPR_2019/... · Infogan: Interpretable representation learning by information maximizing generative

Taskonomy: Disentangling Task Transfer Learning

Towards Interpretable Face Recognition

Reproducible and Interpretable Spiculation Quantification

by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Disentangling India’s Investment Slowdown

O LEARNING REPRESENTATIONS WITH L -B SUPERVISION · Infogan: Interpretable representation learning by information maximizing generative adversarial nets. In Advances in Neural Information

Recommending Transit: Disentangling users’ willingness to ...tram.mcgill.ca/Research/Publications/Recommending_transit.pdf · 1 Recommending Transit: Disentangling users’ willingness

Interpretable Nonnegative Matrix Decompositions

Understanding Minds: Disentangling ToM

Detach and Adapt Learning Cross-Domain Disentangled Deep ...aliensunmin.github.io/aii_workshop/1st/AII_(Frank_Wang).pdf · InfoGAN: Interpretable representation learning by information

Interpretable and Globally Optimal Prediction for Textual ...papers.nips.cc/paper/6787-interpretable-and... · Interpretable and Globally Optimal Prediction for Textual Grounding

Disentangling Blurring - Duke University

Interpretable Intuitive Physics Model - openaccess.thecvf.comopenaccess.thecvf.com/content_ECCV_2018/papers/Tian_Ye_Interpretable... · Interpretable Intuitive Physics Model 3 unseen

Interpretable Linear Modeling - Piazza

InfoGAIL: Interpretable Imitation Learning from Visual ...papers.nips.cc/paper/6971-infogail-interpretable-imitation-learning... · InfoGAIL: Interpretable Imitation Learning from

HISTOGRAM POOLING OPERATORS AN INTERPRETABLE …

DISENTANGLING THE SOURCES OF GROWTH

GIF: Generative Interpretable Faces