Generative Adversarial Networks (GANs) and Play Generative Models •New state of the art generative model (Nguyen et al 2016) •Generates 227x227 realistic images from all ImageNet

Generative Adversarial Networks (GANs)

Based on:

Generative Adversarial Networks (GANs), Ian Goodfellow. NIPS, 2016

04/12/2017 Anthony Ortiz 1

What are some recent and potentially upcoming breakthroughs in deep learning?

2Anthony Ortiz04/12/2017

The most important one, in my opinion, is adversarial

training (also called GAN for Generative Adversarial

Networks)…

This, and the variations that are now being proposed is

the most interesting idea in the last 10 years in ML, in

my opinion.

Generative Modeling


Why is important to study GANs

• Excellent test of our ability to use high-dimensional, complicatedprobability distributions

• Simulate possible futures for planning or simulated RL

• Missing data

• Semi-supervised learning

• Multi-modal outputs

• Realistic generation tasks


Sample Generation


Next Frame prediction


Lotter et al 2016

Next frame prediction


Super-Resolution


Ledig et al 2016

iGAN


(Zhu et al 2016)

Image to Image Translation


How GANs work?


Maximum Likelihood


Generative Models’ Taxonomy


Fully Visible Belief Networks

• Explicit formula based on chain rule:

Disadvantages:

• O(n) sample generation cost

• Generation not controlled by a latent code


Wavenet


Amazing quality Samplegeneration slow

Two minutes to synthesizeone second of audio

GANs

• Use a latent code

• Asymptotically consistent (unlike variational methods)

• No Markov chains needed

• Often regarded as producing the best samples

• No good way to quantify this


Adversarial Nets Framework


Generator Network

• Must be differentiable

• No invertibility requirement

• Trainable for any size of z

• Some guarantees require z to have higher dimension than x

• Can make x conditionally Gaussian given z but need not do so


Training Procedure

• Use SGD-like algorithm of choice (Adam) on two minibatchessimultaneously:

• A minibatch of training examples

• A minibatch of generated samples

• Optional: run k steps of one player for every step of the other player.


Minimax Game


Discriminator Strategy


Non-Saturation Game


DCGAN Architecture


Is the divergence important?


Modifying GANs to do Maximum Likelihood


Loss does not seem to explain why GAN samples are sharp


Hint: The approximation strategy matters more than the loss

Labels improve subjective sample quality


Implementation, tips and tricks


GAN for MNIST using Tensorflow


Training GAN


Training GAN


• Above, we use negative sign for the loss functions because they need to be maximized, whereas TensorFlow’s optimizer can only do minimization.

• Also, as per the paper’s suggestion, it’s better to maximize tf.reduce_mean(tf.log(D_fake)) instead of minimizing tf.reduce_mean(1 - tf.log(D_fake)) in the algorithm above.

Training GAN

• Then we train the networks one by one with those Adversarial Training, represented by those loss functions above.


Training process by sampling G(Z)


We are done!

One-sided label smoothing


Benefits of label smoothing


Batch Norm


Batch norm in G can cause strong intra-batchcorrelation


Balancing G and D


Research Fronteirs


Non-convergence


Non-convergence in GANs


Problems with Counting


Problems with Perspective


Problems with Global Structure


Evaluation


Plug and Play Generative Models

• New state of the art generative model (Nguyen et al 2016)

• Generates 227x227 realistic images from all ImageNet classes

• Combines adversarial training, moment matching, denoisingautoencoders, and Langevin sampling


PPGN Models


Nguyen et al 2016)

Conclusions

• GANs are a generative models that use supervised learning to estimate an intractable cost function

• GANs allow a model to learn that there are many correct answers

• GANs can simulate many cost functions, including the one used formaximum likelihood

• Adversarial training can be useful for people as well as machine learning models


Documents

Generative Adversarial Networks (GANs) and Play Generative Models •New state of the art generative model (Nguyen et al 2016) •Generates 227x227 realistic images from all ImageNet