Lipschitz Generative Adversarial Nets11-14-00)-11-15-10-4628-lipschitz_gener.pdf · Zhiming Zhou...

Lipschitz Generative Adversarial Nets

Zhiming Zhou1 Jiadong Liang2 Lantao Yu3 Yong Yu1 Zhihua Zhang2

1Shanghai Jiao Tong University

2Peking University

2Stanford University

ICML, 2019

Zhiming Zhou Lipschitz Generative Adversarial Nets ICML, 2019 1 / 11

Generalized Formulation for GANs

minf ∈F

Ez∼Pz [φ(f (g(z)))] + Ex∼Pr [ψ(f (x))],

ming∈G

Ez∼Pz [ϕ(f (g(z)))],(1)

Pz : the source distribution in Rnz ;Pr : the target (real) distribution in Rnr ;g : the generative function Rnz → Rnr ;f : the discriminative function Rnr → R;G: the generative function space;F : the discriminative function space;

φ, ψ, ϕ: R→ R are loss metrics.

GAN : φ(x) = ψ(−x) = − log(σ(−x))

WGAN : φ(x) = ψ(−x) = x

LSGAN : φ(x) = ψ(−x) = (x + α)2

We denote the generation distribution by Pg .

The Gradient Uninformativeness

The problem that the gradient from the discriminator does not contain any informativeabout the real distribution.

A new perspective for the training instability / convergence issue of GANs.

For GANs with unrestricted F :

f ∗(x) = arg minf (x)∈R

Pg (x)φ(f (x)) + Pr (x)ψ(f (x)), ∀x . (2)

f ∗(x) is independently defined and only reflects the local densities Pr (x) and Pg (x);

∇x f ∗(x) does not reflect any information about the other distribution, if the supports oftwo distributions are disjoint.

The Gradient Uninformativeness

Unrestricted GANs MUST suffer from this problem

Restricted GANs May suffer from this problem

GANs with W-Distance May suffer from this problem

Lipschitz GANs DO NOT suffer from this problem

Lipschitz Generative Adversarial Nets (LGANs)

minf ∈F

Ez∼Pz [φ(f (g(z)))] + Ex∼Pr [ψ(f (x))] + λ · k(f )2 ,

ming∈G

Ez∼Pz [ϕ(f (g(z)))]. (3)

We require φ and ψ to satisfy:{φ′(x) > 0,

φ′′(x) ≥ 0,and φ(x) = ψ(−x). (4)

Any increasing function with non-decreasing derivative.

k(f ): Lipschitz constant of f

Lipschitz Generative Adversarial Nets (LGANs)

Theoretically guaranteed properties:

The optimal discriminative function f ∗ exists;

If φ is strictly convex, then f ∗ is unique;

There exists a unique Nash equilibrium where Pr = Pg and k(f ∗) = 0;

Do not suffer from gradient uninformativeness;

For each generated sample, the gradient directly points towards a real sample.

Experiments: Gradient Uninformativeness

Gradient uninformativeness practically behaviors as noisy gradient.

(a) Disjoint Case (b) Overlapping Case

Experiments: ∇x f∗(x) in LGANs

∇x f ∗(x) directly point towards real samples.

Experiments: f ∗ is Unique

Uniqueness of f ∗ leads to stabilized discriminative functions.

(a) f (x) in WGAN. (b) f (x) in LGANs.

Experiments: Unsupervised Image Generation

LGANs, with different choices of φ(x), consistently outperform WGAN.

(a) Training curves on CIFAR. (b) Training curves on Tiny.

Summary Poster: this evening, #19

Gradient uninformativeness

Unrestricted GANs MUST suffer from this problem

Restricted GANs May suffer from this problem

GANs with W-Distance May suffer from this problem

Lipschitz GANs DO NOT suffer from this problem

Lipschitz GANs:

Penalize the Lipschitz constant of f ;Set φ(x) to be any increasing function with non-decreasing derivative;If φ is strictly convex, then f ∗ is unique;The gradients directly point towards real samples.

Lipschitz Generative Adversarial Nets11-14-00)-11-15-10-4628-lipschitz_gener.pdf · Zhiming Zhou...

Documents

Transcript of Lipschitz Generative Adversarial Nets11-14-00)-11-15-10-4628-lipschitz_gener.pdf · Zhiming Zhou...

Adversarial Search Chapter 6 Sections 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.

An Algorithm for Stochastic and Adversarial Bandits with ...

Do Deep Generative Models Know What They Don’t Know?bayesiandeeplearning.org/2018/papers/67.pdfDo Deep Generative Models Know What They Don’t Know? Eric Nalisnickyz, Akihiro Matsukawa,

Part IV Generative Learning algorithmskmlee/cs539/cs229-notes2.pdf · Generative Learning algorithms So far, we’ve mainly been talking about learning algorithms that model p(y|x;θ),

CAPÍTULO Ijvalentm/search1.pdf · A L se le conoce como constante de Lipschitz. La Condición de Lipschitz juega una parte esencial en el siguiente teorema: Teorema 1.3.2(De la existencia

Discussion on ``Generalized fractional smoothness and Lp ...AdvanBSDEfin2010/slides/... · Discussionon“Generalizedfractionalsmoothness andLp-variationofBSDEswithnon-Lipschitz terminalcondition”

Generative Grammatik Ασκησεις

High-Con dence Predictions Under Adversarial Uncertaintypeople.csail.mit.edu/andyd/prediction_web_slides.pdf · Andrew Drucker, IAS High-Con dence Predictions 28/47. Analysis ideas

5 Adversarial

1 Chapter 6, Sec. 1 - 4 Adversarial Search. 2 Outline Optimal decisions α-β pruning Imperfect, real-time decisions.

2 Generative Adversarial Networkproceedings.mlr.press/v80/tao18b/tao18b.pdf · Chenyang Tao 1Liqun Chen Ricardo Henao Jianfeng Feng2 Lawrence Carin1 Abstract To assess the difference

POROSITY AND DIFFERENTIABILITY OF LIPSCHITZ MAPS FROM …cvgmt.sns.it/media/doc/paper/3474/DiffPorosStratBanach.pdf · 2017-06-06 · POROSITY AND DIFFERENTIABILITY 3 derivatives

Lecture 9 VAE variants 50mins - Deep Generative Models

Chapter 5 Adversarial Search 5.1 { 5.4 Deterministic gamespages.mtu.edu/.../lecture-slides/cs4811-ch05-adversarial-search.pdf · I Tim Huang’s slides for the game of Go I Othello

ADVERSARIAL TRAINING FOR FREE!ashafahi/free_training/Free_train_slide.pdfTRAINING FOR FREE! Free-m also maintains important valuable properties of PGD adversarially trained models

Harmonic maps on domains with piecewise Lipschitz continuous

124/ Adversarial Search جستجوی تخاصمی Chapter 6 Section 1 – 4 Modified by Vali Derhami.

Heat kernel and Lipschitz-Besov spacesgrigor/besov.pdf · Heat kernel and Lipschitz-Besov spaces Alexander Grigor’yan and Liguang Liu April 2014 Abstract On a metric measure space

The Mixed Boundary Value Problem in Lipschitz Domains · The Mixed Boundary Value Problem in Lipschitz Domains Katharine Ott University of Kentucky Women and Mathematics Institute

Chapter 5 Adversarial Search 5.1 – 5.4 Deterministic games