discreteness in generative models 25... · multiple data inputs and multiple data outputs input...

49
Discreteness in Generative Models -- Discrete Sequence Generation Hao Dong Peking University 1

Upload: others

Post on 15-Oct-2020

11 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

Discreteness in Generative Models-- Discrete Sequence Generation

Hao Dong

Peking University

1

Page 2: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

2

Discreteness in Generative Models – Discrete Sequence Generation

Page 3: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

3

Page 4: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

4

RNN Language Modelling

• Recap: Recurrent Neural Network

Multiple inputs and multiple outputs

synchronous(simultaneous)

Input

Hiddenstate

Output

asynchronous(Seq2Seq)

Page 5: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

this is a bird

is a bird <EOS>

For testing, input “this,” output the entire sentence

• MLEtraining

• Theoutputofeachstepisequaltotheinputofitsnextstep.

RNN Language Modelling

• Synchronous Many-to-Many

Page 6: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

RNN Language Modelling

• Limitations

• 1. No latent space – Representation learning

• 2. Exposure bias – Long sentence generation problem

Page 7: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inversed RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

7

Page 8: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

8

Generating Sentences from a Continuous Space

• Why Continuous Space

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

Page 9: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

9

Generating Sentences from a Continuous Space

• VAE Approach

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

Multiple data inputs and multiple data outputs

Input

Hiddenstate

Output

asynchronous(Seq2Seq)

VAE reparametric trick here

Page 10: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

10

Generating Sentences from a Continuous Space

• VAE Approach: Training

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

p(z|X)

Page 11: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

11

Generating Sentences from a Continuous Space

• VAE Approach: Testing

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

p(z)

Page 12: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

12

Generating Sentences from a Continuous Space

• Application

Impute missing words within sentences.

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

Page 13: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

13

Generating Sentences from a Continuous Space

• Application

Results from the mean of the posterior distribution and samples from that distribution.

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

Page 14: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

14

Generating Sentences from a Continuous Space

• Application

Interpolation between two points in the VAE manifold.

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

Page 15: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

15

Generating Sentences from a Continuous Space

• Limitation

Difficult to generate long sentences

Generating Sentences from a Continuous Space. Bowman, Samuel R. Vilnis, Luke. Vinyals, Oriol. Dai, Andrew. Jozefowicz, Rafal. Bengio, Samy. Arxiv 2015.

Page 16: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

16

Page 17: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

17

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

Page 18: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

18

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

• Why?

Difficult and even impossible to define reward functions for many environments

Page 19: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

19

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

• Behaviour Cloning

• Inverse RL

Page 20: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

20

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

• Behaviour Cloning == Supervised Learning

• Given expert’s demonstrations: (𝑠-, 𝑎-), (𝑠., 𝑎.)….. (𝑠/, 𝑎/)

• Supervised training:

Agent/Actor𝑠; 𝑎;

Page 21: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

21

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

• Behaviour Cloning == Supervised Learning

• Problem

• Expert only samples from limited observation (states)(Solution: Dataset Aggregation)

• If machine has limited capacity, it may choose the wrong behavior to copy

• Assume the training and testing data distributions are the same

Page 22: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

22

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

• Inverse Reinforcement Learning

RewardFunction

“Common”Reinforcement

Learning

OptimalActor

Environment

Page 23: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

23

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

• Inverse Reinforcement Learning

RewardFunction

“Inverse”Reinforcement

Learning

ExpertDemonstration

Environment𝒯 =(𝑠!, 𝑎!,𝑠", 𝑎"….. 𝑠# , 𝑎#)

……

Page 24: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

24

Recap: Inverse RL vs. GAN

• Reinforcement Learning and Imitation Learning

• Inverse Reinforcement Learning

RewardFunction

“Inverse”Reinforcement

Learning

ExpertDemonstration

Environment𝒯 =(𝑠!, 𝑎!,𝑠", 𝑎"….. 𝑠$ , 𝑎$)

……

“Common”Reinforcement

Learning

OptimalActor

Page 25: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

25

Recap: Inverse RL vs. GAN

• Inverse RL: Training

Expert 𝜋 𝒯1 𝒯2 𝒯3…. 𝒯N

Actor $𝜋

=𝒯1 =𝒯2 =𝒯3…. =𝒯N

RewardFunction

Goal: The expert is always the best

“Common”Reinforcement

Learning

5%&!

#

𝑅(𝒯𝑛) > 5%&!

#

𝑅( :𝒯𝑛)

Updated Reward Function

Update ActorNeural Network

Neural Network

𝒯 =(𝑠!, 𝑎!,𝑠", 𝑎"….. 𝑠$ , 𝑎$)

Page 26: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

26

Recap: Inverse RL vs. GAN

• GAN

• Inverse RLExpert 𝜋 𝒯1 𝒯2 𝒯3 …. 𝒯N

Actor ?𝜋 "𝒯1 "𝒯2 "𝒯3 …. "𝒯N

RewardFunction

“Common”Reinforcement

Learning

Dataset 𝑥1 𝑥2 𝑥3 …. 𝑥N

Generator $𝑥1 $𝑥2 $𝑥3 …. $𝑥N

Discriminator

"𝒯 are not differentiable ..so update the actor via RL

real

fake

real images

expert demonstrations

real

fake

$𝑥 are differentiable ..so update the generator through the discriminator

Page 27: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

27

Page 28: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

28

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

• Motivation

MLE has exposure bias

MLE tends to move to the mean when there is uncertainty“this is a …” “this is one” ç “a” or “one” which one is correct?

MLE-free sentence generation using adversarial networks could be better

• Challenge

Sentences are discrete

Sentence generation is not differentiable

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. Lantao Yu, Weinan Zhang, Jun Wang, Yong Yu. AAAI. 2016.

Page 29: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

29

• Inverse RL

• SeqGAN

Expert 𝜋 𝒯1 𝒯2 𝒯3 …. 𝒯N

Actor ?𝜋 "𝒯1 "𝒯2 "𝒯3 …. "𝒯N

RewardFunction

“Common”Reinforcement

Learning

"𝒯 are not differentiable ..so update the actor via RL

expert demonstrations

real

fake

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Dataset 𝑡1 𝑡2 𝑡3 …. 𝑡N

Generator �̂�1�̂�2�̂�3 ….�̂�N

Discriminator

Policy Gradient�̂� are not differentiable ..so update the generator via RL

real sentences

real

fake

Page 30: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

30

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

• Method Details

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. Lantao Yu, Weinan Zhang, Jun Wang, Yong Yu. AAAI. 2016.

this is a bird

is a bird <EOS>• The generator is a LSTM or GRU language model

• Pretrain the generator using MLE before RL trainingfor better initialization

• The discriminator is a CNN network

Page 31: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

31

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

• Method Details

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. Lantao Yu, Weinan Zhang, Jun Wang, Yong Yu. AAAI. 2016.

Page 32: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

32

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

• Results

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. Lantao Yu, Weinan Zhang, Jun Wang, Yong Yu. AAAI. 2016.

Page 33: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

33

Page 34: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

34

RankGAN: Adversarial Ranking for Language Generation

• Motivation

the existing GANs restrict the discriminator to be a binary classifier, limiting their learning capacity for tasks that need to synthesize output with rich structures such as natural language descriptions

so … we use a Ranker as the discriminator … improved SeqGAN

RankGAN: Adversarial Ranking for Language Generation. Kevin Lin, Dianqi Li, Xiaodong He, Zhengyou Zhang, Ming-Ting Sun. NeurIPS. 2017

Page 35: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

35

RankGAN: Adversarial Ranking for Language Generation

• Method

RankGAN: Adversarial Ranking for Language Generation. Kevin Lin, Dianqi Li, Xiaodong He, Zhengyou Zhang, Ming-Ting Sun. NeurIPS. 2017

Generator hopes the Ranker rank the fake sentence to the top… a better reward function for RL training

Page 36: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

36

RankGAN: Adversarial Ranking for Language Generation

• Method

RankGAN: Adversarial Ranking for Language Generation. Kevin Lin, Dianqi Li, Xiaodong He, Zhengyou Zhang, Ming-Ting Sun. NeurIPS. 2017

𝑈 is a set of real data for reference𝐶 −

is a set of randomly sampled fake data𝐶

+is a set of randomly sampled real data

𝑝( and 𝐺) are real and fake distributionThe “Rank” of a data is the similarity with the reference 𝑈𝑦* and 𝑦+ are the feature vector of 𝑈 and 𝑠 (𝑠 is a singe data point)The similarity score of the input sequence 𝑠 given a reference 𝑢:

The ranking score for a certain sequence 𝑠 given a comparison set 𝐶:

The ranking score for the input sentence 𝑠 :

Page 37: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

37

Page 38: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

38

MaskGAN: Better Text Generation via Filling in the ____

• Motivation

SeqGAN and RankGAN use Policy Gradient for RL training, … thus we can only have a single reward for the whole sentence

Could we have a reward for each word?

MaskGAN uses Actor Critic for RL training.

MaskGAN: Better Text Generation via Filling in the ____ . William Fedus, Ian Goodfellow, Andrew M. Dai. ICLR 2018

Page 39: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

39

MaskGAN: Better Text Generation via Filling in the ____

• Method

Both Generator and Discriminator are Seq2Seq

MaskGAN: Better Text Generation via Filling in the ____ . William Fedus, Ian Goodfellow, Andrew M. Dai. ICLR 2018

x y

real/fake real/fake no need no need no need

reward

Discriminator

Generator

Page 40: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

40

MaskGAN: Better Text Generation via Filling in the ____

• Results

• Conditional Sentence Generation == Inpainting

• Unconditional Sentence Generation == All words are masked

MaskGAN: Better Text Generation via Filling in the ____ . William Fedus, Ian Goodfellow, Andrew M. Dai. ICLR 2018

Page 41: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

41

Page 42: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

42

Adversarial Text Generation without Reinforcement Learning

Adversarial Text Generation without Reinforcement Learning. Donahue, David. Rumshisky, Anna. 2018

• Motivation

RL training is too slow …

• Method: Stage 1

The generator is a common Seq2Seq

Page 43: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

43

Adversarial Text Generation without Reinforcement Learning

• Method: Stage 2

Adversarial Text Generation without Reinforcement Learning. Donahue, David. Rumshisky, Anna. 2018

Page 44: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling• Generating Sentences from a Continuous Space• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN• RankGAN• MaskGAN

• GANs• Adversarial Text Generation without Reinforcement Learning• GANs for Sequences Generation with the Gumbel-softmax

44

Page 45: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

45

GANs for Sequence Generation with Gumbel-softmax

• MotivationTrain like a normal GAN

GANs for Sequences of Discrete Elements with the Gumbel-softmax Distribution. Matt Kusner, Jose Miguel Hernandez-Lobato. NeurIPS. 2016.

Page 46: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

46

GANs for Sequence Generation with Gumbel-softmax

• SeqGAN

• Gumbel-softmax GAN

GANs for Sequences of Discrete Elements with the Gumbel-softmax Distribution. Matt Kusner, Jose Miguel Hernandez-Lobato. NeurIPS. 2016.

Dataset 𝑡1 𝑡2 𝑡3 …. 𝑡N

Generator �̂�1�̂�2�̂�3 ….�̂�N

Discriminator

Policy Gradient

real sentences

real

fake

Dataset 𝑡1 𝑡2 𝑡3 …. 𝑡N

Generator �̂�1�̂�2�̂�3 ….�̂�N

Discriminator

real sentences

real

fake

Gumbel-softmax𝑧à

LSTM decoder

LSTM language model

LSTM encoder

LSTM encoder

Page 47: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

47

GANs for Sequence Generation with Gumbel-softmax

• Limitation

• Gumbel-softmax’s gradient is too large … poor performance

GANs for Sequences of Discrete Elements with the Gumbel-softmax Distribution. Matt Kusner, Jose Miguel Hernandez-Lobato. NeurIPS. 2016.

Page 48: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

• Introduction• RNN Language Modelling --• Generating Sentences from a Continuous Space 2016• Recap: Inverse RL vs. GAN

• GAN+RL• SeqGAN 2017• RankGAN 2017• MaskGAN 2018

• GANs• Adversarial Text Generation without Reinforcement Learning 2018• GANs for Sequences Generation with the Gumbel-softmax 2016

48

Discreteness in Generative Models – Discrete Sequence Generation

Page 49: Discreteness in Generative Models 25... · Multiple data inputs and multiple data outputs Input Hidden state Output asynchronous (Seq2Seq) ... Discreteness in Generative Models –Discrete

Thanks

49