Name	Name	Last commit message	Last commit date
parent directory ..
.gitignore	.gitignore
README.md	README.md
config.py	config.py
ctrl_gen_model.py	ctrl_gen_model.py
main.py	main.py
prepare_data.py	prepare_data.py

Text Style Transfer

This example implements a simplified variant of the ctrl-gen model from

Toward Controlled Generation of Text
Zhiting Hu, Zichao Yang, Xiaodan Liang, Ruslan Salakhutdinov, Eric Xing; ICML 2017

The model roughly has an architecture of Encoder--Decoder--Classifier. Compared to the paper, following simplications are made:

Replaces the base Variational Autoencoder (VAE) model with an attentional Autoencoder (AE) -- VAE is not necessary in the text style transfer setting since we do not need to interpolate the latent space as in the paper.
Attribute classifier (i.e., discriminator) is trained with real data only. Samples generated by the decoder are not used.
Independency constraint is omitted.

Usage

Dataset

Download the yelp sentiment dataset with the following cmd:

python prepare_data.py

Train the model

Train the model on the above data to do sentiment transfer.

python main.py --config config

config.py contains the data and mode configurations.

The model will first be pre-trained for a few epochs (specified in config.py). During pre-training, the Encoder-Decoder part is trained as an autoencoder, while the Classifier part is trained with the classification labels.
Full-training is then performed for another few epochs. During full-training, the Classifier part is fixed, and the Encoder-Decoder part is trained to fit the classifier, along with continuing to minimize the autoencoding loss.

(Note: When using your own dataset, make sure to set max_decoding_length_train and max_decoding_length_infer in config.py.)

Training log is printed as below:

gamma: 1.0, lambda_g: 0.0
step: 1, loss_d: 0.6903 accu_d: 0.5625
step: 1, loss_g_clas: 0.6991 loss_g: 9.1452 accu_g: 0.2812 loss_g_ae: 9.1452 accu_g_gdy: 0.2969
step: 500, loss_d: 0.0989 accu_d: 0.9688
step: 500, loss_g_clas: 0.2985 loss_g: 3.9696 accu_g: 0.8891 loss_g_ae: 3.9696 accu_g_gdy: 0.7734
...
step: 6500, loss_d: 0.0806 accu_d: 0.9703
step: 6500, loss_g_clas: 5.7137 loss_g: 0.2887 accu_g: 0.0844 loss_g_ae: 0.2887 accu_g_gdy: 0.0625
epoch: 1, loss_d: 0.0876 accu_d: 0.9719
epoch: 1, loss_g_clas: 6.7360 loss_g: 0.2195 accu_g: 0.0627 loss_g_ae: 0.2195 accu_g_gdy: 0.0642
val: accu_g: 0.0445 loss_g_ae: 0.1302 accu_d: 0.9774 bleu: 90.7896 loss_g: 0.1302 loss_d: 0.0666 loss_g_clas: 7.0310 accu_g_gdy: 0.0482
...

where:

loss_d and accu_d are the classification loss/accuracy of the Classifier part.
loss_g_clas is the classification loss of the generated sentences.
loss_g_ae is the autoencoding loss.
loss_g is the joint loss = loss_g_ae + lambda_g * loss_g_clas.
accu_g is the classification accuracy of the generated sentences with soft represetations (i.e., Gumbel-softmax).
accu_g_gdy is the classification accuracy of the generated sentences with greedy decoding.
bleu is the BLEU score between the generated and input sentences.

Results

Text style transfer has two primary goals:

The generated sentence should have desired attribute (e.g., positive/negative sentiment)
The generated sentence should keep the content of the original one

We use automatic metrics to evaluate both:

For (1), we can use a pre-trained classifier to classify the generated sentences and evaluate the accuracy (the higher the better). In this code we have not implemented a stand-alone classifier for evaluation, which could be very easy though. The Classifier part in the model gives a reasonably good estimation (i.e., accu_g_gdy in the above) of the accuracy.
For (2), we evaluate the BLEU score between the generated sentences and the original sentences, i.e., bleu in the above (the higher the better) (See Yang et al., 2018 for more details.)

The implementation here gives the following performance after 10 epochs of pre-training and 2 epochs of full-training:

Accuracy (by the `Classifier` part)	BLEU (with the original sentence)
0.92	54.0

Also refer to the following papers that used this code and compared to other text style transfer approaches:

Unsupervised Text Style Transfer using Language Models as Discriminators. Zichao Yang, Zhiting Hu, Chris Dyer, Eric Xing, Taylor Berg-Kirkpatrick. NeurIPS 2018
Structured Content Preservation for Unsupervised Text Style Transfer. Youzhi Tian, Zhiting Hu, Zhou Yu. 2018

Samples

Here are some randomly-picked samples. In each pair, the first sentence is the original sentence and the second is the generated.

go to place for client visits with gorgeous views .
go to place for client visits with lacking views .

there was lots of people but they still managed to provide great service .
there was lots of people but they still managed to provide careless service .

this was the best dining experience i have ever had .
this was the worst dining experience i have ever had .

needless to say , we skipped desert .
gentle to say , we edgy desert . 

the first time i was missing an entire sandwich and a side of fries .
the first time i was beautifully an entire sandwich and a side of fries .

her boutique has a fabulous selection of designer brands !
her annoying has a sketchy selection of bland warned !

service is pretty good .
service is trashy rude .

ok nothing new .
exceptional impressed new .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text_style_transfer

text_style_transfer

README.md

Text Style Transfer

Usage

Dataset

Train the model

Results

Samples

Files

text_style_transfer

Directory actions

More options

Directory actions

More options

Latest commit

History

text_style_transfer

Folders and files

parent directory

README.md

Text Style Transfer

Usage

Dataset

Train the model

Results

Samples