Emotional Chatting Machine

This reponsitory includes two models

a basic seq2seq model with attention and beamsearch
ECM model

Thanks to https://github.com/AaronYALai/Seq2seqAttn_ECM, I learn a lot from this reponsitory. However, I can't get a satisfactory results on the chinese corpus when I use AaronYALai's model. So, I implement a new ECM model based on the tensorflow seq2seq API.

Requirements

python 2.7
tensorflow == 1.4

Sample data

sample data here is only for showing the data format, not for training.

category: target sentence emotion category
choice: target sentence emotional word annotation
source: source sentence
target: target sentence

Just tell me how it works

Set up work space

Create a new folder by following the parameters "workspace" in the yaml configuration file. for example:

./works/example/

Prepare your data and configuration file

you can check the sample data folder for the data format.Then you need to put them under path like

./works/example/data/

Train the model

for training the basic model:

python train.py

for training ECM model:

python train_ECM.py

Infer

"infer_ECM.py" will first create a calculation graph of infer model then load the trained parameters, and finally perform the inference, which is not suitable for deployment.

python infer_ECM.py

"save_infer_model.py" will first create a calculation graph, load the training parameters, and then save the infer model as a model file. At this time, you can use different languages of tensorflow API(C++/Java) to load the infer model.

python save_infer_model.py
python infer_ECM_deploy.py

Model performance

The following result is based on the dataset I crawled from BaiDu Tieba, including 3 emotion types:

no emotion: 321052
pos emotion: 137086
neg emotion: 240233

Parameters:

embeddings:
   embed_size: 300
   vocab_size: 40000
encoder:
   bidirectional: True
   cell_type: LSTM
   num_layers: 2
   num_units: 512
decoder:
   attn_num_units: 512
   cell_type: LSTM
   num_layers: 2
   num_units: 512
   state_pass: True
   infer_max_iter: 25
   emo_cat_emb_size: 256
   emo_internal_memory_units: 256
   num_emotion: 3

Training perplexity

Extra dataset

NTCIR Short Text Conversation Task(STC-3) chinese Emotional Conversation Generation (CECG) Subtask http://coai.cs.tsinghua.edu.cn/hml/challenge/dataset_description/ You can also find the dataset in this project.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
configs		configs
model		model
sample_data		sample_data
.gitignore		.gitignore
README.md		README.md
infer.py		infer.py
infer_ECM.py		infer_ECM.py
infer_ECM_deploy.py		infer_ECM_deploy.py
save_infer_model.py		save_infer_model.py
stc-3_emotion_train.json.zip		stc-3_emotion_train.json.zip
train.py		train.py
train_ECM.py		train_ECM.py
training_perplexity.png		training_perplexity.png
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotional Chatting Machine

Requirements

Sample data

Just tell me how it works

Set up work space

Prepare your data and configuration file

Train the model

Infer

Model performance

Parameters:

Training perplexity

Extra dataset

About

Releases

Packages

Languages

1YCxZ/ECM-seq2seq

Folders and files

Latest commit

History

Repository files navigation

Emotional Chatting Machine

Requirements

Sample data

Just tell me how it works

Set up work space

Prepare your data and configuration file

Train the model

Infer

Model performance

Parameters:

Training perplexity

Extra dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages