How to train the three model #5

tiancity-NJU · 2019-06-20T06:31:10Z

when I try to train key_gen model, it may needs 1-billion.txt， but I don't find the txt anywhere. Can you tell me how to train my model and reproduce you result? thanks

J-zin · 2019-09-23T07:47:10Z

FileNotFoundError: [Errno 2] No such file or directory: '../data/1-billion/1-billion.txt'

NingMiao · 2019-10-11T03:27:09Z

Since I'm not authorized to release 1-billion dataset in my code, please download it from its official website.
By the way, you can use any dataset to train the language model, or even replace the current model with a pre-trained GPT-2.

Milozms · 2019-10-14T19:49:46Z

Since I'm not authorized to release 1-billion dataset in my code, please download it from its official website.
By the way, you can use any dataset to train the language model, or even replace the current model with a pre-trained GPT-2.

@NingMiao I'm also interested in replacing current model with a pre-trained GPT-2. But the released GPT-2 is only uni-directional, while your method requires a bidirectional (both forward and backward) pre-trained language model. Are there any possible solutions?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train the three model #5

How to train the three model #5

tiancity-NJU commented Jun 20, 2019

J-zin commented Sep 23, 2019

NingMiao commented Oct 11, 2019

Milozms commented Oct 14, 2019 •

edited

Loading

How to train the three model #5

How to train the three model #5

Comments

tiancity-NJU commented Jun 20, 2019

J-zin commented Sep 23, 2019

NingMiao commented Oct 11, 2019

Milozms commented Oct 14, 2019 • edited Loading

Milozms commented Oct 14, 2019 •

edited

Loading