Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train the three model #5

Open
tiancity-NJU opened this issue Jun 20, 2019 · 3 comments
Open

How to train the three model #5

tiancity-NJU opened this issue Jun 20, 2019 · 3 comments

Comments

@tiancity-NJU
Copy link

when I try to train key_gen model, it may needs 1-billion.txt, but I don't find the txt anywhere. Can you tell me how to train my model and reproduce you result? thanks

@J-zin
Copy link

J-zin commented Sep 23, 2019

FileNotFoundError: [Errno 2] No such file or directory: '../data/1-billion/1-billion.txt'

@NingMiao
Copy link
Owner

Since I'm not authorized to release 1-billion dataset in my code, please download it from its official website.
By the way, you can use any dataset to train the language model, or even replace the current model with a pre-trained GPT-2.

@Milozms
Copy link

Milozms commented Oct 14, 2019

Since I'm not authorized to release 1-billion dataset in my code, please download it from its official website.
By the way, you can use any dataset to train the language model, or even replace the current model with a pre-trained GPT-2.

@NingMiao I'm also interested in replacing current model with a pre-trained GPT-2. But the released GPT-2 is only uni-directional, while your method requires a bidirectional (both forward and backward) pre-trained language model. Are there any possible solutions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants