-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请教数据预处理部分 #1
Comments
另外词向量是要自己根据数据集训练吗? |
你好,1、运行本模型,需要对源数据分词(包括query、passage、alternatives)。由于本次以分享思路为主,并没有开源数据预处理部分的计划。2、基于数据集训练的词向量就好,如果追求更好的效果,可以使用外部词向量。 |
多谢了! |
您好! |
哇,不能直接跑的项目可真是让人一眼难尽呢 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
您好!
看了您的项目收获很大,非常感谢!请问您数据预处理部分有开源吗?我在尝试跑的时候发现 train.json 应该是处理过的,而且有些属性是原数据集中没有的。请问方便分享吗?
多谢了!
The text was updated successfully, but these errors were encountered: