Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vocabulary table #8

Open
zperfet opened this issue Oct 21, 2019 · 5 comments
Open

vocabulary table #8

zperfet opened this issue Oct 21, 2019 · 5 comments

Comments

@zperfet
Copy link

zperfet commented Oct 21, 2019

hello, could you provide your vocabulary table for further study?
i read your code and your vocabulary table is of size 5000, but is still remains to be a block box for us.
so we will really appreciate it if you could provide your vocabulary or raw data.

@Fatima-200159617
Copy link

Hi,
Could you please advice about the following:
When creating the list-of-index-and-counts of words in text, did you remove stopwords, did you do stemming. Did you just count frequency of words or did you select words based on their TF.IDF scores.
I appreciate if you can share any information on how we can create the same input but with our own data.

@OwenLeng
Copy link

hello, could you provide your vocabulary table for further study?
i read your code and your vocabulary table is of size 5000, but is still remains to be a block box for us.
so we will really appreciate it if you could provide your vocabulary or raw data.

请问你爬数据集了吗

@cxyccc
Copy link

cxyccc commented Jul 11, 2022

hello, could you provide your vocabulary table for further study? i read your code and your vocabulary table is of size 5000, but is still remains to be a block box for us. so we will really appreciate it if you could provide your vocabulary or raw data.

Hello, have you obtained the vocabulary table now? Many thanks!

@cxyccc
Copy link

cxyccc commented Jul 11, 2022

Hi, Could you please advice about the following: When creating the list-of-index-and-counts of words in text, did you remove stopwords, did you do stemming. Did you just count frequency of words or did you select words based on their TF.IDF scores. I appreciate if you can share any information on how we can create the same input but with our own data.

Hello, have you obtained the vocabulary table now? Many thanks!

@Gsuhy-L
Copy link

Gsuhy-L commented Dec 2, 2023

Hi, Could you please advice about the following: When creating the list-of-index-and-counts of words in text, did you remove stopwords, did you do stemming. Did you just count frequency of words or did you select words based on their TF.IDF scores. I appreciate if you can share any information on how we can create the same input but with our own data.

Hello, have you obtained the vocabulary table now? Many thanks!

你理解了作者是如何创建这个词表了吗?我感觉他更像是简单地通过单词频率来创建词表的,你认为呢?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants