Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about scaling of data #1

Open
Perkins729 opened this issue Nov 6, 2024 · 1 comment
Open

Question about scaling of data #1

Perkins729 opened this issue Nov 6, 2024 · 1 comment

Comments

@Perkins729
Copy link

Hello,

I would like to ask whether a 6M parameter model would be too small for 4 million data samples, based on the details provided in the paper. Could you share any insights on the relative relationship between model size and data quantity?

Thank you!

@WhoKnowsssss
Copy link
Collaborator

Hi there,

Thanks for your question. We find that the model size scales more with the number of skills (i.e. diversity of the data), less with the data samples. For example, for the same 4 million data samples, if the data include 50 skills instead of 5, you would probably need a much larger model. If most data is repetitive (as in our case), a relatively small model is sufficient to capture the diversity within the dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants