Question about GODEL_XL (GPT-J) model size #19

wooters · 2022-07-20T23:53:49Z

First of all, thank you for making this work public!

I'm curious about the model size shown in the README for the released GODEL_XL model (based on GPT-J). In the table in the README it lists the model size as "2.7B". My understanding is that GPT-J has 6B parameters.

Is the number of parameters for GODEL XL listed in the README correct?

meatflavourdev · 2022-07-26T23:03:48Z

The parameter count isn't a very reliable statistic of the model's capability. With newer models that exploit sparsely connected networks and model distillation, one can darastically reduce the number of parameters and improve the speed and performance stats of the model. (ie. faster and better, less params)

meatflavourdev · 2022-07-26T23:05:59Z

Also, models that exploit knowledge retrieval can run circles around large language models.
https://analyticsindiamag.com/deepminds-language-model-retro-proves-bigger-is-not-always-better/
So that's a 7B paramter model that can outperform GPT-3. (GODEL exploits knowledge retieval FYI)
It's the difference between implicit vs. explicit representations of learned data.

wooters · 2022-07-26T23:41:45Z

@meatflavourdev thanks for your responses. While I don't dispute anything you said, it doesn't address my question. To be clear, here is my issue:

GPT-J has 6B parameters
The paper says that the released GODEL_XL model was initialized from GPT-J, and so it should have 6B parameters (just as the released GODEL_B and GODEL_L models match the sizes of the T5 models from which they were initialized)
The table in the README says that the released GODEL_XL model has 2.7B parameters
The paper doesn't say anything about reducing the number of parameters for the released GODEL_XL/GPT-J model

My guess is that the number of parameters listed in the table in the README of this repo for the GODEL_XL model is a typo and it should say "6B" instead of "2.7B". This is a relatively minor point, but I was hoping that one of the authors could confirm just to be sure.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about GODEL_XL (GPT-J) model size #19

Question about GODEL_XL (GPT-J) model size #19

wooters commented Jul 20, 2022

meatflavourdev commented Jul 26, 2022

meatflavourdev commented Jul 26, 2022 •

edited

Loading

wooters commented Jul 26, 2022

Question about GODEL_XL (GPT-J) model size #19

Question about GODEL_XL (GPT-J) model size #19

Comments

wooters commented Jul 20, 2022

meatflavourdev commented Jul 26, 2022

meatflavourdev commented Jul 26, 2022 • edited Loading

wooters commented Jul 26, 2022

meatflavourdev commented Jul 26, 2022 •

edited

Loading