Why use the 8-bit floating numbers to compute the original cost? #16

shiyuetianqiang · 2020-10-29T11:29:26Z

Hi,
The work is amazing.
When I looked through the code, I foud that you employed the 8-bit floating numbers to compute the original cost and store
it as a lookup table. I wondered why not use the 32-bit floating(not use the flag "--half" in the pretraining process) or use the 16-bit floating (use the flag "--half" in the pretraining process)? Could you please clarify that?
Thanks a lot!

shiyuetianqiang · 2020-10-29T12:20:38Z

Sorry, I got it.
It seems that you employed the 8-bit as the baseline

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use the 8-bit floating numbers to compute the original cost? #16

Why use the 8-bit floating numbers to compute the original cost? #16

shiyuetianqiang commented Oct 29, 2020

shiyuetianqiang commented Oct 29, 2020

Why use the 8-bit floating numbers to compute the original cost? #16

Why use the 8-bit floating numbers to compute the original cost? #16

Comments

shiyuetianqiang commented Oct 29, 2020

shiyuetianqiang commented Oct 29, 2020