Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reproducing Figure 1 in the paper #3

Open
freesky01 opened this issue Aug 29, 2024 · 0 comments
Open

Reproducing Figure 1 in the paper #3

freesky01 opened this issue Aug 29, 2024 · 0 comments

Comments

@freesky01
Copy link

freesky01 commented Aug 29, 2024

Thank you for the great work, quite interesting view!

I'm trying to use DCT to analyze the power spectrum of input hidden states of currently popular LLMs, like you have shown in Figure 1 in the paper. But I'm not sure about the details to plot it (e.g. w/o normalization and how are amplitudes averaged over the entire validation set, sample-level or token level?). Most of the amplitudes of tokens by my side is about zero, which does not follow the pattern of Figure 1, so I think I am very likely to mess up some settings.

Could you provide the snippet of plotting Figure 1 or give me some further hints about it? I would really appreciate it if you would help me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant