You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Given a set of reporters for each layer of a model and a fixed input, we can extract the model's "belief" at each layer and see how it evolves over time, similar to how the tuned lens works.
This is low-ish priority, but I think this should be done in time for the paper at least.
The text was updated successfully, but these errors were encountered:
Given a set of reporters for each layer of a model and a fixed input, we can extract the model's "belief" at each layer and see how it evolves over time, similar to how the tuned lens works.
This is low-ish priority, but I think this should be done in time for the paper at least.
The text was updated successfully, but these errors were encountered: