How to get the Activation cache while the LLM is generating new tokens? #697

Meehaohao · 2024-08-07T12:40:47Z

Question

My prompt is "Which instrument does Henry Halstead mainly play? Please answer an instrument name. Answer: ", which is a question to LLM.
I want to get the cached hidden states of the LLM responsed tokens while LLM is generating the response. How to do it?

On one hand, the codelogits, cache = model.run_with_cache(prompt, return_cache_object=True) only cache the hidden states of the prompt, because it dosen't run the generate function.

On the other hand, the code output = model.generate(prompt, do_sample = False, max_new_tokens = 20) only get the generated tokens or sentences, I can't get the Activation_cache of the generated Answer.

So how can I obtain the model response and the Acativation_cache of the LLM response tokens at the same time during one reasoning process?

The text was updated successfully, but these errors were encountered:

bryce13950 · 2024-08-16T03:57:02Z

Unfortunately, at this moment in time there is no integration of activation cache in the generate function. I don't see any reason why we can't add that as an option, but it would unfortunately be a pretty low priority given some other projects that are currently being worked on unless someone volunteers to do it.

Meehaohao · 2024-08-16T09:20:56Z

Got it, thank you.

Meehaohao changed the title ~~How to get the hook while generating new tokens?~~ How to get the Activation cache while the LLM is generating new tokens? Aug 7, 2024

bryce13950 added the complexity-moderate label Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the Activation cache while the LLM is generating new tokens? #697

How to get the Activation cache while the LLM is generating new tokens? #697

Meehaohao commented Aug 7, 2024 •

edited

Loading

bryce13950 commented Aug 16, 2024

Meehaohao commented Aug 16, 2024

How to get the Activation cache while the LLM is generating new tokens? #697

How to get the Activation cache while the LLM is generating new tokens? #697

Comments

Meehaohao commented Aug 7, 2024 • edited Loading

Question

bryce13950 commented Aug 16, 2024

Meehaohao commented Aug 16, 2024

Meehaohao commented Aug 7, 2024 •

edited

Loading