code for a pytorch layer? #91

AngledLuffa · 2025-01-19T05:29:33Z

Is there code available, either here or elsewhere, that would implement a pytorch layer for streaming attention w/ sinks independent of the various LLMs that it goes in?

I can imagine turning this into a separate layer:

https://github.com/mit-han-lab/streaming-llm/blob/main/streaming_llm/pos_shift/modify_llama.py

but if that work has already been done somewhere else, that'd be a great time saver

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code for a pytorch layer? #91

code for a pytorch layer? #91

AngledLuffa commented Jan 19, 2025

code for a pytorch layer? #91

code for a pytorch layer? #91

Comments

AngledLuffa commented Jan 19, 2025