optimize filling kv cache kernel in pytorch engine #1251

grimoire · 2024-03-06T02:10:10Z

RunningLeon · 2024-03-11T09:38:11Z

test ok for two models using opencompass

lmdeploy/pytorch/models/deepseek.py

RunningLeon

LGTM

grimoire added 7 commits March 4, 2024 12:52

first version

917c2b1

Merge branch 'main' into torch-optimize-fill-kv-cache

461975c

fix models

b37f1ca

merge main

1491823

Merge branch 'main' into torch-optimize-fill-kv-cache

642fa4d

Merge branch 'main' into torch-optimize-fill-kv-cache

626d52a

remove history dependency

c171d1d

grimoire added the improvement label Mar 6, 2024

Merge branch 'main' into torch-optimize-fill-kv-cache

e8bc151

lvhan028 requested review from AllentDan and RunningLeon March 11, 2024 03:46

AllentDan approved these changes Mar 11, 2024

View reviewed changes

RunningLeon reviewed Mar 11, 2024

View reviewed changes

lmdeploy/pytorch/models/deepseek.py Show resolved Hide resolved

Merge branch 'main' into torch-optimize-fill-kv-cache

7c796df

RunningLeon approved these changes Mar 11, 2024

View reviewed changes

lvhan028 changed the title ~~Torch optimize fill kv cache~~ optimize filling kv cache kernel in pytorch engine Mar 11, 2024

lvhan028 merged commit 1fcd6d3 into InternLM:main Mar 11, 2024
5 checks passed

Provide feedback