[Kernel] Fix input for flashinfer prefill wrapper. #7008

LiuXiaoxuanPKU · 2024-07-31T21:33:55Z

…-fix

…flashinfer-fix

github-actions · 2024-07-31T21:34:06Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

LiuXiaoxuanPKU · 2024-07-31T21:45:06Z

/ready

comaniac

LGTM. Can you add a comment?
Also cc @SolitaryThinker

yzh119 · 2024-08-01T00:48:13Z

vllm/attention/backends/flashinfer.py

            assert self.paged_kv_indices is not None
            assert self.paged_kv_indptr is not None
            assert self.paged_kv_last_page_len is not None
-            self.paged_kv_indices = self.paged_kv_indices.to(self.device)


Per flashinfer-ai/flashinfer#362 (comment), perhaps we should keep this?

WoosukKwon · 2024-08-01T19:21:19Z

@LiuXiaoxuanPKU Just for a heads up: we just upgraded the PyTorch and FlashInfer versions in #6951

Signed-off-by: Alvant <[email protected]>

LiuXiaoxuanPKU added 7 commits July 11, 2024 11:19

remove flashinfer warning, add flashinfer to CI

f9c9ddc

update version

f9492f1

update version

0f686f5

fix input

056b388

Merge branch 'main' of github.com:LiuXiaoxuanPKU/vllm into flashinfer…

4c896c3

…-fix

Merge branch 'flashinfer-fix' of github.com:LiuXiaoxuanPKU/vllm into …

88fcc08

…flashinfer-fix

revert

619c554

LiuXiaoxuanPKU assigned comaniac and unassigned comaniac Jul 31, 2024

LiuXiaoxuanPKU requested a review from comaniac July 31, 2024 21:35

LiuXiaoxuanPKU mentioned this pull request Jul 31, 2024

RuntimeError: Out of workspace memory in AlignedAlloactor when there is a lot of GPU memory left flashinfer-ai/flashinfer#362

Closed

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 31, 2024

comaniac approved these changes Jul 31, 2024

View reviewed changes

add comment

a197965

yzh119 reviewed Aug 1, 2024

View reviewed changes

ggbetz mentioned this pull request Aug 1, 2024

Evaluate: Gemma 2 logikon-ai/cot-eval#56

Closed

6 tasks

fix

0badabb

LiuXiaoxuanPKU enabled auto-merge (squash) August 1, 2024 15:34

WoosukKwon disabled auto-merge August 2, 2024 01:44

WoosukKwon merged commit 954f730 into main Aug 2, 2024
62 of 64 checks passed

WoosukKwon deleted the flashinfer-fix branch August 2, 2024 01:44

wwydmanski mentioned this pull request Aug 5, 2024

[Bug]: Unexpected Responses from Gemma-2-9b-it #7152

Closed

dtrifiro mentioned this pull request Aug 5, 2024

Sync with [email protected] opendatahub-io/vllm#120

Closed

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Aug 17, 2024

[Kernel] Fix input for flashinfer prefill wrapper. (vllm-project#7008)

7abe097

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Kernel] Fix input for flashinfer prefill wrapper. (vllm-project#7008)

3262279

Signed-off-by: Alvant <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Kernel] Fix input for flashinfer prefill wrapper. (vllm-project#7008)

5823be4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Kernel] Fix input for flashinfer prefill wrapper. #7008

[Kernel] Fix input for flashinfer prefill wrapper. #7008

LiuXiaoxuanPKU commented Jul 31, 2024

github-actions bot commented Jul 31, 2024

LiuXiaoxuanPKU commented Jul 31, 2024

comaniac left a comment

yzh119 Aug 1, 2024

WoosukKwon commented Aug 1, 2024

[Kernel] Fix input for flashinfer prefill wrapper. #7008

[Kernel] Fix input for flashinfer prefill wrapper. #7008

Conversation

LiuXiaoxuanPKU commented Jul 31, 2024

github-actions bot commented Jul 31, 2024

LiuXiaoxuanPKU commented Jul 31, 2024

comaniac left a comment

Choose a reason for hiding this comment

yzh119 Aug 1, 2024

Choose a reason for hiding this comment

WoosukKwon commented Aug 1, 2024