[BugFix] fix wrong output when using lora and num_scheduler_steps=8 #11161

sleepwalker2017 · 2024-12-13T06:28:13Z

github-actions · 2024-12-13T06:28:23Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

sleepwalker2017 · 2024-12-13T06:32:06Z

@jeejeelee Hi, I've sent a new pr, please help check it.

The dummy lora modules are not cleared when using multi-step runner. so the real loras are never loaded. that leads to the wrong result.

jeejeelee

Could you plz add unit test for this，maybe test_llama_tp is a good choice

jeejeelee · 2024-12-13T16:26:11Z

vllm/worker/worker.py

@@ -252,9 +252,6 @@ def determine_num_available_blocks(self) -> Tuple[int, int]:
            available_kv_cache_memory / (1024**3),
            self.cache_config.gpu_memory_utilization)

-        # Final cleanup
-        if self.model_runner.lora_manager:


We should keep this comment.

bug fix for issue 9688

348855f

sleepwalker2017 requested review from zhuohan123, youkaichao, alexm-redhat, comaniac and njhill as code owners December 13, 2024 06:28

fix typo

7711621

jeejeelee requested changes Dec 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] fix wrong output when using lora and num_scheduler_steps=8 #11161

[BugFix] fix wrong output when using lora and num_scheduler_steps=8 #11161

sleepwalker2017 commented Dec 13, 2024

github-actions bot commented Dec 13, 2024

sleepwalker2017 commented Dec 13, 2024 •

edited

Loading

jeejeelee left a comment

jeejeelee Dec 13, 2024

[BugFix] fix wrong output when using lora and num_scheduler_steps=8 #11161

Are you sure you want to change the base?

[BugFix] fix wrong output when using lora and num_scheduler_steps=8 #11161

Conversation

sleepwalker2017 commented Dec 13, 2024

github-actions bot commented Dec 13, 2024

sleepwalker2017 commented Dec 13, 2024 • edited Loading

jeejeelee left a comment

Choose a reason for hiding this comment

jeejeelee Dec 13, 2024

Choose a reason for hiding this comment

sleepwalker2017 commented Dec 13, 2024 •

edited

Loading