-
-
Notifications
You must be signed in to change notification settings - Fork 5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: AttributeError: 'NoneType' object has no attribute 'prefill_metadata' #5982
Comments
Please provide more information on your environment by running the command at the beginning of your post (under "Your current environment"). The issue seems to come from having incompatible packages installed so you might want to reset your Python environment. |
Hello, I've run into this same issue. Here's the the output of
FWIW, my deps are generated using pip compile with vllm pinned to |
How did you run vLLM? Can you show the command/code? |
Sure, here's the "model" code. It gets initialized in a webserver from which I want to serve the model:
|
I got a similar issue recently and it turns out that it's because vLLM cannot allocate blocks for the model. Here, I think you set Anyways, the |
Yeah I noticed this was removed - I wasn't able to build from source unfortunately so I'm stuck on the older version. Not a huge deal, I can wait for the next pypi release. |
The next release should be just around the corner! See #5806 for more details. |
Your current environment
🐛 Describe the bug
run LLaVA-NeXT | llava-hf/llava-v1.6-mistral-7b-hf
python -m vllm.entrypoints.openai.api_server --model /ai/LLaVA-NeXT --image-token-id 32000 --image-input-shape 1,3,336,336 --image-input-type pixel_values --image-feature-size 65856 --chat-template template_llava.jinja --host 19*** --port 10860 --trust-remote-code --tensor-parallel-size 1 --dtype=half --disable-custom-all-reduce
The text was updated successfully, but these errors were encountered: