v0.5.1 Release Tracker #5806

simon-mo · 2024-06-25T00:18:07Z

ETA Friday -> Wednesday 07/03

[Bugfix] set OMP_NUM_THREADS to 1 by default when using the multiproc_gpu_executor #6109

The text was updated successfully, but these errors were encountered:

NiuBlibing · 2024-06-25T07:00:41Z

Could you release nightly versions for esaier testing?

DarkLight1337 · 2024-06-26T11:34:35Z

For multi-modal support, we plan to only include new VLMs (#4986 is user-facing while #5591 is intended to be a component of other VLMs) and #5214 (which involves dev-facing changes) in this release. The other upcoming PRs such as #5276 introduce a sequence of breaking changes to users, so we will try to bundle them within a single major release (e.g. v0.6) to avoid continuously interrupting users.

WangErXiao · 2024-06-27T09:46:55Z

Deepseek-V2 this can be merged in v0.5.1?

simon-mo · 2024-06-28T18:03:29Z

Nightly is in the Q3 roadmap for CI/CD.

sasha0552 · 2024-06-28T18:20:53Z

Hi. I haven't seen this release tracker as it's not pinned, but could #4409 be included in the release? At the moment, at least ~10 users want Pascal support in vLLM.

#4409 (comment)
#5224 (comment)
https://github.com/sasha0552/vllm-ci/stargazers

DarkLight1337 · 2024-07-02T00:57:39Z

For multi-modal support, we plan to only include new VLMs (#4986 is user-facing while #5591 is intended to be a component of other VLMs) and #5214 (which involves dev-facing changes) in this release. The other upcoming PRs such as #5276 introduce a sequence of breaking changes to users, so we will try to bundle them within a single major release (e.g. v0.6) to avoid continuously interrupting users.

Since the release has been delayed, to avoid soft blocking other PRs from getting merged we have included those PRs in the release anyway. The expected user-facing breaking changes are:

Simplified engine args: Image-specific arguments have been removed from all entrypoints as we found them unnecessary.
- --image-input-type is removed; models can now support different inputs at runtime ([VLM] Remove image_input_type from VLM config #5852)
- --image-token-id, --image-input-shape and --image-feature-size are removed; models can now infer this information from HuggingFace ([vlm] Remove vision language config. #6089)
- --image-processor-related arguments are no longer supported as we will always use the processor identified by the same model name ([VLM] Remove image_input_type from VLM config #5852)
Simplified interface for multimodal inputs: This affects usage of the LLM API class. On the other hand, the OpenAI-compatible server handles the conversion internally so end users remain unaffected.
- No more repeating <image> tokens in the prompt - please follow the format documented on the HuggingFace repo ([Core] Dynamic image size support for VLMs #5276)
```
 # e.g. LLaVA-1.5 (llava-hf/llava-1.5-7b-hf)
 llm.generate({
-    "prompt": "<image>" * 576 + "\nUSER: What is the content of this image?\nASSISTANT:",
+    "prompt": "USER: <image>\nWhat is the content of this image?\nASSISTANT:",
     "multi_modal_data": multi_modal_data,
 })
```
- Instead of passing ImagePixelData(pil_image), you should pass {"image": pil_image} to multimodal prompts ([VLM] Remove image_input_type from VLM config #5852)
```
 llm.generate({
     "prompt": prompt,
-    "multi_modal_data": ImagePixelData(pil_image),
+    "multi_modal_data": {"image": pil_image},
 })
```
- ImagePixelData(tensor) and ImageFeatureData are no longer supported ([VLM] Remove image_input_type from VLM config #5852)
  - If you are currently using ImageFeatureData to represent multi-image inputs, please refrain from upgrading since we are going to replace it with embeddings soon (see below).
- We will support multi-modal embeddings in an upcoming PR to be included in the next release. Expect the format to be along the lines of:
```
 llm.generate({
     "prompt": prompt,
-    "multi_modal_data": ImageFeatureData(feature_tensor),
+    "multi_modal_data": {"image": {"embeds": model.multi_modal_projector(feature_tensor)}},  # Or just pass the embeddings directly
 })
    ```
```

DarkLight1337 · 2024-07-02T00:58:46Z

@simon-mo btw this thread is not pinned

WangErXiao · 2024-07-02T06:39:57Z

#5358 can this be merged in v0.5.1?

DarkLight1337 · 2024-07-02T06:41:57Z

#5358 can this be merged in v0.5.1?

Very unlikely this will happen since the author of the PR has not resolved the merge conflicts yet. This is not to mention #5852 and #5276 (scheduled to merge before v0.5.1) will introduce further merge conflicts.

WoosukKwon · 2024-07-02T18:12:25Z

Please add #6051 for Gemma 2

njhill · 2024-07-02T22:59:19Z

Small fix #6079 is ready and would be good to include if possible.

ywang96 · 2024-07-03T16:57:35Z

Please also add #6089 - I plan to merge it by noon as this is the final piece we need for the cycle's milestone for multi-modality support refactoring and is a user-facing change we need to add in for this release.

Update: #6089 is merged!

DarkLight1337 · 2024-07-04T05:18:16Z

It would be nice if we can get #5979 into the release, otherwise we won't see its effects until the next release after this one...

huangchen007 · 2024-07-05T03:04:14Z

Will this release be bumped today exactly?

simon-mo · 2024-07-05T16:48:55Z

Cutting now, ETA today.

https://www.githubstatus.com/incidents/5yx1d67vq9hg GH incident can't trigger CI :(. Will wait and retry.

simon-mo added the misc label Jun 25, 2024

simon-mo self-assigned this Jun 25, 2024

simon-mo added release Related to new version release and removed misc labels Jun 25, 2024

DarkLight1337 mentioned this issue Jun 27, 2024

[New Model]: bump a new version of vllm to support Qwen2 series #5773

Closed

sasha0552 mentioned this issue Jun 28, 2024

Add a token counting endpoint for vLLM SillyTavern/SillyTavern#2428

Merged

DarkLight1337 mentioned this issue Jun 30, 2024

[Feature]: need 'Gemma2ForCausalLM' support #5992

Closed

mgoin mentioned this issue Jul 1, 2024

[Usage]: Gemma-2-9b is not supported #6026

Closed

mgoin mentioned this issue Jul 2, 2024

[Gemma 2 27B]: Update docker hub image to support gemma-2-27B-it #6071

Closed

dsingal0 mentioned this issue Jul 2, 2024

added gemma2 9b and 27b vllm with streaming basetenlabs/truss-examples#318

Merged

DarkLight1337 mentioned this issue Jul 4, 2024

[Bug]: AttributeError: 'NoneType' object has no attribute 'prefill_metadata' #5982

Closed

DarkLight1337 mentioned this issue Jul 4, 2024

[RFC]: Multi-modality Support on vLLM #4194

Open

51 tasks

simon-mo mentioned this issue Jul 5, 2024

bump version to v0.5.1 #6157

Merged

simon-mo closed this as completed Jul 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.5.1 Release Tracker #5806

v0.5.1 Release Tracker #5806

simon-mo commented Jun 25, 2024 •

edited by njhill

Loading

NiuBlibing commented Jun 25, 2024

DarkLight1337 commented Jun 26, 2024 •

edited

Loading

WangErXiao commented Jun 27, 2024

simon-mo commented Jun 28, 2024

sasha0552 commented Jun 28, 2024

DarkLight1337 commented Jul 2, 2024 •

edited

Loading

DarkLight1337 commented Jul 2, 2024

WangErXiao commented Jul 2, 2024

DarkLight1337 commented Jul 2, 2024 •

edited

Loading

WoosukKwon commented Jul 2, 2024

njhill commented Jul 2, 2024

ywang96 commented Jul 3, 2024 •

edited

Loading

DarkLight1337 commented Jul 4, 2024

huangchen007 commented Jul 5, 2024

simon-mo commented Jul 5, 2024 •

edited

Loading

v0.5.1 Release Tracker #5806

v0.5.1 Release Tracker #5806

Comments

simon-mo commented Jun 25, 2024 • edited by njhill Loading

NiuBlibing commented Jun 25, 2024

DarkLight1337 commented Jun 26, 2024 • edited Loading

WangErXiao commented Jun 27, 2024

simon-mo commented Jun 28, 2024

sasha0552 commented Jun 28, 2024

DarkLight1337 commented Jul 2, 2024 • edited Loading

DarkLight1337 commented Jul 2, 2024

WangErXiao commented Jul 2, 2024

DarkLight1337 commented Jul 2, 2024 • edited Loading

WoosukKwon commented Jul 2, 2024

njhill commented Jul 2, 2024

ywang96 commented Jul 3, 2024 • edited Loading

DarkLight1337 commented Jul 4, 2024

huangchen007 commented Jul 5, 2024

simon-mo commented Jul 5, 2024 • edited Loading

simon-mo commented Jun 25, 2024 •

edited by njhill

Loading

DarkLight1337 commented Jun 26, 2024 •

edited

Loading

DarkLight1337 commented Jul 2, 2024 •

edited

Loading

DarkLight1337 commented Jul 2, 2024 •

edited

Loading

ywang96 commented Jul 3, 2024 •

edited

Loading

simon-mo commented Jul 5, 2024 •

edited

Loading