Upgrade the latest vLLM version 09/18#4

Merged

Jeffwan merged 247 commits intoaibrix:mainfrom vllm-project:main

Sep 19, 2024

+39,289-10,397

Commits on Aug 27, 2024

[Misc] Update compressed tensors lifecycle to remove prefix from create_weights (#7825 )
dsikka
authored
[Core] Asynchronous Output Processor (#7049 )

megha95
and
alexm-redhat
authored
[Tests] Disable retries and use context manager for openai client (#7565 )
njhill
authored
[core][torch.compile] discard the compile for profiling (#7796 )
youkaichao
authored
Revert #7509 (#7887 )
comaniac
authored
[Model] Add Mistral Tokenization to improve robustness and chat encoding (#7739 )
patrickvonplaten
authored
[CI/Build][VLM] Cleanup multiple images inputs model test (#7897 )
Isotr0py
authored
[Hardware][Intel GPU] Add intel GPU pipeline parallel support. (#7810 )
jikunshang
authored
[CI/Build][ROCm] Enabling tensorizer tests for ROCm (#7237 )
alexeykondrat
authored
[Bugfix] Fix phi3v incorrect image_idx when using async engine (#7916 )
Isotr0py
authored
[cuda][misc] error on empty CUDA_VISIBLE_DEVICES (#7924 )
youkaichao
authored
[Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7766 )

dsikka
and
ElizaWszola
authored
[benchmark] Update TGI version (#7917 )
philschmid
authored
[Model] Add multi-image input support for LLaVA-Next offline inference (#7230 )
zifeitong
authored
[mypy] Enable mypy type checking for vllm/core (#7229 )
jberkhahn
authored

Commits on Aug 28, 2024

Commits on Aug 29, 2024

Commits on Aug 30, 2024

Commits on Sep 1, 2024

[Misc] Optional installation of audio related packages (#8063 )
ywang96
authored

Commits on Sep 3, 2024

Commits on Sep 4, 2024

Commits on Sep 5, 2024

Commits on Sep 6, 2024

Commits on Sep 7, 2024

Commits on Sep 8, 2024

[Bugfix] Fix async postprocessor in case of preemption (#8267 )
alexm-redhat
authored

Commits on Sep 10, 2024

Commits on Sep 11, 2024

Commits on Sep 12, 2024

Commits on Sep 13, 2024

Commits on Sep 15, 2024

Commits on Sep 16, 2024

Commits on Sep 17, 2024

Commits on Sep 18, 2024