Skip to content

Pull requests: HabanaAI/vllm-fork

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Delayed cherrypick
#877 opened Feb 28, 2025 by kamil-kaczor Draft
[Gaudi][Model] Qwen2.5-vl New Model Issue o PR to enable a new model
#870 opened Feb 26, 2025 by malkomes Loading…
[CI] Add APC tests
#866 opened Feb 25, 2025 by kzawora-intel Loading…
Update Dockerfile.hpu
#864 opened Feb 25, 2025 by michalkuligowski Draft
Draft: Another attempt at v1 HPU integration
#831 opened Feb 14, 2025 by kzawora-intel Draft
19 of 23 tasks
Resolve Speculative Decode RTE
#823 opened Feb 13, 2025 by tannervoas742 Loading…
enable LoRA for embedding models
#821 opened Feb 12, 2025 by skaulintel Loading…
Support qwenvl model for HPU New Model Issue o PR to enable a new model
#793 opened Feb 7, 2025 by yingjie-han Loading…
Enable roberta embedding
#786 opened Feb 5, 2025 by yeonsily Loading…
Delayed sampling
#720 opened Jan 22, 2025 by mfylcek Draft
ProTip! Mix and match filters to narrow down what you’re looking for.