forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 77
Pull requests: HabanaAI/vllm-fork
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Added the logic to fix the warmup phase for spec decoding when enforce_eager is not used
#880
opened Feb 28, 2025 by
pallavijaini0525
Loading…
[Gaudi][Model] Qwen2.5-vl
New Model
Issue o PR to enable a new model
#870
opened Feb 26, 2025 by
malkomes
Loading…
Update requirements-hpu.txt for open telemetry tracing support
#857
opened Feb 21, 2025 by
louie-tsai
Loading…
enable multi-modal embedding for TIGER-Lab/VLM2Vec-Full T+I on HPU
#854
opened Feb 20, 2025 by
libinta
Loading…
Draft: Another attempt at v1 HPU integration
#831
opened Feb 14, 2025 by
kzawora-intel
•
Draft
19 of 23 tasks
Extend accuracy tests for models that we support
#824
opened Feb 13, 2025 by
AnetaKaczynska
Loading…
Update documentation to reflect current bucket defaults
#817
opened Feb 12, 2025 by
nngokhale
Loading…
Support qwenvl model for HPU
New Model
Issue o PR to enable a new model
#793
opened Feb 7, 2025 by
yingjie-han
Loading…
[DEEPSEEK_V3/R1] includes features of fp8 dequant, MLA, Expert parallelism
#792
opened Feb 6, 2025 by
xuechendi
Loading…
[DO NOT MERGE][PoC] Mark dynamic shapes in torch.compile mode
#755
opened Jan 29, 2025 by
kzawora-intel
•
Draft
make benchmark_throughput static support single image input
#718
opened Jan 22, 2025 by
yma11
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.