Skip to content

Activity

Fix prefix caching abstract type

ShawnD200pushed 1 commit to cache-policy-framework • 4b1e9a5…d9b3979 • 
1 hour ago

Adapt to flash-attention and fashinfer backend (previously only xform…

ShawnD200pushed 2 commits to cache-policy-framework • 1ded7d1…4b1e9a5 • 
4 hours ago

Fix type

ShawnD200pushed 1 commit to fix-start-idx • 6a4f541…d67fd0b • 
19 hours ago

Fix rebase bug and static check error

ShawnD200pushed 1 commit to cache-policy-framework • de7c117…1ded7d1 • 
3 days ago

Delete block_table

ShawnD200created cache-policy-framework • de7c117 • 
4 days ago

Fix call instances and yapf warning

ShawnD200pushed 1 commit to fix-start-idx • b8cf94f…6a4f541 • 
5 days ago

[Bugfix] Fix start_idx for computing slot mapping to avoid uninitiali…

ShawnD200created fix-start-idx • b8cf94f • 
5 days ago

[Doc][4/N] Reorganize API Reference (vllm-project#11843)

ShawnD200pushed 156 commits to main • 3f3e92e…6cd40a5 • 
5 days ago

[Model] Automatic conversion of classification and reward models (vll…

ShawnD200pushed 314 commits to main • 1f6584e…3f3e92e • 
20 days ago

Address clang-format errors

ShawnD200pushed 1 commit to Refactor-CPU-Types-ISAs • cb4b451…b5bc729 • 
on Nov 30, 2024

[Hardware][CPU] Refactor CPU vector types for ISAs

ShawnD200created Refactor-CPU-Types-ISAs • cb4b451 • 
on Nov 30, 2024

[V1] Enable profile for LLMEngine (vllm-project#10665)

ShawnD200pushed 131 commits to main • c4e4643…1f6584e • 
on Nov 26, 2024

Break long lines and adjust imports

Force push
ShawnD200force pushed to Add-Arm-CPU-backend • 856d93d…8d554d6 • 
on Nov 19, 2024

Break long lines

Force push
ShawnD200force pushed to Add-Arm-CPU-backend • 1209280…856d93d • 
on Nov 19, 2024

Break long lines

ShawnD200pushed 1 commit to Add-Arm-CPU-backend • 4b2fdef…1209280 • 
on Nov 19, 2024

Merge remote-tracking branch 'origin/main' into Add-Arm-CPU-backend

ShawnD200pushed 53 commits to Add-Arm-CPU-backend • c45979f…4b2fdef • 
on Nov 18, 2024

[Misc] Add uninitialized params tracking for AutoWeightsLoader (vll…

ShawnD200pushed 51 commits to main • ac49b59…c4e4643 • 
on Nov 18, 2024

Resolve merge conflicts

ShawnD200pushed 215 commits to Add-Arm-CPU-backend • 0350a3d…c45979f • 
on Nov 14, 2024

[Bugfix] bitsandbytes models fail to run pipeline parallel (vllm-proj…

ShawnD200pushed 50 commits to main • ad9a78b…ac49b59 • 
on Nov 14, 2024

[Doc] Fix typo error in vllm/entrypoints/openai/cli_args.py (vllm-pro…

ShawnD200pushed 33 commits to main • f4c2187…ad9a78b • 
on Nov 11, 2024

remove files

youkaichaopushed 1 commit to Convert-traces-to-perfetto-events • b7f472d…0464d8c • 
on Nov 10, 2024

add username

youkaichaopushed 1 commit to Convert-traces-to-perfetto-events • a2d0348…b7f472d • 
on Nov 10, 2024

fix format

youkaichaopushed 1 commit to Convert-traces-to-perfetto-events • 97823b4…a2d0348 • 
on Nov 10, 2024

Merge branch 'main' into Convert-traces-to-perfetto-events

youkaichaopushed 31 commits to Convert-traces-to-perfetto-events • 6c97c34…97823b4 • 
on Nov 10, 2024

Resolved merge conflicts

ShawnD200pushed 125 commits to Convert-traces-to-perfetto-events • b663e8e…6c97c34 • 
on Nov 8, 2024
ShawnD200pushed 99 commits to main • 74b529c…f4c2187 • 
on Nov 8, 2024

[Misc] Address comments

ShawnD200pushed 1 commit to Convert-traces-to-perfetto-events • 7da9e77…b663e8e • 
on Nov 7, 2024

[Doc] Fix references

ShawnD200pushed 1 commit to Add-Arm-CPU-backend • 2789754…0350a3d • 
on Nov 3, 2024

[bugfix] fix chatglm dummy_data_for_glmv (vllm-project#9955)

ShawnD200pushed 32 commits to main • 3ea2dc2…74b529c • 
on Nov 2, 2024

[Hardward][CPU] Add ARM CPU backend

Force push
ShawnD200force pushed to Add-Arm-CPU-backend • 0b8a00f…2789754 • 
on Nov 2, 2024