Skip to content

Actions: leiwen83/vllm

mypy

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
19 workflow runs
19 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[ Misc ] Remove separate bias add (#6353)
mypy #19: Commit 6047187 pushed by leiwen83
July 12, 2024 07:16 34s main
July 12, 2024 07:16 34s
Fix w8a8 benchmark and add Llama-3-8B (#5562)
mypy #17: Commit e2b85cf pushed by leiwen83
June 17, 2024 10:30 34s main
June 17, 2024 10:30 34s
June 9, 2024 13:31 31s
June 7, 2024 02:24 5m 22s
[Misc] Make Serving Benchmark More User-friendly (#5044)
mypy #14: Commit f17a1a8 pushed by leiwen83
May 26, 2024 07:30 32s main
May 26, 2024 07:30 32s
May 11, 2024 08:28 34s
[Doc] Chunked Prefill Documentation (#4580)
mypy #11: Commit 36fb68f pushed by leiwen83
May 4, 2024 11:23 32s main
May 4, 2024 11:23 32s
[Core][Distributed] enable multiple tp group (#4512)
mypy #9: Commit 2a85f93 pushed by leiwen83
May 2, 2024 08:57 32s main
May 2, 2024 08:57 32s
[Misc]Add customized information for models (#4132)
mypy #8: Commit d6f4bd7 pushed by leiwen83
May 1, 2024 10:02 30s main
May 1, 2024 10:02 30s
[Misc] Upgrade to torch==2.3.0 (#4454)
mypy #7: Commit d627a3d pushed by leiwen83
April 30, 2024 02:54 30s main
April 30, 2024 02:54 30s
[Kernel] Full Tensor Parallelism for LoRA Layers (#3524)
mypy #6: Commit eefeb16 pushed by leiwen83
April 27, 2024 07:37 28s main
April 27, 2024 07:37 28s
[Misc] Use public API in benchmark_throughput (#4300)
mypy #5: Commit a395a63 pushed by leiwen83
April 25, 2024 02:29 34m 44s main
April 25, 2024 02:29 34m 44s
[CI][Build] change pynvml to nvidia-ml-py (#4302)
mypy #4: Commit e4bf860 pushed by leiwen83
April 24, 2024 01:41 33s main
April 24, 2024 01:41 33s
April 23, 2024 08:45 31s
April 20, 2024 14:24 29s
LM Format Enforcer Guided Decoding Support (#3868)
mypy #1: Commit 0543476 pushed by leiwen83
April 16, 2024 12:17 39s main
April 16, 2024 12:17 39s