[Core] Reduce TTFT with concurrent partial prefills #22784
Annotations
8 errors
Mypy:
vllm/core/scheduler.py#L718
"Scheduler" has no attribute "_get_num_new_uncached_and_cached_tokens" [attr-defined]
|
Mypy:
vllm/core/scheduler.py#L906
"Scheduler" has no attribute "_get_num_new_uncached_and_cached_tokens" [attr-defined]
|
Mypy:
vllm/core/scheduler.py#L1008
"Scheduler" has no attribute "_get_num_new_uncached_and_cached_tokens" [attr-defined]
|
Mypy:
vllm/core/scheduler.py#L1031
"Scheduler" has no attribute "_get_num_new_uncached_and_cached_tokens" [attr-defined]
|
Mypy:
vllm/core/scheduler.py#L1116
"Scheduler" has no attribute "_get_num_new_uncached_and_cached_tokens" [attr-defined]
|
Mypy:
vllm/core/scheduler.py#L2050
Name "self" is not defined [name-defined]
|
Mypy:
vllm/core/scheduler.py#L2050
Name "seq_group" is not defined [name-defined]
|
Mypy
Process completed with exit code 1.
|
Loading