[Doc][CI/Build] Update docs and tests to use `vllm serve` #6431

DarkLight1337 · 2024-07-15T02:56:13Z

Follow-up to #5090. As a sanity check, I have also updated the entrypoints tests to use the new CLI.

After this, we can update the Docker images and performance benchmarks to use the new CLI.

github-actions · 2024-07-15T02:56:24Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only trigger fastcheck CI to run, which consists only a small and essential subset of tests to quickly catch errors with the flexibility to run extra individual tests on top (you can do this by unblocking test steps in the Buildkite run).

Full CI run is still required to merge this PR so once the PR is ready to go, please make sure to run it. If you need all test signals in between PR commits, you can trigger full CI as well.

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

simon-mo · 2024-07-15T03:07:20Z

I want to hold this off until the release. So people visiting the nightly docs can directly use the CLI

simon-mo · 2024-07-15T03:07:30Z

In the meantime, please feel free to start improving it!!!

mgoin

Does --model still work or will it cause issues with vllm serve? I'm curious if it is sufficient to replace python -m vllm.entrypoints.openai.api_server with vllm serve or if it specifically replaces python -m vllm.entrypoints.openai.api_server --model

DarkLight1337 · 2024-07-16T01:37:40Z

Does --model still work or will it cause issues with vllm serve?

model_tag is a required positional arg in vllm serve so you have to pass it like vllm serve <model_tag>, regardless of whether --model is passed. The value passed to model is actually overwritten by model_tag (we should not allow both to be passed at the same time tbh).

mgoin

Thanks for the clarity, this is a nice PR to update usage LGTM!

DarkLight1337 · 2024-07-16T15:30:56Z

@simon-mo since v0.5.2 has been released, can we merge this?

…ct#6431)

…ct#6431) Signed-off-by: Alvant <[email protected]>

Update docs to use vllm serve

7ac6349

DarkLight1337 added 3 commits July 15, 2024 03:01

Revert runpy change since it's based on module

4dc7b2e

yapf

00def12

Minor update

5e193bf

DarkLight1337 added 3 commits July 15, 2024 03:18

Fix argparse docs

28e5c91

Fix model arg

deea3f1

Fix linted errors

4e18dda

DarkLight1337 changed the title ~~[Doc] Update docs to use vllm serve~~ [Doc][CI/Build] Update docs and tests to use vllm serve Jul 15, 2024

Clean

96b3ce6

mgoin reviewed Jul 15, 2024

View reviewed changes

Merge branch 'main' into vllm-serve-docs

8d23d4b

youkaichao mentioned this pull request Jul 16, 2024

[Bug]: No metrics exposed at /metrics with 0.5.2 (0.5.1 is fine), possible regression? #6461

Closed

DarkLight1337 added 2 commits July 16, 2024 12:19

Merge branch 'upstream' into vllm-serve-docs

e92ea97

Update tokenization test fixture

8948e98

mgoin approved these changes Jul 16, 2024

View reviewed changes

simon-mo approved these changes Jul 16, 2024

View reviewed changes

simon-mo enabled auto-merge (squash) July 16, 2024 15:33

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 16, 2024

DarkLight1337 mentioned this pull request Jul 16, 2024

[Bugfix][Frontend] Fix missing /metrics endpoint #6463

Merged

Merge branch 'upstream' into vllm-serve-docs

68d7e11

simon-mo merged commit 5bf35a9 into vllm-project:main Jul 17, 2024
71 of 72 checks passed

DarkLight1337 deleted the vllm-serve-docs branch July 17, 2024 07:43

dtrifiro pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 17, 2024

[Doc][CI/Build] Update docs and tests to use vllm serve (vllm-proje…

b92dcbc

…ct#6431)

fialhocoelho pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 19, 2024

[Doc][CI/Build] Update docs and tests to use vllm serve (vllm-proje…

34ac3ed

…ct#6431)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Doc][CI/Build] Update docs and tests to use vllm serve (vllm-proje…

56e4af4

…ct#6431)

gnpinkert pushed a commit to gnpinkert/vllm that referenced this pull request Jul 26, 2024

[Doc][CI/Build] Update docs and tests to use vllm serve (vllm-proje…

deeeae5

…ct#6431)

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Doc][CI/Build] Update docs and tests to use vllm serve (vllm-proje…

cce2f16

…ct#6431) Signed-off-by: Alvant <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc][CI/Build] Update docs and tests to use `vllm serve` #6431

[Doc][CI/Build] Update docs and tests to use `vllm serve` #6431

DarkLight1337 commented Jul 15, 2024 •

edited

Loading

github-actions bot commented Jul 15, 2024

simon-mo commented Jul 15, 2024

simon-mo commented Jul 15, 2024

mgoin left a comment

DarkLight1337 commented Jul 16, 2024 •

edited

Loading

mgoin left a comment

DarkLight1337 commented Jul 16, 2024

[Doc][CI/Build] Update docs and tests to use vllm serve #6431

[Doc][CI/Build] Update docs and tests to use vllm serve #6431

Conversation

DarkLight1337 commented Jul 15, 2024 • edited Loading

github-actions bot commented Jul 15, 2024

simon-mo commented Jul 15, 2024

simon-mo commented Jul 15, 2024

mgoin left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Jul 16, 2024 • edited Loading

mgoin left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Jul 16, 2024

[Doc][CI/Build] Update docs and tests to use `vllm serve` #6431

[Doc][CI/Build] Update docs and tests to use `vllm serve` #6431

DarkLight1337 commented Jul 15, 2024 •

edited

Loading

DarkLight1337 commented Jul 16, 2024 •

edited

Loading