Compatibility with `vLLM` with `tensor_parallel_size` argument #805

gabrielmbmb · 2024-07-22T14:28:23Z

Description

This PR adds a few changes that enables using vLLM with the argument tensor_parallel_size. This argument enables the use of multiple GPUs with vLLM, and to do so, vLLM uses multiprocessing or ray:

When using the multiprocessing approach, it was not working as vLLM was trying to create new processes but it was not able to because the process that distilabel creates was a daemon process, which is not allowed to create child processes. To by pass these issue, a _NoDaemonPool class has been created that creates non-daemon processes and it's used in Pipeline.
With ray it works installing the version in the main branch which includes the changes of [Core] Introduce SPMD worker execution using Ray accelerated DAG vllm-project/vllm#6032. It's needed to set VLLM_USE_RAY_COMPILED_DAG=1 and VLLM_USE_RAY_SPMD_WORKER=1 environment variables. Update: vllm==0.5.3 has been released which includes the changes needed.

github-actions · 2024-07-22T14:29:43Z

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-805/

codspeed-hq · 2024-07-22T14:34:16Z

CodSpeed Performance Report

Merging #805 will not alter performance

_{Comparing compatibility-vllm-tensor-parallel-size (c0ece53) with develop (ea1c44b)}

Summary

✅ 1 untouched benchmarks

src/distilabel/pipeline/local.py

Co-authored-by: Agus <[email protected]>

…hub.com/argilla-io/rlxf into compatibility-vllm-tensor-parallel-size

Add _NoDaemonPool class

3c00553

gabrielmbmb added the enhancement New feature or request label Jul 22, 2024

gabrielmbmb added this to the 1.3.0 milestone Jul 22, 2024

gabrielmbmb requested a review from plaguss July 22, 2024 14:28

gabrielmbmb self-assigned this Jul 22, 2024

gabrielmbmb added 3 commits July 22, 2024 16:41

Use Union

8e3e2eb

Merge branch 'develop' into compatibility-vllm-tensor-parallel-size

4cfb85c

Merge branch 'develop' into compatibility-vllm-tensor-parallel-size

1b80695

plaguss approved these changes Jul 23, 2024

View reviewed changes

src/distilabel/pipeline/local.py Outdated Show resolved Hide resolved

gabrielmbmb and others added 6 commits July 23, 2024 11:39

Update src/distilabel/pipeline/local.py

1d26533

Co-authored-by: Agus <[email protected]>

Update dependency version to vllm>=0.5.3 and add setuptools

550d7a0

Merge branch 'compatibility-vllm-tensor-parallel-size' of https://git…

4aaa61f

…hub.com/argilla-io/rlxf into compatibility-vllm-tensor-parallel-size

Remove pinned outlines==0.34.0

51e1adc

Fix docstring

0afef6e

Add docs about vLLM with ray

c0ece53

gabrielmbmb merged commit b7f124f into develop Jul 23, 2024
5 of 7 checks passed

gabrielmbmb deleted the compatibility-vllm-tensor-parallel-size branch July 23, 2024 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility with `vLLM` with `tensor_parallel_size` argument #805

Compatibility with `vLLM` with `tensor_parallel_size` argument #805

gabrielmbmb commented Jul 22, 2024 •

edited

Loading

github-actions bot commented Jul 22, 2024

codspeed-hq bot commented Jul 22, 2024 •

edited

Loading

Compatibility with vLLM with tensor_parallel_size argument #805

Compatibility with vLLM with tensor_parallel_size argument #805

Conversation

gabrielmbmb commented Jul 22, 2024 • edited Loading

Description

github-actions bot commented Jul 22, 2024

codspeed-hq bot commented Jul 22, 2024 • edited Loading

CodSpeed Performance Report

Merging #805 will not alter performance

Summary

Compatibility with `vLLM` with `tensor_parallel_size` argument #805

Compatibility with `vLLM` with `tensor_parallel_size` argument #805

gabrielmbmb commented Jul 22, 2024 •

edited

Loading

codspeed-hq bot commented Jul 22, 2024 •

edited

Loading