Skip to content

Commit

Permalink
update Memory usage && rename image && pinned version for torch torch…
Browse files Browse the repository at this point in the history
…vision
  • Loading branch information
cennn committed Dec 16, 2024
1 parent d5d2911 commit 075ab9d
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
4 changes: 2 additions & 2 deletions docs/source/serving/deploying_with_docker.rst
Original file line number Diff line number Diff line change
Expand Up @@ -51,11 +51,11 @@ of PyTorch Nightly and should be considered **experimental**. Using the flag `--

.. code-block:: console
# Example of building on Nvidia GH200 server. (Memory usage: ~180GB, Build time: ~2387s / ~40 min)
# Example of building on Nvidia GH200 server. (Memory usage: ~12GB, Build time: ~1475s / ~25 min, Image size: 7.26GB)
$ DOCKER_BUILDKIT=1 sudo docker build . \
--target vllm-openai \
-platform "linux/arm64" \
-t drikster80/vllm-gh200-openai:v0.6.4.post1 \
-t vllm/vllm-gh200-openai:latest \
--build-arg max_jobs=66 \
--build-arg nvcc_threads=2 \
--build-arg torch_cuda_arch_list="9.0+PTX" \
Expand Down
4 changes: 2 additions & 2 deletions requirements-cuda-arm64.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
--index-url https://download.pytorch.org/whl/nightly/cu124
torchvision; platform_machine == 'aarch64'
torch; platform_machine == 'aarch64'
torchvision==0.22.0.dev20241215; platform_machine == 'aarch64'
torch==2.6.0.dev20241210+cu124; platform_machine == 'aarch64'

0 comments on commit 075ab9d

Please sign in to comment.