Venv creation and uv support #245

jarlsondre · 2024-11-13T13:08:05Z

Summary

This PR updates the way we create venvs removing parts of the reliance on massive scripts such as generic_torch.sh. It also allows us to use uv, which is quite a bit faster than just using pip. Of course, this is done without depending on uv, as it is still possible to only use pip.

I wrote a quick tutorial on how the uv workflow goes, which can be seen in the uv-tutorial.md file that was added here.

Motivation

The reliance on the generic_torch.sh script has numerous disadvantages. First of all, we are unable to provide all of our dependencies in the pyproject.toml, which means that a simple pip install itwinai is simply not possible at the moment. Secondly, the generic_torch.sh script is messy with many if statement that are repeated multiple times throughout, such as "if cuda" (but in shell syntax), meaning that it is hard to build upon the script. Thirdly, because we have a bunch of separated pip install x statements, we effectively give no way for the dependency manager to solve our dependency graph in a nice way. This results in many packages being installed only to be uninstalled on the next pip install statement. This causes our script to take much longer than needed.

Noteworthy

Because part of this PR is also about transitioning to uv, I have renamed a lot of the venvs that we use to just .venv. It seems that this work better with uv in some cases. I was also thinking that we could keep our old venvs and just symlink them to .venv, so that we don't have to create new names if we ever have to change systems again. This should hopefully streamline our naming convention a bit more.

Related issue :
#244

pyproject.toml

README.md

pyproject.toml

* Refactor Dockerfiles * Refactor container gen script * ADD jlab dockerfile * First working version of jlab container * ADD CMCC requirements * update dockerfiles * ADD nvconda and refactor * Update containers * ADD containers * ADD simple plus dockerfile * Update NV deps * Update CUDA * Add comment * Cleanup * Cleanup * UPDATE README * Refactor * Fix linter * Refactor dockerfiles and improve tests * Refactor * Refactor * Fix * Add first tests for HPC * First broken tests for HPC * Update tests and strategy * UPDATE tests * Update horovod tests * Update tests and jlab deps * Add MLFLow tracking URI * ADD distributed trainer tests * mpirun container deepspeed * Fix distributed strategy tests on multi-node * ADD srun launcher * Refactor jobscript * Cleanup * isort tests * Refactor scripts * Minor fixes * Add logging to file for all workers * Add jupyter base files * Add jupyter base files * spelling * Update provenance deps * Update DS version * Update prov docs * Cleanup * add nvidia dep * Remove incomplete work * update pyproject * ADD hadolit config file * FIX flag * Fix linters * Refactor * Update prov4ml * Update pytest CI * Minor fix * Incorporate feedback * Update Dockerfiles * Incorporate feedback * Update comments * Refactor tests

uv-tutorial.md

matbun · 2024-11-25T15:51:52Z

If this PR is merged after #249, we need to update the pyproject.toml to use the new-main version of the provenance logger. Maybe other files need to be updated accordingly

README.md

env-files/torch/install-horovod-deepspeed-cuda.sh

pyproject.toml

tutorials/distributed-ml/torch-scaling-test/README.md

use-cases/eurac/requirements.txt

uv-tutorial.md

jarlsondre added 8 commits November 11, 2024 15:36

add empty requirements file for cuda

fa3dc1f

add requirements files and update pyproject toml

e9babf9

update pyproject

e994bf4

update installation in pyproject.toml

4b32a05

update readme and horovod installation script

39e5801

update readme with horovod explanation

c9d786b

update horovod installation script

8932f36

update readme with -e flag

0906e33

jarlsondre added the enhancement New feature or request label Nov 13, 2024

jarlsondre requested review from matbun and annaelisalappe November 13, 2024 13:08

jarlsondre self-assigned this Nov 13, 2024

jarlsondre marked this pull request as draft November 13, 2024 13:08

jarlsondre added 4 commits November 13, 2024 14:12

fix linter readme errors

0d588ad

add more info to readme

750618f

trailing whitespace 🙃

00f4454

trailing whitespace 🙃 (again)

ae89e0c

matbun reviewed Nov 13, 2024

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

jarlsondre and others added 12 commits November 13, 2024 15:41

add draft of table of contents to readme

149a536

update readme toc

337ebd9

update readme toc again

7b1cff9

add section about uv lock to readme

2457826

update toc of readme

4940963

fix errors in readme

ddc7d13

add version numbers to packages in pyproject.toml

abff6c1

remove uv.lock (for now)

4eb5352

remove link from readme

c9cbcef

put toc in html comment

eb163ef

remove toc, remove ds and horovod from reqs, add docs comment to pyproj

a99a674

jarlsondre added 8 commits November 19, 2024 09:40

add uv installation command to readme

3ac9313

add requirements files and update pyproject toml

f751912

update pyproject

6f9c5c1

update installation in pyproject.toml

6e65624

add version numbers to packages in pyproject.toml

0a731ed

update horovod install script and add pip as dependency

def18fd

fix merge conflicts

7379659

formatting

6c8f4db

jarlsondre marked this pull request as ready for review November 19, 2024 16:28

jarlsondre added 4 commits November 19, 2024 17:30

fix linting

690bed3

trailing whitespace

9412e48

remove comment from readme

a23583a

remove comments and small formatting difference

60cbc6f

annaelisalappe reviewed Nov 20, 2024

View reviewed changes

uv-tutorial.md Outdated Show resolved Hide resolved

matbun reviewed Nov 26, 2024

View reviewed changes

jarlsondre added 14 commits November 28, 2024 10:51

move uv tutorial under docs/

8f88c3e

merge with main

de202e2

update readme with nvidia and amd instead of linux

018cc47

remove duplicate entries in pyproject and reformat distributed file

6895472

update readmes

69e1dd2

separate horovod ds installation script into two files

bb815e6

fix linting errors and update dependencies

d06dfe9

fix tests and update lockfile

a368cc0

fix linting errors

166f1ec

update installation scripts for testing

59302f5

add local test command

402598c

add tf to installation in readme

93af263

add torch cuda to project dependencies

81fa4a3

remove index from tutorial

d02a9cf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Venv creation and uv support #245

Venv creation and uv support #245

jarlsondre commented Nov 13, 2024 •

edited

Loading

matbun commented Nov 25, 2024 •

edited

Loading

Venv creation and uv support #245

Are you sure you want to change the base?

Venv creation and uv support #245

Conversation

jarlsondre commented Nov 13, 2024 • edited Loading

Summary

Motivation

Noteworthy

matbun commented Nov 25, 2024 • edited Loading

jarlsondre commented Nov 13, 2024 •

edited

Loading

matbun commented Nov 25, 2024 •

edited

Loading