Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scalability report update 2 #229

Closed
wants to merge 261 commits into from
Closed

Scalability report update 2 #229

wants to merge 261 commits into from
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Aug 3, 2023

  1. update

    matbun committed Aug 3, 2023
    Configuration menu
    Copy the full SHA
    12a595c View commit details
    Browse the repository at this point in the history

Commits on Aug 9, 2023

  1. Configuration menu
    Copy the full SHA
    5fec89a View commit details
    Browse the repository at this point in the history

Commits on Aug 30, 2023

  1. Configuration menu
    Copy the full SHA
    d0bfbbd View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f41fb6d View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2023

  1. Configuration menu
    Copy the full SHA
    d6359dc View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2023

  1. ADD: download on login node

    matbun committed Sep 8, 2023
    Configuration menu
    Copy the full SHA
    ea63b99 View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2023

  1. FIX: dataloaders in Trainer

    matbun committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    623e40f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9e5c0de View commit details
    Browse the repository at this point in the history

Commits on Sep 13, 2023

  1. FIX: clear load and save state

    matbun committed Sep 13, 2023
    Configuration menu
    Copy the full SHA
    a866338 View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2023

  1. ADD: Loggers

    matbun committed Sep 15, 2023
    Configuration menu
    Copy the full SHA
    6fc3c7e View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2023

  1. Configuration menu
    Copy the full SHA
    05df827 View commit details
    Browse the repository at this point in the history
  2. TensorFlow backend (#63)

    * UPDATE: Remove experimental distribution
    
    * ADD: Mnist distributed
    
    * ADD: Optional strategy
    
    * UPDATE: Conditional distribution
    
    * FIX: Dataloader for mnist
    
    * FIX: Model cloning lambda function for distributed scope
    
    * ADD: CycleGAN
    
    * UPDATE: Types
    
    * UPDATE: Types
    
    * ADD: Local distr
    
    * FIX: learning rates
    
    * ADD: CycleGAN distributed
    
    * FIX: Reduction
    
    * FIX: Distribution
    
    * ADD: tmp.py
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * UPDATE: Executors
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD:Initial VIRGO
    
    * UPDATE: Optional distribution, tensorflow-gpu
    
    * UPDATE: tensorflow-gpu dependency
    
    * ADD: Unify branches
    
    ---------
    
    Co-authored-by: User3574 <[email protected]>
    matbun and User3574 authored Sep 20, 2023
    Configuration menu
    Copy the full SHA
    6c9bdc4 View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2023

  1. Refacto entire code base

    matbun committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    216537c View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2023

  1. ADD: workflows folder

    matbun committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    831cb47 View commit details
    Browse the repository at this point in the history
  2. FIX: refactor

    matbun committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    4f35d65 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9af803c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5b2743b View commit details
    Browse the repository at this point in the history
  5. FIX: linting

    matbun committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    0f3cf6d View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2023

  1. ADD: how to run use case doc

    matbun committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    8ab4b73 View commit details
    Browse the repository at this point in the history
  2. ADD: workflows doc

    matbun committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    b3e7e68 View commit details
    Browse the repository at this point in the history
  3. FIX: MD linter

    matbun committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    a8afa6f View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2023

  1. Pipe MNIST lightning (#86)

    * ADD: lightning distributed + pipeline
    
    * UPDATE: jscpd threshold
    
    * UPDATE: super linter ignore use cases
    
    * ADD: jscpd ignore loggers
    matbun authored Oct 17, 2023
    Configuration menu
    Copy the full SHA
    4bdde4e View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2023

  1. Functional tests for MNIST (#87)

    * ADD: use case tests
    
    * FIX: move use case models out of itwinai
    
    * FIX: rearrange modules
    
    * ADD: ConsoleLogger and LoggersCollection
    
    * FIX: loggers filter
    
    * FIX: add TF env creation
    
    * UPDATE: test flag
    
    * ADD: early pytest on slurm
    
    * FIX: duplicated code in TF Trainer
    matbun authored Oct 18, 2023
    Configuration menu
    Copy the full SHA
    1cd81a6 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2023

  1. Sqaaas code (#88)

    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    matbun and orviz authored Oct 25, 2023
    Configuration menu
    Copy the full SHA
    e99276f View commit details
    Browse the repository at this point in the history

Commits on Oct 27, 2023

  1. Sqaaas code (#89)

    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    matbun and orviz authored Oct 27, 2023
    Configuration menu
    Copy the full SHA
    1207e16 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2023

  1. 3dgan use case (#94)

    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    3 people authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    d5f78fd View commit details
    Browse the repository at this point in the history
  2. Sqaaas code (#96)

    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * ADD: adaptive branch discovery for SQAaaS actin
    
    * Trigger only on main and dev branches
    
    * ADD: double quote
    
    * Trigger pytest only on main and dev PRs
    matbun authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    3000770 View commit details
    Browse the repository at this point in the history
  3. Torch mnist inference (#95)

    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * Remove keras dependency
    matbun authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    39835ed View commit details
    Browse the repository at this point in the history
  4. 3dgan integration (#97)

    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    3 people authored Nov 9, 2023
    Configuration menu
    Copy the full SHA
    1299b0c View commit details
    Browse the repository at this point in the history

Commits on Nov 29, 2023

  1. Add SQAaaS dynamic badge for dev branch (#104)

    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    orviz authored Nov 29, 2023
    Configuration menu
    Copy the full SHA
    c366e66 View commit details
    Browse the repository at this point in the history

Commits on Dec 13, 2023

  1. 3dgan integration (#98)

    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    3 people authored Dec 13, 2023
    Configuration menu
    Copy the full SHA
    bdc317d View commit details
    Browse the repository at this point in the history

Commits on Mar 20, 2024

  1. fixed distributed trainer in cyclones use case

    zoechbauer1 authored and matbun committed Mar 20, 2024
    Configuration menu
    Copy the full SHA
    fc45759 View commit details
    Browse the repository at this point in the history

Commits on Mar 21, 2024

  1. 3dgan integration (#118)

    * fixed distributed trainer in cyclones use case
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    * ADD GPU support and update tag
    
    * FIX linter
    
    * ADD override example
    
    * UPDATE 3DGAN inference
    
    * UPDATE inference execution tutorials
    
    * UPDATE README
    
    * UPDATE saver saving sparse tensors
    
    * ADD interlink pods
    
    * UPDATE pod name
    
    * UPDATE annotations
    
    * FIX README
    
    * CLEANUP
    
    * Merge
    
    * update
    
    * ADD tf cpu env
    
    * U[date Makefile
    
    * FIX 3DGAN tests
    
    * FIX data folder path
    
    ---------
    
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    4 people authored Mar 21, 2024
    Configuration menu
    Copy the full SHA
    61e742d View commit details
    Browse the repository at this point in the history

Commits on Mar 22, 2024

  1. Unit test 4 dev (#113)

    * Define a step for pytest execution
    
    * Fix: use v1 of step action
    
    * Print result of step composition
    
    * Rename step
    
    * Use step previous definition in the assessment
    
    * Rename input: workflow -> steps
    
    * Avoid caching by using 1.0.0
    
    * Set container image
    
    * Bump to v1
    
    * Bump to sqaaas-assessment-action@v2
    
    * Remove 'id' property
    
    * Adapt inputs to v2
    
    * Remove current branch
    
    * Disable test_cyclones_train_tf
    
    * ADD marker
    
    * ADD skip memory heavy
    
    * Disable for PRs
    
    ---------
    
    Co-authored-by: Matteo Bunino <[email protected]>
    orviz and matbun authored Mar 22, 2024
    Configuration menu
    Copy the full SHA
    150bcef View commit details
    Browse the repository at this point in the history

Commits on Apr 16, 2024

  1. Distributed strategy launcher (#117)

    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    5 people authored Apr 16, 2024
    Configuration menu
    Copy the full SHA
    e8c1ed6 View commit details
    Browse the repository at this point in the history
  2. Distributed strategy launcher (#127)

    Update ParseConfig
    matbun authored Apr 16, 2024
    Configuration menu
    Copy the full SHA
    bb84d13 View commit details
    Browse the repository at this point in the history
  3. Distributed strategy launcher (#128)

    Remove experimental files
    matbun authored Apr 16, 2024
    Configuration menu
    Copy the full SHA
    6180b96 View commit details
    Browse the repository at this point in the history

Commits on Apr 29, 2024

  1. Docs dev (#132)

    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * fixed distributed trainer in cyclones use case
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * testing automated docs update
    
    * updating getting started page
    
    * fixing pages and adding new content
    
    * bug fixes
    
    * fixing content rendering
    
    * latest fixes in rendering
    
    * Add version feature to docs
    
    * Update .readthedocs.yaml
    
    * fixing display structure in getting started page
    
    * new fixes similar to previous commit
    
    * Update index.rst
    
    * Update index.rst
    
    Text re-edit index
    
    * Update index.rst
    
    change 1 word
    
    * Update .readthedocs.yaml
    
    * Update .readthedocs.yaml
    
    * fixing getting started page
    
    * Text review getting_started_with_itwinai.rst
    
    * Update 3dgan_doc.rst
    
    * Update getting_started_with_itwinai.rst
    
    punctuation
    
    * Fix torch naming problem
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    4 people authored Apr 29, 2024
    Configuration menu
    Copy the full SHA
    fb12fca View commit details
    Browse the repository at this point in the history

Commits on Apr 30, 2024

  1. Distributed strategy launcher (#131)

    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    * Restore ConfigParser
    
    * FIX type hinting
    
    * ADD dev dependencies
    
    * REMOVE experimental scripts
    
    * UPDATE scaling report
    
    * Add SLURM logs
    
    * Refactor log scale
    
    * Update scalability report
    
    * Unify SLURM logs per job
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * ADD itwinai installation
    
    * UPDATE torch distributed tutorial 0
    
    * UPDATE torch distributed tutorials
    
    * REMOVE imagenet tutorial
    
    * ADD NonDistributedStrategy and create_dataloader method
    
    * CLEANUP older classes
    
    * Rename strategies
    
    * Simplify structure
    
    * ADD draft new torch trainer class
    
    * UPDATED torch trainer draft
    
    * UPDATE MNIST use case
    
    * INtegrate new trainer into MNIST use case
    
    * UPDATE structure: remove unused files and refactor tests
    
    * Tmp disable unused tests
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * FIX failing inference
    
    * Functiona tests (#133)
    
    * UPDATE tests
    
    * FIX errors
    
    * CLEANUP
    
    * Remove unused workflow
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    5 people authored Apr 30, 2024
    Configuration menu
    Copy the full SHA
    2479d0d View commit details
    Browse the repository at this point in the history
  2. 3dgan integration (#134)

    * fixed distributed trainer in cyclones use case
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    * ADD GPU support and update tag
    
    * FIX linter
    
    * ADD override example
    
    * UPDATE 3DGAN inference
    
    * UPDATE inference execution tutorials
    
    * UPDATE README
    
    * UPDATE saver saving sparse tensors
    
    * ADD interlink pods
    
    * UPDATE pod name
    
    * UPDATE annotations
    
    * FIX README
    
    * CLEANUP
    
    * Merge
    
    * update
    
    * ADD tf cpu env
    
    * U[date Makefile
    
    * FIX 3DGAN tests
    
    * FIX data folder path
    
    * ADD offloading of 3DGAN training
    
    * ADAPT 3DGAN training for singularity execution
    
    * UPDATE test and fix linter
    
    ---------
    
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    4 people authored Apr 30, 2024
    Configuration menu
    Copy the full SHA
    e052d1c View commit details
    Browse the repository at this point in the history

Commits on May 2, 2024

  1. Docs dev (#135)

    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * fixed distributed trainer in cyclones use case
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * testing automated docs update
    
    * updating getting started page
    
    * fixing pages and adding new content
    
    * bug fixes
    
    * fixing content rendering
    
    * latest fixes in rendering
    
    * Add version feature to docs
    
    * Update .readthedocs.yaml
    
    * fixing display structure in getting started page
    
    * new fixes similar to previous commit
    
    * Update index.rst
    
    * Update index.rst
    
    Text re-edit index
    
    * Update index.rst
    
    change 1 word
    
    * Update .readthedocs.yaml
    
    * Update .readthedocs.yaml
    
    * fixing getting started page
    
    * Text review getting_started_with_itwinai.rst
    
    * Update 3dgan_doc.rst
    
    * Update getting_started_with_itwinai.rst
    
    punctuation
    
    * Fix torch naming problem
    
    * UPDATE requirements
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    4 people authored May 2, 2024
    Configuration menu
    Copy the full SHA
    a0064f4 View commit details
    Browse the repository at this point in the history

Commits on May 7, 2024

  1. Distributed strategy launcher (#137)

    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    * Restore ConfigParser
    
    * FIX type hinting
    
    * ADD dev dependencies
    
    * REMOVE experimental scripts
    
    * UPDATE scaling report
    
    * Add SLURM logs
    
    * Refactor log scale
    
    * Update scalability report
    
    * Unify SLURM logs per job
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * ADD itwinai installation
    
    * UPDATE torch distributed tutorial 0
    
    * UPDATE torch distributed tutorials
    
    * REMOVE imagenet tutorial
    
    * ADD NonDistributedStrategy and create_dataloader method
    
    * CLEANUP older classes
    
    * Rename strategies
    
    * Simplify structure
    
    * ADD draft new torch trainer class
    
    * UPDATED torch trainer draft
    
    * UPDATE MNIST use case
    
    * INtegrate new trainer into MNIST use case
    
    * UPDATE structure: remove unused files and refactor tests
    
    * Tmp disable unused tests
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * FIX failing inference
    
    * Functiona tests (#133)
    
    * UPDATE tests
    
    * FIX errors
    
    * CLEANUP
    
    * Remove unused workflow
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Update distributed.py
    
    * Update tfmirrored_slurm.sh
    
    * Update train.py
    
    * TF updates
    
    * Add README
    
    * Python venv (#136)
    
    * Move to python venv
    
    * Update Makefile
    
    * Add Horovod installation
    
    * Update env
    
    * FIX openmpi install
    
    * Add TF explicit version
    
    * UPDATE env creation
    
    * REMOVE constraint on torch 2.0.*
    
    * UPDATE installation
    
    * FIX test
    
    * REMOVE strict dependency on micromamba
    
    * FIX docs and debugging states
    
    * FIX cpu only installation
    
    * FIX deepspeed cpu installation
    
    * FIX tf env creation
    
    * FIX makefile
    
    * ADD pypi deployment
    
    * DISABLE push debug
    
    * UPDATE pypi
    
    * UPDATE classifiers
    
    * Update pyproject.toml
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    5 people authored May 7, 2024
    Configuration menu
    Copy the full SHA
    62f01cf View commit details
    Browse the repository at this point in the history

Commits on May 8, 2024

  1. Update README.md

    matbun authored May 8, 2024
    Configuration menu
    Copy the full SHA
    814e755 View commit details
    Browse the repository at this point in the history

Commits on May 28, 2024

  1. Distributed strategy launcher (#141)

    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    * Restore ConfigParser
    
    * FIX type hinting
    
    * ADD dev dependencies
    
    * REMOVE experimental scripts
    
    * UPDATE scaling report
    
    * Add SLURM logs
    
    * Refactor log scale
    
    * Update scalability report
    
    * Unify SLURM logs per job
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * ADD itwinai installation
    
    * UPDATE torch distributed tutorial 0
    
    * UPDATE torch distributed tutorials
    
    * REMOVE imagenet tutorial
    
    * ADD NonDistributedStrategy and create_dataloader method
    
    * CLEANUP older classes
    
    * Rename strategies
    
    * Simplify structure
    
    * ADD draft new torch trainer class
    
    * UPDATED torch trainer draft
    
    * UPDATE MNIST use case
    
    * INtegrate new trainer into MNIST use case
    
    * UPDATE structure: remove unused files and refactor tests
    
    * Tmp disable unused tests
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * FIX failing inference
    
    * Functiona tests (#133)
    
    * UPDATE tests
    
    * FIX errors
    
    * CLEANUP
    
    * Remove unused workflow
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Update distributed.py
    
    * Update tfmirrored_slurm.sh
    
    * Update train.py
    
    * TF updates
    
    * Add README
    
    * Python venv (#136)
    
    * Move to python venv
    
    * Update Makefile
    
    * Add Horovod installation
    
    * Update env
    
    * FIX openmpi install
    
    * Add TF explicit version
    
    * UPDATE env creation
    
    * REMOVE constraint on torch 2.0.*
    
    * UPDATE installation
    
    * FIX test
    
    * REMOVE strict dependency on micromamba
    
    * FIX docs and debugging states
    
    * FIX cpu only installation
    
    * FIX deepspeed cpu installation
    
    * FIX tf env creation
    
    * FIX makefile
    
    * ADD pypi deployment
    
    * DISABLE push debug
    
    * UPDATE pypi
    
    * UPDATE classifiers
    
    * Update pyproject.toml
    
    * Update README.md
    
    * Cyclone tf dist (#130)
    
    * get_stretegy
    
    * UPDATE distributed strategy
    
    * change req file
    
    * cycline tf dist
    
    * small bugs
    
    * fix bug in train.py
    
    * REFACTOR cyclones use case
    
    * Activate pytest
    
    * NEW TensorFlow trainer
    
    * ADD user information
    
    ---------
    
    Co-authored-by: ruettgers1 <[email protected]>
    Co-authored-by: Matteo Bunino <[email protected]>
    
    * Interactive distrib ml (#139)
    
    Add examples for distributed ml in interactive mode
    
    * Interactive distrib ml (#140)
    
    Update tutorial
    
    * Disable documentation GH action
    
    * Remove action
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: MarioRuettgers <[email protected]>
    6 people authored May 28, 2024
    Configuration menu
    Copy the full SHA
    a6abe91 View commit details
    Browse the repository at this point in the history

Commits on May 29, 2024

  1. Merge main (#142)

    Bring changes on main into dev
    matbun authored May 29, 2024
    Configuration menu
    Copy the full SHA
    87caba1 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' into dev

    matbun committed May 29, 2024
    Configuration menu
    Copy the full SHA
    040bbff View commit details
    Browse the repository at this point in the history

Commits on May 30, 2024

  1. Virgo integration (#143)

    * ADD Virgo data pipeline and some refactoring
    
    * FIX typo
    
    * UPDATE README
    
    * ADD training
    
    * ADD TrainingConfiguration
    
    * ADD distributed training and refactor
    
    * update readme
    
    * UPDATE loggers and add tests
    
    * Refactor
    
    * FIX typo
    
    * UPDATE use cases instructions
    
    * ADD checkpointing and refactor.
    
    * FIX linter
    
    * FIX jscpd
    
    * FIX jscpd
    
    * Disable jscpd
    
    * Refactor loggers
    
    * ADD loggers to Virgo use case
    matbun authored May 30, 2024
    Configuration menu
    Copy the full SHA
    8ad45d6 View commit details
    Browse the repository at this point in the history
  2. Update AUTHORS.md

    matbun authored May 30, 2024
    Configuration menu
    Copy the full SHA
    5f43ad2 View commit details
    Browse the repository at this point in the history

Commits on May 31, 2024

  1. Update AUTHORS.md

    matbun authored May 31, 2024
    Configuration menu
    Copy the full SHA
    e7fd283 View commit details
    Browse the repository at this point in the history
  2. Docs dev (#144)

    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * fixed distributed trainer in cyclones use case
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * testing automated docs update
    
    * updating getting started page
    
    * fixing pages and adding new content
    
    * bug fixes
    
    * fixing content rendering
    
    * latest fixes in rendering
    
    * Add version feature to docs
    
    * Update .readthedocs.yaml
    
    * fixing display structure in getting started page
    
    * new fixes similar to previous commit
    
    * Update index.rst
    
    * Update index.rst
    
    Text re-edit index
    
    * Update index.rst
    
    change 1 word
    
    * Update .readthedocs.yaml
    
    * Update .readthedocs.yaml
    
    * fixing getting started page
    
    * Text review getting_started_with_itwinai.rst
    
    * Update 3dgan_doc.rst
    
    * Update getting_started_with_itwinai.rst
    
    punctuation
    
    * Fix torch naming problem
    
    * UPDATE requirements
    
    * Remove unnecessary dependencies
    
    * Add docstring
    
    * adding latest changes from dev
    
    * new content and changes
    
    * Update index.rst
    
    toctree revise
    
    * adding pages for distributed ml tutorials
    
    * new shpinx reqs to solve build failing
    
    * Docs update:
    - python code format fixed
    - added brief explanation on ddp in new section
    
    * requirements changed
    
    * UPDATE requirements
    
    * UPDATE requirements and itwinai.types
    
    * ADD CMake and GCC installation
    
    * UPDATE CMake and GCC installation
    
    * UPDATE CMake and GCC installation
    
    * ADD notebooks
    
    * Disable notebooks section
    
    * FIX TOC
    
    * Saving local changes before pulling from remote
    
    * saving updates before pull from origin
    
    * Update itwinai.torch.modules.rst
    
    * Update itwinai.torch.modules.rst
    
    * Update itwinai.torch.modules.rst
    
    * Update itwinai.torch.modules.rst
    
    * adding cyclones and virgo use cases pages
    
    * FIX build errors
    
    * Update TOC
    
    * Update TOC
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    Co-authored-by: Killian Verder <[email protected]>
    5 people authored May 31, 2024
    Configuration menu
    Copy the full SHA
    11f5a71 View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2024

  1. Merge branch 'main' into dev

    matbun committed Jun 6, 2024
    Configuration menu
    Copy the full SHA
    2ebac3b View commit details
    Browse the repository at this point in the history

Commits on Jun 11, 2024

  1. Update dev (#152)

    * Dev - itwinai 0.0.2 (#138)
    
    * Backend (#59)
    
    * WIP: Tensorflow MNIST use-case
    
    * UPDATE: Tensorflow MNIST version
    
    * ADD: Backend
    
    * ADD: Use-case init
    
    * FIX: Paths and downloading of the data
    
    * FIX: Paths and downloading of the data
    
    * ADD: Setup, Config update
    
    * ADD: Setup, Config update
    
    * UPDATE: File movement into itwinai
    
    * FIX: Move utils from tensorflow to global folder
    
    * FIX: Add setup into torch Executable
    
    * ADD: MNIST Torch Use-case
    
    * FIX: Formatting
    
    * ADD: Lib
    
    * ADD: Lib
    
    * ADD: Tests, Fix Loggers
    
    * Update README.md
    
    * ADD: Tests
    
    * ADD: MLCC
    
    * ADD: Cyclones, Cyclones-pipe
    
    * ADD: TensorflowTrainer
    
    * UPDATE: Move TensorflowTrainer into Backend
    
    * FIX: Dependencies
    
    * ADD: Number of devices
    
    * ADD: initial version of TorchTrainer
    
    * update
    
    * update
    
    * ADD: distributed torch Trainer and decorator
    
    * ADD: New version of torch distribtued trainer and tests
    
    * ADD: load torch dist trainer form config file
    
    * ADD: multi-gpu pytorch trainer
    
    * ADD: download on login node
    
    * FIX: dataloaders in Trainer
    
    * FIX: add dataloaders into trainer
    
    * FIX: clear load and save state
    
    * ADD: Loggers
    
    * FIX: Log in a distributed environment
    
    * TensorFlow backend (#63)
    
    * UPDATE: Remove experimental distribution
    
    * ADD: Mnist distributed
    
    * ADD: Optional strategy
    
    * UPDATE: Conditional distribution
    
    * FIX: Dataloader for mnist
    
    * FIX: Model cloning lambda function for distributed scope
    
    * ADD: CycleGAN
    
    * UPDATE: Types
    
    * UPDATE: Types
    
    * ADD: Local distr
    
    * FIX: learning rates
    
    * ADD: CycleGAN distributed
    
    * FIX: Reduction
    
    * FIX: Distribution
    
    * ADD: tmp.py
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * UPDATE: Executors
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD:Initial VIRGO
    
    * UPDATE: Optional distribution, tensorflow-gpu
    
    * UPDATE: tensorflow-gpu dependency
    
    * ADD: Unify branches
    
    ---------
    
    Co-authored-by: User3574 <[email protected]>
    
    * Refacto entire code base
    
    * ADD: workflows folder
    
    * FIX: refactor
    
    * FIX: linting
    
    * ADD: how to run use case doc
    
    * ADD: workflows doc
    
    * FIX: MD linter
    
    * Pipe MNIST lightning (#86)
    
    * ADD: lightning distributed + pipeline
    
    * UPDATE: jscpd threshold
    
    * UPDATE: super linter ignore use cases
    
    * ADD: jscpd ignore loggers
    
    * Functional tests for MNIST (#87)
    
    * ADD: use case tests
    
    * FIX: move use case models out of itwinai
    
    * FIX: rearrange modules
    
    * ADD: ConsoleLogger and LoggersCollection
    
    * FIX: loggers filter
    
    * FIX: add TF env creation
    
    * UPDATE: test flag
    
    * ADD: early pytest on slurm
    
    * FIX: duplicated code in TF Trainer
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * 3dgan use case (#94)
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#96)
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * ADD: adaptive branch discovery for SQAaaS actin
    
    * Trigger only on main and dev branches
    
    * ADD: double quote
    
    * Trigger pytest only on main and dev PRs
    
    * Torch mnist inference (#95)
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * Remove keras dependency
    
    * 3dgan integration (#97)
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * 3dgan integration (#98)
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * fixed distributed trainer in cyclones use case
    
    * 3dgan integration (#118)
    
    * fixed distributed trainer in cyclones use case
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    * ADD GPU support and update tag
    
    * FIX linter
    
    * ADD override example
    
    * UPDATE 3DGAN inference
    
    * UPDATE inference execution tutorials
    
    * UPDATE README
    
    * UPDATE saver saving sparse tensors
    
    * ADD interlink pods
    
    * UPDATE pod name
    
    * UPDATE annotations
    
    * FIX README
    
    * CLEANUP
    
    * Merge
    
    * update
    
    * ADD tf cpu env
    
    * U[date Makefile
    
    * FIX 3DGAN tests
    
    * FIX data folder path
    
    ---------
    
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Unit test 4 dev (#113)
    
    * Define a step for pytest execution
    
    * Fix: use v1 of step action
    
    * Print result of step composition
    
    * Rename step
    
    * Use step previous definition in the assessment
    
    * Rename input: workflow -> steps
    
    * Avoid caching by using 1.0.0
    
    * Set container image
    
    * Bump to v1
    
    * Bump to sqaaas-assessment-action@v2
    
    * Remove 'id' property
    
    * Adapt inputs to v2
    
    * Remove current branch
    
    * Disable test_cyclones_train_tf
    
    * ADD marker
    
    * ADD skip memory heavy
    
    * Disable for PRs
    
    ---------
    
    Co-authored-by: Matteo Bunino <[email protected]>
    
    * Distributed strategy launcher (#117)
    
    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    
    * Distributed strategy launcher (#127)
    
    Update ParseConfig
    
    * Distributed strategy launcher (#128)
    
    Remove experimental files
    
    * Docs dev (#132)
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * fixed distributed trainer in cyclones use case
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * testing automated docs update
    
    * updating getting started page
    
    * fixing pages and adding new content
    
    * bug fixes
    
    * fixing content rendering
    
    * latest fixes in rendering
    
    * Add version feature to docs
    
    * Update .readthedocs.yaml
    
    * fixing display structure in getting started page
    
    * new fixes similar to previous commit
    
    * Update index.rst
    
    * Update index.rst
    
    Text re-edit index
    
    * Update index.rst
    
    change 1 word
    
    * Update .readthedocs.yaml
    
    * Update .readthedocs.yaml
    
    * fixing getting started page
    
    * Text review getting_started_with_itwinai.rst
    
    * Update 3dgan_doc.rst
    
    * Update getting_started_with_itwinai.rst
    
    punctuation
    
    * Fix torch naming problem
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    
    * Distributed strategy launcher (#131)
    
    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    * Restore ConfigParser
    
    * FIX type hinting
    
    * ADD dev dependencies
    
    * REMOVE experimental scripts
    
    * UPDATE scaling report
    
    * Add SLURM logs
    
    * Refactor log scale
    
    * Update scalability report
    
    * Unify SLURM logs per job
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * ADD itwinai installation
    
    * UPDATE torch distributed tutorial 0
    
    * UPDATE torch distributed tutorials
    
    * REMOVE imagenet tutorial
    
    * ADD NonDistributedStrategy and create_dataloader method
    
    * CLEANUP older classes
    
    * Rename strategies
    
    * Simplify structure
    
    * ADD draft new torch trainer class
    
    * UPDATED torch trainer draft
    
    * UPDATE MNIST use case
    
    * INtegrate new trainer into MNIST use case
    
    * UPDATE structure: remove unused files and refactor tests
    
    * Tmp disable unused tests
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * FIX failing inference
    
    * Functiona tests (#133)
    
    * UPDATE tests
    
    * FIX errors
    
    * CLEANUP
    
    * Remove unused workflow
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    
    * 3dgan integration (#134)
    
    * fixed distributed trainer in cyclones use case
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    * ADD GPU support and update tag
    
    * FIX linter
    
    * ADD override example
    
    * UPDATE 3DGAN inference
    
    * UPDATE inference execution tutorials
    
    * UPDATE README
    
    * UPDATE saver saving sparse tensors
    
    * ADD interlink pods
    
    * UPDATE pod name
    
    * UPDATE annotations
    
    * FIX README
    
    * CLEANUP
    
    * Merge
    
    * update
    
    * ADD tf cpu env
    
    * U[date Makefile
    
    * FIX 3DGAN tests
    
    * FIX data folder path
    
    * ADD offloading of 3DGAN training
    
    * ADAPT 3DGAN training for singularity execution
    
    * UPDATE test and fix linter
    
    ---------
    
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Docs dev (#135)
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * fixed distributed trainer in cyclones use case
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * testing automated docs update
    
    * updating getting started page
    
    * fixing pages and adding new content
    
    * bug fixes
    
    * fixing content rendering
    
    * latest fixes in rendering
    
    * Add version feature to docs
    
    * Update .readthedocs.yaml
    
    * fixing display structure in getting started page
    
    * new fixes similar to previous commit
    
    * Update index.rst
    
    * Update index.rst
    
    Text re-edit index
    
    * Update index.rst
    
    change 1 word
    
    * Update .readthedocs.yaml
    
    * Update .readthedocs.yaml
    
    * fixing getting started page
    
    * Text review getting_started_with_itwinai.rst
    
    * Update 3dgan_doc.rst
    
    * Update getting_started_with_itwinai.rst
    
    punctuation
    
    * Fix torch naming problem
    
    * UPDATE requirements
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    
    * Distributed strategy launcher (#137)
    
    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    * Restore ConfigParser
    
    * FIX type hinting
    
    * ADD dev dependencies
    
    * REMOVE experimental scripts
    
    * UPDATE scaling report
    
    * Add SLURM logs
    
    * Refactor log scale
    
    * Update scalability report
    
    * Unify SLURM logs per job
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * ADD itwinai installation
    
    * UPDATE torch distributed tutorial 0
    
    * UPDATE torch distributed tutorials
    
    * REMOVE imagenet tutorial
    
    * ADD NonDistributedStrategy and create_dataloader method
    
    * CLEANUP older classes
    
    * Rename strategies
    
    * Simplify structure
    
    * ADD draft new torch trainer class
    
    * UPDATED torch trainer draft
    
    * UPDATE MNIST use case
    
    * INtegrate new trainer into MNIST use case
    
    * UPDATE structure: remove unused files and refactor tests
    
    * Tmp disable unused tests
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * FIX failing inference
    
    * Functiona tests (#133)
    
    * UPDATE tests
    
    * FIX errors
    
    * CLEANUP
    
    * Remove unused workflow
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Update distributed.py
    
    * Update tfmirrored_slurm.sh
    
    * Update train.py
    
    * TF updates
    
    * Add README
    
    * Python venv (#136)
    
    * Move to python venv
    
    * Update Makefile
    
    * Add Horovod installation
    
    * Update env
    
    * FIX openmpi install
    
    * Add TF explicit version
    
    * UPDATE env creation
    
    * REMOVE constraint on torch 2.0.*
    
    * UPDATE installation
    
    * FIX test
    
    * REMOVE strict dependency on micromamba
    
    * FIX docs and debugging states
    
    * FIX cpu only installation
    
    * FIX deepspeed cpu installation
    
    * FIX tf env creation
    
    * FIX makefile
    
    * ADD pypi deployment
    
    * DISABLE push debug
    
    * UPDATE pypi
    
    * UPDATE classifiers
    
    * Update pyproject.toml
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    
    * Update README.md
    
    * Distributed strategy launcher (#141)
    
    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    * Restore ConfigParser
    
    * FIX type hinting
    
    * ADD dev dependencies
    
    * REMOVE experimental scripts
    
    * UPDATE scaling report
    
    * Add SLURM logs
    
    * Refactor log scale
    
    * Update scalability report
    
    * Unify SLURM logs per job
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * ADD itwinai installation
    
    * UPDATE torch distributed tutorial 0
    
    * UPDATE torch distributed tutorials
    
    * REMOVE imagenet tutorial
    
    * ADD NonDistributedStrategy and create_dataloader method
    
    * CLEANUP older classes
    
    * Rename strategies
    
    * Simplify structure
    
    * ADD draft new torch trainer class
    
    * UPDATED torch trainer draft
    
    * UPDATE MNIST use case
    
    * INtegrate new trainer into MNIST use case
    
    * UPDATE structure: remove unused files and refactor tests
    
    * Tmp disable unused tests
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * FIX failing inference
    
    * Functiona tests (#133)
    
    * UPDATE tests
    
    * FIX errors
    
    * CLEANUP
    
    * Remove unused workflow
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Fixes to TF new version errors
    
    * Update distributed.py
    
    * Update tfmirrored_slurm.sh
    
    * Update train.py
    
    * TF updates
    
    * Add README
    
    * Python venv (#136)
    
    * Move to python venv
    
    * Update Makefile
    
    * Add Horovod installation
    
    * Update env
    
    * FIX openmpi install
    
    * Add TF explicit version
    
    * UPDATE env creation
    
    * REMOVE constraint on torch 2.0.*
    
    * UPDATE installation
    
    * FIX test
    
    * REMOVE strict dependency on micromamba
    
    * FIX docs and debugging states
    
    * FIX cpu only installation
    
    * FIX deepspeed cpu installation
    
    * FIX tf env creation
    
    * FIX makefile
    
    * ADD pypi deployment
    
    * DISABLE push debug
    
    * UPDATE pypi
    
    * UPDATE classifiers
    
    * Update pyproject.toml
    
    * Update README.md
    
    * Cyclone tf dist (#130)
    
    * get_stretegy
    
    * UPDATE distributed strategy
    
    * change req file
    
    * cycline tf dist
    
    * small bugs
    
    * fix bug in train.py
    
    * REFACTOR cyclones use case
    
    * Activate pytest
    
    * NEW TensorFlow trainer
    
    * ADD user information
    
    ---------
    
    Co-authored-by: ruettgers1 <[email protected]>
    Co-authored-by: Matteo Bunino <[email protected]>
    
    * Interactive distrib ml (#139)
    
    Add examples for distributed ml in interactive mode
    
    * Interactive distrib ml (#140)
    
    Update tutorial
    
    * Disable documentation GH action
    
    * Remove action
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: MarioRuettgers <[email protected]>
    
    * Merge main (#142)
    
    Bring changes on main into dev
    
    * Virgo integration (#143)
    
    * ADD Virgo data pipeline and some refactoring
    
    * FIX typo
    
    * UPDATE README
    
    * ADD training
    
    * ADD TrainingConfiguration
    
    * ADD distributed training and refactor
    
    * update readme
    
    * UPDATE loggers and add tests
    
    * Refactor
    
    * FIX typo
    
    * UPDATE use cases instructions
    
    * ADD checkpointing and refactor.
    
    * FIX linter
    
    * FIX jscpd
    
    * FIX jscpd
    
    * Disable jscpd
    
    * Refactor loggers
    
    * ADD loggers to Virgo use case
    
    * Update AUTHORS.md
    
    * Update AUTHORS.md
    
    * Docs dev (#144)
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * fixed distributed trainer in cyclones use case
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * testing automated docs update
    
    * updating getting started page
    
    * fixing pages and adding new content
    
    * bug fixes
    
    * fixing content rendering
    
    * latest fixes in rendering
    
    * Add version feature to docs
    
    * Update .readthedocs.yaml
    
    * fixing display structure in getting started page
    
    * new fixes similar to previous commit
    
    * Update index.rst
    
    * Update index.rst
    
    Text re-edit index
    
    * Update index.rst
    
    change 1 word
    
    * Update .readthedocs.yaml
    
    * Update .readthedocs.yaml
    
    * fixing getting started page
    
    * Text review getting_started_with_itwinai.rst
    
    * Update 3dgan_doc.rst
    
    * Update getting_started_with_itwinai.rst
    
    punctuation
    
    * Fix torch naming problem
    
    * UPDATE requirements
    
    * Remove unnecessary dependencies
    
    * Add docstring
    
    * adding latest changes from dev
    
    * new content and changes
    
    * Update index.rst
    
    toctree revise
    
    * adding pages for distributed ml tutorials
    
    * new shpinx reqs to solve build failing
    
    * Docs update:
    - python code format fixed
    - added brief explanation on ddp in new section
    
    * requirements changed
    
    * UPDATE requirements
    
    * UPDATE requirements and itwinai.types
    
    * ADD CMake and GCC installation
    
    * UPDATE CMake and GCC installation
    
    * UPDATE CMake and GCC installation
    
    * ADD notebooks
    
    * Disable notebooks section
    
    * FIX TOC
    
    * Saving local changes before pulling from remote
    
    * saving updates before pull from origin
    
    * Update itwinai.torch.modules.rst
    
    * Update itwinai.torch.modules.rst
    
    * Update itwinai.torch.modules.rst
    
    * Update itwinai.torch.modules.rst
    
    * adding cyclones and virgo use cases pages
    
    * FIX build errors
    
    * Update TOC
    
    * Update TOC
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    Co-authored-by: Killian Verder <[email protected]>
    
    ---------
    
    Co-authored-by: Roman Machacek <[email protected]>
    Co-authored-by: linxUser3574 <[email protected]>
    Co-authored-by: orviz <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    Co-authored-by: MarioRuettgers <[email protected]>
    Co-authored-by: Killian Verder <[email protected]>
    
    * Delete .github/workflows/pages.yml
    
    * ADD quick install for users (#145)
    
    * User install (#146)
    
    * ADD quick install for users
    
    * UPDATE installer
    
    * fix framework selection
    
    * UPDATE installer
    
    * Update README.md
    
    * Update README.md
    
    * Improve docstring parsing and refactor (#147)
    
    * UPDATE print patch and refactor
    
    * Cleanup
    
    * Cleanup
    
    * Cleanup
    
    * Cleanup
    
    * FIX broken import
    
    * UPDATE docs
    
    * FIX docstring parsing
    
    * Preserve ordering
    
    * Update cli.py
    
    * Update docs (#148)
    
    * Update README.md
    
    * ADD missing doctrings
    
    * Bump actions/setup-python from 4 to 5 (#149)
    
    Bumps [actions/setup-python](https://github.com/actions/setup-python) from 4 to 5.
    - [Release notes](https://github.com/actions/setup-python/releases)
    - [Commits](https://github.com/actions/setup-python/compare/v4...v5)
    
    ---
    updated-dependencies:
    - dependency-name: actions/setup-python
      dependency-type: direct:production
      update-type: version-update:semver-major
    ...
    
    Signed-off-by: dependabot[bot] <[email protected]>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * updating doc pages (#150)
    
    Co-authored-by: KalliopiTsolaki <ktsolaki@LAPTOP-4683QBL6>
    
    * Update cyclones_doc.rst
    
    * Bug fixes and addition of CERFACS use-case (#151)
    
    * Update train.py
    
    * Update generic_tf.sh
    
    * Update pyproject.toml
    
    * Update train.py
    
    * Fix: head problems with MacOS
    
    * Fixes for MacOS support
    
    * Fix: Update basic_components.py
    
    * Addition of cerfacs use-case
    
    * Update README.md
    
    * Update train.py
    
    * Update cyclones_doc.rst
    
    * Update startscript.sh
    
    * Update pyproject.toml
    
    * Update mnist.py
    
    * Update mnist.py
    
    * Update generic_tf.sh
    
    * Update requirements.txt
    
    * Update requirements.txt
    
    * Docs changes (#153)
    
    * updating doc pages
    
    * testing if changing the GH edit url works
    
    * adding repo link in toc
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <ktsolaki@LAPTOP-4683QBL6>
    
    * Update pyproject.toml
    
    ---------
    
    Signed-off-by: dependabot[bot] <[email protected]>
    Co-authored-by: Roman Machacek <[email protected]>
    Co-authored-by: linxUser3574 <[email protected]>
    Co-authored-by: orviz <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    Co-authored-by: MarioRuettgers <[email protected]>
    Co-authored-by: Killian Verder <[email protected]>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: KalliopiTsolaki <ktsolaki@LAPTOP-4683QBL6>
    16 people authored Jun 11, 2024
    Configuration menu
    Copy the full SHA
    be3ec87 View commit details
    Browse the repository at this point in the history

Commits on Jun 13, 2024

  1. Update sqaaas.yml

    matbun authored Jun 13, 2024
    Configuration menu
    Copy the full SHA
    d5bcfb3 View commit details
    Browse the repository at this point in the history

Commits on Jun 25, 2024

  1. added train to start integration

    iferrario committed Jun 25, 2024
    Configuration menu
    Copy the full SHA
    742132c View commit details
    Browse the repository at this point in the history
  2. update requirements.txt

    iacopoff committed Jun 25, 2024
    Configuration menu
    Copy the full SHA
    1c0aff3 View commit details
    Browse the repository at this point in the history

Commits on Jun 26, 2024

  1. downsamplingo option

    iacopoff committed Jun 26, 2024
    Configuration menu
    Copy the full SHA
    c4abb26 View commit details
    Browse the repository at this point in the history
  2. fix plot

    iacopoff committed Jun 26, 2024
    Configuration menu
    Copy the full SHA
    eef0b39 View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2024

  1. Configuration menu
    Copy the full SHA
    079cdf5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    053cd49 View commit details
    Browse the repository at this point in the history
  3. UPDATE comments

    matbun committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    93066b0 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    77c84d2 View commit details
    Browse the repository at this point in the history
  5. UPDATE docs

    matbun committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    b8065e3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    3c47b4b View commit details
    Browse the repository at this point in the history

Commits on Jul 1, 2024

  1. prepare for distributed run

    iacopoff committed Jul 1, 2024
    Configuration menu
    Copy the full SHA
    5a36185 View commit details
    Browse the repository at this point in the history
  2. enable distributed sampler

    iacopoff committed Jul 1, 2024
    Configuration menu
    Copy the full SHA
    2cf5521 View commit details
    Browse the repository at this point in the history
  3. fix

    iacopoff committed Jul 1, 2024
    Configuration menu
    Copy the full SHA
    3e1c3f9 View commit details
    Browse the repository at this point in the history

Commits on Jul 9, 2024

  1. prepare to run on JSC

    iacopoff committed Jul 9, 2024
    Configuration menu
    Copy the full SHA
    6d4e8d3 View commit details
    Browse the repository at this point in the history
  2. update train

    ferrario2 committed Jul 9, 2024
    Configuration menu
    Copy the full SHA
    2661d3b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3e5b8de View commit details
    Browse the repository at this point in the history

Commits on Jul 18, 2024

  1. update convlstm

    iacopoff committed Jul 18, 2024
    Configuration menu
    Copy the full SHA
    2ef8f3d View commit details
    Browse the repository at this point in the history
  2. blacked

    iacopoff committed Jul 18, 2024
    Configuration menu
    Copy the full SHA
    a2154a8 View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2024

  1. Configuration menu
    Copy the full SHA
    46d7900 View commit details
    Browse the repository at this point in the history
  2. Update dist-train.py

    matbun authored Jul 19, 2024
    Configuration menu
    Copy the full SHA
    25bf5a5 View commit details
    Browse the repository at this point in the history
  3. Update dist-train.py

    correct tqdm import error
    iacopoff authored Jul 19, 2024
    Configuration menu
    Copy the full SHA
    cd54241 View commit details
    Browse the repository at this point in the history
  4. Update dist-train.py

    in train_val, strategy.device changed to strategy.device()
    iacopoff authored Jul 19, 2024
    Configuration menu
    Copy the full SHA
    304aa23 View commit details
    Browse the repository at this point in the history

Commits on Jul 24, 2024

  1. add distributed support

    MutegekiHenry committed Jul 24, 2024
    Configuration menu
    Copy the full SHA
    d4cbebb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    99cbcfb View commit details
    Browse the repository at this point in the history
  3. add gather

    MutegekiHenry committed Jul 24, 2024
    Configuration menu
    Copy the full SHA
    f9b66ef View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    81f5739 View commit details
    Browse the repository at this point in the history
  5. add logging

    MutegekiHenry committed Jul 24, 2024
    Configuration menu
    Copy the full SHA
    210fc15 View commit details
    Browse the repository at this point in the history

Commits on Jul 25, 2024

  1. clean up script

    MutegekiHenry committed Jul 25, 2024
    Configuration menu
    Copy the full SHA
    bc5e0ac View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4476930 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    26779e6 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2024

  1. Update cli.py

    matbun authored Jul 26, 2024
    Configuration menu
    Copy the full SHA
    ccb00d4 View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2024

  1. update torch dist final

    iacopoff committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    5115752 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7a9120f View commit details
    Browse the repository at this point in the history

Commits on Aug 9, 2024

  1. prepared slurm script

    iacopoff committed Aug 9, 2024
    Configuration menu
    Copy the full SHA
    fe26f3c View commit details
    Browse the repository at this point in the history

Commits on Aug 14, 2024

  1. Configuration menu
    Copy the full SHA
    353c158 View commit details
    Browse the repository at this point in the history

Commits on Aug 21, 2024

  1. add scaling tests

    MutegekiHenry committed Aug 21, 2024
    Configuration menu
    Copy the full SHA
    828624f View commit details
    Browse the repository at this point in the history
  2. add scaling tests

    MutegekiHenry committed Aug 21, 2024
    Configuration menu
    Copy the full SHA
    07223cf View commit details
    Browse the repository at this point in the history
  3. add plots

    MutegekiHenry committed Aug 21, 2024
    Configuration menu
    Copy the full SHA
    5c5bde9 View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2024

  1. fix env path

    MutegekiHenry committed Aug 23, 2024
    Configuration menu
    Copy the full SHA
    f236080 View commit details
    Browse the repository at this point in the history
  2. add hpo eurac

    MutegekiHenry committed Aug 23, 2024
    Configuration menu
    Copy the full SHA
    6b698ec View commit details
    Browse the repository at this point in the history
  3. correct start hpo cmd

    MutegekiHenry committed Aug 23, 2024
    Configuration menu
    Copy the full SHA
    f21de4d View commit details
    Browse the repository at this point in the history

Commits on Aug 26, 2024

  1. test distributed slurm

    iacopoff committed Aug 26, 2024
    Configuration menu
    Copy the full SHA
    a4db372 View commit details
    Browse the repository at this point in the history
  2. fix hpo functionality

    MutegekiHenry committed Aug 26, 2024
    Configuration menu
    Copy the full SHA
    8def582 View commit details
    Browse the repository at this point in the history

Commits on Aug 27, 2024

  1. working distributed version

    iacopoff committed Aug 27, 2024
    Configuration menu
    Copy the full SHA
    54a8d91 View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2024

  1. Configuration menu
    Copy the full SHA
    53373fb View commit details
    Browse the repository at this point in the history
  2. add hpo results vizualization script

    MutegekiHenry committed Aug 29, 2024
    Configuration menu
    Copy the full SHA
    cbdc819 View commit details
    Browse the repository at this point in the history
  3. update data path

    MutegekiHenry committed Aug 29, 2024
    Configuration menu
    Copy the full SHA
    d1ed009 View commit details
    Browse the repository at this point in the history

Commits on Sep 17, 2024

  1. Configuration menu
    Copy the full SHA
    4e5b1e9 View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2024

  1. convlstm

    iacopoff committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    08b31be View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e917a14 View commit details
    Browse the repository at this point in the history
  3. conv

    iacopoff committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    2194d3d View commit details
    Browse the repository at this point in the history
  4. Prov4ml integration (#192)

    * ADD prov4ml logger
    
    * UPDATE enum access fields
    
    * UPDATE loggers documentation and first integration attempt
    
    * ADD prov logger
    
    * format kinds table
    
    * MIGRATE to upstream prov4ml
    
    * ADD docs build on JSC
    
    * ADD RTD website
    
    * UPDATE docs creation
    
    * Refactor
    
    * UPDATE logger
    
    * Remove lightning callbacks and loggers
    
    * ADD checkpoints
    
    * UPDATE logger kind docs
    
    * Update README.md
    
    * ADD rank on loggers
    
    * Update loggers.py
    
    * Update loggers.py
    
    * Update loggers.py
    
    * Update loggers.py
    
    * Update loggers.py
    
    * FIX linter
    
    * REFACTOR loggers
    
    * Simplify prov4ml switch case
    
    * UPDATE loggers
    
    * FIX prov graph
    
    * REFACTOR itwinai logging
    
    * UPDATE SLURM jobscripts
    
    * REFACTOR
    
    * Update
    
    * ADD prov experiments
    
    * REFACTOR provenance logs and SLURM jobscripts
    
    * REMOVE duplication
    
    * FIX dataset name
    
    * UPDATE README
    
    * SKIP cyclones use case
    
    * UPDATE version
    
    * REMOVE redundant parameter
    
    * CLEANUP
    
    * ADD warning
    
    * ADD warning
    
    * UPDATE README
    
    * FIX errors
    
    * ADD docs
    
    * UPDATE scripts
    
    * UPDATE scripts
    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    106663d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    bd7c065 View commit details
    Browse the repository at this point in the history
  6. Update index.rst

    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    661f1d8 View commit details
    Browse the repository at this point in the history
  7. Update index.rst

    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    a05d2e4 View commit details
    Browse the repository at this point in the history
  8. Update index.rst

    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    a29a2cd View commit details
    Browse the repository at this point in the history
  9. Update pyproject.toml

    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    9e85ddb View commit details
    Browse the repository at this point in the history
  10. Update pyproject.toml

    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    99f8086 View commit details
    Browse the repository at this point in the history
  11. Bump github/super-linter from 6 to 7 (#198)

    Bumps [github/super-linter](https://github.com/github/super-linter) from 6 to 7.
    - [Release notes](https://github.com/github/super-linter/releases)
    - [Changelog](https://github.com/github/super-linter/blob/main/CHANGELOG.md)
    - [Commits](github/super-linter@v6...v7)
    
    ---
    updated-dependencies:
    - dependency-name: github/super-linter
      dependency-type: direct:production
      update-type: version-update:semver-major
    ...
    
    Signed-off-by: dependabot[bot] <[email protected]>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    0797041 View commit details
    Browse the repository at this point in the history
  12. Itwinai container (#197)

    * Backend (#59)
    
    * WIP: Tensorflow MNIST use-case
    
    * UPDATE: Tensorflow MNIST version
    
    * ADD: Backend
    
    * ADD: Use-case init
    
    * FIX: Paths and downloading of the data
    
    * FIX: Paths and downloading of the data
    
    * ADD: Setup, Config update
    
    * ADD: Setup, Config update
    
    * UPDATE: File movement into itwinai
    
    * FIX: Move utils from tensorflow to global folder
    
    * FIX: Add setup into torch Executable
    
    * ADD: MNIST Torch Use-case
    
    * FIX: Formatting
    
    * ADD: Lib
    
    * ADD: Lib
    
    * ADD: Tests, Fix Loggers
    
    * Update README.md
    
    * ADD: Tests
    
    * ADD: MLCC
    
    * ADD: Cyclones, Cyclones-pipe
    
    * ADD: TensorflowTrainer
    
    * UPDATE: Move TensorflowTrainer into Backend
    
    * FIX: Dependencies
    
    * ADD: Number of devices
    
    * ADD: initial version of TorchTrainer
    
    * update
    
    * update
    
    * ADD: distributed torch Trainer and decorator
    
    * ADD: New version of torch distribtued trainer and tests
    
    * ADD: load torch dist trainer form config file
    
    * ADD: multi-gpu pytorch trainer
    
    * ADD: download on login node
    
    * FIX: dataloaders in Trainer
    
    * FIX: add dataloaders into trainer
    
    * FIX: clear load and save state
    
    * ADD: Loggers
    
    * FIX: Log in a distributed environment
    
    * TensorFlow backend (#63)
    
    * UPDATE: Remove experimental distribution
    
    * ADD: Mnist distributed
    
    * ADD: Optional strategy
    
    * UPDATE: Conditional distribution
    
    * FIX: Dataloader for mnist
    
    * FIX: Model cloning lambda function for distributed scope
    
    * ADD: CycleGAN
    
    * UPDATE: Types
    
    * UPDATE: Types
    
    * ADD: Local distr
    
    * FIX: learning rates
    
    * ADD: CycleGAN distributed
    
    * FIX: Reduction
    
    * FIX: Distribution
    
    * ADD: tmp.py
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * FIX: Distribution
    
    * UPDATE: Executors
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * FIX: Distributed Dataset
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD: Ray
    
    * ADD:Initial VIRGO
    
    * UPDATE: Optional distribution, tensorflow-gpu
    
    * UPDATE: tensorflow-gpu dependency
    
    * ADD: Unify branches
    
    ---------
    
    Co-authored-by: User3574 <[email protected]>
    
    * Refacto entire code base
    
    * ADD: workflows folder
    
    * FIX: refactor
    
    * FIX: linting
    
    * ADD: how to run use case doc
    
    * ADD: workflows doc
    
    * FIX: MD linter
    
    * Pipe MNIST lightning (#86)
    
    * ADD: lightning distributed + pipeline
    
    * UPDATE: jscpd threshold
    
    * UPDATE: super linter ignore use cases
    
    * ADD: jscpd ignore loggers
    
    * Functional tests for MNIST (#87)
    
    * ADD: use case tests
    
    * FIX: move use case models out of itwinai
    
    * FIX: rearrange modules
    
    * ADD: ConsoleLogger and LoggersCollection
    
    * FIX: loggers filter
    
    * FIX: add TF env creation
    
    * UPDATE: test flag
    
    * ADD: early pytest on slurm
    
    * FIX: duplicated code in TF Trainer
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * 3dgan use case (#94)
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#96)
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * ADD: adaptive branch discovery for SQAaaS actin
    
    * Trigger only on main and dev branches
    
    * ADD: double quote
    
    * Trigger pytest only on main and dev PRs
    
    * Torch mnist inference (#95)
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * Remove keras dependency
    
    * 3dgan integration (#97)
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * 3dgan integration (#98)
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * REMOVE: keras dependency
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    ---------
    
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * fixed distributed trainer in cyclones use case
    
    * 3dgan integration (#118)
    
    * fixed distributed trainer in cyclones use case
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    * ADD GPU support and update tag
    
    * FIX linter
    
    * ADD override example
    
    * UPDATE 3DGAN inference
    
    * UPDATE inference execution tutorials
    
    * UPDATE README
    
    * UPDATE saver saving sparse tensors
    
    * ADD interlink pods
    
    * UPDATE pod name
    
    * UPDATE annotations
    
    * FIX README
    
    * CLEANUP
    
    * Merge
    
    * update
    
    * ADD tf cpu env
    
    * U[date Makefile
    
    * FIX 3DGAN tests
    
    * FIX data folder path
    
    ---------
    
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Unit test 4 dev (#113)
    
    * Define a step for pytest execution
    
    * Fix: use v1 of step action
    
    * Print result of step composition
    
    * Rename step
    
    * Use step previous definition in the assessment
    
    * Rename input: workflow -> steps
    
    * Avoid caching by using 1.0.0
    
    * Set container image
    
    * Bump to v1
    
    * Bump to sqaaas-assessment-action@v2
    
    * Remove 'id' property
    
    * Adapt inputs to v2
    
    * Remove current branch
    
    * Disable test_cyclones_train_tf
    
    * ADD marker
    
    * ADD skip memory heavy
    
    * Disable for PRs
    
    ---------
    
    Co-authored-by: Matteo Bunino <[email protected]>
    
    * Distributed strategy launcher (#117)
    
    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    
    * Distributed strategy launcher (#127)
    
    Update ParseConfig
    
    * Distributed strategy launcher (#128)
    
    Remove experimental files
    
    * Docs dev (#132)
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * fixed distributed trainer in cyclones use case
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * commiting docs functionality for testing deployment
    
    * adding documentation deployment relevant files
    
    * updating readthedocs.yaml
    
    * changing directory of requirements.txt
    
    * updating reqs file
    
    * commiting changes and adding pages for tutorials
    
    * adding installation instructions in docs
    
    * adding latest changes to docs
    
    * adding new pages for itwinai modules and other modifications
    
    * modified src/itwinai/torch directory name to solve namespace conflict
    
    * fixing tutorial sections
    
    * fixes in pages appearance
    
    * fixing rendering bugs
    
    * fixing pages appearance bugs
    
    * adding latest modifications
    
    * Deleted duplicate folder after renaming src/itwinai/torch
    
    * adding documentation.yml file for automatic updating on github pages
    
    * modifying documentation.yml file
    
    * updating reqs file to solve bug in deployment
    
    * testing automated docs update
    
    * updating getting started page
    
    * fixing pages and adding new content
    
    * bug fixes
    
    * fixing content rendering
    
    * latest fixes in rendering
    
    * Add version feature to docs
    
    * Update .readthedocs.yaml
    
    * fixing display structure in getting started page
    
    * new fixes similar to previous commit
    
    * Update index.rst
    
    * Update index.rst
    
    Text re-edit index
    
    * Update index.rst
    
    change 1 word
    
    * Update .readthedocs.yaml
    
    * Update .readthedocs.yaml
    
    * fixing getting started page
    
    * Text review getting_started_with_itwinai.rst
    
    * Update 3dgan_doc.rst
    
    * Update getting_started_with_itwinai.rst
    
    punctuation
    
    * Fix torch naming problem
    
    ---------
    
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    
    * Distributed strategy launcher (#131)
    
    * ADD: distrib launcher mockup
    
    * REFACTOR: cluster env, strategy and launcher
    
    * ADD: Torch Elastic Launcher
    
    * ADD: info on env vars
    
    * ADD: distributed tooling and examples
    
    * new folder
    
    * UPDATE: distributed strategy setup
    
    * generalized for DDP and DS
    
    * add config file
    
    * UPDATE: kwargs
    
    * Update general_trainer.py
    
    * Update general_startscript
    
    * Update general_trainer.py
    
    * UPDATE .gitignore
    
    * Update distrib strategy
    
    * UPDATE torch distributed strategy classes
    
    * Updated docstrings
    
    * Small fixes
    
    * UPDATE docstrings
    
    * ADD deepespeed config loader
    
    * ADD first deepspeed tutorial draft
    
    * UPDATE DDP Dp distrib strategy
    
    * UPDATE horovod strategy
    
    * UPDATE tutorial on torch distributed strategies
    
    * UPDATE torch strategies tutorial
    
    * Update createEnvJSC.sh
    
    * Update hvd_slurm.sh
    
    * Update README.md
    
    * UPDATE distributed tutorial
    
    * Delete tutorials/distributed-ml/torch-ddp-deepspeed-horovod/0
    
    * Fixes to deepspeed startscript
    
    * Update distributed.py
    
    * Update trainer.py
    
    * UPDATE tutorial
    
    * ADD draft MNIST tutorial
    
    * UPDATE DDP tutorial for MNIST
    
    * FIX small details
    
    * Update distributed.py
    
    * Added TF tutorials
    
    * Fixes to tutorials
    
    * Add files via upload
    
    * Update Makefile
    
    * Update README.md
    
    * UPDATE tutorials
    
    * UPDATE documentation and improve explainability
    
    * UPDATE SLURM scripts
    
    * FIX local rank mismatch
    
    * fixed distributed trainer in cyclones use case
    
    * UPDATE launcher
    
    * UPDATE linter
    
    * UPDATE format
    
    * FIX linter
    
    * FIX linter
    
    * Update workflow
    
    * UPDATE workflow
    
    * update
    
    * Update workflow
    
    * UPDATE super linter to v6
    
    * UPDATE super linter to v6.3.0
    
    * UPDATE super linter to slim
    
    * Cleanup
    
    * Update tfmirrored_slurm.sh
    
    * Update tfmirrored_slurm.sh
    
    * REMOVE workflows legacy
    
    * DELETE cyclegan use case
    
    * UPDATE dist training tutorials torch
    
    * RENAME folders with torch
    
    * DRAFT torch imagenet tutorial
    
    * UPDATE configuration
    
    * UPDATE imagenet tutorial
    
    * DRAFT scaling test
    
    * ADD scaling analysis report
    
    * FIX deepspeed micro batchsize
    
    * UPDATE data path
    
    * UPDATE checkpoint to avoid race conditions
    
    * UPDATE scalability report
    
    * UPDATE dataset path
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * Update README.md
    
    * Update README.md
    
    * JUBE benchmarks
    
    * Update createEnvJSC.sh
    
    * Update createEnvJSCTF.sh
    
    * ADD logy scale option
    
    * Extract JUBE tutorial
    
    * CLEANUP baselines
    
    * Log epoch time in real-time
    
    * FIX deepspeed dataloader for potential performances improvement
    
    * UPDATE SC bash severity
    
    * FIX deepspeed and horovod trainers
    
    * FIX some code checks
    
    * Unify redundant SLURM job scripts and configuration files
    
    * CLEANUP unused configuration
    
    * Reorg configurations
    
    * Refactor configurations and add documentation
    
    * Update README
    
    * ADD report image
    
    * Improve plot resolution
    
    * UPDATE scaling test
    
    * UPDATE  launcher scripts
    
    * FIX linter
    
    * REMOVE jube tutorial
    
    * Restore ConfigParser
    
    * FIX type hinting
    
    * ADD dev dependencies
    
    * REMOVE experimental scripts
    
    * UPDATE scaling report
    
    * Add SLURM logs
    
    * Refactor log scale
    
    * Update scalability report
    
    * Unify SLURM logs per job
    
    * Update README.md
    
    * Update README.md
    
    * Update README.md
    
    * ADD itwinai installation
    
    * UPDATE torch distributed tutorial 0
    
    * UPDATE torch distributed tutorials
    
    * REMOVE imagenet tutorial
    
    * ADD NonDistributedStrategy and create_dataloader method
    
    * CLEANUP older classes
    
    * Rename strategies
    
    * Simplify structure
    
    * ADD draft new torch trainer class
    
    * UPDATED torch trainer draft
    
    * UPDATE MNIST use case
    
    * INtegrate new trainer into MNIST use case
    
    * UPDATE structure: remove unused files and refactor tests
    
    * Tmp disable unused tests
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * Update action
    
    * FIX failing inference
    
    * Functiona tests (#133)
    
    * UPDATE tests
    
    * FIX errors
    
    * CLEANUP
    
    * Remove unused workflow
    
    ---------
    
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    
    * 3dgan integration (#134)
    
    * fixed distributed trainer in cyclones use case
    
    * commiting integration of 3dgan scripts
    
    * ADD: Download dataset
    
    * FIX: DDP distributed training with manual optimization
    
    * ADD: log with MLFlow
    
    * Sqaaas code (#88)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Sqaaas code (#89)
    
    * Create sqaaas.yml
    
    * Update sqaaas.yml
    
    * Update sqaaas.yml
    
    * Point to the current repo
    
    * Remove unnecessary checkout step
    
    * Rename step
    
    * ADD: adaptive branch discovery for SQAaaS action
    
    * Update sqaaas.yml
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD: draft predictor and saver
    
    * ADD: stub for inference pipeline
    
    * ADD: small docs
    
    * UPDATE: inference pipeline components
    
    * UPDATE: reorg
    
    * ADD: image generation for inference
    
    * update tag
    
    * ADD: threshold
    
    * ADD: draft inference
    
    * ADD: draft inference wf
    
    * ADD: working inference workflow
    
    * ADD: 3D scatter plots
    
    * ADD: Dockerfile + refactor
    
    * ADD: .dockerignore
    
    * Update .dockerignore
    
    * ADD: skip download option
    
    * ADD: cern pipeline.yaml
    
    * UPDATE: dataset loading function
    
    * UPDATE: dataset loading function
    
    * UPDATE conf
    
    * UPDATE refactor
    
    * UPDATE refactor
    
    * UPDATE training docs
    
    * Update readme
    
    * update README
    
    * FIX typo
    
    * Update README
    
    * Update mkdir
    
    * UPDATE data paths
    
    * UPDATE Dockerfile
    
    * UPDATE Dockerfiles
    
    * UPDATE for Singularity execution
    
    * FIX version mismatch
    
    * UPDATE Singularity docs
    
    * Named steps pipe (#100)
    
    * ADD: dict steps pipe
    
    * Relax dependency constraint
    
    * UPDATE Singularity exec command
    
    * UPDATE: Image version
    
    * UPDATE: load components from pipeline
    
    * ADD: docs
    
    * Simplify 3DGAN model config
    
    * ADD: mlflow autologging support for PL trainer
    
    * UPDATE container info
    
    * Refactor
    
    * UPDATE dependencies
    
    * FIX linter problem
    
    * Simplified workflow configuration (#108)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * Simplified workflow configuration (#109)
    
    * Add SQAaaS dynamic badge for dev branch (#104)
    
    * Add SQAaaS dynamic badge
    
    * Upgrade to sqaaas-assessment-action@v2
    
    * Add draft example
    
    * UPDATE credits field
    
    * ADD docs
    
    * REFACTOR components and pipeline code
    
    * UPDATE docstring
    
    * UPDATE mnist torch uc
    
    * ADD config file parser draft
    
    * ADD itwinaiCLI and ConfigParser
    
    * ADD docs
    
    * ADD pipeline parser and serializer plus tests
    
    * UPDATE docs
    
    * ADD adapter component and tests (incl parser)
    
    * ADD splitter component, improve pipeline, tests
    
    * UPDATE test
    
    * REMOVE todos
    
    * ADD component tests
    
    * ADD serializer tests
    
    * FIX linter
    
    * ADD basic workflow tutorial
    
    * ADD basic intermediate tutorial
    
    * ADD advanced tutorial
    
    * UPDATE advanced tutorial
    
    * UPDATE use cases
    
    * UPDATE save parameters
    
    * FIX linter
    
    * FIX cyclones use case workflow
    
    * ADD slurm jobscript
    
    * FIX merge error
    
    * FIX components template
    
    ---------
    
    Co-authored-by: orviz <[email protected]>
    
    * ADD integration tests
    
    * FIX test
    
    * FIX 3dgan inference test
    
    * ADD GPU support and update tag
    
    * FIX linter
    
    * ADD override example
    
    * UPDATE 3DGAN inference
    
    * UPDATE inference execution tutorials
    
    * UPDATE README
    
    * UPDATE saver saving sparse tensors
    
    * ADD interlink pods
    
    * UPDATE pod name
    
    * UPDATE annotations
    
    * FIX README
    
    * CLEANUP
    
    * Merge
    
    * update
    
    * ADD tf cpu env
    
    * U[date Makefile
    
    * FIX 3DGAN tests
    
    * FIX data folder path
    
    * ADD offloading of 3DGAN training
    
    * ADAPT 3DGAN training for singularity execution
    
    * UPDATE test and fix linter
    
    ---------
    
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: orviz <[email protected]>
    
    * Move to python venv
    
    * Update Makefile
    
    * Add Horovod installation
    
    * Update env
    
    * FIX openmpi install
    
    * Add TF explicit version
    
    * UPDATE env creation
    
    * REMOVE constraint on torch 2.0.*
    
    * UPDATE installation
    
    * FIX test
    
    * REMOVE strict dependency on micromamba
    
    * FIX docs and debugging states
    
    * FIX cpu only installation
    
    * FIX deepspeed cpu installation
    
    * FIX tf env creation
    
    * FIX makefile
    
    * ADD torch and tensorflow Docker containers
    
    * Working DDP
    
    * REFACTOR torch container build scripts
    
    * FIX MPI env var set
    
    * Incomplete containers
    
    * UPDATE Dockerfiles
    
    * REFACTOR Dockerfiles
    
    * Rename
    
    * UPDATE containers files and tutorial
    
    * CLEANUP old doc pages
    
    * ADD containers tutorials
    
    * ADD containers tutorials
    
    * UPDATE deps
    
    * UPDATE deps
    
    * UPDATE deps
    
    * UPDATE docs and tutorials
    
    * CLEANUP duplicates
    
    * Update tests and scripts
    
    * ADD labels
    
    * CLEANUP
    
    * Add docs and fix deepspeed launcher
    
    * UPDATE linter settings
    
    * FIX slow unit test on 3DGAN train
    
    * ADD 3dgan sample dataset
    
    ---------
    
    Co-authored-by: Roman Machacek <[email protected]>
    Co-authored-by: linxUser3574 <[email protected]>
    Co-authored-by: orviz <[email protected]>
    Co-authored-by: Kalliopi Tsolaki <[email protected]>
    Co-authored-by: zoechbauer1 <[email protected]>
    Co-authored-by: Mario Rüttgers <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: r-sarma <[email protected]>
    Co-authored-by: KalliopiTsolaki <[email protected]>
    Co-authored-by: VerderK <[email protected]>
    11 people authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    7266008 View commit details
    Browse the repository at this point in the history
  13. Update config.yaml

    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    d104eff View commit details
    Browse the repository at this point in the history
  14. Update run_docker.sh

    matbun authored and jarlsondre committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    9bbf974 View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2024

  1. Configuration menu
    Copy the full SHA
    d8506b7 View commit details
    Browse the repository at this point in the history
  2. fixing linting errors

    jarlsondre committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    43b5069 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3d58822 View commit details
    Browse the repository at this point in the history
  4. run isort on eurac files

    jarlsondre committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    8399e54 View commit details
    Browse the repository at this point in the history
  5. Update distributed.py

    matbun authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    47826b9 View commit details
    Browse the repository at this point in the history
  6. Update trainer.py

    matbun authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    3237dc2 View commit details
    Browse the repository at this point in the history
  7. Update trainer.py

    matbun authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    9ee32d8 View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2024

  1. Configuration menu
    Copy the full SHA
    a4ead7c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    94138ec View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2024

  1. Configuration menu
    Copy the full SHA
    6cdb0f6 View commit details
    Browse the repository at this point in the history
  2. Fixed deepspeed launcher for scaling test, added option to set checkp…

    …oint frequency in NoiseGeneratorTrainer
    annaelisalappe committed Sep 23, 2024
    Configuration menu
    Copy the full SHA
    acef68f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5b54c81 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    2a39041 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5b0137d View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    ad25942 View commit details
    Browse the repository at this point in the history

Commits on Sep 24, 2024

  1. config

    iacopoff committed Sep 24, 2024
    Configuration menu
    Copy the full SHA
    cbe7261 View commit details
    Browse the repository at this point in the history
  2. update mse metric

    iacopoff committed Sep 24, 2024
    Configuration menu
    Copy the full SHA
    926cc41 View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2024

  1. Configuration menu
    Copy the full SHA
    d293675 View commit details
    Browse the repository at this point in the history
  2. small cleanup

    jarlsondre committed Sep 25, 2024
    Configuration menu
    Copy the full SHA
    01d42e8 View commit details
    Browse the repository at this point in the history
  3. match newest config file

    jarlsondre committed Sep 25, 2024
    Configuration menu
    Copy the full SHA
    9f63b42 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b7f97f3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    13e4b20 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    73644a0 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    2ec41dd View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    8b7cd2c View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    e9e28e5 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    9007447 View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2024

  1. Configuration menu
    Copy the full SHA
    d355489 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c04f1f3 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1f50030 View commit details
    Browse the repository at this point in the history
  4. run isort

    jarlsondre committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    3cda5ba View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    dd46530 View commit details
    Browse the repository at this point in the history
  6. fix linting errors

    jarlsondre committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    2369641 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    4a42a23 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    352977a View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    44a7966 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    5e3fa31 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    956b516 View commit details
    Browse the repository at this point in the history
  12. logging model

    iacopoff committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    dbec263 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    36e4b65 View commit details
    Browse the repository at this point in the history
  14. Unused imports

    annaelisalappe committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    4a866c2 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    3d69e1c View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    a174c78 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    f77b1b7 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    be0fa8a View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2024

  1. Configuration menu
    Copy the full SHA
    5eb23ce View commit details
    Browse the repository at this point in the history
  2. Added override for loggers field, so that the config.yaml does not ha…

    …ve to be changed for hpo to work
    annaelisalappe committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    8dce5f2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    cdc1222 View commit details
    Browse the repository at this point in the history
  4. isort

    annaelisalappe committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    6f7a6fd View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    ee48fbf View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    e19802c View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    3458640 View commit details
    Browse the repository at this point in the history

Commits on Oct 1, 2024

  1. Configuration menu
    Copy the full SHA
    ca6b504 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f97eccd View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2024

  1. Configuration menu
    Copy the full SHA
    899f5d0 View commit details
    Browse the repository at this point in the history

Commits on Oct 5, 2024

  1. Configuration menu
    Copy the full SHA
    7c8d88a View commit details
    Browse the repository at this point in the history

Commits on Oct 8, 2024

  1. Configuration menu
    Copy the full SHA
    6715247 View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2024

  1. update trainer imports

    iacopoff committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    041c71e View commit details
    Browse the repository at this point in the history
  2. update config

    iacopoff committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    a849c91 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    87572c9 View commit details
    Browse the repository at this point in the history
  4. add comm vs comp analysis

    jarlsondre committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    eb13482 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    7dadcb0 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    9de6abe View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    c27367d View commit details
    Browse the repository at this point in the history
  8. Merge branch 'scalability-report-update' of github.com:interTwin-eu/i…

    …twinai into scalability-report-update
    jarlsondre committed Oct 9, 2024
    Configuration menu
    Copy the full SHA
    e2c728a View commit details
    Browse the repository at this point in the history

Commits on Oct 10, 2024

  1. Configuration menu
    Copy the full SHA
    f5f1495 View commit details
    Browse the repository at this point in the history
  2. update jupyter notebook

    jarlsondre committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    eed58c2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d95a62e View commit details
    Browse the repository at this point in the history
  4. Do data analysis

    jarlsondre committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    2816ee0 View commit details
    Browse the repository at this point in the history
  5. add docstrings etc.

    jarlsondre committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    383a4d4 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    2c2aaaf View commit details
    Browse the repository at this point in the history
  7. run isort on folder

    jarlsondre committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    4c450a8 View commit details
    Browse the repository at this point in the history
  8. remove unused file

    jarlsondre committed Oct 10, 2024
    Configuration menu
    Copy the full SHA
    21f8002 View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2024

  1. Configuration menu
    Copy the full SHA
    c5a1296 View commit details
    Browse the repository at this point in the history
  2. Update README.md

    jarlsondre authored Oct 11, 2024
    Configuration menu
    Copy the full SHA
    295e3bf View commit details
    Browse the repository at this point in the history
  3. Update README.md

    jarlsondre authored Oct 11, 2024
    Configuration menu
    Copy the full SHA
    b9ddad4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    1a9a12a View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    368552f View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    6daa42a View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    82fc9db View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    d0a3dbe View commit details
    Browse the repository at this point in the history
  9. code cleanup for PR

    jarlsondre committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    eb527b0 View commit details
    Browse the repository at this point in the history
  10. merge files

    jarlsondre committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    ff94b29 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    680c225 View commit details
    Browse the repository at this point in the history
  12. update slurm script

    jarlsondre committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    2f4f95a View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    e32c4ae View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    24b0196 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    fba749e View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    430164d View commit details
    Browse the repository at this point in the history
  17. Spelling errors

    annaelisalappe committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    cc0deed View commit details
    Browse the repository at this point in the history
  18. Isort

    annaelisalappe committed Oct 11, 2024
    Configuration menu
    Copy the full SHA
    2b90e4e View commit details
    Browse the repository at this point in the history

Commits on Oct 14, 2024

  1. Update file_gen.py

    matbun authored Oct 14, 2024
    Configuration menu
    Copy the full SHA
    4850926 View commit details
    Browse the repository at this point in the history
  2. Update file_gen.py

    matbun authored Oct 14, 2024
    Configuration menu
    Copy the full SHA
    3f03844 View commit details
    Browse the repository at this point in the history
  3. Update README.md

    matbun authored Oct 14, 2024
    Configuration menu
    Copy the full SHA
    3e40dbe View commit details
    Browse the repository at this point in the history
  4. Update README.md

    matbun authored Oct 14, 2024
    Configuration menu
    Copy the full SHA
    16efe26 View commit details
    Browse the repository at this point in the history
  5. update readme

    jarlsondre committed Oct 14, 2024
    Configuration menu
    Copy the full SHA
    771ba43 View commit details
    Browse the repository at this point in the history
  6. remove run.sh

    jarlsondre committed Oct 14, 2024
    Configuration menu
    Copy the full SHA
    325399f View commit details
    Browse the repository at this point in the history
  7. fix typo in readme

    jarlsondre committed Oct 14, 2024
    Configuration menu
    Copy the full SHA
    f397976 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    444325c View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    129afb3 View commit details
    Browse the repository at this point in the history

Commits on Oct 15, 2024

  1. Configuration menu
    Copy the full SHA
    bb66604 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    500f6fa View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    6dcdafd View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    6edbd4c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    f57f7ed View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    7c8107b View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    a6b5c3d View commit details
    Browse the repository at this point in the history
  8. add comm vs comp analysis

    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    88aa13d View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    15b0ca5 View commit details
    Browse the repository at this point in the history
  10. Update README.md

    matbun authored and jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    ab758f6 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    372b0f8 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    4496ab3 View commit details
    Browse the repository at this point in the history
  13. update jupyter notebook

    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    691bd11 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    7d92902 View commit details
    Browse the repository at this point in the history
  15. Do data analysis

    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    4e6e6f4 View commit details
    Browse the repository at this point in the history
  16. add docstrings etc.

    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    e508fae View commit details
    Browse the repository at this point in the history
  17. update slurm script

    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    b9b8307 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    8e747bc View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    08d5b06 View commit details
    Browse the repository at this point in the history
  20. Merge branch 'scalability-report-update' of github.com:interTwin-eu/i…

    …twinai into scalability-report-update
    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    a4441a5 View commit details
    Browse the repository at this point in the history
  21. add scheduler to profiler

    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    7d2fc84 View commit details
    Browse the repository at this point in the history
  22. merge

    jarlsondre committed Oct 15, 2024
    Configuration menu
    Copy the full SHA
    a2a739a View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2024

  1. Configuration menu
    Copy the full SHA
    896193e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3ed873b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    944f4b1 View commit details
    Browse the repository at this point in the history
  4. format code

    jarlsondre committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    afd155b View commit details
    Browse the repository at this point in the history
  5. fix linting errors

    jarlsondre committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    e639cd4 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b341d97 View commit details
    Browse the repository at this point in the history
  7. update docstrings

    jarlsondre committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    f022d59 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    e38d555 View commit details
    Browse the repository at this point in the history
  9. move profiler to own file

    jarlsondre committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    ceb17d3 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    f1e36b1 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    c35eafb View commit details
    Browse the repository at this point in the history
  12. fix linting errors

    jarlsondre committed Oct 16, 2024
    Configuration menu
    Copy the full SHA
    b12c2f9 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    3a76b66 View commit details
    Browse the repository at this point in the history