Connect AWS/P5 (H100) instances to CI #3490
Triggered via pull request
October 31, 2024 13:47
Status
Startup failure
Total duration
–
Artifacts
–
ci.yaml
on: pull_request
metadata
Matrix: amd64 / test-distribution
Waiting for pending jobs
Matrix: arm64 / test-distribution
Waiting for pending jobs
Matrix: amd64 / test-jax / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-nsys-jax / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-pallas / run-unit-test
Waiting for pending jobs
amd64
/
...
/
launch-slurm-runner
amd64
/
...
/
launch-slurm-runner
amd64
/
...
/
launch-slurm-runner
Matrix: arm64 / test-jax / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-nsys-jax / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-pallas / run-unit-test
Waiting for pending jobs
arm64
/
...
/
launch-slurm-runner
arm64
/
...
/
launch-slurm-runner
arm64
/
...
/
launch-slurm-runner
Matrix: amd64 / test-te / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-upstream-pax / pax-multi-node
Waiting for pending jobs
Matrix: amd64 / test-upstream-pax / single-process-evaluation
Waiting for pending jobs
Matrix: amd64 / test-upstream-pax / single-process-multi-device
Waiting for pending jobs
Matrix: amd64 / test-te-multigpu / te-multi-gpu
Waiting for pending jobs
Matrix: amd64 / test-upstream-t5x / t5x-multi-gpu
Waiting for pending jobs
Matrix: amd64 / test-upstream-t5x / t5x-multi-node
Waiting for pending jobs
Matrix: amd64 / test-gemma / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-levanter / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-maxtext / maxtext-multinode
Waiting for pending jobs
Matrix: amd64 / test-maxtext / single-process-multi-device
Waiting for pending jobs
Matrix: amd64 / test-triton / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-te / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-upstream-pax / pax-multi-node
Waiting for pending jobs
Matrix: arm64 / test-upstream-pax / single-process-evaluation
Waiting for pending jobs
Matrix: arm64 / test-upstream-pax / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-te-multigpu / te-multi-gpu
Waiting for pending jobs
Matrix: arm64 / test-upstream-t5x / t5x-multi-gpu
Waiting for pending jobs
Matrix: arm64 / test-upstream-t5x / t5x-multi-node
Waiting for pending jobs
Matrix: arm64 / test-gemma / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-levanter / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-maxtext / maxtext-multinode
Waiting for pending jobs
Matrix: arm64 / test-maxtext / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-triton / run-unit-test
Waiting for pending jobs
Matrix: amd64 / test-rosetta-pax / rosetta-pax-multi-node-te
Waiting for pending jobs
Matrix: amd64 / test-rosetta-pax / rosetta-pax-multi-node
Waiting for pending jobs
Matrix: amd64 / test-rosetta-pax / rosetta-pax-single-node-dropout-te
Waiting for pending jobs
Matrix: amd64 / test-rosetta-pax / single-process-evaluation-te
Waiting for pending jobs
Matrix: amd64 / test-rosetta-pax / single-process-multi-device-te
Waiting for pending jobs
Matrix: amd64 / test-rosetta-t5x / multi-gpu-multi-node
Waiting for pending jobs
Matrix: amd64 / test-rosetta-t5x / single-process-multi-device
Waiting for pending jobs
Matrix: amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
Matrix: amd64 / test-rosetta-t5x / vit-single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / rosetta-pax-multi-node-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / rosetta-pax-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / rosetta-pax-single-node-dropout-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / single-process-evaluation-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-pax / single-process-multi-device-te
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / multi-gpu-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / single-process-multi-device
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
Matrix: arm64 / test-rosetta-t5x / vit-single-process-multi-device
Waiting for pending jobs
Matrix: publish-containers
Waiting for pending jobs
finalize
/
publish-badge
Annotations
1 error
Invalid workflow file:
.github/workflows/ci.yaml#L28
The workflow is not valid. NVIDIA/JAX-Toolbox/.github/workflows/_test_te.yaml@1869da1c411d3268c5133a795e1cbeaa533a7ffd (Line: 28, Col: 29): Invalid input, SLURM_LOGIN_HOSTNAME is not defined in the referenced workflow.
|