-
Notifications
You must be signed in to change notification settings - Fork 7
Compute Clusters
We have three clusters available to us:
All three clusters use Slurm. A quick overview of slurm commands can be found here. Their official documentation is here
Commands available:
- sbatch. Built in to slurm and available on all three clusters. Docs:
man sbatch
or the official docs here. - qbatch. A python package installed on our cluster and the SCC. Makes it a bit easier to submit jobs by automatically setting certain settings. Docs: Official docs here
sacct
. Without any options it will show a list of your submitted jobs only. If you use sacct -a
it will show you all jobs currently submitted so you can get an overview of how busy the queue is.
Cancel a specific job:
scancel <job id>
Cancel all of your jobs:
scancel -u <username>
You don't need to do anything special to access our cluster. All our computers are set up to submit jobs to it. It runs on our workstations, three dedicated compute nodes (40 cores, 128MB each) and a dedicated GPU computing node (4xGeForce Titan GPUs) and is the smallest of the three clusters available.
Anyone with a lab account can get access to CAMH's Specialised Computing Cluster by submitting a request here. After you get an account you can log in with ssh <username>@scclogin.camhres.ca
. Generally the username and password you use will be the same as the ones you use to log in to our workstations. Their full documentation is available here. This cluster is larger than ours, and is entirely dedicated to running jobs, so it can churn through a lot more work. But it's also used by multiple labs at CAMH so it can be busier than ours.
Our instructions for registering for an using SciNet can be found here. Scinet is the largest of all three clusters, but is widely (and heavily) used so you're more likely to have to wait. You'll also have to be more careful about how you structure your jobs and what resources you request because they're more strict.
Once you have an account you can login with ssh <scinet username>@niagara.scinet.utoronto.ca
- Home
- Onboarding / Introduction
- Technical Skills
- Resources
- Offboarding
- Data
- Other
- Methods