Skip to content
Dawn E. Smith edited this page Aug 6, 2019 · 16 revisions

Available Clusters

We have three clusters available to us:

All three clusters use Slurm. A quick overview of slurm commands can be found here. Their official documentation is here

Submitting jobs

Commands available:

  • sbatch. Built in to slurm and available on all three clusters. Docs: man sbatch or the official docs here.
  • qbatch. A python package installed on our cluster and the SCC. Makes it a bit easier to submit jobs by automatically setting certain settings. Docs: Official docs here

Checking on your jobs

sacct. Without any options it will show a list of your submitted jobs only. If you use 'sacct -a' it will show you all jobs currently submitted so you can get an overview of how busy the queue is.

Cancelling jobs

Cancel a specific job: scancel <job id> Cancel all of your jobs: scancel -u <username>

Accessing the clusters

Local Cluster

You don't need to do anything special to access our cluster. All our computers are set up to submit jobs to it. It runs on our workstations, three dedicated compute nodes (40 cores, 128MB each) and a dedicated GPU computing node (4xGeForce Titan GPUs) and is the smallest of the three clusters available.

The SCC

Anyone with a lab account can get access to CAMH's Specialised Computing Cluster by submitting a request here. After you get an account you can log in with ssh <username>@scclogin.camhres.ca. Their full documentation is available here. This cluster is larger than ours, and is entirely dedicated to running jobs, so it can churn through a lot more work. But it's also used by multiple labs at CAMH so it can be busier than ours.

Scinet

To get access to scinet you have to register at compute canada and then apply for scinet access. Their website has complete instructions here. You will need to ask a lab member with access for the correct RAC to use to get access to our allocation. Scinet is the largest of all three clusters, but is widely (and heavily) used so you're more likely to have to wait. You'll also have to be more careful about how you structure your jobs and what resources you request because they're more strict.

Once you have an account you can login with ssh <scinet username>@niagara.scinet.utoronto.ca

Clone this wiki locally