Information about what datasets are required for a module's tutorial exercises are described in a
data.yaml
file. As well describing where to source the dataset files from, we also need to describe
where on the filesystem the files should reside and who should own them.