Test datasets for local MSK modules or subworkflows added into modules repo
- dmp_demux (raw fastq files and metadata from sequencing for demultiplexing used in DMP)
- argos (test datasets for Argos Pipeline)
- neoantigen (test datasets for Neoantigen)
- hg37 (downsized reference files for hg37, and downsized chromosome 22)
- hla (test datasets for hla)
- chr22 (downsized chromosome 22)?
- Master branch should stay empty. Actual test datasets are categories into different branches.
- Each test file should be LESS than 50 MB based on the requirement of GitHub
- Each branch contains datasets specifically for a pipeline (e.g. argos), project (e.g. dmp_demux), module/sub-workflow (e.g. neoantigen), or references (e.g. hg37)
- Each branch contains README file with brief introduction of the test datasets, prefer to have a link with the corresponding workflow repo (if applicable)
- If a new test dataset does not belong to all existing branches, create a
feature/<test-dataset>
branch based onmaster
branch. Once the test dataset is ready to publish, contact our Review Team to do the final review and create new official branch. - If the new test dataset fitting in existing branches, please make a PR to the corresponding branch.