Skip to content

Latest commit

 

History

History
28 lines (16 loc) · 1.58 KB

Submissions.md

File metadata and controls

28 lines (16 loc) · 1.58 KB

Contributing a dataset to IHEC

Overall Process

Submission to IHEC is a multistage process which involves 3 submissions:

  1. submission of the raw data to a public archive (e.g. EGA, DDBJ or dbGaP)
  2. submission of the metadata to the EpiRR registry
  3. submission of analysis files to the IHEC Data Portal

To avoid too many iterations, IHEC provides automatic validation software to verify compliance at the start of the process.

Submission Process

1. Metadata preparation

The metadata is stored and transmitted as an XML file. Here are instructions on metadata preparation.

2. Raw data submission

Having prepared the metadata, proceed to submit the raw experimental data to the archive of your choice. For sequencing raw data, the preferred format is FASTQ (over BAM and others).

3. EpiRR submission

Having submitted data into the archive, you should now have global identifiers for your dataset, samples and experiments. You can now prepare an EpiRR JSON submission file

4. IHEC Data Portal submission

Having submitted metadata into EpiRR, you should now have EpiRR identifiers to each epigenome. You can now use this information to link publicly visible analysis files (in BigBed and BigWig format) to the epigenome using an IHEC Data Hub JSON file