Submission to IHEC is a multistage process which involves 3 submissions:
- submission of the raw data to a public archive (e.g. EGA, DDBJ or dbGaP)
- submission of the metadata to the EpiRR registry
- submission of analysis files to the IHEC Data Portal
To avoid too many iterations, IHEC provides automatic validation software to verify compliance at the start of the process.
The metadata is stored and transmitted as an XML file. Here are instructions on metadata preparation.
Having prepared the metadata, proceed to submit the raw experimental data to the archive of your choice. For sequencing raw data, the preferred format is FASTQ (over BAM and others).
Having submitted data into the archive, you should now have global identifiers for your dataset, samples and experiments. You can now prepare an EpiRR JSON submission file
Having submitted metadata into EpiRR, you should now have EpiRR identifiers to each epigenome. You can now use this information to link publicly visible analysis files (in BigBed and BigWig format) to the epigenome using an IHEC Data Hub JSON file