Chromosomal Conformation Course

Course description

3C-based methods, such as Hi-C, produce a huge amount of raw data as pairs of DNA reads that are in close spatial proximity in the cell nucleus. Overall, those interaction matrices have been used to study how the genome folds within the nucleus, which is one of the most fascinating problems in modern biology. The rigorous analysis of those paired-reads using computational tools has been essential to fully exploit the experimental technique, and to study how the genome is folded in the space. Currently, there is a clear expansion on the wealth of data on genome structure with the availability of many dataset of Hi-C experiments down to 1Kb resolution (see for example: http://hic.umassmed.edu/welcome/welcome.php ; http://promoter.bx.psu.edu/hi-c/view.php or http://www.aidenlab.org/data.html ). In this course, participants will learn to use TADbit, a software designed and developed to manage all dimensionalities of the Hi-C data:

1D - Map paired-end sequences to generate Hi-C interaction matrices
2D - Normalize matrices and identify constitutive domains (TADs, compartments)
3D - Generate populations of structures which satisfy the Hi-C interaction matrices
4D - Compare samples at different time points

Participants can bring- specific biological questions and/or their own 3C-based data to analyze during the course. At the end of the course, participants will be familiar with the TADbit software and will be able to fully analyze Hi-C data. Note: Although the TADbit software is central in this course, alternative software will be discussed for each part of the analysis.

Why TADbit?

This course uses exclusively mostly TADbit to analyze HiC data, however many other tools are available ( https://www.multiscalegenomics.eu/MuGVRE/3c-tools-comparison/ ). The advantage of using TADbit is that it covers all the step of the analysis, from the quality check of the FASTQ reads to the 3D modeling, and is thus the perfect backbone for any HiC analysis pipeline. Moreover, as it is a python library, it is relatively easy to plugin analysis from other tools at any step.

Target Audience

The course design is oriented towards experimental researchers and bioinformaticians at the graduate and post-graduate levels. The last edition of this course was attended by people with different backgrounds and interested in the genome organization. Moreover, Hi-C data have recently been used in metagenomics studies to accurately cluster metagenome assembly contigs into groups that contain nearly complete genomes of each species. It is likely that the participants to this course aim at getting involved in generating Hi-C data for chromosome structure determination or that they just want to be able to correctly interpret and analyse publicly available data.

Course Pre-requisites

Recommended Linux and basic Python programming skills, graduate level in Life Sciences.

Content

	Lectures (pdf)	Core pipeline (notebooks)	Annex (notebooks)
Day1	Intro UNIX Intro Python		Software installation
Day2	Intro TADbit NGS in HiC	Hi-C Quality check Mapping	Prepare reference genome Download Hi-C experiment
Day3	From HiC to 3D models
Day4	Normalization of HiC data and DryHiC CSnorm	Parsing mapped reads-MboI Parsing mapped reads-HindIII Filterind reads Normalization	Compare/merge experiments
Day5		Compartments and TADs Parameter optimization Model optimization	Align and compare TADs Analysis of 3D models

TADbit tools

Most of the tasks of the "core pipeline" can be tunned directly from command line (without any python), using TADbit tool. Have a look to the commands, and the metadata of the results.

For now TADbit tool is not incuded in the general documetation, as it is still "under construction". Use it carefully, and don't hesitate to repport anyweird behaviour you observe.

TADbit version

This tutorial is associated with a specific version of TADbit, if you wish to reproduce exactly the results in the notebooks you should use the version of TADbit tagged CRG_CCc_2017.

To install this version do:

git clone https://github.com/3dgenomes/tadbit
cd tadbit
git checkout tags/CRG_CCc_2017
sudo python setup.py install

http://www.crg.eu/en/event/coursescrg-chromosomal-conformation-0

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
Documents/Logo		Documents/Logo
Notebooks		Notebooks
Papers		Papers
Participants		Participants
Presentations		Presentations
TADbit_tools		TADbit_tools
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.md.orig		README.md.orig

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chromosomal Conformation Course

Course description

Target Audience

Course Pre-requisites

Content

TADbit tools

TADbit version

About

Releases

Packages

Languages

License

deboramarks/Chromosomal-Conformation-Course

Folders and files

Latest commit

History

Repository files navigation

Chromosomal Conformation Course

Course description

Target Audience

Course Pre-requisites

Content

TADbit tools

TADbit version

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages