Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Segmental duplications #1458

Open
jonahcullen opened this issue Aug 7, 2024 · 1 comment
Open

Segmental duplications #1458

jonahcullen opened this issue Aug 7, 2024 · 1 comment

Comments

@jonahcullen
Copy link

Hello, I am trying to replicate the generation of the Year1 centromeric satellite and segmental duplication annotations as described for usage with the contig.inclusion.stats.R. If I understand correctly, the centromeric annotations are produced with dna-brnn as part of cactus-preprocess (--maskMode brnn). What mask action was chosen? I am guessing I am just missing it due to my unfamiliarity, but where/how are the segmental duplications marked in the sedef.bedpe files? Is that with sedef or now biser? And what if anything was done following sedef/biser (?) to generate for example HG00438.maternal.sedef.bedpe.

Thanks for your time,
Jonah.

@glennhickey
Copy link
Collaborator

dna-brnn was run with its default settings. From the HPRC paper https://www.nature.com/articles/s41586-023-05896-x#Sec120

SD annotation

SDs were annotated using sedef85 after masking repeats in each assembly. Repeats annotated with more than 20 copies corresponded to unannotated mobile elements and were excluded from the analysis. The pipeline for annotating SDs is available at GitHub (https://github.com/ChaissonLab/SegDupAnnotation/releases/tag/vHPRC).

I think the segdupe data may live here

https://s3-us-west-2.amazonaws.com/human-pangenomics/index.html?prefix=submissions/0175F9C0-83B5-4CA3-9256-EC0593490EE7--repeats-and-segdups/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants