Skip to content

Use a new training set #81

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
kantLeroy opened this issue Dec 11, 2024 · 1 comment
Open

Use a new training set #81

kantLeroy opened this issue Dec 11, 2024 · 1 comment

Comments

@kantLeroy
Copy link

As mentionned in Frequently asked questions section, ClinSV is optimized using a dataset from HiSeq X (and potentially Mechanical Fragmentation library prep). In our lab we move to NovaSeq 6000 / X+ coupled with Tagmentation library prep.

Is that possible to use a new dataset reference to optimize ClinSV with this new lab set up ?
Thanks in advance for any advise.
Q.

@drmjc
Copy link
Member

drmjc commented Jan 20, 2025

Hi @kantLeroy,
This is a good suggestion, as we too have moved on from HiSeqX to NovaSeq 6000 now to NovaSeq X. The biggest change i'd expect is the the insert size distribution, which is baked into the automated NA12878 validation report. Doing so would also alter the IGV tracks, perhaps smoothing out some SR and DP noise of the coverage stdev tracks.

I think the changes would be pretty modest though, and if it helps alleviate your concerns, this does not stop us from using ClinSV, as is, on 30-40x depth Illumina germline WGS data.

We'd have to re-create the scripts to makes these data, as the developer has left academia, so I don't see it happening in the next quarter. in the backlog to discuss with @J-Bradlee...

cheers, M

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants