Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IUPAC symbol Y #1011

Closed
Liukvr opened this issue Sep 18, 2024 · 2 comments
Closed

IUPAC symbol Y #1011

Liukvr opened this issue Sep 18, 2024 · 2 comments
Assignees
Labels

Comments

@Liukvr
Copy link

Liukvr commented Sep 18, 2024

Dear developers,
Many thanks for this very usefull tool. I have a question regarding DemuxFastqs. My barcode contains 4 IUPAC symbols (e.g. Y and R) but when tdemultiplexing they are counted ad mismatches. Is there a way to handle it without increasing --max-mismatches to 4?
Thanks in advance,
Luca

@nh13
Copy link
Member

nh13 commented Feb 6, 2025

@Liukvr

  1. See the fqtk demux tool that we prefer due to it being much much more performant. The DemuxFastqs was originally built to handle MiSeq size data. We would be more willing to add features to fqtk than DemuxFastqs, so I've created an issue there: Support IUPAC bases in the expected barcode fqtk#52
  2. One temporary work-around would be to demultiplex using all the barcodes implied by the IUPAC bases, for example AANAA would be expanded into AAAAA, AACAA, AAGAA, and AATAA. Then you would demultiplex using the expanded list, and merge after. The unfortunate issue in this work-around is that only exact matching would be supported, hence the need for new functionality, which we would be more inclined to work on in fqtk (and with sponsorship!)

@nh13 nh13 closed this as completed Feb 6, 2025
@Liukvr
Copy link
Author

Liukvr commented Feb 6, 2025

Thanks @nh13, very appreciated

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants