Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature request paf filter #7

Open
colindaven opened this issue Apr 23, 2024 · 2 comments
Open

feature request paf filter #7

colindaven opened this issue Apr 23, 2024 · 2 comments

Comments

@colindaven
Copy link

colindaven commented Apr 23, 2024

Hi,

I couldn't find any utilities for filtering pafs and this tool seems to be the most capable and toolkit-like.

Maybe it would be an option to put in a filter to avoid users having to hack together one liners in awk, like I did here.

# this is nextflow so contains escaped $

   awk '{if ((P=\$11/\$2*100) >= $params.min_align_length) {print P"\t"\$_}}' $paf

Filters might include

  • Mapping Quality if available ?
  • NM number mismatches
  • alignment length

It might be easy to do this in awk, but it is a bit hacky and error prone.

Other repos do not seem to offer a paf filter either AFAIK

https://github.com/AndreaGuarracino/paf2chain

https://github.com/AndreaGuarracino/pafgnostic

https://github.com/ekg/pafplot

etc

Thanks

@glennhickey
Copy link
Contributor

I agree that these are useful features to add.

In the meantime, you can consider using gaffilter. It's a little specific to minigraph-cactus, but I think does some of what you want (and despite its name, works on PAF as well as GAF).

@colindaven
Copy link
Author

Hi @glennhickey thanks, that is useful.

I wrote something similar for gaf filtering once and noticed it at least partially worked on PAF too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants