-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request: Compatibility with Starlong #218
Comments
To some extent, this is already possible. Technically, Arriba is compatible with STARlong. I just gave it a spin with PacBio long reads and it worked. The There weren't any false positives. I am not sure about the false negative rate. Looking at the discarded fusions file, I mainly saw false negatives that were discarded due to lack of support (they had only 1 supporting read). So Arriba reported pretty much everything that can be reported with confidence. So if anything was missed, it was mainly because STAR did not find an alignment. I believe STAR should be able to find most fusions with ease given the long read length. It may have trouble with reads spanning multiple breakpoints (=> unmapped, too short), which may be ameliorated through STAR parameter tweaking. However, STAR will only ever report one chimera per read. Multiple chimeric alignments of the same read are supported by neither STAR nor Arriba. In addition to disabling the mismappers filter, other Arriba parameters may improve the sensitivity some more. For example, one could reduce the minimum number of supporting reads to 1. I need to give this a try. What made you think that Arriba is not compatible with STARlong? Did it fail on a sample of yours? Or did it miss fusions in a sample? |
HI @suhrig - apologies after my post I managed to make it work fine with StarLong - sorry for wasting your time! |
No time wasted. I will make an enhancement that the If you can share any STAR or Arriba parameter optimizations that improve calling, let me know! |
Hi I got STARlong-arriba working on a sample. When I analyzed the results from arriba, I expected to find CIC-DUX4 fusion since that fusion was found while viewing the alignment in IGV viewer but this fusion was not found the fusions.tsv and the fusions.discarded. I used the Thanks! Karlijn |
In order to detect CIC-DUX4 fusions, it is important to enable multimapping chimeric reads. Did you use Would it be possible to share the BAM file with me or at least the reads mapping to the fusion breakpoints which you see in IGV? |
Thank you for the quick response! Here is the command I used for STARlong: I did use |
It took a while long than expected to get a reply and confirmation from my supervisor. I sent you an email about it and a filesurfsender with the BAM file and its BAI file. I sent it to [email protected] but that mail address does not exist. To what email address can I send the filesurfsender? |
I no longer work at the DKFZ. You can send the link to this address instead: sebastian [dot] uhrig [at] googlemail [dot] com. |
Thank you for your quick response! I send you the files via filesurfsender and sent an email. Let me know if you received both. |
@YinYangKarly I had a look at the BAM file. The problem is with STAR, not with Arriba. STAR fails to report a chimeric alignment for the supporting reads. It either aligns the part of the read that maps to CIC or it aligns the part of the read that maps to DUX4, but not both. I am not sure what the reason is. When I map the two parts of the reads individually, STAR finds an alignment. They are unique hits, too. So it has nothing to do with multimapping reads. What's even more strange is the fact that when I cut down the long read to ~60nt on both sides of the fusion junction, then STAR correctly reports a chimeric alignment! The problem must be a limitation of STARlong reporting chimeric alignments of long reads. This is not something I can fix. Maybe the alignment parameters can be tweaked in a way to make this work. I have tried a number of things, but nothing worked. I think it would be best to bring this up with the developer of STAR. He will know much better which parameters need tweaking or if this is a bug that needs fixing. You could send him an example read, such as read |
Thank you for looking into it! I will think about your advice and considerations. I have another question about arriba how it determines confidence in STAR short read alignment, because the CIC-DUX4 fusion we are looking for in the short read alignment was found in the discarded fusion file. While filtering out |
It would be best to open another issue about this if you don't mind. |
Hi
I am trying to do fusion neoantigen prediction using Arriba, however STAR can't cope with long reads and ideally STARlong should be used - do you think this will be able to be supported please?
THanks
Andrew
The text was updated successfully, but these errors were encountered: