Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fidibus with no sequence input #232

Open
vpbrendel opened this issue Jan 21, 2020 · 0 comments
Open

fidibus with no sequence input #232

vpbrendel opened this issue Jan 21, 2020 · 0 comments
Assignees

Comments

@vpbrendel
Copy link
Member

Currently we require as input a gdna FASTA file, a GFF3 file, and a protein file. However, if we are only interested in genome organization or otherwise a quick view, the only input that is strictly necessary is the GFF3 annotation file; after all, most of our calculations are simply done on the ranges indicated in the GFF3 file, independent of sequence content.

I suggest we have something like a "--terse" flag (possibly implicit by what is provided as input) which gives all the functionality that does not rely on sequence input. The necessary changes to the code do not seem to be extensive: largely it would be to have a version of fidibus-stats.py that does everything as before except for the sequence-dependent calculations (GCcontent, GCskew, Ncontent; checks on sequence lengths); a few if statements should do. Before that the iloci and breakdown steps would need to omit the sequences(db) calls from prepare(db).

A quick way to try this might be to test a code version in which we delete all the calls involving the sequence data. I think the code is so well organized that this change should not be difficult to implement (unless hidden dependencies show up ...).

Benefit: This change would allow us to hugely speed up sigmaphi determination, saving download to disk in the first place plus subsequent disk writing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants