Investigate alternatives to gffutils in-memory data #264
Labels
performance
Improvements to performance
priority:medium
Medium priority
technical debt
A feature/requirement implemented in a sub-optimal way & must be re-written. Contrast to "cleanup"
I've been having some pretty crazy slowdown while trying to read the NCBI annotations into memory with
gffutils
. If this persists, there are at least two possible alternatives:gffutils
, store data in a sqlite db (see below). We could also save this to a DB in the data folder and then check to see if it already exists so that we don't have to repeatedly create it. I think this will be pretty fast once the DB is created.gffpandas
came up on a quick google, and it seems like a good fit. I bet we could spin our own up as well (gffpandas is, like, < 100 lines of actual code).The text was updated successfully, but these errors were encountered: