-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Uniprot #62
base: dev
Are you sure you want to change the base?
Conversation
…es classified to species level
… processing and file output format
…d cmsearch outputs
Update infernal container
helpers/database-import-scripts/uniprot/uniprot_sanity_check.py
Outdated
Show resolved
Hide resolved
Adjusted pytest to include the required folders in the pythonpath
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good stuff.. I think we should merge this one.
We should start migrating the UniProt, RNAcentral, and file_organizer bash scripts as subworkflows in the pipeline. They are becoming unwieldy. Or as prefect flows... not sure yet.
Another thing that is missing is running the linters, I ran the pre-commits and bumped into loads os warning and errors (we need to fix them). I would do this after the merge in another PR, and probably not now.
Cheers
…prot file generation process
Added scripts to generate import files for Uniprot. Scripts are executed outside of the main pipeline from file_organisher.sh (to be folded into a future pipeline version). Scripts generate individual Uniprot import files for each species representative genome and one Uniprot metadata file per catalogue.