Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Too many FASTA files with spacers #14

Open
aziele opened this issue May 6, 2024 · 2 comments
Open

Too many FASTA files with spacers #14

aziele opened this issue May 6, 2024 · 2 comments

Comments

@aziele
Copy link

aziele commented May 6, 2024

Hi,

I'm encountering an issue while attempting to run the command:

spacepharer easy-predict spacers/*.fna targetSetDB predictions.tsv temp/

I have approximately 35,000 FASTA files containing spacers, each corresponding to a different bacterial genome. However, when executing the command, I receive the error message:

sh: 1: spacepharer: Argument list too long

The argument list is exceeding its capacity. Is there a workaround for this limitation? Any advice or solution would be greatly appreciated.

@milot-mirdita
Copy link
Member

I pushed a change that should allow you to either pass a directory to easy-predict or a list of file paths to spacer files. The latter needs to have the file ending.tsv.

spacepharer easy-predict spacers targetSetDB predictions.tsv temp/ --file-include ".fna$"

You can download precompiled binaries here: https://mmseqs.com/spacepharer

If you don't want to update to a pre-release version, you should be able to do the following:

tar -cvf spacers.tar spacers/
spacepharer tar2db spacers.tar spacers_fna_db
spacepharer createdb spacers_fna_db spacers_db
spacepharer createsetdb spacers_db spacers_set_db tmp --extractorf-spacer 1
spacepharer predictmatch spacers_set_db targetSetDB targetSetDB_rev predictions.tsv tmp

@aziele
Copy link
Author

aziele commented May 9, 2024

Thank you for the update. Everything's working smoothly now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants