Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clustering ProstT5 DB? #361

Open
HaimAshk opened this issue Oct 1, 2024 · 4 comments
Open

Clustering ProstT5 DB? #361

HaimAshk opened this issue Oct 1, 2024 · 4 comments

Comments

@HaimAshk
Copy link

HaimAshk commented Oct 1, 2024

Dear FoldSeek team,

Sorry for maybe a very novice question, I could not find the answer in the documentation...
I have a ProstT5-encoded DB for foldseek. Is there an easy way to cluster all the proteins included in the DB? It seems easy-cluster scheme does not accept a DB as an input (or I missed it).
How can I achieve it?

Thanks for any advice!
Haim

@milot-mirdita
Copy link
Member

That should work. What did you execute? You might need to use cluster instead of easy-cluster since the latter only accepts structures and not databases as input

@HaimAshk
Copy link
Author

HaimAshk commented Oct 1, 2024

Thanks for the prompt response, Milot!
Yes, using just cluster seems to work. Thanks!

@shiraz-shah
Copy link

@HaimAshk, could you please share the commands as to how you succeded with this? I have prostT5 3di fastas for my proteins and can't get cluster to work for the life of me. It just throws a database type error.

@milot-mirdita
Copy link
Member

easy-cluster will call ProstT5 internally if you give it a FASTA file. cluster cannot do this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants