Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Notebook AlphaLink2 producing corrupted output #6

Open
ggerlach1 opened this issue Jul 7, 2023 · 6 comments
Open

Notebook AlphaLink2 producing corrupted output #6

ggerlach1 opened this issue Jul 7, 2023 · 6 comments

Comments

@ggerlach1
Copy link

I am excited to see the ColabFold notebook for this method! I tired running it with a cystine rich peptide that binds to larger protein with a single cross-link. The structure produced has a reasonable looking structure for the protein, but the smaller peptide appears to be chopped up into several small pieces. I say appears because the .zip file that is produced is corrupted so I cannot look at the results outside of the embedded viewer. Additionally, in the original AlphaLink there is a method to specify distances other than the one that it was trained on is this a feature that is possible in AlphaLink2?

Any help is greatly appreciated. Very useful method and the notebooks are a great resource.

@lhatsk
Copy link
Collaborator

lhatsk commented Jul 8, 2023

Glad to hear! :-)

I'm curious, we have never tested AlphaLink on peptides, how well does AlphaFold-Multimer work in this case?

Unfortunately, I cannot reproduce the problem with the .zip file on Mac OS. You could try to replace the zipping with taring. Press "show code" in the last cell and replace the last lines with this:

!tar cvfz {target_id}.tar.gz {" ".join(file_lists)}
files.download(f'{target_id}.tar.gz')

Let me know if that works for you.

Regarding the distances, unfortunately, it's not possible at the moment. It is planned but I cannot give a time estimate. Do you have a specific distance in mind? We have another network trained on 10A (v2 weights) which we may be able to release in the meantime.

@ggerlach1
Copy link
Author

In my (limited) experience AlphaFold-Multimer works okay with peptides and specifically with the goal of generating a possible structure it can be helpful.

Thanks for the taring idea, the output is no longer corrupted.

The linker we have in this system is at max 11A between side chain nitrogens, so I think the 10A network would be much closer. If you can release that I would appreciate it.

@lhatsk
Copy link
Collaborator

lhatsk commented Jul 11, 2023

I added the 10A network as an option to the ColabFold.

Since you were able to look at the peptide now, was it reasonable or somehow broken up?

@ggerlach1
Copy link
Author

Thank you!
It was a reasonable output I believe there was an issue on my end with the fasta file (erroneous new line characters).

I look forward to trying the 10A network.

@ggerlach1
Copy link
Author

I ran the 10A network and unfortunately it produced the copped up output I had seen previously. This was with the same fasta and csv which had previous produced valid output, so I am confident it is not an issue with the input. Interestingly, this time the zip file was not corrupted, but the peptide chain has many residues too far apart and a collection of clashing atoms.

Do you have any thoughts on what could be causing this seemingly random occurrence?

@lhatsk
Copy link
Collaborator

lhatsk commented Jul 18, 2023

Strange. Sorry, I have no idea. I have never come across this. Would be curious to see what happens if you relax it. Maybe it's worth running this with your output? https://colab.research.google.com/github/sokrypton/ColabFold/blob/main/beta/relax_amber.ipynb Probably it just outright fails though.

If you are able to share your inputs (maybe confidentially by mail, I would be very curious).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants