-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Many clashes when aligning the decoupled structures of ligands in solvent into the apo state pocket #1
Comments
Hello! From the W in solvent graph on the left, it seems the coupled ligand had significant structural changes during the equilibrium simulation, hence the second peak and poor overlap between work distributions. The work values themselves also look off. Were you maybe using Would you mind sharing the input files and maybe the equilibrium trajectories that gave rise to these results? Also, which version of gromacs are you using? For comparison here are my results and input files for cdk2 with ligand 30: Cheers, |
Dear Yuriy, Thank you very much for your reply! I use the same mdp as downloaded from your github. So I used couple-intramol = no. The only different is that I changed "couple-moltype=MOL" to "couple-moltype = ligand" to follow my format of the force field. All my runs are based on single-precision GPU version of gromacs2019.06. I did energy minimization, nvt and npt following the method section in your paper. I noticed there is a large descrepency between my result and yours. I also did the second run for the ligand in solvent, and surprisely, this time the result is also different from my first run. Please download all my input and the procedure I used for every step from this link. Thanks again for your help! Yours Sincerely, Fred. |
Hi, Fred! I just uploaded the fix for this bug in gmx 2019.4 that I used for the simulations in the paper. If the goal is to reproduce my results, then use this fix. A more efficient and safer fix is included in gmx 2021 and newer. Safer because my 2019.4 fix does not catch the case where the excluded pair is separated by more than rlist. Actual rlist is larger than the 1.1 nm requested in the mdp file, because mdrun instead calculates it from verlet-buffer-tolerance and nstlist. When run with GPUs, the remaining error in free energy from this is smaller than uncertainty from repeating the measurement (tested on tyk2). 2021 versions also added a check & a crash for exclusion pairs beyond rlist, so even marginally incorrect results don't accidentally occur. To avoid this crash with 2021, one needs to make sure rlist is larger than the maximum distance within the ligand during the simulation, but still smaller than half the simulation box. Mdrun automatically tunes rlist at start, but if you choose to use gmx 2021, you can force a value with these mdp settings:
A much easier alternative is to use I hope this helps. If not, please let me know. |
Dear Yuriy,
|
Re procedure: I have written my own python script for for the alignment procedure. It may be helpful to illustrate those steps. It assumes particular file names though, so you'd need to edit it if you want to use it with your current folder structure. Plus it relies on a somewhat modified version of pmx ( Meanwhile, the procedure you described relaxes the aligned structures, bringing them closer to the equilibrium distribution of the apo protein + the decoupled ligand + restraints, and (probably) starts the non-equilibrium simulations from the relaxed structures. That would be similar to what was done for the central column of Figure S15 in the SI of the paper. Except there I retained water and ions from the apo trajectory as well as the protein and didn't bother with em/nvt or position restraints, just a short npt simulation to bring the decoupled ligand closer the its equilibrium as driven by the protein-ligand restraints. If you are solvating and equillibrating the apo protein + decoupled ligand system anyway, then you might as well only align the starting structures (eg prot_30.pdb and prot_apo.pdb, not each frame of previous npt) to get the ligand in the right spot in the starting apo structure, run em/nvt_soft/npt, and sample the 165 frames for starting the coupling non-equilibrium simulations from that. This would correspond to the procedure for Figure S14 in the SI. No need to resample the same equilibrium distribution from different starting points. This would also have shorter em than what you are currently doing, as position restraints for em/nvt could be targeted towards the crystal structure, instead of the 298 Kelvin aligned structure. Re stability: One more thing that can help with stability is using gmx genion from Gromacs 2020 and newer. Older versions sometimes inserted ions too close to the protein or even into the middle of one (if there was water it could replace there). When that happened, protein would swell with water the moment position restraints were removed. From 2020 and on genion has a -rmin parameter (default 0.6 nm) that prevents ion insertion too close to the solute. Finally, non-equilibrium simulations have been unstable at times for me. Increasing Cheers, Edit: wording |
Dear Yuriy, |
Dear Fred, Gromacs 2021 already has the bug-fix included in it, so you can use it or newer versions for everything. The only issue is that it will tell you to make the rlist larger to include the whole decoupled ligand in the neighborlist so the bug-fix in it can correctly handle ALL the excluded intramolecular interactions in it. See my post from April 1 for how to set rlist and not have Gromacs override it. The 2019.4-bug-fix didn't have a check for this, allowing for too short rlists and still slightly wrong results for larger ligands. The point I was trying to make yesterday is to use any version with the bug-fix for both equilibria and transitions. In principle small clashes shouldn't be an issue while the ligand is decoupled. Water will move out of the way of the ligand as it gets coupled and the ligand will rotate/deform to get out clashes with the protein unless something prevents it. This involves a lot of energy being dissipated and is responsible for the poor overlap between work distributions, but that isn't going to go away without prohibitively long transitions or manipulation of the potential in the mid-ranges of lambda. The systematic issue here may be the "prevents" bit. The dihedral might not be able to reliably flip back into its bound orientation during the coupling transition, leaving a portion of the transitions in a high energy state, leading to lower backward work values. That's why I'm hoping for the simple solution of the dihedral flip just being an artifact of the bug in the equilibrium simulation of the decoupled ligand in solvent. |
Dear Yuriy, Both equilibrium and transitions were performed using 2019.4-bug-fix. To check if it is the clash reason I did a test run using the ligands conformations directly from holo equilibrium after aligning holo trajectory onto apo trajectory. (files are in the following link)That shoud give a better alignment in the apo pocket. The dG of reverse was inproved by 7 kJ/mol, but still 10 kJ/mol lower than your result. |
Hey Fred, I see several issues here: the ensemble that needs to be superimposed into the active site of the protein must come from the decoupled ligand simulations, because it effectively replaces the actual decoupled+restrained ligand simulations in the protein. Another issue, you cannot just take arbitrary restraints for such a superimposed ensemble. The restraints need to be carefully selected such that an ensemble generated with the chosen restraints would not be distinguishable from the one that you have generated by fitting. For the restraint generation I suggest using our scripts: Yuriy (@zetadin) could you give a brief example how to run the script to create the restraints? Vytas |
Sure. The script is here: postHoc_restraining_python3.py usage:
I also checked on the dihedral flip in the decoupled ligand that is causing poor alignment. I never saw it in 5 repeats with 2019.4-bug-fix and only see it in 1 of 5 repeats with gmx 2019.6, suggesting it is just a rare event. Even with optimal restraints, this dihedral would not be able to reliably flip back into the correct orientation during transitions, and so would be lowering the backwards work values. I would suggest doing multiple repeats of the protocol (we used 5 in the paper) and using the mean as the final dG estimate. This helps averaging stochastic errors both from not fully sampling the equilibria and from the poor overlap in work distributions during transitions. Cheers, |
Dear Prof. Gapsys,
Thanks a lot for your help and explanation! I will try your script.
Best regards
Fred
…________________________________
发件人: Vytautas Gapsys ***@***.***>
发送时间: 2022年4月19日 17:38
收件人: deGrootLab/abs_dG_paper_ChemScience2021 ***@***.***>
抄送: #ZHOU FENG# ***@***.***>; Author ***@***.***>
主题: Re: [deGrootLab/abs_dG_paper_ChemScience2021] Many clashes when aligning the decoupled structures of ligands in solvent into the apo state pocket (Issue #1)
Hey Fred,
I see several issues here: the ensemble that needs to be superimposed into the active site of the protein must come from the decoupled ligand simulations, because it effectively replaces the actual decoupled+restrained ligand simulations in the protein.
Another issue, you cannot just take arbitrary restraints for such a superimposed ensemble. The restraints need to be carefully selected such that an ensemble generated with the chosen restraints would not be distinguishable from the one that you have generated by fitting. For the restraint generation I suggest using our scripts: Yuriy ***@***.***<https://github.com/zetadin>) could you give a brief example how to run the script to create the restraints?
Vytas
―
Reply to this email directly, view it on GitHub<#1 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJT4DWJA4STAX3FUXZRM4ULVFZ5HXANCNFSM5SD5DF3Q>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Dear Yuriy,
Thank you very much! I wii have a try.
Best regards
Fred
…________________________________
发件人: Yuriy Khalak ***@***.***>
发送时间: 2022年4月19日 21:39
收件人: deGrootLab/abs_dG_paper_ChemScience2021 ***@***.***>
抄送: #ZHOU FENG# ***@***.***>; Author ***@***.***>
主题: Re: [deGrootLab/abs_dG_paper_ChemScience2021] Many clashes when aligning the decoupled structures of ligands in solvent into the apo state pocket (Issue #1)
Sure. The script is here: postHoc_restraining_python3.py<https://github.com/deGrootLab/pmx/blob/abs_dG_workflow/src/pmx/scripts/workflows/postHoc_restraining_python3.py>
usage:
python postHoc_restraining_python3.py -f frame*.gro -n index.ndx -oii ii.itp -odg dg.dat -T 298 -alpha 0.05
* f: aligned trajectory as separate files for each frame. The alignment script outputs them.
* n: index file with candidate anchors atoms. Needs separate groups for ligand and protein.
* oii: output restraints file
* odg: output file with the Boresch-Karplus correction for uncorrelated harmonic restraints.
* T: temperature in K
* alpha: p-value for determining if distributions of restrained degrees of freedom are not Gaussian. If they aren't, they can't be accurately recreated with harmonic B-K restraints and the script will look for other anchors.
I also checked on the dihedral flip in the decoupled ligand that is causing poor alignment. I never saw it in 5 repeats with 2019.4-bug-fix and only see it in 1 of 5 repeats with gmx 2019.6, suggesting it is just a rare event. Even with optimal restraints, this dihedral would not be able to reliably flip back into the correct orientation during transitions, and so would be lowering the backwards work values. I would suggest doing multiple repeats of the protocol (we used 5 in the paper) and using the mean as the final dG estimate. This helps averaging stochastic errors both from not fully sampling the equilibria and from the poor overlap in work distributions during transitions.
Cheers,
Yuriy.
―
Reply to this email directly, view it on GitHub<#1 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJT4DWIW3YND3W4TIQJOS2LVF2ZSTANCNFSM5SD5DF3Q>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Dear Khalak,
I found there are many clashes when aligning the decoupled structures of ligands in solvent into the apo state pocket. And some of them cannot be solved by just do a minimization and the following calculation cannot continue. Can you please tell me how do you figure out such a problem? For example, in my 165 transition runs, just 60 succeed. I ran a example for cdk2 with ligand 30, but I got a very large dG result (-69.09 kJ/mol) compared to experiment (-42.3 kJ/mol).
The text was updated successfully, but these errors were encountered: