Should we provide all trajectories having starting timestep to be zero? #195

comcon1 · 2024-06-20T16:38:47Z

Now when we go through the trajectory using MDAnalysis, we use TIMELEFTOUT starting from 0 time. And trajectory can have starting time 120 ns for example. And TIMELEFTOUT 50 means that it will actually skip nothing.
Is that correct behavior?
Should we check then that TIMELEFTOUT is larger than zero timestep?

I offer just checking starting time to be zero inside AddFile script. That will fix at least future trajectories.

ohsOllila · 2024-06-27T18:45:10Z

In which code exactly is this problem?

comcon1 · 2024-06-27T21:48:31Z

I saw that in scattering code, but I suppose it could be in many parts. That is not a problem actually, it's more like an uncertainty in the behavior.

batukav · 2024-06-29T06:42:55Z

I think we can fix this issue by making use of the PREEQTIME. It looks like we're actually not using this variable.

For any trajectory, the first timestep in the analysis should be PREEQTIME + TIMELEFTOUT

In calc_FormFactors.py (actually in all calculation scripts)

Databank/Scripts/AnalyzeDatabank/calc_FormFactors.py

Line 171 in b2a656c

EQtime=float(system['TIMELEFTOUT'])*1000

setting

EQtime=float(system['TIMELEFTOUT'] + system['PREEQTIME])*1000 should do the trick. We'll need to rerun the entire databank, though.

comcon1 · 2024-06-29T12:08:58Z

PREEQTIME can not be the starting point of the trajectory. I can do the equilibration during 300 ns first and then make new MD run starting from zero. Then PREEQTIME I will anyway set to 300 to inform a user that I did equilibrate it for a long. There are three possibilities:

we ask users to always start trajectory from 0
we ask users to always start trajectory from PREEQTIME
we ignore starting time of the trajectory and calculate TIMELEFTOUT from starting trajectory time

That is the choice, I suppose.

batukav · 2024-06-29T12:23:48Z

Your example makes sense. I’d prefer your third suggestion. We could set the Eq time as the (first_time_stamp + TIMELEFTOUT).

…

On 29. Jun 2024, at 15:09, Alexey Nesterenko ***@***.***> wrote: PREEQTIME can not be the starting point of the trajectory. I can do the equilibration during 300 ns first and then make new MD run starting from zero. Then PREEQTIME I will anyway set to 300 to inform a user that I did equilibrate it for a long. There are three possibilities: we ask users to always start trajectory from 0 we ask users to always start trajectory from PREEQTIME we ignore starting time of the trajectory and calculate TIMELEFTOUT from starting trajectory time That is the choice, I suppose. — Reply to this email directly, view it on GitHub <#195 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AB47SNNPBHTAUHEQBTMFWNLZJ2PXDAVCNFSM6AAAAABJUKDKCSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCOJYGEZDSMZQG4>. You are receiving this because you commented.

comcon1 · 2024-06-29T12:26:48Z

Ok, let's do the third one. I like it better also. You can assign it to me.

comcon1 · 2024-08-08T07:35:11Z

I found the problem that some of systems cannot be recomputed. For example many of united-atom systems contain definition, which buildh doesn't know. Probably they were added to buildH locally but were never synchronized with buildh github. Probably, there exist other problems so I'm now quite skeptical regarding a possibility of recomputing the entire databank. Probably, I can do this issue without recomputing?

batukav · 2024-08-08T07:57:26Z

Could you please post the output for some of the united-atom systems that you couldn't recompute? As a test, can you recompute the same united-atom system with the code you haven't modified?

comcon1 · 2024-08-08T21:50:19Z

Could you please post the output for some of the united-atom systems that you couldn't recompute? As a test, can you recompute the same united-atom system with the code you haven't modified?

I answered but then I found my problem and deleted last post. I found that I accidentaly broke path to lipid_json_buildh folder which contains dictionaries locally in NMRlipids and that's why I experienced these problems. Now seems that everything is OK. So that was misreporing from my side..

batukav · 2024-08-09T06:40:46Z

okay, that's relieving. so you're done with implementing the change and now just recomputing the systems with the new code, right?

comcon1 · 2024-08-13T10:56:17Z

It's not a problem to fix this issue. It's a problem to recompute the databank. I would like to do it after I'm ready with my current code optimizing and testing. Once I'm ready to start recomputing, I will do this issue. Also, I would probably want to ensure everyone to separate the Data into a GitHub submodule to have an independent commit history for the code and the data. I had a talk about it with @ohsOllila, @fsuarezleston and @markussmiettinen when Samuli was visiting us in Bergen this summer. Note, that databank recomputing will touch huge amount of files. Now, for example, almost all functions writes JSON in more human-readable format (not in one-line), so even having the same numbers, JSONs will change. Probably it's a good time to split up the repository exactly at the point of recomputing.

Probably it's actually better for me to FIX this issue after I make tests, to cover this fix with tests and then just wait until everything will be ready for the recomputing...

batukav · 2024-08-16T06:44:12Z

It makes a lot of sense to separate data and scripts. I'll create a separate issue to discuss this.

comcon1 · 2024-12-03T16:14:36Z

Some story about magic 22000 number for gmx3. It is also related to this problem of zero time step.

Here is the list of the trajectories with GROMACS_VERSION: gromacs3

0a2/272/0a22727a0659852772b4a1193ced99c5981fb739/973cf49703e3217f44b5c18759fd8926e8f5d1f1
219/709/219709a1090be0c8ecc1fc3c6362cdf996c1c997/fdf916cac7bed9521a8d6acd71f12207aa12f6fd
52e/142/52e1425c86de88b82dc50fc6539269b4e73b3a61/cac0f8f72a05076af7c53c7a17bf05f20666c5d7
532/695/5326957d67eb0355d0eccf6aac9d9a44fd984777/48e776c91986eeb05d8d407567423c42bbc6126e
671/199/671199c998a93dcf594469ea33e3b25e73d08b9e/25bf45d5d477c8ae86de3b9c044ead9489d678f6
7ab/2b4/7ab2b45dcff6f8ff34f5f26d8d15fb94f7cf3393/1d42d856ac5622426ba19d7a73030b1a749f817a
7cb/7a3/7cb7a3281b5e79d8e0328d82cdebbcc11f1bab2a/f89b798e078bcc3f9793b20176dbe25ae42778ad
88c/02d/88c02d2e58d6c59918f230e060130570e398071e/32ba461564dfbf8212488be80ddf109e3de23c1f
a69/b1f/a69b1f89e3c9dd265a7ac3009eceb75695de582b/51b9a8cb6e4f8c4f17816bfcc025978b93276e5d
f78/e18/f78e188379aa6f6b1be9f2a8afc8bc2aac191ed0/443765a3de567fd6fe4e8666d29c63e8580e6306
f97/dff/f97dfff6e14f18c3d24bda5b8fdf9d4b4a7ec763/d9fa5dfeae8940ba2ee2faa7784bec5b2d44613a

Do you want me to open an issue about the discussion of supporting these records?

Originally posted by @comcon1 in #201 (comment)

I want to avoid using 22000 magic number and also the gromacs 3 dependency. So yes, maybe an issue can be worth.

I believe that 22000 is just a magic number that once was established for all this trajectories. We actually can't use MDAnalysis or other tools to extract the value of the first frame. What can be done, is application of gmx check on the trajectory and then parsing the output to understand what is the time of the first frame. Then using this time as a parameter for dump. It is not really complicated, but maybe it is too much work for supporting 10 trajectories.

Originally posted by @pbuslaev in #201 (comment)

batukav assigned comcon1 Jun 29, 2024

batukav mentioned this issue Aug 16, 2024

Restructuring the repo #200

Open

comcon1 mentioned this issue Dec 3, 2024

Large refactoring with minimum code changes #201

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should we provide all trajectories having starting timestep to be zero? #195

Should we provide all trajectories having starting timestep to be zero? #195

comcon1 commented Jun 20, 2024

ohsOllila commented Jun 27, 2024

comcon1 commented Jun 27, 2024

batukav commented Jun 29, 2024 •

edited

Loading

comcon1 commented Jun 29, 2024

batukav commented Jun 29, 2024 via email

comcon1 commented Jun 29, 2024

comcon1 commented Aug 8, 2024

batukav commented Aug 8, 2024

comcon1 commented Aug 8, 2024

batukav commented Aug 9, 2024

comcon1 commented Aug 13, 2024 •

edited

Loading

batukav commented Aug 16, 2024

comcon1 commented Dec 3, 2024

Should we provide all trajectories having starting timestep to be zero? #195

Should we provide all trajectories having starting timestep to be zero? #195

Comments

comcon1 commented Jun 20, 2024

ohsOllila commented Jun 27, 2024

comcon1 commented Jun 27, 2024

batukav commented Jun 29, 2024 • edited Loading

comcon1 commented Jun 29, 2024

batukav commented Jun 29, 2024 via email

comcon1 commented Jun 29, 2024

comcon1 commented Aug 8, 2024

batukav commented Aug 8, 2024

comcon1 commented Aug 8, 2024

batukav commented Aug 9, 2024

comcon1 commented Aug 13, 2024 • edited Loading

batukav commented Aug 16, 2024

comcon1 commented Dec 3, 2024

batukav commented Jun 29, 2024 •

edited

Loading

comcon1 commented Aug 13, 2024 •

edited

Loading