-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LammpsLibrary hangs on close #176
Comments
@pmrv It works for me, so I presume it is related to the setup on the cluster @niklassiemer can you comment on this? |
It also doesn't work on a local machine for me. I suppose it might be MPI related? |
I do not have a clue... |
So it does work on a local machine now. It seems the warning I posted above is just a red herring. |
@pmrv Can you try if interfacing with the MPI process directly fixes the issue? import os
import pylammpsmpi
from pympipool.shared import interface_bootup, MpiExecInterface
interface = interface_bootup(
command_lst=["python", os.path.join(os.path.dirname(pylammpsmpi.__file__), "mpi/lmpmpi.py")],
connections=MpiExecInterface(cwd=None, cores=2),
)
interface.send_and_receive_dict(input_dict={"command": "get_version", "args": []})
interface.shutdown(wait=True) |
It still hangs, but in a different location. I took this stack trace after ~30min
Compared to the original stack trace
|
This might also be fixed by pyiron/executorlib#279 |
Another indication that this was caused by the issue in |
So with the latest changes from pympipool it works on the cluster in a python shell, but not in a notebook/lab environment. |
As discussed, running LAMMPS with multiple cores requires LAMMPS with mpi support. You can check this using |
So with the correct lammps the simple examples above seem to work, but interactive pyiron jobs or calphy are still stuck. I have to double check all my versions and then update here. |
So here's a small data point: @srmnitc and I managed to make it work on the cluster with the following env
and |
@pmrv Did you try the new
|
With the latest version, 0.2.13, the interactive lammps sessions work, but don't properly clean up. Running the snippet below never finishes (on the cmti cluster)
I've watched the lmpmpi.py process with top, and it does disappear when
close
is called, but apparently that's not properly communicated back to the foreground process.When I run this snippet on my laptop in a fresh conda environment it hangs similarly, but also prints this warning
The text was updated successfully, but these errors were encountered: