Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

look into making pbsdsh default bootstrap server for recent Intel MPI #82

Closed
boegel opened this issue Oct 12, 2016 · 6 comments
Closed
Labels

Comments

@boegel
Copy link
Member

boegel commented Oct 12, 2016

advantages:

example (with Intel MPI)

mpirun -bootstrap pbsdsh -np 40 ./mpi_helloworld
@boegel boegel added the caylo label Oct 12, 2016
@boegel boegel changed the title look into making pbsdsh (default?) bootstrap server look into making pbsssh (default?) bootstrap server Oct 12, 2016
@boegel
Copy link
Member Author

boegel commented Oct 12, 2016

should actually use the pbsssh wrapper (see bin/pbsssh.sh)

@Caylo
Copy link
Contributor

Caylo commented Dec 14, 2016

the problem seems to be mostly the if/else under this:
https://github.com/hpcugent/vsc-mympirun/blob/develop/lib/vsc/mympirun/mpi/mpi.py#L820 where -rmk is prioritized over the launcher, which shouldn't happen.
(more info on that:
- http://www.failureasaservice.com/2014/04/mpich-and-mvapich-with-torque.html
- https://software.intel.com/en-us/forums/intel-clusters-and-hpc-technology/topic/278096)

HYDRA_LAUNCHER_NAME and HYDRA_LAUNCHER_EXEC are hardcoded into the schedulers, we should look into making this an option e.g. --launcher

also, the mympisanityscript needs some reviewing (see #80) so we can use it to debug/sanity check how processes are pinned

@boegel boegel mentioned this issue Feb 22, 2017
26 tasks
@boegel
Copy link
Member Author

boegel commented Mar 1, 2017

see #110

@boegel boegel changed the title look into making pbsssh (default?) bootstrap server look into making pbsdsh default bootstrap server for recent Intel MPI Mar 22, 2017
@boegel
Copy link
Member Author

boegel commented Mar 22, 2017

#110 is merged, but not perfect, see #114

@Caylo
Copy link
Contributor

Caylo commented May 17, 2017

#114 fixed in #115

@Caylo Caylo closed this as completed May 17, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants