-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Oss517 #356
Oss517 #356
Conversation
Codecov Report
@@ Coverage Diff @@
## dashboard #356 +/- ##
=============================================
+ Coverage 90.00% 90.11% +0.11%
=============================================
Files 60 60
Lines 3710 3802 +92
=============================================
+ Hits 3339 3426 +87
- Misses 371 376 +5
|
51a2630
to
edda5d7
Compare
9cf4e11
to
197e75e
Compare
remove demo exp creation
One last thing to consider, and for sake of time I'm going to request this be kicked into a new ticket/PR! It looks like unmanaged tasks need to launched through the For testing consider the following driver script: from smartsim import Experiment
def main() -> int:
slurm = "slurm" # <-- I believe this problem exists for any WLM
open_mpi = "mpirun" # and run command combo that produces an
# unmanaged step
exp = Experiment("my-exp", launcher=slurm)
rs = exp.create_run_settings("echo", ["hello", "world"],
run_command=open_mpi)
rs.set_tasks(1)
model = exp.create_model("my-model", run_settings=rs)
exp.generate(model, overwrite=True)
exp.start(model, block=True) # <-- Does not look like any start or
# stop jsons are produced bc the
# unmanaged indirect is never invoked
print("Done")
return 0
if __name__ == "__main__":
raise SystemExit(main()) While a |
Added a follow-up ticket to give this the love it deserves. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks amazing and seems to be working great! I left a couple suuuupppeer pedantic change requests and some notes for either you/I to write up some tickets for some known defects that we want to address, but nothing worth holding up approval over!
Otherwise LGTM!! Thanks for all of the hard work on this!!
def _create_batch_job_step( | ||
self, entity_list: t.Union[Orchestrator, Ensemble, _AnonymousBatchJob] | ||
self, | ||
entity_list: t.Union[Orchestrator, Ensemble, _AnonymousBatchJob], | ||
telemetry_dir: pathlib.Path, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same "don't love, but good enough for now" concern with param diving the telemetry_dir: @MattToast and/or @ankona remember to ticket this as well!!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
No description provided.