v0.12.0-rc1
Pre-release
Pre-release
HyperQueue 0.12.0-rc1
New features
Automatic allocation
- #457 You can now specify the idle timeout
for workers started by the automatic allocator using the--idle-timeout
flag of thehq alloc add
command.
Resiliency
- #449 Tasks that were present during multiple
crashes of the workers will be canceled.
CLI
- #463 You can now wait until
N
workers
are connected to the clusters withhq worker wait N
.
Python API
- Resource requests improvements in Python API.
Changes
CLI
-
#477 Requested resources are now shown while
submitting anarray
and while viewing information about taskTASK_ID
of specified
jobJOB_ID
usinghq task info JOB_ID TASK_ID
-
#444 The
hq task list
command will now
hide some details by default, to conserve space in terminal output. To show all details, use the
-v
flag to enable verbose output. -
#455 Improve the quality of error messages
produced when parsing various CLI parameters, like resources.
Automatic allocation
- #448 The automatic allocator will now start
workers in multi-node Slurm allocations usingsrun --overlap
. This should avoid taking up Slurm
task resources by the started workers (if possible). If you run into any issues with usingsrun
inside HyperQueue tasks, please let us know.
Jobs
- #483 There is no longer a length limit
for job names.
Fixes
Job submission
- #450 Attempts to resubmit a job with zero
tasks will now result in an explicit error, rather than a crash of the client.
Artifact summary:
- hq-v0.12.0-rc1-*: Main HyperQueue build containing the
hq
binary. Download this archive to
use HyperQueue from the command line. - hyperqueue-0.12.0-rc1-*: Wheel containing the
hyperqueue
package with HyperQueue Python
bindings.