@@ -6,6 +6,7 @@ version 2.6.0:
6
6
7
7
- [oarsh_shell] Add support for cgroupv2/systemd and use hwloc
8
8
- [job_resource_manager] Add support for cgroupv2/systemd and use hwloc
9
+ - [oarcgde] Add the oarcgdev tool to manage devices blacklist in cgroupv2
9
10
- [suspend_resume_manager] Add support for cgroupv2/systemd
10
11
- [Monika] Fix display of wanted resources
11
12
- [all] Fix for Perl exporter (Perl 5.39.1)
@@ -14,27 +15,28 @@ version 2.6.0:
14
15
- [IO.pm] Fix issue with resource_log
15
16
- [IO.pm/MetaSched] Add functions to allow scheduling perf statistics
16
17
- [oaraccounting] Now use a systemd timer
17
- - [pam_adopt_oar] Add pam script to enable ssh connection to jobs
18
- - [oar_resources_init] Add support to use hwloc to list resources
18
+ - [pam_adopt_oar] Add PAM script to enable ssh connection to jobs
19
+ - [oar_resources_init] Now use hwloc to find resources on nodes
19
20
- [oar_resources_add] Add support for resources in the hwloc format
20
- - [man] Move administrator's commands to section 8
21
+ - [man] Move administrator's commands to section 8 and some rewrittings
21
22
- [cron] Provide both cron and systemd timers (oaraccounting, oarnodecheck)
22
23
- [oarnodecheck] Add support of cgroupv2/systemd, refactor
23
24
24
25
Starting from OAR 2.6, a new mechanism using cgroupv2/systemd and hwloc is in
25
- place to map OAR resources do physical compute resources (cores, GPUs) on
26
- nodes .
26
+ place to map OAR resources do physical compute resources (cores, GPUs) on nodes
27
+ and manage the processes of jobs .
27
28
28
29
As a consequence, on an existing installation, the cpuset property of the OAR
29
- resources will need some changes to adopt the hwloc description of the
30
- machines. The oar_resource_init script can be used to look at the expected
31
- values for the cpuset resource property.
30
+ resources may need some changes to adopt the hwloc description of the machines.
31
+ The oar_resource_init script can be used to look at the expected values for the
32
+ cpuset resource property.
32
33
33
34
Regarding the gpudevice resource property: it must now contain the list of the
34
35
special devices (e.g.: /dev/nvidia0) associated to a resource.
35
36
36
- Finally, cgroupv1 activation is not required anymore. Related directives in the
37
- kernel command line can now be removed (on OAR nodes).
37
+ Finally, cgroupv1 activation is not need anymore, only the unified cgroupv2
38
+ file hierarchy is required now. Thus, cgroupv1 related directives in the
39
+ kernel command line can be removed on OAR nodes.
38
40
39
41
version 2.5.10:
40
42
---------------
@@ -386,7 +388,7 @@ version 2.5.3:
386
388
--------------
387
389
- Add the "Name" field on the main Monika page. This is easier for the users
388
390
to find there jobs.
389
- - Add MAX_CONCURRENT_JOB_TERMINATIONS into the oar.conf ofthe master. This
391
+ - Add MAX_CONCURRENT_JOB_TERMINATIONS into the oar.conf of the master. This
390
392
limits the number of concurrent processes launched by the Almighty when the
391
393
the jobs finish.
392
394
- Bug fix in ssh key feature in oarsub.
@@ -395,7 +397,7 @@ version 2.5.3:
395
397
security fixes, add of new configuration options, listing of the scheduled
396
398
nodes into jobs, fixed bad reinitialization of the limit parameter,
397
399
stress_factor, accounting...
398
- See OAR-DOCUMENTATION-API-USER for more informations .
400
+ See OAR-DOCUMENTATION-API-USER for more information .
399
401
- CGROUP: handle cgroup hierarchy already mounted by the OS like in Fedora 18
400
402
(by systemd in /sys/fs/cgroup) in job_resource_manager_cgroups.pl.
401
403
- Bug fix oar-database: fix the reset function for mysql.
@@ -419,7 +421,7 @@ version 2.5.3:
419
421
420
422
$jobproperties_applied_after_validation = "maintenance='off'";
421
423
422
- So, even if all the ressources have "maintenance='on'", the new jobs will be
424
+ So, even if all the resources have "maintenance='on'", the new jobs will be
423
425
accepted but not scheduled now.
424
426
- Add the oardel option --force-terminate-finishing-job: to use when a job is
425
427
stuck in the Finishing state.
@@ -444,7 +446,7 @@ version 2.5.2:
444
446
- Bug #13939: the job_resource_manager.pl and job_resource_manager_cgroups.pl
445
447
now deletes the user files in /tmp, /var/tmp and /dev/shm at
446
448
the end of the jobs.
447
- - Bugfix: in oardodo.c, the preprocessed variables was not defined correclty .
449
+ - Bugfix: in oardodo.c, the preprocessed variables was not defined correctly .
448
450
- Finaud: fix race condition when there was a PINGCHECKER error jsut before
449
451
another problem. The node became Alive again when the PINGCHECKER said OK
450
452
BUT there was another error to resolve.
@@ -498,7 +500,7 @@ version 2.5.1:
498
500
if plenty of jobs have to be launched at the same time.
499
501
- Fix problem when deleting a job in the Suspended state (oarexec was stopped
500
502
by a SIGSTOP so it was not able to handle the delete operation)
501
- - Make the USER_SIGNAL feature of oardel multi job independant and remove the
503
+ - Make the USER_SIGNAL feature of oardel multi job independent and remove the
502
504
temporary file at the end of the job
503
505
- Monika: display if the job is of timesharing type or not
504
506
add in the job listing the initial_request (is there a reason to
@@ -535,7 +537,7 @@ version 2.5.1:
535
537
{"resource":"/cpu=1", "command":"sleep", "param_file":"60\n90\n30"}
536
538
537
539
This submits 3 sleep jobs with differents sleep values.
538
- - Remove any reference to gridlibs and gridapi as these components are obselete
540
+ - Remove any reference to gridlibs and gridapi as these components are obsolete
539
541
- Add stdout and stderr files of each job in oarstat output.
540
542
- API now supports fastcgi (big performance raise!)
541
543
- Add "-f" option to oarnodesetting to read hostnames from a file.
@@ -553,7 +555,7 @@ version 2.5.1:
553
555
- Implement a new script 'oar-database' to manage the oar database.
554
556
oar_mysql_init & oar_psql_init are dropped.
555
557
- Huge code reorganisation to allow a better packaging and system integration
556
- - Drop the oarsub/oarstat 2.3 version that was kept for compatiblity issues
558
+ - Drop the oarsub/oarstat 2.3 version that was kept for compatibility issues
557
559
during the 2.4.x branch.
558
560
- By default the oar scheduler is now
559
561
'oar_sched_gantt_with_timesharing_and_fairsharing' and the following values
@@ -912,7 +914,7 @@ version 2.2:
912
914
the whole node is assigned then it kills all jobs from the user if all cpus
913
915
are assigned.
914
916
- the same thing can be done with cpuset_manager_PAM.pl as the script used to
915
- configure the cpuset. More efficent if cpusets are configured.
917
+ configure the cpuset. More efficient if cpusets are configured.
916
918
- debug cm_availability feature to switch on and off nodes automatically
917
919
depending on waiting jobs.
918
920
- reservations now take care of cm_availability field
@@ -1035,13 +1037,13 @@ version 2.0.0:
1035
1037
- Add wait() in pingchecker to avoid zombies.
1036
1038
- Better code modularization.
1037
1039
- Remove node install part to launch jobs. So it is easier to upgrade from
1038
- one version to an other (oarnodesetting must already be installed on each
1040
+ one version to another (oarnodesetting must already be installed on each
1039
1041
nodes if we want to use it).
1040
1042
- Users can specify a method to be notified (mail or script).
1041
1043
- Add cpuset support
1042
1044
- Add prologue and epilogue script to be executed on the OAR server before
1043
1045
and after launching a job.
1044
- - Add dependancy support between jobs ("-a" option in oarsub).
1046
+ - Add dependency support between jobs ("-a" option in oarsub).
1045
1047
- In oarsub you can specify the launching directory ("-d" option).
1046
1048
- In oarsub you can specify a job name ("-n" option).
1047
1049
- In oarsub you can specify stdout and stderr file names.
0 commit comments