-
Notifications
You must be signed in to change notification settings - Fork 144
Conference call notes 20210217
Kenneth Hoste edited this page Mar 3, 2021
·
5 revisions
(back to Conference calls)
Notes on the 167th EasyBuild conference call, Wednesday February 17th 2021 (09:00 UTC)
Alphabetical list of attendees (12):
- Simon Branford (University of Birmingham, UK)
- Miguel Dias Costa (National University of Singapore)
- Alexander Grund (TU Dresden, Germany)
- Kenneth Hoste (HPC-UGent, Belgium)
- Adam Huffman (Big Data Institute, Oxford, UK)
- Terje Kvernes (University of Oslo, Norway)
- Kurt Lust (Univ. of Antwerp, Belgium + LUMI User Support Team)
- Robert Mijakovic (LuxProvide)
- Mikael Öhman (Chalmers University of Technology, Sweden)
- Alan O'Cais (JSC, Germany)
- Jurij Pečar (EMBL, Germany)
- Jörg Saßmannshausen (NIHR Biomedical Research Centre, UK)
- PyTorch benchmarking results (conda vs EasyBuild) at EMBL
- update on recent developments + outlook to next release
- Q&A
- https://github.com/constantinpape/3d-unet-benchmarks#embl-cluster-results
- improve speedups for PyTorch 1.7.1 installation with EasyBuild vs conda installations
- 4x-6x faster!
- across a range of GPUs (1080i, V100, A100, ...)
- smaller speedup for 1080i (no Tensor cores)
- is input data available so others can try to reproduce these results?
- Alexander: has startup overhead been taken into account?
- Does conda have binaries for the targeted GPUs?
- PTX intermediate format would imply large overhead for JiT compilation
- Mikael: workload could be relying heavily on CPU, so conda installing generic binaries could have big impact
- next release (v4.3.3):
before EUM'21?during EUM week?(still) hopefully next week...- project keeping track of "blockers" for EasyBuild v4.3.3: https://github.com/orgs/easybuilders/projects/11
- recent changes
-
framework
- bug fixes
-
enhancements
- use https://sources.easybuild.io as fallback source URL (PR #3572, PR #3576)
- add toolchain definition for
iibff
toolchain (PR #3574)- Intel compilers, Intel MPI, BLIS + libFLAME + FFTW
-
changes
- (none)
-
easyblocks
-
bug fixes
- pass down compilation flags from build environment for ESMF (PR #2325)
-
enhancements
- add support for skipping steps in Python packages installed as extension + print progress on individual steps for installing Python packages as extensions (PR #2290)
- updates for TensorFlow easyblock w.r.t. optional feature support & running CPU/GPU tests (PR #2314, PR #2312)
- update OpenFOAM easyblock for v2012 (PR #2321)
-
new software
- (none)
-
changes
- (none)
-
bug fixes
-
easyconfigs
- ~50 merged easyconfig PRs since last conf call!
-
bug fixes
- Remove duplicate extensions in R 3.5.x easyconfigs, and add test to detect such issues (PR #12059)
- add gnuplot dependency for OpenFOAM from v2.4.0 to v6 (PR #11801)
- fix source URL in UDUNITS easyconfigs + add sources.easybuild.io fallback source URL (PR #12156)
- add additional patches for PyTorch 1.7.1 to fix failing tests (PR #12147)
- enhancements
-
new software
- FIRESTARTER (PR #12160)
- software updates
- changes
-
framework
- to merge/fix/tackle soon
-
framework (v4.3.3 milestone)
-
bug fixes
- (BLOCKER for v4.3.3) Fix UTF-8 encoding errors when running EasyBuild with Python 3.0.x-3.6.x (PR #3565)
- Performance improvements for easyconfig parsing (PR #3555)
- Avoid module use in Lmod if possible to allow faster execution (PR #3557)
- Avoid metadata greedy behaviour (mostly relevant for Cray systems) (PR #3559)
- enhancements
-
bug fixes
-
easyblocks (v4.3.3 milestone)
-
bug fixes
- correctly determine path to active binutils in TensorFlow easyblock (PR #2218) [EESSI, RPATH]
- fix taking into account
--sysroot
when installing/using CMake [EESSI] (PR #2247 or PR #2248, latter is best option?) - treat files/directories of unpacked sources equally in
PackedBinary
generic easyblock (PR #2306) - make sure the installation of
libiberty.a
in the binutils easyblock goes into a populated directory (PR #2308) - Fix for building GCC with
--sysroot
on ppc64le (PR #2315) - RPackages: Change to using
R_LIBS_SITE
when installing RPackages (PR #2326)/pull/2325)) - added regex to replace
/lib/cpp
withcpp
in OpenFOAM's wmake rules file (PR #2331)
-
enhancements
- enhance OpenBLAS easyblock to make it aware of
optarch
(PR #1946)- needs testing...
- problem with numpy installed on Arm 64-bit with
--optarch=GENERIC
may be related, see https://github.com/EESSI/software-layer/issues/55
- (NICE TO HAVE for v4.3.3) update impi easyblock for impi 2021.x (oneAPI) (PR #2313)
- enhance test and install step of
CMakePythonPackage
easyblock (PR #2318) - pass
$CXXFLAGS
to PDT's configure script via '-useropt' (PR #2324) - Add sanity check commands to GCC (including LTO support) (PR #2322)
- Add support for including PTX code in PyTorch (PR #2328)
- enhance OpenBLAS easyblock to make it aware of
-
changes
- (nothing)
-
new software
- add custom easyblock for:
-
(NICE TO HAVE for v4.3.3) new generic easyblock for Intel oneAPI compilers (PR #2305)
- using
intel-compilers
as software name, includingicc
/icpc
/ifort
(classic compilers) +icx
/icpx
/ifx
/dpcpp
(oneAPI compilers)
- using
-
bug fixes
-
easyconfigs (v4.3.3 milestone)
- bug fixes
-
enhancements
- (nothing major?)
-
new software
-
(NICE TO HAVE FOR v4.3.3) Intel oneAPI
- there will be no further updates to Intel Parallel Studio (apparently)
-
intel-compilers
v2020.1 (PR #11982)
-
(NICE TO HAVE FOR v4.3.3) Intel oneAPI
- software updates
-
framework (v4.3.3 milestone)
- Mikael: Has somebody played with MLperf
- Robert: PRs coming in, will start slowly
- Jörg: looking into installing old software (EPACT), have to use foss/2016b
- need to get latest R installed as well
- is it worth the effort for contributing this back?
-
toolchainopts = {'cstd': 'c++98'}
, maybe also disable-Werror
?
- BLIS evaluation follow-up call tomorrow (Thu Feb 18th 2021 at 2.15 CET)
- Mikael: does BLIS at GCCcore imply we can move more stuff to GCCcore level?
- Mikael: NCCL is actually open-source?
- see https://github.com/NVIDIA/nccl
- we should be building it from source instead?
- prevents the need for registering to download the pre-built versions
- see also tests: https://github.com/nvidia/nccl-tests