Spack¶
Spack is a package manager that automates the download-build-install process for HPC software. It is especially useful for building and maintaining installations of many different versions of the same software. It also comes with a virtual environment feature that is useful when developing software.
LUMI provides a module to load a pre-configured Spack instance: module load
spack
. When you load this module, you will use a Spack instance configured to
compile software with the Cray programming environment. The software will be
installed in a location determined by you in $SPACK_USER_PREFIX
. This Spack
instance is "chained" to the upstream one in /appl/lumi/spack
, which means
that you can build new packages on top of the already installed ones by the
LUMI User Support Team (similar to how our EasyBuild setup works).
The software installed with Spack in /appl/lumi/spack/ is provided as is.
It may not have received any testing after installation! We also build the software there in a more fool-proof way with slightly less optimizations: Zen2 architecture instead of Zen3, OpenBLAS instead of Cray LibSci, and Netlib Scalapack. This may have a small impact on performance, but is usually fine.
Using Spack on LUMI¶
To install software with Spack, perform the following steps. In this
example, we will install kokkos, a C++ parallel
programming framework, into a hypothetical project storage folder
/project/project_465000XYZ/spack
. We want to configure this package with AMD
GPU support and activate extra array bounds checking for debugging.
-
Initialize Spack.
We recommend that you set
$SPACK_USER_PREFIX
in e.g. your.bash_profile
file to avoid having to set it every time you want to use Spack. -
Check the information Spack has about the package and especially the configuration options:
From reading the package information, it becomes clear that the install command should be:
The flag
+rocm
activates GPU support, and+debug_bounds_check
adds the array bounds checking. We also need to specify the type of GPU:amdgpu_target==gfx90a
(note the double equal signs which has the special meaning of propagating the GPU target to all dependencies). In this case, we give no explicit specification of a compiler, which means that Spack will choose gcc 11.2.0 for us when compiling. -
Before installing, it is good practice to check the dependencies that Spack will install. Sometimes this can be many, many, packages! Running this command can take some time (up to a few minutes):
$ spack spec -I kokkos+rocm+debug_bounds_check amdgpu_target==gfx90a %gcc@12.2.0 Input spec -------------------------------- - kokkos%gcc@12.2.0+debug_bounds_check+rocm amdgpu_target==gfx90a Concretized -------------------------------- - kokkos@4.0.01%gcc@12.2.0~aggressive_vectorization~compiler_warnings~cuda~cuda_constexpr~cuda_lambda~cuda_ldg_intrinsic~cuda_relocatable_device_code~cuda_uvm~debug+debug_bounds_check~debug_dualview_modify_check~deprecated_code~examples~explicit_instantiation~hpx~hpx_async_dispatch~hwloc~ipo~memkind~numactl~openmp~openmptarget~pic+profiling~profiling_load_print~pthread~qthread+rocm+serial+shared~sycl~tests~tuning~wrapper amdgpu_target=gfx90a build_system=cmake build_type=Release generator=make intel_gpu_arch=none std=17 arch=linux-sles15-zen2 [^] ^cmake@3.26.3%gcc@11.2.0~doc+ncurses~ownlibs~qt build_system=generic build_type=Release arch=linux-sles15-zen2 [^] ^curl@8.0.1%gcc@11.2.0~gssapi~ldap~libidn2~librtmp~libssh~libssh2~nghttp2 build_system=autotools libs=shared,static tls=mbedtls arch=linux-sles15-zen2 [^] ^mbedtls@2.28.2%gcc@11.2.0+pic build_system=makefile build_type=Release libs=static arch=linux-sles15-zen2 [^] ^expat@2.5.0%gcc@11.2.0+libbsd build_system=autotools arch=linux-sles15-zen2 [^] ^libbsd@0.11.7%gcc@11.2.0 build_system=autotools arch=linux-sles15-zen2 [^] ^libmd@1.0.4%gcc@11.2.0 build_system=autotools arch=linux-sles15-zen2 [^] ^libarchive@3.6.2%gcc@11.2.0+iconv build_system=autotools compression=bz2lib,lz4,lzma,lzo2,zlib,zstd crypto=mbedtls libs=shared,static programs=none xar=expat arch=linux-sles15-zen2 ...
The packages that are already installed in your own Spack instance will have a
[+]
in the first column, and the packages that Spack found installed upstream will have[^]
. A-
means Spack did not find the package and will build it. In this case, all dependencies are already installed so building a new kokkos package will be fast. -
When you're satisfied with what Spack plans to do, install it:
$ spack install kokkos+rocm+debug_bounds_check amdgpu_target==gfx90a %gcc@12.2.0 [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/mbedtls-2.28.2-os2trz4 [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/zlib-1.2.13-xnce3sf [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/libmd-1.0.4-hihpx7d [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/bzip2-1.0.8-xxbduxx [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/libiconv-1.17-sfn5bie [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/lz4-1.9.4-eelrsw7 [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/lzo-2.10-52o446s [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/xz-5.4.1-wsj4v26 [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/zstd-1.5.5-s3ym3jd [+] /appl/lumi/spack/23.03/0.20.0/opt/spack/libuv-1.44.1-2m6sqkh ==> Installing kokkos-4.0.01-hcnhccbgkddtgeq7wyqvtpvpj6wz76vu ==> No binary for kokkos-4.0.01-hcnhccbgkddtgeq7wyqvtpvpj6wz76vu found: installing from source ==> Fetching https://mirror.spack.io/_source-cache/archive/bb/bb942de8afdd519fd6d5d3974706bfc22b6585a62dd565c12e53bdb82cd154f0.tar.gz ==> No patches needed for kokkos ==> kokkos: Executing phase: 'cmake' ==> kokkos: Executing phase: 'build' ==> kokkos: Executing phase: 'install' ==> kokkos: Successfully installed kokkos-4.0.01-hcnhccbgkddtgeq7wyqvtpvpj6wz76vu Stage: 2.03s. Cmake: 23.47s. Build: 17.93s. Install: 6.86s. Post-install: 2.54s. Total: 1m 1.47s [+] /project/project_465000XYZ/spack/23.03/0.20.0/kokkos-4.0.01-hcnhccb
The final line shows where the software is installed on disk. A module will also be generated automatically and added to your
$MODULEPATH
. The modules are generated with a short hash code (5 characters "nioel" here) to prevent naming collisions.
What to do when a Spack install fails¶
-
Check if the error displayed suggests an easy solution. If there is an error, Spack will point you to an installation log for the particular package. In the same directory, the full build directory can also be found in
/tmp
. Inspecting the output logs from configure or cmake can sometimes be fruitful.Some failures can be avoided by:
- building a different version of the packages
- building with a different compiler (try
%gcc
or%cce
) - disabling a variant of the package
- modifying which dependencies are used to build the target package (see Specs and Dependencies in the official Spack documentation)
In some cases, changes have to made to the
package.py
file of a package. Unfortunately, this is not straightforward as the package repository is located in/appl/lumi
, which is read-only. In such cases, you have to clone to your own Spack instance and configure it using our configuration files. -
Seek help: you can check the official Spack documentation, open a ticket at the LUMI Helpdesk or ask the Spack community via the Spack Slack, the Spack Slack community can be very helpful.
Description of the different Spack modules¶
- Module
spack/23.03-2
: This is Spack release version 0.20.0 based on the Cray Programming Environment 23.03. The ROCM packages are built from source by Spack and corresponds to ROCM release version 5.4.3. Testing has indicated that it should be possible to run ROCM 5.4.3-based software using the older drivers from ROCM 5.2.3, which is installed on the LUMI-G compute nodes. - Module
spack/23.03
: This is Spack release version 0.19.2 based on the Cray Programming Environment 23.03. The ROCM packages are external and comes from the HPE provided ROCM 5.2.3 in/opt/rocm
. - Module
spack/22.08-2
: This is Spack release version 0.19.0 based on the Cray Programming Environment 22.08. The ROCM packages are built from source by Spack and corresponds to ROCM release version 5.2.3. Testing has indicated that it should be possible to run ROCM 5.2.3-based software using the older drivers from ROCM 5.1.3, which is installed on the LUMI-G compute nodes. - Module
spack/22.08
: This is Spack release version 0.18.1 based on the Cray Programming Environment 22.08. The ROCM packages are external and comes from the HPE provided ROCM 5.0.2 in/opt/rocm
(which is rather old). This Spack module is deprecated and should not be used. MPI programs may not work and the linked ROCm is really old and does not exist on the system anymore.
Spack on LUMI (advanced)¶
This section further explains the Spack setup on LUMI.
The upstream Spack instances maintained by the User Support Team
are located in subdirectories of /appl/lumi/spack
, numbered according to the
Cray Programming Environment release version, and Spack release version. For
example:
This is a Spack instance based on Spack release 0.18.1 configured to use the
compilers and MPI libraries from the Cray Programming Environment release
22.08. In general, we will only install one or two versions of Spack per programming
environment. These instances are not meant to be used directly by users as
they are configured to install packages centrally, i.e. spack install
will
fail with permission errors, but you can copy the configuration files from
there if you want to make your own Spack instance.
Instead, for users, there are "fake" chained Spack instances installed
alongside these which are configured with install packages in a user-specified
location. These instances have names which end in -user
, e.g.:
This is what the spack
modules use, and what you should use as user on the
system if you do not want set up your own Spack instance. If you check the etc/spack/config.yaml
inside, you will see that they set the install_tree
property to a value provided by $SPACK_USER_PREFIX
.
config:
build_jobs: 32
install_path_scheme: '{name}-{version}-{hash:7}'
install_tree: $SPACK_USER_PREFIX/22.08/0.18.1
source_cache: $SPACK_USER_PREFIX/22.08/0.18.1/cache
misc_cache: $SPACK_USER_PREFIX/22.08/0.18.1/cache
install_missing_compilers: false
suppress_gpg_warnings: true
A similar trick is used in the modules.yaml
file to install modules in
$SPACK_USER_PREFIX/22.08/0.18.1/modules/tcl
.
modules:
default:
roots:
tcl: $SPACK_USER_PREFIX/22.08/0.18.1/modules/tcl
enable:
- tcl
tcl:
hash_length: 5
projections:
all: '{name}/{version}-{compiler.name}'
If you want to use Spack directly (without module load spack
), you can source
the Spack initialization scripts as usual. They can be found on disk in e.g.:
You just need to make sure that $SPACK_USER_PREFIX
is set and that the
cache
and modules/tcl
subdirectories exist within that directory.
Further reading¶
- Spack documentation
- Spack tutorial
- Spack Source code (especially the package definitions)
- Spack Slack