HPC/Module naming scheme 2016
Introduction
(*) Environment modules are the means by which software applications are made available to users. Using modules allows users and administrators to pick, by user or even by compute job, a desired or default version of an application from typically several current and historic versions persisting on the system. Older versions are kept to improve reproducibility of results, an important characteristic of scientific computing. |
On Carbon, the naming scheme for environment modules* and the set of default modules being loaded for users has changed. This step was necessary because of increasing diversity and interdependence of applications, their modules, and the underlying operating system.
The goal of the changed naming scheme has been to:
- expand the schemes for (a) installation directories and (b) module names to enable addressing and chosing versions by differing:
- OS release,
- compilers,
- MPI flavors, and,
- for future evolution: different aspects of the machine architecture like CPU generation, capabilities, and coprocessor facilities,
- accommodate and enable existing application versions to run on the newer OS release,
- install and make available new application versions where suitable, either on both old and newer OS releases, or only one of them.
Review the detailed changes in module conventions.
New-style module names
You may need to adapt the module names that you placed in your shell startup and job files to the new more hierarchical scheme. For most modules, with exceptions show below, the leading name component (the part before any "/") is the same in the old and new naming schemes. What always differs are the name parts after the first slash.
Name change rules
- To use the latest or automatically selected version of a package, remove version numbers from old-style module names of the form
packagename/version
, leaving onlypackagename
. This is the recommended approach, as you will automatically benefit from future updates and maintenance builds. - To insist on a specific version for a package in new style names:
- Inspect the available flavors and versions (some older modules were not migrated):
module avail packagename
.
- Choose the new-style name up to the desired specificity. You may leave out trailing name or directory parts.
- For instance, instead of
vasp5/5.3/openmpi-1.4/intel/5.3.3p3-mkl-3
you may writevasp5/5.3/openmpi-1.4
orvasp5/5.3
, letting the system choose the versions for MPI and compiler that are chosen as defaults at a given time.
- For instance, instead of
- Inspect the available flavors and versions (some older modules were not migrated):
Renamed modules
For the following modules the newer naming convention allowed for and thus uses more consistent names:
OLD NEW
-------------------------------------
asap3 asap/3
ase2 ase/2
ase3 ase/3
g09 gaussian/09
GaussView gaussview (lowercase)
- The modules
fftw3
andvasp5
did not change name due to more entrenched usage in the package itself, Unix group names, and compilation dependencies.
Migration guide
Or: How to customize your module selection by OS release.
Configure in .bashrc
To switch over to new-style module names entirely, on both CentOS releases, you could continue using only ~/.bashrc
.
To do this:
- Tell CentOS-5 nodes to offer you the new-style module catalog instead of the old one. To do so, simply create an empty customization file:
touch ~/.modules-2
- Then, apply the changes shown in the #Name change rules section above:
vi .bashrc
# or:
nano .bashrc
- Use a text editor of your choice, such as
nano
orvi
.
Configure in dedicated files
It is perhaps cleaner to perform the module selection in files dedicated for each CentOS release, so that you can make adjustments independently.
To migrate your existing configuration, manage the following files:
.bashrc .modules-1 .modules-2
Use a helper application to get you started:
modules-migrate
- Test – same as in previous section.
Minor changes for the module command
Name completion in command line
When working interactively in a terminal, you can use a "completion" feature of the Bash shell to complete a partially typed module name and show all names available for the name typed so far. For example:
At a shell prompt (shown as "$"), type:
$ module load fft
Press the <TAB>
key and the name will be expanded to fftw3/
and you'll see two possible completing names, with the cursor waiting at the end of the longest common substring:
$ module load fftw3/_ fftw3/3.3/impi-5/intel-16/3.3.4-10 fftw3/3.3/openmpi-1.10/intel-16/3.3.4-11 fftw3/3.3/intel/3.3.2-1 fftw3/3.3/openmpi-1.4/intel/3.3.2-4
Type the letter o
, hit the <TAB>
key again. The choices will be narrowed down to OpenMPI.
$ module load fftw3/3.3/openmpi-1.<TAB> fftw3/3.3/openmpi-1.10/intel-16/3.3.4-11 fftw3/3.3/openmpi-1.4/intel/3.3.2-4
Typing the digit 1
will pick the 1.10
MPI version, at which the then remaining single module name choice will be completed, with the cursor waiting after an additional space character:
$ module load fftw3/3.3/openmpi-1.10/intel-16/3.3.4-11 _
"module purge" command
Previously, the Carbon queueing system was made available to users by modules, which meant that it was difficult for a user to start with a clean slate, or to make and adjust a custom set of module choices. You may now safely use the module "purge" command for its intended purpose:
module purge
followed by module load …
to choose compilers, MPI flavors, and applications.
Determining default module versions
The module avail
command under CentOS-6 no longer includes the marker "(default)"
when one has been set in a .version
file.
I am not sure if this is a bug or by design, but the change certainly makes the output more consistent.
To determine which module will be loaded when an abbreviated name is used, I recommend to inspect the first relevant line in the output of one of these commands:
module show name module help name