openlava – Hot Resource Manager

HPC systems are really designed to be shared by several users. One way to share them is through a software tool called a resource manager. Openlava is an open source version of the commercial scheduler LSF. It shares the robustness of LSF while being freely available, very scalable, and easy to install and customize.

Grid Engine: Running on All Four Cylinders

Two years after the Oracle acquisition of Sun, Grid Engine is still alive and scheduling jobs.

Gathering Data on Environment Modules

Gathering data on various aspects of your HPC system is a key step toward developing information about the system and one of the first steps toward tuning your system for performance and reporting on system use. It can tell how users are using the system and, at a high level, what they are doing. In this article, I present a method for gathering data on how users are using Environment Modules, such as which modules are being used, how often, and so on.

Listing 6: Warewulf – Part 4

Listing 6 for Warewulf Part 4

Listing 5: Warewulf – Part 4

Listing 4 for Warewulf Part 4

Listing 4: Warewulf – Part 4

Listing 4 for Warewulf Part 4

Listing 3: Warewulf – Part 4

Listing 3 for Warewulf – Part 4

Listing 2: Warewulf – Part 4

Listing 2 for Warewulf – Part 4

Listing 1: Warewulf – Part 4

Listing 1 for Warewulf – Part 4

Warewulf Cluster Manager – Part 4

In the last of this four-part series on using Warewulf to build an HPC cluster, I focus a bit more on the administration of a Warewulf cluster, particularly some basic monitoring and the all-important resource manager.