ARROW Cluster

From TRACC Wiki
Revision as of 22:54, February 20, 2021 by Amiot (talk | contribs) (→‎ARROW Queues)
Jump to navigation Jump to search

Introduction To ARROW

TRACC has now combined the hardware from the Phoenix and Zephyr clusters into the ARROW cluster. This consolidation allows efficient administration of TRACC cluster services with limited staff. To avoid the problems of load balancing, the different types of hardware nodes on the ARROW cluster are partitioned and available in queues. When new hardware is installed to expand cluster resources, it will be made available via a new queue. The documentation at Using the Clusters describes procedures for using ARROW.

ARROW is arranged such that there is a single set of login nodes, a singe file system, and single user home directory that serves all of the nodes in all of the queues.

ARROW Queues

There are currently three queues that are available with some restrictions about who can use them as described below.

  • batch (default queue, with 94 nodes, each node with 16 floating point cores available for general use)
    • 92 nodes have 32 GB of RAM
    • 2 nodes (nodes 1 and 2) with 128GB
    • 2 nodes (nodes 3 and 4) with 64GB
  • nhtsa (with 12 nodes, each with 28 cores and 64 GB of RAM, only available to the NHTSA project)
  • arrow (one new EPYC server with 64 cores, for use for testing by TRACC staff or special permission by the TRACC Director)