How to find available partitions on ARC

From RCSWiki
Revision as of 22:39, 19 January 2023 by Dmitri (talk | contribs) (Created page with "ARC is a relatively large and very heterogeneous cluster. It has lots of compute nodes and some of these nodes are very different in their hardware specifications and performance capabilities. On ARC, nodes with similar specifications are grouped into SLURM partitions. To use ARC effectively and to its full potential it is important that the researchers who use ARC can find available partitions and see important features of the nodes in those partitions. For this pur...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

ARC is a relatively large and very heterogeneous cluster. It has lots of compute nodes and some of these nodes are very different in their hardware specifications and performance capabilities. On ARC, nodes with similar specifications are grouped into SLURM partitions.

To use ARC effectively and to its full potential it is important that the researchers who use ARC can find available partitions and see important features of the nodes in those partitions.

For this purpose on ARC, a special command, arc.hardware is provided:

$ arc.hardware

Node specifications per partition:
      ================================================================================
           Partition |   Node    CPUs    Memory        GPUs  Node list
                     |  count   /node      (MB)       /node  
      --------------------------------------------------------------------------------
              bigmem |      2      80   3000000              fm[1-2]
                     |      1      40   4127000      a100:4  mmg1
                     |      1      40   8256000      a100:2  mmg2
             cpu2013 |     14      16    120000              h[1-14]
        cpu2017-bf05 |     16      28    245000              s[1-16]
                     |     20      28    188000              th[1-20]
             cpu2019 |     40      40    185000              fc[22-61]
        cpu2019-bf05 |     87      40    185000              fc[1-21,62-127]
             cpu2021 |     17      48    185000              mc[1-11,14-19]
        cpu2021-bf05 |     21      48    185000              mc[23-43]
        cpu2021-bf24 |      7      48    381000              mc[49-55]
             cpu2022 |     52      52    256000              mc[73-124]
        cpu2022-bf24 |     16      52    256000              mc[57-72]
            gpu-a100 |      6      40    515000      a100:2  mg[1-6]
            gpu-v100 |     13      40    753000      v100:2  fg[1-13]
             lattice |    196       8     11800              cn[169-364]
            parallel |    572      12     23000              cn[0513-0544,0557-1096]
                     |      4      12     23000     m2070:2  cn[0553-0556]
              single |    168       8     11800              cn[001-168]
      ================================================================================