List of Computational Nodes

NameCPUCores per socketSocketsCPU CoresReal MemoryGPU (gres)GPU Count
node[01-14]Intel Xeon Gold 6226R16232770000
node[15-32]Intel Xeon Gold 6226R16232385000
gpu01Intel Xeon Gold 6226R16232385000tesla_a1004
gpu02Intel Xeon Gold 6226R16232770000tesla_v100S4
gpu03Intel Xeon Gold 6226R16232770000tesla_v100S2
gpu04Intel Xeon Gold 6226R16232385000tesla_v100S2
gpu05Intel Xeon Gold 6226R16232385000tesla_v100S2
gpu06Intel Xeon Gold 622612224385000tesla_v1004
gpu07Intel Xeon Gold 622612224385000tesla_v1003
gpu08Intel Xeon Gold 622612224385000tesla_t42
gpu09Intel Xeon Gold 622612224386000tesla_v1004
highmem01Intel Xeon Gold 6240L182362984000
highmem02Intel Xeon Gold 6226124482984000
gpa[401-410]Intel Xeon Gold 63386421281024000tesla_a1004
gpua[801-805]Intel Xeon Gold 63386421281024000nvidia_a100_804
gpuh[801-802]AMD EPYC 93346421281024000nvidia_h100_804
gpugh01Grace Superchip721721200000nvidia_gh2001
(note: "tesla_a100" is used to denote A100-SXM-40 and A100-PCIe-40 cards)

See also the spreadsheetopen in new window

Login Nodes

login[1-2]

Summary: 2 x ACTserv x1210open in new window (32-cores, 192GB, 480GB SSD)

  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W
  • Memory: 192GB – 12x 16GB DDR4 2933MHz
  • Storage: 2x 240GB SATA 2.5″ solid state drives

top

CPU Nodes

node[01-14]

Summary: 14 x Dual Socket Xeon SP (32-cores, 768GB)

  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W
  • Memory: 768GB – 12x 64GB DDR4 2933MHz
  • Storage: 256GB M.2 NVMe solid state drives

node[15-32]

Summary: 18 x Dual Socket Xeon SP (32-cores, 384GB)

  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W
  • Memory: 384GB – 12x 32GB DDR4 2933MHz
  • Storage: 256GB M.2 NVMe solid state drives

highmem01

Summary: ACTserv x1210open in new window (36-cores, 3TB)

  • Processor: 2x Intel 18-Core Xeon Gold 6240L 2.6GHz – 150W
  • Memory: 3TB – 24x 128GB DDR4 2933MHz
  • Storage: 256GB M.2 NVMe solid state drives

highmem02

Summary: ACT SYS-2049U-TR4 (24-cores, 3TB)

  • Processor: 4x Intel 12-Core Xeon Gold 6226 2.7GHz – 125W
  • Memory: 3TB – 48x 64GB DDR4 2933MHz
  • Storage: 240GB solid state drives

top

GPU Nodes

GPU acceleration adds 432TF of SFLOP performance and 798GB of global memory from 27 GPUs.

gpu01

Summary: ACTserv x2280copen in new window (32-cores, 384GB)

  • GPU: 4x NVIDIA Tesla A100 PCI-E Passive Single GPU each with 40GB of GDDR5 memory
  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W
  • CPU Memory: 384GB – 12x 32GB DDR4 2933MHz
  • Storage: 6x 2.5″ SATA hotswap, 2x U.2 NVMe drive bays
  • Note: Each pair of GPUs are connected via NVLink

gpu02

Summary: ACTserv x2280copen in new window (32-cores, 768GB)

  • GPU: 4x NVIDIA Tesla V100S PCI-E Passive Single GPU each with 32GB of GDDR5 memory
  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W
  • CPU Memory: 768GB – 12x 64GB DDR4 2933MHz
  • Storage: 6x 2.5″ SATA hotswap, 2x U.2 NVMe drive bays

gpu03

Summary: ACTserv x2280copen in new window (32-cores, 768GB)

  • GPU: 2x NVIDIA Tesla V100S PCI-E Passive Single GPU each with 32GB of GDDR5 memory
  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W
  • CPU Memory: 768GB – 12x 64GB DDR4 2933MHz
  • Storage: 6x 2.5″ SATA hotswap, 2x U.2 NVMe drive bays

gpu[04-05]

Summary: 2 x ACTserv x2280copen in new window (32-cores, 384GB)

  • GPU: 2x NVIDIA Tesla V100S PCI-E Passive Single GPU each with 32GB of GDDR5 memory

  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W

  • CPU Memory: 384GB – 12x 32GB DDR4 2933MHz

  • Storage: 6x 2.5″ SATA hotswap, 2x U.2 NVMe drive bays

gpu06

Summary: ACTserv x4170copen in new window (24-cores, 384GB)

  • GPU: 4x NVIDIA Tesla V100 PCI-E Passive Single GPU each with 32GB of GDDR5 memory
  • Processor: 2x Intel 12-Core Xeon Gold 6216 2.6GHz – 125W
  • CPU Memory: 384GB – 12x 32GB DDR4 2666MHz

gpu07

Summary: ACTserv x4170copen in new window (24-cores, 384GB)

  • GPU: 3x NVIDIA Tesla V100 PCI-E Passive Single GPU each with 32GB of GDDR5 memory
  • Processor: 2x Intel 12-Core Xeon Gold 6216 2.6GHz – 125W
  • CPU Memory: 384GB – 12x 32GB DDR4 2666MHz

gpu08

Summary: ACTserv x4170copen in new window (24-cores, 384GB)

  • GPU: 2x NVIDIA Tesla T4 PCI-E Passive Single GPU each with 15GB of GDDR6 memory
  • Processor: 2x Intel 12-Core Xeon Gold 6216 2.6GHz – 125W
  • CPU Memory: 384GB – 12x 32GB DDR4 2666MHz

gpu09

Summary: ACTserv x4170copen in new window (24-cores, 384GB)

  • GPU: 4x NVIDIA Tesla V100 PCI-E Passive Single GPU each with 16GB of GDDR5 memory
  • Processor: 2x Intel 12-Core Xeon Gold 6216 2.6GHz – 125W
  • CPU Memory: 384GB – 12x 32GB DDR4 2666MHz

top

gpua[401-412]

  • GPU: 4x NVIDIA A100-SXM-40 each with 40GB of HBM memory - modular cards with NVLINK
  • Processor: 2x Intel 32-Core Xeon Gold 6338
  • CPU Memory: 1TB TruDDR4 3200MHz
  • Internal storage: 6.4TB NVMe
  • Chassis: Lenovo SR670 V2

top

gpua[801-805]

  • GPU: 4x NVIDIA A100-SXM-80 each with 80GB of HBM memory - modular cards with NVLINK
  • Processor: 2x Intel 32-Core Xeon Gold 6338
  • CPU Memory: 1TB TruDDR4 3200MHz
  • Internal storage: 6.4TB NVMe
  • Chassis: Lenovo SR670 V2

top

gpuh[801-802]

  • GPU: 4x NVIDIA H100-SXM-80 each with 80GB of HBM memory - modular cards with NVLINK
  • Processor: 2x Intel 32-Core AMD EPYC 9334
  • CPU Memory: 1TB TruDDR4 3200MHz
  • Internal storage: 6.4TB NVMe
  • Chassis: Lenovo SR675 V3

top

gpugh01

  • Chassis: QCT S74G GH200 Grace Hopper system
  • GPU: H100-SXM-96 with 96GB of HBM memory
  • Processor: Grace CPU with 72 Arm Neoverse V2 cores with an NVSWITCH connecting to the Hopper H100 GPU
  • Memory: 480GB LPDDRX memory 96GB HBM3 GPU accelerator

top

Gateway Nodes

Summary: 4 x ACTserv x1210open in new window (32-cores, 192GB)

  • Processor: 2x Intel 16-Core Xeon Gold 6226R 2.9GHz – 150W
  • Memory: 192GB – 12x 16GB DDR4 2933MHz
  • Storage: 2x 240GB SATA 2.5″ solid state drives

top

Storage

For architectural concepts and usage of storage, see here.

High-throughput file system (scratch)

BeeGFS storage with 1.3PB (i.e., 1,327,032 GiB) of total usable storage.

3 x BeeGFS HDD storage block

beegfs-oss01.cluster [ID: 1]

beegfs-oss02.cluster [ID: 2]

beegfs-oss03.cluster [ID: 3]

  • Enclosure: 4U, 60 drive JBOD with redundant SAS expanders
  • Storage: 480TB usable – 6GB/s (60x 10TB)
  • Fabric: ConnectX6, HDR-100 IB (100Gb/s) and 100GbE, single-port QSFP56, PCIe3/4 x16 Slot

3 x BeeGFS NVMe storage block

beegfs-oss04.cluster [ID: 4]

beegfs-oss05.cluster [ID: 5]

beegfs-oss06.cluster [ID: 6]

  • Storage: 300TB usable – 2x 48GB/s (8x 6GB/s per HDR200 IB card theoretical throughput) for 96GB/s
  • Fabric: ConnectX7, 2x HDR-200 IB (200Gb/s) for a total of 80GB/s theoretical throughput

2 x BeeGFS meta data server

beegfs-mds01.cluster [ID: 1]

beegfs-mds02.cluster [ID: 2]

  • Memory: 192GB – (12x 16GB DIMMs)
  • Metadata storage: 4x 3.2TB PCIe NVMe 2.5" solid state drives
  • Fabric: ConnectX-6 VPI adapter card, HDR-100 IB (100Gb/s) and 100GbE, single-port QSFP56, PCIe3/4 x16 Slot

Large-volume file system (persistent)

Ceph storage with 8.1PB of total usable storage.

33 x Ceph HDD storage block

3 x Ceph meta data server

croit-mds01

croit-mds02

croit-mds03

  • 32-core CPU
  • Memory: 267GB
  • Metadata storage: 6x PCIe NVMe 2.5″ solid state drives

top

Infiniband

1 x HDR Network

  • IB switch: 1x 40 port IB HDR (200Gb/s) – QSFP ports
  • Used IB ports: 38x HDR-100 ports
  • Unused ports: 42x HDR-100 available ports for expansion – 21x HDR-200 physical
  • Node cables: 19x 2 Meter HDR Infiniband Cable – 1x QSFP (200Gb/s) to 2x QSFP (100Gb/s)

top

Legacy

CHPC2 has 528 CPU cores in 23x CPU-based nodes and 11x GPU-accelerated nodes. GPU acceleration adds 122.5TF of SFLOP performance and 160GB of global memory from 31 GPUs.

  • Dual-socket Intel E5-2630 8-core CPUs
  • 128GB memory
  • GPU-acceleration: NVIDIA K20m and K20xm

top