Empire AI: Alpha Hardware Specifications
24 HGX Nodes
Nodes alphagpu01 - alphagpu18
8 H100 80GB GPUs per node
Nodes alphagpu19 - alphagpu24
8 H200 80GB GPUs per node
10 400Gb/s ConnectX-7 NIC Cards (8 for IB and 2 for Ethernet)
30TB NVMe caching space
2TB of system memory
23 of the nodes serve as the main computational platform
The 24th node is primarily to serve development, testing, etc.
Non-blocking NDR fabric cabled for rail configuration
8 network switches, 96 optical connections
4 service nodes
2 login nodes and 2 cluster management nodes (NVIDIA Base Command with licenses for all gear)
2 data transfer nodes
2PB of DDN Storage
4 x 720TB Flash storage (training data, snap shots)
2PB of VAST Storage
- home directories
Was this article helpful?
That’s Great!
Thank you for your feedback
Sorry! We couldn't be helpful
Thank you for your feedback
Feedback sent
We appreciate your effort and will try to fix the article