Skip to content

03. High-Performance Network

This chapter focuses on InfiniBand and RoCE performance for AI training and HPC communication.

Core Topics

  • topology and communication patterns
  • NCCL benchmarking and tuning
  • network reliability and troubleshooting

AI-HPC Organization · Contact: openaihpc@gmail.com