AI Systems Performance
A performance-focused guide to making AI systems faster, cheaper, and more reliable from model training through inference.

About the Book
A performance-focused guide to making AI systems faster, cheaper, and more reliable from model training through inference. It connects hardware, software, and model-level decisions to real production performance.
What's inside
- GPU and CUDA optimization
- PyTorch performance techniques
- distributed training and inference
- scaling GPU clusters
- production optimization checklists
Download Links
21.76 MB Total Size