AI Systems Performance

A performance-focused guide to making AI systems faster, cheaper, and more reliable from model training through inference.

About the Book

A performance-focused guide to making AI systems faster, cheaper, and more reliable from model training through inference. It connects hardware, software, and model-level decisions to real production performance.

What's inside

  • GPU and CUDA optimization
  • PyTorch performance techniques
  • distributed training and inference
  • scaling GPU clusters
  • production optimization checklists

Download Links

21.76 MB Total Size

Unlock this Bonus Material

Get the Package