Beyond nvidia-smi: Tools for Real GPU Performance Metrics
- Track: Software Performance
- Room: H.1301 (Cornil)
- Day: Sunday
- Start: 09:50
- End: 10:30
- Video only: h1301
- Chat: Join the conversation!
nvidia-smi reports 100% utilization, but your workload underperforms. What's missing?
Relying only on nvidia-smi is like measuring highway usage by checking if any car is present, not how many lanes are full.
This talk reveals the metrics nvidia-smi doesn't show and introduces open source tools that expose actual GPU efficiency metrics.
We'll cover:
- Why GPU Utilization is not same as GPU Efficiency.
- Deep dive into relevant key metrics: SM Active, SM Occupancy, and Tensor Core utilization explained.
- Steps for practical gpu profiling and active monitoring setup.
- Identifying bottlenecks in inference workloads.
Attendees will leave understanding how to identify underutilized GPU and discover real optimization opportunities across inference workloads.
Speakers
| YASH PANCHAL |