# Profiling Stack: torch.profiler + Nsight Systems + Nsight Compute + MFU Calculation

> Source: https://sukruyusufkaya.com/en/learn/fine-tuning-cookbook/ftc-profiling-stack-torch-profiler-nsys-ncu-mfu
> Updated: 2026-05-14T14:42:50.000Z
> Category: Fine-Tuning Cookbook (Model-by-Model)
> Module: Part I — Hardware & Memory Engineering
**TLDR:** Optimization without profiling is hot air. Python-level timing with torch.profiler, kernel-level timeline with Nsight Systems (nsys), kernel-internal metrics with Nsight Compute (ncu), MFU (Model FLOPs Utilization) calculation. Cookbook certification: each Lab MFU > 35%.

