Compute College

What is Model FLOPs Utilization (MFU)?

By ComputeTape Editorial

Model FLOPs Utilization (MFU) measures how much of a GPU's theoretical compute a job actually uses; goodput counts useful work delivered.

MFU compares useful model math to a GPU's theoretical peak; it is usually well below 100%.
Goodput counts useful work delivered, not raw throughput.
Higher MFU and goodput lower cost per result on the same hardware.

A move from 40% to 50% MFU is roughly 25% more useful work on the same hardware.
Bottlenecks are often memory, networking, or pipeline stalls, not raw compute.
Treat any MFU percentage as workload- and setup-specific.

Example figures are illustrative calculations, not current quoted market prices.

Efficiency gains can lower cost per result without new hardware.
Persistent low MFU points to memory, networking, or software bottlenecks.
Utilization metrics separate real efficiency from raw chip counts.

Market read: MFU and goodput reveal how much of paid compute is actually useful. Evidence discipline: state the workload and setup behind any utilization figure, and keep illustrative percentages separate from measured runs. Figures here are illustrative unless explicitly sourced and dated — see our methodology.

Operators and ML engineers: measure MFU and remove memory, networking, and pipeline bottlenecks before buying more GPUs.
Analysts: read efficiency gains as a lever on effective compute supply and cost.
Compare cost per useful result, not per raw GPU-hour.
Treat utilization percentages as workload- and setup-specific.
Keep measured runs separate from illustrative efficiency targets.

Decision check: before adding GPUs, check whether higher MFU or goodput on existing hardware would meet the need more cheaply.

Use the calculators

Compute College track

Model Costs

Step 5 of 7: What is model flops utilization MFU

What is Model FLOPs Utilization (MFU)?

Plain-English definition

Why it matters

Simple example

How to read the market signal

Common mistake

What you can do with this

Turn the lesson into a number

Model Costs

What is Model FLOPs Utilization (MFU)?

Plain-English definition

Why it matters

Simple example

How to read the market signal

Common mistake

What you can do with this

Turn the lesson into a number

Model Costs

Related lessons

What is GPU utilization?

GPU-Hour Cost Calculator

What is model training cost?

Why memory matters