Pure on-demand monthly
Expected GPU-hours × on-demand rate × (1 + overhead). Linear in usage.
Calculator
Compare a GPU reservation to pure on-demand at your expected usage.
Reservations trade a lower hourly rate for a committed monthly bill that you owe whether or not you actually use the GPUs. The calculator shows pure on-demand, reserved-plus-burst, the GPU-hours of stranded capacity if you under-use, and the monthly usage at which the reservation breaks even.
Interactive calculator
At 1,200 GPU-h/mo, pure on-demand is $10,560/mo vs reserved plus burst at $6,336/mo over a 12-month commit. Reserved plus burst looks cheaper at this usage. The commitment pays for itself with room to spare.
Starting values are illustrative defaults you can edit — not live ComputeTape benchmark prices. Replace them with a real quote.
How to read the result
Reservations are a fixed monthly bill; pure on-demand scales with what you run. The break-even number is the monthly GPU-hours at which the reservation matches the equivalent on-demand spend; above it, reserved-plus-burst is cheaper, below it on-demand wins.
Expected GPU-hours × on-demand rate × (1 + overhead). Linear in usage.
Reserved bill (always paid) plus any extra GPU-hours above committed capacity at the on-demand rate. Above break-even, this beats pure on-demand.
Committed hours that go unused at your expected usage. Stranded capacity is paid for but produces nothing.
Reserved hours × reserved rate / on-demand rate, in monthly GPU-hours. The cleanest single number for sizing a commitment against expected demand.
Why variance matters
Two clusters with the same average monthly GPU-hours can land in very different places. A workload that bursts hard then sits idle strands more capacity on a reservation than one that runs at a steady level. The sensitivity row shows what happens at 50%, 100%, and 150% of your expected usage; the wider that spread is in real life, the more cautious the reservation choice should be.
Access terms, savings, and interruption risk side by side.
How utilization is measured and why paid capacity costs more when it sits idle.