AI compute market signals and learning
← Back to Compute College

Compute College

GPU Cloud Quote Comparison Checklist

Eleven concrete fields to compare on every GPU cloud quote before signing.

Buyers & OperatorsLearning path

One concept connected to AI compute market decisions.

5-8 minutesRead time

A practical introduction designed to be completed in one sitting.

GPU Quotes / Procurement / Buyer ChecklistTags

Useful for procurement teams, founders, infrastructure leads, and finance teams evaluating multiple gpu cloud offers side by side.

Plain-English definition

Plain-English definition

A GPU cloud quote comparison checklist is a fixed list of fields recorded on every quote so two or more offers can be read against each other without confusion. The checklist used here is eleven fields, grouped into accelerator, commercial, capacity, and quote-quality categories. It is a rubric, not a score; no field is given a universal weight because the right weight depends on the workload.

Why it matters

Why it matters

Without a fixed comparison rubric, the lowest-rate quote tends to win attention even when its access terms, networking, or interruption risk would produce a higher completed-workload cost. A checklist forces every offer to expose the same eleven facts, which lets a buyer compare like-for-like and an analyst record offers consistently across providers.

  • Accelerator fields name what is being priced, so a buyer is not comparing an H100 SXM cluster against a single L40S box.
  • Commercial fields name what the buyer commits to, including minimum spend, term length, change rights, and the rate paid for capacity used outside the commitment.
  • Capacity and reliability fields expose access terms that determine whether the quoted hours can actually be delivered when the workload needs them.
  • Quote-quality fields make the comparison auditable: who quoted, on what date, and with what assumptions about region and configuration.

Simple example

Simple example

Quote A lists H100 SXM at $4 per GPU-hour with a 24-month commitment, US-East, 3,200 Gbps InfiniBand, no spot, an SLA on availability of 99.5%, included 50 TB storage and $0.05 per GB egress. Quote B lists H100 SXM at $5 per GPU-hour on a month-to-month basis, US-West, 200 Gbps Ethernet, spot at $2.50 with no SLA, and metered storage and egress. For a deadline-bound 600-hour training run on 128 connected GPUs, the comparable buyer cost is not the headline rate alone; it depends on whether B can deliver an undisrupted 128-GPU cluster on the required date and whether A is willing to redeem any of its commitment in a flex period.

  • Quote A is cheaper on rate but binds capacity and region; the commitment is only a saving if utilization stays above the break-even monthly hours.
  • Quote B is more flexible but spot risk and lower networking can slow or restart a 128-GPU job, raising effective completed-workload cost.
  • A side-by-side row that says "$4 vs $5" hides everything that actually differs; the eleven-field row does not.
  • Numbers in the example are illustrative; a real comparison requires verified quote text and a date for each row.

Example figures are illustrative calculations, not current quoted market prices.

Market signal

How to read the market signal

Comparable, eleven-field quote records make buyer pricing readable as a market signal rather than a rumor. If multiple comparable quotes in the same region show falling rates on shorter commitments and rising rates on longer ones, demand for committed capacity may be cooling. If access requires region substitution, lengthened commitment, or a quote-only conversation, public capacity in the headline region may be tight regardless of any list price.

  • A rate is only a market signal when the eleven fields beside it are recorded; a number without context is noise.
  • Average quotes across providers say little if the underlying capacity, term, and region differ.
  • Spread between commitment rates and on-demand rates communicates how providers value certainty of demand.
  • Quote date and region are required so older or distant quotes are not mistaken for current local market conditions.

Market read: comparable quote records turn private deals into public structure. Without the rubric, every quote is a one-off; with it, a buyer reads a market.

Common mistake

Common mistake

Do not compare quotes by rate alone or by a custom score that hides its weights. A single rate or a black-box score collapses eleven different facts into one number and lets the cheapest-looking row win regardless of whether it can do the job. If a score is used, the rubric and its weights must be visible alongside the result.

Practical takeaway

What you can do with this

Record the eleven fields on every quote. Use the checklist below as the column header for a side-by-side comparison sheet, and keep the source attribution and date on every row.

  • 1. GPU generation: chip model, board form factor (SXM, PCIe), memory capacity, and connectivity (NVLink, InfiniBand). H100 SXM is not the same offering as H100 PCIe.
  • 2. Hourly and effective monthly cost: published rate per GPU-hour and the implied monthly cost at full utilization, plus a row at the expected utilization the buyer can sustain.
  • 3. Minimum commitment: term length, minimum monthly spend, prepay structure, and whether unused capacity can be reclaimed or sold back.
  • 4. Region and latency: physical region, available zones, end-user latency budget, and whether the buyer can change region during the term.
  • 5. Capacity and quota: connected cluster size available now, lead time for the required block, soft quotas, and what triggers a manual approval.
  • 6. Networking: interconnect type and bandwidth between GPUs in a node and between nodes, plus any topology constraints on a multi-node job.
  • 7. Storage and egress: included high-speed storage, metered overflow, and per-GB egress charges for moving model weights, data, and checkpoints out of the region.
  • 8. SLA and support: availability target, remedies if missed, support tier included, response-time commitment, and named escalation path.
  • 9. Interruption risk: spot or interruptible eligibility, expected eviction rate by region, checkpointing assumptions, and the contract treatment of forced interrupts.
  • 10. Flexibility: rights to pause, change GPU type, change quantity, extend or shorten the term, transfer capacity to another project, and renegotiate before renewal.
  • 11. Quote source and date: the named source contact, quote document or page URL, the effective date, and any expiry on the quoted terms.

Decision check: a quote can only be chosen when the eleven-field row is filled in and the gaps named. An empty field is a question, not a permission.

Helpful memory trick

Helpful memory trick

Read every quote as eleven facts in a row, not as one number with a logo next to it. The number is the last column, not the first.

Compute College

Turn the lesson into a number

Use the GPU-Hour Cost Calculator, AI Training Cost Calculator, or Model Serving Cost Calculator.

Use the calculators

Compute College track

Buyers & Operators

Continue this Compute College lesson path

Previous lesson

How to compare GPU cloud quotes

Continue the Buyers & Operators track.

Next lesson

How to estimate monthly AI compute burn

Continue the Buyers & Operators track.