Previous lesson
B200 price per hour
Continue the GPU Pricing & Capacity track.
Compute College
GPUs, TPUs, and custom ASICs are different kinds of AI accelerator that trade flexibility for efficiency on targeted workloads.
One concept connected to AI compute market decisions.
A practical introduction designed to be completed in one sitting.
Useful for founders, ml engineers, analysts, and procurement buyers.
Plain-English definition
GPUs, TPUs, and custom ASICs are all AI accelerators — chips built to run the heavy math behind training and inference. GPUs (graphics processing units, the type most widely used for AI today) are flexible and broadly supported. TPUs (tensor processing units) and other custom ASICs (application-specific integrated circuits, including hyperscaler silicon and inference-focused startup chips) are purpose-built for particular AI workloads, trading general flexibility for efficiency on the work they target.
Why it matters
The accelerator a workload runs on shapes its cost, its availability, and how locked-in you are. The dominant GPUs have the deepest software ecosystem and the broadest availability; alternatives can offer better price-performance on specific workloads or relieve supply constraints, but usually come with smaller software ecosystems and a porting cost.
Simple example
Suppose a workload runs well on both a flexible GPU and a purpose-built accelerator, and the accelerator delivers the same useful output at an illustrative 30% lower cost per result. If porting and re-validating the software takes real engineering time, the decision weighs the recurring savings against that one-time switching cost — not the headline chip price alone.
Example figures are illustrative calculations, not current quoted market prices.
Market signal
A shift toward non-dominant accelerators — cloud TPUs, alternative merchant GPUs, and custom inference silicon — is a signal about supply, pricing power, and where efficiency gains are concentrating. Watch whether large buyers diversify accelerators to manage cost and availability, since broad adoption of alternatives can loosen scarcity and pressure pricing.
Market read: accelerator choice is a supply-and-lock-in signal as much as a price one. Evidence discipline: name the workload, the accelerator, and the date for any price-performance claim, and keep illustrative ratios separate from measured results.
Common mistake
Assuming a cheaper accelerator is automatically cheaper to use. Headline price-performance ignores software maturity, porting effort, real-workload utilization, and availability — a chip that looks cheaper per hour can cost more per completed result once those are included.
Practical takeaway
Choose accelerators by total cost per useful result on your own workload, including porting and software risk, rather than by headline specs.
Decision check: an alternative accelerator is worth adopting only when the recurring savings on your real workload outweigh the one-time porting and software risk.
Helpful memory trick
A GPU is the Swiss-army knife; a custom ASIC is the purpose-built tool — faster at one job, useless at the others.
Compute College
Use the GPU-Hour Cost Calculator, AI Training Cost Calculator, or Model Serving Cost Calculator.
Compute College track
Continue this Compute College lesson path
Previous lesson
Continue the GPU Pricing & Capacity track.
Next lesson
Continue the GPU Pricing & Capacity track.