Track 1
AI Compute 101
Start here. GPUs, GPU-hours, model costs, and why compute became a market.
0 of 7 complete
Free lessons, calculators, and explainers for understanding GPUs, model costs, cloud capacity, data centers, power constraints, and emerging AI compute infrastructure. Explore 60+ short lessons, follow a learning path, or use a calculator to estimate real compute costs.
A different lesson is highlighted in the Morning Brief each day.
Sponsor slot available
Reserved placement for infrastructure, data-center, energy, and AI compute sponsors.
View sponsor optionsThe curriculum
Pick a track and work through it in order, or jump to the topic you need. Track your progress in this browser — no account required. Every lesson is free.
Track 1
Start here. GPUs, GPU-hours, model costs, and why compute became a market.
0 of 7 complete
Track 2
What H100, H200, and B200 hours cost, and how on-demand, reserved, and spot pricing differ.
0 of 8 complete
Track 3
What it costs to train a model and to serve one, and why utilization drives the bill.
0 of 7 complete
Track 4
How benchmark scores connect to token pricing, latency, throughput, and real inference spend.
0 of 23 complete
Track 5
Power, cooling, networking, memory, and the physical sites where AI compute actually runs.
0 of 17 complete
Track 6
Neoclouds, capacity markets, and reservations: how compute supply gets priced.
0 of 8 complete
Track 7
Forward pricing for compute: futures, forward curves, and forward contracts.
0 of 5 complete
Track 8
How labs and teams buy, compare, and budget GPU capacity in practice.
0 of 8 complete
Compute College tracks
Once you know the basics, work through pricing, model costs, infrastructure and power, market structure, compute futures, and the buyer-and-operator playbook. Emerging topics and the calculators sit alongside them.
The basic resource behind training and running AI models.
Why chips, power, and capacity are becoming economic constraints.
The basic unit behind compute pricing.
How accelerator generations affect performance, supply, and cost.
What an H100 hourly quote includes and why offers vary.
When memory and workload fit can justify a premium.
How next-generation rates reveal scarcity and demand.
How accelerator types trade flexibility for efficiency.
Compare access terms, savings, and interruption risk.
How buyers rent accelerator capacity and what rental signals can reveal.
How short-term, interruptible capacity can become a pricing signal.
Why buyer-accessible GPU supply differs from headline inventory.
The cost of teaching a model before deployment.
How high-end AI usage becomes recurring compute demand.
How hosted APIs price inference, and why output drives the bill.
Why paid capacity can cost more when it sits idle.
Why useful life and obsolescence drive the real cost per GPU-hour.
How much of a paid GPU does useful work, and why it sets cost per result.
Turn training and serving usage into a recurring budget.
How "thinking" tokens turn one query into far more compute.
Why the highest score is not always the cheapest to run.
The coding benchmark that mirrors real agent debugging work.
Measuring how well agents operate in a real terminal.
Read a current model release through compute economics.
Why restricted frontier-model access can still move compute demand.
A graduate-level reasoning benchmark and what it signals.
A harder knowledge benchmark for comparing model quality.
A frontier-difficulty test of model reasoning limits.
How multi-step agent tasks drive repeated inference demand.
Read launches and benchmark jumps as demand signals.
Why electricity and site capacity shape AI compute markets.
The physical site where chips, power, cooling, networking, and operations come together.
Why heat limits how densely AI chips can be deployed and operated.
Why fast interconnects turn individual chips into useful AI clusters.
Why high-bandwidth memory can constrain accelerator supply and model performance.
How many GPUs become one useful compute system.
Why fast GPU-to-GPU links improve connected systems.
How large AI clusters move data between servers.
How rack-scale systems link many GPUs as one.
Why cooling becomes a constraint as AI racks grow denser.
How network access connects compute with data and buyers.
How facility overhead power affects AI economics.
How PPAs, on-site power, and nuclear/SMR deals gate AI capacity.
Translate power announcements into potential capacity.
Why waiting for grid connection delays compute supply.
How concentrated rack heat limits deployments.
Why memory supply and fit matter for accelerator economics.
Compute-first cloud operators and why they matter.
How compute access becomes a supply and pricing signal.
Why buyer-accessible GPU supply differs from headline inventory.
Why buyers commit to GPU access ahead of need.
How short-term, interruptible capacity can become a pricing signal.
Borrowing against GPUs and rental contracts to fund AI buildouts.
AI capacity a nation controls within its own borders.
Forward-looking pricing for compute capacity.
How curve shape becomes a market signal.
How future capacity could be priced before delivery.
How compute access becomes a supply and pricing signal.
Why buyers commit to GPU access ahead of need.
How major buyers secure flexible and committed capacity.
Compare rate, reliability, network, and completed-workload cost.
Match access terms to interruption risk and deadlines.
Understand the reliability terms behind GPU access.
Build a buyer scorecard for capacity and risk.
Estimate AI compute cost from GPU price, runtime, utilization, and overhead.
Estimate a training-run budget using GPU-hours and operating assumptions.
Estimate recurring inference cost from usage and capacity needs.
Compare committing to reserved GPU capacity against paying on-demand for a workload.
Compare paying per token for a hosted API against running the model on your own GPUs.
A proposed chip and compute-capacity project.
A mega-scale AI infrastructure buildout and what it says about future compute supply.
xAI’s large-scale compute buildout and why power can become the bottleneck after GPUs arrive.
AWS’s custom-silicon AI cluster and why proprietary chips matter to compute supply.
Meta’s multi-gigawatt AI campuses and what industrial-scale compute really means.
Put it to work
Use the GPU-Hour Cost Calculator, AI Training Cost Calculator, or Model Serving Cost Calculator to estimate real compute costs from your own inputs.
Keep up with the market
Get the ComputeTape Morning Brief for daily AI compute pricing, power, capacity, and infrastructure signals — plus a different Compute College lesson highlighted each day.
Contact Compute College
Send lesson ideas, corrections, source material, or questions about AI compute education to the editorial team.