Compute College

What is AI compute? Chips, power, and cost explained

By ComputeTape Editorial

The usable capacity stack behind AI: accelerators, memory, networking, power, data centers, and access.

Usable capacityMarket unit

Compute is useful only when chips, memory, networking, power, cooling, and access work together.

Physical + cloudCost stack

Prices reflect hardware, facilities, electricity, utilization, reliability, and the way capacity is sold.

Plain-English definition

AI compute is the usable capacity required to train and run artificial-intelligence models. It is not just GPUs: it includes accelerators, memory, networking, power, cooling, data-center space, software, and cloud or infrastructure access. Those inputs become a market because buyers compete for capacity that is expensive, scarce, and constrained by physical infrastructure.

Memory trick: AI compute is a working factory: chips are the machines, while memory, networking, power, cooling, and access keep production running.

Accelerators such as GPUs that perform the core calculations.
Memory and networking that move data fast enough to keep chips useful.
Data centers, power, and cooling that make large-scale deployments possible.
Cloud, rental, and ownership models that determine how buyers access capacity.
Utilization, reliability, and delivery timing that determine whether capacity is actually useful.

Inputs

Chips, power, memory, networking

Work

Model training or model serving

Output

A model learned or a model response served

Any figures shown are illustrative calculations, not current quoted market prices.

Training demand, model serving demand, and enterprise adoption all compete for finite accelerator capacity.
Power, interconnection, cooling, and data-center buildout can limit supply even when chips are announced.
GPU-hour pricing, reservations, spot capacity, and cloud contracts turn compute access into a measurable market.
The cost of AI products depends on both model efficiency and the price of the capacity needed to run them.

Market read: AI compute became a market because training demand, model serving demand, cloud contracts, power constraints, and data-center buildout all meet in the price and availability of usable capacity. A chip announcement affects supply only after the surrounding capacity is operating and accessible to buyers. Figures here are illustrative unless explicitly sourced and dated — see our methodology.

A GPU without enough power or cooling is not usable capacity.
Weak networking can limit how well many GPUs work together.
The same hardware can produce different value depending on the workload and operating environment.

Founders and product managers: identify whether training or serving drives the capacity need.
Analysts and buyers: compare accelerators together with power, networking, availability, and cost.
For training, ask how many accelerators can work together efficiently and for how long; for serving, ask what latency, uptime, and recurring throughput the capacity must support.
When an announcement describes chips or a data-center build, look for evidence of energization, cooling, network readiness, and customer access before treating it as supply.

Decision check: ask what usable output the announced or rented capacity can produce, not only how many chips it contains.

Use the calculators

Compute College track

AI Compute 101

Step 1 of 7: What is AI compute

What is AI compute? Chips, power, and cost explained

Plain-English definition

Why it matters

Simple example

Inputs

Work

Output

How to read the market signal

Common mistake

What you can do with this

Turn the lesson into a number

AI Compute 101

What is AI compute? Chips, power, and cost explained

Plain-English definition

Why it matters

Simple example

Inputs

Work

Output

How to read the market signal

Common mistake

What you can do with this

Turn the lesson into a number

AI Compute 101

Related lessons

Why compute matters

What is a GPU-hour?

H100 vs H200 vs B200