Fundamentals 12 min readJune 13, 2026

Inside a cloud data center (and what 'the cloud' really is)

The cloud is a building full of computers. Here is how one is built and how raw metal becomes an API call.

“The cloud” sounds soft and weightless. It is not. It is a few hundred enormous, windowless buildings, drawing as much power as a small city, full of computers that someone has to cool, wire, and keep alive. When you spin up a server “in the cloud,” this is what is actually on the other end.

The cloud is someone else's computer, at scale

At the bottom of every cloud is ordinary hardware: a metal server with CPUs, memory, and disks, no different in kind from a powerful PC. The magic is not the machine. It is the software and the scale wrapped around millions of them, so that you can rent a slice of one for a few cents an hour with a single command, and give it back when you are done.

How a provider is laid out, from the planet down to a chip

Cloud providers organize their hardware in a nested hierarchy. Each level exists to contain failures and reduce distance:

Regiona geographic area, e.g. "Europe (Frankfurt)"

Availability Zoneone or more nearby data centers, independent power + network

Data centerthe building

Hall / roomrows of racks

Racka tall cabinet of ~20-40 servers

ServerCPUs, memory, disks, network cards

The nesting of a cloud provider. You usually pick a region; the provider spreads your workload across availability zones for you.

The two levels worth burning into memory are region and availability zone (AZ). A region is a place on the map. Inside it are several AZs, each a physically separate facility with its own power and network, but close enough that they can talk in a millisecond. If you run copies of your app in two AZs, a fire, flood, or power failure in one does not take you down. This is the single most important reliability decision most teams make, and it costs almost nothing.

The unglamorous half: power and cooling

Roughly 40% of a data center's energy can go not to computing but to removing the heat that computing makes. A rack of servers is a space heater that never turns off. Providers obsess over a number called PUE (Power Usage Effectiveness): total power divided by power that actually reaches the computers. A perfect 1.0 is impossible; the best facilities reach ~1.1 with tricks like:

Building in cold climates and using outside air instead of air conditioning.
Running the rooms surprisingly warm (27°C is fine), so less cooling is needed.
Liquid cooling, piping coolant directly to the hottest chips, now standard for AI/GPU racks that run far hotter than normal servers.
Redundant power: two utility feeds, banks of batteries, and diesel generators that start in seconds if the grid drops.

Note

This is why data-center location is a strategic decision, not a detail. Cheap, clean, reliable power and a cool climate are worth more than almost anything else.

The network fabric: how thousands of servers talk

Inside the building, every server must be able to reach every other server, fast. The modern design is called spine-leaf. Each rack has a “leaf” switch; all leaf switches connect to a set of “spine” switches. Any server is then just two hops from any other, no matter how big the building gets. It is the same trick as the internet itself: lots of equal paths, so traffic spreads out and no single link is a bottleneck.

Server

in a rack

Leaf switch

top of rack

Spine switches

the fabric

Any other server

two hops away

Spine-leaf: every rack's leaf switch connects to every spine switch, so any two servers are always a short, equal-cost hop apart.

From metal to “an instance”: virtualization

You never rent a whole physical server (usually). You rent a virtual machine, a software-defined computer that behaves exactly like a real one but is really a slice of a bigger machine shared with other tenants. The thing that creates and isolates these slices is a hypervisor. It hands each VM its own pretend CPUs, memory, and disk, and makes sure no tenant can see or touch another's, even though they sit on the same metal.

Containers take this further and lighter: instead of a whole fake computer, a container packages just your app and what it needs, sharing the host's operating system. VMs give stronger isolation; containers start in milliseconds and pack more densely. Most modern platforms use both, containers running inside VMs.

The control plane: turning hardware into an API

Here is the real product. The provider runs a giant software brain, the control plane, whose entire job is to take a request like “give me 4 CPUs and 16 GB in Frankfurt” and, within seconds, find a physical machine with room, carve out a VM, attach storage, plug it into a virtual network, give it an address, and hand you the keys.

1
You ask
An API call (or a click) requests an instance of a certain size in a certain region.
2
Schedule
The scheduler finds a physical host with enough free capacity and the right hardware.
3
Provision
A hypervisor carves out a VM; storage and a virtual network interface are attached.
4
Network
The instance gets a private address inside your virtual network and, if you want, a public one.
5
Boot + hand over
It boots your chosen image and, seconds later, you can connect. Billing starts ticking.

What actually happens when you click 'create instance'. The control plane choreographs the physical world for you.

OpenStack: the open-source blueprint for a cloud

How do you build all of that? The big providers wrote their own private versions, but there is a famous open-source one you can read and run yourself: OpenStack. It is, in effect, a cloud in a box, and a great way to see the moving parts because each job is a named service:

Keystoneidentity: who are you, what may you do

Novacompute: create and manage virtual machines

Neutronnetworking: virtual networks, routers, addresses

Cinderblock storage: virtual disks for instances

Swiftobject storage: files and backups at scale

Glanceimages: the templates instances boot from

OpenStack's core services map one-to-one to what every cloud must do. AWS, Google Cloud, and Azure have their own equivalents of each.

Run OpenStack on your own racks and you have a private cloud with the same shape as a public one. Telcos, banks, and sovereign clouds (including providers like Nebius in the AI space) use exactly this pattern: open building blocks, their own hardware, full control over data and cost.

Why availability zones exist (a story in one paragraph)

Everything in a data center is doubled because everything eventually fails. Disks die weekly. A switch reboots. A backhoe cuts a fiber line. Once in a while a whole building loses power. Providers cannot prevent this, so they contain it: independent power and network per AZ, your data copied across AZs, traffic steered away from the broken one automatically. Reliability in the cloud is not the absence of failure, it is failure that no longer reaches the customer.

Why this still matters in ten years

Chips will change (GPUs and custom AI accelerators are reshaping the rack right now), and cooling is getting wetter and weirder to keep up. But the shape, regions and zones, a control plane turning hardware into an API, virtualization for isolation, redundancy for survival, is stable. Whether you are placing a database or a thousand-GPU training run, you are making the same handful of decisions: how close, how isolated, how redundant, how much it costs to keep the heat moving.

The one thing to remember

“Serverless” and “the cloud” are abstractions over a very physical reality. Knowing what is underneath, a building, power, heat, racks, a scheduler, makes you far better at choosing regions, designing for failure, and controlling cost.

Written by the Stratiflux engineering team

We build and run this kind of infrastructure and AI for companies, and train the engineers who do it. If a piece of this is on your plate, we can help.

See what we build Free tools & guides

Buy me a coffee

Everything here is free. If it saved you time or taught you something, a small tip keeps the work going.

Keep reading

AISix things every AI engineer should know (that get increasingly niche)FundamentalsHow the internet actually moves your data AIHow a frontier AI model really works, end to end