14 stages · 69 topics · 32 core
Roadmap

Become a DevOps engineer.

The full arc — from the shell up through pipelines, containers, clusters, cloud, and the reliability practices that keep it all standing. Every stage is on the critical path. Each topic links to a Semicolony deep dive or simulator where one exists, and to a curated external resource where it doesn't. Follow the arc in order, or jump to wherever you're stuck.

FOUNDATIONSBUILD & SHIPRUNSCALERELIABILITY 01 02 03 04 05 06 07 08 09 10 11 12 13 14startdevops engineer
Core (the spine) Recommended (strong upside) Optional (pick if relevant)

Path
Level

Core plus the recommended layer. The optional stops stay hidden until you have shipped a couple of real systems.


Jump to a stage

01
Stage

Linux & the shell

The system everything else runs on top of.

Almost every server you will ever touch is Linux. Get fluent in the filesystem, processes, signals, and the shell — these are the primitives you reach for at 3am when the dashboards are lying.

02
Stage

Networking for ops

Packets, ports, and why the request hung.

Distributed systems are networking with extra steps. You do not need to be a CCNA, but you must be able to reason about TCP, DNS, TLS, and the layers of indirection between a client and your pod.

Core

DNS & service resolution

The name-to-address layer that is somehow always the cause. Records, TTLs, caching, and why a stale entry takes you down for exactly the cache lifetime.

How DNS works
03
Stage

Version control & collaboration

Git as the source of truth for everything you ship.

In DevOps, Git is not just for code — it is the audit log for your infrastructure, your pipelines, and your deploys. GitOps lives or dies on understanding what a commit actually is.

Core

Git internals & the object model

Blobs, trees, commits, refs. Once you see Git as a content-addressed store, rebases and merges stop being scary incantations.

How Git works
04
Stage

CI/CD pipelines

From a commit to production, with no hands on it.

Continuous integration catches breakage early; continuous delivery makes shipping boring. The goal is a pipeline so trustworthy that deploying on a Friday afternoon is a non-event.

05
Stage

Containers (Docker)

Shipping the whole environment, not just the code.

Containers killed "works on my machine" by packaging the app and its world together. Under the hood they are just Linux namespaces and cgroups — once that clicks, the magic becomes mechanics.

06
Stage

Container orchestration (Kubernetes)

You declare the state; it keeps reality matching.

Kubernetes is a control loop that keeps your declared state and actual state in sync — and reschedules around failure so you do not have to. Steep curve, but it is the lingua franca of modern infra.

07
Stage

Infrastructure as Code (Terraform)

Your whole cloud, in a diff you can review.

Click-ops does not scale and cannot be reviewed. IaC turns infrastructure into versioned, planned, peer-reviewed code — so the thing you deploy is the thing you read.

08
Stage

Configuration & secrets

Getting config and secrets where they belong, safely.

Config and secrets are where deploys quietly go wrong. Separate config from code, keep secrets out of Git, and make rotation a routine rather than an emergency.

09
Stage

Cloud platforms

Renting compute, storage, and reliability by the API call.

The big three rent you compute, storage, networking, and a hundred managed services. The skill is not memorising one provider — it is the shared mental model that ports across all of them.

10
Stage

Observability

Metrics, logs, and traces, so you stop guessing.

You cannot operate what you cannot see. The three pillars — metrics, logs, traces — turn "users say it is slow" into "p99 on the checkout service regressed after the 14:02 deploy."

11
Stage

Scaling & load management

Holding up when the traffic actually shows up.

Scaling is not just adding boxes — it is autoscaling on the right signal, shedding load gracefully, and protecting downstreams from each other. The art is degrading instead of collapsing.

12
Stage

Reliability engineering

SLOs, error budgets, and what to do at 3am.

SRE is reliability treated as an engineering discipline, with a budget. Define what "working" means numerically, spend the error budget deliberately, and run incidents like the routine they should be.

13
Stage

Security & compliance (DevSecOps)

Building security into the pipeline, not bolting it on.

Security is not a gate at the end — it is a property you build in and continuously verify. Scan the supply chain, lock down the runtime, and assume breach so detection actually matters.

14
Stage

Platform & developer experience

Turning ops into a product other engineers self-serve.

Platform engineering treats your internal tooling as a product whose users are other engineers. The win is a golden path so smooth that doing the right thing is also the easy thing.