Skip to content

Appendix F — Concept map, tag index, and reading paths

115 chapters is a lot. This appendix is the index. It exists so a reader standing in front of the table of contents doesn't drown.

There are three different ways to navigate the guide and this appendix exposes all three:

  1. By concept — Section 2 is a single Mermaid concept map of "X builds on Y" relationships across the guide. If you've learned X, the arrows tell you what Y (and Z, and W) the rest of the guide says you can attack next.
  2. By tag — Section 3 is an alphabetical tag index. Each chapter carries a <!-- tags: ... --> HTML comment near its top; this index turns those tags inside out so you can ask "show me every chapter about argo-cd" or "everything tagged cost" and get an ordered chapter list.
  3. By goal — Section 4 is 10 topic-driven reading paths. Each path is an ordered, opinionated chapter list with a one-line "why this chapter, why here" plus an estimated time-to-complete and a one-sentence outcome statement at the end.

These three lenses complement Appendix E — Learning paths, which gives the four big role-based arcs (fast-track / zero-to-prod / exam-prep / SRE-platform). Appendix E is the high-level "which arc do I follow?"; Appendix F is the fine-grained "for this specific topic, in this order, here are the chapters."

A few practical notes before the rest of the appendix:

  • Tags are the source of truth. Every chapter that participates in the index has a single HTML comment in its header block of the form <!-- tags: tag1, tag2, tag3 -->. The tag list in Section 3 is generated from those comments — adding a new tag to a chapter is the entire workflow.
  • The concept map is curated, not generated. Section 2's diagram is hand- drawn. It shows the load-bearing edges only (~30 nodes, ~60 edges) — not every possible cross-reference. The goal is "what to learn next", not "every link that exists." A complete machine-readable cross-reference would be the full text of the guide.
  • Reading paths are opinionated. Section 4 lists 10 paths. Each is one reasonable order through a slice of the guide — not the only one. If a path says "skip chapters X and Y", that's a deliberate scoping decision for the named outcome, not a claim that X and Y are unimportant.
  • Single-chapter tags are intentionally omitted from Section 3. A tag that appears in only one chapter (like coredns, falco, karpenter, meilisearch) provides zero indexing value — it just points back to that one chapter, which the reader can already find through the tag's broader parent (networking, runtime-defense, autoscaling, bookstore-v2). Section 3 lists the 77 tags that span 2+ chapters; the 115 single-chapter tags remain in the chapter headers for future search/grouping but don't clutter the human-readable index.

If you only have time to read one section of this appendix, read Section 4 — Reading paths, find the path closest to your goal, and follow it. Sections 2 and 3 are tools for when a path doesn't quite fit and you need to assemble a custom route.


Section 2 — The big-picture concept map

The diagram below shows the major architectural relationships in the guide. Each node is a concept (not a chapter) and each edge means "X builds on Y" or "Y is the next stop after X". Nodes are clustered by Part to make it easier to trace which part of the guide owns each step.

The map is deliberately incomplete: only the load-bearing edges are drawn. For example, almost everything in Parts 10-15 implicitly depends on Pods and Services from Parts 01-02 — but only the new concepts that a later part introduces get their own edges. The rule is "if you don't know X, the chapter introducing Y will be hard," and the arrow points from X to Y.

graph LR
    %% Part 00 — foundations
    subgraph p00["Part 00 - Foundations"]
        containers["Containers / OCI images"]
        api["Declarative API model"]
        controlplane["Control plane / etcd"]
        nodeplane["Node plane / kubelet / containerd"]
    end

    %% Part 01 — workloads
    subgraph p01["Part 01 - Core workloads"]
        pod["Pods"]
        probes["Probes + lifecycle"]
        qos["Requests/limits + QoS"]
        rs["ReplicaSets / Deployments"]
        sts["StatefulSets"]
        ds["DaemonSets"]
        jobs["Jobs / CronJobs"]
        rollout["Rolling / blue-green / canary"]
    end

    %% Part 02 — networking
    subgraph p02["Part 02 - Networking"]
        svc["Services"]
        dns["DNS / service discovery"]
        ing["Ingress"]
        gw["Gateway API"]
        netpol["NetworkPolicies"]
    end

    %% Part 03 — config + storage
    subgraph p03["Part 03 - Config + storage"]
        cm["ConfigMaps"]
        sec["Secrets"]
        vol["Volumes"]
        pvc["PV / PVC / StorageClass"]
        statefuldata["Stateful data + operators"]
    end

    %% Part 04 — scheduling
    subgraph p04["Part 04 - Scheduling"]
        sched["Scheduler basics"]
        affinity["Affinity / taints / topology"]
        prio["Priority + preemption"]
    end

    %% Part 05 — security
    subgraph p05["Part 05 - Security"]
        rbac["RBAC"]
        psa["PodSecurity (PSA restricted)"]
        supply["Supply chain / SBOM / sign"]
        hard["Cluster hardening / KMS / etcd-enc"]
    end

    %% Part 06 — production readiness
    subgraph p06["Part 06 - Production readiness"]
        metrics["Metrics + Prometheus"]
        logs["Logs"]
        traces["Tracing / OTel"]
        autoscale["HPA / VPA / cluster-autoscaler"]
        pdb["PDB / disruption budgets"]
    end

    %% Part 07 — delivery
    subgraph p07["Part 07 - Delivery"]
        helm["Helm"]
        kust["Kustomize"]
        cicd["CI/CD pipeline"]
        argocd["Argo CD / GitOps"]
        progdel["Argo Rollouts / progressive delivery"]
    end

    %% Part 08 — day-2
    subgraph p08["Part 08 - Day-2 ops"]
        lifecycle["Cluster lifecycle"]
        backup["Backup / Velero"]
        tshoot["Troubleshooting playbook"]
        tenancy["Multi-tenancy / namespaces"]
        operators["Operators / CRDs"]
    end

    %% Part 09 — bookstore v1
    subgraph p09["Part 09 - Bookstore v1"]
        bookv1["End-to-end bookstore (v1)"]
    end

    %% Part 10 — cloud K8s
    subgraph p10["Part 10 - Cloud K8s"]
        managed["Managed K8s (EKS / GKE / AKS)"]
        iac["Terraform / IaC"]
        wkid["Workload identity / IRSA"]
        cloudnet["Cloud LB / CNI"]
        karp["Karpenter / cluster-autoscaler"]
    end

    %% Part 11 — advanced patterns
    subgraph p11["Part 11 - Advanced patterns"]
        admission["Admission webhooks"]
        opdev["Operator development"]
        apf["API Priority and Fairness"]
        mesh["Service mesh / Istio"]
        secretscale["Vault + ESO at scale"]
        fleet["Multi-cluster / fleet"]
        chaos["Chaos engineering"]
        hacp["HA control plane / etcd"]
        platfeng["Platform engineering"]
    end

    %% Part 12 — ML
    subgraph p12["Part 12 - ML on K8s"]
        gpus["GPUs / MIG / accelerators"]
        kueue["Kueue / gang scheduling"]
        dist["Distributed training"]
        kserve["KServe / model serving"]
        mlpipe["ML pipelines / Argo Workflows"]
    end

    %% Part 13 — bookstore v2
    subgraph p13["Part 13 - Bookstore platform v2"]
        cross["Crossplane tenancy"]
        cnpg["CNPG multi-region"]
        keycloak["Keycloak / Istio JWT"]
        kafka["Kafka + Debezium"]
        coraza["Coraza WAF + edge"]
        opencost["OpenCost per-tenant"]
        backstage["Backstage IDP"]
        runbook["Day-2 runbook + DR drill"]
    end

    %% Part 14 — EKS production
    subgraph p14["Part 14 - EKS A-to-Z"]
        tfstate["Terraform state in prod"]
        ekslife["EKS cluster lifecycle"]
        addons["EKS addon management"]
        graviton["ARM / Graviton"]
        cilium["Cilium / eBPF on EKS"]
        velero["Velero on EKS"]
        crossdr["Cross-region DR"]
    end

    %% Part 15 — daily ops
    subgraph p15["Part 15 - Day-to-day ops"]
        prtoprod["PR-to-prod lifecycle"]
        appcicd["App CI/CD"]
        cosign["Cosign / provenance"]
        promote["Multi-env promotion"]
        vault["Vault + ESO + rotation"]
        rollback["Rollback playbook"]
        flags["Feature flags"]
        hotfix["Hotfix + breakglass"]
        oncall["Incident response + on-call"]
        d2d["Day-to-day ops"]
        d90["First 90 days"]
    end

    %% Foundations -> core
    containers --> pod
    api --> pod
    controlplane --> api
    nodeplane --> pod

    %% Core workloads chain
    pod --> probes
    probes --> qos
    qos --> rs
    rs --> rollout
    pod --> sts
    pod --> ds
    pod --> jobs

    %% Workloads -> networking
    rs --> svc
    svc --> dns
    svc --> ing
    ing --> gw
    svc --> netpol

    %% Config + storage
    pod --> cm
    cm --> sec
    pod --> vol
    vol --> pvc
    pvc --> statefuldata
    sts --> statefuldata

    %% Scheduling
    pod --> sched
    sched --> affinity
    sched --> prio
    ds --> affinity

    %% Security
    api --> rbac
    pod --> psa
    cicd --> supply
    sec --> hard

    %% Production readiness
    pod --> metrics
    pod --> logs
    pod --> traces
    rs --> autoscale
    rs --> pdb

    %% Delivery chain
    rs --> helm
    rs --> kust
    helm --> cicd
    kust --> cicd
    cicd --> argocd
    argocd --> progdel
    rollout --> progdel

    %% Day-2 ops
    statefuldata --> backup
    metrics --> tshoot
    logs --> tshoot
    rbac --> tenancy
    tenancy --> operators

    %% Bookstore v1 = integration point of Parts 00-08
    rs --> bookv1
    svc --> bookv1
    ing --> bookv1
    cm --> bookv1
    metrics --> bookv1
    argocd --> bookv1

    %% Cloud
    bookv1 --> managed
    managed --> iac
    managed --> wkid
    managed --> cloudnet
    autoscale --> karp

    %% Advanced patterns
    operators --> opdev
    rbac --> admission
    sec --> secretscale
    secretscale --> vault
    managed --> fleet
    fleet --> argocd
    metrics --> chaos
    controlplane --> hacp
    platfeng --> backstage

    %% ML
    bookv1 --> gpus
    sched --> kueue
    gpus --> dist
    statefuldata --> dist
    rs --> kserve
    autoscale --> kserve
    argocd --> mlpipe

    %% Bookstore v2 (platform capstone)
    platfeng --> cross
    fleet --> cnpg
    statefuldata --> cnpg
    wkid --> keycloak
    mesh --> keycloak
    kserve --> kafka
    statefuldata --> kafka
    ing --> coraza
    mesh --> coraza
    metrics --> opencost
    platfeng --> backstage
    chaos --> runbook

    %% EKS A-to-Z
    iac --> tfstate
    managed --> ekslife
    ekslife --> addons
    karp --> graviton
    cloudnet --> cilium
    backup --> velero
    cnpg --> crossdr

    %% Day-to-day ops
    argocd --> prtoprod
    progdel --> prtoprod
    cicd --> appcicd
    supply --> cosign
    kust --> promote
    argocd --> promote
    vault --> secretscale
    progdel --> rollback
    progdel --> flags
    rollback --> hotfix
    tshoot --> oncall
    runbook --> oncall
    crossdr --> oncall
    oncall --> d2d
    d2d --> d90

How to read the map: pick a node you already know, follow the outgoing arrows. That's your shortlist for "what to learn next." If you want a curated ordering of those nodes, jump to Section 4.


Section 3 — Alphabetical tag index

Every chapter that participates in this index carries a <!-- tags: tag1, tag2, ... --> HTML comment near its title. The list below is the inverse map: for each multi-chapter tag, the chapters that carry it, sorted alphabetically.

Tags are shown grouped by first letter for navigation. The number in parentheses after each tag is the count of chapters that carry it. Single- chapter tags are omitted (115 of them); they remain in chapter headers for future tooling but don't help indexing.

A

aks (4) — 10.01 — The managed Kubernetes model · 10.02 — Provisioning and infrastructure-as-code · 10.03 — Cloud identity for workloads · 10.04 — Cloud networking and load balancing

api-server (2) — 00.03 — Architecture overview · 00.04 — Control plane deep dive

app-cicd (3) — 15.02 — Application CI/CD pipelines · 15.03 — Image signing and provenance in CI · 15.08 — Feature flags and dark launches

argo-cd (4) — 07.04 — GitOps with Argo CD · 11.06 — Multi-cluster and fleet · 14.10 — GitOps bootstrap on a fresh EKS cluster · 15.04 — Multi-environment promotion

argo-rollouts (3) — 07.05 — Progressive delivery · 15.06 — Progressive delivery in production · 15.07 — Rollback playbook

argo-workflows (2) — 12.07 — ML pipelines and workflows · 13.08 — Real ML loop: training -> registry -> serving -> drift -> retrain

autoscaling (5) — 06.04 — Autoscaling · 06.06 — Capacity and cost · 10.06 — Node autoscaling, cost & multi-cloud · 12.06 — Model serving and inference · 15.11 — Day-to-day production operations

B

backstage (2) — 11.10 — Platform engineering · 13.11 — Developer portal: Backstage scaffolder + software catalog + tech docs

batch (4) — 01.07 — Jobs and CronJobs · 12.03 — Batch and gang scheduling · 12.04 — Distributed training · 12.07 — ML pipelines and workflows

bookstore-v2 (12) — 13.01 — Bookstore 2.0: from toy to platform · 13.02 — Tenancy model and onboarding via Crossplane · 13.03 — Multi-region active-active · 13.04 — Real auth: Keycloak OIDC + IRSA + Istio JWT · 13.05 — Search and product discovery · 13.06 — Payments and event sourcing · 13.07 — Edge: Istio Gateway + Coraza WAF + per-tenant rate limiting · 13.08 — Real ML loop: training -> registry -> serving -> drift -> retrain · 13.09 — Observability: OpenTelemetry traces + Loki logs + Prometheus metrics + Grafana dashboards · 13.10 — Cost: OpenCost per-tenant, per-cluster, per-region · 13.11 — Developer portal: Backstage scaffolder + software catalog + tech docs · 13.12 — Day-2: runbook + on-call playbook + DR drill + chaos game-day

C

capstone (3) — 09.01 — Bookstore end-to-end · 13.01 — Bookstore 2.0: from toy to platform · 13.12 — Day-2: runbook + on-call playbook + DR drill + chaos game-day

ci-cd (13) — 05.03 — Supply chain security · 07.01 — Packaging with Helm · 07.02 — Packaging with Kustomize · 07.03 — CI/CD pipeline · 07.04 — GitOps with Argo CD · 07.05 — Progressive delivery · 14.07 — Infrastructure CI/CD + drift detection · 14.12 — Supply chain security in production · 14.16 — Developer experience for Kubernetes teams · 15.01 — The PR-to-production lifecycle · 15.02 — Application CI/CD pipelines · 15.03 — Image signing and provenance in CI · 15.09 — Hotfix workflow and breakglass

cilium (2) — 10.04 — Cloud networking and load balancing · 14.15 — Cilium / eBPF on EKS

cloud (22) — 08.01 — Cluster lifecycle · 10.01 — The managed Kubernetes model · 10.02 — Provisioning and infrastructure-as-code · 10.03 — Cloud identity for workloads · 10.04 — Cloud networking and load balancing · 10.05 — Cloud storage and data · 10.06 — Node autoscaling, cost & multi-cloud · 14.01 — Production-grade Terraform state · 14.02 — EKS cluster lifecycle · 14.03 — EKS add-on management discipline · 14.04 — Storage classes & EBS in production · 14.05 — Logging & metrics cost discipline · 14.06 — Cost guardrails · 14.07 — Infrastructure CI/CD + drift detection · 14.08 — VPC endpoints & egress economics · 14.09 — ARM/Graviton on EKS · 14.10 — GitOps bootstrap on a fresh EKS cluster · 14.11 — Multi-region active-active: cloud reality · 14.12 — Supply chain security in production · 14.13 — Runtime defense & container security · 14.15 — Cilium / eBPF on EKS · 14.16 — Developer experience for Kubernetes teams

cnpg (3) — 03.05 — Stateful data patterns · 08.05 — Operators and CRDs · 13.03 — Multi-region active-active

control-plane (2) — 00.03 — Architecture overview · 00.04 — Control plane deep dive

core-objects (14) — 00.01 — Why Kubernetes · 01.01 — Pods · 01.02 — Health and lifecycle · 01.03 — Resources and QoS · 01.04 — ReplicaSets and Deployments · 01.05 — StatefulSets · 01.06 — DaemonSets · 01.07 — Jobs and CronJobs · 01.08 — Deployment strategies · 03.01 — ConfigMaps · 05.01 — Authentication, authorization, RBAC · 05.02 — Pod security · 06.05 — Reliability and disruptions · 08.05 — Operators and CRDs

cosign (2) — 14.12 — Supply chain security in production · 15.03 — Image signing and provenance in CI

cost (11) — 06.06 — Capacity and cost · 10.06 — Node autoscaling, cost & multi-cloud · 12.02 — GPUs and accelerators · 12.05 — Notebooks and interactive ML · 12.08 — ML platform, cost, and MLOps capstone · 13.10 — Cost: OpenCost per-tenant, per-cluster, per-region · 14.05 — Logging & metrics cost discipline · 14.06 — Cost guardrails · 14.08 — VPC endpoints & egress economics · 14.09 — ARM/Graviton on EKS · 14.11 — Multi-region active-active: cloud reality

crossplane (2) — 11.10 — Platform engineering · 13.02 — Tenancy model and onboarding via Crossplane

D

day-2 (45) — 05.01 — Authentication, authorization, RBAC · 05.02 — Pod security · 05.03 — Supply chain security · 05.04 — Secrets and cluster hardening · 06.01 — Observability: metrics · 06.02 — Logging · 06.03 — Tracing · 06.04 — Autoscaling · 06.05 — Reliability and disruptions · 06.06 — Capacity and cost · 08.01 — Cluster lifecycle · 08.02 — Backup and disaster recovery · 08.03 — Troubleshooting playbook · 08.04 — Multi-tenancy and namespaces · 08.05 — Operators and CRDs · 09.01 — Bookstore end-to-end · 10.01 — The managed Kubernetes model · 10.02 — Provisioning and infrastructure-as-code · 10.05 — Cloud storage and data · 10.06 — Node autoscaling, cost & multi-cloud · 11.01 — Admission webhooks · 11.02 — Operator development · 11.03 — API Priority and Fairness · 11.07 — Chaos engineering · 11.08 — HA control plane and etcd · 11.09 — Performance and scalability · 13.03 — Multi-region active-active · 13.12 — Day-2: runbook + on-call playbook + DR drill + chaos game-day · 14.01 — Production-grade Terraform state · 14.02 — EKS cluster lifecycle · 14.04 — Storage classes & EBS in production · 14.05 — Logging & metrics cost discipline · 14.06 — Cost guardrails · 14.13 — Runtime defense & container security · 14.14 — Backup and restore with Velero · 14.17 — Cross-region DR + AWS account baseline + 90-day production-readiness runbook · 15.01 — The PR-to-production lifecycle · 15.05 — Production secrets: Vault + ESO + rotation · 15.06 — Progressive delivery in production · 15.07 — Rollback playbook · 15.08 — Feature flags and dark launches · 15.09 — Hotfix workflow and breakglass · 15.10 — Incident response & on-call · 15.11 — Day-to-day production operations · 15.12 — Capstone: the first 90 days running production

debezium (2) — 13.05 — Search and product discovery · 13.06 — Payments and event sourcing

deployments (2) — 01.04 — ReplicaSets and Deployments · 01.08 — Deployment strategies

dr (9) — 08.02 — Backup and disaster recovery · 11.06 — Multi-cluster and fleet · 11.08 — HA control plane and etcd · 13.03 — Multi-region active-active · 13.12 — Day-2: runbook + on-call playbook + DR drill + chaos game-day · 14.11 — Multi-region active-active: cloud reality · 14.14 — Backup and restore with Velero · 14.17 — Cross-region DR + AWS account baseline + 90-day production-readiness runbook · 15.07 — Rollback playbook

drift (3) — 07.04 — GitOps with Argo CD · 08.01 — Cluster lifecycle · 14.07 — Infrastructure CI/CD + drift detection

E

ebs-csi (2) — 10.05 — Cloud storage and data · 14.04 — Storage classes & EBS in production

eks (15) — 10.01 — The managed Kubernetes model · 10.02 — Provisioning and infrastructure-as-code · 10.03 — Cloud identity for workloads · 10.04 — Cloud networking and load balancing · 10.05 — Cloud storage and data · 14.02 — EKS cluster lifecycle · 14.03 — EKS add-on management discipline · 14.04 — Storage classes & EBS in production · 14.09 — ARM/Graviton on EKS · 14.10 — GitOps bootstrap on a fresh EKS cluster · 14.11 — Multi-region active-active: cloud reality · 14.14 — Backup and restore with Velero · 14.15 — Cilium / eBPF on EKS · 14.16 — Developer experience for Kubernetes teams · 15.02 — Application CI/CD pipelines

eso (3) — 03.02 — Secrets · 11.05 — Secrets at scale · 15.05 — Production secrets: Vault + ESO + rotation

etcd (2) — 00.03 — Architecture overview · 00.04 — Control plane deep dive

F

finops (12) — 06.04 — Autoscaling · 06.06 — Capacity and cost · 10.06 — Node autoscaling, cost & multi-cloud · 12.08 — ML platform, cost, and MLOps capstone · 13.10 — Cost: OpenCost per-tenant, per-cluster, per-region · 14.01 — Production-grade Terraform state · 14.05 — Logging & metrics cost discipline · 14.06 — Cost guardrails · 14.08 — VPC endpoints & egress economics · 14.09 — ARM/Graviton on EKS · 14.17 — Cross-region DR + AWS account baseline + 90-day production-readiness runbook · 15.11 — Day-to-day production operations

foundations (13) — 00.01 — Why Kubernetes · 00.02 — Containers and images · 00.03 — Architecture overview · 00.04 — Control plane deep dive · 00.05 — Node components · 00.06 — The declarative API model · 00.07 — Local cluster setup · 05.01 — Authentication, authorization, RBAC · 08.01 — Cluster lifecycle · 10.01 — The managed Kubernetes model · 11.08 — HA control plane and etcd · 12.01 — Why ML on Kubernetes · 14.01 — Production-grade Terraform state

G

gitops (18) — 07.01 — Packaging with Helm · 07.02 — Packaging with Kustomize · 07.03 — CI/CD pipeline · 07.04 — GitOps with Argo CD · 07.05 — Progressive delivery · 09.01 — Bookstore end-to-end · 10.02 — Provisioning and infrastructure-as-code · 11.06 — Multi-cluster and fleet · 11.10 — Platform engineering · 12.07 — ML pipelines and workflows · 13.11 — Developer portal: Backstage scaffolder + software catalog + tech docs · 14.07 — Infrastructure CI/CD + drift detection · 14.10 — GitOps bootstrap on a fresh EKS cluster · 15.01 — The PR-to-production lifecycle · 15.02 — Application CI/CD pipelines · 15.04 — Multi-environment promotion · 15.06 — Progressive delivery in production · 15.08 — Feature flags and dark launches

gke (2) — 10.01 — The managed Kubernetes model · 10.03 — Cloud identity for workloads

gpu (5) — 12.02 — GPUs and accelerators · 12.04 — Distributed training · 12.05 — Notebooks and interactive ML · 12.06 — Model serving and inference · 12.08 — ML platform, cost, and MLOps capstone

H

ha (2) — 00.04 — Control plane deep dive · 04.02 — Affinity, taints, and topology

I

incident-response (2) — 15.09 — Hotfix workflow and breakglass · 15.10 — Incident response & on-call

irsa (3) — 10.03 — Cloud identity for workloads · 13.04 — Real auth: Keycloak OIDC + IRSA + Istio JWT · 14.03 — EKS add-on management discipline

istio (3) — 11.04 — Service mesh · 13.04 — Real auth: Keycloak OIDC + IRSA + Istio JWT · 13.07 — Edge: Istio Gateway + Coraza WAF + per-tenant rate limiting

K

kafka (2) — 13.05 — Search and product discovery · 13.06 — Payments and event sourcing

kserve (2) — 12.06 — Model serving and inference · 13.08 — Real ML loop: training -> registry -> serving -> drift -> retrain

kube-proxy (2) — 00.05 — Node components · 02.02 — Services

kueue (2) — 12.03 — Batch and gang scheduling · 12.04 — Distributed training

kustomize (2) — 07.02 — Packaging with Kustomize · 15.04 — Multi-environment promotion

L

l7 (2) — 02.04 — Ingress · 02.05 — Gateway API

M

mental-model (3) — 00.01 — Why Kubernetes · 12.01 — Why ML on Kubernetes · 13.01 — Bookstore 2.0: from toy to platform

ml (9) — 12.01 — Why ML on Kubernetes · 12.02 — GPUs and accelerators · 12.03 — Batch and gang scheduling · 12.04 — Distributed training · 12.05 — Notebooks and interactive ML · 12.06 — Model serving and inference · 12.07 — ML pipelines and workflows · 12.08 — ML platform, cost, and MLOps capstone · 13.08 — Real ML loop: training -> registry -> serving -> drift -> retrain

mlops (3) — 12.07 — ML pipelines and workflows · 12.08 — ML platform, cost, and MLOps capstone · 13.08 — Real ML loop: training -> registry -> serving -> drift -> retrain

multi-cluster (2) — 11.06 — Multi-cluster and fleet · 13.03 — Multi-region active-active

multi-region (4) — 13.03 — Multi-region active-active · 13.10 — Cost: OpenCost per-tenant, per-cluster, per-region · 14.11 — Multi-region active-active: cloud reality · 14.17 — Cross-region DR + AWS account baseline + 90-day production-readiness runbook

multi-tenancy (11) — 08.04 — Multi-tenancy and namespaces · 11.10 — Platform engineering · 12.03 — Batch and gang scheduling · 12.05 — Notebooks and interactive ML · 13.02 — Tenancy model and onboarding via Crossplane · 13.04 — Real auth: Keycloak OIDC + IRSA + Istio JWT · 13.05 — Search and product discovery · 13.07 — Edge: Istio Gateway + Coraza WAF + per-tenant rate limiting · 13.09 — Observability: OpenTelemetry traces + Loki logs + Prometheus metrics + Grafana dashboards · 13.10 — Cost: OpenCost per-tenant, per-cluster, per-region · 13.11 — Developer portal: Backstage scaffolder + software catalog + tech docs

N

networking (11) — 02.01 — The networking model · 02.02 — Services · 02.03 — DNS and service discovery · 02.04 — Ingress · 02.05 — Gateway API · 02.06 — Network policies · 10.04 — Cloud networking and load balancing · 11.04 — Service mesh · 13.07 — Edge: Istio Gateway + Coraza WAF + per-tenant rate limiting · 14.08 — VPC endpoints & egress economics · 14.15 — Cilium / eBPF on EKS

O

observability (18) — 06.01 — Observability: metrics · 06.02 — Logging · 06.03 — Tracing · 06.04 — Autoscaling · 06.05 — Reliability and disruptions · 07.05 — Progressive delivery · 08.03 — Troubleshooting playbook · 09.01 — Bookstore end-to-end · 11.03 — API Priority and Fairness · 11.07 — Chaos engineering · 11.08 — HA control plane and etcd · 11.09 — Performance and scalability · 12.02 — GPUs and accelerators · 13.09 — Observability: OpenTelemetry traces + Loki logs + Prometheus metrics + Grafana dashboards · 14.05 — Logging & metrics cost discipline · 14.15 — Cilium / eBPF on EKS · 15.06 — Progressive delivery in production · 15.08 — Feature flags and dark launches

on-call (3) — 15.10 — Incident response & on-call · 15.11 — Day-to-day production operations · 15.12 — Capstone: the first 90 days running production

opentelemetry (3) — 06.02 — Logging · 06.03 — Tracing · 13.09 — Observability: OpenTelemetry traces + Loki logs + Prometheus metrics + Grafana dashboards

operators (2) — 03.05 — Stateful data patterns · 11.02 — Operator development

P

platform-engineering (29) — 07.01 — Packaging with Helm · 07.02 — Packaging with Kustomize · 07.03 — CI/CD pipeline · 07.04 — GitOps with Argo CD · 08.04 — Multi-tenancy and namespaces · 08.05 — Operators and CRDs · 11.01 — Admission webhooks · 11.02 — Operator development · 11.03 — API Priority and Fairness · 11.04 — Service mesh · 11.05 — Secrets at scale · 11.06 — Multi-cluster and fleet · 11.07 — Chaos engineering · 11.08 — HA control plane and etcd · 11.09 — Performance and scalability · 11.10 — Platform engineering · 12.08 — ML platform, cost, and MLOps capstone · 13.01 — Bookstore 2.0: from toy to platform · 13.02 — Tenancy model and onboarding via Crossplane · 13.11 — Developer portal: Backstage scaffolder + software catalog + tech docs · 14.02 — EKS cluster lifecycle · 14.03 — EKS add-on management discipline · 14.06 — Cost guardrails · 14.10 — GitOps bootstrap on a fresh EKS cluster · 14.16 — Developer experience for Kubernetes teams · 14.17 — Cross-region DR + AWS account baseline + 90-day production-readiness runbook · 15.01 — The PR-to-production lifecycle · 15.11 — Day-to-day production operations · 15.12 — Capstone: the first 90 days running production

postmortem (4) — 08.03 — Troubleshooting playbook · 13.12 — Day-2: runbook + on-call playbook + DR drill + chaos game-day · 15.10 — Incident response & on-call · 15.12 — Capstone: the first 90 days running production

pr-workflow (3) — 15.01 — The PR-to-production lifecycle · 15.04 — Multi-environment promotion · 15.12 — Capstone: the first 90 days running production

projected-volumes (2) — 03.01 — ConfigMaps · 03.03 — Volumes

prometheus (2) — 06.01 — Observability: metrics · 13.09 — Observability: OpenTelemetry traces + Loki logs + Prometheus metrics + Grafana dashboards

psa-restricted (3) — 05.02 — Pod security · 11.01 — Admission webhooks · 12.05 — Notebooks and interactive ML

R

rbac (2) — 03.02 — Secrets · 05.01 — Authentication, authorization, RBAC

rollback (2) — 01.04 — ReplicaSets and Deployments · 15.07 — Rollback playbook

rolling-update (2) — 01.04 — ReplicaSets and Deployments · 01.08 — Deployment strategies

S

scheduler (2) — 00.04 — Control plane deep dive · 04.01 — The scheduler and nodes

scheduling (5) — 04.01 — The scheduler and nodes · 04.02 — Affinity, taints, and topology · 04.03 — Priority and preemption · 12.02 — GPUs and accelerators · 12.03 — Batch and gang scheduling

secrets (2) — 03.02 — Secrets · 05.04 — Secrets and cluster hardening

secrets-rotation (3) — 05.04 — Secrets and cluster hardening · 11.05 — Secrets at scale · 15.05 — Production secrets: Vault + ESO + rotation

security (20) — 02.06 — Network policies · 03.02 — Secrets · 05.01 — Authentication, authorization, RBAC · 05.02 — Pod security · 05.03 — Supply chain security · 05.04 — Secrets and cluster hardening · 08.04 — Multi-tenancy and namespaces · 09.01 — Bookstore end-to-end · 10.03 — Cloud identity for workloads · 11.01 — Admission webhooks · 11.04 — Service mesh · 11.05 — Secrets at scale · 13.04 — Real auth: Keycloak OIDC + IRSA + Istio JWT · 13.06 — Payments and event sourcing · 13.07 — Edge: Istio Gateway + Coraza WAF + per-tenant rate limiting · 14.12 — Supply chain security in production · 14.13 — Runtime defense & container security · 15.03 — Image signing and provenance in CI · 15.05 — Production secrets: Vault + ESO + rotation · 15.09 — Hotfix workflow and breakglass

slo (7) — 06.01 — Observability: metrics · 06.05 — Reliability and disruptions · 11.07 — Chaos engineering · 11.09 — Performance and scalability · 13.09 — Observability: OpenTelemetry traces + Loki logs + Prometheus metrics + Grafana dashboards · 13.12 — Day-2: runbook + on-call playbook + DR drill + chaos game-day · 15.06 — Progressive delivery in production

stateful (8) — 01.05 — StatefulSets · 03.05 — Stateful data patterns · 08.02 — Backup and disaster recovery · 08.05 — Operators and CRDs · 10.05 — Cloud storage and data · 12.04 — Distributed training · 13.05 — Search and product discovery · 13.06 — Payments and event sourcing

storage (6) — 03.03 — Volumes · 03.04 — Persistent storage · 03.05 — Stateful data patterns · 10.05 — Cloud storage and data · 14.04 — Storage classes & EBS in production · 14.14 — Backup and restore with Velero

supply-chain (5) — 05.03 — Supply chain security · 07.03 — CI/CD pipeline · 14.12 — Supply chain security in production · 15.02 — Application CI/CD pipelines · 15.03 — Image signing and provenance in CI

T

terraform (5) — 10.02 — Provisioning and infrastructure-as-code · 14.01 — Production-grade Terraform state · 14.02 — EKS cluster lifecycle · 14.03 — EKS add-on management discipline · 14.07 — Infrastructure CI/CD + drift detection

troubleshooting (2) — 04.01 — The scheduler and nodes · 08.03 — Troubleshooting playbook

V

vault (2) — 11.05 — Secrets at scale · 15.05 — Production secrets: Vault + ESO + rotation

velero (3) — 08.02 — Backup and disaster recovery · 14.14 — Backup and restore with Velero · 15.07 — Rollback playbook


Section 4 — Topic-driven reading paths

Each path below is an ordered chapter list focused on one goal. The "why" column is the load-bearing reason for that chapter's place in the path; if you already know that material, skim and move on. The estimated time assumes a working developer with general programming experience reading carefully and trying the examples — not just reading prose.

These paths overlap. A chapter that appears in three different paths gets a different reason in each — read what's relevant to the path you're on.

Path 1 — Zero to Kubernetes (5-day intensive, ~25 hours)

The minimum coherent path that gets a working developer to "I can deploy and operate a real app on Kubernetes." Covers Parts 00-04 only; deliberately defers security depth, delivery, and day-2.

# Chapter Why here
1 00.01 — Why Kubernetes The problem K8s solves; declarative vs imperative
2 00.02 — Containers and images The unit underneath everything
3 00.03 — Architecture overview The components you'll be talking to
4 00.06 — The declarative API model Spec/status; the one mental model
5 00.07 — Local cluster setup kind cluster + kubectl basics
6 01.01 — Pods The unit you actually run
7 01.02 — Health and lifecycle Probes — without them, nothing is production-real
8 01.03 — Resources and QoS Requests/limits keep noisy neighbors off you
9 01.04 — ReplicaSets and Deployments Self-healing + rolling updates + rollback
10 01.07 — Jobs and CronJobs Batch and scheduled work
11 02.01 — The networking model IP-per-pod and where it comes from
12 02.02 — Services Stable in-cluster endpoints
13 02.03 — DNS and service discovery How service names actually resolve
14 02.04 — Ingress Reach the app from outside
15 03.01 — ConfigMaps Externalize configuration
16 03.02 — Secrets Externalize credentials (with caveats)
17 03.03 — Volumes Ephemeral data, downward API
18 03.04 — Persistent storage PV/PVC/StorageClass — surviving pod death
19 04.01 — The scheduler and nodes Why your pod landed where it did

You'll be ready to: deploy a multi-pod web app with persistent storage, expose it over HTTPS, and reason about why it's healthy or broken.

Path 2 — GitOps end-to-end (~12 hours)

How code reaches production via Git. Covers packaging, the GitOps loop, multi- environment promotion, progressive delivery, and the cloud-bootstrap variant.

# Chapter Why here
1 07.01 — Packaging with Helm Templated charts, the most common packaging
2 07.02 — Packaging with Kustomize Overlay-based packaging; pairs naturally with GitOps
3 07.03 — CI/CD pipeline The build/test/push side of the pipeline
4 07.04 — GitOps with Argo CD Argo CD pulls from Git, reconciles against cluster
5 07.05 — Progressive delivery Argo Rollouts: canary, blue-green, analysis
6 14.10 — GitOps bootstrap on a fresh EKS cluster The chicken-and-egg of bootstrapping Argo itself
7 15.01 — The PR-to-production lifecycle Where the GitOps loop fits in the daily PR cycle
8 15.04 — Multi-environment promotion Promoting one artifact across dev/staging/prod
9 15.06 — Progressive delivery in production Production-grade canaries with SLO-driven analysis
10 15.07 — Rollback playbook When the canary fails: undo via Git
11 13.11 — Backstage developer portal Self-service GitOps for an internal team

You'll be ready to: wire an app from git push to production with progressive rollout, automatic SLO-based rollback, and zero kubectl on the human side.

Path 3 — Cost optimization (~10 hours)

Where money leaks in a Kubernetes platform and how to plug each leak. Reading roughly cheapest-fix-first to hardest-fix-last.

# Chapter Why here
1 06.06 — Capacity and cost The model: requests vs usage vs allocatable
2 06.04 — Autoscaling HPA/VPA — right-sizing pods, not nodes
3 10.06 — Node autoscaling, cost & multi-cloud Karpenter vs cluster-autoscaler, spot, multi-cloud
4 14.05 — Logging & metrics cost discipline The observability bill is bigger than you think
5 14.06 — Cost guardrails ResourceQuotas, budget alerts, policy
6 14.08 — VPC endpoints & egress economics NAT and cross-AZ are silent line items
7 14.09 — ARM/Graviton on EKS 20-40% off node bill, with caveats
8 14.11 — Multi-region active-active: cloud reality Multi-region doubles much of the bill; here's how
9 12.02 — GPUs and accelerators GPU bin-packing, MIG, time-slicing
10 12.08 — ML platform, cost, and MLOps ML-specific cost lever: training vs serving
11 13.10 — OpenCost per-tenant FinOps The chargeback / showback story

You'll be ready to: know where every dollar in your K8s bill goes and have the levers to push at least three of them.

Path 4 — Security top-to-bottom (~14 hours)

From "what an authenticated request looks like" to "the cluster survives a compromised pod." Reading lowest-layer-first.

# Chapter Why here
1 05.01 — Authentication, authorization, RBAC The "who can do what" model
2 05.02 — Pod security PSA restricted, dropped capabilities, seccomp
3 05.03 — Supply chain security SBOM, vulnerability scanning in CI
4 05.04 — Secrets and cluster hardening KMS, etcd-at-rest, the audit log
5 02.06 — Network policies Default-deny, egress controls
6 11.01 — Admission webhooks Policy at admission: Kyverno/Gatekeeper patterns
7 11.04 — Service mesh mTLS, request-level authz, traffic policy
8 11.05 — Secrets at scale Vault, ESO, the rotation story
9 13.04 — Real auth: Keycloak + IRSA + Istio JWT End-user OIDC + workload identity together
10 14.12 — Supply chain security in production Cosign signing + verification webhook
11 14.13 — Runtime defense & container security Falco, eBPF-based runtime detection
12 15.03 — Image signing and provenance in CI The CI side of cosign
13 15.05 — Production secrets: Vault + ESO + rotation The day-2 ops on secret rotation

You'll be ready to: answer a CKS-level question on any of authn, authz, supply chain, runtime, or secrets — and explain why the controls compose.

Path 5 — Machine learning on Kubernetes (~16 hours)

The full ML-on-K8s arc. Read in order; later chapters assume the earlier ones.

# Chapter Why here
1 12.01 — Why ML on Kubernetes The case (and the limits) of K8s for ML
2 04.02 — Affinity, taints, and topology Required for GPU scheduling
3 12.02 — GPUs and accelerators Device plugin, MIG, time-slicing
4 12.03 — Batch and gang scheduling Kueue, JobSet, why default scheduler doesn't fit ML
5 12.04 — Distributed training Training-operator, PyTorch/TF jobs
6 12.05 — Notebooks and interactive ML JupyterHub, idle-shutdown, multi-tenancy
7 12.06 — Model serving and inference KServe, auto-scaling for inference
8 12.07 — ML pipelines and workflows Argo Workflows, Kubeflow Pipelines
9 12.08 — ML platform, cost, and MLOps The full MLOps loop on K8s
10 14.09 — ARM/Graviton on EKS Cost lever specific to large inference fleets
11 13.08 — Real ML loop on bookstore-v2 The end-to-end capstone: train→registry→serve→drift→retrain

You'll be ready to: stand up a Kubeflow-style ML platform on K8s — train, serve, monitor drift, and re-train — and explain to a finance team where the money goes.

Path 6 — Day-2 operations (~14 hours)

What an SRE or on-call engineer does once the platform is built. Reading is chronological — from "cluster exists" through "it caught fire at 3am."

# Chapter Why here
1 08.01 — Cluster lifecycle Cluster upgrades, drain, node lifecycle
2 08.02 — Backup and DR Velero, restore drills, RPO/RTO
3 08.03 — Troubleshooting playbook The structured debugging approach
4 08.04 — Multi-tenancy and namespaces Quota, isolation, the noisy-neighbor problem
5 08.05 — Operators and CRDs Operating operators, not just writing them
6 11.07 — Chaos engineering Practiced failure beats unpracticed
7 13.12 — Day-2 runbook + on-call + DR drill The bookstore-v2 runbook (concrete worked example)
8 14.14 — Backup and restore with Velero Cloud-specific Velero (S3 + EBS snapshots)
9 14.17 — Cross-region DR + 90-day runbook The big-picture DR + production-readiness checklist
10 15.09 — Hotfix workflow and breakglass When normal GitOps is too slow
11 15.10 — Incident response & on-call PagerDuty, severity, the IC role
12 15.11 — Day-to-day production operations The non-incident steady-state
13 15.12 — Capstone: the first 90 days The 90-day plan to inherit a running platform

You'll be ready to: be on-call for a production K8s platform, run a DR drill quarterly, and write postmortems that nobody hates.

Path 7 — Cloud (AWS) specifically (~18 hours)

The full EKS-and-AWS arc. Skip Part 10 if you only care about AWS; Part 14 is AWS-specific top to bottom.

# Chapter Why here
1 10.01 — The managed Kubernetes model What "managed" actually means; cross-cloud baseline
2 10.02 — Provisioning and infrastructure-as-code Terraform vs cdk8s vs Pulumi tradeoffs
3 10.03 — Cloud identity for workloads IRSA / Workload Identity / AAD-pod-identity
4 10.04 — Cloud networking and load balancing NLB vs ALB; VPC-CNI; Cilium on cloud
5 10.05 — Cloud storage and data EBS vs EFS vs S3 + CSI drivers
6 10.06 — Node autoscaling, cost & multi-cloud Karpenter on EKS specifically
7 14.01 — Production-grade Terraform state The state-management story for IaC at scale
8 14.02 — EKS cluster lifecycle EKS upgrade choreography (control plane + nodes)
9 14.03 — EKS add-on management discipline The matrix of EKS add-ons and how to version them
10 14.04 — Storage classes & EBS in production Per-AZ EBS gotchas, gp3 sizing
11 14.07 — Infrastructure CI/CD + drift detection Atlantis-style TF + drift alerts
12 14.08 — VPC endpoints & egress economics NAT is expensive; private endpoints aren't free
13 14.10 — GitOps bootstrap on a fresh EKS cluster Argo bootstrap pattern
14 14.11 — Multi-region active-active: cloud reality What multi-region actually costs and breaks
15 14.13 — Runtime defense & container security Falco + GuardDuty + the runtime story
16 14.15 — Cilium / eBPF on EKS Cilium chaining or replacing VPC-CNI
17 14.16 — Developer experience for K8s teams What "good DX" on EKS looks like
18 14.17 — Cross-region DR + 90-day runbook The full AWS production-readiness checklist

You'll be ready to: run a multi-region EKS platform end-to-end — Terraform → cluster → workloads → cost guardrails → DR — and explain every AWS line item on the bill.

Path 8 — Platform engineering for an internal team (~10 hours)

What "the platform team" actually builds for application teams. The golden-path / IDP story.

# Chapter Why here
1 08.04 — Multi-tenancy and namespaces The isolation primitive your platform leans on
2 11.01 — Admission webhooks Policy is how you make defaults stick
3 11.02 — Operator development When CRDs beat YAML templates
4 11.10 — Platform engineering The framing chapter: golden paths, paved roads
5 13.02 — Tenancy and Crossplane onboarding Self-service tenant provisioning via Crossplane
6 13.11 — Backstage developer portal The UI on top of the platform
7 14.06 — Cost guardrails Quotas and budget alerts are platform features
8 14.16 — Developer experience for K8s teams What "good DX" looks like; Skaffold/Tilt/Telepresence
9 15.01 — The PR-to-production lifecycle The user-visible interface of the platform
10 15.12 — Capstone: the first 90 days What "platform success" looks like 90 days in

You'll be ready to: define a golden path for an internal application team — onboarding, paved-road deploy, observability defaults, cost visibility — and ship it as a product, not a side-project.

Path 9 — CKAD exam preparation (~20 hours)

Chapters mapped to the CKAD curriculum domains. Read in order; the exam tests hands-on speed, so try every example.

# Chapter Why here (CKAD domain)
1 00.06 — The declarative API model Apply / labels / selectors (every domain)
2 00.07 — Local cluster setup kubectl context/imperative — exam speed
3 01.01 — Pods App design (~20%); multi-container, init
4 01.02 — Health and lifecycle App observability (~15%); probes
5 01.03 — Resources and QoS App design; resource requests/limits
6 01.04 — ReplicaSets and Deployments App deployment (~20%); rolling/rollback
7 01.07 — Jobs and CronJobs App deployment; batch
8 01.08 — Deployment strategies App deployment; blue-green/canary
9 02.02 — Services Services & networking (~20%)
10 02.03 — DNS and service discovery Services & networking
11 02.04 — Ingress Services & networking
12 02.06 — Network policies Services & networking
13 03.01 — ConfigMaps App environment, config (~25%)
14 03.02 — Secrets App environment, config
15 03.03 — Volumes App environment, config
16 03.04 — Persistent storage App environment, config
17 06.01 — Observability: metrics App observability
18 08.03 — Troubleshooting playbook Exam survival skill: kubectl describe/logs/events

You'll be ready to: sit the CKAD exam and finish in time. The hands-on speed comes from drilling — read this guide, then time yourself on Killer.sh.

Path 10 — CKS exam preparation (~18 hours)

Chapters mapped to the CKS curriculum. CKS is hands-on and heavy on Parts 05, 11, and 14. Read in order; do every lab.

# Chapter Why here (CKS domain)
1 00.04 — Control plane deep dive Cluster setup (~10%); apiserver flags
2 00.05 — Node components Cluster setup; kubelet, containerd
3 05.01 — Authn, authz, RBAC Cluster hardening (~15%); RBAC
4 05.02 — Pod security System hardening (~15%); PSA, capabilities
5 05.03 — Supply chain Supply chain security (~20%); SBOM, scanners
6 05.04 — Secrets & cluster hardening Cluster hardening; KMS, etcd-at-rest, audit
7 02.06 — Network policies Cluster hardening; default-deny
8 11.01 — Admission webhooks Cluster hardening; OPA/Kyverno/ImagePolicyWebhook
9 11.04 — Service mesh Microservice vulnerabilities (~20%); mTLS
10 11.05 — Secrets at scale Microservice vulnerabilities; Vault/ESO
11 14.12 — Supply chain security in production Supply chain; cosign + verification
12 14.13 — Runtime defense & container security Runtime security (~20%); Falco, sandboxing
13 15.03 — Image signing and provenance in CI Supply chain; CI side
14 15.05 — Production secrets: Vault + ESO + rotation Microservice vulnerabilities; rotation drills
15 08.03 — Troubleshooting playbook Required for exam-time debugging

You'll be ready to: sit the CKS exam. The exam tests speed and recall — drill the kubectl + auditctl + falco invocations until they're muscle memory.


Section 5 — How to extend this appendix

This appendix is currently maintained by hand, but the heavy lifting (the tag index in Section 3) is mechanical. The intent is to migrate Section 3 to a small generator script in a future phase.

Adding a tag to a new chapter

Each chapter file has an HTML comment block near the top with structured metadata, including a tags: line. The pattern is:

# 14.18 — Some new chapter title

<!-- chapter: 14.18 -->
<!-- part: 14-eks-in-production-a-to-z -->
<!-- tags: cloud, eks, day-2, your-new-tag -->
<!-- estimated_reading_minutes: 45 -->
<!-- prerequisites: 14.10, 14.17 -->

Rules:

  • One tags: comment per chapter.
  • Tags are comma-separated, lowercase, hyphen-not-underscore (secrets-rotation, not secrets_rotation).
  • Prefer reusing an existing tag from the list in Section 3 to keep the index dense. A new singleton tag is acceptable for genuinely novel material, but if a broader tag would also apply, add both.
  • Tags are descriptive, not aspirational — a chapter with one paragraph on Argo CD should not be tagged argo-cd.

Regenerating the tag index (manual recipe)

Until a generator script exists, the tag index in Section 3 is regenerated by:

  1. Walk every full-guide/**/*.md file.
  2. Extract the <!-- tags: ... --> comment.
  3. Build a tag -> [chapter, chapter, ...] map.
  4. For each tag with ≥ 2 chapters, emit a bullet line in the **\tag`** (N) — [PP.CC — Title]+() · ...` format.
  5. Group bullets by first letter; sort within each group alphabetically.

A 30-line Python script can do this. When CI gates the appendix (a future phase), the script would run pre-merge and fail the build on stale content.

Adding a new reading path

A path in Section 4 has four parts:

  1. Name### Path N — <one-line goal>
  2. Intro — 1-2 sentences setting the audience and scoping decision.
  3. Ordered chapter table| # | Chapter | Why here | with a one-line reason per chapter (load-bearing reason, not "because it exists").
  4. Outcome line — exactly one sentence beginning "You'll be ready to:".

Keep paths opinionated. A path that lists 50 chapters with no scoping decisions has no value over the table of contents.

Updating the concept map

The Mermaid diagram in Section 2 should be edited when:

  • A new Part adds genuinely new concepts (not just chapter-level additions).
  • An edge becomes wrong (a chapter is rewritten and a "builds on" relationship reverses).
  • The map grows past ~50 nodes — at which point split into per-Part submaps.

Do not try to make the map exhaustive. The diagram earns its place by being legible; cross-references that aren't load-bearing belong in chapter text, not in the map.