The Bookstore Guide — Kubernetes from Zero to Production
Bookstore
Initializing search
abd-ulbasit/bookstore-kubernetes-guide
The Bookstore Guide — Kubernetes from Zero to Production
About this guide
00 foundations
01 core workloads
02 networking
03 config and storage
04 scheduling
05 security
06 production readiness
07 delivery
08 day 2 operations
09 end to end bookstore
10 cloud and managed kubernetes
11 advanced production patterns
12 kubernetes for machine learning
13 grand capstone bookstore platform
14 eks in production a to z
15 day to day production ops
Appendix
Examples
The Bookstore Guide — Kubernetes from Zero to Production
abd-ulbasit/bookstore-kubernetes-guide
The Bookstore Guide — Kubernetes from Zero to Production
About this guide
00 foundations
00 foundations
01 — Why Kubernetes
02 — Containers and images
03 — Architecture overview
04 — Control plane deep dive
05 — Node components
06 — The declarative API model
07 — Local cluster setup
01 core workloads
01 core workloads
01 — Pods
02 — Health and lifecycle
03 — Resources and QoS
04 — ReplicaSets and Deployments
05 — StatefulSets
06 — DaemonSets
07 — Jobs and CronJobs
08 — Deployment strategies
02 networking
02 networking
01 — The networking model
02 — Services
03 — DNS and service discovery
04 — Ingress
05 — Gateway API
06 — Network policies
03 config and storage
03 config and storage
01 — ConfigMaps
02 — Secrets
03 — Volumes
04 — Persistent storage
05 — Stateful data patterns
04 scheduling
04 scheduling
01 — The scheduler and nodes
02 — Affinity, taints, and topology
03 — Priority and preemption
05 security
05 security
01 — Authentication, authorization, RBAC
02 — Pod security
03 — Supply chain security
04 — Secrets and cluster hardening
06 production readiness
06 production readiness
01 — Observability: metrics
02 — Logging
03 — Tracing
04 — Autoscaling
05 — Reliability and disruptions
06 — Capacity and cost
07 delivery
07 delivery
01 — Packaging with Helm
02 — Packaging with Kustomize
03 — CI/CD pipeline
04 — GitOps with Argo CD
05 — Progressive delivery
08 day 2 operations
08 day 2 operations
01 — Cluster lifecycle
02 — Backup and disaster recovery
03 — Troubleshooting playbook
04 — Multi-tenancy and namespaces
05 — Operators and CRDs
09 end to end bookstore
09 end to end bookstore
01 — Bookstore end-to-end
10 cloud and managed kubernetes
10 cloud and managed kubernetes
01 — The managed Kubernetes model
02 — Provisioning and infrastructure-as-code
03 — Cloud identity for workloads
04 — Cloud networking and load balancing
05 — Cloud storage and data
06 — Node autoscaling, cost & multi-cloud
11 advanced production patterns
11 advanced production patterns
01 — Admission webhooks
02 — Operator development
03 — API Priority and Fairness
04 — Service mesh
05 — Secrets at scale
06 — Multi-cluster and fleet
07 — Chaos engineering
08 — HA control plane and etcd
09 — Performance and scalability
10 — Platform engineering
12 kubernetes for machine learning
12 kubernetes for machine learning
01 — Why ML on Kubernetes
02 — GPUs and accelerators
03 — Batch and gang scheduling
04 — Distributed training
05 — Notebooks and interactive ML
06 — Model serving and inference
07 — ML pipelines and workflows
08 — ML platform, cost, and MLOps capstone
13 grand capstone bookstore platform
13 grand capstone bookstore platform
13.01 — Bookstore 2.0: from toy to platform
13.02 — Tenancy model and onboarding via Crossplane
13.03 — Multi-region active-active
13.04 — Real auth: Keycloak OIDC + IRSA + Istio JWT
13.05 — Search and product discovery
13.06 — Payments and event sourcing
13.07 — Edge: Istio Gateway + Coraza WAF + per-tenant rate limiting
13.08 — Real ML loop: training -> registry -> serving -> drift -> retrain
13.09 — Observability: OpenTelemetry traces + Loki logs + Prometheus metrics + Grafana dashboards
13.10 — Cost: OpenCost per-tenant, per-cluster, per-region
13.11 — Developer portal: Backstage scaffolder + software catalog + tech docs
13.12 — Day-2: runbook + on-call playbook + DR drill + chaos game-day
14 eks in production a to z
14 eks in production a to z
14.01 — Production-grade Terraform state
14.02 — EKS cluster lifecycle
14.03 — EKS add-on management discipline
14.04 — Storage classes & EBS in production
14.05 — Logging & metrics cost discipline
14.06 — Cost guardrails
14.07 — Infrastructure CI/CD + drift detection
14.08 — VPC endpoints & egress economics
14.09 — ARM/Graviton on EKS
14.10 — GitOps bootstrap on a fresh EKS cluster
14.11 — Multi-region active-active: cloud reality
14.12 — Supply chain security in production
14.13 — Runtime defense & container security
14.14 — Backup and restore with Velero
14.15 — Cilium / eBPF on EKS
14.16 — Developer experience for Kubernetes teams
14.17 — Cross-region DR + AWS account baseline + 90-day production-readiness runbook
15 day to day production ops
15 day to day production ops
15.01 — The PR-to-production lifecycle
15.02 — Application CI/CD pipelines
15.03 — Image signing and provenance in CI
15.04 — Multi-environment promotion
15.05 — Production secrets: Vault + ESO + rotation
15.06 — Progressive delivery in production
15.07 — Rollback playbook
15.08 — Feature flags and dark launches
15.09 — Hotfix workflow and breakglass
15.10 — Incident response & on-call
15.11 — Day-to-day production operations
15.12 — Capstone: the first 90 days running production
Appendix
Appendix
Appendix A — kubectl cheatsheet
Appendix B — Glossary
Appendix C — YAML and API conventions
Appendix D — Further reading
Appendix E — Learning paths
Appendix F — Concept map, tag index, and reading paths
Examples
Examples
Bookstore
Bookstore
Bookstore
App
App
Helm
Helm
Bookstore
Bookstore
Kustomize
Kustomize
Ml
Ml
Dataset
Dataset
Pipeline
Pipeline
Serve
Serve
Train
Train
Operator
Operator
Operators
Operators
Bookstore — Disaster Recovery Runbook
Bookstore platform
Bookstore platform
App
App
Auth
Auth
Catalog
Catalog
Events
Events
Orders
Orders
Payments gateway
Payments gateway
Payments worker
Payments worker
Recommendations
Recommendations
Search
Search
Argocd
Argocd
Auth
Auth
Backstage
Backstage
Ci
Ci
Clusters
Clusters
Cost
Cost
Crossplane
Crossplane
Edge
Edge
Feature flags
Feature flags
Integrating a feature-flag SDK in a Bookstore Go service
Helm
Helm
Hotfix
Hotfix
Runbook — hotfix workflow (emergency change procedure)
Runbook — post-incident cleanup after a breakglass session
Incident
Incident
Incident-channel automation — wiring PagerDuty → Slack → Zoom → Status page
On-call handoff — Week of YYYY-MM-DD
Postmortem — \<INCIDENT-TITLE>
Postmortem — Checkout 5xx Spike During Spring Sale (INC-2026-04-15-001)
Incident severity matrix — Bookstore Platform v2
Kafka
Kafka
Kustomize
Kustomize
Ml
Ml
Observability
Observability
Payments
Payments
Platform base
Platform base
Rollback
Rollback
Runbook — code rollback via Argo CD (CODE layer)
Runbook — code rollback via Helm (CODE layer)
Runbook — code rollback via Argo Rollouts (CODE layer)
Runbook — rolling back a config change via Argo CD (CONFIG layer)
Runbook — Postgres point-in-time recovery (DATA layer)
Runbook — restoring an S3 object version (DATA layer)
Runbook — restoring a workload via Velero (DATA layer)
Runbooks
Runbooks
Chaos Game-Day — Quarterly Playbook
On-Call Rotation — Bookstore Platform v2
Postmortem — \<INCIDENT-TITLE>
Runbook — BookstoreCatalogP99Latency (P1)
Runbook — BookstoreCNPGReplicationLag (P1)
Runbook — BookstorePaymentsFailureRate (P1)
Runbook — BookstoreTenantBudgetBreach (P2)
Search
Search
Terraform
Terraform
Drift check
Drift check
Drift Runbook
Multi region
Multi region
Terraform account baseline
Terraform account baseline
Vault
Vault
Bookstore
Back to top