Certification Courses
Want a structured paid program with certificate and roadmap?
Instrument services, collect metrics, and build actionable dashboards and alerting rules.
These are the tools you use in this track and what each tool does in production workflows.
Observability and SRE
Prometheus
Time-series metrics collection and alert query engine.
Observability and SRE
Grafana
Dashboarding and visualization for metrics and operations health.
Observability and SRE
Alertmanager
Routing, grouping, and escalation of monitoring alerts.
For each tool area: what it is, types, when to use, why it matters, and how to apply it.
Monitoring and Observability
What: Observability combines metrics, logs, and traces for system health and debugging.
Why: Good observability shortens MTTD/MTTR and improves service reliability.
Types
When to use
How to use
What is a Docker image?
Images package app and dependencies.
What is a Docker container?
Containers run isolated processes from images.
Why keep container images small?
Smaller images improve speed and security.
What does docker build do?
docker build packages instructions into image.
What is the purpose of .dockerignore?
It reduces context size and accidental leaks.
What is Kubernetes Deployment?
Deployment handles stateless rollout orchestration.
What is a Kubernetes Pod?
Pod groups tightly coupled containers.
What is a Kubernetes Service?
Services provide discovery and load-balancing.
What does kubectl get pods do?
kubectl queries cluster resources.
What is ConfigMap used for?
ConfigMaps externalize application config.
What is Secret in Kubernetes?
Secrets handle sensitive configuration material.
What is Horizontal Pod Autoscaler (HPA)?
HPA adjusts replica count by load.
What is a liveness probe?
Liveness detects deadlocked/unhealthy processes.
What is a readiness probe?
Readiness gates traffic until app is ready.
What is rolling update in Kubernetes?
Rolling updates minimize downtime.
What is DaemonSet used for?
DaemonSets deploy node-level services.
What is StatefulSet for?
StatefulSets support ordered, sticky identity.
Why use private container registry?
Private registries protect proprietary artifacts.
What does imagePullPolicy IfNotPresent do?
It reduces unnecessary image pulls.
What is kube namespace useful for?
Namespaces separate teams/environments.
What is Infrastructure as Code (IaC)?
IaC makes infra reproducible and reviewable.
Why use Terraform in DevOps?
Terraform is a widely used IaC tool.
What does terraform plan show?
Plan helps review infra delta safely.
What is remote state in Terraform used for?
Remote state prevents local drift/conflicts.
What is cloud autoscaling?
Autoscaling optimizes cost and performance.
What is a VPC?
VPC provides private networking boundaries.
Why use IAM roles/policies?
IAM controls access securely.
What is object storage commonly used for?
Object stores are durable and scalable.
What is cloud region selection based on?
Region impacts performance and regulations.
What is disaster recovery in cloud context?
DR includes backup, replication, and restore drills.