This project demonstrates a complete Kubernetes Monitoring and Alerting System built using industry-standard tools.
The system continuously monitors cluster resources such as CPU, memory, and pod metrics, visualizes them using Grafana dashboards, and triggers alerts when defined thresholds are exceeded.
Alerts are automatically sent to a Slack channel in real time using webhooks, enabling fast incident response.
Kubernetes Cluster (Minikube)
│
▼
Prometheus (Metrics Collection)
│
▼
Grafana (Visualization + Alerting)
│
▼
Alert Rule Evaluation (PromQL)
│
▼
Notification Policy
│
▼
Slack Webhook (HTTP POST)
│
▼
Slack Channel (#alerts)
- Kubernetes (Minikube) – Local cluster setup
- Prometheus – Metrics collection and storage
- Grafana – Visualization and alerting
- Helm – Package manager for Kubernetes
- Slack Webhooks – Real-time alert notifications
- Stress Tool – Load generation for testing alerts
minikube startsudo snap install helm --classichelm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm repo updatehelm install monitoring prometheus-community/kube-prometheus-stackkubectl get pods -Akubectl port-forward svc/monitoring-grafana 3000:80👉 Open in browser: http://localhost:3000
kubectl get secret monitoring-grafana -o jsonpath="{.data.admin-password}" | base64 --decode- Create Slack channel (#alerts)
- Create Slack App
- Enable Incoming Webhooks
- Generate Webhook URL
-
Go to: Alerting → Contact Points
-
Add new contact point:
- Type: Slack
- Paste Webhook URL
- Go to: Alerting → Notification Policies
- Attach Slack contact point
Example condition:
sum(rate(container_cpu_usage_seconds_total[1m])) > 0.05kubectl run stress --image=progrium/stress -- stress --cpu 2 --timeout 60Application Load Increases
│
▼
Prometheus Collects Metrics
│
▼
Grafana Evaluates PromQL Query
│
▼
Condition Becomes TRUE
│
▼
Alert State = FIRING
│
▼
Grafana Sends HTTP Request (Webhook)
│
▼
Slack Receives Request
│
▼
Message Displayed in Channel (#alerts)
- Implemented real-time monitoring in Kubernetes
- Understood Prometheus metrics and scraping
- Built dashboards using Grafana
- Designed alert rules using PromQL
- Integrated Slack for automated alerting
- Simulated real-world incident scenarios
- Email alert integration
- Deployment on cloud (AWS EKS / GKE)
- Custom dashboards
- Alert severity levels (warning, critical)
This project demonstrates a complete end-to-end monitoring and alerting pipeline, similar to real-world DevOps and SRE environments.
It provides a strong foundation in observability, alerting, and incident response systems.


