● Built and automated CI/CD pipelines, reducing deployment time by 40–60% and minimizing manual intervention.
● Implemented monitoring and alerting using Prometheus, Thanos, and Grafana, improving incident detection time by ~50%.
● Deployed and maintained observability platforms (Grafana Mimir, Pyroscope), enabling better visibility into system performance.
● Automated infrastructure and operational tasks using Python and Bash, reducing repetitive workload by ~70%.
● Integrated SonarQube into CI pipelines, increasing code quality coverage and reducing code issues.
● Optimized AWS resource usage, reducing infrastructure costs by 15–25%.
● Troubleshot production issues across distributed systems and improved system stability.