Hey There, I’m Zareen Khan, a passionate and experienced Site Reliability Engineer (SRE) with over 11 years in the software industry. I currently work at a leading technology company where I help build, manage, and optimize large-scale, resilient systems that support mission-critical applications.
My work spans across DevOps, cloud platforms (like AWS), observability, automation, and infrastructure as code. I’ve worked with a wide range of tools, including Terraform, Jenkins, Docker, Kubernetes, Prometheus, and Grafana, helping teams improve reliability, reduce incident response times, and scale with confidence.
I’m based in San Jose, California, and I’m deeply passionate about mentoring and teaching. That passion has led me to create beginner-friendly courses and content that empower aspiring engineers and career changers to confidently enter the world of DevOps and SRE.
In recent years, I’ve also developed a strong interest in AIOps (Artificial Intelligence for IT Operations). I believe that the future of infrastructure lies in combining automation with intelligent insights—using machine learning and GenAI to detect anomalies, predict failures, and accelerate root cause analysis. In my courses and projects, I actively explore how AIOps can transform traditional monitoring and operations workflows into proactive, self-healing systems.
Outside of work, I’m a lifelong learner, a mentor, a community contributor. I believe in building not only reliable systems—but also resilient careers and a strong, supportive learning community.
If you're curious about DevOps, SRE, Cloud, or AIOps—I’d love to help you get started. Let’s learn and grow together.