Ismail is a DevOps and Site Reliability Engineer with 10+ years of work experience automating and scaling cloud infrastructure. He started his career in networks and server management and has worked for well-established companies like Ericsson, Huawei, and McAfee. With a strong background in infrastructure engineering, Ismail is dedicated to optimizing software development processes, improving systems' reliability, and ensuring seamless deployment and operations. He is an AWS, Cisco, and Red Hat Certified professional keen on working with systems architecture and diverse DevOps and SRE tools.
Working with the development team to architect and deploy new services.
Collaborating with development teams to define and implement efficient CI/CD pipelines using Docker BuilderKit and dependency caching, reducing deployment time by 40%.
Collaborating with cross-functional teams enforcing resource tagging and leveraging Goldilocks and OpenCost to reduce cloud cost by 20%.
Helping teams adopt IaC framework for Prometheus/Datadog SLI and SLOs using Kubernetes Operators pattern.
Assisting developers with infrastructure-related issues and helping set up monitoring and alerting.
Building templates and patterns for new services using Helm charts, Terraform modules, and Python.
Leading incident response efforts, investigating critical issues, and implementing preventive measures to minimize system downtime.