Ravi V.

About Me

Senior Site Reliability Engineer with 9+ years of experience designing, automating, and managing highly available cloud infrastructures on AWS and Azure. R. V. is an AI-fluent SRE with expertise in end-to-end observability, Kubernetes orchestration, Infrastructure as Code, and CI/CD automation. Proven track record of delivering $1.5M+ in annual cost savings through infrastructure optimization and cloud migration, with demonstrated ability to reduce MTTR by 60% and eliminate 500+ hours per year of operational toil.

AI, ML & LLM

Backend

Database

DevOps

Workflow

GitOps Git GitHub Actions

Other

Work history

Ericsson
Senior Site Reliability Engineer
2024 - 2026 (2 years)
Remote
  • Architected end-to-end observability platform using Prometheus, Grafana, and Dynatrace (APM, Synthetic, RUM) for mission-critical telecom applications serving millions of subscribers

  • Managed and optimized complex AKS clusters through right-sizing, HPA tuning, pod disruption budgets, and resource quota enforcement with 99.95%+ SLA adherence

  • Implemented GitOps workflows with ArgoCD and Istio service mesh enabling fully automated, auditable Kubernetes deployments with zero-trust inter-service security

Claude API Claude code Google Gemini Prompt Engineering PrometheusGrafanadynatraceAKS TerraformGitLab CI/CD ArgoCD IstioKubernetes
Adobe Systems India
Site Reliability Engineer
2021 - 2024 (3 years)
Remote
  • Spearheaded on-premise to multi-cloud migration (AWS & Azure), achieving $1.4M in annual cost savings via right-sizing and load balancer consolidation

  • Led 6-member Escalation Response Team establishing runbooks and PagerDuty workflows, reducing MTTR by 60% and escalations by 45%

  • Automated PostgreSQL administration and containerized legacy monolith services with Docker and EKS, reducing infrastructure costs by 20%

AWSAzure PostgreSQLPythonBashAnsiblePagerDuty JenkinsGitHub Actions DockerEKSVPCIAMCloudFormation
Akal Information Systems (Client: CBIC)
Deployment Engineer
2020 - 2021 (1 year)
Remote
  • Administered 200+ RHEL servers supporting 45,000+ concurrent users for the national e-Office application

  • Developed Bash/KornShell automation for log rotation, backup and deployment, reducing manual operations by 60%

  • Configured Apache HTTP with reverse proxy and load balancing handling 10,000+ daily requests

Jump Box Pvt. Ltd.
System Administrator & Cloud Automation Engineer
2019 - 2020 (1 year)
Remote
  • Automated AWS provisioning (EC2, S3, RDS, IAM, VPC) with CloudFormation and Terraform, improving deployment speed by 70%

  • Developed Python/Bash scripts for security patching, health monitoring, and compliance reporting across multi-account AWS

  • Managed infrastructure automation and deployment optimization for cloud environments

Fujiyama Pvt. Ltd.
Engineer
2016 - 2019 (3 years)
Remote
  • Provided L1/L2 support for 50+ Linux servers in VMware environment, maintaining 99.9% uptime SLA

  • Developed Bash scripts for health monitoring, disk alerting, and log rotation, reducing manual checks by 50%

  • Managed server infrastructure and performance optimization

Education

Education
Bachelor of Technology (B.Tech)
Northern India Engineering College, GGSIPU Delhi
2012 - 2016 (4 years)
Education
AWS Fundamentals: Addressing Security Risk
Amazon Web Services
Education
Google IT Professional Certificate
Google
Education
AWS Certified Cloud Practitioner
Amazon Web Services
Education
Red Hat Certified System Administrator RHCSA
Education
Gemini Certified Educator
Google
Education
AI Fluency for Educators
Anthropic
2026
Education
Claude 101
Anthropic
2026
Education
Claude Code in Action
Anthropic
2026
Education
Claude with the Anthropic API
Anthropic
2026