Andrey is an experienced Engineer with 18+ years of industry expertise and know-how in the provision of diverse IT approaches, infrastructure and solutions for the cloud, containerization, Linux and DevOps. He has the technical wherewithal to manage large DevOp infrastructure; having provided 100% uptime for 300+ VMs in AWS, handled large sharded databases (20+ servers and 30+ Tb of Data), and ELK to inject over 1M of entries per hour.
Performed Kubernetes cluster management with 7 on-premises clusters on CoreOS (up to 200+ nodes), providing uptime and supporting k8s infrastructure, configuring and monitoring for k8s clusters.
Created a log management system based on FluentD and ElasticSearch with 4 ES Clusters (up to 25 nodes per cluster). Configured advanced logs parsing and deployed monitoring tools for logging.
Configured a monitoring system based on Prometheus, Grafana, NewRelic and PagerDuty. Managed AWS resources (EC2, VPC, S3, AWS ES, R53, Athena and etc.) and ES via Terraform, Ansible and Python
Synthesized and maintained the operational efficiency of the SaaS infrastructure keeping 100% uptime for 4 weeks in a row on release deployments for the client.
Optimized hardware infrastructure for efficiency; reducing monthly operational costs by $200K.
Implemented a logs management solution that injected 1 million records per hour and migrated large shared database (30+ Tb) on the infrastructure.
Chute is a holistic user-generated-content (UGC) artificial intelligence (AI) SaaS platform for digital marketers providing UGC management, social image search, social media rights management, contesting, and analytics. It can be custom-configured with various modules to meet specific requirements. The platform runs on AWS (300+ Linux Servers). Created the automatic deployment procedures for all components and made changes to the components’ code to support the non-production environment and monitoring processes. Developed a Logs Management system that collected all logs from all components (1M of entries/hour), migrated several databases (10+ Tb of data), updated several legacy databases (30+ Tb of data); covered all components with monitoring and auto backup, made all components fault-tolerant, built the staging environment to test all deployments before deploying to production. It is used by 100+ Customers like Coca-Cola, Apple (Apple’s marketing campaign “Shot on an iPhone” is running on that platform), Carlton Hotels and others. The platform collects around 0.5 Tb of data from social networks per month.
The Placeable solution includes two SaaS MarTech products: Placeable Workbench ™ and Placeable Pages ™. Placeable Workbench is a location data management and distribution platform, whereas Placeable Pages is a customizable locator for local landing pages. The solution runs on AWS (200+ Linux Servers). Migrated all the components from Apache Mesos to Kubernetes; without any serious service impact - dockerized all components and created the documentation. It is used by 100+ customers like Western Union, Nationwide bank, Biglots and others.
The MessageOne solution provides email management, archiving, and business continuity services and recovery only available from a managed service. The solution is running on AWS; with dockerized components. Fixed alarms on the solution to prevent service disruptions created the documentation on the project.
Education
AWS Certified Solution Architect
AWS
2017 - 2019 (2 years)
MBA, IT Management
The Higher School of Economics (HSE)
2013 - 2014 (1 year)
Research in Information Security
Moscow Engineering Physics Institute (MEPhI), Faculty of Cybernetics, Department of Computer Systems and Technologies
2003 - 2006 (3 years)
MSc. Computer Science
Moscow Engineering Physics Institute (MEPhI), Faculty of Cybernetics, Department of Computer Systems and Technologies