Swapnil M.

Swapnil M.

Senior Data Engineer

India
Hire Swapnil M. Hire Swapnil M. Hire Swapnil M.

About Me

Swapnil is an AWS Certified Senior Data Engineer with 8+ years of IT experience, proficient in both on-premise and cloud environments, with extensive knowledge of Big Data technologies. He designs and develops data pipelines, focusing on data migration to Databricks and optimization of data workflows, creates ETL pipelines and Python scripts for automating utilities, and manages AWS infrastructure. Swapnil has a strong background in AWS services (EC2, S3, Glue, EMR, Lambda, CloudFormation, Redshift) and batch processing via EMR scripts for large-scale data handling. He is also adept at leading teams, collaborating with stakeholders, and ensuring project delivery while maintaining a strong focus on quality, performance, and data integrity.

Work history

Contract
Contract
Data Warehouse Migration to Databricks
1 - Present (2024 years)
India
  • Led a successful migration and optimization of a data warehouse infrastructure to Databricks, significantly improving data accessibility, performance, and security.

  • Analyzed existing infrastructure and identified bottlenecks for optimization.

  • Designed and implemented a migration strategy from on-premise data sources to Databricks.

  • Developed ETL pipelines to ensure seamless data integration into Databricks.

  • Optimized data models to enhance performance and reduce costs.

  • Integrated Databricks with Jenkins for CI/CD automation.

Contract
Contract
End-to-end Data Cleaning and KPI Generation for Business Analytics
1 - Present (2024 years)
India
  • Developed an end-to-end architecture to clean, process, and aggregate data, generating KPI tables for business analytics and Tableau reporting.

  • Designed the project architecture and data flow for KPI generation.

  • Processed large datasets by cleaning and aggregating data as per business requirements.

  • Automated processes using Jenkins, managed Oozie jobs, and handled deployments.

  • Worked directly with stakeholders for project enhancements and requirements gathering.

Data CleaningKPIsBusiness Analytics Architecture Data ProcessingData Aggregation TableauDatasets JenkinsOozie
Contract
Contract
Centralized Data Hub Migration on AWS
1 - Present (2024 years)
India
  • Migrated multiple AWS-hosted applications to a centralized account and developed a modern data analytics lake, centralizing data from various sources.

  • Analyzed the existing architecture and migrated applications to a centralized AWS account.

  • Implemented and updated code in Scala and Python for data migration and ensured proper data permissions through Lake Formation.

  • Created and optimized AWS Glue jobs, Spark jobs on EMR, and automated infrastructure management via CloudFormation.

Data MigrationAWSData Analytics Data Lake Design Data CentreArchitecture ScalaPythonAWS Lake Formation AWS GlueSparkAWS EMRAWS CloudFormation
Contract
Contract
Big Data Engineering for a Credit Reporting Agency
1 - Present (2024 years)
India
  • Built a data lake for storing and processing credit reporting data, enabling large-scale analytics and migrating existing credit scoring modules from legacy systems to Hadoop.

  • Developed Apache Spark jobs to load and process data into Hive external tables.

  • Automated code releases and scheduled jobs using GoCD pipelines.

  • Implemented regression testing scripts and unit tests using Scala Test/Flat Spec.

  • Streamlined data ingestion processes from multiple data sources.

Education

Certified Hadoop Developer
Certified Hadoop Developer
Cloudera
Certified Data Engineer Associate
Certified Data Engineer Associate
Databricks
AWS Certified Solutions Architect
AWS Certified Solutions Architect
AWS