UpStack - Grow your team with the top 1% Remote Software Developers

Kevin M.

Tachira State, Venezuela

Hire Kevin M. Hire Kevin M. Hire Kevin M.

About Me

Kevin is a driven Data Engineer focusing on building real-time Data ETL pipelines using Python, data streaming with Kafka, and further processing or even modeling with Spark for creating real-time interactive dashboards for visual insight. He creates expert models through Machine Learning or Deep Learning with Python for Time Series problems, Classification tasks as well as NLP. He also has good knowledge of AWS, GCP, and Azure for serverless applications that involve data manipulation, storing, processing or streaming.

Work history

UpStack

Senior Data Engineer

2022 - Present (3 years)

Remote

Build and improve databases, acquire data, ETL/ELT, big data pipelines and deploy cloud services on projects.
Administer infrastructure solutions to improve data models, increase data accessibility and foster data-driven solutions for clients.
Implement monitoring solutions to ensure data integrity - working closely with engineers, product managers and other stakeholders.

Data Engineering Python Airflow SQL Typescript Kafka Github Docker Kubernetes AWS GCP Pyspark Microsoft Azure Continuous Integration (CI)Continuous Delivery (CD)AWS S3 AWS Redshift AWS Glue GCP BigQuery

Clevertech

Senior Data Engineer

2020 - Present (5 years)

Remote

Worked on designing and building Data Pipeline for BI solutions. Integrated REST API endpoints from applications like Shopify and Rutter
Migrated Cloud based warehouse Data from Snowflake and AWS Redshfit to Google Big Query using Airflow. Used Machine Learning to predict the likelihood of an organic visit to one of the client stores.
Automated data upload/aggregation in Postgres using Python for easy extraction from backend. Setup migration scripts with rollbacks in knex for frontend DB using JavaScript. Set up an ETL pipeline using AWS transfer, s3, lambda (Python) and Postgres RDS.

Data Engineering Python PostgreSQL Airflow AWS Elastic Search Machine Learning Typescript JavaScript GCP AWS Redshift AWS S3

number8

Data Engineer/Team Lead

2019 - 2020 (1 year)

Remote

Worked on the creation of Digital Twins for Supply Chain procedures, presenting the data architecture proposal to management.
Streamed real-time data from MySQL source to GBQ and then replicated it to Azure and AWS using a Multi-Node Kafka Cluster.
Applied real-time processing to data using Spark for streaming from Database to the Cloud. Automating data extraction from Cognos Framework Manager XML model using Python.

Data Engineering Team Management MySQL AWS Microsoft Azure Python Kafka Spark Pyspark

KPMG

Data Scientist

2018 - 2019 (1 year)

Germany

Performed web scrapping, mining and validating data relevant to client requests, going from commodity futures, general financial market data and Geo-location.
Worked with Data ETL for customer insight, as well as feature engineering, data modeling using supervised and unsupervised learning for forecasting and classification tasks of multiple phenomena and events.
Created a Python script to upload batches of data directly into Google BigQuery for testing a serverless approach to data migration. Performed string matching and database merging by implementing NLP techniques such as Edit-based measures and Token-based measures combined with machine learning .

Data Science Data Engineering SQL Python Github ETL NLP Machine Learning Kubernetes

Showcase

Data Engineer - Parker Financial

Assisted in building an ETL pipeline connecting to various endpoints using TypeScript.
Performed data transformation and validation using EMR, while orchestrating the pipeline with Kubernetes clusters.
Developed dashboard views leveraging DBT Cloud for the underwriting team.

Data Engineer - KPMG

Developed a product to flag potential fraudulent invoices using Machine Learning and Feature Engineering.
Built a Pipeline to convert geolocation data into readable addresses for clients.
Utilized skills in Machine Learning and Feature Engineering throughout the project.

Data Engineer - Suffolk

Developed an Airflow ETL Pipeline to extract data from various endpoints (Datalakes, API, SFTP servers).
Created a Glue job to transform, validate, and move data to Amazon Redshift.
Successfully integrated data extraction, transformation, and storage to a data warehouse.

Education

Master's degree, Data Engineering

Jacobs University Bremen

2017 - 2019 (2 years)

Masters, International Business Management

Universitat Autònoma de Barcelona

2009 - 2010 (1 year)

Bachelor's degree, Economics and Business Administration

Universidad Nororiental Gran Mariscal de Ayacucho

2004 - 2008 (4 years)

Kevin M.

Kevin M.

About Me

AI, ML & LLM

Backend

Database

DevOps

Workflow

Other

Work history

Senior Data Engineer

Senior Data Engineer

Data Engineer/Team Lead

Data Scientist

Showcase

Education