UpStack - Grow your team with the top 1% Remote Software Developers

Felipe L.

Senior ML/Data Engineer

Dubai, United Arab Emirates

Hire Felipe L. Hire Felipe L. Hire Felipe L.

About Me

Felipe is a Senior Data Engineer and Machine Learning Engineer who designs, deploys, and scales sophisticated data solutions that empower strategic decision-making and drive business growth. He transforms raw data into actionable insights using a blend of advanced analytics, robust engineering practices, and scalable cloud infrastructure. Felipe is proficient in statistical analysis (descriptive, inferential, causal inference) and end-to-end pipeline development, designing, building, and implementing sophisticated data pipelines and enterprise-grade data processing, analytics, and reporting applications. With expertise in Python, Pandas, SciPy, Scikit-learn, PySpark, TensorFlow, and MLOps best practices (unit testing, integration testing, model monitoring, and API development with FastAPI/Flask), he maintains and enforces common conventions, standards, and technologies across database structures and applications to drive scalability and increase consistency.

SQL 4 years Data Science 4 years Python 4 years Tensorflow 3 years Numpy 2 years Scikit Learn 2 years AWS 2 years Microsoft Azure 2 years Pandas 2 years GCP Matplotlib Flask Google Data Studio Machine Learning Docker Business Intelligence

Work history

UpStack

Senior ML/Data Engineer

2022 - Present (3 years)

Remote

Delivering data warehouse and ETL solutions as part of an Agile team using advanced ML techniques to improve performance and processes.
Helping build and improve infrastructure, application, and performance development and ensuring tight security including data encryption, security groups, and environment scanning.
Ensuring high-quality deliverables and implementing DevOps and security best practices in fast-paced environments.

Machine Learning Data Modeling Databases SQL Python ETL Docker APIs AWS Statistical Modelling Data Science Data Engineering

Hero.io

Senior Data Engineer

2025 - Present

Dubai, United Arab Emirates

Developing and maintaining web scraping scripts that support data pipelines using Python, SQL, MongoDB, Docker, and related technologies.
Integrating multiple data sources and understanding best practices for merging complex datasets.
Monitoring and troubleshooting API endpoints to ensure data accuracy, completeness, and reliability.
Understanding the full data lifecycle from extraction to data modeling and how the data is implemented into products or transformed to apply complex analysis using ML algorithms/AI models.
Contributing to optimization of data storage and retrieval processes.
Assisting the development and maintenance of data warehouse architecture.
Participating in code reviews, documentation, and knowledge sharing sessions.

Data Engineering Python SQL MongoDB Docker Web Scraping Data pipelinesDatasets Data Source Types Endpoint Management API Machine Learning Data ModelingData Extraction AI Modeling Data Warehouse

Spectral Labs

Machine Learning Engineer

2022 - 2024 (2 years)

Remote

Led the development team in automating and optimizing production-level data pipelines, including data fetching, parsing, cleaning, model inference, and scaling, ensuring efficient and reliable data processing workflows.
Contributed to the strategic planning and evolution of data pipelines, aligning technical solutions with business objectives to drive innovation and efficiency.
Implemented code optimizations and parallel processing techniques, reducing model inference time from 5 seconds to 0.8 seconds, significantly improving user experience and increasing retention rates by 15%.
Redesigned data pipeline architecture and established real-time monitoring, reducing failure rates by 40% and enhancing operational efficiency by 25%, enabling faster and more informed business decisions.
Collaborated with cross-functional teams to maintain and enhance the quality of Machine Learning components, delivering innovative credit scoring solutions in the DeFi sector.
Acted as the primary liaison between development, data science, and product management teams, effectively communicating progress and aligning expectations to ensure project success.
Guided and mentored junior team members, fostering professional growth and enhancing the team’s technical capabilities.
Organized internal workshops and training sessions, elevating the team’s proficiency in advanced Machine Learning techniques, leading to the successful deployment of three new models into production within six months.

Machine Learning Data pipelinesData Fetching Document Parsing Data CleaningData Inference Scaling Data Processing PythonDecentralized Finance (DeFi) Solution Architecture Cloud ComputingPredictive Modeling Data Engineering Data ManagementData Governance MLOps Data Architecture ETL Product Strategy Product Management Project Management Kanban ScrumCredit Scores

Illuvium.io

Data Scientist

2022 - 2022

Remote

Deployed new solutions to implement the game's balance model into production on AWS using MLOps best practices.
Worked on a probability model on the platform for sale of land (NFTs), increasing revenue generation for Illuvium.
Built and executed several data analysis pipelines to improve multiple game initiatives and functionalities.
Developed and managed end-to-end data pipelines to automate and scale business-critical analysis, integrating data from diverse sources for enhanced decision-making across game design and marketing strategy.
Created and deployed dynamic dashboards using Streamlit, providing stakeholders with real-time insights and enabling data-driven decisions that support game balance, user engagement, and monetization efforts.
Designed data models to empower decision-making in game development and marketing, delivering valuable business insights and identifying trends for targeted actions.

Python Web3.js AWS Data Analysis Data Science Machine Learning Data Visualization Data pipelines MLOpsVideo Games Streamlit AWS CloudWatchInfrastructure as Code (IaC) SplunkAWS Eventbridge MLFlow Amazon QuickSight

Olist

Data Scientist

2021 - 2022 (1 year)

Minas Gerais, Brazil

Led a team of engineers to manage and manipulate risk models to help the company maintain its sales channels.
Built new reputation models on AWS and transcribed notebooks into high-quality code per software best practices.
Developed and deployed data pipelines to handle the running of models using DAGs and ECS.
Determined catalog errors for identical products with differing descriptions, putting models into production using Docker and creating Airflow DAGs to schedule model updates.

SQL AWS Azure Databricks Python Gitlab Docker Agile Methodologies Data ScienceRisk Models Data pipelinesDirected Acrylic Graphs (DAG) Amazon Elastic Container Service (Amazon ECS) Airflow AWS Athena PostgreSQL DBeaver AWS S3 AWS EMR AWS EC2

Numera

Data Analyst

2021 - 2021

Minas Gerais, Brazil

Utilized the latest NLP techniques and tools to extract insights from texts for Numera's reference platform.
Handled speech-to-text transcriptions, bringing value to solutions using AWS and Google tools.
Managed exploratory and explanatory analysis processes to ensure high-quality solutions for clients.

Python Pandas Numpy Scikit Learn Selenium R Robotic Process Automation (RPA)OCR Data Analysis Data ScienceNatural Language Processing (NLP) Speech to Text AWSGoogle Speech-to-Text API Exploratory Data Analysis

AI ROBOTS

Data Scientist

2020 - 2021 (1 year)

Minas Gerais, Brazil

Built and maintained data pipelines for generating information, creating resources to make the information relevant.
Developed and deployed data pipelines and models with high performance and availability.
Architected and executed Azure solutions for industrial robots using state-of-the-art ML and AI algorithms.

Python Scikit Learn Seaborn Matplotlib Pandas Numpy Flask Tensorflow Microsoft Azure Data Science Data pipelines Data Modeling Machine Learning Artificial Intelligence Robotics

Freelance

Data Scientist

2018 - 2020 (2 years)

Belo Horizonte, Brazil

Designed, implemented, and supported an end-to-end data analysis platform for Machedo Law, gathering requirements and mapping processes.
Deployed OLAP database solutions, architecting and building data warehouses, cubes, and data lakes.
Worked on a BI solution that enhances utilizing functional and non-functional requirements and business rules to improve KPIs.

Python SQL Microsoft Office PowerBI Tensorflow GCP Google Data Studio Data Science OLAP Data WarehouseData Lakes

IPM Sistemas

Web Developer (Intern)

2017 - 2018 (1 year)

Rio do Sul, Brazil

Designed and developed a web portal for city halls within Brazil.
Created and modelled data resources on the project.
Worked with senior devs to generate operation reports and analyses for clients from IPM's database.
Developed and maintained SQL queries using PostgreSQL.

PHP JavaScript SQL HTML CSS PostgreSQL

Universidade do Estado de Santa Catarina (Udesc)

Software Developer (Scholarship)

2016 - 2017 (1 year)

Ibirama SC, Brazil

Designed and deployed a new visualization platform to pinpoint flood points within the Upper Itajaí Valley.
Manipulated maps and interpolated geometric figures to ensure consistency and accuracy on the platform.
Provided ongoing support for defects and bugs on the solution.

Java JavaServer Faces (JSF)Hibernate SQL ArcGISNatural Language Processing (NLP) Maps PostgreSQL

Portfolio

Data Scientist - Illuvium

Illuvium develops and publishes AAA play-to-earn crypto games, removing the ownership gap between gamers and games to create a community-governed collaborative game development model. It offers collectible NFT assets and in-game functionalities that are playable across multiple games within the Illuvium metaverse. Managed data analysis tasks, built new data pipelines for automated analysis, and deployed new data models to help with decision-making.

Data Scientist - Olist

Olist is an SMB commerce enabler ecosystem providing end-to-end solutions for customers to sell online. Provided Data Science expertise to enhance data analytics and risk management, taking over several data initiatives with a product-oriented mindset to deliver solutions.