Felipe L.

Felipe L.

Senior ML/Data Engineer

Dubai, United Arab Emirates
Hire Felipe L. Hire Felipe L. Hire Felipe L.

About Me

Felipe is a Senior Data Engineer and Machine Learning Engineer who designs, deploys, and scales sophisticated data solutions that empower strategic decision-making and drive business growth. He transforms raw data into actionable insights using a blend of advanced analytics, robust engineering practices, and scalable cloud infrastructure. Felipe is proficient in statistical analysis (descriptive, inferential, causal inference) and end-to-end pipeline development, designing, building, and implementing sophisticated data pipelines and enterprise-grade data processing, analytics, and reporting applications. With expertise in Python, Pandas, SciPy, Scikit-learn, PySpark, TensorFlow, and MLOps best practices (unit testing, integration testing, model monitoring, and API development with FastAPI/Flask), he maintains and enforces common conventions, standards, and technologies across database structures and applications to drive scalability and increase consistency.

Work history

UpStack
UpStack
Senior ML/Data Engineer
2022 - Present (3 years)
Remote
  • Delivering data warehouse and ETL solutions as part of an Agile team using advanced ML techniques to improve performance and processes.

  • Helping build and improve infrastructure, application, and performance development and ensuring tight security including data encryption, security groups, and environment scanning.

  • Ensuring high-quality deliverables and implementing DevOps and security best practices in fast-paced environments.

Hero.io
Hero.io
Senior Data Engineer
2025 - Present
Dubai, United Arab Emirates
  • Developing and maintaining web scraping scripts that support data pipelines using Python, SQL, MongoDB, Docker, and related technologies.

  • Integrating multiple data sources and understanding best practices for merging complex datasets.

  • Monitoring and troubleshooting API endpoints to ensure data accuracy, completeness, and reliability.

  • Understanding the full data lifecycle from extraction to data modeling and how the data is implemented into products or transformed to apply complex analysis using ML algorithms/AI models.

  • Contributing to optimization of data storage and retrieval processes.

  • Assisting the development and maintenance of data warehouse architecture.

  • Participating in code reviews, documentation, and knowledge sharing sessions.

Data EngineeringPythonSQLMongoDBDockerWeb ScrapingData pipelinesDatasets Data Source Types Endpoint Management API Machine LearningData ModelingData Extraction AI Modeling Data Warehouse
Spectral Labs
Spectral Labs
Machine Learning Engineer
2022 - 2024 (2 years)
Remote
  • Led the development team in automating and optimizing production-level data pipelines, including data fetching, parsing, cleaning, model inference, and scaling, ensuring efficient and reliable data processing workflows.

  • Contributed to the strategic planning and evolution of data pipelines, aligning technical solutions with business objectives to drive innovation and efficiency.

  • Implemented code optimizations and parallel processing techniques, reducing model inference time from 5 seconds to 0.8 seconds, significantly improving user experience and increasing retention rates by 15%.

  • Redesigned data pipeline architecture and established real-time monitoring, reducing failure rates by 40% and enhancing operational efficiency by 25%, enabling faster and more informed business decisions.

  • Collaborated with cross-functional teams to maintain and enhance the quality of Machine Learning components, delivering innovative credit scoring solutions in the DeFi sector.

  • Acted as the primary liaison between development, data science, and product management teams, effectively communicating progress and aligning expectations to ensure project success.

  • Guided and mentored junior team members, fostering professional growth and enhancing the team’s technical capabilities.

  • Organized internal workshops and training sessions, elevating the team’s proficiency in advanced Machine Learning techniques, leading to the successful deployment of three new models into production within six months.

Machine LearningData pipelinesData Fetching Document Parsing Data CleaningData Inference Scaling Data ProcessingPythonDecentralized Finance (DeFi) Solution Architecture Cloud ComputingPredictive Modeling Data EngineeringData ManagementData Governance MLOpsData ArchitectureETLProduct StrategyProduct ManagementProject ManagementKanbanScrumCredit Scores
Illuvium.io
Illuvium.io
Data Scientist
2022 - 2022
Remote
  • Deployed new solutions to implement the game's balance model into production on AWS using MLOps best practices.

  • Worked on a probability model on the platform for sale of land (NFTs), increasing revenue generation for Illuvium.

  • Built and executed several data analysis pipelines to improve multiple game initiatives and functionalities.

  • Developed and managed end-to-end data pipelines to automate and scale business-critical analysis, integrating data from diverse sources for enhanced decision-making across game design and marketing strategy.

  • Created and deployed dynamic dashboards using Streamlit, providing stakeholders with real-time insights and enabling data-driven decisions that support game balance, user engagement, and monetization efforts.

  • Designed data models to empower decision-making in game development and marketing, delivering valuable business insights and identifying trends for targeted actions.

Olist
Olist
Data Scientist
2021 - 2022 (1 year)
Minas Gerais, Brazil
  • Led a team of engineers to manage and manipulate risk models to help the company maintain its sales channels.

  • Built new reputation models on AWS and transcribed notebooks into high-quality code per software best practices.

  • Developed and deployed data pipelines to handle the running of models using DAGs and ECS.

  • Determined catalog errors for identical products with differing descriptions, putting models into production using Docker and creating Airflow DAGs to schedule model updates.

Numera
Numera
Data Analyst
2021 - 2021
Minas Gerais, Brazil
  • Utilized the latest NLP techniques and tools to extract insights from texts for Numera's reference platform.

  • Handled speech-to-text transcriptions, bringing value to solutions using AWS and Google tools.

  • Managed exploratory and explanatory analysis processes to ensure high-quality solutions for clients.

PythonPandasNumpyScikit LearnSeleniumRRobotic Process Automation (RPA)OCRData AnalysisData ScienceNatural Language Processing (NLP) Speech to Text AWSGoogle Speech-to-Text API Exploratory Data Analysis
AI ROBOTS
AI ROBOTS
Data Scientist
2020 - 2021 (1 year)
Minas Gerais, Brazil
  • Built and maintained data pipelines for generating information, creating resources to make the information relevant.

  • Developed and deployed data pipelines and models with high performance and availability.

  • Architected and executed Azure solutions for industrial robots using state-of-the-art ML and AI algorithms.

Freelance
Freelance
Data Scientist
2018 - 2020 (2 years)
Belo Horizonte, Brazil
  • Designed, implemented, and supported an end-to-end data analysis platform for Machedo Law, gathering requirements and mapping processes.

  • Deployed OLAP database solutions, architecting and building data warehouses, cubes, and data lakes.

  • Worked on a BI solution that enhances utilizing functional and non-functional requirements and business rules to improve KPIs.

IPM Sistemas
IPM Sistemas
Web Developer (Intern)
2017 - 2018 (1 year)
Rio do Sul, Brazil
  • Designed and developed a web portal for city halls within Brazil.

  • Created and modelled data resources on the project.

  • Worked with senior devs to generate operation reports and analyses for clients from IPM's database.

  • Developed and maintained SQL queries using PostgreSQL.

Universidade do Estado de Santa Catarina (Udesc)
Universidade do Estado de Santa Catarina (Udesc)
Software Developer (Scholarship)
2016 - 2017 (1 year)
Ibirama SC, Brazil
  • Designed and deployed a new visualization platform to pinpoint flood points within the Upper Itajaí Valley.

  • Manipulated maps and interpolated geometric figures to ensure consistency and accuracy on the platform.

  • Provided ongoing support for defects and bugs on the solution.

Portfolio

Data Scientist - Illuvium
Data Scientist - Illuvium

Illuvium develops and publishes AAA play-to-earn crypto games, removing the ownership gap between gamers and games to create a community-governed collaborative game development model. It offers collectible NFT assets and in-game functionalities that are playable across multiple games within the Illuvium metaverse. Managed data analysis tasks, built new data pipelines for automated analysis, and deployed new data models to help with decision-making.

Data Scientist - Olist
Data Scientist - Olist

Olist is an SMB commerce enabler ecosystem providing end-to-end solutions for customers to sell online. Provided Data Science expertise to enhance data analytics and risk management, taking over several data initiatives with a product-oriented mindset to deliver solutions.

Data Scientist - AI Robotics
Data Scientist - AI Robotics

Worked on a data pipeline that cleans and explores data on a robotic platform that uses AI for optimization and predictive/prescriptive maintenance.

Education

Machine Learning DevOps NanoDegree
Machine Learning DevOps NanoDegree
Udacity
2022 - 2022
Machine Learning Engineer
Machine Learning Engineer
Udacity
2019 - 2019
Data Science Fundamentals
Data Science Fundamentals
Udacity
2018 - 2018
Bachelor's Degree, Software Engineering
Bachelor's Degree, Software Engineering
Universidade do Estado de Santa Catarina (Udesc) - Brazil
2016 - 2019 (3 years)
Bachelor's Degree, Production Engineering
Bachelor's Degree, Production Engineering
UFOP (Universidade Federal de Ouro Preto) - Brazil
2011 - 2015 (4 years)