Felipe C.

Felipe C.

New York, United States of America
Hire Felipe C. Hire Felipe C. Hire Felipe C.

About Me

Felipe is an expert Data Scientist empowered with the latest trends in big data and technology; in developing and delivering practical insights and actionable recommendations on projects. He employs a mix of dataset acquisition, statistical modelling, exploratory data analysis, and software engineering best practices in addressing a wide range of technical and identifying additional opportunities where data science can bring real business value.

Database

DevOps

Other

Work history

UpStack
UpStack
Senior Data Scientist
2020 - Present (5 years)
Remote
  • Creating and developing innovative software solutions for clients across a broad range of industries.

  • Participate in scrums consisting of cross-functional teams, both software and hardware.

  • Ensure that features are being delivered efficiently and on-time.

Intellinum Analytics Inc
Intellinum Analytics Inc
Lead Data Scientist
2019 - Present (6 years)
New York, United States of America
  • Built a home detection algorithm from GPS trace/geolocation data from 10,000 users throughout the US using listwise learning-to-rank (LTR) algorithms for Intellinum Analytics.

  • Analyzed and modelled spatiotemporal point processes to identify clusters of mobile users and create new advertising audiences for the client based on their digital behaviour or places visited.

  • Led Intellinum's data science team on projects; leading to a reduction in computing and storage costs by 52%, optimizing the performance and quality of data ingestion and processing from 15 hours to 2 hours.

SpicyMinds – Digital Business Lab
SpicyMinds – Digital Business Lab
Lead Data Scientist
2015 - 2018 (3 years)
Mexico City, Mexico
  • Designed, implemented and analyzed offline and online experiments using A/B Test, factorial experiments and Thompson sampling to inform content generation, advertising, and web development processes to increase traffic, conversion rates, reach, and ROI for a 50+ SME customer base.

  • Built Python-based machine learning models to ingest historic purchase data and multichannel interactions for efficient customer segmentation, lead qualification, cross-selling and up-selling, and time-series forecasts.

  • Developed a reporting tool that integrated data from several marketing platforms onto customized Tableau dashboards; providing real-time updates on KPIs for websites, eCommerce sites, search and social ad spending.

Ministry of Justice and Law (Colombia)
Ministry of Justice and Law (Colombia)
Data Analyst - Criminal and Prison Policy
2015 - 2017 (2 years)
Bogota, Colombia
  • Provided expertise in forecasting future crime rate trends in Python using Network time series analysis, deep learning models and clustered time series in identifying similarities between states.

  • Assessed the impact of inmates population on different legislative measures to reduce overcrowding in prison using Monte Carlo simulation and Variance Reduction Techniques.

  • Performed market basket analysis and clustering in R on historic arrests data; learning and understanding better criminal behaviour in Colombia and identifying spatial and temporal changes of how crimes are committed.

Showcase

Lead Data Scientist - GeoSpatial Analytics
Lead Data Scientist - GeoSpatial Analytics
  • Developed and maintained a pipeline to manage 2+ billion daily location data events using structured streaming.

  • Implemented an optimization and production algorithm for geospatial data processing and enrichment, reducing data volumes by 75% and storage costs by $500,000 annually.

  • Improved data structures, tuned parameters, and implemented QA systems to monitor data streams and algorithm output, resulting in a core component for new data products.

Lead Data Scientist - Fraud Detection in AdTech
Lead Data Scientist - Fraud Detection in AdTech
  • Developed a fraud detection solution for a client, increasing data volume received from data partners.

  • Implemented new fraud detection mechanisms utilizing different data providers to enhance data quality.

  • Utilized several machine learning models to identify and rectify fraudulent data, reducing storage and processing costs by 25% and improving data reliability.

Lead Data Scientist - Visualization Tool for Customer Success
Lead Data Scientist - Visualization Tool for Customer Success
  • Gathered requirements from stakeholders, creating technical specifications for data and software engineers.

  • Improved query efficiency by reducing query execution time from 30 min to 2-3 min, enabling independent data exploration and product innovation.

  • Implemented data aggregation and format changes to enhance query performance and scalability, and created a self-service tool for customer success representatives.

Education

MSc. Operations Research
MSc. Operations Research
Columbia University in the City of New York
2018 - 2019 (1 year)
Big Data: Data Analytics for Business and beyond 
(Joint summer progamme between London School of Economics and Peking University)
Big Data: Data Analytics for Business and beyond (Joint summer progamme between London School of Economics and Peking University)
Peking University
2016 - 2016
BEng. Chemical Engineering
BEng. Chemical Engineering
Universidad Nacional de Colombia
2007 - 2013 (6 years)