House Rental Price Prediction
An end-to-end cloud ML solution for highly accurate house rental price prediction and risk estimation. The solution was foundational to a Bay Area-based startup with the goal of uberization of the house rental market. It included raw data ingestion, a complex ETL pipeline, a suite of predictive models, MLOps processes including CI/CD, model, data versioning, and production model monitoring. I supervised a few other engineers who joined the project later to further improve the system.
Revenue Prediction for Retail Store Chain
Built a machine learning model that predicted revenues for a retail store chain based on store location, local demographic data, GIS features, seasonality, and other factors. I was the tech lead in a group of data scientists who ran the whole cycle from data extraction, web scrapping, ETL, exploratory analysis, data preprocessing, feature engineering, machine learning, packaging the model as a standalone service, and implementing a dashboard.
Payment Default Risk Scoring
Built and deployed an interpretable machine learning model that scored B2B customers for payment default risks and provided explanations for the scores. The model massively reduced workload for weekly risks assessment. I was the tech lead in a group of data scientists who ran the whole cycle from data extraction, merging several different data sources, ETL, exploratory analysis, data preprocessing, feature engineering, machine learning, packaging, and deploying the model to the client' premises.
Probabilistic Model for Building Commission Times
Built a probabilistic Bayesian machine learning model to predict which apartment buildings still under construction would fail to be commissioned in time. The model helped reduce the funds needed to hedge risks by two times. In addition to typical data science project activities, which included data exploration, ETL, and ML, this project also involved setting up machinery for the explicit Bayesian inference of structured models using GPUs.