Bhoumik S.

About Me

Bhoumik S. is a machine learning and operations research specialist with six years in Al, notably in ads moderation, linehaul cost optimization, and McKinsey's routing tool. He's excelled in cost reduction and system efficiency, adept with jsprit, OR-Tools, and ML model development. With both a bachelor's and master's degree in mechanical engineering, with a focus on operations research, Bhoumik also holds certifications in deep learning and related fields.

AI, ML & LLM

ChatGPT ChatGPT API Deep Learning Generative Pre-trained Transformer 3 (GPT-3) Generative Pre-trained Transformers (GPT) Machine Learning Machine Learning Operations (MLOps) Neural Networks OpenAI GPT-3 API Pytorch Retail & Wholesale Spark ML Supply Chain Supply Chain Management (SCM) Supply Chain Optimization

Backend

Database

Graph Databases PostgreSQL SQL Spatial Databases noSQL

DevOps

Amazon Web Services (AWS)

Workflow

Other

Advertising Technology (Adtech) Algorithms Analytics Artificial General Intelligence (AGI) Automation Beautiful Soup Bedrock Big Data C CPLEX CSV File Processing Chatbots Computer Vision Consulting Data Analysis Data Collection Data Engineering Data Modeling Data Processing Data Science Product Manager Document Parsing Finance Food Forecasting Full-stack Gradient Boosting Gurobi H20 Haystack Inventory Java Keras Large Data Sets Leadership Location Services and Maps Microsoft Excel Minimum Viable Product (MVP) Modeling Models N8n Natural Language Processing (NLP) Natural Language Understanding (NLU) Neo4j Network Optimization Numpy OpenAL OpenCV Operations Research Optimization Pandas Predictive Modeling Product Management Pyspark Regression Retrieval-augmented Generation (RAG) Scikit Learn Sentiment Analysis Spark Speech Recognition Supervised Learning Teamwork Technical Leadership Tensorflow Tesseract Time Series Trend Analysis Vehicle Routing Web Development eCommerce

Work history

Freelance
Freelance
AI/ML Engineer
2022 - Present (3 years)
Remote
  • Developed an end-to-end application for scraping scanned images of handwritten mails from an internal website and extracting data from it.

  • Used AWS Textract and OpenAl along with Donut model to do OCR of handwritten images and extract information in a structured format.

  • Implemented a web scraping solution using Selenium and Beautiful Soup.

  • Built a heuristic algorithm to identify and extract relevant information and amendments from policy documents published by US federal agencies.

  • Used LLMs to make amendments into federal regulations based on published policy by multiple federal agencies.

  • Built a front-end application using Streamlit to compare changes in regulation.

  • Developed an end-to-end recommendation engine for a platform that sells raw materials to restaurants.

  • Scraped restaurant menus and used items in the menu to generate personalized recommendations for raw materials.

  • Deployed the recommendation engine as a service on AWS using AWS Lambda, DynamoDB, and Step Functions.

Computer VisionPythonOpenCVTensorflowPandasMachine LearningDeep LearningWorkflow Automation & System Integration Natural Language Processing (NLP) Optical Character Recognition (OCR) KerasAl Prompts Large Data Sets FinanceCapital marketsOpenAL Natural Language Understanding (NLU) API IntegrationGoogle CloudClaude TesseractChatGPT API Al Data Classification Al Modeling Data ProcessingAutoGen Beautiful Soup Agentic Al Replit Agentic Frameworks PostgreSQLAWS Textract Optical Character Recognition (OCR) SeleniumHeuristics Large Language Models (LLMs) StreamlitRecommendation Systems Web ScrapingData ScrapingAWS Lambda DynamoDBAWS Step Functions
Amazon India
Amazon India
Applied & ML Research Scientist
2021 - 2024 (3 years)
Bangalore, India
  • Developed automated moderation systems to streamline the review of more than 5 billion ads annually.

  • Led a team of three in enhancing current Machine Learning systems, cutting down the number of ads requiring manual moderation by 60%.

  • Achieved a $500 million annual reduction in shipping invoice estimation errors by training a gradient boosting machine (GBM) model using H2O to handle a large training dataset of 250 million samples.

  • Constructed a regression model designed to predict invoice amounts for more than 50 million third-party shipments per week.

H20 Machine LearningGradient Boosting SparkPytorchGenerative Pre-trained Transformers (GPT) Natural Language Processing (NLP) Data ScienceDeep LearningLarge Language Models (LLMs) Amazon Web Services (AWS) Data AnalysisPredictive Modeling OpenAI GPT-3 API Technical Leadership Haystack Retrieval-augmented Generation (RAG) Trend Analysis Advertising Technology (Adtech) Machine Learning Operations (MLOps) Neural NetworksAlgorithmsRetail & Wholesale Inventory CSV File Processing Full-stack Web DevelopmentBedrock Generative Pre-trained Transformer 3 (GPT-3) ConsultingLeadershipSQLProduct ManagementPysparkData Science Product Manager GitlabAl Programming Al Model Training PandasNumpynoSQLFastAPINeo4jGraph Databases Python 3 Models Generative Artificial Intelligence (GenAl) Classifier Development Supervised Learning TeamworkRegressionData EngineeringWeb ScrapingSpark MLOptical Character Recognition (OCR) Workflow Automation & System Integration TensorfloweCommerce AnalyticsSentiment Analysis ModelingData Collection Big DataC#Document Parsing Minimum Viable Product (MVP) AutomationMicrosoft ExcelAl Prompts Large Data Sets
Amazon India
Amazon India
Operations Research Scientist
2019 - 2021 (2 years)
Bangalore, India
  • Built a dispatch time optimizer for the Amazon India network to deliver 10% of shipments one day faster.

  • Used concepts of vehicle routing optimization and built a line-haul route planner tool for the Amazon line-haul network.

  • Created a workforce scheduling tool using constraint programming to automate and optimize human resources deployment at the Amazon sort center, which resulted in an 8% reduction in the workforce.

  • Utilized MILP formulation and lifted cutting plane techniques to reduce the runtime of post and parcel network scheduling optimization from 74 hours to just 30 seconds.

  • Designed a network simulation tool for assessing the speed impact on the network due to configuration changes.

Operations Research Optimization Vehicle Routing Network Optimization PythonJavaGurobi CPLEX Amazon Web Services (AWS) Data AnalysisData ScienceTime Series Predictive Modeling Technical Leadership Machine LearningTrend Analysis Machine Learning Operations (MLOps) Neural NetworksAlgorithmsRetail & Wholesale Inventory CSV File Processing ConsultingSQLProduct ManagementPysparkData Science Product Manager GitlabAl Model Training PandasNumpynoSQLFastAPIPython 3 Models Location Services and Maps Spatial Databases Classifier Development TeamworkRegressionData EngineeringWeb ScrapingSoftware ArchitectureWorkflow Automation & System Integration eCommerce Generative Pre-trained Transformers (GPT) AnalyticsGrocery Delivery ModelingData Collection Big DataDocument Parsing Minimum Viable Product (MVP) AutomationMicrosoft ExcelAPI IntegrationData ProcessingPostgreSQLData pipelines
McKinsey & Company
McKinsey & Company
Knowledge Analyst
2017 - 2018 (1 year)
Gurgaon, India
  • Developed a state-of-the-art tool to solve vehicle routing problems with real-life constraints using heuristic algorithms.

  • Collaborated with global clients for topology and production planning optimization.

  • Developed a routing algorithm for solving ship container routing for a leading petrochemical producer.

  • Worked on network optimization problems to identify optimal supply chain network configuration for Indian and global clients using Llamasoft Supply Chain Guru and internally developed tools.

  • Performed oil vessel routing and scheduling optimization to identify opportunity to reduce logistic cost by 10%.

  • Helped an Indian chemical client reduce logistics cost by 11% through online bidding.

Supply Chain Supply Chain Optimization Supply Chain Management (SCM) Data AnalysisData ScienceTime Series ForecastingPredictive Modeling Technical Leadership Machine LearningTrend Analysis Statistical Modeling Data VisualizationAnalyticsData Analytics Heuristics Topology Network Optimization LLaMA

Education

Introduction to Data Science in Python | Neural Networks and Deep Learning | 
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization | Structuring Machine Learning Projects | Sequence Models | 
Deep Learning Specialization | Convolutional Neural Networks
Introduction to Data Science in Python | Neural Networks and Deep Learning | Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization | Structuring Machine Learning Projects | Sequence Models | Deep Learning Specialization | Convolutional Neural Networks
Coursera
2018 - 2018
B.Tech/M.Tech Mechanical Engineering
B.Tech/M.Tech Mechanical Engineering
Indian Institute of Technology (IIT) Bombay - India
2012 - 2017 (5 years)