Rishab P.

About Me

Rishab is a Senior Data Scientist and ML expert who designs and develops end-to-end complex and impactful ML solutions deployed to production on cloud and edge platforms. As a Data Scientist proficient in both TensorFlow and PyTorch, he has spearheaded projects across diverse sectors, harnessing data to drive strategic decision-making and enhance user experiences, underscoring the power of data in transforming business landscapes. Rishab is also skilled in NLP, using both heuristic and Deep Learning-based methodologies in multi-agent chatbot development, multi-label sentiment analysis, sentence correction, RAG with custom tools, and optimizing LLMs for efficient deployment. Using his Computer Vision skills, he develops bespoke applications spanning Image Classification, Image Segmentation, Object Detection, Object Tracking, Optical Character Recognition (OCR), Face Recognition, and so on.

AI, ML & LLM

Machine Learning Deep Learning Pytorch AI Chatbots Large Language Models (LLMs)

Backend

Other

Data Science Artificial Intelligence Natural Language Processing (NLP) Computer Vision Tensorflow Predictive Modeling Predictive Analytics Numpy Natural Language Toolkit (NLTK) Data Analytics Pandas OpenCV Chatbot Development

Work history

Tomorrow People
Prompt Engineer
2025 - 2025
Remote
  • Developed and optimized enterprise-grade prompt engineering frameworks using LangChain and LangGraph, resulting in a 40% improvement in content generation quality for sales and marketing materials across multiple industry verticals.

  • Led knowledge management initiatives including fine-tuning Llama models on domain-specific data, building structured knowledge graphs, and designing information retrieval systems.

  • Collaborated with cross-functional teams to align prompt engineering strategies with business goals and improve overall content performance.

Invisible Technologies
Prompt Engineer
2024 - 2025 (1 year)
Remote
  • Developed hallucination detection and content verification systems using LangChain agents and LangGraph, creating ensemble techniques that reduced inaccuracies by 65% while maintaining strict data privacy protocols for enterprise clients.

  • Engineered prompt frameworks for claim extraction and verification using OpenAl Assistants and custom Python tooling, enabling automated quality assurance processes that scaled to process 30,000+ content pieces daily.

  • Implemented and fine-tuned Llama models as verification agents, developing novel techniques for cost-effective factuality assessment that maintained 92% accuracy while reducing API costs by 72%.

Prompt Engineering Natural Language Processing (NLP) Machine LearningData SciencePythonPytorchTensorflowCloud
Solstice Health Inc
Al Developer
2024 - 2024
Remote
  • Extracted key medical claims from research documents and materials published by U.S. pharmaceutical companies, meeting FDA requirements.

  • Used extracted data to create presentations for customers and clinicians, supporting sales pitches for new drugs.

  • Built the solution on LangChain, with fine-tuned models and prompt engineering.

PythonLarge Language Models (LLMs) Fine-tuning ChatGPT Mistral Al Llama 2 Llama 3 Retrieval-augmented Generation (RAG) LangChain Al Agents DeepSeek Claude Gemini
FLIRT TECHNOLOGIES, LLC
Al Developer
2024 - 2024
Remote
  • Developed Al-powered social media app features for personalized content creation using Stable Diffusion and Deepgram.

  • Enabled near real-time immersive experiences with professional model photos, videos, and voice cloning.

  • Used machine learning techniques like low-rank adaptation (LoRA) based on PEFT, model quantization, checkpoint merging, and other related approaches to achieve state-of-the-art (SOTA) optimization and performance.

Large Language Models (LLMs) Machine LearningStable Diffusion OpenAL Text to Image Claude Gemini
Revibe Al
LLM Chatbot Developer
2023 - 2023
Remote
  • Completed a POC of a mental health assistant built on top of Voiceflow. It understands the user's mind, responds with appropriate custom suggestions from the given knowledge base, or engages in a general conversation like a human.

  • Built the RAG-based chatbot that replicates the behavior of a neuroscientist and emphatically engages with the user during the conversation.

  • Used LangChain to enrich the vector store, adding proprietary neuroscience and psychological data to generate a more relevant response from the LLMs.

Generative Pre-trained Transformers (GPT) PythonChatbot Conversation Design API IntegrationNatural Language Understanding (NLU) Natural Language Processing (NLP) Language Models Voiceflow Cognitive Behavioral Therapy (CBT) ZapierMachine LearningChatbots Retrieval-augmented Generation (RAG) Generative Artificial Intelligence (GenAl) Azure Al Studio LangChain Integrated Development Environments (IDE) ChatGPT OpenAI API Al Agents Azure SQL Databases Claude Gemini
Zachary Gidwitz
ML Expert | Data Scientist
2023 - 2023
Remote
  • Built a recommendation engine for an addiction recovery app that incorporates user demographics, mental state, and other metadata from app interaction to suggest quotes and prayers relevant to the user's state of mind.

  • Added memory and linked the vector database quickly.

  • Provided several offline and online metrics to track the live performance of the model and the recommendation system.

Data ScienceMachine LearningPythonRecommendation Systems Data Strategy Data Chatbots OpenAI GPT-3 API JupyterCloudOpenAI GPT-4 API Chatbot Conversation Design IntegrationProgramming AI Programming Data CleaningUnstructured Data Analysis AirtableMachine Learning Automation Project ManagementStatistical Analysis Data Reporting Data Cleansing Prompt Engineering Retrieval-augmented Generation (RAG) Generative Artificial Intelligence (GenAI) LangChain ChatGPT OpenAI Assistants API OpenAI API Azure SQL Databases
Odem Global Pty Ltd
Machine Learning Engineer
2023 - 2023
Remote
  • Trained causal language model (CLM) with DeepSpeed on Bittensor, a peer-to peer open-source protocol powering a globally distributed decentralized neural network.

  • Used a mountain dataset (800+ GB) to train generative LLMs using DeepSpeed, which has lower network validation loss and thereby increases the model's reward, benefiting the client.

  • Contributed to hyper-parameter tuning and revamped the default PyTorch data loader for faster dataset loading. Also, validated the models' performance on the validation set to ensure it meets the desired performance metrics.

Language Models Machine LearningFine-tuning Causal Inference DeepSpeed OpenAI GPT-3 API JupyterCloudIntegrationProgramming AI Programming Data CleaningUnstructured Data Analysis Machine Learning Automation Large Language Models (LLMs) Project ManagementRegexStatistical Analysis Data Reporting Data Cleansing Prompt Engineering Chatbots Retrieval-augmented Generation (RAG) Azure SQL Databases
Rich Lemon Apps FZE LLC
AI Expert
2023 - 2023
Remote
  • Developed a user sticker model using Stable Diffusion and ControlNet for a given style.

  • Experimented with different configurations of parameters to understand the effect on the results and achieved state-of-the-art results.

  • Explored different methods like checkpoint merging, trained a stickers model, and then fine-tuned that model on Portrait Face, and combined sticker style and user image training.

Deep LearningImage ProcessingStable Diffusion Computer VisionControlNet 3DMicroservices Architecture Data Scientist JupyterCloudIntegrationProgramming AI Programming Data CleaningMachine Learning Automation Image AnalysisProject ManagementStatistical Analysis Data Reporting Data Cleansing Prompt Engineering Image Generation Text to Image Multimodal GenAl OpenAI API Azure SQL Databases
EagleView
Senior ML Expert | Data Scientist
2021 - 2023 (2 years)
Remote
  • Revamped roof measurement automation for US houses by engineering a high-performance Line Detection model on aerial imagery to detect roof, wall, and ground lines from 28% to 59%.

  • Developed segmentation models for roof and wall facet detection while handling obstruction, achieving 86% accuracy.

  • Orchestrated microservices using Step Functions, AWS Lambda, and Amazon SageMaker endpoints, improving response time by 25%. This was followed by simplifying maintenance and support, focusing on maximizing code reuse.

PythonMachine Learning Operations (MLOps) PytorchPython API DockerComputer VisionGPU Computing Generative Pre-trained Transformers (GPT) Natural Language Processing (NLP) Data ScienceSQLNeural NetworksData VisualizationUnsupervised Learning Code ReviewRemote Team Leadership Graphics Processing Unit (GPU) Transfer Learning TransformersSequence Models Image Retrieval Image Recognition ClassificationEntity Extraction Siamese Neural Networks Artificial Neural Networks (ANN) Convolutional Neural Networks (CNNs) TensorboardGenerative Adversarial Networks (GANs) LinuxLanguage Models Python 3 Recommendation Systems AlgorithmsXGBoostData Extraction Data Manipulation SnowflakeAnalyticsMySQLAmazon Web Services (AWS) Data AnalysisVersion Control Systems CommunicationData EngineeringRegressionRAPIDS APIsAmazon SageMaker noSQLImage Search Al Design Jupyter NotebookFlaskNVIDIA TensorRT Microservices Architecture Data Scientist JupyterCloudAnomaly Detection IntegrationProgramming BitbucketData CleaningUnstructured Data Analysis Machine Learning Automation KubernetesAmazon Elastic Container Service (ECS) Image AnalysisLarge Language Models (LLMs) Project ManagementStatistical Analysis Data Reporting Data Cleansing You Only Look Once (YOLO) Databricks Azure SQL Databases
Airtel Digital
Airtel Digital
Senior Software Engineer
2020 - 2021 (1 year)
Gurgaon (Hybrid), India
  • Designed and built a chatbot using Hugging Face transformers with DeepSpeed to resolve queries for over 400 million of active monthly users; deployed successfully, handling over 10 million daily conversations with 85% accuracy.

  • Improved the accuracy of face recognition, matching, and liveness-check models, added face segmentation, blur and forgery detection, real-time blacklisting, and face search, enhanced service efficiency, and reduced manual effort by over 25%.

  • Inculcated best MLOps practices on the team to lower technical debt and adapt workflow that requires little to no manual intervention for simulations, testing, deployment, and monitoring.

Data Analytics Statistical Modeling Natural Language Processing (NLP) Generative Pre-trained Transformers (GPT) Python 3 DockerBash Script Data ScienceSQLOptical Character Recognition (OCR) Neural NetworksOptimization ClusteringCode ReviewRemote Team Leadership Graphics Processing Unit (GPU) Transfer Learning TransformersSequence Models Image Retrieval Image Recognition ClassificationEntity Extraction Recurrent Neural Networks (RNNs) Artificial Neural Networks (ANN) Convolutional Neural Networks (CNNs) TensorboardGenerative Adversarial Networks (GANs) LinuxLanguage Models PostgreSQLRecommendation Systems AlgorithmsGenerative Pre-trained Transformer 3 (GPT-3) XGBoostData Extraction Data Manipulation SnowflakeAnalyticsMySQLAmazon Web Services (AWS) Handwriting Recognition Data AnalysisVersion Control Systems CommunicationData EngineeringRegressionRAPIDS APIsAmazon SageMaker noSQLImage Search Amazon Rekognition Al Design Jupyter NotebookFlaskNVIDIA TensorRT Microservices Architecture Data Scientist Chatbots JupyterCloudAnomaly Detection OpenAI GPT-4 API Chatbot Conversation Design IntegrationProgramming BitbucketData CleaningUnstructured Data Analysis Machine Learning Automation KubernetesAmazon Elastic Container Service (ECS) Image AnalysisLarge Language Models (LLMs) Project ManagementStatistical Analysis Data Reporting Data Cleansing Big DataDatabricks Azure SQL Databases
Trantor
Trantor
ML Engineer
2019 - 2020 (1 year)
Chandigarh, India
  • Designed and deployed an OCR solution that outperformed the accuracy of Google OCR by 12% as benchmarked on the ICDAR 2013 dataset.

  • Prototyped an NLP--based bot using recurrent neural networks to parse comments on the website, assess sentiments, and classify them into themes.

  • Worked on a trained convolutional neural networks model using INT8 fixed-point and trained a software categorization and brand recognition tool using word2vec, N-grams embeddings, and kNN classifier.

  • Reduced the overall cost from an estimated $1 million to less than $50,000 per year.

  • Enabled standard responses and activation of the needed systems to respond.

Machine LearningDeep LearningData ScienceSQLOptical Character Recognition (OCR) Data VisualizationOptimization Unsupervised Learning ClusteringCode ReviewRemote Team Leadership Graphics Processing Unit (GPU) Transfer Learning TransformersSequence Models Image Recognition ClassificationRecurrent Neural Networks (RNNs) Siamese Neural Networks Artificial Neural Networks (ANN) Convolutional Neural Networks (CNNs) TensorboardGenerative Adversarial Networks (GANs) LinuxLanguage Models Python 3 Recommendation Systems AlgorithmsGenerative Pre-trained Transformer 3 (GPT-3) XGBoostData Extraction Data Manipulation SnowflakeAnalyticsMySQLHandwriting Recognition Google Vision API Data AnalysisVersion Control Systems CommunicationData EngineeringRegressionRAPIDS APIsAmazon SageMaker noSQLImage Search Amazon Rekognition Al Design Jupyter NotebookFlaskMicroservices Architecture Data Scientist Chatbots JupyterCloudGoogle Cloud Platform (GCP) Anomaly Detection OpenAI GPT-4 API Chatbot Conversation Design IntegrationProgramming Data CleaningUnstructured Data Analysis Machine Learning Automation Image AnalysisLarge Language Models (LLMs) Project ManagementRegexStatistical Analysis Data Reporting Data Cleansing You Only Look Once (YOLO) Elasticsearch Azure SQL Databases
Yamaha Motor Solutions India
Yamaha Motor Solutions India
Software Engineer
2018 - 2019 (1 year)
Faridabad, India
  • Conceptualized and built a ConvLSTM model on TensorFlow for hand gesture recognition.

  • Worked on single-shot face recognition using SSD MobileNet for detection and Inception-ResNet v2 for feature extraction.

  • Led a team to propose and implement predictive analysis for retail sales forecasting using ARIMA, exponential smoothing, and Holt's Winter models, resulting in a 5.35% increase in quarterly sales.

  • Used Bayesian optimization techniques for hyperparameter tuning.

  • Introduced TP-GAN for photorealism and identity, preserving the frontal face-view synthesis from any pose.

Python 3 Machine LearningData ScienceSQLNeural NetworksData VisualizationOptimization Code ReviewGraphics Processing Unit (GPU) Transfer Learning Machine Vision Sequence Models ClassificationSiamese Neural Networks Convolutional Neural Networks (CNNs) Artificial Neural Networks (ANN) TensorboardGenerative Adversarial Networks (GANs) LinuxAlgorithmsXGBoostData Extraction Data Manipulation SnowflakeAnalyticsMySQLHandwriting Recognition Data AnalysisVersion Control Systems CommunicationData EngineeringRegressionAPIsAmazon SageMaker noSQLAl Design Statistical Analysis Data Reporting Data Cleansing You Only Look Once (YOLO) Point Cloud Data Point Clouds Azure SQL Databases

Showcase

Retail Sale Forecasting
Retail Sale Forecasting

Analyzed the sales trends and sizing curve distribution across categories and product lines to give recommendations on the localized sizing distribution for a demand forecast model. Proposed and implemented predictive analysis for biweekly retail sales forecasting using ARIMA, exponential smoothing, and Holt-Winters methods. Incorporated Bayesian hyperparameter tuning with hybrid modeling using principal component analysis, support vector machines, and ensemble learning methods with Random Forests and XGBoost. This led to a 10% year-on-year increase in key metrics such as sales revenue.

Callup AI | Chatbot Platform
Callup AI | Chatbot Platform

A generative chatbot for user queries using a Transformer-based Seq2Seq model from chat messages, currently deployed commercially, handling thousands of daily conversations in English or Hinglish with almost 85% accuracy. The chatbot is also comprised of the nearest neighborhood-based intent detection solution, which uses pre-tagged customer messages to tag new customer messages for the support team. The idea was to reduce customer service workload by automatically replying to user messages with pre-defined responses based on intent detection.

Price Optimization
Price Optimization

Optimized retail product prices for TIKI using demand prediction by ML models and time-series historical sales data. LightGBM model is trained to predict sales at daily and 3-day frequencies. The work included analyzing data, engineering features, managing stakeholders, designing A/B experiments, and performing statistical analysis for hypothesis testing.

Education

B.Tech Computer Science and Engineering
B.Tech Computer Science and Engineering
DIT University - India
2014 - 2018 (4 years)