Rishab is a Senior Data Scientist and ML expert who designs and develops end-to-end complex and impactful ML solutions deployed to production on cloud and edge platforms. As a Data Scientist proficient in both TensorFlow and PyTorch, he has spearheaded projects across diverse sectors, harnessing data to drive strategic decision-making and enhance user experiences, underscoring the power of data in transforming business landscapes. Rishab is also skilled in NLP, using both heuristic and Deep Learning-based methodologies in multi-agent chatbot development, multi-label sentiment analysis, sentence correction, RAG with custom tools, and optimizing LLMs for efficient deployment. Using his Computer Vision skills, he develops bespoke applications spanning Image Classification, Image Segmentation, Object Detection, Object Tracking, Optical Character Recognition (OCR), Face Recognition, and so on.
Developed and optimized enterprise-grade prompt engineering frameworks using LangChain and LangGraph, resulting in a 40% improvement in content generation quality for sales and marketing materials across multiple industry verticals.
Led knowledge management initiatives including fine-tuning Llama models on domain-specific data, building structured knowledge graphs, and designing information retrieval systems.
Collaborated with cross-functional teams to align prompt engineering strategies with business goals and improve overall content performance.
Invisible Technologies
Prompt Engineer
2024 - 2025 (1 year)
Remote
Developed hallucination detection and content verification systems using LangChain agents and LangGraph, creating ensemble techniques that reduced inaccuracies by 65% while maintaining strict data privacy protocols for enterprise clients.
Engineered prompt frameworks for claim extraction and verification using OpenAl Assistants and custom Python tooling, enabling automated quality assurance processes that scaled to process 30,000+ content pieces daily.
Implemented and fine-tuned Llama models as verification agents, developing novel techniques for cost-effective factuality assessment that maintained 92% accuracy while reducing API costs by 72%.
Extracted key medical claims from research documents and materials published by U.S. pharmaceutical companies, meeting FDA requirements.
Used extracted data to create presentations for customers and clinicians, supporting sales pitches for new drugs.
Built the solution on LangChain, with fine-tuned models and prompt engineering.
PythonLarge Language Models (LLMs)
Fine-tuning
ChatGPT
Mistral Al
Llama 2
Llama 3
Retrieval-augmented Generation (RAG)
LangChain
Al Agents
DeepSeek
Claude
Gemini
FLIRT TECHNOLOGIES, LLC
Al Developer
2024 - 2024
Remote
Developed Al-powered social media app features for personalized content creation using Stable Diffusion and Deepgram.
Enabled near real-time immersive experiences with professional model photos, videos, and voice cloning.
Used machine learning techniques like low-rank adaptation (LoRA) based on PEFT, model quantization, checkpoint merging, and other related approaches to achieve state-of-the-art (SOTA) optimization and performance.
Large Language Models (LLMs)
Machine LearningStable Diffusion
OpenAL
Text to Image
Claude
Gemini
Revibe Al
LLM Chatbot Developer
2023 - 2023
Remote
Completed a POC of a mental health assistant built on top of Voiceflow. It understands the user's mind, responds with appropriate custom suggestions from the given knowledge base, or engages in a general conversation like a human.
Built the RAG-based chatbot that replicates the behavior of a neuroscientist and emphatically engages with the user during the conversation.
Used LangChain to enrich the vector store, adding proprietary neuroscience and psychological data to generate a more relevant response from the LLMs.
Generative Pre-trained Transformers (GPT)
PythonChatbot Conversation Design
API IntegrationNatural Language Understanding (NLU)
Natural Language Processing (NLP)
Language Models
Voiceflow
Cognitive Behavioral Therapy (CBT)
ZapierMachine LearningChatbots
Retrieval-augmented Generation (RAG)
Generative Artificial Intelligence (GenAl)
Azure Al Studio
LangChain
Integrated Development Environments (IDE)
ChatGPT
OpenAI API
Al Agents
Azure SQL Databases
Claude
Gemini
Zachary Gidwitz
ML Expert | Data Scientist
2023 - 2023
Remote
Built a recommendation engine for an addiction recovery app that incorporates user demographics, mental state, and other metadata from app interaction to suggest quotes and prayers relevant to the user's state of mind.
Added memory and linked the vector database quickly.
Provided several offline and online metrics to track the live performance of the model and the recommendation system.
Data ScienceMachine LearningPythonRecommendation Systems
Data Strategy
Data
Chatbots
OpenAI GPT-3 API
JupyterCloudOpenAI GPT-4 API
Chatbot Conversation Design
IntegrationProgramming
AI Programming
Data CleaningUnstructured Data Analysis
AirtableMachine Learning Automation
Project ManagementStatistical Analysis
Data Reporting
Data Cleansing
Prompt Engineering
Retrieval-augmented Generation (RAG)
Generative Artificial Intelligence (GenAI)
LangChain
ChatGPT
OpenAI Assistants API
OpenAI API
Azure SQL Databases
Odem Global Pty Ltd
Machine Learning Engineer
2023 - 2023
Remote
Trained causal language model (CLM) with DeepSpeed on Bittensor, a peer-to peer open-source protocol powering a globally distributed decentralized neural network.
Used a mountain dataset (800+ GB) to train generative LLMs using DeepSpeed, which has lower network validation loss and thereby increases the model's reward, benefiting the client.
Contributed to hyper-parameter tuning and revamped the default PyTorch data loader for faster dataset loading. Also, validated the models' performance on the validation set to ensure it meets the desired performance metrics.
Language Models
Machine LearningFine-tuning
Causal Inference
DeepSpeed
OpenAI GPT-3 API
JupyterCloudIntegrationProgramming
AI Programming
Data CleaningUnstructured Data Analysis
Machine Learning Automation
Large Language Models (LLMs)
Project ManagementRegexStatistical Analysis
Data Reporting
Data Cleansing
Prompt Engineering
Chatbots
Retrieval-augmented Generation (RAG)
Azure SQL Databases
Rich Lemon Apps FZE LLC
AI Expert
2023 - 2023
Remote
Developed a user sticker model using Stable Diffusion and ControlNet for a given style.
Experimented with different configurations of parameters to understand the effect on the results and achieved state-of-the-art results.
Explored different methods like checkpoint merging, trained a stickers model, and then fine-tuned that model on Portrait Face, and combined sticker style and user image training.
Revamped roof measurement automation for US houses by engineering a high-performance Line Detection model on aerial imagery to detect roof, wall, and ground lines from 28% to 59%.
Developed segmentation models for roof and wall facet detection while handling obstruction, achieving 86% accuracy.
Orchestrated microservices using Step Functions, AWS Lambda, and Amazon SageMaker endpoints, improving response time by 25%. This was followed by simplifying maintenance and support, focusing on maximizing code reuse.
Designed and built a chatbot using Hugging Face transformers with DeepSpeed to resolve queries for over 400 million of active monthly users; deployed successfully, handling over 10 million daily conversations with 85% accuracy.
Improved the accuracy of face recognition, matching, and liveness-check models, added face segmentation, blur and forgery detection, real-time blacklisting, and face search, enhanced service efficiency, and reduced manual effort by over 25%.
Inculcated best MLOps practices on the team to lower technical debt and adapt workflow that requires little to no manual intervention for simulations, testing, deployment, and monitoring.
Data Analytics
Statistical Modeling
Natural Language Processing (NLP)
Generative Pre-trained Transformers (GPT)
Python 3
DockerBash Script
Data ScienceSQLOptical Character Recognition (OCR)
Neural NetworksOptimization
ClusteringCode ReviewRemote Team Leadership
Graphics Processing Unit (GPU)
Transfer Learning
TransformersSequence Models
Image Retrieval
Image Recognition
ClassificationEntity Extraction
Recurrent Neural Networks (RNNs)
Artificial Neural Networks (ANN)
Convolutional Neural Networks (CNNs)
TensorboardGenerative Adversarial Networks (GANs)
LinuxLanguage Models
PostgreSQLRecommendation Systems
AlgorithmsGenerative Pre-trained Transformer 3 (GPT-3)
XGBoostData Extraction
Data Manipulation
SnowflakeAnalyticsMySQLAmazon Web Services (AWS)
Handwriting Recognition
Data AnalysisVersion Control Systems
CommunicationData EngineeringRegressionRAPIDS
APIsAmazon SageMaker
noSQLImage Search
Amazon Rekognition
Al Design
Jupyter NotebookFlaskNVIDIA TensorRT
Microservices Architecture
Data Scientist
Chatbots
JupyterCloudAnomaly Detection
OpenAI GPT-4 API
Chatbot Conversation Design
IntegrationProgramming
BitbucketData CleaningUnstructured Data Analysis
Machine Learning Automation
KubernetesAmazon Elastic Container Service (ECS)
Image AnalysisLarge Language Models (LLMs)
Project ManagementStatistical Analysis
Data Reporting
Data Cleansing
Big DataDatabricks
Azure SQL Databases
Trantor
ML Engineer
2019 - 2020 (1 year)
Chandigarh, India
Designed and deployed an OCR solution that outperformed the accuracy of Google OCR by 12% as benchmarked on the ICDAR 2013 dataset.
Prototyped an NLP--based bot using recurrent neural networks to parse comments on the website, assess sentiments, and classify them into themes.
Worked on a trained convolutional neural networks model using INT8 fixed-point and trained a software categorization and brand recognition tool using word2vec, N-grams embeddings, and kNN classifier.
Reduced the overall cost from an estimated $1 million to less than $50,000 per year.
Enabled standard responses and activation of the needed systems to respond.
Conceptualized and built a ConvLSTM model on TensorFlow for hand gesture recognition.
Worked on single-shot face recognition using SSD MobileNet for detection and Inception-ResNet v2 for feature extraction.
Led a team to propose and implement predictive analysis for retail sales forecasting using ARIMA, exponential smoothing, and Holt's Winter models, resulting in a 5.35% increase in quarterly sales.
Used Bayesian optimization techniques for hyperparameter tuning.
Introduced TP-GAN for photorealism and identity, preserving the frontal face-view synthesis from any pose.
Analyzed the sales trends and sizing curve distribution across categories and product lines to give recommendations on the localized sizing distribution for a demand forecast model. Proposed and implemented predictive analysis for biweekly retail sales forecasting using ARIMA, exponential smoothing, and Holt-Winters methods. Incorporated Bayesian hyperparameter tuning with hybrid modeling using principal component analysis, support vector machines, and ensemble learning methods with Random Forests and XGBoost. This led to a 10% year-on-year increase in key metrics such as sales revenue.
A generative chatbot for user queries using a Transformer-based Seq2Seq model from chat messages, currently deployed commercially, handling thousands of daily conversations in English or Hinglish with almost 85% accuracy. The chatbot is also comprised of the nearest neighborhood-based intent detection solution, which uses pre-tagged customer messages to tag new customer messages for the support team. The idea was to reduce customer service workload by automatically replying to user messages with pre-defined responses based on intent detection.
Optimized retail product prices for TIKI using demand prediction by ML models and time-series historical sales data. LightGBM model is trained to predict sales at daily and 3-day frequencies. The work included analyzing data, engineering features, managing stakeholders, designing A/B experiments, and performing statistical analysis for hypothesis testing.