Kartheek K.

About Me

10+ years of experience delivering high-impact ML and GenAl solutions across domains like NLP, CV, scientific ML, and applied LLMs. Proven expertise in building and scaling production-grade systems, LLM applications (RAG, Agentic Al, LangGraph), and enterprise-grade ML pipelines. Strong track record of innovation, internal leadership, and hackathon success at Microsoft. Skilled in cross-functional collaboration, technical problem-solving, and mentoring.

AI, ML & LLM

BERT LangChain GPT Pytorch MLFlow Large Language Models (LLMs)

Backend

Database

DevOps

CI/CD Docker Azure

Workflow

GitHub Actions

Other

Prompt Engineering Time Series Optimization Databricks ETL Pipelines OCR RAG NLP Transformers Scikit Learn

Work history

Microsoft
Microsoft
Senior Data Scientist
2021 - Present (4 years)
Hyderabad, India
  • Co-led the design and development of a personalized GenAl CoPilot for Career Guidance using GPT-based LLMs and feedback loops, delivering 1M+ career sessions for blue-collar workers.

  • Built end-to-end GenAl applications using LangChain & LangGraph and delivered internal workshops and reusables to 50+ data scientists.

  • Led the 2023 winning team with an RAG-powered Agentic Al solution for workforce upskilling, built a GPT-based chatbot, and advocated for Responsible Al principles.

  • Replaced 6-hour PDE simulations with FNO-based Scientific Deep Learning surrogate, delivering <1 minute latency (99%+ improvement), trained using Distributed Data Parallel (DDP) on Azure ML.

  • Achieved <1% loss on 3D pressure/saturation field reconstruction using only 100 simulated scenarios.

  • Led Microsoft OSS contribution for GPU-accelerated credit risk modeling using RAPIDS and LightGBM.

GenAI GPT LLMs LangChain Langgraph RAG Agentic Al Azure ML NLPCV Scientific ML OpenAIPysparkPythonChatbot Development PytorchPredictive AnalyticsDeep LearningData SciencePandasMachine LearningTransformersMulti-agent Systems DDP LightGBMRAPIDS GPU Computing
iCube CSI (Intuceo)
iCube CSI (Intuceo)
Lead Data Scientist
2016 - 2021 (5 years)
Hyderabad, India
  • Built an NLP pipeline for a global lens brand with custom NER, BERT-based aspect sentiment, and semantic clustering.

  • Reduced manual document extraction costs by 70% with custom OCR + NLP system.

  • Delivered insights that directly drove 15-20% customer satisfaction improvement.

  • Optimized manufacturing line performance by applying evolutionary algorithms on a digital twin simulation, discovering optimal robotic travel time configurations and significantly improving throughput and efficiency.

  • Built a real-time correlation engine to map frequent industrial downtimes to alarm bursts from hundreds of hourly events, enabling predictive diagnostics and minimizing unplanned disruptions.

  • Designed and deployed a personalized health analytics system using Association Rule Mining, uncovering patterns in diabetes and blood pressure trends across individual customer journeys, influencing targeted health interventions.

NLPNER BERT OCRSemantic Clustering Sentiment Analysis Data ScienceArtificial IntelligenceMachine Learning Algorithms Data StructuresPysparkComputer VisionScikit LearnPythonData Analytics Artificial Neural Networks (ANN) KerasSciPyPytorchPredictive AnalyticsNumpyDeep LearningPandasMachine LearningObject-oriented Programming (OOP) Linear Algebra Predictive Modeling
Xion Multiventures
Xion Multiventures
Associate
2015 - 2015
Mumbai, India
  • Identified opportunities and developed trading strategies through research and back testing.

  • Worked with technical chart patterns and statistical methods.

Data Analytics AlgorithmsStatisticsTrading Research
Futures First
Futures First
Analyst
2011 - 2014 (3 years)
Hyderabad, India
  • Tracked the prevailing macroeconomic conditions, supply-demand scenario, central bank activities, geopolitical situations, and other markers like currencies and equities to predict the market movement.

  • Analyzed market trends and execute effective trading strategies with strict risk management considering macroeconomic factors, arbitrage with correlated entities, and analyzing FED and USDA reports.

AnalyticsAlgorithmsStatisticsData Analytics Trading Macroeconomics Banking & Finance Corporate Finance Market Assessment Risk Management

Education

Microsoft Certified: Azure Al Engineer Associate (Sep 2021 - Expired Sep 2023) | Microsoft Certified: Azure Data Scientist Associate (May 2022 - Expired May 2023)
Microsoft Certified: Azure Al Engineer Associate (Sep 2021 - Expired Sep 2023) | Microsoft Certified: Azure Data Scientist Associate (May 2022 - Expired May 2023)
Microsoft
2022 - 2022
CoRe (Credential of Readiness) Business Analytics, Economics for Managers, and Financial Accounting
CoRe (Credential of Readiness) Business Analytics, Economics for Managers, and Financial Accounting
Harvard Business School Online
2021 - 2021
Machine Learning Certificate
Machine Learning Certificate
Coursera Course Certificates
2015 - 2015
PGD Machine Learning & Al
PGD Machine Learning & Al
International Institute of Information Technology Bangalore - India
2013 - 2014 (1 year)
B.Tech Electronics & Communications Engineering
B.Tech Electronics & Communications Engineering
IIITH - India
2007 - 2011 (4 years)