Sakshi G.

About Me

Senior AI Engineer with 5+ years of experience spanning generative AI, agentic systems, NLP, computer vision, and MLOps across product companies, consulting engagements, and applied research in Germany and India. Sakshi has built and shipped on-device SLMs, LangGraph-based agentic RAG pipelines, diffusion model fine-tuning, sensor fusion systems, and scalable ETL and CI/CD infrastructure on AWS. Proficient in Python, PyTorch, TensorFlow, LangChain, LangGraph, and the OpenAI/Anthropic/Sarvam API ecosystems with a consistent focus on latency, scalability, and measurable business impact.

AI, ML & LLM

Pytorch XGBoost OpenAI Claude Gemini LangChain Langgraph Streamlit BERT RoBERTa GPT Airflow MLFlow AI/ML

Backend

DevOps

Workflow

GitHub Actions

Other

C Tensorflow Scikit Learn Hugging Face Transformers Stable Diffusion LoRa RAG Vector Search NER YOLOv5 Pose Estimation Sensor Fusion Pyspark ETL Tableau Dash OpenCV CNN Transformers Text Generation Sentiment Analysis

Work history

Lenskart
AI / ML Engineer
2026 - 2026
Remote
  • Replaced Whisper API-based intent inference with a fine-tuned, quantized SLM deployed on-device, cutting latency by ~3x and pushing intent classification accuracy beyond 98% while eliminating external API costs

  • Designed end-to-end system architecture for the SLM pipeline on AWS EC2 including data ingestion, fine-tuning, quantization, serving, and MLflow-based experiment tracking with shadow deployments

  • Led AI R&D for the voice intelligence initiative by defining technical roadmap, setting latency SLAs, designing for horizontal scaling, and mentoring engineers across model development and evaluation workstreams

PythonAWS EC2MLFlow Model quantization Fine-tuning Shadow deployments Intent Classification
Dialog Matrix
Technical Consultant - Generative AI
2025 - 2026 (1 year)
Remote
  • Fine-tuned Stable Diffusion with LoRA adapters for few-shot visual concept learning, reducing required training samples from thousands to under 20 while preserving generation quality

  • Designed a Pipecat agentic pipeline to serve custom TTS models via API with real-time audio streaming, turn-taking logic, and fallback routing under latency constraints

  • Built an agentic customer support system with dual-corpus RAG using ChromaDB, achieving a RAGAS score of 0.85 and ~40% ticket deflection rate orchestrated via LangGraph

Stable Diffusion LoRa Pipecat FastAPITTS ChromaDB Langgraph RAG RAGAS
1&1 Telecommunication SE
Data Scientist - Machine Learning
2025 - 2025
Remote
  • Enhanced ML data pipelines for fraud-login detection using XGBoost, improving model precision and recall on production traffic

  • Integrated monitoring KPIs into Grafana dashboards for real-time visibility into model performance and data drift

  • Managed ML pipeline optimization and performance monitoring for production systems

XGBoostGrafanaData pipelinesModel Monitoring Fraud detection
Bertrandt GmbH
Software Developer - AI & MLOps
2022 - 2025 (3 years)
Remote
  • Optimised 5+ ETL workflows using AWS Glue and PySpark, producing analytics-ready datasets at scale for downstream ML model training

  • Automated CI/CD pipelines with GitHub Actions and Jenkins, achieving 80% build/test coverage and boosting development efficiency by 25%

  • Built an object detection system with YOLOv8 achieving 92% accuracy on a custom automotive dataset and developed interactive Tableau dashboards integrated with AWS Data Catalog

AWS GluePysparkGitHub Actions JenkinsDockerAirflowTorchServe YOLOv8 TableauAWS Data Catalog C++
STTech GmbH
Software Developer - Deep Learning & NLP
2020 - 2022 (2 years)
Remote
  • Researched and implemented CNN and Transformer architectures for out-of-distribution (OOD) detection in autonomous driving datasets using PyTorch and co-authored a peer-reviewed publication

  • Built a custom NER pipeline using BERT and RoBERTa, improving entity extraction accuracy by 28%

  • Executed 2D sensor fusion via Python-based Kalman filter and implemented autonomous path planning with CARLA and ROS

PytorchCNN TransformersBERT RoBERTa NER Sensor FusionKalman filter CARLA ROS
AGT Group (R&D;) GmbH
Computer Vision Intern
2019 - 2019
Remote
  • Accelerated pose tracking by 40% via GPU-optimised optical flow techniques

  • Automated training and evaluation pipelines, reducing iteration cycles by 35%

  • Built stereo and IP camera calibration setups with OpenCV

Optical flow OpenCV

Education

Education
B.Tech. Electronics and Communication Engineering
The NorthCap University
2017
Education
M.Sc. Information and Communication Engineering
TU Darmstadt
2020