Krishna S.

About Me

Krishna is a Senior Data Scientist passionate about analytics and Machine Learning to solve challenging problems in eCommerce, marketing, and NLP. With 4 years of experience in Machine Learning, including his tenure in the analytics division of JP Morgan Chase & Co., he has developed and deployed data-driven solutions for various clients across different domains and industries, building NLP pipelines in Python using Keras to extract and summarize opinions and feedback from customer product reviews, performing market mix modeling, demand forecasting, and A/B testing analytics to optimize spending, supply chain, and marketing funnel, and building data pipelines and dashboards in AWS, GCP, and Tableau to provide actionable insights and data-driven decision-making. Krishna holds certifications in Machine Learning, Python, and software testing and is keen on using data and ML to create value and impact for businesses and society.

AI, ML & LLM

Machine Learning Deep Neural Networks Deep Learning LLM Agents Large Language Models (LLMs) LLM Generative AI Generative Artificial Intelligence (GenAI)

Backend

Python 3 Python

Database

DevOps

CI/CD Jenkins Amazon Web Services (AWS)

Other

Analytics Code Review Data Analytics Statistical Data Analysis Algorithms Data Reporting Exploratory Data Analysis Statistical Learning Object Oriented Development (OOD) Object-oriented Analysis & Design (OOAD) Object-oriented Design (OOD) Object-oriented Programming (OOP) Pandas Numpy Multithreading Parallel Programming

Work history

The Weather Company
The Weather Company
Senior Data Scientist
2024 - Present (1 year)
Remote
  • Designed, developed, and deployed a temporal GNN (Graph-LSTM, contrastive learning) for creating dynamic user embeddings, enhancing downstream ML model performance, and eliminating feature engineering time.

  • Developed and deployed reusable SageMaker model explainability reporting pipelines (SHAP, PDP), standardizing validation processes and reducing model debugging time by 60%, increasing stakeholder trust.

  • Designed a modular, reusable self-supervised learning pipeline (self-distillation), improving user tagging performance in cold-start settings.

  • Built customized on-disk graph back end using custom Feature Store, GraphStore implementation in PyG to scale the GNN training on huge data (500M edges & 50M nodes).

  • Mentored data scientists through pair-programming, peer reviews, and hiring participation.

Graph Neural Networks Graph Algorithms Graph Databases Machine LearningDeep LearningPytorchAmazon SageMaker Pipelines Amazon S3Explainable Artificial Intelligence (XAI) AWS SagemakerGNN LSTMPyTorch Geometric (PyG)
Organifi
Organifi
Data Scientist
2020 - 2024 (4 years)
Remote
  • Developed and deployed an NLP pipeline to extract and summarize opinions and feedback from customer product reviews.

  • Built data pipelines in AWS and GCP for a reporting and analytics data warehouse.

  • Built numerous executive summary dashboards in Tableau by identifying key metrics to track goals specific to individual teams.

Python 3 GPT Natural Language Processing (NLP) Generative Pre-trained Transformers (GPT) Sentiment Analysis Data Analytics BigQuery Google Cloud FunctionsGoogle CloudTableauTableau Desktop Pro MySQLAmazon Web Services (AWS) AWS Lambda Amazon RDSData ScienceTechnical Hiring Code ReviewSQLMachine LearningPythonDeep LearningData pipelinesGCPData WarehouseDashboards
JPMorganChase
JPMorganChase
Analyst
2019 - 2019
Bangalore, India
  • Designed and built a next-gen merchant acquisition tool in R Shiny for a credit card business.

  • Conducted pricing analysis of credit card business.

  • Built and integrated a minimum revenue model based on customer demographics.

TableauPythonRCode ReviewSource Code Review SQLMachine LearningRStudio Shiny
Tredence
Tredence
Business Analyst
2018 - 2019 (1 year)
Bangalore, India
  • Segmented retail customers based on their shopping behavior by using Random Forest.

  • Designed, built, and deployed an end-to-end Machine Learning pipeline.

  • Conducted marketing analysis for a leading US retail company.

TableauRPythonInterviewing Source Code Review Task Analysis Data ScienceSQLMachine LearningBusiness AnalysisTensorflowRandom Forests Market Research & Analysis
Capgemini
Capgemini
Software Analyst
2017 - 2018 (1 year)
Pune, India
  • Scraped the web for collecting unstructured data present on a website.

  • Created and deployed various executive summary dashboards.

  • Automated data cleaning pipelines to save significant person-hours every week.

Showcase

Multi-modal Fully Convolutional Network for Semantic Segmentation
Multi-modal Fully Convolutional Network for Semantic Segmentation

A fully convolutional network (FCN-32s) trained to semantically segment forest scene images with RGB and nir_color input images. The project was developed to help unmanned drones in smooth navigation. The model is trained and tested on still images of forest scenes. Used Intel Edison and Microsoft Kinect for POC and prototype creation.

Smart Medical Network
Smart Medical Network

Worked on a smart medical network for Intel ESDC 2016, Shanghai. The project aimed to create an ecosystem of a medical network that stores patients' clinical and real-time data for smoother and quicker diagnosis in an emergency.

Education

B.Tech Electrical Engineering
B.Tech Electrical Engineering
Indian Institute of Technology Gandhinagar
2013 - 2017 (4 years)