Rajeev K.

About Me

Rajeev is passionate about data and machine learning and has more than five years of experience in data science projects across numerous industries and applications. He's currently focused on cutting-edge technologies such as TensorFlow, Keras, deep learning, and most of the Python data science stack. Rajeev has used these skills to solve many real business problems in NLP, image processing, and time series domains.

AI, ML & LLM

Machine Learning Deep Learning Deep Neural Networks

Backend

Workflow

Other

Statistical Learning Analytics Data Analysis Data Analytics Artificial Intelligence Data Engineering Natural Language Processing (NLP) Python Jupyter Python 3

Work history

Availyst LLC
Data Developer
2021 - Present (4 years)
Remote
  • Worked with a US-based food aggregator startup on data engineering and scraping, using the Python data science stack, Jupyter Notebook, and AWS services.

  • Handled the recommendation engine for the user, a food and restaurant recommendation.

  • Developed the scraping application using Python and deployed it using AWS services.

Forbes Media - Q.ai
Data Scientist – Fintech Project
2021 - 2022 (1 year)
Remote
  • Managed the business intelligence team, acting as a senior data scientist for the client.

  • Worked as a quant researcher, using advanced forms of quantitative techniques and artificial intelligence to generate investment recommendations across multiple asset classes, including stocks, ETFs, options, and cryptocurrencies.

  • Created a dashboard for the growth and marketing and leadership teams using Dash, Plotly, and Tableau.

JSS Information Technology Business Incubator
Independent Consultant — Data Scientist
2017 - Present (8 years)
Remote
  • Associated with JSS Information Technology Business Incubator as a data science mentor.

  • Helped small companies and startups take advantage of their data.

  • Created predictive models using machine learning.

  • Worked with natural language processing with neural networks.

  • Developed classification and regression algorithms.

  • Implemented time-series forecasting.

  • Developed image detection with deep learning.

Google Cloud Platform (GCP) GitJupyter NotebookKerasTensorflowPython
Newristics
Independent Consultant – Data Scientist
2017 - 2018 (1 year)
Remote
  • Developed a Python app which uses natural language processing with deep neural networks sequence to sequence learning to automate business process.

  • Reduced the cost of business operations.

Google Cloud Platform (GCP) GitJupyter NotebookKerasTensorflowNatural Language Toolkit (NLTK) spacyGloVeGensimLSTMPython
Sopra Steria Singapore
Data Scientist
2016 - 2017 (1 year)
Remote

Worked with the Land Transport Authority, Singapore to implement the vision to convert the city into a digital and intelligent one to improve the efficiency of services for the citizens, using machine learning, predictive modeling, and data mining.

Steria India
Data Scientist
2014 - 2015 (1 year)
Remote
  • Built a recommendation system for an eCommerce site; it recommended the best possible items to buy based on customer history and collaborative filtering.

  • Helped with customer churn prediction by developing a classification algorithm for a retail bank to identify customers likely to churn balances in the next quarter by at least 50% vis-a-vis current quarter.

  • Created a classification algorithm for a retail bank to improve sales from existing customers by cross-selling one of its product, the personal loan (customer cross-sales).

Steria India — Barclays Bank
Technical Program Manager
1997 - 2014 (17 years)
Remote
  • Set up business benefits of around £43 million over five years in customer retention, cost savings, and new business opportunities at an estimated cost of around £12 million.

  • Acted as a vital member of the steering committee that identified user needs and developed customized solutions for around 250,000 Barclaycard acquiring merchants.

  • Led a project team of 147 members including solution architects, designers, developers, and testers spread across multi-geographical locations through the entire project development life cycle.

  • Consistently stayed within around 5% of resource and budget forecast monthly.

  • Recognized as problem solver within a team of 22 project managers in the portfolio of annual spend over £70 million.

OracleContent Management Ab Initio WebSphereXMLJavaCobolJCLVirtual Storage Access Method (VSAM) IBM DB2CICS
Premier Global Management Consultancy
Senior Data Scientist and Data Analyst
Present (2025 years)
Remote
  • Worked as a data scientist and senior analyst with the client and its team.

  • Worked on demand space segmentation for a large US fashion retailer.

  • Mapped 6 million customer data to the demand space segment.

Python 3 Amazon Elastic MapReduce (EMR) Pyspark
A Telecommunications and Media Company in the US
Data Scientist
Present (2025 years)
Remote
  • Worked with a telecommunications and media company in the US on identifying fake news.

  • Developed two models to identify sarcasm and quantification fallacies in articles.

IBM
Independent Consultant – Data Scientist
Present (2025 years)
Remote
  • Worked for IBM US to optimize its US facility leases to run its operation.

  • Developed a Python model to improve facility utilization, reduce facility operations cost and reduce lease cost along with number of business constraints.

Linear Programming PlotlyPython
AbbVie, Inc.
Independent Consultant – Data Scientist
Present (2025 years)
Remote
  • Worked closely with the C-level executive and product management team to analyze the survey and produced data/reports.

  • Helped the product team and executive team to make more informed decisions—increasing market share through the identification of new opportunity, target segments and devising ingenious new ways of resolving constraints.

Association Rule Learning Cluster RegressionMatplotlibPlotlyRPython

Showcase

IBM
  • IBM US leases facilities across the US to improve facility utilization and reduce lease costs.

  • The project utilized Python integer programming and Package Pulp to solve the optimization problem.

  • The algorithm included a flexible optimization period, generating multiple solutions to address business constraints.

Newristics
  • Newristics is a US-based global leader applying decision-heuristic science to marketing.

  • It automates message scoring by comparing new messages against old ones and analyzing their adherence to heuristic psychology.

  • The project utilizes XGBoost and deep neural network seq-to-seq learning models, incorporating NLP features, word embeddings, graph analysis, and TF-IDF similarity.

AbbVie, Inc.
  • AbbVie, a pharmaceutical company, experienced a decline in market share from 65% to 49%.

  • A physician survey was conducted with 119 physicians focusing on HCV regiment attributes, patient treatment, and sales rep interactions.

  • Data analysis and reporting were produced by the product team and executive team to inform strategic decisions and increase market share.

Classify H&E Stained Histological Breast Cancer Images
  • Participated in a hackathon focused on classifying H&E-stained histological breast cancer images.

  • Utilized data augmentation and deep convolutional feature extraction with pre-trained CNNs on ImageNet to increase robustness.

  • Implemented a highly accurate gradient boosting algorithm to prevent suboptimal generalization with a limited training dataset.

Demand Forecast at an SKU-level for a Brewery Company
  • The company needs to predict demand for 34 SKUs across 60 agencies, with a required estimate of 34 unique SKU combinations.

  • The data includes historical sales data (hectoliters), weather data, industry sales data, event calendar, and demographic information.

  • A deep neural network sequence to sequence learning model is being used for demand prediction at SKU level.

Satellite Imagery Feature Detection Using Deep Learning
  • Developed a model for satellite imagery feature detection using deep learning.

  • The model processes 1km x 1km satellite images in both 3-band and 16-band formats.

  • The imagery covers the multispectral (400-1040NM) and short-wave infrared (SWIR) ranges.

Education

Education
Master's Degree in Computer Science
Jawaharlal Nehru University
1991 - 1994 (3 years)
Education
Bachelor's Degree in Mathematics
Delhi University
1987 - 1990 (3 years)