Jakob S.

About Me

Jakob is an Engineer turned Data Scientist and Machine Learning Engineer, with proficiency in Python and its frameworks. He has the ability to transform a sea of data into actionable insights that can have an impact on businesses and projects- from building predictive models for various types of cancer detection, to setting-up consumer facing models in the transportation and entertainment sectors, as well as the financial industry. In his role as an Educator in the Data Science and Data Analytics programs, he has shown great communication and mentorship skills, having encouragingly guided his students to achieve high-level technical skills and great professional performance in the industry.

AI, ML & LLM

Frontend

Backend

Database

DevOps

Other

Work history

UpStack
UpStack
Data Scientist/Machine Learning Engineer
2022 - Present (3 years)
Remote
  • Identifies relevant data sources and sets to mine for client business needs.

  • Uses algorithms and models to mine big data, perform data and error analysis to improve models; analyzes and interprets data for trends and patterns.

  • Collaborates with engineering and product development teams to propose solutions and strategies.

CoachHub
CoachHub
Data Scientist
2022 - Present (3 years)
Spain
  • Acted as a part of the Data & Insights team at this 300M+ funded hyper-growth scale-up, building out the AI system on the platform. Created a satisfaction model from scratch to identify unsatisfied users on the platform. The model was deployed to AWS and both Looker and Streamlit dashboards were used to communicate the output to various stakeholders.

  • Developed a Grafana dashboard with RedShift data source was used to visualize the AB tests on the productionalized model.

  • Created a model that significantly improved the company's capacity to enhance user experience while saving costs. Using Hugging Face, I built out an NLP model in record time to identify the Language, Sentiment and Topic of user feedback despite limited amount of data.

EvaluateML
EvaluateML
Founder/ Machine Learning Engineer
2021 - Present (4 years)
Madrid, Spain
  • Built, using Django and TensorFlow, a machine-learning grading system which automatically evaluates open-ended questions in schools.

  • Acquired data, through qualitative and quantitative analysis, as well as unbiased user interviews, to guide the engineering process.

  • Built scalable full stack Deep Learning system applied to qualitative data (text); lead the experimentation and development of scalable NLP models to deliver production-ready solutions.

Freelance
Freelance
Data Science & Machine Learning Consultant
2019 - Present (6 years)
Remote
  • Developed machine learning models for cancer detection, based on a novel cancer detection approach to improve model performance; the models showed improved sensitivity by 34%, 21% and 15% for breast, colorectal and pancreas cancers respectively.

  • Collaborated with engineers, designers and developers to build Web platforms for a range of a clients.

Thinkful
Thinkful
Data Scientist & Educator
2019 - 2021 (2 years)
Remote
  • Mentored junior engineers as an Educator in the Data Science and Data Analytics programs; guided his trainees through the industry's best practices by translating concepts and terminology from data science, machine learning and data analytics into easy-to-digest curriculum.

  • Strategically planned and delivered assigned projects and the curriculum learning progressions.

  • Effectively trained students which have managed to take on Director positions at start-ups, join top 3 world ranked Universities and work for the financial institutions of Wall Street.

Alto Analytics
Alto Analytics
Data Analyst/Scientist
2018 - 2019 (1 year)
Madrid, Spain
  • Collected and analyzed 2.5 million social media posts to identify key public opinion trends and patterns, while providing valuable analytical insight to help stakeholder’s decisions.

  • Conducted behavioral segmentation with quantitative and qualitative data using clustering techniques, by applying Unsupervised Machine Learning techniques; automated the data mining processes with Python for faster and more accurate results.

  • The present findings were published in several national Spanish newspapers.

Revoolt
Revoolt
Software Engineer - Machine Learning
2017 - 2018 (1 year)
Madrid, Spain
  • Built a machine-learning based solution to streamline the company's consumer payments.

  • Built custom Deep Learning OCR models which read the information stored on physical receipts; achieved 96% in character recognition accuracy.

  • Worked in an interdisciplinary team with strong engineering practices, which continuously tested the analytical models for error proof results.

Showcase

Thrive's CancerSEEK- Improving the machine learning model for a cancer detection startup
Thrive's CancerSEEK- Improving the machine learning model for a cancer detection startup
  • Jakob developed a machine learning model for early cancer detection.

  • The model was trained on a publicly available dataset and compared to research from Johns Hopkins University.

  • The goal was to sell the machine learning implementation to Thrive Earlier Detection.

Personal project- Predicting the Fare on a Billion Taxi Trips with BigQuery
Personal project- Predicting the Fare on a Billion Taxi Trips with BigQuery
  • Developed a predictive model for taxi fares using a large dataset of over 1.1 billion cab rides in New York City.

  • Implemented machine learning techniques to showcase the work and cost-effectiveness of the model.

  • Utilized BigQuery for data storage and analysis.

Master's Thesis - Estimating Stellar parameters to find Exoplanets using Deep Learning
Master's Thesis - Estimating Stellar parameters to find Exoplanets using Deep Learning
  • A deep learning system was developed to predict habitable planets orbiting distant stars.

  • The system utilizes Convolutional Neural Networks (CNNs) with a regression approach.

  • The objective is to evaluate the capabilities of Deep Learning algorithms on stellar parameter prediction from spectrograms.

Education

Master degree in Science, Industrial Engineering and Data Science
Master degree in Science, Industrial Engineering and Data Science
Universidad Politécnica de Madrid, Spain
2016 - 2018 (2 years)
Bachelor degree in Science, Mechanical Engineering
Bachelor degree in Science, Mechanical Engineering
Lund University, Sweden
2007 - 2010 (3 years)