Jaime L.

About Me

Jaime is a versatile data scientist and developer with 4+ years of experience working in all stages of the data science pipeline for different organizations. He is highly-skilled in R and Python, also having direct experience working with Machine Learning, Deep Learning, NLP, and AWS. Achievements include developing several interactive RShiny dashboards and training speech synthesis models by using automatic speech transcription, being a regular contributor to data science projects on GitHub.

AI, ML & LLM

DevOps

Other

Work history

UpStack
UpStack
Data Scientist
2020 - Present (5 years)
Remote
  • Create and implement data analysis pipelines, including data access, ingestion, munging / manipulation / cleansing, analysis / modelling, testing, deployment / integration into business applications and services.

  • Enhance operational aspects of businesses by increasing control of the company's data.

  • Working in cross-functional teams to provide data-driven solutions for increased efficiency and productivity.

Linksbridge
Linksbridge
R Developer
2020 - 2020
Remote
  • Worked on the development of a RShiny application related to vaccinations campaigns where users can create new entries and delete old ones.

  • Developed SQL queries that permits direct interaction between aplications and the database.

  • Utilized Git for version control, also writing documentation for the projects.

Strong Analytics
Strong Analytics
Data Engineer
2019 - 2020 (1 year)
Remote
  • Created two R libraries for a company that designs, engineers, and deploys custom end-to-end machine learning and AI products and solutions.

  • Worked on the development of a dashboard in RShiny to interface with a data analytics engine.

  • Ensured that developed R packages worked as intended by performing unit testing.

Teranalytics
Teranalytics
Data Scientist
2017 - 2020 (3 years)
Mexico
  • Worked with structured data on the development of a data science pipeline that included data cleaning, feature engineering, and model building/deployment.

  • Created RShiny dashboards and managed multiple Amazon Web Services (AWS) S3, EC2, Lambda, API Gateway, Cognito, and RDS.

  • Developed an application that collected status updates and comments from Twitter and Facebook in order to perform sentiment and text analysis.

Showcase

Synthesizing David Attenborough Speech with Tacotron2 and Waveglow
Synthesizing David Attenborough Speech with Tacotron2 and Waveglow
  • A model was trained to mimic the voice of David Attenborough.

  • The model utilizes a dataset of thousands of audio clips paired with transcriptions.

  • The project leverages Tacotron2 and Waveglow for speech synthesis, with data sourced from the audiobook 'Life on Earth' and Amazon Transcribe.

Text prediction app
Text prediction app
  • Developed a text autocomplete application using a Markov Chain Model.

  • Implemented word prediction as an assistive technology.

  • Utilized R, R Shiny, and the Markov Chain Model for the project.

Movie recommendation app
Movie recommendation app
  • Developed a Django movie recommendation application.

  • Uses a Content-Based Recommender System trained on movie descriptions.

  • The dataset for analysis is The Movies Dataset, containing 45,000 movies released before July 2017.

Education

Master's degree in Petroleum Engineering
Master's degree in Petroleum Engineering
Texas A & M University
2015 - 2016 (1 year)
Big Data Modeling and Management Systems; Introduction to Big Data; Deep Learning Specialization; Data Science
Big Data Modeling and Management Systems; Introduction to Big Data; Deep Learning Specialization; Data Science
Certifications