Yoe H.

Yoe H.

Medellín, Colombia
Hire Yoe H. Hire Yoe H. Hire Yoe H.

About Me

Yoe is a Python Developer and Data Scientist with extensive experience working on Machine Learning/Python projects and transferring real-world problems into requirements and solution planning. With a solid background and practical knowledge in ML/AI, research, mathematics, and statistical analysis, he delivers solutions and helps businesses to achieve more.

AI, ML & LLM

Backend

Database

Other

Work history

UpStack
UpStack
Python Developer | Data Scientist
2022 - Present (3 years)
Remote
  • Delivering data warehouse and ETL solutions as part of an Agile team using advanced ML techniques to improve performance and processes.

  • Helping build and improve infrastructure, application, and performance development and ensuring tight security including data encryption, security groups, and environment scanning.

  • Ensuring high-quality deliverables and implementing DevOps and security best practices in fast-paced environments.

  • Building pipelines with transformation/aggregation phases using PySpark and working with Databricks for collaborative analytics.

Turing
Turing
Engineering Manager
2024 - Present (1 year)
Remote
  • Supervising and managing a team of 30+ business analysts, including 5+ team leads, ensuring output targets are met and guidelines followed.

  • Identifying training needs, conducting sessions, and ensuring high-quality training datasets.

  • Performing evaluations and implementing improvement plans.

  • Conducting regular QA checks, identifying process gaps, and improving workflows to enhance quality.

  • Overseeing daily operations, ensuring timely, within-budget project delivery.

  • Managing resources and monitoring performance metrics.

Engineering Management Training Evaluation Key Performance Metrics Resource Management Python 3 BigQuery AWSFlaskMongoDBMySQLSQLMachine Learning Algorithms
Mercor
Mercor
Mathematics Expert
2024 - 2024
Remote

Wrote solutions to advanced math problems to be fed into an LLM model.

Large Language Models (LLMs) Complex Problem Solving Mathematics
Turing
Turing
AI Trainer
2024 - 2024
Remote
  • Evaluated model responses.

  • Worked on prompt engineering.

  • Built applications using OpenAI’s GPT models for chat and RAG in Flask.

Python 3 Data ScienceMachine LearningTraining Prompt Engineering OpenAI GPT-3 API OpenAIGPT FlaskRetrieval-augmented Generation (RAG)
Darwin AI
Darwin AI
Senior Software Engineer
2023 - 2024 (1 year)
Remote
  • Created API for collecting and labelling visual assets from ad platforms such as Google Ads, Meta, TikTok.

  • Worked on CI/CD processes using GitHub Actions and Bitbucket Pipelines to automate testing/deployment for APIs and data pipelines.

  • Containerized apps with Docker Compose for local testing and deployment to AWS EC2.

  • Integrated PyTest into CI workflows for Python services, ensuring code reliability.

  • Deployed serverless apps via Lambda/API Gateway.

  • Ensured robustness by writing unit/integration tests (PyTest) and monitoring performance with CloudWatch metrics.

PythonAmazon S3 (AWS S3) AWS Lambda AWS CloudWatchMongoDBAPI Applications Google Ads APIGraph API Creative Problem Solving Query Optimization BitbucketGitHub Actions CI/CD Pipelines AWS EC2Docker ComposePyTestAmazon API Gateway Integration TestingUnit Testing
Meta4Capital
Meta4Capital
Data Scientist/Analyst
2022 - 2022
Remote
  • Worked for an NFT startup company and created an algorithmic trading strategy Flask web app.

  • Developed a credit risk model for NFTs using Python and Flask among other technologies and libraries.

  • Gathered data from primary or secondary data sources and maintained databases/data systems.

Technology Institute of Antioquia
Technology Institute of Antioquia
Python Developer | Research Professor
2021 - 2022 (1 year)
Medellin, Colombia
  • Worked on some pragmatic prevention guidelines regarding SARS-CoV-2 and COVID-19 in Latin-America inspired by Mixed Machine Learning Techniques and Artificial Mathematical Intelligence.

  • Used ML tools and Python to set up a sentiment analysis classifier of tweets with the TensorFlow module.

  • Used ML tools and Python to set up a Long-Short-Term Memory Neural Network with the TensorFlow module to forecast the Colombian coffee price.

Universidad Autónoma de Bucaramanga
Universidad Autónoma de Bucaramanga
Researcher
2015 - 2022 (7 years)
Bucaramanga, Colombia
  • Published numerous research papers including statistical mechanics in the portfolio optimization with Kusuoka’s representation and conceptual computation in artificial mathematical intelligence as a paradigm-shifting technique in physics and mathematics.

  • Reviewed methods and teaching materials and gave recommendations for improvement.

  • Worked on research, fieldwork, investigations, and writing up reports.

University of Oklahoma
University of Oklahoma
Graduate Research & Teaching Assistant
2008 - 2013 (5 years)
Oklahoma, United States of America
  • Conducted research, prepared new materials, and read scientific papers, deriving new data-driven algorithms and creating computational models.

  • Contributed to the development of research documentation for publications, presentations, and applications.

  • Worked on creating new concepts, techniques, and standards.

  • Fine-tuned models on code generation, logical reasoning, and domain-specific Q&A.

Showcase

Engineering Manager - SFT Advanced Reasoning
Engineering Manager - SFT Advanced Reasoning
  • Led a 40-person team on the SFT Advanced Reasoning project, designing architectures for handling complex training datasets and fine-tuning Language Models.

  • Created a Flask-based app using OpenAI APIs to test various model prompts, and monitored trainer performance using ETL processes with BigQuery.

  • Worked with tools like AWS Lambda, API Gateway, and EC2 in previous roles to automate data collection/labeling pipelines, and studied Docker configurations for containerized services.

Colombian Coffee Price Forecast via LSTM Neural Networks
Colombian Coffee Price Forecast via LSTM Neural Networks
  • Utilized Machine Learning tools and Python to establish a Long-Short-Term Memory Neural Network using TensorFlow for Colombian coffee price prediction

  • Performed LSTM time-series forecasting and classification tasks with TensorFlow and Keras

  • Experimented with MLFlow for model tracking in a BERT-based Q&A system, logging metrics and hyperparameters during A/B testing for different fine-tuning strategies

Data Scientist/Data Analyst - Meta4.Capital
Data Scientist/Data Analyst - Meta4.Capital
  • Meta4 Capital is a crypto-focused fund with investment in unique NFT projects, emphasizing on collectibles, art, gaming, and virtual land.

  • Worked on NFT trading methods and NFT credit risk modeling, implemented a web app for lending NFTs credit score using clustering algorithms in Flask.

  • Technologies used included Python, MongoDB, Pandas, NumPy, Sklearn, Data Analysis, NFT, and Flask.

Semantic and Morpho-Syntactic Prevention’s Guidelines for COVID-19 Based on Cognitively Inspired Artificial Intelligence and Data Mining
Semantic and Morpho-Syntactic Prevention’s Guidelines for COVID-19 Based on Cognitively Inspired Artificial Intelligence and Data Mining
  • Implemented sentiment analysis classifier of tweets using Machine Learning and Python via the TensorFlow module

  • Applied the model in various regions including Europe, North America, and South America

  • Project aims to provide Semantic and Morpho-Syntactic Prevention’s Guidelines for COVID-19 Based on AI and Data Mining

Some Pragmatic Prevention’s Guidelines Regarding SARS-CoV-2 and COVID-19 in Latin-America
Some Pragmatic Prevention’s Guidelines Regarding SARS-CoV-2 and COVID-19 in Latin-America
  • Inspired by Mixed Machine Learning Techniques and Artificial Mathematical Intelligence

  • Conducted a case study on Colombia

  • Used Machine Learning tools and Python to set up a sentiment analysis classifier of tweets with the TensorFlow module

API for Visual Assets Collection
API for Visual Assets Collection
  • Contributed to the development and upkeep of APIs

  • APIs designed to gather visual assets from various ad platforms

  • Incorporated platforms like Google Ads, Meta, and TikTok in the API

API for Labelling Visual Assets
API for Labelling Visual Assets
  • Contributed to the development and support of APIs

  • The APIs were designed for labeling visual assets

  • These APIs targeted ad platforms like Google Ads, Meta, and TikTok

Education

Education
MA Mathematics
University of Oklahoma
2008 - 2012 (4 years)
Education
PhD Mathematics
University of Oklahoma
2008 - 2013 (5 years)
Education
MSc Mathematics
Universidad Nacional de Colombia
2005 - 2007 (2 years)
Education
Bachelor’s Degree, Mathematics
Universidad Nacional de Colombia
1996 - 2004 (8 years)