Viacheslav Z.

Viacheslav Z.

Machine Learning Developer

San Francisco, CA
Hire Viacheslav Z. Hire Viacheslav Z. Hire Viacheslav Z.

About Me

Viacheslav has seven years of experience in data science and software engineering. He is passionate about the insights gained from raw data and enjoys converting them to create exceptional business value. Viacheslav's primary expertise is Python, with production experience in Java and C++. To solve data-heavy projects, he has applied advanced machine learning techniques, such as computer vision, NLP, product recommendation systems, networking data, and classical data science.

Work history

Botprise, Inc.
Lead Data Scientist
2020 - 2022 (2 years)
Remote
  • Developed the back end for the full ML cycle, ModelOps, and MLOps, on the platform. Added a wrapper on top of the AWS SageMaker.

  • Worked on dozens of automation workflows (use cases), including MLOps, analytics, DataOps, networks, ITOps, etc.

  • Created the back- and front-end elements using React for a drag-and-drop, chatbot-building application.

  • Implemented and deployed dozens of algorithms (classification, clustering, time series, NLP, and computer vision) for different use cases.

  • Led a small ML team, including planning, management, monitoring, and leadership.

Amazon Web Services (AWS) PythonFlaskMongoDBApache Kafka React Redux RESTDockerKubernetesAmazon SageMaker SciPyNumpyPandasPytorchTensorflowTransformersManagementPostgreSQLMachine Learning Operations (MLOps) React Cloud InfrastructureSQLComputer VisionDeep LearningKerasGitAgile software developmentLinuxObject DetectionConvolutional Neural Networks BashSublime TextVim Text Editor Anomaly Detection ClassificationData pipelinesAutoML TCP/IPNetworks Time Series AnalysisConcurrent ProgrammingPredictive AnalyticsJupyterOpenCVPredictive Modeling Machine Learning Automation GPT Natural Language Processing (NLP) Generative Pre-trained Transformers (GPT) PycharmData ScienceAlgorithmsPython 3 AWS CloudFormationDataOps Software DevelopmentNetworkingAWS Lambda Artificial Intelligence (AI) Programming DataDogNatural Language Toolkit (NLTK) Data EngineeringAnalyticsAPI IntegrationData Validation Data VisualizationNeural NetworksDashboards DatabasesSoftware EngineeringLinear Regression Microsoft ExcelSpreadsheets MatplotlibStatistical Modeling Unit TestingETLPlotlyAPIsJupyter NotebookAutomationWeb Frameworks
Toptal
Sr AI & Data Engineer
2019 - Present (5 years)
Remote
  • Sr. Data Engineer | Grata | Developed async ML based distributed pipeline for website pages classification, data scraping, locations extraction and normalization.

  • Sr. AI Engineer | Briefly |Owned BE, ML and DevOps parts of the project, applied LLMs to build productivity tool features (summarization, fine-tuning, RAG, recommendations, embeddings, inf. retrieval, etc.). Slack App, API integrations, data engineering.

  • Sr. AI Engineer | Wing | Technical lead of the team. Analytical engine implementation, agentic chat bot for financial advisory implementation, RAGBtesting.

Akcelita
Senior AI Developer
2019 - 2020 (1 year)
Remote
  • Developed an ingestion and processing pipeline on AWS for photos from surveillance cameras.

  • Experimented with various non-DL and DL approaches and tested them. Trained and used the Siamese network with an attention mechanism, achieving +95% accuracy.

  • Created and shared presentations with analytics to executives. Developed and maintained a wiki for the project.

The National Academy of Sciences of Ukraine
Team Lead
2018 - 2019 (1 year)
Remote
  • Led and mentored a team of students. I defined objectives and controlled the process using the Agile methodology.

  • Created a tool for crop classification and map creation.

  • Collected data manually and via web scraping using Mapillary and oversaw data labeling.

  • Implemented and tested DeblurGAN and several other classic deblurring methods.

  • Oversaw field localization (YOLO) and crop classification by fine-tuning a ResNet model.

Openwave Mobility
Senior Data Scientist
2017 - 2019 (2 years)
Remote
  • Created a multi-staged data pipeline from raw packet data (TCP/IP layer) to consumable inputs for machine learning models with multi-processing implementation in Python (CPython).

  • Trained, tuned, evaluated, and compared multiple machine learning models in Python (scikit-learn, Keras, XGBoost, CatBoost) and C++ (mlpack).

  • Oversaw the data analysis and communication with stakeholders. Created a reusable Python tool for generating rapid and externally configurable data analysis reports.

  • Implemented custom feature generation algorithms based on expert knowledge based on aggregation, derivatives, TCP/IP conversation delays, products, and fractions.

  • Implemented custom, multi-staged feature selection algorithms that were model-based.

  • Deployed and monitored the project in production in the network. If the tool detects congestion, optimization policies are applied. Customers reported up to a 20% increase in the quality of delivery for video content.

Octetis
Data Scientist
2015 - 2017 (2 years)
Remote
  • Developed, deployed, and evaluated a hybrid recommendation engine in Python for an online store.

  • Oversaw customer behavior analysis, visualization, and stakeholder communication.

  • Handled different scenarios of user engagement using a strategy pattern. Contextual recommendations were given based on popularity (general and category-based), item-to-item, and SVD. (Python, scikit-learn, SciPy).

  • Integrated recommendation engine into a Django back end.

  • Conducted multiple A/B tests with random sampling for evaluation of the system. Compared to the most popular items in the category baseline, we achieved up to a 150% boost in purchases per session and increased revenue.

  • Created an image super-resolution module for an online cloud site constructor with Keras.

  • Utilized middle-deep CNN, trained on several blur kernels, and deployed it as a service via REST.

  • Conducted surveys showing an increase of about 5% in satisfaction for users of the platform.

Samsung
Research Intern
2014 - 2015 (1 year)
Remote
  • Developed algorithms for smart keyboard functionality (word prediction and spelling correction).

  • Developed Naive Bayes for n-grams and K-nearest neighbors (KNN) for spelling corrections.

  • Created tweaks for better algorithm performance using Laplace smoothing and a custom keyboard distance for KNN.

  • Developed algorithms with C++. Integrated them with a Java to Android keyboard and published them to the App Store.

C++AndroidJavaMachine LearningArtificial Intelligence (AI) Natural Language Processing (NLP) GPT Generative Pre-trained Transformers (GPT) Android NDKGitAgile software developmentLinuxBashSublime TextVim Text Editor EclipseData ScienceAlgorithmsSoftware DevelopmentProgramming Software Engineering
Engage Point
Software Engineer Intern
2013 - 2014 (1 year)
Remote
  • Developed a Content Management Interoperability System in Jakarta EE. I used the Model-view-controller framework for the application.

  • Developed Enterprise JavaBeans for the business logic of the application.

  • Developed JavaServer Pages for the presentation level.

Briefly
AI Engineer
Present (2024 years)
Remote
  • Co-developed multiple Django-based services on the back end for data processing.

  • Utilized and adopted OpenAI models for multiple tasks, such as summarization, classification, and recommendations.

  • Performed APIs integrations of external services and tools.

  • Worked on AWS development using AWS Lambda, Amazon SQS, Amazon DynamoDB, and Amazon SES.

DjangoWeb Frameworks PostgreSQLSQLNatural Language Processing (NLP) GPT Generative Pre-trained Transformers (GPT) OpenAI Gym React Jupyter NotebookChatGPT Automation
Grata Inc.
Senior Data Engineer
Present (2024 years)
Remote
  • Developed end-to-end distributed NLP-based geocoding pipeline using Celery on Kubernetes.

  • Implemented scraping from company websites and aggregators.

  • Developed and deployed on the AWS SageMaker web page category classification model.

  • Implemented a hybrid geocoding model with query lookup, query relaxation, result validation, prioritization, and fallback mechanisms.

  • Used combinations of available geo databases and offline entity extractors like libpostal and third-party geocoding services to combine it in a single view.

  • Increased a fraction of parsed addresses, reduced incorrect addresses by 95%, and improved the overall data quality score by 20%.

PythonDockerElasticsearch CeleryKubernetesJenkinsPostgreSQLRESTGeocoding GrafanaDataOps DataDogPandasNumpyNatural Language Toolkit (NLTK) GISReact Cloud InfrastructureSQLDeep LearningAmazon Web Services (AWS) GitAgile software developmentLinuxBashSublime TextVim Text Editor ClassificationData pipelinesDjangoJupyterGPT Natural Language Processing (NLP) Generative Pre-trained Transformers (GPT) Parallel Programming PycharmData ScienceFlaskReact Redux AlgorithmsData AnalysisPython 3 SciPyDevOpsAmazon SageMaker Data ScrapingSoftware DevelopmentProgramming TransformersConcurrent ProgrammingData EngineeringAnalyticsAPI IntegrationData Validation Data VisualizationDashboards DatabasesSoftware EngineeringMatplotlibUnit TestingETLGeopandas Quality Assurance (QA) APIsJupyter NotebookWeb Frameworks
Spin (Tier Mobility)
Data Analyst
Present (2024 years)
Remote
  • Performed a time-series forecasting of the demand for e-scooters for a global e-scooter rental company (hundreds of cities) with models for auto selection and auto retraining.

  • Performed ad-hoc data analysis and built Looker dashboards.

  • Performed an intervention-effect analysis (for promotions and other events).

Data Validation Data AnalysisData Analytics PythonSQLRData ScienceData VisualizationGoogle CloudGoogle Cloud Platform (GCP) LookerBigQuery Google BigQuery PandasNumpyGitAgile software developmentLinuxCloud InfrastructureBashSublime TextVim Text Editor Data pipelinesTime Series AnalysisPredictive AnalyticsJupyterPredictive Modeling PycharmApache Airflow AlgorithmsDockerPython 3 SciPySoftware DevelopmentProgramming AnalyticsBusiness Intelligence (BI) Dashboards DatabasesSoftware EngineeringLinear Regression Spreadsheets MatplotlibStatistical Modeling ETLPlotlyJupyter NotebookInternet of Things (IoT)
Pro Football Focus, LLC
Senior MLOps Engineer
Present (2024 years)
Remote
  • Introduced and implemented MLOps techniques, tools, and approaches.

  • Rebuilt a dozen monolithic R pipelines into distributed, modular, and functional-styled Python pipelines.

  • Developed MLOps layer on top of Dagster, Seldon, Feast, and other tools.

  • Fine-tuned existing model hyperparameters both for speed and performance.

Data SciencePythonRMachine LearningReact Python 3 SeldonDagster RabbitMQPostgreSQLMachine Learning Operations (MLOps) Cloud InfrastructureSQLDeep LearningPandasNumpyAmazon Web Services (AWS) GitAgile software developmentLinuxBashSublime TextVim Text Editor ClassificationData pipelinesExplainable Artificial Intelligence (XAI) Predictive AnalyticsJupyterPredictive Modeling Parallel Programming PycharmVisual Studio Code (VS Code) Data Analytics Apache Airflow RESTAlgorithmsKubernetesDockerSciPyPrefectAWS CloudFormationDevOpsSoftware DevelopmentAWS Lambda Programming Dask Data EngineeringDatabasesSoftware EngineeringLinear Regression Sports MatplotlibStatistical Modeling Unit TestingETLPlotlyQuality Assurance (QA) Jupyter NotebookWeb Frameworks
Plutoshift, Inc.
Machine Learning Engineer
Present (2024 years)
Remote
  • Introduced MLOps tools to existing infrastructure (Seldon, Feast, and Great Expectations).

  • Migrated existing hardcoded models to introduce the MLOps infrastructure.

  • Developed back-end APIs using Django for ML-related services.

  • Implemented classification and time series forecasting models for manufacturing sensors.

Facebook
Software Engineer Intern
Present (2024 years)
Remote
  • Trained and evaluated AdaBoost models for customer churn prediction using FBLearner Flow.

  • Performed hyper-parameters tuning for optimization.

  • Data-engineered with Hive and processed data using Python.

Portfolio

Web Scraping Mapillary

This is a code sample for downloading images at specific locations using the Mapillary API. It searches for specific locations on the map, identifies the car's angle, and scrapes only side-view photos with nearly 90-degree angles of the road.

Education

Education
Master's Degree in Theoretical Physics (Quantum Field Theory)
Kyiv National University
2019 - 2021 (2 years)
Education
Master's Degree in Computer Mathematics and Algebra
Kyiv National University
2018 - 2020 (2 years)
Education
Bachelor's Degree in Computer Science and Applied Statistics
Kyiv National University
2010 - 2014 (4 years)