Work history

UpStack
UpStack
Senior Data Scientist/Machine Learning Engineer
2022 - Present (3 years)
Remote
  • Delivering data warehouse and ETL solutions as part of an Agile team using advanced ML techniques to improve performance and processes.

  • Helping build and improve infrastructure, application, and performance development and ensuring tight security including data encryption, security groups, and environment scanning.

  • Ensuring high-quality deliverables and implementing DevOps and security best practices in fast-paced environments.

PD4 Solutions LLC
PD4 Solutions LLC
AI Expert
2023 - 2023
Remote
  • Developed an LLM-based solution to determine which scientific articles are related to user-inputted free-text criteria.

  • Evaluated the LLM solution performance and demonstrated metrics proving considerable improvement over the previously implemented solution.

  • Worked with ML engineers to deploy solutions and define an optimal architecture for applying the LLM solution.

Machine LearningPythonNatural Language Processing (NLP) Language Models Text Classification Unsupervised Learning LangChain Amazon Web Services (AWS) GitGPT Text Generation Large Language Models (LLMs)
BJS
BJS
Data Science Engineer
2022 - 2022
Remote
  • Developed prototype product recommenders showing customer purchasing patterns.

  • Built simple AWS Lambda functions to conduct an ETL workflow.

  • Worked with PySpark on large sets of data (>100GB of historical purchases).

PythonMachine LearningSpark MLPysparkAmazon Web Services (AWS) GitData ScienceETLAWS Lambda
Online Freelance Agency
Online Freelance Agency
Research Advisor
2022 - Present (3 years)
Remote
  • Investigating, researching, and documenting caching methods in software.

  • Reproducing the most popular caching methods for predicting time-to-live from research papers.

  • Building a simulator and reinforcement learning model trying to solve TTL prediction for object caching.

Machine LearningSupervised Machine Learning Reinforcement Learning Deep Reinforcement Learning Data ScienceNumpy
Lionbridge (via Newfire Global Partners)
Lionbridge (via Newfire Global Partners)
Senior Data Scientist
2021 - 2023 (2 years)
Remote
  • Developed a Machine Learning sequence labeling model on text data that achieved above 0.9 F1 score.

  • Decreased inference time on a previously developed ML model without sacrificing their F1 score.

  • Used PySpark and Databricks to perform a large-scale data analysis that the company employed to drive future business decisions.

  • Developed multiple highly scalable Python web services that are currently serving production traffic.

PythonAgile ScrumWeb ServicesJSONPytorchspacyNatural Language Toolkit (NLTK) PysparkJupyterDatabricks Open Neural Network Exchange (ONNX) Neural NetworksLSTMPandasData ScienceNumpyGitNatural Language Processing (NLP) Data AnalysisAzure DatabricksMachine Learning
Alchemy V Ltd
Alchemy V Ltd
Machine Learning Engineer
2020 - 2021 (1 year)
Remote
  • Created a marketing slogan text generator using Hugging Face Transformers/text generation pipelines and customer-provided data.

  • Created a data ingestion and reporting process via multiple Google Cloud services: BigQuery, Cloud Functions, Cloud Endpoints, and Dataproc.

  • Ported existing R reporting code to a Python web service.

Google CloudGoogle Cloud API Google BigQuery RPythonText Generation SQLGitMachine LearningHugging Face Transformers Google Cloud FunctionsGoogle Dataproc
Granville Knowledge Management
Granville Knowledge Management
NLP Consultant
2020 - 2021 (1 year)
Remote
  • Developed a scraper to download a large (around 20,000) and diverse legal documents file (1,990 until today) from a European public repository.

  • Used ML to build a text classification model to automatically classify categories based on document content.

  • Created a dataset of legal documents and used it to train and evaluate the ML text classification model.

  • Shared results via Google Colab such that customers can interactively try the model performance with their held-out data.

PythonScrapyWeb ScrapingPytorchJupyterGoogle Colaboratory (Colab) Text Classification Natural Language Processing (NLP) Machine LearningGoogle Colab
TakeLab FER
TakeLab FER
Research Associate
2018 - 2019 (1 year)
Zagreb, Croatia
  • Developed a search engine for Croatian legal documents.

  • Built a named entity recognition model in PyTorch by combining LSTM with a CRF.

  • Mentored several interns and wrote my master thesis on natural language processing.

PytorchApache Solr DjangoPythonTorch PandasData ScienceGitNatural Language Processing (NLP)
Amazon
Amazon
Software Development Engineer
2014 - 2017 (3 years)
Remote
  • Contributed to developing a scalable time-series database solution in Java and C++, which served around 1 million requests/second.

  • Served as the team scrum master and product owner.

  • Designed and implemented a network correlation engine microservice to handle networking events from the entire Amazon network (patent award https://patents.justia.com/inventor/filip-boltuzic).

Amazon Web Services (AWS) C++PythonJavaAlgorithmsProgramming Agile GitWeb ServicesMicroservices
Zagrebačka banka
Zagrebačka banka
Business Intelligence Analyst
2012 - 2014 (2 years)
Zagreb, Croatia
  • Developed SQL reports to determine the promising retail strategies in a data warehouse.

  • Built an interactive tool in Java to speed up the processes in Oracle Data Integrator.

  • Developed small web applications for the accounting department using PL/SQL and Oracle Apex.

JavaSQLBusiness Intelligence (BI) Data WarehouseOracleOracle Data Integrator (ODI) Oracle PL/SQL

Showcase

Retail Sale Forecasting
Retail Sale Forecasting
  • The project focused on predicting sale amounts based on historical data, previous sales, and regions.

  • Time-series forecasting was performed at both regional and global levels.

  • Experimentation included ARIMA and SARIMA models.

Lead Developer - Search Engine for Croatian Legal Documents
Lead Developer - Search Engine for Croatian Legal Documents
  • Developed a Django and Apache Solr web application with a microservices architecture.

  • The application stores and indexes documents within Solr.

  • The Django front-end handles user requests and communicates with Solr.

Education

BSc/MSc/PhD Computer Science
BSc/MSc/PhD Computer Science
University of Zagreb - Croatia
2007 - 2020 (13 years)