Fabio A.

Fabio A.

Senior Data Engineer

Brazil
Hire Fabio A. Hire Fabio A. Hire Fabio A.

About Me

Fabio is a driven Data Engineer with over 5 years of experience in wrangling big datasets for multiple projects. He also has extensive experience in business intelligence and data analysis for gathering game-changing insights for businesses, also focusing on scalability, data infrastructure, and efficient development.

Work history

UpStack
UpStack
Senior Data Engineer
2022 - Present (2 years)
Remote
  • Build and improve databases, acquire data, ETL/ELT, big data pipelines and deploy cloud services on projects.

  • Administer infrastructure solutions to improve data models, increase data accessibility and foster data-driven solutions for clients.

  • Implement monitoring solutions to ensure data integrity - working closely with engineers, product managers and other stakeholders.

BairesDev
BairesDev
Data Engineer
2020 - Present (4 years)
Remote
  • Worked on transitioning production ELT to a new architecture using new AWS stacks and Pyspark, also working with files versions.

  • Built data producers using Python and Flask with Kubernetes to connect to data sources and stream data.

  • Analyzed data in Snowflake for custom reports that are created using DBT views scheduled by Airflow.

Albert Einstein Hospital
Albert Einstein Hospital
Data Scientist/ Data Engineer
2020 - 2021 (1 year)
Brazil
  • Worked on developing an analytics environment with Impala and Hive, also using Spark/Python for Machine Learning.

  • Supported the architecture of environment in AWS with Elastic, MongoDB, Glue, Kafka, QuickSight, R, Jupyter Notebook, Pretos, Apache Hue.

  • Delivered insights on the heath public system, using PowerBI and Jupyter Notebook. Gathered datasets from many types of sources, building pipelines, performing exploratory data analysis.

ONNE EMPRESAS
ONNE EMPRESAS
Data Scientist
2019 - 2020 (1 year)
Brazil
  • Acted as a Data Scientist for a platform that that seeks more agile deliveries, reduction of operational costs, and improvement of processes.

  • Used multiple models such as SARIMAX, Decision Tree, LSTM and GMM, for the food and restaurant segment.

  • Analyzed large amounts of information to discover trends and patterns.

Caixa Economica Federal
Caixa Economica Federal
Data Scientist
2018 - 2020 (2 years)
Brazil
  • Worked on cloud and on-premises in financial fraud, financial default turn-over, IT capability and legal documents categorization.

  • Tuned models and integrated them with computational capabilities.

  • Performed anomaly detection for financial illegal operations like money laundering using IsolationForrest and NetworkX to map the relationship between transactions with Spark and PySpark.

OneWaySolution
OneWaySolution
Data Engineer
2018 - 2019 (1 year)
Brazil
  • Built a Big Data fast-lane architecture for a client in the Events and Productions sector.

  • Worked on Data Wrangling, Data Discovery, and combining legacy data with new business data.

  • Used Python and Scala to create Machine Learning algorithms for customer profile consumption, promotion directions, and event consumption in real-time.

Comp Line Services Solutions
Comp Line Services Solutions
Big Data Engineer
2018 - 2018
Brazil
  • Acted as a Big data architect, implementing data analytics, reporting using PowerBI, and also working on Data Warehouse architecture.

  • Gathered data from SQL and imported it into Azure blob storage, using Azure data factory. Used Hive and Pig for querying and generating data reports to Power BI.

  • Created a messenger service between MSSQL 2014 on Azure to AWS RDS and MongoDB using Kafka and Broker.

Autotrac Comércio e Telecomunicações S.A.
Autotrac Comércio e Telecomunicações S.A.
Data Analyst
2015 - 2018 (3 years)
Brazil
  • Acted as a Data Analyst for the major geolocation company in Brazil.

  • Worked on a billing automation system, T-SQL tuning, data consistency, new billing rules based on traffic signals, Client Attendance Program, and also performed lectures on T-SQL tunings.

  • Handled BI support, ETL with SSIS, and billing team support.

Portfolio

Data Engineer - Self-service data system
Data Engineer - Self-service data system

Worked on the creation of a self-service data system by building our own CDC tool, Kinesis, Glue, Trino, and Cube.js. I created the algorithm to flatten unstructured data like Json, creating relationships between the nodes. Used Python and kinesis stream.

Data Engineer - Pipeline creation
Data Engineer - Pipeline creation

Created the pipeline that consumed data from vaccines and built Covid's portal. I was a data engineer and we used Spark + Kafka and AWS tools such as Glue Catalog and QuickSight

Data Engineer - On-premises cluster
Data Engineer - On-premises cluster

Worked on creating an on-premises cluster using Hortonworks tools to process and analyze financial data from a bank to detect anomalies

Education

Bachelor of Computer Sciences
Bachelor of Computer Sciences
Brasilia University, UniCEUB, DF
2004 - 2010 (6 years)
Applied Machine Learning in Python
Applied Machine Learning in Python
Coursera
Postgraduate: IT Governance
Postgraduate: IT Governance
Brasilia Catholic University, Universidade Católica de Brasilia, DF