
Hi, I'm Victor Bastos
Data Engineer with over 4 years of experience in developing data pipelines, cloud data architectures, and process automation. Proven capability in optimizing complex workflows by up to 90%, integrating solutions using Azure Databricks, SQL, Python, PySpark, and Power BI to deliver real-time analytics.
Scroll Down
Featured Projects
Showcasing my expertise in data engineering through real-world solutions that deliver measurable results for businesses while leveraging modern data architecture patterns and technologies
View Project
Monthly Water Meter Closing System
An enterprise-scale data processing architecture handling 15M+ monthly meter readings with Delta Lake and Databricks Runtime 10.4, reducing processing latency from 168 hours to 2 hours while implementing Zero-ETL patterns that eliminated data inconsistencies and generated €13M in annual operational savings.
Azure Databricks
PySpark
SQL
Apache Airflow
Power BI
View Project
Recommendation System for Meter Replacement
ML-driven analytics system using Azure Databricks and PySpark to prioritize water meter replacements across a 10M+ device network. The solution achieved 92% prediction accuracy and generated €8.7M in annual incremental billing.
Azure Databricks
PySpark
SQL
Machine Learning
DataOps
Technical Skills
A comprehensive toolkit that enables me to design and implement robust data solutions
Programming & Query Languages
Python
SQL
JavaScript
Data Processing & ETL
Apache Spark
PySpark
Azure Databricks
ETL/ELT Processes
Data Pipeline Development
Data Automation
Apache Airflow
Cloud & Infrastructure
Azure Cloud
Data Architecture
Docker
Terraform
Cloud Migration
Jenkins
Analysis & Visualization
Power BI
Jupyter
DataOps
Advanced Analytics & AI
Machine Learning
NLP & Transformer Models
Scikit-learn
Methodologies & Practices
Agile/Scrum
Data Governance
Git
Get In Touch
Have a project in mind or want to discuss data engineering solutions? I'd love to hear from you.