gabrieldaes.com / Overview
8
Certifications
8
6+
Projects built
6
2
Live deployments
2
PT
Based in Portugal
PT
pipeline.py — gabriel_alves
01
Ingest
REST APIs
CSV / Batch
Paginated I/O
02
Transform
PySpark
Pandas · NumPy
SQL
03
Orchestrate
Apache Airflow
GitHub Actions
Logging
04
ML-Ready
Feature Eng.
Model Eval.
Prediction Svc
05
Deploy
Render
AWS Cognito
WAF
ML Pipeline · Churn
Customer Churn Pipeline
▶ Demo
ML-integrated ETL for churn prediction. Deployed on Render with AWS Cognito auth and WAF security layer.
Python AWS ML Render
NLP · Sentiment
NLP Sentiment Pipeline
Live
Text processing pipeline over product reviews driving a Streamlit dashboard with real-time filtering by category and time.
NLP Streamlit Neural nets
Analytics · SQL
Workforce SQL Analysis
Complete
Diagnostic analysis of employee data covering compensation equity, diversity metrics, and workforce stability insights.
SQL DataCamp Analytics
ML Pipeline · Churn
Customer Churn Pipeline
▶ Demo
End-to-end machine learning pipeline for customer churn prediction, covering data ingestion, preprocessing, feature engineering, model training, and deployment. Built reusable ETL components and integrated model evaluation with profit-based threshold optimization. Deployed on Render with AWS S3, Cognito, and WAF.
Python ETL AWS ML Render
NLP · Sentiment
NLP Sentiment Analysis Pipeline
Live
Text processing pipeline to clean, transform, and analyze large-scale product review data. Neural network for sentiment classification integrated into a Streamlit dashboard with dynamic filtering by product, category, and time. Production-ready workflow combining scalable text processing with ML-driven business insights.
NLP Streamlit Neural nets Python
Data Engineering & ETL
7 skills
ETL Pipeline Design Data Transformation Data Cleaning Batch Processing Schema Design Data Modeling Hive-style Partitioning
Programming & Processing
6 skills
Python SQL PySpark Pandas NumPy pytest
Orchestration & Monitoring
5 skills
Apache Airflow DAG Design Structured Logging Pipeline Debugging GitHub Actions (CI/CD)
Cloud & Deployment
5 skills
AWS S3 AWS Cognito AWS WAF Render Git
ML Integration & Analysis
7 skills
Supervised Learning Feature Engineering Model Evaluation ML-Ready Pipelines EDA Matplotlib Seaborn
Critical Thinking & Problem Solving
Analytical Mindset & Data-Driven Thinking
Attention to Detail
Curiosity & Learning Agility
Ability to Present Results & Insights
Persistence & Self-Discipline
01
Data Engineer Professional
DataCamp
View PDF
02
AI Engineer for Data Scientists Associate
DataCamp
View PDF
03
Data Scientist Professional
DataCamp
View PDF
04
Data Analyst
DataCamp
View PDF
05
Python Data Associate
DataCamp
View PDF
06
SQL Associate
DataCamp
View PDF
07
AI Fundamentals
DataCamp
View PDF
08
Data Literacy
DataCamp
View PDF
Building reliable, scalable data systems that bridge Engineering and Machine Learning.

I am a Data Engineer with a background in Informatics Engineering, focused on building scalable data pipelines and production-ready data systems.

I have hands-on experience designing ETL workflows, transforming large datasets, and preparing data for machine learning applications. I have built end-to-end pipelines covering data ingestion, transformation, modeling, and deployment.

Recently, I completed the Data Engineer Professional Certification, working with tools such as Airflow and logging systems, strengthening my understanding of workflow orchestration and pipeline monitoring.

Currently seeking a junior Data Engineer role to contribute to data infrastructure and grow in distributed systems and orchestration.

Education
Bachelor's in Informatics Engineering
ESTG-IPVC · Instituto Politécnico de Viana do Castelo
Location
Portugal
Open to remote · Hybrid (north of Portugal)
Contact
Response within 24h
Looking for
Junior Data Engineer
Data infrastructure · Distributed systems · Orchestration