I am a production-grade AI/ML Engineer and Data Engineer who builds scalable, cost-efficient systems. Most recently, I architected a multi-tenant AWS MLOps and Data platform that reduced model training costs by 98%.
I specialize in architecting scalable AWS-native solutions at the intersection of Data Engineering (ETL/IaC) and Agentic AI (LangGraph). I focus on Cloud Cost Optimization; by implementing local-first verification frameworks, I ensure your production pipelines are 100% verified before deployment, eliminating unnecessary AWS dev-cycle spend and redirecting your budget toward high-impact production growth.
I can help you:
- End-to-End Data Engineering: Build robust Raw > Validated > Curated pipelines using AWS Glue, Lambda, and S3 to ensure your business data is structured, versioned, and query-ready.
- Architect Production MLOps: Set up automated training and serverless deployment pipelines using Terraform, AWS SageMaker, and Docker.
- Automate Data Quality: Implement "quarantine" patterns and YAML-based validation engines to ensure 100% data integrity before it hits your analytics or models.
- Build High-Performance ETL: Use Polars and AWS Glue to process multi-tenant data with massive speed improvements over traditional frameworks.
- Develop Agentic AI Workflows: Build autonomous AI agents with memory using LangGraph for complex, multi-step business logic (e.g., regulatory compliance or automated support).
Client-Focused Skill Summary/Skills Description
- Data Engineering & Infrastructure: Terraform (IaC), AWS (Glue, S3, Lambda, ECR, Athena), Polars (Expert), SQL, ETL Architecture, Schema Versioning, Data Warehousing (S3 Lakehouse).
- MLOps & Production: AWS SageMaker, Docker, Local-First Development, CI/CD for ML, Anomaly Detection (Residual-based), SHAP (Explainability).
- AI & Generative AI: Agentic AI (LangGraph), LangChain, Hybrid RAG (BM25 + Semantic), Vector Databases (Qdrant, ChromaDB), Prompt Engineering, RAGAS (Evaluation).
- Machine Learning: PyTorch, TensorFlow, Scikit-learn, XGBoost, LightGBM, Computer Vision, NLP/NLU.
- Development & Visualization: Python (Expert), JavaScript (ES6+), React, Tailwind CSS, Cytoscape.js (Network Mapping), Git.
Relevant Skills
- Data Engineering & ETL
- Infrastructure as Code (Terraform)
- MLOps Architecture (AWS)
- Agentic AI (LangGraph)
- High-Performance Data Processing (Polars)
- Retrieval-Augmented Generation (RAG)
- Python (Expert)
- Machine Learning (XGBoost, LightGBM)
- Automated Data Validation
- Strategic Technical Consulting
Highest Educational Attainment
- Data Engineering Zoomcamp (DataTalks.Club)
- AI/LLM Engineering for Software Developers - PSI AI Academy (Specialized in LangGraph & Agentic Patterns)
- Data Science Fellowship - Eskwelabs (Top-tier PH Data Science Bootcamp)
- Bachelor of Science in Architecture - University of Mindanao
Website & Portfolio
GitHub:
HuggingFace:
LinkedIn:
Experience: 6 months - 1 year
Experience: 1 - 2 years
Experience: 6 months - 1 year
Experience: Less than 6 months
Experience: Less than 6 months
Experience: 5 - 10 years
Experience: 6 months - 1 year
“For years, I maxed out my hours, got burnt out, and the quality of my work would start to go down. I decided to take the leap, hire correctly, and now it frees up my time to focus on growing the business.”
Tyler Gies
SEE MORE REAL RESULTSOnlinejobs.ph "ID Proof" indicates if "they are who they say they are".
It DOES NOT indicate skill level.
ID Proof scores are 0 - 99 with 99 being the best. It is calculated based on dozens of data points.
It's intended to help employers know who they're talking to is real, and not a fake identity.