I am a production-grade AI/ML Engineer and Data Engineer who builds scalable, cost-efficient systems. Most recently, I architected a multi-tenant AWS MLOps and Data platform that reduced model training costs by 98%.
I specialize in architecting scalable AWS-native solutions at the intersection of Data Engineering (ETL/IaC) and Agentic AI (LangGraph). I focus on Cloud Cost Optimization; by implementing local-first verification frameworks, I ensure your production pipelines are 100% verified before deployment, eliminating unnecessary AWS dev-cycle spend and redirecting your budget toward high-impact production growth.
I can help you:
- End-to-End Data Engineering: Build robust Raw > Validated > Curated pipelines using AWS Glue, Lambda, and S3 to ensure your business data is structured, versioned, and query-ready.
- Architect Production MLOps: Set up automated training and serverless deployment pipelines using Terraform, AWS SageMaker, and Docker.
- Automate Data Quality: Implement "quarantine" patterns and YAML-based validation engines to ensure 100% data integrity before it hits your analytics or models.
- Build High-Performance ETL: Use Polars and AWS Glue to process multi-tenant data with massive speed improvements over traditional frameworks.
- Develop Agentic AI Workflows: Build autonomous AI agents with memory using LangGraph for complex, multi-step business logic (e.g., regulatory compliance or automated support).
Client-Focused Skill Summary/Skills Description
- Data Engineering & Infrastructure: Terraform (IaC), AWS (Glue, S3, Lambda, ECR, Athena), Polars (Expert), SQL, ETL Architecture, Schema Versioning, Data Warehousing (S3 Lakehouse).
- MLOps & Production: AWS SageMaker, Docker, Local-First Development, CI/CD for ML, Anomaly Detection (Residual-based), SHAP (Explainability).
- AI & Generative AI: Agentic AI (LangGraph), LangChain, Hybrid RAG (BM25 + Semantic), Vector Databases (Qdrant, ChromaDB), Prompt Engineering, RAGAS (Evaluation).
- Machine Learning: PyTorch, TensorFlow, Scikit-learn, XGBoost, LightGBM, Computer Vision, NLP/NLU.
- Development & Visualization: Python (Expert), JavaScript (ES6+), React, Tailwind CSS, Cytoscape.js (Network Mapping), Git.
Relevant Skills
- Data Engineering & ETL
- Infrastructure as Code (Terraform)
- MLOps Architecture (AWS)
- Agentic AI (LangGraph)
- High-Performance Data Processing (Polars)
- Retrieval-Augmented Generation (RAG)
- Python (Expert)
- Machine Learning (XGBoost, LightGBM)
- Automated Data Validation
- Strategic Technical Consulting
Highest Educational Attainment
- Data Engineering Zoomcamp (DataTalks.Club)
- AI/LLM Engineering for Software Developers - PSI AI Academy (Specialized in LangGraph & Agentic Patterns)
- Data Science Fellowship - Eskwelabs (Top-tier PH Data Science Bootcamp)
- Bachelor of Science in Architecture - University of Mindanao
Website & Portfolio
GitHub:
HuggingFace:
LinkedIn:
Experience: 6 months - 1 year
Experience: 1 - 2 years
Experience: 6 months - 1 year
Experience: Less than 6 months
Experience: Less than 6 months
Experience: 5 - 10 years
Experience: 6 months - 1 year
“We'll definitely continue to hire people using Onlinejobs because it has taken our agency to the next level”
- Marc Diez
Onlinejobs.ph "ID Proof" indicates if "they are who they say they are".
It DOES NOT indicate skill level.
ID Proof scores are 0 - 99 with 99 being the best. It is calculated based on dozens of data points.
It's intended to help employers know who they're talking to is real, and not a fake identity.