Reynald Ace Pilpil

Aspiring AI / ML Engineer · Agentic LLM Systems & Document AI

I build production-ready ML systems — from RAG assistants and agentic LLM workflows to multimodal document AI. Every project ships to Hugging Face Spaces with a live demo you can click.

7 deployed projects 3 LLM agents End-to-end RAG Document AI

About

I'm an aspiring AI/ML Engineer who ships production systems, not just experiments. My work spans agentic LLM pipelines, RAG architectures, and multimodal document AI — all deployed with live demos on Hugging Face Spaces.

I came up through fullstack web engineering — Django, MySQL, REST APIs at production scale (1000+ users on my thesis project) — and have spent the last year going deep on applied AI/ML, building systems that move beyond notebooks to real-world deployments.

Currently focused on: agentic LLM systems, RAG architecture, and multimodal document AI.

Cavite, Philippines TUP–Cavite · Computer Engineering Open to remote roles

Technical Skills

Machine Learning & AI

Python PyTorch Hugging Face Transformers BERT DistilBERT LayoutLMv3 Flan-T5 scikit-learn XGBoost SHAP sentence-transformers ChromaDB NumPy pandas

LLM & Agentic Systems

LangChain LangGraph RAG Architecture Multi-Agent Orchestration Groq · Llama 3.3 70B Tavily Search API Pydantic v2 Prompt Engineering LangSmith

MLOps & Deployment

FastAPI Docker Hugging Face Spaces Gradio Railway REST API Design Swagger / OpenAPI pdfplumber python-docx

Backend & Data

Django Flask MySQL SQL Query Optimization JavaScript (ES6+)

Tools & Practices

Git / GitHub Agile Code Review Testing Debugging

Projects

INPUT ORCHESTRATOR OUT
Agentic

Job Application Assistant Agent

Autonomous Multi-Step LLM Agent

A 9-step agent with integrity enforcement — architecturally prohibited from fabricating resume content. A post-tailoring diff check flags violations, with every change logged in a fully auditable changelog (reason + confidence score). Pipeline: resume parsing → JD scoring → company research → ATS-tailored resume + cover letter + outreach in under 30 seconds.

Python Groq (Llama 3.3 70B) Tavily Search API Pydantic v2 Gradio python-docx
INPUT ORCHESTRATOR OUT
Agentic

Market Intelligence Agent

Stateful Multi-Agent System with Self-Revision

An autonomous research agent with a self-evaluating revision loop using LangGraph — agent critiques its own draft, scores it, and re-searches targeted gaps until quality threshold is met. Researches a company across 4 parallel angles (overview, news, competitors, market trends) via live web search and delivers a structured intelligence report as a Word document.

Python LangGraph LangChain Groq (Llama 3.3 70B) Tavily Search API Pydantic v2 Gradio LangSmith
CORPUS CHUNKS VECTORS ANSWER + CITATIONS
RAG

Philippine Labor Law RAG Assistant

End-to-End Retrieval-Augmented Generation

End-to-end RAG pipeline over the Philippine Labor Code — document ingestion, semantic chunking, vector embeddings (all-MiniLM-L6-v2), ChromaDB retrieval, and Flan-T5 generation. Returns natural-language answers (leave entitlements, overtime rules, termination procedures) with source citations grounded in the actual law, reducing hallucination through retrieval-first design.

Python LangChain ChromaDB Hugging Face Transformers Flan-T5 sentence-transformers Gradio
key value merchant "Starbucks" total ₱342.00 items [3 found] confidence 93.96%
Document AI

Financial Document Intelligence API

Production REST API

Production-grade REST API that ingests bank statement PDFs and returns structured JSON — extracted transactions, auto-categorized spending, computed financial summaries (income, expenses, savings rate), anomaly flags, and actionable insights. Containerized with Docker, deployed to Railway with auto-generated Swagger / OpenAPI docs.

Python FastAPI pdfplumber pandas Docker Railway
key value merchant "Starbucks" total ₱342.00 items [3 found] confidence 93.96%
Document AI

Receipt Data Extractor (LayoutLMv3)

Multimodal Document AI

Extracts structured data (menu items, quantities, prices, totals) from receipt images using LayoutLMv3, a multimodal transformer that jointly reasons over text, layout, and visual features. Achieved 93.96% F1 on validation set, demonstrating document AI capability beyond plain-text NLP.

Python Hugging Face Transformers LayoutLMv3 PyTorch
SHAP IMPORTANCE CHURN PROBABILITY
Classical ML

BERT Sentiment Analyzer

Social Media Intelligence Tool

Multi-input bulk sentiment analysis tool that processes batches of reviews, tweets, or comments and returns an aggregate sentiment score, breakdown chart, and the top positive / negative phrases driving the score. Dashboard-style output (not single-label classification), making it usable as a social-media intelligence tool.

Python Hugging Face Transformers PyTorch DistilBERT Gradio
SHAP IMPORTANCE CHURN PROBABILITY
Classical ML

Customer Churn Predictor

Customer Retention Dashboard

Production-ready churn risk dashboard: ingests CSV customer data, predicts churn probability with XGBoost, and outputs a retention report with risk distribution, ranked high-risk customers, and SHAP-based feature explanations — translating raw model output into business-actionable retention recommendations using explainable AI (SHAP).

Python scikit-learn XGBoost SHAP pandas Gradio

Experience

Technical Sales Intern

3E HITECH SOLUTIONS, INC.

January 2026 – June 2026
  • Supported pre-sales activities for the company's solutions, focusing on projects with the National Grid Corporation of the Philippines (NGCP) — contributing to proposal preparation, scoping, and technical clarifications with stakeholders.
  • Assisted project management across NGCP deployments — coordinating timelines, tracking deliverables, and supporting communication between technical teams and clients.

Contact

Open to AI/ML engineering roles, internships, and interesting collaborations. Based in Cavite, Philippines — open to remote.

Phone 0968 619 1930 Location Cavite, Philippines