Projects

Live Building

RAG Document Intelligence Pipeline

Production-style retrieval-augmented generation system for document Q&A. Chunks and embeds PDFs using OpenAI embeddings, stores in FAISS, and serves answers via a FastAPI endpoint with citation tracking and query reranking.

LangChain
FAISS
OpenAI
FastAPI
Python

LLM Fine-Tuning Pipeline with QLoRA

Fine-tuned Llama 2 on domain-specific instruction data using QLoRA and PEFT. Reduced GPU memory footprint by 60% while matching full fine-tune performance. Tracked experiments with MLflow and deployed via SageMaker.

HuggingFace
QLoRA
PEFT
MLflow
SageMaker

SplitEase

Full-stack expense-splitting web app built with Next.js, Supabase, and Prisma. Supports equal splits, exact amounts, percentages, and shares — with real-time balance tracking.

Next.js
Supabase
Prisma
TypeScript

ML Serving API with MLflow & Docker

End-to-end ML serving system with experiment tracking, model registry, and versioned REST API endpoints. Containerized with Docker, orchestrated on Kubernetes, with automated retraining triggers on data drift.

MLflow
FastAPI
Docker
Kubernetes
Python

Credit Risk Analysis

Credit risk models using XGBoost and Neural Networks on the Amex dataset. Engineered temporal features, tuned hyperparameters, and used SHAP for model interpretability.

XGBoost
Neural Networks
SHAP
Python

NLP Entity Extraction API

BERT-based named entity recognition and sentiment analysis service deployed with FastAPI and Docker. Handles multi-label text classification, entity extraction, and aspect-based sentiment across customer feedback data.

BERT
spaCy
FastAPI
Docker
NLP