Featured Projects

TORCS RL Racing Agent

9.7M steps of reinforcement learning in TORCS simulator. PPO, reward shaping, catastrophic forgetting — the full journey.

Read Post

arXiv RAG System

End-to-end academic paper Q&A system built from scratch. FastAPI + ChromaDB + LLM, fully containerised with Docker.

Read Post

FinScope - Multi-Agent Financial Analyst

Multi-agent RAG system for SEC EDGAR & Companies House filings. LangGraph orchestration, parallel analysis, hallucination check with retry loop.

Read Post

DefectVision - Manufacturing Defect Detector

Real-time anomaly detection trained on normal images only. PatchCore, FastAPI inference API, Streamlit dashboard with webcam streaming.

Read Post

Recent Posts