LLM-as-Judge for Hallucination Detection: Does the Critic Agent Actually Work?
This is a follow-up to From arXiv to SEC: Building a Multi-Agent Financial Report Analyst with LangGraph. That post ended with: “The remaining question is...
9.7M steps of reinforcement learning in TORCS simulator. PPO, reward shaping, catastrophic forgetting — the full journey.
End-to-end academic paper Q&A system built from scratch. FastAPI + ChromaDB + LLM, fully containerised with Docker.
Multi-agent RAG system for SEC EDGAR & Companies House filings. LangGraph orchestration, parallel analysis, hallucination check with retry loop.
Real-time anomaly detection trained on normal images only. PatchCore, FastAPI inference API, Streamlit dashboard with webcam streaming.
This is a follow-up to From arXiv to SEC: Building a Multi-Agent Financial Report Analyst with LangGraph. That post ended with: “The remaining question is...
DefectVision: Building a Real-Time Manufacturing Defect Detector Trained on Normal Images Only
From arXiv to SEC: Building a Multi-Agent Financial Report Analyst with LangGraph
This is not a technical post. It is an account of how I ended up studying Data Science and AI at the University of Liverpool after four years of Naval Arc...
This post is a deep dive into the fine-tuning experiment from the arXiv RAG System. That post summarised it in a section - this one documents every detail...