arXiv RAG System: Async Refactoring and Bug Fixes
This is a follow-up to arXiv RAG System: Engineering an Academic Paper Q&A System from Scratch. The system was functionally complete after 7 days, but...
9.7M steps of reinforcement learning in TORCS simulator. PPO, reward shaping, catastrophic forgetting — the full journey.
End-to-end academic paper Q&A system built from scratch. FastAPI + ChromaDB + LLM, fully containerised with Docker.
Multi-agent RAG system for SEC EDGAR & Companies House filings. LangGraph orchestration, parallel analysis, hallucination check with retry loop.
Real-time anomaly detection trained on normal images only. PatchCore, FastAPI inference API, Streamlit dashboard with webcam streaming.
This is a follow-up to arXiv RAG System: Engineering an Academic Paper Q&A System from Scratch. The system was functionally complete after 7 days, but...
arXiv RAG System: Engineering an Academic Paper Q&A System from Scratch
TORCS Corkscrew Challenge: A Journey Through Reinforcement Learning Failures and Breakthroughs