Skip to content

AJAY VISHWANATHAN

Machine Learning Notes From Real Builds

Notes from building ML systems in production — the architecture decisions, the things that failed in ways benchmarks never predicted, and what actually held up.

Recent Writing

What I've been working through lately.

  • You Can't Improve What You Don't Measure: Evaluating Knowledge-Grounded Retrieval Systems

    Published on:February 24, 2026
    7 min read

    After shipping a RAG retrieval system, the hardest question is whether it actually works. This post covers RAGAS, LLM-as-judge calibration, the evaluation triad, and how to build an eval pipeline that gives you real signal.