SteadyKestrel
The author has 8 years of experience building and shipping software. He previously worked at Google and Meta. He built and shipped streaming-first LLM products, developer tooling, and real-time web experiences. He focuses on RAG, Agents, Search, Observability, and TypeScript.
4 articles
-
GuidesThe Return of RAG in 2026
RAG is back in 2026 because long context did not solve freshness, permissions, or reliability. Modern RAG looks like search engineering: hybrid retrieval, reranking, and tight evals.
-
GuidesLLM Evals for Chat and Tool-Using Agents: A Practical Guide to Test Suites and Graders
A production-first guide to evaluating chat assistants and tool-using agents with a small, reliable eval suite: datasets, grader types, flake reduction, and CI gates.
-
GuidesThe LLM Cost and Scaling Playbook: Cut Your Bill Without Killing Quality
A practical, production-first guide to reducing LLM spend with model routing, token discipline, caching, batching, and rate-limit aware throughput.
-
OpinionStop Defaulting to Python for LLM Apps
If streaming is the default UX, TypeScript is the pragmatic default stack.