Articles
-
Shipping Safe Tooling: Schemas, Validation, and Failure Modes in Tool Calling
-
The Return of RAG in 2026
-
Why Frontier Models Are Getting More Restrictive
-
LLM Evals for Chat and Tool-Using Agents: A Practical Guide to Test Suites and Graders
-
Voice Pipelines vs Speech-to-Speech Models: What to Ship for Voice Agents
-
OpenAI Codex CLI vs Claude Code: A Practical Harness Comparison for Real Repos
-
The LLM Cost and Scaling Playbook: Cut Your Bill Without Killing Quality
-
Stop Defaulting to Python for LLM Apps