Agent Token Costs Are System Costs: Evaluate Your Agentic Tooling End-to-End
tl;dr: Evaluate agentic tooling in realistic end-to-end agentic tasks, not in isolation. At the task level, costs are driven more by the persistence of context through many turns than by the token count of any single turn. Intro Organizations are entering the tokenmaxxing hangover stage. And with that, lots of tooling is popping up claiming to reduce token usage. I’m not an organization, but I’d love to use fewer tokens as well!...