Discussion about this post

User's avatar
JP's avatar

The $63 on a single investigation stat made me wince. I've been there. When you're running multi-step agentic pipelines, tool calls stack up faster than you expect and suddenly the bill is alarming.

Your point about provider flexibility hits close to home. I ended up solving the per-token anxiety problem entirely by switching my coding agent setup to a flat-rate proxy. $30/month, no per-token charges, access to the full model library. The practical difference is you stop second-guessing whether to throw another LLM call at a problem. https://reading.sh/how-to-get-3x-claude-rate-limits-for-30-a-month-1d3fdb8658df covers the full setup and the rate limit comparison.

I reckon the real shift is when providers stop charging per token entirely for certain workflows. The subscription model removes the cost anxiety that makes you hesitate before adding another layer of LLM calls to your pipeline.

No posts

Ready for more?