Discussion about this post

User's avatar
Pawel Jozefiak's avatar

The ChatGPT routing strategy (GPT-5-main vs GPT-5-thinking) mirrors what I learned the hard way with my automation agent. Started using Opus for everything. Burned through API limits. Switched to Haiku by default. Quality actually improved because simpler tasks got clearer instructions instead of over-engineered prompts. Now I route based on task complexity: Haiku for email/scraping, Sonnet for content, Opus when multi-step reasoning required. The classifier layer matters more than the model itself. Cost dropped 70%, no quality loss. https://thoughts.jock.pl/p/claude-model-optimization-opus-haiku-ai-agent-costs-2026

Opinion AI's avatar

This is a super clean mental model: MCP = tool interface (plumbing), RAG = knowledge injection (memory), Agents = decision loop (manager). They’re not rivals, they sit at different layers of the stack.

The next real battleground is trust + security: once MCP connects models to real systems (files, Slack, GitHub, internal APIs), the cool demo becomes an attack surface, so teams will need least-privilege tool access, audit logs, and safety checks between agents and external MCP servers

9 more comments...

No posts

Ready for more?