Discussion about this post

User's avatar
Nicola's avatar

Great MCP explainer. One thing worth adding for the practical side: what you build with MCP matters as much as understanding the protocol.

The most impactful MCP server I've seen (biased, because I built it) solves a problem most Claude Code users don't realize they have: the agent reads your entire codebase for every prompt, consuming 180K tokens when only 50K are relevant. 70% waste rate.

vexp is an MCP server that sits in the architecture you described — Claude Code is the Host, the MCP Client sends a request like "give me the relevant code for this query with a 50K token budget," and the vexp MCP Server queries a local dependency graph (tree-sitter AST + SQLite) and returns only the relevant subgraph.

In your MCP diagram terms: the Resource is the pre-indexed codebase graph, the Tool is a run_pipeline function that accepts a query and returns ranked code, and the Prompt guides the agent to use vexp before doing its own file exploration.

Result: tool calls dropped from 23 (Read/Grep/Glob) to 2.3 (run_pipeline) per task. Cost from $0.78 to $0.33. On SWE-bench (100 real bugs): 73% pass rate at $0.67/task vs $1.98.

For anyone watching the "12 Claude Code Features" video: context pre-filtering via MCP is the 13th feature that isn't built in yet but probably should be. It works with Claude Code, Cursor, Copilot, and 9 other agents.

https://vexp.dev/benchmark

No posts

Ready for more?