The Cost Landscape
The March 23 Crisis
Hidden Token Taxes
CLAUDE.md Tax
~10K tokens loaded every single request. A 9-developer team found bills 3x higher than expected — CLAUDE.md alone accounted for the majority of the inflation.
Cache TTL Cut
Silently reduced from 1 hour to 5 minutes. The same prompt is now re-billed every turn instead of hitting cache. 10-20x cost inflation for some users.
Compaction Re-reads
Post-compaction, the model re-reads files it already processed. You pay for the same context twice — once when it was first loaded, again when compaction forces a re-read.
No Cost Dashboard
Cursor shows token consumption per minute. Claude Code shows nothing. Users have no visibility into what's consuming their budget until they hit the wall.
v2.1.100 Token Inflation
~20,000 invisible tokens per request in the v2.1.100 update. Limits burn ~40% faster with no visible change in behavior or output quality.
$200/mo Burns in 3 Days
Running 20+ agents on Opus 4.6 with high effort exhausts the Max 20x plan ($200/month) in 3 days. Documented by ICOR with Tom (73K views on YouTube).
"$1,619 in Claude Code API costs over 33 days."Future Stack Reviews · April 2026
"$1,892.38 over 13 months. I cancelled with receipts."Chandler Nguyen · Twitter/X · public cancellation
SubQ Code Answer
SubQ Code
Subquadratic Attention Economics
12M tokens at $1/$5 per MTok. Minimal compaction overhead — 12x more context means most sessions complete without triggering compaction at all. No cache TTL games — the context is persistent, not cached. Load everything once. The same work, a fraction of the cost.