Users Canceling Claude Over Token Costs and Quality Concerns

A user detailed their decision to cancel Claude, citing token efficiency problems, perceived output quality decline, and inadequate customer support as key reasons for switching away from the subscription.

TL;DR

A detailed public post-mortem on canceling a Claude Pro subscription is circulating on Hacker News. The complaints center on token inefficiency in long contexts, perceived output quality regression, and support that does not respond to technical concerns. Here is what that means for anyone currently paying for Claude, and how it compares to the realistic alternatives.

A developer named Nicky Reinert published a detailed breakdown this week of why he canceled his Claude subscription, and the Hacker News thread that followed it has been running hot. The specific complaints are not vague vibes about AI getting worse. They are measurable: responses consuming tokens faster than equivalent tasks in earlier model versions, a perceived quality drop in technical outputs, and support interactions that produced no resolution. That combination, cost going up while quality goes down and no one picking up the phone, is the exact profile that pushes a paying user to churn.

The pattern from 2023 is repeating

This is not the first time a leading AI subscription has faced a public defection wave driven by these specific complaints. In mid-2023, a similar cycle played out with ChatGPT Plus after OpenAI updated GPT-4. Users on Reddit and Hacker News documented side-by-side comparisons showing shorter, less detailed responses than before the update. OpenAI did not formally acknowledge a quality change. The complaints faded, partly because GPT-4 Turbo arrived and reset expectations, and partly because people simply adjusted. The historical pattern here is consistent. A frontier model ships. Early adopters pay. The model gets updated, quietly. Token behavior changes because the economics of inference push providers toward efficiency. Users notice. Some leave. The provider releases something new and the cycle restarts. What is different now is that the alternatives are stronger. In 2023, if you left ChatGPT Plus, you were going back to GPT-3.5 or switching to a tool that was clearly worse. In 2025, leaving Claude Pro means you have Claude vs ChatGPT as a real choice, with Gemini Advanced and a growing set of capable open-source options sitting alongside both.

Where this goes over the next six months

Specific prediction: Anthropic will not respond publicly to this complaint pattern, and Claude Pro churn will not accelerate enough to force a pricing change before Q3 2025. Here is the reasoning. Anthropic's revenue is still heavily weighted toward API and enterprise contracts, not individual Pro subscriptions. The $20/month tier is largely a brand and retention play. Losing a few hundred or a few thousand Pro subscribers to public posts does not move the needle on financials enough to trigger a response. What will happen is that token limits will become a competitive battleground at the Pro tier. Claude versus Gemini is already partly a context window argument. If Google extends Gemini Advanced's effective context handling before Anthropic addresses the token efficiency complaints, some of the users currently sitting on the fence will move. I expect that by September 2025, at least one of the major frontier AI providers will make a public, specific commitment about how Pro-tier token allocation works, either as a marketing move or in response to pressure. If that commitment comes from Anthropic, it will probably look like a dashboard showing usage. If it comes from Google or OpenAI first, Anthropic will follow within 90 days. That is the falsifiable version of this prediction.

How to audit your own Claude usage before canceling

If you are a Claude Pro subscriber wondering whether the complaints apply to you, here is how to check your situation before making a cancellation decision. 1. Open Claude and navigate to a task you run regularly. Run it now and note the response length and the time it takes. 2. Find a record of the same task from 60 or more days ago if you have one. Conversation history in Claude persists, so scroll back or export if you have the option. 3. Compare response length, specificity, and whether follow-up questions were needed. This is not a controlled test, but it is directionally useful. 4. Check your browser's developer tools network tab during a Claude session. You cannot see raw token counts, but you can see response sizes in bytes. A consistent reduction in response byte size across similar prompts is a real signal. 5. Run the same prompt in Claude and in ChatGPT GPT-4o. You do not need to subscribe to both permanently. OpenAI's free tier gives you enough GPT-4o access for a direct comparison. 6. If you are using Claude for code, run the same coding task in Cursor with Claude as the backend, then with GPT-4o as the backend. Cursor makes it easy to switch models mid-session. Verification checklist: - Did the response length change meaningfully on comparable prompts? - Did you need more follow-up turns to get the same output quality? - Did the code or text produced require more editing than before? - Is the price-per-useful-output ratio still better than the next best option? If you answered yes to the first three and no to the last one, the cancellation math probably works in your favor.

On support quality

Reinert's complaint about support is harder to measure but easy to believe. Anthropic's support infrastructure for individual Pro subscribers has never been a strong point. Enterprise customers get account management. Pro subscribers get a help center and a ticket queue. If your issue is technical and nuanced, the queue tends not to produce useful answers.

Claude vs the field: where it wins and where it does not

Tool	Best for	Context handling	Pro tier price/month	Support quality (individual)
Claude Pro	Long-form writing, nuanced instruction-following	Strong up to ~100k tokens, degrades in very long sessions	$20	Ticket queue, slow resolution
ChatGPT Plus	Short turnaround tasks, image generation, plugins	Good, GPT-4o handles context efficiently at shorter windows	$20	Ticket queue, similar speed
Gemini Advanced	Google Workspace integration, very long context	1M token window in Gemini 1.5 Pro, best in class for raw length	$19.99 (Google One AI Premium)	Google support, inconsistent
Perplexity Pro	Research, real-time web grounding	Limited compared to above, not designed for long sessions	$20	Community-focused, limited direct support

The honest split: Claude is still the best single tool for writing tasks that require sustained instruction-following across a long document. If you are writing a 10,000-word technical spec and need the model to hold style and structure constraints throughout, Claude outperforms GPT-4o on that task in my testing. But for shorter, faster tasks with code or data, GPT-4o is faster and the quality difference is negligible. See the Claude vs ChatGPT comparison for a longer breakdown. Gemini wins on raw context window size. If your use case involves dropping in an entire codebase or a very long document, Gemini 1.5 Pro's 1M token window is not matched by Claude or ChatGPT at the Pro tier.

What you are actually paying for at each tier

The pricing pages for these tools are optimistic about what $20/month buys you. Claude Pro at $20/month gives you priority access, 5x more usage than the free tier (Anthropic's phrasing, not a hard number), and access to Claude 3.5 Sonnet and Opus. What it does not give you is a clear statement of how many messages or tokens you get per day. The soft limit kicks in during heavy usage and you get a notice to slow down or wait. There is no dashboard showing you where you are relative to the limit. ChatGPT Plus has moved to a similar model. GPT-4o messages are no longer capped at a published number in most contexts, but heavy users still hit slowdowns and get dropped to GPT-4o mini during peak hours. OpenAI does not publish the threshold. Gemini Advanced through Google One AI Premium includes 2TB of Google Drive storage, which complicates the price comparison. If you need that storage anyway, the effective cost of the AI capability is lower. If you do not, you are paying for storage you will not use. The hidden cost that does not show up on any pricing page is migration time. If you have built workflows, system prompts, or integrations around Claude's specific behavior, switching to GPT-4o or Gemini means rewriting and retesting those. For a developer who has spent time building around Claude Code or a custom Claude Desktop setup, that cost is real. A quick comparison with Cursor vs Claude is worth reading if your use case is primarily code.

$240

annual cost of Claude Pro, before accounting for any productivity loss during model quality transitions

Overage fees do not apply at the individual Pro tier for any of these tools right now. The mechanism is rate limiting and degraded model access, not a credit card charge. That is actually the better outcome for users, but it means the product silently gets worse under heavy load rather than telling you what it costs to get full performance. Reinert's original post is worth reading in full at his site. The Hacker News thread that followed it surfaces a number of users with similar experiences, and a roughly equal number who report no quality change. The developer who opened this post noticed three specific things going wrong: token costs rising, output quality falling, and support providing no path forward. After working through the audit steps above and looking at the alternatives, the picture is clearer. Claude is still the right choice for specific tasks. It is not the only choice, the pricing page obscures how the limits actually work, and support for individual subscribers is notably limited. Whether that combination is worth $20/month depends on what you are building and how often you hit the walls.