Claude vs ChatGPT: Which AI Refuses Fewer Legitimate Requests?

Key Takeaways

- Claude completed technical requests that ChatGPT refused, even when the prompts were legitimate
- ChatGPT's RLHF training penalizes anything that appears risky, leading to over-cautious refusals
- Claude's Constitutional AI lets the model evaluate its own outputs and reason toward better balance
The Refusal Problem Is Getting Worse
Most people don't realize how much time they lose to an AI that won't cooperate until they're mid-project. You phrase a request carefully. The model refuses. You rephrase. It lectures you about potential risks. You try again with more context. Still nothing useful.
This isn't about asking AI to do something harmful. It's about legitimate technical requests that trigger over-sensitive safety filters. MakeUseOf's Jorge Aguilar ran both Claude and ChatGPT through identical prompts that often cause refusals. The results show a clear gap between the two assistants.
Why AI Models Refuse So Often
When AI companies train large language models, they use processes like Reinforcement Learning from Human Feedback (RLHF) and Constitutional AI to keep systems safe. That's necessary. The problem is that chasing safety too aggressively makes models timid.
They start being apologetic. They refuse requests that touch on sensitive topics or use open-ended language, without stopping to think about what you actually need. A model that can't adapt to context will start refusing everything.

Aguilar puts it bluntly: the harder you optimize for safety, the more you sand down helpfulness and creativity. An over-constrained model acts less like an intelligent assistant and more like a liability-averse lawyer. It adds blanket caution regardless of context.
RLHF vs Constitutional AI: Two Different Approaches
ChatGPT relies heavily on RLHF, which penalizes anything that appears risky. The result: it sometimes refuses harmless creative or technical tasks just to stay on the safe side of ambiguity. The model would rather say no than risk being wrong.
Claude uses Anthropic's Constitutional AI. This lets the model evaluate its own outputs and reason toward a better balance between helpfulness and safety. Instead of pattern-matching on keywords and refusing anything that looks vaguely risky, Claude tries to understand what you're actually asking for.

Running both models through the same refusal-prone prompts reveals that one has contextual reasoning to distinguish between genuine threats and legitimate requests. The other is just playing it safe.
The Test Results
In Aguilar's testing, Claude completed tasks that ChatGPT flat-out refused. The prompts weren't unreasonable. They were edge cases that required the model to think rather than just pattern-match.
This matches what developers have been saying for months. One software engineer on X captured the frustration: the "safety tax" is getting too high. When you ask an AI for help debugging a system file, you don't need a summary of your potential moral failings.
“The safety tax is getting too high. When I ask an AI for help debugging a system file, I don't need a summary of my potential moral failings.”
— A prominent software engineer on X
The difference matters for anyone doing real work. ChatGPT's refusal loops, where it lectures instead of helping, waste time and break flow. Claude's willingness to engage with technical requests makes it more useful as a coding assistant.
Developer Migration Is Already Happening
On HackerNews and Reddit's r/LocalLLaMA and r/ChatGPT communities, the conversation is polarized. Many power users are migrating to Claude or local models because they find ChatGPT's persistent refusals insulting to their intelligence.
Community members frequently trade "jailbreak" prompts to bypass ChatGPT's lecture filters. Meanwhile, Claude users focus on sharing code snippets and impressive UI builds using Claude's Artifacts feature.
Claude 3.5 Sonnet has shifted expectations for what an AI coding assistant should do. It's not just the logic. It's the lack of friction. Users report roughly 2x performance improvements in coding tasks compared to previous Claude versions, with zero "preachy" warnings when using Artifacts for code rendering.
“Claude 3.5 Sonnet has successfully moved the goalposts for what we expect from an AI coding assistant. It's not just the logic; it's the lack of friction.”
— Lead Developer at a top-tier tech firm
More tools for developers who prefer direct solutions
Why People Test AI Limits
Testing edge cases isn't about trying to break rules. It's a real way to measure how well a model reasons. If an AI can only handle straightforward requests, it's not much better than a search engine with natural language processing.
Throwing difficult prompts at these systems tells you whether a model is doing real reasoning or just pattern-matching on keywords. ChatGPT's estimated 100+ million weekly active users feel the impact of its refusal policies every day. When the most popular AI assistant can't help with legitimate technical requests, it affects the entire developer ecosystem.
The Trade-Off Nobody Talks About
Both approaches have real costs. OpenAI's conservative safety protocols mean fewer problematic outputs but more frustrated users. Anthropic's utility-focused approach means more helpful responses but occasional strict adherence to Constitutional AI constraints.
Neither model is perfect. But for developers who need technical help without lectures, Claude currently offers less friction. That's why it's becoming the default choice for coding tasks, even among developers who started with ChatGPT.
Logicity's Take
Frequently Asked Questions
Why does ChatGPT refuse more requests than Claude?
ChatGPT uses RLHF training that penalizes anything appearing risky. This makes it refuse even harmless technical requests to stay safe. Claude's Constitutional AI lets it evaluate context before deciding.
Is Claude safer than ChatGPT?
Both have safety measures. ChatGPT is more restrictive, which prevents some harmful outputs but also blocks legitimate requests. Claude tries to balance safety with usefulness through self-evaluation.
Can I use Claude for coding instead of ChatGPT?
Yes. Many developers have switched to Claude 3.5 Sonnet specifically for coding because it completes technical requests that ChatGPT refuses. Its Artifacts feature lets you run code directly.
Why do AI assistants lecture instead of helping?
Over-aggressive safety training makes models add warnings and refusals even when requests are legitimate. The models pattern-match on keywords rather than understanding context.
Which AI is better for developers in 2024?
For coding tasks, Claude currently offers less friction. It completes more technical requests without lectures. ChatGPT still has more users but its refusal rate frustrates power users.
Need Help Implementing This?
Source: MakeUseOf
Huma Shazia
Senior AI & Tech Writer
Related Articles
Browse all
How to Jailbreak Your Kindle: Escape Amazon's Control Before They Brick Your E-Reader
Amazon is cutting off support for older Kindles starting May 2026, but you don't have to buy a new device. Jailbreaking your Kindle lets you install custom software like KOReader, read ePub files natively, and keep your e-reader alive for years to come.

X-Sense Smoke and CO Detectors at Home Depot: UL-Certified Alarms You Can Actually Trust
X-Sense just made their UL-certified smoke and carbon monoxide detectors available at Home Depot stores nationwide. The lineup includes wireless interconnected models that can link up to 24 units, 10-year sealed batteries, and smart features designed to cut down on those annoying false alarms that make people disable their detectors entirely.

How to Change Your Browser's DNS Settings for Faster, Private Browsing in 2026
Your browser's default DNS settings are probably slowing you down and leaking your browsing history to your ISP. Here's why changing this one setting should be the first thing you do on any new device, and how to pick the right DNS provider for your needs.

Raspberry Pi at 15: Why the King of Single-Board Computers Is Losing Its Crown
After 15 years of dominating the hobbyist computing scene, the Raspberry Pi faces serious competition from cheaper alternatives, supply chain headaches, and a market that's evolved past its original mission. Here's what's happening and what it means for your next project.
Also Read

Star Citizen Alpha 4.8.1 Adds 385 Missions, Breaks Escort AI
Cloud Imperium Games shipped 385 new asteroid defence missions in Star Citizen's latest alpha patch. Players are reporting that escort targets fly into closed hangar doors and explode, failing missions before they can be completed.

Navidrome: A Self-Hosted Spotify Alternative That Actually Works
An open-source music server running on hardware as modest as a Raspberry Pi Zero can replace your Spotify subscription. Navidrome uses just 48MB of RAM at idle and connects to dozens of client apps through the Subsonic API.

2 Linux Tools That Fix Storage Problems in Minutes
Running out of disk space on Linux doesn't require hours of manual file hunting. Two tools, Baobab and BleachBit, can identify space hogs and clear system junk in under five minutes. Here's how to use them effectively.