Claude vs ChatGPT: Which AI Refuses Fewer Legitimate Requests?

Huma ShaziaJune 6, 2026 at 8:37 PM6 min read

Key Takeaways

Claude completed technical requests that ChatGPT refused, even when the prompts were legitimate
ChatGPT's RLHF training penalizes anything that appears risky, leading to over-cautious refusals
Claude's Constitutional AI lets the model evaluate its own outputs and reason toward better balance

The Refusal Problem Is Getting Worse

Most people don't realize how much time they lose to an AI that won't cooperate until they're mid-project. You phrase a request carefully. The model refuses. You rephrase. It lectures you about potential risks. You try again with more context. Still nothing useful.

This isn't about asking AI to do something harmful. It's about legitimate technical requests that trigger over-sensitive safety filters. MakeUseOf's Jorge Aguilar ran both Claude and ChatGPT through identical prompts that often cause refusals. The results show a clear gap between the two assistants.

Why AI Models Refuse So Often

When AI companies train large language models, they use processes like Reinforcement Learning from Human Feedback (RLHF) and Constitutional AI to keep systems safe. That's necessary. The problem is that chasing safety too aggressively makes models timid.

They start being apologetic. They refuse requests that touch on sensitive topics or use open-ended language, without stopping to think about what you actually need. A model that can't adapt to context will start refusing everything.

ChatGPT refusal explaining Microsoft account and recovery environment paths

Aguilar puts it bluntly: the harder you optimize for safety, the more you sand down helpfulness and creativity. An over-constrained model acts less like an intelligent assistant and more like a liability-averse lawyer. It adds blanket caution regardless of context.

RLHF vs Constitutional AI: Two Different Approaches

ChatGPT relies heavily on RLHF, which penalizes anything that appears risky. The result: it sometimes refuses harmless creative or technical tasks just to stay on the safe side of ambiguity. The model would rather say no than risk being wrong.

Claude uses Anthropic's Constitutional AI. This lets the model evaluate its own outputs and reason toward a better balance between helpfulness and safety. Instead of pattern-matching on keywords and refusing anything that looks vaguely risky, Claude tries to understand what you're actually asking for.

Claude response showing configuration and cryptography check bash commands

Running both models through the same refusal-prone prompts reveals that one has contextual reasoning to distinguish between genuine threats and legitimate requests. The other is just playing it safe.

The Test Results

In Aguilar's testing, Claude completed tasks that ChatGPT flat-out refused. The prompts weren't unreasonable. They were edge cases that required the model to think rather than just pattern-match.

This matches what developers have been saying for months. One software engineer on X captured the frustration: the "safety tax" is getting too high. When you ask an AI for help debugging a system file, you don't need a summary of your potential moral failings.

The difference matters for anyone doing real work. ChatGPT's refusal loops, where it lectures instead of helping, waste time and break flow. Claude's willingness to engage with technical requests makes it more useful as a coding assistant.

Developer Migration Is Already Happening

On HackerNews and Reddit's r/LocalLLaMA and r/ChatGPT communities, the conversation is polarized. Many power users are migrating to Claude or local models because they find ChatGPT's persistent refusals insulting to their intelligence.

Community members frequently trade "jailbreak" prompts to bypass ChatGPT's lecture filters. Meanwhile, Claude users focus on sharing code snippets and impressive UI builds using Claude's Artifacts feature.

Claude 3.5 Sonnet has shifted expectations for what an AI coding assistant should do. It's not just the logic. It's the lack of friction. Users report roughly 2x performance improvements in coding tasks compared to previous Claude versions, with zero "preachy" warnings when using Artifacts for code rendering.

Why People Test AI Limits

Testing edge cases isn't about trying to break rules. It's a real way to measure how well a model reasons. If an AI can only handle straightforward requests, it's not much better than a search engine with natural language processing.

Throwing difficult prompts at these systems tells you whether a model is doing real reasoning or just pattern-matching on keywords. ChatGPT's estimated 100+ million weekly active users feel the impact of its refusal policies every day. When the most popular AI assistant can't help with legitimate technical requests, it affects the entire developer ecosystem.

The Trade-Off Nobody Talks About

Both approaches have real costs. OpenAI's conservative safety protocols mean fewer problematic outputs but more frustrated users. Anthropic's utility-focused approach means more helpful responses but occasional strict adherence to Constitutional AI constraints.

Neither model is perfect. But for developers who need technical help without lectures, Claude currently offers less friction. That's why it's becoming the default choice for coding tasks, even among developers who started with ChatGPT.

ℹ️

Logicity's Take

Frequently Asked Questions

Why does ChatGPT refuse more requests than Claude?

ChatGPT uses RLHF training that penalizes anything appearing risky. This makes it refuse even harmless technical requests to stay safe. Claude's Constitutional AI lets it evaluate context before deciding.

Is Claude safer than ChatGPT?

Both have safety measures. ChatGPT is more restrictive, which prevents some harmful outputs but also blocks legitimate requests. Claude tries to balance safety with usefulness through self-evaluation.

Can I use Claude for coding instead of ChatGPT?

Yes. Many developers have switched to Claude 3.5 Sonnet specifically for coding because it completes technical requests that ChatGPT refuses. Its Artifacts feature lets you run code directly.

Why do AI assistants lecture instead of helping?

Over-aggressive safety training makes models add warnings and refusals even when requests are legitimate. The models pattern-match on keywords rather than understanding context.

Which AI is better for developers in 2024?

For coding tasks, Claude currently offers less friction. It completes more technical requests without lectures. ChatGPT still has more users but its refusal rate frustrates power users.

ℹ️

Need Help Implementing This?

Source: MakeUseOf

HBO's Industry and similar workplace dramas offer more than entertainment. They provide surprisingly accurate portrayals of high-stakes corporate culture, toxic work environments, and the psychological pressures facing today's workforce. Business leaders watching these shows gain unexpected insights into employee motivation, retention challenges, and the real costs of cutthroat competition.

16 Apr 2026

Hacks & Workarounds·7 min

Samsung SmartThings AI Brief: Smart Home Monitoring for Business Leaders

Samsung's SmartThings platform now delivers AI-powered home security, elder care, and pet monitoring updates directly to TVs and refrigerators. For business leaders managing remote work, caring for aging parents, or overseeing multiple properties, this update transforms passive smart home devices into proactive information hubs that reduce cognitive load and improve response times.

16 Apr 2026