OpenAI Claims GPT-5.5 Instant Cuts Hallucinations by 52%

Key Takeaways

- OpenAI claims GPT-5.5 Instant produces 52.5% fewer hallucinated claims than GPT-5.3 Instant on medical, legal, and financial prompts
- The new model also reduced inaccurate claims by 37.3% on conversations users had flagged for errors
- GPT-5.3 Instant will remain available for three months before being retired
OpenAI announced Tuesday that its new default ChatGPT model hallucinates less than its predecessor. The company says GPT-5.5 Instant produces 52.5% fewer made-up claims compared to GPT-5.3 Instant when handling prompts about medicine, law, and finance.
The numbers come from OpenAI's internal evaluations, not independent testing. Still, the specific percentage marks a departure from the vague "improved accuracy" claims that typically accompany AI model updates.
What OpenAI Is Claiming
Beyond the headline number, OpenAI says GPT-5.5 Instant also reduced inaccurate claims by 37.3% on "especially challenging conversations users had flagged for factual errors." This suggests the company is using real user complaints to benchmark accuracy, not just synthetic test cases.
The company describes the new model as "more capable across everyday tasks." That includes better analysis of uploaded images and smarter decisions about when to search the web for answers instead of relying on training data.
OpenAI also addressed a complaint that has annoyed many ChatGPT users: excessive emoji. The company says GPT-5.5 Instant will avoid "gratuitous emojis" and deliver responses that are "tighter and more to-the-point."
Enhanced Personalization and Memory
GPT-5.5 Instant can now pull more context from previous chats and connected services like Gmail. This means responses can be tailored to your history and preferences. Google has been building similar features into Gemini, so personalization is becoming a competitive battleground.
A new "memory sources" feature will show users exactly what context ChatGPT used to personalize a response. You can delete or correct this information if it's wrong or outdated. This addresses a common concern: when AI remembers things about you, you should be able to see and control what it knows.
The enhanced personalization features roll out first to Plus and Pro subscribers on the web, with mobile apps coming later. Free, Go, Business, and Enterprise users will get access "soon," though OpenAI didn't specify a date.
Anthropic's specialized finance agents tackle similar accuracy concerns in high-stakes domains
Rollout and Transition Period
OpenAI starts rolling out GPT-5.5 Instant on Tuesday to all ChatGPT users. The company is keeping GPT-5.3 Instant available as an option for three months before retiring it.
The transition period acknowledges a real pattern: users sometimes prefer older models. When OpenAI has retired previous versions, some users publicly mourned the loss. Three months gives people time to adjust or find workarounds if they prefer the older model's behavior.
The memory sources feature is rolling out now to ChatGPT consumer plans on the web, with mobile support coming soon.
The Hallucination Problem
Hallucinations remain one of the biggest barriers to AI adoption in professional settings. A model that confidently states false medical information or cites non-existent legal precedents isn't just wrong. It's dangerous.
OpenAI's specific claims about improvement percentages are notable, but they come with caveats. "Internal evaluations" means we're taking OpenAI's word for it. Independent benchmarks would carry more weight, especially for organizations considering AI deployment in regulated industries.
The 52.5% reduction sounds impressive, but context matters. If the previous model hallucinated on 20% of high-stakes prompts, a 52.5% reduction means GPT-5.5 Instant still hallucinates on roughly 9.5% of them. That's better, but still far from reliable for unsupervised use in medicine, law, or finance.
Logicity's Take
Frequently Asked Questions
When does GPT-5.5 Instant become the default ChatGPT model?
OpenAI begins rolling it out on Tuesday, May 5, 2026, to all ChatGPT users.
How much less does GPT-5.5 Instant hallucinate compared to the previous model?
OpenAI claims 52.5% fewer hallucinated claims on high-stakes prompts covering medicine, law, and finance, and 37.3% fewer inaccurate claims on conversations users flagged for errors.
Can I still use GPT-5.3 Instant after the update?
Yes, GPT-5.3 Instant will remain available as an option for three months before OpenAI retires it.
What is the new memory sources feature in ChatGPT?
Memory sources shows you what context ChatGPT used to personalize a response, letting you delete or correct information if needed.
Who gets the enhanced personalization features first?
Plus and Pro subscribers on the web get access first, with mobile apps and other subscription tiers coming later.
Need Help Implementing This?
Huma Shazia
Senior AI & Tech Writer
اقرأ أيضاً

رأي مغاير: كيف يؤثر اختراق الأمن الداخلي الأميركي على شركاتنا الخاصة؟
في ظل اختراق عقود الأمن الداخلي الأميركي مع شركات خاصة، نناقش تأثير هذا الاختراق على مستقبل الأمن السيبراني. نستعرض الإحصاءات الموثوقة ونناقش كيف يمكن للشركات الخاصة أن تتعامل مع هذا التهديد. استمتع بقراءة هذا التحليل العميق

الإنسان في زمن ما بعد الوجود البشري: نحو نظام للتعايش بين الإنسان والروبوت - Centre for Arab Unity Studies
في هذا المقال، سنناقش كيف يمكن للبشر والروبوتات التعايش في نظام متكامل. سنستعرض التحديات والحلول المحتملة التي تضعها شركات مثل جوجل وأمازون. كما سنلقي نظرة على التوقعات المستقبلية وفقًا لتقرير ماكنزي

إطلاق ناسا لمهمة مأهولة إلى القمر: خطوة تاريخية نحو استكشاف الفضاء
تعتبر المهمة الجديدة خطوة هامة نحو استكشاف الفضاء وتطوير التكنولوجيا. سوف تشمل المهمة إرسال رواد فضاء إلى سطح القمر لconducting تجارب علمية. ستسهم هذه المهمة في تطوير فهمنا للفضاء وتحسين التكنولوجيا المستخدمة في استكشاف الفضاء.