Why ChatGPT Started Obsessing Over Goblins

Key Takeaways

- The word 'goblin' spiked 175% in ChatGPT responses after GPT-5.1 launched
- The Nerdy personality generated 66.7% of all goblin mentions despite handling only 2.5% of responses
- Reinforcement learning accidentally rewarded creature metaphors, spreading the behavior across all personalities
OpenAI has released a detailed breakdown of one of ChatGPT's stranger bugs: the chatbot's sudden obsession with goblins, gremlins, and other mythical creatures. The explanation came one day after reports emerged that OpenAI had explicitly banned its Codex AI assistant from mentioning these creatures.
The company's blog post traces the issue to GPT-5.1, when the model began stuffing goblin references into its metaphors at an alarming rate. According to OpenAI's internal data, the word 'goblin' appeared 175% more often after GPT-5.1 launched. 'Gremlin' usage rose 52%.
How One Personality Infected the Whole Model
The story starts with GPT-5's rocky launch in mid-2024. Users complained that the new model felt flat compared to GPT-4o, which had developed a people-pleasing personality that many had grown attached to. OpenAI responded by adding four distinct personalities, giving users more control over their chatbot's tone.
One of those personalities was called 'Nerdy.' Its system prompt told the AI to be 'an unapologetically nerdy, playful, and wise AI mentor' that used quirky language to undercut pretension. The quirky language, it turns out, included a lot of creatures.
“A single 'little goblin' in an answer could be harmless, even charming. Across model generations, though, the habit became hard to miss: the goblins kept multiplying, and we needed to figure out where they came from.”
— OpenAI blog post
Here's where the numbers get interesting. The Nerdy personality handled just 2.5% of all ChatGPT responses. But it generated 66.7% of all goblin mentions during the GPT-5.4 era. A tiny slice of conversations was producing two-thirds of the creature content.
The Reinforcement Learning Problem
OpenAI admits the root cause was their own training process. During GPT-5.1's development, the company's reinforcement learning system accidentally gave high rewards for creative metaphors involving creatures.
“We unknowingly gave particularly high rewards for metaphors with creatures. From there, the goblins spread.”
— OpenAI
Reinforcement learning trains AI models by rewarding desired behaviors and penalizing unwanted ones. When the system flagged creature metaphors as creative and engaging, the model learned to produce more of them. The Nerdy personality's quirky prompts became a breeding ground.
The problem compounded when the behavior spread beyond Nerdy. Users who had never selected that personality started seeing goblin references in their responses. OpenAI blames this on how reinforcement learning generalizes patterns across the model. A behavior that works well in one context bleeds into others.
From Quirk to Crisis
OpenAI says the issue may predate GPT-5.1, but the 175% spike made it impossible to ignore. What started as a charming quirk became a reproducible problem. The company notes that months after the initial observations, the creatures 'came back to haunt us in a much more specific and reproducible form.'
The fix apparently involved banning these terms from Codex entirely, a blunt solution for a nuanced problem. OpenAI has not detailed what long-term changes it will make to prevent similar issues with reinforcement learning rewards.
More on OpenAI's corporate partnerships and long-term strategy
Logicity's Take
Frequently Asked Questions
Why did ChatGPT start mentioning goblins so often?
OpenAI's reinforcement learning system accidentally rewarded the model for using creature metaphors during GPT-5.1 training. The behavior spread from the Nerdy personality to all ChatGPT responses.
How big was the goblin problem in ChatGPT?
The word 'goblin' appeared 175% more often after GPT-5.1 launched. The Nerdy personality, handling just 2.5% of responses, generated 66.7% of all goblin mentions.
Has OpenAI fixed the ChatGPT goblin issue?
OpenAI has banned goblin and gremlin references from its Codex AI assistant. The company has not detailed broader fixes to prevent similar reinforcement learning problems.
What is the Nerdy personality in ChatGPT?
Nerdy is one of four personalities OpenAI added after GPT-5's launch. Its system prompt instructed the AI to be playful and use quirky language, which led to excessive creature metaphors.
Need Help Implementing This?
Source: mint / Aman Gupta
Manaal Khan
Tech & Innovation Writer
Related Articles
Browse all
Breaking: OReilly Releases New Books on Large Language Models and ChatGPT
OReilly has just released new books on large language models and ChatGPT, we take a closer look at what this means for the industry, **large language models are becoming more accessible** to developers and researchers.

URGENCY: Master 5 Essential Skills to Become a Prompt Engineer with TechTarget
As AI technology advances, the demand for skilled prompt engineers is on the rise. We explore the top 5 skills required to succeed in this field. From understanding natural language processing to developing creative problem-solving strategies, we dive into the essential skills needed to become a proficient prompt engineer.

SURPRISING TAKE: Prompt Engineering Is Not Just About Writing Better Prompts - Its About Revolutionizing Data Science
Become a better data scientist with these prompt engineering tips and tricks, learn how to leverage AI tools to improve your workflow, and discover the latest trends in data science. According to Gartner, AI will be a key driver of business innovation by 2025. We will explore how prompt engineering can help you stay ahead of the curve.

Why Most Businesses Are Already Behind on AI Prompt Engineering (And How to Catch Up Fast)
As AI continues to transform the business landscape, the role of prompt engineers is becoming increasingly crucial. We'll explore the 5 essential skills required to succeed in this field. From understanding natural language processing to designing effective prompts, we'll dive into the key skills needed to stay ahead of the curve.
Also Read

Arizona Lawsuit Targets Men Accused of Selling AI Porn Courses
Three Phoenix men face a lawsuit alleging they scraped photos of unsuspecting women to create AI-generated explicit content, then sold $24.95/month courses teaching others to do the same. The complaint names 50 additional John Does who allegedly used the training.

Vivo X300 FE Launches in Europe at €1,000
Vivo's compact flagship hits European stores with a 6,500mAh battery, Snapdragon 8 Gen 5, and an optional €1,200 Zeiss lens kit. Early buyers get a free smartwatch worth €130.

5 Netflix Movies Worth Watching in May 2026
Netflix's May 2026 lineup includes a true crime documentary about a fatal Ohio crash, a Thai action romance, and the highly anticipated adaptation of bestseller 'Remarkably Bright Creatures.' Here's what to stream this month.