ChatGPT 5.5 Pro Solves Open Math Problem in 17 Minutes

Key Takeaways

- ChatGPT 5.5 Pro solved an open number theory problem in 17 minutes and 5 seconds
- Fields Medalist Timothy Gowers says his own mathematical contribution was zero
- The AI improved an exponential bound to a quadratic bound, the best possible construction
Fields Medalist Timothy Gowers fed ChatGPT 5.5 Pro a set of open problems in number theory. The model returned a complete solution in 17 minutes and 5 seconds. Then it rewrote the argument as a LaTeX preprint in 2 minutes and 23 seconds.
Gowers, who holds the Combinatorics Chair at the Collège de France and is a Fellow at Trinity College Cambridge, wrote in his blog that his own mathematical contribution to the work was zero. "I didn't even do anything clever with the prompts," he said.
The Problem and the Solution
The problems came from a paper by number theorist Mel Nathanson. The paper investigates possible sizes of certain sets of integer sums and how efficiently sets with prescribed properties can be constructed. Nathanson had proved an exponential bound for one problem and asked whether it could be improved.
ChatGPT 5.5 Pro delivered the best possible construction with a quadratic bound. That's a significant jump from exponential. The model's core insight: it swapped out a component in Nathanson's proof for a more efficient variant. The variant is well known in combinatorics, but applying it to this specific problem wasn't obvious.
Gowers checked the work for correctness, then asked the model to solve a related variant. It handled that without issues. Both results are now available as a preprint.
Tackling a Harder Problem
A generalized version of the problem proved more challenging. MIT student Isaac Rajagopal had prior work on this variant, having proven an exponential dependency. Gowers gave ChatGPT Rajagopal's paper and asked for an improvement.
After 16 minutes and 41 seconds, the model delivered a first improvement. Rajagopal judged this step correct but called it a routine modification of his own work. Gowers then pushed further, asking for additional improvements.
“I got greedy”
— Timothy Gowers, describing his escalating prompts to ChatGPT 5.5 Pro
The escalation yielded results. Rajagopal, evaluating the model's key idea on the harder problem, called it "completely original." He said it was an achievement a human mathematician would be proud of after weeks of deliberation.
What This Demonstrates
The experiment is notable for several reasons. First, the problems were genuinely open. These weren't textbook exercises or competition questions with known solutions. Second, the human mathematician deliberately stayed hands-off. Gowers didn't guide the model's reasoning or correct its approach. Third, the output was publishable. The model produced a complete LaTeX preprint that Gowers verified and posted.
Gowers described the AI's output as "PhD-level." That's a specific benchmark. It means the work would be acceptable as part of a doctoral thesis in mathematics. The model demonstrated independent mathematical reasoning, not just pattern matching or retrieval of memorized proofs.
The Speed Factor
The total time for the first problem: under 20 minutes of model thinking, plus about 2.5 minutes to write the LaTeX. The model then handled a variant problem. Both results came in under two hours of total work.
Compare that to typical mathematical research timelines. A PhD student might spend weeks or months on a problem of this complexity. Rajagopal explicitly said the "completely original" idea would have taken a human mathematician weeks of deliberation.
The speed advantage compounds. A researcher using ChatGPT 5.5 Pro could explore many more problem variants in the time it would take to solve one manually. Even if most attempts fail, the throughput matters.
Limitations Worth Noting
Gowers verified all the model's work. The AI didn't self-certify its proofs as correct. Mathematical verification remains a human task, at least for now. The model also worked on problems where Gowers could assess correctness. For cutting-edge problems where verification itself is hard, the workflow would be different.
The source article was truncated, so we don't have complete details on how the harder problem ultimately resolved. What's clear is that the model made substantive progress with original ideas, not just incremental tweaks.
Logicity's Take
Frequently Asked Questions
What math problem did ChatGPT 5.5 Pro solve?
The model solved an open problem in number theory from a paper by Mel Nathanson, improving an exponential bound to a quadratic bound. The problem involved the possible sizes of certain sets of integer sums.
How long did ChatGPT 5.5 Pro take to solve the math problem?
The model took 17 minutes and 5 seconds to solve the first problem, then 2 minutes and 23 seconds to write up the solution as a LaTeX preprint.
Who verified ChatGPT 5.5 Pro's mathematical work?
Fields Medalist Timothy Gowers checked the model's work for correctness. MIT student Isaac Rajagopal also evaluated the model's approach to a harder variant of the problem.
What does PhD-level math research mean in this context?
Gowers used the term to indicate the work would be acceptable as part of a doctoral thesis in mathematics. It demonstrated independent mathematical reasoning and original problem-solving, not just retrieval of known techniques.
Is ChatGPT 5.5 Pro available to the public?
The source doesn't specify availability details for ChatGPT 5.5 Pro. The experiment was conducted by Gowers using the model, with results posted as a preprint.
Need Help Implementing This?
Source: The Decoder / Matthias Bastian
Manaal Khan
Tech & Innovation Writer
اقرأ أيضاً

رأي مغاير: كيف يؤثر اختراق الأمن الداخلي الأميركي على شركاتنا الخاصة؟
في ظل اختراق عقود الأمن الداخلي الأميركي مع شركات خاصة، نناقش تأثير هذا الاختراق على مستقبل الأمن السيبراني. نستعرض الإحصاءات الموثوقة ونناقش كيف يمكن للشركات الخاصة أن تتعامل مع هذا التهديد. استمتع بقراءة هذا التحليل العميق

الإنسان في زمن ما بعد الوجود البشري: نحو نظام للتعايش بين الإنسان والروبوت - Centre for Arab Unity Studies
في هذا المقال، سنناقش كيف يمكن للبشر والروبوتات التعايش في نظام متكامل. سنستعرض التحديات والحلول المحتملة التي تضعها شركات مثل جوجل وأمازون. كما سنلقي نظرة على التوقعات المستقبلية وفقًا لتقرير ماكنزي

إطلاق ناسا لمهمة مأهولة إلى القمر: خطوة تاريخية نحو استكشاف الفضاء
تعتبر المهمة الجديدة خطوة هامة نحو استكشاف الفضاء وتطوير التكنولوجيا. سوف تشمل المهمة إرسال رواد فضاء إلى سطح القمر لconducting تجارب علمية. ستسهم هذه المهمة في تطوير فهمنا للفضاء وتحسين التكنولوجيا المستخدمة في استكشاف الفضاء.