ChatGPT Solves 60-Year-Old Math Problem: AI Breakthrough Validated by Experts

GPT-5.5 artificial intelligence ChatGPT AI breakthrough Erdős problems mathematics Terence Tao Liam Price Paul Erdős math conjectures

ChatGPT Solves 60-Year-Old Math Problem, Experts Validate Breakthrough

In a surprising turn, a 23-year-old named Liam Price appears to have solved one of the famously difficult Erdős problems—a set of abstruse math conjectures left by Hungarian mathematician Paul Erdős—using ChatGPT. The breakthrough, reported by Scientific American, marks a rare instance where AI may have genuinely outpaced human mathematicians in tackling an unsolved problem.

How a Non-Expert Solved an Erdős Problem with AI

Price, who does not hold an advanced math degree, reportedly prompted GPT-5.4 to generate a solution for one of the Erdős conjectures. While many AI-generated solutions to these problems have proven incorrect, experts reviewing Price’s submission—posted to erdosproblems.com—say his approach is valid.

The problem in question had stumped mathematicians for decades, with most experts following a predictable sequence of moves to approach it. However, ChatGPT took an unexpected path by applying a well-known formula in a novel way, bypassing the conventional approach that had led previous researchers astray.

Experts Praise AI’s Unconventional Solution

Terence Tao, a mathematician at the University of California, Los Angeles and a leading voice in evaluating AI’s role in mathematics, noted the significance of the AI’s approach:

“This one is a bit different because people did look at it, and the humans that looked at it just collectively made a slight wrong turn at move one.”

There was kind of a standard sequence of moves that everyone who worked on the problem previously started by doing, but the AI took an unexpected approach by using a well-known formula that no one had thought to apply to this type of question.”

Tao maintains a database of AI-assisted solutions to Erdős problems, but most have either rediscovered existing proofs or provided flawed reasoning. This latest case, however, suggests AI may have truly “thought” outside conventional boundaries.

Human Expertise Still Required to Refine AI Output

While the AI’s raw output was initially unclear, human mathematicians played a crucial role in refining and validating the solution. Jared Lichtman, a mathematician at Stanford University whose doctoral thesis focused on an Erdős conjecture, explained:

The raw output of ChatGPT’s proof was actually quite poor. So it required an expert to kind of sift through and actually understand what it was trying to say.”

Tao added that the breakthrough represents a new way to conceptualize large numbers and their properties:

“We have discovered a new way to think about large numbers and their anatomy. It’s a nice achievement. I think the jury is still out on the long-term significance.”

Caution Urged Despite Enthusiasm

Despite the excitement, experts urge caution. In October 2023, Kevin Weil, then a vice president at OpenAI, prematurely celebrated ChatGPT’s solution to another Erdős problem—only for the claim to collapse under scrutiny. Weil deleted his post after competitors exposed that the AI had merely regurgitated an existing proof.

The incident underscores the risks of overhyping AI-generated math solutions before rigorous human validation. Still, the latest breakthrough suggests that AI may occasionally offer genuinely novel insights—if not yet a fully autonomous path to mathematical discovery.

Key Takeaways

Liam Price, a 23-year-old without an advanced math degree, used ChatGPT to solve an Erdős problem.
Experts, including Terence Tao and Jared Lichtman, validated the AI’s unconventional approach.
The AI bypassed conventional methods by applying a well-known formula in a new way.
Human mathematicians were still required to refine and verify the AI’s output.
Past AI claims, such as one by Kevin Weil in October 2023, have later been debunked.
Experts remain cautiously optimistic about AI’s role in mathematical discovery.

Source: Futurism

← Previous

May 2, 1927: Landmark Supreme Court Ruling in Buck v. Bell Upholds For...

Revolutionary Infrasound Fire Suppression: Can AI-Powered Acoustic Waves Replace Traditional Sprinklers?

20:58 · 14 May 2026

Elon Musk Skips OpenAI Trial Amid Legal Pressure and International Travel to China with Trump

Elon Musk is locked in a heated trial in a lawsuit he lodged against his rival OpenAI and its CEO Sam Altman. Or at least, he’s supposed to be. Despit...

20:12 · 14 May 2026

Sam Altman’s Cross-Examination: Trustworthiness Under Fire in Musk v. Altman Trial

OpenAI CEO Sam Altman faced what sounds like a truly awful day on the stand this week during cross-examination in the ongoing Musk v. Altman court sag...

19:14 · 14 May 2026

NSF Ends Geoscience Postdoctoral Fellowships Amid Agency Reorganization

Research & Developments is a blog for brief updates that provide context for the flurry of news regarding law and policy changes that impact science a...

18:26 · 14 May 2026

Microsoft Study Reveals AI Workplace Failures: 25% Document Corruption Rate in Top Models

AI automation is typically exactly what it sounds like: automating tasks — many of which were previously carried out by humans — in an attempt to boos...

16:53 · 14 May 2026

OpenAI Faces Lawsuit Over Alleged Unauthorized Sharing of User Data with Meta and Google

A new class action lawsuit accuses OpenAI of sharing data including user chat queries and personal identifying information like emails and user IDs wi...

15:56 · 14 May 2026

Lake Tahoe Residents Face Power Shutoff as Data Centers Drain Nevada’s Grid

The data center scramble feeding off the AI boom is no longer just raising utility prices for nearby civilians — it’s rerouting their utilities entire...

15:07 · 14 May 2026

Sam Altman Testifies Musk Spent Critical OpenAI Meetings Showing Memes Instead of Discussing Tesla Merger

OpenAI CEO Sam Altman took the stand yesterday in Musk v. Altman, the chaotic, embarrassing, and yet deeply illuminating lawsuit — filed against Altma...

14:04 · 14 May 2026

Elon Musk's xAI Grok Struggles: Stagnant Growth and Poor Performance in AI Market

In the AI world, there are what the tech scholar Kate Crawford has called the “Great Houses of AI.” These are Microsoft, Amazon, Google, and Meta — gi...

Science

ChatGPT Solves Decades-Old Math Conjecture, Experts Confirm Breakthrough

ChatGPT Solves 60-Year-Old Math Problem, Experts Validate Breakthrough

How a Non-Expert Solved an Erdős Problem with AI

Experts Praise AI’s Unconventional Solution

Human Expertise Still Required to Refine AI Output

Caution Urged Despite Enthusiasm

Key Takeaways

May 2, 1927: Landmark Supreme Court Ruling in Buck v. Bell Upholds For...

Revolutionary Infrasound Fire Suppression: Can AI-Powered Acoustic Wav...

Science

ChatGPT Solves Decades-Old Math Conjecture, Experts Confirm Breakthrough

ChatGPT Solves 60-Year-Old Math Problem, Experts Validate Breakthrough

How a Non-Expert Solved an Erdős Problem with AI

Experts Praise AI’s Unconventional Solution

Human Expertise Still Required to Refine AI Output

Caution Urged Despite Enthusiasm

Key Takeaways

May 2, 1927: Landmark Supreme Court Ruling in Buck v. Bell Upholds For...

Revolutionary Infrasound Fire Suppression: Can AI-Powered Acoustic Wav...

Related articles

Elon Musk Skips OpenAI Trial Amid Legal Pressure and International Travel to China with Trump

Sam Altman’s Cross-Examination: Trustworthiness Under Fire in Musk v. Altman Trial

NSF Ends Geoscience Postdoctoral Fellowships Amid Agency Reorganization

Microsoft Study Reveals AI Workplace Failures: 25% Document Corruption Rate in Top Models

OpenAI Faces Lawsuit Over Alleged Unauthorized Sharing of User Data with Meta and Google

Lake Tahoe Residents Face Power Shutoff as Data Centers Drain Nevada’s Grid

Sam Altman Testifies Musk Spent Critical OpenAI Meetings Showing Memes Instead of Discussing Tesla Merger

Elon Musk's xAI Grok Struggles: Stagnant Growth and Poor Performance in AI Market