The most important part is not that AI solved a math problem. It is how it found the path humans missed.

For nearly 80 years, one of the most famous unsolved problems in combinatorial geometry looked almost childish on the surface.

Take a sheet of paper. Put dots on it. Now ask a simple question: how many pairs of dots can be exactly one inch apart?

With nine dots in a line, the answer is easy. You get eight one-inch connections. Put those same nine dots into a 3 by 3 grid, and suddenly you get twelve. Now scale that up. What if you have 1,000 dots? A million? A trillion? If you are allowed to place the dots anywhere you want, what is the maximum number of exact one-inch pairs you can force?

That is the planar unit distance problem, first posed by Paul Erdős in 1946. Erdős was one of the most prolific mathematicians in history, a legendary Hungarian problem-poser whose simple questions often turned into decades-long mathematical traps. This was one of his favorites.

For almost eight decades, the basic intuition was that the best strategy should look something like a grid. Maybe a very clever grid. Maybe a carefully rescaled grid. But still basically a grid.

Then OpenAI announced that one of its unreleased internal reasoning models had found a counterexample.

The model did not just beat a benchmark. It did not just solve a contest problem. According to OpenAI and a group of outside mathematicians, it disproved a central conjecture in discrete geometry by constructing an infinite family of point arrangements that create more unit-distance pairs than the long-believed grid-based optimum.

And the unsettling part is this: the AI did not invent alien mathematics. It connected pieces humans already had.

The knowledge was there. The tools were there. They were just sitting in the wrong mental drawer.

What the problem actually asks

The unit distance problem asks: if you place n points in the plane, what is the maximum number of pairs of points that can be exactly distance 1 apart?

Think of it like designing a city.

The dots are houses. You want as many pairs of houses as possible to be exactly one block apart. A normal street grid works well. A carefully designed grid works even better. Erdős believed that, even with sophisticated tricks, you could not do dramatically better than this grid-like approach.

Mathematically, Erdős conjectured that the answer should grow roughly like n^(1+o(1)).

That notation sounds intimidating, but the idea is simple. It means the number of one-inch pairs grows only slightly faster than the number of dots themselves. The extra advantage shrinks as n gets huge. So the growth is almost linear.

OpenAI's model showed that this belief was false.

It found point arrangements with at least n^(1+δ) unit-distance pairs, where δ is some fixed positive number. Princeton mathematician Will Sawin then refined the construction and showed that one can take δ = 0.014, meaning there are arrangements with more than n^1.014 unit-distance pairs for arbitrarily large n.

That exponent looks tiny. But conceptually, it is a huge shift.

Before, the belief was that the answer was barely above n. Now we know the answer is above n by a fixed power.

The known upper bound is still much higher, around n^(4/3), or n^1.333, from work by Spencer, Szemerédi, and Trotter in 1984. So the full problem is not solved. The true maximum is still somewhere between roughly n^1.014 and n^1.333.

But Erdős' conjectured ceiling is gone.

What OpenAI's model actually did

OpenAI says the proof came from a new general-purpose internal reasoning model. The company has not named the model.

That matters. OpenAI is not claiming this was a special-purpose math engine, a proof-search scaffold, or a system built only for this problem. Noam Brown, an OpenAI researcher known for work on reasoning systems, wrote on X: "This is a general-purpose LLM. It wasn't targeted at this problem or even at mathematics. Also, it's not a scaffold. We have not pushed this model to the limit on open problems."

According to Scientific American, OpenAI mathematicians Mehtaab Sawhney and Mark Sellke fed the conjecture to the model and asked whether Erdős was right. The model produced hundreds of pages of logic and calculations.

The companion paper says the proof was first mathematically generated "in one shot" by an internal OpenAI model, then expositionally refined through human interactions with Codex. In other words, the core mathematical idea appears to have come from the model, while humans and Codex helped clean up the presentation.

That distinction matters. This is not the same as saying humans invented the construction and the model merely assisted. But it is also not the same as saying the public has seen the raw model trace.

Scientific American notes that external experts did not see the original raw output, only an edited version of its chain of thought. OpenAI released a proof, a companion remarks paper, and an abridged GPT-rewritten chain-of-thought document, but it has not disclosed the exact prompt sequence, number of attempts, compute cost, or model name.

So the safest framing is: the mathematical result appears real and significant, but the exact process around the model is still partly opaque.

How it worked, without the hard math

The key idea is that the AI did not invent a new kind of geometry.

It imported machinery from a different field: algebraic number theory.

Erdős' original grid can be understood through Gaussian integers, numbers of the form a + bi. If ordinary grids are like graph paper, Gaussian integers are like graph paper with complex-number coordinates.

OpenAI's model generalized that idea to much richer number systems called CM number fields.

A simple way to picture it:

Instead of placing dots directly on a 2D sheet of paper, imagine designing the dot pattern inside a hidden high-dimensional spreadsheet. In that hidden space, relationships between points are easier to control. There are symmetries you cannot easily see on the page. Then you cast a 2D shadow of that structure onto ordinary paper.

The final points still live in the normal 2D plane. This is not a 3D or 10D drawing. But the structure that creates them comes from a higher-dimensional number-field construction.

The proof uses deep tools like infinite class field towers and the Golod-Shafarevich theorem to show that the needed number fields actually exist. The rough recipe is:

  • Build a highly structured lattice-like object inside a high-degree number field.

  • Make sure it has lots of hidden symmetry.

  • Use those symmetries to create many pairs of points whose differences have magnitude one.

  • Project the construction back into the ordinary plane.

  • Get more exact unit-distance pairs than the old square-grid construction could produce.

That is why this result is so interesting to AI people.

The tools were not brand new. Number theorists already knew much of this machinery. But combinatorial geometers did not realize it could break this geometry conjecture.

The model found the bridge.

The real AI point: models may find the paths experts dismiss

This is the part that should make researchers uncomfortable.

Human experts have taste. Usually, taste is useful. It tells you which paths are promising and which paths are probably a waste of time.

But taste can also become a trap.

For decades, many mathematicians believed Erdős was probably right. So much of the field's energy went toward trying to prove the conjecture, not break it. Even people who considered number-field approaches tended to explore them in the more natural human way. Will Sawin's reflection in the companion paper points to a subtle but important difference: humans tended to fix the field and vary the primes, while the AI's path effectively fixed primes and let the field degree grow.

That is not a flashy difference. But it was enough to matter.

Sébastien Bubeck, who leads OpenAI's math work, gave Scientific American the cleanest summary: "The model did not invent something fundamentally new that nobody saw coming. It just executed like an amazing mathematician."

That line is the whole story.

The model did not beat humans by having magic. It beat the old intuition by connecting tools from another field and following a path humans mostly did not pursue.

The knowledge was already there. The AI connected the dots.

Why this is more credible than OpenAI's October 2025 embarrassment

OpenAI has made bold claims about Erdős problems before, and one of them blew up in its face.

In October 2025, then-OpenAI VP Kevin Weil posted that GPT-5 had found solutions to 10 previously unsolved Erdős problems and made progress on 11 others. The claim quickly unraveled. GPT-5 had not produced original solutions. It had surfaced existing papers already in the literature.

Thomas Bloom, the mathematician who maintains erdosproblems.com, called that claim "a dramatic misrepresentation." Google DeepMind CEO Demis Hassabis replied "This is embarrassing." Yann LeCun mocked the episode with: "Hoisted by their own GPTards." Weil deleted the post.

That context is important because it makes the new announcement much more interesting.

This time, OpenAI published the proof. It published a companion paper. Outside mathematicians checked the result. And Bloom, one of the people who criticized the earlier OpenAI overclaim, is now a co-author on the companion remarks.

That does not mean every bit of OpenAI's framing should be accepted uncritically. But it does mean this is not just a tweet. It is a mathematical claim with serious external validation.

What mathematicians are saying

Tim Gowers is a Fields Medal-winning mathematician, one of the most respected voices in modern mathematics. He called the result "a milestone in AI mathematics." He also wrote that if a human had submitted the result to the Annals of Mathematics, he would recommend acceptance without hesitation. Scientific American quoted him saying: "No previous AI-generated proof has come close" to this level.

Noga Alon, a Princeton mathematician and major figure in combinatorics, described the solution as "an outstanding achievement, settling a long-standing open problem." He said the construction uses sophisticated tools "in an elegant and clever way."

Thomas Bloom said the AI succeeded partly by "persevering down paths that a human may have dismissed as not worth their time to explore." He also wrote: "The human still plays a vital role in discussing, digesting, and improving this proof, and exploring its consequences."

Daniel Litt, a University of Toronto mathematician consulted by OpenAI, told Scientific American: "This is the unique interesting result produced autonomously by AI so far."

Arul Shankar, a leading number theorist at the University of Toronto, said: "Current AI models go beyond just helpers to human mathematicians - they are capable of having original ingenious ideas, and then carrying them out to fruition."

Jacob Tsimerman, also at Toronto, gave one of the best descriptions of the AI advantage: "AIs have an edge: It's not just that they can try all known methods. They can play for longer and in more treacherous waters than mathematicians without getting overwhelmed."

Melanie Matchett Wood at Harvard added an important warning. AI-generated math can fail to properly credit related prior work. She told Scientific American: "We recognized that there were very similar ideas in the literature that weren't credited. If a human had been familiar with those results and not credited them, then that would be professional malpractice."

So the reaction is not mindless hype. The reaction is serious, impressed, and cautious.

What this does not mean

This is not a full solution to the unit distance problem.

OpenAI's model found a counterexample to Erdős' conjectured upper limit. It did not find the exact maximum number of unit-distance pairs. It did not close the gap between the new lower bound and the known upper bound. It did not prove its construction is optimal.

There is also an important distinction between proving a conjecture and disproving one.

Several mathematicians noted that proving Erdős right would likely have required a much deeper new geometric theory. Disproving him with a counterexample is still a major result, but it is a different kind of result.

That is why the strongest version of the story is not: AI has conquered mathematics.

The strongest version is: a general-purpose AI model may have found a real research path that humans missed because their intuition pointed elsewhere.

That is enough.

Why this matters beyond math

Mathematics is a clean testbed for AI reasoning because proofs can be checked. A long proof either holds together or it does not. That makes math one of the best places to see whether models can move beyond summarization, coding assistance, and benchmark performance into genuine research contribution.

If this result generalizes, the implications are bigger than geometry.

AI research acceleration may not look like a robot scientist inventing alien theories from scratch. It may look like models finding hidden connections between fields humans keep mentally separate.

That matters for:

  • Physics

  • Biology

  • Materials science

  • Medicine

  • Engineering

  • AI research itself

The uncomfortable possibility is that many fields contain overlooked bridges like this. The papers exist. The tools exist. The experts exist. But the connections are buried between disciplines, conventions, and human assumptions about what is worth trying.

This is the AI capability people underprice: not just raw IQ, not just benchmark scores, but the ability to search weird paths without embarrassment.

Humans avoid dead ends because time is scarce and reputation matters. Models can explore more low-probability paths, combine distant literatures, and return with something experts can verify.

That does not make human experts less important. It makes them more important. Humans still choose the problems, verify the proofs, interpret the results, assign credit, and decide what matters.

But the creative loop is changing.

The AI math timeline is accelerating

This result lands after a string of increasingly serious AI math milestones.

In July 2024, DeepMind's AlphaProof and AlphaGeometry 2 reached silver-medal level on the International Math Olympiad.

In July 2025, OpenAI said an experimental reasoning model reached IMO gold-medal level.

In October 2025, OpenAI had its Erdős hype failure, where GPT-5 found existing literature rather than original solutions.

By early 2026, multiple AI-assisted open-problem claims began circulating, including Terence Tao's comment that one result involving GPT-5.2 was "perhaps the most unambiguous instance" of AI solving an open math problem.

Then came the May 2026 unit-distance result.

The pattern is hard to ignore:

  • 2024: AI reaches elite contest math.

  • 2025: AI reaches gold-level contest performance, but still produces public hype failures.

  • 2026: AI starts producing externally verified open-problem results.

This is not a guarantee that every field is about to be automated. But it is a signal that the frontier has moved.

The clean takeaway

OpenAI's model did not solve all of the unit distance problem.

But it did something important: it disproved an 80-year-old belief about what the answer could look like.

It found a new family of point arrangements that beat the classic grid-based construction. It did this by connecting discrete geometry with deep algebraic number theory. And outside mathematicians say the result is real, publishable, and significant.

The real story is not that AI replaced mathematicians.

The real story is that AI may now be entering the creative discovery loop of mathematics.

Not by knowing magic. Not by inventing alien math. But by connecting tools humans already had, following paths humans had mostly dismissed, and exposing blind spots inside expert intuition.

That is the part worth paying attention to.

If this generalizes, AI will not just help researchers move faster.

It will show them where their taste failed.

Sources

Reply

Avatar

or to participate

Keep Reading