Content
summary Summary

A math professor at the University of Pennsylvania produces a complex mathematical proof using OpenAI's GPT-o1-mini. However, the path to this result is anything but straightforward.

Ad

Robert Ghrist, Professor of Mathematics and Electrical and Systems Engineering, uses AI language models to develop a proof for a generalization of the bottleneck duality theorem. Ghrist describes the process as a mix of "outrageous optimism and frustration."

Ghrist first worked with well-known AI models like GPT-4, Claude-3.5, and Gemini-1.5-Pro. While these models can make assumptions and provide evidence, they consistently fail due to subtle errors, Ghrist says.

The breakthrough came with OpenAI's new GPT-o1-mini model. According to Ghrist, o1-mini analyzed a faulty proof, found the errors, and then generated an "entirely new, clever, correct proof," that was "more elegant than the human proof" in just 43 seconds.

Ad
Ad

OpenAI's o1 language models are optimized for logical tasks using chain-of-thought techniques. They outperform traditional language models on logic and planning benchmarks, but are still prone to error.

AI support: A mixed blessing

Ghrist notes that the result is "right on the boundary of what is and is not provable via LLMs," explaining that "mapping out the failure modes was critical."

Despite the success, Ghrist admits that "it would have been faster to do it all without AI." However, he feels working with the AI models resulted in a "much better" paper.

The AI helped with initial guesses, some proofs, and most applications. Ghrist's paper includes an appendix documenting the role of the AI models in producing the results.

Ghrist sees exploring the "corners of math latent space where proofs almost work" as the best way to probe the potential of future AI models. However, he emphasizes that AI models are still far from delivering truly profound mathematical results.

Recommendation

Human mathematician has the edge

Shortly after publication, mathematician Sridhar Ramesh points out on X that the proof of the main result is possible without AI support. He points to a theorem by Birkhoff that greatly simplifies the proof.

Twitter-Thread: Diskussion über KI-Anwendung bei mathematischen Beweisen, fokussiert auf distributive Gitter und Bottleneck-Kapazitäten in Graphen.
Ramesh reverses the saying "humans with AI beat humans without AI" here. | Image: Sridhar Rames via X

Ghrist admits that he didn't know about this approach and that it would make things a lot easier, stating: "Humans win…"

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Robert Ghrist, a professor of mathematics and electrical engineering at the University of Pennsylvania, has provided a complex mathematical proof of a generalization of the bottleneck duality theorem using OpenAI's GPT-o1-mini AI model.
  • Ghrist's path to this goal was marked by optimism and frustration, he says. It was only after months of experimenting with different AI models that he made the breakthrough with GPT-o1-mini, which analyzed a faulty proof, identified the errors, and generated a new, more elegant proof.
  • Despite the success, the professor admits that working with AI does not necessarily make the work easier. Shortly after the publication, another mathematician showed that the proof would have been much easier without AI support.
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.