EconPapers    
Economics at your fingertips  
 

Mathematical discoveries from program search with large language models

Bernardino Romera-Paredes (), Mohammadamin Barekatain, Alexander Novikov, Matej Balog, M. Pawan Kumar, Emilien Dupont, Francisco J. R. Ruiz, Jordan S. Ellenberg, Pengming Wang, Omar Fawzi, Pushmeet Kohli () and Alhussein Fawzi ()
Additional contact information
Bernardino Romera-Paredes: Google DeepMind
Mohammadamin Barekatain: Google DeepMind
Alexander Novikov: Google DeepMind
Matej Balog: Google DeepMind
M. Pawan Kumar: Google DeepMind
Emilien Dupont: Google DeepMind
Francisco J. R. Ruiz: Google DeepMind
Jordan S. Ellenberg: University of Wisconsin-Madison
Pengming Wang: Google DeepMind
Omar Fawzi: University of Lyon (Inria, ENS Lyon, UCBL, LIP)
Pushmeet Kohli: Google DeepMind
Alhussein Fawzi: Google DeepMind

Nature, 2024, vol. 625, issue 7995, 468-475

Abstract: Abstract Large language models (LLMs) have demonstrated tremendous capabilities in solving complex tasks, from quantitative reasoning to understanding natural language. However, LLMs sometimes suffer from confabulations (or hallucinations), which can result in them making plausible but incorrect statements1,2. This hinders the use of current large models in scientific discovery. Here we introduce FunSearch (short for searching in the function space), an evolutionary procedure based on pairing a pretrained LLM with a systematic evaluator. We demonstrate the effectiveness of this approach to surpass the best-known results in important problems, pushing the boundary of existing LLM-based approaches3. Applying FunSearch to a central problem in extremal combinatorics—the cap set problem—we discover new constructions of large cap sets going beyond the best-known ones, both in finite dimensional and asymptotic cases. This shows that it is possible to make discoveries for established open problems using LLMs. We showcase the generality of FunSearch by applying it to an algorithmic problem, online bin packing, finding new heuristics that improve on widely used baselines. In contrast to most computer search approaches, FunSearch searches for programs that describe how to solve a problem, rather than what the solution is. Beyond being an effective and scalable strategy, discovered programs tend to be more interpretable than raw solutions, enabling feedback loops between domain experts and FunSearch, and the deployment of such programs in real-world applications.

Date: 2024
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.nature.com/articles/s41586-023-06924-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:nature:v:625:y:2024:i:7995:d:10.1038_s41586-023-06924-6

Ordering information: This journal article can be ordered from
https://www.nature.com/

DOI: 10.1038/s41586-023-06924-6

Access Statistics for this article

Nature is currently edited by Magdalena Skipper

More articles in Nature from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-22
Handle: RePEc:nat:nature:v:625:y:2024:i:7995:d:10.1038_s41586-023-06924-6