Theory of Mind May Have Spontaneously Emerged in Large Language Models

Kosinski, Michal

Theory of Mind May Have Spontaneously Emerged in Large Language Models

Michal Kosinski
Additional contact information
Michal Kosinski: Stanford U

Research Papers from Stanford University, Graduate School of Business

Abstract: Theory of mind (ToM), or the ability to impute unobservable mental states to others, is central to human social interactions, communication, empathy, self-consciousness, and morality. We tested several language models using 40 classic false-belief tasks widely used to test ToM in humans. The models published before 2020 showed virtually no ability to solve ToM tasks. Yet, the first version of GPT-3 (â€œdavinci-001â€ ), published in May 2020, solved about 40% of false-belief tasksâ€”performance comparable with 3.5-year-old children. Its second version (â€œdavinci-002â€ ; January 2022) solved 70% of false-belief tasks, performance comparable with six-year-olds. Its most recent version, GPT-3.5 (â€œdavinci-003â€ ; November 2022), solved 90% of false-belief tasks, at the level of seven-year-olds. GPT-4 published in March 2023 solved nearly all the tasks (95%). These findings suggest that ToM-like ability (thus far considered to be uniquely human) may have spontaneously emerged as a byproduct of language modelsâ€™ improving language skills.

Date: 2023-03
New Economics Papers: this item is included in nep-evo and nep-neu
References: Add references at CitEc
Citations: View citations in EconPapers (9)

Downloads: (external link)
https://www.gsb.stanford.edu/faculty-research/work ... arge-language-models

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ecl:stabus:4086

Access Statistics for this paper

More papers in Research Papers from Stanford University, Graduate School of Business Contact information at EDIRC.
Bibliographic data for series maintained by ().