Human-centred mechanism design with Democratic AI
Raphael Koster,
Jan Balaguer,
Andrea Tacchetti,
Ari Weinstein,
Tina Zhu,
Oliver Hauser,
Duncan Williams,
Lucy Campbell-Gillingham,
Phoebe Thacker,
Matthew Botvinick and
Christopher Summerfield ()
Additional contact information
Raphael Koster: Deepmind
Jan Balaguer: Deepmind
Andrea Tacchetti: Deepmind
Ari Weinstein: Deepmind
Tina Zhu: Deepmind
Duncan Williams: Deepmind
Lucy Campbell-Gillingham: Deepmind
Phoebe Thacker: Deepmind
Matthew Botvinick: Deepmind
Christopher Summerfield: Deepmind
Nature Human Behaviour, 2022, vol. 6, issue 10, 1398-1407
Abstract:
Abstract Building artificial intelligence (AI) that aligns with human values is an unsolved problem. Here we developed a human-in-the-loop research pipeline called Democratic AI, in which reinforcement learning is used to design a social mechanism that humans prefer by majority. A large group of humans played an online investment game that involved deciding whether to keep a monetary endowment or to share it with others for collective benefit. Shared revenue was returned to players under two different redistribution mechanisms, one designed by the AI and the other by humans. The AI discovered a mechanism that redressed initial wealth imbalance, sanctioned free riders and successfully won the majority vote. By optimizing for human preferences, Democratic AI offers a proof of concept for value-aligned policy innovation.
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (7)
Downloads: (external link)
https://www.nature.com/articles/s41562-022-01383-x Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
Working Paper: Human-centered mechanism design with Democratic AI (2022) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:nathum:v:6:y:2022:i:10:d:10.1038_s41562-022-01383-x
Ordering information: This journal article can be ordered from
https://www.nature.com/nathumbehav/
DOI: 10.1038/s41562-022-01383-x
Access Statistics for this article
Nature Human Behaviour is currently edited by Stavroula Kousta
More articles in Nature Human Behaviour from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().