Aligned with Whom? Direct and social goals for AI systems
Anton Korinek and
Avital Balwit
No 17298, CEPR Discussion Papers from C.E.P.R. Discussion Papers
Abstract:
As artificial intelligence (AI) becomes more powerful and widespread, the AI alignment problem - how to ensure that AI systems pursue the goals that we want them to pursue - has garnered growing attention. This article distinguishes two types of alignment problems depending on whose goals we consider, and analyzes the different solutions necessitated by each. The direct alignment problem considers whether an AI system accomplishes the goals of the entity operating it. In contrast, the social alignment problem considers the effects of an AI system on larger groups or on society more broadly. In particular, it also considers whether the system imposes externalities on others. Whereas solutions to the direct alignment problem center around more robust implementation, social alignment problems typically arise because of conflicts between individual and group-level goals, elevating the importance of AI governance to mediate such conflicts. Addressing the social alignment problem requires both enforcing existing norms on their developers and operators and designing new norms that apply directly to AI systems.
Keywords: Agency theory; Delegation; Direct alignment; Social alignment; Ai governance (search for similar items in EconPapers)
JEL-codes: D6 O3 (search for similar items in EconPapers)
Date: 2022-05
References: Add references at CitEc
Citations:
Downloads: (external link)
https://cepr.org/publications/DP17298 (application/pdf)
CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org
Related works:
Working Paper: Aligned with Whom? Direct and Social Goals for AI Systems (2022) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cpr:ceprdp:17298
Ordering information: This working paper can be ordered from
https://cepr.org/publications/DP17298
Access Statistics for this paper
More papers in CEPR Discussion Papers from C.E.P.R. Discussion Papers Centre for Economic Policy Research, 33 Great Sutton Street, London EC1V 0DX.
Bibliographic data for series maintained by ().