ToyArchitecture: Unsupervised learning of interpretable models of the environment
Jaroslav Vítků,
Petr Dluhoš,
Joseph Davidson,
Matěj Nikl,
Simon Andersson,
Přemysl Paška,
Jan Šinkora,
Petr Hlubuček,
Martin Stránský,
Martin Hyben,
Martin Poliak,
Jan Feyereisl and
Marek Rosa
PLOS ONE, 2020, vol. 15, issue 5, 1-50
Abstract:
Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improvements in narrow AI domains, or on universal theoretical frameworks which are often uncomputable, or lack practical implementations. In this paper we attempt to follow a big picture view while also providing a particular theory and its implementation to present a novel, purposely simple, and interpretable hierarchical architecture. This architecture incorporates the unsupervised learning of a model of the environment, learning the influence of one’s own actions, model-based reinforcement learning, hierarchical planning, and symbolic/sub-symbolic integration in general. The learned model is stored in the form of hierarchical representations which are increasingly more abstract, but can retain details when needed. We demonstrate the universality of the architecture by testing it on a series of diverse environments ranging from audio/visual compression to discrete and continuous action spaces, to learning disentangled representations.
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0230432 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 30432&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0230432
DOI: 10.1371/journal.pone.0230432
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone (plosone@plos.org).