EconPapers    
Economics at your fingertips  
 

Mastering diverse control tasks through world models

Danijar Hafner (), Jurgis Pasukonis, Jimmy Ba and Timothy Lillicrap
Additional contact information
Danijar Hafner: Google DeepMind
Jurgis Pasukonis: Google DeepMind
Jimmy Ba: University of Toronto
Timothy Lillicrap: Google DeepMind

Nature, 2025, vol. 640, issue 8059, 647-653

Abstract: Abstract Developing a general algorithm that learns to solve tasks across a wide range of applications has been a fundamental challenge in artificial intelligence. Although current reinforcement-learning algorithms can be readily applied to tasks similar to what they have been developed for, configuring them for new application domains requires substantial human expertise and experimentation1,2. Here we present the third generation of Dreamer, a general algorithm that outperforms specialized methods across over 150 diverse tasks, with a single configuration. Dreamer learns a model of the environment and improves its behaviour by imagining future scenarios. Robustness techniques based on normalization, balancing and transformations enable stable learning across domains. Applied out of the box, Dreamer is, to our knowledge, the first algorithm to collect diamonds in Minecraft from scratch without human data or curricula. This achievement has been posed as a substantial challenge in artificial intelligence that requires exploring farsighted strategies from pixels and sparse rewards in an open world3. Our work allows solving challenging control problems without extensive experimentation, making reinforcement learning broadly applicable.

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.nature.com/articles/s41586-025-08744-2 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:nature:v:640:y:2025:i:8059:d:10.1038_s41586-025-08744-2

Ordering information: This journal article can be ordered from
https://www.nature.com/

DOI: 10.1038/s41586-025-08744-2

Access Statistics for this article

Nature is currently edited by Magdalena Skipper

More articles in Nature from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-18
Handle: RePEc:nat:nature:v:640:y:2025:i:8059:d:10.1038_s41586-025-08744-2