EconPapers    
Economics at your fingertips  
 

Experimental quantum speed-up in reinforcement learning agents

V. Saggio (), B. E. Asenbeck, A. Hamann, T. Strömberg, P. Schiansky, V. Dunjko, N. Friis, N. C. Harris, M. Hochberg, D. Englund, S. Wölk, H. J. Briegel and P. Walther ()
Additional contact information
V. Saggio: University of Vienna, Faculty of Physics, Vienna Center for Quantum Science and Technology (VCQ)
B. E. Asenbeck: University of Vienna, Faculty of Physics, Vienna Center for Quantum Science and Technology (VCQ)
A. Hamann: Universität Innsbruck
T. Strömberg: University of Vienna, Faculty of Physics, Vienna Center for Quantum Science and Technology (VCQ)
P. Schiansky: University of Vienna, Faculty of Physics, Vienna Center for Quantum Science and Technology (VCQ)
V. Dunjko: Leiden University
N. Friis: Austrian Academy of Sciences
N. C. Harris: Massachusetts Institute of Technology
M. Hochberg: Nokia Corporation
D. Englund: Massachusetts Institute of Technology
S. Wölk: Universität Innsbruck
H. J. Briegel: Universität Innsbruck
P. Walther: University of Vienna, Faculty of Physics, Vienna Center for Quantum Science and Technology (VCQ)

Nature, 2021, vol. 591, issue 7849, 229-233

Abstract: Abstract As the field of artificial intelligence advances, the demand for algorithms that can learn quickly and efficiently increases. An important paradigm within artificial intelligence is reinforcement learning1, where decision-making entities called agents interact with environments and learn by updating their behaviour on the basis of the obtained feedback. The crucial question for practical applications is how fast agents learn2. Although various studies have made use of quantum mechanics to speed up the agent’s decision-making process3,4, a reduction in learning time has not yet been demonstrated. Here we present a reinforcement learning experiment in which the learning process of an agent is sped up by using a quantum communication channel with the environment. We further show that combining this scenario with classical communication enables the evaluation of this improvement and allows optimal control of the learning progress. We implement this learning protocol on a compact and fully tunable integrated nanophotonic processor. The device interfaces with telecommunication-wavelength photons and features a fast active-feedback mechanism, demonstrating the agent’s systematic quantum advantage in a setup that could readily be integrated within future large-scale quantum communication networks.

Date: 2021
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.nature.com/articles/s41586-021-03242-7 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nat:nature:v:591:y:2021:i:7849:d:10.1038_s41586-021-03242-7

Ordering information: This journal article can be ordered from
https://www.nature.com/

DOI: 10.1038/s41586-021-03242-7

Access Statistics for this article

Nature is currently edited by Magdalena Skipper

More articles in Nature from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-19
Handle: RePEc:nat:nature:v:591:y:2021:i:7849:d:10.1038_s41586-021-03242-7