EconPapers    
Economics at your fingertips  
 

DEEP-SARSA: A REINFORCEMENT LEARNING ALGORITHM FOR AUTONOMOUS NAVIGATION

M. Andrecut () and M. K. Ali ()
Additional contact information
M. Andrecut: Department of Physics, University of Lethbridge, 4401 University Drive, Lethbridge, Alberta, T1K 3M4, Canada
M. K. Ali: Department of Physics, University of Lethbridge, 4401 University Drive, Lethbridge, Alberta, T1K 3M4, Canada

International Journal of Modern Physics C (IJMPC), 2001, vol. 12, issue 10, 1513-1523

Abstract: In this paper we discuss the application of reinforcement learning algorithms to the problem of autonomous robot navigation. We show that the autonomous navigation using the standard delayed reinforcement learning algorithms is an ill posed problem and we present a more efficient algorithm for which the convergence speed is greatly improved. The proposed algorithm (Deep-Sarsa) is based on a combination between the Depth-First Search (a graph searching algorithm) and Sarsa (a delayed reinforcement learning algorithm).

Keywords: Reinforcement learning; autonomous navigation (search for similar items in EconPapers)
Date: 2001
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0129183101002851
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:ijmpcx:v:12:y:2001:i:10:n:s0129183101002851

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0129183101002851

Access Statistics for this article

International Journal of Modern Physics C (IJMPC) is currently edited by H. J. Herrmann

More articles in International Journal of Modern Physics C (IJMPC) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:ijmpcx:v:12:y:2001:i:10:n:s0129183101002851