A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm

Bonsu, Kwadwo Osei

A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm

Kwadwo Osei Bonsu

Abstract: This paper proposes a geometric approach for estimating the $\alpha$ value in Q learning. We establish a systematic framework that optimizes the {\alpha} parameter, thereby enhancing learning efficiency and stability. Our results show that there is a relationship between the learning rate and the angle between a vector T (total time steps in each episode of learning) and R (the reward vector for each episode). The concept of angular bisector between vectors T and R and Nash Equilibrium provide insight into estimating $\alpha$ such that the algorithm minimizes losses arising from exploration-exploitation trade-off.

Date: 2024-08
New Economics Papers: this item is included in nep-gth
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://arxiv.org/pdf/2408.04911 Latest version (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2408.04911

Access Statistics for this paper

More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().