EconPapers    
Economics at your fingertips  
 

Elite Episode Replay Memory for Polyphonic Piano Fingering Estimation

Ananda Phan Iman and Chang Wook Ahn ()
Additional contact information
Ananda Phan Iman: Department of AI Convergence, College of Information and Computing, Gwangju Institute of Science and Technology, Gwangju 61005, Republic of Korea
Chang Wook Ahn: Department of AI Convergence, College of Information and Computing, Gwangju Institute of Science and Technology, Gwangju 61005, Republic of Korea

Mathematics, 2025, vol. 13, issue 15, 1-18

Abstract: Piano fingering estimation remains a complex problem due to the combinatorial nature of hand movements and no best solution for any situation. A recent model-free reinforcement learning framework for piano fingering modeled each monophonic piece as an environment and demonstrated that value-based methods outperform probability-based approaches. Building on their finding, this paper addresses the more complex polyphonic fingering problem by formulating it as an online model-free reinforcement learning task with a novel training strategy. Thus, we introduce a novel Elite Episode Replay (EER) method to improve learning efficiency by prioritizing high-quality episodes during training. This strategy accelerates early reward acquisition and improves convergence without sacrificing fingering quality. The proposed architecture produces multiple-action outputs for polyphonic settings and is trained using both elite-guided and uniform sampling. Experimental results show that the EER strategy reduces training time per step by 21% and speeds up convergence by 18% while preserving the difficulty level and result of the generated fingerings. An empirical study of elite memory size further highlights its impact on training performance in solving piano fingering estimation.

Keywords: piano fingering estimation; experience replay; replay strategy; reinforcement learning; symbolic music processing (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/13/15/2485/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/15/2485/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:15:p:2485-:d:1715722

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-08-02
Handle: RePEc:gam:jmathe:v:13:y:2025:i:15:p:2485-:d:1715722