To Exit or Not to Exit: Cost-Effective Early-Exit Architecture Based on Markov Decision Process

Kim, Kyu-Sik; Lee, Hyun-Suk

To Exit or Not to Exit: Cost-Effective Early-Exit Architecture Based on Markov Decision Process

Kyu-Sik Kim and Hyun-Suk Lee ()
Additional contact information
Kyu-Sik Kim: Department of AI and Robotics, Sejong University, Seoul 05006, Republic of Korea
Hyun-Suk Lee: Department of AI and Robotics, Sejong University, Seoul 05006, Republic of Korea

Mathematics, 2024, vol. 12, issue 14, 1-16

Abstract: Recently, studies on early-exit mechanisms have emerged to reduce the computational cost during the inference process of deep learning models. However, most existing early-exit architectures simply determine early exiting based only on a target confidence level in the prediction, without any consideration of the computational cost. Such an early-exit criterion fails to balance accuracy and cost, making it difficult to use in various environments. To address this problem, we propose a novel, cost-effective early-exit architecture in which an early-exit criterion is designed based on the Markov decision process (MDP). Since the early-exit decisions within an early-exit model are sequential, we model them as an MDP problem to maximize accuracy as much as possible while minimizing the computational cost. Then, we develop a cost-effective early-exit algorithm using reinforcement learning that solves the MDP problem. For each input sample, the algorithm dynamically makes early-exit decisions considering the relative importance of accuracy and computational cost in a given environment, thereby balancing the trade-off between accuracy and cost regardless of the environment. Consequently, it can be used in various environments, even in a resource-constrained environment. Through extensive experiments, we demonstrate that our proposed architecture can effectively balance the trade-off in different environments, while the existing architectures fail to do so since they focus only on reducing their cost while preventing the degradation of accuracy.

Keywords: deep learning; early exit; Markov decision process; reinforcement learning (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/12/14/2263/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/14/2263/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:14:p:2263-:d:1439023

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().