Trust Region Policy Learning for Adaptive Drug Infusion with Communication Networks in Hypertensive Patients
Mai The Vu,
Seong Han Kim,
Ha Le Nhu Ngoc Thanh,
Majid Roohi and
Tuan Hai Nguyen ()
Additional contact information
Mai The Vu: Department of Artificial Intelligence and Robotics, Sejong University, Seoul 05006, Republic of Korea
Seong Han Kim: Department of Artificial Intelligence and Robotics, Sejong University, Seoul 05006, Republic of Korea
Ha Le Nhu Ngoc Thanh: Faculty of Mechanical Engineering, Ho Chi Minh City University of Technology and Education, Ho Chi Minh City 71307, Vietnam
Majid Roohi: Department of Mathematics, Aarhus University, 8000 Aarhus, Denmark
Tuan Hai Nguyen: Faculty of Engineering, Dong Nai Technology University, Bien Hoa City, Vietnam
Mathematics, 2025, vol. 13, issue 1, 1-21
Abstract:
In the field of biomedical engineering, the issue of drug delivery constitutes a multifaceted and demanding endeavor for healthcare professionals. The intravenous administration of pharmacological agents to patients and the normalization of average arterial blood pressure (AABP) to desired thresholds represents a prevalent approach employed within clinical settings. The automated closed-loop infusion of vasoactive drugs for the purpose of modulating blood pressure (BP) in patients suffering from acute hypertension has been the focus of rigorous investigation in recent years. In previous works where model-based and fuzzy controllers are used to control AABP, model-based controllers rely on the precise mathematical model, while fuzzy controllers entail complexity due to rule sets. To overcome these challenges, this paper presents an adaptive closed-loop drug delivery system to control AABP by adjusting the infusion rate, as well as a communication time delay (CTD) for analyzing the wireless connectivity and interruption in transferring feedback data as a new insight. Firstly, a nonlinear backstepping controller (NBC) is developed to control AABP by continuously adjusting vasoactive drugs using real-time feedback. Secondly, a model-free deep reinforcement learning (MF-DRL) algorithm is integrated into the NBC to adjust dynamically the coefficients of the controller. Besides the various analyses such as normal condition (without CTD strategy), stability, and hybrid noise, a CTD analysis is implemented to illustrate the functionality of the system in a wireless manner and interruption in real-time feedback data.
Keywords: average arterial blood pressure (AABP); vasoactive drugs; nonlinear backstepping controller (NBC); model-free reinforcement learning (MF-DRL); communication time delay (CTD) (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/13/1/136/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/1/136/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:1:p:136-:d:1558262
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().