Adverse Drug Event Detection from Electronic Health Records Using Hierarchical Recurrent Neural Networks with Dual-Level Embedding
Susmitha Wunnava (),
Xiao Qin (),
Tabassum Kakar (),
Cansu Sen (),
Elke A. Rundensteiner () and
Xiangnan Kong ()
Additional contact information
Susmitha Wunnava: Worcester Polytechnic Institute
Xiao Qin: Worcester Polytechnic Institute
Tabassum Kakar: Worcester Polytechnic Institute
Cansu Sen: Worcester Polytechnic Institute
Elke A. Rundensteiner: Worcester Polytechnic Institute
Xiangnan Kong: Worcester Polytechnic Institute
Drug Safety, 2019, vol. 42, issue 1, No 12, 113-122
Abstract:
Abstract Introduction Adverse drug event (ADE) detection is a vital step towards effective pharmacovigilance and prevention of future incidents caused by potentially harmful ADEs. The electronic health records (EHRs) of patients in hospitals contain valuable information regarding ADEs and hence are an important source for detecting ADE signals. However, EHR texts tend to be noisy. Yet applying off-the-shelf tools for EHR text preprocessing jeopardizes the subsequent ADE detection performance, which depends on a well tokenized text input. Objective In this paper, we report our experience with the NLP Challenges for Detecting Medication and Adverse Drug Events from Electronic Health Records (MADE1.0), which aims to promote deep innovations on this subject. In particular, we have developed rule-based sentence and word tokenization techniques to deal with the noise in the EHR text. Methods We propose a detection methodology by adapting a three-layered, deep learning architecture of (1) recurrent neural network [bi-directional long short-term memory (Bi-LSTM)] for character-level word representation to encode the morphological features of the medical terminology, (2) Bi-LSTM for capturing the contextual information of each word within a sentence, and (3) conditional random fields for the final label prediction by also considering the surrounding words. We experiment with different word embedding methods commonly used in word-level classification tasks and demonstrate the impact of an integrated usage of both domain-specific and general-purpose pre-trained word embedding for detecting ADEs from EHRs. Results Our system was ranked first for the named entity recognition task in the MADE1.0 challenge, with a micro-averaged F1-score of 0.8290 (official score). Conclusion Our results indicate that the integration of two widely used sequence labeling techniques that complement each other along with dual-level embedding (character level and word level) to represent words in the input layer results in a deep learning architecture that achieves excellent information extraction accuracy for EHR notes.
Date: 2019
References: View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://link.springer.com/10.1007/s40264-018-0765-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:drugsa:v:42:y:2019:i:1:d:10.1007_s40264-018-0765-9
Ordering information: This journal article can be ordered from
http://www.springer.com/adis/journal/40264
DOI: 10.1007/s40264-018-0765-9
Access Statistics for this article
Drug Safety is currently edited by Nitin Joshi
More articles in Drug Safety from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().