Large Scale, Complex Processing of Health Data with MapReduce
Khanh Luan P. Nguyen () and
Naveen Ashish ()
Additional contact information
Khanh Luan P. Nguyen: Cognie Inc., 365 San Juan Place, Pasadena, CA 91107, USA
Naveen Ashish: Cognie Inc., 365 San Juan Place, Pasadena, CA 91107, USA
Journal of Information & Knowledge Management (JIKM), 2014, vol. 13, issue 01, 1-6
Abstract:
The article describes a solution to process large volumes of unstructured health social media data in a scalable fashion using the MapReduce framework. Our work is in the context of health informatics applications involving complex text and language processing as well as large resources such as ontologies, due to which the text processing of a single unit of text takes time. Even with a throughput of an order processing time of one second per unit, it takes over a week to process a million units, which is unacceptable. We present a solution where we take the processing to a MapReduce framework and achieve significant improvement in processing performance by dividing the processing across a cluster of processors. This paper describes the technical details of our work in terms of the design, modeling, and implementation of such an approach. We also present experimental results demonstrating the effectiveness of our approach.
Keywords: Information extraction; MapReduce; Hadoop; parallelization (search for similar items in EconPapers)
Date: 2014
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649214500099
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:13:y:2014:i:01:n:s0219649214500099
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649214500099
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().