EconPapers    
Economics at your fingertips  
 

Large Scale, Complex Processing of Health Data with MapReduce

Khanh Luan P. Nguyen () and Naveen Ashish ()
Additional contact information
Khanh Luan P. Nguyen: Cognie Inc., 365 San Juan Place, Pasadena, CA 91107, USA
Naveen Ashish: Cognie Inc., 365 San Juan Place, Pasadena, CA 91107, USA

Journal of Information & Knowledge Management (JIKM), 2014, vol. 13, issue 01, 1-6

Abstract: The article describes a solution to process large volumes of unstructured health social media data in a scalable fashion using the MapReduce framework. Our work is in the context of health informatics applications involving complex text and language processing as well as large resources such as ontologies, due to which the text processing of a single unit of text takes time. Even with a throughput of an order processing time of one second per unit, it takes over a week to process a million units, which is unacceptable. We present a solution where we take the processing to a MapReduce framework and achieve significant improvement in processing performance by dividing the processing across a cluster of processors. This paper describes the technical details of our work in terms of the design, modeling, and implementation of such an approach. We also present experimental results demonstrating the effectiveness of our approach.

Keywords: Information extraction; MapReduce; Hadoop; parallelization (search for similar items in EconPapers)
Date: 2014
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649214500099
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:13:y:2014:i:01:n:s0219649214500099

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0219649214500099

Access Statistics for this article

Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh

More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:jikmxx:v:13:y:2014:i:01:n:s0219649214500099