AN INTEGRATED DISTRIBUTED SYSTEM FOR WEB NEWS RETRIEVAL
Man-Chung Chan,
Wei-Dong Luo and
James N. K. Liu
Additional contact information
Man-Chung Chan: SPEED, Hong Kong Polytechnic University, Hong Kong, P.R.China
Wei-Dong Luo: Department of Computing, Hong Kong Polytechnic University, Hong Kong, P.R.China
James N. K. Liu: Department of Computing, Hong Kong Polytechnic University, Hong Kong, P.R.China
Chapter 22 in Challenges in Information Technology Management, 2008, pp 147-154 from World Scientific Publishing Co. Pte. Ltd.
Abstract:
AbstractThis paper highlights the problems of information explosion and the incapability of currently available search engines in finding what we mostly want. In particularly, these search engines cannot offer users the facility of specifying the categories and time frames they receive and cannot provide the online news information with the required frequency. To address these problems, we present the design and implementation of- “Ai-Times”, a distributed web news retrieval system which can accurately retrieve and organize the web news information. We describe the optimized crawler algorithm, the news extraction algorithm, and explain how MapReduce is used in “Ai-Times” and can be improved to get better performance.
Keywords: Information Technology; Knowledge Management; Computing (search for similar items in EconPapers)
Date: 2008
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.worldscientific.com/doi/pdf/10.1142/9789812819079_0022 (application/pdf)
https://www.worldscientific.com/doi/abs/10.1142/9789812819079_0022 (text/html)
Ebook Access is available upon purchase.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:wschap:9789812819079_0022
Ordering information: This item can be ordered from
Access Statistics for this chapter
More chapters in World Scientific Book Chapters from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().