Hierarchical concept indexing of full‐text documents in the Unified Medical Language System® Information Sources Map
Lawrence W. Wright,
Holly K. Grossetta Nardini,
Alan R. Aronson and
Thomas C. Rindflesch
Journal of the American Society for Information Science, 1999, vol. 50, issue 6, 514-523
Abstract:
Full‐text documents are a vital and rapidly growing part of online biomedical information. A single large document can contain as much information as a small database, but normally lacks the tight structure and consistent indexing of a database. Retrieval systems will often miss highly relevant parts of a document if the document as a whole appears irrelevant. Access to full‐text information is further complicated by the need to search separately many disparate information resources. This research explores how these problems can be addressed by the combined use of two techniques: 1) natural language processing for automatic concept‐based indexing of full text, and 2) methods for exploiting the structure and hierarchy of full‐text documents. We describe methods for applying these techniques to a large collection of full‐text documents drawn from the Health Services/Technology Assessment Text (HSTAT) database at the National Library of Medicine (NLM), and examine how this hierarchical concept indexing can assist both document‐ and source‐level retrieval in the context of NLM's Information Sources Map project.
Date: 1999
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/(SICI)1097-4571(1999)50:63.0.CO;2-Q
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamest:v:50:y:1999:i:6:p:514-523
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1097-4571
Access Statistics for this article
More articles in Journal of the American Society for Information Science from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().