Data discretization for novel relationship discovery in information retrieval
G. Benoît
Journal of the American Society for Information Science and Technology, 2002, vol. 53, issue 9, 736-746
Abstract:
This article describes an information retrieval, visualization, and manipulation model. After term or phrases have been input for a query, the system designed on this model offers the user multiple ways to exploit the retrieval set via an interactive interface. The retrieved data are clustered into thematic concepts related to the query, represented on screen as a grid of nodes. Users of the system may manipulate the retrieval set to explore document–document, document–concept, concept–concept relationships in the retrieval set that might otherwise be masked by altering (a) the discrete grid size of the display, (b) the influence, or weight, of various document terms and properties, and (c) mixed levels of granularity. As these factors are reweighed, the display is updated in real‐time to expose unanticipated document relationships, and shifts in cluster membership. The article outlines the mathematical model and then describes an information‐retrieval application built on the model to search structured and full‐text files. The application, written in Java, uses a small test collection of Dialog and Swiss‐Prot documents.
Date: 2002
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asi.10079
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamist:v:53:y:2002:i:9:p:736-746
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1532-2890
Access Statistics for this article
More articles in Journal of the American Society for Information Science and Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().