Discovering interesting statements from a database
F. Gebhardt
Applied Stochastic Models and Data Analysis, 1994, vol. 10, issue 1, 1-14
Abstract:
Knowledge discovery aims at extracting new knowledge from potentially large databases; this may be in the form of interesting statements about the data. Two interrelated classes of problem arise that are treated here: to put the subjective notion of ‘interesting’ into concrete terms and to deal with large numbers of statements that are related to one another (one rendering the other redundant or at least less interesting). Four increasingly subjective facets of ‘interestingness’ are identified: the subject field under consideration, the conspicuousness of a finding, its novelty, and its deviation from prior knowledge. A procedure is proposed, and tried out on two quite different data sets, that allows for specifying interestingness by various means and that ranks the results in a way that takes interestingness (relevance, evidence) as well as mutual relatedness (similarity, affinity) into account—manifestations of the second and third facets of interestingness in the given data environment.
Date: 1994
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asm.3150100102
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wly:apsmda:v:10:y:1994:i:1:p:1-14
Access Statistics for this article
More articles in Applied Stochastic Models and Data Analysis from John Wiley & Sons
Bibliographic data for series maintained by Wiley Content Delivery ().