Recall‐precision trade‐off: A derivation
Michael Gordon and
Manfred Kochen
Journal of the American Society for Information Science, 1989, vol. 40, issue 3, 145-151
Abstract:
The inexact nature of document retrieval gives rise to a fundamental recall precision trade‐off: generally, recall improves at the expense of precision, or precision improves at the expense of recall. This trade‐off is borne out empirically and has qualitatively intuitive explanations. In this article, we explore this relationship mathematically to explain it further. We see that the recall‐precision trade‐off hinges on a deceleration in the proportion of relevant documents which are retrieved, successively, over time. Further, we examine several mathematical functions sharing this property and conclude that the equation that best models recall as a function of time is the logarithm of a quadratic function. Our conclusion meets the following requirements: the function we derive predicts non‐decreasing recall over time until the last relevant document is retrieved (regardless of the density of relevant documents in the collection) without imposing any artificial restrictions on either what percentage of the collection would need to be examined to achieve perfect recall or what the level of precision would be at that time. Other models examined fail to meet one or more of these criteria. © 1989 John Wiley & Sons, Inc.
Date: 1989
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/(SICI)1097-4571(198905)40:33.0.CO;2-I
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamest:v:40:y:1989:i:3:p:145-151
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1097-4571
Access Statistics for this article
More articles in Journal of the American Society for Information Science from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().