System design for detection and correction of spelling errors in scientific and scholarly text
Joseph J. Pollock and
Antonio Zamora
Journal of the American Society for Information Science, 1984, vol. 35, issue 2, 104-109
Abstract:
The SPEEDCOP project recently completed at Chemical Abstracts Service (CAS) extracted over 50,000 misspellings from approximately 25,000,000 words of text from seven scientific and scholarly databases. The misspellings were automatically classified and analyzed and the results used to design and implement a program that proved capable of correcting most such errors. Analysis of the performance of the spelling error detection and correction programs highlighted the features that should be incorporated into a powerful and user‐friendly interactive system suitable for nonprogram‐mers. These include document level thresholds for misspelling detection, automatic reuse of user decisions, and user verification and control of correction. An advantage of the proposed design is that the system automatically customizes itself to its environment. This article is primarily concerned with system design, not implementation details.
Date: 1984
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asi.4630350206
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamest:v:35:y:1984:i:2:p:104-109
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1097-4571
Access Statistics for this article
More articles in Journal of the American Society for Information Science from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().