Generating Representative Phrase Sets for Text Entry Experiments by GA-Based Text Corpora Sampling
Sandi Ljubic () and
Alen Salkanovic
Additional contact information
Sandi Ljubic: University of Rijeka, Faculty of Engineering, Vukovarska 58, HR-51000 Rijeka, Croatia
Alen Salkanovic: University of Rijeka, Faculty of Engineering, Vukovarska 58, HR-51000 Rijeka, Croatia
Mathematics, 2023, vol. 11, issue 11, 1-26
Abstract:
In the field of human–computer interaction (HCI), text entry methods can be evaluated through controlled user experiments or predictive modeling techniques. While the modeling approach requires a language model, the empirical approach necessitates representative text phrases for the experimental stimuli. In this context, finding a phrase set with the best language representativeness belongs to the class of optimization problems in which a solution is sought in a large search space. We propose a genetic algorithm (GA)-based method for extracting a target phrase set from the available text corpus, optimizing its language representativeness. Kullback–Leibler divergence is utilized to evaluate candidates, considering the digram probability distributions of both the source corpus and the target sample. The proposed method is highly customizable, outperforms typical random sampling, and exhibits language independence. The representative phrase sets generated by the proposed solution facilitate a more valid comparison of the results from different text entry studies. The open source implementation enables the easy customization of the GA-based sampling method, promotes its immediate utilization, and facilitates the reproducibility of this study. In addition, we provide heuristic guidelines for preparing the text entry experiments, which consider the experiment’s intended design and the phrase set to be generated with the proposed solution.
Keywords: text entry; phrase sets; text corpus sampling; genetic algorithm; Kullback–Leibler divergence (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/11/11/2550/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/11/2550/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:11:p:2550-:d:1162100
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().