Identifying and Managing Fraudulent Respondents in Online Stated Preferences Surveys: A Case Example from Best–Worst Scaling in Health Preferences Research

MacDonald, Karen V.; Nguyen, Geoffrey C.; Sewitch, Maida J.; Marshall, Deborah A.

Identifying and Managing Fraudulent Respondents in Online Stated Preferences Surveys: A Case Example from Best–Worst Scaling in Health Preferences Research

Karen V. MacDonald (), Geoffrey C. Nguyen (), Maida J. Sewitch () and Deborah A. Marshall ()
Additional contact information
Karen V. MacDonald: University of Calgary
Geoffrey C. Nguyen: University of Toronto
Maida J. Sewitch: Research Institute of the McGill University Health Centre
Deborah A. Marshall: University of Calgary

The Patient: Patient-Centered Outcomes Research, 2025, vol. 18, issue 4, No 7, 373-390

Abstract: Abstract Background There is limited evidence and guidance in health preferences research to prevent, identify, and manage fraudulent respondents and data fraud, especially for best–worst scaling (BWS) and discrete choice experiments with nonordered attributes. Using an example from a BWS survey in which we experienced data fraud, we aimed to: (1) develop an approach to identify, verify, and categorize fraudulent respondents; (2) assess the impact of fraudulent respondents on data and results; and (3) identify variables associated with fraudulent respondents. Methods An online BWS survey on healthcare services for inflammatory bowel disease (IBD) was administered to Canadian IBD patients. We used a three-step approach to identify, verify, and categorize respondents as likely fraudulent (LF), likely real (LR), and unsure. First, responses to 12 “red flag” variables (variables identified as indicators of fraud) were coded 0 (pass) or 1 (fail) then summed to generate a “fraudulent response score” (FRS; range: 0–12 (most likely fraudulent)) used to categorize respondents. Second, respondents categorized LR or unsure underwent age verification. Third, categorization was updated on the basis of age verification results. BWS data were analyzed using conditional logit and latent class analysis. Subgroup analysis was done by final categorization, FRS, and red flag variables. Results Overall, n = 4334 respondents underwent initial categorization resulting in 24% (n = 1019) LF and 76% (n = 3315) needing further review. After review, 75% (n = 3258) were categorized as LF and n = 484 underwent age verification. Respondent categorization was updated on the basis of age verification, with final categorization of 76% (n = 3297) LF, 14% (n = 592) unsure, 10% (n = 442) LR, and

Date: 2025
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s40271-025-00740-y Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:patien:v:18:y:2025:i:4:d:10.1007_s40271-025-00740-y

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/40271

DOI: 10.1007/s40271-025-00740-y

Access Statistics for this article

The Patient: Patient-Centered Outcomes Research is currently edited by Christopher I. Carswell

More articles in The Patient: Patient-Centered Outcomes Research from Springer, International Academy of Health Preference Research
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().