Modeling Radiologists’ Assessments to Explore Pairing Strategies for Optimized Double Reading of Screening Mammograms
Jessie J. J. Gommers,
Craig K. Abbey,
Fredrik Strand,
Sian Taylor-Phillips,
David J. Jenkinson,
Marthe Larsen,
Solveig Hofvind,
Mireille J. M. Broeders and
Ioannis Sechopoulos
Additional contact information
Jessie J. J. Gommers: Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Craig K. Abbey: Department of Psychological and Brain Sciences, University of California, Santa Barbara, CA, USA
Fredrik Strand: Department of Oncology-Pathology, Karolinska Institute, Stockholm, Sweden
Sian Taylor-Phillips: Warwick Medical School, University of Warwick, Coventry, UK
David J. Jenkinson: Warwick Medical School, University of Warwick, Coventry, UK
Marthe Larsen: Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway
Solveig Hofvind: Section for Breast Cancer Screening, Cancer Registry of Norway, Oslo, Norway
Mireille J. M. Broeders: Dutch Expert Center for Screening (LRCB), Nijmegen, The Netherlands
Ioannis Sechopoulos: Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Medical Decision Making, 2024, vol. 44, issue 7, 828-842
Abstract:
Purpose To develop a model that simulates radiologist assessments and use it to explore whether pairing readers based on their individual performance characteristics could optimize screening performance. Methods Logistic regression models were designed and used to model individual radiologist assessments. For model evaluation, model-predicted individual performance metrics and paired disagreement rates were compared against the observed data using Pearson correlation coefficients. The logistic regression models were subsequently used to simulate different screening programs with reader pairing based on individual true-positive rates (TPR) and/or false-positive rates (FPR). For this, retrospective results from breast cancer screening programs employing double reading in Sweden, England, and Norway were used. Outcomes of random pairing were compared against those composed of readers with similar and opposite TPRs/FPRs, with positive assessments defined by either reader flagging an examination as abnormal. Results The analysis data sets consisted of 936,621 (Sweden), 435,281 (England), and 1,820,053 (Norway) examinations. There was good agreement between the model-predicted and observed radiologists’ TPR and FPR ( r  ≥ 0.969). Model-predicted negative-case disagreement rates showed high correlations ( r  ≥ 0.709), whereas positive-case disagreement rates had lower correlation levels due to sparse data ( r  ≥ 0.532). Pairing radiologists with similar FPR characteristics (Sweden: 4.50% [95% confidence interval: 4.46%–4.54%], England: 5.51% [5.47%–5.56%], Norway: 8.03% [7.99%–8.07%]) resulted in significantly lower FPR than with random pairing (Sweden: 4.74% [4.70%–4.78%], England: 5.76% [5.71%–5.80%], Norway: 8.30% [8.26%–8.34%]), reducing examinations sent to consensus/arbitration while the TPR did not change significantly. Other pairing strategies resulted in equal or worse performance than random pairing. Conclusions Logistic regression models accurately predicted screening mammography assessments and helped explore different radiologist pairing strategies. Pairing readers with similar modeled FPR characteristics reduced the number of examinations unnecessarily sent to consensus/arbitration without significantly compromising the TPR. Highlights A logistic-regression model can be derived that accurately predicts individual and paired reader performance during mammography screening reading. Pairing screening mammography radiologists with similar false-positive characteristics reduced false-positive rates with no significant loss in true positives and may reduce the number of examinations unnecessarily sent to consensus/arbitration.
Keywords: breast cancer; screening; double reading; modeling; optimization (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journals.sagepub.com/doi/10.1177/0272989X241264572 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:sae:medema:v:44:y:2024:i:7:p:828-842
DOI: 10.1177/0272989X241264572
Access Statistics for this article
More articles in Medical Decision Making
Bibliographic data for series maintained by SAGE Publications ().