Development and evaluation of a multimodal feature-based predictive model for radiotherapy-induced oral mucositis in nasopharyngeal carcinoma

Li, Ling; Li, Linke; Guo, Ruifeng; Fang, Shiting; Wang, Ke; Yuan, Ge; Jiang, Danxian; Huang, Jing

Development and evaluation of a multimodal feature-based predictive model for radiotherapy-induced oral mucositis in nasopharyngeal carcinoma

Ling Li, Linke Li, Ruifeng Guo, Shiting Fang, Ke Wang, Ge Yuan, Danxian Jiang and Jing Huang

PLOS ONE, 2026, vol. 21, issue 4, 1-17

Abstract: Background: Accurate prediction of radiation-induced oral mucositis is crucial for personalized treatment in head and neck cancer. However, developing robust predictive models utilizing high-dimensional multimodal data (CT imaging, dose distribution, and clinical features) remains challenging, particularly in cohorts with limited sample sizes. Objective: This study aimed to rigorously evaluate and compare the multi-class predictive performance of traditional machine learning algorithms and deep learning architectures under a small-cohort setting. Methods: Multimodal data from 108 patients were collected. A comprehensive evaluation framework incorporating nine traditional machine learning algorithms and two deep learning models (a dimensionality-reduced 1D-CNN and a multimodal 3D-CNN) was established. To ensure robust evaluation, a stratified 5-fold cross-validation was employed. Model performance was comprehensively quantified using mean ± standard deviation (SD) across multiple metrics, including the Area Under the Curve (AUC), accuracy, and Matthews Correlation Coefficient (MCC). Results: Inter-rater reliability for RIOM grading was excellent (Cohen’s kappa = 0.82, 95% CI: 0.73–0.91). Among traditional machine learning approaches, the Extra Trees (ET) algorithm achieved the highest discriminative capacity (AUC: 0.956 ± 0.046), while Logistic Regression (LR) demonstrated optimal overall accuracy (0.832 ± 0.155) and stability. Regarding deep learning, the lightweight 1D-CNN utilizing fused low-dimensional features exhibited highly competitive and robust performance (AUC: 0.900 ± 0.072; Accuracy: 0.732 ± 0.140). In stark contrast, the high-dimensional multimodal 3D-CNN suffered from severe overfitting and mode collapse phenomenon, yielding significantly inferior results (AUC: 0.568 ± 0.090; MCC: −0.025 ± 0.031). Conclusions: For small-cohort radiomics and dosimetric analyses, ensemble learning models (e.g., ET) and appropriately regularized linear models (e.g., LR) remain highly effective. While deep learning holds promise, high-dimensional architectures like 3D-CNNs are highly susceptible to mode collapse without massive datasets. Instead, employing feature dimensionality reduction combined with lightweight networks (1D-CNN) is a vastly superior strategy to extract reliable predictive patterns from limited clinical data.

Date: 2026
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0346251 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 46251&type=printable (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0346251

DOI: 10.1371/journal.pone.0346251

Access Statistics for this article

More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().