Predictive modeling of tax compliance risks: A comparative study of machine learning approaches
Lan Yang
PLOS ONE, 2025, vol. 20, issue 9, 1-19
Abstract:
Modern enterprises grapple with complex financial data and multidimensional risk interdependencies in their operations. Machine learning offers transformative potential for tax risk assessment and smart auditing solutions. This research analyzes 3,232 tax records from regional manufacturing and service sectors (2021–2023) to evaluate three predictive models: SVM, XGBoost, and Random Forest. Results demonstrate Random Forest’s superior performance, achieving 92.00% (manufacturing) and 93.39% (service) accuracy – substantially outperforming XGBoost and SVM (85–90%). Key manufacturing risk indicators follow a “high tax-high volatility-high scrutiny” pattern, with tax burden rate (0.129 weight), profit fluctuation (0.100), and audit frequency (0.091) being most predictive. Service sector risks manifest as “volatility-declaration-tax burden” dynamics, where profit volatility (0.142) emerges as the strongest predictor. These findings both validate machine learning’s efficacy in tax analysis and equip regulators with intelligent risk management tools.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0331715 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 31715&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0331715
DOI: 10.1371/journal.pone.0331715
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().