A three-stage fusion model for predicting financial distress considering semantic and sentiment information
Risk Staff,
Jiaming Liu and
Bo Yuan
Journal of Risk
Abstract:
In recent years the role of analyzing the management discussion and analysis (MD&A) text of listed companies in financial distress prediction models has gradually gained attention. This paper, by integrating text analysis and machine learning techniques, reveals the financial information hidden in MD&A text and accurately captures the emotional tendency of the text through a sentiment analysis lexicon, providing a more comprehensive and detailed method for predicting a company’s financial condition. This study explores the effect of integrating financial features, semantic features and sentiment features on the ability to predict the financial distress of listed companies. To do this we propose an innovative three-phase fusion model. First, semantic features are extracted from the MD&A sections of the annual reports of listed companies using deep learning techniques, and sentiment features are derived from the MD&A text content based on a sentiment dictionary. Then, initial prediction models are constructed separately based on financial, semantic and sentiment features. Finally, by introducing a stacking ensemble strategy, a heterogeneous stacking model is constructed by integrating these models to improve prediction accuracy. The research results indicate that financial features play a critical role in prediction models, having a decisive impact on prediction accuracy. The introduction of semantic and sentiment features significantly enhances the model’s predictive performance. Further, by comparing the application of different algorithms (naive Bayes, random forest, extreme gradient boosting, logistic regression and ridge regression) in the model, we find that the adoption of a heterogeneous stacking model not only enhances the overall prediction accuracy but also improves the model’s generalizability.
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.risk.net/node/7961700 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:rsk:journ4:7961700
Access Statistics for this article
More articles in Journal of Risk from Journal of Risk
Bibliographic data for series maintained by Thomas Paine ().