EconPapers    
Economics at your fingertips  
 

A three-stage fusion model for predicting financial distress considering semantic and sentiment information

Risk Staff, Jiaming Liu and Bo Yuan

Journal of Risk

Abstract: In recent years the role of analyzing the management discussion and analysis (MD&A) text of listed companies in financial distress prediction models has gradually gained attention. This paper, by integrating text analysis and machine learning techniques, reveals the financial information hidden in MD&A text and accurately captures the emotional tendency of the text through a sentiment analysis lexicon, providing a more comprehensive and detailed method for predicting a company’s financial condition. This study explores the effect of integrating financial features, semantic features and sentiment features on the ability to predict the financial distress of listed companies. To do this we propose an innovative three-phase fusion model. First, semantic features are extracted from the MD&A sections of the annual reports of listed companies using deep learning techniques, and sentiment features are derived from the MD&A text content based on a sentiment dictionary. Then, initial prediction models are constructed separately based on financial, semantic and sentiment features. Finally, by introducing a stacking ensemble strategy, a heterogeneous stacking model is constructed by integrating these models to improve prediction accuracy. The research results indicate that financial features play a critical role in prediction models, having a decisive impact on prediction accuracy. The introduction of semantic and sentiment features significantly enhances the model’s predictive performance. Further, by comparing the application of different algorithms (naive Bayes, random forest, extreme gradient boosting, logistic regression and ridge regression) in the model, we find that the adoption of a heterogeneous stacking model not only enhances the overall prediction accuracy but also improves the model’s generalizability.

References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.risk.net/node/7961700 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:rsk:journ4:7961700

Access Statistics for this article

More articles in Journal of Risk from Journal of Risk
Bibliographic data for series maintained by Thomas Paine ().

 
Page updated 2025-07-02
Handle: RePEc:rsk:journ4:7961700