EconPapers    
Economics at your fingertips  
 

Quantification and identification of authorial writing style through higher-order text network modeling and analysis

Hongzhong Deng, Chengxing Wu, Bingfeng Ge and Hongqian Wu

Journal of Informetrics, 2025, vol. 19, issue 1

Abstract: Determining the true author of anonymized texts has important applications ranging from text classification and information extraction to forensic investigations. Despite substantial progress, current authorship identification solutions are limited to extracting straightforward semantic relationships in writing styles, lacking consideration for higher-order features among multiple vocabulary, phrases, or sentences in language structure. Here, we propose a novel approach based on hypernetwork theory to encode higher-order text features into a unified text hyper-network and investigate whether the hyper-order topological features of the text hyper-network contribute to revealing the author's stylistic preferences. Our results indicate that metrics of the text hyper-network, such as hyperdegree, average shortest path length, and intermittency, can capture more information about the author's writing styles. More importantly, in the author identification task of 170 novels, our method accurately distinguished the authorship of 81% of the novels, surpassing the accuracy of the method of using paired word relationships. This further highlights the importance of higher-order features in text analysis, beyond mere pairwise interactions of words.

Keywords: Complex networks; Hypergraph; Text hyper-networks; Text analysis; Authorship identification (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S1751157724001159
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:infome:v:19:y:2025:i:1:s1751157724001159

DOI: 10.1016/j.joi.2024.101603

Access Statistics for this article

Journal of Informetrics is currently edited by Leo Egghe

More articles in Journal of Informetrics from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-24
Handle: RePEc:eee:infome:v:19:y:2025:i:1:s1751157724001159