Quantification and identification of authorial writing style through higher-order text network modeling and analysis
Hongzhong Deng,
Chengxing Wu,
Bingfeng Ge and
Hongqian Wu
Journal of Informetrics, 2025, vol. 19, issue 1
Abstract:
Determining the true author of anonymized texts has important applications ranging from text classification and information extraction to forensic investigations. Despite substantial progress, current authorship identification solutions are limited to extracting straightforward semantic relationships in writing styles, lacking consideration for higher-order features among multiple vocabulary, phrases, or sentences in language structure. Here, we propose a novel approach based on hypernetwork theory to encode higher-order text features into a unified text hyper-network and investigate whether the hyper-order topological features of the text hyper-network contribute to revealing the author's stylistic preferences. Our results indicate that metrics of the text hyper-network, such as hyperdegree, average shortest path length, and intermittency, can capture more information about the author's writing styles. More importantly, in the author identification task of 170 novels, our method accurately distinguished the authorship of 81% of the novels, surpassing the accuracy of the method of using paired word relationships. This further highlights the importance of higher-order features in text analysis, beyond mere pairwise interactions of words.
Keywords: Complex networks; Hypergraph; Text hyper-networks; Text analysis; Authorship identification (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S1751157724001159
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:infome:v:19:y:2025:i:1:s1751157724001159
DOI: 10.1016/j.joi.2024.101603
Access Statistics for this article
Journal of Informetrics is currently edited by Leo Egghe
More articles in Journal of Informetrics from Elsevier
Bibliographic data for series maintained by Catherine Liu ().