Author Identification Using Chaos Game Representation and Deep Learning
Catalin Stoean and
Daniel Lichtblau
Additional contact information
Catalin Stoean: Human Language Technology Research Center, University of Bucharest, 010014 Bucharest, Romania
Daniel Lichtblau: Wolfram Research, Champaign, IL 61820, USA
Mathematics, 2020, vol. 8, issue 11, 1-18
Abstract:
An author unconsciously encodes in the written text a certain style that is often difficult to recognize. Still, there are many computational means developed for this purpose that take into account various features, from lexical and character-based attributes to syntactic or semantic ones. We propose an approach that starts from the character level and uses chaos game representation to illustrate documents like images which are subsequently classified by a deep learning algorithm. The experiments are made on three data sets and the outputs are comparable to the results from the literature. The study also verifies the suitability of the method for small data sets and whether image augmentation can improve the classification efficiency.
Keywords: authorship attribution; chaos game representation; deep learning (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/8/11/1933/pdf (application/pdf)
https://www.mdpi.com/2227-7390/8/11/1933/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:8:y:2020:i:11:p:1933-:d:438970
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().