EconPapers    
Economics at your fingertips  
 

A3C: Albanian Authorship Attribution Corpus

Arta Misini (), Arbana Kadriu () and Ercan Canhasi ()
Additional contact information
Arta Misini: South East European University
Arbana Kadriu: South East European University
Ercan Canhasi: University “Ukshin Hoti” Prizren

A chapter in Economic Recovery, Consolidation, and Sustainable Growth, 2023, pp 755-763 from Springer

Abstract: Abstract The process of authorship attribution (AA) examines previous works by authors to identify the correct writer. The primary objective of this study is to compile an Albanian corpus that will aid AA research. The novel corpus consists of newsroom columns scraped from online sources. We conduct experiments using two machine learning (ML) algorithms—the MNB and SVM classifiers. The model is fed with the TF-IDF feature vector. Compared to the MNB classifier, the SVM algorithm performed better. The results demonstrate that the corpus performs well on the AA task.

Keywords: Authorship attribution; Corpus; Feature vector; ML classifier; Albanian (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:prbchp:978-3-031-42511-0_49

Ordering information: This item can be ordered from
http://www.springer.com/9783031425110

DOI: 10.1007/978-3-031-42511-0_49

Access Statistics for this chapter

More chapters in Springer Proceedings in Business and Economics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-01
Handle: RePEc:spr:prbchp:978-3-031-42511-0_49