EconPapers    
Economics at your fingertips  
 

Topic Classification for Short Texts

Dan Claudiu Neagu (), Andrei Bogdan Rus (), Mihai Grec (), Mihai Boroianu () and Gheorghe Cosmin Silaghi ()
Additional contact information
Dan Claudiu Neagu: Cicada Technologies
Andrei Bogdan Rus: Cicada Technologies
Mihai Grec: Cicada Technologies
Mihai Boroianu: Cicada Technologies
Gheorghe Cosmin Silaghi: Babes-Bolyai University

A chapter in Advances in Information Systems Development, 2023, pp 207-222 from Springer

Abstract: Abstract In the context of TV and social media surveillance, constructing models to automate topic identification of short texts is a key task. This paper constructs worth-to-consider models for practical usage, employing Top-K multinomial classification methodology. We describe the full data processing pipeline, discussing about dataset selection, text preprocessing, feature extraction, model selection and learning, including hyperparameter optimization. We will test and compare popular methods including: standard machine learning, deep learning, and a fine-tuned BERT for topic classification.

Date: 2023
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:lnichp:978-3-031-32418-5_12

Ordering information: This item can be ordered from
http://www.springer.com/9783031324185

DOI: 10.1007/978-3-031-32418-5_12

Access Statistics for this chapter

More chapters in Lecture Notes in Information Systems and Organization from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-01
Handle: RePEc:spr:lnichp:978-3-031-32418-5_12