EconPapers    
Economics at your fingertips  
 

Leveraging Text Classification by Co-training with Bidirectional Language Models – A Novel Hybrid Approach and Its Application for a German Bank

Roland Graef ()
Additional contact information
Roland Graef: University of Ulm

A chapter in Innovation Through Information Systems, 2021, pp 216-231 from Springer

Abstract: Abstract Labeling training data constitutes the largest bottleneck for machine learning projects. In particular, text classification via machine learning is widely applied and investigated. Hence, companies have to label a decent amount of texts manually in order to build appropriate text classifiers. Obviously, labeling texts manually is associated with time and expenses. Against this background, research started to develop approaches exploiting the knowledge contained in unlabeled texts by learning sophisticated text representations or labeling some of the texts in an automated manner. However, there is still a lack of integrated approaches, considering both types of approaches to further reduce time and expenses for labeling texts. To address this problem, we propose a new hybrid text classification approach combining recent text representations and automated labeling approaches in an integrated perspective. We demonstrate and evaluate our approach using the case of a German bank where the approach could be applied successfully.

Keywords: Machine learning; Text classification; Co-training; Bidirectional Long Short-Term Memory Networks (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:lnichp:978-3-030-86797-3_15

Ordering information: This item can be ordered from
http://www.springer.com/9783030867973

DOI: 10.1007/978-3-030-86797-3_15

Access Statistics for this chapter

More chapters in Lecture Notes in Information Systems and Organization from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-06-15
Handle: RePEc:spr:lnichp:978-3-030-86797-3_15