EconPapers    
Economics at your fingertips  
 

Detecting Outliers with Semi-Supervised Machine Learning: A Fraud Prediction Application

Sebastián Palacio ()

No XREAP2018-2, Working Papers from Xarxa de Referència en Economia Aplicada (XREAP)

Abstract: Abnormal pattern prediction has received a great deal of attention from both academia and industry, with applications that range from fraud, terrorism and intrusion detection to sensor events, medical diagnoses, weather patterns, etc. In practice, most abnormal pattern prediction problems are characterized by the presence of a small number of labeled data and a huge number of unlabeled data. While this points most obviously to the adoption of a semi-supervised approach, most empirical studies have opted for a simplification and treated it as a supervised problem, resulting in a severe bias of false negatives. In this paper, we propose an innovative methodology based on semi-supervised techniques and introduce a new metric the Cluster-Score for abnormal homogeneity measurement. Specifically, the methodology involves transmuting unsupervised models to supervised models using the Cluster-Score metric, which defines the objective boundaries between clusters and evaluates the homogeneity of the abnormalities in the cluster construction. We apply this methodology to a problem of fraud detection among property insurance claims. The objectives are to increase the number of fraudulent claims detected and to reduce the proportion of claims investigated that are, in fact, non-fraudulent. The results from applying our methodology considerably improved these objectives.

Keywords: Outlier Detection; Semi-Supervised Models; Fraud; Cluster; Insurance (search for similar items in EconPapers)
Pages: 33 pages
New Economics Papers: this item is included in nep-big and nep-cmp
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.xreap.cat/RePEc/xrp/pdf/XREAP2018-02.pdf First version, 2018 (application/pdf)
http://www.xreap.cat/RePEc/xrp/pdf/XREAP2018-02.pdf Revised version, 2018 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:xrp:wpaper:xreap2018-2

Access Statistics for this paper

More papers in Working Papers from Xarxa de Referència en Economia Aplicada (XREAP) Contact information at EDIRC.
Bibliographic data for series maintained by XREAP ( this e-mail address is bad, please contact ).

 
Page updated 2025-03-22
Handle: RePEc:xrp:wpaper:xreap2018-2