EconPapers    
Economics at your fingertips  
 

Exposing model bias in machine learning revisiting the boy who cried wolf in the context of phishing detection

Duan C.J. (Chaojie) and Anuj Gaurav

Journal of Business Analytics, 2021, vol. 4, issue 2, 171-178

Abstract: Grown out of the quest for artificial intelligence (AI), machine learning (ML) is today’s most active field across disciplines with a sharp increase in applications ranging from criminology to fraud detection and to biometrics. ML and statistics both emphasise model estimation/training and thus share the inescapable Type 1 and 2 errors. Extending the concept of statistical errors into the domain of ML, we devise a ground-breaking pH scale-like ratio and intend it as a litmus test indicator of ML model bias completely masked by the popular performance criterion of accuracy. Using publicly available phishing dataset, we conduct experiments on a series of classification models and consequently unravel the significant cost implications of models with varying levels of bias. Based on these results, we recommend practitioners exercise human judgement and match their own risk tolerance profile with the bias ratio associated with each ML model in order to guard against potential unintended adverse effects.

Date: 2021
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://hdl.handle.net/10.1080/2573234X.2021.1934128 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:taf:tjbaxx:v:4:y:2021:i:2:p:171-178

Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/tjba20

DOI: 10.1080/2573234X.2021.1934128

Access Statistics for this article

Journal of Business Analytics is currently edited by Dursan Delen

More articles in Journal of Business Analytics from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().

 
Page updated 2025-03-20
Handle: RePEc:taf:tjbaxx:v:4:y:2021:i:2:p:171-178