EconPapers    
Economics at your fingertips  
 

A Regression Model for Predicting the Likelihood of Reporting a Crime Based on the Victim’s Demographic Variables and Their Perceptions Towards the Police

Pulenyane Malebogo () and Montshiwa Tlhalitshi Volition ()
Additional contact information
Pulenyane Malebogo: Department of Business Statistics & Operations Research, North West University, Mahikeng, South Africa
Montshiwa Tlhalitshi Volition: Department of Business Statistics & Operations Research, North West University, Mahikeng, South Africa

Statistics, Politics and Policy, 2020, vol. 11, issue 2, 167-193

Abstract: Despite the growing criminal activities in South Africa, many victims still do not report the crimes, therefore there was a need to understand the determinants of the likelihood of reporting a crime in the country. Binary logistic regression is a supervised machine learning algorithm that can assist in predicting the likelihood of reporting a crime but the selection of relevant variables to add in the model varies from one author to the other. Selection of theoretically sound and statistically relevant independent variables is key to achieving parsimonious multivariate models. This study sought to test the efficiency of some commonly used variable selection methods for logistic regression models in order to identify the most relevant determinants of the likelihood of reporting a crime of housebreaking. The study used 17 candidate variables such as the victims’ demographic variables and their perceptions on the police. The multivariate model fitted using stepwise selection was found to be a best fit for the data based on the lowest AIC, the highest classification accuracy rate and the highest Area under the Receiver Operating Characteristic curve. The model fitted using the Hosmer-Lemeshow (H-L) algorithm was the worst fit for the data. The study revealed a limitation of the stepwise selection method which is that this method may select different independent variables for each unique set of randomly selected observations of the same dataset. The study established a multivariate logistic regression model to predict the likelihood of a victim reporting a crime of housebreaking and the determinants thereof.

Keywords: crime; housebreaking; logistic regression; reporting (search for similar items in EconPapers)
Date: 2020
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/spp-2020-0003 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:statpp:v:11:y:2020:i:2:p:167-193:n:4

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/spp/html

DOI: 10.1515/spp-2020-0003

Access Statistics for this article

Statistics, Politics and Policy is currently edited by Joel A. Middleton

More articles in Statistics, Politics and Policy from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-19
Handle: RePEc:bpj:statpp:v:11:y:2020:i:2:p:167-193:n:4