EconPapers    
Economics at your fingertips  
 

Using Tree Ensembles to Analyze National Baseball Hall of Fame Voting Patterns: An Application to Discrimination in BBWAA Voting

Brian Mills and Salaga Steven
Additional contact information
Salaga Steven: University of Michigan

Journal of Quantitative Analysis in Sports, 2011, vol. 7, issue 4, 32

Abstract: We predict the induction of Major League Baseball hitters and pitchers into the National Baseball Hall of Fame by the Baseball Writers' Association of America. We employ a Random Forest algorithm for binary classification, improving upon past models with a simplistic input approach. Our results suggest that the random forest technique is a fruitful line of research with prediction in the sports world. We find an error rate as low as 0.91% in our most accurate forest, with no out-of-bag Error higher than 2.6% in any tree ensemble. We extend the results to an examination of the possibility of discrimination with respect to BBWAA voting, finding little evidence for exclusions based on race.

Keywords: hall of fame; random forest; classification; prediction; baseball (search for similar items in EconPapers)
Date: 2011
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
https://doi.org/10.2202/1559-0410.1367 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:jqsprt:v:7:y:2011:i:4:n:12

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/jqas/html

DOI: 10.2202/1559-0410.1367

Access Statistics for this article

Journal of Quantitative Analysis in Sports is currently edited by Mark Glickman

More articles in Journal of Quantitative Analysis in Sports from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-22
Handle: RePEc:bpj:jqsprt:v:7:y:2011:i:4:n:12