EconPapers    
Economics at your fingertips  
 

Clustering of football players based on performance data and aggregated clustering validity indexes

Akhanli Serhat Emre () and Christian Hennig
Additional contact information
Akhanli Serhat Emre: Department of Statistics, Muğla Sıtkı Koçman University, Muğla, Türkiye

Journal of Quantitative Analysis in Sports, 2023, vol. 19, issue 2, 103-123

Abstract: We analyse football (soccer) player performance data with mixed type variables from the 2014-15 season of eight European major leagues. We cluster these data based on a tailor-made dissimilarity measure. In order to decide between the many available clustering methods and to choose an appropriate number of clusters, we use the approach by Akhanli and Hennig (2020. “Comparing Clusterings and Numbers of Clusters by Aggregation of Calibrated Clustering Validity Indexes.” Statistics and Computing 30 (5): 1523–44). This is based on several validation criteria that refer to different desirable characteristics of a clustering. These characteristics are chosen based on the aim of clustering, and this allows to define a suitable validation index as weighted average of calibrated individual indexes measuring the desirable features. We derive two different clusterings. The first one is a partition of the data set into major groups of essentially different players, which can be used for the analysis of a team’s composition. The second one divides the data set into many small clusters (with 10 players on average), which can be used for finding players with a very similar profile to a given player. It is discussed in depth what characteristics are desirable for these clusterings. Weighting the criteria for the second clustering is informed by a survey of football experts.

Keywords: a large number of clusters; calibrated indexes; cluster analysis; clustering validity indexes; football data (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/jqas-2022-0037 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:jqsprt:v:19:y:2023:i:2:p:103-123:n:3

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/jqas/html

DOI: 10.1515/jqas-2022-0037

Access Statistics for this article

Journal of Quantitative Analysis in Sports is currently edited by Mark Glickman

More articles in Journal of Quantitative Analysis in Sports from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-04-09
Handle: RePEc:bpj:jqsprt:v:19:y:2023:i:2:p:103-123:n:3