EconPapers    
Economics at your fingertips  
 

An Open and Data-driven Taxonomy of Skills Extracted from Online Job Adverts

Jyldyz Djumalieva1 () and Cath Sleeman ()

Economic Statistics Centre of Excellence (ESCoE) Discussion Papers from Economic Statistics Centre of Excellence (ESCoE)

Abstract: In this work we offer an open and data-driven skills taxonomy, which is independent of ESCO and O*NET, two popular available taxonomies that are expert-derived. Since the taxonomy is created in an algorithmic way without expert elicitation, it can be quickly updated to reflect changes in labour demand and provide timely insights to support labour market decision-making. Our proposed taxonomy also captures links between skills, aggregated job titles, and the salaries mentioned in the millions of UK job adverts used in this analysis. To generate the taxonomy, we employ machine learning methods, such as word embeddings, network community detection algorithms and consensus clustering. We model skills as a graph with individual skills as vertices and their co-occurrences in job adverts as edges. The strength of the relationships between the skills is measured using both the frequency of actual co-occurrences of skills in the same advert as well as their shared context, based on a trained word embeddings model. Once skills are represented as a network, we hierarchically group them into clusters. To ensure the stability of the resulting clusters, we introduce bootstrapping and consensus clustering stages into the methodology. While we share initial results and describe the skill clusters, the main purpose of this paper is to outline the methodology for building the taxonomy.

Keywords: Skills; Skills taxonomy; Labour demand; Online job adverts; Big data; Machine learning; Word embeddings (search for similar items in EconPapers)
JEL-codes: C18 C38 J23 J24 (search for similar items in EconPapers)
Date: 2018-08
New Economics Papers: this item is included in nep-big, nep-lma and nep-pay
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (12)

Downloads: (external link)
https://escoe-website.s3.amazonaws.com/wp-content/ ... ESCoE-DP-2018-13.pdf

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:nsr:escoed:escoe-dp-2018-13

Access Statistics for this paper

More papers in Economic Statistics Centre of Excellence (ESCoE) Discussion Papers from Economic Statistics Centre of Excellence (ESCoE) King's College London Strand London WC2R 2LS. Contact information at EDIRC.
Bibliographic data for series maintained by ESCoE Centre Manager ().

 
Page updated 2025-04-10
Handle: RePEc:nsr:escoed:escoe-dp-2018-13