EconPapers    
Economics at your fingertips  
 

VEUCTOR: Training and Selecting Best Vector Space Models from Online Job Ads for European Countries

Emilio Colombo (), Simone D'Amico (), Fabio Mercorio () and Mario Mezzanzanica ()

No dis2601, DISEIS - Quaderni del Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo from Università Cattolica del Sacro Cuore, Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo (DISEIS)

Abstract: Over the last decade, word embeddings have enabled machines to represent words and sentences as vectors, enabling researchers to reason on text for tasks like semantic similarity, contextual understanding, machine translation, etc. However, the synthesis of embeddings involves domain-specific parameters that affect semantic accuracy and contextual relevance, often leading to unpredictable biases and inconsistent comparisons. This issue is particularly relevant in labor market analysis, where different embeddings yield varying results, making the selection of the most appropriate model a key element. This paper addresses these challenges by (i) proposing a methodology to train, select, and align vector space models for a target taxonomy, ensuring comparability across dimensions and languages; (ii) applying this approach to 4.5 million job ads in 28 languages, aligning country-specific embeddings using the ESCO taxonomy; (iii) generating over 3,000 models over 142 machine days, making the best-performing ones publicly available via VEUCTOR; and (iv) showing how model choice significantly impacts labor market analysis, revealing substantial variations in occupational skill bundles across embeddings.

JEL-codes: C55 J63 (search for similar items in EconPapers)
Date: 2026
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://dipartimenti.unicatt.it/diseis-wp_2601.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:dis:wpaper:dis2601

Access Statistics for this paper

More papers in DISEIS - Quaderni del Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo from Università Cattolica del Sacro Cuore, Dipartimento di Economia internazionale, delle istituzioni e dello sviluppo (DISEIS) Contact information at EDIRC.
Bibliographic data for series maintained by Emilio Colombo ().

 
Page updated 2026-04-07
Handle: RePEc:dis:wpaper:dis2601