Sesgo de datos en aplicaciones de aprendizaje automático: un estudio de caso de un modelo no supervisado para identificar el riesgo de corrupción en la contratación pública colombiana
Kevin Mojica
Additional contact information
Kevin Mojica: School of Government, Universidad de los Andes
Documentos de trabajo from Escuela de Gobierno - Universidad de los Andes
Abstract:
This study analyzes data bias in an unsupervised learning algorithm designed to identify the risk of corruption in public procurement of Colombia. The employed algorithm is a two-stage clustering model used to segment electronic contracts based on variables indicating corruption risk. The objective was to develop an early warning tool for corruption in the Programa de Alimentación Escolar (PAE) procurement, utilizing data from the Sistema Electrónico para la Contratación Pública (SECOP). Although the results demonstrate the potential of artificial intelligence algorithms for detecting corruption risks, they also reveal significant limitations in their practical implementation, attributable to data availability and quality deficiencies. Specifically, biases of representation, measurement, and omitted variables were identified, affecting the algorithm's reliability. The study provides a detailed analysis of these biases, assessing their impact on the algorithm's performance, and emphasizes the importance of recognizing and addressing biases during the development of such initiatives. Finally, recommendations are presented to improve the quality of data in SECOP, aiming to enhance the reliability and accuracy of these algorithms in future developments.
Keywords: data bias; unsupervised learning; Artificial Intelligence; public procurement; AI, fairness. (search for similar items in EconPapers)
Pages: 45
Date: 2025-03-13
References: Add references at CitEc
Citations:
Downloads: (external link)
https://gobierno.uniandes.edu.co/documento-de-trabajo-no-120/
https://gobierno.uniandes.edu.co/wp-content/uploads/DT_120.pdf
None
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:col:000547:022180
Access Statistics for this paper
More papers in Documentos de trabajo from Escuela de Gobierno - Universidad de los Andes Contact information at EDIRC.
Bibliographic data for series maintained by Alejandra Rojas Forero ().