Use of classification trees and rule-based models to optimize the funding assignment to research projects: A case study of UTPL
Roberto Fernandez Martinez,
Ruben Lostado Lorza,
Ana Alexandra Santos Delgado and
Nelson Piedra
Journal of Informetrics, 2021, vol. 15, issue 1
Abstract:
In the process of funding research projects, two important factors must be studied. First, experts judges the potential value of a project. Secondly, the research ability is judged by the applicants previous research activity. The most appropriate way to assign the appropriate amount of money to project proposals is always a difficult decision. This work focuses on the second factor based on classifying the researchers previous research activity on an automated logical classification (accepted, rejected) resolving conflicts of interests between administration and applicants and helping in the decision-making process. As the class in these kinds of studies is usually unbalanced, because there are fewer accepted projects than rejected projects, how the use of an imbalanced dataset or a balanced dataset affects to the models is investigated by using several resampling methods. Later, several trees and rule-based machine learning techniques are used to create classification models. This is based on information from the faculty members information of the “Technical Particular University of Loja (UTPL),” in cases, with balanced datasets and those with unbalanced datasets. Multivariate analysis, feature selection, algorithm parameter tuning and validation methods are used to achieve robust classification models. The most accurate results are obtained with a rules-based model and use of the C5.0 algorithm. As the latter provides acceptable accuracy, close to 95 % when predicting both classes and to 99 % when predicting the accepted projects class, both the methodology and final model are validated.
Keywords: Automated logical classification; Resampling methods; Funding assignment; Unbalanced dataset; Model validation methodology (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S1751157720306246
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:infome:v:15:y:2021:i:1:s1751157720306246
DOI: 10.1016/j.joi.2020.101107
Access Statistics for this article
Journal of Informetrics is currently edited by Leo Egghe
More articles in Journal of Informetrics from Elsevier
Bibliographic data for series maintained by Catherine Liu ().