Acceleration of Large-Scale DEA Computations Using Random Forest Classification
Anyu Yu (),
Yu Shi () and
Joe Zhu ()
Additional contact information
Anyu Yu: Zhejiang Gongshang University
Yu Shi: Worcester Polytechnic Institute
Joe Zhu: Worcester Polytechnic Institute
A chapter in Data-Enabled Analytics, 2021, pp 31-49 from Springer
Abstract:
Abstract With the prevalence of big data, traditional data envelopment analysis (DEA) faces the challenge of handling large-scale computations. The DEA computation time can increase significantly as the number of decision-making units grows. In this study, we propose a novel approach to accelerate DEA computations involving voluminous data. The proposed method uses random forest classification to predict and search for the best-practice DMUs within the large-scale observations. Since best-practice DMUs are always of a smaller quantity, and they can determine the efficiency scores of all the remaining DMUs, identifying best-practice DMUs first reduces the programming size and the consequent computation time of the DEA model. The proposed method is termed as the DEA-RF method, which combines DEA and machine learning methods to reduce computational cost. Next, we test the effectiveness of the proposed method using numerical cases involving large-scale data. After computing the DEA scores of the DMUs in both the observed and simulated samples, we find that the proposed DEA-RF method can decrease computation time significantly, while ensuring an acceptable level of accuracy. Additionally, the larger the sample size is, the more time the model can save. The proposed DEA-RF method proves to be an effective solution to the long computation time problem of DEA models under big data contexts.
Keywords: Data envelopment analysis; Data enabled analytics; Large-scale computations; Random forest; Big data; Best-practice DMU classification (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:isochp:978-3-030-75162-3_2
Ordering information: This item can be ordered from
http://www.springer.com/9783030751623
DOI: 10.1007/978-3-030-75162-3_2
Access Statistics for this chapter
More chapters in International Series in Operations Research & Management Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().