EconPapers    
Economics at your fingertips  
 

Acceleration of Large-Scale DEA Computations Using Random Forest Classification

Anyu Yu (), Yu Shi () and Joe Zhu ()
Additional contact information
Anyu Yu: Zhejiang Gongshang University
Yu Shi: Worcester Polytechnic Institute
Joe Zhu: Worcester Polytechnic Institute

A chapter in Data-Enabled Analytics, 2021, pp 31-49 from Springer

Abstract: Abstract With the prevalence of big data, traditional data envelopment analysis (DEA) faces the challenge of handling large-scale computations. The DEA computation time can increase significantly as the number of decision-making units grows. In this study, we propose a novel approach to accelerate DEA computations involving voluminous data. The proposed method uses random forest classification to predict and search for the best-practice DMUs within the large-scale observations. Since best-practice DMUs are always of a smaller quantity, and they can determine the efficiency scores of all the remaining DMUs, identifying best-practice DMUs first reduces the programming size and the consequent computation time of the DEA model. The proposed method is termed as the DEA-RF method, which combines DEA and machine learning methods to reduce computational cost. Next, we test the effectiveness of the proposed method using numerical cases involving large-scale data. After computing the DEA scores of the DMUs in both the observed and simulated samples, we find that the proposed DEA-RF method can decrease computation time significantly, while ensuring an acceptable level of accuracy. Additionally, the larger the sample size is, the more time the model can save. The proposed DEA-RF method proves to be an effective solution to the long computation time problem of DEA models under big data contexts.

Keywords: Data envelopment analysis; Data enabled analytics; Large-scale computations; Random forest; Big data; Best-practice DMU classification (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:isochp:978-3-030-75162-3_2

Ordering information: This item can be ordered from
http://www.springer.com/9783030751623

DOI: 10.1007/978-3-030-75162-3_2

Access Statistics for this chapter

More chapters in International Series in Operations Research & Management Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-01
Handle: RePEc:spr:isochp:978-3-030-75162-3_2