Input Use Efficiency Management for Paddy Production Systems in India: A Machine Learning Approach
Priya Brata Bhoi,
Veeresh S. Wali,
Deepak Kumar Swain,
Kalpana Sharma,
Akash Kumar Bhoi,
Manlio Bacco and
Paolo Barsocchi
Additional contact information
Priya Brata Bhoi: Department of Economics and Sociology, Punjab Agricultural University, Ludhiana 141004, Punjab, India
Veeresh S. Wali: Indian Institute of Millets Research, Hyderabad 500030, Telangana, India
Deepak Kumar Swain: Department of Agricultural Statistics, Faculty of Agricultural Sciences, Siksha ‘O’ Anusandhan University, Bhubaneswar 751003, Odisha, India
Kalpana Sharma: Department of Computer Science and Engineering, Sikkim Manipal Institute of Technology, Sikkim Manipal University, Majitar 737136, Sikkim, India
Akash Kumar Bhoi: Department of Computer Science and Engineering, Sikkim Manipal Institute of Technology, Sikkim Manipal University, Majitar 737136, Sikkim, India
Manlio Bacco: Institute of Information Science and Technologies, National Research Council, 56124 Pisa, Italy
Paolo Barsocchi: Institute of Information Science and Technologies, National Research Council, 56124 Pisa, Italy
Agriculture, 2021, vol. 11, issue 9, 1-27
Abstract:
This research illustrates the technical efficiency of the pan-India paddy cultivation status obtained through a stochastic frontier approach. The results suggest that the mean technical efficiency varies from 0.64 in Gujarat to 0.95 in Odisha. Inputs like human labor, mechanical labor, fertilizer, irrigation and insecticide were found to determine the yield in paddy cultivation across India (except for Chhattisgarh). Inefficiency in the paddy production in Punjab, Bihar, West Bengal, Andhra Pradesh, Tamil Nadu, Kerala, Assam, Gujarat and Odisha in 2016–2017 was caused by technical inefficiency due to poor input management, as suggested by the significant σ 2 U and σ 2 v values of the stochastic frontier model. In addition, most of the farm groups in the study operated in the high-efficiency group (80–90% technical efficiency). No specific pattern of input use can be visualized through descriptive measures to give any specific policy implication. Thus, machine learning algorithms based on the input parameters were tested on the data in order to predict the farmers’ efficiency class for individual states. The highest mean accuracy of 0.80 for the models of all of the states was achieved in random forest models. Among the various states of India, the best random forest prediction model based on accuracy was fitted to the input data of Bihar (0.91), followed by Uttar Pradesh (0.89), Andhra Pradesh (0.88), Assam (0.88) and West Bengal (0.86). Thus, the study provides a technique for the classification and prediction of a farmer’s efficiency group from the levels of input use in paddy cultivation for each state in the study. The study uses the DES input dataset to classify and predict the efficiency group of the farmer, as other machine learning models in agriculture have used mostly satellite, spectral imaging and soil property data to detect disease, weeds and crops.
Keywords: paddy; stochastic frontier; machine learning; k -nearest neighbour (KNN); support vector machine (SVM); random forest (RF) (search for similar items in EconPapers)
JEL-codes: Q1 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2077-0472/11/9/837/pdf (application/pdf)
https://www.mdpi.com/2077-0472/11/9/837/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jagris:v:11:y:2021:i:9:p:837-:d:626727
Access Statistics for this article
Agriculture is currently edited by Ms. Leda Xuan
More articles in Agriculture from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().