Applying Data Mining to China’s Swine Farming Industry: A Compromise Perspective of Economic, Environmental and Overall Performances
Diejun Huang,
Qiuzhuo Ma,
Liangyu Feng,
Xiaowei Wen and
Hua Li
Additional contact information
Diejun Huang: Institute of Geography and Tourism, Guangdong University of Finance &Economics, Guangzhou 510320, China
Qiuzhuo Ma: Business School, Guangdong University of Foreign Studies, 2 Baiyun Avenue, Baiyun District, Guangzhou 510420, China
Liangyu Feng: College of Economics and Management, South China Agricultural University, 483 Wushan Road, Tianhe District, Guangzhou 510642, China
Xiaowei Wen: College of Economics and Management, South China Agricultural University, 483 Wushan Road, Tianhe District, Guangzhou 510642, China
Hua Li: College of Economics and Management, South China Agricultural University, 483 Wushan Road, Tianhe District, Guangzhou 510642, China
Sustainability, 2018, vol. 10, issue 7, 1-26
Abstract:
The economic and environmental performances of the swine farming industry have always resulted in heated discussions in developing countries. Exploring the relationship between these features and the producers’ overall performance is the focus of this paper. For constructing multi-objective features that include the above features, a compromise approach for optimization is taken into consideration. For classifying the overall performance into different levels and detecting the effect of economic and environmental features on such features, an iteration scheme is developed in which the overall performance is treated as a target label. By neglecting this target label, a k-means clustering method is then used to help predict the producer’s overall performance given their economic and environmental features. In data pre-processing, correlation analysis for feature selection shows that the producer’s pollution emission and received regulation intensity largely affect its overall performance, while profit is found to be negatively correlated with pollution emission as regulation intensity is neglected. The classification result derived from the Silhouette Coefficient shows that the data set can be efficiently split into different groups in terms of the producer’s overall performance. The average distance between the objects in the low-performance group is larger than that of the high-performance group. The threshold position between the two groups is found to be largely dependent on the features of pollution emission and regulation intensity. The clustering result obtained by the k-means method shows good effectiveness and efficiency in separating the objects into different groups based on various features other than the overall performance. In 2- and 3-cluster cases, the result also shows evidence of the impact of economic and environmental features on the clustering result. The cross-validation analysis under a set of randomly chosen splitting points shows an increasing out-of-sample prediction quality with increases in training sample size. As one of the by-products of this paper, the geographical distribution in the clustering result is found partially consistent with the official report from Chinas central government regarding advantageous regions within the industry. In addition to current research, the ease of using the knowledge obtained in this paper for transfer learning is discussed.
Keywords: data mining; swine farming industry; compromise; multi-objective optimization (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2018
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.mdpi.com/2071-1050/10/7/2374/pdf (application/pdf)
https://www.mdpi.com/2071-1050/10/7/2374/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:10:y:2018:i:7:p:2374-:d:156887
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().