Exploring Patterns of Transportation-Related CO 2 Emissions Using Machine Learning Methods
Xiaodong Li,
Ai Ren and
Qi Li
Additional contact information
Xiaodong Li: School of Economics and Management, Anhui Polytechnic University, Wuhu 241000, China
Ai Ren: School of Business, State University of New York at New Paltz, New Paltz, NY 12561, USA
Qi Li: School of Business, State University of New York at New Paltz, New Paltz, NY 12561, USA
Sustainability, 2022, vol. 14, issue 8, 1-21
Abstract:
While the transportation sector is one of largest economic growth drivers for many countries, the adverse impacts of transportation on air quality are also well-noted, especially in developing countries. Carbon dioxide (CO 2 ) emissions are one of the direct results of a transportation sector powered by burning fossil-based fuels. Detailed knowledge of CO 2 emissions produced by the transportation sectors in various countries is essential for these countries to revise their future energy investments and policies. In this framework, three machine learning algorithms, ordinary least squares regression (OLS), support vector machine (SVM), and gradient boosting regression (GBR), are used to forecast transportation-based CO 2 emissions. Both socioeconomic factors and transportation factors are also included as features in the study. We study the top 30 CO 2 emissions-producing countries, including the Tier 1 group (the top five countries, accounting for 61% of global CO 2 emissions production) and the Tier 2 group (the next 25 countries, accounting for 35% of total CO 2 emissions production). We evaluate our model using four-fold cross-validation and report four frequently used statistical metrics ( R 2 , MAE, rRMSE, and MAPE). Of the three machine learning algorithms, the GBR model with features combining socioeconomic and transportation factors (GBR_ALL) has the best performance, with an R 2 value of 0.9943, rRMSE of 0.1165, and MAPE of 0.1408. We also find that both transportation features and socioeconomic features are important for transportation-based CO 2 emission prediction. Transportation features are more important in modeling for 30 countries, while socioeconomic features (especially GDP and population) are more important when modeling for Tier 1 and Tier 2 countries.
Keywords: carbon dioxide emission prediction; transportation sector; socioeconomic factors (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
https://www.mdpi.com/2071-1050/14/8/4588/pdf (application/pdf)
https://www.mdpi.com/2071-1050/14/8/4588/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:14:y:2022:i:8:p:4588-:d:791939
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().