EconPapers    
Economics at your fingertips  
 

Unsupervised Machine Learning to Detect Impending Anomalies in Testing of Fuel Economy and Emissions of Light-Duty Vehicles

Dhan Lord B. Fortela (), Ashton C. Fremin, Wayne Sharp, Ashley P. Mikolajczyk, Emmanuel Revellame, William Holmes, Rafael Hernandez and Mark Zappi
Additional contact information
Dhan Lord B. Fortela: Department of Chemical Engineering, University of Louisiana, Lafayette, LA 70504, USA
Ashton C. Fremin: Department of Chemical Engineering, University of Louisiana, Lafayette, LA 70504, USA
Wayne Sharp: Energy Institute of Louisiana, University of Louisiana, Lafayette, LA 70504, USA
Ashley P. Mikolajczyk: Department of Chemical Engineering, University of Louisiana, Lafayette, LA 70504, USA
Emmanuel Revellame: Energy Institute of Louisiana, University of Louisiana, Lafayette, LA 70504, USA
William Holmes: Department of Chemical Engineering, University of Louisiana, Lafayette, LA 70504, USA
Rafael Hernandez: Department of Chemical Engineering, University of Louisiana, Lafayette, LA 70504, USA
Mark Zappi: Department of Chemical Engineering, University of Louisiana, Lafayette, LA 70504, USA

Clean Technol., 2023, vol. 5, issue 1, 1-18

Abstract: This work focused on demonstrating the capability of unsupervised machine learning techniques in detecting impending anomalies by extracting hidden trends in the datasets of fuel economy and emissions of light-duty vehicles (LDVs), which consist of cars and light-duty trucks. This case study used the vehicles’ fuel economy and emissions testing datasets for vehicle model years 2015 to 2023 with a total of 34,602 data samples on LDVs of major vehicle manufacturers. Three unsupervised techniques were used: principal components analysis (PCA), K-Means clustering, and self-organizing maps (SOM). Results show that there are clusters of data that exhibit trends not represented by the dataset as a whole. Fuel CO vs. Fuel Economy has a negative correlation in the whole dataset (r = −0.355 for LDVs model year 2022), but it has positive correlations in certain sample clusters (e.g., LDVs model year 2022: r = +0.62 in a K-Means cluster where the slope is around 0.347 g−CO/mi/MPG). A time series analysis of the results of clustering indicates that Test Procedure and Fuel Type, specifically Test Procedure 11 and Fuel Type 26 as defined by the US EPA, could be the contributors to the positive correlation of CO and Fuel Economy. This detected peculiar trend of CO-vs.-Fuel Economy is an impending anomaly, as the use of Fuel 26 in emissions testing with Test Procedure 11 of US-EPA has been increasing through the years. With the finding that the clustered data samples with positive CO-vs.-Fuel Economy correlation all came from vehicle manufacturers that independently conduct the standard testing procedures and not data from US-EPA testing centers, it was concluded that the chemistry of using Fuel 26 in performing Test Procedure 11 should be re-evaluated by US-EPA.

Keywords: machine learning; fuel economy; vehicle emissions (search for similar items in EconPapers)
JEL-codes: Q2 Q3 Q4 Q5 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2571-8797/5/1/21/pdf (application/pdf)
https://www.mdpi.com/2571-8797/5/1/21/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jcltec:v:5:y:2023:i:1:p:21-435:d:1095298

Access Statistics for this article

Clean Technol. is currently edited by Ms. Shary Song

More articles in Clean Technol. from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jcltec:v:5:y:2023:i:1:p:21-435:d:1095298