Explainable Machine-Learning Model Optimized Using Forward Feature Selection Algorithm for Prediction of Ground Ozone Concentration
DOI:
https://doi.org/10.7250/conect.2026.112Keywords:
Ground ozone, neural network, machine learning, forward feature selectionAbstract
An accurate model for prediction of air quality parameters such as ground ozone (o3) serves as policy and decision-making tool for providing healthy and friendly environment. in this study, six machine-learning models (neural network (nn), ensemble, kernel regression, regression tress, support vector regression (svr), and multilinear regression (mlr)) were developed for prediction of ground ozone concentration using 35 065 hourly data from January 2013 to february 2017. The selection of relevant input variables for the models was using forward feature selection algorithm. the forward feature selection reveals rain, day and wind speed as the least important variable for the prediction of ozone concentration. the models were evaluated using mean absolute error (mae), mean square error (mse), root mean square error (rmse) and nash sutcliffe efficiency (nse) in both training and testing stage. the nn model outperformed ensemble, kernel, tree, svr and mlr by 0.73 %, 4.74 %, 6.47 %, 11.41 % and 25.17 % respectively in the testing stage. copland algorithm inicates the nn model as the overall best model considering all evaluation metrics. the shapley analysis indicates temperature, nitrogen oxide and hour of the day as the major factors contributing to ground ozone concentration.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Ibrahim Khalil Umar, Samir Bashir (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.