Lag Variables in Air Pollution Modeling Based on Traffic Flow and Meteorological Factors
Joanna Kamińska , Guido Sciavicco , Estrella Lucena-Sánchez , Fernando Jimenez
AbstractIn order to refine the research on the impact of environmental factors on the concentration of pollutants in the air, in this paper, we present a mathematical model that allows the possibility of taking into account the past values of factors (explanatory variables) when modeling the current concentration of pollution. We conducted numerical analyzes based on hourly data from meteorological, traffic and air quality monitoring stations in Wrocław (Poland, Central Europe) from 2015–2017. In order to determine the optimal delay of each explanatory variable, we used a multi-objective optimization model (MO). It turned out that for the concentration of nitrogen oxides, delayed traffic flow, wind speed and sunshine duration time are more important than current ones. Then we built two random forest models: an actual model of current values of explanatory variables and a lag model with delayed variables determined by the MO method. Taking into account variables with an optimal delay (lag model) results in an increase in model accuracy for NO2 with R2 = 0.51 to 0.56 and for NOx from 0.46 to 0.52. We deduced that in pollutant concentrations modeling, the possibility of greater influence of variables with delay should always be considered because it can significantly increase the accuracy of the model and indicate additional relationships or dependencies.
|Journal series||Proceedings, ISSN , e-ISSN 2504-3900, (0 pkt)|
|Publication size in sheets||0.5|
|Conference||9th Innovations-Sustainability-Modernity-Openness Conference (ISMO’20) (ISMO 2020), 20-05-2020 - 21-05-2020, Częstochowa, Polska|
|Keywords in English||air pollution, nitrogen oxides, random forest, lag variables, multi-objective optimization, traffic flow, meteorological conditions|
|License||Journal (articles only); published final; ; with publication|
|Score||= 5.0, 15-09-2020, ArticleFromConference|
* presented citation count is obtained through Internet information analysis and it is close to the number calculated by the Publish or Perish system.