Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.12188/30082
Title: | Machine learning model for air pollution prediction in Skopje, North Macedonia | Authors: | Andonovic, Viktor GJoreski, Hristijan Srbinovska, Mare Andova, Vesna Krkoleva, Aleksandra Celeska, Maja Todorov, Zdravko |
Keywords: | Machine learning, prediction model, air pollution, PM10, PM2.5 | Issue Date: | Jul-2020 | Conference: | 4th South East Europe Sustainable Development of Energy Water and Environment Systems Conference (SEE SDEWES) Sarajevo | Abstract: | The low quality of air, especially high concentration of particulate matter that have significant negative effect on human health and environment, is a global problem in urban areas. Thus, early air pollution prediction is an urgent need in Skopje, North Macedonia with highly increased concentration of particulate matter especially during the winter months. The objective of this paper is to develop machine learning model for predicting the air pollution in Skopje. The methods are based on processing the collected data from different measurement locations in Skopje, generating numerous weather and pollution features, and choosing the optimal parameters (hyperparameters) for the model. The information for the various pollutants were provided from the measurement stations located near the Faculty of Electrical Engineering and Information Technologies building. The measured data are gathered from the three sensor nodes that are collecting data for following parameters: particulate matter with 10 or less micrometres (PM10), particulate matter with 2.5 or less micrometres (PM2.5), CO and NO2, and sending these data to a server for online monitoring or off-line analysis. The pollution data, together with the weather information for temperature, humidity, wind speed, and wind direction were combined to train the prediction model. The results show that the weather information is correlated with the air pollution, which allows to train a model that predicts the air pollution based on the weather data and the historical data about the pollution. The experimental evaluation showed that the best performing model, XGBoost, achieves Mean Absolute Error for PM10 values of 6.8, 9.7, and 12.4 for the nodes 3, 2, and 1 respectively, and for PM2.5 values 6.36, 8.81 and 8 for nodes 3, 2 and 1 respectively. | URI: | http://hdl.handle.net/20.500.12188/30082 |
Appears in Collections: | Faculty of Electrical Engineering and Information Technologies: Conference Papers |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.