Machine learning-based forecasting of air quality index under long-term environmental patterns: A comparative approach with XGBoost, LightGBM, and SVM

基于机器学习的长期环境模式下空气质量指数预测:与 XGBoost、LightGBM 和 SVM 的比较方法

阅读:1

Abstract

Air pollution is a global problem that threatens environmental sustainability and severely affects public health. Monitoring air quality and predicting future pollution levels are critical for creating effective environmental policies and enabling individuals to take precautions against air pollution. This study presents a long-term assessment of daily Air Quality Index (AQI) prediction using machine learning models based on meteorological and pollutant data collected in eastern Türkiye from 2016 to 2024. The dataset includes four major air pollutants (PM₁₀, SO₂, NO₂, O₃) and five meteorological variables (temperature, precipitation, relative humidity, wind direction, wind speed). Three models-eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Support Vector Machine (SVM)-were evaluated using the coefficient of determination (R²), root mean square error (RMSE) and mean absolute error (MAE) as performance metrics. Among these, XGBoost achieved the highest prediction accuracy (R² = 0.999, RMSE = 0.234, MAE = 0.158). The results demonstrate that ensemble-based machine learning approaches, particularly XGBoost, can effectively model AQI fluctuations using environmental predictors. These results provide valuable insights for air quality forecasting systems and suggest practical implications for regional air pollution management and early warning systems, supporting public health protection and the development of environmental health policies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。