Systematic Review of Machine Learning and Deep Learning Techniques for Spatiotemporal Air Quality Prediction
Abstract
:1. Introduction
2. Materials and Methods
Query 1: “Write the keywords in review of machine learning techniques for spatiotemporal air quality prediction”.
Query 2: “What are the keywords on review of machine learning techniques for spatiotemporal air quality prediction”.
Query 3: “Write all the keywords on review of machine learning techniques for spatiotemporal air quality prediction”.
Query 4: “Write all the keywords on review of machine learning models for spatiotemporal air quality prediction”.
Query 5: “What are all the keywords on review of machine learning models for spatiotemporal air quality prediction”.
Query 6: “Write keywords on review of machine learning techniques for spatiotemporal air quality prediction”.
3. Discussions
3.1. ChatGPT-Generated Keywords
3.2. Machine Learning Techniques for Spatiotemporal Air Quality Prediction
3.3. Air Quality Index (AQI) Models
3.4. Model Hybridization Using ML and DL for Air Quality Prediction
4. Conclusions and Future Directions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
AdaBoost | Adaptive Boosting |
AQI | Air quality index |
AQP | Air Quality Prediction |
BPNN | Back Propagation Neural Network |
MLP | Artificial Neural Network—Multilayer Perceptron |
ANN | Artificial neural networks |
ARIMA | Autoregressive Integrated Moving Average |
BiLSTM | Bi-directional long-short-term memory layer |
CR | Catboost Regression |
R2 | Coefficient of determination |
CAQI | Common Air Quality Index |
CMAQ | Community Multiscale Air Quality model |
CWT | Concentration weight trajectory |
CAAQMS | Continuous Ambient Air Quality Monitoring Stations |
CNN | Convolutional neural network |
DT | Decision Tree |
DTR | Decision Tree Regression |
DF | Deep Forest |
DRL | Deep Reinforcement Learning |
DTW | Dynamic Time Warping |
EBAT | Ensemble bagged tree |
EBOT | Ensemble boosted tree |
ETR | Extra Trees Regression |
XGBoost | eXtreme Gradient Boosting |
ERT | Extremely Randomized Tree |
ELM | Extreme learning machine |
FDT | Fine Decision Tree |
GEE | Generalized Estimating Equation |
GPSTI | Generalized Probabilistic Standardized Temperature Index |
GA | Genetic algorithm |
GEOS-CF | Goddard Earth Observing System Composition Forecasting |
GBTR | Gradient Boosted Tree Regression |
GAST | Graph Attention-based Spatial-Temporal model |
HGBoost | Histogram-based Gradient Boost |
k-NN | k-Nearest Neighbor |
LUR | Land Use Regression |
LGBM or LightGBM | Light Gradient Boosting Machine |
LR | Logistic Regression |
LSTM | Long short-term memory |
RNN | Recurrent neural network |
ML | Machine Learning |
MAE | Mean Absolute Error |
MSE | Mean Squared Error |
ME | Microenvironment Exposure |
MODIS | Moderate Resolution Imaging Spectro Radiometer |
MP | Multilayer Perceptron |
ML | Multilinear Regression |
MISR | Multiangle Imaging Spectro Radiometer |
MLR | Multiple Linear Regression |
NSE | Nash-Sutcliffe Efficiency coefficient |
NN | Neural Network |
OST | Optimal Stopping Theory |
PALM | Parallelized large-eddy simulation model |
PVE | Probabilistic Voting Ensemble |
RF | Random Forest |
RMSE | Root Mean Square Error |
SOM | Self-Organizing Map |
SKATER | Spatial K’luster Analysis by Tree Edge Removal |
SMLT | Supervised machine learning technique |
SVR | Support Vector Regression |
SLR | Simple Linear Regression |
GCNs | Transformers and Graph Convolutional Networks |
WLSTME | Weighted Long Short-Term Memory Extended model |
TBATS | Trigonometric seasonality, Box-Cox transformation, ARMA errors, Trend and Seasonal components |
TROPOMI | TROPOspheric Monitoring Instrument |
Appendix A
Queries | ChatGPT-Generated Keywords |
---|---|
Query 1 | “Machine Learning”, “Spatiotemporal”, “Air Quality Prediction”, “Atmospheric Modeling”, “Time Series Analysis”, “Spatial Interpolation”, “Feature Engineering”, “Ensemble Methods”, “Deep Learning”, “Convolutional Neural Networks (CNNs)”, “Recurrent Neural Networks (RNNs)”, “Long Short-Term Memory (LSTM)”, “Gated Recurrent Units (GRUs)”, “Support Vector Machines (SVMs)”, “Random Forests”, “Gradient Boosting Machines (GBMs)”, “Gaussian Processes”, “Kriging”, “Data Fusion”, “Model Evaluation” |
Query 2 | “Air quality prediction”, “Spatiotemporal modeling”, “Machine learning algorithms”, “Atmospheric data analysis”, “Time series forecasting”, “Spatial interpolation methods”, “Feature extraction and selection”, “Sensor networks”, “Data fusion techniques”, “Deep learning architectures”, “Convolutional neural networks (CNN)”, “Recurrent neural networks (RNN)”, “Long short-term memory (LSTM)”, “Support vector machines (SVM)”, “Random forests”, “Gradient boosting methods”, “Ensemble learning approaches”, “Model evaluation metrics”, “Uncertainty quantification”, “Urban air pollution” |
Query 3 | “Air quality prediction”, “Spatiotemporal modeling”, “Machine learning algorithms”, “Atmospheric data analysis”, “Time series forecasting”, “Spatial interpolation methods”, “Feature extraction”, “Feature selection”, “Sensor networks”, “Data fusion techniques”, “Deep learning architectures”, “Convolutional neural networks (CNN)”, “Recurrent neural networks (RNN)”, “Long short-term memory (LSTM)”, “Support vector machines (SVM)”, “Random forests”, “Gradient boosting methods”, “Ensemble learning approaches”, “Model evaluation metrics”, “Uncertainty quantification”, “Urban air pollution”, “Environmental monitoring”, “Satellite data integration”, “Hybrid models”, “Transfer learning” |
Query 4 | “Air quality prediction”, “Spatiotemporal modeling”, “Machine learning algorithms”, “Atmospheric data analysis”, “Time series forecasting”, “Spatial interpolation techniques”, “Feature engineering”, “Feature selection”, “Sensor networks”, “Data fusion methods”, “Deep learning architectures”, “Convolutional neural networks (CNN)”, “Recurrent neural networks (RNN)”, “Long short-term memory (LSTM)”, “Support vector machines (SVM)”, “Random forests”, “Gradient boosting methods”, “Ensemble learning techniques”, “Model evaluation metrics”, “Uncertainty estimation”, “Urban air pollution monitoring”, “Satellite data integration”, “Hybrid models”, “Transfer learning applications”, “Environmental health impacts” |
Query 5 | “Air quality prediction”, “Spatiotemporal modeling”, “Machine learning algorithms”, “Atmospheric data analysis”, “Time series forecasting”, “Spatial interpolation”, “Feature engineering”, “Feature selection”, “Sensor networks”, “Data fusion”, “Deep learning”, “Convolutional neural networks (CNN)”, “Recurrent neural networks (RNN)”, “Long short-term memory (LSTM)”, “Support vector machines (SVM)”, “Random forests”, “Gradient boosting”, “Ensemble learning”, “Model evaluation”, “Uncertainty quantification”, “Urban air pollution”, “Environmental monitoring”, “Satellite data integration”, “Hybrid models”, “Transfer learning” |
Query 6 |
|
References
- Zhu, S.; Xu, J.; Fan, M.; Yu, C.; Letu, H.; Zeng, Q.; Zhu, H.; Wang, H.; Wang, Y.; Shi, J. Estimating Near-Surface Concentrations of Major Air Pollutants From Space: A Universal Estimation Framework LAPSO. IEEE Trans. Geosci. Remote Sens. 2023, 61, 4101011. [Google Scholar] [CrossRef]
- Mumtaz, R.; Amin, A.; Khan, M.A.; Asif, M.D.A.; Anwar, Z.; Bashir, M.J. Impact of Green Energy Transportation Systems on Urban Air Quality: A Predictive Analysis Using Spatiotemporal Deep Learning Techniques. Energies 2023, 16, 6087. [Google Scholar] [CrossRef]
- Wang, J.; Lei, Y.; Chen, Y.; Wu, Y.; Ge, X.; Shen, F.; Zhang, J.; Ye, J.; Nie, D.; Zhao, X. Comparison of active and passive sampling methods for air pollutants in urban environments. Environ. Sci. Pollut. Res. Int. 2020, 27, 33173–33183. [Google Scholar]
- Khokhar, M.; Abid, M.; Zahid, M. Air pollution and its impact on agriculture: A review. Environ. Sci. Pollut. Res 2016, 23, 1703–1715. [Google Scholar]
- Ali, M.; Athar, M.A.; Ashfaq, M.; Tariq, M.A. Air Pollution and Human Health: A Review. Environ. Pollut 2015, 207, 427–438. [Google Scholar]
- Matooane, M.; John, J.; Oosthuizen, R.; Binedell, M. Vulnerability of South African communities to air pollution. In Proceedings of the 8th World Congress on Environmental Health; Document Transformation Technologies Organised by SB Conferences, Durban, South Africa, 22–27 February 2004. [Google Scholar]
- Zhan, Y.; Li, X.; Li, G.; Li, Z. Deep learning for source identification of ambient air pollutants using air quality and meteorological data. Environ. Sci. Pollut. Res 2021, 28, 14380–14391. [Google Scholar]
- Kaur, M.; Singh, D.; Jabarulla, M.Y.; Kumar, V.; Kang, J.; Lee, H.N. Computational deep air quality prediction techniques: A systematic review. Artif. Intell. Rev. 2023, 56, S2053–S2098. [Google Scholar] [CrossRef]
- Gugnani, V.; Singh, R.K. Analysis of deep learning approaches for air pollution prediction. Multimed. Tools Appl. 2022, 81, 6031–6049. [Google Scholar] [CrossRef]
- Doush, I.A.; Sultan, K.; Alsaber, A.; Alkandari, D.; Abdullah, A. Improving Neural Network Using Jaya Algorithm with Opposite Learning for Air Quality Prediction. In Lecture Notes in Networks and Systems; Springer: Cham, Switzerland, 2024. [Google Scholar]
- South Africa. Government Gazette: National Environment Management: Air Quality Act, 2004; Parliament of the Republic of South Africa: Cape Town, South Africa, 2005; Volume 476, pp. 1–29.
- Environment, A. Environmental Monitoring. 2024. Available online: https://apexenviro.co.za/services/environmental-monitoring/ (accessed on 20 August 2024).
- Wang, J.; Song, G. A Deep Spatial-Temporal Ensemble Model for Air Quality Prediction. Neurocomputing 2018, 314, 198–206. [Google Scholar] [CrossRef]
- Drewil, G.I.; Al-Bahadili, R.J. Air pollution prediction using LSTM deep learning and metaheuristics algorithms. Sensors 2022, 24, 100546. [Google Scholar] [CrossRef]
- Morapedi, T.D.; Obagbuwa, I.C. Air pollution particulate matter (PM2.5) prediction in South African cities using machine learning techniques. Front Artif Intell. 2023, 6, 1230087. [Google Scholar] [CrossRef]
- Cosemans, G.; Janssen, S.; Panis, L.I.; Mishra, V. A Comparison of Linear Regression, Regularization, and Machine Learning Algorithms to Develop Europe-wide Spatial Models of Fine Particles and Nitrogen Dioxide. Atmosphere 2021, 12, 798. [Google Scholar]
- Lin, G.Y.; Chen, H.W.; Chen, B.J.; Chen, S.C. A machine learning model for predicting PM2.5 and nitrate concentrations based on long-term water-soluble inorganic salts datasets at a road site station. Chemosphere 2022, 289, 133123. [Google Scholar] [CrossRef] [PubMed]
- Choi, S.; Lee, Y.; Kim, E.; Jeong, I.; Kim, H.; Kim, K.H. Mapping urban air quality using mobile sampling with low-cost sensors and machine learning in Seoul, South Korea. Environ. Pollut 2021, 275, 116586. [Google Scholar]
- Dragomir, E.G.; Oprea, M. Air Quality Forecasting by Using Nonlinear Modeling Methods. In International Conference on Nonlinear Dynamics of Electronic Systems; Springer International Publishing: Cham, Switzerland, 2014. [Google Scholar]
- Masinde, M. Enhancing Systematic Literature Reviews using LDA and ChatGPT: Case of Framework for Smart City Planning. In Proceedings of the 2024 IST-Africa Conference (IST-Africa), Dublin, Ireland, 20–24 May 2024; IEEE Explore: New York, NY, USA, 2024. [Google Scholar]
- Arunarani, A.R.; Selvanayaki, S.; Saleh Al Ansari, M.; Ala Walid, M.A.; Devireddy, N.; Keerthi, M.M. Crop Yield Prediction Using Spatio Temporal CNN and Multimodal Remote Sensing. In Proceedings of the 2nd International Conference on Edge Computing and Applications, ICECAA 2023, Namakkal, India, 19–21 July 2023; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2023. [Google Scholar]
- Brayshaw, G.; Ward-Cherrier, B.; Pearson, M. Temporal and Spatio-temporal domains for Neuromorphic Tactile Texture Classification. In Proceedings of the ACM International Conference Proceeding Series, Online, 28 March–1 April 2022; Association for Computing Machinery: New York, NY, USA, 2022. [Google Scholar]
- Broumand, M.; Asgarian, A.; Bussmann, M.; Chattopadhyay, K.; Thomson, M.J. Spatio-temporal dynamics and disintegration of a fan liquid sheet. Phys. Fluids 2021, 33, 112109. [Google Scholar] [CrossRef]
- Mohammadi, F.; Teiri, H.; Hajizadeh, Y.; Abdolahnejad, A.; Ebrahimi, A. Prediction of atmospheric PM2.5 level by machine learning techniques in Isfahan, Iran. Sci. Rep. 2024, 14, 2109. [Google Scholar] [CrossRef] [PubMed]
- Kelly, J.T.; Jang, C.; Zhu, Y.; Long, S.; Xing, J.; Wang, S.; Murphy, B.N.; Pye, H.O.T. Predicting the nonlinear response of PM2.5 and ozone to precursor emission changes with a response surface model. Atmosphere 2021, 12, 1044. [Google Scholar] [CrossRef]
- Alsaedi, A.S.; Liyakathunisa, L. Spatial and temporal data analysis with deep learning for air quality prediction. In Proceedings of the—International Conference on Developments in eSystems Engineering, DeSE, Kazan, Russia, 7–10 October 2019; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2019. [Google Scholar]
- Awan, W.B.; Batool, A.; Ali, Z.; Xu, Z.; Niaz, R.; Sammen, S.S. A Unified procedure for the probabilistic assessment and forecasting temperature characteristics under global climate change. Environ. Dev. Sustain. 2024, 1–25. [Google Scholar] [CrossRef]
- Hashemi, M.; Karimi, H.A. Weighted Machine Learning for Spatial-Temporal Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 3066–3082. [Google Scholar] [CrossRef]
- Shafi, M. Utilizing Spatio-temporal Satellite Images and Machine Learning to Examine the Impacts of Changing Land Use and Land Cover on Sohar City. In Proceedings of the 2023 24th International Arab Conference on Information Technology, ACIT, Ajman, United Arab Emirates, 6–8 December 2023. [Google Scholar]
- Yang, C.; Du, W.J.; He, R.L.; Hu, Y.R.; Liu, H.; Huang, T.; Li, W.W. Spatiotemporal Patterns of Methane and Nitrous Oxide Emissions in China’s Inland Waters Identified by Machine Learning Technique. ACS ES T Water 2024, 4, 936–947. [Google Scholar] [CrossRef]
- Iskandaryan, D.; Ramos, F.; Trilles, S. Air quality prediction in smart cities using machine learning technologies based on sensor data: A review. Appl. Sci. 2020, 10, 2401. [Google Scholar] [CrossRef]
- Soh, P.-W.; Chang, J.-W.; Huang, J.-W. Adaptive Deep Learning-Based Air Quality Prediction Model Using the Most Relevant Spatial-Temporal Relations. IEEE Access 2018, 6, 38186–38199. [Google Scholar] [CrossRef]
- Power, M.; Cascio, K.; Adgate, A. Air pollution and cardiovascular disease: A window of opportunity. Curr. Opin. Cardiol. 2018, 33, 578–584. [Google Scholar]
- Khan, J.; Ellermann, T.; Hertel, O. Predicting Hourly Street-Scale NO2 and PM2.5 Concentrations Using Machine Learning at One of the Danish Traffic Hotspots. In Springer Proceedings in Complexity; Springer International Publishing: Cham, Switzerland, 2022. [Google Scholar]
- Park, C.; Jeong, S.; Kim, C.; Shin, J.; Joo, J. Machine learning based estimation of urban on-road CO2 concentration in Seoul. Environ. Res. 2023, 231, 116256. [Google Scholar] [CrossRef] [PubMed]
- Tu, Q.; Hase, F.; Chen, Z.; Schneider, M.; García, O.; Khosrawi, F.; Chen, S.; Blumenstock, T.; Liu, F.; Qin, K.; et al. Estimation of NO2 emission strengths over Riyadh and Madrid from space from a combination of wind-Assigned anomalies and a machine learning technique. Atmos. Meas. Tech. 2023, 16, 2237–2262. [Google Scholar] [CrossRef]
- Meng, X.; Hang, Y.; Lin, X.; Li, T.; Wang, T.; Cao, J.; Fu, Q.; Dey, S.; Huang, K.; Liang, F.; et al. A satellite-driven model to estimate long-term particulate sulfate levels and attributable mortality burden in China. Environ. Int. 2023, 171, 107740. [Google Scholar] [CrossRef]
- Choi, H.; Park, S.; Kang, Y.; Im, J.; Song, S. Retrieval of hourly PM2.5 using top-of-atmosphere reflectance from geostationary ocean color imagers I and II. Environ. Pollut. 2023, 323, 121169. [Google Scholar] [CrossRef]
- Ezhilkumar, M.R.; Karthikeyan, S.; Chalishajar, D.N.; Ramesh, R. Air quality prediction using machine learning techniques for intelligent monitoring systems. In Industry Automation: The Technologies, Platforms and Use Cases; River Publishers: Gistrup, Denmark, 2024; pp. 223–235. [Google Scholar]
- Mathew, A.; Gokul, P.R.; Raja Shekar, P.; Arunab, K.S.; Ghassan Abdo, H.; Almohamad, H.; Abdullah Al Dughairi, A. Air quality analysis and PM2.5 modelling using machine learning techniques: A study of Hyderabad city in India. Cogent Eng. 2023, 10, 2243743. [Google Scholar] [CrossRef]
- Xiong, K.; Xie, X.; Huang, L.; Hu, J. Improved O3 predictions in China by combining chemical transport model and multi-source data with machining learning techniques. Atmos. Environ. 2024, 318, 120269. [Google Scholar] [CrossRef]
- Sharma, G.; Khurana, S.; Saina, N.; Gupta, G. Comparative Analysis of Machine Learning Techniques in Air Quality Index (AQI) prediction in smart cities. Int. J. Syst. Assur. Eng. Manag. 2024, 15, 3060–3075. [Google Scholar] [CrossRef]
- Pandithurai, O.; Bharathiraja, N.; Pradeepa, K.; Meenakshi, D.; Kathiravan, M.; Vinoth Kumar, M. Air Pollution Prediction using Supervised Machine Learning Technique. In Proceedings of the 3rd International Conference on Artificial Intelligence and Smart Energy, ICAIS, Coimbatore, India, 2–4 February 2023. [Google Scholar]
- Mahesh Babu, K.; Rene Beulah, J. Air quality prediction based on supervised machine learning methods. Int. J. Innov. Technol. Explor. Eng. 2019, 8, 206–212. [Google Scholar] [CrossRef]
- Niveshitha, N.; Amsaad, F.; Jhanjhi, N.Z. Air Quality Prediction in Smart Cities Using Cloud Machine Learning. In Proceedings of the 2023 2nd International Conference on Smart Technologies for Smart Nation, SmartTechCon, Singapore, 18–19 August 2023. [Google Scholar]
- William, P.; Paithankar, D.N.; Yawalkar, P.M.; Korde, S.K.; Pabale, A.R.; Rakshe, D.S. Divination of Air Quality Assessment using Ensembling Machine Learning Approach. In Proceedings of the International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering, ICECONF, Chennai, India, 5–7 January 2023. [Google Scholar]
- Kunnathettu, A.J.; Varma, S.L. Comparative Analysis of Neural Network and Machine Learning Techniques for Air Quality Prediction. In Proceedings of the IEEE 2020 2nd International Conference on Advances in Computing, Communication Control and Networking, ICACCCN, Greater Noida, India, 18–19 December 2020. [Google Scholar]
- Sanjeev, D. Implementation of machine learning algorithms for analysis and prediction of air quality. Int. J. Eng. Res. Technol. 2021, 10, 533–538. [Google Scholar]
- Chang, F.J.; Chang, L.C.; Kang, C.C.; Wang, Y.S.; Huang, A. Explore spatio-temporal PM2.5 features in northern Taiwan using machine learning techniques. Sci. Total Environ. 2020, 736, 139656. [Google Scholar] [CrossRef]
- Ly, B.T.; Matsumi, Y.; Vu, T.V.; Sekiguchi, K.; Nguyen, T.T.; Pham, C.T.; Nghiem, T.D.; Ngo, I.H.; Kurotsuchi, Y.; Nguyen, T.H.; et al. The effects of meteorological conditions and long-range transport on PM2.5 levels in Hanoi revealed from multi-site measurement using compact sensors and machine learning approach. J. Aerosol Sci. 2021, 152, 105716. [Google Scholar] [CrossRef]
- Zareba, M.; Dlugosz, H.; Danek, T.; Weglinska, E. Big-Data-Driven Machine Learning for Enhancing Spatiotemporal Air Pollution Pattern Analysis. Atmosphere 2023, 14, 760. [Google Scholar] [CrossRef]
- Li, Y.; Zhang, M.; Ma, G.; Ren, H.; Yu, E. Analysis of Primary Air Pollutants’ Spatiotemporal Distributions Based on Satellite Imagery and Machine-Learning Techniques. Atmosphere 2024, 15, 287. [Google Scholar] [CrossRef]
- Chakradhar Reddy, K.; Nagarjuna Reddy, K.; Brahmaji Prasad, K.; Rajendran, P.S. The Prediction of Quality of the Air Using Supervised Learning. In Proceedings of the 6th International Conference on Communication and Electronics Systems, ICCES, Coimbatre, India, 8–10 July 2021; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2021. [Google Scholar]
- Pant, A.; Sharma, S.; Bansal, M.; Narang, M. Comparative Analysis of Supervised Machine Learning Techniques for AQI Prediction. In Proceedings of the 2022 International Conference on Advanced Computing Technologies and Applications, ICACTA, Coimbatore, India, 4–5 March 2022. [Google Scholar]
- Difaizi, T.Z.; Camille, O.P.L.; Nadine, A.; Kalita, S.; Kumar, A.; Rakesh, N. Comparative analysis of machine learning techniques for air quality index prediction for Indian cities. AIP Conf. Proc. 2023, 2917, 050014. [Google Scholar]
- Alam, B.; Hussain, A.; Fayaz, M. An Effective Approach for Air Quality Prediction in Bishkek Based on Machine Learning techniques. In Proceedings of the ACM International Conference Proceeding Series, Istanbul, Turkey, 13–15 October 2023; Association for Computing Machinery: New York, NY, USA, 2023. [Google Scholar]
- Almaliki, A.H.; Derdour, A.; Ali, E. Air Quality Index (AQI) Prediction in Holy Makkah Based on Machine Learning Methods. Sustainability 2023, 15, 3168. [Google Scholar] [CrossRef]
- Xiang, X.; Fahad, S.; Han, M.S.; Naeem, M.R.; Room, S. Air quality index prediction via multi-task machine learning technique: Spatial analysis for human capital and intensive air quality monitoring stations. Air Qual. Atmos. Health 2023, 16, 85–97. [Google Scholar] [CrossRef]
- Džaferović, E.; Karaduzović-Hadžiabdić, K. Air Quality Prediction Using Machine Learning Methods: A Case Study of Bjelave Neighborhood, Sarajevo, BiH. In Lecture Notes in Networks and Systems; Springer International Publishing: Cham, Switzerland, 2021. [Google Scholar]
- Mahalingam, U.; Elangovan, K.; Dobhal, H.; Valliappa, C.; Shrestha, S.; Kedam, G. A machine learning model for air quality prediction for smart cities. In Proceedings of the 2019 International Conference on Wireless Communications, Signal Processing and Networking, WiSPNET, Chennai, India, 21–23 March 2019. [Google Scholar]
- Raut, A.R.; Kharade, H.B.; Nashikkar, R.N.; Padhye, Y.N. Harnessing Machine Learning for Predictive Analysis of Air Quality in Pune City: A Comparative Study. In Proceedings of the 2023 IEEE Pune Section International Conference, PuneCon, Pune, India, 14–16 December 2023. [Google Scholar]
- Hardini, M.; Chakim, M.H.R.; Magdalena, L.; Kenta, H.; Rafika, A.S.; Julianingsih, D. Image-based Air Quality Prediction using Convolutional Neural Networks and Machine Learning. APTISI Trans. Technopreneurship 2023, 5, 109–123. [Google Scholar] [CrossRef]
- Madan, T.; Sagar, S.; Virmani, D. Air Quality Prediction using Machine Learning Algorithms-A Review. In Proceedings of the IEEE 2020 2nd International Conference on Advances in Computing, Communication Control and Networking, ICACCCN, Greater Noida, India, 18–19 December 2020. [Google Scholar]
- Le, Q.V. A Tutorial on Deep Learning Part 1: Nonlinear Classifiers and the Backpropagation Algorithm. Standford University: Mountain View, CA, USA, 2015; p. 7. [Google Scholar]
- Aruna Kumari, N.S.; Ananda Kumar, K.S.; Hitesh Vardhan Raju, S.; Vasuki, H.R.; Nikesh, M.P. Prediction of Air Quality in Industrial Area. In Proceedings of the—5th IEEE International Conference on Recent Trends in Electronics, Information and Communication Technology, RTEICT, Bangalore, India, 12–13 November 2020. [Google Scholar]
- Zhang, B.; Duan, M.; Sun, Y.; Lyu, Y.; Hou, Y.; Tan, T. Air Quality Index Prediction in Six Major Chinese Urban Agglomerations: A Comparative Study of Single Machine Learning Model, Ensemble Model, and Hybrid Model. Atmosphere 2023, 14, 1478. [Google Scholar] [CrossRef]
- Shrivastava, A.L.; Dwivedi, R.K. Air Quality Prediction Using Supervised Machine Learning Techniques. In Smart Innovation, Systems and Technologies; Springer Nature Singapore: Singapore, 2023. [Google Scholar]
- Sun, C.; Li, J.; Sulaiman, R.; Alotaibi, B.S.; Elattar, S.; Abuhussain, M. Air Quality Prediction and Multi-Task Offloading based on Deep Learning Methods in Edge Computing. J. Grid Comput. 2023, 21, 32. [Google Scholar] [CrossRef]
- Lin, H.X.; Jin, J.; Van Den Herik, J. Air quality forecast through integrated data assimilation and machine learning. In Proceedings of the ICAART 2019—Proceedings of the 11th International Conference on Agents and Artificial Intelligence, Prague, Czech Republic, 19–21 February 2019. [Google Scholar]
- Wang, S.; McGibbon, J.; Zhang, Y. Predicting high-resolution air quality using machine learning: Integration of large eddy simulation and urban morphology data. Environ. Pollut. 2024, 344, 123371. [Google Scholar] [CrossRef] [PubMed]
- Zareba, M.; Cogiel, S.; Danek, T.; Weglinska, E. Machine Learning Techniques for Spatio-Temporal Air Pollution Prediction to Drive Sustainable Urban Development in the Era of Energy and Data Transformation. Energies 2024, 17, 2738. [Google Scholar] [CrossRef]
- Chang, Y.S.; Abimannan, S.; Chiao, H.T.; Lin, C.Y.; Huang, Y.P. An ensemble learning based hybrid model and framework for air pollution forecasting. Environ. Sci. Pollut. Res. 2020, 27, 38155–38168. [Google Scholar] [CrossRef]
- Karthikeyan, I.; Jenefa, A.; Edward Naveen, V.; Santhiya, P.; Sangeetha, R.; Lincy, A. EcoAI Forecast: Multi—Model Air Quality Prediction in Patancheru. In Proceedings of the 2024 International Conference on Recent Advances in Electrical, Electronics, Ubiquitous Communication, and Computational Intelligence, RAEEUCCI, Chennai, India, 17–18 April 2024. [Google Scholar]
- Zhang, Z.; Ren, J.; Chang, Y. Improving Intra-Urban Prediction of Atmospheric Fine Particles Using a Hybrid Deep Learning Approach. Atmosphere 2023, 14, 599. [Google Scholar] [CrossRef]
- Cican, G.; Buturache, A.N.; Mirea, R. Applying Machine Learning Techniques in Air Quality Prediction—A Bucharest City Case Study. Sustainability 2023, 15, 8445. [Google Scholar] [CrossRef]
- Sasaki, Y.; Harada, K.; Yamasaki, S.; Onizuka, M. AIREX: Neural Network-based Approach for Air Quality Inference in Unmonitored Cities. In Proceedings of the IEEE International Conference on Mobile Data Management, Paphos, Cyprus, 6–9 June 2022. [Google Scholar]
- Gladkova, E.; Saychenko, L. Applying machine learning techniques in air quality prediction. Transp. Res. Procedia 2022, 63, 1999–2006. [Google Scholar] [CrossRef]
- Neo, E.X.; Hasikin, K.; Lai, K.W.; Mokhtar, M.I.; Azizan, M.M.; Hizaddin, H.F.; Razak, S.A. Artificial intelligence-assisted air quality monitoring for smart city management. PeerJ Comput. Sci. 2023, 9, e1306. [Google Scholar] [CrossRef]
- Kumbalaparambi, T.S.; Menon, R.; Radhakrishnan, V.P.; Nair, V.P. Assessment of urban air quality from Twitter communication using self-attention network and a multilayer classification model. Environ. Sci. Pollut. Res. 2023, 30, 10414–10425. [Google Scholar] [CrossRef]
- Chiang, Y.L.; Wang, J.C.; Lee, M.H.; Liu, A.C.; Jiang, J.A. Deep-Learning-Based Multi-Timestamp Multi-Location PM2.5 Prediction: Verification by Using a Mobile Monitoring System with an IoT Framework Deployed in the Urban Zone of a Metropolitan Area. IEEE Internet Things J. 2024, 11, 8815–8837. [Google Scholar] [CrossRef]
- Wang, Y.; Yuan, Q.; Li, T.; Zhu, L. Global spatiotemporal estimation of daily high-resolution surface carbon monoxide concentrations using Deep Forest. J. Clean. Prod. 2022, 350, 131500. [Google Scholar] [CrossRef]
- Huang, G.; Ge, C.; Xiong, T.; Song, S.; Yang, L.; Liu, B.; Yin, W.; Wu, C. Large scale air pollution prediction with deep convolutional networks. Sci. China Inf. Sci. 2021, 64, 192107. [Google Scholar] [CrossRef]
- Rojas-Domínguez, A.; Padierna, L.C.; Valadez, J.M.C.; Puga-Soberanes, H.J.; Fraire, H.J. Optimal Hyper-Parameter Tuning of SVM Classifiers With Application to Medical Diagnosis. IEEE Access 2017, 6, 7164–7176. [Google Scholar] [CrossRef]
- Li, R.; Liu, M.; Wang, C.; Dong, L.; Song, Y.; Zhang, J. Assessing the impact of clean air action on air quality trends in Beijing using a machine learning technique. Atmos. Envrion. 2020, 231, 117463. [Google Scholar]
- Wu, L.; An, J. Quantitative impacts of meteorology and emissions on the long-term trend of O3 in the YRD, China from 2015 to 2022. J. Environ. Sci. 2025, 149, 314–329. [Google Scholar] [CrossRef]
- Wu, J.; Xu, L.; Zheng, H.; Cao, X.; Lu, H. Spatiotemporal assessment of evapotranspiration of desert steppe in northern china: A case of otog front banner. In Proceedings of the International Geoscience and Remote Sensing Symposium (IGARSS), Brussels, Belgium, 11–16 July 2021. [Google Scholar]
- Ahamad, F.; Griffiths, P.T.; Latif, M.T.; Juneng, L.; Xiang, C.J. Ozone trends from two decades of ground level observation in Malaysia. Atmosphere 2020, 11, 755. [Google Scholar] [CrossRef]
- Cui, Y.; Zha, H.; Dang, Y.; Qiu, L.; He, Q.; Jiang, L. Spatio-Temporal Heterogeneous Impacts of the Drivers of NO2 Pollution in Chinese Cities: Based on Satellite Observation Data. Remote Sens. 2022, 14, 3487. [Google Scholar] [CrossRef]
- Ivanova, N.S.; Kruchenitskii, G.M.; Kuznetsova, I.N.; Demin, V.I.; Lapchenko, V.A. Ozone Content over the Russian Federation in the First Quarter of 2020. Russ. Meteorol. Hydrol. 2020, 45, 447–454. [Google Scholar] [CrossRef]
- Liu, H.; Song, D.; Huang, F.; Lu, C.; Zhang, X. A Study on the Characteristics of Spatial and Temporal Evolution of Ozone Pollution in Chengdu (2014–2016). In Proceedings of the IOP Conference Series: Earth and Environmental Science, Shanghai, China, 14–16 March 2019. [Google Scholar]
- Maidanovych, N.; Khlobystov, D. Assessment of atmospheric CO vertical column density over the Crimean Peninsula (Ukraine) by the TROPOMI instrument on the Sentinel-5 Precursor satellite. In Proceedings of the 17th International Conference Monitoring of Geological Processes and Ecological Condition of the Environment, Monitoring 2023, Kyiv, Ukraine, 7–10 November 2023. [Google Scholar]
- Li, H.; Shi, R.; Jin, S.; Wang, W.; Fan, R.; Zhang, Y.; Liu, B.; Zhao, P.; Gong, W.; Zhao, Y. Study of persistent haze pollution in winter over jinan (China) based on ground-based and satellite observations. Remote Sens. 2021, 13, 4862. [Google Scholar] [CrossRef]
Category 1: Dissimilar Keywords | Category 2: Similar Keywords |
---|---|
“Deep learning”, “Gradient boosting”, “Machine Learning”, “Time Series Analysis”, “Uncertainty estimation”, “PM2.5”, “Nitrogen dioxide (NO2)”, “Supervised learning” | “Air Quality Prediction”, “Convolutional Neural Networks (CNNs)”, “Environmental monitoring”, “Feature extraction”, “Feature selection”, “Long Short-Term Memory (LSTM)”, “Machine learning algorithms”, “Random forests”, “Support vector machines (SVM)” |
Subject Matter Expert Keywords |
---|
“Air quality index”, “Air quality indices”, “Air pollution”, “Air pollutants”, “Machine learning approaches”, “Machine learning models”, “Machine learning techniques”, “Machine-learning”, “Particulate Matter”, “Supervised Machine learning”, “Spatiotemporal analysis”, “Time series”, “Air monitoring” |
Author | Challenges/Limitations | Pollutants | Best Model | Performance Metrics | Performance Evaluation Results |
---|---|---|---|---|---|
Mathew, Gokul [40] | Data limitations | PM2.5 | HGBoost | R2, MAE, RMSE | HGBoost: R2 (85.90%), MAE (5.717 μg/m3), and RMSE (7.647 μg/m3). Comparative models: MR and k-NN. |
Xiong, Xie [41] | Uncertainties in data collection from multiple sources | O3 | LGBR-CHAP model | Coefficient (R), RMSE | R value of LGBM was 0.84. Comparative models: RF and eXtreme Gradient Boosting. |
Pandithurai, Bharathiraja [43] | Air quality | - | DT | Precision, recall, F1 score | Accuracy of DT: 99.88% |
Chang, Chang [49] | Data limitations | PM2.5 | BPNN | R2, RMSE, NSE | - |
Ly, Matsumi [50] | Data limitations | PM2.5 | RF and CWT | Averaging hourly concentration | - |
Sharma, Khurana [42] | Source of air pollution determination | - | hybrid (RF and XGBoost) model | Coefficient of determination (R2) values and lower MAE, MSE, RMSE | Hybrid model has R2 value of 95.43%. Comparative models using R2 value: RF (87.18%), DT (76.39%), LR (83.84%), XgBoost (86.86%). |
Niveshitha, Amsaad [45] | -Execution time -AQI forecast -Air quality prediction | Particulate matter, O3, NO, NOx, CO, benzene, toluene, SO2, xylene, NH3, NO2 | RF Regression | Evaluation metrics (MAE, MSE, RMSE, R2 score) and execution time | RF model had R2 score of 91.43%. Comparative model was DT (R2 score of 83.89%). |
Sanjeev [48] | Air pollutants prediction | - | RF algorithm | Recall, precision, F1, specificity | Accuracy of the RF was 99.4%. Comparative models’ accuracy: SVM (93.5%) and ANN (90.4%). |
Khan, Ellermann [34] | Emission from vehicles | NO2, PM2.5 | RF | RMSE, R2 | RMSE of RF was between 11.2 and 13.5. Comparative model’s RMSE were ANN (between 13.5 and 14.1), SVM (between 14.7 and 15.1). |
Park, Jeong [35] | Traffic CO2 | CO2 | RF | R2, RMSE | RF model had R2 value of 0.8 and RMSE of 22.9 ppm |
Meng, Hang [37] | Monitoring ground-based sulfate | Sulfate | RF | R2 | RF model had R2 value of 0.68 at daily level and 0.93 at the monthly level |
Kunnathettu and Varma [47] | Air quality prediction with hyper-parameter tuning | PM2.5 | LR and SVM | Recall, precision, F1 score, support value | Accuracy of LR was 88.12% and SVM with hyper-parameter tuning was 87.56%. Comparative models’ accuracy was NN (83.70%), RF (83.42%), SVM (53.31%). |
Mahesh Babu and Rene Beulah [44] | Air quality | DT | F1 score, precision, and recall | - | |
William, Paithankar [46] | Processing time and error rate | PM2.5 | RF | MAE, RMSE | RMSE of RF varies between 0.05 and 0.18. MAE of RF varies betweeen 6% and 18%. Comparative model were DT (MAE was between 8% and 21%) and RMSE was between 0.06 and 0.24. |
Author | Focus | Best Model | Performance Metrics | Performance Evaluation Results |
---|---|---|---|---|
Chakradhar Reddy, Nagarjuna Reddy [53] |
| DT | Accuracy measurement: precision, recall, sensitivity, specificity, F1 score, and accuracy | Accuracy: DT (100%). Comparative models and accuracy: LR (98), RF (99%), SVM (70%), K-NN (97%), Naïve Bayen (95%) |
Pant, Sharma [54] | Predicted AQI of pollutants (PM10, PM2.5, SO2, NO2) | DT classifier | F1 score, precision, and recall | DT (91.78%), comparative model was Logistic Regression (91.78%) |
Difaizi, Camille [55] | Prediction of AQI | k-NN | Confusion matrix, F1 score, precision, and recall | Accuracy: K-NN (100%). Comparative model: LR (98.8%), Adaboost (83%). |
Alam, Hussain [56] | Air quality index prediction for pollutants (PM1, PM10, PM2.5) | LightGBM and CatBoost | Classification: accuracy, precision, recall and F1 score Regression metrics: MSE, MAE, and R2 | Regression value: CatBoost (R2-score 85% on PM1 and 95% on PM2.5). Classification accuracy of LightGBM (99.75%), CatBoost (99.5%), K-NN (97.2%), NN (96%) |
Xiang, Fahad [58] | AQI | RF and probabilistic voting ensemble | R2, RMSE, and MAE | Comparative model: SLR, SVR, RF, and probabilistic voting ensemble |
Hardini, Chakim [62] |
| CNN | Cronbach’s Alpha and composite reliability score | The average variance extracted (AVE) showed a 0.5 threshold, indicating strong validity. |
Džaferović and Karaduzović-Hadžiabdić [59] |
| Ensemble technique, RF | Regression metrics, such as R-squared and RMSE | Ensemble technique with R2 value of 99% and RMSE of 2.30. RF with R2 score of 99% and RMSE value of 2.58. Comparative models include SVR, MLR, and Multilayer Perceptron. |
Author | Best-Performing ML Technique | Research Focus | Pollutant | Performance Metrics | Performance Evaluation Results |
---|---|---|---|---|---|
Wang, Yuan [81] | Deep Forest (DF) | Complex correlation between surface CO and multi-source data | CO | R and RMSE | DF had R/RMSE as (0.73/0.273 ppm and 0.77/0.215 ppm) at daily and monthly scales. Comparative models: Light Gradient Boosting Machine and deep neural network. |
Drewil and Al-Bahadili [14] | LSTM + GA | Improper hyper-parameter settings | PM10, PM2.5, CO, NOX | RMSE and MAE | RMSE value of LSTM+GA was 9.58. Comparative models’ RMSE: Bi-LSTM (22.58), C-LSTM (13.97), WLSTEM (40.67). |
Lin, Jin [69] | RNN and LSTM | Data-driven for non-dust PM10. Real-time measurement of local emissions is challenging. | PM10 | Real observations | - |
Sun, Li [68] | Hybrid deep learning model: multi-factor LSTM + DRL | Monitoring air quality from multiple sensor data emanating from multiple dimensions and locations | Hourly PM2.5 | Latency and use of accuracy (that is, RMSE, MAE, MAPE, R2) | MAPE value of DRL-LSTM was 32.45. Comparative models’ MAPE values were CNN-LSTM (90.43), NLSTM (72.67), XGBoost (79.09), ANN (42.39). |
Wang, McGibbon [70] | CNN + PALM | Spatial distribution of CO concentration | CO concentration | R2 and RMSE | High precision accuracy of (R2 > 0.8) |
Zhang, Duan [66] | Hybrid model (LSTM-SVR) | AQI prediction Air quality prediction | - | R2, RMSE, MAE | Hybrid LSTM-SVR model achieved the best R2 and RMSE value. Comparative models were the ensemble model (RF, XGBT, LGBM) and single models (K-NN, LR, SVR, LSTM). |
Neo, Hasikin [78] | LSTM | Air quality prediction in four urban cities in Malaysia (that is, Petaling Jaya, Banting, Klang, Shah Alam) | CO, O3, PM2.5, PM10. Wind direction and speed, humidity. | RSME and R2 | In predicting PM10 and PM2.5, the R2 values for LSTM in four cities were Banting (0.998), Petaling (0.995), Klang (0.918), Shah Alam (0.993). Comparative models were Ada Boost, SVR, RF, KNN, MLP Regressor. |
Wang, Yuan [81] | DF | Surface CO and multi-source data | Surface CO | R and RMSE | R and RMSE values for DF are 0.73 and 0.273 ppm for daily. While for the monthly scale, R and RMSE are 0.77 and 0.215 ppm, respectively. Comparative models are XGBoost, Light-GBM, RF, ERT, DNN. |
Chiang, Wang [80] | Deep-learning-based multi-timestamp multi-location (based on LSTM+GRU) | Minute-by-minute air quality forecasts | PM2.5 concentration levels | RMSE and accuracy | The RMSE and accuracy are 0.922 µg/m3 and 100% for the LSTM-based prediction model, and GRU-based predictive model had RMSE of 0.940 µg/m3 and accuracy of 95.7%. |
Gladkova and Saychenko [77] | LSTM | Forecasting the time series of PM2.5 concentration | PM2.5 concentrations | МSЕ and RМSЕ | LSTM had RMSE of 7.86. Comparative models are Prophet with RMSE of 12.25, ARIMA (12.46). |
Chang, Abimannan [72] | Hybrid model that exploits stacking ensemble learning model | 1 to 8 h air pollution forecasting developed on cloudbased big data platform (comprising Spark+Hadoop machine learning and TensorFlow-based deep learning) | PM2.5 and PM10 | Use of MAE and RMSE. Pearson correlation coefficient to find the correlation between the four models (GBT, SVR, LSTM, LSTM2). | - |
Authors | Observation Data | Study Periods | Computation Domains | Study Area | Major Conclusions |
---|---|---|---|---|---|
Wu and An [85] | Surface ozone of coastal cities | Short term, seasonal, and long term | Statistical analysis based on Kolmogorov–Zurbenko (KZ) filter | Yangtze River Delta | A decreasing spatial pattern was observed from the coastal cities towards the northwest, which were influenced by synoptic and monsoon conditions. Again, cities located at the same latitudes were significantly impacted by atmospheric transmission. |
Wu, Xu [86] | MOD16 products with ground observation data from eight (8) weather stations | 2011–2014 | Regression analysis | Otog Front Banner | The evapotranspiration space showed a decreasing trend from the southeast to the northwest. |
Ahamad, Griffiths [87] | Change in surface ozone | 20-year period (1997–2016) at four locations in western Peninsular Malaysia. | Trend and correlation analyses | Western Peninsular Malaysia | The oxides of nitrogen ratios (NO/NO2) had a significant inverse relationship with O3 at all stations. |
Cui, Zha [88] | Atmospheric nitrogen dioxide (NO2) pollution | Chinese cities from 2005 to 2020 | Geographically and temporally weighted regression model | Chinese cities | The population density and the ambient air pressure positively correlate with NO2 pollution. |
Ivanova, Kruchenitskii [89] | Ozone Monitoring Instrument (OMI) satellite equipment to observe surface ozone | First quarter of 2020 | - | Commonwealth of Independent States (CIS) and Balticcountries | Generalization of total O3 observation for each month of the first quarter of 2020 |
Liu, Song [90] | Spatial and temporal evolution of ozone pollution | 8-h average O3 concentrations from 2014 to 2016 observation data | Unimodal distribution | Chengdu | Ozone pollution in the west of Chengdu is more serious than in the east of Chengdu. |
Maidanovych and Khlobystov [91] | Atmospheric carbon monoxide (CO) | Sentinel-5 Precursor satellite for the period from January 2019 to July 2023 | TROPOMI instrument | Crimean Peninsula (Ukraine) | There was an increase in O3 concentration in the atmosphere due to heavy enemy military equipment and fires. |
Li, Shi [92] | Ground-based and satellite observation data, PM2.5 concentrations | 1 December 2020 to 12 January 2021 | Photochemical processes | Jinan area of China | The local pollution is often accompanied by the regional pollution during haze pollution events. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Agbehadji, I.E.; Obagbuwa, I.C. Systematic Review of Machine Learning and Deep Learning Techniques for Spatiotemporal Air Quality Prediction. Atmosphere 2024, 15, 1352. https://doi.org/10.3390/atmos15111352
Agbehadji IE, Obagbuwa IC. Systematic Review of Machine Learning and Deep Learning Techniques for Spatiotemporal Air Quality Prediction. Atmosphere. 2024; 15(11):1352. https://doi.org/10.3390/atmos15111352
Chicago/Turabian StyleAgbehadji, Israel Edem, and Ibidun Christiana Obagbuwa. 2024. "Systematic Review of Machine Learning and Deep Learning Techniques for Spatiotemporal Air Quality Prediction" Atmosphere 15, no. 11: 1352. https://doi.org/10.3390/atmos15111352
APA StyleAgbehadji, I. E., & Obagbuwa, I. C. (2024). Systematic Review of Machine Learning and Deep Learning Techniques for Spatiotemporal Air Quality Prediction. Atmosphere, 15(11), 1352. https://doi.org/10.3390/atmos15111352