Water-Quality Assessment and Pollution-Risk Early-Warning System Based on Web Crawler Technology and LSTM
Abstract
:1. Introduction
2. Data and Methods
2.1. Data Sources
2.2. The Philosophy of This System
2.2.1. Data Crawling
2.2.2. Database
2.2.3. Model Service
2.2.4. Client
2.3. Process of Data Collecting by the Web Crawler
2.3.1. Website Monitoring and Crawler Starting
2.3.2. Data Crawling
2.3.3. Data Cleaning and Storing
2.4. Model Services Development for Water Quality Assessment and Early Warnings Based on the LSTM Neural Network
2.4.1. Dataset Organization and Analysis
2.4.2. Model Developed by LSTM
2.4.3. Model Training and Testing
2.4.4. Model Services Deployment
2.5. Client Structures
2.5.1. Data Visualization Module
2.5.2. Water-Quality-Assessment Module
2.5.3. Water-Quality-Prediction and Early-Warning Module
3. Results and Discussion
3.1. Datasets That Were Crawled
3.1.1. The Spatial Distribution of Monitoring Stations
3.1.2. The Temporal Distribution of Monitoring Stations
3.2. LSTM Model Training and Validation
3.3. System Developments and Implementation
3.3.1. System Development Environment
3.3.2. Functions of the System
- (1)
- Overview of water environment monitoring data
- (2)
- Water analysis with with water quality evaluation and prediction
3.4. Effect of a Case of Pollution Warning
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Yin, Y.; Wang, H.; Gao, Y.; Li, X. Real-time monitoring and early warning of landslides at relocated Wushan Town, the Three Gorges Reservoir, China. Landslides 2010, 7, 339–349. [Google Scholar] [CrossRef]
- Bownik, A.; Wlodkowic, D. Advances in real-time monitoring of water quality using automated analysis of animal behaviour. Sci. Total Environ. 2021, 789, 147796. [Google Scholar] [CrossRef] [PubMed]
- Pavai, G.; Geetha, T.V. Improving the freshness of the search engines by a probabilistic approach based incremental crawler. Inf. Syst. Front. 2017, 19, 1013–1028. [Google Scholar] [CrossRef]
- Kumar, N.; Aggarwal, D. LEARNING-based Focused WEB Crawler. IETE J. Res. 2021, 1–9. [Google Scholar] [CrossRef]
- Velkumar, K.; Thendral, P. Web Crawler and Web Crawler Algorithms: A Perspective. Int. J. Eng. Adv. Technol. 2020, 9, 203–205. [Google Scholar] [CrossRef]
- Hayes, D.R.; Cappa, F.; Cardon, J. A Framework for More Effective Dark Web Marketplace Investigations. Information 2018, 9, 186. [Google Scholar] [CrossRef]
- Kumar, M.; Bindal, A.; Gautam, R.; Bhatia, R. Keyword query based focused Web crawler. Procedia Comput. Sci. 2018, 125, 584–590. [Google Scholar] [CrossRef]
- Wang, Z.; Ren, H.; Zhang, A.; Zhuang, D. Spatiotemporal Hotspots of Study Areas in Research of Gastric Cancer in China Based on Web-Crawled Literature. Int. J. Environ. Res. Public Health 2021, 18, 3997. [Google Scholar] [CrossRef]
- Chen, Z.; Zhang, H.; Liao, M. Integration Multi-Model to Evaluate the Impact of Surface Water Quality on City Sustainability: A Case from Maanshan City in China. Processes 2019, 7, 25. [Google Scholar] [CrossRef]
- Zhao, Y.; Song, Y.; Cui, J.; Gan, S.; Yang, X.; Wu, R.; Guo, P. Assessment of Water Quality Evolution in the Pearl River Estuary (South Guangzhou) from 2008 to 2017. Water 2020, 12, 59. [Google Scholar] [CrossRef] [Green Version]
- Qin, G.; Liu, J.; Xu, S.; Wang, T. Water Quality Assessment and Pollution Source Apportionment in a Highly Regulated River of Northeast China. Environ. Monit. Assess. 2020, 192, 1–16. [Google Scholar] [CrossRef]
- Huang, J.; Zhang, Y.; Bing, H.; Peng, J.; Dong, F.; Gao, J.; Arhonditsis, G.B. Characterizing the river water quality in China: Recent progress and on-going challenges. Water Res. 2021, 201, 117309. [Google Scholar] [CrossRef] [PubMed]
- Gordillo, G.; Morales-Hernández, M.; García-Navarro, P. Finite volume model for the simulation of 1D unsteady river flow and water quality based on the WASP. J. Hydroinform. 2020, 22, 327–345. [Google Scholar] [CrossRef]
- Priya, S.K.; Shenbagalakshmi, G.; Revathi, T. Design of smart sensors for real time drinking water quality monitoring and contamination detection in water distributed mains. Int. J. Eng. Technol. 2018, 7, 47–51. [Google Scholar] [CrossRef]
- Arndt, J.; Kirchner, J.S.; Jewell, K.S.; Schluesener, M.P.; Wick, A.; Ternes, T.A.; Duester, L. Making waves: Time for chemical surface water quality monitoring to catch up with its technical potential. Water Res. 2022, 213, 118168. [Google Scholar] [CrossRef] [PubMed]
- Ding, X.; Dong, X.; Hou, B.; Fan, G.; Zhang, X. Visual platform for water quality prediction and pre-warning of drinking water source area in the Three Gorges Reservoir Area. J. Clean. Prod. 2021, 309, 127398. [Google Scholar] [CrossRef]
- Pyo, J.C.; Kwon, Y.S.; Ahn, J.-H.; Baek, S.-S.; Kwon, Y.-H.; Cho, K.H. Sensitivity Analysis and Optimization of a Radiative Transfer Numerical Model for Turbid Lake Water. Remote Sens. 2021, 13, 709. [Google Scholar] [CrossRef]
- Hochreiter, S.; Urgen Schmidhuber, J. Long Shortterm Memory. Neural Comput. 1997, 9, 17351780. [Google Scholar] [CrossRef] [PubMed]
- Chimmula, V.K.R.; Zhang, L. Time series forecasting of COVID-19 transmission in Canada using LSTM networks. Chaos Solitons Fractals 2020, 135, 109864. [Google Scholar] [CrossRef]
- Bukhari, A.H.; Raja, M.A.Z.; Sulaiman, M.; Islam, S.; Shoaib, M.; Kumam, P. Fractional Neuro-Sequential ARFIMA-LSTM for Financial Market Forecasting. IEEE Access 2020, 8, 71326–71338. [Google Scholar] [CrossRef]
- Liu, P.; Wang, J.; Sangaiah, A.K.; Xie, Y.; Yin, X. Analysis and Prediction of Water Quality Using LSTM Deep Neural Networks in IoT Environment. Sustainability 2019, 11, 2058. [Google Scholar] [CrossRef]
- Xia, F.; Jiang, D.; Kong, L.; Zhou, Y.; Wei, J.; Ding, D.; Chen, Y.; Wang, G.; Deng, S. Prediction of Dichloroethene Concentration in the Groundwater of a Contaminated Site Using XGBoost and LSTM. Int. J. Environ. Res. Public Health 2022, 19, 9374. [Google Scholar] [CrossRef] [PubMed]
- Barzegar, R.; Aalami, M.T.; Adamowski, J. Short-term water quality variable prediction using a hybrid CNN–LSTM deep learning model. Stoch. Environ. Res. Risk Assess. 2020, 34, 415–433. [Google Scholar] [CrossRef]
- Lee, S.; Lee, D. Improved Prediction of Harmful Algal Blooms in Four Major South Korea’s Rivers Using Deep Learning Models. Int. J. Environ. Res. Public Health 2018, 15, 1322. [Google Scholar] [CrossRef]
- Akhter, F.; Siddiquei, H.R.; Alahi, E.E.; Jayasundera, K.P.; Mukhopadhyay, S.C. An IoT-Enabled Portable Water Quality Monitoring System With MWCNT/PDMS Multifunctional Sensor for Agricultural Applications. IEEE Internet Things J. 2022, 9, 14307–14316. [Google Scholar] [CrossRef]
- Tung, T.M.; Yaseen, Z.M. A survey on river water quality modelling using artificial intelligence models: 2000–2020. J. Hydrol. 2020, 585, 124670. [Google Scholar] [CrossRef]
- Ban, J.; Ling, B.; Huang, W.; Liu, X.; Peng, W.; Zhang, J. Spatiotemporal Variations in Water Flow and Quality in the Sanyang Wetland, China: Implications for Environmental Restoration. Sustainability 2021, 13, 4637. [Google Scholar] [CrossRef]
- Jia, Z.; Chang, X.; Duan, T.; Wang, X.; Wei, T.; Li, Y. Water quality responses to rainfall and surrounding land uses in urban lakes. J. Environ. Manag. 2021, 298, 113514. [Google Scholar] [CrossRef]
- Wang, Y.; Guo, Y.; Zhao, Y.; Wang, L.; Chen, Y.; Yang, L. Spatiotemporal heterogeneities and driving factors of water quality and trophic state of a typical urban shallow lake (Taihu, China). Environ. Sci. Pollut. Res. 2022, 29, 53831–53843. [Google Scholar] [CrossRef]
- Li, L.; Jiang, P.; Xu, H.; Lin, G.; Guo, D.; Wu, H. Water quality prediction based on recurrent neural network and improved evidence theory: A case study of Qiantang River, China. Environ. Sci. Pollut. Res. 2019, 26, 19879–19896. [Google Scholar] [CrossRef]
- Li, R. Water quality forecasting of Haihe River based on improved fuzzy time series model. Desalination Water Treat. 2018, 106, 285–291. [Google Scholar] [CrossRef]
- Ritter, A.; Muñoz-Carpena, R. Performance evaluation of hydrological models: Statistical significance for reducing subjectivity in goodness-of-fit assessments. J. Hydrol. 2013, 480, 33–45. [Google Scholar] [CrossRef]
- Wang, X.; Yin, Z.-Y. Using GIS to assess the relationship between land use and water quality at a watershed level. Environ. Int. 1997, 23, 103–114. [Google Scholar] [CrossRef]
- Cherchi, C.; Badruzzaman, M.; Oppenheimer, J.; Bros, C.M.; Jacangelo, J.G. Energy and water quality management systems for water utility’s operations: A review. J. Environ. Manag. 2015, 153, 108–120. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Li, Q.; Zhang, W.; Hu, S.; Peng, H. The architecture and application of an automatic operational model system for basin scale water environment management and design making supporting. J. Environ. Manag. 2021, 290, 112577. [Google Scholar] [CrossRef] [PubMed]
ID | Tables in the Database | Functions |
---|---|---|
1 | Hydrology and water quality station information table | Store the names and geographical coordinates of hydrology and water quality stations in China |
2 | Hydrology station table | Store the names of the hydrology sites crawled |
3 | Water quality site table | Store the names of the water quality sites crawled |
4 | Hydrological data table | Store the hydrological data crawled |
5 | Water quality data table | Store the water quality data crawled |
Category | Variable | Indexes Name | Time Range | Units |
---|---|---|---|---|
Hydrological monitoring data | WL | Water level | Since 2005 | m |
WWL | Warning water level | m | ||
Q | Flow | m³/s | ||
Water-quality-monitoring data | Tub | Turbidity of water | Since 2000 | NTU |
CODMN | Permanganate index | mg/L | ||
Ammonia nitrogen index | mg/L | |||
TP | Total phosphorus index | mg/L | ||
TN | Total nitrogen index | mg/L | ||
PH | Pondus Hydrogenii | - | ||
CHL | Chlorophyll α content | mg/L | ||
CA | Algal density of water | cells/L | ||
BOD5 | Biochemical oxygen demand | mg/L | ||
DO | Dissolved oxygen | mg/L | ||
WT | Water temperature | °C | ||
WQC | Water quality classification | - |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Guan, G.; Wang, Y.; Yang, L.; Yue, J.; Li, Q.; Lin, J.; Liu, Q. Water-Quality Assessment and Pollution-Risk Early-Warning System Based on Web Crawler Technology and LSTM. Int. J. Environ. Res. Public Health 2022, 19, 11818. https://doi.org/10.3390/ijerph191811818
Guan G, Wang Y, Yang L, Yue J, Li Q, Lin J, Liu Q. Water-Quality Assessment and Pollution-Risk Early-Warning System Based on Web Crawler Technology and LSTM. International Journal of Environmental Research and Public Health. 2022; 19(18):11818. https://doi.org/10.3390/ijerph191811818
Chicago/Turabian StyleGuan, Guoliang, Yonggui Wang, Ling Yang, Jinzhao Yue, Qiang Li, Jianyun Lin, and Qiang Liu. 2022. "Water-Quality Assessment and Pollution-Risk Early-Warning System Based on Web Crawler Technology and LSTM" International Journal of Environmental Research and Public Health 19, no. 18: 11818. https://doi.org/10.3390/ijerph191811818
APA StyleGuan, G., Wang, Y., Yang, L., Yue, J., Li, Q., Lin, J., & Liu, Q. (2022). Water-Quality Assessment and Pollution-Risk Early-Warning System Based on Web Crawler Technology and LSTM. International Journal of Environmental Research and Public Health, 19(18), 11818. https://doi.org/10.3390/ijerph191811818