1. Introduction
Visibility measures the maximum horizontal distance at which a person with normal eyesight can recognize the outline of a target [
1], which is an important indicator that reflects the atmosphere’s transparency and air quality. It is a conventional element in meteorological observation. Low-visibility weather seriously affects the safe operation of aviation, transportation, and power systems, and threatens human health because of its association with undiluted pollutant particles and toxic impurities [
2,
3,
4,
5]. Accurate visibility forecasting can provide the public with better travel plans and effectively reduce property losses and human casualties [
6]. Therefore, improving the visibility forecasting capability is a significant technical challenge for many weather forecasters and scholars.
The application of deep learning methods is becoming increasingly widespread across various fields, including model design, data preprocessing, algorithm improvement, and theoretical exploration [
7,
8,
9,
10]. Previous works have applied deep learning methods to the field of visibility forecasting. For example, Tang et al. [
11] used the SARIMA and long short-term memory (LSTM) neural network models to predict visibility in China, both of which performed well, and the prediction projected better visibility in China in the future. Duddu et al. [
12] developed a back-propagation neural network model to predict fog or low-visibility weather, and the model showed a very high predictive capability. Chaabani et al. [
13] proposed a visibility distance prediction method based on shallow neural networks. For airport visibility classification and the analysis of factors affecting visibility, Liu et al. [
14] proposed a deep ensemble model containing two popular convolutional neural network (CNN) models and reported accuracy levels reaching 87.64%. Ortega et al. [
15] used the multi-layer perceptron (MLP), traditional CNN, fully CNN, multi-input CNN, and LSTM models to forecast visibility.
The above studies have shown that deep learning methods can be successfully applied in visibility forecasting. However, these models (i.e., LSTM, MLP, and CNN) cannot simultaneously provide good forecasts for the spatial and temporal variability of visibility. For example, LSTM can suffer from the problem of gradient vanishing or explosion and has a limited memory length [
16]. Meanwhile, MLP and CNN are not proficient in processing time series and cannot capture the temporal or spatial relationships between data points [
17,
18,
19].
To better apply deep learning technology to visibility forecasting, Peláez-Rodríguez et al. [
20] proposed and discussed different deep learning ensemble architectures for low-visibility forecasting. They found that the ensemble models and meteorological-based methods, which combined multiple deep learning architectures, achieved a better forecasting accuracy than the individual deep learning models. For the forecasting of low-visibility conditions, Peláez-Rodríguez et al. [
21] proposed an iterative forward selection algorithm based on evolutionary algorithms, which was applied to determine the optimal variables and nodes in a region for each regressor model. Differential evolution and particle swarm optimization have been used as optimization algorithms, producing an improvement of up to 17.3% concerning the baseline databases. Ortega et al. [
15] developed deep learning models based on climate series data for single-step visibility forecasting. Using data from two weather stations in Florida, USA, they developed, trained, and tested five deep learning models. However, previous studies on visibility forecasting based on deep learning mainly utilized meteorological data from stations or local video images. Whilst these models have shown good applicability for visibility at a single station or in a specific area where camera observations were available, they tended to be limited in capturing the spatial distribution and visibility patterns across larger regional scales.
The Unet architecture has been widely applied in medical image segmentation [
22,
23,
24] and has promising applications in meteorological research, such as for visibility, oceanic variables, and radar-based precipitation forecasting [
25,
26,
27,
28]. To further deepen the network depth and improve model performance, Qin et al. [
29] designed the U
2net architecture by incorporating a two-level nested U-structure, which deepened the overall depth of the network architecture without significantly increasing the memory and computational cost. Compared to Unet, U
2net has more hierarchical structures and parameters, which can help improve the feature representation capability and application accuracy [
30,
31].
Guizhou lies on a slope from the Tibetan Plateau to the hilly areas of eastern China, and the karsts within its borders make it one of the most prone provinces to meteorological disasters, owing to the significant climatic variations and complex weather changes in different parts of the country. Guizhou is the only province in China with no plains and its topography varies greatly, with a distribution characterized by a “high in the west and low in the east” pattern (
Figure 1). The conditions under which low visibility occurs in Guizhou vary, with frontal fog in the central–western and high-altitude regions and radiation fog in the eastern and northern parts of the province, which makes it challenging to forecast visibility in the province. Low-visibility weather is typical in Guizhou and has drawn considerable attention from the local public. Therefore, the region’s meteorological departments have placed particular emphasis on monitoring and forecasting low-visibility weather. However, Guizhou Province only has 84 national meteorological stations to monitor visibility, which is insufficient to comprehensively represent the visibility conditions across the province. In order to enable more detailed visibility forecasting for Guizhou, this study utilized the gridded observational data and the CMA-GD model output obtained from the China Meteorological Administration (CMA) Information Center to construct three visibility datasets to examine the impact of input data on the visibility forecasting capability. More specifically, to improve visibility forecasting in Guizhou, the Unet and U
2net architectures were applied to establish visibility forecasting models for Guizhou Province based on multi-variable datasets. The threat score (TS), bias score (BS), and the root-mean-square error (RMSE) were used to evaluate the performance of visibility forecasting models.
3. Datasets and Visibility Forecast Models
In order to increase the sample size, the missing data were estimated using neighboring points, which is a widely used approach in weather operations. The datasets were normalized by rescaling the data from 0 to 1 for the practical training of the deep learning models.
3.1. Unet-Based Visibility Forecasting Model Using Observational and Model Visibility Data
Based on the CLDAS and CMA-GD data, a visibility dataset (dataset I) was built using an input–output mapping method of “24 frames → 24 frames”, with data elements being visibility only. Then, a Unet-based visibility forecasting model (Unet_VToV) for Guizhou Province was trained and established, providing hourly forecasts at 0 to 72 h lead times with a horizontal resolution of 3 km.
3.2. Unet-Based Visibility Forecasting Model Using Multiple Observational Meteorological Variables
The CLDAS-merged multiple-meteorological-variable dataset (dataset II) included the following grid-based observational data variables: visibility, precipitation, 10 m wind direction and speed, and 2 m humidity and temperature. It is important to note that this dataset did not include any visibility forecast data from the CMA-GD model. We constructed a dataset using a “24 frames → 24 frames” input–output mapping approach. Then, the Unet architecture parameters were tuned and optimized to establish an hourly visibility forecasting model (Unet_NVToV) for Guizhou Province, covering a 0–72 h forecast range.
3.3. Unet- and U2net-Based Visibility Forecasting Model Using Multiple Observational Meteorological Variables and CMA-GD Visibility
Using the merged multiple meteorological variable dataset mentioned in
Section 3.2 and the visibility forecast data from the CMA-GD model (dataset III), a Unet-based visibility forecasting model (Unet_PVToV) was established using the “24 frames → 24 frames” input–output mapping approach to provide hourly forecasts for Guizhou Province, covering a 0–72 h forecast range. Furthermore, a nested network architecture was introduced to construct a visibility forecasting model using the U
2net architecture (U
2net_PVToV).
4. Evaluation of Visibility Forecasting Models
The Unet-based visibility forecasting models were established using the three datasets (datasets I–III) mentioned in
Section 3.1,
Section 3.2 and
Section 3.3, respectively. Each dataset was divided into training, validation, and testing with a 16:1:9 ratio. The training set covered the period from 1 January 2022 00:00 to 31 December 2022 23:00, resulting in 8784 samples. The testing set covered 1 January 2023 00:00 to 30 June 2023 23:00, generating 4344 samples for the model evaluation. The three datasets were input variables, and visibility was the output (Step1 and Step2 in
Figure 4). The Unet and U
2net architectures performed hourly visibility forecasts with 0–72 h lead times (Step3 in
Figure 4). In addition, each hourly forecast of the visibility corresponded to a separate forecast model. The TS and BS were used to comprehensively evaluate and compare the performance of the different forecasting models, considering both grid-based and station-based assessments (Step4 in
Figure 4). For the station-based evaluation of the models, we used the observed visibility values from the weather stations and the forecast values from the model grid closest to the station. The final scores for each model were calculated as the average of the 08:00 and 20:00 forecast performances. The visibility forecasting models were evaluated across multiple visibility classifications: ≤200 m, ≤500 m, ≤1000 m, and ≤3000 m. The verification was conducted for the 0–72 h forecasts with a 3 h interval.
4.1. Evaluation of the Unet-Based Visibility Forecasting Model
4.1.1. Grid-Based Assessments of Unet-Based Models
Figure 5 shows the gridded distributions of the TS of the Unet-based (Unet_VToV, Unet_NVToV, and Unet_PVToV) and CMA_GD visibility forecasting models. Unet_VToV had a higher TS (dark green lines in
Figure 5) than CMA_GD (red lines) in the visibility classifications of ≤200 m and ≤500 m for the predictions at 24–30 h and 54–63 h lead times. Conversely, for the ≤1000 m and ≤3000 m visibility classifications, the TS of Unet_VToV was generally lower than CMA_GD, with a maximum decrease of 0.02. The results indicated that the forecasting performance of Unet_VToV was inferior to the original CMA_GD model.
Unet_NVToV (light green lines) had a higher TS in the ≤200 m, ≤500 m, and ≤1000 m visibility classifications, outperforming Unet_VToV and surpassing CMA_GD in most of the predictions. In the ≤3000 m visibility classification, the TS of Unet_NVToV was higher than that of Unet_VToV, but 0.02 lower than that of CMA_GD.
For the Unet_PVToV model (yellow lines), there was a significant improvement in the TS for all visibility classifications. The TSs for nearly all forecast ranges were higher than those of Unet_VToV, Unet_NVToV, and CMA_GD, with the maximum TS reaching 0.25 for the ≤200 m visibility classification (
Figure 5a). Note that the TS improvements in Unet_PVToV were less stable for the ≤200 m visibility classification, with only a few forecasts showing substantial improvements, indicating a relatively lower model stability in heavy-fog forecasting.
Figure 6 shows the BSs of the visibility forecasting models. Unet_VToV and Unet_NVToV had much lower BSs than CMA_GD for all visibility classifications, with the best performance seen for visibility below 200 m. Again, Unet_PVToV had a lower BS than CMA_GD did, which indicated that Unet_PVToV had smaller forecast errors. Especially in the ≤200 m visibility classification, the average BS was reduced by 20. However, Unet_PVToV had some high BSs for predictions with a lead time of less than 27 h, which suggested some significant errors during this period. In general, the improvement in the BS of Unet_PVToV was not as substantial as the improvements found in the TS evaluation. While Unet_PVToV showed a significant improvement in forecast accuracy, as shown by the TS, the larger BS suggested that Unet_PVToV still struggled with specific forecast periods, with some more significant errors during these periods.
4.1.2. Station-Based Assessments of Unet-Based Models
Figure 7 shows the TSs of the visibility forecasting models at all the weather stations in Guizhou Province. Unet_VToV had higher TSs than CMA_GD in most of the predictions for the visibility classifications of ≤200 m, ≤500 m, and ≤1000 m. However, it had lower TSs than CMA_GD in the 27–33 h and 66–72 h forecasts. For the ≤3000 m visibility classification, the TSs of Unet_VToV were not uniformly larger than those of CMA_GD, with notable decreases compared to CMA_GD in the 6–15-h, 27–36-h, and 51–60 h forecasts. Unet_NVToV had larger TSs than Unet_VToV, with a maximum increase of 0.03, outperforming CMA_GD. However, within the first 15 h of the ≤3000 m visibility condition, Unet_NVToV had a lower TS than Unet_VToV and CMA_GD. Unet_PVToV had higher station-based TSs than Unet_NVToV (maximum increase of 0.06) and outperformed Unet_VToV_Sta. Unet_PVToV effectively enhanced the model performance and increased the TSs of forecasts with a lead time of less than 15 h.
Figure 8 shows the BSs of the visibility forecasting models. The BS evaluations of Unet_VToV and Unet_NVToV at weather stations were similar to those of the grid-based evaluation. The BSs of Unet_VToV and Unet_NVToV were significantly lower than those of CMA_GD for all visibility classifications, with a maximum reduction of approximately 3.5. The BS of Unet_PVToV at stations was similar to that of Unet_PVToV over the model grid, which had a notable reduction compared to CMA_GD. However, the improvement for Unet_PVToV was less substantial than that found in Unet_VToV and Unet_NVToV. Moreover, Unet_PVToV had a high BS for all visibility classifications for the forecasts with a lead time of less than 27 h, which indicated significant forecast errors during this period.
The evaluations on model grids and at weather stations showed that the TS of visibility forecasts increased by adopting the Unet-based model and including more meteorological variables. Unet_PVToV achieved considerably higher TSs than the other models for the ≤500 m, ≤1000 m, and ≤3000 m visibility classifications, which demonstrated good forecasting capabilities. However, the TS improvement for Unet_PVToV was less pronounced for the ≤200 m visibility classification, where the model was less robust. Moreover, the BS evaluation indicated significant forecast errors within the 72 h forecast range. Overall, the model incorporating the multi-variable physical quantities, including visibility, performed better.
4.2. Evaluation of the U2net-Based Visibility Forecasting Model
To further enhance the TS and decrease the BS of the Unet-based forecasting models, a model based on the U2net architecture using the multi-variable physical quantity dataset with CMA_GD visibility (dataset III) was constructed and evaluated.
4.2.1. Grid-Based Assessments
Figure 9 shows the TSs of the Unet- and U
2net-based visibility forecasting models and the CMA_GD model. By introducing the nested U
2net model architecture, the U
2net_PVToV model showed a 0.06 higher TS than Unet_PVToV in the ≤500 m, ≤1000 m, and ≤3000 m visibility classifications. U
2net_PVToV had a much higher TS than CMA_GD at various lead times. Furthermore, for the ≤200 m visibility classification, U
2net_PVToV_Grid had higher TSs than CMA_G and Unet_PVToV. U
2net_PVToV was more robust, with a substantial improvement in the TS. Compared to LSTM_Attention_Grid, the TS of LSTM_Attention_Grid was better than that of U
2net_PVTo_V_Grid at individual times, but, overall, the TS of U
2net_PVTo_V_Grid performed a little better, with better model stability.
As for the BS of the visibility forecast models, shown in
Figure 10, we could see that U
2net_PVToV significantly decreased the forecast errors within the 72 h prediction found in Unet_PVToV. U
2net_PVToV had an overall low BS (<0.2) for all visibility classifications, which constituted a notable reduction in the BS compared to Unet_PVToV and CMA_GD, with an average decrease of 0.3 to 1.0. The BS of LSTM_Attention_Grid was close to that of U
2net_PVToV, but the BS of U
2net_PVToV performed better in the ≤500 m and ≤1000 m visibility classifications. The RMSE results showed a significant improvement for U
2net_PVToV over Unet_PVToV and CMA_GD, and an overall reduction in RMSE over LSTM_Attention (
Figure 11). These results demonstrated that the U
2net-based model could increase the TS of visibility forecasts and reduce forecast errors, thereby strengthening the overall stability of the model.
To further illustrate the forecasting skill improvements by introducing the U
2net architecture, the spatial distributions of the TS for the models in Guizhou Province are presented in
Figure 12. As can be seen, CMA_GD had a TS ranging from 0.01 to 0.2, with the majority of the area having scores below 0.1, which indicated a poor forecasting performance of this model. Compared to CMA_GD, Unet_PVToV showed notable improvements in the ≤200 m visibility classification, with the TS reaching the range of 0.1 to 0.7. For the ≤500 m and ≤3000 m visibility classifications, Unet_PVToV produced an increase in the TS over some areas of Guizhou, reaching 0.1 to 0.2. There was no significant difference in the TS between CMA_GD and Unet_PVToV for the ≤1000 m visibility classification. In contrast, U
2net_PVToV demonstrated substantial TS improvements for all visibility classifications. For the ≤200 m visibility classification, U
2net_PVToV had a TS approximately three times higher than that of CMA_GD. The TS of U
2net_PVToV increased by a factor of three over CMA_GD, and its spatial coverage with a TS reaching 0.5 was wider than that of Unet_PVToV_Grid. For the other visibility classifications, U
2net_PVToV also had higher TSs than CMA_GD and Unet_PVToV, especially for the ≤1000 m visibility classification, where the forecast performed better than that of Unet_PVToV.
In summary, introducing the nested U2net architecture substantially improved the TS of the visibility forecasting model. The TS of U2net_PVToV was able to reach 0.5–0.7 for the ≤200 m visibility classification, which indicated that the U2net-based model could better forecast low-visibility weather conditions. U2net_PVToV also reduced the significant forecast errors within the 72 h prediction found in the Unet-based model. Moreover, the U2net-based model significantly reduced the BS, diminishing the overall forecast errors and improving the model’s stability. U2net_PVToV could provide more reliable visibility forecast products in Guizhou Province, especially in areas lacking ground-based visibility observations.
4.2.2. Station-Based Assessments
The evaluations of the models at stations were similar to those over model grids. The station-averaged TS (BS, RMSE) was higher (lower) at various prediction times in U
2net_PVToV than Unet_PVToV and CMA_GD. To assess the model performance over different areas, the TS values for the different models at each weather station are presented in
Figure 13. CMA_GD had a low TS, ranging from 0.01 to 0.05, across Guizhou Province, with most stations experiencing a TS below 0.03. This indicated a generally poor forecast performance of CMA_GD. For Unet_PVToV, the TS was higher in the eastern, southern, and northwestern parts of Guizhou for the ≤500 m, ≤1000 m, and ≤3000 m visibility classifications than for CMA_GD. However, in the ≤200 m visibility range, improvements were less pronounced. U
2net_PVToV had a significantly higher TS for all visibility classifications. Especially in the ≤3000 m visibility classification, the TS tripled compared to CMA_GD at most stations. The areas with a higher TS were in Guizhou’s southern, northern, and northwestern parts. It is worth noting that the improvements in TS for the low-visibility (≤200 m) classifications at stations were not as extensive as the evaluations over model grids (
Figure 12).
The visibility forecasting model based on the U2net network architecture had significantly higher TSs and lower forecast errors. Moreover, the U2net-based model outperformed the Unet-based and CMA_GD models at various lead times and over different areas of Guizhou. While the U2net-based model delivered substantial improvements in the station-based visibility forecast evaluations, there was still room for further enhancements, particularly under low-visibility conditions at the stations.
4.3. Overall Evaluation of TS and BS
An overall evaluation of the models’ TSs taken over model grids and at weather stations is summarized in
Table 1 and in
Table 2 for the BSs. These comprehensive TS and BS evaluations highlighted the stepwise advancements in the forecasting capabilities achieved through the progressive refinements in the dataset compositions and the model architectures. Unet_PVToV significantly improved the skill scores compared to CMA_GD, with an average tripling of the TS for different visibility classifications. By introducing the nested U
2net architecture and increasing the model depth, U
2net_PVToV further increased the TS, with a doubling over Unet_PVToV, and all classification intervals had higher TSs than LSTM_Attention. Moreover, the model evaluations over model grids were better than those at weather stations.
Unet_PVToV had higher TSs than Unet_VTOV and Unet_NVTOV. The higher BS of Unet_PVToV indicated that the model became less stable when the CMA_GD output was included. U2net_PVToV increased the TS and decreased the BS, which resulted in more robust and consistent visibility forecasting capabilities.
5. Conclusions
This study established visibility forecast models for Guizhou Province by utilizing the Unet and U2net architectures based on three datasets. Model performances were evaluated using the temporal evolution and spatial distribution of the TSs and BSs of forecasts with lead times of 0–72 h over model grids and at weather stations. The key findings were as follows:
The Unet-based visibility forecasting model using the multi-variable physical quantity dataset could significantly increase the TS compared to the CMA-GD model, with a more than threefold increase. This approach also enhanced the forecast stability. However, the Unet-based models had larger BSs than the CMA-GD model.
The nested U2net architecture, which deepened the neural network structure, could strengthen the model’s ability to extract physical field features and increase the TS, achieving a more than sixfold increase over the CMA-GD model. In addition, by introducing the U2net architecture, the model further improved the TS by approximately a factor of two compared to the Unet model, and significantly reduced the BS. In particular, in terms of the TS, the U2net-based model performed the best at the ≤200 m visibility threshold, with a more than eightfold increase over the CMA-GD model. The spatial distribution of the TS showed that the U2net-based model performed better at the model grid scale than at individual weather stations. Compared to the Unet-based and LSTM_Attention model, the U2net-based model lowered the overall BS and RMSE, reducing significant prediction errors. The U2net-based model could improve the accuracy and stability of the visibility forecast model. It was the best model we ever built for predicting visibility.