Next Article in Journal
Domain Adaptation for Satellite-Borne Multispectral Cloud Detection
Next Article in Special Issue
TE-LSTM: A Prediction Model for Temperature Based on Multivariate Time Series Data
Previous Article in Journal
Remotely Piloted Aircraft for Evaluating the Impact of Frost in Coffee Plants: Interactions between Plant Age and Topography
Previous Article in Special Issue
Advancing Regional–Scale Spatio–Temporal Dynamics of FFCO2 Emissions in Great Bay Area
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Enhancing Significant Wave Height Retrieval with FY-3E GNSS-R Data: A Comparative Analysis of Deep Learning Models

1
School of Computer Science and Technology, National University of Defense Technology, Changsha 410000, China
2
School of Meteorology and Oceanography, National University of Defense Technology, Changsha 410000, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2024, 16(18), 3468; https://doi.org/10.3390/rs16183468
Submission received: 19 July 2024 / Revised: 14 September 2024 / Accepted: 15 September 2024 / Published: 18 September 2024

Abstract

:
Significant Wave Height (SWH) is a crucial parameter in oceanographic research, essential for understanding various marine and atmospheric processes. Traditional methods for obtaining SWH, such as ship-based and buoy measurements, face limitations like limited spatial coverage and high operational costs. With the advancement of Global Navigation Satellite Systems reflectometry (GNSS-R) technology, a new method for retrieving SWH has emerged, demonstrating promising results. This study utilizes Radio occultation sounder (GNOS) data from the FY-3E satellite and incorporates the latest Vision Transformer (ViT) technology to investigate GNSS-R-based SWH retrieval. We designed and evaluated various deep learning models, including ANN-Wave, CNN-Wave, Hybrid-Wave, Trans-Wave, and ViT-Wave. Through comparative training using ERA5 data, the ViT-Wave model was identified as the optimal retrieval model. The ViT-Wave model achieved a Root Mean Square Error (RMSE) accuracy of 0.4052 m and Mean Absolute Error (MAE) accuracy of 0.2700 m, significantly outperforming both traditional methods and newer deep learning approaches utilizing Cyclone Global Navigation Satellite Systems (CYGNSS) data. These results underscore the potential of integrating GNSS-R technology with advanced deep-learning models to enhance SWH retrieval accuracy and reliability in oceanographic research.

1. Introduction

The retrieval of Significant Wave Height (SWH) is crucial in meteorology and oceanography as it serves as a fundamental parameter for assessing ocean wave conditions. Accurate SWH measurements are Vital for understanding the sea state, which directly influences marine navigation, offshore operations, and coastal management. Utilizing remote sensing techniques to retrieve SWH allows for high-resolution, wide-coverage wave data collection, essential for improving weather forecasts, monitoring marine environments, and studying climate change impacts. Moreover, SWH data play a significant role in validating and calibrating wave models, which are integral to predicting extreme weather events and managing coastal risks [1,2,3,4,5]. Traditional methods for retrieving SWH, such as using altimeter satellites, buoys, and ship-based observations, face several challenges, including computational complexity, data acquisition difficulties, and platform-specific limitations. Conventionally, the primary methodology for measuring Significant Wave Height (SWH) utilizes altimeter satellites, including TOPEX/Poseidon, Jason-1, Jason-2, and Sentinel-3. While these satellites provide extensive global coverage, their data collection is subject to temporal and spatial discontinuities, which stem from their specific orbital configurations. The advent of GNSS-R technology introduces a novel observational approach for SWH measurement, potentially mitigating some of the limitations faced by traditional altimetric methods [6,7,8]. Buoys, although highly accurate and capable of providing real-time data, are limited by geographic distribution and are susceptible to damage from severe weather, necessitating regular maintenance, which is costly and logistically challenging [9,10]. Ship-based observations, while valuable for direct measurements, are infrequent, geographically constrained, and costly, often restricted to specific routes or missions. In the context of Voluntary Observing Ship (VOS) data, the quality of data can be suboptimal, which may stem from the inherent limitations and challenges of the data collection process itself [11,12]. Furthermore, the calibration and validation of SWH data require continuous efforts and integration of various datasets, adding to the overall complexity and resource requirements [9,10].
GNSS-Reflectometry (GNSS-R) is an innovative remote sensing technique that leverages reflected signals from Global Navigation Satellite Systems (GNSS) to extract various environmental parameters, including surface wind speeds, ice extent, soil moisture, and SWH [13,14,15,16,17,18,19]. By processing Delay-Doppler Maps (DDMs) to correlate surface roughness with wave height, GNSS-R offers a compelling alternative to traditional SWH retrieval methods. Compared to traditional methods, GNSS-R has several advantages in SWH retrieval. It overcomes the limitations of buoys, which are restricted by geographic distribution and susceptibility to severe weather damage, and ship-based observations, which are infrequent and geographically constrained. Additionally, GNSS-R data collection is less affected by adverse weather conditions and does not require complex and resource-intensive data processing algorithms [13]. The spaceborne GNSS-R research commenced with the TechDemoSat-1 (TDS-1) mission, launched by the UK Space Agency (UKSA), which carried the SGR-ReSI payload to demonstrate GNSS-R’s feasibility for environmental monitoring [7]. This pioneering mission paved the way for subsequent advancements, notably the Cyclone Global Navigation Satellite System (CYGNSS), launched by NASA in December 2016. Comprising eight microsatellites, CYGNSS provides frequent and comprehensive measurements. Initially scheduled to terminate on 30 September 2023, the mission has been extended due to its excellent operational performance and high-quality data products. The extension of CYGNSS’s mission underscores the significant potential and broad applicability of GNSS-R technology. This technique not only enhances our ability to monitor and understand oceanic and atmospheric conditions but also promises a robust future for environmental remote sensing [19]. By integrating GNSS-R capabilities with advanced machine learning models, researchers can achieve unprecedented accuracy in SWH retrieval, contributing significantly to marine navigation, weather forecasting, and climate research. The continued operation of CYGNSS beyond its planned termination date highlights its importance and efficacy in environmental monitoring, affirming the broad and meaningful impact of GNSS-R technology.
Studies by Ruf et al. [20] have demonstrated strong correlations between CYGNSS-derived oceanographic parameters measurements and buoy data, validating the approach for SWH retrieval. Additionally, Clarizia et al. [21] have significantly improved noise reduction in DDMs, enhancing the precision of SWH measurements. In addition to these data analysis methods, machine learning techniques have also been widely applied to SWH retrieval. These techniques have notably enhanced the capabilities of CYGNSS in retrieving various environmental parameters. For instance, neural networks have been applied to retrieve sea surface wind speeds with high accuracy. Methods developed by Stopa and Cheung [7] and Quach et al. [22] utilize large datasets of co-located buoy and satellite data to train models that accurately retrieve wind speeds based on GNSS-R signal characteristics. Neural networks trained on extensive datasets of the co-located buoy and satellite data have modeled the relationship between reflected signal characteristics and SWH with high precision. For example, Morris et al. [23] and Gleason et al. [24] used neural networks to retrieve SWH from CYGNSS data, demonstrating significant improvements in accuracy. Furthermore, deep learning approaches, such as those explored by Li et al. [25], have shown promise in extracting SWH from GNSS-R data, leveraging the ability of deep neural networks to handle complex and non-linear relationships. Recent studies, such as those by Patanè et al., have proposed the use of LSTM-based estimation models [26], while Bu et al. combined ERA5 data with CNN networks for SWH retrieval research [27].
Despite these advancements, there are still limitations in using machine learning for SWH retrieval compared to wind speed retrieval. It has been noted in several studies that the models employed for SWH often exhibit less sophistication and the datasets used are relatively sparse [25,28]. Research on SWH retrieval lags behind that of wind speed, where more advanced models like transformers have been applied. For instance, the use of hybrid transformer networks and ConvLSTM models in wind speed forecasting has shown significant improvements in accuracy and prediction horizons [29,30]. In contrast, SWH models tend to rely on older, simpler architectures, which may not capture the complex dynamics as effectively. Overall, the integration of advanced machine learning techniques with GNSS-R data from CYGNSS not only improves the accuracy and precision of SWH retrievals but also broadens the scope of applications, making it a crucial tool for contemporary remote sensing. However, the field still faces challenges, particularly in the sophistication of models used for SWH retrieval compared to wind speed retrieval.
In China, the current approach to SWH retrieval predominantly utilizes data from the CFOSAT (China France Oceanography Satellite). Launched in 2018, CFOSAT employs GNSS-R to monitor ocean surface wind speeds and SWH [31,32]. However, its single-satellite design limits both data coverage and temporal resolution. With the launch of the FY-3E series satellites in 2021, equipped with GNOS (GNSS Occultation Sounder) payloads capable of GNSS-R data collection, China’s spaceborne GNSS-R technology has seen rapid development. Comparatively, CYGNSS, composed of eight microsatellites, offers high-frequency data that are particularly useful for monitoring extreme weather conditions. CYGNSS processes DDMs from reflected GPS signals to maintain data quality under adverse weather, while FY-3E’s GNOS leverages multi-frequency DDMs to improve measurement accuracy and reduce noise [20,21,33]. The GNOS sensor on FY-3E also demonstrates superior capabilities in monitoring polar ice changes and high-latitude ocean environments, complementing the strengths of CYGNSS [14,34,35,36,37,38,39,40]. Despite significant progress in sea surface wind speed retrieval [41,42,43], comprehensive research on SWH retrieval using FY-3E data has yet to be conducted. Developing this capability would significantly enhance the utility and accuracy of FY-3E products. By focusing on SWH retrieval, FY-3E can provide more reliable data, crucial for applications such as marine navigation, weather forecasting, and climate research [44,45,46]. This research would not only improve the precision of FY-3E’s measurements but also extend its applicability, thereby solidifying its role in global oceanographic monitoring [37,47,48].
Given the challenges in machine learning-based GNSS-R retrieval methods, this study focuses on using FY-3E GNOS payload data with ERA5 as the reference data for SWH retrieval. The contributions of this paper are as follows:
  • Application of FY-3E GNOS Data in SWH Retrieval: This study is the first to utilize FY-3E GNOS payload data for SWH retrieval, achieving promising accuracy.
  • Proposal of the ViT-Wave Model: Combining the latest transformer models with ViT models tailored for the task, we propose a specialized model, ViT-Wave, for SWH retrieval.
  • Global Ocean Analysis: The global ocean analysis demonstrates that the model significantly improves the retrieval accuracy of high wave heights and enhances the overall precision distribution across different sea states.
The structure of this paper is as follows: Section 2 introduces the data encountered in the experiment. Section 3 describes the experimental methods and models. Section 4 details the experimental process. Section 5 provides a summary and discussion of the experimental results. Section 6 concludes the paper with final remarks. By advancing the application of neural network models and integrating state-of-the-art transformer techniques, this research aims to significantly improve the accuracy and reliability of SWH retrieval using FY-3E GNOS data, contributing to the broader field of oceanographic monitoring and analysis.

2. Date Description

2.1. FY-3E Data

The FY-3E satellite is equipped with the GNOS payload, which enables the reception of GNSS-R signals, thereby facilitating research in SWH retrieval. Unlike CYGNSS, which can only receive GPS signals, FY-3E can simultaneously receive reflected signals from GPS, BeiDou, and Galileo satellites. This capability provides FY-3E with a broader observational range. Specifically, FY-3E’s observation coverage extends from 67°N to 67°S, compared to CYGNSS’s coverage from 34°N to 34°S. This wider observational range allows FY-3E to monitor a more extensive area, including high-latitude regions. Moreover, the effective scattering area for DDM data also differs between the two systems. FY-3E features an effective scattering area of 9 × 20 bins, while CYGNSS has an area of 17 × 11 bins. These distinctions enhance the observational capabilities of FY-3E, providing more detailed and accurate measurements. Currently, the FY-3E observation system consists of a single satellite. Plans are underway to expand this system with additional satellites, which, once operational, will facilitate multiple observations of the same location within a single day. This expanded capability mirrors the technological approach utilized by the CYGNSS satellite system, which boasts a revisit cycle of less than seven hours as detailed in the CYGNSS handbook [49]. The accompanying Figure 1 provides a schematic representation of the FY-3E satellite’s observation points over a single day. For this study, data from one month (August 2023 to September 2023) were selected.
In the experimental analysis, a total of thirty-two essential data parameters were utilized. The variables sp_lat and sp_lon denote the geographic coordinates of the specular reflection points. An additional twenty-nine variables comprehensively characterize various attributes of the DDM, all of which are detailed in Table 1. Notably, Ddm_effective_area is presented as a two-dimensional array measuring 9 × 20, while Sp_delay_doppler_flag is utilized as a quality assessment metric. Comprehensive descriptions of each variable are provided in Appendix A.

2.2. ERA5 SWH

ERA5 is a fifth-generation reanalysis dataset produced by the European Centre for Medium-Range Weather Forecasts (ECMWF). It provides comprehensive and high-resolution climate and weather data, including SWH, on a global scale. ERA5 offers detailed temporal and spatial resolution, with hourly data on a 0.25° × 0.25° grid. This fine resolution allows for precise monitoring and analysis of wave conditions across different regions and time periods. The dataset spans several decades, enabling long-term studies and trend analysis in wave dynamics and climate change.
In numerous research studies, ERA5 SWH data have been utilized as training labels for machine learning models due to its high accuracy and consistency, which make it an ideal reference dataset [50,51]. In prior initiatives to incorporate buoy data for machine learning applications, we gathered measurements from 255 buoys, ranging from ID 41001 to 46279. Regrettably, only 90 datapoints from the same month were successfully matched, an amount deemed insufficient for machine learning analysis. Consequently, we adopted ERA5 as the training label, a method commonly utilized in SWH retrieval research, as corroborated by multiple studies in the domain. Our experiments effectively matched nearly 100,000 data points with ERA5, facilitating thorough experimental analysis. Should adequate buoy data become available, we aim to conduct additional experiments to further validate and refine our models. Compared to other datasets, the integration of satellite observations, buoy measurements, and numerical weather prediction models in ERA5 provides a robust and reliable source of SWH data [52]. In this experiment, ERA5 SWH data corresponding to the same time period as the FY-3E observations were used for reference and validation.

3. Methodology

This study introduces five distinct neural network models tailored for SWH retrieval: the ANN-Wave model, based on ANN; the CNN-Wave model, utilizing CNN; the hybrid-Wave model, integrating both ANN and CNN architectures; the Trans-Wave model, built on the transformer framework; the ViT-Wave model, leveraging the ViT architecture. Each model comprises different network layers and weights, as indicated in Table 2. The models are structured into several parts: the first part consists of convolutional layers that employ CNN methods to extract features from the DDM; the second part includes transformer layers that analyze DDM features using transformer techniques; the third part consists of linear layers that process the features derived from various methods in a unified manner; the final part, also composed of linear layers, outputs the model’s final SWH retrieval results.
In the following sections, each model will be introduced and discussed in detail, highlighting their unique structures and the methodologies employed for SWH retrieval.

3.1. ANN-Wave

The ANN-Wave model is predominantly based on an Artificial Neural Network (ANN) architecture. The primary inputs to this model consist of spatial-temporal auxiliary information, including the latitude and longitude of the specular point (SP), one-dimensional feature data from DDMs, and quality control labels. In total, the model utilizes 31 one-dimensional features, as detailed in Appendix A, variables numbered 2–32. The network architecture is illustrated in Figure 2. The input features are concatenated and fed into the linear network structure. Initially, Batch Normalization (BatchNorm) is applied to normalize the inputs, which helps in accelerating the training process and improving the stability of the network. Following this, a Rectified Linear Unit (ReLU) activation function is employed to introduce non-linearity into the model, enabling it to learn complex patterns in the data. Subsequently, Dropout is utilized to prevent overfitting by randomly setting a fraction of the input units to zero during training, thereby promoting model generalization. The linear network consists of a total of eight layers, each incorporating BatchNorm, ReLU, and Dropout. The final layer is a fully connected layer that outputs the retrieved SWH.

3.2. CNN-Wave

The CNN-Wave model is primarily based on a Convolutional Neural Network (CNN) architecture, designed to process two-dimensional feature data from DDMs. The primary input feature for this model corresponds to the variable listed as number 1 in Appendix A. The network architecture is illustrated in Figure 3. Initially, the model employs convolutional layers with kernel sizes of 5, 4, and 2, respectively. These varying kernel sizes enable the extraction of features at different scales, capturing both broad and fine-grained information from the input DDM data. Each convolutional layer is followed by a ReLU activation function, which introduces non-linearity and aids in learning complex patterns within the data. After the convolutional layers, the extracted features are flattened and fed into a series of linear network layers. This transformation converts the two-dimensional feature maps into a one-dimensional vector, suitable for further processing by the fully connected layers. The final layer of the CNN-Wave model is a fully connected layer that outputs the retrieved SWH. This architecture ensures that the CNN-Wave model effectively leverages the spatial structure of the DDM data to accurately predict SWH.

3.3. Hybrid-Wave

The Hybrid-Wave model integrates both ANN and CNN, utilizing all 32 variables listed in Appendix A as inputs. The network architecture is illustrated in Figure 4. In this model, the two-dimensional DDM data are initially fed into a convolutional network, which extracts relevant features through a series of convolutional layers. These features are then flattened to form a one-dimensional vector. Simultaneously, the one-dimensional linear features are processed. The extracted features from the Conv network are concatenated with the one-dimensional linear features. This combined feature vector is then fed into a series of linear network layers. The final layer of the Hybrid-Wave model is a fully connected layer that outputs the retrieved SWH. This architecture leverages the strengths of both ANN and CNN, effectively capturing and processing the diverse input features to enhance the prediction accuracy of SWH.

3.4. Trans-Wave

The Trans-Wave model, as depicted in Figure 5, is predicated on the transformer architecture, uniquely tailored for analyzing the DDM features for SWH retrieval. This model simplistically preprocesses the DDM data before directly inputting them into the Transformer alongside auxiliary one-dimensional features, thus forming a comprehensive set of inputs. The transformer layers, consisting of an embedding layer and a TransformerEncoder layer, transform the raw input into a dense 64-dimensional representation suitable for processing by the transformer encoder. Configured with 64-dimensional inputs, eight attention heads, three encoder layers, and a feedforward dimension of 256, the encoder adeptly captures complex dependencies among features. Post-processing, the outputs are concatenated with additional auxiliary features and routed through a linear layer, concluding with a series of SWH retrieval layers. These layers comprise multiple linear layers enhanced with BatchNorm, ReLU, and Dropout, leading to the final SWH prediction. This architecture allows the Trans-Wave model to effectively utilize both the spatial-temporal structure of the input data and the sequential characteristics of the DDM features.

3.5. ViT-Wave

The ViT-Wave model, illustrated in Figure 6, utilizes the Vision Transformer (ViT) architecture to handle image-like input data from DDMs. Differing from the Trans-Wave model, it begins by segmenting the DDM into several small patches to enhance feature learning. The primary inputs include these DDM patches and auxiliary one-dimensional features. The ViT-Wave model’s embedding layer maps each patch into a 64-dimensional space, preparing it for the VisionTransformerEncoder. This encoder, with identical configuration settings as the Trans-Wave model, processes the embedded patches, meticulously capturing the relationships across the input field. Its self-attention mechanism focuses on relevant features while ignoring irrelevant ones, enhancing the model’s precision in predicting SWH. Following this, the outputs are concatenated with auxiliary features and passed through a linear layer. The final SWH retrieval layers, similar to those in the Trans-Wave model, consist of multiple linear stages equipped with BatchNorm, ReLU, and Dropout, ultimately delivering the SWH prediction. This model’s structure effectively leverages the spatial structure of the DDM data and the power of self-attention to achieve high accuracy in SWH retrieval.

4. Experiment

In this study, the experiment was conducted using the OpenI computing platform, which efficiently supported the needs of our neural network models. This platform is tailored for various scientific and engineering applications, ensuring adequate computational resources for the project.

4.1. Data Preprocessing

The data preprocessing stage involved meticulous steps to ensure the completeness and reliability of the dataset utilized in subsequent analyses. The process began with a thorough data cleaning operation, where outliers and missing values, such as NaN, were identified and removed. Common outliers included data points that exceeded typical geographic coordinates (latitude and longitude) and unusual fill values such as −9999 and infinity (inf). Moreover, entries associated with these outliers and missing values, including corresponding data points and labels, were systematically eliminated to prevent any distortion of results and degradation of model performance. Following this, temporal and spatial matching of the data was conducted to ensure alignment between different datasets. Specifically, the observed variables from FY-3E were interpolated onto a 0.25° × 0.25° grid. Temporally, the data were interpolated to the nearest hour using the nearest-neighbor method, ensuring consistency with the spatial and temporal resolution of the ERA5 data.
After the cleaning and matching processes, the resulting dataset comprised 390,261 entries. A significant portion of this data, 87.92%, corresponded to SWH ranging from 0 to 4 m. Entries with SWH between 4 and 8 m accounted for 47,148 records, representing 11.87% of the total dataset. Only 824 entries, or 0.21%, had SWH values exceeding 8 m. Thus, the vast majority of SWH measurements were below 8 m, making up 99.79% of the data. The maximum observed SWH was 14.80 m, while the minimum was 0.03 m. This distribution shown in Figure 7, highlights that extreme SWH values are relatively rare within the dataset.

4.2. Experimental Procedure

The experimental process commenced with the random shuffling of data, followed by its division into training, validation, and test sets in a 7:2:1 ratio. The training set was utilized to train the model, the validation set to verify the results of each training iteration, and the test set to evaluate the final performance of the model. To ensure reproducibility and robustness, five different random seeds were selected, and each of the five models was trained and evaluated five times under the same experimental configuration.
In the model training phase, the loss function used was Mean Squared Error Loss (MSELoss). To prevent overfitting, a regularization parameter was introduced. The total loss during training included both the prediction loss and an L2 regularization term:
total _ loss = loss + λ L2 _ reg
where λ is the regularization parameter, and L2 _ reg is the sum of the L2 norms of the model parameters. This regularization helps in mitigating overfitting by penalizing large weights in the model. Additional key parameters set during the experiment included the optimizer and learning rate. The Adam optimizer was chosen with a learning rate of 0.0005 to balance the speed and stability of convergence. The number of epochs was set to 500, providing ample iterations for the model to learn from the data. Data loaders were configured with a batch size of 512, ensuring efficient data handling and processing.
The training process involved several steps. To begin with, the dataset was randomly shuffled and divided into training, validation, and test sets. Throughout each epoch of model training, the model’s parameters were updated using the training set, and the loss was calculated for each batch. This loss, combined with the L2 regularization term, was used to adjust the model parameters via backpropagation. Subsequently, the model’s performance was evaluated using the validation set, and the validation loss was recorded after each epoch. This step ensured that the model’s learning was appropriately generalized and not overfitted to the training data. Finally, after the last epoch, the model’s performance was assessed using the test set, and the test loss was recorded to evaluate the model’s prediction accuracy on unseen data. As is shown in Table 3, after five rounds of training, the ViT-Wave model demonstrates the most superior performance across all metrics. Specifically, the ViT-Wave model achieved the lowest average training loss and validation loss, with the smallest standard deviations. This indicates that the ViT-Wave model not only converges quickly during training but also maintains high stability and consistency, highlighting its comprehensive superiority. In contrast, the CNN-Wave model with single input exhibits the poorest performance, characterized by the highest training and validation losses, along with significant error variability. Moreover, Hybrid-Wave and Trans-Wave models achieve moderate training and validation losses with relatively small standard deviations, indicating stable performance. Overall, the experimental results clearly delineate the performance differences among the models. The ViT-Wave model stands out as the best-performing model with significant advantages.

4.3. Evaluation Metrics

To comprehensively evaluate the performance of each neural network model in retrieving SWH, several key metrics were selected as evaluation standards. These metrics offer unique insights into different aspects of model accuracy and reliability. The selected evaluation metrics include Root Mean Square Error (RMSE), Bias, Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and the Coefficient of Determination (R²). The specific formulas and meanings of each metric are detailed below:
RMSE = 1 n i = 1 n ( y i y ^ i ) 2
MAE = 1 n i = 1 n | y i y ^ i |
Bias = 1 n i = 1 n ( y i y ^ i )
MAPE = 100 % n i = 1 n y i y ^ i y i
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2
Parameters
-
y i : Observed SWH value (ERA5 data)
-
y ^ i : Predicted SWH value (model output)
-
n: Number of observations
-
y ¯ : Mean of observed SWH values
-
y ^ ¯ : Mean of predicted SWH values
These metrics collectively provide a robust framework for evaluating the performance of the neural network models, ensuring that various aspects of prediction accuracy and reliability are comprehensively assessed in the context of SWH retrieval using ERA5 data as the reference.

5. Result

Upon completing the experimental tests, we evaluated the finalized models using the test dataset. The performance of these models was assessed based on several predetermined evaluation metrics: RMSE, MAE, BIAS, MAPE, and R2. The test results are summarized in Table 4. Overall, it can be seen that the Vit-Wave model has achieved the best comprehensive effect. In addition to these evaluations, we conducted comparative studies with other models that utilize CYGNSS data for SWH retrieval. When compared with traditional data statistical analysis methods such as SNR [28] and NCDW LES [28], the ViT-Wave model exhibited improved performance in terms of RMSE and MAE. These improvements highlight the advantages of leveraging advanced neural network architectures over conventional statistical methods. Furthermore, the ViT-Wave model outperformed several other machine learning models, including ANN [25], BT [25], and DCNN [27]. The enhancements in retrieval accuracy with the ViT-Wave model underscore its potential for more accurate and reliable SWH retrievals compared to these existing methods.
To better illustrate the performance improvements of the ViT-Wave model, Table 5 presents the percentage enhancements of key parameters when compared to other models.
Scatter density plots were generated for each model to comprehensively evaluate their performance as shown in Figure 8. In these plots, the black solid line represents the ideal y = x line, indicating perfect agreement between the observed and retrieved values. Additionally, a red dashed line depicts the linear fit of the data, with the corresponding regression equation displayed in the bottom right corner of each plot. The scatter density plots reveal that the ViT-Wave model demonstrates the most favorable distribution of data points around the ideal y = x line, indicating a high degree of accuracy in its retrievals. The linear regression equation for the ViT-Wave model, y = 0.88 x + 0.32 , underscores the model’s superior fitting performance. This close alignment with the ideal line suggests that the ViT-Wave model effectively captures the underlying relationship between the observed and retrieved SWH values, resulting in minimal deviations and high fidelity in its predictions.
In comparison, other models show varying degrees of dispersion around the y = x line, reflecting differences in retrieval accuracy and consistency. The scatter density plots reveal that models such as the CNN-Wave and Trans-Wave have more scattered points, indicating higher retrieval errors and less reliable performance. The fitting equations for these models also exhibit greater deviations from the ideal line, further highlighting their comparative inferiority. Overall, the scatter density analysis reinforces the earlier findings from the quantitative metrics, solidifying the ViT-Wave model’s status as the most robust and accurate model for SWH retrieval among those evaluated. Its superior performance across both quantitative metrics and visual scatter plots underscores its potential for practical applications in wave height prediction and oceanographic research.
We also conducted a segmented error analysis based on the range of SWH values. The SWH data were divided into five segments: 0–2 m, 2–4 m, 4–6 m, 6–8 m, and >8 m. The blue column represents the ANN-Wave model; the orange column represents the CNN-Wave model; the blue column represents the ANN-Wave model; the gray column represents the Hybrid-Wave model; the red column represents the Tans-Wave model; the green column represents the ViT-Wave model. The errors for each model within these segments were calculated and presented in the form of bar charts (see Figure 9, Figure 10 and Figure 11).
From these bar charts, it is evident that the ViT-Wave model consistently demonstrates the lowest errors across the SWH range of 0–8 m, indicating its superior performance in this range. Notably, the error range of 0–4 m is where all models perform best across all metrics, which may be related to the data distribution, as shown in Figure 7. This suggests that the ViT-Wave model is highly effective in accurately retrieving SWH values, particularly within the common range encountered in oceanographic observations. Interestingly, for SWH values greater than 8 m, the Trans-Wave model shows the smallest errors, outperforming other models in this higher SWH segment. This indicates that while the ViT-Wave model excels in general SWH conditions, the Trans-Wave model has a notable advantage in extreme wave conditions, where its architecture might better capture the complex features associated with higher wave heights. The segmented error analysis highlights the strengths of both the ViT-Wave and Trans-Wave models, each excelling in different SWH ranges and collectively offering robust solutions for a wide spectrum of wave height retrieval scenarios.
To better understand the global distribution of SWH across the ocean surface, we utilized reference points based on ERA5 SWH data along with their corresponding latitudes and longitudes. Upon evaluating the test dataset, it was observed that 95% of the SWH values were concentrated in the 0–5 m range. To enhance the visibility of the overall SWH distribution, a customized color bar was employed, as shown in Figure 12, where darker colors indicate lower SWH values and lighter colors indicate higher SWH values. From Figure 12 and Figure 13, it can be seen that most of the colors are consistent, indicating that the ViT-Wave model’s retrieval results are close to the ERA5 SWH. However, in the central part of the Southern Hemisphere, the color in Figure 13 is darker than in Figure 12, indicating that the ViT-Wave model underestimates the SWH compared to the ERA5 data. This suggests that the ViT-Wave model requires further optimization in high-value ranges.
From Figure 12, which represents the ERA5 data, it is evident that SWH values tend to be higher in high-latitude regions, particularly in the Southern Hemisphere. This observation cannot be made using CYGNSS data due to their limited coverage between 33°N and 33°S latitude, highlighting the advantage of FY-3E data. Overall, the ViT-Wave model shows a distribution that closely resembles the ERA5 reference, indicating its superior performance in capturing the global SWH distribution.
To evaluate the error distribution of the ViT-Wave model across the global ocean surface, we generated residual and rmse 3° × 3° grid distribution maps (see Figure 14 and Figure 15). These maps visually represent the differences between the ViT-Wave model predictions and the ERA5 reference data. In these maps, negative biases (indicating model underestimation) are shown in red, with darker shades representing larger discrepancies. Conversely, positive biases (indicating overestimation) are shown in blue, with darker shades indicating greater deviations. Regions where the model predictions match the ERA5 data are displayed in white. From Figure 14 and Figure 15, it is evident that the ViT-Wave model tends to underestimate SWH values overall. The biases are more pronounced in high-latitude regions compared to areas near the equator, suggesting that the model’s performance is less accurate in these regions. This indicates a need for further refinement and optimization of the ViT-Wave model to improve its accuracy, particularly in high-latitude areas. Future research should focus on addressing these discrepancies to enhance the model’s reliability and performance in various oceanographic conditions.

6. Discussion

Among the evaluated models, as shown in Table 4, the ViT-Wave model demonstrated superior performance across all five metrics. The ViT-Wave model achieved an RMSE of 0.4052 m, indicating the lowest overall error distribution among the models assessed. This suggests its high efficacy in capturing data variability and minimizing retrieval errors, particularly in handling outliers effectively. Furthermore, its MAE of 0.27 m reflects the model’s ability to maintain low absolute errors, underscoring its robustness in providing accurate retrievals with minimal deviations from observed values, indicating overall good performance even without considering extreme values. The model’s near-zero bias, recorded at −0.0015 m, suggests that the ViT-Wave’s retrievals are almost unbiased, with negligible systematic errors, thereby enhancing its reliability for practical applications and indicating a slight tendency of the model to underestimate. The MAPE was 18.02%, showcasing its efficiency in minimizing percentage errors relative to observed values, a crucial metric for assessing performance where relative error measurements are important. However, the Hybrid-Wave model’s slightly lower MAPE of 17.71% suggests it may be more effective in applications requiring precise percentage error reduction, possibly due to less influence from extreme values. The R² value of 0.9117 indicates a high level of correlation between the retrieved and observed values, showing that the model explains a substantial proportion of the variance in the observed data, which further validates its effectiveness.
The superior performance of the ViT-Wave model can largely be attributed to the unique capabilities of its architecture. The ViT segments the input images into patches, a method that enables the model to capture detailed features within each segment effectively. This patch-based processing approach allows the ViT-Wave model to learn fine-grained details and spatial hierarchies in the data, which are crucial for accurately predicting SWH. By focusing on local and global patterns simultaneously, the model can better understand and integrate various data relationships, enhancing its precision and effectiveness in SWH retrieval. This capability is advantageous for SWH retrieval, where the complexity of ocean surface dynamics requires models to discern subtle features and patterns that significantly impact prediction accuracy. The ViT-Wave’s ability to handle non-linearities and dependencies in data more efficiently than conventional models suggests that its method of slicing the image data into manageable pieces before processing contributes significantly to its lower RMSE and MAE.
Furthermore, the slight underperformance in minimizing MAPE compared to the Hybrid-Wave model might be addressed by exploring hybrid architectures. Combining the detailed feature extraction capabilities of the Vision Transformer with the robustness against outliers of other models could lead to a more balanced approach, enhancing performance across all metrics. Moreover, the analysis indicates that while the ViT-Wave model demonstrates high effectiveness across many metrics, there is room for improvement, especially in regions with high latitude and extreme wave conditions. Investigating the training data diversity and distribution might reveal the need for more representative samples from these challenging conditions.
Future research could also explore modifications to the ViT architecture to further optimize its processing capabilities for SWH data. By enhancing the model’s ability to handle outliers and extreme values, and possibly integrating more diverse training scenarios, the overall accuracy and reliability of SWH retrievals can be significantly improved. This exploration into the causes of discrepancies and potential mitigation strategies will be crucial for advancing the model’s practical applications in oceanographic research.

7. Conclusions

This research capitalizes on GNSS-R data from the FY-3E satellite and introduces the Vision Transformer model to advance the field of SWH retrieval. The model has achieved the lowest recorded RMSE of 0.4 m, which underscores its superior accuracy over traditional methods. This high level of precision highlights the model’s capability to effectively harness advanced machine learning techniques for the retrieval of complex environmental data through GNSS-R data. However, the ViT-Wave model needs more matching and more accurate data such as buoy data and altimeter data to improve its retrieval accuracy. While the ViT-Wave model excels in providing enhanced spatial and temporal resolution and broad global coverage, it faces challenges in high wave conditions and at high latitudes it is notably reduced, suggesting areas where further model optimization is necessary. Despite these challenges, the integration of cutting-edge technology with satellite observations marks a significant step forward in the domain.
In general, the model has good retrieval effects in 0–8 m and low- and mid-latitude regions, and needs further optimization when used in areas above 8 m and high-latitude regions. For optimal use, the ViT-Wave model is suitable for environments with moderate to high wave conditions and where extensive area coverage is needed, such as in global marine environmental monitoring and climate studies. It should be used with caution in extreme sea conditions and areas that require real-time data processing because the model calculation data are missing under such conditions and the accuracy will be reduced. This foundation sets the stage for future enhancements and wider applications of the technology in oceanographic monitoring.

Author Contributions

Conceptualization, Z.Z. and B.D.; methodology, K.R. and B.D.; software, Z.Z.; validation, Z.Z. and W.N.; investigation, R.C.; resources, B.D.; data curation, Z.Z. and K.R.; writing—original draft preparation, Z.Z.; writing—review and editing, B.D. and W.N.; visualization, R.C.; supervision, K.R.; project administration, K.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the science and technology innovation program of Hunan Province grant number 2022RC3070 and Youth Independent Innovation Science Foundation grant number ZK23-53.

Data Availability Statement

The code has been uploaded to the Github platform, the URL is https://github.com/andyzhou4451/swh_retrieval_from_fy3e, accessed on 10 June 2024. The GNSS-R data of FY-3E can be obtained from the Fengyun Data Service Network, the URL is https://satellite.nsmc.org.cn/portalsite/default.aspx, accessed on 20 August 2023; The SWH data of ERA5 can be obtained from the Climate Data Store, the URL is https://cds-beta.climate.copernicus.eu/, accessed on 20 August 2023.

Acknowledgments

This article would like to thank the National Satellite Meteorological Center for data support and researcher Fangli Dou for her help.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Explanation of Variables.
Table A1. Explanation of Variables.
#AbbreviationFull NameExplanation
1Ddm_effective_areaEffective Scattering AreaThe effective scattering area of the 9 × 20 region of the DDM used to calculate DDM_NBRCS
2sp_latSpecular Point LatitudeLatitude of the specular reflection point
3sp_lonSpecular Point LongitudeLongitude of the specular reflection point
4Ddm_brcs_factorBRCS FactorFactor used to compute DDM BRCS (power/BRCS)
5Ddm_doppler_referDoppler ReferenceThe central doppler (at column 10) of the DDM
6Ddm_kurtosisKurtosisKurtosis of raw counts in the whole DDM
7Ddm_noise_mNoise M-valueThe ratio of the square of the mean of the noise floor and the variance of the noise floor
8Ddm_noise_rawNoise RawThe mean noise floor of the raw DDM
9Ddm_noise_sourceNoise SourceDDM noise floor source to calculate the mean noise
10Ddm_peak_columnPeak Bin ColumnThe zero-based Doppler column of the peak value in the DDM
11Ddm_peak_delayPeak Bin DelayDelay of the DDM peak bin, in corresponding GNSS system chip
12Ddm_peak_dopplerPeak Bin DopplerDoppler of the DDM peak bin
13Ddm_peak_power_ratioPeak Power RatioSum of centered 5 × 3 DDM power bin values around the specular point divided by the sum of the all DDM power bin values
14Ddm_peak_rawPeak RawPeak value in DDM raw counts
15Ddm_peak_rowPeak Bin RowThe zero-based delay row of the peak value in the DDM
16Ddm_peak_snrPeak SNR10lg(S_max/N_avg-1), where S_max is the maximum value (in raw counts) in a single DDM bin and N_avg is the average per-bin raw noise counts
17Ddm_power_factorPower FactorFactor used to compute DDM power (dBW) from DDM counts (counts/power)
18Ddm_quality_flagQuality FlagThe L1 DDM quality flag of processing, indicating various quality checks and conditions
19Ddm_range_referRange ReferenceThe central range (at column 10) of the DDM
20Ddm_raw_dataRaw Data122 × 20 array of DDM bin raw counts
21Ddm_skewnessSkewnessSkewness of raw counts in the whole DDM
22Ddm_sp_columnSpecular Point ColumnThe zero-based Doppler column of the specular point doppler in the DDM
23Ddm_sp_delaySpecular Point DelaySpecular point delay in the DDM
24Ddm_sp_dlesSpecular Point DLESThe slope of the second derivative of the DDM’s leading edge slope
25Ddm_sp_dopplerSpecular Point DopplerSpecular point Doppler in the DDM
26Ddm_sp_lesSpecular Point LESLeading edge slope of a 3 delay × 5 Doppler bin box centered at the specular point bin
27Ddm_sp_nbrcsSpecular Point NBRCSNormalized BRCS of a 3 delay × 5 Doppler bin box centered at the specular point bin
28Ddm_sp_normalized_snrNormalized SNR at Specular PointSNR at specular point normalized by bistatic radar equation
29Ddm_sp_rawSpecular Point RawValue of the specular point in the DDM raw counts
30Ddm_sp_reflectivitySpecular Point ReflectivitySignal reflectivity at the specular point assuming coherent scattering
31Ddm_sp_rowSpecular Point RowThe zero-based delay row of the specular point delay in the DDM
32Sp_delay_doppler_flagDelay-Doppler FlagThe method and quality flag to find specular position in DDM

References

  1. Young, I.R. Wind Generated Ocean Waves; Elsevier: Amsterdam, The Netherlands, 1999. [Google Scholar]
  2. Janssen, P.A.E.M. The Interaction of Ocean Waves and Wind; Cambridge University Press: Cambridge, MA, USA, 2004. [Google Scholar]
  3. Queffeulou, P. Long-term validation of wave height measurements from altimeters. Mar. Geod. 2004, 27, 495–510. [Google Scholar] [CrossRef]
  4. Ardhuin, F.; Chapron, B.; Collard, F. Observation of swell dissipation across oceans. Geophys. Res. Lett. 2009, 36. [Google Scholar] [CrossRef]
  5. Komar, P.D. Beach processes and erosion—An introduction. In Handbook of Coastal Processes and Erosion; CRC Press: Boca Raton, FL, USA, 2018; pp. 1–20. [Google Scholar]
  6. Chelton, D.B.; Wentz, F.J. Further development of an improved altimeter wind speed algorithm. J. Geophys. Res. Ocean. 1986, 91, 14250–14256. [Google Scholar] [CrossRef]
  7. Stopa, J.E.; Cheung, K.F. Intercomparison of Wind and Wave Data from the ECMWF Reanalysis Interim and the NCEP Climate Forecast System Reanalysis. Ocean Model. 2014, 75, 65–83. [Google Scholar] [CrossRef]
  8. Jiang, H. Evaluation of altimeter undersampling in estimating global wind and wave climate using virtual observation. Remote Sens. Environ. 2020, 245, 111840. [Google Scholar] [CrossRef]
  9. Gommenginger, C.P.; Srokosz, M.A.; Challenor, P.G.; Cotton, P.D. Measuring ocean wave period with satellite altimeters: A simple empirical model. Geophys. Res. Lett. 2003, 30. [Google Scholar] [CrossRef]
  10. Durrant, T.H.; Greenslade, D.J.; Simmonds, I. Validation of Jason-1 and Envisat Remotely Sensed Wave Heights. J. Atmos. Ocean. Technol. 2009, 26, 123–134. [Google Scholar] [CrossRef]
  11. Gemmrich, J.; Thomas, B.; Bouchard, R. Observational Changes and Trends in Northeast Pacific Wave Records. Geophys. Res. Lett. 2011, 38, L22601. [Google Scholar] [CrossRef]
  12. Ribal, A.; Young, I.R. 33 Years of Globally Calibrated Wave Height and Wind Speed Data Based on Altimeter Observations. Sci. Data 2019, 6, 77. [Google Scholar] [CrossRef]
  13. Yan, Q.; Huang, W.; Jin, S.; Jia, Y. Pan-tropical soil moisture mapping based on a three-layer model from CYGNSS GNSS-R data. Remote Sens. Environ. 2020, 247, 111944. [Google Scholar] [CrossRef]
  14. Li, W.; Cardellach, E.; Fabra, F.; Ribó, S.; Rius, A. Assessment of spaceborne GNSS-R ocean altimetry performance using CYGNSS mission raw data. IEEE Trans. Geosci. Remote Sens. 2019, 58, 238–250. [Google Scholar] [CrossRef]
  15. Zribi, M.; Motte, E.; Baghdadi, N.; Baup, F.; Dayau, S.; Fanise, P.; Guyon, D.; Huc, M.; Wigneron, J.P. Potential applications of GNSS-R observations over agricultural areas: Results from the GLORI airborne campaign. Remote Sens. 2018, 10, 1245. [Google Scholar] [CrossRef]
  16. Strandberg, J.; Hobiger, T.; Haas, R. Real-time sea-level monitoring using Kalman filtering of GNSS-R data. GPS Solut. 2019, 23, 61. [Google Scholar] [CrossRef]
  17. Cardellach, E.; Fabra, F.; Nogués-Correig, O.; Oliveras, S.; Ribó, S.; Rius, A. GNSS-R ground-based and airborne campaigns for ocean, land, ice, and snow techniques: Application to the GOLD-RTR data sets. Radio Sci. 2011, 46, 1–16. [Google Scholar] [CrossRef]
  18. Wang, W.; Ren, K.; Duan, B.; Zhu, J.; Li, X.; Ni, W.; Lu, J.; Yuan, T. A four-dimensional variational constrained neural network-based data assimilation method. J. Adv. Model. Earth Syst. 2024, 16, e2023MS003687. [Google Scholar] [CrossRef]
  19. Yuan, T.; Zhu, J.; Ren, K.; Wang, W.; Wang, X.; Li, X. Neural network driven by space-time partial differential equation for predicting sea surface temperature. In Proceedings of the 2022 IEEE International Conference on Data Mining (ICDM), Orlando, FL, USA, 28 November–1 December 2022; pp. 656–665. [Google Scholar]
  20. Ruf, C.; Lyons, A.; Unwin, M.; Dickinson, J. CYGNSS: Enabling the Future of Hurricane Forecasting. IEEE Geosci. Remote Sens. Mag. 2019, 7, 52–67. [Google Scholar]
  21. Clarizia, M.P.; Gommenginger, C.P.; Gleason, S.T.; Srokosz, M.A.; Galdi, C.; Bisceglie, M.D. Analysis of GNSS-R Delay-Doppler Maps from the UK-DMC Satellite Over the Ocean. Geophys. Res. Lett. 2014, 41, 5276–5283. [Google Scholar] [CrossRef]
  22. Quach, B.; Glaser, Y.; Stopa, J.E.; Mouche, A.A.; Sadowski, P. Deep learning for predicting significant wave height from synthetic aperture radar. IEEE Trans. Geosci. Remote Sens. 2020, 59, 1859–1867. [Google Scholar] [CrossRef]
  23. Li, Z.; Guo, F.; Zhang, X.; Guo, Y.; Zhang, Z. Analysis of factors influencing significant wave height retrieval and performance improvement in spaceborne GNSS-R. GPS Solut. 2024, 28, 64. [Google Scholar] [CrossRef]
  24. Yu, K.; Han, S.; Bu, J.; An, Y.; Zhou, Z.; Wang, C.; Tabibi, S.; Cheong, J.W. Spaceborne GNSS reflectometry. Remote Sens. 2022, 14, 1605. [Google Scholar] [CrossRef]
  25. Wang, F.; Yang, D.; Yang, L. Retrieval and assessment of significant wave height from CYGNSS mission using neural network. Remote Sens. 2022, 14, 3666. [Google Scholar] [CrossRef]
  26. Patanè, L.; Iuppa, C.; Faraci, C.; Xibilia, M.G. A deep hybrid network for significant wave height estimation. Ocean Model. 2024, 189, 102363. [Google Scholar] [CrossRef]
  27. Bu, J.; Yu, K.; Ni, J.; Huang, W. Combining ERA5 data and CYGNSS observations for the joint retrieval of global significant wave height of ocean swell and wind wave: A deep convolutional neural network approach. J. Geod. 2023, 97, 81. [Google Scholar] [CrossRef]
  28. Bu, J.; Yu, K. Significant wave height retrieval method based on spaceborne GNSS reflectometry. IEEE Geosci. Remote Sens. Lett. 2022, 19, 1–5. [Google Scholar] [CrossRef]
  29. Zhao, D.; Heidler, K.; Asgarimehr, M.; Arnold, C.; Xiao, T.; Wickert, J.; Zhu, X.X.; Mou, L. DDM-Former: Transformer networks for GNSS reflectometry global ocean wind speed estimation. Remote Sens. Environ. 2023, 294, 113629. [Google Scholar] [CrossRef]
  30. Madhiarasan, D.M.; Roy, P.; Pratim, P. Hybrid Transformer Network for Different Horizons-based Enriched Wind Speed Forecasting. arXiv 2022, arXiv:2204.09019. [Google Scholar]
  31. Li, X.; Xu, Y.; Liu, B.; Lin, W.; He, Y.; Liu, J. Validation and calibration of nadir SWH products from CFOSAT and HY-2B with satellites and in situ observations. J. Geophys. Res. Ocean. 2021, 126, e2020JC016689. [Google Scholar] [CrossRef]
  32. Hauser, D.; Xiaolong, D.; Aouf, L.; Tison, C.; Castillan, P. Overview of the CFOSAT mission. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; pp. 5789–5792. [Google Scholar]
  33. Li, Z.; Guo, F.; Chen, F.; Zhang, Z.; Zhang, X. Wind speed retrieval using GNSS-R technique with geographic partitioning. Satell. Navig. 2023, 4, 4. [Google Scholar] [CrossRef]
  34. Eroglu, O.; Kurum, M.; Boyd, D.; Gurbuz, A.C. High spatio-temporal resolution CYGNSS soil moisture estimates using artificial neural networks. Remote Sens. 2019, 11, 2272. [Google Scholar] [CrossRef]
  35. Gleason, S.; Ruf, C.S.; O’Brien, A.J.; McKague, D.S. The CYGNSS level 1 calibration algorithm and error analysis based on on-orbit measurements. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 12, 37–49. [Google Scholar] [CrossRef]
  36. Zhao, R.; Wang, K.; Xu, X. Global precipitation from FY-3 polar orbit satellites. Sci. Bull. 2024, 69, 2688–2692. [Google Scholar] [CrossRef] [PubMed]
  37. Shao, W.; Sheng, Y.; Sun, J. Preliminary assessment of wind and wave retrieval from Chinese Gaofen-3 SAR imagery. Sensors 2017, 17, 1705. [Google Scholar] [CrossRef]
  38. Li, Z.; Verhoef, A.; Stoffelen, A.; Shang, J.; Dou, F. First Results from the WindRAD Scatterometer on Board FY-3E: Data Analysis, Calibration and Wind Retrieval Evaluation. Remote Sens. 2023, 15, 2087. [Google Scholar] [CrossRef]
  39. Yan, Q.; Chen, Y.; Jin, S.; Liu, S.; Jia, Y.; Zhen, Y.; Chen, T.; Huang, W. Inland water mapping based on GA-LinkNet from CYGNSS data. IEEE Geosci. Remote Sens. Lett. 2023, 20, 1500305. [Google Scholar] [CrossRef]
  40. Chen, Y.; Yan, Q. Unlocking the Potential of CYGNSS for Pan-tropical Inland Water Mapping through Multi-Source Data and Transformer. Int. J. Appl. Earth Obs. Geoinf. 2024, 113, 104122. [Google Scholar] [CrossRef]
  41. Huang, F.; Xia, J.; Yin, C.; Zhai, X.; Xu, N.; Yang, G.; Bai, W.; Sun, Y.; Du, Q.; Liao, M.; et al. Assessment of FY-3E GNOS-II GNSS-R Global Wind Product. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 7899–7912. [Google Scholar] [CrossRef]
  42. Zhou, Z.; Duan, B.; Ren, K. Improving GNSS-R Sea Surface Wind Speed Retrieval from FY-3E Satellite Using Multi-task Learning and Physical Information. In Proceedings of the International Conference on Neural Information Processing, Changsha, China, 20–23 November 2023; Springer Nature: Singapore, 2023; pp. 357–369. [Google Scholar]
  43. Yuan, T.; Zhu, J.; Wang, W.; Lu, J.; Wang, X.; Li, X.; Ren, K. A Space-Time Partial Differential Equation Based Physics-Guided Neural Network for Sea Surface Temperature Prediction. Remote Sens. 2023, 15, 3498. [Google Scholar] [CrossRef]
  44. Yang, G.; Du, X.; Huang, L.; Wu, X.; Sun, L.; Qi, C.; Zhang, X.; Wang, J.; Song, S. An Illustration of FY-3E GNOS-R for Global Soil Moisture Monitoring. Sensors 2023, 23, 5825. [Google Scholar] [CrossRef] [PubMed]
  45. Yang, G.; Bai, W.; Wang, J.; Hu, X.; Zhang, P.; Sun, Y.; Xu, N.; Zhai, X.; Xiao, X.; Xia, J.; et al. FY-3E GNOS II GNSS Reflectometry: Mission Review and First Results. Remote Sens. 2022, 14, 988. [Google Scholar] [CrossRef]
  46. Huang, F.; Xia, J.; Yin, C.; Zhai, X.; Yang, G.; Bai, W.; Sun, Y.; Du, Q.; Wang, X.; Qiu, T.; et al. Spaceborne GNSS Reflectometry with Galileo Signals on FY-3E/GNOS-II: Measurements, Calibration, and Wind Speed Retrieval. IEEE Geosci. Remote Sens. Lett. 2023, 20, 3501505. [Google Scholar] [CrossRef]
  47. Xie, Y.; Yan, Q. Stand-alone retrieval of sea ice thickness from FY-3E GNOS-R data. IEEE Geosci. Remote Sens. Lett. 2024, 21, 2000305. [Google Scholar] [CrossRef]
  48. Sun, Y.; Wang, X.; Du, Q.; Bai, W.; Xia, J.; Cai, Y.; Wang, D.; Wu, C.; Meng, X.; Tian, Y.; et al. The status and progress of Fengyun-3E GNOS II mission for GNSS remote sensing. In Proceedings of the IGARSS 2019—2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan, 28 July–2 August 2019; IEEE: New York, NY, USA, 2019; pp. 5181–5184. [Google Scholar]
  49. Ruf, C. CYGNSS Handbook; Michigan Publishing Services: Ann Arbor, MI, USA, 2022. [Google Scholar]
  50. Wolf, J.; Woolf, D.K. Waves and climate change in the north-east Atlantic. Geophys. Res. Lett. 2006, 33. [Google Scholar] [CrossRef]
  51. Martinez, P.; Lopez, M.; Wang, S. Comparative Study of Significant Wave Height Datasets for Coastal Management. Mar. Data Sci. 2023, 10, 45–58. [Google Scholar]
  52. Zhang, W.; Sun, Y.; Wu, Y.; Dong, J.; Song, X.; Gao, Z.; Pang, R.; Guoan, B. A deep-learning real-time bias correction method for significant wave height forecasts in the Western North Pacific. Ocean Model. 2023, 187, 102289. [Google Scholar] [CrossRef]
Figure 1. FY-3E observation point distributions on 29 August 2023.
Figure 1. FY-3E observation point distributions on 29 August 2023.
Remotesensing 16 03468 g001
Figure 2. ANN-Wave model structure.
Figure 2. ANN-Wave model structure.
Remotesensing 16 03468 g002
Figure 3. CNN-Wave model structure.
Figure 3. CNN-Wave model structure.
Remotesensing 16 03468 g003
Figure 4. Hybrid-Wave model structure.
Figure 4. Hybrid-Wave model structure.
Remotesensing 16 03468 g004
Figure 5. Trans-Wave model structure.
Figure 5. Trans-Wave model structure.
Remotesensing 16 03468 g005
Figure 6. ViT-Wave model structure.
Figure 6. ViT-Wave model structure.
Remotesensing 16 03468 g006
Figure 7. Distribution of Significant Wave Height (SWH).
Figure 7. Distribution of Significant Wave Height (SWH).
Remotesensing 16 03468 g007
Figure 8. Scatter density plot of all models.
Figure 8. Scatter density plot of all models.
Remotesensing 16 03468 g008
Figure 9. RMSE error distribution histogram of each model.
Figure 9. RMSE error distribution histogram of each model.
Remotesensing 16 03468 g009
Figure 10. MAE error distribution histogram of each model.
Figure 10. MAE error distribution histogram of each model.
Remotesensing 16 03468 g010
Figure 11. Bias error distribution histogram of each model.
Figure 11. Bias error distribution histogram of each model.
Remotesensing 16 03468 g011
Figure 12. 3° × 3° Grid Era5 SWH Distribution (m).
Figure 12. 3° × 3° Grid Era5 SWH Distribution (m).
Remotesensing 16 03468 g012
Figure 13. 3° × 3° Grid Vit-wave model SWH Distribution (m).
Figure 13. 3° × 3° Grid Vit-wave model SWH Distribution (m).
Remotesensing 16 03468 g013
Figure 14. 3° × 3° Grid Residule Distribution (m).
Figure 14. 3° × 3° Grid Residule Distribution (m).
Remotesensing 16 03468 g014
Figure 15. 3° × 3° Grid RMSE Distribution (m).
Figure 15. 3° × 3° Grid RMSE Distribution (m).
Remotesensing 16 03468 g015
Table 1. Variables utilized in the experiment.
Table 1. Variables utilized in the experiment.
Variables
sp_latsp_lonDdm_brcs_factorDdm_effective_areaDdm_doppler_refer
Ddm_kurtosisDdm_noise_mDdm_noise_rawDdm_noise_sourceDdm_peak_column
Ddm_peak_delayDdm_peak_dopplerDdm_peak_power_ratioDdm_peak_rawDdm_peak_row
Ddm_peak_snrDdm_power_factorDdm_quality_flagDdm_range_referDdm_skewness
Ddm_sp_columnDdm_sp_delayDdm_sp_dlesDdm_sp_dopplerDdm_sp_les
Ddm_sp_nbrcsDdm_sp_normalized_snrDdm_sp_rawDdm_sp_reflectivityDdm_sp_row
Ddm_sp_snr
Table 2. Weights for Each Layer in the Models.
Table 2. Weights for Each Layer in the Models.
LayerANN-WaveCNN-WaveHybrid-WaveTrans-WaveViT-Wave
Convolutional Layers
Conv2d-[128, 1, 5, 5][128, 1, 5, 5]--
Conv2d-[32, 128, 4, 4][32, 128, 4, 4]--
Conv2d-[1, 32, 2, 2][1, 32, 2, 2]--
Transformer Layers
Embedding---[20, 64][4 * 4, 64]
TransformerEncoder---[64, 8, 3, 256][64, 8, 3, 256]
Linear Layer
Linear[31, 1000][12, 1000][43, 1000][(9 * 64) + 31, 1000][(45 * 64) + 31, 1000]
SWH Retrieval LayersWeight Dimensions
Linear[1000, 2000]
Linear[2000, 1500]
Linear[1500, 500]
Linear[500, 200]
Linear[200, 100]
Linear[100, 10]
Linear[10, 1]
Table 3. Average Train and Validation Loss with Standard Deviations.
Table 3. Average Train and Validation Loss with Standard Deviations.
ModelTrain Loss ± StdValidation Loss ± Std
ANN-Wave0.2135 ± 0.01460.2177 ± 0.0044
CNN-Wave1.1400 ± 0.01911.9116 ± 0.6267
Hybrid-Wave0.2065 ± 0.01650.1938 ± 0.0109
Trans-Wave0.2034 ± 0.01480.1955 ± 0.0086
ViT-Wave0.1816 ± 0.00400.1735 ± 0.0042
Table 4. Evaluation accuracy of all models.
Table 4. Evaluation accuracy of all models.
RMSEMAEBiasMAPER2
ANN-Wave0.45460.30480.004018.68140.8889
CNN-Wave1.23370.94470.271574.64400.1819
Hybrid-Wave0.42250.27990.014417.71260.9040
Trans-Wave0.43440.2931−0.001223.22380.8986
SNR [28]0.5340.421-21.52-
NCDW LES [28]0.5030.390-20.02-
ANN [25]0.59----
BT [25]0.48----
DCNN [27]0.422---0.89
ViT-Wave0.40520.2700−0.001518.02000.9117
Table 5. Percentage Improvement of ViT-Wave Model Over Other Models.
Table 5. Percentage Improvement of ViT-Wave Model Over Other Models.
RMSEMAEBiasMAPER2
ANN-Wave10.85%11.45%137.50%3.54%2.57%
CNN-Wave67.15%71.42%100.55%75.86%401.10%
Hybrid-Wave4.10%3.53%110.42%−1.74%0.83%
Trans-Wave6.72%7.88%−20.00%28.28%1.46%
SNR [28]24.07%35.87%-16.26%-
NCDW LES [28]19.43%30.77%-10.00%-
ANN [25]31.36%----
BT [25]15.62%----
DCNN [27]4.00%---2.44%
ViT-Wave-----
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhou, Z.; Duan, B.; Ren, K.; Ni, W.; Cao, R. Enhancing Significant Wave Height Retrieval with FY-3E GNSS-R Data: A Comparative Analysis of Deep Learning Models. Remote Sens. 2024, 16, 3468. https://doi.org/10.3390/rs16183468

AMA Style

Zhou Z, Duan B, Ren K, Ni W, Cao R. Enhancing Significant Wave Height Retrieval with FY-3E GNSS-R Data: A Comparative Analysis of Deep Learning Models. Remote Sensing. 2024; 16(18):3468. https://doi.org/10.3390/rs16183468

Chicago/Turabian Style

Zhou, Zhenxiong, Boheng Duan, Kaijun Ren, Weicheng Ni, and Ruixin Cao. 2024. "Enhancing Significant Wave Height Retrieval with FY-3E GNSS-R Data: A Comparative Analysis of Deep Learning Models" Remote Sensing 16, no. 18: 3468. https://doi.org/10.3390/rs16183468

APA Style

Zhou, Z., Duan, B., Ren, K., Ni, W., & Cao, R. (2024). Enhancing Significant Wave Height Retrieval with FY-3E GNSS-R Data: A Comparative Analysis of Deep Learning Models. Remote Sensing, 16(18), 3468. https://doi.org/10.3390/rs16183468

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop