1. Introduction
As a key thermodynamic variable in the atmosphere, the atmospheric temperature profiles (ATPs) play a vital role in improving numerical simulation, climate prediction, and atmospheric monitoring [
1,
2,
3]. Currently, radiosonde observations are the primary means of obtaining ATP data [
4]. Tibetan Plateau (TP), the highest and most topographically complex plateau on the earth, is the source of several rivers and has been referred to as the world’s third pole, affecting more than 20% of the global population [
5]. More importantly, the eastern TP is home to one of the world’s most critical biodiversity hotspots: the Hengduan Mountains Biodiversity Hotspot [
6,
7,
8]. Additionally, over 90% of the TP’s population resides in this region [
9]. Enhancing the capability to observe meteorological elements in this area is crucial for protecting biodiversity and ensuring the safety of lives and property.
Nonetheless, in contrast to other land areas globally, there is a shortage of observational data over the Tibetan Plateau due to its high altitude, extreme environmental conditions, and underdeveloped logistical support. Although several field experiments have been conducted in these data-deficient regions, including three Tibetan Plateau Atmospheric Scientific Experiments held in 1979, 1998, and from 2013 to 2021 [
10,
11,
12,
13], the observational data from these short-term experiments suffer from temporal and spatial resolution limitations. The intricate terrain and underlying surface characteristics further restrict the representativeness of the observation stations [
14]. Therefore, high-resolution satellite remote sensing has the potential to effectively bridge the gap left by conventional observations in the eastern TP and its adjacent areas.
Satellite observation data have the advantages of wide coverage, and their high temporal and spatial resolution is less susceptible to the geographical environment [
15,
16], which makes up for the lack of sparse conventional observations over the TP area. Thus, it serves as an essential part of the meteorological observation network [
17,
18]. Among satellite-based observation instruments, the infrared detector greatly contributes to the research [
19,
20,
21]. Hyperspectral infrared sounders (HIRs) feature thousands of channels with high spectral resolution, enabling precise measurement of the vertical profiles of atmospheric temperature and humidity [
22].
The geostationary-satellite-based infrared hyperspectral sounder offers a higher scanning frequency compared to its polar orbit counterparts, which improves the applications of the ATP, especially in severe weather monitoring and predictions on the TP. In 2016 and 2021, China launched two second-generation geostationary meteorological satellites, Fengyun-4A (FY-4A) and Fengyun-4B (FY-4B), both equipped with the Geostationary Interferometric Infrared Sounder (GIIRS). The GIIRS became the first hyperspectral infrared sounder aboard a geostationary meteorological satellite [
23]. It enabled high spatiotemporal resolution detection of weather systems in China and its surrounding regions, facilitating the acquisition of more detailed atmospheric vertical structures, including the retrieval of atmospheric temperature profiles at the 1 km layers [
24] (
http://fy4.nsmc.org.cn/nsmc/en/theme/FY4B.html, accessed on 5 November 2024). The development of the rapid radiative transfer model [
25], the evaluation of bias characteristics [
26], and advancements in retrieval algorithms [
27,
28,
29] have facilitated the application of FY-4A/GIIRS data in numerical weather prediction models [
30,
31,
32]. The atmospheric temperature and humidity profile products obtained from the onboard sounding instruments have significantly contributed to weather monitoring and forecasting [
33,
34].
The accuracy of satellite data is a crucial factor in ensuring the smooth progress of research and operations [
35]. Before utilizing retrieval products, it is essential to assess the accuracy of the data and select appropriate data for use [
2]. Scholars have conducted some research on FY-4A/GIIRS data quality evaluation. Yang et al. [
24] conducted a comparative analysis between the temperature and humidity profiles retrieved from FY-4A/GIIRS and GOES using one-dimensional variational retrieval [
36,
37]; their study delineated that while the vertical root mean square error (RMSE) characteristics of the temperature and humidity profiles retrieved from FY-4A and GOES exhibit similarity, FY-4A/GIIRS demonstrates a lower RMSE compared to GOES. He et al. [
38] and Du et al. [
39] found that clouds would affect the quality of FY-4A/GIIRS ATP and different types and heights of clouds have different impacts on them. Some studies indicate that ATP from FY-4A/GIIRS can fill the observation gap caused by sparse conventional observations and have considerable value in assimilation applications [
2,
40,
41].
Compared to FY-4A, the GIIRS on FY-4B features a broader spectral range, higher spectral resolution in the long-wave infrared band, and improved radiometric calibration accuracy and detection sensitivity [
42]. Niu et al. [
43] compared the quality of FY-4A/GIIRS and FY-4B/GIIRS long-wave infrared channels and pointed out that under clear sky conditions, the data error of FY-4B/GIIRS long-wave channels is smaller, and the daily variation in the error is also smaller. Benefiting from the increased spectral range, the ATP accuracy of FY-4B/GIIRS is higher than that of FY-4A/GIIRS and is more stable during the retrieving [
42]. Yang et al. [
44] used the fifth-generation European Centre for Medium-Range Weather Forecasts (ECMWF) atmospheric reanalysis (ERA5) data to evaluate the accuracy of FY-4B/GIIRS ATP, which showed that the FY-4B/GIIRS ATP has high overall accuracy, with the RMSE values within 1K. Nevertheless, research about the ATP of FY-4B/GIIRS remains relatively scarce, especially over TP, due to the short timeframe of the product release.
The traditional method for validating satellite ATP involves the use of RAOB data [
2,
39,
40]. The daily temporal resolution of RAOB data, which are available twice at 00 UTC and 12 UTC, is considered insufficient for a thorough evaluation and comparative analysis of the FY-4B/GIIRS ATP products over the Tibetan Plateau. This sparsity remains a major source of errors, and the cost and environmental impact of launching sufficient RAOBs to overcome this limitation hinder our ability to address the issue of data sparsity [
45,
46]. This issue motivates us to use ERA5 reanalysis data to test the performance of the FY-4B/GIIRS ATP products over TP. This study chose an entire year of ATP products from FY-4B/GIIRS to compare with RAOB, including nine routine sounding observations in the TP area from 17 January 2023 to 17 January 2024. Additionally, the ATP products were also compared with ERA5 reanalysis data, to compensate for RAOB’s lack of temporal and spatial resolution.
The subsequent sections of this paper are structured as follows.
Section 2 describes the dataset used in this study, as well as the matching strategy and some statistical criteria used for evaluation. In
Section 3,
Section 4 and
Section 5, the bias characteristics of FY-4B/GIIRS ATP over the eastern TP are analyzed in detail. Conclusions are given in
Section 6.
2. Materials and Methods
2.1. FY-4B/GIIRS ATP Products
The ATP products derived from FY-4B/GIIRS in this study were generated using a neural network algorithm developed by the National Satellite Meteorological Center (NSMC) [
47]. This approach does not depend on the precision of the prior information and could provide more accurate products than the Dual-Regression method. The full disk ATP products‘ spatial and temporal resolution is 12km and 2 h, respectively. Although the observation period is 2 h, to ensure data quality, the first 90 min are the actual observation time, and the last half an hour is the GIIRS’s self-check. FY-4B/GIIRS ATP products are stored in pressure vertical layers with 101 levels from 0.005 to 1100 hPa.
A QC Flag with four levels is used to mark the FY-4B/GIIRS ATP products (
Table 1). In this paper, the ATP products marked with QC Flags 2 and 3 were excluded during the precision assessment. FY-4B/GIIRS ATP products have a Cloud Mask (CLM) in each detector; CLM equal to 0 means clear sky in this site while 1 indicates cloudy weather. Considering that satellite detection data in the infrared band are susceptible to cloud pollution [
48], only products with CLM equal to 0, i.e., completely clear sky, are used.
The FY-4B/GIIRS ATP products were officially released for download on 17 January 2023. In this study, ATP products from 17 January 2023 to 17 January 2024, totaling 366 days, were selected for analysis and assessment.
2.2. RAOB ATP
The RAOB data come from the Department of Atmospheric Sciences of the University of Wyoming (
https://weather.uwyo.edu/upperair/sounding.html, accessed on 5 November 2024). The detection range extends from the surface to 100 hPa in the upper troposphere, and meteorological elements include air pressure, temperature, mixing ratio, wind direction, wind speed, etc. Nine RAOB stations in the TP area are selected in this study, indicated by the red triangles in
Figure 1a. Both terrain and land surface type play crucial roles in determining the quality of satellite radiation retrieval [
49]; the geolocation and land use categories for the nine selected RAOB stations are provided in
Table 2, along with their precise geographical coordinates. The land use data are sourced from the Copernicus Global Land Service: land cover 100 m, collection 3, epoch 2019, covering the globe [
50]. It can be seen from the table that the elevation of the nine stations ranges from 2295.2 m to 4507 m, of which the Nagqu has the highest topography, located in Tibet. As for the land use category of these stations, Dulan, Qamdo, Garze, and Yushu are classified as Grassland, while others are classified as Urban. According to
Figure 1a, it can be found that all the RAOB stations are distributed in the eastern part of the TP. Before processing the radiosonde data, we conduct quality control to ensure the highest possible data quality.
2.3. ERA5 ATP
The ERA5 ATP used in this study is obtained from the Copernicus Climate Change Service (
https://cds.climate.copernicus.eu/datasets/reanalysis-era5-pressure-levels?tab=overview, accessed on 5 November 2024). ERA5 is the fifth-generation atmospheric reanalysis of the European Centre for Medium-Range Weather Forecasts (ECMWF), offering hourly and monthly reanalysis datasets with a horizontal resolution of 0.25° and vertical resolution across 37 pressure levels ranging from 1 hPa to 1000 hPa. The pressures of these 37 levels are 1 hPa, 2 hPa, 3 hPa, 5 hPa, 7 hPa, 10 hPa, 20 hPa, 30 hPa, 50 hPa, 70 hPa, 100 hPa, 125 hPa, 150 hPa, 175 hPa, 200 hPa, 225 hPa, 250 hPa, 300 hPa, 350 hPa, 400 hPa, 450 hPa, 500 hPa, 550 hPa, 600 hPa, 650 hPa, 700 hPa, 750 hPa, 775 hPa, 800 hPa, 825 hPa, 850 hPa, 875 hPa, 900 hPa, 925 hPa, 950 hPa, 975 hPa, and 1000 hPa. In this study, the daily ATP is derived with the arithmetic averaging of hourly datasets.
2.4. Methodology
The TP region is defined by the Integration dataset of the Tibet Plateau boundary [
51], as shown in
Figure 1a. To effectively and accurately evaluate the performance of FY-4B/GIIRS ATP in the eastern TP, it is essential to align the temporal and spatial positions of satellite data and atmospheric vertical pressure layers with the RAOB ATP. Temporally, the time difference in all ATP is limited to 1 h. Spatially, some researchers [
38,
52] used the matching window, which means the range of matchups is limited by using a given latitude and longitude grid centered on the RAOB station. Within the grid, satellite products are averaged and then matched into the RAOB ATP at the corresponding time. In this study, the distance of the matching window is limited to 0.5° × 0.5°, taking Garze as an example (
Figure 1b). However, due to the removal of data from the cloud area, the number of available data is limited. So, the inverse distance weight (IDW) interpolation is also adopted for spatial matching. The IDW method is a simple approach that does not require extensive computations. It is considered one of the standard spatial interpolation techniques in geoinformation science [
53,
54,
55]. Formally, the IDW method is used to estimate the unknown value
in location
, given the observed
y value at sampled location
as follows:
Essentially, the estimated value at
is a linear combination of the weights
and the observed
y values at
where
is typically defined as follows:
with
In Equation (
2), the numerator
is the distance between
and
with a power
. The denominator is the sum of all inverse distance weights for locations
i ensuring that the sum of all
for an unsampled point equals one (Equation (
3)). The parameter
defines the weight’s geometric form. A larger
enhances the distance decay effect, making it more sensitive to distance changes, while a smaller
leads to estimates that are averages of nearby
. Thus, larger
gives more weight to the nearest points and less to those further away [
54].
Figure 2 compares the RMSE between FY-4B and RAOB and the number of effective data at nine RAOB stations for the two different matching methods. Except for Lhasa and Yushu, where the RMSE of the IDW method is slightly higher than that of the MW method, the RMSE for all other stations is lower when using the IDW method. The largest value occurs at Changdu station, reaching 1.53 K, while the differences at Golmud, Dulaan, and Naqu also exceed 1 K. For the remaining stations, the RMSE of the IDW method is marginally lower than that of the MW method. However, the count of effective data of the MW method is notably lower compared to that of the IDW method. Especially prominent at high-altitude sites such as Qamdo, Garze, Lhasa, Yushu, and Nagqu, the MW method yields effective data totals of approximately 60% or even lower compared to the IDW method. In summary, the IDW method not only allows for the acquisition of more valid data but also reduces the RMSE to some extent. Therefore, to obtain better statistical representativeness and more precise matching results, the effective dataset used in the evaluation should include as much data as possible. Therefore, in this study, the IDW method was used for spatial matching. Radiosonde observation data typically consist of measurements taken at predefined isobaric levels (standard levels), totaling 24 layers [
56].
However, due to the high altitude of our research area, the surface air pressure at the lowest-elevation Xining station is already above 750 hPa. Additionally, the top-level pressure in the University of Wyoming’s RAOB data is 100 hPa. Therefore, the vertical study region ranges from 750 hPa to 100 hPa. However, this results in fewer vertical layers. To address this, we densified the vertical layers, ultimately obtaining the following vertical layers: 750, 700, 650, 600, 550, 500, 450, 400, 350, 300, 250, 200, 150, and 100 hPa. Given that both RAOB and FY-4B have dense vertical layers and that the temperature within the troposphere can be considered to vary linearly according to the lapse rate, the vertical matching method employed linear interpolation, which interpolated both RAOB and FY-4B ATP onto the same pressure layers.
Before comparing FY-4B/GIIRS ATP with ERA5 ATP, it is necessary to first compare ERA5 with RAOB to validate ERA5’s representativeness ATP in the eastern TP. The specific approach involves using bilinear interpolation to interpolate ERA5 ATP onto the locations of the nine RAOB stations, followed by comparative validation, the same processing method as described earlier will be employed in the vertical direction. Subsequently, the FY4B ATP will be interpolated onto the ERA5 grid using the nearest neighbor method, facilitating subsequent comparisons. In the vertical direction, linear interpolation is utilized to interpolate FY-4B/GIIRS ATP onto the pressure layers of ERA5. Given the high altitude of the TP region where most locations have minimum pressure below 600 hPa, and uncertainties above 100 hPa are significant, this study focuses only on the data for 100 to 600 hPa, with 14 layers including 600, 550, 500, 450, 400, 350, 300, 250, 225, 200, 175, 150, 125, and 100 hPa.
Some objective statistical metrics are used in this study to evaluate the performance of ATP from FY-4B/GIIRS, RAOB, and ERA5: the mean bias (MB), the RMSE, the correlation coefficient (R), and the sample standard deviation (STD). These statistical metrics are expressed as follows:
where
and
are the FY-4B/GIIRS ATP products and mean of FY-4B/GIIRS ATP products, respectively;
and
are the ATP of RAOB (or ERA5) and the mean of the ATP of RAOB (or ERA5), respectively;
n is the total number of the observations; and the
and
are the bias between FY-4B/GIIRS ATP and RAOB ATP (or ERA5).
4. Comparison Between RAOB ATP and FY-4B ATP
Figure 4 shows the vertical distribution of ATP observed by FY-4B/GIIRS (blue line) and RAOB. At 00 UTC (
Figure 4a–i), FY-4B/GIIRS ATP is lower than RAOB on the surface layer in Golmud, Lhasa, and Nagqu stations while the other stations are generally higher. From the surface up to around 500 hPa, the FY-4B generally underestimates the air temperature compared to the RAOB observation, but such negative bias tends to decrease with elevation. As the pressure is lower than 500 hPa, almost none of the stations show a significant positive or negative bias, with bias fluctuating around 0 K, except the Lhasa station exhibits a relatively persistent underestimation with a maximum bias of −2 K. It is noticeable that the bias from 500 hPa to 150 hPa is relatively smaller than the lower layer in these RAOB stations. However, from 150 hPa to the uppermost layer, bias at all nine RAOB stations indicate an increasing trend towards positive bias, with Lhasa, Yushu, and Nagqu showing the most notable growth. Particularly, at the Nagqu, the bias has already reached around 4K at 100 hPa.
At 12 UTC (
Figure 4j–r), apart from the Lhasa, the vertical trend of bias at the other eight stations is generally similar to that at 00 UTC, but there are some differences. Specifically, all nine stations exhibit negative bias at the surface, with higher-elevation stations such as Qamdo (
Figure 4n), Garze (
Figure 4o), Lhasa (
Figure 4p), Yushu (
Figure 4q), and Nagqu (
Figure 4r) showing more severe underestimation compared to 00UTC. From the surface to 500hPa, all stations show a decreasing trend in bias values. From 500 hPa to the uppermost layer, except for the Lhasa station, the variation in bias at the other stations is similar to that at 00UTC, but with smaller fluctuations in bias. The Lhasa station exhibits significant differences in bias compared to 00UTC, characterized by consistent underestimation throughout the layers. Specifically, there is noticeable underestimation at the surface and near-surface, with bias approaching approximately −8K at the surface and decreasing with height, reaching a minimum around 300 hPa, followed by a slight boost in underestimation as height increases further, reaching around −2K at 150 hPa.
From the perspective of STD, at or above 150 hPa, the STD for all nine stations shows an increase at both 00UTC and 12 UTC, indicating slightly higher instability compared to other pressure levels. Between 500 and 150 hPa, although some STD is present, the small value of the bias means that even with the addition of STD, the bias still fluctuates within a narrow range. Near the surface, STD does not significantly differ from the mid-troposphere, except for the Xining (
Figure 4j), Golmud (
Figure 4k), and Hezuo (
Figure 4l) stations at 12 UTC. Overall, both at 00 UTC and 12 UTC, the vertical variation in STD over the nine RAOB stations remains relatively stable, all within 5 K, with no abrupt changes. This reflects the higher stability and consistency of FY-4B ATP products.
Having analyzed the overall distribution of ATP from FY-4B/GIIRS and RAOB, we further investigate their correlations with the RAOB ATP in the entire atmospheric layers.
Figure 5 presents the comparison results of ATP between FY-4B and RAOB at nine stations. Additionally, the regression equation (red line), R, total data number (N), MB, and RMSE are also presented in the figure. Overall, the fitting between FY-4B ATP and RAOB ATP is good, with data points distributing roughly around the 1:1 line, showing no significant deviations or dispersion. This is reflected in the R, with all nine stations having R greater than 0.99, indicating an excellent correlation between ATP from FY-4B and RAOB. Furthermore, based on the density of data point distribution, the majority of data points are concentrated near the 1:1 line, further demonstrating the good consistency between the two ATP products. This is a positive outcome, indicating the presence of consistency and similarity between FY-4B and RAOB ATP.
As the elevation of the station position increases, there is an observable decline trend in the number of effective data points, primarily attributed to the reduction in lower-level data resulting from the topographic elevation. However, despite its elevation being located between Dulan (
Figure 5d) and Garze (
Figure 5f), Qamdo (
Figure 5e) demonstrates a lower total data point volume compared to the higher-elevation Lhasa (
Figure 5g) and Yushu (
Figure 5h). This suggests a higher occurrence of lower-quality FY-4B ATP data at Qamdo.
From a quantitative analysis perspective, among the nine stations, Xining (
Figure 5a), Hezuo (
Figure 5c), and Dulan (
Figure 5d) exhibit MB values greater than 0, indicating that FY-4B tends to overestimate temperatures in these three stations and their surrounding areas to varying degrees. Conversely, in the remaining six stations and their surrounding areas, the bias is predominantly underestimated. The largest positive bias occurs at the Xining station (
Figure 5a), which has the lowest elevation among all nine stations, with a value of 0.40K. The largest negative bias is observed at the Lhasa station (
Figure 5g), with an MB value of −1.44K. For the RMSE, the highest value is observed at the Lhasa station (
Figure 5g), reaching 3.17K, while the lowest one is recorded at the Garze station (
Figure 5f), at 2.39 K. With a relatively small range of RMSE variation, it indicates that FY-4B/GIIRS ATP demonstrates good spatial consistency in quality control.
5. Spatiotemporal Evaluation of FY-4B/ATP for a Whole Year
The relationships between ERA5 ATP and nine RAOB ATP are depicted in
Figure 6. It can be visually observed that compared to FY-4B, ERA5 exhibits better consistency with RAOB since scatters at nine RAOB stations show a tighter distribution. The R at all stations except Hezuo (
Figure 6c), Garze (
Figure 6f), Lhasa (
Figure 6g), and Yushu (
Figure 6h) are 0.999 or higher. Although the R at these four stations is slightly lower, at 0.9987, 0.9989, 0.9984, and 0.9983, respectively, compared to 0.999, the comparison with FY-4B/GIIRS ATP (
Figure 6c,f,g,h) still shows improvements. The MB at all nine stations suggests that ERA5 ATP tends to underestimate temperature. From the graphs, it is evident that the majority of bias originates from temperatures above 260 K, indicating a more pronounced underestimation near the surface layer.
Although the comparison between ERA5 and RAOB reveals certain bias in ERA5 ATP, overall, ERA5 ATP still maintains good consistency with RAOB ATP, with relatively small bias values. Therefore, it can be considered sufficiently representative for evaluating the performance of FY-4B/GIIRS ATP in the eastern TP.
As shown in
Figure 7a, due to the zenith angle of FY-4B/GIIRS being too large in the western TP, where deformation rates increase with the satellite zenith angle and reach up to three times the Sub-Satellite point deformation at angles exceeding 60°, there is a significant difference in data availability between the eastern and western TP [
57,
58]. This could cause a significant decrease in the representativeness, reliability, and availability of data among the western TP during spatial analysis.
The annual mean temperature of the entire region after interpolation (
Figure 7b) reveals a distinct arc around 85–95°>E, which is the borderline of the zenith angle of 60°. The continuity of data on the western side of the arc is poor, and its reliability cannot be verified. To retain as much valid data as possible, the intersection points of the arc and the north-south boundaries of the TP, denoted as points A (87°E, 27.5°N) and B (95.5°E, 39.75°N), are selected to establish a line (
Figure 7b). Subsequently, the observations at the east of the red line in
Figure 7b are analyzed (
Figure 7c), as shown in
Figure 7c.
As depicted in
Figure 8, the annual average tropospheric temperature is dominated by positive bias orientating from south to north of the eastern TP, with a bias over 0.25 K. The largest bias is located in the central region of the study area (indicated by the blue box in
Figure 8a), where the bias ranges from 0.5 K to 0.75 K with a maximum of up to 0.84 K. While the temperature bias over the southeastern and southwestern parts of the study area exhibits as slightly negative, it is majorly less than −0.25 K.
To further investigate the vertical characteristics of bias, we select the region with the most pronounced positive bias (indicated by the blue box in
Figure 8a). The pattern in
Figure 8b suggests that FY-4B/GIIRS tends to overestimate temperature above 150 hPa, with the most significant bias at 100 hPa, where the value of bias exceeds 2 K. Bias at and below 150 hPa is relatively slighter, staying within ±2 K. At 150 hPa, the bias is relatively smaller, at only 0.16K. The above analysis suggests that the spatial average bias in this region ultimately shows positive values because the positive bias in the upper layers contributes more to the bias than the negative bias in the lower layers. The vertical distribution of STD is uniform, with an average value of 1.36 K across the entire layer. The maximum STD occurs at 100 hPa, reaching 2.23 K, while the minimum value is at 400 hPa, at 0.92 K. This indicates that the overall dispersion of bias is low, with few outliers, suggesting good quality control of the ATP from FY-4B.
Notably, the vertical bias distribution pattern shown in
Figure 7b is similar to the vertical bias distribution pattern at 00 UTC between the FY-4B and RAOB (
Figure 4a–i, above 600 hPa) discussed earlier. This similarity indicated that the ATP bias of FY-4B exhibits good spatial consistency in the eastern TP region. This provides data support for the subsequent use of the product and bias correction.
As shown in
Figure 9, the density distribution indicates that most data points across the four seasons are clustered near the 1:1 line, suggesting a good consistency between FY-4B/GIIRS ATP and ERA5 ATP. The correlation coefficient
R for all four seasons is greater than 0.99, with summer (
Figure 9c) achieving the highest
R of 0.9994, while autumn (
Figure 9d) has the lowest at 0.9978. Except for summer, where the MB is negative at −0.29 K, the MB for the other three seasons is positive. The largest positive MB occurs in spring (
Figure 9b) at 0.47 K, while the smallest positive MB is in autumn at only 0.03 K, and winter (
Figure 9a) has an MB of 0.33 K. From the perspective of RMSE, although autumn has the smallest MB, it exhibits the largest RMSE at 1.55 K. This suggests that autumn may contain comparable positive and negative bias that offset each other in the MB calculation. Winter and spring have very similar RMSE values of 1.17 K and 1.19 K, respectively, while summer has the smallest RMSE at 1.00 K.
Figure 10 gives the horizontal distribution of bias in different seasons. We note that the pattern of the bias over the eastern TP region is similar in winter, spring, and autumn, but their values show some differences. In spring (
Figure 10b), most of the areas are dominated by positive bias, with the highest value reaching 1.52 K, and within the rectangular region, most bias exceeds 0.75 K. During the winter (
Figure 10a), areas with positive bias are fewer compared to the spring, but the eastern TP still predominantly exhibits positive bias, with the highest value being 1.08 K, which is lower than in the spring, and most bias exceeds 0.5 K within the rectangular region. However, in the northern part of the rectangular, the bias slightly increases compared to the spring. Additionally, in areas with a negative bias, the values also slightly increase, with the maximum negative bias reaching −1.18 K. Although the pattern is similar to winter and spring, the values of bias are relatively lower in autumn (
Figure 10d). Its highest positive bias and the lowest negative bias are 0.79 K and −0.34 K, respectively, indicating a significant decrease compared to winter and spring. In summer, the bias distribution pattern differs significantly from the other three seasons, shifting from a predominantly positive bias to a negative bias-dominated pattern. Although most areas exhibit negative bias, there are still some regions with positive bias. These areas, where positive bias persists, are consistent with regions of significant positive bias observed in the other seasons.
We also analyze the vertical distribution of bias within the rectangular region for each of the four seasons (
Figure 11). The largest positive bias across all four seasons occurs at 100 hPa, with spring (
Figure 11b) showing the highest bias at 3.21 K, followed by winter (
Figure 11a) at 2.34 K. The bias at 100 hPa in summer (
Figure 11c) and autumn (
Figure 11d) is comparatively smaller, measuring 1.97 K and 2.07 K, respectively. From 100 hPa to 150 hPa, the positive bias for all four seasons decreases as the pressure increases, approaching 0 K at 150 hPa. A positive bias predominates from 150 hPa to 200 hPa in winter, spring, and autumn. Within this pressure range, the peak positive bias for winter and spring occurs at 200 hPa, with values of 2.04 K and 1.50 K, respectively. In autumn, the maximum positive bias in this range is observed at 175 hPa, with a value of 0.91 K. At 250 hPa, significant negative bias is observed in winter and autumn, with values of −0.81 K and −1.06 K, respectively, while no significant bias is noted in spring. In winter, there is a noticeable positive bias at 300 hPa, with a value of 1.11 K, while spring and autumn exhibit a smaller positive bias. Between 350 hPa and 450 hPa, winter and autumn show no significant bias, whereas spring presents a slight positive bias. From 450 hPa to 600 hPa, winter and spring exhibit negative bias. The largest negative bias in winter occurs at 500 hPa, with a value of −1.44 K, while in spring, the largest negative bias is observed at 600 hPa, recorded at −1.22 K. In autumn, no significant bias is present within this pressure layer. The vertical distribution of bias in summer exhibits a pattern distinct from that of other seasons. Aside from small positive bias at 450 hPa and 500 hPa, negative bias are present from 150 hPa to 600 hPa, with the largest negative bias occurring at 250 hPa, recorded at −1.03 K.
Moreover, as can be seen from
Figure 11, the negative bias in the lower layers (below 450 hPa) during the winter and spring is also more pronounced than in the summer and autumn. This indicates that the bias across the entire layers exhibits a synchronous variation trend, meaning that as the positive bias in the mid-to-upper layers increases, the negative bias in the lower layers also increases simultaneously. In the autumn season (
Figure 11d), the vertical pattern of bias is similar to the pattern of spring (
Figure 11a) and winter (
Figure 11b), but the values of bias are slight. In the summer (
Figure 11c), although there is a remarkable difference in the vertical bias pattern, with predominantly negative bias overall, positive bias is still present in the upper layers (above 150 hPa). The average STD of vertical bias for the four seasons is 1.19 K, 1.45 K, 0.90 K, and 0.96 K, respectively. Similarly, the STD for winter and spring is higher than that for summer and autumn.
6. Summary and Discussions
Due to the complex terrain and sparse population in the TP region, it is hard to obtain exact and wide-ranging atmospheric temperature profiles from conventional observations. At present, the Geostationary Interferometric Infrared Sounder (GIIRS) onboard the FY-4 series satellite can effectively eliminate this deficiency. Based on the ATP data from RAOB, ERA5 and the latest FY-4B/GIIRS products, this study uses various spatiotemporal matching ways to minimize the spatiotemporal differences between different data and investigates the performance of FY-4B/GIIRS ATP products over the eastern TP region.
Our research demonstrates the product quality of ATP from FY-4B/GIIRS is closely related to the cloud conditions. Under clear sky conditions, the percentage of ATP with “Perfect” and “Good” QC Flags is 68.45%. Under cloudy conditions, however, the proportion of “Perfect” and “Good” declines to 48.3%, representing a reduction of approximately 20% compared to clear sky conditions. This results in a distinct decrease in the availability of products when there is substantial cloud cover over the TP region. FY-4B/GIIRS ATP exhibits great consistency with RAOB ATP, with an average RMSE of only 2.58 K, ranging from 2.39 K to 3.17 K. The bias shows slight variations among different RAOB sites and between day and night, especially near the surface and in the upper troposphere. The dispersion of bias across various sites is also relatively uniform. In the lower layers, FY-4B/GIIRS temperatures are generally lower than those from RAOB, while they are higher than RAOB in the upper layers. The bias is slight in the middle layers. Bias does not exhibit a significant correlation with altitude or land use type, suggesting that altitude and land use type may not be the primary factors affecting the detection and retrieval performance of FY-4B/GIIRS in the eastern TP. Moreover, the bias of FY-4B/GIIRS ATP demonstrates different characteristics in different regions, but overall, overestimation is predominant over the eastern TP, especially in winter and spring. This may be related to snow cover, as the spatial distribution pattern of annual average bias correlates to some extent with the spatial distribution of annual snow days in the TP region according to Huang et al. [
59]. Snow cover directly impacts the albedo and land surface emissivity (LSE), which are closely related to satellite detection results.
However, due to the limitation of satellite zenith angle, only data from the eastern part of the TP are available for evaluation and analysis in this study, limiting a comprehensive analysis of the quality of FY-4B/GIIRS ATP products across the entire TP. Consequently, the finding cannot be generalized to the western TP area. However, with FY-4B drifting from 133°E to 105°>E on 5 March 2024 and replacing FY-4A as the main operational system (
http://www.nsmc.org.cn/nsmc/cn/satellite/FY4B.html, accessed on 5 November 2024), the satellite zenith angle in the TP region would decrease. As a result, complete detection data for the TP region will be obtainable. Nonetheless, this study systematically evaluates the quality of FY-4B/GIIRS ATP products accumulated over one year in the eastern TP region. It unearths the potential application value of FY-4B in the TP region and provides a certain preliminary research foundation for the subsequent related studies.