Next Article in Journal
A Contrast of Recent Changing Tendencies in Genesis Productivity of Tropical Cloud Clusters over the Western North Pacific in May and October
Next Article in Special Issue
Analyzing Atmospheric Circulation Patterns Using Mass Fluxes Calculated from Weather Balloon Measurements: North Atlantic Region as a Case Study
Previous Article in Journal
Reaction Characteristics of NOx and N2O in Selective Non-Catalytic Reduction Using Various Reducing Agents and Additives
Previous Article in Special Issue
Polyphony of Short-Term Climatic Variations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Chinese Sunspot Drawings and Their Digitizations-(VI) Extreme Value Theory Applied to the Sunspot Number Series from the Purple Mountain Observatory

1
Center of Astronomy and Space Science, China Three Gorges University, Yichang 443002, China
2
Purple Mountain Observatories, Chinese Academy of Sciences, Nanjing 210008, China
3
National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100101, China
*
Author to whom correspondence should be addressed.
Atmosphere 2021, 12(9), 1176; https://doi.org/10.3390/atmos12091176
Submission received: 23 July 2021 / Revised: 6 September 2021 / Accepted: 9 September 2021 / Published: 13 September 2021
(This article belongs to the Special Issue Links between Solar Activity and Atmospheric Circulation)

Abstract

:
Based on the daily sunspot number (SN) data (1954–2011) from the Purple Mountain Observatory, the extreme value theory (EVT) is employed for the research of the long-term solar activity. It is the first time that the EVT is applied on the Chinese SN. Two methods are used for the research of the extreme events with EVT. One method is the block maxima (BM) approach, which picks the maximum SN value of each block. Another one is the peaks-over-threshold (POT) approach. After a declustering process, a threshold value (here it is 300) is set to pick the extreme values. The negative shape parameters are obtained by the two methods, respectively, indicating that there is an upper bound for the extreme SN value. Only one value of the N -year return level (RL) is estimated: N = 19 years. For N = 19 years, the RL values of SN obtained by two methods are similar with each other. The RL values are found to be 420 for the POT method and the BM method. Here, the trend of 25th solar cycle is predicted to be stronger, indicating that the length of meridional forms of atmospheric circulation will be increased.

1. Introduction

It is well known that the Sun is a dynamic and complex body, releasing energy on all wavelength intervals of the electromagnetic spectrum [1]. Solar activity has an important impact on space climate, space navigation, and high-frequency radio communications [2]. Research based on long-term solar activity helps reveal the physical connections in the solar-terrestrial space environment and the dynamic process operating in the solar interior [3,4]. Sunspots are the earliest observed solar activity, which are the most significant observational features on the photosphere [5]. The greatest observation of solar activity is represented by the time series of sunspot number (SN) [6,7]. A high probability of a strong solar storm is related to high values of the SN index [8].
Extreme value theory (EVT) is a branch of statistics, which offers techniques for the research and estimation of probabilities of predicting rare events [9]. In recent years, as a special and important tool, the extreme value theory (EVT) has been applied in various science fields and got some marvel findings. EVT has been used on climate change [10,11,12,13], economics [14], engineering [15], and astrophysics [16,17,18]. For example, Ref. Siscoe (1976) [19] used the EVT to analyze the largest SN per solar cycle (SC) and the result supported the argument for a change in SC statistics during the Maunder minimum. Ref. Asensio (2007) [20] applied the extreme events of the SC to investigate the laws of extreme change and the results indicated that the SC 19 was the most probable time of maximum value. Ref. Acero et al. (2017) [21] applied the EVT on the new version of the SN index at daily, monthly, and yearly timescales, and the results showed that the return level (RL) was reaching a plateau. Furthermore, they found that the daily SN values would hardly be greater than 550 in the following 1100 years. Ref. Elvidge et al. (2018) [22] used EVT to determine the probability of solar flares of Carrington-like, and the results of the EVT were consistent with flare data from the Kepler space telescope mission. The millennial years 10Be and 14C time series were used to study the solar activity [23] and the solar radio of flux at the daily scale were also used for the research of the extreme events law [24]. Ref. Love (2020) [25] did some experiments on statistical models of extreme values of magnetic superstorm intensities, and the discrete samples of the disturbance storm time (Dst) data were found to be insufficient to provide usefully accurate estimates of a statistically theoretical upper bound for magnetic storm intensity.
In these above works, two traditional methods were usually used to study long-term solar indicators. The first one is the block maxima (BM) approach [9,19], which has been used to focus on the generalized extreme value (GEV) distribution. Another one is the peaks-over-threshold (POT) approach [9], which has been used to focus on the generalized Pareto (GP) distribution.
In this paper, the daily SN data, which are from the Purple Mountain Observatory during the period of January 1954 to December 2011, are used to study the extreme events of the long-term solar activity and the RLs of the extreme events. The structure of this paper is as follows. In Section 2, the data and methods are introduced. In Section 3, we shows the experimental analysis results. Finally, Section 4 is about discussion and conclusions.

2. Data and Methods

2.1. Data

In the papers in this series, Ref. Lin (2019) [26] used the Chinese historical hand-drawing sunspot records to make a compilation of the data. Ref. Sun et al. (2019) [27] analyzed the accuracy of the data from the Purple Mountain Observatory. Ref. Wan et al. (2020) [28] studied the quasi-biennial oscillations of sunspot activity based on this dataset. Therefore, the data are reliable and can be used for scientific research. In recent years, much work of solar extremes are based on the SN from World Data Center, Sunspot Index and Long-term Solar Observations (WDC-SILSO). More details about the SN can be found in the article [29]. However, in this work, the daily SN data during the period of January 1954 to December 2011 are from the Purple Mountain Observatory. Here, the SN is the number of the sunspots observed in one day, and what we analyzed in our work is not images. The data for the entire 24 cycle are not processed because the data from the Purple Mountain Observatory are just from 1954 to 2011. The dataset is composed of a period of 58 years. Figure 1 shows the daily values of the SN during this period.

2.2. Methods

EVT is a tool to handle with the excesses for the distribution, and to characterise whether the levels of the behavior of a process is at small or large [9]. EVT is applied for the research of the extreme values in the timeseries, and for the purpose of predicting the probability of occurrence of rare events. Generally, there are two main methods for the research of extreme events. The first method is the BM approach focusing on the GEV distribution [30,31]. Another one is the POT approach focusing on the GP distribution.
Both methods are used to analyze the extreme values, there are also differences between them, such as the defined way of the extreme events [20]. There are many advantages for BM approach. For example, its theory is simple [20], and it is a better choice when data are not completely independent and identically distributed [32]. However, it is a weakness for it to use the data inefficiently. So, POT is the fundamental method for the analysis of extreme values because of efficient utilization. In our work, two methods are both applied to data for better observing the results and comparing the results with two methods.
Considering the BM approach, the first step is that we cut the data into blocks with equal length and the maximum value of each block is obtained. The second step is that the GEV distribution is fitted to the BM. GEV is a type of “law of large numbers”, with a probability distribution function, please reference to [9,21].
As for POT, the first step is to set the value of u , which means the upper threshold. The second step is that we consider all sample values exceeding the u . Then the GP distribution is fitted to the POT. The threshod is necessary but difficult, which can not be too high or too low [9]. There are two criteria for the selection of u , the assessment of the stability of the parameter estimates and the mean residual life plot. For the definition of the GP distribution, please reference to [9,21].

3. Results

3.1. Block Maxima Approach

First, we cut the data into blocks of equal length and the block length is set to 365, which means that every block consists of data from one year. Figure 2 shows the SN divided into 58 blocks with vertical lines, indicating the blocks we got and the maximum value of each block.
After that, the GEV distribution is fitted to the BM, and the method of estimating the parameters is the maximum likelihood estimation. In Figure 3, the precision of the GEV model evaluated by standard diagnostic plots is shown. The quantile–quantile (QQ) plot confirms the validity of the fitted model, and compares the empirical data quantiles and the GEV fit quantiles (a). The consequence of plotted points falls onto an straight line. The QQ-plot shows the randomly generated data, which is from the fitted GEV, against the empirical data quantiles (b), and the 95% confidence bands of the QQ-plot get close to be linear. Then, the empirical density of the observed maximum seems to be consistent with the fitted density (c), stating that a good fit exists between the two. In Figure 2, the numbers of sample points lying in the interval of [0, 100], [100, 200], [200, 300], [300, 400], [400, 500], [500, 600] are 10, 18, 6, 20, 4, 0 respectively. The number 6 is obviously lower than its adjacent numbers 18 and 20, so the trend in the interval of [200, 300] is downward, in which the trend of the model of density is upward. Therefore, there is a different trend between two lines in the interval of [200, 300]. Overall, the trend of two lines are approximately same. Therefore, the diagnostic plots support the model’s accuracy.
We usually use the RLs to explain extreme values and we estimate the N = 19 years with the 95% confidence interval (CI) using the bootstrap technique in Figure 4. The SC 25 is started from 2019 to 2030, and our data are from 1954 to 2011, so N = 19 years corresponds to the last year of this SC, namely, 2030 (2011 + 19 = 2030). We want to know how the trend of solar cycle 25 changes, so we need to obtain the extreme values with the RL for N = 19 years. As shown in Figure 4, the yearly maximum of SN value is about 420 in the future 2030. Comparing with the yearly maximums of SN value during from 2012 to 2029, the trend of RL is upward. Table 1 shows the values of three parameters of GEV distribution and their 95% CIs. μ is the location parameter, σ is the scale parameter, and ξ is the shape parameter [9]. The shape parameter and its 95% CI are always negative, indicating that the distribution has an upper bound and there is the maximum extreme value. Table 2 lists the estimates of the RL for the daily time series and for 19 years.
We also try to set the block length to 730, which means that every block consists of data from two years, and effective sample data are less. The results of diagnostic plots are not as good as the results of Figure 3.

3.2. Peaks-Over-Threshold Approach

Choosing u is the most important step. Shown in Figure 5, an approximately linear relationship exists in the interval [250, 300] of the threshold u in the mean residual life plot, when the CI becomes larger. There is the stability of the parameter estimation, which is fitted by the GP distribution over a range of thresholds. Shown in Figure 6, it can be found that the shape parameter and reparameterized scale parameter change slowly when u are set to 260, 280, 300. The effects of three standard diagnostic plots with u = 300 are best, so 300 is chosen.
An extreme value over a threshold is usually accompanied by exceedances of adjacent thresholds, leading to a cluster of consecutive exceedances [9]. However, the EVT requires the maxima to be independent so that there will not be any problems with short-range dependencies. Declustering procedure, which consists of locating clusters in the exceedance over the threshold and representing them by the maximum value inside each cluster, are applied to the sample data with r = 13 to make them independent, and here r is the run length. Because the solar global rotation is 27 and half of it is 13.5 [33], we need an integer, so 13 is chosen. In the declustering procedure, if the exceedances are separated by less than a number of r , exceedances are marked as the same cluster. Figure 7 shows the threshold exceedances histogram, which indicates the distribution over the last six SCs included 182 exceedances in this timescale. The numbers of threshold exceedances are 68 of the SC 19, 3 of the SC 20, 33 of the SC 21, 55 of the SC 22, 23 of the SC 23, and 0 of the SC 24, respectively. Figure 8 shows the decluster data above a threshold. Then we can get the new independent threshold exceedances. The next step is to apply the GP distribution to the independent threshold exceedances. The method of estimating the parameters is also the maximum likelihood estimation and the estimations of the two parameters ξ , σ of the GP distribution are −0.08 and 47.98. In Figure 9, the precision of the GP model evaluated by standard diagnostic plots is shown. The QQ-plot confirms the validity of the fitted model, and compares the empirical data quantiles and the GP fit quantiles (a). The consequence of plotted points falls onto an straight line. The QQ-plot shows the randomly generated data, which is from the fitted GP, against the empirical data quantiles (b), and the 95% confidence bands of the QQ-plot get close to be linear. Then, the empirical density of the observed maximum seems to be consistent with the fitted density (c), stating that a good fit exists between the two. Therefore, the diagnostic plots support the model’s accuracy.
Besides, r = 27 is set for the sample data and the value is about the solar global rotation. r = 54 is also set for our data and it is about two solar global rotations. Both their results of diagnostic plots are not as good as the results of Figure 9, indicating that 13 is the best choice.
Then in Figure 10, the estimate and its 95% CI of the RL for N = 19 years are obtained by bootstrapping. Table 3 shows the data in which we use the bootstrapping to obtain for the two different parameters of estimated GP distribution and their 95% CIs. The shape parameter is negative and the 95% CI is also negative, implying that there is an upper bound of the extreme SN distribution. Combining the results of three standard diagnostic plots in Figure 9 and these two parameters in Table 3, it could be seen that there exists a good GP distribution. Table 4 lists the estimates of the RL for the daily time series and for N = 19 years, indicating that the yearly maximum of SN value is about 420 in the future 2030. Comparing with the yearly maximums of SN value during from 2012 to 2029, the trend of RL is upward.

4. Discussion and Conclusions

In this work, the SN data from the Purple Mountain Observatory at the daily scale are used for the research of the extreme SN based on the EVT. Two methods, the BM approach and the POT approach, are used. Considering the extreme values in the future, we estimate the RLs for N = 19 years. The shape parameters in Table 1 and Table 3 are negative and the 95% CIs are also negative, indicating that there will be an upper bound of the extreme SN distribution for two distributions, so the RLs for two distributions can be estimated. In the Table 2 and Table 4, the results of the RL for N = 19 are both 420, implying that the yearly maximum of SN value is about 420 in the future 2030. The values obtained by these two different methods are same. The trends of RL are upward in Figure 4 and Figure 10, showing that the solar activity of 25th is obviously stronger than SC 24, and there will be an increase in the length of meridional forms of circulation [34,35,36].
There are many differences between other work and our work. For example, comparing with [21], it is the first time that we use Chinese data to study the extreme values. There are data of 58 years in our work and there are about data of 300 years in [21]. For the accuracy of the data, the more data, the better the fit. The results of three diagnostic plots in our work would be as good as results in [21] if we have more sample data. POT is usually used for the analysis of the extreme values in astrophysics [20,22,23,24], and we use both methods to analyze the results.
Table 5 lists a selection of prediction for the trend of 25th solar cycle. The solar activity cycle is predictable in nature, but the high-accuracy prediction should only be done for short- to mid-term due to its intrinsically dynamical complexity [37]. In the past few decades, many researchers have predicted SN by different methods. Ref. Wu et al. (2021) [38] used the two-parameter modified logistic prediction-extension (TMLP-E) models to predict SN of the SC 25 and SC 26, and the amplitudes of these two SCs were predicted to be at the same level as that of SC 24. Ref. Kakad et al. (2020) [39] suggested that the trend of peak SN was stronger. Ref. Sarp et al. (2018) [40] predicted that the solar maximum of SC 25 was greater than that of SC 24. Ref. Li et al. (2018) [41] forecasted the SC 25 by applying the bimodal distribution and found that the trend of solar activity was stronger. In this work, we apply the EVT to the Chinese SN, and find that the distribution of the daily SN data has an upper bound, and in Table 5, we can find that the trend of our prediction is consistent with these previous predictions [39,40,41]. Comparing with previous research, we study the daily SN data from the Purple Mountain Observatory at the daily scale by the EVT. Our results are constant with, and further support the previous works.

Author Contributions

Conceptualization: S.Z., S.-G.Z.; methodology: S.-G.Z.; software: Y.-Q.C.; validation: S.Z., S.-G.Z.,Y.-S.X.; formal analysis: T.-H.Z., G.-H.L.; investigation: Y.-Q.C., S.-G.Z.; resources: S.Z., S.-G.Z., Y.-S.X., T.-H.Z., G.-H.L.; data curation: G.-H.L., T.-H.Z.; writing—original draft preparation: Y.-Q.C.; writing—review and editing: Y.-Q.C., S.Z., Y.-S.X., S.-G.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the National Science Foundation of China under Grant numbers U2031202, U1731124, U1531247, the special foundation work of the ministry of science and technology of the China under Grant numbers 2014FY120300, the 13th Five-year Informatization Plan of Chinese Academy of Sciences under Grant number XXH13505-04.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated for this study are available from the Purple Mountain Observatory.

Acknowledgments

The authors thank the Purple Mountain Observatory that provided the data.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Deng, L.H.; Li, B.; Zheng, Y.F.; Cheng, X.M. Relative phase analyses of 10.7 cm solar radio flux with sunspot numbers. New Astron. 2013, 23, 1–5. [Google Scholar] [CrossRef]
  2. Pala, Z.; Atici, R. Forecasting Sunspot Time Series Using Deep Learning Methods. Sol. Phys. 2019, 294, 50. [Google Scholar] [CrossRef]
  3. Deng, L.H.; Li, B.; Xiang, Y.Y.; Dun, G.T. Multi-scale analysis of coronal Fe XIV emission: The role of mid-range periodicities in the Sun-heliosphere connection. J. Atmos. Sol.-Terr. Phys. 2015, 122, 18–25. [Google Scholar] [CrossRef]
  4. Deng, L.H.; Fei, Y.; Deng, H.; Mei, Y.; Wang, F. Spatial distribution of quasi-biennial oscillations in high-latitude solar activity. Mon. Not. R. Astron. Soc. 2020, 494, 4930–4938. [Google Scholar] [CrossRef]
  5. Charbonneau, P. Dynamo models of the solar cycle. Living Rev. Sol. Phys. 2020, 17, 4. [Google Scholar] [CrossRef]
  6. Vaquero, J.M.; Vázquez, M. Scientific Data Extracted from Historical Documents. In The Sun Recorded through History; Springer: New York, NY, USA, 2009; p. 361. [Google Scholar]
  7. Deng, L.-H.; Qu, Z.-Q.; Yan, X.-L.; Wang, K.R. Phase analysis of sunspot group numbers on both solar hemispheres. Res. Astron. Astrophys. 2013, 13, 104–114. [Google Scholar] [CrossRef]
  8. Lefèvre, L. Detailed Analysis of Solar Data Related to Historical Extreme Geomagnetic Storms: 1868–2010. Sol. Phys. 2016, 291, 1483–1531. [Google Scholar] [CrossRef] [Green Version]
  9. Coles, S. An Introduction to Statistical Modeling of Extreme Values; Springer: London, UK, 2001. [Google Scholar]
  10. Nogaj, M.; Yiou, P.; Parey, S.; Malek, F. Amplitude and frequency of temperature extremes over the North Atlantic region. Geophys. Res. Lett. 2006, 33, L10801. [Google Scholar] [CrossRef]
  11. Coelho, C.A.S.; Ferro, C.A.T.; Stephenson, D.B.; Steinskog, D.J. Methods for Exploring Spatial and Temporal Variability of Extreme Events in Climate Data. J. Clim. 2008, 21, 2072. [Google Scholar] [CrossRef] [Green Version]
  12. Acero, F.J.; García, J.A.; Gallego, M.C. Peaks-over-Threshold Study of Trends in Extreme Rainfall over the Iberian Peninsula. J. Clim. 2011, 24, 1089–1105. [Google Scholar] [CrossRef]
  13. Acero, F.J.; García, J.A.; Gallego, M.C.; Parey, S.; Dacunha-Castelle, D. Trends in summer extreme temperatures over the Iberian Peninsula using nonurban station data. J. Geophys. Res. (Atmos.) 2014, 119, 39–53. [Google Scholar] [CrossRef]
  14. Gilli, M.; Këllezi, E. An Application of Extreme Value Theory for Measuring Financial Risk. Comput. Econ. 2006, 27, 207–228. [Google Scholar] [CrossRef] [Green Version]
  15. Castillo, E.; Hadi, A.S.; Balakrishnan, N.; Sarabia, J.M. Extreme Value and Related Models with Applications in Engineering and Science; Wiley-Interscience: New York, NY, USA, 2004. [Google Scholar]
  16. Bhavsar, S.P.; Barrow, J.D. First ranked galaxies in groups and clusters. Mon. Not. R. Astron. Soc. 1985, 213, 857–869. [Google Scholar] [CrossRef]
  17. Bernstein, J.P.; Bhavsar, S.P. Models for the magnitude-distribution of brightest cluster galaxies. Mon. Not. R. Astron. Soc. 2001, 322, 625–630. [Google Scholar] [CrossRef] [Green Version]
  18. Deng, L.H.; Zhang, X.J.; Deng, H.; Mei, Y.; Wang, F. Systematic regularity of solar coronal rotation during the time interval 1939–2019. Mon. Not. R. Astron. Soc. 2020, 491, 848–857. [Google Scholar] [CrossRef]
  19. Siscoe, G.L. On the statistics of the largest geomagnetic storms per solar cycle. J. Geophys. Res. 1976, 81, 4782. [Google Scholar] [CrossRef]
  20. Asensio Ramos, A. Extreme value theory and the solar cycle. Astron. Astrophys. 2007, 472, 293–298. [Google Scholar] [CrossRef] [Green Version]
  21. Acero, F.J.; Carrasco, V.M.S.; Gallego, M.C.; García, J.A.; Vaquero, J.M. Extreme Value Theory and the New Sunspot Number Series. Astrophys. J. 2017, 839, 98. [Google Scholar] [CrossRef]
  22. Elvidge, S.; Angling, M.J. Using Extreme Value Theory for Determining the Probability of Carrington-Like Solar Flares. Space Weather 2018, 16, 417–421. [Google Scholar] [CrossRef]
  23. Acero, F.J.; Gallego, M.C.; García, J.A.; Usoskin, I.G.; Vaquero, J.M. Extreme Value Theory Applied to the Millennial Sunspot Number Series. Astrophys. J. 2018, 853, 80. [Google Scholar] [CrossRef] [Green Version]
  24. Acero, F.J.; Vaquero, J.M.; Gallego, M.C.; García, J.A. Extreme Value Theory Applied to the Daily Solar Radio Flux at 10.7 cm. Sol. Phys. 2019, 294, 67. [Google Scholar] [CrossRef]
  25. Love, J.J. Some Experiments in Extreme-Value Statistical Modeling of Magnetic Superstorm Intensities. Space Weather 2020, 18, e02255. [Google Scholar] [CrossRef] [Green Version]
  26. Lin, G.H. Chinese Sunspot Drawings and Their Digitization— (I) Parameter Archives. Sol. Phys. 2019, 294, 79. [Google Scholar] [CrossRef] [Green Version]
  27. Sun, X.B.; Zheng, S.; He, H.L.; Zeng, S.G. System Difference Analysis for the 55 Years’ Hand-drawing Sunspot Data of Purple Mountain Observatory. Acta Astron. Sin. 2019, 60, 55–63. [Google Scholar]
  28. Wan, M.; Zeng, S.-G.; Zheng, S.; Lin, G.-H. Chinese sunspot drawings and their digitization—(III) quasi-biennial oscillation of the hand-drawn sunspot records. Res. Astron. Astrophys. 2020, 20, 190. [Google Scholar] [CrossRef]
  29. Clette, F.; Svalgaard, L.; Vaquero, J.M.; Cliver, E.W. Revisiting the Sunspot Number. A 400-Year Perspective on the Solar Cycle. Space Sci. Rev. 2014, 186, 35–103. [Google Scholar] [CrossRef] [Green Version]
  30. Kharin, V.V.; Zwiers, F.W. Changes in the Extremes in an Ensemble of Transient Climate Simulations with a Coupled Atmosphere-Ocean GCM. J. Clim. 2000, 13, 3760–3788. [Google Scholar] [CrossRef]
  31. García, J.A.; Gallego, M.C.; Serrano, A.; Vaquero, J.M. Trends in Block-Seasonal Extreme Rainfall over the Iberian Peninsula in the Second Half of the Twentieth Century. J. Clim. 2007, 20, 113–130. [Google Scholar] [CrossRef]
  32. Ferreira, A.; Haan, L.D. On the block maxima method in extreme value theory: PWM estimators. Ann. Stat. 2013, 43, 276–298. [Google Scholar] [CrossRef] [Green Version]
  33. Heristchi, D.; Mouradian, Z. The global rotation of solar activity structures. Astron. Astrophys. 2009, 497, 835–841. [Google Scholar] [CrossRef]
  34. Lee, J.N.; Shindell, D.T.; Hameed, S. The Influence of Solar Forcing on Tropical Circulation. J. Clim. 2009, 22, 5870. [Google Scholar] [CrossRef]
  35. Kukoleva, A.A.; Kononova, N.K.; Krivolutskii, A.A. Manifestation of the Solar Cycle in the Circulation Characteristics of the Lower Atmosphere in the Northern Hemisphere. Geomagn. Aeron. 2018, 58, 775–783. [Google Scholar] [CrossRef]
  36. Guttu, S. Impacts of UV Irradiance and Medium-Energy Electron Precipitation on the North Atlantic Oscillation during the 11-Year Solar Cycle. Atmosphere 2021, 12, 1029. [Google Scholar] [CrossRef]
  37. Deng, L.H.; Li, B.; Xiang, Y.Y.; Dun, G.T. Comparison of Chaotic and Fractal Properties of Polar Faculae with Sunspot Activity. Astron. J. 2016, 151, 2. [Google Scholar] [CrossRef]
  38. Wu, S.-S.; Qin, G. Predicting Sunspot Numbers for Solar Cycles 25 and 26. arXiv 2021, arXiv:2102.06001. [Google Scholar]
  39. Kakad, B.; Kumar, R.; Kakad, A. Randomness in Sunspot Number: A Clue to Predict Solar Cycle 25. Sol. Phys. 2020, 295, 88. [Google Scholar] [CrossRef]
  40. Sarp, V.; Kilcik, A.; Yurchyshyn, V.; Rozelot, J.P.; Ozguc, A. Prediction of solar cycle 25: A non-linear approach. Mon. Not. R. Astron. Soc. 2018, 481, 2981–2985. [Google Scholar] [CrossRef] [Green Version]
  41. Li, F.Y.; Kong, D.F.; Xie, J.L.; Xiang, N.B.; Xu, J.C. Solar cycle characteristics and their application in the prediction of cycle 25. J. Atmos. Sol.-Terr. Phys. 2018, 181, 110–115. [Google Scholar] [CrossRef]
Figure 1. Daily values of the SN for the period of January 1954 to December 2011 are from the Purple Mountain Observatory.
Figure 1. Daily values of the SN for the period of January 1954 to December 2011 are from the Purple Mountain Observatory.
Atmosphere 12 01176 g001
Figure 2. The SN divided into 58 blocks based on the BM approach.
Figure 2. The SN divided into 58 blocks based on the BM approach.
Atmosphere 12 01176 g002
Figure 3. Three diagnostic plots fitting the GEV to the maximum values at a daily scale. (a) The QQ-plot compares the empirical data quantiles and the GEV fit quantiles. (b) The QQ-plot shows the randomly generated data, which is from the fitted GEV, against the empirical data quantiles, and the 95% confidence bands (black dashed line) of the QQ-plot get close to be linear. (c) The plot shows that the empirical density of the observed maximum (black solid line) against GEV fit density (blue dashed line).
Figure 3. Three diagnostic plots fitting the GEV to the maximum values at a daily scale. (a) The QQ-plot compares the empirical data quantiles and the GEV fit quantiles. (b) The QQ-plot shows the randomly generated data, which is from the fitted GEV, against the empirical data quantiles, and the 95% confidence bands (black dashed line) of the QQ-plot get close to be linear. (c) The plot shows that the empirical density of the observed maximum (black solid line) against GEV fit density (blue dashed line).
Atmosphere 12 01176 g003aAtmosphere 12 01176 g003b
Figure 4. RL plot of the maxima values for daily data with GEV distribution. The dashed lines indicate 95% CIs of the return value, the solid line is the regression line, and the point indicates the RL value for N years.
Figure 4. RL plot of the maxima values for daily data with GEV distribution. The dashed lines indicate 95% CIs of the return value, the solid line is the regression line, and the point indicates the RL value for N years.
Atmosphere 12 01176 g004
Figure 5. Mean residual life plot and there is an approximately linear relationship exists in the interval [250, 300]. The solid line indicates the mean excess for threshold and the dashed lines indicate 95% CIs of the mean excess.
Figure 5. Mean residual life plot and there is an approximately linear relationship exists in the interval [250, 300]. The solid line indicates the mean excess for threshold and the dashed lines indicate 95% CIs of the mean excess.
Atmosphere 12 01176 g005
Figure 6. Estimates of two parameters for different thresholds, and an approximately linear relationship exists in the interval [250, 300].
Figure 6. Estimates of two parameters for different thresholds, and an approximately linear relationship exists in the interval [250, 300].
Atmosphere 12 01176 g006
Figure 7. Threshold exceedances histogram, indicating the distribution over the last six SCs included 182 exceedances in the daily scale.
Figure 7. Threshold exceedances histogram, indicating the distribution over the last six SCs included 182 exceedances in the daily scale.
Atmosphere 12 01176 g007
Figure 8. Decluster data above a threshold and the new independent threshold exceedances are obtained.
Figure 8. Decluster data above a threshold and the new independent threshold exceedances are obtained.
Atmosphere 12 01176 g008
Figure 9. Three diagnostic plots fitting the GEV to the maximum values at a daily scale. (a) The QQ-plot compares the empirical data quantiles and the GP fit quantiles. (b) The QQ-plot shows the randomly generated data, which is from the fitted GP, against the empirical data quantiles, and the 95% confidence bands (black dashed line) of the QQ-plot get close to be linear. (c) The plot shows that the empirical density of the observed maximum (black solid line) against GEV fit density (blue dashed line).
Figure 9. Three diagnostic plots fitting the GEV to the maximum values at a daily scale. (a) The QQ-plot compares the empirical data quantiles and the GP fit quantiles. (b) The QQ-plot shows the randomly generated data, which is from the fitted GP, against the empirical data quantiles, and the 95% confidence bands (black dashed line) of the QQ-plot get close to be linear. (c) The plot shows that the empirical density of the observed maximum (black solid line) against GEV fit density (blue dashed line).
Atmosphere 12 01176 g009aAtmosphere 12 01176 g009b
Figure 10. RL plot of the maxima values for daily data with GP distribution. The dashed lines indicate 95% CIs of the return value, the solid line is the regression line, and the point indicates the RL value for N years.
Figure 10. RL plot of the maxima values for daily data with GP distribution. The dashed lines indicate 95% CIs of the return value, the solid line is the regression line, and the point indicates the RL value for N years.
Atmosphere 12 01176 g010
Table 1. Estimates and their 95% CIs of three GEV parameters obtained by bootstrapping.
Table 1. Estimates and their 95% CIs of three GEV parameters obtained by bootstrapping.
Location ( μ ) [95% CI]Scale ( σ ) [95% CI]Shape ( ξ ) [95% CI]
189.74 [155.49, 224.01]113.78 [87.64, 139.92]−0.24 [−0.49, 0.01]
Table 2. Estimate and its 95% CI of the 19 year-RL with bootstrapping in GEV distribution.
Table 2. Estimate and its 95% CI of the 19 year-RL with bootstrapping in GEV distribution.
19 Year-RL
420.84 (371.82, 463.88)
Table 3. Estimates and their 95% CIs of two GP parameters obtained by bootstrapping.
Table 3. Estimates and their 95% CIs of two GP parameters obtained by bootstrapping.
Scale ( σ ) [95% CI]Shape ( ξ ) [95% CI]
47.98 [30.39, 65.57]−0.08 [−0.33, 0.18]
Table 4. Estimate and its 95% CI of the 19 year-RL with bootstrapping in GP distribution.
Table 4. Estimate and its 95% CI of the 19 year-RL with bootstrapping in GP distribution.
19 Year-RL
420.99 (393.49, 461.14)
Table 5. A selection of prediction for the trend of 25th solar cycle.
Table 5. A selection of prediction for the trend of 25th solar cycle.
Method/ModelTime of PredictingTrend of Solar ActivityReference
BMA, POTSC 25strongerOur works
TMLP-E modelsSC 25 and 26Similar[38]
A model based on Shannon entropySC 25stronger[39]
Non-linear prediction algorithmSC 25stronger[40]
The bimodal distributionSC 25stronger[41]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, Y.-Q.; Zheng, S.; Xiao, Y.-S.; Zeng, S.-G.; Zhou, T.-H.; Lin, G.-H. Chinese Sunspot Drawings and Their Digitizations-(VI) Extreme Value Theory Applied to the Sunspot Number Series from the Purple Mountain Observatory. Atmosphere 2021, 12, 1176. https://doi.org/10.3390/atmos12091176

AMA Style

Chen Y-Q, Zheng S, Xiao Y-S, Zeng S-G, Zhou T-H, Lin G-H. Chinese Sunspot Drawings and Their Digitizations-(VI) Extreme Value Theory Applied to the Sunspot Number Series from the Purple Mountain Observatory. Atmosphere. 2021; 12(9):1176. https://doi.org/10.3390/atmos12091176

Chicago/Turabian Style

Chen, Yan-Qing, Sheng Zheng, Yan-Shan Xiao, Shu-Guang Zeng, Tuan-Hui Zhou, and Gang-Hua Lin. 2021. "Chinese Sunspot Drawings and Their Digitizations-(VI) Extreme Value Theory Applied to the Sunspot Number Series from the Purple Mountain Observatory" Atmosphere 12, no. 9: 1176. https://doi.org/10.3390/atmos12091176

APA Style

Chen, Y. -Q., Zheng, S., Xiao, Y. -S., Zeng, S. -G., Zhou, T. -H., & Lin, G. -H. (2021). Chinese Sunspot Drawings and Their Digitizations-(VI) Extreme Value Theory Applied to the Sunspot Number Series from the Purple Mountain Observatory. Atmosphere, 12(9), 1176. https://doi.org/10.3390/atmos12091176

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop