3.1. Forecasting Weather Scenarios
The temperature modeling results are shown in
Figure 5. As can be seen on the plot, none of the proposed models can fully describe the weather data. The Prophet and Neural Prophet models smooth the data, resulting in no sharp changes in temperature compared to the real data. If we look at the observations from July to August 2017, it can be seen that the temperature values range from 13 to 26 °C, while the values predicted by the models range from 17 to 20 °C. Most models performed well at capturing the general trend in the data but were rather poor in reproducing the stochasticity of weather events. On the other hand, the grouping by mean and SARIMAX models represented stochasticity; however, the predicted temperature values differed significantly from the real data.
In the case of precipitation modeling, it should be noted that the observations were essentially stochastic and their prediction was nearly impossible. Prophet and Neural Prophet gave smoother results as in the case with temperature but missed the extreme events such as heavy rainfall in June. The grouping by mean model and SARIMAX retained stochasticity, but the peaks predicted by them did not exceed 10 mm/day, while in the real observations, the peaks reached 30 mm/day.
The weather conditions forecast accuracy indices are summarized in
Table 3. The lowest relative error was achieved by the Prophet class models. However, even in their case, the RRMSE for precipitation and wind speed predictions was up to 99%. The coefficient of determination for precipitation and wind speed forecasts was negative for all models. A negative value can be obtained when the model fits the data worse than a simple constant model, or when the predicted dependent variable values deviate strongly from the actual values in the sample. The best values for the other variables were also obtained by the Prophet class models. When using the dynamic time warping algorithm to predict average temperature and precipitation, grouping by mean showed the best results, while in the case of solar radiation and wind speed, the best results were achieved with SARIMAX.
Based on the obtained results, it can be concluded that the Prophet and Neural Prophet models are best suited for predicting temperature and solar radiation, as they have the lowest relative error. However, when forecasting precipitation and wind speed, the grouping by mean and SARIMAX models are more suitable as they retain the stochasticity of the data. In general, it is recommended to use Neural Prophet to achieve the best results in time series forecasting, as it demonstrates the highest quality of prediction combined with the good performance for all the variables considered.
3.2. Crop Growth and Development Modeling
The simulations resulted in the growth and development scenarios for soybean and sugar beet for the different forecasts obtained with the machine-learning-based models. The results for each crop were compared considering the dynamic changes in the main crop growth indicators at one-day intervals. Additionally, quality indices were calculated based on the correspondence between the dynamics of the biophysical crop growth indices obtained using different forecasting methods and based on the real weather observations. The growth curves and dynamics of the key biophysical parameters for soybean are provided in
Figure 6. The yield dynamic differs considerably between the forecasts obtained with different models. As can be seen in
Figure 6A, Neural Prophet and the grouping by mean model both forecast earlier crop maturation than is predicted based on the observed weather conditions. Prophet underpredicts the total yield by about two times. SARIMAX gives the dynamics that are the closest to the real observations for the whole vegetative period. The aboveground biomass dynamics are similar to the yield dynamics. In this case, SARIMAX also produces the dynamics that are the closest to those observed, and also allows values to be obtained that are the closest to the observed estimate of the aboveground biomass at the end of the season.
Ecosystem carbon exchange shows the lowest values in the period from June to August, which is associated with plant biomass growth. Prophet and Neural Prophet give smoother NEE dynamics compared to the actual observations. SARIMAX produces the dynamics that are closer to those observed than all other forecasts, including at the end of the vegetative period when all other models overestimate NEE.
Leaf area index is an important characteristic used to assess crop growth and development during a season. This index is a frequently used variable in the tasks of remote sensing data assimilation into simulation models with the purpose of improving their predictive properties. Therefore, the ability of the model to predict its values during the season is important in assessing its quality. As can be seen on the plot, SARIMAX allows the estimation of LAI dynamics during the season with high accuracy, while the other models have the LAI peak shifted towards the start of the season.
The total soybean yield was 3758 kg/ha in the case of the real weather observations. The closest yield estimate, 3184 kg/ha, was obtained when weather observations were averaged over 10 years, while the worst forecast, 1964 kg/ha, was produced by Prophet. All forecast variants underestimated the final yield. One of the possible reasons for this phenomenon may be the poor quality of precipitation forecasting by the models, which ultimately leads to the underestimation of precipitation amounts over the season. On the other hand, in the grouping by mean model, the total precipitation was 275 mm, which is almost two times lower than the observed value (504 mm), while the predicted total yield is only lower by 5%.
The forecast accuracy indices are provided in
Table 4. In the case of the yield prediction, the grouping by mean variant obtained the DTW score of 403.71. However, the SARIMAX model shows the
of 0.98, which is higher than 0.80 for the averaging variant. SARIMAX also obtained the highest quality scores in the case of the aboveground biomass dynamics, LAI, and NEE. It is also worth noting that all models were worse at predicting NEE dynamics than all the other parameters.
The growth curves and dynamics of the key biophysical parameters for sugar beet are provided in
Figure 7. In the case of sugar beet, unlike soybean, the models overestimate yields compared to the observations. The grouping by mean variant gives earlier crop maturation and a significant overestimation of yield. The Prophet, Neural Prophet, and SARIMAX models also overestimate yields. In contrast to yield, the aboveground biomass dynamics are better estimated by the different models. It is worth noting that unlike yield, the aboveground biomass values are underestimated by SARIMAX. It is also worth mentioning that, unlike soybean, sugar beet yields show less correlation with the aboveground biomass.
The dynamics of ecosystem carbon exchange in the case of real-life observations exhibit a rather high amplitude over the vegetation season. Prophet and Neural Prophet give smoother NEE dynamics compared to the real observations, which was the same for soybean. In turn, the SARIMAX model gives an amplitude that is closer to the observed NEE dynamics.
In the case of leaf area index, the following pattern was observed: all variants produced a forecast that was biased towards the start of the season. In the second half of the season, the Prophet and Neural Prophet models predict the LAI dynamics that replicate the observed dynamics quite well. The SARIMAX model, in its turn, underestimates the LAI values in the second half of the season by about 0.5 units. The Neural Prophet model gives the closest value to the observed estimate of yield with the total value of 24,187 kg/ha, and the worst forecast is produced by the weather averaging approach with the value of 29,395 kg/ha, which is 32% higher than the observed yield. In contrast to soybean, for sugar beet, all forecasts overestimated the total yield. We should also note a rather large variation in the totals of the precipitation values obtained by different models, starting from 291 mm in the observations to 556 mm for the Prophet model.
The forecast accuracy indices for sugar beet are provided in
Table 5. The Neural Prophet model revealed the highest quality index values in the case of the yield forecast, in particular,
= 0.95. The Neural Prophet model also obtained the best
values for all other biophysical indicators compared to other models.
3.3. Soil Dynamics Modeling
The soil conditions in the plough horizon obtained in the soybean simulations are presented in
Figure 8. As can be seen on the plot, most models gave poor quality weather forecasts when reproducing soil moisture. Prophet and Neural Prophet underpredicted soil moisture compared to the real weather observations. Using SARIMAX and the grouping by mean model allowed soil moisture values to be obtained that were closer to the actual observation, although a certain underestimation can also be seen. In the summer season, a sharp increase in soil humidity can be seen in the case of the real-life observations with peaks exceeding 0.35, while the predicted peak values in the corresponding period are not higher than about 0.3.
The nitrate leaching dynamics reveal a pattern similar to soil humidity. None of the weather forecasting methods obtained the same high leaching values as in the real observations. All the models and grouping by mean produce similar dynamics and underestimate the maximum leaching values, from 1.0–1.2 to 0.3–0.4 kgN/ha. It is also worth noting that the Prophet and Neural Prophet models produce a peak of nitrogen leaching at the start of the season, while two peaks can be distinguished in the real weather observations, one of them occurring in early May and the other in early June. This may be due to the presence of high precipitation, as can be seen in the
Figure A1. SARIMAX predicts the first peak in early May, similar to the real observations, but the second peak in early June is not predicted.
The amount of precipitation and its distribution over the season play a significant role in forming the dynamics of soil moisture and nitrate leaching. As mentioned above, precipitation is one of the parameters that are challenging to predict because of its stochasticity, which in turn impacts the possibility of modeling those soil conditions for which precipitation is the main affecting factor. For this reason, smoothed precipitation predictions underestimate the moisture levels and may lead to the underprediction of nitrate leaching.
The nitrate content dynamics do not show such pronounced fluctuations during the season as moisture and nitrate leaching. For both model forecasts and real weather observations, the nitrate content level appears to be very low throughout the season and increases only after crop harvesting in early September. The closest to the observed dynamics was achieved with the SARIMAX model. As can be seen on the plot, grouping by mean and the Neural Prophet model overestimated the nitrate content at the end of the season, while the Prophet model gave nitrate content levels that were lower than the observed ones. The organic matter content showed a dynamic that is similar to the nitrate content. The plot shows an increase in the organic matter content after harvesting, which is associated with the residual plant parts left in the soil horizon. Similar to the case with the nitrates dynamic, SARIMAX allowed us to obtain a forecast closely replicating the real-life observations.
Table 6 compares the quality indices for the soil condition forecasting models in the case of soybean. As the table shows, the RRMSE and
for soil humidity and nitrate leaching are worse than those for the organic matter and nitrate content dynamics, which can be also seen in
Figure 8. The same tendency was observed for all the weather scenario modeling methods in this study. When the models were compared to each other, the SARIMAX model achieved the best quality metrics values for all soil parameters.
The soil conditions in the plough horizon obtained in the sugar beet simulations are presented in
Figure 9. For sugar beet soil moisture, significant differences between the observations and model forecasts were also obtained.
However, unlike the case with soybean, the sugar beet models tended to overestimate soil moisture levels. The Prophet and Neural Prophet models allowed smoothed moisture dynamics with values of about 0.3 to be obtained, while the real observations showed more pronounced fluctuations and ranged between 0.1 and 0.3 on average. The SARIMAX model and the grouping by mean variant showed dynamics close to the observed one. Nitrate leaching shows a significant increase at the start of the season and then a gradual decrease, which is associated with the use of nitrogen fertilizers, which are taken into account in the modeling. The grouping by mean variant produced a dynamic that was closest to the real-life observations. Prophet and Neural Prophet showed some anomalous increases at the end of the season, which may be mentioned as shortcomings of these forecasts. The nitrate content dynamics showed a sharp peak in the first half of the season after nitrogen fertilizers were applied. The SARIMAX and the grouping by mean models produced forecasts that were quite close to the real content, while the Prophet and Neural Prophet models underestimated the content by about two times. The dynamics of the organic matter content are similar to the soybean case and show a growth after crop harvesting resulting from the plant residues remaining in the plough horizon.
The best quality soil condition forecasts for sugar beet were obtained using weather averaging over several years (
Table 7). Neural Prophet obtained the best quality indices for the soil organic matter forecast only, although the differences between this model and the averaging variant were not significant. We should also note the poor quality of the nitrogen leaching forecasts obtained with the models, with only the grouping by mean variant obtaining a positive
value equal to 0.34. It can be assumed that the quality of the soil dynamics prediction depends on the quality of the crop dynamics prediction.
As part of our research, we generalized machine learning and statistical analysis methods for working with climate data. We identified which variables can be accurately predicted using these methods and which contain uncertainty that cannot be predicted. In addition, based on the identified results, it is possible to conclude the contribution of individual variables to subsequent modeling using simulation methods.
We hope that our research might be helpful for the development of methods and techniques for the cheap generation of surrogate weather scenarios and risk estimations. However, such research might require an additional scale of computations related to one more mathematical model for risks. Nevertheless, our research shows the advantages and limitations of popular time series forecasting methods for different weather and soil parameters. We see that some parameters can be predicted accurately with our approach, and some require another approach to be found.
One of the main limitations of most seasonal forecast approaches is the complexity of modeling extreme events. This drawback has a powerful effect on the ability to predict precipitation [
52]. However, this is important for modeling nitrogen leaching for soybeans. As can be seen, for observed weather, there are several peaks of nitrogen leaching due to extreme precipitation. At the same time, only one peak is observed for the model variants since the models did not predict extreme precipitation. Thus, there may be an underestimation of essential factors, such as nitrogen leaching, which directly depend on the possibility of predicting extreme events. It is important to note that within the framework of the presented study, meteorological parameters were evaluated as independent time series data. In subsequent studies, it makes sense to consider the possibility of conducting a correlation analysis between the abovementioned variables.
Future studies could explore the reasons behind the underestimation of crop yield, explicitly investigating the role of poor precipitation forecasting in model predictions. Additionally, using transformer-based methods that are popular in NLP tasks can improve the quality of the forecast and the use of hybrid approaches based on statistical weather generators and machine learning models [
53,
54].