Next Article in Journal
The Impact of Forest Management Inventory Factors on the Ecological Service Value of Forest Water Conservation Based on Machine Learning Algorithms
Previous Article in Journal
The Role of Field Measurements of Fine Dead Fuel Moisture Content in the Canadian Fire Weather Index System—A Study Case in the Central Region of Portugal
Previous Article in Special Issue
Multi-Agent Reinforcement Learning for Stand Structure Collaborative Optimization of Pinus yunnanensis Secondary Forests
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Many Shades of the Vegetation–Climate Causality: A Multimodel Causal Appreciation

1
School of Geographical Sciences, Nanjing University of Information Science & Technology, Nanjing 210044, China
2
Hydro-Climate Extremes Laboratory, Ghent University, 9000 Ghent, Belgium
3
Department of Civil and Environmental Engineering, University of Florence, 50139 Firenze, Italy
4
Collaborative Innovation Center on Forecast and Evaluation of Meteorological Disasters, Nanjing University of Information Science and Technology, Nanjing 210044, China
5
Tai’an Meteorological Bureau, Tai’an 271000, China
6
School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
*
Author to whom correspondence should be addressed.
Forests 2024, 15(8), 1430; https://doi.org/10.3390/f15081430
Submission received: 14 July 2024 / Revised: 31 July 2024 / Accepted: 1 August 2024 / Published: 14 August 2024
(This article belongs to the Special Issue Applications of Artificial Intelligence in Forestry)

Abstract

:
The causal relationship between vegetation and temperature serves as a driving factor for global warming in the climate system. However, causal relationships are typically characterized by complex facets, particularly within natural systems, necessitating the ongoing development of robust approaches capable of addressing the challenges inherent in causality analysis. Various causality approaches offer distinct perspectives on understanding causal structures, even when experiments are meticulously designed with a specific target. Here, we use the complex vegetation–climate interaction to demonstrate some of the many facets of causality analysis by applying three different causality frameworks including (i) the kernel Granger causality (KGC), a nonlinear extension of the Granger causality (GC), to understand the nonlinearity in the vegetation–climate causal relationship; (ii) the Peter and Clark momentary conditional independence (PCMCI), which combines the Peter and Clark (PC) algorithm with the momentary conditional independence (MCI) approach to distinguish the feedback and coupling signs in vegetation–climate interaction; and (iii) the Liang–Kleeman information flow (L-K IF), a rigorously formulated causality formalism based on the Liang–Kleeman information flow theory, to reveal the causal influence of vegetation on the evolution of temperature variability. The results attempt to capture a fuller understanding of the causal interaction of leaf area index (LAI) on air temperature (T) during 1981–2018, revealing the characteristics and differences in distinct climatic tipping point regions, particularly in terms of nonlinearity, feedback signals, and variability sources. This study demonstrates that realizing a more holistic causal structure of complex problems like the vegetation–climate interaction benefits from the combined use of multiple models that shed light on different aspects of its causal structure, thus revealing novel insights that are missing when we rely on one single approach. This prompts the need to move toward a multimodel causality analysis that could reduce biases and limitations in causal interpretations.

1. Introduction

Vegetation serves as a vital life support system for human societies, interconnecting soil, atmosphere, and water; however, its interactions within the climate system are very complex [1]. On one hand, global warming profoundly impacts vegetation ecosystems through temporal and spatial changes in climate elements such as water and energy, alongside the C O 2 fertilization effect [2], which contains interactions and potential processes among various influencing factors, contributing to the complexity of the mechanisms by which climate affects vegetation. On the other hand, vegetation ecosystems modulate the climate system through biophysical and chemical processes such as growth, greening, senescence, transpiration, and photosynthesis, thereby amplifying or mitigating signals of climate [3], where the response of vegetation to the climate system is relatively weak and difficult to capture. The interaction between vegetation and climate is thus marked by a complex, bidirectional feedback, with a notable emphasis on the influence of vegetation on temperature dynamics [4]. The predominant effect of climate change on vegetation ecosystems is well documented [5], whereas the reciprocal influence of vegetation on climate change remains underexplored and presents analytical challenges [6]. Methodologically, correlation and regression analysis methods are commonly employed within ecology and geographical sciences to assess the impacts of climate change on vegetation [7,8,9,10], but these methods often falter in clarifying the bidirectional “driving-response” relationship between vegetation and climate, leading to debates and controversies [11]. Climate research, on the other hand, leans toward numerical simulations to decipher vegetation’s feedback to the climate system [12,13], which depend heavily on model parameterization schemes, creating variability in analytical outcomes and obstructing consensus [14].
In light of the aforementioned challenges, causality inferences can either quantitatively and/or qualitatively through interventions unravel cause–effect relationships between various subsystems of the climate [15,16,17]. However, a faithful causal inference is hindered by the complexity of cause–effect relationships, which are characterized by linear and nonlinear relationships, different feedback forms, time- and frequency-dependent components, and sufficient asymmetry needed to untangle the causes from the effects [18]. A good overview of these challenges is presented in [19]. Different causal frameworks have been developed over the years to tackle the different challenges mentioned above and beyond, and while most of them have seen successful applications, each approach is often blind to its limitations. This may indicate the need to move toward a multimodel approach where different approaches with different strengths and limitations are used in tandem to unravel a more comprehensive causal structure.
A classical qualitative causality test approach is the Granger causality (GC) [20], which is a statistical hypothesis test for determining whether one time series is useful in forecasting another. By definition, a variable x (Granger) causes another variable y if the knowledge of y improves the autoregressive forecast of x. Since its development, GC has found a wide application in the fields of finance [21,22,23,24], brain science [25,26,27], and climate science [28,29,30]. For instance, Friston et al. [31] pioneered the detection and estimation of directed connections in neural networks through GC. Kovács et al. [32] employed GC to reveal the underlying causal relationships between environmental drivers and global vegetation attributes, highlighting soil moisture and the availability of accumulated precipitation as critical drivers of vegetation dynamics. Attanasio et al. [33] utilized GC to investigate the relationship between anthropogenic C O 2 emissions and global temperature, uncovering statistical evidence that the increase in C O 2 content leads to a rise in global temperature. Dhamala et al. [34] proposed a time-frequency-based GC to explore multiscale relationship between various variables. However, the assumption of linearity in these earlier forms restricts the scope of Granger causality applications to linear cases. Yet, complex nonlinear feedback relationships often exist in the climate datasets including the vegetation–temperature interaction, prompting the need to develop new methods for inferring nonlinear causal relationships. In that endeavor, Marinazzo et al. [35] applied the theory of reproducing kernel Hilbert spaces to Granger causality analysis to generalize a nonlinear form of GC. This approach was applied to understand the linear and nonlinear characteristics between different variables. Bueso et al. [36] extended kernel Granger causality (KGC) to include timescale applications and applied it to the interactions between El Niño–Southern Oscillation and soil moisture (ENSO-SM).
Land–atmosphere interactions are also characterized by positive and negative feedback signs, which are important for trend analysis in climate signs [37,38]. In general, statistical correlations are often used to represent positive and negative feedback [8]. However, although correlation is a necessary prerequisite for causality, it does not imply causation [39]. Thus, a causal inference model based on a Bayesian network was proposed, by first establishing a Markov graph with correlation to determine the links between network nodes and then through a series of intervention and statistical tests, extract the causal structure from full-time graph, realizing the leap from correlation to causal inference [16]. On this basis, the Peter and Clark momentary conditional independence (PCMCI) method combines the Peter and Clark (PC) algorithm [40] with the momentary conditional independence (MCI) approach to obtain causal graphs [19,41,42,43]. PCMCI has been widely used in many geological and climatological studies [44,45]. For example, Krich et al. [46] utilized PCMCI methods to estimate causal networks in biosphere–atmosphere interactions. Capua et al. [47] used PCMCI to study tropical and mid-latitude teleconnections interacting with Indian summer monsoon rainfall. Qu et al. [48] also employed PCMCI methods to detect the lagged and contingent relationships between wildfire burn area and drought patterns alongside vegetation conditions.
Beyond the above methods, information theoretic (IT) approaches also provide ways to assess causality between time series quantitatively [49]. Schreiber [50] derived an IT measure, the transfer entropy (TE), an asymetric conditional variation of the entropy–based statistics known as mutual information, which has been applied to distinguish drivers from responses within interactions [51,52]. Additionally, IT methods based on entropy, in the Shannon sense [53], provide a way to assess causal histories in systems [54]. Ruddell used mutual information the transfer entropy to assess the cause–effect relationships within a network of ecohydrological variables.
Liang et al. [49] proposed a rigorously formulated causality formalism, from first principles, based on the Liang–Kleeman information flow theory (L-K IF) where causality is quantitatively realized. This has been verified in well-known theoretical models such as the Rossler and Lorenz systems [55], as well its maiden real-world application to reveal the causal relationship between El Nino phenomenon and the Indian Ocean dipole (IOD). Since then, the L-K IF causality has been applied to several areas of research interest in climate science [56], finance [57], and psychology [58], among others, to unravel cause–effect relationships. For instance, Stips et al. [59] used this method to study the causal relationship between greenhouse gas emissions and global warming and found that the increase in the C O 2 concentration in the past 150 years led to the rise of global surface temperature. Hagan et al. [18] applied Kalman filtering and wavelet analysis techniques to extend the Liang–Kleeman information flow method to the time-frequency domain and analyzed the bidirectional mutual feed mechanism between soil moisture and air temperature. Tao et al. [60] used the information flow method to quantitatively assess the effects of climate warming and Atlantic and Pacific decadal oscillations on global precipitation and their regional differences. Docquier et al. [61] applied the Liang–Kleeman information flow method to identify the potential causal drivers of the Arctic sea ice. They found that recent and future changes in Arctic sea ice are primarily driven by air and sea surface temperatures, as well as ocean heat transport, which in turn significantly influence temperature and ocean heat transport dynamics. More recently, Zhou et al. [62] proposed a time-dependent form of the formalism to the multivariate form of the LK causality and applied it to reveal novel findings of time-varying causal structures between soil moisture, vapor pressure deficit, and gross primary productivity.
Understanding natural systems, particularly the climate, is challenging due to the complexity of the systems and the limitations in causal inquiry methods and observations [1]. In scientific research, identifying reliable and relatively certain patterns often relies on extracting and summarizing objective facts [19]. Furthermore, historical observational data provide authentic information and evidence, while causality analysis methods offer rigorous and reliable data analysis and exploration tools [19]. Like many complex natural interactions, the holistic causal structure of the vegetation–climate interaction may not be sufficiently realized with only one approach, since each method comes with its unique strengths and limitations as shown in Figure 1.
In conducting our exploration of the causal relationships between vegetation and climate, we recognize that different causal analysis methods illuminate these causal relationships from varied aspects [63], as shown in Figure 1. This is because causality analysis of natural systems is a very complex problem that has many facets to it. As a result, different causality methods may only provide skewed perspectives of the full causal structure such as the nature of the temporal delays [64,65], degrees of nonlinearity [29], and feedback signs [4,46], among others. Consequently, relying on a single approach limits interpretation of the cause–effect relationship to a specific framework, making it blind to other potentially useful interpretations. This limitation stems from the fact that each method offers a unique lens, focusing on specific aspects of the causal relationship, and thus provides a partial view that falls short of encapsulating the entire complexity of the vegetation–temperature coupling (Figure 1). This study provides a blue–print on how to capture a fuller interpretation of the vegetation–climate coupling by harnessing a multimethod analysis approach.
Nonlinearity is a critical characteristic of the climate system. Therefore, we first conducted a causal analysis of the vegetation–climate system coupling based on the degree of nonlinearity as a function of location. Using the KGC, we aimed to understand global regions of nonlinearity in their causal relationship where knowledge of vegetation improves air temperature prediction. Next, the PCMCI was used to investigate the coupling signs, alongside providing potential temporal delays between the driver and response. Finally, Liang–Kleeman information flow (L-K IF) was applied to highlight regions of uncertainty and predictability in air temperature due to information flow from vegetation changes.
The remaining sections are structured as follows: Section 2 introduces the data and methods applied in this study. Section 3 presents the experimental results. Section 4 discusses and analyzes the experimental findings. Section 5 presents the conclusions.

2. Data and Methods

2.1. Data

2.1.1. CRU TS

For the temperature variable, datasets of global mean air temperature (T) monthly observations at meteorological stations across the world’s land areas, acquired from the Climatic Research Unit (CRU), version TS v4.2.0 [66], were used. CRU TS v4.2.0 Mean Temperature is a high-resolution monthly grid dataset interpolated from 5583 observation stations. It covers the average temperature station data over the last 35 years and was interpolated into 0.5° × 0.5° (longitude × latitude) covering the global land surface. We took the monthly mean temperature from 1981 to 2018 as the temperature variable.

2.1.2. GLASS LAI

For vegetation variables, we utilized LAI data from the Global Land Surface Satellite (GLASS) dataset [67,68]. GLASS LAI employs a generalized regression neural network to train LAI data from satellite observations and AVHRR band data from high-resolution radiometers. The trained model was then applied to estimate the LAI on a global scale. The product closely aligns with mainstream leaf area products, has a spatial resolution of 0.05 degrees and a temporal resolution of 8 days, and spanned the period from 1981 to 2018. In this study, we resampled the spatial resolution of these data to 0.5° × 0.5° (longitude × latitude) to ensure consistency in spatiotemporal resolution with CRU data.

2.1.3. GLEAM ET

Evapotranspiration is very important in land-atmosphere interactions [69]. GLEAM (Global Land Evaporation Amsterdam Model) is a suit of specially designed algorithms aimed at accurately estimating global land evaporation by assimilating soil moisture and vegetation optical depth observations [70,71]. The model divides land evaporation into several key components including transpiration, canopy interception evaporation, bare soil evaporation, ice and snow sublimation and open water evaporation. Here, we employed the transpiration from GLEAM V3.6a to represent evapotranspiration from 1981 to 2018, with a monthly temporal resolution, resampled from a spatial resolution of 0.25° × 0.25° to 0.5° × 0.5° (longitude × latitude), consistent with the vegetation and temperature data specifications.

2.2. Methods

We utilized a series of causality-based analysis methods to construct the global vegetation–temperature causal structure as comprehensively as possible, including KGC, PCMCI, and L-K IF. The main features and differences of these methods are presented in Table 1.
The methodological workflow (Figure 2) comprised three steps and is described in the following subsections.

2.2.1. Kernel Granger Causality (KGC)

In GC, a set of stationary time series { x ( t ) } t = 1 , . , N + m can be commonly modeled as [20]
X n = i = 1 m A i X n i + i = 1 m B i Y n i + E n
where Y represents another time series that can be similarly defined as X . The matrices A i and B i represent the autoregression coefficients, while E n denotes white noise [73]. The parameter m corresponds to the order of the autoregressive model, typically determined through selection criteria such as the Bayesian information criterion (BIC) [74]. A standard least squares optimization method was utilized for estimating model coefficients. Instead of measuring the strength of the causal interaction based on the basic concept of GC, kernel Granger causality (KGC) employs kernel function to project the original linear indivisible time series to reproducing Hilbert spaces ( H ). For each α { 1 , , m } , the samples of the α th component of X form a vector u α R N . Calling X the m × N matrix having vectors u α as rows, H coincides with the range of the N × N matrix K = X T X , redefining the Granger causality index as
δ G C = P u 2 1 x ~ T x ~
where x ~ is the projection of an organized vector θ = x 1 + m , , x N + m T on Hilbert space H R N .   P represents the projector on the space H . Observing H as the range of the matrix K ~ = K P K K P + P K P , the natural choice of the orthonormal basis in H is the set of the eigenvectors of K ~ . Hence, P x 2 = i = 1 m c i 2 , where c i is the Pearson’s coefficient of x and eigenvectors t i . With a hand of Bonferroni test, a filtered linear Granger causality index can be drawn as
δ F ( Y X ) = i c i 2 1 x ~ T x ~
Through spectral representation K X , X = a λ a ϕ a ( X ) ϕ a X , where ϕ a are eigenfunctions of the kernel k , it is observed that x ~ , in the feature space spanned by λ a Ψ a , coincides with the nonlinear regression of x in the original variables. Consider the space H spanned by zero-mean vectors u φ , where φ represents eigenvectors of the Gram matrix [70], with elements K i j = k X i , X j . The Gram matrix K is evaluated using both X and Y to predict α , where elements of K are defined as K i j = k Z i , Z j . The regression values now form a vector that is equal to the projection of α, representing the range of K . In subsequent analysis, we consider the inhomogeneous polynomial (IP) of integer order P as our choice to meet our task. The IP kernel [23,24] of integer order p is defined as K p X , X = 1 + X T X P . Along similar lines as described for the linear case, we constructed the kernel Granger causality, taking into account only eigenvectors that passed the Bonferroni test:
δ F K = i c i 2
The IP kernel of integer order p is expressed as X = X     X T , and X T is orthonormal to X (orthogonal unit vectors to X     X T ) up to order p and p = 1; consider only the linear regression, where p     2 suggests that the information transfer mechanism is nonlinear. The corresponding KGC formula can be expressed as
K G C x y = t P y t 2 / t ( y ( t ) y ¯ ( t ) ) 2

2.2.2. Peter and Clark Momentary Conditional Independence (PCMCI)

PCMCI combines the Peter and Clark (PC) [40] algorithm momentary conditional independence (MCI) approach. Consider an underlying time-dependent system X t = X t 1 , , X t N with
X t j = f j P X t j , η t j
where f i signifies a potential nonlinear functional dependency, and η t j denotes the mutually independent dynamic noise components; the nodes within a time series graph symbolize the variables X t j at various time lags. Here, P X t j X t = X t 1 , X t 2 , is defined to represent the causal progenitors, or “parents”, of the variable X t j , identified from the historical data across all N variables.
For each variable X t j , the PC algorithm begins with the establishment of initial parent candidates P X t j X t = X t 1 , X t 2 , . Subsequently, the PC algorithm undertakes nonconditional independence evaluations, excising X t τ i if the null hypothesis X t τ i X t j stands unrefuted at the significance threshold α P C . By means of an a-value assessment, causal links in PC algorithms can be defined as
p X t τ i X t j = m a x { 𝒮 } p X t τ i X t j 𝒮
The aggregated p-value of a causal link is determined as the maximum of all values resulting from conditional independence tests conducted on various condition sets 𝒮 , as illustrated in Equation (4). Then, the momentary conditional independence (MCI) test is used to test whether X t τ i X t j with
X t τ i / X t j P ^ X t j X t τ i , P ^ X t τ i
There are various MCI methods provided by PCMCI, including partial correlation (ParCorr), a nonlinear two-step conditional independence test, and two fully nonparametric tests: (i) a test based on Gaussian process regression and a distance correlation (GPDC) for the detection of additive nonlinear causality and (ii) another test based on conditional mutual information (CMI) for the detection of multiplicative nonlinear causality.
To study the positive and negative feedback relationship between LAI and T, we specifically focused on the linear independence test known as ParCorr.
The ParCorr conditional independence test relies on partial correlations and a t-test, assuming the following model:
X i = 𝒮 β X i + ϵ X i , X j = 𝒮 β X j + ϵ X j
where β represents coefficients and ϵ denotes Gaussian noise. This model leads to the subsequent residuals
r X i = X i 𝒮 β ^ X i , r X j = X j 𝒮 β ^ X j ,
with X i being the estimated value. ParCorr eliminates the influence of 𝒮 on X i and X j through ordinary least-squares regression, then testing the independence of the residuals by the Pearson correlation with a t-test. The independence test yields a p-value and a test statistic value I, which represents the correlation coefficient in the case of ParCorr.

2.2.3. Liang–Kleeman Information Flow (L-K IF)

The causality in Liang–Kleeman Information Flow is defined as the time rate of the flow of information transferred from variable X 2   (e.g., leaf area index (LAI)) to X 1 (e.g., temperature (T)) when the system guides a state forward, which can be formulated as:
d X 1 d t = F x , t = d X 1 * d t + T 2 1
where X 1 and X 1 * are n -dimensional vectors, F = F 1 , F 2 , , F n T is the vector field.   T 2 1 is the information flow (IF) rate from X 2 to X 1 demonstrated by Liang in [75]. The time rate of change in X 1 is precisely equivalent to the mathematical expectation of the divergence of the vector field. Specifically, for the case of bivariate analysis in this study ( n = 2 ), the information flow (IF) rate is:
T 2 1 = d X 1 d t d X 1 * d t                                                                     = E F 1 σ 1 ρ 1 x 1 E F 1 x 1 = E 1 σ 1 F 1 ρ 1 x 1                                  
where E denotes the mathematical expectation and σ 1 the marginal probability density of X 1 . It is important to note that the IF rate is asymmetric T 2 1 T 1 2 , distinguishing it from correlation.
In the real cases where the underlying system dynamics are unknown, Equation (2) is estimated using maximum likelihood estimation (MLE). Given two time series   X 1 and X 2 , the MLE of T 2 1 = 0 , as described by Liang in [49], is:
T ^ 2 1 = C 11 C 12 C 2 , d 1 C 12 2 C 1 , d 1 C 22 C o 11 C 12 2 ,
where C i j represents the sample covariance between X i an X j , and C i , d j is the covariance between X i and a forward differenced series derived from X j , i.e., X ˙ j , n = X j , n + k X j , n / ( k Δ t ) , with k 1 as an integer and Δ t as the time step. Although the Euler forward difference method has lower accuracy, it is often indispensable because it inherently assumes that present conditions can affect future outcomes, but not vice versa.
A nonzero T 2 1 indicates that X 2 causes X 1 . Conversely, X 2 is not causal if T 2 1 = 0 , aligning with the principle of nil causality, which is a core theorem in the IF theory, has been empirically tested and verified [55]. As shown in equation (13), if C 12 =0, then T 2 1 = 0 , but the reverse is not true. This highlights the concept that, in a linear sense, causation implies correlation, but correlation does not imply causation [49]. This holds even when the total information flow is zero [76], conclusively addressing the long-standing debate on causation versus correlation. The Fisher information matrix was utilized for significance testing. Its inverse provides a covariance matrix, establishing a significance level at 5%. A significant result indicates that X 2 exerts a causal influence on X 1 [49].

3. Results

3.1. Nonlinearity of the Causal Relationship between Vegetation and Temperature

The global distribution of the KGC results from LAI to T (LAI→T) from 1981 to 2018 in Figure 3 shows the spatial pattern of vegetation–temperature coupling, presenting differences at various degrees of nonlinearity ((a) P = 1, linear, (b) P = 3, nonlinear, (c) P = 5, strong nonlinear; the rest of the results are provided in Figure S1). Generally, the KGC result of LAI→T is stronger in the mid and low latitudes (intertropical regions) and weaker in the high latitudes. However, stronger signals appear in the high latitudes with increasing nonlinearity (p value). For example, relatively strong KGC values also occur in Eurasia and northern North America as shown in Figure 3b,c.
The linear KGC (P = 1) of the LAI→T distribution shown in Figure 3a predominantly highlights high-value regions in intertropical regions which coincide regions of strong land–atmosphere coupling. These areas were characterized by moderate vegetation cover where changes in LAI could impact changes in T.
In Figure 3b,c, the nonlinear KGC from LAI to T presents a more extensive range of regions with strong LAI responses to temperature, encompassing the intertropical regions, temperate grasslands and parts of high-latitude regions. Notable examples include the central grasslands of North America, temperate forests and grasslands in the Eurasian continent, the Sahel; and southern Africa and southeastern Australia in the Southern Hemisphere. Conversely, regions with weak nonlinear responses of LAI to temperature included extreme arid or cold regions, such as the deserts of Central Asia, and polar regions like Antarctica and Greenland. In these areas, the response of LAI to temperature was minimal, reflected in lower nonlinear GC values. Comparing Figure 3a,c into comparison, it is apparent that the regions of strong LAI→T are characterized by nonlinear processes, which reflects the complexity of the coupling.
For the boreal forest in northern Eurasia, linear KGC analysis at P = 1 (Figure 3a) revealed that the causality strength from LAI to T was comparatively weak. This potentially indicates that changes in T are not directly linked to changes in LAI, and is modulated by other mediating factors which may include precipitation, soil moisture and VPD variability. As a result, P = 5 (Figure 3c) shows stronger signals in the region indicating that the coupling is a potentially complex relationship, and hence, a lower predictive power of T anomalies with LAI.
The box plots in Figure 3d generally show increasing KGC values with increasing P values for LAI→T considering both median and mean distributions. Similar results are also obtained for T→LAI (Figure S1f). These results highlight the need to move beyond linear methods to adequately analyse this interaction. While linear methods may provide some useful information for the nature of the global distribution of the coupling, it may miss out on other key regions which limits their use for regional studies. Fundamentally, these KGC results indicates that accurately predicting T in almost all global regions would require considering more climate factors and processes beyond vegetation since they are mostly nonlinearly linked.

3.2. Feedback/Coupling Signs and Timescales between Vegetation and Temperature

In this section, we compute both the Pearson correlation and PCMCI to represent the linear and nonlinear correlation-based forms respectively. Figure 4 presents the linear Pearson correlation results for the interaction (Figure 4a) and PCMCI results (Figure 4b) for LAI→T globally from 1981 to 2018. We note that Figure 4a, may more rightly represent the feedback relationship while Figure 4b would represent the coupling of LAI to T. Furthermore, the analysis in Figure 4b considered evapotranspiration (ET) as a conditional variable. Positive and negative signals are distinguished by warm red and cool blue colors respectively. White regions are statistically insignificant locations. As shown in Figure 4a, positive feedback signals were predominantly observed in the northern high latitudes, such as Siberia and northern Canada, and the mid-latitude regions, including parts of Europe and North America. Conversely, negative feedback was primarily seen in tropical regions like parts of Brazil, the Sahel and India.
In Figure 4a, in the northern high latitudes and mid-latitude regions, the positive influence of LAI on T suggests that increased vegetation cover led to increases in temperatures, which could be attributed to processes such as reduced albedo, where dense vegetation absorbed more sunlight, leading to increased surface temperatures. Additionally, evapotranspiration from vegetation can contribute to local humidity. During the growing season, increased LAI enhanced photosynthetic activity, leading to higher carbon sequestration and surface cooling through evapotranspiration. This cooling effect can further enhance vegetation growth, creating positive feedback. The coupling observed in these regions, as shown in Figure 4b, may indicate reinforcement of this mechanism, where the delay in response highlighted the time taken for vegetation to significantly influence the local climate. These above potentially illustrate that in the northern high latitudes and mid-latitude regions, energy control was the dominant factor, while water control was relatively weaker. Therefore, the strength of the causal relationship reflected in the water cycle was not as strong as that in the energy cycle. We also note that the spatial distribution of the PCMCI results in agree more with that of the KGC results for P = 5, indicating that the PCMCI does capture the nonlinearity within the interaction.
Conversely, in tropical regions such as Brazil and the Sahelian region, the negative influence of LAI on T was observed. In these areas, increased vegetation cover was associated with lower temperatures. This negative relationship could be explained by the cooling effects of evapotranspiration, where the process of water vapor release from plants led to heat absorption from the surrounding air, thus cooling the surface. Furthermore, dense vegetation could enhance cloud formation, which in turn reflected solar radiation and reduced surface temperatures. These results add a layer of helpful interpretation to the KGC results in Figure 3 which only shows more of a binary causal–noncausal result.
To further analyse the coupling in detail, we selected the northern forest, Central Asian monsoon zone, Sahel region, and Amazon tropical rainforest as representative areas for further analysis. The analysis of four distinct regions—boreal forest (60–65° N, 90–95° E), East Asian monsoon region (26–31° N, 110–115° E), Sahel (5–10° N, 30–35° E), and Amazon rainforest (0–10° S, 55–65° W)—revealed significant insights into the relationship between LAI and temperature T. Across these regions, scatter plots, joint probability density functions (PDFs), and causal structure consistently showed a positive correlation between LAI and T, although with varying degrees of strength and temporal dynamics influenced by regional climatic conditions.
Overall, the plots demonstrate that the interaction is nonlinear, although the degree of nonlinearity varies from region to region. These validate the KGC results. In the boreal forest, Figure 5a shows a robust correlation of 0.88 between LAI and T at a lag of 1 month for LAI→T. Both Figure 5a,b suggest that LAI→T in the region is not a direct or strictly linear relationship. The causal structure (Figure 5c) emphasized a bidirectional causality portrayed as a positive feedback between vegetation and climate in the region.
Similarly, in the East Asian monsoon region, the scatter plots (Figure 5d,e) show a clear positive trend, indicating an even more direct LAI→T relationship at a 1 month lag. Figure 5e shows that this more direct relationship (correlation = 0.96) is likely at all vegetation and temperature scenarios in the region. The causal structure (Figure 5f) here also shows a positive vegetation–climate feedback. In the Sahel, the scatter plots from Figure 5g,h the lagged correlation between LAI and T is −0.69, and the joint PDFs (Figure 5h) demonstrate a multimodal distribution, suggesting complex interactions indicating that multiple environmental factors influences the interaction. The causal structure (Figure 5i) indicated a positive coupling of from T to LAI, and a negative coupling of LAI on regional temperature patterns.
Over the selected Amazon rainforest region, the scatter plots (Figure 5j,k) also highlighted the complexity of vegetation–climate interactions in this region (correlation = −0.43). The PDFs (Figure 5k) also showed a multimodal distribution between vegetation and climate in the region. Like the Sahel, the causal structure (Figure 5l) here also shows positive T→LAI and negative LAI→T. These plots provide a detailed view of the statistical relationship between vegetation and climate which shows the complex relationship between them.

3.3. Information Flow between Vegetation and Temperature

The Liang–Kleeman information flow here was to measure the information flow between LAI and T. Notably, similar to the PCMCI results, the Liang–Kleeman information flow (L-K IF) exhibited both positive and negative values, although their interpretations differ, as indicated in Table 1. L–K IF entropy values qualitative, measured in nats per unit time. As noted in Table 1, for PCMCI, a positive value means that the increasing or decreasing changes in the cause variable results in increases or decreases in the effect, while a negative values denote the contrary. However, in L-K IF, a positive IF indicates that one variable is a source of uncertainty for the other, such that, changes in the causal variable results in increased amplitude or variability of the other variable. Negative IF, on the other hand, suggests that changes in the causal variable reduces the amplitude of the other variable, making the causal variable a source of equilibrium and consequently, predictability. Figure 6 shows the L-K IF from LAI to T from 1981 to 2018 across the globe. As noted above, that the signs of L-K IF, based on entropy, should not be interpreted exactly as that of PCMCI, which is fundamentally based on correlation analysis. Positive values in Figure 6 suggest that vegetation was a source of uncertainty for temperature. Thus, changes in vegetation could result in anomalous temperature events such as heat waves or significant cooling of the temperature in the region. In other words, changes in vegetation could amplify temperature variability in the positive IF regions [77]. Negative L-K IF values suggested that vegetation functions to reduce the variability of temperature, keeping it within a range of equilibrium and predictability as “surprises” or anomalous events in the temperature variability would be less probable (henceforth referred to as the stabilizing effect). Overall, positive values were found in regions of strong land–atmosphere coupling [78,79]. Furthermore, these are also water–limiting regions where changes in soil moisture could potentially drive atmospheric conditions. On the other hand, the blue regions were found in energy-limiting regions across the globe where net radiation drives atmospheric conditions like temperature and precipitation by its control on evaporation [80,81,82]. In Figure 6, northern Eurasia depicts a stabilizing effect of LAI on T, as evidenced by the prevalence of negative L-K IF from LAI to T. This suggests that changes in vegetation in this region contributed to the stabilization of the temperature regime, a phenomenon potentially modulated by evapotranspiration mechanisms (see Figure S4). In northern North America, the L-K IF maps similarly denoted a modest stabilizing role of vegetation on the thermal environment, as indicated by softer blue shades in the LAI→T visual representation. In the Amazon basin, the LAI→T map presented a notably dynamic interaction, with pronounced red-orange gradients which showed that LAI is a source of uncertainty for regional climate system.

4. Discussion

In this study, we used three different causality frameworks to study complex vegetation–climate interactions, demonstrating some of the many facets of causality analysis. Previous studies were based on correlation and regression analysis methods [7,8,9], which remain inherently uncertain. In fact, Li et al. [3] cautioned that it was difficult to conclude that “vegetation greening in northern Eurasia will lead to temperature increase” based on remote sensing data and regression statistical methods. Thus, this study attempts to look into this problem by utilizing causality analysis methods to capture the essential characteristics potentially eliminating spurious causalities based from observational records.
In our study, we took advantage of KGC, which can derive the degree of nonlinearity within an interaction reflecting the different degrees of the vegetation-temperature coupling complexity as a function of location across the globe. The increasing nonlinearity with higher degrees of the polynomial kernel (P, see Figure 3) indicated that the influence of LAI on T involved multifaceted interactions, possibly including other ecological factors and feedback processes. Additionally, our study identified regions of strong nonlinear causal signals consistent with earlier studies like Schwingshackl et al. [83], who found these areas to be characterized by strong soil moisture–evapotranspiration coupling. Combining these findings, we concluded that the hydrological cycle and processes in transitional zones significantly influenced the causal relationship between vegetation and climate, often in a complex, nonlinear manner. The subtleties of these interactions challenge the notion of a straightforward, predictive relationship between vegetation and climate, pointing instead to a system characterized by complex, emergent properties.
Meanwhile, we focused on elucidating the positive and negative feedback and coupling features of the interaction PCMCI, finding that vegetation’s impact on climate exhibited positive feedback and coupling in mainly in the mid- to high-latitude regions, while showing negative feedback in lower mid-to low latitude tropical regions. These results are consistent with recognized global patterns of the feedback between vegetation and air temperature found by Forzieri et al. in [4]. Furthermore, our results showed different feedback signals of vegetation on climate under energy and moisture controls, which is very important in global warming discussions. These findings are also consistent with Krich et al. [46] who also relied on PCMCI. In this study, the combined use of PCMCI and KGC results, provide a detailed analysis of climate transition zones, offering insights into climate change and associated critical scenarios. In regions with strong nonlinearity in land–atmosphere interactions, PCMCI methods extract information on positive and negative feedbacks, along with corresponding time delays, addressing limitations of linear theories and methods that neglect time delays. This detailed analysis, particularly in critical ecological–climatic transition zones globally, enhances our understanding and provides a basis for studying climate change and associated critical scenarios. The L-K IF approach complemented these findings by capturing the dynamics of information flow within the LAI–T coupling. By examining the information flow, this method provided insight into how changes in LAI might induce variability in T, thus indicating regions where changes in LAI could lead to extreme temperature events across the globe, which we find are predominantly in water–limited regions.
In addition to natural factors, human activities also significantly impacted vegetation–climate interactions. However, we note that our study did not take anthropogenic factors into account for the purposes of simplifying the problem. This will be for future studies. The human factor has a great impact on the vegetation on the globe. Real-world scenarios involve complex indirect effects, such as global warming leading to human or animal migration and changes in water use and subsequently affecting urbanization, forest fragmentation, and thus vegetation and ecology [84]. These processes contain important causal information to more adequately understand the causal structure between vegetation and climate. For example, in Figure 4a,b of the study results, there were a few causal signals in the Amazon region that were contrary to the overall regional situation, such as in the southeast of the Amazon in Figure 4a and the scattered positive and negative signals in Figure 4b. These atypical opposite signals may be influenced by human activities, such as the severe human-induced deforestation in the southeastern Amazon found by Ometto et al. in [85] and the impact of road construction on forest fragmentation mentioned by das Neves et al. in [86]. The indirect effects or subprocesses caused by these human factors may be inferred in this paper, but ought to be done with caution.

5. Conclusions

This study systematically studied the causality between vegetation and climate from 1981 to 2018 from observations through different causal characterstics that stem from the added value of different approaches. From these various perspectives, we elucidated the complex impact of vegetation on temperature across the globe. We advocate for an integrated approach to address the climatic and ecological challenges of our time, attaching importance to vegetation dynamics in the climate system. Integrating multiple causality methods, we observed that while KGC highlighted the complexity and nonlinearity in the LAI–T relationship, PCMCI provided a nuanced understanding of the positive or negative feedback, as well as the coupling with time lag, and L-K IF shed light on the system’s equilibrium. The divergence in the conclusions drawn by each method illuminates the multifaceted nature of the LAI–T interaction. The KGC results suggested that the relationship between observed LAI and T is far more complex than a simple cause–effect, depending on the location over the globe. The PCMCI analysis, indicated that the LAI–T relationship was characterized by positive or negative influences. Finally, the L-K IF results on system equilibrium implied that vegetation could be both an agent of uncertainty or equilibrium depending on the location under consideration.
Overall, this study highlights the need to go beyond the absolute conclusions of a single method since each method has its strengths and limitations in providing a targeted perspective on the nature of the causal structure of any system under investigation. Thus, there is the need to employ a multimodel analysis that could bring us closer to obtaining more holistic views of the causal structure of natural systems.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/f15081430/s1: Figure S1: The kernel Granger causality (KGC) results between LAI and T, showing KGC results from T to LAI in panels (a) to (e) and from LAI to T in panels (g) to (k), corresponding to parameter p values of 1 to 5. Panel (f) and (l) display statistical boxplots of KGC results for T→LAI and LAI→T relationships across different degrees of nonlinearity indicated by parameter P ranging from 1 to 5. Figure S2: The global distribution of the causal effects from LAI to T based on the PCMCI method, which shows the influence of LAI to T with a time lag of one month. Figure S3. The global Liang–Kleeman Information flow (L-K IF) from T to LAI.

Author Contributions

Conceptualization, Y.S., D.F.T.H., and P.C.; data curation, Y.S., S.L., and X.Z.; formal analysis, Y.S., X.Z., and P.C.; investigation, Y.S., S.L., and X.Z.; methodology, Y.S., D.F.T.H., and F.Z.; project administration, F.Z., X.Z., and P.C.; resources, Y.S., X.Z., and P.C.; software, Y.S. and X.Z.; supervision, S.L., F.Z., and P.C.; validation, D.F.T.H., S.L., and P.C.; visualization, Y.S.; writing—original draft, Y.S.; writing—review and editing, Y.S., D.F.T.H., and P.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We are grateful to Daniele Marinazzo, X. San Liang, Samuel Asher Bhatti and David Docquier for their suggestions that helped to shape the motivation and structure of the study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Cui, J.; Piao, S.; Huntingford, C.; Wang, X.; Lian, X.; Chevuturi, A.; Turner, A.; Kooperman, G. Vegetation forcing modulates global land monsoon and water resources in a CO2-enriched climate. Nat. Commun. 2020, 11, 5184. [Google Scholar] [CrossRef] [PubMed]
  2. Piao, S.; Wang, X.; Park, T.; Chen, C.; Lian, X.; He, Y.; Bjerke, J.W.; Chen, A.; Ciais, P.; Tømmervik, H.; et al. Characteristics, drivers and feedbacks of global greening. Nat. Rev. Earth Environ. 2019, 1, 14–27. [Google Scholar] [CrossRef]
  3. Li, Y.; Piao, S.; Li, L.Z.X.; Chen, A.; Wang, X.; Ciais, P.; Huang, L.; Lian, X.; Peng, S.; Zeng, Z.; et al. Divergent hydrological response to large-scale afforestation and vegetation greening in China. Sci. Adv. 2018, 4, eaar4182. [Google Scholar] [CrossRef] [PubMed]
  4. Forzieri, G.; Alkama, R.; Miralles, D.; Cescatti, A. Satellites reveal contrasting responses of regional climate to the widespread greening of Earth. Science 2017, 356, 1180–1184. [Google Scholar] [CrossRef] [PubMed]
  5. Sun, G.Q.; Li, L.; Li, J.; Liu, C.; Wu, Y.-P.; Gao, S.; Wang, Z.; Feng, G.-L. Impacts of climate change on vegetation pattern: Mathematical modeling and data analysis. Phys. Life Rev. 2022, 43, 239–270. [Google Scholar] [CrossRef] [PubMed]
  6. Mao, R.; Jinxi, S.; Bin, T.; Xu, W.; Feihe, K.; Sun, H.; Yuxin, L. Vegetation variation regulates soil moisture sensitivity to climate change on the Loess Plateau. J. Hydrol. 2022, 617, 128763. [Google Scholar] [CrossRef]
  7. Jiang, M.; He, Y.; Song, C.; Pan, Y.; Qiu, T.; Tian, S. Disaggregating climatic and anthropogenic influences on vegetation changes in Beijing-Tianjin-Hebei region of China. Sci. Total Environ. 2021, 786, 147574. [Google Scholar] [CrossRef]
  8. Zhang, P.; Cai, Y.; Yang, W.; Yi, Y.; Yang, Z.; Fu, Q. Contributions of climatic and anthropogenic drivers to vegetation dynamics indicated by NDVI in a large dam-reservoir-river system. J. Clean Prod. 2020, 256, 120477. [Google Scholar] [CrossRef]
  9. Zhao, J.; Huang, S.; Huang, Q.; Wang, H.; Leng, G.; Fang, W. Time-lagged response of vegetation dynamics to climatic and teleconnection factors. Catena 2020, 189, 104474. [Google Scholar] [CrossRef]
  10. Zhou, Z.; Ding, Y.; Shi, H.; Cai, H.; Fu, Q.; Liu, S.; Li, T. Analysis and prediction of vegetation dynamic changes in China: Past, present and future. Ecol. Indic. 2020, 117, 106642. [Google Scholar] [CrossRef]
  11. Li, Y.; Zeng, Z.; Huang, L.; Lian, X.; Piao, S. Comment on “Satellites reveal contrasting responses of regional climate to the widespread greening of Earth”. Science 2018, 360, eaap7950. [Google Scholar] [CrossRef] [PubMed]
  12. Kerr, Y.H.; Wigneron, J.P. Vegetation models and observations A review. In Passive Microwave Remote Sensing of Land—Atmosphere Interactions; CRC Press: Boca Raton, FL, USA, 2023; pp. 317–344. [Google Scholar]
  13. Lian, X.; Jeong, S.; Park, C.E.; Xu, H.; Li, L.Z.X.; Wang, T.; Gentine, P.; Peñuelas, J.; Piao, S. Biophysical impacts of northern vegetation changes on seasonal warming patterns. Nat. Commun. 2022, 13, 3925. [Google Scholar] [CrossRef] [PubMed]
  14. Gutman, G.; Skakun, S.; Gitelson, A. Revisiting the use of red and near-infrared reflectances in vegetation studies and numerical climate models. Sci. Remote Sens. 2021, 4, 100025. [Google Scholar] [CrossRef]
  15. Pearl, J.; Mackenzie, D. The Book of Why: The New Science of Cause and Effect; Basic Books: New York, USA, 2018. [Google Scholar]
  16. Pearl, J. Causality; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
  17. Imbens, G.W.; Rubin, D.B. Causal Inference in Statistics, Social, and Biomedical Sciences; Cambridge University Press: Cambridge, UK, 2015. [Google Scholar]
  18. Hagan, D.; Wang, G.; Liang, X.; Dolman, H. A Time-Varying Causality Formalism Based on the Liang–Kleeman Information Flow for Analyzing Directed Interactions in Nonstationary Climate Systems. J. Clim. 2019, 32, 7521–7537. [Google Scholar] [CrossRef]
  19. Runge, J.; Bathiany, S.; Bollt, E.; Camps-Valls, G.; Coumou, D.; Deyle, E.; Glymour, C.; Kretschmer, M.; Mahecha, M.D.; Muñoz-Marí, J.; et al. Inferring causation from time series in Earth system sciences. Nat. Commun. 2019, 10, 2553. [Google Scholar] [CrossRef] [PubMed]
  20. Granger, C.W.J. Investigating Causal Relations by Econometric Models and Cross-spectral Methods. Econometrica 1969, 37, 424. [Google Scholar] [CrossRef]
  21. Hong, Y.; Liu, Y.; Wang, S. Granger causality in risk and detection of extreme risk spillover between financial markets. J. Econom. 2009, 150, 271–287. [Google Scholar] [CrossRef]
  22. Ajayi, R.A.; Friedman, J.; Mehdian, S.M. On the relationship between stock returns and exchange rates: Tests of Granger causality. Glob. Financ. J. 1998, 9, 241–251. [Google Scholar] [CrossRef]
  23. Shojaie, A.; Fox, E.B. Granger causality: A review and recent advances. Annu. Rev. Stat. Its Appl. 2022, 9, 289–319. [Google Scholar] [CrossRef]
  24. Kar, M.; Nazlıoğlu, Ş.; Ağır, H. Financial development and economic growth nexus in the MENA countries: Bootstrap panel granger causality analysis. Econ. Model. 2011, 28, 685–693. [Google Scholar] [CrossRef]
  25. Seth, A.K.; Barrett, A.B.; Barnett, L. Granger causality analysis in neuroscience and neuroimaging. J. Neurosci. 2015, 35, 3293–3297. [Google Scholar] [CrossRef] [PubMed]
  26. Stokes, P.A.; Purdon, P.L. A study of problems encountered in Granger causality analysis from a neuroscience perspective. Proc. Natl. Acad. Sci. USA 2017, 114, E7063–E7072. [Google Scholar] [CrossRef] [PubMed]
  27. Deshpande, G.; LaConte, S.; James, G.A.; Peltier, S.; Hu, X. Multivariate Granger causality analysis of fMRI data. Hum. Brain Mapp. 2009, 30, 1361–1373. [Google Scholar] [CrossRef]
  28. McGraw, M.C.; Barnes, E.A. Memory matters: A case for Granger causality in climate variability studies. J. Clim. 2018, 31, 3289–3300. [Google Scholar] [CrossRef]
  29. Papagiannopoulou, C.; Miralles, D.G.; Decubber, S.; Demuzere, M.; Verhoest, N.E.; Dorigo, W.A.; Waegeman, W. A non-linear Granger-causality framework to investigate climate–vegetation dynamics. Geosci. Model Dev. 2017, 10, 1945–1960. [Google Scholar] [CrossRef]
  30. Smirnov, D.A.; Mokhov, I.I. From Granger causality to long-term causality: Application to climatic data. Phys. Rev. E Stat. Nonlinear Soft Matter Phys. 2009, 80, 016208. [Google Scholar] [CrossRef] [PubMed]
  31. Friston, K.J.; Moran, R.; Seth, A. Analysing connectivity with Granger causality and dynamic causal modelling. Curr. Opin. Neurobiol. 2013, 23, 172–178. [Google Scholar] [CrossRef]
  32. Kovács, D.D.; Amin, E.; Berger, K.; Reyes-Muñoz, P.; Verrelst, J. Untangling the Causal Links between Satellite Vegetation Products and Environmental Drivers on a Global Scale by the Granger Causality Method. Remote Sens. 2023, 15, 4956. [Google Scholar] [CrossRef]
  33. Attanasio, A.; Pasini, A.; Triacca, U. A contribution to attribution of recent global warming by out-of-sample Granger causality analysis. Atmos. Sci. Lett. 2012, 13, 67–72. [Google Scholar] [CrossRef]
  34. Dhamala, M.; Rangarajan, G.; Ding, M. Estimating Granger causality from Fourier and wavelet transforms of time series data. Phys. Rev. Lett. 2008, 100, 018701. [Google Scholar] [CrossRef]
  35. Marinazzo, D.; Pellicoro, M.; Stramaglia, S. Kernel method for nonlinear granger causality. Phys. Rev. Lett. 2007, 100, 144103. [Google Scholar] [CrossRef] [PubMed]
  36. Bueso, D.; Piles, M.; Camps-Valls, G. Explicit Granger causality in kernel Hilbert spaces. Phys. Rev. E 2020, 102, 062201. [Google Scholar] [CrossRef] [PubMed]
  37. Menon, S.; Denman, K.L.; Brasseur, G.; Chidthaisong, A.; Ciais, P.; Cox, P.M.; Dickinson, R.E.; Hauglustaine, D.; Heinze, C.; Holland, E. Couplings between Changes in the Climate System and Biogeochemistry; Lawrence Berkeley National Lab. (LBNL): Berkeley, CA, USA, 2007. [Google Scholar]
  38. Rial, J.; Pielke, R.; Beniston, M.; Claussen, M.; Canadell, J.; Cox, P.; Held, H.; Noblet-Ducoudré, N.d.; Prinn, R.; Reynolds, J.; et al. Nonlinearities, Feedbacks and Critical Thresholds within the Earth’s Climate System. Clim. Chang. 2004, 65, 11–38. [Google Scholar] [CrossRef]
  39. Kennaway, R. When causation does not imply correlation. In The Interdisciplinary Handbook of Perceptual Control Theory; Elsevier: Amsterdam, The Netherlands, 2015; pp. 49–72. [Google Scholar]
  40. Spirtes, P.; Glymour, C.; Scheines, R. Causation, Prediction, and Search; MIT Press: Cambridge, MA, USA, 2001. [Google Scholar]
  41. Runge, J.; Nowack, P.; Kretschmer, M.; Flaxman, S.; Sejdinovic, D. Detecting and quantifying causal associations in large nonlinear time series datasets. Sci. Adv. 2017, 5, eaau4996. [Google Scholar] [CrossRef] [PubMed]
  42. Runge, J. Causal network reconstruction from time series: From theoretical assumptions to practical estimation. Chaos Interdiscip. J. Nonlinear Sci. 2018, 28, 075310. [Google Scholar] [CrossRef] [PubMed]
  43. Gerhardus, A.; Runge, J. High-recall causal discovery for autocorrelated time series with latent confounders. Adv. Neural Inf. Process. Syst. 2020, 33, 12615–12625. [Google Scholar]
  44. Kretschmer, M.; Coumou, D.; Donges, J.; Runge, J. Using Causal Effect Networks to Analyze Different Arctic Drivers of Midlatitude Winter Circulation. J. Clim. 2016, 29, 4069–4081. [Google Scholar] [CrossRef]
  45. Dubey, N.; Ghosh, S. The relative role of soil moisture and vapor pressure deficit in affecting the Indian vegetation productivity. Environ. Res. Lett. 2023, 18, 064012. [Google Scholar] [CrossRef]
  46. Krich, C.; Runge, J.; Miralles, D.; Migliavacca, M.; Pérez-Priego, Ó.; El-Madany, T.; Carrara, A.; Mahecha, M. Estimating causal networks in biosphere–atmosphere interaction with the PCMCI approach. Biogeosciences 2020, 17, 1033–1061. [Google Scholar] [CrossRef]
  47. Capua, G.d.; Kretschmer, M.; Donner, R.; Hurk, B.v.d.; Vellore, R.; Krishnan, R.; Coumou, D. Tropical and mid-latitude teleconnections interacting with the Indian summer monsoon rainfall: A theory-guided causal effect network approach. Earth Syst. Dynam. 2019, 11, 17–34. [Google Scholar] [CrossRef]
  48. Qu, Y.; Montzka, C.; Vereecken, H. Causation Discovery of Weather and Vegetation Condition on Global Wildfire Using the PCMCI Approach. In Proceedings of the IGARSS 2021—2021 IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium, 11–16 July 2021; pp. 8644–8647. [Google Scholar] [CrossRef]
  49. Liang, X.S. Unraveling the cause-effect relation between time series. Phys. Rev. E 2014, 90, 052150. [Google Scholar] [CrossRef]
  50. Schreiber, T. Measuring information transfer. Phys. Rev. Lett. 2000, 85, 461. [Google Scholar] [CrossRef] [PubMed]
  51. Ruddell, B.; Kumar, P. Ecohydrologic process networks: 1. Identification. Water Resour. Res. 2009, 45, W03419. [Google Scholar] [CrossRef]
  52. Goodwell, A.; Jiang, P.; Ruddell, B.; Kumar, P. Debates—Does Information Theory Provide a New Paradigm for Earth Science? Causality, Interaction, and Feedback. Water Resour. Res. 2020, 56, e2019WR024940. [Google Scholar] [CrossRef]
  53. Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  54. Deacon, T.W. Shannon–Boltzmann–Darwin: Redefining information (Part I). Cogn. Semiot. 2007, 1, 123–148. [Google Scholar] [CrossRef]
  55. Liang, X. Information flow and causality as rigorous notions ab initio. Phys. Rev. E 2015, 94, 052201. [Google Scholar] [CrossRef] [PubMed]
  56. Liang, X.S.; Xu, F.; Rong, Y.; Zhang, R.; Tang, X.; Zhang, F. El Niño Modoki can be mostly predicted more than 10 years ahead of time. Sci. Rep. 2021, 11, 17860. [Google Scholar] [CrossRef] [PubMed]
  57. Cincinelli, P.; Pellini, E.; Urga, G. Systemic risk in the Chinese financial system: A panel Granger causality analysis. Int. Rev. Financ. Anal. 2022, 82, 102179. [Google Scholar] [CrossRef]
  58. Cong, J.; Zhuang, W.; Liu, Y.; Yin, S.; Jia, H.; Yi, C.; Chen, K.; Xue, K.; Li, F.; Yao, D. Altered default mode network causal connectivity patterns in autism spectrum disorder revealed by Liang information flow analysis. Hum. Brain Mapp. 2023, 44, 2279–2293. [Google Scholar] [CrossRef]
  59. Stips, A.; Macias, D.; Coughlan, C.; Garcia-Gorriz, E.; Liang, X.S. On the causal structure between CO2 and global temperature. Sci. Rep. 2016, 6, 21691. [Google Scholar] [CrossRef] [PubMed]
  60. Tao, L.; Liang, X.S.; Cai, L.; Zhao, J.; Zhang, M. Relative contributions of global warming, AMO and IPO to the land precipitation variabilities since 1930s. Clim. Dyn. 2021, 56, 2225–2243. [Google Scholar] [CrossRef]
  61. Docquier, D.; Vannitsem, S.; Ragone, F.; Wyser, K.; Liang, X. Causal Links Between Arctic Sea Ice and Its Potential Drivers Based on the Rate of Information Transfer. Geophys. Res. Lett. 2021, 49, e2021GL095892. [Google Scholar] [CrossRef]
  62. Zhou, F.; Hagan, D.F.T.; Wang, G.; Liang, X.S.; Li, S.; Shao, Y.; Yeboah, E.; Wei, X. Estimating Time-Dependent Structures in a Multivariate Causality for Land–Atmosphere Interactions. J. Clim. 2024, 37, 1853–1876. [Google Scholar] [CrossRef]
  63. Krakovská, A.; Jakubík, J.; Chvosteková, M.; Coufal, D.; Jajcay, N.; Paluš, M. Comparison of six methods for the detection of causality in a bivariate time series. Phys. Rev. E 2018, 97, 042207. [Google Scholar] [CrossRef] [PubMed]
  64. Ding, Y.; Li, Z.; Peng, S. Global analysis of time-lag and -accumulation effects of climate on vegetation growth. Int. J. Appl. Earth Obs. Geoinf. 2020, 92, 102179. [Google Scholar] [CrossRef]
  65. Wu, D.; Zhao, X.; Liang, S.; Zhou, T.; Huang, K.; Tang, B.; Zhao, W. Time-lag effects of global vegetation responses to climate change. Glob. Change Biol. 2015, 21, 3520–3531. [Google Scholar] [CrossRef]
  66. Harris, I.; Osborn, T.; Jones, P.; Lister, D. Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset. Sci. Data 2020, 7, 109. [Google Scholar] [CrossRef]
  67. Liang, S.; Cheng, J.; Jia, K.; Jiang, B.; Liu, Q.; Xiao, Z.; Yao, Y.; Yuan, W.; Zhang, X.; Zhao, X.; et al. The Global Land Surface Satellite (GLASS) Product Suite. Bull. Am. Meteorol. Soc. 2020, 102, E323–E337. [Google Scholar] [CrossRef]
  68. Liang, S.; Zhao, X.; Liu, S.; Yuan, W.; Cheng, X.; Xiao, Z.; Zhang, X.; Liu, Q.; Cheng, J.; Tang, H.; et al. A long-term Global LAnd Surface Satellite (GLASS) data-set for environmental studies. Int. J. Digit. Earth 2013, 6, 33–35. [Google Scholar] [CrossRef]
  69. Shan, Y.; Wei, J.; Zan, B. Improving Estimates of Land–Atmosphere Coupling Through a Novel Framework of Land Aridity Classification. Geophys. Res. Lett. 2024, 51, e2023GL106598. [Google Scholar] [CrossRef]
  70. Miralles, D.; Holmes, T.; Jeu, R.D.; Gash, J.; Meesters, A.; Dolman, A. Global land-surface evaporation estimated from satellite-based observations. Hydrol. Earth Syst. Sci. 2010, 15, 453–469. [Google Scholar] [CrossRef]
  71. Martens, B.; Miralles, D.; Lievens, H.; Schalie, R.V.D.; Jeu, R.; Fernández-Prieto, D.; Beck, H.; Dorigo, W.; Verhoest, N. GLEAM v3: Satellite-based land evaporation and root-zone soil moisture. Geosci. Model Dev. 2016, 10, 1903–1925. [Google Scholar] [CrossRef]
  72. Docquier, D.; Capua, G.d.; Donner, R.V.; Pires, C.A.L.; Simon, A.; Vannitsem, S. A comparison of two causal methods in the context of climate analyses. Nonlinear Process. Geophys. 2024, 31, 115–136. [Google Scholar] [CrossRef]
  73. Granger, C. Long memory relationships and the aggregation of dynamic models. J. Econom. 1980, 14, 227–238. [Google Scholar] [CrossRef]
  74. Neath, A.A.; Cavanaugh, J.E. The Bayesian information criterion: Background, derivation, and applications. Wiley Interdiscip. Rev. Comput. Stat. 2012, 4, 199–203. [Google Scholar] [CrossRef]
  75. Liang, X. The Liang-Kleeman Information Flow: Theory and Applications. Entropy 2013, 15, 327–360. [Google Scholar] [CrossRef]
  76. Liang, X. Normalized Multivariate Time Series Causality Analysis and Causal Graph Reconstruction. Entropy 2021, 23, 679. [Google Scholar] [CrossRef] [PubMed]
  77. Hagan, D.; Dolman, H.; Wang, G.; Sian, K.T.C.L.K.; Yang, K.; Ullah, W.; Shen, R. Contrasting ecosystem constraints on seasonal terrestrial CO2 and mean surface air temperature causality projections by the end of the 21st century. Environ. Res. Lett. 2022, 17, 124019. [Google Scholar] [CrossRef]
  78. Koster, R.; Dirmeyer, P.; Guo, Z.; Bonan, G.; Chan, E.; Cox, P.; Gordon, C.T.; Kanae, S.; Kowalczyk, E.; Lawrence, D.; et al. Regions of Strong Coupling Between Soil Moisture and Precipitation. Science 2004, 305, 1138–1140. [Google Scholar] [CrossRef]
  79. Miralles, D.G.; Jimenez, C.; Jung, M.; Michel, D.; Ershadi, A.; McCabe, M.F.; Hirschi, M.; Martens, B.; Dolman, A.; Fisher, J.B.; et al. The WACMOS-ET project—Part 2: Evaluation of global terrestrial evaporation data sets. Hydrol. Earth Syst. Sci. 2015, 20, 823–842. [Google Scholar] [CrossRef]
  80. Koster, R.; Guo, Z.; Dirmeyer, P.; Bonan, G.; Chan, E.; Cox, P.; Davies, H.; Gordon, C.T.; Kanae, S.; Kowalczyk, E.; et al. GLACE: The Global Land-Atmosphere Coupling Experiment. Part I: Overview. J. Hydrometeorol. 2006, 7, 590–610. [Google Scholar] [CrossRef]
  81. Dirmeyer, P.A. The terrestrial segment of soil moisture–climate coupling. Geophys. Res. Lett. 2011, 38, L16702. [Google Scholar] [CrossRef]
  82. Hsu, H.; Dirmeyer, P.A. Soil moisture-evaporation coupling shifts into new gears under increasing CO2. Nat. Commun. 2023, 14, 1162. [Google Scholar] [CrossRef] [PubMed]
  83. Schwingshackl, C.; Hirschi, M.; Seneviratne, S. Quantifying Spatiotemporal Variations of Soil Moisture Control on Surface Energy Balance and Near-Surface Air Temperature. J. Clim. 2017, 30, 7105–7124. [Google Scholar] [CrossRef]
  84. Cotton, W.R.; Pielke, R.A., Sr. Human Impacts on Weather and Climate; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]
  85. Ometto, J.; Aguiar, A.; Martinelli, L. Amazon deforestation in Brazil: Effects, drivers and challenges. Carbon Manag. 2011, 2, 575–585. [Google Scholar] [CrossRef]
  86. das Neves, P.B.T.; Blanco, C.J.C.; Duarte, A.A.A.M.; das Neves, F.B.S.; das Neves, I.B.S.; dos Santos, M.H.d.P. Amazon rainforest deforestation influenced by clandestine and regular roadway network. Land Use Policy 2021, 108, 105510. [Google Scholar] [CrossRef]
Figure 1. The many shades of the vegetation–climate causality. That is, the vegetation–temperature interaction mechanism will be revealed from different causal aspects, where LAI is the leaf area index; T is the temperature; and KGC (kernel Granger causality), PCMCI (Peter and Clark momentary conditional independence), and L-K IF (Liang–Kleeman information flow) are three different causal analysis methods.
Figure 1. The many shades of the vegetation–climate causality. That is, the vegetation–temperature interaction mechanism will be revealed from different causal aspects, where LAI is the leaf area index; T is the temperature; and KGC (kernel Granger causality), PCMCI (Peter and Clark momentary conditional independence), and L-K IF (Liang–Kleeman information flow) are three different causal analysis methods.
Forests 15 01430 g001
Figure 2. The research outline of this study, where the GLASS LAI is the leaf area index from the Global Land Surface Satellite (GLASS) dataset; CRU TS represents the temperature data from the Climatic Research Unit (CRU) dataset; ET represents the evapotranspiration data from the Global Land Evaporation Amsterdam Model (GLEAM) dataset; and KGC (kernel Granger causality), PCMCI (Peter and Clark momentary conditional independence), and L-K IF (Liang–Kleeman information flow) are three different causality analysis methods.
Figure 2. The research outline of this study, where the GLASS LAI is the leaf area index from the Global Land Surface Satellite (GLASS) dataset; CRU TS represents the temperature data from the Climatic Research Unit (CRU) dataset; ET represents the evapotranspiration data from the Global Land Evaporation Amsterdam Model (GLEAM) dataset; and KGC (kernel Granger causality), PCMCI (Peter and Clark momentary conditional independence), and L-K IF (Liang–Kleeman information flow) are three different causality analysis methods.
Forests 15 01430 g002
Figure 3. The kernel Granger causality (KGC) results from leaf area index (LAI) to temperature (T), indicated as LAI→T, are shown in (ac), representing P equal to (a) 1, (b) 3, and (c) 5. Spatial results for P equal to 4, 5 are shown in the Supplementary Materials Figure S1. The statistical boxplots of KGC results for LAI→T across different degrees of nonlinearity are shown in (d), indicated by the parameter P, which ranges from 1 to 5. In (d), the star symbol in the middle of each boxplot represents the median value, the dashed line indicates the mean value, and outliers are not displayed. Results are computed at a 5% statistical significance.
Figure 3. The kernel Granger causality (KGC) results from leaf area index (LAI) to temperature (T), indicated as LAI→T, are shown in (ac), representing P equal to (a) 1, (b) 3, and (c) 5. Spatial results for P equal to 4, 5 are shown in the Supplementary Materials Figure S1. The statistical boxplots of KGC results for LAI→T across different degrees of nonlinearity are shown in (d), indicated by the parameter P, which ranges from 1 to 5. In (d), the star symbol in the middle of each boxplot represents the median value, the dashed line indicates the mean value, and outliers are not displayed. Results are computed at a 5% statistical significance.
Forests 15 01430 g003
Figure 4. The global distribution of the Pearson correlation between LAI and T is shown in (a), while the influence of LAI on T with a one-month time lag considering the influence of ET, is shown in (b), both of which are computed at a statistical significance level of 1%, where warm colors (red-orange) indicate positive values, while cool colors (blue) indicate negative values.
Figure 4. The global distribution of the Pearson correlation between LAI and T is shown in (a), while the influence of LAI on T with a one-month time lag considering the influence of ET, is shown in (b), both of which are computed at a statistical significance level of 1%, where warm colors (red-orange) indicate positive values, while cool colors (blue) indicate negative values.
Forests 15 01430 g004
Figure 5. Time series statistics and causal analysis results for selected typical regions. The first column represents the scatter plot between LAI (x-axis) and T (y-axis) of each region, and the second column represents contour plots of the kernel densities of the scatter plot for (ac) the boreal forest (60–65° N, 90–95° E), (df) East Asian monsoon region (26–31° N, 110–115° E), (gi) Sahel (5–10° N, 30–35° E), and (jl) Amazon rainforest (0–10° S, 55–65° W). The third column, (c,f,i,l) shows the causal structure of LAI and T in these regions. The unidirectional curved arrows represent the causal relationship with a delay of 1 calculated with PCMCI, and the bidirectional straight arrows represent the results calculated by PCMCI Plus with no time delay. The colors of the arrows are blue for negative causality and red for positive causality.
Figure 5. Time series statistics and causal analysis results for selected typical regions. The first column represents the scatter plot between LAI (x-axis) and T (y-axis) of each region, and the second column represents contour plots of the kernel densities of the scatter plot for (ac) the boreal forest (60–65° N, 90–95° E), (df) East Asian monsoon region (26–31° N, 110–115° E), (gi) Sahel (5–10° N, 30–35° E), and (jl) Amazon rainforest (0–10° S, 55–65° W). The third column, (c,f,i,l) shows the causal structure of LAI and T in these regions. The unidirectional curved arrows represent the causal relationship with a delay of 1 calculated with PCMCI, and the bidirectional straight arrows represent the results calculated by PCMCI Plus with no time delay. The colors of the arrows are blue for negative causality and red for positive causality.
Forests 15 01430 g005
Figure 6. The global Information flow from LAI to T. Red colours indicate positive IF rates and blue colours indicate negative IF rates. All results are computed at a 5% statistical significance. White regions are statistically insignificant regions or masked out due to the absence of vegetation.
Figure 6. The global Information flow from LAI to T. Red colours indicate positive IF rates and blue colours indicate negative IF rates. All results are computed at a 5% statistical significance. White regions are statistically insignificant regions or masked out due to the absence of vegetation.
Forests 15 01430 g006
Table 1. Description of the causal methods used in this paper.
Table 1. Description of the causal methods used in this paper.
MethodsKernel Granger CausalityPeter and Clark Momentary
Conditional Independence
Liang–Kleeman
Information Flow
AbbreviationKGCPCMCIL-K IF
Type of methodQualitative causalityQualitative causalityQuantitative causality
Theoretical basisGranger Causality, spectral representation, kernel functionConditional independence test, structure causality graphLiang–Kleeman information flow
Use of time delaysNot by defaultAlwaysNot by default
Use of iterative
conditioning
NoYesNo
Sign meaning 1No negative valuePositive value: Increases in drivers result in an increase in the target.
Negative value: Increases in drivers result in an increase in the target.
Positive value: The driver functions to increase the variability of the target, thereby making it more uncertain.
Negative value: The driver functions to reduce variability in the target.
Key referencesMarinazzo et al. (2007) [35]Runge et al. (2019) [19]Liang (2014) [49]
1 The sign meaning property of PCMCI and L-K IF is reproduced from [72], with permission from Nonlinear Processes in Geophysics, 2024.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shao, Y.; Hagan, D.F.T.; Li, S.; Zhou, F.; Zou, X.; Cabral, P. The Many Shades of the Vegetation–Climate Causality: A Multimodel Causal Appreciation. Forests 2024, 15, 1430. https://doi.org/10.3390/f15081430

AMA Style

Shao Y, Hagan DFT, Li S, Zhou F, Zou X, Cabral P. The Many Shades of the Vegetation–Climate Causality: A Multimodel Causal Appreciation. Forests. 2024; 15(8):1430. https://doi.org/10.3390/f15081430

Chicago/Turabian Style

Shao, Yuhao, Daniel Fiifi Tawia Hagan, Shijie Li, Feihong Zhou, Xiao Zou, and Pedro Cabral. 2024. "The Many Shades of the Vegetation–Climate Causality: A Multimodel Causal Appreciation" Forests 15, no. 8: 1430. https://doi.org/10.3390/f15081430

APA Style

Shao, Y., Hagan, D. F. T., Li, S., Zhou, F., Zou, X., & Cabral, P. (2024). The Many Shades of the Vegetation–Climate Causality: A Multimodel Causal Appreciation. Forests, 15(8), 1430. https://doi.org/10.3390/f15081430

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop