Next Article in Journal
GDnet-IP: Grouped Dropout-Based Convolutional Neural Network for Insect Pest Recognition
Next Article in Special Issue
Harnessing AI-Powered Genomic Research for Sustainable Crop Improvement
Previous Article in Journal
Relationships Between Photosynthetic Efficiency and Grain Antioxidant Content of Barley Genotypes Under Increasing Nitrogen Rates
Previous Article in Special Issue
Molecular Genetic Insights into the Stress Responses and Cultivation Management of Zoysiagrass: Illuminating the Pathways for Turf Improvement
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integrating Environmental Covariates into Adaptability and Stability Analyses: A Structural Equation Modeling Approach for Cotton Breeding

by
Matheus Massariol Suela
1,
Moysés Nascimento
1,2,*,
Ana Carolina Campana Nascimento
1,
Camila Ferreira Azevedo
1,
Paulo Eduardo Teodoro
3,
Francisco José Correia Farias
4,
Luiz Paulo de Carvalho
4 and
Diego Jarquin
2,*
1
Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa 36570-900, MG, Brazil
2
Agronomy Department, University of Florida, Gainesville, FL 32611-0180, USA
3
Department of Agronomy, Federal University of Mato Grosso do Sul, Chapadão do Sul 79560-000, MS, Brazil
4
National Center for Cotton Research, Brazilian Agricultural Research Corporation, Campina Grande 58428-095, PB, Brazil
*
Authors to whom correspondence should be addressed.
Agriculture 2024, 14(11), 1914; https://doi.org/10.3390/agriculture14111914
Submission received: 14 September 2024 / Revised: 22 October 2024 / Accepted: 24 October 2024 / Published: 28 October 2024
(This article belongs to the Special Issue Feature Papers in Genotype Evaluation and Breeding)

Abstract

:
Breeding programs rely on genotype-by-environment interaction (GEI) to recommend cultivars for specific locations. GEI describes how different genotypes perform under varying environmental conditions. Several methods were proposed to assess adaptability and stability across environments. These methods utilize various statistical approaches like parametric and non-parametric regression, multivariate analysis techniques, and even Bayesian frameworks and artificial intelligence. The accessibility of environmental data through platforms like NASA POWER allows breeders to integrate this information into a breeding process. It has been done by using multi-omics integration models that combine data across various biological levels to create accurate predictive models. In the context of phenotypic adaptability and stability analysis, structural equation modeling (SEM) offers an interesting approach to integrating environmental covariates. This work aimed to propose a novel approach that integrates weather information into adaptability and stability analysis, combining SEM with the established Eberhart and Russell model. Additionally, a user-friendly applet, denoted ECERSEM-AdaptStab, was made available to perform the analysis. This approach utilized data from 12 cotton cultivar trials conducted across two growing seasons at 19 sites. This approach successfully integrated environmental covariates into a phenotypic adaptability and stability analysis of cotton cultivars. Specifically, the genotypes TMG 41 WS, IMA CV 690, DP 555 BGRR, BRS 286 and BRS 369 RF were recommended for favorable environments, while the genotypes TMG 43 WS, IMA 5675 B2RF, IMA 08 WS, NUOPAL, DELTA OPAL, BRS 335, and BRS 368 RF are more suitable for unfavorable environments.

1. Introduction

In plant breeding programs, a critical component for recommending cultivars for specific locations is the interaction between the genotypes and the environment (GEI) [1]. GEI refers to the phenomenon where different genotypes perform differently depending on the environmental conditions [2]. Given this factor, particular attention has been given to cotton (Gossypium hirsutum L.), since it is widespread from the southern to the northern hemisphere, from subtropical regions to temperate latitudes [3], with 90% of the total planted area concentrated in the Brazilian Cerrado, a region that is quite large and has 12 Brazilian states [4]. Due to planting in different regions, genotype–environment interactions (GEI) are the main impediment related to the development of cultivars that adapt to environmental heterogeneity and technological specificities [5].
To identify and recommend stable and adapted genotypes across environments, several methodologies to study adaptability and stability have been developed. These methodologies differ in how the adaptability and stability parameters are defined and the methods used for analysis. Some methods rely on parametric and non-parametric regression models, such as those proposed by Finlay and Wilkinson [6] and Eberhart and Russell [7]. Multivariate analysis techniques, including GGE Biplot [8], AMMI [9], and the Extended Centroid Method [10]. Advancements have also seen the Bayesian framework [2,11,12] and artificial intelligence [13] into adaptability and stability analysis.
The advent of platforms like NASA POWER [14] has allowed access to environmental data, making it easier to collect this information and use it than in breeding programs. Specifically, it is allowing breeders to integrate environmental covariates and genomic information through a multi-omics approach [15,16].
A natural way to integrate environmental covariates into adaptability and stability analyses is through structural equation modeling (SEM). SEM allows to estimate both direct and indirect effects within a system of interrelated variables, assuming a linear model. This powerful capability paves the way for integrating weather data into adaptability and stability analyses. By combining SEM with the established Eberhart and Russell model, we can incorporate weather covariates. Specifically, this approach can allow to estimate the direct impact of weather (climate covariates) on the environmental index used in the Eberhart and Russell model, as well as its direct and indirect influence on the yield of the evaluated genotypes.
SEM has been used successfully in breeding to infer causal phenotype networks using structural equation models [17], investigate multi-trait genetic architecture of udder health in dairy cattle [18], and in GWAS studies to explore interrelated dependencies between phenotypes related to morphology in coffee [19]. However, to the best of our knowledge, this is the first attempt to leverage SEM within the adaptability and stability framework for crop breeding.
In light of this, the purposes of this study were to: (i) propose a novel approach that integrates weather information into combing structural equation modeling (SEM) with the established Eberhart and Russell model; (ii) make available an applet (Adaptability and Stability Analysis incorporating Environmental Covariates into Eberhart and Russell based on Structural Equation Models—ECERSEM-AdaptStab) that allows us to collect the weather information and provide for the user both direct and indirect effects from the climate covariates in the environmental index and on the yield of the evaluated genotypes.

2. Materials and Methods

2.1. Experimental Data Set

The data set was obtained from a cotton cultivar competition trial with 12 treatments planted in 19 environments following a randomized block design with four replications in the years 2013/2014 and 2014/2015 (Figure 1). The 12 treatments were composed of the following mid-cycle cultivars: TMG 41 WS (genotype 1), TMG 43 WS (genotype 2), IMA CV 690 (genotype 3), IMA 5675 B2RF (genotype 4), IMA 08 WS (genotype 5), NUOPAL (genotype 6), DP 555 BGRR (genotype 7), DELTA OPAL (genotype 8), BRS 286 (genotype 9), BRS 335 (genotype 10), BRS 368 RF (genotype 11), and BRS 369 RF (genotype 12). The environments were produced from the combination of 2 years and different production locations in the Brazilian Cerrado, more specifically the regions of Minas Gerais (MG), Goiás (GO), Mato Grosso (MT), Bahia (BA), Maranhão (MA), and Piauí (PI) (Figure 1). Due to the vast size of this territory, it presents different edaphoclimatic characteristics, which can be justified by the latitudinal variation, topographic amplitude, inland position, and the action of air masses [20,21,22]. All experiments had an experimental unit composed of four 5 m lines with spacings of 0.90 m between the lines and a density of 100,000 p l a n t s . h a 1 . The trait evaluated was fiber yield. The collection was carried out from samples from the two central rows of each experimental unit, and finally the humidity was corrected to 13%. More details about the experiment can be found in Teodoro et al. [23].
In order to understand the effect of environmental covariates on the performance of each genotype in each environment, the covariates related to Earth’s skin temperature (TS), temperature at 2 meters (T2M), relative humidity at 2 meters (RH2M), wind speed at 2 meters maximum (WS2M_MAX), corrected precipitation (PRECTOTCORR), and total clear sky surface photosynthetically active radiation (CLRSKY_SFC_PAR_TOT) were obtained from the NASA POWER platform (https://power.larc.nasa.gov/data-access-viewer/, accessed on 15 September 2024) [14].

2.2. Statistical Analysis

Initially, the joint analysis variance (ANOVA) was performed to verify if the effect of GEI is significant according to the model:
Y i j k = μ + G i + E j + B / E j k + G E I i j + e i j k ,
where Y i j k is the observation in the k th block, evaluated in the j th environment and i th genotype; μ is the overall mean; G i is the effect of the i th genotype (fixed effect); E j is the effect of the j th environment (fixed effect); B / E j k is the effect of block k within environment j (fixed effect); G E I i j is the effect of the genotype i x environment j interaction (fixed effect); and e i j k is the error associated with the Y i j k observation (random effect). All this analysis was carried out using the R software version 4.3.1 [24].
The traditional Eberhart and Russell methodology [7] is based on simple linear regression:
Y i j = β 0 i E R + β 1 i E R I j + δ i j + e i j ,
where Y ij is the observation of the i th (i = 1, 2, …, g) genotypes in the j th (j = 1, 2, …n) environment; β 0 i E R is the general mean for the i th genotype; β 1 i E R is the regression coefficient for the i th genotype; I j is the environmental index I j = i Y i j g i j Y i j g a ; δ i j is equal to the regression deviation of the i th genotype in the j th environment, and e ij is the effect of the mean experimental error. In this approach, genotype performance refers to how each genotype performs across different environmental conditions, and it is measure by the regression parameter, β 1 i E R . Stability parameter, σ d S E M 2 , on the other hand, informs us about the predictability of a genotype’s behavior in response to environmental stimuli. The stability can be measure also considering the coefficient of determination [25].

2.3. Structural Equation Models

Aiming to integrate weather data into adaptability and stability analyses, a structural equation modeling framework was proposed. The causal relationship between the endogenous variables (average yield of each genotype across the environments and the environmental index— I ), as well as the exogenous variables (environmental covariables), was modeled as follows [26].
y * = α + B y * + Γ x + ζ ,
where y * (average fiber yield of each genotype across the environments and the environmental index) and x (TS, T2M, RH2M, WS2M_MAX, PRECTOTCORR and CLRSKY_SFC_PAR_TOT) are (2 × 1) and (2 × 6) vectors that represent the endogenous and exogenous variables, respectively. α is a (2 × 1) vector that represents the general mean of each genotype (general mean, as Eberhart and Russell). B and Γ are (2×2) and (6×1) matrices that represent the causal coefficients representing direct effects of y -variables on y * -variables (adaptability parameter, as Eberhart and Russell) and direct effects of x -variables on y * -variables, respectively, and ζ is a (2 × 1) vector that represents the random residuals associated with endogenous variables. Using the SEM theory, it was possible to verify indirect effects of EC via the environmental index in each of the genotypes. These values are defined by the product between the environmental covariable estimates and the adaptability parameter ( γ ^ j × β ^ 1 i S E M ). The estimation for SEM model was based on maximum likelihood estimation (ML). According to Boolen [27], the coefficient of determination is given by R S E M 2 = 1 Ψ ^ Σ ^ y y , where Ψ ^ is the determinant of the estimate of the covariance matrix for the equation errors and Σ ^ y y is the determinant of the covariance matrix of y * taken from Σ ^ (estimated covariance matrix). The stability parameter is estimated by σ ^ d S E M 2 = ψ 11 = v a r ( y i ) σ y i z i Σ z i z i 1 σ z i y i , where v a r ( y i ) is the variance of y i , Σ z i z i 1 is the nonsingular covariance matrix of the column vector z i contains only those variables that belong in the y i equation, and σ z i y i is the covariance vector of ζ 1 (random residuals associated with y i ). These analyses were performed using the lavaan package [28] in R software [24].
The causal relationship considering the proposed model can be written individually as:
I = γ T S I T S + γ T 2 M I T 2 M + γ R H 2 M I R H 2 M + γ W S 2 M _ M A X I W S 2 M _ M A X + γ P R E C T O T C O R R I P R E C T O T C O R R + γ C L R S K Y _ S F C _ P A R _ T O T I C L R S K Y _ S F C _ P A R _ T O T + ζ 2 ,
y i = β 0 i S E M + γ T S y i S T + γ T 2 M y i T 2 M + γ R H 2 M y i R H 2 M + γ W S 2 M _ M A X y i W S 2 M _ M A X + γ P R E C T O T C O R R y i P R E C T O T C O R R + γ C L R S K Y _ S F C _ P A R _ T O T y i C L R S K Y _ S F C _ P A R _ T O T + β 1 i S E M I + ζ 1 ,
where I is the vector of the environmental index, y i (average fiber yield of each genotype across the environments) is the vector of the i th genotype ( i = 1 , , 12 ) , β 0 i S E M is the vector that represent the general mean of each genotype (general mean, as Eberhart and Russell). γ values represent the direct effects among environmental covariates (TS, T2M, RH2M, WS2M_MAX, PRECTOTCORR, and CLRSKY_SFC_PAR_TOT) on the index and each genotype. The β 1 i S E M values represent the direct effect among environmental indexes on each genotype (adaptability parameter, as Eberhart and Russell). ζ 1 and ζ 2 represents the random residuals associated with endogenous variables. Also, the stability can be measured considering the coefficient of determination [25]. The causal relationships represented by Equations (4) and (5) of the structural equation model are visualized in the causal diagram of Figure 2.

2.4. Integrating Weather Data into Adaptability and Stability Analyses into a Shiny Applet

All the aforementioned methodologies were integrated into the Adaptability and Stability Analysis incorporating Environmental Covariates into Eberhart and Russell based on Structural Equation Models (ECERSEM-AdaptStab). The ECERSEM-AdaptStab is a freely available tool at https://ecersem-adaptstab.shinyapps.io/ECERSEM-AdaptStab/ (accessed on 21 October 2024) and offers five functionalities: (1) data upload and covariate selection; (2) ANOVA for GEI significance; (3) data summary; (4) heatmaps for adaptability and stability parameters ( β ^ 0 i S E M , β ^ 1 i S E M , σ ^ d S E M 2 and R S E M 2 ) and; (5) heatmaps for environmental effects on genotypes and index. The workflow for using the ECERSEM-AdaptStab app is presented in Figure 3.
For part 1: (i) to ensure compatibility with the ECERSEM-AdaptStab app, the phenotypic information file must have the following format: columns named “LOCAL”, “TRAT”, “REP”, and “VARIABLE” in that specific order. It is important that the information in the first three columns is numeric and starts from “1” (for example, if the phenotypic database has 3 sites, 5 treatments, and 4 replications, we would have a table where the first site, first treatment, and first replication must be entered as “1”, going up to the values “3”, “5”, and “4”, respectively, for this hypothetical situation). (ii) The local information file should also follow this specific format: five columns named “LOCAL”, “LATITUDE”, “LONGITUDE”, “DATE_INITIAL”, and “DATE_FINAL” in that exact order. Additionally, the “LOCAL” column should contain unique identifiers (usually text labels). The user must use the decimal degrees format for latitude and longitude and “YEAR-MM-DAY” format for dates (Figure 4).
Parts 4 and 5 of the tools present heatmaps summarizing the results of the SEM analysis. Importantly, these analyses are based on a standard SEM structure, which is visualized in Figure 3. It is observed that the model contemplates the influence of environmental covariables (TS, T2M, RH2M, WS2M_MAX, PRECTOTCORR, and CLRSKY_SFC_PAR_TOT) in average fiber yield of each genotype ( y i ) and the environmental index ( I ) based on the γ ^ values, and the influence of the environmental index on each genotype (adaptability parameter) based on the β ^ values.

3. Results

A summary of the descriptive statistics, including the means, medians, standard deviations (SD), and ranges (minimum and maximum) for the fiber yield of each genotype, the environmental index, and each environmental covariate (EC), is presented in Table 1. The average of fiber yield values varied from 1645.67 (±471.06) kg . h a 1 (NUOPAL) to 1899.76 (±599.87) kg . h a 1 (genotype 7). The minimum and maximum values ranged from 483.95 kg . h a 1 (DP 555 BGRR) to 3093.16 kg . h a 1 (IMA CV 690). Eleven of the nineteen environments (1, 2, 3, 6, 7, 9, 12, 13, 16, 17, and 19) presented positive values and eight (4, 5, 8, 10, 11, 14, 15, and 18) negative values for the environmental index, with values ranging from 735.94 to −1083.63. The mean values of the EC, TS, T2M, RH2M, WS2M_MAX, PRECTOTCORR, and CLRSKY_SFC_PAR_TOT presented values ranging from 24.60 (±1.02) , 24.32 (±0.98) , 78.81 (±6.74) %, 2.66 (±1.17) m . s 1 , 5.16 (±3.19) m m . d a y 1 and 151.32 (±3.70) W . m 2 , respectively (Table 1).

3.1. Analysis of Variance for Cotton Yields in Different Environments

The joint analysis of variance for the data of yield of 12 cotton genotypes showed the significance (p-value < 0.01) of the GEI (Table 2). This result indicates the differential performance of genotypes in different environments, justifying the use of adaptability and stability analysis. Additionally, the average fiber yield was 1750.08 kg . h a 1 and the coefficient of variation obtained was 8.67%.

3.2. Yield Adaptability and Stability from Experimental Data

Observing the ECERSEM-AdaptStab and ER results, we can see that the genotypes IMA CV 690, IMA 08 WS, DP 555 BGRR, BRS 368 RF, and BRS 369 RF presented β ^ 0 higher than the general mean (1750.08) (Table 3). According to ECERSEM-AdaptStab, the genotypes TMG 41 WS, IMA CV 690, DP 555 BGRR, BRS 286, and BRS 369 RF presented β 1 > 1 and are recommended for favorable environments. On the other hand, the genotypes TMG 43 WS, IMA 5675 B2RF, IMA 08 WS, NUOPAL, DELTA OPAL, BRS 335, and BRS 368 RF were classified as genotypes with specific adaptability to unfavorable environments ( β 1 < 1 ). None of the evaluated genotypes were classified as genotypes of general adaptability ( β 1 = 1 ) . Among the evaluated genotypes, observing ECERSEM-AdaptStab only TMG 43 WS and IMA 08 WS, and in ER TMG 43 WS, IMA 5675 B2RF, and IMA 08 WS presented low stability, R 2 < 0.80 (Table 3). The results using ER show that IMA CV 690, DP 555 BGRR, DELTA OPAL, BRS 335, and BRS 369 RF presented β 1 > 1 , TMG 43 WS, IMA 5675 B2RF, and IMA 08 WS presented β 1 < 1 , and TMG 41 WS, NUOPAL, BRS 286, and BRS 368 RF presented β 1 = 1 .
The direct effects from EC on genotypes are presented in Figure 5 and in table format in Table S1. Among the evaluated genotypes, TMG 41 WS, TMG 43 WS, IMA 5675 B2RF, IMA 08 WS, and BRS 368 RF did not exhibit significant interrelationships (Figure 5). The environmental covariates TS and T2M showed larger magnitude differences between the negative and positive effects. The magnitude of the influence is indicated by the values in parentheses. TS presented a negative impact on IMA CV 690 (−636.36), DP 555 BGRR (−354.86), and BRS 286 (−550.70), while positively affecting genotypes NUOPAL (541.66), DELTA OPAL (666.4), and BRS 335 (646.6). Similar to TS, T2M negatively influenced genotypes NUOPAL (−455.39) and DELTA OPAL (−622.49) and positively influenced IMA CV 690 (490.07), DP 555 BGRR (389.41), and BRS 286 (538.3). Only three genotypes were significantly influenced by RH2M. IMA CV 690 (−30.01) was negatively affected and DELTA OPAL (42.61) and BRS 369 RF (20.84) responded positively. Wind speed (WS2M_MAX) negatively affected DP 555 BGRR (−118.73) but positively influenced NUOPAL (135.71) and BRS 335 (212.70). PRECTOTCORR presented a positive effect on IMA CV 690 (14.8). Finally, CLRSKY_SFC_PAR_TOT showed a negative influence only on NUOPAL (−25.02) and a positive influence on IMA CV 690 (30.17) and DP 555 BGRR (46.06).
The indirect effects of EC on the genotype via environmental index are presented in Figure 6. TS and WS2M_MAX positively influenced the environmental index with values of 795.90 and 321.30, respectively. T2M and CLRSKY_SFC_PAR_TOT negatively influenced the environmental index with values of −621.90 and −55.30, respectively (Figure 5, Table S1).
According to Figure 7, analyzing TS and WS2M_MAX, all genotypes showed positive total effects, with values ranging from 450.05 to 1403.84 and 225.82 to 489.14, respectively. Evaluating T2M and CLRSKY_SFC_PAR_TOT, it was observed that all genotypes showed negative total effects, with values ranging from −1198.65 to −243.58 and −75.24 to −28.81, respectively. For RH2M and PRECTOTCORR, not all genotypes showed significant effects, as the index did not significantly affect these same ECs (Figure 5). For these ECs, the total effect on genotypes was exactly the same as the direct effect (Figure 5).

4. Discussion

In this study, we proposed a structural equation modeling (SEM) approach to integrate weather data into analyses of genotype adaptability and stability. Beyond estimating adaptability and stability parameters like the Eberhart and Russell approach, SEM allows us to assess both the direct and indirect effects of environmental covariates on genotypes through the environmental index. To present the proposal, we utilized data from a cotton cultivar competition trial with 12 treatments planted across 19 environments. Additionally, a user-friendly ECERSEM-AdaptStab app for performing this analysis was developed and made available.
Similar to several studies performed to investigate cotton crop adaptability and stability in the Brazilian Cerrado, the GEI was significant for the same trait in regions similar to this study [23,25,29,30,31]. The observed coefficient of variation (CV%) for cotton fiber yield (8.67%) is consistent with values reported in the literature [12,32] and is indicating high experimental precision [33].
Initially, the results presented from ECERSEM-AdaptStab were compared with those obtained using the traditional ER methodology (Table 3). This comparison revealed that all genotypes exhibited convergent results for both the overall mean (general average) and the stability parameter based on component of variance ( σ ^ d 2 ). Regarding the stability parameter measured by the coefficient of determination ( R 2 ), only the IMA 5675 B2RF genotype showed a discrepancy. ECERSEM-AdaptStab estimated its stability to be above 80%, whereas the traditional ER method yielded a value below 80%. The genotypes TMG 43 WS, IMA CV 690, IMA 5675 B2RF, IMA 08 WS, DP 555 BGRR, and BRS 369 RF obtained similar recommendations regarding adaptability parameters, while the others (TMG 41 WS, NUOPAL, DELTA OPAL, BRS 286, BRS 335, and BRS 368 RF) had divergent classifications between ECERSEM-AdaptStab and ER (Table 3). These differences can be attributed to the influence of environmental covariates, which are not considered by the traditional ER model. By capturing these environmental effects, ECERSEM-AdaptStab may generate different parameter estimates, potentially even leading to a difference in the classification of genotype suitability compared to ER.
Given these results, it can be seen that the new strategy brings with it the advantages of incorporating information from environmental covariates directly into the model and not just working with the averages of each location. Furthermore, it is possible to separate the impact of environmental covariates on both the genotypes and the index, with this impact being exactly associated with the experimental information (location and year) imposed on the model. However, like all regression models, the addition of parameters is totally related to the degrees of freedom of the model, which directly impacts a minimum of observations for the model to work.
Multiple studies, employing the same genotype data utilized in ECERSEM-AdaptStab, presented discrepancies between the ER approach and other model-based regression. These discrepancies involve both adaptability parameters and the coefficient of determination (R2). Nascimento et al. [25], using quantile and non-parametric regressions, revealed distinct adaptability patterns among genotypes TMG 41 WS, TMG 43 WS, BRS 286, and BRS 368 RF. Nascimento et al. [12] using a Bayesian segmented regression model found discrepancies with results obtained for NUOPAL and DELTA OPAL genotypes. Similarly, Teodoro et al. [31] also observed discordances with ER using the non-parametric method proposed by Lin and Binns [34] modified by Carneiro [35] for genotypes IMA CV 690, DP 555 BGRR, and BRS 369 RF. These contrasting classifications indicated the potential influence of methodological choices on adaptability and stability recommendations.
Selecting the most suitable method for analyzing adaptability and stability depends on the specific research objectives and the nature of the available data. For example, analysis of variance (ANOVA) methods allow to estimate the contributions of genotype, environment, and GEI to the overall performance variation [36,37]. Multi-environment trial analysis methods like AMMI and GGE Biplot combine analysis of variance (ANOVA) with other techniques like principal component analysis (PCA) to identify genotypes with both high mean yields and specific adaptation to certain environments [38]. Additionally, the regression-based methods utilize the analysis to assess stability and identify genotypes with high mean yields and low responsiveness to environmental changes [6,7]. Each approach has its own characteristics and can be used alone or as complementary to each other. However, unlike the ER method and other approaches, some alternative approaches do allow for the incorporation of environmental covariates (EC) into the process.
Similar to many other crops, EC significantly interacts with the genotype of cotton plants, affecting their overall production and potentially influencing their adaptability and stability. Among the EC, heat stress causes a potential decline in cotton yields [39,40,41]. Schlenker and Roberts [42] project that cotton-growing regions could face significant warming exceeding the global average. This warming trend has the potential to cause cotton yields to decline by up to 40% by 2100. In addition to heat stress, water stress caused significant reductions in days to first flower formation, plant height, number of bolls/plant, boll weight, and seed cotton yield [43,44,45]. Other studies have also shown that increased sunlight decreased the production of some cotton cultivars due to increased evaporation and water stress in some regions of China [46], but favored increased production of other cotton cultivars in other regions, an increase explained by favoring photosynthesis [47]. It was also found that wind speed in critical periods of the cotton production window can affect cotton sowing and seedling growth [48].
Considering our results, the genotypes TMG 41 WS, TMG 43 WS, IMA 5675 B2RF, IMA 08 WS, and BRS 368 RF were not affected directly by the EC in these specific season conditions (Figure 5 and Table S1). This result suggests that these genotypes are stable over the direct effects of EC variations. On the other hand, TS and T2M (related to the heat stress) affected six (IMA CV 690, NUOPAL, DP 555 BGRR, DELTA OPAL, BRS 286, and BRS 335) and five (IMA CV 690, NUOPAL, DP 555 BGRR, DELTA OPAL, and BRS 286) genotypes, respectively, being, in our study, the most important EC in this set of environments. It is important to note that if the number of seasons were different, the results would possibly change, as it could also be different in other models based on averages of the environments already proposed, as this would directly impact the value linked to the specific environment.
Specifically, TS negatively affected IMA CV 690 and DP 555 BGRR, while T2M and CLRSKY_SFC_PAR_TOT affected the same genotypes positively. IMA CV 690 was negatively affected by RH2M, while genotype DP 555 BGRR suffered a positive effect from PRECTOTCORR and WS2. While temperature (T2M) and light availability (CLRSKY_SFC_PAR_TOT) presented similar effects on these two genotypes, caution is warranted in their recommendation. Their differential responses to other EC, like humidity (RH2M), rainfall (PRECTOTCORR), and wind speed (WS2_MAX), highlight the importance of considering information about the EC. Therefore, the recommendation for genotype IMA CV 690 should be directed to regions that present milder temperatures and good CLRSKY_SFC_PAR_TOT conditions, but regions that do not present high RH2M should also be considered. On the other hand, genotype DP 555 BGRR should be recommended for regions that also have good CLRSKY_SFC_PAR_TOT conditions, but also with good precipitation conditions and without the occurrence of WS2_MAX. Genotype BRS 369 RF, contrary to genotype IMA CV 690, should be allocated to regions where the occurrence of higher RH2M is more frequent. Finally, BRS 368 RF was not significantly influenced by any of the EC used in this work; therefore, the recommendation would be to maintain the same as would traditionally be done in classical methods, as was observed by the ECERSEM-AdaptStab approach. These results showed that variations in EC have affected these genotypes, which potentially will change their recommendations depending on climatic variations.
The indirect effects of RH2M and PRECTOCORR on all genotypes through environmental index were not significant (Figure 6). Other environmental covariates showed consistently directional indirect effects. The low difference between these results is expected because they arise from the product of two parameters. The first one, gamma ( γ ), represents the direct effect of each environmental covariate on the environmental index. The second parameter captures the direct effect of the environmental index on each genotype’s adaptability, which in this work ranges from 0.43 to 1.55. It is interesting to notice that some environmental covariates (ECs) did not present a direct effect on genotypes, but their influence was observed indirectly (Figure 6). This information, given by the SEM modeling proposal, is particularly valuable since even for ECs with no direct effect on genotypes, we can still uncover their influence by evaluating their indirect effect through the environmental index. Specifically, the genotypes TMG 41 WS, TMG 43 WS, IMA 5675 B2RF, IMA 08 WS, and BRS 368 RF presented indirect effects for TS, T2M, WS2M_MAX, and CLRSKY_SFC_PAR_TOT.
Regarding the total effects, it is possible to observe that TS and WS2M_MAX exhibited positive effects, while T2M, RH2M, PRECTOTCORR, and CLRSKY_SFC_TOT showed negative effects (Figure 7). This result allows the researcher to obtain responses about the total effect of the environmental covariate on the genotype. However, by partitioning this effect, it is possible to verify how these covariates directly influence genotypes or indirectly modulate their effects across the studied environments. In practice, by partitioning the total effects, the researchers improve their understanding of how the EC affects the genotypes, bringing more knowledge to help their decisions.
In theory of regression, including an additional explanatory variable (EC) can improve the model’s fit by capturing more variance in the trait [49]. This may also lead to smaller standard errors, making the coefficients of all variables more precise estimates. Additionally, omitting relevant variables can lead to several issues, including inflated model error and biased and inconsistent estimators, ultimately resulting in unreliable models [35]. However, introducing highly correlated explanatory variables can introduce multicollinearity. This can create unstable models with insignificant coefficients, making interpretation difficult [50]. The combined effects of multicollinearity and sample size limitations restrict the number of explanatory variables (ECs) we can effectively include in the model. The sample size is related to the choice of regression-based method for adaptability and stability analysis. To do that, the researcher should consider the number of environments included in the analysis. For instance, the Eberhart and Russell (ER) model requires at least three environments due to its two estimated parameters. Conversely, segmented regression models necessitate a minimum of six environments. These models typically allocate three environments below the average environmental index and three above to capture the non-linear response. As the proposed method incorporates EC, it necessitates fitting a model with more parameters. Researchers should account for this during the analysis.
Overall, ECERSEM-AdaptStab unveils the direct and indirect effects of environmental covariates on genotypes. Direct effects describe how individual ECs influence genotypes, while indirect effects capture how these factors combine to impact genotypes through an environmental index. In a practical setting, this information is very important for breeders as climate change intensifies. By understanding how genotypes respond directly and indirectly to environmental variations, breeders can make informed decisions about which lines to prioritize for breeding programs. Recent studies have explored the significant impact of climate change on crop breeding [51,52]. These studies emphasize the crucial role of crop breeding in adapting to these challenges, complementing crop management strategies and policy interventions to ensure global food security. However, to our knowledge, no existing adaptability and stability method explicitly incorporates information about ECs into its recommendation process.

5. Conclusions

In this study, a methodology based on SEM was proposed, which combines traditional parameters of adaptability and stability with the use of environmental covariates, proving to be advantageous given the new climate scenarios that are to come. Furthermore, a user-friendly shiny application (ECERSEM-AdaptStab) was created that incorporated all procedures, with the aim of facilitating use by plant breeders.
ECERSEM-AdaptStab identified that the genotypes TMG 41 WS, IMA CV 690, DP 555 BGRR, BRS 286 and BRS 369 RF should be recommended for favorable environments, while the genotypes TMG 43 WS, IMA 5675 B2RF, IMA 08 WS, NUOPAL, DELTA OPAL, BRS 335, and BRS 368 RF are more suitable for unfavorable environments.

Supplementary Materials

The Table S1 can be downloaded at: https://doi.org/10.5281/zenodo.13759585.

Author Contributions

Conceptualization, M.M.S. and M.N.; methodology, M.M.S. and M.N.; validation, M.M.S., M.N. and D.J.; formal analysis, M.M.S. and M.N.; investigation, M.M.S. and M.N.; data curation, P.E.T., F.J.C.F. and L.P.d.C.; writing—original draft preparation, M.M.S., M.N., A.C.C.N. and C.F.A.; writing—review and editing, M.M.S. and M.N.; visualization, M.M.S., M.N. and D.J.; supervision, M.N. and D.J.; project administration, M.N. and D.J. All authors have read and agreed to the published version of the manuscript.

Funding

We would like to thank the Foundation for Research Support of the state of Minas Gerais (FAPEMIG, APQ-01638-18), and the National Council of Scientific and Technological Development (CNPq, 408833/2023-8). MN and CFA are supported by scientific productivity (310755/2023-9 and 309856/2023-0), respectively, from Brazilian Council for Scientific and Technological Development (CNPq).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

All relevant information is contained in the application (https://ecersem-adaptstab.shinyapps.io/ECERSEM-AdaptStab/). This link was accessed on 21 October 2024. The data supporting the findings of this study are available from one of the authors, Paulo Eduardo Teodoro, upon request.

Acknowledgments

We would like to thank the Federal University of Viçosa and the University of Florida for providing the necessary knowledge to develop this work, and the Federal University of Mato Grosso do Sul and the Brazilian Agricultural Research Corporation for providing the data necessary for this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Crossa, J. From Genotype × Environment Interaction to Gene × Environment Interaction. Curr. Genom. 2012, 13, 225–244. [Google Scholar] [CrossRef] [PubMed]
  2. Couto, M.F.; Nascimento, M.; Do Amaral, A.T.; Silva, F.F.E.; Viana, A.P.; Vivas, M. Eberhart and Russel’s Bayesian Method in the Selection of Popcorn Cultivars. Crop Sci. 2015, 55, 571–577. [Google Scholar] [CrossRef]
  3. D’Eeckenbrugge, G.C.; Lacape, J.M. Distribution and Differentiation of Wild, Feral, and Cultivated Populations of Perennial Upland Cotton (Gossypium hirsutum L.) in Mesoamerica and the Caribbean. PLoS ONE 2014, 9, e107458. [Google Scholar] [CrossRef]
  4. Dos Santos, A.; da Silva Matos, E.; da Silva Freddi, O.; Galbieri, R.; Lal, R. Cotton Production Systems in the Brazilian Cerrado: The Impact of Soil Attributes on Field-Scale Yield. Eur. J. Agron. 2020, 118, 126090. [Google Scholar] [CrossRef]
  5. Van Eeuwijk, F.A.; Bustos-Korts, D.V.; Malosetti, M. What Should Students in Plant Breeding Know about the Statistical Aspects of Genotype × Environment Interactions? Crop Sci. 2016, 56, 2119–2140. [Google Scholar] [CrossRef]
  6. Finlay, K.W.; Wilkinson, G.N. The Analysis of Adaptation in a Plant-Breeding Programme. Aust. J. Agric. Res. 1963, 14, 742–754. [Google Scholar] [CrossRef]
  7. Eberhart, S.A.; Russell, W.A. Stability Parameters for Comparing Varieties 1. Crop Sci. 1966, 6, 36–40. [Google Scholar] [CrossRef]
  8. Yan, W.; Hunt, L.A.; Sheng, Q.; Szlavnics, Z. Cultivar Evaluation and Mega-Environment Investigation Based on the GGE Biplot. Crop Sci. 2000, 40, 597–605. [Google Scholar] [CrossRef]
  9. Gauch, H.G. Statistical Analysis of Yield Trials by AMMI and GGE. Crop Sci. 2006, 46, 1488–1500. [Google Scholar] [CrossRef]
  10. Nascimento, M.; Ferreira, A.; Nascimento, A.C.C.; Salgado, C.C.; Cruz, C.D. Multiple Centroid Methodology to Analyze Genotype Adaptability. Crop Breed. Appl. Biotechnol. 2009, 9, 8–16. [Google Scholar] [CrossRef]
  11. Nascimento, M.; Silva, F.F.; Sáfadi, T.; Nascimento, A.C.C.; de Paula Ferreira, R.; Cruz, C.D. Abordagem Bayesiana Para Avaliação Da Adaptabilidade e Estabilidade de Genótipos de Alfafa. Pesqui. Agropecu. Bras. 2011, 46, 26–32. [Google Scholar] [CrossRef]
  12. Nascimento, M.; Nascimento, A.C.C.; e Silva, F.F.; Teodoro, P.E.; Azevedo, C.F.; de Oliveira, T.R.A.; do Amaral Junior, A.T.; Cruz, C.D.; Farias, F.J.C.; de Carvalho, L.P. Bayesian Segmented Regression Model for Adaptability and Stability Evaluation of Cotton Genotypes. Euphytica 2020, 216, 30. [Google Scholar] [CrossRef]
  13. Barbosa, W.F.; do Prado Hashimoto, T.; Matsuo, É.; Nascimento, A.C.C.; Azevedo, C.F.; Bezerra, A.R.G.; Nascimento, M. Artificial Neural Networks Based on Segmented Model for Adaptability and Stability Evaluation of Soybean Genotypes. Aust. J. Crop Sci. 2023, 17, 735–740. [Google Scholar] [CrossRef]
  14. Sparks, A. Nasapower: A NASA POWER Global Meteorology, Surface Solar Energy and Climatology Data Client for R. J. Open Source Softw. 2018, 3, 1035. [Google Scholar] [CrossRef]
  15. Jarquín, D.; Crossa, J.; Lacaze, X.; Du Cheyron, P.; Daucourt, J.; Lorgeou, J.; Piraux, F.; Guerreiro, L.; Pérez, P.; Calus, M.; et al. A Reaction Norm Model for Genomic Selection Using High-Dimensional Genomic and Environmental Data. Theor. Appl. Genet. 2014, 127, 595–607. [Google Scholar] [CrossRef]
  16. Manthena, V.; Jarquín, D.; Howard, R. Integrating and Optimizing Genomic, Weather, and Secondary Trait Data for Multiclass Classification. Front. Genet. 2022, 13, 1032691. [Google Scholar] [CrossRef]
  17. Rosa, G.J.M.; Valente, B.D.; De Los Campos, G.; Wu, X.L.; Gianola, D.; Silva, M.A. Inferring Causal Phenotype Networks Using Structural Equation Models. Genet. Sel. Evol. 2011, 43, 6. [Google Scholar] [CrossRef]
  18. Pegolo, S.; Momen, M.; Morota, G.; Rosa, G.J.M.; Gianola, D.; Bittante, G.; Cecchinato, A. Structural Equation Modeling for Investigating Multi-Trait Genetic Architecture of Udder Health in Dairy Cattle. Sci. Rep. 2020, 10, 7751. [Google Scholar] [CrossRef]
  19. Suela, M.M.; Azevedo, C.F.; Nascimento, A.C.C.; Momen, M.; de Oliveira, A.C.B.; Caixeta, E.T.; Morota, G.; Nascimento, M. Genome-Wide Association Study for Morphological, Physiological, and Productive Traits in Coffea Arabica Using Structural Equation Models. Tree Genet. Genomes 2023, 19, 23. [Google Scholar] [CrossRef]
  20. Correia Filho, W.L.F.; de Oliveira-Júnior, J.F.; Santiago, D.d.B.; Abdo, H.G.; Almohamad, H.; Al Dughairi, A.A.; da Silva Junior, C.A. The Assessment of Climatic, Environmental, and Socioeconomic Aspects of the Brazilian Cerrado. Ecol. Process. 2023, 12, 19. [Google Scholar] [CrossRef]
  21. Nascimento, D.T.F.; Novais, G.T. Clima Do Cerrado: Dinâmica Atmosférica e Características, Variabilidades e Tipologias Climáticas. Élisée-Rev. Geogr. UEG 2020, 9, e922021. [Google Scholar]
  22. Neri, A.V.; Schaefer, C.E.G.R.; Silva, A.F.; Souza, A.L.; Ferreira-Junior, W.G.; Meira-Neto, J.A.A. The Influence of Soils on the Floristic Composition and Community Structure of an Area of Brazilian Cerrado Vegetation. Edinburgh J. Bot. 2012, 69, 1–27. [Google Scholar] [CrossRef]
  23. Teodoro, P.E.; Farias, F.J.C.; de Carvalho, L.P.; Nascimento, M.; Peixoto, L.d.A.; Cruz, C.D.; Bhering, L.L. Identification of Optimal Environments for Cotton Cultivars in the Brazilian Cerrado. Agron. J. 2018, 110, 1226–1232. [Google Scholar] [CrossRef]
  24. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023. [Google Scholar]
  25. Nascimento, M.; Teodoro, P.E.; Sant’anna, I.d.C.; Barroso, L.M.A.; Nascimento, A.C.C.; Azevedo, C.F.; Teodoro, L.P.R.; Farias, F.J.C.; Almeida, H.C.; de Carvalho, L.P. Influential Points in Adaptability and Stability Methods Based on Regression Models in Cotton Genotypes. Agronomy 2021, 11, 2179. [Google Scholar] [CrossRef]
  26. Hu, L.T.; Bentler, P.M. Cutoff Criteria for Fit Indexes in Covariance Structure Analysis: Conventional Criteria versus New Alternatives. Struct. Equ. Model. 1999, 6, 1–55. [Google Scholar] [CrossRef]
  27. Bollen, K.A. Structural Equations with Latent Variables, 1st ed.; Wiley: Hoboken, NJ, USA, 1989; Volume 1, ISBN 0471011711. [Google Scholar]
  28. Rosseel, Y. Lavaan: An R Package for Structural Equation Modeling. Acta Physiol. Scand. Suppl. 2012, 48, 1–36. [Google Scholar]
  29. Souza, A.A.D.; Freire, E.C.; Bruno, R.D.L.A.; Carvalho, L.P.D.; Silva Filho, J.L.D.; Pereira, W.E. Estabilidade e Adaptabilidade Do Algodoeiro Herbáceo No Cerrado Do Mato Grosso e Mato Grosso Do Sul. Pesqui. Agropecuária Bras. 2006, 41, 1125–1131. [Google Scholar] [CrossRef]
  30. Suinaga, A.; Bastos, S.; Rangel, P.; Eduardo, L.; Akiyoshi Suinaga, F.; Schetino Bastos, C.; Eduardo Pacifici Rangel, L. Phenotypic Adaptability and Stability of Cotton Cultivars in Mato Grosso State, Brazil. Pesqui. Agropecuária Trop. 2006, 36, 145–150. [Google Scholar]
  31. Teodoro, P.E.; Farias, F.J.C.; de Carvalho, L.P.; Ribeiro, L.P.; Nascimento, M.; Azevedo, C.F.; Cruz, C.D.; Bhering, L.L. Adaptability and Stability of Cotton Genotypes Regarding Fiber Yield and Quality Traits. Crop Sci. 2019, 59, 518–524. [Google Scholar] [CrossRef]
  32. Araújo, L.F.D.; Almeida, W.S.D.; Bertini, C.H.C.D.M.; Vidal Neto, F.D.C.; Bleicher, E. Correlations and Path Analysis in Components of Fiber Yield in Cultivars of Upland Cotton. Bragantia 2012, 71, 328–335. [Google Scholar] [CrossRef]
  33. Pimentel, F. Curso De Estatística Experimental, 15th ed.; Fund. Estudos Agrár; Luiz de Queiroz: São Paulo, Brazil, 2009. [Google Scholar]
  34. Lin, C.S.; Binns, M.R. A Superiority Measure of Cultivar Performance for Cultivar × Location Data. Can. J. Plant Sci. 1988, 68, 193–198. [Google Scholar] [CrossRef]
  35. Carneiro, P.C.d.S. Novas Metodologias de Análise Da Adaptabilidade e Estabilidade de Comportamento. Universidade Federal de Viçosa, Viçosa, Brazil, 1998. [Google Scholar]
  36. Plaisted, R.L.; Peterson, L.C. A Technique for Evaluating the Ability of Selections to Yield Consistently in Different Locations or Seasons. Am. J. Potato Res. 1959, 36, 381–385. [Google Scholar] [CrossRef]
  37. Wricke, G. Über Eine Methode Zur Erfassung Der Ökologischen Streubreite in Feldversuchen. Pflanzenzücht 1962, 47, 92–96. [Google Scholar]
  38. Gauch, H.G. Statistical Analysis of Regional Yield Trials: AMMI Analysis of Factorial Designs; Elsevier: Amsterdam, The Netherlands, 1992; ISBN 978-0-444-89240-9. [Google Scholar]
  39. Saini, D.K.; Impa, S.M.; McCallister, D.; Patil, G.B.; Abidi, N.; Ritchie, G.; Jaconis, S.Y.; Jagadish, K.S.V. High Day and Night Temperatures Impact on Cotton Yield and Quality—Current Status and Future Research Direction. J. Cott. Res. 2023, 6, 16. [Google Scholar] [CrossRef]
  40. Oosterhuis, D.M. Stress Physiology in Cotton, 1st ed.; Oosterhuis, D.M., Ed.; The Cotton Foundation: Cordova, TN, USA, 2011; ISBN 978-0-939809-07-3. [Google Scholar]
  41. Awaad, H.A. Sustainable Agriculture in Egypt, 1st ed.; Springer: Berlin/Heidelberg, Germany, 2021; ISBN 978-3-030-81873-9. [Google Scholar]
  42. Schlenker, W.; Roberts, M.J. Nonlinear Temperature Effects Indicate Severe Damages to U.S. Crop Yields under Climate Change. Proc. Natl. Acad. Sci. USA 2009, 106, 15594–15598. [Google Scholar] [CrossRef]
  43. Bakhsh, A.; Rehman, M.; Salman, S.; Ullah, R. Evaluation of Cotton Genotypes for Seed Cotton Yield and Fiber Quality Traits under Water Stress and Non-Stress Conditions. Sarhad J. Agric. 2019, 35, 161–170. [Google Scholar] [CrossRef]
  44. Zonta, J.H.; Bezerra, J.R.C.; Sofiatti, V.; Brandão, Z.N. Yield of Cotton Cultivars under Different Irrigation Depths in the Brazilian Semi-Arid Region. Rev. Bras. Eng. Agric. Ambient. 2015, 19, 748–754. [Google Scholar] [CrossRef]
  45. Zhang, D.; Luo, Z.; Liu, S.; Li, W.; WeiTang; Dong, H. Effects of Deficit Irrigation and Plant Density on the Growth, Yield and Fiber Quality of Irrigated Cotton. Field Crop. Res. 2016, 197, 1–9. [Google Scholar] [CrossRef]
  46. Zhou, Y.; Li, F.; Xin, Q.; Li, Y.; Lin, Z. Historical Variability of Cotton Yield and Response to Climate and Agronomic Management in Xinjiang, China. Sci. Total Environ. 2024, 912, 169327. [Google Scholar] [CrossRef]
  47. Abudoukerimu, A.; Hu, S.Q.; Nu’erpatiman, M. Effect of Climate Change on Cotton Growth Period and Yield in Kashgar City, Xinjiang Uygur Autonomous Region. Chin. J. Eco-Agric. 2015, 23, 919–930. [Google Scholar] [CrossRef]
  48. He, Z.S.; Wang, W.; Sun, H.J.; Luo, J.; Du, S.S.; Ma, J.X.; Ge, S.N.; Yao, Q.Q.; Wang, G.H.; Li, W.P.; et al. Effect of Wheat and Cotton Intercropping on Prevention of Wind Disaster at the Cotton Seedling Stage in Southern Xinjiang. China Cott. 2022, 49, 1–11. [Google Scholar] [CrossRef]
  49. Jobson, J.D. Applied Multivariate Data Analysis Regression and Experimental Design; Casella, G., Fienberg, S., Olkin, I., Eds.; Springer: New York, NY, USA, 1991; Volume 1, ISBN 978-1-4612-6960-1. [Google Scholar]
  50. Montgomery, D.C.; Peck, E.A.; Vining, G.G. Introduction to Linear Regression Analysis, 5th ed.; John Wiley & Sons: Hoboken, NJ, USA, 2012; ISBN 978-0-470-54281-1. [Google Scholar]
  51. Robles-Zazueta, C.A.; Crespo-Herrera, L.A.; Piñera-Chavez, F.J.; Rivera-Amado, C.; Aradottir, G.I. Climate Change Impacts on Crop Breeding: Targeting Interacting Biotic and Abiotic Stresses for Wheat Improvement. Plant Genome 2024, 17, e20365. [Google Scholar] [CrossRef] [PubMed]
  52. Xiong, W.; Reynolds, M.; Xu, Y. Climate Change Challenges Plant Breeding. Curr. Opin. Plant Biol. 2022, 70, 102308. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Map of cotton cultivar yield test sites, Brazil.
Figure 1. Map of cotton cultivar yield test sites, Brazil.
Agriculture 14 01914 g001
Figure 2. Standard structural equation model (SEM) structure showing the causal relationship among Earth’s skin temperature (ST), temperature at 2 meters (T2M), relative humidity at 2 meters (RH), wind speed at 2 meters maximum (WS), corrected precipitation (PRECTOTCORR), and total clear sky surface photosynthetically active radiation (CLRSKY_SFC_PAR_TOT) to the environmental index ( I ), and average fiber yield of each genotype ( y i ). The γ values represent the direct effects among environmental covariates to the index and each genotype. The α and β I y i S E M values represent the general mean of each genotype ( β 0 i S E M ), and the direct effect among environmental index to each genotype, respectively (general mean and adaptability parameter, as Eberhart and Russell, respectively). ζ 1 and ζ 2 represents the random residuals associated with endogenous variables. ψ 11 and ψ 22 represents the residual variance of the endogenous variable.
Figure 2. Standard structural equation model (SEM) structure showing the causal relationship among Earth’s skin temperature (ST), temperature at 2 meters (T2M), relative humidity at 2 meters (RH), wind speed at 2 meters maximum (WS), corrected precipitation (PRECTOTCORR), and total clear sky surface photosynthetically active radiation (CLRSKY_SFC_PAR_TOT) to the environmental index ( I ), and average fiber yield of each genotype ( y i ). The γ values represent the direct effects among environmental covariates to the index and each genotype. The α and β I y i S E M values represent the general mean of each genotype ( β 0 i S E M ), and the direct effect among environmental index to each genotype, respectively (general mean and adaptability parameter, as Eberhart and Russell, respectively). ζ 1 and ζ 2 represents the random residuals associated with endogenous variables. ψ 11 and ψ 22 represents the residual variance of the endogenous variable.
Agriculture 14 01914 g002
Figure 3. Workflow for using the ECERSEM-AdaptStab app.
Figure 3. Workflow for using the ECERSEM-AdaptStab app.
Agriculture 14 01914 g003
Figure 4. Example input for the phenotypic and local information files.
Figure 4. Example input for the phenotypic and local information files.
Agriculture 14 01914 g004
Figure 5. Heatmap that shows the direct effects between the environmental covariates and the genotypes via environmental index. Index: environmental index; TS: Earth skin temperature; T2M: temperature at 2 meters; RH2M: relative humidity at 2 meters; WS2M_MAX: wind speed at 2 meters maximum; PRECTOTCORR: precipitation corrected; CLRSKY_SFC_PAR_TOT: clear sky surface photosynthetically active radiation total. Gen1: TMG 41 WS; Gen2: TMG 43 WS; Gen3: IMA CV 690; Gen4: IMA 5675 B2RF; Gen5: IMA 08 WS; Gen6: NUOPAL; Gen7: DP 555 BGRR; Gen8: DELTA OPAL; Gen9: BRS 286; Gen10: BRS 335; Gen11: BRS 368 RF; Gen 12: BRS 369 RF.
Figure 5. Heatmap that shows the direct effects between the environmental covariates and the genotypes via environmental index. Index: environmental index; TS: Earth skin temperature; T2M: temperature at 2 meters; RH2M: relative humidity at 2 meters; WS2M_MAX: wind speed at 2 meters maximum; PRECTOTCORR: precipitation corrected; CLRSKY_SFC_PAR_TOT: clear sky surface photosynthetically active radiation total. Gen1: TMG 41 WS; Gen2: TMG 43 WS; Gen3: IMA CV 690; Gen4: IMA 5675 B2RF; Gen5: IMA 08 WS; Gen6: NUOPAL; Gen7: DP 555 BGRR; Gen8: DELTA OPAL; Gen9: BRS 286; Gen10: BRS 335; Gen11: BRS 368 RF; Gen 12: BRS 369 RF.
Agriculture 14 01914 g005
Figure 6. Heatmap that shows the indirect effects between the environmental covariates and the genotypes via environmental index. TS: Earth skin temperature; T2M: temperature at 2 meters; RH2M: relative humidity at 2 meters; WS2M_MAX: wind speed at 2 meters maximum; PRECTOTCORR: precipitation corrected; CLRSKY_SFC_PAR_TOT: clear sky surface photosynthetically active radiation total. Gen1: TMG 41 WS; Gen2: TMG 43 WS; Gen3: IMA CV 690; Gen4: IMA 5675 B2RF; Gen5: IMA 08 WS; Gen6: NUOPAL; Gen7: DP 555 BGRR; Gen8: DELTA OPAL; Gen9: BRS 286; Gen10: BRS 335; Gen11: BRS 368 RF; Gen 12: BRS 369 RF.
Figure 6. Heatmap that shows the indirect effects between the environmental covariates and the genotypes via environmental index. TS: Earth skin temperature; T2M: temperature at 2 meters; RH2M: relative humidity at 2 meters; WS2M_MAX: wind speed at 2 meters maximum; PRECTOTCORR: precipitation corrected; CLRSKY_SFC_PAR_TOT: clear sky surface photosynthetically active radiation total. Gen1: TMG 41 WS; Gen2: TMG 43 WS; Gen3: IMA CV 690; Gen4: IMA 5675 B2RF; Gen5: IMA 08 WS; Gen6: NUOPAL; Gen7: DP 555 BGRR; Gen8: DELTA OPAL; Gen9: BRS 286; Gen10: BRS 335; Gen11: BRS 368 RF; Gen 12: BRS 369 RF.
Agriculture 14 01914 g006
Figure 7. Heatmap that shows the total effects of the environmental covariates on the genotypes. TS: Earth skin temperature; T2M: temperature at 2 meters; RH2M: relative humidity at 2 meters; WS2M_MAX: wind speed at 2 meters maximum; PRECTOTCORR: precipitation corrected; CLRSKY_SFC_PAR_TOT: clear sky surface photosynthetically active radiation total. Gen1: TMG 41 WS; Gen2: TMG 43 WS; Gen3: IMA CV 690; Gen4: IMA 5675 B2RF; Gen5: IMA 08 WS; Gen6: NUOPAL; Gen7: DP 555 BGRR; Gen8: DELTA OPAL; Gen9: BRS 286; Gen10: BRS 335; Gen11: BRS 368 RF; Gen 12: BRS 369 RF.
Figure 7. Heatmap that shows the total effects of the environmental covariates on the genotypes. TS: Earth skin temperature; T2M: temperature at 2 meters; RH2M: relative humidity at 2 meters; WS2M_MAX: wind speed at 2 meters maximum; PRECTOTCORR: precipitation corrected; CLRSKY_SFC_PAR_TOT: clear sky surface photosynthetically active radiation total. Gen1: TMG 41 WS; Gen2: TMG 43 WS; Gen3: IMA CV 690; Gen4: IMA 5675 B2RF; Gen5: IMA 08 WS; Gen6: NUOPAL; Gen7: DP 555 BGRR; Gen8: DELTA OPAL; Gen9: BRS 286; Gen10: BRS 335; Gen11: BRS 368 RF; Gen 12: BRS 369 RF.
Agriculture 14 01914 g007
Table 1. Means, medians, standard deviations (SD), and ranges (minimum and maximum) for the fiber yield of each genotype, the environmental index, and each environmental covariate.
Table 1. Means, medians, standard deviations (SD), and ranges (minimum and maximum) for the fiber yield of each genotype, the environmental index, and each environmental covariate.
VariablesMinMaxMeanMedianSD
FYG1499.862416.361697.101759.41508.19
FYG2801.662297.761683.811698.99438.19
FYG3587.903093.161951.871873.70591.22
FYG4857.912220.701690.921777.35355.78
FYG5791.172473.471764.821830.74410.74
FYG6697.392585.811645.671670.49471.06
FYG7483.952786.231899.761876.04599.87
FYG8572.912592.311653.661691.46564.53
FYG9500.612604.901732.901701.33516.39
FYG10544.792614.171688.661813.26562.52
FYG11808.542889.711773.731805.11540.02
FYG12631.402712.451818.071914.97571.76
Environmental index−1083.63735.940.0087.20469.92
TS23.1526.1324.6024.291.02
T2M23.0626.0024.3224.060.98
RH2M65.7790.3878.8180.296.74
WS2M_MAX0.163.732.663.181.17
PRECTOTCORR1.289.765.165.423.19
CLRSKY_SFC_PAR_TOT141.16156.78151.32152.303.70
FYG: average fiber yield of each genotype (1, 2, …, 12); TS: Earth skin temperature; T2M: temperature at 2 meters; RH2M: relative humidity at 2 meters; WS2M_MAX: wind speed at 2 meters maximum; PRECTOTCORR: precipitation corrected; CLRSKY_SFC_PAR_TOT: clear sky surface photosynthetically active radiation total. Min: minimum; Max: maximum; SD: standard deviation.
Table 2. Joint analysis for yield of 12 early cotton genotypes evaluated in 19 Brazilian environments in the 2013/2014 and 2014/2015 crop seasons.
Table 2. Joint analysis for yield of 12 early cotton genotypes evaluated in 19 Brazilian environments in the 2013/2014 and 2014/2015 crop seasons.
Source of VariationDegrees of FreedomMean Square
Genotype11714,991.03 *
Environments1810,599,770.42 *
Block/environments5742,223.61 *
GEI198198,986.84 *
Residual62723,035.85
Mean-1750.08
Coefficient of variation (%)-8.67
* Represents significance at 1% probability by the F test.
Table 3. Adaptability and stability parameters obtained by the ECERSEM-AdaptStab app for 12 cotton genotypes in 19 environments in the Brazilian Cerrado.
Table 3. Adaptability and stability parameters obtained by the ECERSEM-AdaptStab app for 12 cotton genotypes in 19 environments in the Brazilian Cerrado.
ECERSEM-AdaptStabER
Genotypes β ^ 0 S E M β ^ 1 S E M σ ^ d S E M 2 R S E M 2 β ^ 0 E R β ^ 1 E R σ ^ d E R 2 R E R 2
TMG 41 WS1697.101.09 *29,027.54 *0.881697.100.9842,531.17 *0.82
TMG 43 WS1683.810.91 *40,204.28 *0.771683.810.80 *47,251.75 *0.74
IMA CV 6901951.871.52 *14,313.06 *0.951951.871.19 *30,820.69 *0.90
IMA 5675 B2RF1690.920.92 *19,863.71 *0.831690.920.67 *24,638.68 *0.77
IMA 08 WS1764.820.70 *39,891.97 *0.751764.820.70 *58,017.47 *0.64
NUOPAL1645.670.43 *9471.79 *0.951645.670.9422,563.10 *0.88
DP 555 BGRR1899.761.55 *5824.48 *0.981899.761.24 *12,839.75 *0.95
DELTA OPAL1653.660.92 *21,167.75 *0.921653.661.13 *35,320.60 *0.88
BRS 2861732.901.25 *22,869.41 *0.911732.901.0330,571.81 *0.87
BRS 3351688.660.52 *35,714.77 *0.881688.661.08 *58,215.95 *0.81
BRS 368 RF1773.730.78 *30,656.38 *0.881773.731.0638,783.87 *0.86
BRS 369 RF1818.071.36 *12,435.91 *0.951818.071.18 *15,010.33 *0.94
General mean1750.08---1750.08---
ECERSEM-AdaptStab: adaptability and stability analysis incorporating environmental covariates into Eberhart and Russell based on structural equation models; ER: Eberhart and Russell; * represents significance at 5% probability by the F test.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Suela, M.M.; Nascimento, M.; Nascimento, A.C.C.; Azevedo, C.F.; Teodoro, P.E.; Farias, F.J.C.; de Carvalho, L.P.; Jarquin, D. Integrating Environmental Covariates into Adaptability and Stability Analyses: A Structural Equation Modeling Approach for Cotton Breeding. Agriculture 2024, 14, 1914. https://doi.org/10.3390/agriculture14111914

AMA Style

Suela MM, Nascimento M, Nascimento ACC, Azevedo CF, Teodoro PE, Farias FJC, de Carvalho LP, Jarquin D. Integrating Environmental Covariates into Adaptability and Stability Analyses: A Structural Equation Modeling Approach for Cotton Breeding. Agriculture. 2024; 14(11):1914. https://doi.org/10.3390/agriculture14111914

Chicago/Turabian Style

Suela, Matheus Massariol, Moysés Nascimento, Ana Carolina Campana Nascimento, Camila Ferreira Azevedo, Paulo Eduardo Teodoro, Francisco José Correia Farias, Luiz Paulo de Carvalho, and Diego Jarquin. 2024. "Integrating Environmental Covariates into Adaptability and Stability Analyses: A Structural Equation Modeling Approach for Cotton Breeding" Agriculture 14, no. 11: 1914. https://doi.org/10.3390/agriculture14111914

APA Style

Suela, M. M., Nascimento, M., Nascimento, A. C. C., Azevedo, C. F., Teodoro, P. E., Farias, F. J. C., de Carvalho, L. P., & Jarquin, D. (2024). Integrating Environmental Covariates into Adaptability and Stability Analyses: A Structural Equation Modeling Approach for Cotton Breeding. Agriculture, 14(11), 1914. https://doi.org/10.3390/agriculture14111914

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop