Next Article in Journal
Evidence from Phylogenomics and Morphology Provide Insights into the Phylogeny, Plastome Evolution, and Taxonomy of Kitagawia
Next Article in Special Issue
Nutritional Balance and Genetic Diversity of Coffea canephora Genotypes
Previous Article in Journal
Tendril Anatomy: A Tool for Correct Identification among Cucurbitaceous Taxa
Previous Article in Special Issue
Description of an Arabica Coffee Ideotype for Agroforestry Cropping Systems: A Guideline for Breeding More Resilient New Varieties
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Environment and Multi-Year Bayesian Analysis Approach in Coffee canephora

by
André Monzoli Covre
1,
Flavia Alves da Silva
2,*,
Gleison Oliosi
1,
Caio Cezar Guedes Correa
3,
Alexandre Pio Viana
2 and
Fabio Luiz Partelli
1,*
1
Centro Universitário Norte do Espírito Santo, Universidade Federal do Espírito Santo, Rodovia BR-101, Km 60, Litorâneo, São Mateus 29932-540, ES, Brazil
2
Laboratório de Melhoramento Genético Vegetal, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Avenida Alberto Lamego, 2000, Campos dos Goytacazes 28013-602, RJ, Brazil
3
Departamento de Agronomia, Universidade Federal do Espírito Santo, Alto Universitário, S/N Guararema, Alegre 29500-000, ES, Brazil
*
Authors to whom correspondence should be addressed.
Plants 2022, 11(23), 3274; https://doi.org/10.3390/plants11233274
Submission received: 3 September 2022 / Revised: 11 October 2022 / Accepted: 25 November 2022 / Published: 28 November 2022
(This article belongs to the Special Issue Coffee Breeding and Stress Biology)

Abstract

:
This work aimed to use the Bayesian approach to discriminate 43 genotypes of Coffea canephora cv. Conilon, which were cultivated in two producing regions to identify the most stable and productive genotypes. The experiment was a randomized block design with three replications and seven plants per plot, carried out in the south of Bahia and the north of Espírito Santo, environments with different climatic conditions, and evaluated during four harvests. The proposed Bayesian methodology was implemented in R language, using the MCMCglmm package. This approach made it possible to find great genetic divergence between the materials, and detect significant effects for both genotype, environment, and year, but the hyper-parametrized models (block effect) presented problems of singularity and convergence. It was also possible to detect a few differences between crops within the same environment. With a model with lower residual, it was possible to recommend the most productive genotypes for both environments: LB1, AD1, Peneirão, Z21, and P2.

1. Introduction

Worldwide, more than 80 tropical countries employ 25 million farmers that produce around 175 million bags of coffee every year. Commonly, Coffea arabica represents 60% and C. canephora 40% of the annual revenue of 172 million USD [1]. Brazil is an important player, due it is the main producer and exporter in the world, with an estimated increase in global demand in the 2021/22 cycle due to the advance of control of COVID-19 in important consumer centers, such as Europe and the United States [2]. In this scenario of strong C. arabica valuation due to the significant loss of production in Brazil (−25%), the industry tends to increase demand for C. canephora coffee to reduce the cost of producing blends. The Espírito Santo and Bahia states in Brazil are the largest producer with almost 17.5 thousand bags of already processed coffee, with an increase of 12% of C. canephora in the last harvest [2]. In this context, this is an important crop for the surrounding regions mainly due to the local climate that exerts a strong influence on production.
The climate in the region is predominantly within the ideal limits for the culture, mainly C.canephora which is temperature tolerant up to 37 °C by photoprotective and antioxidant mechanisms [3]. Outside the limits from 17 to 31 °C its development and production are reduced or impaired [4,5]. Although the limits are wide, it has been observed that coffee is highly responsive to small variations in temperature [6]. Breeding programs have developed new coffee genotypes, providing not only desirable agronomic characteristics, but also greater adaptability to the environment [7].
Even though breeding programs develop better genotypes available to environments, they cannot be tested in many locations. This impossibility is mainly due to the inherent characteristics of perennial plants, which have a long reproductive cycle, a high annual variation, differences in precocity between genotypes, and productive longevity [8]. So, biometric studies that fill these gaps are important to identify productive cultivars adapted to the environment, which is the main component of the competitiveness of coffee plantations. Difficulties inherent to perennial crops such as coffee, in addition to the lack of information aimed at important regions, also pose problems for statistical models [9].
The challenge for statistical modeling is focused on the quantity and quality of data obtained by this type of crop, which certainly reduces the predictive power of the models. Both from the point of view of genetic breeding and commercial orchards, this consequence comes from the use of the same genotype for several years, a low survival rate during the life of the orchards, and the need for repeated evaluations in each individual over time [9]. Therefore, using a more accurate modeling approach can undoubtedly save resources and improve the chance of success of orchards with perennial plants.
An approach that has been used more frequently in these types of problems is the Bayesian approach. This approach is widely used with genomic data in culture [10,11,12], but for phenotypic data, it has already been successfully applied to other perennial plants [13,14,15]. The Bayesian approach enables advantages compared to the frequentist (REML/BLUP). The main one is the possibility of using previous information (called informative priors) incorporated in the parameter’s distribution of the model [16]. In the frequentist’s approach, previous data are generally used in a joint analysis, which often generates hyper-parameterized models, with unplanned/incompatible experimental design or missing observations. Usually, it’s become a source of variation in the model and does not add much information, beyond the possibility of identifying if the previous data had different averages/variances from the current experiment. Another advantage is that the credibility intervals tend to be smaller than the confidence intervals with a proper prior [17]. Given the above, this work aims to use the Bayesian approach to discriminate 43 genotypes of C. canephora genotypes cv. Conilon is the most widely cultivated cultivar in Brazil for the Robusta type of coffee in the main coffee-producing regions.

2. Results

Both genotypes, locations, and year effects incorporated into the model were significant (p < 0.05) by chi-square (Table 1). The models that include the block effect, in addition to increasing the number of parameters in the model, presented problems of singularity and convergence. It is also noted that the deviance of model 3 + 5 (which corresponds to the 3rd model plus the block effect) is close to model 3, thus, the block effect was disregarded in the models. The model chosen was the 3 + 8 model, in which the year models both the intercept and slope of the location, at the cost of one more parameter in the model (six to seven), but with less deviance.
After choosing the model, it was adjusted following a Bayesian approach. The chain generated by the approach passed on the quality statistics of the Markov Chain convergence and sample size tests (Table A1). In addition to the tests, all chains were visually inspected (Figure 1 and Figure A1, Figure A2, Figure A3, Figure A4 and Figure A5), looking for initial instability in the chain that could be due to a small burn-in or problems along the chain that could come from either the model or inadequate parameters in the prior. The distribution of residues was also checked to identify any possible bias in the model or non-incorporated effect (Figure A6). No issues were found in either case. Observing the chains from parameters year inside the environment (Figure 1) of the genotypes in each year and location can be a good indication of how the genotypes are responding in these environments.
A biannually coffee yield was observed in Bahia, but it was not observed in Espírito Santo (Figure 1). Despite this biannually, the average yield in Bahia (100.35 bags of green coffee·ha−1) was not significantly different from Espírito Santo (84.05 bags of green coffee·ha−1), as well as both farmers who produce coffee in the two locations have the potential to produce similar amounts. In the years 2017 and 2018 in Bahia, the highest average production of coffee genotypes was observed, with 133.58 (lower CI = 126.60 and upper CI = 140.40) and 126.02 (lower CI = 118.96 and upper CI = 132.74) bags of green coffee·ha−1. These two years were significantly higher than the others evaluated in Bahia. The averages and credibility intervals observed in Espírito Santo did not show significant differences between the years in that location. Subsequently, some variance components were estimated from the genotype chains, mainly obtaining heritability and genotype ranking by BLUP (Table 2).
Finally, a ranking of genotype production was obtained, allowing farms in these locations to choose the most productive genotypes among those evaluated for use in their orchards (Figure 2). The heritability of the yield trait was about 0.28 with credibility intervals between 0.18 and 0.37.

3. Discussion

Accurate estimates and production pattern for a crop is important for the regional development of the crop, mainly because it allows accurate planning and profit forecasting on the cultivation activity. This importance is easily seen when a farmer must choose between two slightly different locations to plant their orchard. If initial studies do not consider that there may be a different variance in yield between the two environments and obtain a single average estimate for both, yield may be reduced by some non-incorporated effect (such as unfavorable weather). This loss can be further attenuated if considered an orchard destined for export, which generally yields 40% more than conventional ones.
The coffee crop is planted in a relatively wide spacing, where to obtain an orchard with a few hundred plants, the producer easily needs a large area. Thinking about the size needed to experiment, it was planned to be installed in a block design, as we believe that the soil might not be very homogeneous. After collecting all the data and thinking about the dependent random variable and all the explanatory variables, we tried to keep the model as inflated as possible. Later, use some form of model selection to choose a model that balances predictive accuracy and overfitting/Type I error [18,19], removing only the terms necessary to allow for a non-singular adjustment [20]. So, looking at our model simplification results, mainly looking at how much each factor contributes to reducing the model’s uncertainty, we believe that the block effect can be ignored. This factor contributed little to a better fit of the model, in addition to causing convergence problems.
After simplifying the model, we chose to use a Bayesian approach. The Bayesian approach has some advantages over frequentist analysis. The main one considered in our choice is the possibility of using informative priors on model parameters [16]. After several years of following farms and collecting data in this important coffee-producing region, in some cases with the same genotypes, we could include this prior information as a joint experiment for example. This would generate unbalanced data, sources of variation without enough repetitions, hyper parametrization, complexity, and perhaps increase model uncertainty. Another advantage is that credibility intervals are often reported as smaller than confidence intervals if an appropriate prior has been used, as observed in guava, another perennial fruit tree [14]. Due to the likelihood function, if a non-informative prior is used, a Bayesian performance at least similar to the frequentist is expected [21,22,23].
The observation of the aspect of the chain is also a quality control criterion frequently used in the adjustment of the Bayesian model to the data [24]. If any problems such as chain instability, lack of convergence, or instability were observed, we could revise our prior, chain size, or even the sources of variation in the model. If there is interest from other breeders or biometrics, our chains, and their quality tests, as well as algorithms, can be obtained from Appendix A.
Disregarding the blocking effect, all others presented at least one significant contrast (p < 0.05), moreover, looking at the residuals of the model, we did not observe any bias. This implies that the effects we considered in the model, in addition to being significant, were reasonably sufficient to control the errors. This information is particularly important to discriminate which genotypes are more promising, and the farmer in the region of the greatest importance for the crop in Brazil can better plan the harvest volume, even in regions relatively close to those evaluated in the experiment. All the genotypes studied here had already been studied for other traits related to grain quality [25], and a significant difference was observed between the genotypes, like as differences between the growth of the root system, which is directly correlated with yield, nutrient concentrations in leaves, our main source of variation of interest for recommendation to farmers.
This fluctuation in production observed in Bahia has already been observed in another trait, this is directly related to the best rainfall conditions concerning the state of Espírito Santo, the leaf concentrations of macro- and micronutrients are at better levels [26]. The rainfall and temperature as well as the reproductive periods influenced this trait, and as here the crop has closed the cycle for all genotypes, we believe that climatic data are strong candidates to enter as covariates in future models in cultivar recommendation systems. Here, our year/location as random provides a hint that something could vary between the interaction of these variables, but we don’t know what it is, although a good guess would be the weather.
Although we did not detect a significant difference between the locations, in the meteorological data we observe that Bahia, for different periods, had an average temperature higher than Espírito Santo. This may have contributed to a greater sum of degree days in the culture at that location. On the other hand, a lack of consistency in Espírito Santo may have generated greater variability in production there, which led to greater credibility intervals than those observed in Bahia.
So, was proceeded with the rank of the individuals by BLUP obtained using the Bayesian approach. The Bayesian MCMC methods consider uncertainties in the parameters throughout the inference process. On the other hand, the frequentist models are predicted by punctual estimates of variance components and considered as true values, ignoring uncertainty in the variance parameters [27].
It was possible to observe great genotypic variability in the group of studied genotypes. The most relevant information here is the group of the first five genotypes, which stood out from the others and can be used by farmers who want high yields. The credibility intervals (HPD interval) found can be considered relatively high, but this great variability may come from the interaction between the environmental genotype, mainly in the Espírito Santo locations. This does not rule out the other genotypes as potential candidates to compose an orchard with characteristics that the most productive may not have, such as good stability between environments, and great responsibility to favorable environments. However, another study with the material is needed to discriminate these genotypes for these characteristics. The heritability found was also a value within the expected, which is not very high but is consistent with the polygenic nature of the studied trait. The five genotypes shown in Figure 2 are good recommendations for people interested in growing crops in the region, or any other environment with similar conditions, especially regarding altitude. The selection of the best families was performed to be recombined and to generate new populations.

4. Materials and Methods

4.1. Plant Material and Field Experiments

The genetic material consisted of 43 coffee (C. canephora) genotypes, both adapted to tropical environments, and widely cultivated by coffee farmers in the State of Espírito Santo—Brazil. The genotypes were evaluated for high-yield potential and agronomic traits of interest. The experiment was planted at a spacing of 3.5 m × 1.0 m, in a randomized block design with three replications, and seven plants as an experimental unit. All cultural treatments were made according to the crop requirement, and pruning kept around 12,000 orthotropic branches.
The trials were carried out in two environments in Brazil, in the south of Bahia (16°36′52″ S, 39°30′33″ W, and 140 m asl) and the north of Espírito Santo (18°39′43” S, 40°25′52″ W, and 200 m asl). According to Köppen’s classification, the regional climate is Aw tropical, with hot humid summers and dry winters, mean minimum temperatures of over 15 °C (July to August) and mean maxima of over 35 °C in January to February (Figure 3).
The edaphoclimatic conditions in these two locations characterize slightly different conditions, and therefore two distinct environments for the plants. At each location, four harvests were carried out, between the harvested period 2016 to 2019 at the experimental plot level, according to the maturation cycle of each genotype. The individual yield (bags of green coffee per hectare) was used as a random variable in the statistical analyses.

4.2. Model Choice

A statistical model was implemented that contemplates the initially planned experimental design, with all the identified effects. From this model called the complete model, a simplification of the model was performed, removing non-significative effects or their combinations. After that, a Bayesian approach was used. This simplification helps to reduce the number of model parameters and complexity, in addition to helping the processing time in Bayesian chains, reducing their numbers if there are fewer effects in the model. Complete model:
y i j k = β 0 + β 1 g i + β 2 p j + β 3 ( p / t ) j k + β 4 ( p / b ) j l + ϵ i j k I
where: y i j k is the vector of the phenotypic values of the yield in the i t h genotype of j t h location of k t h year; β 0 is an associated model parameter (intercept); g i = 1 , , 43 is the parametric vector of random effects of i t h genotype associated with the vector y by the incidence matrix known β 1 , assuming that g N 0 , I Σ g   p j = 1 , , 2 is the parametric vector of fixed (systematic) effects of j t h location associated with the vector y by the incidence matrix known β 2 assuming that p N μ p , I Σ p ; t k = 1 , , 4 is the parametric vector of random effects of k t h year inside j t h location influencing both intercept and slope, associated with the vector y by the incidence matrix known β 3 , assuming that t N 0 , I Σ t ; b l = 1 , , 3 is the parametric vector of random effects of l t h block inside j t h location influencing both intercept and slope, associated with the vector y by the incidence matrix known β 4 , assuming that t N 0 , I Σ l ; and ϵ is the vector of random residual effects, assuming that ϵ N 0 , I Σ ϵ .
From this complete model(I), a simplification of the model was performed using package::functionlme4::lmer [18] in R language [28]. Were observed its goodness of fit through deviance and the number of parameters that each effect includes in the model. After simplification, a model for fit (II) using the Bayesian approach was chosen, using the same definitions for the effects as the full model described above:
y i j k = β 0 + β 1 g i + β 2 p j + β 3 ( p / t ) j k + ϵ i j k ( II )

4.3. Bayesian Approach

The Bayesian approach was used with the same model (II), applying the Monte Carlo method based on Markov Chains (MCMC), employing the package MCMCglmm::MCMCglmmas [29], in R language. A total of 500,000 iterations were estimated, discarding the first one at 50,000 (burn-in) and performing a 1:5 (thin) sampling. Quality statistics of the Markov Chain convergence were tested by the Geweke criterion [30] using the coda::geweke.diag package [31] in R language.
The formula represented in model (II) can be written more compactly, representing the system of equations with random effects models, expressed as:
E y = X β + Z μ + ϵ ( III )
where: X is the design matrix for fixed effects and contains the predictor information and β = β 0 , β 2 is the vector of parameters; Z is the design matrix for random effects and μ = μ 1 , μ 3 is the vector of parameters; and ϵ is the error associated with the model, assuming y N X β , I Σ ϵ .
In a Bayesian approach, both fixed and random effects are random, also called systematic effects, treated in the same way [29], and combine the design matrices W = X , Z and combine the vectors of parameters θ = β , μ :
E y = W θ + ϵ ( IV )
In this formula, the design matrix is exactly equivalent to the original design matrix (frequentist) except we have one additional variable. In the frequentist model, this variable is absorbed into the global intercept because it could not be uniquely estimated from the data. Now, in the Bayesian approach, this additional parameter can be estimated by inclusion from a prior. As is usual in a Bayesian analysis, if there is no information in the data it has to come from the prior.
Priors can be defined for the residuals, the fixed effects, and the random effects. MCMCglmm::MCMCglmm uses an inverse-Wishart distribution prior for the (co)variances matrices and a normal prior for the fixed effects [29]. For model (II), the prior information was based on the posterior distributions of the parameters from a meta-analysis of coffee experiments [32,33,34,35]. The prior informative probability distribution for the systematic parameters of interest was provided by: β i ~ N μ 0 , V b in which μ 0   is a vector of averages and V b is a diagonal matrix of the a priori variance of β . Also was used an inverse-Wishart distribution for both random G 0 and residuals R 0 as a priori for the covariance matrices: G 0 ~ W 1 1 Σ g , n and R 0 ~ W 1 1 Σ ϵ , n in which Σ g and Σ ϵ are scale matrices.
From chains, the a posteriori distribution was used to take averages, credibility intervals, breeding values, variance components, and broad-sense heritability, using the expression: h 2 = σ g 2 σ g 2 + σ p / t 2 + σ ϵ 2 , where: σ g 2 , σ p / t 2 and σ ϵ 2 are the genotypic, year inside the environment, and residual variances, respectively.

Author Contributions

F.L.P.: conceptualization; F.A.d.S. and C.C.G.C.: methodology, formal analysis, and figures; F.A.d.S.: writing original draft preparation; A.M.C. and G.O.: investigation, data curation; A.P.V. and F.L.P.: supervision, writing, reviewing and editing. All authors have read and agreed to the published version of the manuscript.

Funding

The authors are indebted to the pioneer breeders, the coffee farmers, who made the first steps in the process of selection of most of the superior genotypes available nowadays. Therefore, the traditional names of the clones were maintained as used among coffee growers. We further acknowledge the support of the Federal University of Espírito Santo (UFES), the National Council of Scientific and Technological Development (CNPq), and the Foundation for Research and Innovation Support of Espírito Santo (FAPES). This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001.

Data Availability Statement

The complete phenotypic information, breeding values, and generated chains used in this study were submitted to the Open Science Framework and received the public identifier DOI: https://doi.org/10.17605/OSF.IO/D8T2R, accessed on 1 November 2022.

Acknowledgments

The authors gratefully acknowledge the financial support of the institution’s FAPES, FAPERJ, CAPES, CNPq, Universidade Federal do Espírito Santo—UFES, and Universidade Estadual do Norte Fluminense Darcy Ribeiro—UENF.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. The plant collection and use were in accordance with all the relevant guidelines.

Appendix A

Table A1. Quality statistics of the Markov Chain convergence from 43 coffee (Coffea canephora) genotypes grown in Bahia and Espírito Santo.
Table A1. Quality statistics of the Markov Chain convergence from 43 coffee (Coffea canephora) genotypes grown in Bahia and Espírito Santo.
Heidel
Stationarity teststart iterationp-valueHalfwidth testMeanHalfwidth
genotypepassed10.939passed2941.031
environment:yearpassed10.816passed120317.331
unitspassed10.52passed7560.498
Geweke
genotype0.6881
environment:year0.6316
units0.8992
Effective size
genotype19,982.16
environment:year18,800
units18,278.89
Raf Tery
Burn-inTotalLower boundDependence factor
genotype1019,09037465.1
environment:year1018,27537464.88
units1018,60037464.97
Figure A1. Time series of the chains and density curves of the effect of intercept, Espírito Santo, genotype 02, and genotype 122.
Figure A1. Time series of the chains and density curves of the effect of intercept, Espírito Santo, genotype 02, and genotype 122.
Plants 11 03274 g0a1
Figure A2. Time series of the chains and density curves of the effect of genotypes A1, AD1, Alecrim, and AP.
Figure A2. Time series of the chains and density curves of the effect of genotypes A1, AD1, Alecrim, and AP.
Plants 11 03274 g0a2
Figure A3. Time series of the chains and density curves of the effect of genotypes B01, Bamburral, Beira Rio (8), and Bicudo.
Figure A3. Time series of the chains and density curves of the effect of genotypes B01, Bamburral, Beira Rio (8), and Bicudo.
Plants 11 03274 g0a3
Figure A4. Time series of the chains and density curves of the effect of genotypes CH1, Cheique, Clementino, and Graudão HP.
Figure A4. Time series of the chains and density curves of the effect of genotypes CH1, Cheique, Clementino, and Graudão HP.
Plants 11 03274 g0a4
Figure A5. Time series of the chains and density curves of the effect of genotypes 143, 153, 18, and 700.
Figure A5. Time series of the chains and density curves of the effect of genotypes 143, 153, 18, and 700.
Plants 11 03274 g0a5
Figure A6. Distribution of residuals for each fitted value.
Figure A6. Distribution of residuals for each fitted value.
Plants 11 03274 g0a6

References

  1. USDA United States Department of Agriculture. Yld Arab and Robu Cof. 2020. Available online: https://www.usda.gov (accessed on 1 November 2022).
  2. CONAB. Companhia Nacional de Abastecimento—Acompanhamento Da Safra Brasileira: Acompanhamento Da Safra Brasileira de Café. Terc. Levant. 2021, 8, 1–58. [Google Scholar]
  3. Martins, M.Q.; Rodrigues, W.P.; Fortunato, A.S.; Leitão, A.E.; Rodrigues, A.P.; Pais, I.P.; Martins, L.D.; Silva, M.J.; Reboredo, F.H.; Partelli, F.L.; et al. Protective Response Mechanisms to Heat Stress in Interaction with High [CO2] Conditions in Coffea spp. Front. Plant Sci. 2016, 7, 947. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Partelli, F.L.; Vieira, H.D.; Silva, M.G.; Ramalho, J.C. Seasonal Vegetative Growth of Different Age Branches of Conilon Coffee Tree. Sem. Cien. Agr. 2010, 31, 619–626. [Google Scholar] [CrossRef] [Green Version]
  5. Ramalho, J.C.; Rodrigues, A.P.; Lidon, F.C.; Marques, L.M.C.; Leitão, A.E.; Fortunato, A.S.; Pais, I.P.; Silva, M.J.; Scotti-Campos, P.; Lopes, A.; et al. Stress Cross-Response of the Antioxidative System Promoted by Superimposed Drought and Cold Conditions in Coffea spp. PLoS ONE 2018, 13, e0198694. [Google Scholar] [CrossRef] [PubMed]
  6. Kath, J.; Byrareddy, V.M.; Craparo, A.; Nguyen-Huy, T.; Mushtaq, S.; Cao, L.; Bossolasco, L. Not so Robust: Robusta Coffee Production Is Highly Sensitive to Temperature. Glob. Chang. Biol. 2020, 26, 3677–3688. [Google Scholar] [CrossRef]
  7. Partelli, F.L.; Golynski, A.; Ferreira, A.; Martins, M.Q.; Mauri, A.L.; Ramalho, J.C.; Vieira, H.D. Andina-First Clonal Cultivar of High-Altitude Conilon Coffee. Crop Breed. Appl. Biotechnol. 2019, 19, 476–480. [Google Scholar] [CrossRef] [Green Version]
  8. Whitaker, J.; Field, J.L.; Bernacchi, C.J.; Cerri, C.E.P.; Ceulemans, R.; Davies, C.A.; DeLucia, E.H.; Donnison, I.S.; McCalmont, J.P.; Paustian, K.; et al. Consensus, Uncertainties and Challenges for Perennial Bioenergy Crops and Land Use. GCB Bioenergy 2018, 10, 150–164. [Google Scholar] [CrossRef]
  9. Van Eeuwijk, F.A.; Bustos-Korts, D.; Millet, E.J.; Boer, M.P.; Kruijer, W.; Thompson, A.; Malosetti, M.; Iwata, H.; Quiroz, R.; Kuppe, C. Modelling Strategies for Assessing and Increasing the Effectiveness of New Phenotyping Techniques in Plant Breeding. Plant Sci. 2019, 282, 23–39. [Google Scholar] [CrossRef]
  10. Labouisse, J.-P.; Cubry, P.; Austerlitz, F.; Rivallan, R.; Nguyen, H. New Insights on Spatial Genetic Structure and Diversity of Coffea canephora (Rubiaceae) in Upper Guinea Based on Old Herbaria. Plant Ecol. Evol. 2020, 153, 82–100. [Google Scholar] [CrossRef]
  11. Ferrão, L.F.V.; Ferrão, R.G.; Ferrão, M.A.G.; Fonseca, A.; Carbonetto, P.; Stephens, M.; Garcia, A.A.F. Accurate Genomic Prediction of Coffea canephora in Multiple Environments Using Whole-Genome Statistical Models. Heredity 2019, 122, 261–275. [Google Scholar] [CrossRef] [Green Version]
  12. Alkimim, E.R.; Caixeta, E.T.; Sousa, T.V.; da Silva, F.L.; Sakiyama, N.S.; Zambolim, L. High-Throughput Targeted Genotyping Using next-Generation Sequencing Applied in Coffea canephora Breeding. Euphytica 2018, 214, 50. [Google Scholar] [CrossRef]
  13. Mora, F.; Serra, N. Bayesian Estimation of Genetic Parameters for Growth, Stem Straightness, and Survival in Eucalyptus Globulus on an Andean Foothill Site. Tree Genet. Genomes 2014, 10, 711–719. [Google Scholar] [CrossRef]
  14. da Silva, F.A.; Viana, A.P.; Corrêa, C.C.G.; Carvalho, B.M.; de Sousa, C.M.B.; Amaral, B.D.; Ambrósio, M.; Glória, L.S. Impact of Bayesian Inference on the Selection of Psidium Guajava. Sci. Rep. 2020, 10, 1999. [Google Scholar] [CrossRef] [Green Version]
  15. Souza, A.O.; Viana, A.P.; e Silva, F.F.; Azevedo, C.F.; da Silva, F.A.; e Silva, F.H.L. Row-Col and Bayesian Approach Seeking to Improve the Predictive Capacity and Selection of Passion Fruit. Sci. Agric. 2022, 79. [Google Scholar] [CrossRef]
  16. Beaumont, M.A.; Rannala, B. The Bayesian Revolution in Genetics. Nat. Rev. Genet. 2004, 5, 251–261. [Google Scholar] [CrossRef]
  17. Silva Junqueira, V.; de Azevedo Peixoto, L.; Galvêas Laviola, B.; Lopes Bhering, L.; Mendonça, S.; Agostini Costa, T.d.S.; Antoniassi, R. Correction: Bayesian Multi-Trait Analysis Reveals a Useful Tool to Increase Oil Concentration and to Decrease Toxicity in Jatropha Curcas L. PLoS ONE 2016, 11, e0161046. [Google Scholar] [CrossRef]
  18. Bates, D.; Mächler, M.; Bolker, B.; Walker, S. Fitting Linear Mixed-Effects Models Using Lme4. J. Stat. Softw. 2015, 67, 48. [Google Scholar] [CrossRef]
  19. Matuschek, H.; Kliegl, R.; Vasishth, S.; Baayen, H.; Bates, D. Balancing Type I Error and Power in Linear Mixed Models. J. Mem. Lang. 2017, 94, 305–315. [Google Scholar] [CrossRef]
  20. Barr, D.J.; Levy, R.; Scheepers, C.; Tily, H.J. Random Effects Structure for Confirmatory Hypothesis Testing: Keep It Maximal. J. Mem. Lang. 2013, 68, 255–278. [Google Scholar] [CrossRef] [Green Version]
  21. Sorensen, D.; Gianola, D. Likelihood, Bayesian, and MCMC Methods in Quantitative Genetics, 1st ed.; Springer Science & Business Media: New York, NY, USA, 2007; 740p, ISBN 0387954406. [Google Scholar]
  22. Sandoval, V.J.C.; Silva, F.F.; Resende, M.D.V.; Macedo, L.R.; Cecon, P.R. Bayesian Random Regression for Genetic Evaluation of South American Leaf Blight in Rubber Trees. Rev. Cien. Agro. 2017, 48, 151–156. [Google Scholar] [CrossRef] [Green Version]
  23. Junqueira, V.S.; de Azevedo Peixoto, L.; Laviola, B.G.; Bhering, L.L.; Mendonça, S.; da Silveira Agostini Costa, T.; Antoniassi, R. Bayesian Multi-Trait Analysis Reveals a Useful Tool to Increase Oil Concentration and to Decrease Toxicity in Jatropha Curcas L. PLoS ONE 2016, 11, e0157038. [Google Scholar] [CrossRef]
  24. Carlin, B.P.; Chib, S. Bayesian Model Choice via Markov Chain Monte Carlo Methods. J. R. Stat. Soc. 1995, 57, 473–484. [Google Scholar] [CrossRef]
  25. Partelli, F.L.; Oliosi, G.; Dalazen, J.R.; da Silva, C.A.; Vieira, H.D.; Espindula, M.C. Proportion of Ripe Fruit Weight and Volume to Green Coffee: Differences in 43 Genotypes of Coffea canephora. Agron. J. 2021, 113, 1050–1057. [Google Scholar] [CrossRef]
  26. Oliosi, G.; Partelli, F.L.; da Silva, C.A.; Dubberstein, D.; Gontijo, I.; Tomaz, M.A. Seasonal Variation in Leaf Nutrient Concentration of Conilon Coffee Genotypes. J. Plant Nutr. 2021, 44, 74–85. [Google Scholar] [CrossRef]
  27. Sorensen, D. Developments in Statistical Analysis in Quantitative Genetics. Genetica 2009, 136, 319–332. [Google Scholar] [CrossRef]
  28. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
  29. Hadfield, J. MCMC Methods for Multi-Response Generalized Linear Mixed Models: The MCMCglmm R Package. J. Stat. Softw. 2010, 33, 1–22. [Google Scholar] [CrossRef] [Green Version]
  30. Cowles, M.K.; Carlin, B.P. Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review. J. Am. Stat. Assoc. 1996, 91, 883–904. [Google Scholar] [CrossRef]
  31. Plummer, M.; Best, N.; Cowles, K.; Vines, K. CODA: Convergence Diagnosis and Output Analysis for MCMC. R News 2006, 6, 7–11. [Google Scholar]
  32. Bonomo, D.Z.; Bonomo, R.; Partelli, F.L.; de Souza, J.M. Performance of Conilon Coffee Genotypes under Different Adjusted Crop Coefficients. IRRIGA 2017, 22, 236–248. [Google Scholar] [CrossRef]
  33. Partelli, F.L.; Giles, J.A.D.; Oliosi, G.; Covre, A.M.; Ferreira, A.; Rodrigues, V.M. Tributun: A Coffee Cultivar Developed in Partnership with Farmers. Crop Breed. Appl. Biotechnol. 2020, 20, e30002025. [Google Scholar] [CrossRef]
  34. Covre, A.M.; Partelli, F.L.; Bonomo, R.; Braun, H.; Ronchi, C.P. Vegetative Growth of Conilon Coffee Plants under Two Water Conditions in the Atlantic Region of Bahia State, Brazil. Acta Sci. Agron. 2016, 38, 535. [Google Scholar] [CrossRef] [Green Version]
  35. Silva, L.O.E.; Schmidt, R.; Valani, G.P.; Ferreira, A.; Ribeiro-Barros, A.I.; Partelli, F.L. Root Trait Variability in Coffea canephora Genotypes and Its Relation to Plant Height and Crop Yield. Agronomy 2020, 10, 1394. [Google Scholar] [CrossRef]
Figure 1. Time series of the chains of the four crops within the Bahia location, and probability density function curve of the chains (A). Observed versus predicted values for the four crops in the Bahia location (B). Time series of chains of the four crops within the Espírito Santo location, and probability density function curve of the chains (C). Observed versus predicted values for the four cultures at the Espírito Santo location (D).
Figure 1. Time series of the chains of the four crops within the Bahia location, and probability density function curve of the chains (A). Observed versus predicted values for the four crops in the Bahia location (B). Time series of chains of the four crops within the Espírito Santo location, and probability density function curve of the chains (C). Observed versus predicted values for the four cultures at the Espírito Santo location (D).
Plants 11 03274 g001
Figure 2. Yield average estimates for four harvests for 43 coffee (Coffea canephora) genotypes grown in Bahia and Espírito Santo. Means were grouped into values above one standard deviation from the mean, above the mean, below the mean, and below one standard deviation from the mean.
Figure 2. Yield average estimates for four harvests for 43 coffee (Coffea canephora) genotypes grown in Bahia and Espírito Santo. Means were grouped into values above one standard deviation from the mean, above the mean, below the mean, and below one standard deviation from the mean.
Plants 11 03274 g002
Figure 3. Information on monthly averages of total precipitation and the average temperature of climate data from two environments, Bahia and Espírito Santo, in the period between the implementation of the experiment and the last evaluated coffee crop (C. canephora).
Figure 3. Information on monthly averages of total precipitation and the average temperature of climate data from two environments, Bahia and Espírito Santo, in the period between the implementation of the experiment and the last evaluated coffee crop (C. canephora).
Plants 11 03274 g003
Table 1. Summary of ANOVA to simplify models with good-fit statistics on 43 coffee genotypes (Coffea canephora) for yield evaluated in two locations and four years.
Table 1. Summary of ANOVA to simplify models with good-fit statistics on 43 coffee genotypes (Coffea canephora) for yield evaluated in two locations and four years.
ModelNparDFChisqPr (>Chisq)Deviance
1Null2---10,579.3
2(1|genotype)3191.21.2 × 10−2110,488.1
3(1|genotype) + location4148.82.7 × 10−1210,439.2
3 + 4(1|year)51345.73.6 × 10−7710,093.5
3 + 5(1|block) *500.0-10,438.6
3 + 6(1|location/year)61557.72.5 × 10−1239880.8
3 + 7(1|location/block) *600.0-10,438.6
3 + 8(location|year)71559.61.0 × 10−1239879.0
3 + 9(location|block) **700.0-10,437.1
3 + 10(location|block) + (location|year) **103566.12.2 × 10−1229871.0
Note: From the third model onwards, all consider the effect of genotype and location. * Model failed to converge. ** boundary (singular) fit. Npar = number of parameters. DF = degrees of freedom. Chisq = chi-square test.
Table 2. The posterior mean of BLUP estimates for 43 coffee genotypes (Coffea canephora) and their higher posterior density (HPD) interval, estimates were obtained by a Bayesian model.
Table 2. The posterior mean of BLUP estimates for 43 coffee genotypes (Coffea canephora) and their higher posterior density (HPD) interval, estimates were obtained by a Bayesian model.
GenotypeBLUPHPD IntervalGenotypeBLUPHPD Interval
LowerUpperLowerUpper
LB1 *,#34.7623.4146.64L80 #0.94−10.4512.41
AD1 *,#26.4014.5237.79Valcir P0.31−11.7311.50
Peneirão *,#24.8113.2736.13Ouro Negro0.27−11.1111.65
Z2124.7412.9936.03Z37−0.63−11.8711.17
P2 *19.727.9331.13Z40−0.76−12.1610.63
0213.902.1725.28Tardio C−7.84−19.003.91
A1 #12.931.5524.62153−8.71−19.983.05
Ouro Negro 312.921.5924.73Sementes−8.83−20.562.33
AP *11.45−0.1722.85122−9.02−20.372.65
Verdim D10.23−1.0721.68CH1−10.25−21.871.22
70010.12−1.3021.48Pirata−11.70−23.25−0.05
1438.75−2.6420.24Z29−12.58−24.19−1.29
Imbigudinho *8.56−2.9620.0618−13.06−24.36−1.50
Clementino7.96−3.4319.79Z38−13.73−25.31−2.40
Z397.16−4.3718.63Verdim R−14.28−26.31−2.92
Graudão HP7.03−4.6718.30Tardio V−18.12−29.96−6.54
Bicudo #6.52−5.0817.78Z35−19.31−30.96−8.02
Z365.06−6.0416.77Cheique−20.16−31.58−8.34
Bamburral4.97−6.8316.35Alecrim−25.66−36.92−13.85
P14.78−6.5816.21Beira Rio (8)−30.68−42.06−18.86
Z182.23−8.7114.05B01−41.98−53.54−30.46
Ouro Negro 11.54−9.6012.99----
* Derived from the cultivar Monte Pascoal (Partelli et al., 2021), # derived from the cultivar Plena which is in the registration process.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Covre, A.M.; da Silva, F.A.; Oliosi, G.; Correa, C.C.G.; Viana, A.P.; Partelli, F.L. Multi-Environment and Multi-Year Bayesian Analysis Approach in Coffee canephora. Plants 2022, 11, 3274. https://doi.org/10.3390/plants11233274

AMA Style

Covre AM, da Silva FA, Oliosi G, Correa CCG, Viana AP, Partelli FL. Multi-Environment and Multi-Year Bayesian Analysis Approach in Coffee canephora. Plants. 2022; 11(23):3274. https://doi.org/10.3390/plants11233274

Chicago/Turabian Style

Covre, André Monzoli, Flavia Alves da Silva, Gleison Oliosi, Caio Cezar Guedes Correa, Alexandre Pio Viana, and Fabio Luiz Partelli. 2022. "Multi-Environment and Multi-Year Bayesian Analysis Approach in Coffee canephora" Plants 11, no. 23: 3274. https://doi.org/10.3390/plants11233274

APA Style

Covre, A. M., da Silva, F. A., Oliosi, G., Correa, C. C. G., Viana, A. P., & Partelli, F. L. (2022). Multi-Environment and Multi-Year Bayesian Analysis Approach in Coffee canephora. Plants, 11(23), 3274. https://doi.org/10.3390/plants11233274

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop