Next Article in Journal
Non-Invasive Prenatal Screening from a Genetic Counseling Prospective: Pre and Post-Genetic Counseling Regarding Rare Chromosomal Abnormalities and Incidental Finding
Next Article in Special Issue
Quantitative Trait Loci Mappings for the Sulfur Utilization Efficiency-Related Traits at the Seedling Stage of Wheat
Previous Article in Journal
Transcriptome Analysis Deciphers the Underlying Molecular Mechanism of Peanut Lateral Branch Angle Formation Using Erect Branching Mutant
Previous Article in Special Issue
PlantMine: A Machine-Learning Framework to Detect Core SNPs in Rice Genomics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

GWAS for Drought Resilience Traits in Red Clover (Trifolium pratense L.)

1
Plant Sciences Unit, ILVO (Flanders Research Institute for Agriculture, Fisheries and Food), Caritasstraat 39, 9090 Melle, Belgium
2
Department of Plant Biotechnology and Bioinformatics, Faculty of Sciences, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
3
Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland
4
Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000 Gent, Belgium
5
Laboratory of Plant Breeding, Wageningen University & Research, Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands
*
Author to whom correspondence should be addressed.
Genes 2024, 15(10), 1347; https://doi.org/10.3390/genes15101347
Submission received: 19 September 2024 / Revised: 15 October 2024 / Accepted: 17 October 2024 / Published: 21 October 2024
(This article belongs to the Special Issue Genomic Studies of Plant Breeding)

Abstract

:
Red clover (Trifolium pratense L.) is a well-appreciated grassland crop in temperate climates but suffers from increasingly frequent and severe drought periods. Molecular markers for drought resilience (DR) would benefit breeding initiatives for red clover, as would a better understanding of the genes involved in DR. Two previous studies, as follows, have: (1) identified phenotypic DR traits in a diverse set of red clover accessions; and (2) produced genotypic data using a pooled genotyping-by-sequencing (GBS) approach in the same collection. In the present study, we performed genome-wide association studies (GWAS) for DR using the available phenotypic and genotypic data. Single nucleotide polymorphism (SNP) calling was performed using GBS data and the following two red clover genome assemblies: the recent HEN-17 assembly and the Milvus assembly. SNP positions with significant associations were used to delineate flanking regions in both genome assemblies, while functional annotations were retrieved from Medicago truncatula orthologs. GWAS revealed 19 significant SNPs in the HEN-17-derived SNP set, explaining between 5.3 and 23.2% of the phenotypic variation per SNP–trait combination for DR traits. Among the genes in the SNP-flanking regions, we identified candidate genes related to cell wall structuring, genes encoding sugar-modifying proteins, an ureide permease gene, and other genes linked to stress metabolism pathways. GWAS revealed 29 SNPs in the Milvus-derived SNP set that explained substantially more phenotypic variation for DR traits, between 5.3 and 42.3% per SNP–trait combination. Candidate genes included a DEAD-box ATP-dependent RNA helicase gene, a P-loop nucleoside triphosphate hydrolase gene, a Myb/SANT-like DNA-binding domain protein, and an ubiquitin–protein ligase gene. Most accessions in this study are genetically more closely related to the Milvus genotype than to HEN-17, possibly explaining how the Milvus-derived SNP set yielded more robust associations. The Milvus-derived SNP set pinpointed 10 genomic regions that explained more than 25% of the phenotypic variation for DR traits. A possible next step could be the implementation of these SNP markers in practical breeding programs, which would help to improve DR in red clover. Candidate genes could be further characterized in future research to unravel drought stress resilience in red clover in more detail.

1. Introduction

Grasslands cover more than a third of the European agricultural area [1]. They play important roles in livestock feeding and deliver valuable ecosystem services [1]. Red clover (T. pratense) is a short-lived perennial legume crop that is often included in grasslands for its nitrogen fixation capacity, its protein-rich forage, and its high palatability [2]. Red clover is naturally a diploid species (2n = 2x = 14) with the following two maturity types: the late flowering single-cut type; and the early flowering, quickly re-growing double-cut type [3]. Red clover is a strictly outcrossing crop, and populations or accessions are genetically and phenotypically highly diverse [4], rendering genetic work generally difficult [5].
Climate change is causing warmer summers with more frequent and more prolonged drought periods in Europe [6]. Grasslands are particularly vulnerable to drought; studies in European grasslands predict annual yield losses between 30% and 40% due to drought [7,8]. The general understanding of plant drought responses and the various coping mechanisms for abiotic stress has advanced significantly in recent years. Comprehensive reviews are available on the physiological basis of drought responses in plants [9] and on drought signaling pathways [10]. Important pathways for drought responses include calcium signaling pathways, MAPK-dependent pathways that control a wide range of developmental and stress–response functions, and pathways involving abscisic acid (ABA) [10]. The phytohormone ABA plays a key role in drought responses, influencing stomatal closure and activating drought-responsive genes through specific pathways [10]. However, other phytohormones, such as salicylic acid, auxin, ethylene, jasmonic acid, brassinosteroid and gibberellin, are involved in drought responses, although their exact roles are often not clear [11]. In addition, transcription factors are crucial components in drought responses as regulators of stress-responsive genes [12].
Red clover is regarded as a rather drought-resilient grassland crop [2], thanks to its deep taproot development [3]. Nonetheless, in the light of climate change, breeding for drought resilience (DR) becomes increasingly important in red clover. However, the cross-pollinating reproduction system of red clover implies phenotyping numerous individual plants under water-limited conditions, crossing the better individuals, and continuing selection for multiple cycles [13,14]. Moreover, DR is a quantitatively inherited trait. As a consequence, although some results have been achieved [15], breeding progress for DR is generally slow in red clover. Knowing which genomic regions are associated with a beneficial drought response can be the basis to develop molecular markers for DR in red clover. Molecular markers would allow for the screening of large numbers of young plants and genetically identify promising material at the start of a breeding trial, reducing the need for a laborious phenotypic screening [13]. In addition, candidate genes related to DR could be revealed, which may increase insights into the physiological mechanisms driving drought responses in red clover. Genome-wide association studies (GWAS) are a widely used approach with which to study the genetic basis of phenotypic variation in crops [16]. A prerequisite for GWAS is the availability of genome-wide molecular marker data. In cross-pollinated crops, which include most grassland crops like red clover, phenotypic data from field or greenhouse experiments and genotyping data are more relevant at the population level than at the individual plant level [17]; reduced representation genome sequencing methods, like genotyping-by-sequencing (GBS), are an effective method with which to quantify allelic variants at the population level (genome-wide allele frequency fingerprinting). GWAS have been used to elucidate the genetic control of complex traits in various grassland legumes such as forage quality traits [18] and autumn dormancy [19] in lucerne (Medicago sativa), and disease resistance [20] and freezing tolerance [21] in red clover. Nonetheless, our knowledge on the genetic control of drought responses in grassland legumes, and especially in red clover, remains limited.
In previous work [22], we investigated drought responses in 395 red clover accessions of the EUCLEG collection in a two-year field trial with mobile rain-out shelters. Drought caused substantial reductions in canopy cover (CC) and canopy height (CH), with visible reductions starting roughly two weeks after the onset of drought and continuing during the recovery period until late in the growing seasons. Plants reacted differently to drought in the first and second year of production, as plants were older in year 2, and may have developed deeper rooting that could alter drought responses. The same collection of 395 accessions has been genotyped by Frey et al. [20] using a pooled GBS approach followed by single nucleotide polymorphism (SNP) frequency profiling using the Milvus red clover genome assembly [23]. Recently, a new, high-quality, long read-based assembly of the red clover genome of a different red clover genotype (HEN-17) became available [5]. This HEN-17 genome assembly presents a substantially improved reference genome sequence, with improved per-base quality, more than 500 times fewer contigs, and a three-fold increase in contig N50 compared to the previous assembly [5]. The HEN-17 assembly is nearly 20% longer than the Milvus assembly (413.5 Mbp vs. 350 Mbp), which is closer to the predicted genome size of red clover. The Milvus genome annotation contains 39,948 protein coding genes; the HEN-17 genome annotation contains 33,610 protein coding genes.
The present work was performed in the framework of the EUCLEG project that focuses on legume crop breeding. In the present paper, we build further on the following two studies: the study by Frey et al. [20], in which 395 red clover populations with relevance for European breeding activities were genotyped by GBS; and the study by Vleugels et al. [22], in which the phenotypic variation for DR was investigated in the same collection. The objectives of the present study were, as follows: (i) to perform GWAS to identify quantitative trait loci (QTLs) for DR in red clover; and (ii) to identify potential candidate genes in the genomic regions underlying the QTL associated with DR in red clover. We have performed GWAS analysis in parallel based on two SNP sets, using either the Milvus assembly [23], or the HEN-17 assembly [5] as references for SNP calling.

2. Materials and Methods

2.1. Phenotypic Data for DR

Phenotypic data were used from our previous work [22], in which we phenotyped a panel of 395 red clover accessions for drought responses in two parallel fields, as follows: a ‘drought-treated’ field where drought was simulated using rain-out shelters; and a ‘control field’ that was irrigated and served to calculate relative performance indices. Four cuts were taken per year, and drought periods lasting six to eight weeks were imposed between the first and second cuts in two subsequent production years or growing seasons (Figure 1). CC and CH were monitored throughout the two growing seasons. Subsequently, relative performance indices (Yr), displaying the relative performance of the drought-treated plots compared to the control plots, were obtained for each accession and each trait, using the following formula:
Yr = (Control − Drought)/Control
Positive Yr values for CC or CH indicate lower observed values of coverage or height for that accession in the drought-treated field compared to the control field.
For the present study, we defined drought resilient accessions as accessions that display small differences in terms of CC or CH either during the drought period (cut 1–2) or in two recovery periods (cuts 2–3 and 3–4). We used Yr indices for CC and CH for both growing seasons, year 1 and year 2, from the middle of the drought treatments until the middle of the 2nd growing periods after drought, further named recovery periods. To ensure meaningful Yr indices, only observations obtained at least 10 days after a cut were included. A timeline visualizing the trial and the phenotypic data used in this paper are given in Figure 1. For year 1, we included Yr indices for CC and CH at DOY(day-of-year) 165, 178, 198, 205, 218, 225, 238 and 245; for year 2, Yr indices for CC and CH at DOY 175, 189, 195, 212, 220, 225, 245 and 258 were used. This rendered a total of 32 traits, 16 in year 1 and 16 in year 2 (Table 1). GWAS was performed individually for each Yr index, further named DR trait.

2.2. GBS

A full description of the methods for GBS genotyping and SNP calling of the 395 red clover accessions is provided in Frey et al. [20]. We used SNP data obtained with the Milvus genome assembly [23] as a reference for read mapping, as available from Frey et al. [20]. In parallel, we ran the SNP calling analysis for the GBS data of the EUCLEG red clover collection using the more recent HEN-17 genome assembly as reference sequence [5]. Read mapping was performed using Bowtie (v2.4.4) [24], while SNP calling and filtering were performed using bcftools (v1.16) [25]. SNP data were filtered for biallelic genotype calls with a minimum read depth of 30 and at least 100 accessions genotyped per SNP position.

2.3. GWAS

GWAS was performed using the R-package ‘GAPIT’ in R version 4.2.2., implemented in RStudio [26,27], applying the Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK) model [28,29]. BLINK was chosen as the preferred model for its superior handling of genetic diversity and population structure, as well as its ability to minimize redundant signals across linkage blocks. While other models such as the general linear model (GLM) tests markers independently, BLINK offers a more refined and efficient approach by retaining only the most significant SNP from each chromosomal region. This allows for a clearer distinction of significant associations, avoiding noise from multiple linked SNPs within the same region. A key strength of BLINK is its ability to merge marker–trait associations within a linkage block, reducing false positives and enhancing the precision of detected associations. By focusing only on the most relevant SNP in each region, BLINK avoids signal dilution that can occur when multiple linked markers are considered, giving it a distinct advantage over other GWAS models.
For our analysis, BLINK was configured with default settings and a linkage disequilibrium threshold of 0.2, ensuring that SNP–trait associations were both robust and well-controlled. Associations were considered significant if they surpassed a threshold of -Log10 (p-value) ≥ 6. To evaluate the contribution of each SNP, the phenotypic variance explained (R2) was calculated via linear regression between phenotypic data and SNP genotype data using the lm() function in R 4.2.2. SNPs were selected based on an R2 threshold of greater than 0.05 and a Minor Allele Frequency (MAF) above 0.05 within the total association mapping panel. Closely linked SNPs were inspected carefully, and if one SNP exhibited lower p-value, MAF, or R2 values compared to others, the lower-performing SNP was removed. Effect sizes and the phenotypic variance explained by SNPs associated with traits were represented by the regression coefficient (β) and R2, derived from a linear model with Best Linear Unbiased Estimations (BLUEs) as the response variable, with SNPs treated as fixed effects.

2.4. Identification of Candidate Genes

For the 21 significant SNP–trait associations found in the HEN-17-derived SNP set, flanking genes were identified within a 50 kb region upstream and downstream of the significant SNP in the HEN-17 assembly [5] (GCF_020283565.1). Subsequently, we performed a BLASTx search with the corresponding gene sequence against the M. truncatula proteome to retrieve the functional gene annotations, using PLAZA5.0 [30]. Similarly, for the significant SNP–trait associations from the Milvus-derived SNP set, we first identified flanking genes in the Milvus assembly (50 kb flanking sequences upstream and downstream, assembly GCA_900079335.1; ensemble annotation v2.1 [31]). Subsequently, we used the corresponding gene sequence to perform a BLASTx search against the predicted gene set of the HEN-17 assembly, and the M. truncatula proteome, as described above, to retrieve the corresponding functional annotation descriptions. For all significant associations, all known genes in a flanking region of ±50 kb from each SNP, together with their orthologs and functional annotations in M. truncatula (if available), are given in Supplementary Data File S1. Given the low degree of linkage disequilibrium in red clover [21], in what follows we only considered candidate genes within regions of ±5 kb upstream or downstream from each significant SNP (Supplementary Data File S2).

3. Results

Mapping of the GBS read data on the Milvus genome assembly provided 20,137 SNP markers, polymorphic at MAF ≥ 0.05, spread over the seven red clover chromosomes and multiple unanchored scaffolds. The SNP calling on the HEN-17 assembly provided 59,343 SNP markers spread over the 7 chromosomes, and polymorphic at MAF ≥ 0.05.
As the recent long read-based HEN-17 assembly [5] presents an improved reference genome sequence, with substantially more genes located on chromosomes instead of on scaffolds compared to the Milvus assembly [23], we first performed GWAS using the HEN-17-derived SNP set. This resulted in 19 SNPs that were significantly associated with improved drought responses after Bonferroni correction (α = 5%), explaining between 5.3 and 23.2% of the phenotypic variation for various DR traits (Figure 2A, Table 2). Three SNPs were associated with drought responses during the drought period, while the remaining 16 SNPs were associated with responses during the recovery periods. The full results of the GWAS are shown in Supplementary Data File S1. The following two SNPs explained more than 15%: NC_060063.1_4412473 on LG5; and NC_060064.1_36266159 on LG6. No candidate genes were identified in the flanking regions (±5 kb) of these SNPs. GWAS indicated additional SNPs with minor effect. First, two genes related to cell wall structuring were highlighted as follows: genes with homology to arabinosyltransferase (year 1, CC_198: 9.2% of variation explained); and pectinase inhibitors (year 2, CC_212: 5.9% of variation explained). Second, associations were found with SNPs near genes encoding sugar-modifying proteins, such as glycosyltransferase genes for CC_195 and CC_225 in year 2, explaining 11.3% and 5.3% of the phenotypic variation, respectively. Additionally, an SNP flanking a gene encoding an α-galactosidase was significantly associated to CC_198 in year 2, explaining 8.4% of the phenotypic variation. Third, an SNP significantly associated to CH_225 in year 1, explaining 5.9% of the phenotypic variation, was located in a gene encoding an ureide permease. Finally, we found associations with an SNP explaining 6.6% of the phenotypic variation for CC_258 in year 2, located in a gene containing an Ma3-binding domain, and an SNP explaining 8.4% of the phenotypic variation for CC_198 in year 1, with its flanking gene encoding a MAP kinase.
As the associations revealed using the HEN-17-derived SNP set explained only minor percentages of phenotypic variation, we decided to re-run our GWAS using the Milvus-derived SNP set. GWAS analysis with the Milvus-derived SNP set yielded 29 SNPs that were significantly associated with DR: 14 SNPs in the drought period and 15 SNPs in the recovery period (Figure 2B, Table 2). Considerably larger percentages of phenotypic variation were explained compared to the HEN-17-derived SNP set, up to 42.3%. Eleven SNPs on LG1, LG3, LG6, LG7 and an unanchored scaffold (LG1_12991269, LG1_24162580, LG1_3190272, LG1_4925076, LG1_4925141, LG3_1774487, LG3_3530352, LG6_9239088, LG6_9239175, LG7_16795407, and scaf_678_58484), among which were two closely linked SNPs on LG1, explained more than 25% of the phenotypic variation for DR traits. The full results of the GWAS analysis are shown in Supplementary Data File S1. Next, we located the closest flanking candidate genes for the Milvus-derived SNPs that were significantly associated to DR traits. First, three associations that explained large percentages of phenotypic variation in year 1 (40.3% for CC_178, 34.4% for CH_218, and 32.9% for CH_225) were found with SNPs located in genes encoding transmembrane proteins. During the drought periods, we found associations with SNPs flanking four candidate genes. The first association was found in year 2 between CH_189 and an SNP in a flowering control gene, explaining 36.6% of the phenotypic variation. Furthermore, CH_178 and CH_189 were associated with two SNPs flanking a DEAD-box ATP-dependent RNA helicase gene (26.5% and 29.1% of the phenotypic variation explained, respectively). Also, an association was found between CC_178 and an SNP flanking a P-loop nucleoside triphosphate hydrolase gene, which explained 31.1% of the phenotypic variation. Finally, two associations were found between CC_189 and CC_195 in year 2 and an SNP in a gene encoding a Myb/SANT-like DNA-binding domain protein, explaining 18.8% and 22.2% of the phenotypic variation, respectively. In the recovery periods, an additional association was observed between CC_218 and an SNP in an ubiquitin–protein ligase gene, which explained 12.6% of the phenotypic variation.

4. Discussion

GWAS yielded surprisingly different results depending on the genome assembly that was used for SNP calling. Although the HEN-17 reference genome assembly is more complete than the Milvus genome assembly [5], it resulted in fewer significant associations and SNPs that explained lower percentages of phenotypic variation. Multiple strong associations that were detected in the Milvus-derived SNP set, were not detected in the HEN-17-derived SNP set. One possible explanation may be that red clover is a genetically diverse species [3]; the primary sequence of the genome assemblies of divergent genotypes reflects that diversity. If a major QTL locus is present in a set of accessions but absent in the genotype used for the genome assembly, no reads can map to that locus, and no SNPs are called, even if GBS data from the GWAS panel accessions contains reads derived from that locus. On a more subtle level, if the QTL locus is present but the reference sequence is divergent from the alleles present in the GWAS panel accessions, a mapping bias is created against increasingly more divergent alleles. While this allows for the identification of SNPs, it may cause a shift in observed allele frequencies (divergent alleles are lost from the alternative allele frequency count). As a consequence, the power to detect significant markers for that QTL is reduced or lost entirely. Thus, a particular genome assembly may be fit for read mapping and SNP frequency profiling for certain genotypes or populations, while not for others. The set of accessions used for this study comprised mostly European material [22]. The cultivar ‘Milvus’ originates from Switzerland, whereas ‘HEN-17’ is a North American genotype. Possibly, the majority of accessions in this study may have been genetically closer to the Milvus genotype than to HEN-17, which could explain why the Milvus-derived SNP set yielded more and stronger associations. Such within-species genome diversity has previously been described in pan-genome studies, as well as the consequences of a reference bias for quantitative genetics studies, in rice (Oryza sativa) [43,44], barley (Hordeum vulgare) [45], maize (Zea mays) [46], and in the human genome [47]. Notably, such a bias may only be noticed when multiple reference genomes are used in parallel for SNP calling and in subsequent GWAS analysis, which is not routinely conducted in published GWAS studies.
Briefly, when using the HEN-17 assembly for SNP calling, a number of associated SNPs were found flanking genes that have been suggested to be involved in drought responses. Glycosyltransferases and other genes related to cell wall structuring are known to be involved in drought responses [35,36]. In particular, rice plants with over-expression of certain glycosyltransferases display enhanced drought tolerance at the seedling and at the mature stage [35]. The expression of certain α-galactosidase genes increases tolerance to abiotic stress [33], and to drought stress in particular, for example, in New Zealand spinach (Tetragonia tetragonioides) [48]. Ureides accumulate under drought and play important roles in drought responses in legume species [48]. Finally, Ma3-binding domains and MAP kinases have been suggested to be involved in stress metabolism [10,37]. It would be interesting to further study these associations in future research.
In what follows, we focus only on the results obtained with the Milvus genome assembly for SNP calling. We performed GWAS for CC and CH, phenotyped during the drought periods and the recovery periods of two growing seasons. This revealed 10 genomic regions on LG1, LG3, LG6, LG7, and scaf_678 that explained more than 25% of the phenotypic variation for DR per SNP–trait combination at various time points during and after drought (Table 2). These SNPs could be used as molecular markers for DR breeding in red clover, and may help to speed up breeding for DR. We uncovered 10 and 13 associations in the Milvus-derived SNP set, explaining on average 18.4% and 25.5% of the phenotypic variation for CC and CH, respectively. First, we identified an SNP in an ubiquitin–protein ligase gene [39]. Ubiquitin–protein ligases are known to negatively regulate ABA-mediated drought responses by ubiquitinating receptor-like protein kinases [39,40]. Furthermore, we identified transcription factors such as MYB transcription factors. The MYB family represents a large, functionally diverse class of proteins that act as transcription factors [42]. In plants, ABA regulates drought responses following a pathway that involves MYB transcription factors [42]. In addition, we found associations with SNPs in drought-responsive genes such as a DEAD-box ATP-dependent RNA helicase gene, and an SNP flanking a P-loop nucleoside triphosphate hydrolase gene [38,49]. DEAD-box RNA helicases are a large family of genes that play key roles in abiotic stress responses through regulating membrane lipid peroxidation [38]. Their involvement in drought responses has been well documented in species including Arabidopsis thaliana, rice, and wheat (Triticum aestivum) [38,50]. In addition, we found three associations with SNPs in genes encoding transmembrane proteins. Although only few transmembrane proteins have been functionally characterized, various transmembrane proteins have been shown to be involved in drought responses, e.g., in A. thaliana, banana (Musa sp.) and tobacco (Nicotiana tabacum) [34]. Up to 30% of all eukaryote genes are predicted to encode transmembrane proteins, fulfilling functions in signal transmission, the transport of nutrients, energy conversion, and stress response [34]. Finally, we found an interesting association with a flowering control gene, which may control plant development and/or the timing of maturity. This association may explain why two distinct drought response strategies exist in red clover, ‘drought tolerance’ and ‘drought survival’, which largely coincide with the maturity type [22,41]. The set of accessions screened here comprised both maturity types that may exhibit allelic variations in the flowering control gene alongside their different drought response strategies.
Different associations were found for the different traits (CC and CH) and the different growing seasons, with remarkably little overlap. In year 2, CC_189 and CC_195 were associated with a single SNP in a Myb/SANT-like DNA-binding domain gene. One SNP–trait association appeared in the two growing seasons: CH_178 and CH_189, respectively phenotyped at the end of the drought periods in years 1 and 2, were associated with SNPs located in a DEAD-box ATP-dependent RNA helicase gene. This poor overlap is not entirely unexpected, as we showed in our previous study that CC and CH act largely independent, and that the overall phenotypic response of the screened red clover accessions differed throughout the growing seasons, and between the two growing seasons [22].
Drought responses have also been studied in other grassland legumes. In lucerne, various putative genes related to drought responses were identified, including genes encoding transcription factors, protein receptor-like kinases, DNA-binding domain proteins, and phospholipase-like proteins [51,52,53,54]. In lucerne, DR is linked to the capacity for sustaining photosynthetic activity, optimizing root development, enhancing water use efficiency, regulating osmotic potential, accumulating minerals (such as potassium) or organic solutes like proline, and adjusting carbohydrate metabolism to favor the accumulation of soluble sugars [55]. Further research is required to unravel the specific mechanisms driving DR in red clover. An important remark in this context is that we conducted GWAS on allele frequencies per accession, unveiling the substantial genetic variation across populations in outcrossing species, such as red clover, and eliminating the necessity to sequence thousands of individuals [17]. When sequencing resources are limited, population-level association studies are a valuable and effective approach for identifying key SNPs. However, to validate significant SNPs and thoroughly characterize allelic variation in candidate genes, genotyping individual genotypes is essential [20].
We have identified a number of candidate genes that are associated with drought responses in the whole set of 395 red clover EUCLEG accessions. However, it is likely that many more drought response genes are present in these accessions. In GWAS, associations that are present in a smaller fraction of accessions only, but uncommon in the total set of accessions, are unlikely to be detected. Therefore, GWAS typically will produce false-negative results, rather than false-positive results. Due to the large genetic diversity in the set of accessions screened in this study, we expect that numerous, more rare associations will be present in a limited set of (related) accessions only. To identify such associations, a separate GWAS study should be conducted with the accessions of interest only. While our study suggested a number of candidate genes, no functional validation was conducted. Future research should perform functional validation of the candidate genes suggested in this study, so that the specific mechanisms driving DR in red clover can be further studied.

5. Conclusions

The present study applied GWAS to investigate the genetic control of DR in the EUCLEG red clover collection, relevant for European breeding initiatives. The GWAS identified 48 significant SNP–trait associations for DR traits but depended on the red clover genome assembly used for SNP calling. Based on these SNP–trait associations, we proposed a total of 16 candidate genes for DR in red clover. Several genes known to be involved in drought responses, such as a MYB transcription factor and an ubiquitin–protein ligase, were found by our GWAS. Our results also suggest a link between a flowering control gene, possibly related to the maturity type, and drought responses. A next step could be the implementation of molecular markers for DR in practical breeding programs, which would enable breeders to speed up breeding progress. The genomic regions uncovered in this study can be further investigated to validate their potential contribution to genomic-assisted breeding, or to further characterize important genes associated with DR. This would help to uncover the physiological mechanisms driving DR in red clover.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15101347/s1, Supplementary Data File S1: Known genes in a flanking region of ±50,000 bp from each SNP, together with their orthologs and functional annotations in M. truncatula (if available) for all significant associations found using the HEN-17-derived and the Milvus-derived SNP sets; Supplementary Data File S2: Known genes in a flanking region of ±5000 bp from each SNP, together with their orthologs and functional annotations in M. truncatula (if available) for all significant associations found using the HEN-17-derived and the Milvus-derived SNP sets.

Author Contributions

T.V. conceptualized the study and was in charge of writing the article. R.D. and A.S. performed data collection and preliminary data investigation. T.R., D.A.-S. and H.M. performed formal data analysis and data curation. I.R.-R. conceptualized the study and acquired funding. H.M. conceptualized the study, coordinated the work, and assisted in data interpretation. All authors have read and agreed to the published version of the manuscript.

Funding

This work has received funding from the European Union’s Horizon 2020 Program for Research and Innovation under grant agreement no. 727312 (project: “EUCLEG—Breeding forage and grain Legumes to increase EU’s and China’s protein self-sufficiency”); http://www.eucleg.eu, accessed on 19 October 2024.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data generated or analyzed during this study are included in this published article [and its Supplementary Data Files].

Acknowledgments

The authors would like to thank the partners of the EUCLEG Horizon 2020 project for fruitful discussions on GWAS methodologies. We thank Irene Borra Serrano for her assistance with designing Figure 1.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

ABA: abscisic acid; BLINK: linkage-disequilibrium iteratively nested keyway; CC: canopy cover; CH: canopy height; DOY: day of the year; DR: drought resilience; GBS: genotyping-by-sequencing; GWAS: genome-wide association study; QTL: quantitative trait locus; SNP: single nucleotide polymorphism; Yr: relative performance index.

References

  1. Smit, H.; Metzger, M.; Ewert, F. Spatial distribution of grassland productivity and land use in Europe. Agric. Syst. 2008, 98, 208–219. [Google Scholar] [CrossRef]
  2. Annicchiarico, P.; Barrett, B.; Brummer, E.C.; Julier, B.; Marshall, A.H. Achievements and challenges in improving temperate perennial forage legumes. Crit. Rev. Plant Sci. 2015, 34, 327–380. [Google Scholar] [CrossRef]
  3. Taylor, N.L.; Quesenberry, K.H. Red Clover Science; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1996; p. 228. ISBN 0-7923-3887-1. [Google Scholar]
  4. Jones, C.; De Vega, J.; Lloyd, D.; Hegarty, M.; Ayling, S.; Powell, W.; Skot, L. Population structure and genetic diversity in red clover (Trifolium pratense L.). Sci. Rep. 2020, 10, 8364. [Google Scholar] [CrossRef] [PubMed]
  5. Bickhart, D.M.; Koch, L.M.; Smith, T.P.L.; Riday, H.; Sullivan, M.L. Chromosome-scale assembly of the highly heterozygous genome of red clover (Trifolium pratense L.), an allogamous forage crop species. Gigabyte 2022, 2022, gigabyte42. [Google Scholar] [CrossRef] [PubMed]
  6. IPCC. Climate Change 2023: Synthesis Report. Contribution of Working Groups I, II and III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change; IPCC: Geneva, Switzerland, 2023; p. 81. [Google Scholar] [CrossRef]
  7. Ciais, P.; Reichstein, M.; Viovy, N.; Granier, A.; Ogée, J.; Allard, V. Europe-wide reduction in primary productivity caused by the heat and drought in 2003. Nature 2005, 437, 529–532. [Google Scholar] [CrossRef]
  8. Emadodin, I.; Corral, D.E.F.; Reinsch, T.; Kluβ, C.; Taube, F. Climate change effects on temperate grassland and its implication for forage production: A case study from northern Germany. Agriculture 2021, 11, 232. [Google Scholar] [CrossRef]
  9. Tardieu, F.; Simonneau, T.; Muller, B. The physiological basis of drought tolerance in crop plants: A scenario-dependent probabilistic approach. Annu. Rev. Plant Biol. 2018, 69, 733–759. [Google Scholar] [CrossRef]
  10. Movahedi, A.; Dzinyela, R.; Aghaei-Dargiri, S.; Alhassan, A.R.; Yang, L.; Xu, C. Advanced study of drought-responsive protein pathways in plants. Agronomy 2023, 13, 849. [Google Scholar] [CrossRef]
  11. He, C.; Du, Y.; Fu, J.; Zeng, E.; Park, S.; White, F.; Zheng, J.; Liu, S. Early drought-responsive genes are variable and relevant to drought tolerance. G3 Genes Genomes Genet. 2020, 10, 1657–1670. [Google Scholar] [CrossRef]
  12. Lui, F.; Xi, M.; Liu, T.; Wu, X.; Ju, L.; Wang, D. The central role of transcription factors in bridging biotic and abiotic stress responses for plants’ resilience. New Crops 2024, 1, 100005. [Google Scholar] [CrossRef]
  13. Cyriac, D.; Hofmann, R.W.; Stewart, A.; Sathish, P.; Winefield, C.S.; Moot, D.J. Intraspecific differences in long-term drought tolerance in perennial ryegrass. PLoS ONE 2018, 13, e0194977. [Google Scholar] [CrossRef] [PubMed]
  14. De Swaef, T.; Maes, W.H.; Aper, J.; Baert, J.; Cougnon, M.; Reheul, D.; Steppe, K.; Roldán-Ruiz, I.; Lootens, P. Applying RGB- and thermal-based vegetation indices from UAVs for high-throughput field phenotyping of drought tolerance in forage grasses. Remote Sens. 2021, 13, 147. [Google Scholar] [CrossRef]
  15. Boucher, K. Breeding for Improved Drought Tolerance in Red Clover (Trifolium pratense L.). Master’s Thesis, University of Guelph, Guelph, ON, Canada, 2016. [Google Scholar]
  16. Huang, X.; Han, B. Natural variations and genome-wide association studies in crop plants. Ann. Rev. Plant Biol. 2014, 65, 531–551. [Google Scholar] [CrossRef] [PubMed]
  17. Byrne, S.; Czaban, A.; Studer, B.; Panitz, F.; Bendixen, C.; Asp, T. Genome wide allele frequency fingerprints (GWAFFs) of populations via genotyping by sequencing. PLoS ONE 2013, 8, e57438. [Google Scholar] [CrossRef] [PubMed]
  18. Biazzi, E.; Nazzicari, N.; Pecetti, L.; Brummer, E.C.; Palmonari, A.; Tava, A.; Annicchiarico, P. Genome-wide association mapping and genomic selection for alfalfa (Medicago sativa) forage quality traits. PLoS ONE 2017, 12, e0169234. [Google Scholar] [CrossRef]
  19. Pégard, M.; Barre, P.; Delaunay, S.; Surault, F.; Karagic, D.; Milic, D.; Zoric, M.; Ruttink, T.; Julier, B. Genome-wide genotyping data renew knowledge on genetic diversity of a worldwide alfalfa collection and give insights on genetic control of phenology traits. Front. Plant Sci. 2023, 14, 1196134. [Google Scholar] [CrossRef]
  20. Frey, L.A.; Vleugels, T.; Ruttink, T.; Schubiger, F.X.; Pégard, M.; Skøt, L.; Grieder, C.; Studer, B.; Roldán-Ruiz, I.; Kölliker, R. Phenotypic variation and quantitative trait loci for resistance to southern anthracnose and clover rot in red clover. Theor. Appl. Genet. 2022, 135, 4337–4349. [Google Scholar] [CrossRef]
  21. Zanotto, S.; Ruttink, T.; Pégard, M.; Skot, L.; Grieder, C.; Kölliker, R.; Ergon, A. A genome-wide association study of freezing tolerance in red clover (Trifolium pratense L.) germplasm of European origin. Front. Plant Sci. 2023, 14, 1189662. [Google Scholar] [CrossRef]
  22. Vleugels, T.; Saleem, A.; Dubey, R.; Muylle, H.; Borra-Serrano, I.; Lootens, P.; De Swaef, T.; Roldán-Ruiz, I. Phenotypic characterization of drought responses in red clover (Trifolium pratense L.). Front. Plant Sci. 2024, 14, 1304411. [Google Scholar] [CrossRef]
  23. De Vega, J.J.; Ayling, S.; Hegarty, M.; Kudrna, D.; Goicoechea, J.L.; Ergon, Å.; Rognli, O.A.; Jones, C.; Swain, M.; Geurts, R.; et al. Red clover (Trifolium pratense L.) draft genome provides a platform for trait improvement. Sci. Rep. 2015, 5, 17394. [Google Scholar] [CrossRef]
  24. Langmead, B.; Salzberg, S. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef] [PubMed]
  25. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve years of SAMtools and BCFtools. GigaSci 2021, 10, giab008. [Google Scholar] [CrossRef] [PubMed]
  26. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.R-project.org (accessed on 19 October 2024).
  27. RStudio Team. RStudio: Integrated Development for R; PBC: Boston, MA, USA, 2023; Available online: http://www.rstudio.com (accessed on 19 October 2024).
  28. Huang, M.; Liu, X.; Zhou, Y.; Summers, R.M.; Zhang, Z. BLINK: A package for the next level of genome-wide association studies with both individuals and markers in the millions. Gigascience 2019, 8, giy154. [Google Scholar] [CrossRef] [PubMed]
  29. Lipka, A.E.; Tian, F.; Wang, Q.; Peiffer, J.; Li, M.; Bradbury, P.J.; Gore, M.A.; Buckler, E.S.; Zhang, Z. GAPIT: Genome association and prediction integrated tool. Bioinformatics 2012, 28, 2397–2399. [Google Scholar] [CrossRef] [PubMed]
  30. Van Bel, M.; Silvestri, F.; Weitz, E.M.; Kreft, L.; Botzki, A.; Coppens, F.; Vandepoele, K. PLAZA 5.0: Extending the scope and power of comparative and functional genomics in plants. Nucleic Acids Res. 2022, 50, 1468–1474. [Google Scholar] [CrossRef]
  31. Martin, F.J.; Amode, M.R.; Aneja, A.; Austine-Orimoloye, O.; Azov, A.G.; Barnes, I.; Becker, A.; Bennett, R.; Berry, A.; Bhai, J.; et al. Ensembl 2023. Nucleic Acids Res. 2023, 51, 933–941. [Google Scholar] [CrossRef]
  32. Baral, B.; Izaguirre-Mayoral, M.L. Chapter four—Purine-derived ureides under drought and salinity. Adv. Agron. 2017, 146, 167–204. [Google Scholar] [CrossRef]
  33. Pennycooke, J.C.; Jones, M.L.; Stushnoff, C. Down-regulating α-galactosidase enhances freezing tolerance in transgenic Petunia. Plant Physiol. 2002, 133, 901–909. [Google Scholar] [CrossRef]
  34. Zhou, Y.; Wang, B.; Yuan, F. The role of transmembrane proteins in plant growth; development; and stress responses. Int. J. Mol. Sci. 2022, 23, 13627. [Google Scholar] [CrossRef]
  35. Liu, Q.; Dong, G.; Ma, Y.; Zhao, S.; Liu, X.; Li, X.; Li, Y.; Hou, B. Rice glycosyltransferase gene UGT85E1 is involved in drought stress tolerance through enhancing abscisic acid response. Front. Plant Sci. 2021, 12, 790195. [Google Scholar] [CrossRef]
  36. Niu, L.; Liu, L.; Wang, W. Digging for stress-responsive cell wall proteins for developing stress-resistant maize. Front. Plant Sci. 2020, 11, 576385. [Google Scholar] [CrossRef] [PubMed]
  37. Lee, D.; Park, S.J.; Ahn, C.S.; Pai, H. MRF family genes are involved in translation control; especially under energy-deficient conditions; and their expression and functions are modulated by the tor signalling pathway. Plant Cell 2017, 29, 2895–2920. [Google Scholar] [CrossRef] [PubMed]
  38. Ru, J.-N.; Hou, Z.-H.; Zheng, L.; Zhao, Q.; Wang, F.-Z.; Chen, J.; Zhou, Y.-B.; Chen, M.; Ma, Y.-Z.; Xi, Y.-J.; et al. Genome-wide analysis of DEAD-box RNA helicase family in wheat (Triticum aestivum) and functional identification of TaDEAD-box57 in abiotic stress responses. Front. Plant Sci. 2021, 12, 797276. [Google Scholar] [CrossRef] [PubMed]
  39. Chen, X.; Wang, T.; Rehman, A.U.; Wang, Y.; Qi, J.; Li, Z.; Song, C.; Wang, B.; Yang, S.; Gong, Z. Arabidopsis U-box E3 ubiquitin ligase PUB11 negatively regulates drought tolerance by degrading the receptor-like protein kinases LRR1 and KIN7. J. Integr. Plant Biol. 2021, 63, 494–509. [Google Scholar] [CrossRef] [PubMed]
  40. Li, B.W.; Gao, S.; Yang, Z.M.; Song, J.B. The F-box E3 ubiquitin ligase AtSDR is involved in salt and drought stress responses in Arabidopsis. Gene 2022, 809, 146011. [Google Scholar] [CrossRef]
  41. Loucks, C.E.S.; Deen, W.; Gaudin, A.C.M.; Earl, H.J.; Bowley, S.R.; Martin, R.C. Genotypic differences in red clover (Trifolium pratense L.) response under severe water deficit. Plant Soil 2018, 425, 401–414. [Google Scholar] [CrossRef]
  42. Roy, S. Function of MYB domain transcription factors in abiotic stress and epigenetic control of stress response in plant genome. Plant Signal. Behav. 2016, 11, e1117723. [Google Scholar] [CrossRef]
  43. Wang, W.; Mauleon, R.; Hu, Z.; Chebotarov, D.; Tai, S.; Wu, Z.; Li, M.; Zheng, T.; Fuentes, R.R.; Zhang, F.; et al. Genomic variation in 3;010 diverse accessions of Asian cultivated rice. Nature 2018, 557, 43–49. [Google Scholar] [CrossRef]
  44. Shang, L.; Li, X.; He, H.; Yuan, Q.; Song, Y.; Wei, Z.; Lin, H.; Hu, M.; Zhao, F.; Zhang, C.; et al. A super pan-genomic landscape of rice. Cell Res. 2022, 32, 878–896. [Google Scholar] [CrossRef]
  45. Jayakodi, M.; Padmarasu, S.; Haberer, G.; Bonthala, V.S.; Gundlach, H.; Monat, C.; Lux, T.; Kamal, N.; Lang, D.; Himmelbach, A.; et al. The barley pan-genome reveals the hidden legacy of mutation breeding. Nature 2020, 588, 284–289. [Google Scholar] [CrossRef]
  46. Gui, S.; Wie, W.; Jiang, C.; Luo, J.; Chen, L.; Wu, S.; Li, W.; Wang, Y.; Li, S.; Yang, N.; et al. A pan-Zea genome map for enhancing maize improvement. Genome Biol. 2022, 23, 178. [Google Scholar] [CrossRef] [PubMed]
  47. Günther, T.; Nettelblad, C. The presence and impact of reference bias on population genomic studies of prehistoric human populations. PLoS Genet. 2019, 15, e1008302. [Google Scholar] [CrossRef] [PubMed]
  48. Hara, M.; Tokunaga, K.; Kuboi, T. Isolation of a drought-responsive alkaline α-galactosidase gene from New Zealand spinach. Plant Biotechnol. 2008, 25, 497–501. [Google Scholar] [CrossRef]
  49. Biniaz, Y.; Tahmasebi, A.; Afsharifar, A.; Tahmasebi, A.; Poczai, P. Meta-analysis of common and differential transcriptomic responses to biotic and abiotic stresses in Arabidopsis thaliana. Plants 2022, 11, 502. [Google Scholar] [CrossRef] [PubMed]
  50. Nawaz, G.; Kang, H. Rice OsRH58; a chloroplast DEAD-box RNA helicase; improves salt or drought stress tolerance in Arabidopsis by affecting chloroplast translation. BMC Plant Biol. 2019, 19, 17. [Google Scholar] [CrossRef] [PubMed]
  51. Li, Z.; Zou, Y.; Li, Y.; Sun, C.; Liu, Y.; Cui, Y.; Liu, J.; Yang, Z.; Chen, Y. Exploring the salt- and drought-tolerant genes of alfalfa through expression library screening strategy. Grass Res. 2021, 1, 11. [Google Scholar] [CrossRef]
  52. Lin, S.; Medina, C.A.; Boge, B.; Hu, J.; Fransen, S.; Norberg, S.; Yu, L.X. Identification of genetic loci associated with forage quality in response to water deficit in autotetraploid alfalfa (Medicago sativa L.). BMC Plant Biol. 2020, 20, 203. [Google Scholar] [CrossRef]
  53. Yu, L. Identification of single-nucleotide polymorphic loci associated with biomass yield under water deficit in alfalfa (Medicago sativa L.) using genome-wide sequencing and association mapping. Front. Plant Sci. 2017, 8, 1152. [Google Scholar] [CrossRef]
  54. Zhang, T.; Yu, L.; Zheng, P.; Li, Y.; Rivera, M.; Main, D.; Greene, S.L. Identification of loci associated with drought resistance traits in heterozygous autotetraploid alfalfa (Medicago sativa L.) using genome-wide association studies with genotyping by sequencing. PLoS ONE 2015, 10, e0138931. [Google Scholar] [CrossRef]
  55. Inès, S.; Talbi, O.; Nasreddine, Y.; Rouached, A.; Gharred, J.; Jdey, A.; Hanana, M.; Abdelly, C. Drought tolerance traits in Medicago species: A review. Arid Land Res. Manag. 2021, 36, 67–83. [Google Scholar] [CrossRef]
Figure 1. Timeline of the phenotypic data from Vleugels et al. [22] that were used for genome-wide association study (GWAS).
Figure 1. Timeline of the phenotypic data from Vleugels et al. [22] that were used for genome-wide association study (GWAS).
Genes 15 01347 g001
Figure 2. Manhattan plots from GWAS on DR in red clover, displaying significant associations for: (A) traits in the drought period; (B) the recovery periods of both years using the HEN-17-derived SNP set; (C) traits in the drought period; and (D) the recovery period of both years using the Milvus-derived SNP set. The dotted line represents the significance threshold after Bonferroni correction (α = 5%). Colors represent the different linkage groups (LG1 to LG7), and unmapped (UM) contigs.
Figure 2. Manhattan plots from GWAS on DR in red clover, displaying significant associations for: (A) traits in the drought period; (B) the recovery periods of both years using the HEN-17-derived SNP set; (C) traits in the drought period; and (D) the recovery period of both years using the Milvus-derived SNP set. The dotted line represents the significance threshold after Bonferroni correction (α = 5%). Colors represent the different linkage groups (LG1 to LG7), and unmapped (UM) contigs.
Genes 15 01347 g002
Table 1. Relative performance indices (Yr) for CC and CH used for GWAS, with their average and SD over all 395 accessions observed in Vleugels et al. [22].
Table 1. Relative performance indices (Yr) for CC and CH used for GWAS, with their average and SD over all 395 accessions observed in Vleugels et al. [22].
PeriodDOYYr (µ ± SD) forCut
CCCH
Year 1
Drought period1650.19 ± 0.020.31 ± 0.09
1780.44 ± 0.110.37 ± 0.09Cut 2
1st recovery period1980.30 ± 0.170.35 ± 0.07
2050.28 ± 0.120.59 ± 0.09
2180.15 ± 0.060.35 ± 0.12
2250.22 ± 0.060.07 ± 0.16Cut 3
2nd recovery period2380.12 ± 0.12−0.59 ± 0.09
2450.19 ± 0.090.09 ± 0.05
Year 2
Drought period1750.26 ± 0.08−0.10 ± 0.14
1890.27 ± 0.090.33 ± 0.09
1950.14 ± 0.100.22 ± 0.10Cut 2
1st recovery period2120.45 ± 0.090.27 ± 0.10
2200.29 ± 0.080.31 ± 0.10
2250.17 ± 0.080.21 ± 0.10
2450.01 ± 0.03−0.04 ± 0.11Cut 3
2nd recovery period2580.06 ± 0.11−2.75 ± 0.24
DOY: day-of-year; CC: canopy cover; CH: canopy height; Yr: relative performance index; SD: standard deviation. Cuts are indicated with horizontal lines and drought periods are shaded.
Table 2. Marker–trait associations from GWAS on DR in red clover, displaying significant associations using the HEN-17-derived and Milvus-derived SNP sets with genes that are known to be involved in drought responses.
Table 2. Marker–trait associations from GWAS on DR in red clover, displaying significant associations using the HEN-17-derived and Milvus-derived SNP sets with genes that are known to be involved in drought responses.
TraitYearPeriodSNP IDChrPosVar Expl (%)Effect (β)Candidate GeneKnown for DRDistance (bp)Protein Annotationin PLAZA 5.0Reference
Associations using the HEN-17-derived SNP set [5]
CH_16519DNC_060060.1 34138967LG234,138,9679.1−0.14
CH_17819DNC_060065.1 42224330LG742,224,3305.6−0.19
CC_23819RNC_060059.1 25418144LG125,418,1448.1−0.34
CH_23819RNC_060059.1 39822546LG139,822,5467.10.42
CC_21819RNC_060059.1 45472125LG145,472,12511.7−0.13
CC_24519RNC_060060.1 18680074LG218,680,0747.20.19
CH_22519RNC_060060.1 24894651LG224,894,6515.9−0.27ureide permease 1-likeSNP in geneureide permease-like protein[32]
CC_20519RNC_060060.1 37047857LG237,047,8576.00.25
CC_19819RNC_060061.1 9267342LG39,267,3428.40.33uncharacterized LOC123917029+1664α-galactosidase[33]
protein-tyrosine-phosphatase MKP1-like−3852MAP kinase phosphatase[10]
CH_20519RNC_060061.1 26803013LG326,803,01311.4−0.16
CH_20519RNC_060061.1 29630157LG329,630,1576.90.25uncharacterized LOC123915918−171transmembrane protein, putative[34]
CH_23819RNC_060062.1 52642219LG452,642,21911.00.67
CH_22519RNC_060063.1 4412473LG54,412,47316.40.24
CC_19819RNC_060063.1 6098087LG56,098,0879.20.26probable arabinosyltransferase ARAD1SNP in genesecondary cell wall glycosyltransferase family 47 protein[35]
CH_23819RNC_060064.1 36266159LG636,266,15923.20.74
CC_19520DNC_060062.1 53931991LG453,931,99111.3−0.18N-acetyl-α-D-glucosaminyl L-malate synthase+616glycosyltransferase family 4 protein[35]
CC_21220RNC_060062.1 20504973LG420,504,9735.90.23pectinesterase/pectinesterase inhibitor-likeSNP in genepectinesterase/pectinesterase inhibitor[36]
CC_22520RNC_060062.1 42084107LG442,084,1075.30.14probable glycosyltransferase At5g20260−278glycosyltransferase[35]
CC_25820RNC_060063.1 49890749LG549,890,7496.60.23MA3 domain-containing translation regulatory factor 1-likeSNP in genetopoisomerase-like protein[37]
Associations using the Milvus-derived SNP set [23]
CC_17819DLG1_3190272LG13,190,27227.5−0.46
CH_17819DLG1_4925141LG14,925,14126.5−0.12DEAD-box ATP-dependent RNA helicase 41+352DEAD-box ATP-dependent RNA helicase[38]
CC_17819DLG1_24162580LG124,162,58031.1−0.13
CC_17819DLG3_1774487LG31,774,48740.3−0.11uncharacterized LOC123918397SNP in genetransmembrane protein, putative[34]
CH_17819DLG3_3530352LG33,530,35231.1−0.14
CH_16519DLG4_2357602LG42,357,6025.80.29
CH_16519Dscaf_21186_400scaf_2118640010.70.15no genes on scaffold
CC_17819Dscaf_282_123440scaf_282123,44013.10.25
CC_17819Dscaf_569_145336scaf_569145,33614.9−0.04
CH_21819RLG1_4970479LG14,970,47921.90.25
CC_21819RLG1_8876185LG18,876,1855.9−0.12
CC_21819RLG2_16875234LG216,875,23412.60.07E3 ubiquitin–protein ligase HOS1SNP in geneE3 ubiquitin–protein ligase HOS1[39,40]
CH_22519RLG6_9239088LG69,239,08832.9−0.25uncharacterized LOC123889235SNP in genetransmembrane protein, putative[34]
CH_21819RLG6_9239175LG69,239,17534.4−0.14uncharacterized LOC123889235SNP in genetransmembrane protein, putative[34]
CH_23819RLG7_16795407LG716,795,40742.30.75
CC_20519Rscaf_17454_702scaf_1745470212.30.13no genes on scaffold
CH_22519Rscaf_215_44822scaf_21544,82222.40.47
CC_20519Rscaf_298_215478scaf_298215,47811.1−0.08
CH_23819Rscaf_677_20023scaf_67720,02312.70.42
CH_23819Rscaf_678_58484scaf_67858,48425.00.36
CH_21819Rscaf_802_53097scaf_80253,09710.8−0.17
CC_21819Rscaf_918_18614scaf_91818,61411.9−0.10
CH_18920DLG1_4925076LG14,925,07629.1−0.13DEAD-box ATP-dependent RNA helicase 41+287DEAD-box ATP-dependent RNA helicase[38]
CH_18920DLG1_12991269LG112,991,26936.6−0.14flowering time control protein FPASNP in geneRNA recognition motif (RRM) containing protein[41]
CC_18920DLG1_26956262LG126,956,2628.6−0.09
CC_18920DLG3_11998225LG311,998,22518.80.17uncharacterized LOC123913798SNP in geneMyb/SANT-like DNA-binding domain protein[42]
CC_19520DLG3_11998225LG311,998,22522.20.23uncharacterized LOC123913798SNP in geneMyb/SANT-like DNA-binding domain protein[42]
CC_22020RLG3_2451723LG32,451,7235.3−0.15
CC_24520RLG6_21196281LG621,196,2818.7−0.04
Trait with day-of-year of observation; Trial year; Period: drought (D) or recovery (R); SNP ID; Chromosome (Chr); Position on chromosome (Pos); phenotypic variance explained by that SNP allele (%); Effect size on the phenotype as regression coefficient (β) of the allele frequency; Gene of interest known to be involved in drought responses flanking the significant SNP in the HEN-17 reference genome [5]; Distance (bp) between the SNP and the gene of interest if the gene is upstream (+) or downstream (−) from the SNP; Annotation, functional description of the closest M. truncatula ortholog of the candidate gene, and literature references explaining the function of the genes of interest in the context of drought stress.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Vleugels, T.; Ruttink, T.; Ariza-Suarez, D.; Dubey, R.; Saleem, A.; Roldán-Ruiz, I.; Muylle, H. GWAS for Drought Resilience Traits in Red Clover (Trifolium pratense L.). Genes 2024, 15, 1347. https://doi.org/10.3390/genes15101347

AMA Style

Vleugels T, Ruttink T, Ariza-Suarez D, Dubey R, Saleem A, Roldán-Ruiz I, Muylle H. GWAS for Drought Resilience Traits in Red Clover (Trifolium pratense L.). Genes. 2024; 15(10):1347. https://doi.org/10.3390/genes15101347

Chicago/Turabian Style

Vleugels, Tim, Tom Ruttink, Daniel Ariza-Suarez, Reena Dubey, Aamir Saleem, Isabel Roldán-Ruiz, and Hilde Muylle. 2024. "GWAS for Drought Resilience Traits in Red Clover (Trifolium pratense L.)" Genes 15, no. 10: 1347. https://doi.org/10.3390/genes15101347

APA Style

Vleugels, T., Ruttink, T., Ariza-Suarez, D., Dubey, R., Saleem, A., Roldán-Ruiz, I., & Muylle, H. (2024). GWAS for Drought Resilience Traits in Red Clover (Trifolium pratense L.). Genes, 15(10), 1347. https://doi.org/10.3390/genes15101347

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop