Next Article in Journal
Dihydroceramides Derived from Bacteroidetes Species Sensitize TRPV1 Channels
Next Article in Special Issue
De Novo Assembly and Annotation of 11 Diverse Shrub Willow (Salix) Genomes Reveals Novel Gene Organization in Sex-Linked Regions
Previous Article in Journal
Choroid Plexus Aquaporins in CSF Homeostasis and the Glymphatic System: Their Relevance for Alzheimer’s Disease
Previous Article in Special Issue
Gain or Loss? Evidence for Legume Predisposition to Symbiotic Interactions with Rhizobia via Loss of Pathogen-Resistance-Related Gene Families
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Association Studies of Seven Root Traits in Soybean (Glycine max L.) Landraces

1
National Agrobiodiversity Center, National Institute of Agricultural Sciences, RDA, Jeonju 5487, Republic of Korea
2
Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
3
Department of Applied Biosciences, Kyungpook National University, Daegu 41566, Republic of Korea
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2023, 24(1), 873; https://doi.org/10.3390/ijms24010873
Submission received: 14 November 2022 / Revised: 7 December 2022 / Accepted: 8 December 2022 / Published: 3 January 2023
(This article belongs to the Special Issue Comparative Genomics and Functional Genomics Analysis in Plants)

Abstract

:
Soybean [Glycine max (L.) Merr.], an important oilseed crop, is a low-cost source of protein and oil. In Southeast Asia and Africa, soybeans are widely cultivated for use as traditional food and feed and industrial purposes. Given the ongoing changes in global climate, developing crops that are resistant to climatic extremes and produce viable yields under predicted climatic conditions will be essential in the coming decades. To develop such crops, it will be necessary to gain a thorough understanding of the genetic basis of agronomic and plant root traits. As plant roots generally lie beneath the soil surface, detailed observations and phenotyping throughout plant development present several challenges, and thus the associated traits have tended to be ignored in genomics studies. In this study, we phenotyped 357 soybean landraces at the early vegetative (V2) growth stages and used a 180 K single-nucleotide polymorphism (SNP) soybean array in a genome-wide association study (GWAS) conducted to determine the phenotypic relationships among root traits, elucidate the genetic bases, and identify significant SNPs associated with root trait-controlling genomic regions/loci. A total of 112 significant SNP loci/regions were detected for seven root traits, and we identified 55 putative candidate genes considered to be the most promising. Our findings in this study indicate that a combined approach based on SNP array and GWAS analyses can be applied to unravel the genetic basis of complex root traits in soybean, and may provide an alternative high-resolution marker strategy to traditional bi-parental mapping. In addition, the identified SNPs, candidate genes, and diverse variations in the root traits of soybean landraces will serve as a valuable basis for further application in genetic studies and the breeding of climate-resilient soybeans characterized by improved root traits.

1. Introduction

Soybean [Glycine max (L.) Merr.] is an important oilseed crop and an inexpensive source of protein and oil. In regions of Southeast Asia and Africa, soybeans are commonly used as a source of traditional food and animal feed and for diverse industrial purposes. In addition to these uses, given its symbiotic associations with nitrogen-fixing rhizobacteria, cultivating soybean crops can make a valuable contribution to fixing atmospheric nitrogen, thereby reducing reliance on the excessive application of chemical fertilizers and thus facilitating sustainable cultivation [1,2].
To meet the growing demand for food that will be necessary to sustain human populations in the coming years, it will be necessary to boost productivity. However, given the current yields of major crops, it will be difficult to meet the projected future demand. In addition to population growth, the predicted changes in global climate will potentially have wide-ranging ramifications with respect to human health, world trade flow, food prices, and the ecosystem, and could also imperil global food security [3,4]. Consequently, it is deemed imperative to make efforts to enhance the yields of soybean for food production. According to recent reports, compared with the previous growth season, global soybean production in 2021/2022 will be reduced by 9.2 million tons to 372.56 million metric tons, owing to the lower yields of South American crops [5]. This reduction in yield has been influenced by several factors, including soil properties and biotic and abiotic stresses. Typically, growth and development are detrimentally affected when plants are subject to stress, which is ultimately manifested in declining yields. To overcome the challenges presented by such yield reductions, a multi-faceted research approach is required. Generally, it is considered that roots serve as a frontline defensive barrier under conditions in which plants are exposed to stress [6]. From a morpho-physiological standpoint, roots play essential roles in water and nutrient uptake, transportation, and absorption (Freschet et al., 2021). Accordingly, the structure and function of roots are of particular importance in terms of crop productivity, and not surprisingly, root research is assuming increasing prominence in agricultural sectors to increase productivity from a broad perspective. At present, however, compared with other agronomical traits, agriculture-related plant root research has been comparatively limited, as root phenotyping tends to be tedious, expensive, and technically difficult, and thus to date has made little contribution to furthering crop improvement. However, based on recent technological developments, researchers have begun to use different approaches for phenotyping the roots of diverse plant species [7,8,9,10,11].
Typically, it is considered that the landraces of cultivated crop plant species, including soybean, are characterized by wide genetic diversity and carry multiple favorable alleles and genes that confer greater adaptability to adverse conditions than that shown by modern cultivars [12,13,14]. Consequently, conducting genomics studies using landraces is viewed as a particularly fruitful approach for assessing the available diversity for targeted traits. A plant’s resilience to biotic and abiotic stresses can also be improved by modification in the root. Root traits [root system architecture (RSA) and root morphological traits], tend to be quantitative in nature. Mainly, RSA comprises multiple traits, for example, total root length, root diameter, root biomass, root volume, etc., and can be used to assess a plant’s capacity to adapt to challenging conditions. These traits are influenced by multiple factors, and thus identifying quantitative trait loci (QTLs) associated with root traits and the associated molecular markers will represent a key approach for the exploitation of root traits in future breeding programs and crop improvement.
The increasingly widely adopted genome-wide association studies (GWASs) have facilitated the discovery of the immense allelic diversity harbored within natural populations with both high resolution and accuracy, and have thereby enabled the identification of genomic regions/QTLs linked to desirable traits. Using high-density SNP markers, researchers have succeeded in discovering genomics regions, loci, or alleles for root traits in numerous plant species, including soybean [15,16,17,18]. As a consequence of the discovery of heritable natural variations in soybean root traits, several QTLs associated with RSA have been identified using a mapping approach [16,19,20]. In addition, Ye et al. (2018) identified a major QTL (qWT_Gm03) linked to waterlogging tolerance and the regulation of RSA and root plasticity in soybean. Furthermore, based on a combined approach using genotyping-by-sequencing in conjunction with whole-genome sequencing, Seck et al. (2020) recently identified 10 QTL regions associated with total root length and primary root diameter in soybean [21]. All these studies indicated the significance of RSA in root development.
Apart from the markers identified in these studies, however, there has generally been a lack of relevant markers for root traits in molecular breeding, and few underlying genes have been identified for RSA [16,20,22,23]. Moreover, the requisite functional characterization or cloning of genes has yet to be performed. Thus, there currently exists a paucity of relevant information, particularly with respect to the identification of genomic regions or loci and alleles associated with biological progression and the effects on root traits. This scarcity of functional genomic information for root traits accordingly motivates identifying a more extensive range of unique genomic regions and the associated candidate genes necessary for elucidating the genetic basis of root traits of interest.
In this study, we aimed to examine the phenotypic association among root traits, elucidate their genetic basis, and identify significant SNPs associated with regions/loci controlling root traits. To this end, we phenotyped 357 soybean landraces at the early vegetative (V2) stage of growth and conducted GWAS using a soybean 180K SNP array. This study will help us comprehend the genetic basis of soybean root development.

2. Results

2.1. Root Trait Variation

In this study, 357 soybean germplasms were analyzed for seven root traits, all seven of which showed significant differences among the landraces (Table S1). The range of the average root diameter (DIAM) was found to have a relatively even distribution across the germplasms, ranging in value from 0.404 to 0.666 mm, with a mean diameter of 0.537 mm. With respect to LAD and LAL, we obtained values ranging from 0.429 to 0.725 and 0.109 to 0.379, respectively, with corresponding mean values of 0.5637 and 0.216. The total root length (LENGTH) of the landraces ranged from 703.33 to 3212.642 cm, with a mean value of 1766.26 cm (Table S1), whereas values obtained for the number of forks (NF) ranged from 1180.00 to 11,941.67, with a mean value of 5250.51. A majority of the germplasms had fork numbers ranging from 3200 to 5200, with numbers gradually declining at the higher and lower extremes. Assessment of root tip number (NT) revealed values distributed across a range from 6222.67 to 3255.00, with a mean number of 1777.58, whereas surface area (SA) values ranged from 124.15 to 491.667 cm2, with a mean value of 294.382 cm2 (Table S1). Based on the determinations of skewness and kurtosis, we established all seven traits (DIAM, LAL, LAD, LENGTH, NT, NF, and SA) to be normally distributed (Table 1, Figure S1A–G). Among these traits, the highest coefficient of variation (CV) was obtained for NF (32.23%), followed by NT (29.53%), LENGTH (23.92%), SA (22.50%), LAL (15.12%), LAD (9.25%), and DIAM (8.68%). These findings indicate a notable variation in the levels of significant difference among the measured traits.

2.2. Population Structure and Principal Component and Phylogenetic Analyses

Grouping the 357 soybean landraces into clusters corresponding to their major categories is a key step in highlighting the basic population structure of these landraces. Although genetic studies of landraces can reveal distinct genotypes, direct and precise genotype partitioning is difficult. As a result, we used K-means clustering algorithms to identify and group these landraces based on their similarities. We used the elbow method, which enables the determination of the number of clusters by the curve’s elbow (Figure S2). Using the elbow approach, we detected three clusters/groups (i.e., K = 3), with groups I, II, and III containing 92, 105, and 160 landraces, respectively, all three of which were found to have varying degrees of admixture. Landraces from South Korea were more or less equally distributed between groups I and III, with 51 and 59 landraces, respectively, whereas 71 of the 90 landraces from China were clustered in group II. North Korean landraces were found to be distributed within all three groups, with groups I, II, and III containing 22, 16, and 20 landraces, respectively. Similarly, Japanese landraces had representatives in each of the three groups with 11, 13, and 70 clustering in groups I, II, and III, respectively. Overall, we found that except for five landraces, the soybean landraces of South Korean origin could mainly be divided into two groups, whereas the majority of landraces originating from China fall into group II, and a mixed type of separation was observed for landraces of North Korean origin among three groups, without any distinct origin-based separation.
By way of confirming the population structure of the assessed soybean landraces, we performed phylogenetic and principal component analyses (PCA). For phylogenetic tree construction, we used all 357 landraces, which were compared based on genetic similarity matrix information. The results of both PCA and phylogenetic analysis were consistent with the aforementioned population structure (Figure 1). Phylogenetic analysis partitioned the 357 landraces into three major groups (Figure 1B), among which group I comprised 143 landraces [South Korea (13), Japan (27), North Korea (27), and China (82)], group II comprised 114 landraces [South Korea (45), Japan (59), North Korea (8), and China (2)], and group III comprised 100 landraces [South Korea (57), Japan (14), North Korea (23), and China (6)]. Similarly, PCA-based clustering revealed that the 357 soybean lines can be divided into three major groups, among which there was a certain degree of overlap comprising areas of admixture (Figure 1A). PC1 and PC2 accounted for 34.23% and 14.10% of the observed variation, respectively (Figure 1C). The first three, PC1, PC2, and PC3, contributed more than fifty percent (58.20%) variation (Table S2). Based on these analyses, we can thus conclude that the 357 assessed soybean landraces can be divided into three major groups/subpopulations, which represent an admixture of three ancestral populations.

2.3. Estimation of Linkage Disequilibrium Decay

To gain an estimate of LD decay, we performed pairwise comparisons between all filtered SNPs. At a cut-off value of r2 = 0.1, the average LD decay distance of the 357 soybean landraces was approximately 200 kb, r2 = 0.2 (Figure 1D). Furthermore, we determine the pattern of LD across the genome to establish the number of haplotype blocks containing SNPs that can be used to determine the candidate gene identification range.

2.4. Genome-Wide Association Analysis

To identify SNPs that are significantly associated with the seven assessed root traits (DIAM, LAD, LAL, LENGTH, NF, NT, and SA), we conducted GWAS using a mixed linear model (MLM) and accordingly identified 112 SNPs with significant associations, among which 28, 23, 21, 16, 16, 4, and 4 SNPs were associated with LAD, NT, DIAM, LAL, NF, LENGTH, and SA, respectively, with a threshold of ≥4 − log10(p) (Table 1). The SNPs associated with LAD were distributed on seven chromosomes (Chr. 3, 6, 12, 13, 14, 16X, and 18). Similarly, SNPs associated with NT were distributed on five chromosomes (Chr. 1, 9, 11, 13, and 2). The chromosomal distribution of other SNPs associated with DIAM, LAL, LENGTH, NF, and SA are shown in Table 1. Among the detected SNPs, that with the highest −log10 (p) value (5.78) was the LAL-associated AX-90477018 located at 7419551 bp on Chr. 7, followed by SNP AX-90402282 located at 58817483 bp on Chr. 18 with a −log10 (p) value of 5.28.
We also established that 24 of the identified SNPs were co-associated with two of the seven traits. Among the 28 significant SNPs associated with the LAD trait, 17 SNPs present on Chr. 3 (6 SNPs), Chr. 6 (7), Chr. 14 (2), and Chr. 18 (2) were also commonly associated with DIAM. Likewise, three SNPs were identified as commonly associated with LENGTH and NT on Chr. 11 and a further four SNPs distributed on Chr. 8 were found to be commonly associated with NF and SA. Manhattan and quantile-quantile plots of all the significant SNPs associated with individual traits are shown in Figure 2A,B, Figure 3A,B, and Figure 4A–C.

2.5. Putative Candidate Genes

Candidate genes were identified within 100-kb regions on either side of 82 significant SNPs, with a total of 570 putative candidate genes being identified for the most significant SNPs associated with the seven root traits, based on MLM modeling (Table S3). A complete list of all genes associated with significant SNPs is presented in Table S3. Based on obtained annotation information, possible role in root development, and expression profiles related to root organs/tissues (e.g., roots, root stripped, root nodules, and root tips), we drew up a shortlist of 55 candidate genes (Table S4). We first used the ePlant (https://bar.utoronto.ca/eplant_soybean/, accessed on 16 April 2022) database to analyze the expression patterns of 55 candidate genes in different tissues. The result showed that all candidate genes were expressed in soybean root tissues (Figure S3). Furthermore, we selected the six highest expressed genes (Glyma.11g209100, Glyma.09g051100, Glyma.01g220600, Glyma.05g225700 Glyma.11g209200, and Glyma.13g261700) in the root, and data analysis was completed for these six genes using RNA-Seq soybean libraries (4085) and compared with other tissues (leaf, seedling, shoot, stem, meristem, flower, pod, nodule, seed, embryo, and endosperm). The result showed differential expression levels of six selected candidate genes in the othertissues (Figure S4A–F).

2.6. Comparative Genome and Orthologous Gene Analysis

For comparative genomic research to estimate the integrity of the shortlisted candidate genes first, we chose four closely related species, which are listed as follows: soybean, barrel medic (Medicago truncatula), adzuki bean (Vigna augularis), and common bean (Phaseolus vulgaris). A total of 13,191 core gene clusters were found in the four species, and 960 gene clusters were unique to soybeans (Figure S5A), with specific gene clusters accounting for ~3.73% (960/22159). Then, we used the root tissue-specific 55 candidate genes for comparative genomic analysis, but did not find any unique gene cluster (Figure S5B); however, 41 core gene clusters were found in a comparative analysis of candidate genes and mungbean, adzuki bean, and common bean, with specific core genes clusters accounting for 74.54% (41/55), a result which showed that these candidate genes have common ortholog among interspecies. In Figure S5C, 41 core genes are involved in various biological, molecular, and cellular processes. Detailed statistics of shared gene clusters and protein count concerning the cluster are represented in Figure S5D,E.

3. Discussion

Although the roots of plants play essential roles in anchoring and the absorption of water and nutrients, their subterranean distribution throughout the plant life cycle makes it difficult to continuously monitor root attributes or traits. Moreover, compared with the traits of other plant parts, relatively less attention has been focused on root genomics. In recent years, however, there has been a gradual increase in the recognition of the importance of root traits, and it is now believed that by optimally exploiting root trait information, we may be on the verge of a “second green revolution” in agriculture [24].
In the present study, we detected significant differences among soybean landraces with respect to seven assessed root traits (DIAM, LAL, LAD, LENGTH, NT, NF, and SA) at (p < 0.0001) level (Table S1). Genetic analysis of the 357 assessed landraces revealed the presence of distinct genotypes among these landraces, with genotypic cluster analysis indicating three major clusters, characterized by varying degrees of admixture (Figure 1A) and a lack of distinct separation based on landrace origin. However, the majority of landraces of Korean origin were found to be divided mainly between two subpopulation groups (I and III). The results of subsequent phylogenetic and PCA analyses consistently revealed the clustering of the assessed landraces into the three major groups indicated by population structure analysis (Figure 1). In our PCA, principal components 1 and 2 were found to account for 34.23% and 14.10% of the observed variance, respectively; however, we detected no distinct separation corresponding to the geographical origin of landraces. These findings tend to be consistent with those of previous genetic studies conducted on soybean, which have reported similar clustering results [25,26,27].
In self-pollinated crops, the rate of LD decay is typically higher than that detected in cross-pollinated plants [28]. Based on physical distance, the LD decay of the entire genome is assumed to increase. In the present study, we obtained an estimated decay of r2 = 0.2 within approximately 200 kb, which is slightly lower than the previously reported LD decay in soybean (235–375 kb) [29,30,31], although it is higher than that reported for other self-pollinated species such as rice and sorghum, with calculated LD decays of (123–167 kb) [32] and (150 kb) [33], respectively. We speculate that these differences could be attributed to a lower genome marker coverage and the smaller number of genotypes in the case of rice and sorghum.
Although numerous studies have examined the agronomic traits of soybean, very few have adopted a GWAS approach to assessing the genetic basis of root traits in soybean. To facilitate such studies, a necessary prerequisite is genetic diversity among the assessed germplasm resources. In terms of allelic diversity for various agronomic traits, the most extensive differences can be found among wild soybeans, followed by landraces, with cultivated soybeans being characterized by being the least diverse. Accordingly, for GWAS in the present study, we used a diverse collection of landraces (n = 357) to identify genetic loci associated with root traits. The primary goal of this study was to determine the genetic basis of phenotypic variation in the soybean root system. To this end, we used a 180KSoyaSNP array for our analyses of root traits. In GWAS studies, marker density is of particular importance with respect to establishing entire genome coverage and capturing all the available haplotypic variation, with a larger number of markers being conducive to obtaining sufficient LD to identify significant marker–trait associations. To the best of our knowledge, the marker dataset used in the present study is the second-largest examined to date for root trait GWAS, after that recently reported by Seck et al. [21]. Comparatively, previous studies that have examined root traits based on a combination of QTL mapping and GWAS have tended to use smaller sets of markers ranging in number from 232 to 38,052 [16,22,23,34].
In this study, we identified a total of 112 significant SNPs associated with one or more of the seven assessed root traits, which enabled us to identify genomic regions associated with desired traits. Similarly, recent GWAS studies conducted for root traits in soybean detected 10 genomic regions for the length and diameter of roots and four QTL regions for the number of lateral roots [21,23]. Given the sufficient marker density obtained in the present study, we were able to detect a large number of significant SNPs with a −log10(p) value >4.0 for the all-measured traits using an MLM model. The phenotypes showed high stability across replicate analyses, and we observed significant genetic diversity among the 357 landraces with respect to the seven assessed root traits. Moreover, we also succeeded in detecting a large number of SNPs associated with the traits of interest. Based on comparisons of the significant SNPs detected in the present study and those previously reported for root and shoot traits, we identified significant SNPs on chromosome 13 that differ from the SNPs previously reported for root traits, although interestingly these SNPs map to locations within the vicinity of those detected for NT (AX-90325580, AX-90370286, AX-90457815, AX-90452276, AX-90480597, AX-90305162, and AX-90433497) [15,35]. In addition, soybean locus ss715616115 on chromosome 13 (position 33415484), previously reported by Zeng et al. (2017) to be associated with salt tolerance, is also located in the vicinity of the aforementioned SNPs. In addition, the ss715624611 locus associated with salt tolerance on chromosome 16 (position 33383414) harbors the SNP AX-90502945 (position 33527990) associated with LAD. Furthermore, common SNPs detected for NF and SA (AX-90428520, AX-90467878, AX-90468823, and AX-90412442) in the present study map within previously identified QTLs associated with different root traits. For example, SNPs associated with root dry weight, taproot length, shoot length, lateral root number, and total root length were detected on chromosome 8 at the seedling stage in root phenotyping studies [22,34,36,37,38]. Similarly, SNPs present on chromosome 6 are commonly associated with DIAM and LAD (AX-90513850, AX-90502675, AX-90329829, AX-90504776, AX-90434642, AX-90416589, and AX-90426968) and SNPs on chromosome 18 associated with DIAM, LAD, and LAL are located in the vicinity of previously reported SNPs for SA, root volume, and branching number in soybean [15], as well as photosynthetic traits associated with phosphorus efficiency [39]. To the best of our knowledge, however, most of the remaining SNPs we identified as being associated with root traits have not been reported previously.
Having identified SNPs associated with the traits of interest, we proceeded to identify the putative candidate genes for the significant SNPs, which are listed in Table S3. In total, we detected 570 genes within the 100-kb regions up and downstream of the SNPs, which we subsequently whittled down to a shortlist of 55 genes based on annotation information, possible roles in root development, and expression profiles related to root organs/tissues. Recently, Seck et al. (2020), using a similar approach, identified two relevant candidate genes (Glyma.03g065700 and Glyma.07g096000) associated with soybean root development and root tips. Furthermore, six genes that showed higher expression in the root (Figure S3) were compared with other plant tissue to see the expression across tissues, which showed a differential expression pattern. A similar approach was used recently to elucidate the role of candidate genes for tocopherol content in soybean seed [40]. Among the genes, we identified Glyma.09g073300 and Glyma.01g220600, the functional annotations of which in the public domain are auxin-responsive protein-related protein and aquaporin PIP1-4-related genes, respectively. Considering other highly expressed genes in the root would be of particular interest to further validate and assess the expression of these genes in the root tissues of contrasting landraces. Apart from these genes, further research will be necessary to validate and confirm the identity of the remaining genes, particularly those genes for which annotation information is currently unavailable. Moreover, it is desirable to analyze the expression of all the promising candidate genes to determine genotype/population differences, as well as their patterns of expression at different stages of root development.

4. Materials and Methods

4.1. Plant Materials

In this study, we used 357 soybean landraces collected from the National Agrobiodiversity Center gene bank of the Rural Development Administration. The collected landraces were derived from the following four countries: South Korea (115), North Korea (58), China (90), and Japan (94). The seeds of these germplasms were sown in polyvinyl chloride tubes (8 cm in diameter and 40 cm in height) containing horticultural soil (Tobirang, Baekkwang Fertility, Republic of Korea) in a randomized block design with three replicates. All landraces were grown for 40 days and were harvested at the V2 stage of growth.

4.2. Root Phenotype Evaluation

For greenhouse experiments, individual plant samples were removed and measured at the V2 growth stage, and having been rinsed with clean fresh water, their root samples were measured. Seven root traits, namely, average root diameter (DIAM), link average diameter (LAD), link average length (LAL), total root length (LENGTH), number of root forks (NF), number of root tips (NT), and surface area (SA), were scanned and analyzed using the Win-RHIZO system (LA6400XL; Canada Regent Instruments). We had performed this root phenotyping previously [41] and used the phenotypic data for conducting GWAS of the 357 assessed landraces.

4.3. Genotyping, Population Structure, and Linkage Disequilibrium Analyses

For DNA extraction, we collected samples of young trifoliate leaves from individual plants of each of the 357 soybean landraces, from which the genomic DNAs were isolated using the magnetic bead method (Qiagen, Hilden, Germany) following the manufacturer’s instructions. Genotyping of the 357 individuals was performed using an Axiom® 180k SoyaSNP array following a previously described method [42]. The genotypic data thus obtained were filtered based on missing IDs/SNPs and a minor allele frequency of ≤0.01, and haplotype phasing and imputation of individuals with a minimum heterozygosity proportion of ≤0.05 were conducted using BEAGLE version 3.3.2 (https://faculty.washington.edu/browning/beagle/beagle.html, accessed on 14 April 2022) based on a hidden Markov process algorithm. The number of subpopulations (k) was determined based on cluster analysis, and further ancestry estimation was performed based on ADMIXTURE analysis [43]. A bar plot of the 357 individuals was generated using STRUCTURE 2.3.4 [44,45] for a k = value of 3. Linkage disequilibrium (LD) analysis was performed using genome-wide SNP markers, and correlation (r2) values among the 84,210 SNP markers were calculated using QTLmaxV2 software based on PLINK, with a window size of 400 SNP markers, and estimated as the squared values of the correlation coefficient between SNP marker pairs [46].

4.4. Phylogenetic and Principal Component Analyses

Phylogenetic analysis was performed using the similarity identity-by-state coefficient matrix calculated using Plink v2.0 (www.cog-genomics.org/plink/2.0/, accessed on 14 April 2022) [47], and a phylogenetic tree was constructed based on the genetic similarity matrix unweighted paired group method using arithmetic averages (UPGMA). Principal component analysis (PCA) was performed using the R package ggplot2 distance matrix.

4.5. Genome-Wide Association Studies

All GWAS analyses were performed for seven root traits of the 357 landraces using GWASpro (QTLmaxV2) [48] with a linear mixed model and default settings. We filtered the 180,375 SNPs based on the previously mentioned parameters, and the 84,210 SNP markers thus obtained were used for GWAS analysis. We set the threshold for the −log10(p) values ≥ 4 at alpha = 5 to search for significant SNPs.

4.6. Candidate Gene Identification

We used the most significant SNPs detected in GWAS to identify potential candidate genes, based on annotation using the SoyBase database (https://soybase.org/SequenceIntro.php, accessed on15 April 2022) according to the Wm82. a2. v1 soybean reference genome. Candidate genes were searched within a sequence range of 100 kb up and downstream each significant SNP using the genome browser, and functional annotation of the genes was performed using the Phytozome database [49].

4.7. RNA-Seq Data Analysis of Selected Candidate Genes

To determine the candidate genes expression level, we first performed an analysis using the ePlant (https://bar.utoronto.ca/eplant_soybean/, accessed on 4 December 2022) database and a heatmap was constructed for shortlisted genes using TBtools software (https://github.com/CJ-Chen/Tbtools, accessed on 4 December 2022). Furthermore, selected genes were used for differential expression analysis at different tissues using web-based publicly available RNA-Seq soybean libraries’ (4085) data with default setting (http://ipf.sustech.edu.cn/pub/soybean/, accessed on 4 December 2022) [50]. Furthermore, the whole genome and candidate genes among closely related legume species were compared and orthologous gene clusters were determined using OrthoVenn2 web portal [51].

4.8. Statistical Analysis

We used phenotypic data for the 357 landraces obtained in our recently published study [41]. To determine statistical significance, descriptive statistics were obtained for the landrace data using SAS release 9.4 (SAS, Gary, NC, USA), and a histogram was generated for the seven assessed root traits.

5. Conclusions

In this study, using GWAS, we determined the genomic regions associated with seven selected root traits in soybean landraces and identified the 112 most significant SNPs associated with the measured traits. Furthermore, based on annotation information and publicly available expression data, putative candidate genes were identified in the vicinities of the SNPs. These promising candidate genes could potentially serve as molecular targets for further studies conducted to evaluate their role in the regulation of root development. Moreover, the numerous SNPs identified in this study can be used as a potential resource in soybean breeding programs, with a view toward marker development for root traits. Collectively, our findings will provide a basis for identifying the genetic components underlying phenotypic variation in the root traits of soybean landraces.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms24010873/s1.

Author Contributions

R.T. and S.-H.K. prepared a draft of the manuscript and performed root phenotype analysis and data curation. B.-S.H. and B.-H.K. performed formal analysis and validation. B.-K.H. and Y.-H.K. inspected the experimental design and revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by a grant allocated to S.-H.K. from the National Institute of Agricultural Sciences (Project No. PJ01424203), the Rural Development Administration, Republic of Korea.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. van Hameren, B.; Hayashi, S.; Gresshoff, P.M.; Ferguson, B.J. Advances in the identification of novel factors required in soybean nodulation, a process critical to sustainable agriculture and food security. J. Plant Biol. Soil Health 2013, 1, 6. [Google Scholar]
  2. Marinkovic, J.; Bjelic, D.; Tintor, B.; Miladinovic, J.; Dukic, V.; Dorđevic, V. Effects of soybean co-inoculation with plant growth promoting rhizobacteria in field trial. Rom. Biotechnol. Lett. 2018, 23, 13401. [Google Scholar]
  3. Department of Economic and Social Affairs. World Population Prospects 2019; Department of Economic and Social Affairs: New York, NY, USA, 2019. [Google Scholar]
  4. Gitz, V.; Meybeck, A.; Lipper, L.; Young, C.; Braatz, S. Climate Change and Food Security: Risks and Responses; FAO: Rome, Italy, 2016; pp. 1–110. [Google Scholar] [CrossRef]
  5. Ates, A.M.; Bukowski, M. 2021/22 Global Soybean Stocks Fall in Tandem with South American Production Economic Research Service|Situation and Outlook Report Domestic Outlook International Outlook. 2022. Available online: https://www.ers.usda.gov/webdocs/outlooks/103026/ocs-22a.pdf?v=6071 (accessed on 14 October 2022).
  6. Zhang, H.; Zhao, Y.; Zhu, J.-K. Thriving under Stress: How Plants Balance Growth and the Stress Response. Dev. Cell 2020, 55, 529–543. [Google Scholar] [CrossRef]
  7. Atkinson, J.A.; Pound, M.P.; Bennett, M.J.; Wells, D.M. Uncovering the hidden half of plants using new advances in root phenotyping. Curr. Opin. Biotechnol. 2019, 55, 1–8. [Google Scholar] [CrossRef]
  8. Tayade, R.; Kim, S.-H.; Tripathi, P.; Choi, Y.-D.; Yoon, J.-B.; Kim, Y.-H. High-Throughput Root Imaging Analysis Reveals Wide Variation in Root Morphology of Wild Adzuki bean (Vigna angularis) Accessions. Plants 2022, 11, 405. [Google Scholar] [CrossRef] [PubMed]
  9. Tripathi, P.; Tayade, R.; Mun, B.-G.; Yun, B.-W.; Kim, Y. Silicon Application Differentially Modulates Root Morphology and Expression of PIN and YUCCA Family Genes in Soybean (Glycine max L.). Front. Plant Sci. 2022, 13, 842832. [Google Scholar] [CrossRef]
  10. McGrail, R.K.; Van Sanford, D.A.; McNear, D.H. Trait-Based Root Phenotyping as a Necessary Tool for Crop Selection and Improvement. Agronomy 2020, 10, 1328. [Google Scholar] [CrossRef]
  11. Guimarães, P.H.R.; de Lima, I.P.; de Castro, A.P.; Lanna, A.C.; Guimarães Santos Melo, P.; de Raïssac, M. Phenotyping Root Systems in a Set of Japonica Rice Accessions: Can Structural Traits Predict the Response to Drought? Rice 2020, 13, 67. [Google Scholar] [CrossRef]
  12. Marone, D.; Russo, M.A.; Mores, A.; Ficco, D.B.M.; Laidò, G.; Mastrangelo, A.M.; Borrelli, G.M. Importance of Landraces in Cereal Breeding for Stress Tolerance. Plants 2021, 10, 1267. [Google Scholar] [CrossRef]
  13. Arca, M.; Mary-Huard, T.; Gouesnard, B.; Bérard, A.; Bauland, C.; Combes, V.; Madur, D.; Charcosset, A.; Nicolas, S.D. Deciphering the Genetic Diversity of Landraces With High-Throughput SNP Genotyping of DNA Bulks: Methodology and Application to the Maize 50k Array. Front. Plant Sci. 2021, 11, 1926. [Google Scholar] [CrossRef]
  14. Hyten David, L.; Song, Q.; Zhu, Y.; Choi, I.-Y.; Nelson Randall, L.; Costa Jose, M.; Specht James, E.; Shoemaker Randy, C.; Cregan Perry, B. Impacts of genetic bottlenecks on soybean genome diversity. Proc. Natl. Acad. Sci. USA 2006, 103, 16666–16671. [Google Scholar] [CrossRef] [PubMed]
  15. Mandozai, A.; Moussa, A.A.; Zhang, Q.; Qu, J.; Du, Y.; Anwari, G.; Al Amin, N.; Wang, P. Genome-Wide Association Study of Root and Shoot Related Traits in Spring Soybean (Glycine max L.) at Seedling Stages Using SLAF-Seq. Front. Plant Sci. 2021, 12, 1598. [Google Scholar] [CrossRef] [PubMed]
  16. Prince, S.J.; Song, L.; Qiu, D.; Maldonado Dos Santos, J.V.; Chai, C.; Joshi, T.; Patil, G.; Valliyodan, B.; Vuong, T.D.; Murphy, M.; et al. Genetic variants in root architecture-related genes in a Glycine soja accession, a potential resource to improve cultivated soybean. BMC Genom. 2015, 16, 132. [Google Scholar] [CrossRef] [Green Version]
  17. Dhanapal, A.P.; York, L.M.; Hames, K.A.; Fritschi, F.B. Genome-Wide Association Study of Topsoil Root System Architecture in Field-Grown Soybean [Glycine max (L.) Merr.]. Front. Plant Sci. 2021, 11, 590179. [Google Scholar] [CrossRef]
  18. Li, L.; Peng, Z.; Mao, X.; Wang, J.; Chang, X.; Reynolds, M.; Jing, R. Genome-wide association study reveals genomic regions controlling root and shoot traits at late growth stages in wheat. Ann. Bot. 2019, 124, 993–1006. [Google Scholar] [CrossRef] [PubMed]
  19. Abdel-Haleem, H.; Lee, G.-J.; Boerma, R.H. Identification of QTL for increased fibrous roots in soybean. Theor. Appl. Genet. 2011, 122, 935–946. [Google Scholar] [CrossRef] [PubMed]
  20. Brensha, W.; Kantartzi, S.K.; Meksem, K.; Grier IV, R.L.; Barakat, A.; Lightfoot, D.A.; Kassem, M.A. Genetic Analysis of Root and Shoot Traits in the ‘ Essex’ By ‘ Forrest’ Recombinant Inbred Line (RIL) Population of Soybean [Glycine max (L.) Merr.]. Plant Genet. Genom. Biotechnol. 2012, 1, 1–9. [Google Scholar]
  21. Seck, W.; Torkamaneh, D.; Belzile, F. Comprehensive Genome-Wide Association Analysis Reveals the Genetic Basis of Root System Architecture in Soybean. Front. Plant Sci. 2020, 11, 590740. [Google Scholar] [CrossRef]
  22. Manavalan, L.P.; Prince, S.J.; Musket, T.A.; Chaky, J.; Deshmukh, R.; Vuong, T.D.; Song, L.; Cregan, P.B.; Nelson, J.C.; Shannon, J.G.; et al. Identification of Novel QTL Governing Root Architectural Traits in an Interspecific Soybean Population. PLoS ONE 2015, 10, e0120490. [Google Scholar] [CrossRef] [Green Version]
  23. Prince, S.J.; Valliyodan, B.; Ye, H.; Yang, M.; Tai, S.; Hu, W.; Murphy, M.; Durnell, L.A.; Song, L.; Joshi, T.; et al. Understanding genetic control of root system architecture in soybean: Insights into the genetic basis of lateral root number. Plant Cell Environ. 2019, 42, 212–229. [Google Scholar] [CrossRef] [Green Version]
  24. Lynch, J.P. Roots of the Second Green Revolution. Aust. J. Bot. 2007, 55, 493–512. [Google Scholar] [CrossRef]
  25. Jo, H.; Lee, J.Y.; Cho, H.; Choi, H.J.; Son, C.K.; Bae, J.S.; Bilyeu, K.; Song, J.T.; Lee, J.-D. Genetic Diversity of Soybeans (Glycine max (L.) Merr.) with Black Seed Coats and Green Cotyledons in Korean Germplasm. Agronomy 2021, 11, 581. [Google Scholar] [CrossRef]
  26. Chung, W.H.; Jeong, N.; Kim, J.; Lee, W.K.; Lee, Y.G.; Lee, S.H.; Yoon, W.; Kim, J.H.; Choi, I.Y.; Choi, H.K.; et al. Population structure and domestication revealed by high-depth resequencing of korean cultivated and wild soybean genomes. DNA Res. 2014, 21, 153–167. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Jeong, N.; Kim, K.S.; Jeong, S.; Kim, J.Y.; Park, S.K.; Lee, J.S.; Jeong, S.C.; Kang, S.T.; Ha, B.K.; Kim, D.Y.; et al. Korean soybean core collection: Genotypic and phenotypic diversity population structure and genome-wide association study. PLoS ONE 2019, 14, e0224074. [Google Scholar] [CrossRef] [PubMed]
  28. Flint-Garcia, S.A.; Thornsberry, J.M.; Buckler, E.S. Structure of Linkage Disequilibrium in Plants. Annu. Rev. Plant Biol. 2003, 54, 357–374. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Wen, Z.; Tan, R.; Yuan, J.; Bales, C.; Du, W.; Zhang, S.; Chilvers, M.I.; Schmidt, C.; Song, Q.; Cregan, P.B.; et al. Genome-wide association mapping of quantitative resistance to sudden death syndrome in soybean. BMC Genom. 2014, 15, 809. [Google Scholar] [CrossRef] [Green Version]
  30. Li, M.; Liu, Y.; Tao, Y.; Xu, C.; Li, X.; Zhang, X.; Han, Y.; Yang, X.; Sun, J.; Li, W.; et al. Identification of genetic loci and candidate genes related to soybean flowering through genome wide association study. BMC Genom. 2019, 20, 987. [Google Scholar] [CrossRef]
  31. Liu, Z.; Li, H.; Fan, X.; Huang, W.; Yang, J.; Li, C.; Wen, Z.; Li, Y.; Guan, R.; Guo, Y.; et al. Phenotypic Characterization and Genetic Dissection of Growth Period Traits in Soybean (Glycine max) Using Association Mapping. PLoS ONE 2016, 11, e0158602. [Google Scholar] [CrossRef] [Green Version]
  32. Huang, X.; Wei, X.; Sang, T.; Zhao, Q.; Feng, Q.; Zhao, Y.; Li, C.; Zhu, C.; Lu, T.; Zhang, Z.; et al. Genome-wide association studies of 14 agronomic traits in rice landraces. Nat. Genet. 2010, 42, 961–967. [Google Scholar] [CrossRef]
  33. Morris, G.P.; Ramu, P.; Deshpande, S.P.; Hash, C.T.; Shah, T.; Upadhyaya, H.D.; Riera-Lizarazu, O.; Brown, P.J.; Acharya, C.B.; Mitchell, S.E.; et al. Population genomic and genome-wide association studies of agroclimatic traits in sorghum. Proc. Natl. Acad. Sci. USA 2013, 110, 453–458. [Google Scholar] [CrossRef] [Green Version]
  34. Liang, H.; Yu, Y.; Yang, H.; Xu, L.; Dong, W.; Du, H.; Cui, W.; Zhang, H. Inheritance and QTL mapping of related root traits in soybean at the seedling stage. Theor. Appl. Genet. 2014, 127, 2127–2137. [Google Scholar] [CrossRef] [PubMed]
  35. Van Nguyen, L.; Takahashi, R.; Githiri, S.M.; Rodriguez, T.O.; Tsutsumi, N.; Kajihara, S.; Sayama, T.; Ishimoto, M.; Harada, K.; Suematsu, K.; et al. Mapping quantitative trait loci for root development under hypoxia conditions in soybean (Glycine max L. Merr.). Theor. Appl. Genet. 2017, 130, 743–755. [Google Scholar] [CrossRef] [PubMed]
  36. Liu, Y.; Gai, J.; Lu, H. Identification of rhizosphere abiotic stress tolerance and related root traits in soybean [Glycine max (L.) Merr.]. Acta Agron. Sin. 2005, 31, 1132. [Google Scholar]
  37. Liu, B.; Fujita, T.; Yan, Z.H.; Sakamoto, S.; Xu, D.; Abe, J. QTL mapping of domestication-related traits in soybean (Glycine max). Ann. Bot. 2007, 100, 1027–1038. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Prince, S.J.; Vuong, T.D.; Wu, X.; Bai, Y.; Lu, F.; Kumpatla, S.P.; Valliyodan, B.; Shannon, J.G.; Nguyen, H.T. Mapping Quantitative Trait Loci for Soybean Seedling Shoot and Root Architecture Traits in an Inter-Specific Genetic Population. Front. Plant Sci. 2020, 11, 1284. [Google Scholar] [CrossRef] [PubMed]
  39. Lü, H.; Yang, Y.; Li, H.; Liu, Q.; Zhang, J.; Yin, J.; Chu, S.; Zhang, X.; Yu, K.; Lv, L.; et al. Genome-Wide Association Studies of Photosynthetic Traits Related to Phosphorus Efficiency in Soybean. Front. Plant Sci. 2018, 9, 1226. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Yu, K.; Miao, H.; Liu, H.; Zhou, J.; Sui, M.; Zhan, Y.; Xia, N.; Zhao, X.; Han, Y. Genome-wide association studies reveal novel QTLs, QTL-by-environment interactions and their candidate genes for tocopherol content in soybean seed. Front. Plant Sci. 2022, 13, 1026581. [Google Scholar] [CrossRef]
  41. Kim, S.-H.; Subramanian, P.; Hahn, B.-S.; Ha, B.-K. High-Throughput Phenotypic Characterization and Diversity Analysis of Soybean Roots (Glycine max L.). Plants 2022, 11, 2017. [Google Scholar] [CrossRef]
  42. Lee, Y.G.; Jeong, N.; Kim, J.H.; Lee, K.; Kim, K.H.; Pirani, A.; Ha, B.K.; Kang, S.T.; Park, B.S.; Moon, J.K. Development, validation and genetic analysis of a large soybean SNP genotyping array. Plant J. 2015, 81, 625–636. [Google Scholar] [CrossRef]
  43. Alexander, D.H.; Lange, K. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinform. 2011, 12, 246. [Google Scholar] [CrossRef] [Green Version]
  44. Porras-Hurtado, L.; Ruiz, Y.; Santos, C.; Phillips, C.; Carracedo, Á.; Lareu, M. An overview of STRUCTURE: Applications, parameter settings, and supporting software. Front. Genet. 2013, 4, 98. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Evanno, G.; Regnaut, S.; Goudet, J. Detecting the number of clusters of individuals using the software structure: A simulation study. Mol. Ecol. 2005, 14, 2611–2620. [Google Scholar] [CrossRef] [Green Version]
  46. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
  47. Chang, C.C.; Chow, C.C.; Tellier, L.C.A.M.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. GigaScience 2015, 4, s13742-015. [Google Scholar] [CrossRef] [PubMed]
  48. Kim, B.; Dai, X.; Zhang, W.; Zhuang, Z.; Sanchez, D.L.; Lübberstedt, T.; Kang, Y.; Udvardi, M.K.; Beavis, W.D.; Xu, S.; et al. GWASpro: A high-performance genome-wide association analysis server. Bioinformatics 2019, 35, 2512–2514. [Google Scholar] [CrossRef] [Green Version]
  49. Goodstein, D.M.; Shu, S.; Howson, R.; Neupane, R.; Hayes, R.D.; Fazo, J.; Mitros, T.; Dirks, W.; Hellsten, U.; Putnam, N.; et al. Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40, D1178–D1186. [Google Scholar] [CrossRef]
  50. Yu, Y.; Zhang, H.; Long, Y.; Shu, Y.; Zhai, J. Plant Public RNA-seq Database: A comprehensive online database for expression analysis of ~45 000 plant public RNA-Seq libraries. Plant Biotechnol. J. 2022, 20, 806–808. [Google Scholar] [CrossRef]
  51. Xu, L.; Dong, Z.; Fang, L.; Luo, Y.; Wei, Z.; Guo, H.; Zhang, G.; Gu, Y.Q.; Coleman-Derr, D.; Xia, Q.; et al. OrthoVenn2: A web server for whole-genome comparison and annotation of orthologous clusters across multiple species. Nucleic Acids Res. 2019, 47, W52–W58. [Google Scholar] [CrossRef]
Figure 1. Population structure of 357 soybean landraces. (A) A bar plot diagram showing the results of clustering analysis when the number of subgroups (K) = 3. The colors blue, green, and red represent separate groups with different levels of admixture. (B) A phylogenetic tree of 357 soybean landraces. (C) Principal component plot for the 357 soybean landraces. (D) Genome-wide linkage disequilibrium (LD) decay for all 357 soybean landraces. R2 indicates the squared allele frequency correlations between all pairs of SNP markers. The X-axis indicates the distance between marker pairs.
Figure 1. Population structure of 357 soybean landraces. (A) A bar plot diagram showing the results of clustering analysis when the number of subgroups (K) = 3. The colors blue, green, and red represent separate groups with different levels of admixture. (B) A phylogenetic tree of 357 soybean landraces. (C) Principal component plot for the 357 soybean landraces. (D) Genome-wide linkage disequilibrium (LD) decay for all 357 soybean landraces. R2 indicates the squared allele frequency correlations between all pairs of SNP markers. The X-axis indicates the distance between marker pairs.
Ijms 24 00873 g001
Figure 2. Manhattan and quantile-quantile (QQ) plots of the 357 soybean landraces using the MLM model. (A) Root average diameter (DIAM) and (B) link average diameter (LAD). The X-axis represents the chromosome number, and the Y-axis represents the −log10p) value. The threshold of 4.0 (Bonferroni correction) was adopted with the blue line in the Manhattan plots. The Manhattan and QQ plots are based on analyses of the association between 84,210 chromosomal SNPs and DIAM and LAD traits.
Figure 2. Manhattan and quantile-quantile (QQ) plots of the 357 soybean landraces using the MLM model. (A) Root average diameter (DIAM) and (B) link average diameter (LAD). The X-axis represents the chromosome number, and the Y-axis represents the −log10p) value. The threshold of 4.0 (Bonferroni correction) was adopted with the blue line in the Manhattan plots. The Manhattan and QQ plots are based on analyses of the association between 84,210 chromosomal SNPs and DIAM and LAD traits.
Ijms 24 00873 g002
Figure 3. Manhattan and quantile-quantile (QQ) plots of the 357 soybean landraces using the MLM model. (A) Link average length (LAL) and (B) total root length (LENGTH). The X-axis represents the chromosome number and the Y-axis represents the −log10(p) value. The threshold of 4.0 (Bonferroni correction) was adopted with the blue line in the Manhattan plots. The Manhattan and QQ plots are based on analyses of the associations between 84,210 chromosomal SNPs and DIAM and LAD traits.
Figure 3. Manhattan and quantile-quantile (QQ) plots of the 357 soybean landraces using the MLM model. (A) Link average length (LAL) and (B) total root length (LENGTH). The X-axis represents the chromosome number and the Y-axis represents the −log10(p) value. The threshold of 4.0 (Bonferroni correction) was adopted with the blue line in the Manhattan plots. The Manhattan and QQ plots are based on analyses of the associations between 84,210 chromosomal SNPs and DIAM and LAD traits.
Ijms 24 00873 g003
Figure 4. Manhattan and quantile-quantile (QQ) plots of the 357 soybean landraces using the MLM model. (A) Number of root forks (NF), (B) number of root tips (NT), and (C) root surface area (SA). The X-axis represents the chromosome number and the Y-axis represents the −log10(p) value. The threshold of 4.0 (Bonferroni correction) was adopted with the blue line in the Manhattan plots. The Manhattan plot and QQ plot is based on the association of 84,210 chromosomal SNPs with DIAM and LAD traits.
Figure 4. Manhattan and quantile-quantile (QQ) plots of the 357 soybean landraces using the MLM model. (A) Number of root forks (NF), (B) number of root tips (NT), and (C) root surface area (SA). The X-axis represents the chromosome number and the Y-axis represents the −log10(p) value. The threshold of 4.0 (Bonferroni correction) was adopted with the blue line in the Manhattan plots. The Manhattan plot and QQ plot is based on the association of 84,210 chromosomal SNPs with DIAM and LAD traits.
Ijms 24 00873 g004
Table 1. Significant SNPs identified for root traits using the MLM model.
Table 1. Significant SNPs identified for root traits using the MLM model.
TraitSNPChrPositionRefAltp Value−log10(p)
DIAMAX−90460045243709318GA8.60 × 10−5 4.07
AX−90505093344598254CG6.41 × 10−5 4.19
AX−90311611344782942GA7.53 × 10−5 4.12
AX−90308307344785822GA7.53 × 10−5 4.12
AX−90385325344786068AG7.53 × 10−5 4.12
AX−90442177344786975TC7.53 × 10−5 4.12
AX−90503377344787733AG7.53 × 10−54.12
AX−90345460344799953TA7.53 × 10−5 4.12
AX−90385554344815582CT7.76 × 10−5 4.11
AX−90513850610393414GA8.32 × 10−5 4.08
AX−90502675610469343GA8.32 × 10−54.08
AX−90329829610471655AG7.33 × 10−54.14
AX−90504776610472539TA8.32 × 10−54.08
AX−90434642610478559AG8.32 × 10−54.08
AX−90416589610495842AG8.32 × 10−54.08
AX−90426968610501950AG8.32 × 10−54.08
AX−905245091446332487TC8.41 × 10−54.08
AX−903322501446340213GT8.41 × 10−54.08
AX−903678871858632433TG5.02 × 10−54.3
AX−904022821858817483AG4.19 × 10−54.38
AX−904680471859089827AC5.72 × 10−54.24
LADAX−90311611344782942GA3.18 × 10−54.5
AX−90308307344785822GA3.18 × 10−54.5
AX−90385325344786068AG3.18 × 10−54.5
AX−90442177344786975TC3.18 × 10−54.5
AX−90503377344787733AG3.18 × 10−54.5
AX−90345460344799953TA3.18 × 10−54.5
AX−90513850610393414GA3.53 × 10−54.45
AX−90454547610396428AC6.94 × 10−54.16
AX−90502675610469343GA3.53 × 10−54.45
AX−90329829610471655AG2.86 × 10−54.54
AX−90504776610472539TA3.53 × 10−54.45
AX−90434642610478559AG3.53 × 10−54.45
AX−90416589610495842AG3.53 × 10−54.45
AX−90426968610501950AG3.53 × 10−54.45
AX−904812331235529714AG3.77 × 10−54.42
AX−904058181327859216TC4.91 × 10−54.31
AX−904057991327870065GA1.21 × 10−54.92
AX−904967471327877321GA8.44 × 10−54.07
AX−905219671327901991TG6.88 × 10−54.16
AX−903365111327939975CT9.79 × 10−54.01
AX−905245091446332487TC6.53 × 10−54.18
AX−903322501446340213GT6.53 × 10−54.18
AX−905029451633527990AG5.38 × 10−54.27
AX−904220711812128963GT5.35 × 10−54.27
AX−903678871858632433GA2.55 × 10−54.59
AX−904022821858817483AC5.28 × 10−65.28
AX−903651601860026624GA2.23 × 10−54.65
AX−904953751860153725AC9.16 × 10−54.04
LALAX−90523253242676896AC9.88 × 10−54.01
AX−90367890242742102TG7.71 × 10−65.11
AX−90407903242742896GA7.71 × 10−65.11
AX−90372917242754007TC7.71 × 10−65.11
AX−90456562540343307CA6.75 × 10−54.17
AX−9036525277177786AG2.20 × 10−54.66
AX−9047701877419551AG1.65 × 10−65.78
AX−90491184117212890GT4.76 × 10−54.32
AX−90348822117234607TG4.76 × 10−54.32
AX−90337775117234726TG4.36 × 10−54.36
AX−90498877117234867AG4.76 × 10−54.32
AX−904439851130156429CT7.06 × 10−54.15
AX−903403351130159162AG7.06 × 10−54.15
AX−903628271811624535GT9.16 × 10−54.04
AX−905136111811665000TC5.60 × 10−54.25
AX−903965751811670146GT5.60 × 10−54.25
LENGTHAX−9043186163169898GA6.34 × 10−54.2
AX−90446460114302823CT4.20 × 10−54.38
AX−90414551114333105CG4.20 × 10−54.38
AX−90472866114337689TG3.75 × 10−54.43
NFAX−9042852086172141GA5.20 × 10−54.28
AX−9046787886174931TC9.11 × 10−54.04
AX−9046882386176238CT9.11 × 10−54.04
AX−9041244286176371AG9.11 × 10−54.04
AX−9049353587168291CA8.54 × 10−54.07
AX−9045515687168816GT7.81 × 10−54.11
AX−9033199287170112CA2.84 × 10−54.55
AX−9036580494467094TC7.79 × 10−54.11
AX−9049528397691924AC9.53 × 10−54.02
AX−9044905997699360GT5.73 × 10−54.24
AX−9052039097709441TG8.40 × 10−54.08
AX−9043419797713796GA8.40 × 10−54.08
AX−9049806197719800AG6.01 × 10−54.22
AX−9041969797726552GA8.40 × 10−54.08
AX−9049215697734112AG8.40 × 10−54.08
AX−9049303997740050AG9.67 × 10−54.01
NTAX−90522644154922179GT8.83 × 10−54.05
AX−90466067154927068AC8.36 × 10−54.08
AX−90467825154937834TC8.83 × 10−54.05
AX−9043013697783387AG6.06 × 10−54.22
AX−9041983097795507GT6.06 × 10−54.22
AX−9031733197816745AG5.91 × 10−54.23
AX−9031386497826748CT5.91 × 10−54.23
AX−9043160497832817AG6.32 × 10−54.2
AX−9038260197840359GA6.06 × 10−54.22
AX−9031211397847844TG6.06 × 10−54.22
AX−9045590997848294AC6.06 × 10−54.22
AX−9030607897849987TC6.06 × 10−54.22
AX−90446460114302823CT2.85 × 10−54.54
AX−90414551114333105CG2.85 × 10−54.54
AX−90472866114337689TG2.52 × 10−54.6
AX−903255801336648560AT4.45 × 10−54.35
AX−903702861336650121AG4.45 × 10−54.35
AX−904578151336655953GA5.60 × 10−65.25
AX−904522761336656131GT4.45 × 10−54.35
AX−904805971336667146GA7.85 × 10−54.11
AX−903051621336668072AG7.85 × 10−54.11
AX−904334971336669907GA7.85 × 10−54.11
AX−903836502036657615GT6.56 × 10−54.18
SAAX−9042852086172141GA1.61 × 10−54.79
AX−9046787886174931TC2.30 × 10−54.64
AX−9046882386176238CT2.30 × 10−54.64
AX−9041244286176371AG2.30 × 10−54.64
REF, Reference allele based on the Williams 82 reference genome version Wm82. a2. v1; ALT, Alternative allele.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kim, S.-H.; Tayade, R.; Kang, B.-H.; Hahn, B.-S.; Ha, B.-K.; Kim, Y.-H. Genome-Wide Association Studies of Seven Root Traits in Soybean (Glycine max L.) Landraces. Int. J. Mol. Sci. 2023, 24, 873. https://doi.org/10.3390/ijms24010873

AMA Style

Kim S-H, Tayade R, Kang B-H, Hahn B-S, Ha B-K, Kim Y-H. Genome-Wide Association Studies of Seven Root Traits in Soybean (Glycine max L.) Landraces. International Journal of Molecular Sciences. 2023; 24(1):873. https://doi.org/10.3390/ijms24010873

Chicago/Turabian Style

Kim, Seong-Hoon, Rupesh Tayade, Byeong-Hee Kang, Bum-Soo Hahn, Bo-Keun Ha, and Yoon-Ha Kim. 2023. "Genome-Wide Association Studies of Seven Root Traits in Soybean (Glycine max L.) Landraces" International Journal of Molecular Sciences 24, no. 1: 873. https://doi.org/10.3390/ijms24010873

APA Style

Kim, S. -H., Tayade, R., Kang, B. -H., Hahn, B. -S., Ha, B. -K., & Kim, Y. -H. (2023). Genome-Wide Association Studies of Seven Root Traits in Soybean (Glycine max L.) Landraces. International Journal of Molecular Sciences, 24(1), 873. https://doi.org/10.3390/ijms24010873

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop