Next Article in Journal
Molecular and Cellular Advances in Endometriosis Research: Paving the Way for Future Directions
Next Article in Special Issue
Proteomic Analysis Reveals Salt-Tolerant Mechanism in Soybean Applied with Plant-Derived Smoke Solution
Previous Article in Journal
A Novel Mechanically Robust and Biodegradable Egg White Hydrogel Membrane by Combined Unidirectional Nanopore Dehydration and Annealing
Previous Article in Special Issue
Genome-Wide Identification, Characterization, and Expression Analysis of Long-Chain Acyl-CoA Synthetases in Carya illinoinensis under Different Treatments
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Association Mapping and Expression Analysis of the Genes Involved in the Wood Formation of Poplar

Co-Innovation Center for Sustainable Forestry in Southern China, Satae Key Laboratory of Tree Genetics and Breeding, Nanjing Forestry University, Nanjing 210037, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2023, 24(16), 12662; https://doi.org/10.3390/ijms241612662
Submission received: 6 July 2023 / Revised: 4 August 2023 / Accepted: 8 August 2023 / Published: 10 August 2023
(This article belongs to the Collection Feature Papers in Molecular Plant Sciences)

Abstract

:
Xylogenesis is a complex and sequential biosynthetic process controlled by polygenes. Deciphering the genetic architecture of this complex quantitative trait could provide valuable information for increasing wood biomass and improving its properties. Here, we performed genomic resequencing of 64 24-year-old trees (64 hybrids of section Aigeiros and their parents) grown in the same field and conducted full-sib family-based association analyses of two growth and six woody traits using GEMMA as a choice of association model selection. We identified 1342 significantly associated single nucleotide polymorphisms (SNPs), 673 located in the region upstream and downstream of 565 protein-encoding genes. The transcriptional regulation network of secondary cell wall (SCW) biosynthesis was further constructed based on the published data of poplar miRNA, transcriptome, and degradome. These provided a certain scientific basis for the in-depth understanding of the mechanism of poplar timber formation and the molecular-assisted breeding in the future.

1. Introduction

Wood, commonly referred to as secondary xylem, produced by the vascular cambium of woody plants, is an indispensable and renewable feedstock for industrial production and daily life. Wood in the vascular cambium that programs cell death (PCD) through the maturation of cambial derivatives is considered a sequential and complex biosynthetic process driven by the coordinated expression of numerous genes involved in cell division, cell expansion, secondary cell wall (SCW) thickening, and PCD [1,2,3]. The secondary xylem of forest trees is a complex tissue comprising tracheary elements, xylem parenchyma cells, and xylem fibers. Tracheids and xylem vessels are fundamental conducting elements for the transport of water and nutrients throughout the tree body and comprise dead hollow cells with patterned SCWs. Xylem parenchyma cells are usually closely connected to tracheary elements via the remnants of plasmodesmata fields, they participate in the maintenance of xylem transport capacity, and are responsible for the recovery of vascular and tracheid function after embolism events. Xylem fibers provide mechanical support to the tree body with evenly thickened SCWs. Wood formation is characterized by massive biopolymer synthesis and deposition of biopolymers in highly thickened lignified SCWs. SCWs mainly comprise cellulose, hemicellulose, and lignin, and the content, distribution, and arrangement of these biopolymers in SCWs have important effects on wood properties.
Deciphering the genetic architecture of xylogenesis could provide valuable information for increasing wood biomass and improving its properties. With the tremendous research focused on the secondary growth of forest tree growth, the anatomical, physiological, and molecular characteristics of each stage of xylogenesis have been continuously updated from a multi-dimensional perspective [4]. A multilevel transcriptional regulatory network with NAC(NAM, ATAF1/2 and CUC2)-MYB (myeloblastosis) at its core plays a pivotal role in SCW biosynthesis [5]. With the widespread application of high-throughput sequencing technologies and genetic manipulation approaches to forest trees, our understanding of SCW biosynthesis and wood properties continues to increase [6]. However, most of this information has been obtained from saplings grown in an artificial environment. Furthermore, knowledge on the genetic architecture of adult trees grown under field conditions is still lacking. Association genetics based on high-throughput single-nucleotide polymorphism (SNP) genotyping has shown unprecedented advantages in revealing the genetic regulation mechanisms of complex quantitative traits regulated by polygenes in forest trees grown under field conditions [7,8].
Association mapping, namely linkage disequilibrium (LD) mapping, can identify allelic variations or functional genes significantly associated with complex quantitative traits by inferring the correlation between phenotypic diversity and DNA polymorphisms generated during biological evolution and then analyzing the genetic effect of alleles on phenotypes. Quantitative trait locus (QTL) mapping can be further divided into targeted candidate gene association and genome-wide association studies (GWAS). The latter has been exploited in many plants because of the dramatic reduction in genome sequencing costs. GWAS for complex traits has made significant progress not only in broad-leaved species such as poplar and eucalyptus [8], but also in coniferous species such as pine [2,9] and Picea abies [10]. Intriguingly, family-based GWAS designs, which could reduce bias in the estimates of direct genetic effects, have been developed and successfully applied to humans [11], chrysanthemum [12], and eucalyptus [13].
Wood properties are complex quantitative traits controlled by polygenes and vary among species and genotypes. As an ecologically and economically important hardwood species, the species of the genus Populus in Salicaceae, collectively known as poplars, are widely distributed in the Northern Hemisphere, from the tropics to the far northern boreal latitudes. This genus comprises five taxonomically distinct sections: Leuce Duby, Tacamahaca Spach, Leucoides Spach, Aigeiros Duby, and Turanga Bunge. Among them, the section Aigeiros, comprising P. deltoides, P. nigra, and their hybrid P. × euramericana, is the most important member of Populus for plantation culture worldwide. Both pure species and their hybrids are important commercial timber species in the mid-latitude plate of the Northern Hemisphere with fast growth, straight trunks, and broad adaptability. Poplars are an important model tree species for studying wood formation. Recently, extensive research has been conducted on secondary xylem development through association mapping and genetic manipulation experiments, and the regulatory pathways involved in poplar SCW biosynthesis have been explored.
In this study, to decipher the genetic architecture of wood traits and the correlation between wood traits and other growth traits in hybrids of Populus section Aigeiros, 66 24-year-old trees (64 hybrids and their parents) grown in the same field were resequenced and tested for wood properties. Furthermore, a full-sib family-based GWAS was conducted using the general linear model (GLM) and linear mixed model (LMM). The transcriptional regulation network of these significantly associated genes was further constructed based on the published data of poplar miRNA, transcriptome, and degradome. These results provide new insights into the improvement and cultivation of large-diameter poplar timber plantations.

2. Results

2.1. Wood Property Variation of Large-Diameter Poplars

In this study, we analyzed two growth traits (diameter at breast height (DBH) and height (H)) and six wood property traits (fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), and basic density (BD)) in P. deltoides cv. I-69 (♀), P. × euramericana cv. I-45 (♂), and 64 F1 hybrids (Table 1). FL, FW, DWT, AD, BD, DBH, and H of I-69 were greater than those of I-45, whereas the CLD of I-69 was lower than I-45. The greatest phenotypic values of the eight traits in the F1 hybrids were greater than those in the two parents (I-69 and I-45), indicating transgressive segregation in the full-sib family. The coefficient of variation (CV, %) of the eight traits in the F1 population ranged from 3.69 (FW) to 14.74% (DBH). Two growth traits (DBH and H) had larger CV values than the other six wood property traits. This showed there were significant differences in growth traits among these clones.
The distribution of the eight traits exhibited an approximate bell shape (Figure 1). Using the Shapiro–Wilk test for the eight phenotypic datasets, the W statistic was close to 1 and p > 0.05 (Table S1), indicating that the datasets conformed to a normal distribution and were suitable for further GWAS analysis. The pairwise correlation analysis indicated there was no strong linear relationship among the eight traits, except the pair of AD and BD (Figure 1). The correlation between AD and BD was strong (r2 = 0.94), showing a significant correlation level.

2.2. Genome-Wide Variations in a Full-Sib Population

All approximately 671.6 Gbp raw reads (3,060,185,242) were generated through the Illumina whole-genome resequencing of all 66 clones. After filtering, 203,819,202 (51.5×) and 210,409,496 (53.4×) clean reads were obtained from parents I-69 and I-45, respectively. A total of 2,608,389,006 clean reads were generated for 64 F1 hybrids, with a mean sequencing depth of 9.9× each hybrid, ranging from 6.0 to 13.8 (Table S2). We identified 183,253 insertion variants, 214,337 deletion variants, and 7,800,813 SNPs from the clean sequencing data of 66 poplars (Table S3). A total of 397,590 InDels (insertions and deletions) were unevenly distributed across 19 poplar chromosomes, with an average density ranging from 749 (Chr19) to 1271 (Chr09) per Mbp (Figure S1). This also happens with SNPs and relatively more SNPs were located in the central region of all the chromosomes, except Chr08, Chr09, and Chr14 (Figure S2).
In total, 332,639 high-quality SNPs were selected under stringent quality control metrics for subsequent LD and GWAS analyses. The mean number of high-quality SNPs was 855 per Mbp window, ranging from 731 per Mbp on Chr19 to 963 per Mbp on Chr08 (Figure 2a). All selected SNPs were divided into six types of nucleotide substitution: A to G (G to A), C to T (T to C), A to T (T to A), A to C (C to A), G to T (T to G), and G to C (C to G). The ratio of transition (Ti) to transversion (Tv) is an indicator used to evaluate the quality of SNPs in a genome [14,15,16,17]. The Ti/Tv ratio of poplar SNPs was 2.44 (Figure S3), which is similar to that of rice [18], chicken [19], and humans [20].

2.3. Genetic Recombination and LD Decay

LD was estimated by calculating the squared correlation coefficient (r2) between intrachromosomal SNP pairs within the full-sib population. We plotted LD decay along the entire poplar genome using 332,639 high-quality SNVs in 66 poplar clones from a full-sib family. LD decay is an index of genetic recombination in the biparental population [21]. The LD decay distance can estimate the minimum number of molecular markers required for subsequent GWAS [22]. Randomly selected chromosomes have approximately the same LD decay pattern between them (Figure 2b). The average physical distance at which the LD value (r2) for the whole genome decayed below 0.2 was approximately 3 Kbp, which was like the LD decay distance of 2.6 Kbp (r2 = 0.2) in P. euphratica [23]. This indicates that at least 133,000 SNPs are necessary for further GWAS [24]. Thus, it was likely sufficient to perform a GWAS on a dataset of 332,639 high-quality SNPs.

2.4. Association Model Selection

Two classical GWAS models, LMM (e.g., GEMMA and GCTA) and GLM (e.g., GAPIT and PLINK), have been applied to map QTLs associated with complex traits in plants [25]. In this study, we used four tools, GEMMA, GCTA, GAPIT, and PLINK, to associate eight traits with 332,639 high-quality SNPs from a full-sib family. To compare the association analysis results using all four approaches, we selected the top 500 trait-associated SNPs based on the p-value. A total of 723 SNPs were associated with FL, 775 SNPs for FW, 870 SNPs for DWT, 924 SNPs for CLD, 807 SNPs for AD, 902 SNPs for BD, 777 SNPs for DBH, and 778 SNPs for H. The number of SNPs identified using all four methods ranged from 172 (CLD) to 311 (FL) (Figure 3). The numbers of SNPs identified by a single tool were 219 SNPs for FL, 255 SNPs for FW, 359 SNPs for DWT, 404 SNPs for CLD, 282 SNPs for AD, BD, 259 SNPs for DBH, and 257 SNPs for H. The similarity between the association results obtained using the four tools was the difference between the eight trait datasets (Figure 3).
The QQ (Q-Q) plot and the genomic inflation factor (λ statistic) are two frequently used indicators for evaluating GWAS models/tools [26,27]. No one method is significantly better than the others for all eight traits (Figure 4). However, similar results were observed in the genomic inflation factor (λ statistic) analysis. GEMMA obtained a λ value closer to one in six traits (e.g., DWT, CLD, AD, BD, DBH, and H) than the other three association tools (Table 2). Although the best λ values for the remaining two traits (FL and FW) were obtained from the GAPIT.GLM results, the top two λ values were close to those of the GEMMA results. The average λ (1.04) value of the eight traits using GEMMA was closer to 1 than those of the other three association methods (Table 2). Therefore, we selected GEMMA as the GWAS analysis tool, and the GEMMA association results were used for further analysis.

2.5. Full-Sib Population-Based Association Mapping

In this study, we identified 1342 significantly associated SNPs for all eight traits using GEMMA below the threshold of p < 0.0005 (Figure 5). In total, 376 and 370 SNPs were significantly associated with DBH and H, respectively. The number of two growth-trait-related SNPs was greater than the six wood traits (117 for FL, 138 for FW, 116 for DWT, 121 for CLD, 120 for AD, and 69 for BD). Approximately half (673) of the 1342 significant SNPs were related to the 565 protein-encoding genes in the P. trichocarpa (v4.1) genome. There were more genes associated with growth traits (177 genes for DBH and 129 genes for H) than with the six wood traits (58 genes for FL, 48 genes for FW, 50 genes for DWT, 54 genes for CLD, 62 genes for AD, and 31 genes for BD) (Table S4).
In total, 342 of the 565 significant genes were assigned functional information in the functional annotation file of the P. trichocarpa (v4.1) genome. Many genes belong to the same gene family, including four R2R3-type MYB and three LAC (laccase) family genes. A total of 41 genes were significantly associated with two traits (AD and BD, DBH and H, and DWT and CLD), and only one gene was significantly associated with three traits (CLD, AD, and BD) (Table S5). There were 18 genes associated with AD and BD traits, possibly due to the high correlation coefficient (r2 = 0.94). A total of 12 genes associated with DBH and H were identified, due in part to the high similarity in the morphogenesis of DBH and H. The GO and KEGG annotations of the 565 significant genes are listed in Tables S6 and S7. In the GO enrichment analysis, for wood traits, enrichment was in hydrolase activity, acting on ester bonds, ribosome, nucleosome, structural constituent of ribosome, and translation pathway, and for growth traits were enriched in peroxisome, cis-Golgi network, endoplasmic reticulum to Golgi vesicle-mediated transport, cytochrome-c oxidase activity, and ADP binding pathway (Table S6). In the KEGG enrichment analysis, wood traits were mainly enriched in ribosome, endocytosis, isoquinoline alkaloid biosynthesis, lysine degradation, sesquiterpenoid, and the triterpenoid biosynthesis pathway, while for growth traits, the main enrichments were in cysteine and methionine metabolism, glyoxylate and dicarboxylate metabolism, and the endocytosis pathway (Table S7).

2.6. Transcriptional Regulatory Network for SCW Biosynthesis

We adopted fragments per kilobase of transcript per million mapped reads (FPKM) values, which were used to measure the expression levels of these candidate genes in two poplar clones: CL290 (clone 290 with fast growing) and CL33 (clone 33 with slow growing). Of the 565 candidate genes, 143 (25.31%) and 165 (29.20%) were highly expressed (Expression > 10) in CL290 and CL33 cells, respectively (Figure S3, Table S8). We identified 38 differentially expressed genes (DEGs) (log2 FoldChange > 2) between CL290 and CL33 cells. A total of 21 and 17 genes were upregulated in CL290 and CL33 cells, respectively (Table S9).
A total of 119 candidate genes were regulated by 128 miRNAs (Figure S4), which comprised 270 gene-miRNA regulatory pairs (Table S10). There were 51 candidate genes targeted by more than one miRNA. For example, the gene Potri.003G050100, which encodes the HD-ZIP protein, is regulated by 14 members of the miR166 family, suggesting that the miR166-HD-ZIP module could play an important role in poplar wood formation. Potri.006G054100, which belongs to the superfamily of ARM repeat proteins, is targeted by six members of the miR167 family. The NAC-containing gene Potri.012G001400 interacted with five members of the miR164 family (Figure 6).

3. Discussion

Population structure is a frequent consideration in GWAS, and the greater the differences between populations, the easier it is for GWAS to identify them. Therefore, most contemporary studies usually aim to sample natural populations with rich genetic diversity, which is more suitable for GWAS [28]. However, it was found that growth variation in the offspring of artificial interspecific crosses was significantly genetically different, showing a great deal of variability [29]. Therefore, artificial populations of half-sib or full-sib families with a clear genetic background and rich variation have become widely studied in forest tree breeding [30,31,32]. In addition, although the genomic recombination rate in the artificial breeding population is low, it can amplify the frequency of rare mutation sites and detect some additional significant associations through adjusting the threshold of the p-value [32], also a more convenient and simpler method of preserving germplasm resources than natural populations [33]. Therefore, there is potential and feasibility to utilize a full-sib artificial hybrid population of P. deltoides with P. × euramericana for GWAS in this study.
LD is related to genetic recombination and is used to estimate the number of markers required for GWAS [34]. The resolution and number of markers required for GWAS depend on the extent of LD [35]. The LD decay was different among species in the genus Populus, an LD decay ranged from 1.4 to 5.1 Kbp (r2 threshold of 0.2) was reported in P. deltoides [36], and an LD decay of 2.6 Kbp was reported in P. euphratica at r2 = 0.2 [23]. In this study, the LD value (r2) for the whole genome decayed below 0.2 was approximately 3 kb, indicating that at least 134,000 markers were required [37]. This shows that 332,639 SNPs met the number of markers required for full-sib family-based GWAS. The QQ plot and genomic inflation λ are two metrics commonly used to select an optimization association model [26,27]. Yang et al. selected a model to associate with the leaves of the F1 pedigree of P. deltoides and P. simonii based on good genomic control (genomic inflation equal to 1) [32]. Furthermore, Zhang et al. selected the most effective model for poplar GWAS based on the degree of deviation between the observed and expected p values [34]. However, the QQ plot and λ metric suggested that no one association tool could surpass the performance of the other three tools for the eight traits, due to the possible absence of population stratification and the presence of full-sib kinship in the I-69 and I-45 families [38]. We selected GEMMA as the final GWAS model based on the average λ metric for all eight traits. GEMMA is a GWAS toolkit to rapidly implement an LMM, with population structure as a fixed effect and kinship as a random effect [39,40].
Wood formation is a complex process in which transcription factors play important regulatory roles in various processes of xylem development, including transcription factor families such as HD-ZIP, MYB, and NAC. The MYB family, as one of the largest families of transcription factors in plants, not only plays a role in plant stress, but also plays an important regulatory role in xylem development, and several MYB transcription factors related to wood formation were associated in this study. Among them, MYB118 has been verified in Arabidopsis thaliana to have the ability to induce plant-to-embryo transition, the formation of somatic embryos from root explants [41], and to play a regulatory role in xylem conduit development [42]. MYB59 is identified in A. thaliana to be involved in the transcriptional regulation of secondary xylem formation [43]. Several transcription factors have also been identified in poplar associated with secondary wall development, such as Poplar PtrMYB2, PtrMYB3, PtrMYB20, and PtrMYB21 [44]; (Homologs of MYB46 and MYB83 in A. thaliana), PtrMYB55, and PtrMYB121 [45]; (Homologs of MYB55 in A. thaliana), PtoMYB92, and PtoMYB125 [46]; (Homologs of MYB85 in A. thaliana), PtoMYB170 [47], and PtoMYB216 [48]; and (Homologs of MYB61 in A. thaliana) were identified to be involved in regulating secondary cell wall formation. Conversely, PtrMYB6 [49], PtoMYB156 [50], PtrMYB189 [51], and PdMYB221 [52] had inhibitory effects on lignin accumulation. Previous studies have shown that, like MYB genes, NAC transcription factors play an important regulatory role in xylem development [53]. The NAC080 associated in this study was identified as a transcription factor related to lignin biosynthesis in Brassica napus [54]. In poplar, PtrWND 1A, PtrWND 1B, PtrWND 2A, and PtrWND 2B [55] (Homologs of NST 1, NST 2, and SND 1 in A. thaliana), were identified to affect secondary wall thickness. Peroxidase (Pox), laccase (LAC), and 4-coumarate-CoA ligase (4CL) are key genes involved in lignin synthesis. The laccase gene LAC17, identified in this study, was characterized in A. thaliana for the lignification of vascular cells [56]. The 4CL1 gene has been identified as being involved in lignin biosynthesis based on molecular and reverse genetic characterization in P. tremula × P. alba ‘INRA-France 717-1B4′ [57]. MiRNAs that regulate lignin biosynthesis have been reported in many plants, including maize [58], Arabidopsis [59], and poplar [60]. In this study, one NAC gene, one LAC gene, and two Pox protein-encoding genes were regulated by miRNAs. These gene-miRNA regulatory pairs may play important roles in wood formation.
In this study, we performed resequencing using GWAS on 64 full-sibling families derived from a cross between P. deltoides cv. I-69 and P. × euramericana cv. I-45. We determined six wood traits (FL, FW, DWT, CLD, AD, and BD) and two growth traits (H and DBH), and identified several SNP loci associated with these material traits. Additionally, we constructed a transcriptional regulatory network of SCW biosynthesis through transcriptome analysis. These findings provide a solid scientific foundation for gaining a deeper understanding of the mechanism behind poplar timber formation and for future molecular-assisted breeding efforts.

4. Materials and Methods

4.1. Plant Materials

In the 1970s, excellent clones of section Aigeiros from Europe and the United States were introduced into China. They have had strong adaptability and rapid growth in the plains of Huanghuai and Jianghuai, and in the middle and lower reaches of the Yangtze River [61]. Since then, researchers have conducted a series of artificial hybridizations of the section Aigeiros. The full-sib population used in this study was derived from a cross between P. deltoides cv. I-69 and P. × euramericana cv. I-45 in the early 1980s. The full-sib progenies, along with I-69 and I-45, were planted and spaced in a 6 × 6 m configuration at the Zhangji Forest Farm (34.14° N, 117.38° W) in Jiangsu Province, China. In total, 66 24-year-old trees (64 progenies and 2 parents) were selected for genome resequencing and wood property measurement.

4.2. Wood Property Determination

For each clone, discs (50 mm thick) were collected at ground level and at 1.3 m. The wood was sampled in two ways, the first was to take wedge-shaped strips of the discs cut radially through the center of the circle, and then split the wedge-shaped blocks into matchstick size for fiber morphology determination. The second was to take 2 cm × 2 cm strips from the center of the circle, and then quintupling the samples into five pieces of wood, including early wood and late wood, for the determination of BD and AD.
To measure fiber morphology using the Franklin method [62], the wood samples were put into a beaker and repeatedly boiled in water until the wood strips sank. The water was discarded and glacial acetic acid was added to the beaker in the volume ratio of 1:1. Hydrogen peroxide was mixed into the solution for dissociation and placed at room temperature for one week. Once the specimens became white and fluffy, they were removed and the acid was discarded. The specimens were washed with deionized water 4–5 times until the acid was washed out and the specimens were neutral. The specimens were broken into single fibers with a disperser, and then the instrument Optest FQAII (OpTest Equipment Inc., Hawkesbury, ON, Canada) was used to set up 2000 fibers, and FL, FW, DWT, and CLD were determined according to the operation procedure. The average of 3 groups was taken for each sample.
BD was determined by soaking the wood blocks in deionized water, changing the water every 2 days to prevent rotting and deterioration, for 2 weeks until the samples were saturated with water. The samples were taken out and the volume of the wood blocks was read on a GP-120W Wood Basic Density Tester (MatsuHaku Inc., Taiwan, China) (accurate to 0.001 cm3). The samples were then placed in a BINDER/ED Series oven (Binder Inc., Tuttlingen, Germany) and dried at 105 °C until constant weight (about 48 h), and then quickly weighed with an L-IC precision electronic balance (METTLER TOLEDO Inc., Greifensee, Switzerland) (accurate to 0.0001 g) after drying and cooling. BD was calculated according to the following formula: ρ = m0/Vmax where ρ is the basic density of the wood block (g/cm3); m0 is the weight of the wood block at full dryness (g); and Vmax is the saturated volume of the wood block (cm3).
AD was determined by placing the wood blocks at room temperature to constant weight (27 °C, 1–2 months) and measuring AD using the sealing wax method on a GP-120W Wood Basic Density Tester (accurate to 0.001 g).

4.3. Genome Resequencing and SNP Genotyping

Genomic DNA was extracted from young leaves using the DNeasy Plant Mini Kit (Qiagen, Hilden, Germany). DNA libraries were constructed and sequenced on an Illumina HiSeq2000 platform to generate raw paired-end (PE) reads. Raw reads were trimmed using Fastp (v.0.23.2) [63] with the following parameter settings: (a) reads with ≤5 unidentified nucleotides (N), (b) trimmed reads with a minimum length of 60 bp, and (c) reads with >40% bases having a minimal Phred-scaled quality of 15. The trimmed reads were mapped against the P. trichocarpa (v4.1, https://phytozome.jgi.doe.gov accessed on 16 July 2022) genome using bwa-mem2 (v.2.0pre2, https://github.com/bwa-mem2/bwa-mem2 accessed on 16 July 2022). Sequence alignment format (SAM) files were converted to binary SAM (BAM) files using SAMtools (v.1.9) [64]. PCR duplicates were eliminated using MarkDuplicates implemented in the Picard software package (v.2.27.4; https://github.com/broadinstitute/picard/releases/ accessed on 16 July 2022). Raw DNA variants were called using mpileup implemented in BCF tools (v.1.9; http://samtools.github.io/bcftools/ accessed on 16 July 2022). Tri- and tetra-allelic SNPs were discarded from the raw variants. The identified variants were filtered using BCFtools under the following parameters: (a) SNPs within 5 bp of an InDel, (b) SNPs with a missing rate of >5%, (c) SNP with a <5% minor allele frequency (MAF), (d) SNPs detected at an average depth of <5×, and (e) Hardy–Weinberg equilibrium (HWE) test p > 10−6 using PLINK (v.1.9, https://www.cog-genomics.org/plink/ accessed on 17 July 2022) [65].

4.4. LD Decay Analysis

These high-quality SNPs in the VCF format were used for the LD decay analysis using PopLDdecay (v.3.42) with default parameters [66]. The average r2 value between the SNP markers was assessed for each of the 19 chromosomes and the entire genome within the full-sib population. The LD decay was plotted using the distance against the average r2 value within 50-Kbp windows the R package ggplot2 (v.3.3.6).

4.5. Statistical Algorithms

Four software programs—GLM of GAPIT (v.3.0) [67], GLM of PLINK [65], LMM of GEMMA (v.0.98.5) [68], and LMM of GCTA (v.1.94.1) [69]—were used for the LD analysis. p < 0.0005 was considered significant. Manhattan and quantile–quantile plots (QQ plot) were visualized using the R package CMplot (v.4.1.0, https://github.com/YinLiLin/CMplot accessed on 5 August 2022).
The top 500 SNPs were selected from the eight traits to compare the models. The intersection of the top SNPs was visualized using the UpSetR R package (v.1.4.0) [70]. The most appropriate method for our full-sib family was selected using the average genomic inflation factor (λ statistic). The genomic inflation factor (λ statistic) had a wide range of values for different methods, suggesting that different methods can affect the accuracy of the GWAS. The λ statistic value was calculated based on the result of the GWAS analysis by determining a ratio between the median of the resulting chi-squared test statistics and the expected median of the chi-squared distribution in by R package GWAS.utils (v.0.1, https://github.com/sinarueeger/GWAS.utils accessed on 5 August 2022) [71,72].

4.6. Enrichment and Transcriptional Expression Analysis

The potential function annotation, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) information were obtained from the P. trichocarpa v4.1 assembly and annotation. Enrichment analyses were performed using the ClusterProfiler R package (v4.2.2; https://github.com/YuLab-SMU/clusterProfiler accessed on 5 August 2022) [72]. Gene network analysis was performed on candidate genes using STRING (v.11.5, https://cn.string-db.org/ accessed on 5 August 2022). Transcriptome, sRNA, and degradome data were obtained from a previous study [73].

5. Conclusions

In this study, we conducted an association study of two growth traits and six woody traits of 66 24-year-old clones from an Aigioros full-sib family, using high-quality 332,639 SNPs exceeding the LD-based estimate of markers required for GWAS. We identified 1342 SNPs and 565 protein-encoding genes significantly associated with the eight poplar traits. Published poplar multiomics data (e.g., miRNA, transcriptome, and degradome) have demonstrated that some of the significantly associated genes, such as NAC, R2R3-type MYB, and lignin-related genes, could be involved in wood formation. These results elucidate information for dissecting the molecular mechanism of wood formation and developing new poplar varieties with large-diameter timber.

Supplementary Materials

The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms241612662/s1.

Author Contributions

M.X. and H.P. conceived the project and designed the entire experiment. Y.W., H.Z. and S.Z. carried out GWAS analysis and prepared this manuscript. T.S. conducted SNP annotation and GO/KEGG enrichment analysis. H.P. and H.Z. performed trait measurement. M.X. revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key Research and Development Program of China [grant number 2021YFD2201205], the National Natural Science Foundation of China [grant number 31971679], the Natural Science Foundation of the Jiangsu Higher Education Institutions of China [grant number 19KJB180001], and the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Sequencing data generated in the study are available in the NCBI Sequence Read Archive (SRA) under BioProject accession PRJNA1001796.

Acknowledgments

We sincerely thank Novogene Technology Co., Ltd., Beijing, China, for performing the high throughput sequencing. The authors were also grateful to the anonymous reviewers and editors for their constructive comments.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

  1. Shen, H.; Xing, X.; Guan, Y.; Zhou, L.; Liu, S.; Gao, H. Radial variation studies on wood properties of Populus deltoides parents and their hybrids. BioResources 2021, 16, 4905. [Google Scholar] [CrossRef]
  2. Ding, X.; Diao, S.; Luan, Q.; Wu, H.X.; Zhang, Y.; Jiang, J. A transcriptome-based association study of growth, wood quality, and oleoresin traits in a slash pine breeding population. PLoS Genet. 2022, 18, e1010017. [Google Scholar] [CrossRef] [PubMed]
  3. Plomion, C.; Leprovost, G.; Stokes, A. Wood Formation in Trees. Plant Physiol. 2001, 127, 1513–1523. [Google Scholar] [CrossRef] [PubMed]
  4. Zhang, J.; Nieminen, K.; Serra, J.A.A.; Helariutta, Y. The formation of wood and its control. Curr. Opin. Plant Biol. 2014, 17, 56–63. [Google Scholar] [CrossRef] [PubMed]
  5. Nakano, Y.; Yamaguchi, M.; Endo, H.; Rejab, N.A.; Ohtani, M. NAC-MYB-based transcriptional regulation of secondary cell wall biosynthesis in land plants. Front. Plant Sci. 2015, 6, 288. [Google Scholar] [CrossRef] [Green Version]
  6. Zhang, J.; Xie, M.; Tuskan, G.A.; Muchero, W.; Chen, J.-G. Recent Advances in the Transcriptional Regulation of Secondary Cell Wall Biosynthesis in the Woody Plants. Front. Plant Sci. 2018, 9, 1535. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Cortés, A.J.; Restrepo-Montoya, M.; Bedoya-Canas, L.E. Modern Strategies to Assess and Breed Forest Tree Adaptation to Changing Climate. Front. Plant Sci. 2020, 11, 583323. [Google Scholar] [CrossRef]
  8. Du, Q.; Lu, W.; Quan, M.; Xiao, L.; Song, F.; Li, P.; Zhou, D.; Xie, J.; Wang, L.; Zhang, D. Genome-Wide Association Studies to Improve Wood Properties: Challenges and Prospects. Front. Plant Sci. 2018, 9, 1912. [Google Scholar]
  9. Weiss, M.; Sniezko, R.A.; Puiu, D.; Crepeau, M.W.; Stevens, K.; Salzberg, S.L.; Langley, C.H.; Neale, D.B.; De La Torre, A.R. Genomic basis of white pine blister rust quantitative disease resistance and its relationship with qualitative resistance. Plant J. 2020, 104, 365–376. [Google Scholar] [CrossRef]
  10. Chen, Z.-Q.; Zan, Y.; Milesi, P.; Zhou, L.; Chen, J.; Li, L.; Cui, B.; Niu, S.; Westin, J.; Karlsson, B.; et al. Leveraging breeding programs and genomic data in Norway spruce (Picea abies L. Karst) for GWAS analysis. Genome Biol. 2021, 22, 179. [Google Scholar] [CrossRef]
  11. Howe, L.J.; Nivard, M.G.; Morris, T.T.; Hansen, A.F. Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nat. Genet. 2022, 54, 581–592. [Google Scholar] [PubMed]
  12. Sumitomo, K.; Shirasawa, K.; Isobe, S.; Hirakawa, H.; Harata, A.; Nakano, M.; Nakano, Y.; Yagi, M.; Hisamatsu, T.; Yamaguchi, H.; et al. A genome-wide association and fine-mapping study of white rust resistance in hexaploid chrysanthemum cultivars with a wild diploid reference genome. Hortic. Res. 2022, 9, uhac170. [Google Scholar] [PubMed]
  13. Ghosh Dasgupta, M.; Abdul Bari, M.P.; Shanmugavel, S.; Dharanishanthi, V.; Muthupandi, M.; Kumar, N.; Chauhan, S.S.; Kalaivanan, J.; Mohan, H.; Krutovsky, K.V.; et al. Targeted re-sequencing and genome-wide association analysis for wood property traits in breeding population of Eucalyptus tereticornis × E. grandis. Genomics 2021, 113, 4276–4292. [Google Scholar] [PubMed]
  14. Park, S.; Kumar, P.; Shi, A.; Mou, B. Population genetics and genome-wide association studies provide insights into the influence of selective breeding on genetic variation in lettuce. Plant Genome 2021, 14, e20086. [Google Scholar] [PubMed]
  15. Wang, J.; Raskin, L.; Samuels, D.C.; Shyr, Y.; Guo, Y. Genome measures used for quality control are dependent on gene function and ancestry. Bioinformatics 2015, 31, 318–323. [Google Scholar] [PubMed] [Green Version]
  16. Guo, M.; Yue, W.; Samuels, D.C.; Yu, H.; He, J.; Zhao, Y.-Y.; Guo, Y. Quality and concordance of genotyping array data of 12,064 samples from 5840 cancer patients. Genomics 2019, 111, 950–957. [Google Scholar] [CrossRef] [PubMed]
  17. Guo, Y.; Long, J.; He, J.; Li, C.-I.; Cai, Q.; Shu, X.-O.; Zheng, W.; Li, C. Exome sequencing generates high quality data in non-target regions. BMC Genom. 2012, 13, 194. [Google Scholar]
  18. Yang, G.; Luo, W.; Zhang, J.; Yan, X.; Du, Y.; Zhou, L.; Li, W.; Wang, H.; Chen, Z.; Guo, T. Genome-Wide Comparisons of Mutations Induced by Carbon-Ion Beam and Gamma-Rays Irradiation in Rice via Resequencing Multiple Mutants. Front. Plant Sci. 2019, 10, 1514. [Google Scholar]
  19. Han, W.; Xue, Q.; Li, G.; Yin, J.; Zhang, H.; Zhu, Y.; Xing, W.; Cao, Y.; Su, Y.; Wang, K.; et al. Genome-wide analysis of the role of DNA methylation in inbreeding depression of reproduction in Langshan chicken. Genomics 2020, 112, 2677–2687. [Google Scholar]
  20. Thami, P.; Choga, W.; Mulisa, D.; Dandara, C.; Shevchenko, A.; Leteane, M.; Novitsky, V.; O’Brien, S.; Essex, M.; Gaseitsiwe, S.; et al. Whole Genome Sequencing-based Characterization of Human Genome Variation and Mutation Burden in Botswana. bioRxiv 2020, 1, 1–34. [Google Scholar] [CrossRef]
  21. Halkett, F.; Simon, J.-C.; Balloux, F. Tackling the population genetics of clonal and partially clonal organisms. Trends Ecol. Evol. 2005, 20, 194–201. [Google Scholar] [PubMed]
  22. Meng, B.; Wang, T.; Luo, Y.; Guo, Y.; Xu, D.; Liu, C.; Zou, J.; Li, L.; Diao, Y.; Gao, Z.; et al. Identification and Allele Combination Analysis of Rice Grain Shape-Related Genes by Genome-Wide Association Study. Int. J. Mol. Sci. 2022, 23, 1065. [Google Scholar] [PubMed]
  23. Jia, H.; Liu, G.; Li, J.; Zhang, J.; Sun, P.; Zhao, S.; Zhou, X.; Lu, M.; Hu, J. Genome resequencing reveals demographic history and genetic architecture of seed salinity tolerance in Populus euphratica. J. Exp. Bot. 2020, 71, 4308–4320. [Google Scholar] [PubMed]
  24. Alqudah, A.M.; Sallam, A.; Stephen Baenziger, P.; Börner, A. GWAS: Fast-forwarding gene identification and characterization in temperate Cereals: Lessons from Barley—A review. J. Adv. Res. 2020, 22, 119–135. [Google Scholar] [PubMed]
  25. Tibbs Cortes, L.; Zhang, Z.; Yu, J. Status and prospects of genome-wide association studies in plants. Plant Genome 2021, 14, e20077. [Google Scholar] [PubMed]
  26. Monnot, S.; Desaint, H.; Mary-Huard, T.; Moreau, L.; Schurdi-Levraud, V.; Boissot, N. Deciphering the Genetic Architecture of Plant Virus Resistance by GWAS, State of the Art and Potential Advances. Cells 2021, 10, 3080. [Google Scholar]
  27. Li, X.; Nie, C.; Liu, Y.; Chen, Y.; Lv, X.; Wang, L.; Zhang, J.; Li, K.; Jia, Y.; Ban, L.; et al. A genome-wide association study explores the genetic determinism of host resistance to Salmonella pullorum infection in chickens. Genet. Sel. Evol. GSE 2019, 51, 1–12. [Google Scholar]
  28. Korte, A.; Farlow, A. The advantages and limitations of trait analysis with GWAS: A review. Plant Methods 2013, 9, 29. [Google Scholar]
  29. Zhang, X.F.; Li, H.G.; You, L.X.; Xao, J. Variation and genetic stability of two-year-old liriodendron seedling growth for 39 mating combinations. J. Zhejiang A F Univ. 2011, 28, 103–108. [Google Scholar]
  30. Chen, Y.; Wu, H.; Yang, W.; Zhao, W.; Tong, C. Multivariate linear mixed model enhanced the power of identifying genome-wide association to poplar tree heights in a randomized complete block design. G3 Genes Genomes Genet. 2021, 11, jkaa053. [Google Scholar]
  31. Lu, N.; Zhang, M.; Xiao, Y.; Han, D.; Liu, Y.; Zhang, Y.; Yi, F.; Zhu, T.; Ma, W.; Fan, E.; et al. Construction of a high-density genetic map and QTL mapping of leaf traits and plant growth in an interspecific F1 population of Catalpa bungei × Catalpa duclouxii Dode. BMC Plant Biol. 2019, 19, 596. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Yang, W.; Yao, D.; Wu, H.; Zhao, W.; Chen, Y.; Tong, C. Multivariate genome-wide association study of leaf shape in a Populus deltoides and P. simonii F1 pedigree. PLoS ONE 2021, 16, e0259278. [Google Scholar]
  33. Li, W.; Boer, M.P.; Zheng, C.; Joosen, R.V.L.; van Eeuwijk, F.A. An IBD-based mixed model approach for QTL mapping in multiparental populations. Theor. Appl. Genet. 2021, 134, 3643–3660. [Google Scholar] [CrossRef]
  34. Zhang, Q.; Su, Z.; Guo, Y.; Zhang, S.; Jiang, L.; Wu, R. Genome-wide association studies of callus differentiation for the desert tree, Populus euphratica. Tree Physiol. 2020, 40, 1762–1777. [Google Scholar] [CrossRef] [PubMed]
  35. Nyaga, C.; Gowda, M.; Beyene, Y.; Muriithi, W.T.; Makumbi, D.; Olsen, M.S.; Suresh, L.M.; Bright, J.M.; Das, B.; Prasanna, B.M. Genome-Wide Analyses and Prediction of Resistance to MLN in Large Tropical Maize Germplasm. Genes 2020, 11, 16. [Google Scholar]
  36. Fahrenkrog, A.M.; Neves, L.G.; Resende, M.F.R., Jr.; Dervinis, C.; Davenport, R.; Barbazuk, W.B.; Kirst, M. Population genomics of the eastern cottonwood (Populus deltoides). Ecol. Evol. 2017, 7, 9426–9440. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Lu, Y.; Shah, T.; Hao, Z.; Taba, S.; Zhang, S.; Gao, S.; Liu, J.; Cao, M.; Wang, J.; Prakash, A.B.; et al. Comparative SNP and Haplotype Analysis Reveals a Higher Genetic Diversity and Rapider LD Decay in Tropical than Temperate Germplasm in Maize. PLoS ONE 2011, 6, e24861. [Google Scholar]
  38. George, A.W.; Cavanagh, C. Genome-wide association mapping in plants. Theor. Appl. Genet. 2015, 128, 1163–1174. [Google Scholar]
  39. Vogt, F.; Shirsekar, G.; Weigel, D. vcf2gwas: Python API for comprehensive GWAS analysis using GEMMA. Bioinformatics 2022, 38, 839–840. [Google Scholar]
  40. Hoffman, G.E. Correcting for Population Structure and Kinship Using the Linear Mixed Model: Theory and Extensions. PLoS ONE 2013, 8, e75707. [Google Scholar]
  41. Wang, X.; Niu, Q.W.; Teng, C.; Li, C.; Mu, J.; Chua, N.H.; Zuo, J. Overexpression of PGA37/MYB118 and MYB115 promotes vegetative-to-embryonic transition in Arabidopsis. Cell Res. 2009, 19, 224–235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Nakano, Y. MYB transcription factors orchestrating the developmental program of xylem vessels in Arabidopsis roots. Plant Biotechnol. 2010, 27, 267–272. [Google Scholar]
  43. Oh, S.; Park, S.; Han, K.H. Transcriptional regulation of secondary growth in Arabidopsis thaliana. J. Exp. Bot. 2003, 54, 2709–2722. [Google Scholar] [PubMed] [Green Version]
  44. McCarthy, R.L.; Zhong, R.; Fowler, S.; Lyskowski, D.; Piyasena, H.; Carleton, K.; Spicer, C.; Ye, Z.H. The poplar MYB transcription factors, PtrMYB3 and PtrMYB20, are involved in the regulation of secondary wall biosynthesis. Plant Cell Physiol. 2010, 51, 1084–1090. [Google Scholar] [PubMed] [Green Version]
  45. Liu, Y.; Man, J.; Wang, Y.; Yuan, C.; Shi, Y.; Liu, B.; Hu, X.; Wu, S.; Zhang, T.; Lian, C. Overexpression of PtrMYB121 Positively Regulates the Formation of Secondary Cell Wall in Arabidopsis thaliana. Int. J. Mol. Sci. 2020, 21, 7734. [Google Scholar] [PubMed]
  46. Li, C.; Wang, X.; Ran, L.; Tian, Q.; Fan, D.; Luo, K. PtoMYB92 is a Transcriptional Activator of the Lignin Biosynthetic Pathway During Secondary Cell Wall Formation in Populus tomentosa. Plant Cell Physiol. 2015, 56, 2436–2446. [Google Scholar] [PubMed] [Green Version]
  47. Xu, C.; Fu, X.; Liu, R.; Guo, L.; Ran, L.; Li, C.; Tian, Q.; Jiao, B.; Wang, B.; Luo, K. PtoMYB170 positively regulates lignin deposition during wood formation in poplar and confers drought tolerance in transgenic Arabidopsis. Tree Physiol. 2017, 37, 1713–1726. [Google Scholar] [PubMed] [Green Version]
  48. Tian, Q.; Wang, X.; Li, C.; Lu, W.; Yang, L.; Jiang, Y.; Luo, K. Functional characterization of the poplar R2R3-MYB transcription factor PtoMYB216 involved in the regulation of lignin biosynthesis during wood formation. PLoS ONE 2013, 8, e76369. [Google Scholar]
  49. Wang, L.; Lu, W.; Ran, L.; Dou, L.; Yao, S.; Hu, J.; Fan, D.; Li, C.; Luo, K. R2R3-MYB transcription factor MYB6 promotes anthocyanin and proanthocyanidin biosynthesis but inhibits secondary cell wall formation in Populus tomentosa. Plant J. 2019, 99, 733–751. [Google Scholar]
  50. Yang, L.; Zhao, X.; Ran, L.; Li, C.; Fan, D.; Luo, K. PtoMYB156 is involved in negative regulation of phenylpropanoid metabolism and secondary cell wall biosynthesis during wood formation in poplar. Sci. Rep. 2017, 7, 41209. [Google Scholar] [CrossRef] [Green Version]
  51. Jiao, B.; Zhao, X.; Lu, W.; Guo, L.; Luo, K. The R2R3 MYB transcription factor MYB189 negatively regulates secondary cell wall biosynthesis in Populus. Tree Physiol. 2019, 39, 1187–1200. [Google Scholar] [PubMed]
  52. Tang, X.; Zhuang, Y.; Qi, G.; Wang, D.; Liu, H.; Wang, K.; Chai, G.; Zhou, G. Poplar PdMYB221 is involved in the direct and indirect regulation of secondary wall biosynthesis during wood formation. Sci. Rep. 2015, 5, 12240. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Liu, G.S.; Li, H.L.; Grierson, D.; Fu, D.Q. NAC Transcription Factor Family Regulation of Fruit Ripening and Quality: A Review. Cells 2022, 11, 525. [Google Scholar] [PubMed]
  54. Ding, Y.; Yu, S.; Wang, J.; Li, M.; Qu, C.; Li, J.; Liu, L. Comparative transcriptomic analysis of seed coats with high and low lignin contents reveals lignin and flavonoid biosynthesis in Brassica napus. BMC Plant Biol. 2021, 21, 246. [Google Scholar]
  55. Tonfack, L.B.; Hussey, S.G.; Veale, A.; Myburg, A.A.; Mizrachi, E. Analysis of orthologous Secondary wall-associated NAC Domain1(SND1) promotor activity in herbaceous and woody angiosperms. Int. J. Mol. Sci. 2019, 20, 4623. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Blaschek, L.; Murozuka, E.; Serk, H.; Menard, D.; Pesquet, E. Different combinations of laccase paralogs nonredundantly control the amount and composition of lignin in specific cell types and cell wall layers in Arabidopsis. Plant Cell 2023, 35, 889–909. [Google Scholar] [PubMed]
  57. Voelker, S.L.; Lachenbruch, B.; Meinzer, F.C.; Jourdes, M.; Ki, C.; Patten, A.M.; Davin, L.B.; Lewis, N.G.; Tuskan, G.A.; Gunter, L.; et al. Antisense down-regulation of 4CL expression alters lignification, tree growth, and saccharification potential of field-grown poplar. Plant Physiol. 2010, 154, 874–886. [Google Scholar]
  58. Sun, Q.; Liu, X.; Yang, J.; Liu, W.; Du, Q.; Wang, H.; Fu, C.; Li, W.-X. MicroRNA528 Affects Lodging Resistance of Maize by Regulating Lignin Biosynthesis under Nitrogen-Luxury Conditions. Mol. Plant 2018, 11, 806–814. [Google Scholar]
  59. Wang, C.-Y.; Zhang, S.; Yu, Y.; Luo, Y.-C.; Liu, Q.; Ju, C.; Zhang, Y.-C.; Qu, L.-H.; Lucas, W.J.; Wang, X.; et al. MiR397b regulates both lignin content and seed number in Arabidopsis via modulating a laccase involved in lignin biosynthesis. Plant Biotechnol. J. 2014, 12, 1132–1142. [Google Scholar]
  60. Lu, S.; Li, Q.; Wei, H.; Chang, M.-J.; Tunlaya-Anukit, S.; Kim, H.; Liu, J.; Song, J.; Sun, Y.-H.; Yuan, L.; et al. Ptr-miR397a is a negative regulator of laccase genes affecting lignin content in Populus trichocarpa. Proc. Natl. Acad. Sci. USA 2013, 110, 10848–10853. [Google Scholar]
  61. Wang, M.H.M. Trait analysis of three new clones in Aigeiros. For. Sci. 1979, 1, 9–14. [Google Scholar]
  62. Franklin, G.L. Preparation of Thin Sections of Synthetic Resins and Wood-Resin Composites, and a New Macerating Method for Wood. Nature 1945, 155, 51. [Google Scholar] [CrossRef]
  63. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef] [PubMed]
  64. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; Genome Project Data Processing, S. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  65. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Zhang, C.; Dong, S.-S.; Xu, J.-Y.; He, W.-M.; Yang, T.-L. PopLDdecay: A fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 2019, 35, 1786–1788. [Google Scholar] [CrossRef] [PubMed]
  67. Wang, J.; Zhang, Z. GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction. Genom. Proteom. Bioinform. 2021, 19, 629–640. [Google Scholar] [CrossRef] [PubMed]
  68. Zhou, X.; Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 2012, 44, 821–824. [Google Scholar]
  69. Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A Tool for Genome-wide Complex Trait Analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef] [Green Version]
  70. Conway, J.R.; Lex, A.; Gehlenborg, N. UpSetR: An R package for the visualization of intersecting sets and their properties. Bioinformatics 2017, 33, 2938–2940. [Google Scholar] [CrossRef] [Green Version]
  71. Emrani, H.; Masoudi, A.A.; Vaez Torshizi, R.; Ehsani, A. Genome-wide association study of shank length and diameter at different developmental stages in chicken F2 resource population. Anim. Genet. 2020, 51, 722–730. [Google Scholar] [CrossRef]
  72. Wu, T.; Hu, E.; Xu, S.; Chen, M.; Guo, P.; Dai, Z.; Feng, T.; Zhou, L.; Tang, W.; Zhan, L.; et al. Clusterprofiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2021, 2, 100141. [Google Scholar] [CrossRef]
  73. Ding, C.; Shen, T.; Ran, N.; Zhang, H.; Pan, H.; Su, X.; Xu, M. Integrated Degradome and Srna Sequencing Revealed miRNA-mRNA Regulatory Networks between the Phloem and Developing Xylem of Poplar. Int. J. Mol. Sci. 2022, 23, 4537. [Google Scholar] [CrossRef]
Figure 1. Scatterplot matrix for eight traits (fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H)). Lower triangle: Scatter plots of units between pairs of eight poplar traits. Diagonal: density plot of eight traits. Upper triangle: correlation coefficient (Pearson’s r) between the eight traits within the full-sib poplar family. *, 0.01 < p < 0.05; **, 0.001 < p < 0.01; ***, p < 0.001.
Figure 1. Scatterplot matrix for eight traits (fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H)). Lower triangle: Scatter plots of units between pairs of eight poplar traits. Diagonal: density plot of eight traits. Upper triangle: correlation coefficient (Pearson’s r) between the eight traits within the full-sib poplar family. *, 0.01 < p < 0.05; **, 0.001 < p < 0.01; ***, p < 0.001.
Ijms 24 12662 g001
Figure 2. (a) SNP (single-nucleotide polymorphism) density plot across all 19 poplar chromosomes. The horizontal axis at the top represents the physical location (Mbp) on the chromosome. The color bar indicates the number of SNPs within the size of the 0.5 Mbp window. (b) Plot of the LD (linkage disequilibrium) rate (r2) against the physical distance in Kbp. The red line indicates the pattern of LD decay for the entire genome, and the gray line represents the LD decay for each of 10 randomly selected chromosomes.
Figure 2. (a) SNP (single-nucleotide polymorphism) density plot across all 19 poplar chromosomes. The horizontal axis at the top represents the physical location (Mbp) on the chromosome. The color bar indicates the number of SNPs within the size of the 0.5 Mbp window. (b) Plot of the LD (linkage disequilibrium) rate (r2) against the physical distance in Kbp. The red line indicates the pattern of LD decay for the entire genome, and the gray line represents the LD decay for each of 10 randomly selected chromosomes.
Ijms 24 12662 g002
Figure 3. Upset plot of the top 500 SNPs (single-nucleotide polymorphisms) of four methods GWAS (genome-wide association studies) results for eight poplar traits. Subplot (a): fiber length (FL). Subplot (b): fiber width (FW). Subplot (c): double wall thickness (DWT). Subplot (d): cell lumen diameter (CLD). Subplot (e): air-dry density (AD). Subplot (f): basic density (BD). Subplot (g): diameter at breast height (DBH). Subplot (h): height (H). Each row in the lower panel represents a method, and the corresponding colored bars on the lower left represent the number of each method, whereas the main bar plot (top) is the number of SNPs in different sets.
Figure 3. Upset plot of the top 500 SNPs (single-nucleotide polymorphisms) of four methods GWAS (genome-wide association studies) results for eight poplar traits. Subplot (a): fiber length (FL). Subplot (b): fiber width (FW). Subplot (c): double wall thickness (DWT). Subplot (d): cell lumen diameter (CLD). Subplot (e): air-dry density (AD). Subplot (f): basic density (BD). Subplot (g): diameter at breast height (DBH). Subplot (h): height (H). Each row in the lower panel represents a method, and the corresponding colored bars on the lower left represent the number of each method, whereas the main bar plot (top) is the number of SNPs in different sets.
Ijms 24 12662 g003
Figure 4. Quantile–quantile plot (QQ plot) of fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H) for four GWAS (genome-wide association studies) methods (GAPIT.GLM, GCTA, GEMMA, and PLINK).
Figure 4. Quantile–quantile plot (QQ plot) of fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H) for four GWAS (genome-wide association studies) methods (GAPIT.GLM, GCTA, GEMMA, and PLINK).
Ijms 24 12662 g004
Figure 5. Manhattan plots of the GWAS (genome-wide association studies) results for eight poplar characters. SNPs (single-nucleotide polymorphisms) were removed when p > 0.005 in all traits from inside to outside. Fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H). Notes: when p > 0.0005.
Figure 5. Manhattan plots of the GWAS (genome-wide association studies) results for eight poplar characters. SNPs (single-nucleotide polymorphisms) were removed when p > 0.005 in all traits from inside to outside. Fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H). Notes: when p > 0.0005.
Ijms 24 12662 g005
Figure 6. Typical pathways of the NAC-containing gene. The “T” arrows indicate interacting miRNAs, and the shades of color represent different category values.
Figure 6. Typical pathways of the NAC-containing gene. The “T” arrows indicate interacting miRNAs, and the shades of color represent different category values.
Ijms 24 12662 g006
Table 1. Phenotypic information (fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H)) of P. deltoides cv. I-69 and P. × euramericana cv. I-45 and their F1 Population.
Table 1. Phenotypic information (fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H)) of P. deltoides cv. I-69 and P. × euramericana cv. I-45 and their F1 Population.
TraitsMaleFemaleF1 Population
Mean ± SDMinMaxCV (%)
FL (mm)0.941.141.09 ± 0.090.891.308.55
FW (μm)23.1723.3823.00 ± 0.8520.9824.753.69
DWT (μm)8.0711.7710.93 ± 0.958.7214.218.65
CLD (μm)15.0911.6112.07 ± 1.039.9014.848.51
AD (g/cm3)0.430.520.51 ± 0.030.440.596.59
BD (g/cm3)0.330.420.40 ± 0.20.350.476.23
DBH (cm)17.4029.3032.61 ± 4.8120.8043.2014.74
H (m)18.9023.9024.59 ± 2.2817.1028.609.89
Table 2. Genomic inflation factor (λ statistic) of the GWAS (genome-wide association studies) results of eight traits (fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H)) using four tools.
Table 2. Genomic inflation factor (λ statistic) of the GWAS (genome-wide association studies) results of eight traits (fiber length (FL), fiber width (FW), double wall thickness (DWT), cell lumen diameter (CLD), air-dry density (AD), basic density (BD), diameter at breast height (DBH), and height (H)) using four tools.
GAPIT.GLMGCTAGEMMAPLINK
FL1.1161.1441.1531.131
FW1.0601.0931.0621.080
DWT1.4841.5071.0241.504
CLD1.7371.8551.0001.866
AD1.1771.2011.0091.187
BD1.1811.4221.0471.407
DBH1.4981.3791.0281.376
H1.1011.0680.9811.071
Average λ1.3011.3401.0401.334
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Y.; Zhang, H.; Zhu, S.; Shen, T.; Pan, H.; Xu, M. Association Mapping and Expression Analysis of the Genes Involved in the Wood Formation of Poplar. Int. J. Mol. Sci. 2023, 24, 12662. https://doi.org/10.3390/ijms241612662

AMA Style

Wang Y, Zhang H, Zhu S, Shen T, Pan H, Xu M. Association Mapping and Expression Analysis of the Genes Involved in the Wood Formation of Poplar. International Journal of Molecular Sciences. 2023; 24(16):12662. https://doi.org/10.3390/ijms241612662

Chicago/Turabian Style

Wang, Yaolin, Heng Zhang, Sheng Zhu, Tengfei Shen, Huixin Pan, and Meng Xu. 2023. "Association Mapping and Expression Analysis of the Genes Involved in the Wood Formation of Poplar" International Journal of Molecular Sciences 24, no. 16: 12662. https://doi.org/10.3390/ijms241612662

APA Style

Wang, Y., Zhang, H., Zhu, S., Shen, T., Pan, H., & Xu, M. (2023). Association Mapping and Expression Analysis of the Genes Involved in the Wood Formation of Poplar. International Journal of Molecular Sciences, 24(16), 12662. https://doi.org/10.3390/ijms241612662

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop