Next Article in Journal
Olfactory Gene Families in Scopula subpunctaria and Candidates for Type-II Sex Pheromone Detection
Next Article in Special Issue
Three Starch Synthase IIa (SSIIa) Alleles Reveal the Effect of SSIIa on the Thermal and Rheological Properties, Viscoelasticity, and Eating Quality of Glutinous Rice
Previous Article in Journal
Antithrombin Activity and Association with Risk of Thrombosis and Mortality in Patients with Cancer
Previous Article in Special Issue
Screening of Induced Mutants Led to the Identification of Starch Biosynthetic Genes Associated with Improved Resistant Starch in Wheat
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Understanding the Potential Gene Regulatory Network of Starch Biosynthesis in Tartary Buckwheat by RNA-Seq

Research Center of Buckwheat Industry Technology, Guizhou Normal University, Guiyang 550001, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(24), 15774; https://doi.org/10.3390/ijms232415774
Submission received: 30 September 2022 / Revised: 12 November 2022 / Accepted: 1 December 2022 / Published: 12 December 2022

Abstract

:
Starch is a major component of crop grains, and its content affects food quality and taste. Tartary buckwheat is a traditional pseudo-cereal used in food as well as medicine. Starch content, granule morphology, and physicochemical properties have been extensively studied in Tartary buckwheat. However, the complex regulatory network related to its starch biosynthesis needs to be elucidated. Here, we performed RNA-seq analyses using seven Tartary buckwheat varieties differing in starch content and combined the RNA-seq data with starch content by weighted correlation network analysis (WGCNA). As a result, 10,873 differentially expressed genes (DEGs) were identified and were functionally clustered to six hierarchical clusters. Fifteen starch biosynthesis genes had higher expression level in seeds. Four trait-specific modules and 3131 hub genes were identified by WGCNA, with the lightcyan and brown modules positively correlated with starch-related traits. Furthermore, two potential gene regulatory networks were proposed, including the co-expression of FtNAC70, FtPUL, and FtGBSS1-3 in the lightcyan module and FtbHLH5, C3H, FtBE2, FtISA3, FtSS3-5, and FtSS1 in the brown. All the above genes were preferentially expressed in seeds, further suggesting their role in seed starch biosynthesis. These results provide crucial guidance for further research on starch biosynthesis and its regulatory network in Tartary buckwheat.

1. Introduction

Fagopyrum tataricum, also called Tartary buckwheat, belongs to the Fagopyrum genus of Polygonaceae Mill and is a type of traditional pseudo-cereal for both medicine and food. Tartary buckwheat is famous for its high nutritional value, i.e., balanced amino acids, vitamins, minerals, and flavonoids (especially rutin and quercetin) [1,2,3]. These compounds have numerous effects and can prevent several chronic human diseases, such as hypertension, obesity, cardiovascular diseases, and gallstone formation [3,4].
Starch, comprised of amylose and amylopectin, is the major component of Tartary buckwheat seeds, accounting for the highest proportion of their dry weight [5]. Starch content, especially amylose content and the ratio of amylose to amylopectin content (amylose/amylopectin), affects food quality and taste to a great degree [6,7,8]. The amylose content of Tartary buckwheat seeds ranges from 10–28%, which depends on the germplasm difference and also differences in measurement method [5,9,10]. For granule morphology, Tartary buckwheat starch has a size ranging from 2–14 µm and three kinds of shapes, namely polygonal, oval, and spherical with smooth surface [11,12]. In terms of physicochemical properties, the amylose of Tartary buckwheat is similar to that of common buckwheat (F. esculentum), wheat (Triticum aestivum) and barley (Hordeum vulgare), with 19.3–20.2% iodine affinity, 1.36–1.48 blue values, and 645–657 nm maximum absorption wavelength [13]. The starch solubility in Tartary buckwheat ranges from 9.9–10.4% at 90 °C, which is much lower than that for maize and potato [12]. Meanwhile, peak viscosity and breakdown of Tartary buckwheat are higher, and its pasting time is shorter than that of common buckwheat. It is suggested that Tartary buckwheat starch could be exploited as a suitable raw material for retrograded starch in food processing [12].
In endosperm, starch biosynthesis is a complex process controlled by the synergistic action of five starch synthetases, namely ADP-glucose pyrophosphorylase (AGPase), granule-bound starch synthase (GBSS), soluble starch synthase (SS), starch branching enzyme (BE), and starch debranching enzyme (DBE) [14,15]. AGPase is a hetero-tetramer formed by two large subunits (AGPL) and two small subunits (AGPS) and is responsible for the synthesis of ADP glucose (the major substrate for starch synthesis), which is the first committed and rate-limiting step in the pathway of starch synthesis [16]. GBSS is required for amylose biosynthesis in the form of oligomers after being phosphorylated on the surface of starch granules. On the other hand, the coordination of SS, BE, and DBE is required for amylopectin biosynthesis [14]. In rice and maize, there are five types of SS existing in the endosperm, among which SSI, SSII, and SSIII are responsible for a-glucan chain elongation during amylopectin synthesis, SSIV is responsible for starch granule initiation with the interaction of other proteins, and SSV is involved in manipulating starch granule initiation [17,18]. BE is responsible for producing branches for amylopectin that are connected by a-1,6-glycoside bonds [15]. DBE, comprised of isoamylase (ISA) and pullulanase (PUL), is responsible for correcting incorrect branches in starch synthesis to ensure the orderly synthesis of amylopectin [15].
Genes encoding the starch biosynthesis enzymes have been massively characterized and functionally analyzed in cereals, such as rice (Oryza sativa), maize (Zea mays), wheat, and barley [14,15]. To date, 3, 4, 2, 9, 4, 3, and 1 gene(s) in rice, and 4, 4, 3,10, 4, 3, and 1 gene(s) in maize that encode AGPS, AGPL, GBSS, SS, BE, ISA, and PUL, respectively, have been identified and their functions studied in depth [15,19,20]. In recent years, focus has gradually been transferred to the regulation mechanism and the regulatory network of starch biosynthesis that is controlled by the regulators, especially the transcription factors (TFs) [21]. Numerous TFs, including bZIP, AP2/ERF, bHLH, NAC, MYB, GRAS, WRKY, MADS, and DOF, are involved in the regulatory network of starch biosynthesis [21]. On one hand, some of the TFs mediate the expression of starch biosynthesis genes through directly binding to the cis-acting elements in their promoters. These TF families include NAC, AP2/ERF, bZIP, MYB, DOF, GRAS, and WRKY [22,23,24,25,26]. A popular example in maize is ZmNAC128 and ZmNAC130, which can bind to the prompters of Brittle 2 (BT2), Zpu1 (encoding DBE), GBSSI, Sh2 (encoding AGPL), SSV, ISA2, and SSIIa, and thus positively affect starch synthesis in the endosperm of maize [22]. On the other hand, some TFs cannot directly bind to the promoters of the starch biosynthesis genes. They bind to the promoters of the TFs, which act in a direct manner, thus forming a regulatory complex to mediate the expression of starch biosynthesis genes in an indirect manner. These TF families include bHLH, NAC, AP2/ERF, bZIP, MADS, and DOF [21,27,28,29]. For example, a novel FLOURY ENDOSPERM2 (FLO2) could interact with the bHLH proteins in a transcriptional complex that can regulate storage starch by influencing the development of endosperm in rice [27].
In Tartary buckwheat, starch-related research has focused on the starch content, granule morphology, and physicochemical properties [9,11,12,13]. The complex regulatory network of starch biosynthesis in Tartary buckwheat seeds needs to be elucidated. Up to now, only one starch-related gene involved in amylose biosynthesis, namely FtGBSSI (Genbank accession: KC990539.1), has been isolated [30]. Meanwhile, a transcriptome analysis revealed that 23 differentially expressed genes (DEGs) correspond to starch biosynthesis [31]. These works are not sufficient for us to understand the gene regulatory network of starch biosynthesis. Here, we have selected seven varieties that differ in starch content to perform RNA-seq analyses and identify the starch biosynthesis genes at a global level. Meanwhile, we combined the RNA-seq data with starch content using weighted gene correlation network analysis (WGCNA), and a series of hub genes and the regulatory network including both starch biosynthesis genes and the candidate TFs were proposed for Tartary buckwheat. This work will not only enrich the gene resources related to starch, but also provide a molecular insight into the starch biosynthesis regulatory mechanism in Tartary buckwheat.

2. Results

2.1. Starch Content among Tartary Buckwheat Seeds

Amylose and amylopectin contents were measured in the matured seeds of 20 Tartary buckwheat varieties (lines), which presented obvious characteristics in their seed size, morphology, shell color, and shell thickness. The results showed that the amylose content ranged from 17.61% to 21.00%, with an average of 19.24%. The amylopectin content ranged from 19.67% to 28.49%, with an average of 23.63%. The ratio of amylopectin/amylose ranged from 1.05 to 1.43, with an average of 1.23 (Supplementary Table S1). We subsequently picked seven varieties that differed in starch content and measured their amylose and amylopectin contents in late-filling-stage seeds, namely PK1, BK2, QK2, XMQ, DMQ, JQ2, and M11 (see full name in Materials and Methods). The amylose content ranged from 3.34% to 11.84% (Figure 1a). The amylopectin content ranged from 48.39% to 66.98% (Figure 1b). The ratio of amylopectin/amylose ranged from 5.66 to 14.51 (Figure 1c). Interestingly, the amylose content, amylopectin content, and amylopectin/amylose produced a gradient. Among these, PK1 had the lowest amylose content, the lowest amylopectin content, and the highest amylopectin/amylose. Therefore, PK1 was used as the control variety in the following analysis.

2.2. Global Analysis of RNA-Seq Data in Tartary Buckwheat Seeds

To discover the global changes in gene expression level, high-throughput RNA-seq was carried out for late-filling-stage seeds of seven selected varieties (PK1, BK2, QK2, XMQ, DMQ, JQ2, and M11), with three biological replicates for each variety. In each sample, 40,261,724–102,065,872 clean reads and 6.04–15.31 G clean bases were obtained. Q30 ranged from 88.86% to 89.77% and GC content ranged from 47.64% to 50.03% (Supplementary Table S2). The clean reads were then mapped to the genome of Tartary buckwheat. As a result, 35,065,134–88,503,506 (85.52%–87.99%) clean reads were uniquely mapped, among which 17,430,434–43,984,338 (42.48%–43.81%) were uniquely mapped to the “+” strand, whereas 17,634,700–44,519,168 (43.05%–44.18%) were uniquely mapped to the “−” strand of the coding sequences (Supplementary Table S2). After mapping, 24,804–26,719 genes were detected in each library, coming to a total of 29,978 genes (Supplementary Table S2). According to the Pearson correlation analysis, the repeatability of the biological replicates was acceptable, with correlations ranging from 0.97 to 1 (Supplementary Figure S1).
Using PK1 as a control, 10,873 DEGs were identified, among which 9573, 3490, 1655, 977, 909, and 906 were identified in the comparisons of M11_vs._PK1, JQ2_vs._PK1, DMQ_vs._PK1, BK2_vs._PK1, QK2_vs._PK1, and XMQ_vs._PK1, respectively (Figure 2a). Of these, 307 DEGs were common in all six comparisons. In addition, 119 DGEs were common in five comparisons except for BK2_vs._PK1; 39 DGEs were common in five comparisons except for QK2_vs._PK1; 34 DGEs were common in five comparisons except for XMQ_vs._PK1; 17 DGEs were common in five comparisons except for M11_vs._PK1; and 17 DGEs were common in five comparisons except for JQ2_vs._PK1 (Figure 2a). Meanwhile, the hierarchical cluster analysis classified all DEGs to six clusters, named as C1 to C6 (Figure 2b,c). Among these, C1 included 996 genes that were highly expressed in PK1, BK2, and QK2, but minimally expressed in JQ2; C2 included 4649 genes that were most highly expressed in PK1, BK2, and QK2 but least expressed in M11; C3 included 210 genes that were least expressed in PK1; C4 included 3476 genes that were highly expressed in M11 but minimally expressed in the other five varieties; C5 included 212 genes that were least expressed in PK1 and M11; C6 included 1327 genes that were highly expressed in JQ2 but minimally expressed in PK1.
The metabolic pathways that the DEGs were related to were analyzed based on KEGG annotation. Among the top 20 enriched pathways in the six comparisons, four metabolic pathways, namely “starch and sucrose metabolism”, “biosynthesis of secondary metabolites”, “circadian rhythm—plant”, and “metabolic pathways” were enriched in all six comparisons (Supplementary Figure S2). Interestingly, the value of the factor “starch and sucrose metabolism” increased in the enrichment of DEGs identified by comparing the varieties containing higher starch content to PK1, such as M11_vs._PK1 and JQ2_vs._PK1. This suggested that the “starch and sucrose metabolism” pathway was positively related to starch content.
Meanwhile, the identified DEGs in each comparison were annotated to a KOG category. As a result, “carbohydrate transport and metabolism”, “secondary metabolite biosynthesis”, “posttranslational modification, protein turnover, chaperones”, “transport and catabolism”, “signal transduction mechanisms”, and “general function prediction only” were listed in the top annotated categories in all comparisons (Supplementary Figure S3). The category “carbohydrate transport and metabolism” included the process of starch biosynthesis and metabolism, which was consistent with “starch and sucrose metabolism” enriched in the KEGG pathway.
We also performed GO analysis for the DEGs and the top 50 enriched items related to biological processes, cellular components, and molecular functions were identified (Supplementary Figure S4). All common items in six comparisons belonged to biological processes. Among these, “secondary metabolic process” was enriched in all six comparisons; “response to toxic substances”, “response to hydrogen peroxide”, “killing of cells of other organisms”, “disruption of cells of other organisms”, and “cell killing” were enriched in five comparisons except for M11_vs._PK1.

2.3. Expression of Genes Related to the Starch Biosynthesis Pathway

In plants, AGPases are hetero-tetramers formed by two AGPLs and two AGPSs that catalyze the first committed and rate-limiting step in the starch synthesis pathway [16]. Then GBSS is required for amylose biosynthesis, meanwhile, the coordination of SS, BE, and DBE is required for amylopectin biosynthesis [14]. We identified all genes related to starch biosynthesis pathway at the genome-wide scale previously [32], resulting in 6 ADPase encoding genes (FtAGPL1~FtAGPL4 and FtAGPS1~FtAGPS2), 5 GBSS encoding genes (FtGBSS1~FtGBSS5), 12 SS encoding genes (FtSS1, FtSS2-1~FtSS2-3, FtSS3-1~FtSS3-5, and FtSS4-1~FtSS4-3), 4 BE encoding genes (FtBE1~FtBE4), and 4 DBE encoding genes (FtISA1~FtISA3 and FtPUL) (Supplementary Table S3). Their expression patterns were analyzed based on the transcriptome data. As a result, all 31 genes above were characterized and expressed in seven Tartary buckwheat seeds (Figure 3). Among these, 15 genes, including 3 ADPase encoding genes (FtAGPL2, FtAGPL4, and FtAGPS1), 3 GBSS encoding genes (FtGBSS1-2, FtGBSS1-3, and FtGBSS1-5), 5 SS encoding genes (FtSS1, FtSS2-1, FtSS3-2, FtSS3-5, and FtSS4-3), 2 BE encoding genes (FtBE1 and FtBE2), and 2 DBE encoding genes (FtISA3 and FtPUL) had higher expression levels, suggesting they had potential functions in the starch biosynthesis of Tartary buckwheat seeds. In addition, 20 out of the 31 genes in the starch biosynthesis pathway were identified as DEGs, including 12 in the hierarchical cluster of C2, 4 in C4, and 4 in C6 (Supplementary Table S3, Figure 3). It is worth mentioning that all four genes in C6 (FtAGPL2, FtGBSS1-3, FtGBSS1-5, and FtPUL) were more highly expressed in the varieties with higher starch content (Figure 2c).

2.4. Trait-Specific Modules and Hub Genes Identified by WGCNA Analysis

WGCNA was performed using 22,191 identified genes whose average count number was equal to or larger than 10 in at least one variety. As a result, nine modules were identified based on the association of gene expression levels with starch-related traits (Figure 4a). Among these, four modules (greenyellow, pink, lightcyan, and brown) were identified as trait-specific modules at the p value < 0.05 level. The greenyellow and pink modules were negatively correlated to the amylose content, amylopectin content, and amylose/amylopectin, but positively correlated to amylopectin/amylose. On the other hand, the lightcyan module was positively correlated to amylose content, amylopectin content, and amylose/amylopectin, but negatively correlated to amylopectin/amylose. The brown module was positively correlated to amylose content, amylopectin content, and amylose/amylopectin. Gene expression patterns of four trait-specific modules are shown in Figure 4b. The expression level of genes in the greenyellow module decreased with an increase in the amylose and amylopectin content in seven varieties. The expression level of genes in the pink module also decreased with an increase in the amylose and amylopectin content in six varieties, except for M11. The expression level of the genes in the lightcyan module were lower in PK1 and BK2 but higher in JQ2. The expression level of the genes in the brown module were extremely high in M11 but low in PK1 and BK2.
The hub genes were then identified using the threshold of the top 30% in intramodule connectivity, absolute KME (module eigengene-based connectivity) value ≥0.8, and absolute GS (gene trait significance) value ≥0.2, resulting in a total of 3131 hub genes in four trait-specific modules, including 2409, 217, 415, and 90 hub genes in greenyellow, pink, brown, and lightcyan, respectively (Supplementary Table S4).
KEGG pathway analysis was performed for the hub genes in four trait-specific modules and the significantly enriched pathways were screened at a corrected p-value < 0.05 level, resulting in 43, 15, 2, and 5 pathways, which were enriched in the greenyellow, pink, lightcyan, and brown modules, respectively (Supplementary Table S5). In the greenyellow module, most of the significantly enriched pathways were related to metabolism, in particular to fatty acid biosynthesis (00061), amino acid biosynthesis (valine, leucine, and isoleucine biosynthesis, 00290; lysine biosynthesis, 00300; arginine biosynthesis, 00220; phenylalanine, tyrosine, and tryptophan biosynthesis, 00400), N-glycan biosynthesis (00510 and 00513), energy metabolism, transport, and recycling (A09100; B09102; pyruvate metabolism, 00620; glycolysis/gluconeogenesis, 00010; citrate cycle (TCA cycle), 00020; carbohydrate metabolism, B09101; amino acid metabolism, B09105, 00260, and 00340). Others were related to genetic information processing (A09120, B09122, 03010, 03050, B09123, and 00970), brite hierarchies (03011, 04147, B09182, 03036, 03012, 01007, A09180, 02044, 02048, 03019, and 03051), and cellular processes (phagosome, 04145) (Supplementary Table S5). In the pink module, the enriched pathways were divided into two classes, one of which was related to genetic information processing, including genetic information processing (A09120), proteasome (3050), ribosome biogenesis in eukaryotes (3008), translation (B09122), spliceosome (3040), transcription (B09121), folding, sorting, and degradation (B09123), mRNA surveillance pathway (3015), and nucleocytoplasmic transport (3013); the other was related to brite hierarchies, including protein families: genetic information (B09182), ribosome biogenesis (3009), brite hierarchies (A09180), proteasome (3051), spliceosome (3041), and messenger RNA biogenesis (3019) (Supplementary Table S5).
In the lightcyan module, no pathway was enriched at a corrected p-value < 0.05 level. However, two metabolism-related pathways, namely starch and sucrose metabolism (00500) and carbohydrate metabolism (B09101) were listed on the top two enriched pathways (corrected p-value < 0.1, Supplementary Table S5 and Figure 4c). Two genes in the starch biosynthesis pathway, FtGBSS1-3 and FtPUL, were in this module (Supplementary Table S3). In the brown module, the significantly enriched pathways were carbohydrate metabolism (B09101), starch and sucrose metabolism (00500), metabolism (A09100), and galactose metabolism (00052) (Supplementary Table S5 and Figure 4c). Four genes in the starch biosynthesis pathway, FtBE2, FtSS3-5, FtISA3, and FtSS1, were in this module (Supplementary Table S3).

2.5. Identification of TFs in Four Trait-Specific Modules and Their Co-Expression Network Related to the Starch Biosynthesis Pathway

TFs in the hub genes of four trait-specific modules were screened; the resulting 60, 1, 4, and 36 TFs were enriched in the greenyellow, pink, lightcyan, and brown modules, respectively (Supplementary Table S4 and Figure 5a). In the greenyellow module, a total of 60 TFs were classified to 40 TF families, including 6, 6, 4, and 4 TFs in the bHLH, MYB, C2C2-GATA, and zf-HD families, respectively. In the pink module, only one bZIP TF (FtPinG0006546100.01) was screened. In the lightcyan module, four TFs were screened, namely two NAC (FtPinG0002339300.01, named as FtNAC70, and FtPinG0009656900.01, named as FtNAC77/15) [33,34,35], one AP2/ERF-ERF (FtPinG0002070100.01), and one RWP-RK (FtPinG0002032100.01). In the brown module, a total of 36 TFs were classified to 22 TF families, including 5, 3, and 3 TFs in AP2/ERF-ERF, bHLH, and the CCCH-type Zn-finger (C3H) protein family, respectively.
TFs are often in the upstream of a regulatory network, so we identified the TFs in the hub genes of the top 50 in intramodule connectivity, resulting in nine TFs in total, including four TFs in the lightcyan module and five TFs in the brown module (Supplementary Table S4). Interestingly, all four TFs in the lightcyan module, namely FtNAC70 and FtNAC15, one AP2/ERF-ERF (FtPinG0002070100.01), and one RWP-RK (FtPinG0002032100.01), were clustered to C6 based on their gene expression patterns; meanwhile, all five TFs in the brown module, namely one E2F-DP (FtPinG0001244300.01), two bHLH (FtPinG0000281400.01, named as FtbHLH5, and FtPinG0005387300.01, named as FtbHLH91) [36], one WRKY (FtPinG0005111700.01, named as FtWRKY39) [37], and one C3H-type Zn-finger protein (FtPinG0003310400.01), were clustered to C4 based on their gene expression patterns.
Nine TFs in the hub genes of the top 50 in intramodule connectivity were taken to perform a co-expression network with the starch biosynthesis genes in the hub genes of the top 30% in intramodule connectivity in the same module. The result showed that one TF, FtNAC70, was co-expressed with two starch biosynthesis genes, FtPUL and FtGBSS1-3, in the lightcyan module (Figure 5b). Meanwhile, two TFs, C3H and FtbHLH5, were co-expressed with four starch biosynthesis genes, FtBE2, FtISA3, FtSS3-5, and FtSS1, in the brown module (Figure 5c).
Pearson correlations of the genes in two co-expression networks were calculated. The result showed that three genes in the lightcyan module, FtNAC70, FtPUL, and FtGBSS1-3, were positively correlated to each other at a significance level of 0.01 (Pearson R ≥ 0.79, Figure 5d). Two TFs in the brown module, FtbHLH5 and C3H, were positively correlated to each other (Pearson R = 1) but negatively correlated to four starch biosynthesis genes, FtBE2, FtISA3, FtSS3-5, and FtSS1 (Pearson R ≤ −0.91), at the significance level of 0.01 (Figure 5e).

2.6. Tissue-Specific Expression Patterns of Nine Candidate Genes in the Starch Biosynthesis Pathway

Nine candidate genes identified by the co-expression network in the lightcyan and brown modules (Figure 5b,c) were applied to perform a tissue-specific expression analysis. The result showed that all of the nine genes were preferentially expressed in seeds, rather than the other four tissues (roots, stems, leaves, and flowers) (Figure 6). Among these, FtNAC70, C3H, and FtSS3-5 were highly expressed in seeds and minimally expressed in the other four tissues; FtbHLH5, FtGBSS1-3, and FtBE2 were highest expressed in roots, followed by stems, and minimally expressed in the other three tissues; FtSS1, FtISA2, and FtPUL were highly expressed in seeds and moderately expressed in the other four tissues.

3. Discussion

Seeds are the edible organs of plants, and starch accounts for the highest proportion of the dry weight of seeds in Tartary buckwheat [12]. A food’s quality and taste are greatly influenced by its starch content, especially amylose and the ratio of amylose to amylopectin [6,7,8]. Previous studies have reported that the amylose content of Tartary buckwheat seeds ranged from 10% to 28% [5,9,10]. The amylose content in the seeds of 20 Tartary buckwheat varieties (lines) ranged from 17.61% to 28.49%, which was consistent with previous studies. In addition, our results showed that the ratio of amylose to amylopectin content differed among the varieties (lines), which provides a basis for selecting varieties with different ratios of amylose to amylopectin for the use of Tartary buckwheat in food processing.
RNA-seq is a well-established and widely used method for identifying global gene regulation at a transcriptional level. We carried out RNA-seq analyses for seven varieties that differed in their starch content, resulting in 29,978 mapped genes and 10,873 DEGs, using PK1 as a control. Through metabolic pathway enrichment analysis, the “starch and sucrose metabolism” pathway was found to be significantly enriched and was positively related to starch content. Meanwhile, “carbohydrate transport and metabolism”, which included the process of starch biosynthesis and metabolism, was listed in the top annotated KOG categories. The KOG categories result was consistent with the KEGG enrichment result, suggesting the starch biosynthesis pathway differed in the seven varieties. In recent transcriptome analyses of developing Chinese chestnut (Castanea mollissima Blume) seed kernels, genes involved in starch biosynthesis, including AGPase, GBSS, SS, SBE, and ISA encoding genes, were also enriched in the developing seed [38,39], which were consistent with our result for Tartary buckwheat seed.
Based on our RNA-seq data, we identified 31 genes related to the starch biosynthesis pathway, including FtAGPL1~FtAGPL4, FtAGPS1~FtAGPS2, FtGBSS1~FtGBSS5, FtSS1, FtSS2-1~FtSS2-3, FtSS3-1~FtSS3-5, FtSS4-1~FtSS4-3, FtBE1~FtBE4, FtISA1~FtISA3, and FtPUL. Of these, 15 genes, namely FtAGPL2, FtAGPL4, FtAGPS1, FtGBSS1-2, FtGBSS1-3, FtGBSS1-5, FtSS1, FtSS2-1, FtSS3-2, FtSS3-5, FtSS4-3, FtBE1, FtBE2, FtISA3, and FtPUL, had a higher expression level, suggesting they might be responsible for starch biosynthesis in seeds rather than other tissues. To date, only one starch-related gene, namely FtGBSSI (Genbank accession: KC990539.1), has been isolated and involved in amylose biosynthesis [30]. This gene was referred to as FtGBSS1-2 and was highly expressed in the seeds in our study. Another reported starch-related gene in F. esculentum, namely FeGBSS (Genbank accession: HW041459.1), was highly homologous to FtGBSS1-1; however, FtGBSS1-1 was not preferentially expressed in the seeds in our study, suggesting it might participate in amylose biosynthesis in other tissues than the seeds of Tartary buckwheat. In addition, we previously performed a transcriptome analysis of three seed developmental stages in Tartary buckwheat and identified 23 DEGs corresponding to starch biosynthesis, among which six DEGs were expressed positively with seed development and showed higher expression in the later developmental seeds [31]. At this time, we selected seven varieties that differed in starch content for transcriptome analysis and identified 31 genes corresponding to starch biosynthesis, among which 10 genes showed higher expression in the varieties with high starch content. This work would largely enrich the starch-related gene resources for Tartary buckwheat.
With the combination of the four starch-related traits and the gene expression data achieved through WGCNA analysis, nine modules were identified, of which four modules (greenyellow, pink, lightcyan, and brown) were identified as trait-specific modules. Though the greenyellow and pink modules included more hub genes and more enriched pathways, the starch-related pathway was absent in them but significantly enriched in the lightcyan and brown modules, which were positively correlated to amylose content, amylopectin content, and amylose/amylopectin based on the module-trait relationship. Among the starch biosynthesis genes, FtGBSS1-3 and FtPUL were listed in the lightcyan module and FtBE2, FtSS3-5, FtISA3, and FtSS1 were listed in the brown module.
According to the results of TF analysis, 4 and 36 TFs were identified in the lightcyan and brown modules, respectively, among which nine TFs were in hub genes of the top 50 in intramodule connectivity. Some of the above TFs, such as NAC, AP2/ERF-ERF, bHLH, and WRKY, were homologous to the functionally characterized regulators related to starch metabolism in cereal crops. For example, NAC family genes are involved in starch biosynthesis in rice and maize. ZmNAC128 and ZmNAC130 could bind to the prompters of the starch biosynthesis genes, including Brittle 2 (BT2), Zpu1 (encoding DBE in maize), GBSSI, Sh2 (encoding AGPL), SSV, ISA2, and SSIIa, and thus positively affect starch synthesis in the endosperm of maize [22]. TaNAC019 was reported to regulate starch biosynthesis by the regulation of SSIIa and Susy1, and thus accelerate starch accumulation in wheat [23]. In our study, two NAC TFs, namely FtNAC70 and FtNAC15, were identified as top-50 hub genes. Both of their expression patterns have been reported to be increasing with the seed development of Tartary buckwheat [33,34]. It is worth mentioning that the tissue-specific expression analysis of FtNAC70 showed that it was a seed-specific NAC TF. Meanwhile, its expression was significantly positively correlated to FtGBSS1-3 (Pearson R = 0.84) and FtPUL (Pearson R = 0.79), suggesting that FtNAC70 should be a key positive regulator in regulating starch biosynthesis in Tartary buckwheat.
TFs in the bHLH family could regulate starch biosynthesis in an indirect manner. A novel FLOURY ENDOSPERM2 (FLO2) could interact with the bHLH proteins in a transcriptional complex that could regulate starch storage by influencing the development of endosperm in rice [27]. OPAQUE11, an endosperm-specific bHLH TF, is a central hub of the regulatory network for maize endosperm development and nutrient metabolism, a mutant of which showed decreased starch and protein with a small and opaque endosperm [28]. In our study, two bHLH TFs, namely FtbHLH5 and FtbHLH91, were identified as top-50 hub genes. FtbHLH5 was more highly expressed in seeds and stems, whereas FtbHLH91 was more highly expressed in leaves rather than other tissues [36]. In addition, both of the two bHLH TFs were increasingly expressed with seed development [36], suggesting that FtbHLH5 might be involved in regulating starch biosynthesis in seeds and stems, whereas FtbHLH91 might be involved in regulating starch biosynthesis in the stem. The expression of FtbHLH5 was significantly negatively correlated to FtBE2 (Pearson R = −0.96), FtSS3-5 (Pearson R = −0.93), FtSS1 (Pearson R = −0.93), and FtISA3 (Pearson R = −0.92), suggesting that FtbHLH5 should be a key negative regulator in regulating starch biosynthesis in Tartary buckwheat.
An AP2/ERF-ERF family gene, ZmEREB156, was sucrose- and ABA-inducible and positively regulated starch synthesis by directly binding to the promoter of ZmSSIIIa in maize [24]. Sugar Signaling in Barley2 (SUSIBA2), a plant-specific WRKY TF, could bind to the promoters of BEIIb; the heterologous expression of it in rice produced a rice variety with a high starch content [26]. A mutant of OS1, encoding an RWP-RK TF, downregulated certain genes in specific cell types, including a majority of zein- and starch-related genes in central starch endosperm cells [40]. In our study, an AP2/ERF-ERF TF (FtPinG0002070100.01), a WRKY TF (FtWRKY39), and an RWP-RK TF (FtPinG0002032100.01), were identified as top-50 hub genes, suggesting they might be involved in starch biosynthesis in Tartary buckwheat. The expression level of FtWRKY39 increased gradually with the development of seeds [37], suggesting it could regulate starch biosynthesis during seed development.
Though no evidence has been shown at present, an E2F-DP TF (FtPinG0001244300.01) and a C3H-type Zn-finger protein (FtPinG0003310400.01) were also identified as top-50 hub genes, suggesting they might be involved in starch biosynthesis in Tartary buckwheat. Notably, the C3H-type Zn-finger protein was in the co-expression network of FtbHLH5, significantly positively correlated to FtbHLH5 (Pearson R = 1.00) and significantly negatively correlated to FtBE2, FtSS3-5, FtSS1, and FtISA3. In addition, C3H was preferentially expressed in seeds rather than other tissues. These suggested that C3H might interact with FtbHLH5 and be another key negative regulator in regulating starch biosynthesis in Tartary buckwheat.

4. Materials and Methods

4.1. Plant Material

A total of 20 Tartary buckwheat varieties/lines were used in this study, namely Biku 2 (BK2), Damiqiao (DMQ), Jinkuqiao (JKQ), Jinqiao 2 (JQ2), Jiujiangkuqiao (JJ), Miku 11 (M11), Miku 127 (M127), Miku 13 (M13), Miku 18 (M18), Miku 5 (M5), Miku 55 (M55), Pinku 1 (PK1), Qianheiqiao 1 (QHQ1), Qianku 2 (QK2), Sanya A5 (SYA5), Sanya B60 (SYB60), Sanya B68 (SYB68), Sanya B69 (SYB69), Sanya C28 (SYC28), and Xiaomiqiao (XMQ). In the early August of 2020, their seeds were sowed in the Anshun experimental field of our laboratory and grew with normal field management. When they were nearly matured, seeds at late filling stage were taken with three replicates, immediately frozen in liquid nitrogen, shelled on ice, and stored in an ultra-low-temperature freezer for starch measurement and RNA-seq. When the plants were matured, the ripened seeds were harvested and dried for starch measurement. Then the ripened seeds of all 20 varieties/lines and the late-filling-stage seeds of seven varieties, namely PK1, BK2, QK2, XMQ, DMQ, JQ2, and M11 were used for starch measurement, with three biological replicates and two (or three) technical replicates. Meanwhile, the late-filling-stage seeds of the seven mentioned varieties were used for RNA-seq analysis, with three biological replicates.

4.2. Starch Content Measurement

Amylose and amylopectin content were measured in accordance with a previous report [41], with some modification. Briefly, the amylose and amylopectin standards were brought from Sigma-Aldrich (Merck KGaA, Darmstadt, Germany). To obtain a mixed standard curve, 11 mixed standards with a series of amylose and amylopectin concentrations were made, namely 0, 5, 10, 15, 20, 25, 30, 35, 40, 45, and 50 µg/mL amylose, and 100, 90, 80, 70, 60, 50, 40, 30, 20, 10, and 0 µg/mL amylopectin in every standard, respectively. The pH was adjusted to 3.0 using 2 mol/L HCl. Then, 0.5 mL iodine reagent was added and mixed. After standing at room temperature for 25 min, the mixtures were applied to a spectrophotometer for amylose measurement at 597 nm and 480 nm and amylopectin measurement at 541 nm and 700 nm. The standard curves of amylose and amylopectin were established using OD(597 nm–480 nm) and OD(541 nm–700 nm), respectively. The equation of the standard curve for amylose was
Y = 1.28 × 10−2 X − 2.17 × 10−2
and the equation of the standard curve for amylopectin was
Y = 2.8 × 10−3 X + 0.219 × 10−1
For each variety/line of Tartary buckwheat, 0.1000 g ground powder of the seeds was accurately weighed and placed in a 50 mL centrifuge tube. Then, 10 mL absolute ethanol was added and mixed. The amylose and amylopectin were extracted in a water bath at 80 °C for 30 min. After cooling, the samples were centrifuged at 12,000 rpm at 25 °C for 5 min. The supernatant was discarded, and 10 mL KOH (1 mol/L) was added to the retained precipitate. The sample was placed into a water bath at 100 °C for 15 min, followed by being transferred to a freezer at 4 °C. Afterward, a 1 mL sample was transferred into a new 50 mL centrifuge tube and 10 mL ddH2O was added. Then, 200 µL solution was placed into a new 2 mL centrifuge tube and 1 mL ddH2O was added. The pH was adjusted to 3.0 using 2 mol/L HCl. Then, 20 µL iodine reagent was added and mixed. The volume was titrated to 2 mL with ddH2O. After standing at room temperature for 25 min, the mixtures were applied to a spectrophotometer for amylose measurement at 597 nm and 480 nm and amylopectin measurement at 541 nm and 700 nm. Then, the amylose and amylopectin content were calculated using Equations (1) and (2), respectively.

4.3. RNA-Seq Analysis

RNA was extracted using the RNAprep Pure Plant Plus Kit produced by TIANGEN (DP441, Beijing, China). Library construction was carried out by Metware Biotechnology Co., Ltd. (Wuhan, China). After that, the library was sequenced on an Illumina NovaSeq 6000 platform (California, USA) and 150 bp paired-end reads were generated. Data filtering and quality control were also completed by this company using fastp, resulting in more than 6 G clean data with Q20 > 95% and Q30 > 85% for each sample [42,43]. Afterward, the clean reads were mapped to the genome data for Tartary buckwheat http://www.mbkbase.org/Pinku1/ (accessed on 8 September 2022) and the chromosomal location of the mapped genes were obtained by HISAT2 with default parameters [44]. The mapped genes were functionally annotated to the databases of NR ftp://ftp.ncbi.nih.gov/blast/db (accessed on 8 September 2022), KEGG https://www.genome.jp/kegg (accessed on 8 September 2022), KOG https://www.ncbi.nlm.nih.gov/COG/ (accessed on 8 September 2022), and GO http://geneontology.org/ (accessed on 8 September 2022) using the BLAST program (e-value was set as 1 × 10−5). Gene expression was quantified by FPKM (fragments per kilobase of transcript per million fragments mapped) as the following equation:
FPKM = mapped   fragments   of   transcript total   count   of   mapped   fragments   millions ×   length   of   fragments
Pearson correlation was analyzed and visualized using “corrplot” in R language https://cran.r-project.org/web/packages/corrplot/index.html (accessed on 8 September 2022).
DEGs were identified by “DESeq2” in R language with default parameters, using absolute value of Log2(FoldChange) ≥1 and padj (adjusted p-value) < 0.05 [45]. The upset plot of the DEGs was visualized using “UpSetR” in R language [46]. The hierarchical cluster of the DEGs was performed by “ggplot2”, “pheatmap”, “reshape2”, and “factoextra” in R language https://cran.r-project.org/web/packages/available_packages_by_name.html#available-packages-C (accessed on 8 September 2022). For the hierarchical cluster of all identified DEGs, the scale was set as “row”. For the hierarchical cluster of partial analyzed DEGs, the scale was set as “none”. KEGG pathway and GO item enrichment were performed by TBtools [47]. The top 20 KEGG pathways, top 50 enriched GO items, and KOG categories were visualized by “ggplot2” in R language https://cran.r-project.org/web/packages/ggplot2/index.html (accessed on 8 September 2022).
For tissue-specific analysis, gene expression data of four different tissues (root, stem, leaf, and flower) were obtained from the published genome data for Tartary buckwheat [48], whereas gene expression data for seeds was calculated by mean FPKM value in the seeds of the seven varieties in this study.

4.4. WGCNA

WGCNA was performed using “WGCNA” in R language [49]. After filtering the genes whose average count number was less than 10 in every variety, the top 50% mad genes among the remaining genes (11,073 genes) were used for WGCNA analysis. In this study, the parameters were set as softpower = 12, minModuleSize = 30, MEDissThres = 0.2, and deepSplit = 2. The module–trait relationship was made using the combination of gene expression data with four starch-related traits, namely amylose content, amylopectin content, amylose/amylopectin, and amylopectin/amylose. Then, the hub genes were identified by the threshold of top 30% in intramodule connectivity, absolute KME value ≥0.8, and absolute GS value ≥0.2. The TFs in the DEGs were identified using the iTAK plant transcription factor and protein kinase identifier and classifier, http://itak.feilab.net/cgi-bin/itak/online_itak.cgi (accessed on 8 September 2022). The gene co-expression networks of the starch biosynthesis genes and TFs were visualized using Cytoscape [50].

5. Conclusions

In this work, we performed RNA-seq analyses using seven Tartary buckwheat varieties that differed in starch content and combined their RNA-seq data with starch content using WGCNA. As a result, 10, 873 DEGs were identified and functionally clustered to six hierarchical clusters based on their expression patterns. Subsequently, starch biosynthesis genes were identified, and 15 of them, including FtAGPL2, FtAGPL4, FtAGPS1, FtGBSS1-2, FtGBSS1-3, FtGBSS1-5, FtSS1, FtSS2-1, FtSS3-2, FtSS3-5, FtSS4-3, FtBE1 FtBE2, FtISA3 and FtPUL, had a higher expression level in seeds. In addition, four trait-specific modules (greenyellow, pink, lightcyan, and brown) and 3131 hub genes in these modules were identified by WGCNA, among which the lightcyan and brown modules were positively correlated to amylose content, amylopectin content, and amylose/amylopectin. Starch and sucrose metabolism and carbohydrate metabolism pathways were enriched in both the lightcyan and brown modules. Based on TF analysis in trait-specific modules, FtNAC70 was co-expressed with FtPUL and FtGBSS1-3 in the lightcyan module; FtbHLH5 and C3H were co-expressed with FtBE2, FtISA3, FtSS3-5, and FtSS1 in the brown module. All nine genes in the co-expression networks were preferentially expressed in seeds rather than other tissues, suggesting they might be responsible for starch biosynthesis in the seeds. These results would not only enrich the gene resources related to starch, but also provide a molecular insight into the starch biosynthesis regulatory mechanism in Tartary buckwheat. The results here appear to be an excellent starting point for further research on Tartary buckwheat. This work will be of benefit in the future for a complementary theoretical study using quantum chemical topology to identify the vdW interactions that play a role in the RNA-seq analysis. The estimation of non-covalent interaction energies present in the structures of RNA presented here could help to better guide further research on starch biosynthesis and its regulatory network in Tartary buckwheat.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms232415774/s1, Supplementary Figure S1. Sample cluster of the transcriptome. Each square indicates the Pearson’s correlation coefficient of a pair of samples. Supplementary Figure S2. Top 20 enriched KEGG pathways of the transcriptome. The rich factor is the ratio of the number of DEGs to that of all genes annotated to a pathway term. A higher rich factor indicates greater intensity. Supplementary Figure S3. Histogram of the KOG (eukaryotic ortholog groups of proteins) classification of the transcriptome. Supplementary Figure S4. Top 50 enriched GO items of the transcriptome. The results are summarized in three main categories: biological process, cellular component, and molecular function. Supplementary Table S1. Amylose and amylopectin contents in the matured seeds of 20 Tartary buckwheat varieties (lines). Supplementary Table S2. Summary statistics of RNA-seq results. Supplementary Table S3. List of genes related to the starch biosynthesis pathway. Supplementary Table S4. Hub genes identified in four trait-specific modules, namely greenyellow, pink, brown, and lightcyan. Supplementary Table S5. Significantly enriched pathways for the hub genes in four trait-specific modules at a corrected p-value < 0.05 level.

Author Contributions

Conceptualization, J.H. and Q.C.; methodology, J.D.; software, R.R. and J.H.; validation, B.T.; formal analysis, J.H.; investigation, B.T.; resources, M.W.; data curation, Y.L.; writing—original draft preparation, J.H. and B.T.; writing—review and editing, J.H. and T.S.; visualization, J.H.; supervision, F.L.; project administration, J.H.; funding acquisition, J.H., J.D. and Q.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (32060508, 31860408), the Science and Technology Foundation of Guizhou Province (QianKeHeJiChu-ZK [2021] General 109 and QianKeHeJiChu [2020]1Y095), and the Earmarked Fund for China Agriculture Research System (CRAS-07-A5).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw sequence data are deposited in the Genome Sequence Archive in the National Genomics Data Center, Chinese Academy of Sciences, under accession number CRA008415, which is publicly accessible at https://ngdc.cncb.ac.cn/gsa (accessed on 8 September 2022).

Acknowledgments

The authors gratefully acknowledge the anonymous reviewers, editors and other editorial members who contribute to this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Luthar, Z.; Zhou, M.; Golob, A.; Germ, M. Breeding buckwheat for increased levels and improved quality of protein. Plants 2021, 10, 14. [Google Scholar] [CrossRef] [PubMed]
  2. Zhu, F. Chemical composition and health effects of Tartary buckwheat. Food Chem. 2016, 203, 231–245. [Google Scholar] [CrossRef] [PubMed]
  3. Luthar, Z.; Golob, A.; Germ, M.; Vombergar, B.; Kreft, I. Tartary buckwheat in human nutrition. Plants 2021, 10, 700. [Google Scholar] [CrossRef] [PubMed]
  4. Lu, C.L.; Zheng, Q.; Shen, Q.; Song, C.; Zhang, Z.M. Uncovering the relationship and mechanisms of Tartary buckwheat (Fagopyrum tataricum) and Type II diabetes, hypertension, and hyperlipidemia using a network pharmacology approach. PeerJ 2017, 5, e4042. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Zhu, F. Buckwheat starch: Structures, properties, and applications. Trends Food Sci. Technol. 2016, 49, 121–135. [Google Scholar] [CrossRef]
  6. Zou, Y.; Yuan, C.; Cui, B.; Sha, H.; Liu, P.; Lu, L.; Wu, Z. High-amylose corn starch/konjac glucomannan composite film: Reinforced by incorporating β-cyclodextrin. J. Agric. Food Chem. 2021, 69, 2493–2500. [Google Scholar] [CrossRef]
  7. Tao, K.; Yu, W.; Prakash, S.; Gilbert, R.G. High-amylose rice: Starch molecular structural features controlling cooked rice texture and preference. Carbohydr. Polym. 2019, 219, 251–260. [Google Scholar] [CrossRef]
  8. Zhou, X.; Wang, S.; Zhou, Y. Study on the structure and digestibility of high amylose Tartary buckwheat (Fagopyrum tataricum Gaertn.) starch-flavonoid prepared by different methods. J. Food Sci. 2021, 86, 1463–1474. [Google Scholar] [CrossRef]
  9. Zhang, W.; Yang, Q.; Xia, M.; Bai, W.; Wang, P.; Gao, X.; Li, J.; Feng, B.; Gao, J. Effects of nitrogen level on the physicochemical properties of Tartary buckwheat (Fagopyrum tataricum (L.) Gaertn.) starch. Int. J. Biol. Macromol. 2019, 129, 799–808. [Google Scholar] [CrossRef]
  10. Yang, L.; Chen, Q.; Li, H. Analysis of grain characters and quality in new varieties of Tartary buckwheat. Guangdong Agric. Sci. 2018, 45, 7–13. [Google Scholar]
  11. Liu, H.; Lv, M.; Peng, Q.; Shan, F.; Wang, M. Physicochemical and textural properties of Tartary buckwheat starch after heat–moisture treatment at different moisture levels. Starch-Stärke 2015, 67, 276–284. [Google Scholar] [CrossRef]
  12. Gao, J.; Kreft, I.; Chao, G.; Wang, Y.; Liu, X.; Wang, L.; Wang, P.; Gao, X.; Feng, B. Tartary buckwheat (Fagopyrum tataricum Gaertn.) starch, a side product in functional food production, as a potential source of retrograded starch. Food Chem. 2016, 190, 552–558. [Google Scholar] [CrossRef] [PubMed]
  13. Yoshimoto, Y.; Egashira, T.; Hanashiro, I.; Ohinata, H.; Takase, Y.; Takeda, Y. Molecular structure and some physicochemical properties of buckwheat starches. Cereal Chem. 2004, 81, 515–520. [Google Scholar] [CrossRef]
  14. Jeon, J.-S.; Ryoo, N.; Hahn, T.-R.; Walia, H.; Nakamura, Y. Starch biosynthesis in cereal endosperm. Plant Physiol. Biochem. 2010, 48, 383–392. [Google Scholar] [CrossRef]
  15. Huang, L.; Tan, H.; Zhang, C.; Li, Q.; Liu, Q. Starch biosynthesis in cereal endosperms: An updated review over the last decade. Plant Commun. 2021, 2, 100237. [Google Scholar] [CrossRef] [PubMed]
  16. Meng, Q.; Zhang, W.; Hu, X.; Shi, X.; Chen, L.; Dai, X.; Qu, H.; Xia, Y.; Liu, W.; Gu, M.; et al. Two ADP-glucose pyrophosphorylase subunits, OsAGPL1 and OsAGPS1, modulate phosphorus homeostasis in rice. Plant J 2020, 104, 1269–1284. [Google Scholar] [CrossRef]
  17. Abt, M.R.; Zeeman, S.C. Evolutionary innovations in starch metabolism. Curr. Opin. Plant Biol. 2020, 55, 109–117. [Google Scholar] [CrossRef]
  18. Liu, H.; Yu, G.; Wei, B.; Wang, Y.; Zhang, J.; Hu, Y.; Liu, Y.; Yu, G.; Zhang, H.; Huang, Y. Identification and phylogenetic analysis of a novel starch yynthase in maize. Front. Plant Sci. 2015, 6, 1013. [Google Scholar] [CrossRef] [Green Version]
  19. Ohdan, T.; Francisco, P.B.J.; Sawada, T.; Hirose, T.; Terao, T.; Satoh, H.; Nakamura, Y. Expression profiling of genes involved in starch synthesis in sink and source organs of rice. J. Exp. Bot. 2005, 56, 3229–3244. [Google Scholar] [CrossRef] [Green Version]
  20. Yan, H.-B.; Pan, X.-X.; Jiang, H.-W.; Wu, G.-J. Comparison of the starch synthesis genes between maize and rice: Copies, chromosome location and expression divergence. Theor. Appl. Genet. 2009, 119, 815–825. [Google Scholar] [CrossRef]
  21. Li, R.; Tan, Y.; Zhang, H. Regulators of starch biosynthesis in cereal crops. Molecules 2021, 26, 7092. [Google Scholar] [CrossRef] [PubMed]
  22. Zhang, Z.; Dong, J.; Ji, C.; Wu, Y.; Messing, J. NAC-type transcription factors regulate accumulation of starch and protein in maize seeds. Proc. Natl. Acad. Sci. USA 2019, 116, 11223–11228. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Gao, Y.; An, K.; Guo, W.; Chen, Y.; Zhang, R.; Zhang, X.; Chang, S.; Rossi, V.; Jin, F.; Cao, X.; et al. The endosperm-specific transcription factor TaNAC019 regulates glutenin and starch accumulation and its elite allele improves wheat grain quality. Plant Cell 2021, 33, 603–622. [Google Scholar] [CrossRef]
  24. Huang, H.; Xie, S.; Xiao, Q.; Wei, B.; Zheng, L.; Wang, Y.; Cao, Y.; Zhang, X.; Long, T.; Li, Y.; et al. Sucrose and ABA regulate starch biosynthesis in maize through a novel transcription factor, ZmEREB156. Sci. Rep. 2016, 6, 27590. [Google Scholar] [CrossRef] [PubMed]
  25. Wang, J.C.; Xu, H.; Zhu, Y.; Liu, Q.Q.; Qiao, Q.; Cai, X.L. OsbZIP58, a basic leucine zipper transcription factor, regulates starch biosynthesis in rice endosperm. J. Exp. Bot. 2013, 64, 3453–3466. [Google Scholar] [CrossRef]
  26. Su, J.; Hu, C.; Yan, X.; Jin, Y.; Chen, Z.; Guan, Q.; Wang, Y.; Zhong, D.; Jansson, C.; Wang, F.; et al. Expression of barley SUSIBA2 transcription factor yields high-starch low-methane rice. Nature 2015, 523, 602–606. [Google Scholar] [CrossRef]
  27. She, K.C.; Kusano, H.; Koizumi, K.; Yamakawa, H.; Hakata, M.; Imamura, T.; Fukuda, M.; Naito, N.; Tsurumaki, Y.; Yaeshima, M.; et al. A novel factor FLOURY ENDOSPERM2 is involved in regulation of rice grain size and starch quality. Plant Cell 2010, 22, 3280–3294. [Google Scholar] [CrossRef] [Green Version]
  28. Feng, F.; Qi, W.; Lv, Y.; Yan, S.; Xu, L.; Yang, W.; Yuan, Y.; Chen, Y.; Zhao, H.; Song, R. OPAQUE11 is a central hub of the regulatory network for maize endosperm development and nutrient metabolism. Plant Cell 2018, 30, 375–396. [Google Scholar] [CrossRef] [Green Version]
  29. Zhang, Z.; Zheng, X.; Yang, J.; Messing, J.; Wu, Y. Maize endosperm-specific transcription factors O2 and PBF network the regulation of protein and starch synthesis. Proc. Natl. Acad. Sci. USA 2016, 113, 10842–10847. [Google Scholar] [CrossRef] [Green Version]
  30. Wang, X.; Feng, B.; Xu, Z.; Sestili, F.; Zhao, G.; Xiang, C.; Lafiandra, D.; Wang, T. Identification and characterization of granule bound starch synthase I (GBSSI) gene of Tartary buckwheat (Fagopyrum tataricum Gaertn.). Gene 2014, 534, 229–235. [Google Scholar] [CrossRef]
  31. Huang, J.; Deng, J.; Shi, T.; Chen, Q.; Liang, C.; Meng, Z.; Zhu, L.; Wang, Y.; Zhao, F.; Yu, S.; et al. Global transcriptome analysis and identification of genes involved in nutrients accumulation during seed development of rice Tartary buckwheat (Fagopyrum Tararicum). Sci. Rep. 2017, 7, 11792. [Google Scholar] [CrossRef] [PubMed]
  32. Shi, T.; Tang, B.; Ren, R.; Zhu, L.; Deng, J.; Liang, C.; Wang, Y.; Huang, J. Genome-wide identification and gene expression analyses of AGPase encoding genes FtAGPL and FtAGPS in Tartary buckwheat (Fagopyrum Tararicum). J. Guizhou Norm. Univ. 2021, 39, 52–57. [Google Scholar]
  33. Huang, J.; Ren, R.; Rong, Y.; Tang, B.; Deng, J.; Chen, Q.; Shi, T. Identification, expression, and functional study of seven NAC transcription factor genes involved in stress response in Tartary buckwheat (Fagopyrum tataricum (L.) Gaertn.). Agronomy 2022, 12, 849. [Google Scholar] [CrossRef]
  34. Huang, J.; Rong, Y.; Meng, Z.; Tang, B.; Zhang, J.; Xia, Z.; Chen, Q. Cloning and expression of FtNAC15 transcription factor in Fagopyrum tataricum. Acta Agric. Univ. Jiangxiensis 2019, 41, 1183–1191. [Google Scholar]
  35. Liu, M.; Ma, Z.; Sun, W.; Huang, L.; Wu, Q.; Tang, Z.; Bu, T.; Li, C.; Chen, H. Genome-wide analysis of the NAC transcription factor family in Tartary buckwheat (Fagopyrum tataricum). BMC Genom. 2019, 20, 113. [Google Scholar] [CrossRef] [Green Version]
  36. Sun, W.; Jin, X.; Ma, Z.; Chen, H.; Liu, M. Basic helix–loop–helix (bHLH) gene family in Tartary buckwheat (Fagopyrum tataricum): Genome-wide identification, phylogeny, evolutionary expansion and expression analyses. Int. J. Biol. Macromol. 2020, 155, 1478–1490. [Google Scholar] [CrossRef]
  37. Sun, W.; Ma, Z.; Chen, H.; Liu, M. Genome-wide investigation of WRKY transcription factors in Tartary buckwheat (Fagopyrum tataricum) and their potential roles in regulating growth and development. PeerJ 2020, 8, e8727. [Google Scholar] [CrossRef] [Green Version]
  38. Li, S.; Shi, Z.; Zhu, Q.; Tao, L.; Liang, W.; Zhao, Z. Transcriptome sequencing and differential expression analysis of seed starch accumulation in Chinese chestnut Metaxenia. BMC Genom. 2021, 22, 617. [Google Scholar] [CrossRef]
  39. Shi, L.; Wang, J.; Liu, Y.; Ma, C.; Guo, S.; Lin, S.; Wang, J. Transcriptome analysis of genes involved in starch biosynthesis in developing Chinese chestnut (Castanea mollissima Blume) seed kernels. Sci. Rep. 2021, 11, 3570. [Google Scholar] [CrossRef]
  40. Song, W.; Zhu, J.; Zhao, H.; Li, Y.; Liu, J.; Zhang, X.; Huang, L.; Lai, J. OS1 functions in the allocation of nutrients between the endosperm and embryo in maize seeds. J. Integr. Plant Biol. 2019, 61, 706–727. [Google Scholar] [CrossRef] [Green Version]
  41. Jiao, M.; Gao, H.; Wang, W.; Tian, Y. Comparison of four methods for the determination of amylose and amylopectin. Sci. Technol. Food Ind. 2019, 40, 259–264. [Google Scholar]
  42. Conesa, A.; Madrigal, P.; Tarazona, S.; Gomez-Cabrero, D.; Cervera, A.; McPherson, A.; Szcześniak, M.W.; Gaffney, D.J.; Elo, L.L.; Zhang, X.; et al. A survey of best practices for RNA-seq data analysis. Genome Biol. 2016, 17, 13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Cold Spring Harb. Lab. 2018, 34, i884–i890. [Google Scholar] [CrossRef] [PubMed]
  44. Kim, D.; Paggi, J.M.; Park, C.; Bennett, C.; Salzberg, S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019, 37, 907–915. [Google Scholar] [CrossRef] [PubMed]
  45. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef] [Green Version]
  46. Conway, J.R.; Lex, A.; Gehlenborg, N. UpSetR: An R package for the visualization of intersecting sets and their properties. Bioinformatics 2017, 33, 2938–2940. [Google Scholar] [CrossRef] [Green Version]
  47. Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
  48. Zhang, L.; Li, X.; Ma, B.; Gao, Q.; Du, H.; Han, Y.; Li, Y.; Cao, Y.; Qi, M.; Zhu, Y.; et al. The Tartary buckwheat genome provides insights into rutinbiosynthesis and abiotic stress tolerance. Mol. Plant 2017, 10, 1224–1237. [Google Scholar] [CrossRef] [Green Version]
  49. Langfelder, P.; Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 2008, 9, 559. [Google Scholar] [CrossRef] [Green Version]
  50. Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. J Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
Figure 1. Starch content in late-filling-stage seeds of seven Tartary buckwheat varieties. (a) Amylose content; (b) amylopectin content; (c) amylopectin/amylose. The data represent the mean ± SD that came from three biological replicates and at least two technical replicates for each biological replicate.
Figure 1. Starch content in late-filling-stage seeds of seven Tartary buckwheat varieties. (a) Amylose content; (b) amylopectin content; (c) amylopectin/amylose. The data represent the mean ± SD that came from three biological replicates and at least two technical replicates for each biological replicate.
Ijms 23 15774 g001
Figure 2. Analyses of differentially expressed genes (DEGs) in seven Tartary buckwheat varieties. (a) Upset plot of the DEGs in six comparisons. Black bars on the lower left represent the number of DEGs in comparisons of seven selected varieties. The black dots on the lower right represent the common DEGs existing in six comparisons. Black bars above represent the number of common DEGs in six comparisons; (b) Functional category of DEGs by hierarchical cluster; (c) Gene expression patterns of the six clusters that correspond to the hierarchical cluster result. Six main clusters were presented as C1–C6. Gene expression values are normalized to log10 (FPKM).
Figure 2. Analyses of differentially expressed genes (DEGs) in seven Tartary buckwheat varieties. (a) Upset plot of the DEGs in six comparisons. Black bars on the lower left represent the number of DEGs in comparisons of seven selected varieties. The black dots on the lower right represent the common DEGs existing in six comparisons. Black bars above represent the number of common DEGs in six comparisons; (b) Functional category of DEGs by hierarchical cluster; (c) Gene expression patterns of the six clusters that correspond to the hierarchical cluster result. Six main clusters were presented as C1–C6. Gene expression values are normalized to log10 (FPKM).
Ijms 23 15774 g002
Figure 3. Expression patterns of genes related to the starch biosynthesis pathway. The light-brown rectangles and dark-brown ovals represent the substrates and the enzymes in the starch biosynthesis pathway, respectively. The heatmap shows the expression patterns of genes in the starch biosynthesis pathway. Genes marked with “*” were identified as DEGs. AGPase, ADP-glucose pyrophosphorylase; AGPL, large subunit of AGPase; AGPS, small subunit of AGPase; GBSS, granule-bound starch synthase; SS, soluble starch synthase; BE, starch branching enzyme; DBE, starch debranching enzyme; ISA, isoamylase; PUL, pullulanase.
Figure 3. Expression patterns of genes related to the starch biosynthesis pathway. The light-brown rectangles and dark-brown ovals represent the substrates and the enzymes in the starch biosynthesis pathway, respectively. The heatmap shows the expression patterns of genes in the starch biosynthesis pathway. Genes marked with “*” were identified as DEGs. AGPase, ADP-glucose pyrophosphorylase; AGPL, large subunit of AGPase; AGPS, small subunit of AGPase; GBSS, granule-bound starch synthase; SS, soluble starch synthase; BE, starch branching enzyme; DBE, starch debranching enzyme; ISA, isoamylase; PUL, pullulanase.
Ijms 23 15774 g003
Figure 4. Weighted gene correlation network analysis (WGCNA) for DEGs and the enriched pathways of trait-specific modules in Tartary buckwheat seeds. (a) Module–trait relationships by WGCNA analysis. The correlations and the corresponding p-values (in parentheses) are indicated in the heatmap. The panel on the left side shows nine identified modules. amylose/amylopectin is shorten for the ratio of amylose to amylopectin; amylopectin/amylose is short for the ratio of amylopectin to amylose; (b) Expression patterns of the trait-specific modules (p-value < 0.05) in correspondence to the module–trait relationship heatmap; (c) KEGG enrichment analysis of the lightcyan and brown modules. Rich factor is the ratio of the number of DEGs to that of all genes annotated to a pathway term. A higher rich factor indicates greater intensity.
Figure 4. Weighted gene correlation network analysis (WGCNA) for DEGs and the enriched pathways of trait-specific modules in Tartary buckwheat seeds. (a) Module–trait relationships by WGCNA analysis. The correlations and the corresponding p-values (in parentheses) are indicated in the heatmap. The panel on the left side shows nine identified modules. amylose/amylopectin is shorten for the ratio of amylose to amylopectin; amylopectin/amylose is short for the ratio of amylopectin to amylose; (b) Expression patterns of the trait-specific modules (p-value < 0.05) in correspondence to the module–trait relationship heatmap; (c) KEGG enrichment analysis of the lightcyan and brown modules. Rich factor is the ratio of the number of DEGs to that of all genes annotated to a pathway term. A higher rich factor indicates greater intensity.
Ijms 23 15774 g004
Figure 5. TFs identified in four trait-specific modules and their co-expression networks related to the starch biosynthesis pathway. (a) Distribution of transcription factor families in four trait-specific modules. C3H, CCCH-type Zn-finger protein; (b,c) Co-expression networks of TFs in the hub genes in the top 50 in intramodule connectivity (represented by dark-red ovals) and starch biosynthesis genes in the hub genes of the top 30% in intramodule connectivity (represented by clay-brown ovals) in the lightcyan (b) and brown (c) modules. Edge width indicates the weight of the relationship between two genes. (d,e) Pearson correlation between the genes in correspondence to (b,e). ** indicates that the correlation reached a significant level of 0.01. FtPUL, FtPinG0000055300.01; FtNAC70, FtPinG0002339300.01; FtGBSS1-3, FtPinG0000359400.01; FtBE2, FtPinG0000080700.01; FtISA3, FtPinG0009517500.01; C3H, FtPinG0003310400.01; FtbHLH5, FtPinG0000281400.01; FtSS3-5, FtPinG0003226800.01; FtSS1, FtPinG0005939600.01.
Figure 5. TFs identified in four trait-specific modules and their co-expression networks related to the starch biosynthesis pathway. (a) Distribution of transcription factor families in four trait-specific modules. C3H, CCCH-type Zn-finger protein; (b,c) Co-expression networks of TFs in the hub genes in the top 50 in intramodule connectivity (represented by dark-red ovals) and starch biosynthesis genes in the hub genes of the top 30% in intramodule connectivity (represented by clay-brown ovals) in the lightcyan (b) and brown (c) modules. Edge width indicates the weight of the relationship between two genes. (d,e) Pearson correlation between the genes in correspondence to (b,e). ** indicates that the correlation reached a significant level of 0.01. FtPUL, FtPinG0000055300.01; FtNAC70, FtPinG0002339300.01; FtGBSS1-3, FtPinG0000359400.01; FtBE2, FtPinG0000080700.01; FtISA3, FtPinG0009517500.01; C3H, FtPinG0003310400.01; FtbHLH5, FtPinG0000281400.01; FtSS3-5, FtPinG0003226800.01; FtSS1, FtPinG0005939600.01.
Ijms 23 15774 g005
Figure 6. Tissue-specific expression patterns of nine candidate genes in the regulatory network of the starch biosynthesis pathway.
Figure 6. Tissue-specific expression patterns of nine candidate genes in the regulatory network of the starch biosynthesis pathway.
Ijms 23 15774 g006
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Huang, J.; Tang, B.; Ren, R.; Wu, M.; Liu, F.; Lv, Y.; Shi, T.; Deng, J.; Chen, Q. Understanding the Potential Gene Regulatory Network of Starch Biosynthesis in Tartary Buckwheat by RNA-Seq. Int. J. Mol. Sci. 2022, 23, 15774. https://doi.org/10.3390/ijms232415774

AMA Style

Huang J, Tang B, Ren R, Wu M, Liu F, Lv Y, Shi T, Deng J, Chen Q. Understanding the Potential Gene Regulatory Network of Starch Biosynthesis in Tartary Buckwheat by RNA-Seq. International Journal of Molecular Sciences. 2022; 23(24):15774. https://doi.org/10.3390/ijms232415774

Chicago/Turabian Style

Huang, Juan, Bin Tang, Rongrong Ren, Min Wu, Fei Liu, Yong Lv, Taoxiong Shi, Jiao Deng, and Qingfu Chen. 2022. "Understanding the Potential Gene Regulatory Network of Starch Biosynthesis in Tartary Buckwheat by RNA-Seq" International Journal of Molecular Sciences 23, no. 24: 15774. https://doi.org/10.3390/ijms232415774

APA Style

Huang, J., Tang, B., Ren, R., Wu, M., Liu, F., Lv, Y., Shi, T., Deng, J., & Chen, Q. (2022). Understanding the Potential Gene Regulatory Network of Starch Biosynthesis in Tartary Buckwheat by RNA-Seq. International Journal of Molecular Sciences, 23(24), 15774. https://doi.org/10.3390/ijms232415774

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop