Next Article in Journal
Prevalence and Genetic Characterization of Morphologically Indistinguishable Sarcocysts of Sarcocystis cruzi in Cattle and Sarcocystis poephagicanis in Yaks
Previous Article in Journal
Genotypic and Phenotypic Diversity of Endemic Golden Camellias Collected from China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Cell Wall-Related Gene Families of Wheat (Triticum aestivum)

Corn, Soybean, and Wheat Quality Research Unit, Agricultural Research Services, United States Department of Agriculture, 1680 Madison Ave., Wooster, OH 44691, USA
Diversity 2023, 15(11), 1135; https://doi.org/10.3390/d15111135
Submission received: 25 September 2023 / Revised: 3 November 2023 / Accepted: 6 November 2023 / Published: 7 November 2023
(This article belongs to the Section Plant Diversity)

Abstract

:
Wheat crops provide 20% of calories worldwide. Cell walls function in plant growth, are part of biotic and abiotic stress resistance, and provide plant mechanical strength and adaptability. These functions factor into the productivity of wheat. The genes that produce and maintain the plant cell wall are up to 10% of the genome in many varied families. Previously, curated cell wall gene families have been published for maize and rice, two other important crop grasses. Here, 81 cell wall-related wheat gene families curated via sequence similarity to maize and rice and unique family protein motif searches are presented. A total of 4086 wheat, 1118 maize, 1036 rice, and 955 Arabidopsis genes were aligned and placed into gene family trees to present homologs for all four species. Due to hexaploidy, many wheat cell wall gene families show expected triplication of genes per family over maize, rice, and Arabidopsis. However, several families contained more wheat genes than expected. The utility of this research is demonstrated with an example from a pre-harvest sprouting study to identify specific gene families rather than the less descriptive identification available with standard bioinformatic searches.

1. Introduction

Wheat accounts for twenty percent of worldwide calories [1]. Seven hundred and ninety-three million metric tons of wheat were produced worldwide in 2023. The top five producers were China, the European Union, India, Russia, and the United States [2]. In addition to human consumption, wheat grain that does not meet human use quality standards can be consumed by animals [3]. One factor leading to the loss of wheat grain quality is pre-harvest sprouting, which has become more prevalent with climate change [4]. Plant cell wall genes comprise about ten percent of a plant genome and are important for growth and development, pathogen resistance, abiotic stress, food quality, conversion of plant material for fuels, and ruminant animal digestion [5,6,7,8]. Plant cell walls may affect flour quality and pre-harvest sprouting in wheat [9,10]. Cell wall-related genes representing almost ten percent of differentially expressed genes between a treated and untreated pre-harvest sprouting-resistant wheat variety were reported for the first time. However, some of their descriptions from bioinformatic searches were too vague for assignment to gene families and thus function [10]. Cell wall genes have been curated for maize, rice, and Arabidopsis into families with unique protein motifs to identify them. However, a curated accounting for all cell wall-related wheat genes and family affiliation with insight into function is lacking [11,12,13,14,15].
Plant cell walls are built from a multitude of carbohydrate molecules assembled into complex structures with over 80 gene families encoding >1000 genes to produce their components and maintain their structure and function throughout the plant’s life. All plant cell walls have in common cellulose microfibrils forming the bulk of the cell wall structure linked together by other carbohydrate molecules. Primary cell walls vary based on species, with major architectural differences between commelinids such as grasses with Type II primary cell walls and dicots/noncommelinids with Type I primary cell walls. Type II primary cell walls contain a large portion of glucoarabinoxylans (GAXs) linking cellulose structures reinforced with phenylpropanoids. Type I primary cell walls contain a large portion of xyloglucans (XyG) and pectic polysaccharides such as homogalaturonan (HG) and rhamnogalacturonan (RG) I, linking the cellulose structure reinforced with Hyp- and Gly-rich proteins [16]. In addition to a primary cell wall, plants often have a secondary one. The secondary cell walls of plants are thickened and contain high amounts of lignin to provide rigidity in structures such as the stem and make up the bulk of the biomass used in bioenergy [17]. There are many groups of glycosyl transferases (GT) and glycosyl hydrolases (GH) and a few groups of polysaccharide lyases (PL) and carbohydrate esterases (CEs) with roles in the production and maintenance of plant cell walls. The families are described on the CAZY website [11,18].
There are many conserved functions within the cell wall of species such as maize, rice, and Arabidopsis, including the production of cellulose, hemicelluloses, and lignin, covered in detail in earlier publications but much less in wheat. Many important family groups carry out specific purposes or encode biochemical pathways. The nucleotide sugar transporters form a large six-group gene family in plants that transfer sugars to proteins, lipids, and polysaccharides in the Golgi [19]. The phenylpropanoid pathway (PP) encodes the biosynthetic pathway to produce monolignols, the building blocks for lignin, and includes ten gene families: 4-coumarate CoA ligase (4CL), p-coumaroyl shikimate 3′-hydroxylase (C3′H), cinnamate 4-hydroxylase (C4H), ferulate (coniferaldehyde) 5-hydroxylase (F5H), (hydroxy)cinnamyl alcohol dehydrogenase (CAD), caffeoyl CoA O-methyltransferase (CCoAOMT), caffeic acid (5-hydroxy-coniferaldehyde) O-methyltransferase (COMT), (hydroxy)cinnamoyl CoA reductase (CCR), hydroxycinnamoyl CoA:shikimate hydroxycinnamoyl transferase (HCT), and phenylalanine ammonia lyase (PAL) [20].
The GT families form a large set of cell wall-related genes. The sucrose synthases (GT4) and nucleotide–sugar interconversion pathway (GT1) form the basic sugar building blocks, later linked to many cell wall carbohydrates. The nucleotide–sugar interconversion pathway families include: membrane-anchored UDP-D-glucuronate decarboxylase (AUD) and soluble UDP-D-glucuronate decarboxylase (SUD), UDP-D-glucose 4-epimerase (AXS), UDP-D-glucuronate 4-epimerase (GAE), GDP-4-keto-6-deoxy-D-mannose 3,5-epimerase-4-reductase (GER), GDP-D-mannose-4,6-dehydratase (GMD), GDP-D-mannose 3,5-epimerase (GME), UDP-L-rhamnose synthase (RHM), UDP-4-keto-6-deoxy-D-glucose 3,5-epimerase-4-reductase (UER), UDP-D-glucose dehydrogenase (UGD), UDP-D-glucose 4-epimerase (UGE), and UDP-D-xylose 4-epimerase (UXE) genes [21,22]. The GT2 family has members that produce many carbohydrate backbones, including the cellulose synthase family (CesA), which produces cellulose everywhere, and the cellulose synthase-like (Csl) D family, which produces cellulose in root hairs and pollen tubes [23,24]. The CslA family synthesizes the (1,4)-β-D-mannan backbone of galactomannan. The β-1,4 glucan backbone of XyG is produced by the CslC family. The CslF family synthesizes a grass cell wall unique mixed linkage (1,3;1,4)-β-D-glucan [25,26,27]. Products of the Csl E, G, and H/J families are still unknown [28]. The GT48 family encodes callose synthases, which synthesize callose from UDP-glucose. Callose production is important for the development of mature cell walls as well as responses to biotic and abiotic stresses [29]. Many other GT families either have few members or few of their members have been characterized. The GT14 family is not well characterized, but one member is a glucuronosyl transferase involved in the cell wall elongation of seedlings in Arabidopsis. The GT61 family are arabinosyl, arabinofuranosyl, and xylosyl transferases involved in producing arabinoxylan and GAX from a xylan backbone [30,31,32]. The GT77 family is a group of arabinosyl transferases [33]. The GT8 family is a large group of genes subdivided into five groups; some are galacturonosyl transferases linked to pectin biosynthesis [34]. The gene At3g61130_GAUT1 from group D of GT8 is thought to synthesize HG [35], while the PARVUS gene, At1g19300_GATL1, is necessary for glucuronoxylan synthesis [36]. The GT31 family is a six-group family with one member having a galactosyl transferase activity and other members potentially involved in building the galactan backbone of arabinogalactan proteins (AGPs) [37,38]. Two genes from the GT34 family, At3g62720_XXT1 and At4g02500_XXT2, encode xylosyltransferases specific for XyG biosynthesis [39]. The GT37 family encodes fucosyltransferases in Arabidopsis, including At2g03220.1_FUT1 [40,41]. The GT47 family is a large set of genes in five groups whose members can have galactosyl—(At2g20370.1_MUR3), arabinosyl—(At2g35100.1_ARAD1), xylosyl—(At5g33290.1_XGD1), or glucuronosyl—(At5g22940.1 and fra8) transferase activity [42,43,44,45,46]. The genes At1g27440_IRX10 and At5g61840_IRX10-like from GT47 are needed for xylan backbone synthesis [47].
Glycosyl hydrolases in the cell wall provide several functions in plant growth, including defense and reproduction. The GH16 family, xyloglucan endotransglucosylase/hydrolase (XTH), and expansin families both function in the loosening of cell walls during growth [48,49]. The GH17 family encodes β-(1,3)-glucanases that appear involved in fugal defense and seed germination [50,51]. The GH18 family are classified as chitinases that degrade fungal cell walls but may also function in cell wall loosening [52,53,54]. The GH9 family has been characterized in bacteria where they function as endo-β-(1,4)-glucanases [11,55]. The GH28 family encodes polygalacturonases (PGases) that are highly expressed during seed pod dehiscence in Arabidopsis thaliana and Brassica napus [56]. The GH35 family encodes β-galactosidases. A mutation in the At5g63810.1_BGAL10 gene leads to shorter siliques and an altered XyG in Arabidopsis [57].
Some CE and PL cell wall-related genes have been observed to alter the composition or molecular bonding for the growth and development of cell walls in different tissues. The gene family CE13 (pectin acetylesterases, PAEs) regulates the degree of pectin acetylation by cleaving acetylester bonds from pectin [58,59]. The CE8 family (pectin methylesterases, PMEs) demethylesterifies pectin, which can increase cell wall strength but also act with PGases during fruit ripening to degrade the pectin matrix, softening the fruit [60]. A pectate lyase gene in the PL1 family was observed to be highly upregulated during induced differentiation of cultured Zinnia mesophyll cells. Gene expression was associated with vascular bundles, shoot primordia, and pectate lyase activity found in elongating and differentiating cells in vitro [61]. The PL4 family encodes RG lyase activity, cleaving the bonds between rhamnose and galacturonic residues of the plant cell wall pectin RG I. In tomatoes, a PL4 gene was observed to affect fruit firmness during ripening and pollen tube elongation [62].
There are gene families involved in modifying the cell wall functioning to strengthen cell walls or prepare them for different stages of growth and development through cell wall remodeling. While the biological role of proteases is not yet well understood, they are important to degrade other proteins as cells undergo development, such as seed germination or response to the environment through abiotic or biotic stress [63]. Laccases polymerize monolignols into larger subunits in the early stages of lignification. Peroxidases are involved in lignification with the presence of hydrogen peroxide [64]. The COBRA gene family alters cell wall mechanical strength. Mutants in At5g15630.1_COBL4 lead to normal-looking plants with reduced stem strength, while mutations in the closest rice homolog, Os03g30250.1_BC1, lead to reduced cell wall thickness and brittleness of plant organs [65,66]. The prolyl-4-hydroxylase (P4H) gene family appears to be involved in proper cell wall organization and architecture by establishing sites for O-glycolylation in root hairs [67]. The Arabidopsis skewed-growth proteins (SKU) are involved in directional root growth through cell wall expansion and are not gravity-dependent [68,69].
The hexaploid wheat genome consists of three complete seven-chromosome sub-genomes (A, B, and D) with a high sequence conservation between homologous genes that have arisen through several rounds of genome hybridization [70]. In addition to differences in ploidy level, genes can be duplicated in a genome and, over time, develop new functions through the mutation of one gene copy (neofunctionalization). In some cases, both gene copies mutate to each perform part of the function of the original gene (subfunctionalization). This has been illustrated in the Arabidopsis and rice monosaccharide transporter gene families [71]. In addition to partitioning multiple functions from one gene into two, neofunctionalized expression or subfunctionalized expression can occur through expression differences brought about by changes in the promoter regions of duplicated genes instead of the protein-coding region [72].
The objectives of this research were to provide a curated set of wheat cell wall-related genes complementing those previously published for maize and determine how the wheat homologs are related by sequence to the cell wall genes of other species. Arabidopsis, maize, and rice often have genes of defined function in cell wall-related families. However, the function is not as often defined in wheat. A potential function for wheat homologs or at least the correct family classification would improve the understanding of how those wheat genes might function when discovered through bioinformatics analyses. A recent bioinformatic study on pre-harvest sprouting in wheat was used to illustrate the utility of this research to improve gene classification and how they might function. While preparing the families of cell wall-related wheat genes, it was discovered that more wheat genes were present than expected due to hexaploidy, and many tandem duplications of genes occurred in these wheat gene families. They are discussed in greater detail.

2. Materials and Methods

Searching for cell wall-related wheat genes and building trees based on sequence similarity for each gene family was performed as previously described [13]. The following methods were used without any updates to the process. Previously published cell wall-related maize sequences were used to search version 2.1 of the reference wheat genome from the International Wheat Genome Sequencing Consortium [73]. The search was performed by downloading and formatting the wheat V2.1 high-confidence and low-confidence (LC) protein fasta files as databases with formatdb and performing protein–protein BLASTs using blastall-blastp from NCBI in a custom batch file under Windows PowerShell [74]. A Perl script version of the original C++ program was used to retrieve wheat gene names considered significant via a protein alignment score of 200 or more to the maize query genes and place wheat gene names in a text file. The NCBI fastacmd under Windows PowerShell was used to retrieve the full-length wheat protein sequences from the same wheat high-confidence and LC protein databases. At this point, all gene models were kept as indicated by “.#” where different protein sequence models are numbered sequentially after a period and the gene name. To recover the names of V2.1 wheat cell wall genes from V1.0, a similar strategy was used. Version 1 gene sequences were recovered via fastacmd from its database and then used in blastall -blastp from NCBI to find V2.1 genes. The top hit was taken and was always on the same chromosome, with expected scores near zero and protein alignment scores above 1000.
Previously published rice (release 7), maize (version 2.59a), and Arabidopsis (TAIR10) sequences were added to the wheat sequences for each gene family. The websites to download each of the protein sequence files are wheat [75], maize [76], rice [77], and Arabidopsis [78]. Gene names were kept from the files, except LOC_, which was removed from the front of rice genes, and abbreviations of known genes or a group designation were added to the end of the gene behind an underscore (_). Trees were made via slow/accurate sequence alignment using the neighbor-joining method followed by bootstrapping 1000 times with bootstrap numbers positioned at the nodes in clustalW [79,80,81]. Individual trees were visualized in TreeDyn with a leaf length scaled to substitutions per site and exported to postscript files to be converted to PDFs using Adobe Acrobat [82]. In some cases, large gene families were broken apart by groups previously published [12,13,17] to make their trees easier to read. In cases of ambiguity as to whether to include a gene in the family, the proper protein motifs were verified using the EBI protein motif viewer [14,83] or PROSITE [15,84] as compared to the closest rice, maize, or Arabidopsis gene. To determine the best gene model or test for full length and correct family placement, multiple wheat genes were aligned using Clustal Omega [85] with the closest rice or maize gene by the initial tree. The protein identity of at least 60% was used as a cutoff along with the protein motif analysis, which in all accepted cases identified the family-specific motifs [86]. After confirming the proper inclusion in the gene family and the best protein model, the final trees were produced using clustalW and TreeDyn as above.
Tandem duplications of wheat genes were chosen from Supplemental File S1 based on their assigned number after the “G” in the name, which lists the genes in numerical order along the chromosome and their closeness on the tree. Physical locations were found by downloading the GFF3 file from the International Wheat Genome Sequencing Consortium for version 2.1 of the wheat sequence, high confidence or LC, as appropriate, and searching for the gene name in Microsoft WordPad [73]. The start and stop locations in the basepairs (bp) of the gene were used. The distance between the genes was calculated by subtracting the stop location of the previous gene from the start location of the next gene numerically on the list and then dividing the value by 1000 to convert to kilobases (kb). Distances were reported between gene 1 and gene 2, then gene 2 and gene 3, etc., in order until the end of the repeated genes along the chromosome location. Figures were produced by colorizing potential tandem duplicated genes in TreeDyn, exporting them to PDFs, loading the PDF tree files into Adobe Photoshop, adding any arrows to indicate potential tandem duplications, and exporting them as TIFF or JPEG images.

3. Results and Discussion

A total of 4086 wheat genes across 81 cell wall-related families were annotated and performed by searching Chinese spring wheat genome protein sequences with maize cell wall family genes followed by manual curation using sequence alignment and protein motif calling. Wheat has a hexaploid genome comprising three sub-genomes (A, B, and D). Thus, three homologous wheat genes (Traes) were expected for every maize (GRMZM or AC), rice (Os), and Arabidopsis (At) gene. The number of wheat genes was several hundred more than would be predicted based on the 1118, 1036, and 955 total genes from maize, rice, and Arabidopsis, respectively (Table 1). The specific versions of the maize, rice, and Arabidopsis genes were chosen because they had previously been curated as cell wall-related [12,13,17]. Many of the increased wheat genes were from families PP, GT2 (Csl), GT14, GT48 (callose synthase), GT61, GT77, expansin, laccase, peroxidase, protease, and most GH families. Other gene families remain close to an expected 3:1 ratio of wheat compared to the diploid species (Table 1). The trees of all cell wall-related gene families for the four species were bootstrapped 1000 times to indicate likely groupings from clustalW and are available as Supplemental File S1 in a PDF format to find species homologs. Examples of gene family differences are highlighted in the figures and described below. All gene names, families, and species were listed in an Excel file in Supplemental File S2 to aid gene-of-interest searches. High-confidence wheat genes, defined by the IWGSC, have known gene models from other plants and expression in wheat tissues. Low-confidence wheat genes with LC following their names contain the appropriate gene family protein motifs but unknown expression from the IWGSC [73].
The monolignol biochemical pathway is encoded by the PP family in both monocots and dicots. PAL is the first committed step in the process, while CAD is the last step, producing either sinapyl alcohol (syringyl lignin subunit) or coniferyl alcohol (guaiacyl lignin subunit) [20]. The CAD gene family includes 9 Arabidopsis, 12 rice, and 4 maize genes. However, wheat had 48, which was four times that of rice and five times that of Arabidopsis genes. At the top of Figure 1, fifteen wheat genes are distributed among the A, B, and D sub-genomes on chromosomes 6 and 7, closely related to two rice and one maize gene: Os08g16910.1, Os10g29470.1, and GRMZM2G118610_P01. The wheat gene naming convention indicates genes close to each other on the physical map based on the numbers after the “G” being or nearly sequential. The numbers can be nearly sequential if gene models are later removed [73]. Rice and Arabidopsis genes follow the same convention. At least two wheat genes close on the physical map were also near each other on the gene family tree, indicating similar protein sequences for wheat chromosomes 2A, 2B, 3A, 3B, 3D, 6A, 6B, and 6D, an indication of tandem duplications [71]. For example, the physical locations for TraesCS6D03G0816500, TraesCS6D03G0816600, and TraesCS6D03G0817100 were ~26 kb away from each other on the IWGSC 2.1 reference sequence GFF3 map (Table 2) [73]. Three CAD genes were on the 6A and 6B chromosomes, ranging from 17.6 to 43.3 kb apart (Figure 1, Table 2). Gene duplications may be present in the middle of Figure 1 with nine wheat genes on chromosomes 2A, 2B, and 2D for one rice gene, Os04g15920.1. TraesCS2A03G0131200 and TraesCS2A03G0131300 were 19.9 kb apart, while TraesCS2B03G0191100 and TraesCS2B03G0191300 were 33.7 kb apart. Potential duplications of CAD genes were found on chromosomes 3A, 3B, and 3D from 34 to 321 kb apart (Figure 1, Table 2). As the last enzyme in the production of monolignol precursors for lignin, increased total lignin or more tissue-selective lignin deposition may occur in wheat. The PAL family had approximately five times the wheat genes of maize and rice and twelve times that of Arabidopsis. There were multiple tandem duplications on the A, B, and D sub-genomes of chromosomes 1 and 2, with 1.4 to 268 kb between each duplicated gene (Table 2). Increased numbers of wheat genes with many tandem duplications compared to the other three species were also observed in HCT, CCR, COMT, C3′H, C4H, F5H, and 4CL.
The GT2 gene families produce many carbohydrate molecules, such as cellulose, via the CesA family. Maize CesA genes were previously reported as duplicated and expressed in the stem, numbering 20 as opposed to 10 for other diploid species [13,17]. The wheat CesA gene family did not show duplication with 28 members, 2 less than expected from hexaploidy (Figure 2). However, Csl A, E, G, and H have more members than expected. The CslA family synthesizes the galactomannan (1,4)-β-D-mannan backbone (Figure 3) [27]. A duplication of three wheat genes on 7A, 7B, and 7D closest in sequence to maize GRMZM2G443715_P01 with distances of 3.1, 9.5, and 49.7 kb apart from one another was observed (Table 2). Two sets of wheat genes on different chromosomes are near GRMZM2G107754_P02. Both maize and wheat show duplicated sequences closest in sequence to Os08g33740.1 (Figure 3). On the other hand, Csl E (Figure 4A), Csls G, and H (Figure 4B), whose functions are unknown, showed wheat sequence duplications at the periphery of the other family members [28]. The CslE 5A, 5B, and 5D distances between genes are 2.4, 2.7, and 4.7 kb, respectively (Table 2).
Thirty-five GT48 callose synthase genes were observed in wheat, over three times those in maize and rice. However, it was about three times the number of callose synthases in Arabidopsis. Two clades in Figure 5 had two Arabidopsis genes but only one maize and rice gene and at least two sets of wheat genes. Arabidopsis genes At4g04970.1_AtGSL1 and At4g03550.1_AtGSL 5 were both on chromosome 4 with two sets of wheat genes (3A, 3B, and 3D) but one rice and maize gene. Arabidopsis genes At1g06490.1_AtGSL7 and At3g59100.1_AtGSL 11 were close to wheat genes in 2A, 2D, 3A, 3B, 3D, 5B, and 5D with only one maize and rice gene. Callose coats the surface of cell plates as cell walls mature and can have many effects. Callose deposited in plasmodesmata helped determine the size exclusion limit and plasmodesmata permeability affecting the stomata and sieve development and transportation of phloem. Callose deposition can alter cell wall permeability during stress events such as pathogen attack, chilling stress, water stress, heat, or induction of heavy metals to reduce plant damage [29].
The GT61 and GT77 families also had a large expansion in wheat genes compared to maize, rice, and Arabidopsis, 191 in total. The rice gene, Os02g22380, a GT61 family member, was required for a β-(1,2) xylosyl substitution of α-(1,3) arabinosyls along the xylan chain, unique to grasses [31]. There were two sets of wheat genes closely associated with this rice gene on chromosomes 2 and 3 (Supplemental File S1). The wheat gene TaXAT1 (TraesCS6A03G0809100.1) of the GT61 family carried out most of the α-(1,3) arabinofuranose substitution on starchy endosperm arabinoxylans. The wheat gene TaXAT2 (TraesCS1B03G1010700.2) of the GT61 family, when transformed into Arabidopsis, converted some of the dicot glucuronoxylans to arabinoxylans. The function of GT61 was hypothesized to be the synthesis of grass-specific GAXs. In addition to adding arabinose to arabinoxylans, a rice mutant XAX1, and GT61 family member was observed to have reduced hydroxycinnamic acid-modified arabinosyl-substituted xylose [32]. With GT61 family members TaXAT1 and TaXAT2 having abrabinofuranosyltransferase activity in wheat, it is possible that the role of arabinoxylan feruloylation may partly explain the expansion of the GT61 family [30,32]. Wheat grain endosperm that is milled to flour contains arabinoxylan, a source of fiber. The alteration of TaXAT1 and XAX1 changed the amount of arabinoxylan feruloylation, which changed the fraction of soluble to insoluble fiber in wheat flour. The feruloylation of arabinoxylan increases the cross-linking of molecules, leading to more insoluble fiber. However, wheat grain is an important source of soluble fiber for human health [30,32,87]. One Arabidopsis mutant, At2g35610_XEG113, classified as a GT77 family member, had elongated hypocotyls in liquid culture with the presence of xyloglucanase. It also exhibited reduced arabinose, increased mannose, and increased glucose content in the extensin-rich fraction of the cell wall. Some GT77 family members add arabinose to extensins in the plant cell wall [88]. The GT14 family was slightly expanded to 44 members in wheat. The Arabidopsis gene At5g39990.1_AtGlcAT14A was characterized as a β-glucuronosyltransferase involved in Type II arabinogalactan biosynthesis and cell elongation [89].
The GH18 family, yieldins, had a large expansion of wheat genes over maize, rice, and Arabidopsis (Figure 6). The wheat genome showed signs of several tandem duplications. Wheat chromosomes 3A, 3B, and 3D had six to eight genes, each located between 2.5 and 457 kb from each other (Table 2). A smaller set of duplications in a second clade was also present in 2A, 2B, 2D, 7A, 7B, and 7D but included three maize genes (Figure 6). The GH18 family members were originally named chitinases for their ability to hydrolyze fungal cell walls composed of chitin and β-1,3, -1,6-glucan and provide resistance to fungal diseases [52,54]. More recent studies have suggested that the GH18 family operated in cell wall loosening, but their function(s) in plants are still not well understood [53]. The GH16 (XTH) and expansin families were greatly increased in wheat, accounting for 390 genes (Table 1). These families are also involved in cell wall loosening [53]. Expansins are related to the GH45 family but do not have glycolytic activity and may perform cell wall loosening through hydrogen bond disruption at the cell wall and aid glycoside hydrolase activity on cellulose [48,53,90]. Members of the XTH family have been hypothesized to tether or re-tether xyloglucans to cellulose during growth [49]. However, XTHs have been found to be highly upregulated and involved in remodeling the cell wall during biotic and abiotic stresses, indicating other roles in grasses [91]. The GH17 family encoding β-(1,3) glucanases showed an increase in wheat with 209 wheat genes across four groups of clades (Supplemental File S1). A rice gene, OsGLN2 (Os01g71670.1), was expressed in flowers and germinating seeds, strongly induced in the presence of gibberellin A3, and greatly decreased in the presence of abscisic acid [50]. This gene and wheat homologs may play a role in pre-harvest sprouting, as it has been suggested that many genes under the control of abscisic acid in germinating seeds are involved in pre-harvest sprouting [92]. A wheat gene in the GH17 family, TaGluD (TraesCS3A03G1134100.1), was characterized as a β-1,3-glucanase involved in fungal defense [51]. The expansion of genes in the family could be due to an increased role in pathogen and abiotic stress defense.
Commelinid monocot cell walls have lower amounts of pectic polysaccharides than dicots and were observed to have fewer genes that modify them [13,16]. The PL1 (pectin and pectate lyases), PL4 (RG I lyases), and CE8 (PMEs), while larger in wheat, have a ratio of ~2:1 compared to Arabidopsis, indicating that the hexaploidy of wheat did not lead to the complexity of genes found in the model dicot (Table 1). One notable exception was the gene family CE13 (PAEs). Wheat had 53 genes, five times the number of maize, rice, and Arabidopsis genes. The PAE family functions to cleave the acetyl ester bond from pectin but not xylan, thus regulating the degree of pectin acetylation [58,59]. Overexpression of a PtPAE1 in tobacco led to frequent wilting of apical buds reduced the lengths of floral styles and filaments, and had leaves with curled edges and wavy surfaces. The levels of pectin acetylation were lower in overexpressed lines, while xylose acetylation was unaffected, and pectin methylester content was potentially increased to compensate for the altered pectin. The lengths of floral epidermal cells were reduced, indicating severe impairment in cell elongation [58]. Other research indicated that one clade of Arabidopsis PAEs had high expression under water and salt stress and was involved in abiotic stress response [59]. Pre-harvest sprouting is caused by too much water at the maturation of the seed on the spike. A total of 3 maize and rice genes and 19 wheat genes, 16 on chromosomes 3A, 3B, and 3D, formed another clade away from Arabidopsis. The wheat genes were between 2.7 and 53.1 kb away from each other, indicating tandem duplications. Within the Arabidopsis clade, PAE wheat genes were duplicated once on chromosomes 5A, 5B, and 5D, although they were between 20 and 127 kb apart (Table 2; Supplemental File S1).
Other cell wall-modifying enzymes with more numerous genes were present in wheat. Laccases had four to six times the number of wheat genes as maize, rice, and Arabidopsis, while peroxidases had five to eight times as many (Table 1). Laccases were observed to polymerize monolignols into larger lignin subunits during the early stages of lignification, while peroxidases were observed to perform lignification in the presence of hydrogen peroxide later in the development of dark-grown cell-suspension cultures of sycamore maple xylem cells [64]. A cotton laccase overexpressed in poplar showed dramatically increased laccase activity and up to 19.6% increased lignin [93]. Peroxidases are a large gene family in plants that function in cell wall lignification and elongation, pathogen and abiotic stress defense, and seed germination. Peroxidases were proposed to function in cell wall stiffening through the suberization or cross-linking of extensins and degradation of anthocyanins [94,95]. The protease family of nearly 300 genes was greatly expanded in wheat, about 6 times those in maize and rice, and 16 times in Arabidopsis (Table 1). Proteases degrade other proteins [63]. Proteases have many roles in the plant but were very prevalent in grass seeds such as wheat, barley, rye, and oat, where they formed a complex mixture of grain storage proteins with the function of hydrolysis of other seed proteins during germination [96]. The cysteine (CYS) proteases were the most enlarged protease group (Supplemental File S1). This could be expected as wheat seeds have many proteins, including gluten proteins associated with celiac disorders that CYS proteases act upon, but are not as prevalent in maize and rice [96].
The results of this research are trees for all 81 cell wall gene families of wheat, maize, rice, and Arabidopsis, available as Supplemental File S1. An Excel sheet lists the wheat gene family classification resulting from this research, the gene name, and the species for each gene in the study in the first through third columns, respectively, and is available as Supplemental File S2. Gene families not described were small or did not show drastically different gene numbers of wheat compared to maize or rice and have been previously described in many publications.
It is beyond the scope of this study to determine whether subfunctionalization or neofunctionalization has occurred in each gene family. However, other literature has demonstrated specific instances where neo/subfunctionalization has occurred in cell wall gene families. When duplicated genes diverge to give one gene a new function (neofunctionalization) or the two genes each perform part of the original gene function (subfunctionalization), it is through protein-coding differences [71]. This would likely manifest in protein sequences diverging and could cause them to align to the periphery of the gene family or form a new clade. All included wheat genes that were at the tree periphery or appeared unrelated to the family were checked using Clustal Omega [86] for at least 60% protein identity and for the correct gene family-related protein motifs using the EMBL-EBI protein motif viewer or prosite [14,15]. An outlying clade in Arabidopsis PAE genes was responsive to water and salt stress, representing novel functions for the gene family through neofunctionalization [57]. Nineteen wheat genes also form a grass-only clade in the PAE family, which may represent further changes in function [71]. Similarly, promoter sequences rather than protein sequences may diverge. As the protein sequences would remain nearly the same, genes would likely group together regardless of species in tree clades, as occurred with the laccases. Poplar laccases showed different tissue expression patterns compared to Arabidopsis, suggesting the expression subfunctionalization of the gene family as poplar and Arabidopsis genes were together in clades of a tree rather than separate clades [72,97]. The laccase gene family between wheat, maize, rice, and Arabidopsis showed consistent clades with members in each clade (Supplemental File S1), just like poplar and Arabidopsis [97]. The PP genes are known to be under finer control in multiple tissues or at different times. Dicot PAL genes use only phenylalanine as a substrate, but some grass PAL genes can use phenylalanine or tyrosine as a substrate. This may have led to multiple function and expression changes over time, explaining the large PP gene expansion, potentially expressing subfunctionalization and neofunctionalization [72,98,99]. Peroxidases were hypothesized to have increased numbers in plants to maintain a diverse set of promoters for different tissues and timing of expression and protein distribution associated with large and diverse projected activities in the plant, which could be neofunctionalization and/or expression subfunctionalization [94].
As an example of the utility of the information presented, a comparison was performed with the bioinformatics available for wheat genes in a recent RNA sequencing expression analysis paper on pre-harvest sprouting resistance. The author found that many genes differentially expressed in the seed at 35 days after anthesis in a pre-harvest sprouting resistant variety, Scotty, between the control plants and those under conditions conducive to pre-harvest sprouting [10]. Many cell wall-related genes were reported in the Supplemental Files but could not be classified for potential function due to vague descriptions in available bioinformatics searches. By converting the version 1 protein to version 2.1 and searching Supplemental File S2, the specific gene families were revealed (Table 3). Five genes listed only as “cellulose synthase-like protein” became three members of CslC and two from CslF. All three CslCs are related to maize CslC12a and CslC12b, while the CslFs are related to rice CslF6 and maize CSlF2 and CSlF4 (Supplemental File S1). Five other genes were variously listed as “UDP-glucose:glycoprotein glucosyltransferase”, “Glycosyltransferase”, and “glycosyltransferase family exostosin protein” but belong to the GT families GT24, GT61, and GT47 E, respectively. The CslC family generates the xylan backbone, while some GT47 genes are needed for xylan backbone synthesis [26,47]. The GT61 genes are involved in forming GAXs. A transgenic Arabidopsis plant expressing a wheat GT61 gene was able to produce arabinoxylans from a xyloglucan backbone [30]. Genes belonging to these families are highly expressed during seed development in spring wheat [100]. This suggests the involvement of CslC, GT47E, and GT61, which combine to produce GAXs, as having a possible role in pre-harvest sprouting. Also, CslFs produce a grass-unique mixed linkage (1,3;1,4)-β-D-glucan that may be involved [25]. Finally, several members of the PP were also differentially expressed in Scotty in control versus pre-harvest sprouting conducive conditions. Most were listed with correct gene families, but “O-methyl transferase” was not family specific. The gene was revealed as belonging to the COMT family here (Table 3). Thus, the PP pathway may also have a role in pre-harvest sprouting through the production of monolignols. Similar searches could be performed quickly and easily for any group of wheat cell wall-related genes that were unclear from a basic bioinformatic search.

4. Conclusions

The purpose of this research was to produce a curated set of wheat cell wall-related genes captured in 81 gene families and compare them to homologous genes in maize, rice, and Arabidopsis previously curated. The added information can be used to gain insight into potential wheat gene function from bioinformatic searches. An example showed how information in this research better classified the cell wall-related wheat genes from an expression analysis. Some wheat cell wall-related gene families showed a large expansion beyond what would be expected due to their hexaploid compared to the diploid species maize, rice, and Arabidopsis. Some expansions appeared to be due to tandem duplication events in wheat. This work presents a more complete picture of a large and diverse set of gene families in wheat responsible for an important plant function, the production and maintenance of the cell wall.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/d15111135/s1, Supplemental File S1. All cell wall-related gene family trees for wheat, maize, rice, and Arabidopsis. Supplemental File S2. All gene names for wheat, rice, maize, and Arabidopsis and their cell wall-related gene families.

Funding

This research was supported by the U.S. Department of Agriculture, Agricultural Research Service project # 5082-43440-001-00D.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Full gene names have been given, and the respective protein gene sequences can be downloaded from the websites for the versions of the wheat, maize, rice, and Arabidopsis as described in Section 2. The websites are long-term repositories for protein sequences.

Acknowledgments

The mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. USDA is an equal opportunity provider and employer. The findings and conclusions in this report are those of the author and should not be construed to represent any official USDA or U.S. Government determination or policy.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

4-coumarate CoA ligase (4CL), arabinogalactan protein (AGP), basepairs (bp), cinnamate 4-hydroxylase (C4H), (hydroxy)cinnamyl alcohol dehydrogenase (CAD), caffeoyl CoA O-methyltransferase (CCoAOMT), caffeic acid (5-hydroxy-coniferaldehyde) O-methyltransferase (COMT), carbohydrate esterase (CE) cellulose synthase (CesA), (hydroxy)cinnamoyl CoA reductase (CCR), cellulose synthase-like (Csl), cysteine (CYS), ferulate (coniferaldehyde) 5-hydroxylase (F5H), glucoarabinoxylan (GAX), GDP-4-keto-6-deoxy-D-mannose 3,5-epimerase-4-reductase (GER), GDP-D-mannose-4,6-dehydratase (GMD), GDP-D-mannose 3,5-epimerase (GME), glycosyl hydrolases (GH), glycosyl transferases (GT), homogalaturonan (HG), hydroxycinnamoyl CoA:shikimate hydroxycinnamoyl transferase (HCT), kilobases (kb), low confidence wheat gene (LC), membrane-anchored UDP-D-glucuronate decarboxylase (AUD), p-coumaroyl shikimate 3′-hydroxylase (C3′H), pectin acetylesterase (PAE), pectin methylesterases (PME), phenylalanine ammonia lyase (PAL), Phenylpropanoid pathway (PP), polygalacturonase (PGase), polysaccharide lyases (PL), prolyl-4-hydroxylase (P4H), rhamnogalacturonan (RG), skewed growth proteins (SKU), soluble UDP-D-glucuronate decarboxylase (SUD) genes, UDP-D-glucose 4-epimerase (AXS), UDP-D-glucuronate 4-epimerase (GAE), UDP-L-rhamnose synthase (RHM), UDP-4-keto-6-deoxy-D-glucose 3,5-epimerase-4-reductase (UER), UDP-D-glucose dehydrogenase (UGD), UDP-D-glucose 4-epimerase (UGE), and UDP-D-xylose 4-epimerase (UXE), xyloglucan endotransglucosylase/hydrolase (XTH), xyloglucan (XyG).

References

  1. Asseng, S.; Guarin, J.R.; Raman, M.; Monje, O.; Kiss, G.; Despommier, D.D.; Meggars, F.M.; Gauthier, P.P.G. Wheat yield potential in controlled-environment vertical farms. Proc. Natl. Acad. Sci. USA 2020, 117, 19131–19135. [Google Scholar] [CrossRef]
  2. Wheat Explorer. Available online: https://ipad.fas.usda.gov/cropexplorer/cropview/commodityView.aspx?cropid=0410000 (accessed on 21 August 2023).
  3. Tripathi, M.K.; Karim, S.A.; Chaturvedi, O.H.; Verma, D.L. Nutritional value of animal feed grade wheat as a replacement for maize in lamb feeding for mutton production. J. Sci. Food Agric. 2007, 87, 2447–2455. [Google Scholar] [CrossRef]
  4. Patwa, N.; Penning, B.W. Environmental impact on cereal crop grain damage from pre-harvest sprouting and late maturity alpha-amylase. In Sustainable Agriculture in the Era of Climate Change; Roychowdhury, R., Choudhury, S., Hasanuzzaman, M., Srivastava, S., Eds.; Springer Nature: Cham, Switzerland, 2020; pp. 23–41. [Google Scholar] [CrossRef]
  5. Farrokhi, N.; Burton, R.A.; Brownfield, L.; Hrmova, M.; Wilson, S.M.; Bacic, A.; Fincher, G.B. Plant Cell wall biosynthesis: Genetic, biochemical and functional genomics approaches to the identification of key genes. Plant Biotechnol. J. 2006, 4, 145–167. [Google Scholar] [CrossRef]
  6. Shrivastava, B.; Jain, K.K.; Kalra, A.; Kuhad, R.C. Bioprocessing of wheat straw into nutritionally rich and digested cattle feed. Sci. Rep. 2014, 4, 6360. [Google Scholar] [CrossRef] [PubMed]
  7. Tsegaye, B.; Balomajumder, C.; Roy, P. Optimization of microwave and NaOH pretreatments of wheat straw for enhancing biofuel yield. Energy Convers. Manag. 2019, 186, 82–92. [Google Scholar] [CrossRef]
  8. Yong, W.; Link, B.; O’Malley, R.; Tewari, J.; Hunter, C.T.; Lu, C.A.; Li, X.; Bleecker, A.B.; Koch, K.E.; McCann, M.C.; et al. Genomics of plant cell wall biogenesis. Planta 2005, 221, 747–751. [Google Scholar] [CrossRef]
  9. Nirmal, R.C.; Furtado, A.; Rangan, P.; Henry, R.J. Fasciclin-like arabinogalactan protein gene expression is associated with yield of flour in the milling of wheat. Sci. Rep. 2017, 7, 12539. [Google Scholar] [CrossRef]
  10. Penning, B.W. Gene expression differences related to pre-harvest sprouting uncovered in related wheat varieties by RNAseq analysis. Plant Gene 2023, 33, 100404. [Google Scholar] [CrossRef]
  11. Drula, E.; Garron, M.L.; Dogan, S.; Lombard, V.; Henrissat, B.; Terrapon, N. The carbohydrate-active enzyme database: Functions and literature. Nucl. Acids Res. 2022, 50, D571–D577. [Google Scholar] [CrossRef] [PubMed]
  12. Penning, B.W.; McCann, M.C.; Carpita, N.C. Evolution of the cell wall gene families of grasses. Front. Plant Sci. 2019, 10, 1205. [Google Scholar] [CrossRef]
  13. Penning, B.W.; Hunter, C.T.; Tayengwa, R.; Eveland, A.L.; Dugard, C.K.; Olek, A.T.; Vermerris, W.; Koch, K.E.; McCarty, D.R.; Davis, M.F.; et al. Genetic Resources for maize cell wall biology. Plant Physiol. 2009, 151, 1703–1728. [Google Scholar] [CrossRef]
  14. Potter, S.C.; Luciani, A.; Eddy, S.R.; Park, Y.; Lopez, R.; Finn, R.D. HMMER webserver: 2018 update. Nucl. Acids Res. 2018, 46, W200–W204. [Google Scholar] [CrossRef] [PubMed]
  15. Sigrist, C.J.A.; de Castro, E.; Cerutti, L.; Cuche, B.A.; Hulo, N.; Bridge, A.; Bougueleret, L.; Xenarios, I. New and continuing developments at PROSITE. Nucl. Acids Res. 2013, 41, D344–D347. [Google Scholar] [CrossRef]
  16. Carpita, N.C. Structure and biogenesis of the cell walls of grasses. Annu. Rev. Plant Physiol. Plant Mol. Biol. 1996, 47, 445–476. [Google Scholar] [CrossRef]
  17. Penning, B.W.; Shiga, T.M.; Klimek, J.F.; SanMiguel, P.J.; Shreve, J.; Thimmapuram, J.; Sykes, R.W.; Davis, M.F.; McCann, M.C.; Carpita, N.C. Expression profiles of cell-wall related genes vary broadly between two common maize inbreds during stem development. BMC Genom. 2019, 20, 785. [Google Scholar] [CrossRef] [PubMed]
  18. CAZY. Available online: http://www.cazy.org/ (accessed on 10 August 2023).
  19. Orellana, A.; Moraga, C.; Araya, M.; Moreno, A. Overview of nucleotide sugar transporter gene family functions across multiple species. J. Mol. Biol. 2016, 428, 3150–3165. [Google Scholar] [CrossRef] [PubMed]
  20. Bonawitz, N.D.; Chapple, C. The genetics of lignin biosynthesis: Connecting genotype to phenotype. Annu. Rev. Genet. 2010, 44, 337–363. [Google Scholar] [CrossRef]
  21. Reiter, W.D.; Vanzin, G.F. Molecular genetics of nucleotide sugar interconversion pathways in plants. Plant Mol. Biol. 2001, 47, 95–113. [Google Scholar] [CrossRef]
  22. Yin, Y.; Huang, J.; Gu, X.; Bar-Peled, M.; Xu, Y. Evolution of plant nucleotide-sugar interconversion enzymes. PLoS ONE 2011, 6, e27995. [Google Scholar] [CrossRef]
  23. Favery, B.; Ryan, E.; Foreman, J.; Linstead, P.; Boudonck, K.; Steer, M.; Shaw, P.; Dolan, L. KOJAK encodes a cellulose synthase-like protein required for root hair cell morphogenesis in Arabidopsis. Genes Dev. 2001, 15, 79–89. [Google Scholar] [CrossRef]
  24. Holland, N.; Holland, D.; Helentjaris, T.; Dhugga, K.; Xoconostle-Cazares, B.; Delmer, D.P. A comparative analysis of the plant cellulose synthase (CesA) gene family. Plant Physiol. 2000, 123, 1313–1323. [Google Scholar] [CrossRef]
  25. Burton, R.A.; Wilson, S.M.; Hrmova, M.; Harvey, A.J.; Shirley, N.J.; Medhurst, A.; Stone, B.A.; Newbigin, E.J.; Bacic, A.; Fincher, G.B. Cellulose synthase-like CslF genes mediate the synthesis of cell wall (1,3;1,4)-b-D-glucans. Science 2006, 311, 1940–1942. [Google Scholar] [CrossRef]
  26. Cocuron, J.C.; Lerouxel, O.; Drakakaki, G.; Alonso, A.P.; Liepman, A.H.; Keegstra, K.; Raikhel, N.; Wilkerson, C.G. A gene from the cellulose synthase-like C family encodes a β-1,4 glucan synthase. Proc. Natl. Acad. Sci. USA 2007, 104, 8550–8555. [Google Scholar] [CrossRef]
  27. Dhugga, K.S.; Barreiro, R.; Whitten, B.; Stecca, K.; Hazebroek, J.; Randhawa, G.S.; Dolan, M.; Kinney, A.J.; Tomes, D.; Nichols, S.; et al. Guar seed β-mannan synthase is a member of the cellulose synthase super gene family. Science 2004, 303, 363–366. [Google Scholar] [CrossRef]
  28. Liu, X.; Zhang, H.; Zhang, W.; Xu, W.; Li, S.; Chen, X.; Chen, H. Genome-wide bioinformatics analysis of cellulose synthase gene family in common bean (Phaseolus vulgaris L.) and the expression in the pod development. BMC Genom. Data 2022, 23, 9. [Google Scholar] [CrossRef] [PubMed]
  29. Wang, B.; Andargie, M.; Fang, R. The function and biosynthesis of callose in high plants. Heliyon 2022, 8, e09248. [Google Scholar] [CrossRef]
  30. Anders, N.; Wilkinson, M.D.; Lovegrove, A.; Freeman, J.; Tryfona, T.; Pellny, T.K.; Weimar, T.; Mortimer, J.C.; Stott, K.; Baker, J.M.; et al. Glycosyl transferases in family 61 mediate arabinofuranosyl transfer onto xylan in grasses. Proc. Natl. Acad. Sci. USA 2012, 109, 989–993. [Google Scholar] [CrossRef] [PubMed]
  31. Chiniquy, D.; Sharma, V.; Schultink, A.; Baidoo, E.E.; Rautengarten, C.; Cheng, K.; Carroll, A.; Ulvskov, P.; Harholt, J.; Keasling, J.D.; et al. XAX1 from glycosyltransferase family 61 mediates xylosyl transfer to rice xylan. Proc. Natl. Acad. Sci. USA 2012, 109, 17117–17122. [Google Scholar] [CrossRef] [PubMed]
  32. Feijao, C.; Morreel, K.; Anders, A.; Tryfona, T.; Busse-Wicher, M.; Kotake, T.; Boerjan, W.; Dupree, P. Hydroxycinnamic acid-modified xylan side chains and their cross-linking products in rice cell walls are reduced in the Xylosyl arabinosyl substitution of xylan 1 mutant. Plant J. 2022, 109, 1152–1167. [Google Scholar] [CrossRef]
  33. Egelund, J.; Obel, N.; Ulvskov, P.; Geshi, N.; Pauly, M.; Bacic, A.; Petersen, B.L. Molecular characterization of two Arabidopsis thaliana glycosyltransferase mutants, rra1 and rra2, which have a reduced residual arabinose content in a polymer tightly associated with the cellulosic wall residue. Plant Mol. Biol. 2007, 64, 439–451. [Google Scholar] [CrossRef] [PubMed]
  34. Scheller, H.V.; Jensen, J.K.; Sørensen, S.O.; Harholt, J.; Geshi, N. Biosynthesis of pectin. Physiol. Plant. 2007, 129, 283–295. [Google Scholar] [CrossRef]
  35. Sterling, J.D.; Atmodjo, M.A.; Inwood, S.E.; Kolli, V.S.K.; Quigley, H.F.; Hahn, M.G.; Mohnen, D. Functional identification of an Arabidopsis pectin biosynthesic homogalacturonan galacturonosyltransferase. Proc. Natl. Acad. Sci. USA 2006, 103, 5236–5241. [Google Scholar] [CrossRef]
  36. Lee, C.; Zhong, R.; Richardson, E.A.; Himmelsbach, D.S.; McPhail, B.T.; Ye, Z.H. The PARVUS gene is expressed in cells undergoing secondary wall thickening and is essential for glucuronoxylan biosynthesis. Plant Cell Physiol. 2007, 48, 1659–1672. [Google Scholar] [CrossRef] [PubMed]
  37. Qu, Y.; Egelund, J.; Gilson, P.R.; Houghton, F.; Gleeson, P.A.; Schultz, C.J.; Bacic, A. Identification of a novel group of putative Arabidopsis thaliana β-(1,3)-galactosyltransferases. Plant Mol. Biol. 2008, 68, 43–59. [Google Scholar] [CrossRef]
  38. Strasser, R.; Bondili, J.S.; Vavra, U.; Schoberer, J.; Svoboda, B.; Glössl, J.; Léonard, R.; Stadlmann, J.; Altmann, F.; Steinkellner, H.; et al. A unique β1,3-galactosyltransferase is indispensable for the biosynthesis of N-glycans containing Lewis a structures in Arabidopsis thaliana. Plant Cell 2007, 19, 2278–2292. [Google Scholar] [CrossRef]
  39. Cavalier, D.M.; Lerouxel, O.; Neumetzler, L.; Yamauchi, K.; Reinecke, A.; Freshour, G.; Zabotina, O.A.; Hahn, M.G.; Burgert, I.; Pauly, M.; et al. Disrupting two Arabidopsis thaliana xylosyltransferase genes results in plants deficient in xyloglucan, a major primary cell wall component. Plant Cell 2008, 20, 1519–1537. [Google Scholar] [CrossRef] [PubMed]
  40. Sarria, R.; Wagner, T.A.; O’Neill, M.A.; Faik, A.; Wilkerson, C.G.; Keegstra, K.; Raikhel, N.V. Characterization of a family of Arabidopsis genes related to xyloglucan fucosyltransferase1. Plant Physiol. 2001, 127, 1595–1606. [Google Scholar] [CrossRef] [PubMed]
  41. Vanzin, G.F.; Madson, M.; Carpita, N.C.; Raikhel, N.V.; Keegstra, K.; Reiter, W.D. The mur2 mutant of Arabidopsis thaliana lacks fucosylated xyloglucan because of a lesion in fucosyltransferase AtFUT1. Proc. Natl. Acad. Sci. USA 2002, 99, 3340–3345. [Google Scholar] [CrossRef] [PubMed]
  42. Harholt, J.; Jensen, J.K.; Sørensen, S.O.; Orfila, C.; Pauly, M.; Scheller, H.V. ARABINAN DEFICIENT1 is a putative arabinosyltransferase involved in biosynthesis of pectic arabinan in Arabidopsis. Plant Physiol. 2006, 140, 49–58. [Google Scholar] [CrossRef]
  43. Jensen, J.K.; Sørensen, S.O.; Harholt, J.; Geshi, N.; Sakuragi, Y.; Møller, I.; Zandleven, J.; Bernal, A.J.; Jensen, N.B.; Sørensen, C.; et al. Identification of a xylogalacturonan xylosyltransferase involved in pectin biosynthesis in Arabidopsis. Plant Cell 2008, 20, 1289–1302. [Google Scholar] [CrossRef]
  44. Madson, M.; Dunand, C.; Li, X.; Verma, R.; Vanzin, G.F.; Caplan, J.; Shoue, D.A.; Carpita, N.C.; Reiter, W.-D. The MUR3 gene of Arabidopsis encodes a xyloglucan galactosyltransferase that is evolutionarily related to animal exostosins. Plant Cell 2003, 15, 1662–1670. [Google Scholar] [CrossRef]
  45. Peña, M.J.; Zhong, R.; Zhou, G.K.; Richardson, E.A.; O’Neill, M.A.; Darvill, A.G.; York, W.S.; Ye, Z.H. Arabidopsis irregular xylem8 and irregular xylem9: Implications for the complexity of glucuronoxylan biosynthesis. Plant Cell 2007, 19, 549–563. [Google Scholar] [CrossRef] [PubMed]
  46. Zhong, R.; Peña, M.J.; Zhou, G.K.; Nairn, C.J.; Wood-Jones, A.; Richardson, E.A.; Morrison, W.H.; Darvill, A.G.; York, W.S.; Ye, Z.-H. Arabidopsis fragile fiber8, which encodes a putative glucuronosyltransferase, is essential for normal secondary wall synthesis. Plant Cell 2005, 17, 3390–3408. [Google Scholar] [CrossRef] [PubMed]
  47. Brown, D.M.; Zhang, Z.; Stephens, E.; Dupree, P.; Turner, S.R. Characterization of IRX10 and IRX10-like reveals an essential role in glucuronoxylan biosynthesis in Arabidopsis. Plant J. 2009, 57, 732–746. [Google Scholar] [CrossRef]
  48. Cosgrove, D.J. Loosening of plant cell walls by expansins. Nature 2000, 407, 321–326. [Google Scholar] [CrossRef] [PubMed]
  49. Rose, J.K.C.; Braam, J.; Fry, S.C.; Nishitani, K. The XTH family of enzymes involved in xyloglucan endotransglucosylation and endohydrolysis: Current perspectives and a new unifying nomenclature. Plant Cell Physiol. 2002, 43, 1421–1435. [Google Scholar] [CrossRef]
  50. Akiyama, T.; Pillai, M.A.; Sentoku, N. Cloning, characterization and expression of OsGLN2, a rice endo-1,3-beta-glucanase gene regulated developmentally in flowers and hormonally in germinating seeds. Planta 2004, 220, 129–139. [Google Scholar] [CrossRef]
  51. Liu, B.; Lu, Y.; Xin, Z.; Zhang, Z. Identification and antifungal assay of a wheat B-1,3-glucanase. Biotechnol. Lett. 2009, 31, 1005–1010. [Google Scholar] [CrossRef]
  52. Minic, Z. Physiological roles of plant glycoside hydrolases. Planta 2008, 227, 723–740. [Google Scholar] [CrossRef]
  53. Nawaz, M.A.; Rehman, H.M.; Imtiaz, M.; Baloch, F.S.; Lee, J.D.; Yang, S.H.; Lee, S.I.; Chung, G. Systems identification and characterization of cell wall reassembly and degradation related genes in Glycine max (L.) Merill, a bioenergy legume. Sci. Rep. 2017, 7, 10862. [Google Scholar] [CrossRef]
  54. Schlumbaum, A.; Mauch, F.; Vögeli, U.; Boller, T. Plant chitinases are potent inhibitors of fungal growth. Nature 1986, 324, 365–367. [Google Scholar] [CrossRef]
  55. Jindou, S.; Xu, Q.; Kenig, R.; Shulman, M.; Shoham, Y.; Bayer, E.A.; Lamed, R. Novel architecture of family-9 glycoside hydrolases identified in cellulosal enzymes of Acetivibrio cellulyticus and Clostridium thermocellum. FEMS Microbiol. Lett. 2006, 254, 308–316. [Google Scholar] [CrossRef] [PubMed]
  56. Jenkins, E.S.; Paul, W.; Craze, M.; Whitelaw, A.; Weigand, A.; Roberts, J.A. Dehiscence-related expression of an Arabidopsis thaliana gene encoding a polygalacturonase in transgenic plants of Brassica napus. Plant Cell Environ. 1999, 22, 159–167. [Google Scholar] [CrossRef]
  57. Sampedro, J.; Gianzo, C.; Iglesias, N.; Guitián, E.; Revilla, G.; Zarra, I. AtBGAL10 is the main xyloglucan β-galactosidase in Arabidopsis, and its absence results in unusual xyloglucan subunits and growth defects. Plant Physiol. 2012, 158, 1146–1157. [Google Scholar] [CrossRef] [PubMed]
  58. Gou, Y.Y.; Miller, L.M.; Hou, G.; Yu, X.H.; Chen, X.Y.; Liu, C.J. Acetylesterase-mediated deacetylation of pectin impairs cell elongation, pollen germination, and plant reproduction. Plant Cell 2012, 24, 50–65. [Google Scholar] [CrossRef]
  59. Philippe, F.; Pelloux, J.; Rayon, C. Plant pectin acetylesterase structure and function: New insights from bioinformatic analysis. BMG Genom. 2017, 18, 456. [Google Scholar] [CrossRef]
  60. Louvet, R.; Cavel, E.; Gutierrez, L.; Guénin, S.; Roger, D.; Gillet, F.; Guerineau, F.; Pelloux, J. Comprehensive expression profiling of the pectin methylesterase gene family during silique development in Arabidopsis thaliana. Planta 2006, 224, 782–791. [Google Scholar] [CrossRef]
  61. Domingo, C.; Roberts, K.; Stacey, N.J.; Connerton, I.; Ruíz-Teran, F.; McCann, M.C. A pectate lyase from Zinnia elegans is auxin inducible. Plant J. 1998, 13, 17–28. [Google Scholar] [CrossRef]
  62. Ochoa-Jiménez, V.A.; Berumen-Varela, G.; Burgara-Estrella, A.; Orozco-Avitia, J.A.; Ojeda-Contreras, A.J.; Trillo-Hernández, E.A.; Rivera-Domínguez, M.; Troncoso-Rojas, R.; Báez-Sañudo, R.; Datsenka, T.; et al. Functional analysis of tomato rhamnogalacturonan lyase gene Solyc11g011300 during fruit development and ripening. J. Plant Physiol. 2018, 231, 31–40. [Google Scholar] [CrossRef]
  63. Canut, H.; Albenne, C.; Jamet, E. Post-translational modifications of plant cell wall proteins and peptides: A survey from a proteomics point of view. Biochim. Biophys. Acta–Proteins Proteom. 2016, 1864, 983–990. [Google Scholar] [CrossRef]
  64. Sterjiades, R.; Dean, J.F.D.; Gamble, G.; Himmelsbach, D.S.; Eriksson, K.L. Extracellular laccases and peroxidases from sycamore maple (Acer pseudoplatanus) cell-suspension cultures. Planta 1993, 190, 75–83. [Google Scholar] [CrossRef]
  65. Brown, D.M.; Zeef, L.A.H.; Ellis, J.; Goodacre, R.; Turner, S.R. Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics. Plant Cell 2005, 17, 2281–2295. [Google Scholar] [CrossRef] [PubMed]
  66. Li, Y.; Qian, Q.; Zhou, Y.; Yan, M.; Sun, L.; Zhang, M.; Fu, Z.; Wang, Y.; Han, B.; Pang, X.; et al. BRITTLE CULM1, which encodes a COBRA-like protein, affects the mechanical properties of rice plants. Plant Cell 2003, 15, 2020–2031. [Google Scholar] [CrossRef] [PubMed]
  67. Velasquez, S.M.; Ricardi, M.M.; Poulsen, C.P.; Oikawa, A.; Dilokpimol, A.; Halim, A.; Mangano, S.; Juarez, S.P.D.; Marzol, E.; Salter, J.D.S.; et al. Complex regulation of prolyl-5-hydroxylases impacts root hair expansion. Mol. Plant 2015, 8, 734–746. [Google Scholar] [CrossRef]
  68. Califar, B.; Sng, N.J.; Zupanska, A.; Paul, A.L.; Ferl, R.J. Root skewing-associated genes impact the spaceflight response of Arabidopsis thaliana. Front. Plant Sci. 2020, 11, 239. [Google Scholar] [CrossRef]
  69. Sedbrook, J.C.; Carroll, K.L.; Hung, K.F.; Masson, P.H.; Somerville, C.R. The Arabidopsis SKU5 gene encodes an extracellular glycosyl phosphatidylinositol-anchored glycoprotein involved in directional root growth. Plant Cell 2002, 14, 1635–1648. [Google Scholar] [CrossRef]
  70. Zhou, C.; Dong, Z.; Zhang, T.; Wu, J.; Yu, S.; Zeng, Q.; Han, D.; Tong, W. Genome-scale analysis of homologous genes among subgenomes of bread wheat (Triticum aestivum L.). Int. J. Mol. Sci. 2020, 21, 3015. [Google Scholar] [CrossRef]
  71. Johnson, D.A.; Thomas, M.A. The monosaccharide transporter gene family in Arabidopsis and rice: A history of duplications, adaptive evolution, and functional divergence. Mol. Biol. Evol. 2007, 24, 2412–2423. [Google Scholar] [CrossRef]
  72. Xu, A.F.; Molinuevo, R.; Fazzari, E.; Tom, H.; Zhang, Z.; Menendez, J.; Casey, K.M. Subfunctionalized expression drives evolutionary retention of ribosomal protein paralogs Rps27 and Rps27l in vertebrates. eLife 2023, 12, e78695. [Google Scholar] [CrossRef]
  73. Zhu, T.; Wang, L.; Rimbert, H.; Rodriguez, J.C.; Deal, K.R.; De Oliveira, R.; Choulet, F.; Keeble-Gagnère, G.; Tibbits, J.; Rogers, J.; et al. Optical maps refine the bread wheat Triticum aestivum cv Chinese Spring genome assembly. Plant J. 2021, 107, 303–314. [Google Scholar] [CrossRef]
  74. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef] [PubMed]
  75. International Wheat Genome Sequencing Consortium. Available online: https://www.wheatgenome.org/ (accessed on 21 August 2023).
  76. MaizeGDB. Available online: https://www.maizegdb.org/ (accessed on 21 August 2023).
  77. Rice Genome Annotation Project. Available online: http://rice.uga.edu/ (accessed on 21 August 2023).
  78. The Arabidopsis Information Reseource. Available online: https://www.arabidopsis.org/ (accessed on 21 August 2023).
  79. Chenna, R.; Sugawara, H.; Koike, T.; Lopez, R.; Gibson, T.J.; Higgins, D.G.; Thompson, J.D. Multiple sequence alignment with the Clustal series of programs. Nucl. Acids Res. 2003, 31, 3497–3500. [Google Scholar] [CrossRef]
  80. Saitou, N.; Nei, M. The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 1987, 4, 406–425. [Google Scholar] [CrossRef]
  81. Thompson, J.D.; Higgins, D.G.; Gibson, T.J. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucl. Acids Res. 1994, 22, 4673–4680. [Google Scholar] [CrossRef] [PubMed]
  82. Chevenet, F.; Brun, C.; Banuls, A.L.; Jacq, B.; Christen, R. TreeDyn: Towards dynamic graphics and annotations for analyses of trees. BMC Bioinform. 2006, 7, 439. [Google Scholar] [CrossRef]
  83. HMMER. Available online: https://www.ebi.ac.uk/Tools/hmmer/ (accessed on 11 August 2023).
  84. Prosite. Available online: https://prosite.expasy.org (accessed on 11 August 2023).
  85. Clustal Omega. Available online: https://www.ebi.ac.uk/Tools/msa/clustalo/ (accessed on 11 August 2023).
  86. Madeira, F.; Pearce, M.; Tivey, A.R.; Basutkar, P.; Lee, J.; Edbali, O.; Madhusoodanan, N.; Kolesnikov, A.; Lopez, R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucl. Acids Res. 2022, 50, W276–W279. [Google Scholar] [CrossRef]
  87. Pellny, T.K.; Patil, A.; Wood, A.J.; Freeman, J.; Halsey, K.; Plummer, A.; Kosik, O.; Temple, H.; Collins, J.D.; Dupree, P.; et al. Loss of TaIRX9b gene function in wheat decreases chain length and amount of arabinoxylan in grain but increases cross-linking. Plant Biotech. J. 2020, 18, 2316–2327. [Google Scholar] [CrossRef]
  88. Gille, S.; Hänsel, U.; Ziemann, M.; Pauly, M. Identification of plant cell wall mutants by means of a forward chemical genetic approach using hydrolases. Proc. Natl. Acad. Sci. USA 2009, 25, 14699–14704. [Google Scholar] [CrossRef]
  89. Knoch, E.; Dilokpimol, A.; Tryfona, T.; Poulsen, C.P.; Xiong, G.; Harholt, J.; Petersen, B.L.; Ulvskov, P.; Hadi, M.Z.; Kotake, T.; et al. A β-glucuronosyltransferase from Arabidopsis thaliana involved in biosynthesis of type II arabinogalactan has a role in cell elongation during seedling growth. Plant J. 2013, 76, 1016–1029. [Google Scholar] [CrossRef]
  90. Sampedro, J.; Cosgrove, D.J. The expansin superfamily. Genome Biol. 2005, 6, 242. [Google Scholar] [CrossRef]
  91. Tenhaken, R. Cell wall remodeling under abiotic stress. Front. Plant Sci. 2015, 5, 771. [Google Scholar] [CrossRef]
  92. Chono, M.; Honda, I.; Shinoda, S.; Kushiro, T.; Kamiya, Y.; Nambara, E.; Kawakami, N.; Kaneko, S.; Watanabe, Y. Field studies on the regulation of abscisic acid content and germinability during grain development of barley: Molecular and chemical analysis of pre-harvest sprouting. J. Exp. Bot. 2006, 57, 2421–2434. [Google Scholar] [CrossRef] [PubMed]
  93. Wang, J.; Wang, C.; Zhu, M.; Yu, Y.; Zhang, Y.; Wei, Z. Generation and characterization of transgenic poplar plants overexpressing a cotton laccase gene. Plant Cell Tissue Organ Cult. 2008, 93, 303–310. [Google Scholar] [CrossRef]
  94. Shigeto, J.; Tsutsumi, Y. Diverse functions and reactions of class III peroxidases. New Phytol. 2016, 209, 1395–1402. [Google Scholar] [CrossRef]
  95. Zipor, G.; Duarte, P.; Carqueijeiro, I.; Shahar, L.; Ovadia, R.; Teper-Bamnolker, P.; Eshel, D.; Levin, Y.; Doron-Faigenboim, A.; Sottomayor, M.; et al. In planta anthocyanin degradation by a vacuolar class III peroxidase in Brunfelsia calycina flowers. New Phytol. 2015, 205, 653–665. [Google Scholar] [CrossRef]
  96. Martinez, M.; Gómez-Cabellos, S.; Giménez, M.J.; Barro, F.; Diaz, I.; Diaz-Mendoza, M. Plant Proteases: From key enzymes in germination to allies for fighting human gluten-related disorders. Front. Plant Sci. 2019, 10, 721. [Google Scholar] [CrossRef]
  97. Berthet, S.; Thevin, J.; Baratiny, D.; Demont-Caulet, N.; Debeaujon, I.; Bidzinski, P.; Leple, J.; Huis, R.; Hawkins, S.; Gomez, L.D.; et al. Chapter 5—Role of plant laccases in lignin polymerization. In Advances in Botanical Research; Jouanin, J., Lapierre, C., Eds.; Academic Press: London, UK, 2012; Volume 61, pp. 145–172. [Google Scholar] [CrossRef]
  98. Pant, S.; Huang, Y. Genome-wide studies of PAL genes in sorghum and their responses to aphid infestation. Sci. Rep. 2022, 12, 22537. [Google Scholar] [CrossRef]
  99. Riaz, M.W.; Yousaf, M.I.; Hussain, Q.; Yasir, M.; Sajjad, M.; Shah, L. Role of lignin in wheat plant for the enhancement of resistance against lodging and biotic and abiotic stresses. Stresses 2023, 3, 434–453. [Google Scholar] [CrossRef]
  100. Pellny, T.K.; Lovegrove, A.; Freeman, J.; Tosi, P.; Love, C.G.; Knox, J.P.; Shewry, P.R.; Mitchell, R.A.C. Cell walls of developing wheat starchy endosperm: Comparison of composition and RNA-seq transcriptome. Plant Phyisol. 2012, 158, 612–627. [Google Scholar] [CrossRef]
Figure 1. The CAD gene family. Wheat CAD genes are more numerous than expected. Fifteen wheat genes are closely associated with just two rice and one maize gene (red). Nine wheat genes are associated with just one rice gene (blue). Six wheat genes are associated with one Arabidopsis and two rice genes (green). Arrows indicate twenty-one genes that appear to be tandem duplications along the wheat chromosomes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Figure 1. The CAD gene family. Wheat CAD genes are more numerous than expected. Fifteen wheat genes are closely associated with just two rice and one maize gene (red). Nine wheat genes are associated with just one rice gene (blue). Six wheat genes are associated with one Arabidopsis and two rice genes (green). Arrows indicate twenty-one genes that appear to be tandem duplications along the wheat chromosomes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Diversity 15 01135 g001
Figure 2. The CesA gene family. CesAs in wheat are as expected, while maize has a duplication of the entire gene family, twenty rather than ten genes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Figure 2. The CesA gene family. CesAs in wheat are as expected, while maize has a duplication of the entire gene family, twenty rather than ten genes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Diversity 15 01135 g002
Figure 3. The CslA gene family. Wheat CslA genes are more numerous than expected. Nine wheat genes are associated with one maize and rice gene (red). Six wheat genes from multiple chromosomes are closely related to one maize and rice gene (blue). Arrows indicate six genes that appear to be tandem duplications along the wheat chromosomes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Figure 3. The CslA gene family. Wheat CslA genes are more numerous than expected. Nine wheat genes are associated with one maize and rice gene (red). Six wheat genes from multiple chromosomes are closely related to one maize and rice gene (blue). Arrows indicate six genes that appear to be tandem duplications along the wheat chromosomes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Diversity 15 01135 g003
Figure 4. Wheat Csl E, G, and H genes are more numerous than expected. (A) CslE genes. Eight wheat genes on chromosomes 5A, 5B, and 5D are associated with two rice and one maize gene (red). Arrows indicate six genes that appear to be tandem duplications along the wheat chromosomes. (B) CslG and H genes. Wheat genes are more numerous than expected. Five wheat genes on chromosomes 3A, 3B, and 3D are shown in red. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Figure 4. Wheat Csl E, G, and H genes are more numerous than expected. (A) CslE genes. Eight wheat genes on chromosomes 5A, 5B, and 5D are associated with two rice and one maize gene (red). Arrows indicate six genes that appear to be tandem duplications along the wheat chromosomes. (B) CslG and H genes. Wheat genes are more numerous than expected. Five wheat genes on chromosomes 3A, 3B, and 3D are shown in red. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Diversity 15 01135 g004
Figure 5. Family GT48. Wheat genes are more numerous than expected for the callose synthase gene family. Seven wheat genes are associated with two Arabidopsis, one rice, and one maize gene (red). Seven wheat genes are associated with two Arabidopsis and one maize and rice gene (blue). The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Figure 5. Family GT48. Wheat genes are more numerous than expected for the callose synthase gene family. Seven wheat genes are associated with two Arabidopsis, one rice, and one maize gene (red). Seven wheat genes are associated with two Arabidopsis and one maize and rice gene (blue). The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Diversity 15 01135 g005
Figure 6. The GH18 gene family. Wheat yieldins are more numerous than expected. Twenty-one wheat genes on chromosomes 3A, 3B, and 3D are associated with two maize and one rice gene (red). Fifteen wheat genes on multiple chromosomes are associated with three maize and one rice gene (blue). Arrows indicate twenty genes that appear to be tandem duplications along the wheat chromosomes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Figure 6. The GH18 gene family. Wheat yieldins are more numerous than expected. Twenty-one wheat genes on chromosomes 3A, 3B, and 3D are associated with two maize and one rice gene (red). Fifteen wheat genes on multiple chromosomes are associated with three maize and one rice gene (blue). Arrows indicate twenty genes that appear to be tandem duplications along the wheat chromosomes. The scale is the number of substitutions per site. The numbers are branching based on 1000 bootstraps.
Diversity 15 01135 g006
Table 1. Total numbers of wheat, maize, rice, and Arabidopsis genes for each cell wall family.
Table 1. Total numbers of wheat, maize, rice, and Arabidopsis genes for each cell wall family.
Number of Genes per Family
Cell Wall FamilyWheatMaizeRiceArabidopsis
Nucleotide Sugar Transferases133635051
PP 4CL45101413
PP C3H_C4H_F5H508106
PP CAD484129
PP CCoAOMT21667
PP COMT333716
PP CCR12118247
PP HCT1493851
PP PAL471094
GT1–NSI–AUD_SUD15966
GT1–NSI–AXS3112
GT1–NSI–GAE15946
GT1–NSI–GER3112
GT1–NSI–GMD6112
GT1–NSI–GME6221
GT1–NSI–RHM_UER8434
GT1–NSI–UGD12354
GT1–NSI–UGE9345
GT1–NSI–UXE9434
GT2–CesA28201010
GT2–CSL A3810119
GT2–CSL C15865
GT2–CSL D15555
GT2–CSL E12231
GT2–CSL F18780
GT2–CSL G & H11133
GT4–Sucrose Synthases21576
GT8–A15745
GT8–B13228
GT8–C2110810
GT8–D52232215
GT8–E12553
GT106333
GT136111
GT144410119
GT163111
GT1711446
GT213111
GT229643
GT243211
GT2912853
GT303111
GT31107403933
GT326226
GT336221
GT34241776
GT3753171810
GT416232
GT432716104
GT47126503839
GT48–Callose Synthases357712
GT502111
GT573111
GT583111
GT593111
GT6111733248
GT646333
GT669422
GT683113
GT75–RGP12935
GT763211
GT7774231618
GT929333
GT963211
GH982222525
GH16–XTHs142312833
GH17209516047
GH18–Yieldins6010910
GH28–Pgases–A29121113
GH28–Pgases–B180110
GH28–Pgases–C11889
GH28–Pgases–D34796
GH28–Pgases–E15338
GH28–Pgases–F1416610
GH28–Pgases–G13148
GH35–Beta Galactosidases48161017
Expansins248556134
CE8–PMEs120344166
CE13–PAEs53111011
PL1–Pectin Lyases31121226
PL4–RG Lyases15437
P4H28101211
Laccases113242717
Peroxidases64312413573
SKU341243
COBRA-like3891111
GDPlike15767
Proteases296495018
HIP-like11333
AGPs229839
HRGPs32213
Total408611181036955
Table 2. Location of select tandem duplicated wheat genes.
Table 2. Location of select tandem duplicated wheat genes.
Gene Family Location (bp)Distance Apart (kb)
Wheat GeneStartStop
CADTraesCS2A03G013120034,450,30734,451,76019.9
TraesCS2A03G013130034,471,63734,473,153
CADTraesCS2B03G017820050,262,74950,264,07678.9
TraesCS2B03G017830050,342,94750,344,421
CADTraesCS2B03G019110054,229,49054,230,57833.8
TraesCS2B03G019130054,264,33254,266,016
CADTraesCS3A03G1041000688,689,431688,692,02834.1
TraesCS3A03G1041600688,726,119688,729,557
CADTraesCS3B03G1199000745,631,611745,634,419321.7
TraesCS3B03G1200700745,956,098745,958,925
CADTraesCS3D03G0968100552,033,322552,035,463116.5
TraesCS3D03G0968500552,151,988552,155,191
CADTraesCS6A03G0939400597,869,237597,875,25117.6
TraesCS6A03G0939600LC597,892,856597,894,20121.3
TraesCS6A03G0939900597,915,465597,921,915
CADTraesCS6B03G1146700690,063,231690,067,77026.8
TraesCS6B03G1147100690,094,598690,099,65443.3
TraesCS6B03G1147200690,142,991690,147,748
CADTraesCS6D03G0816500470,593,540470,597,69126.6
TraesCS6D03G0816600470,624,314470,629,32126.0
TraesCS6D03G0817100470,655,366470,661,613
PALTraesCS1A03G008950022,809,32822,817,0552.8
TraesCS1A03G008970022,819,86422,822,522
PALTraesCS1B03G010810030,156,99030,159,84013.2
TraesCS1B03G010830030,173,03130,175,84248.2
TraesCS1B03G010840030,224,08030,226,93767.0
TraesCS1B03G010860030,293,90730,296,75137.8
TraesCS1B03G010870030,334,55930,337,342
PALTraesCS1D03G007890020,552,68920,561,5371.4
TraesCS1D03G007920020,562,95920,565,612
PALTraesCS2A03G0922600628,113,634628,116,62032.0
TraesCS2A03G0922700628,148,609628,151,52549.4
TraesCS2A03G0922800628,200,915628,203,6598.4
TraesCS2A03G0922900628,212,066628,214,916
PALTraesCS2B03G1014000572,920,722572,923,789145.0
TraesCS2B03G1015000573,068,831573,071,748140.2
TraesCS2B03G1015300573,211,969573,214,748
PALTraesCS2D03G0862600483,823,970483,826,68784.6
TraesCS2D03G0863100483,911,277483,914,370268.0
TraesCS2D03G0863200484,182,377484,185,48823.8
TraesCS2D03G0863300484,209,252484,211,934
CslATraesCS7A03G0953700576,059,990576,064,12949.7
TraesCS7A03G0953800576,113,787576,117,206
CslATraesCS7B03G0796700537,109,006537,112,6923.1
TraesCS7B03G0796800537,115,750537,132,129
CslATraesCS7D03G0919100506,334,490506,337,8689.5
TraesCS7D03G0919200506,347,416506,352,132
CslETraesCS5A03G0631600469,521,391469,527,2742.4
TraesCS5A03G0631700469,529,676469,535,899
CslETraesCS5B03G0653700437,270,468437,275,6862.7
TraesCS5B03G0653800437,278,393437,284,101
CslETraesCS5D03G0599500370,151,008370,156,4454.7
TraesCS5D03G0599600370,161,140370,165,637
GH18TraesCS3A03G0882000623,487,813623,487,8136.4
TraesCS3A03G0882100623,494,204623,495,0972.5
TraesCS3A03G0882200623,497,609623,498,502137.5
TraesCS3A03G0882500623,636,002623,636,89513.2
TraesCS3A03G0882600623,650,108623,651,0016.9
TraesCS3A03G0882700623,657,857623,658,75026.5
TraesCS3A03G0882800623,685,242623,686,135214.1
TraesCS3A03G0883100623,900,253623,901,602
GH18TraesCS3B03G1008300655,681,956655,682,9874.1
TraesCS3B03G1008400655,687,134655,688,02727.7
TraesCS3B03G1008600655,715,751655,716,6446.3
TraesCS3B03G1008700655,722,943655,723,8366.7
TraesCS3B03G1009100655,730,571655,731,464122.3
TraesCS3B03G1009200655,853,730655,854,731
GH18TraesCS3D03G0810000480,870,250480,871,1313.9
TraesCS3D03G0810100480,875,009480,875,90231.7
TraesCS3D03G0810300LC480,907,624480,908,268199.2
TraesCS3D03G0810400481,107,472481,108,36516.6
TraesCS3D03G0810500481,124,934481,125,827457.3
TraesCS3D03G0810900481,583,081481,584,377
PAETraesCS3A03G1263600748,276,487748,280,35753.1
TraesCS3A03G1264000748,333,460748,335,35720.7
TraesCS3A03G1264100748,356,041748,358,8353.3
TraesCS3A03G1264200748,362,146748,365,2828.5
TraesCS3A03G1264400748,373,743748,377,054
PAETraesCS3B03G1508100844,439,077844,442,85627.3
TraesCS3B03G1508200844,470,116844,472,97828.1
TraesCS3B03G1508300844,501,107844,509,11213.6
TraesCS3B03G1508500844,522,685844,524,01722.3
TraesCS3B03G1508700844,546,309844,549,502
PAETraesCS3D03G1188000615,245,943615,249,49329.9
TraesCS3D03G1188400615,279,382615,282,1669.9
TraesCS3D03G1188500615,292,106615,295,8862.7
TraesCS3D03G1188600615,298,614615,301,61531.6
TraesCS3D03G1188700615,333,190615,336,31040.9
TraesCS3D03G1188900615,377,258615,380,183
PAETraesCS5A03G1140500658,736,919658,740,22772.9
TraesCS5A03G1140600658,813,140658,817,069
PAETraesCS5B03G1211700666,788,941666,792,805127.0
TraesCS5B03G1212500666,919,777666,924,347
PAETraesCS5D03G1094700532,276,790532,280,79629.9
TraesCS5D03G1094800532,310,668532,314,929
Table 3. Cell wall gene names improved from conventional bioinformatics searches using Supplemental File S2.
Table 3. Cell wall gene names improved from conventional bioinformatics searches using Supplemental File S2.
ProteinV1Description from Penning 2023ProteinV2_1Gene Family
TraesCS5A01G405700.1Cellulose synthase-like proteinTraesCS5A03G0962500.1GT2–CSL C
TraesCS5B01G410400.1Cellulose synthase-like proteinTraesCS5B03G1012800.1GT2–CSL C
TraesCS5D01G415700.1Cellulose synthase-like proteinTraesCS5D03G0916900.2GT2–CSL C
TraesCS7A01G298600.1Cellulose synthase-like proteinTraesCS7A03G0733400.1GT2–CSL F
TraesCS7B01G188400.2Cellulose synthase-like proteinTraesCS7B03G0538200.1GT2–CSL F
TraesCS1A01G152500.1UDP-glucose:glycoprotein glucosyltransferaseTraesCS1A03G0409400.1GT24
TraesCS1D01G149400.1UDP-glucose:glycoprotein glucosyltransferaseTraesCS1D03G0377700.1GT24
TraesCS6A01G102100.1GlycosyltransferaseTraesCS6A03G0237900.1GT61
TraesCS3A01G440800.1glycosyltransferase family exostosin proteinTraesCS3A03G1025100.1GT47 E
TraesCS3D01G433400.1glycosyltransferase family exostosin proteinTraesCS3D03G0953500.2GT47 E
TraesCS2A01G051600.2O-methyltransferaseTraesCS2A03G0099800.21.3 COMT
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Penning, B.W. The Cell Wall-Related Gene Families of Wheat (Triticum aestivum). Diversity 2023, 15, 1135. https://doi.org/10.3390/d15111135

AMA Style

Penning BW. The Cell Wall-Related Gene Families of Wheat (Triticum aestivum). Diversity. 2023; 15(11):1135. https://doi.org/10.3390/d15111135

Chicago/Turabian Style

Penning, Bryan W. 2023. "The Cell Wall-Related Gene Families of Wheat (Triticum aestivum)" Diversity 15, no. 11: 1135. https://doi.org/10.3390/d15111135

APA Style

Penning, B. W. (2023). The Cell Wall-Related Gene Families of Wheat (Triticum aestivum). Diversity, 15(11), 1135. https://doi.org/10.3390/d15111135

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop