1. Introduction
Cotton (
Gossypium hirsutum L.), originating from South America, belongs to the genus Gossypium in the Malvaceae family. Archaeological findings suggest its cultivation in parts of northern Chile between 3500 and 2500 BC. The discovery of cotton by ancient cultures led to its introduction to Asia and Africa by Europeans in the 15th century. Today, cotton stands as one of the most pivotal global crops with its seeds serving as a significant source of industrial oil and its fibers playing an essential role in textile industries. However, the intensifying global climate crisis has resulted in a rise in the frequency of extreme weather events, and an increase in the prevalence of pests and diseases has also had a significant impact on cotton production. The occurrence of extreme weather events, including floods, droughts, and heat waves, has the potential to result in a decline in cotton production. Flooding will saturate the soil, directly affecting the growth and physiology of the cotton plant. A reduction in yield can be attributed to the combined effects of increased evaporation and transpiration as well as a reduction in soil water availability, which are both consequences of drought [
1]. For example, the economic cost of crop yield loss caused by extreme weather events in developing countries was approximately
$80 billion from 2003 to 2014. In the United States, the number of events with losses exceeding
$1 billion caused by extreme weather events is also increasing at a steady rate [
2]. As evidenced by the data from 2022, the disease resulted in a 8.7% reduction in cotton production in the United States, amounting to a loss of 1.4 million bales. This decline in output is comparable to that observed in 2018–2021 but significantly lower than the average annual output decrease of 12.2% observed over the 22-year period from 2000 to 2021 [
3].
Actin Depolymerizing Factors (ADFs) are small, conserved microfilament-binding proteins with molecular weights typically ranging from 15 to 22 kDa. They play a crucial role in regulating the cytoskeleton [
4]. Upon binding with both filamentous actin (F-actin) and monomeric actin (G-actin), ADF proteins enhance depolymerization activity [
5]. This mechanism allows ADF proteins to modulate actin polymerization within cells, influencing essential cellular processes such as cell division, morphological changes, growth, and motility.
The origin of the ADF gene family dates back to the 1980s when the initial ADF protein was isolated from chicken brain tissue, which was initially termed cofilamentous protein (cofilin) due to its filamentous structure. Subsequent investigations unveiled proteins with sequences akin to cofilin, which were capable of depolymerizing F-actin activity across various tissues. These microfilament-depolymerizing proteins found in mammals and eukaryotic cells were later collectively termed ADF/cofilin [
6]. ADF proteins likely played a pivotal role in the evolutionary adaptation of the eukaryotic cytoskeleton. The expansion of the ADF gene family may be attributed to gene duplication events, as suggested by relevant research. These events lead to structural and functional diversifications among different ADF gene family members, enabling them to assume distinct roles in cellular processes. Widely distributed across animal and plant species, the ADF gene family even extends to fungi and other organisms. The plant ADF gene family encompasses several relatively large subfamilies like ADF, cofilin, and TWD. Despite structural variations, these family members share the ability to promote actin depolymerization and participate in biological processes such as plant cell development, growth, intercellular signaling, and stress response.
In recent years, research on the ADF gene family has predominantly focused on three key areas. Firstly, an analysis of ADF protein structure and function has unveiled the molecular mechanisms underlying ADF protein involvement in cellular processes. For instance, ADF proteins regulate actin depolymerization and polymerization states, thus influencing cell motility and morphogenesis [
7]. Secondly, investigations into the role of the ADF gene family in organismal growth and development have revealed its involvement across various kingdoms. In plants, ADF proteins foster cell activity, with specific members like AtADF4 and AtADF9 promoting Arabidopsis growth [
8]. Similarly,
ZmADF3 facilitates maize root hair elongation, while
ZmADF1,
OsADF, and
AtADF7 contribute to pollen formation and tube elongation in maize, rice, and Arabidopsis [
9,
10,
11], each exerting distinct functions. In animals, ADF proteins are implicated in embryonic and neuronal development [
12]. Finally, studies on the ADF gene family’s role in stress response highlight its significance in plant stress mitigation. Notably, proteins like
OsADF3,
AtADF2,
At-ADF4,
AtADF5, and
TaADF7 exhibit substantial effects on abiotic stress response and disease resistance enhancement [
13,
14,
15,
16]. For instance,
TaADF7 alters actin dynamics to regulate SOD accumulation in wheat, fortifying wheat resistance against stripe rust.
Recent studies on cotton ADF have revealed that the down-regulation of
GhADF6 expression significantly enhances cotton’s resistance to Verticillium wilt, suggesting a crucial role for increased filamentous actin (F-actin) abundance and dynamic protein remodeling in plant defense against pathogen invasion [
17]. This likely involves a coordinated expression regulation of actin-binding proteins, including ADF. Furthermore, transgenic cotton seedlings and mature plants expressing
GhADF1 exhibit robust tolerance to drought stress, which is accompanied by notable improvements in physiological and biochemical indicators such as photosynthetic rate, MDA content, POD activity, and CAT activity in transgenic cotton leaves. Upon the down-regulation of
GhADF1 expression, an analysis of cotton fiber properties reveals a potential role for
GhADF1 in controlling fiber elongation as well as the synthesis and deposition of secondary cell wall cellulose in cotton fibers [
18].
These studies have validated the function of corresponding genes in cotton disease resistance, cotton response to drought stress, and regulation of cotton fiber elongation. However, the gene family of ADF in cotton has yet to be fully described, and the response mechanism to other abiotic stresses remains to be analyzed. However, filamentous actin in actin has been demonstrated to possess a number of functions, including the regulation of plant growth and the response to abiotic stresses. Accordingly, the present study commenced with the initial screening of all ADF genes in cotton through the analysis of genomic data and transcriptome sequencing results. Subsequently, genes that did not respond to abiotic stress were eliminated through the use of quantitative reverse transcription polymerase chain reaction (qRT-PCR). The remaining genes were then subjected to analysis of their response to abiotic stress through the utilization of virus-induced gene silencing (VIGS) technology. The findings of this study are anticipated to serve as a genetic reference for the breeding of cotton with enhanced resistance.
3. Results
3.1. Dentification of ADF Gene Family Members of Four Cotton Species and Analysis of Their Physicochemical Properties
To identify ADF family genes in four cotton species, we employed the known protein sequences of Arabidopsis thaliana AtADFs and rice OsADFs as references. Using the local BLASTP method, we searched the genome protein databases of these cotton species. Combining the results with those from HMM searches, we submitted the resulting candidate protein sequences to Pfam, SMART, and NCBI CDD databases for a further confirmation of domain existence, manually removing redundant sequences. Subsequently, we identified 40, 39, 20, and 19 non-redundant ADF gene sequences in Gossypium hirsutum L., Gossypium barbadense Linn, Gossypium raimondii, and Asiatic cotton, respectively. These genes were sequentially named based on their chromosomal coordinates.
Supplementary Table S1 presents protein length encoded by cotton ADF genes, ranging from 53 to 395 amino acids (aa), with an average length of 151 aa. Protein molecular weights spanned from 6.10 to 44.41 kDa with predicted isoelectric points ranging from 4.46 to 9.4. Additionally, DNA sequences encoded by ADF members varied in length from 939 to 2076 base pairs (bp), averaging 1416 bp. Notably, the physicochemical properties of the 46
GhADFs exhibited differences, which were likely attributable to unconserved amino acid regions.
3.2. Phylogenetic Analysis of ADF Protein in Cotton
To delve deeper into the evolutionary relationships among cotton ADF members, we conducted multiple sequence alignments using MAFFT software (version 7). The analysis encompassed downloaded
AtADFs, rice ADFs, tomato ADFs, wheat ADFs, and full-length sequences of ADFs from the three identified cotton species. Subsequently, we employed the maximum likelihood method to explore the phylogenetic evolution of cotton ADF members. As depicted in
Figure 1, the resulting phylogenetic tree delineated cotton ADF family members into four distinct subfamilies (I, II, III, IV), aligning with previous studies. Notably, in comparison with ADF gene families from other plants, cotton ADF family members within groups III and IV appeared relatively abundant. We hypothesized that tandem duplication events or chromosome segment duplication events might underlie the expansion of these two groups. We constructed a phylogenetic tree specifically for the 40 cotton ADF members (
Figure S1), which corroborated our earlier findings, affirming the reliability of our results for further research endeavors.
3.3. Chromosomal Mapping of ADF Family Members in Cotton
Tandem duplication and chromosome segment duplication represent the primary mechanisms of plant genome amplification, increasing genome diversity. In order to gain insight into the effects of gene differentiation and duplication on the formation of cotton ADF gene families, we initiated a chromosome mapping exercise for all identified candidate genes. As illustrated in
Figure 2, 33 cotton ADFs were distributed across 26 chromosomes, with seven located on scaffolds, exhibiting a heterogeneous distribution pattern. The greatest number of family members was identified on chromosome D09, with four ADFs, which was followed by three family members each on chromosomes A01, A03, A09, and D01. Furthermore, two additional GhADFs were identified on chromosomes A12, A13, D03, and D12 with individual members present on chromosomes A05, A07, A11, D02, D07, D08, and D11. No ADF family members were identified on the remaining chromosomes. It is noteworthy that the majority of ADF genes are situated in the telomeric region of the chromosomes. The telomeric region is a region of relatively low gene density on the chromosome, which may render these genes more susceptible to replication events. This provides further evidence that the location of genes on chromosomes may influence the frequency and outcome of their replication. In particular, genes situated in the telomeric regions of chromosomes may be more prone to replication errors or events due to the intrinsic replication properties of the region. This may result in the expansion of gene families and the emergence of novel functions.
3.4. Collinearity Analysis and KA, KS Analysis of ADF Family Members of Cotton
In plants, gene family expansion is instrumental in facilitating adaptation to diverse environmental conditions. To elucidate the expansion mechanism of the ADF gene family in cotton, we conducted an analysis of gene duplication events within the cotton genome.
As detailed in
Supplementary Table S2, we identified two tandem duplication events within the cotton ADF family, involving
GhADF2 and
GhADF3 as well as
GhADF18 and
GhADF19, which are both located on chromosomes A01 and D01. Notably, these tandemly duplicated gene pairs are situated within subgroup III. It is noteworthy that the sequence similarity of these tandemly repeated gene pairs exceeded 90%, indicating their likely origination from the same ancestral gene during evolution, retaining similar biological functions.
Additionally, as depicted in
Figure 3a and
Supplementary Table S2, we observed a total of 32
GhADFs located within fragment replacement regions, yielding a total of 83 pairs of fragment replacement events.
Furthermore, we investigated the syntenic relationships of the ADF family across various cotton species.
Figure 3b illustrates 30 collinear gene pairs between the ADF family in cotton and Asian cotton, 69 syntenic gene pairs with Sea Island cotton, and 30 syntenic gene pairs with Raymond cotton. To assess whether the selection experienced by duplicated ADF family genes is advantageous, we calculated the non-synonymous differentiation level (Ka), synonymous difference level (Ks), and Ka/Ks ratios. Analysis revealed abnormal Ka/Ks values for 15 pairs of duplicated genes, which was possibly due to genome assembly issues. Additionally, a distribution of Ka/Ks values was observed, with most values less than 1, suggesting that strong purifying selection is acting upon these genes post-duplication. Notably, Ka/Ks values were equal to 0 for 6 duplicated gene pairs, which is consistent with the general observation that Ka/Ks values are typically less than 1, indicating a lack of positive selection pressure.
3.5. Gene Structure and Conserved Motif Analysis of ADF Gene in Cotton
The diversity in gene structure offers valuable insights into gene function and evolutionary relationships. Therefore, we conducted an analysis of the gene structure of ADF family genes in cotton.
According to the evolutionary tree, the gene structure of the 40 ADF members can be categorized into four groups. Group I comprises five members with three exons, while
GhADF30 stands out with five exons. Group II includes five members with three exons, one with two exons, and one with a single exon. Group III encompasses nine members with three exons and three members with two exons. In group IV, nine members feature three exons, two have seven exons, three possess two exons, and one consists of a single exon. Notably, most members exhibit three exons, aligning with typical ADF family characteristics. The gene structure remains highly conserved within members of the same evolutionary branch or subgroup, encompassing exon and intron numbers and lengths. Notably, while exon lengths within the same subgroup are similar, intron lengths display considerable variation. We speculate that changes in exon numbers, as well as the length and distribution of introns, may have contributed to functional diversity among these genes during evolution. To delve deeper into the structural and functional diversity of cotton ADF family members, we subjected the full-length sequences of 40 GhADF proteins to the online program MEME. The analysis predicted 10 conserved motifs, as depicted in
Figure 4. Notably, the number, type, and sequence of motifs among ADF members in different subgroups exhibit a degree of conservatism, validating the reliability of our constructed evolutionary tree, which is consistent with earlier findings. Group I features a distinctive motif 8, while GhADF14 and GhADF31 in group IV possess specific motifs 5, 6, 7, and 9. Motif 9 exclusively appears in groups IV and III. Most members harbor motifs 1, 2, 4, and 10, which constitute part of the conserved ADF protein domain.
3.6. Analysis of Cis-Acting Elements of ADF Gene in Cotton
During the growth and development of plants, they encounter a myriad of biotic and abiotic stresses from the external environment with cis-acting elements playing a pivotal role in regulating gene transcription initiation and interactions with regulatory factors. To delve deeper into the potential functions and transcriptional regulation of GhADFs genes, we utilized the online prediction tool PlantCARE to analyze the promoter region (2000 bp upstream of ATG) of each cotton ADF family member.
As illustrated in
Figure 5 and
Supplementary Table S3, we categorized the predicted cis-acting elements into four types, each serving distinct functions: light response, hormone response, growth and development regulation, and stress response. For this study, we focused on cis-acting elements with specific functions, excluding ubiquitous elements such as CAAT-box, TATA-box, and TATC-box.
Our analysis revealed a widespread occurrence of light-responsive cis-elements in each ADF family member of cotton, including L-box and TCT-motif. Similarly, most members harbored elements associated with hormone responses, such as ABRE, ERE, P-box, TCA-element, TGA-element, and TGACG-motif, indicating their potential involvement in regulating hormone responses. Additionally, we identified growth-related elements in the cotton ADF family, such as TGA-element, CAT-box, RY-element, CCGTCC-box, and dOCT.
Regarding abiotic stress response, the promoter region of 35 ADF family members in cotton contained elements associated with environmental stress response. For instance, seven members contained DRE elements implicated in responses to low temperature, salt, and drought stress. Moreover, 11 members featured MBS elements linked to drought stress response, while 8 members contained TC-rich repeats, and all 16 members harbored W-box elements, which are both widely involved in regulating stress response in plants.
In conclusion, our research underscores the significant role of GhADFs in hormone regulation networks and stress responses in cotton. Additionally, these findings provide valuable insights for the further exploration of adversity stress responses in cotton.
3.7. Analysis of Expression Patterns of ADF Family Members in Cotton under Different Stresses
In order to explore whether the ADF family of cotton plays an important role in the regulation of stress responses, we used transcriptome data to explore the expression levels of members under different stresses. As shown in
Figure 6, most
GhADFs responded to different degrees under the treatment of abiotic stress. After treatment at 4 °C and 37 °C for 1 h, the expression levels of
GhADF31,
GhADF5 and other genes were down-regulated, and the expression levels continued to decrease after 1 h, indicating that these genes may play a negative regulatory role in low temperature and 37 °C stress. On the contrary, the expression levels of
GhADF28,
GhADF7 and other genes increased rapidly after 1 h of low-temperature treatment, and
GhADF6,
GhADF16, and
GhADF30 reached the peak value after 1 h of low-temperature treatment, indicating that these genes can respond rapidly after low-temperature treatment of 37 °C. Under the treatment of 37 °C, the expression levels of most ADF members changed significantly at 1 h and 24 h, while under the treatment of NaCl and PEG, the expression levels of most members changed significantly at 6 h, 12 h and 24 h. At the same time, we also found that there are genes that respond to all stresses, such as
GhADF27,
GhADF28,
GhADF23, etc.; there are also some members whose expression patterns are quite different under different stresses, such as
GhADF9 at 4 °C and PEG treatment. The expression level increased significantly and reached the peak at 12 h, but there was almost no change in the expression level under the treatment of NaCl and 37 °C.
GhADF34 had no change under the treatment of 4 °C, and the expression level reached the peak at 24 h under the stress of 37 °C. Two genes,
GhADF17 and
GhADF22, did not change significantly under the four stress treatments, which may be related to other biological regulation processes.
3.8. qRT-PCR Verification of ADF Family Members of Cotton
The analysis of transcriptome data revealed that most
GhADFs members exhibited different expression patterns under various stress treatments. To validate the credibility of the aforementioned outcomes, we randomly selected some candidate genes whose expression levels changed and conducted qRT-PCR experiments. According to quantitative results, most ADF members responded to NaCl, low temperature, and PEG stress to varying degrees. As depicted in
Figure 7, under NaCl stress,
GhADF32 was up-regulated to more than three times that of the control group at 24 h,
GhADF19 was up-regulated to nearly four times that of the control group at 3 h,
GhADF13 and
GhADF20 showed the same expression pattern, and their expression levels decreased after treatment and returned to the control state at 24 h. Under low-temperature treatment, the expression level of
GhADF12 decreased after 3 h of treatment, then increased linearly, and peaked at 24 h. The expression levels of
GhADF13 and
GhADF20 continued to decrease after treatment, indicating that they may play a negative regulatory role in low-temperature response. The expressions of
GhADF5 and
GhADF35 were rapidly up-regulated after low-temperature treatment for 3 h, which may be related to the signal reception in the early stage of low-temperature regulation. Under PEG treatment, the expression levels of
GhADF32 and
GhADF20 continued to decrease after PEG treatment, and they returned to the control level at 24 h; the expression levels of
GhADF8 and
GhADF13 also continued to decrease after PEG treatment, and they returned to half the level of the control group at 24 h. The expression of
GhADF24 continued to decrease after PEG treatment, indicating that it may play a negative regulatory role, while the expressions of
GhADF5,
GhADF19, and
GhADF35 were significantly up-regulated 3 h after PEG treatment, which may be related to early signal reception. It is worth mentioning that
GhADF35 was strongly expressed in all three stresses, which may play a crucial role in early signal reception. Under PEG treatment, the expression level was nearly 20 times that of the control group; conversely,
GhADF13 showed a down-regulation trend in expression level after early stress, indicating its possible involvement in negative regulation. In summary, by conducting qRT-PCR experiments and comparing them with RNA-seq data, we observed that the RNA-seq data used were more accurate and the conclusions were relatively reliable. Therefore, this can lay a solid foundation for our subsequent selection of candidate genes.
3.9. Functional Analysis of GhADF19
According to the results of fluorescence quantification, four candidate genes
GhADF2,
GhADF5,
GhADF18, and
GhADF19 were screened for VIGS gene silencing. When the leaves of cotton seedlings injected with PDS were albino, it indicated that other genes were also silenced successfully. As shown in
Figure 8a, it was found that the growth of the leaves and plants of the control group
TRV:00 (hereinafter referred to as the control group) was significantly better than that of the silenced plants under drought stress, cold stress, and salt stress. The material measurement results are shown in
Figure 8b. The dry matter synthesis of plants with the
GhADF19 gene silenced under drought stress and cold stress decreased most significantly compared with the other three genes. The results of qRT-PCR silencing efficiency detection are shown in
Figure 8c. The gene expression levels in the young Agrobacterium cotton inoculated with
TRV:GhADF19 were significantly reduced under different stress treatments, by 68.24%, 81.19%, and 66.82%, respectively. Among them, the plants with the
GhADF19 gene silenced had the worst growth condition, with severe leaf wilting, and some plants were close to death. These results indicated that the
GhADF19 gene may be involved in the stress response of upland cotton. Phenotypic images of untreated cotton seedlings inoculated with
TRV:GhADF19 have been uploaded to the
Supplementary File (12d-COLD.HEIC, 12d-NACL.HEIC, 12d-PEG.HEIC).
5. Conclusions
The identification and analysis of the ADF gene family in four cotton species, namely Gossypium hirsutum, Gossypium barbadense Linn, Gossypium raimondii, and Asiatic cotton, revealed a total of 40, 39, 20, and 19 non-redundant ADF gene sequences, respectively. These sequences were further characterized based on their physical and chemical properties, including protein length, molecular weight, and isoelectric point. The analysis showed variations in these properties among the ADF gene family members, which was potentially attributed to unconserved amino acid regions.
Furthermore, phylogenetic analysis divided the ADF family members into four subfamilies, with groups III and IV being relatively larger compared to other plants, which was possibly due to tandem duplication events or chromosomal segment duplication events. Chromosome mapping revealed an uneven distribution of ADF genes across chromosomes and scaffolds. Additionally, gene duplication analysis and collinearity analysis highlighted the expansion of the ADF gene family in cotton through tandem duplication and chromosome segment duplication events.
Conserved gene structures and motifs were identified among ADF family members, suggesting evolutionary conservation within specific branches or subgroups. Analysis of cis-acting elements in the promoter regions revealed their roles in light response, hormone response, growth and development regulation, and stress response.
Transcriptome analysis under various abiotic stresses showed that most GhADFs responded to different degrees of stress. Further validation through qPCR experiments identified four genes, GhADF2, GhADF5, GhADF18, and GhADF19, which exhibited high responses to abiotic stress. These genes were selected for subsequent VIGS verification, confirming their importance in stress responses as evidenced by the worsened growth conditions under drought, low temperature, and salt stress compared to the control group.