Next Article in Journal
A Review on Microbial Products and Their Perspective Application as Antimicrobial Agents
Previous Article in Journal
The Expression and Activity of Rhodanese, 3-Mercaptopyruvate Sulfurtransferase, Cystathionine γ-Lyase in the Most Frequently Chosen Cellular Research Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

AlphaFold-Predicted Structures of KCTD Proteins Unravel Previously Undetected Relationships among the Members of the Family

1
Institute of Biostructures and Bioimaging, CNR, Via Mezzocannone 16, 80134 Naples, Italy
2
IRCCS SDN, Napoli, Via E. Gianturco 113, 80143 Naples, Italy
*
Authors to whom correspondence should be addressed.
Biomolecules 2021, 11(12), 1862; https://doi.org/10.3390/biom11121862
Submission received: 9 November 2021 / Revised: 26 November 2021 / Accepted: 7 December 2021 / Published: 10 December 2021
(This article belongs to the Section Biomacromolecules: Proteins)

Abstract

:
One of the most striking features of KCTD proteins is their involvement in apparently unrelated yet fundamental physio-pathological processes. Unfortunately, comprehensive structure–function relationships for this protein family have been hampered by the scarcity of the structural data available. This scenario is rapidly changing due to the release of the protein three-dimensional models predicted by AlphaFold (AF). Here, we exploited the structural information contained in the AF database to gain insights into the relationships among the members of the KCTD family with the aim of facilitating the definition of the structural and molecular basis of key roles that these proteins play in many biological processes. The most important finding that emerged from this investigation is the discovery that, in addition to the BTB domain, the vast majority of these proteins also share a structurally similar domain in the C-terminal region despite the absence of general sequence similarities detectable in this region. Using this domain as reference, we generated a novel and comprehensive structure-based pseudo-phylogenetic tree that unraveled previously undetected similarities among the protein family. In particular, we generated a new clustering of the KCTD proteins that will represent a solid ground for interpreting their many functions.

1. Introduction

The KCTD family (proteins containing the K-potassium Channel Tetramerization Domain) comprises twenty-five members (KCTD1-21, SHKBP1, TNFBP1, KCNRG, and BTBD10) involved in diversified yet fundamental physio-pathological processes. Their emerging role is clearly evident from the analysis of selected studies published in the last decade [1,2,3,4,5]. One of the most striking and puzzling features of these proteins is their involvement in apparently unrelated processes. KCTDs were initially identified as key factors of neurodevelopmental and neuropsychiatric disorders ([6] and references therein). The role of these proteins has been clearly assessed in neurocognitive disorders (KCTD3), epilepsy (KCTD7), bipolar disorder (KCTD12), autism and schizophrenia (KCTD13), and movement disorders (KCTD17). Moreover, almost all of the members of the family have been associated with the insurgence and the progression of different types of cancers ([7] and references therein), including leukemia, medulloblastoma, hepatocarcinoma, and breast, pancreatic, and colorectal cancer. In addition, KCTD proteins are also implicated into the insurgence of genetically inherited diseases such as the scalp–ear–nipple syndrome [8,9] and obesity/adipogenesis [10,11].
The founding feature of this protein family is the presence in all of the members of a conserved BTB (Broad complex, Tramtrak and Bric-a-brac) domain (also denoted as POZ—poxvirus zinc finger—or T1) [12]. In its canonical function, the BTB domain is involved in both homo- and hetero-oligomerizations [13]. In KCTD proteins, the pentameric association is the prevalent functional oligomerization state [14,15,16,17], although their BTB domains have shown a remarkable versatility in the formation of assemblies formed by a variable number of polypeptide chains [15,18]. These observations have led to the suggestion that the primary role of the BTB domain in KCTDs is in partnership formation rather than in dictating oligomerization. Indeed, key interactions of KCTD proteins that are established with partners such as cullin 3 [17,19,20], the transcription factor AP-2α [21], and the receptor GABAB2 [5,22] are mediated by the BTB domain. The BTB domain is located in the N-terminal region of KCTD protein and, with very few exceptions, it is preceded in the sequence by small, and likely unstructured, protein stretches. In the various members of the family, the BTB domain is followed by regions whose sequences are globally unrelated among KCTDs. Indeed, sequence similarities in the C-terminal regions can be detected in selected groups of KCTD proteins made by two or three members. Based on the alignments of the BTB domains, Skoblov et al. [23] generated an influential divergence tree of KCTD proteins in which proteins presenting sequence similarities in the C-terminal region were grouped in the same clade (Figure S1). Although, over the years, modifications of this tree have been proposed [6,15,24], it is considered a reliable classification of KCTD sequences and, presumably, also representative of structure similarities.
Despite the recognized role of these proteins in key biological processes, a full understanding of their biochemical role is far from being achieved. Although, for some of the members, a clear role as substrate adaptor in cullin-ring ligases in protein ubiquitination/degradation has been established [23], their involvement in unrelated physio-pathological processes remains an intriguing but essentially unsolved issue. It is important to note that it has been pointed out that even individual members of the family can play roles in apparently distant contexts. This observation has suggested that these KCTDs are likely involved in some basic, hitherto undiscovered, biochemical activities [25].
It is commonly believed that the poor and fragmentary structural data available for these proteins has hampered insightful definitions of structure–function relationships in these proteins. Indeed, the three-dimensional structure of the BTB domain has been reported only for a limited, although significant, number of KCTD proteins, whereas the full-length structure has only been reported for two members of the family (Table 1). Moreover, sporadic information is available regarding the folded domains present in the diversified C-terminal regions (Table 1). This scenario is going to rapidly change if the structures predicted by AlphaFold [26,27] and deposited in the AlphaFold Protein Structure Database (AF Database—https://alphafold.ebi.ac.uk/ accessed on 1 November 2021) are considered. Indeed, reliable three-dimensional structures have been reported for all KCTD proteins. Here, we globally analyzed these models by highlighting analogies and differences among the members of the family. We also assessed the accuracy of these models by comparing the experimental versus the predicted structures, for the proteins whose structure had been previously crystallographically determined. These analyses clearly indicated that, despite undetectable sequence analogies, the C-terminal region of the vast majority of KCTD proteins share a common structural domain (KCTD-CTD). Using this domain as a reference, we generated a novel comprehensive structure-based divergence tree that unraveled previously undetected similarities among the protein family. We believe that the present findings will facilitate the definition of the thus far elusive structure–function relationships of these proteins.

2. Materials and Methods

2.1. Three-Dimensional Structures of KCTD Proteins and Their Analysis

The predicted three-dimensional structures of the KCTD proteins characterized here were retrieved from the AlphaFold Protein Structure Database (AF Database) created by DeepMind and EMBL’s European Bioinformatics Institute (EMBL-EBI) (https://alphafold.ebi.ac.uk accessed on 21/08/2021). As in the database for the protein KCTD11, the model of the truncated form of the protein (Uniprot code Q693B1-1), in which the N-terminal region of the BTB domain is missing, was reported, we generated a three-dimensional model for the long variant (Uniprot code Q693B1-2), which is the functional form of the protein [28], using the Colab server (https://colab.research.google.com/github/sokrypton/ColabFold/blob/main/AlphaFold2.ipynb accessed on 15/09/2021). This server predicts protein structures starting from their sequences using a slightly simplified version of AlphaFold v2.0 that does not consider existing structural templates. The Colab server was also used to generate an unbiased template-free model of the KCTD1 protein. The reliability of the AF predictions was assessed by the Local Distance Difference Test (LDDT) score reported for each structure of the AlphaFold Database.
The experimental crystallographic structures of KCTD proteins were retrieved from the Protein Data Bank (https://www.rcsb.org accessed on 15/10/2021) using the keyword KCTD to search the database. These analyses were integrated using the BLAST server (https://blast.ncbi.nlm.nih.gov/Blast.cgi accessed on 21/08/2021) and the sequence of KCTDs as query. Both experimental and predicted structures were inspected by molecular graphics and using available software such as PDBSUM [29] and Dali (http://ekhidna2.biocenter.helsinki.fi accessed on 1 November 2021) [30,31] to identify secondary structure elements and domain boundaries.

2.2. Comparison of the Three-Dimensional Structures

Extensive comparative analysis of the structures of the KCTD proteins was performed using Dali (http://ekhidna2.biocenter.helsinki.fi/dali/ accessed on 1 November 2021). In particular, pairwise comparisons were performed between PDB structures. Moreover, we selected the C-terminal domains of the AF models and structurally aligned every pair of domain structure by using the DALI distance matrix alignment method and scoring function [32], as implemented in the Dali web server. The Dali server was also used to search the PDB for structural analogs of the predicted KCTD structures.

2.3. Generation of Structural Tree

After the generation of the pairwise structural similarity matrix by reporting the Dali Z-score, the structures could be grouped into clusters, and the relationships among various clusters could be showed by a dendrogram (a diagram representing a tree). The Dali server provided a dendrogram of the structures by using hierarchical agglomerative clustering based on the average linkage method applied to the structural similarity matrix. The structural dendrogram in Newick format, returned by Dali all-against-all comparison, was read by the PHY·FI server (https://services.birc.au.dk/phyfi/go.py accessed on 1 November 2021) [33] to obtain the graphical representation.
In addition to providing the pseudo-phylogenetic tree, Dali also performed a multidimensional scaling correspondence analysis of the Z-scores. The results of the projection of the structures on the first two eigenvectors are plotted in the correspondence analysis plot where the most similar structural neighborhoods are placed near each other.

3. Results

3.1. KCTDs in the PDB: The State of the Art

A survey of the PDB (October 2021 release) indicates that twenty-one distinct structures of KCTD proteins have been hitherto reported in the database (Table 1). Among these, only two cover a significant portion of corresponding proteins and may be considered as representative of the structures of the full-length proteins. The structure of KCTD5 (PDB entries 3DRX and 3DRY), the first reported full-length structure for a member of the family, presents a pentameric oligomerization state whose overall interdomain association depends on the salt conditions [16]. The full-length structure of KCTD1 (PDB entry 6S4L—doi 10.2210/pdb6S4L/pdb) is also a pentamer and covers, with the exception of the N- and C-terminal tails, the entire protein. All of the other structures correspond to the individual domains. The structure of the BTB domain, as an isolated entity, has been reported for KCTD1 (PDB entries 5BXD and 5BXB), KCTD5 (PDB entry 3DRZ), KCTD10 (5FTA), KCTD13 (4UIJ), KCTD16 (5A15, 6OCR, 6I0Q, and 6OCT), KCTD17 (5A6R), and SHKBP1 (4CRH). These structures have highlighted the versatility of this domain [15], which was also corroborated by molecular dynamics simulations [18], to oligomerize in different oligomeric states that include monomeric, tetrameric, open/close pentameric, and hexameric assemblies [15,17,34]. This domain has also been characterized in complex with a peptide of the GABAB2 receptor (6OCP and 6M8R) [22,35].
The other PDB structures correspond to the folded domain occurring in the C-terminal regions of the closely related proteins KCTD8 (6G57), KCTD12 (6QZL), and KCTD16 (6QB7). This domain, also denoted in the literature as the H1 domain, has also been reported in complex with the Gβ1γ2 subunits (6M8S) [22].
Table 1. Structures of KCTD proteins in the PDB and their similarity with the corresponding AlphaFold models. If multiple crystal structures of the same protein/domain were present, the one determined at the highest resolution was used for calculation of RMSD values.
Table 1. Structures of KCTD proteins in the PDB and their similarity with the corresponding AlphaFold models. If multiple crystal structures of the same protein/domain were present, the one determined at the highest resolution was used for calculation of RMSD values.
ProteinDomain/ComplexPDB Code(s)/ReferencesResolution(s) (Å)RMSD (Å) (#) 1
KCTD1BTB5BXD/5BXB [17]1.8/2.20.6 (103)/0.4 (103)
Full-length6S4L2.40.8 (205)
KCTD5BTB3DRZ [16]1.90.8 (102)
Full-length3DRX/3DRY [16]3.1/3.32.1 (163)/2.4(133)
KCTD8CTD6G572.81.8 (88)
KCTD9BTB5BXH [17]2.80.7 (97)
KCTD10BTB5FTA [15]2.60.4 (96)
KCTD12CTD6QZL2.00.7 (100)
CTD-Gβ1γ26M8S [22]3.70.8 (103)
KCTD13BTB4UIJ [15]2.70.3 (102)
KCTD16BTB5A15/6OCR/6I0Q/6OCT [15,34,35]2.8/2.3/2.30.5 (91)/0.4 (90)/0.7 (95)/0.5 (91)
CTD6QB72.20.7 (110)
BTB-GABAB2 pept6OCP/6M8R [22,35]2.3/3.20.5 (91)/0.5 (93)
BTB5A6R [15]2.80.7 (101)
BTB4CRH [15]1.71.0 (90)
1 This represents the number of the superimposed residues.

3.2. AlphaFold Versus PDB KCTD Structures

The analysis of three-dimensional structures of the human KCTD proteins predicted by AF indicates that they present significant variations among the members of the family that, however, operate on a common theme. As expected on the basis of their sequences, all AF models of KCTD proteins present a BTB domain which is canonically constituted by a single β-sheet surrounded by five helices (Figure 1). In addition to this common element, all KCTD members present other folded regions that are occasionally accompanied by unstructured fragments characterized by a low confidence in the prediction (Table 2). In line with previous indications, in most of the KCTD proteins, in addition to the BTB, there is a single extra folded domain located at the C-terminus whose size ranges from 57 to 146 residues (Table 2 and Figure 1 and Figure 2). One exception to this general trend is observed for the proteins KCTD3/SHKBP1, which present much larger C-terminal folded domains that assume the structure of an eight-bladed β-propeller (Figure 1). Other examples include KCTD9, which exhibits two extra domains in addition to the BTB that belong to the ubiquitin-like and to the pentapeptide repeat family of β-solenoid classes (Figure 1 and Figure 2), and KCTD19, which presents a collapsed multidomain structure that contains three BTB domains (Figure S2). As anticipated above, in addition to the folded domains, several KCTD proteins also exhibit long unstructured portions that can behave as intrinsically disordered regions (Table 2).
The comparison of the AF-predicted structures with those experimentally determined and reported in the PDB clearly highlights the ability of this approach to correctly predict the structures of individual domains. Indeed, as shown in Table 1, most of the root mean square deviations (RMSD) of the predicted structures of the individual BTB domains versus the experimental structures range in the interval of 0.4–0.8 Å. It is worth noting that these values are close to the errors associated with experimental structures. A slightly higher value is detected for the BTB domain of SHKBP1 (1.0 Å), which is the only monomeric crystalline structure of the family hitherto reported. A similar scenario emerges for the evaluation of the folded domain present in the C-terminal region of the proteins. Again, the RSMD values fall in the interval of 0.7–0.8 Å. The only significant exception is represented by the C-terminal domain of KCTD8 (1.8 Å) that assumes a peculiar organization in the crystalline state (Figure S3a) that is radically different from the pentameric organization of the C-terminal domain observed in the close homologs KCTD12 and KCTD16 (Figure S3). It is worth mentioning that functional studies have predicted the presence of an additional domain (H2) in the C-terminal region of the sequence of KCTD8 and KCTD16. The inspection of the corresponding AF structures shows the presence of two rather isolated helices at the C-terminus of these two proteins, with a reliable prediction score (90 > LDDT > 70). Although speculative, this observation may suggest that the H2 domain is a sort of flexible modulator of the activity of these proteins.
The excellent prediction of the full-length structure of KCTD1 (RMSD of 0.8 Å) shows the ability of the AF approach to correctly determine the relative position of the two domains. Larger RMSD values are detected for the full-length structure of KCTD5 (2.1–2.4 Å). However, it should be noted that the global structure of this protein is somehow flexible, as demonstrated by its sensitivity to salt conditions and corroborated by molecular dynamics investigations [16,36].
It should be noted that the latest version of AlphaFold (AlfaFold2), which was used to generate the models here analyzed, considers the available experimental PDB structures that are used as templates. Although it has been shown that these templates have a marginal impact on the predictions [26], in principle, the excellent agreement between the prediction and the experimental structures detected here may be biased by the AlphaFold2 protocol. In order to assess the impact of the template on these findings, we ran predictions of the KCTD1 structure using the Colab server that uses the AlphaFold approach without employing any template in the predictive scheme (see Materials and Methods for details). The analyses of the structure predicted using this approach clearly indicate the overall correctness of the structure of the two domains of the protein. A significant increase is observed in the RMSD value computed on the entire structure that passed from 0.8 (AF model versus the experimental one) to 3.1 (Colab model versus the experimental one) Å. The structure of the two individual domains are, however, correctly predicted. Collectively, these findings indicate that the inclusion of the KCTD1 template in the AF approach only (slightly) affected the relative orientation of the two domains that constitute the protein (Figure S4).

3.3. Detecting Analogies among AlphaFold Structures of KCTD

As illustrated in the previous paragraph, the release of the AF-predicted structures grossly expanded the structural data available for KCTD proteins. A global visual inspection of these models also indicates that the folded domain occurring in the C-terminal region of these proteins has some common structural features. Indeed, in almost all structures, despite the significant size variability, this domain consists of a single β-sheet that is surrounded by few helices. On the basis of this observation, we undertook a quantitative comparison of these domains using the pairwise alignment tool of Dali (see Methods for details).
As shown in Table 3, in which the Z-score values that emerged from the Dali structural alignment are grouped according to the clades identified by Baranova and colleagues [23], the folded domains of the family member belonging to the same clade present high similarities, as demonstrated by the high Z-score values (also see Figure S5). This finding is not surprising, as these clades were defined according to the sequence homology of the C-terminal region [23]. Somewhat unexpected is the finding that significant structure analogies are frequently found among KCTD proteins belonging to different clades, despite the absence of significant sequence similarities.
A deeper inspection of the table unravels other interesting trends. The only member of the family whose C-terminal region does not display any structural similarities with the C-terminal regions of the other KCTD proteins is KCTD9, which presents a β-solenoid domain in this region (Figure 1). As expected on the basis of the peculiar structure of their C-terminal region, which is characterized by the presence of a large β-propeller, KCTD3 and SHKBP1 present remarkable similarities only between each other. However, a small but significant similarity (Z-score of 2.7) is detected between the C-terminal region of SHKBP1 and the corresponding region of KCTD21. As shown in Figure S6, the C-terminal domain of KCTD21 partially overlaps with one of the blades of the SHKBP1 β-propeller. Somewhat intriguing is the result that emerged from the search of this domain in KCTD19 that, as described above, has a peculiar structural organization that is characterized by a multidomain globular fold and the presence of three BTB domains (Figure 2). Indeed, small significant similarity in terms of Z-score values is detected between KCTD4, KCTD6, and BTBD10. Surprisingly, the region of KCTD19 that presents this similarity corresponds to the second BTB domain of the protein. As shown in Figure S7, this portion of the protein can be successfully fitted to both the BTB and the C-terminal domain of KCTD4.
Remarkable similarities in terms of pairwise Z-scores between KCTD proteins not belonging to the same previously reported clade are detected between: (a) members of the clades A and F, (b) KCNRG with KCTD6 and KCTD15, and (c) KCTD7 and KCTD14.
The detected structural analogy of the domain in the sequence region that follows the BTB domain of the vast majority of KCTD proteins suggests that also these portions of these proteins have a common origin. This new identified domain that presents a single β-sheet surrounded, depending on the specific KCTD protein, by two to five α-helices is thereafter denoted as the KCTD-CTD domain. Moreover, considering the absence of KCTD-CTD in KCTD9 and SHKBP1/KCTD3, we denote them as non-canonical KCTD proteins.

3.4. Structure-Based Pseudo-Phylogenetic Tree

Once we assessed the reliability of the KCTD structures reported in the AF database and the presence of a conserved but diversified structural domain that follows the BTB domain, we decided to generate a structure-based phylogenetic-like tree by comparing the structures of the KCTD-CTD domain. The corresponding structural dendrogram, which was generated using the Dali package (see Methods for details), is reported in Figure 3. The dendrogram that emerged from this analysis highlights a clear clustering of the KCTD proteins. As expected on the basis of the results illustrated in the previous paragraph, KCTD9 and SHKBP1/KCTD3 are the proteins that are most distant from the other members of the family.
Notably, with the exception of KCTD18, all other canonical members of the family are clustered in groups that contain two or more members of the family (Figure 3). Moreover, some of the groups present some additional branching. Based on these observations, KCTD proteins are classified in clusters and sub-clusters. The first group (cluster 1) contains the proteins KCTD8, KCTD12 and KCTD16 (sub-cluster 1A) and KCTD1 and KCTD15 (sub-cluster 1B). The next group (cluster 2) comprises KCNRG and KCTD6 (sub-cluster 2A) and KCTD11 and KCTD21 (sub-cluster 2B). The upper clustering of the dendrogram is completed by cluster 3 that is made up of KCTD2, KCTD17, and KCTD5 (Figure 3). The upper portion of the diagram is completed by the isolated protein KCTD18 and by cluster 4 that embodies the distantly related KCTD4 and KCTD19. The lower portion of the tree includes clusters 5 and 6. Cluster 5 is articulated in the sub-clusters 5A (BTBD10 and KCTD20) and 5B (KCTD7 and KCTD14), whereas cluster 6 comprises KCTD10, TNFAIP1, and the slightly divergent KCTD13. Finally, in cluster 7, the two non-canonical KCTDs, KCTD3 and SHKBP1, are included.
This overall grouping is confirmed by the multidimensional scaling correspondence analysis of the Z-scores performed by Dali (Figure S8).

4. Discussion

Generalized and reliable predictions of three-dimensional protein structures from their sequences have been the holy grail of structural biology for decades. Indeed, despite the huge technical and methodological advances experienced in recent decades, the experimental determination of biomolecules of such complexity remains a challenging and lengthy task. As is universally agreed, the impact of AlphaFold predictions [26,27], whose accuracy often competes with that of experimental determinations, will certainly revolutionize the structural biology with implications that will affect distant fields of life sciences [37,38]. Here, we exploited the structural information contained in the AF database to gain insights into the relationships among the members of the KCTD family with the aim to facilitate the definition of the structure/molecular basis of their key roles in many biological processes.
The most important finding that emerged from the analysis is the discovery that, in addition to the BTB domain, the vast majority of these proteins share a structurally similar domain in the C-terminal region (KCTD-CTD) despite the general absence of sequence similarities detectable in these regions. In this scenario, the few exceptions (KCTD9, KCTD3, and SHKBP1), which present a different repertoire of domains, may be considered non-canonical members of the family. Nevertheless, the individual blades of the KCTD3/SHKBP1 β-propeller present some significant similarity with the CTD domain of KCTD21. Although this analogy may be coincidental, it can also be reminiscent of a very distant common origin of the β-propeller and the KCTD-CTD. Intermediate features are displayed by KCTD19 that presents a collapsed globular structure, in which three distinct BTB domains can be identified. Intriguingly, the third BTB domain of this protein has significant structural analogies with some CTD domains of the protein family (Table 3), although it is more similar to the other BTB domains. In structural terms, the BTB3 domain of KCTD19 is a sort of link between the BTB and the KCTD-CTD structures. Although this analogy may be due to a convergent evolution, it is intriguing to speculate that even the BTB and the CTD domain could share a common ancestor.
The finding that KCTD proteins, in addition to the BTB, also share a rather conserved CTD domain, whose structure is very likely to be correctly predicted by AF, prompted us to build a structure-based pseudo-phylogenetic tree that was generated by comparing the divergence of this domain throughout the family. This led to a new clustering of the KCTD proteins that also presents, along with analogies, significant differences when compared to the trees built on the basis of the BTB sequence comparisons. In general terms, the analysis of this pseudo-phylogenetic tree indicates that KCTD proteins belonging to different clusters may play important roles in cancer [7] or in neurodevelopmental and neuropsychiatric disorders [6], the most important diseases in which these proteins are involved. This may be the consequence of the adaptor function of some KCTDs that can recruit different substrates for ubiquitination in different physio-pathological contexts [6] or of other basic biochemical activities of these proteins yet to be identified. Nevertheless, novel clustering discriminates KCTD proteins on the basis of their ability to bind cullin 3. Indeed, independent experimental reports [15,17,19] have clearly demonstrated that members of cluster 1 (KCTD1/KCTD15 and KCTD8/KCTD12/KCTD16) are unable to bind cullin and, therefore, cannot function as adaptors of cullin-ring ligases. It is likely that the separation of these proteins from the nearby group of KCTD6/KCNRG/KCTD11/KCTD21, which do act as adaptors in the ubiquitination process, has been accompanied by a radical change of function of these proteins. Moreover, in this particular clustering, the analysis of the grouping of KCTD6, KCNRG, KCTD11, and KCTD21 indicates that these four proteins are generally related but separated in two divergent groups (KCTD6/KCNRG and KCTD11/KCTD21). Therefore, the KCASH functional subfamily of KCTD6/KCTD11/KCTD21 [39] should also include KCNRG as a member. Since the other three members of the group are able to downregulate HDAC [1,39], it would be of interest to verify whether KCNRG can also accomplish this function. Furthermore, the tree unravels the similarity of the pair KCTD20/BTBD10 with KCTD7/KCTD14. It also shows that the most divergent group within the canonical KCTDs is represented by KCTD10/KCTD13/TNFBP1, which are involved in the ubiquitination and degradation of small GTPases.
It is important to note that in the current AF database, only individual polypeptide chains are reported. As KCTD proteins generally operate as oligomers, likely pentamers, the ability of this predictive approach to provide information in spite of this important limitation is impressive. Obviously, the elucidation of the structure of KCTDs’ oligomeric states will hold important implications for their function. Our preliminary molecular modeling and dynamics studies indicate that most of the KCTD-CTD domains may form assemblies, such as those detected in the experimental structures of the CTD domains of KCTD5, KCTD1, KCTD12, and KCTD16, in which the association of the five monomers generates a channel that can be functionally important. Although further investigations are needed for a complete definition of the structural properties of these proteins, we believe that the AF database represents a treasury whose exploitation will revolutionize many areas of life sciences.
In conclusion, not only does this work give a structural interpretation of existing functional data on KCTD proteins but it also provides a new interpretative tool for the results that will emerge from future experiments.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/biom11121862/s1, Figure S1: Human KCTD protein tree, Figure S2: Schematic representation of AlphaFold KCTD19 structure, Figure S3: Schematic representation of PDB experimental structures for selected human KCTD proteins, Figure S4: Superimposition of different human KCTD1 structures, Figure S5: Similarity heatmap showing the relationships between the KCTD-CTD domains of the analyzed proteins, Figure S6: Superimposition of the C-terminal domains of AF models for SHKBP1 protein and the KCTD21 protein, Figure S7: Structural comparison of the AF-KCTD4 protein domains with the AF-KCTD19 BTB2 protein domain, Figure S8: Correspondence analysis plot.

Author Contributions

Conceptualization, L.V.; methodology, L.E., L.V.; validation, L.E., N.B., L.V.; investigation, L.E., N.B., G.S., A.R., R.B., L.V.; data curation, L.E., L.V.; writing—original draft preparation, L.E., L.V.; writing—review and editing, L.E., N.B., G.S., A.R., R.B., L.V.; supervision, L.E., L.V.; project administration, L.E., L.V. All authors have read and agreed to the published version of the manuscript.

Funding

The present work was financed by the Italian Ministry of Health by Grant GR2018-12366091 “Unravelling the role of KCTD protein family in the clinical management of childhood acute lymphoblastic leukemias”. We also thank Cineca for computational resources (ISCRA B project—ID HP10BBY7W1).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Ethics Committee of the Medical Council of Rhineland Palatinate Germany (837.374.16).

Data Availability Statement

All datasets analyzed in this study are available from the corresponding authors on reasonable request.

Acknowledgments

We acknowledge the administrative and technical support from Florinda Pignatiello and Luca De Luca, respectively.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Canettieri, G.; Di Marcotullio, L.; Greco, A.; Coni, S.; Antonucci, L.; Infante, P.; Pietrosanti, L.; De Smaele, E.; Ferretti, E.; Miele, E.; et al. Histone deacetylase and Cullin3-REN(KCTD11) ubiquitin ligase interplay regulates Hedgehog signalling through Gli acetylation. Nat. Cell Biol. 2010, 12, 132–142. [Google Scholar] [CrossRef] [PubMed]
  2. Brockmann, M.; Blomen, V.A.; Nieuwenhuis, J.; Stickel, E.; Raaben, M.; Bleijerveld, O.B.; Altelaar, A.F.M.; Jae, L.T.; Brummelkamp, T.R. Genetic wiring maps of single-cell protein states reveal an off-switch for GPCR signalling. Nature 2017, 546, 307–311. [Google Scholar] [CrossRef] [PubMed]
  3. Escamilla, C.O.; Filonova, I.; Walker, A.K.; Xuan, Z.X.; Holehonnur, R.; Espinosa, F.; Liu, S.; Thyme, S.B.; Lopez-Garcia, I.A.; Mendoza, D.B.; et al. Kctd13 deletion reduces synaptic transmission via increased RhoA. Nature 2017, 551, 227–231. [Google Scholar] [CrossRef]
  4. Golzio, C.; Willer, J.; Talkowski, M.E.; Oh, E.C.; Taniguchi, Y.; Jacquemont, S.; Reymond, A.; Sun, M.; Sawa, A.; Gusella, J.F.; et al. KCTD13 is a major driver of mirrored neuroanatomical phenotypes of the 16p11.2 copy number variant. Nature 2012, 485, 363–367. [Google Scholar] [CrossRef] [PubMed]
  5. Schwenk, J.; Metz, M.; Zolles, G.; Turecek, R.; Fritzius, T.; Bildl, W.; Tarusawa, E.; Kulik, A.; Unger, A.; Ivankova, K.; et al. Native GABA(B) receptors are heteromultimers with a family of auxiliary subunits. Nature 2010, 465, 231–235. [Google Scholar] [CrossRef] [PubMed]
  6. Teng, X.; Aouacheria, A.; Lionnard, L.; Metz, K.A.; Soane, L.; Kamiya, A.; Hardwick, J.M. KCTD: A new gene family involved in neurodevelopmental and neuropsychiatric disorders. CNS Neurosci. Ther. 2019, 25, 887–902. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Angrisani, A.; Di Fiore, A.; De Smaele, E.; Moretti, M. The emerging role of the KCTD proteins in cancer. Cell Commun. Signal. 2021, 19, 56. [Google Scholar] [CrossRef]
  8. Marneros, A.G.; Beck, A.E.; Turner, E.H.; McMillin, M.J.; Edwards, M.J.; Field, M.; de Macena Sobreira, N.L.; Perez, A.B.; Fortes, J.A.; Lampe, A.K.; et al. Mutations in KCTD1 cause scalp-ear-nipple syndrome. Am. J. Hum. Genet. 2013, 92, 621–626. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Smaldone, G.; Balasco, N.; Pirone, L.; Caruso, D.; Di Gaetano, S.; Pedone, E.M.; Vitagliano, L. Molecular basis of the scalp-ear-nipple syndrome unraveled by the characterization of disease-causing KCTD1 mutants. Sci. Rep. 2019, 9, 10519. [Google Scholar] [CrossRef] [PubMed]
  10. Willer, C.J.; Speliotes, E.K.; Loos, R.J.; Li, S.; Lindgren, C.M.; Heid, I.M.; Berndt, S.I.; Elliott, A.L.; Jackson, A.U.; Lamina, C.; et al. Six new loci associated with body mass index highlight a neuronal influence on body weight regulation. Nat. Genet. 2009, 41, 25–34. [Google Scholar] [CrossRef] [PubMed]
  11. Pirone, L.; Smaldone, G.; Spinelli, R.; Barberisi, M.; Beguinot, F.; Vitagliano, L.; Miele, C.; Di Gaetano, S.; Raciti, G.A.; Pedone, E. KCTD1: A novel modulator of adipogenesis through the interaction with the transcription factor AP2alpha. Biochim. Biophys. Acta Mol. Cell Biol. Lipids 2019, 1864, 158514. [Google Scholar] [CrossRef] [PubMed]
  12. Zollman, S.; Godt, D.; Prive, G.G.; Couderc, J.L.; Laski, F.A. The BTB domain, found primarily in zinc finger proteins, defines an evolutionarily conserved family that includes several developmentally regulated genes in Drosophila. Proc. Natl. Acad. Sci. USA 1994, 91, 10717–10721. [Google Scholar] [CrossRef] [Green Version]
  13. Stogios, P.J.; Downs, G.S.; Jauhal, J.J.; Nandra, S.K.; Prive, G.G. Sequence and structural analysis of BTB domain proteins. Genome Biol. 2005, 6, R82. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Smaldone, G.; Pirone, L.; Pedone, E.; Marlovits, T.; Vitagliano, L.; Ciccarelli, L. The BTB domains of the potassium channel tetramerization domain proteins prevalently assume pentameric states. FEBS Lett. 2016, 590, 1663–1671. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Pinkas, D.M.; Sanvitale, C.E.; Bufton, J.C.; Sorrell, F.J.; Solcan, N.; Chalk, R.; Doutch, J.; Bullock, A.N. Structural complexity in the KCTD family of Cullin3-dependent E3 ubiquitin ligases. Biochem. J. 2017, 474, 3747–3761. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Dementieva, I.S.; Tereshko, V.; McCrossan, Z.A.; Solomaha, E.; Araki, D.; Xu, C.; Grigorieff, N.; Goldstein, S.A. Pentameric assembly of potassium channel tetramerization domain-containing protein 5. J. Mol. Biol. 2009, 387, 175–191. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Ji, A.X.; Chu, A.; Nielsen, T.K.; Benlekbir, S.; Rubinstein, J.L.; Prive, G.G. Structural Insights into KCTD Protein Assembly and Cullin3 Recognition. J. Mol. Biol. 2016, 428, 92–107. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Balasco, N.; Smaldone, G.; Vitagliano, L. The Structural Versatility of the BTB Domains of KCTD Proteins and Their Recognition of the GABAB Receptor. Biomolecules 2019, 9, 323. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Smaldone, G.; Pirone, L.; Balasco, N.; Di Gaetano, S.; Pedone, E.M.; Vitagliano, L. Cullin 3 Recognition Is Not a Universal Property among KCTD Proteins. PLoS ONE 2015, 10, e0126808. [Google Scholar] [CrossRef] [PubMed]
  20. Balasco, N.; Pirone, L.; Smaldone, G.; Di Gaetano, S.; Esposito, L.; Pedone, E.M.; Vitagliano, L. Molecular recognition of Cullin3 by KCTDs: Insights from experimental and computational investigations. Biochim. Biophys. Acta 2014, 1844, 1289–1298. [Google Scholar] [CrossRef]
  21. Zarelli, V.E.; Dawid, I.B. Inhibition of neural crest formation by Kctd15 involves regulation of transcription factor AP-2. Proc. Natl. Acad. Sci. USA 2013, 110, 2870–2875. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Zheng, S.; Abreu, N.; Levitz, J.; Kruse, A.C. Structural basis for KCTD-mediated rapid desensitization of GABAB signalling. Nature 2019, 567, 127–131. [Google Scholar] [CrossRef]
  23. Skoblov, M.; Marakhonov, A.; Marakasova, E.; Guskova, A.; Chandhoke, V.; Birerdinc, A.; Baranova, A. Protein partners of KCTD proteins provide insights about their functional roles in cell differentiation and vertebrate development. Bioessays 2013, 35, 586–596. [Google Scholar] [CrossRef] [PubMed]
  24. Liu, Z.; Xiang, Y.; Sun, G. The KCTD family of proteins: Structure, function, disease relevance. Cell Biosci. 2013, 3, 45. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Smaldone, G.; Beneduce, G.; Incoronato, M.; Pane, K.; Franzese, M.; Coppola, L.; Cordella, A.; Parasole, R.; Ripaldi, M.; Nassa, G.; et al. KCTD15 is overexpressed in human childhood B-cell acute lymphoid leukemia. Sci. Rep. 2019, 9, 20108. [Google Scholar] [CrossRef] [Green Version]
  26. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Zidek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
  27. Senior, A.W.; Evans, R.; Jumper, J.; Kirkpatrick, J.; Sifre, L.; Green, T.; Qin, C.; Zidek, A.; Nelson, A.W.R.; Bridgland, A.; et al. Improved protein structure prediction using potentials from deep learning. Nature 2020, 577, 706–710. [Google Scholar] [CrossRef] [PubMed]
  28. Correale, S.; Pirone, L.; Di Marcotullio, L.; De Smaele, E.; Greco, A.; Mazza, D.; Moretti, M.; Alterio, V.; Vitagliano, L.; Di Gaetano, S.; et al. Molecular organization of the cullin E3 ligase adaptor KCTD11. Biochimie 2011, 93, 715–724. [Google Scholar] [CrossRef] [PubMed]
  29. Laskowski, R.A.; Jablonska, J.; Pravda, L.; Varekova, R.S.; Thornton, J.M. PDBsum: Structural summaries of PDB entries. Protein Sci. 2018, 27, 129–134. [Google Scholar] [CrossRef] [PubMed]
  30. Holm, L.; Laakso, L.M. Dali server update. Nucleic Acids Res. 2016, 44, W351–W355. [Google Scholar] [CrossRef] [PubMed]
  31. Holm, L.; Rosenstrom, P. Dali server: Conservation mapping in 3D. Nucleic Acids Res. 2010, 38, W545–W549. [Google Scholar] [CrossRef]
  32. Holm, L.; Sander, C. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 1993, 233, 123–138. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Fredslund, J. PHY.FI: Fast and easy online creation and manipulation of phylogeny color figures. BMC Bioinform. 2006, 7, 315. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Sereikaite, V.; Fritzius, T.; Kasaragod, V.B.; Bader, N.; Maric, H.M.; Schindelin, H.; Bettler, B.; Stromgaard, K. Targeting the gamma-Aminobutyric Acid Type B (GABAB) Receptor Complex: Development of Inhibitors Targeting the K(+) Channel Tetramerization Domain (KCTD) Containing Proteins/GABAB Receptor Protein-Protein Interaction. J. Med. Chem. 2019, 62, 8819–8830. [Google Scholar] [CrossRef] [PubMed]
  35. Zuo, H.; Glaaser, I.; Zhao, Y.; Kurinov, I.; Mosyak, L.; Wang, H.; Liu, J.; Park, J.; Frangaj, A.; Sturchler, E.; et al. Structural basis for auxiliary subunit KCTD16 regulation of the GABAB receptor. Proc. Natl. Acad. Sci. USA 2019, 116, 8370–8379. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Barone, D.; Balasco, N.; Vitagliano, L. KCTD5 is endowed with large, functionally relevant, interdomain motions. J. Biomol. Struct. Dyn. 2016, 34, 1725–1735. [Google Scholar] [CrossRef]
  37. Callaway, E. ‘It will change everything’: DeepMind’s AI makes gigantic leap in solving protein structures. Nature 2020, 588, 203–204. [Google Scholar] [CrossRef] [PubMed]
  38. Service, R.F. ‘The game has changed.’ AI triumphs at protein folding. Science 2020, 370, 1144–1145. [Google Scholar] [CrossRef] [PubMed]
  39. De Smaele, E.; Di Marcotullio, L.; Moretti, M.; Pelloni, M.; Occhione, M.A.; Infante, P.; Cucchi, D.; Greco, A.; Pietrosanti, L.; Todorovic, J.; et al. Identification and characterization of KCASH2 and KCASH3, 2 novel Cullin3 adaptors suppressing histone deacetylase and Hedgehog activity in medulloblastoma. Neoplasia 2011, 13, 374–385. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Representative AF-predicted models of the following KCTD proteins: KCTD1 (a), KCTD5 (b), KCTD3 (c), KCTD9 (d), and KCTD19 (e). The color code of the cartoons follows that used by AlphaFold to report the reliability of the models. Blue, cyan, orange, and yellow protein regions correspond to very high (LDDT > 90), high (90 > LDDT > 70), low (70 > LDDT > 50), and very low (LDDT < 50) model confidence, respectively.
Figure 1. Representative AF-predicted models of the following KCTD proteins: KCTD1 (a), KCTD5 (b), KCTD3 (c), KCTD9 (d), and KCTD19 (e). The color code of the cartoons follows that used by AlphaFold to report the reliability of the models. Blue, cyan, orange, and yellow protein regions correspond to very high (LDDT > 90), high (90 > LDDT > 70), low (70 > LDDT > 50), and very low (LDDT < 50) model confidence, respectively.
Biomolecules 11 01862 g001
Figure 2. Schematic representation of protein domain organization. For the KCTD10, KCTD13, and TNFAIP1 proteins, AF predicts that a potential β-strand of the flexible C-terminal region could join the β-sheet of the CTD domain. The Dom domain of KCTD19 presents some similarity to the TAFH domain.
Figure 2. Schematic representation of protein domain organization. For the KCTD10, KCTD13, and TNFAIP1 proteins, AF predicts that a potential β-strand of the flexible C-terminal region could join the β-sheet of the CTD domain. The Dom domain of KCTD19 presents some similarity to the TAFH domain.
Biomolecules 11 01862 g002
Figure 3. Dali dendrogram generated using the structure of the KCTD-CTD domain. Representative three-dimensional models of the folded domains present in the C-terminal region are also shown. Helices and β-sheets are shown in cyan and magenta, respectively. The cluster numbering is reported in red.
Figure 3. Dali dendrogram generated using the structure of the KCTD-CTD domain. Representative three-dimensional models of the folded domains present in the C-terminal region are also shown. Helices and β-sheets are shown in cyan and magenta, respectively. The cluster numbering is reported in red.
Biomolecules 11 01862 g003
Table 2. Details of the KCTD AlphaFold models.
Table 2. Details of the KCTD AlphaFold models.
ProteinAlphaFold
Code
Low Structured and/or Low Confident Regions 1BTB DomainCTD Domain
KCTD1AF-Q719H91–17/242–257A30-T133P139-L239
KCTD2AF-Q146811–71/239–263R72-T178V183-N239
KCTD3AF-Q9Y5971–17/139–181/598–815E18-L115- 4
KCTD4AF-Q8WVF51–32/198–217T33-L135T140-K259
KCTD5AF-Q9NXV21–41/211–234V42-T149V154-N211
KCTD6AF-Q8NC69-D12-D107M114-K234
KCTD7AF-Q96MP81–46/196–224P50-G158Y162-W289
KCTD8AF-Q6ZWB61–41/148–203/325–434E44-L145R205-P322
KCTD9AF-Q7L27373–88D89-S191- 4
KCTD10AF-Q9H3F61–18/274–295Y33-Q129P141-E260
KCTD11 2AF-Q693B1-G14-A123A126-H271
KCTD12AF-Q96CX21–29/132–204/P34-A131R206-E325
KCTD13AF-Q8WZ191–40/272–303/311–329K41-E142I150-T271
KCTD14AF-Q9BQ131–28T33-D124M125-W255
KCTD15AF-Q96SI11–43/266–283A56-R162A165-E265
KCTD16AF-Q68DU81–21/124–161/281–389E25-T122K162-P280
KCTD17AF-Q8N5Z51–29/197–260/265–321G30-V135P139-H196
KCTD18AF-Q6PI47242–426D12-S118P140-L242
KCTD19AF-Q17RG1108–171/259–282/
501–756/788–803
D13-E107/
V172-M258/
Q396-Q487 3
- 4
KCD20AF-Q7Z5Y71–115/359–419E117-C216D218-E356
KCTD21AF-Q4G0X4-P4-K107N109-R260
KCNRGAF-Q8N5I3246–272E5- Q104P107-I245
SH3KBP1AF-Q8TBC3139–192/604–707E19-R118- 4
TNFAIP1AF-Q138291–27/258–290/299–316K28-S130I137-E257
BTBD10AF-Q9BSF81–148/409–475M149-C266D268-W405
1 Confident regions correspond to those with AF parameter LDDT > 70. Unstructured regions correspond to long (>15 residues) fragments with a marginal content of secondary structure and a low confidence prediction (LDDT < 70). 2 The sequence numbering refers to the long variant. The structural data were derived from the model obtained with the Colab server (see Methods for details). 3 The three regions correspond to the three BTB domains of the protein. 4 The KCTD-CTD domain has been not clearly identified in the AF structure of the protein (see text).
Table 3. Dali Z-score similarity matrix derived from the structural alignment of protein C-terminal domains. The clade definition is the one used in Figure S1. Dashes are reported for pairs for which Dali does not detect any significant similarity. The names of the proteins are abbreviated due to space limitations.
Table 3. Dali Z-score similarity matrix derived from the structural alignment of protein C-terminal domains. The clade definition is the one used in Figure S1. Dashes are reported for pairs for which Dali does not detect any significant similarity. The names of the proteins are abbreviated due to space limitations.
K1K15K8K12K16K6K11K21K4K20B10K18K19KCNK7K14K10K13TNFK2K5K17K3SHKK9
Clade
A
K123.319.610.19.69.88.15.92.72.2--2.9-9.52.32.5---4.13.93.7---
K15 23.310.49.910.17.95.12.12.0--2.7-9.82.63.6---4.94.85.0---
Clade
F
K8 22.819.520.47.85.8-2.0----8.12.3----2.72.82.8---
K12 22.819.27.65.7------8.9-----2.52.52.6---
K16 22.77.55.0-2.1--2.2-8.0-2.3---3.23.03.3---
Clade
B
K6 24.47.85.83.3--3.12.211.62.22.3---3.73.83.5---
K11 26.914.25.3----8.7-----------
K21 25.22.1----9.2---------2.7-
K4 27.3---2.33.3-----------
Clade
G
K20 25.322.9---5.66.6---------
B10 25.0-2.22.15.76.6---------
K18 21.8-2.5-----------
K19 54.7--2.0---------
KCN 27.03.83.4---4.84.54.8---
K7 24.913.4---3.03.0----
K14 27.12.42.22.43.03.03.0---
Clade
C
K10 25.621.423.5------
K13 27.022.4------
TNF 26.7------
Clade
E
K2 12.110.710.8---
K5 12.010.2---
K17 11.9---
Clade
D
K3 62.750.3-
SHK 63.6-
K9 39.6
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Esposito, L.; Balasco, N.; Smaldone, G.; Berisio, R.; Ruggiero, A.; Vitagliano, L. AlphaFold-Predicted Structures of KCTD Proteins Unravel Previously Undetected Relationships among the Members of the Family. Biomolecules 2021, 11, 1862. https://doi.org/10.3390/biom11121862

AMA Style

Esposito L, Balasco N, Smaldone G, Berisio R, Ruggiero A, Vitagliano L. AlphaFold-Predicted Structures of KCTD Proteins Unravel Previously Undetected Relationships among the Members of the Family. Biomolecules. 2021; 11(12):1862. https://doi.org/10.3390/biom11121862

Chicago/Turabian Style

Esposito, Luciana, Nicole Balasco, Giovanni Smaldone, Rita Berisio, Alessia Ruggiero, and Luigi Vitagliano. 2021. "AlphaFold-Predicted Structures of KCTD Proteins Unravel Previously Undetected Relationships among the Members of the Family" Biomolecules 11, no. 12: 1862. https://doi.org/10.3390/biom11121862

APA Style

Esposito, L., Balasco, N., Smaldone, G., Berisio, R., Ruggiero, A., & Vitagliano, L. (2021). AlphaFold-Predicted Structures of KCTD Proteins Unravel Previously Undetected Relationships among the Members of the Family. Biomolecules, 11(12), 1862. https://doi.org/10.3390/biom11121862

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop