Next Article in Journal
The Protective Effect of Zebularine, an Inhibitor of DNA Methyltransferase, on Renal Tubulointerstitial Inflammation and Fibrosis
Next Article in Special Issue
Aggregation of Disordered Proteins Associated with Neurodegeneration
Previous Article in Journal
Flavin Adenine Dinucleotide (FAD) and Pyridoxal 5′-Phosphate (PLP) Bind to Sox9 and Alter the Expression of Key Pancreatic Progenitor Transcription Factors
Previous Article in Special Issue
Does Generic Cyclic Kinase Insert Domain of Receptor Tyrosine Kinase KIT Clone Its Native Homologue?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Intrinsically Disordered Proteins: An Overview

by
Rakesh Trivedi
1,* and
Hampapathalu Adimurthy Nagarajaram
2
1
Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77054, USA
2
Laboratory of Computational Biology, Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad 500046, Telangana, India
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(22), 14050; https://doi.org/10.3390/ijms232214050
Submission received: 4 September 2022 / Revised: 7 November 2022 / Accepted: 8 November 2022 / Published: 14 November 2022
(This article belongs to the Special Issue Intrinsically Disordered Proteins (IDPs) 2.0)

Abstract

:
Many proteins and protein segments cannot attain a single stable three-dimensional structure under physiological conditions; instead, they adopt multiple interconverting conformational states. Such intrinsically disordered proteins or protein segments are highly abundant across proteomes, and are involved in various effector functions. This review focuses on different aspects of disordered proteins and disordered protein regions, which form the basis of the so-called “Disorder–function paradigm” of proteins. Additionally, various experimental approaches and computational tools used for characterizing disordered regions in proteins are discussed. Finally, the role of disordered proteins in diseases and their utility as potential drug targets are explored.

1. Introduction

The functional aspects of genes have been attributed to RNAs and proteins. Of the two, proteins are the ones that bring about the majority of diverse cellular effector functions. Two paradigms concerning the structure and function of proteins have evolved, as shown in Figure 1 [1,2,3,4,5,6]. The first paradigm corresponds to the well-established, often assumed as the default for proteins, ‘structure–function paradigm’, which states that the three-dimensional native structure under physiological conditions is the prerequisite for a protein to function. The second paradigm is the recently established ‘disorder–function paradigm’ based on the proteins that perform cellular functions without attaining a stable three-dimensional structure under physiological conditions [7]. The naturally occurring, biologically active proteins that appear to possess a high degree of conformational flexibility have been referred to as intrinsically disordered proteins (IDPs) [5,8,9,10]. In most instances, instead of the whole protein, only some regions in the protein are disordered and functional; such protein segments are known as intrinsically disordered regions (IDRs) [11,12,13,14]. Interestingly, these intrinsically disordered proteins/regions have endowed proteins with functional promiscuity [15,16,17,18].

1.1. Structure–Function Paradigm

In the early stages, protein research was focused on studying globular (ordered) proteins; therefore, the principles laid to explain the protein structures were more specific to globular proteins. These fundamentals defining the protein structures received continuous experimental support throughout the 20th century from the multitude of protein structures submitted to repositories, such as that of the Protein Data Bank (PDB) [19]. There was no doubt about the relatedness of the protein structure and function. Hence, the central philosophy of structural biology was described as follows: “a protein requires a native folded structure to perform its biological function”.
Proteins were initially considered as rigid, crystal-like biological molecules with no flexibility. This “static” view of protein function was supported by one of the best-understood protein functions, catalysis [20]. This static view of proteins led to the assumption that the complementarity of shapes at the binding interface and the emerging non-covalent forces between molecules are the major drivers of interactions involving proteins [21]. After Koshland proposed the “induced-fit model” of catalysis, the theory of presence of some degree of flexibility in protein structures to carry out their biological functions was accepted [22].
The three-dimensional native structure of a protein, which is at the center of the structure–function paradigm, is encrypted in the protein’s amino acid sequence and is governed by the intricate balance of various physical forces between its atoms [23]. Physical forces include strong covalent (peptide/amide) bonds, which connect atoms of the polypeptide sequence, and weaker attractive/repulsive forces between non-bonded atoms. All of these fundamental interactions act coherently and assist a polypeptide in attaining a well-defined three-dimensional structure under physiological conditions. A unique 3-D structure of a protein ensures a precise spatial arrangement of residues, which provides a particular physicochemical microenvironment required for the extremely specific catalysis of chemical reactions, binding of ligands, ion or small molecule translocation, or macromolecular complex assembly [24]. Along with the deposition of a large number of protein structures into the PDB, the availability of the mechanistic details of a protein’s functions, such as enzyme catalysis, receptors, transporters, membrane channels, etc., also helped the acceptance of the structure–function paradigm. However, with increasing reports of the structural/functional data deviating from the classical structure–function paradigm, there was a need to reassess this well-established paradigm, which, in turn, led to the proposition of the disorder–function paradigm.

1.2. Disorder–Function Paradigm

The early recognition of the significance of protein structures in their functioning is generally considered the prime reason for the prolonged negligence of protein disorder. Unusual behavior of proteins, such as missing electron density in the PDB structures, increased sensitivity to in vitro proteolysis, peculiar behavior during the purification process, etc., has always been observed, but in the light of the classical structure–function paradigm, these observations were initially considered as artifacts. This “dark side” of structural biology underwent an intensive reassessment against the backdrop of proteins’ considerable inherent flexibility and peculiar properties, and resulted in the recognition of the intrinsic structural disorder of proteins as a standard feature across proteomes. Altogether, the widespread occurrence of intrinsically disordered proteins or protein segments and their functional relevance has led to the introduction of a new generalization with respect to the protein structure and function, i.e., the ‘disorder–function paradigm’.
Intrinsically disordered proteins (IDPs) or regions (IDRs) remain unfolded and carry out their biological role under physiological conditions [25]. Just as in the case of ordered/globular proteins, the structural organizations of IDPs/IDRs are also governed by the same physical forces, albeit with some differences in their balance and emerging protein structure dynamics [26]. Most of the early concepts regarding the structure of IDPs/IDRs came from the unfolding/folding reaction studies of globular proteins. At the level of genome organization, large proportions of gene sequences have also been observed to code for functional long amino acid stretches, which most likely either attain a non-globular conformation or exist as unfolded entities in solution as IDPs/IDRs [27].
Over the years, our understanding of the protein structure and function has increased tremendously. However, with the increasing reports of the involvement of protein disorder in carrying out cellular functions and the development of computational tools to predict disordered residues of proteins, there has been an exponential increase in interest in studying IDPs/IDRs. This review article summarizes various features of intrinsically disordered proteins/regions that form the basis of this newly established ‘disorder–function paradigm’.

2. Intrinsic Protein Disorder

Repeated occurrences of proteins with intrinsic flexibility and properties different from those of ordered/globular proteins gradually resulted in the development of the notion that non-rigid proteins are not exceptions. Around 2000, these naturally flexible proteins were accepted as a general category of proteins [27,28,29,30]. The conformational flexibility of these “non-traditional” proteins was proposed to be the source of their biological functions [31]. Over the years, various terms have been introduced by different authors to describe these proteins with inherent flexibility. It was only in recent years that the phrase “intrinsically disordered proteins” (IDPs) became more widely used than other terms [28]. Most acceptably, “intrinsic protein disorder” defines the biologically active proteins or protein segments that exist as ensembles of unfolded, collapsed, extended, non-globular conformations at the secondary or tertiary structural level [27,28,29,30,32,33].

2.1. Natural Abundance of Intrinsically Disordered Proteins

Information about IDPs was very sparse for a very long time. Until the reports on the experimentally characterized IDPs, it appeared improbable to have such a class of proteins in abundance [34]. However, computational predictions revealed the possible widespread occurrence of IDPs and IDRs [35,36,37]. The exhaustive analysis of 31 genomes spanning three kingdoms of life revealed that a considerable number of proteins contain regions with 40 or more consecutive disordered residues [28]. The proportion of structural disorder was also found to increase progressively with genome complexity, from bacteria to archaea, and then to eukaryotes [28,38]. While 33% of eukaryotic proteins have been reported to contain at least one functionally relevant long (>30 residues) intrinsically disordered region, archaean and eubacterial proteins possess only 2.0% and 4.2% of such functional IDRs, respectively [36]. In viral proteomes, the total intrinsic disorder content is determined by the nature of the nucleic acid constituting the viral genome, and it decreases successively with an increase in the size of the viral proteome [39]. Recently, 3133 unique proteins were experimentally validated to contain functional long disordered regions (at least 30 residues) [40]. Additionally, the degree of disorderedness in proteomes and essential proteins was estimated for various genomes, and a sharp increase was observed at the prokaryote/eukaryote boundary [41]. Thus, the natural abundance of disordered proteins or protein segments across complex genomes suggests that, even though IDPs/IDRs fail to attain stable three-dimensional structures under physiological conditions, they are of high functional pertinence [27,28,29,30,42,43,44,45,46,47].

2.2. Sequence Characteristics of Intrinsically Disordered Proteins

The propensity of a protein or protein segment to fold or remain unfolded under physiological conditions is encrypted in its amino acid sequence [28,30,32,33,45,46]. In other words, the amino acid sequence composition determines whether a protein would be an ordered protein with a stable folded 3-D structure or an unfolded intrinsically disordered protein. Strong electrostatic repulsions due to a higher net charge and a lack of driving force for compaction due to low mean hydrophobicity are generally considered as the prime reasons for the unfolded, extended structure of IDPs/IDRs [45].
An in-depth comparative analysis of the sequence composition of ordered and disordered proteins revealed that residues such as Ala, Arg, Gly, Gln, Glu, Lys, Pro, and Ser (referred to as disorder-promoting residues) occurred more frequently in IDPs/IDRs. In contrast, residues such as Asn, Cys, Ile, Leu, Phe, Val, Trp, and Tyr were more common in the ordered/structured segments of the proteins (referred to as order-promoting residues) [28,48,49,50]. Comparative studies of amino acid residues in disordered and ordered regions, physicochemical property-based scales (such as the coordination number, aromaticity, strand propensities, flexibility index, volume, helix propensities, etc.), and composition-based features (e.g., any combination that has one to four residues in the group) have led to the distinction between disordered and ordered regions in proteins [28,50]. Compositional bias and sequence characteristics play a significant role in defining the interactions of disordered proteins/regions [51,52]. Furthermore, the distinct compositional bias of the intrinsically disordered proteins/regions as compared with the ordered proteins/regions forms the basis for developing many disorder-predicting computational tools. For instance, tools involving the alignment of IDPs/IDRs use specific amino acid substitution scoring matrices reflecting the frequency of occurrence of different residues in the disordered regions of proteins [53,54,55,56].

2.3. Structural Aspects of Intrinsically Disordered Proteins

In general, most proteins exist as a combination of both ordered and disordered segments in different proportions [57,58,59,60,61,62]. Unlike the ordered regions of proteins, which exist as stable secondary/tertiary structures, the disordered regions fail to attain a stable three-dimensional native structure under physiological conditions. However, the structure of IDPs/IDRs can be best defined as the ensemble of functionally relevant interconverting transient structures on a fast time scale.
Several previous studies reported that IDPs/IDRs are enriched with uncharged and polar amino acids, lacks bulky hydrophobic residues, and exist as dynamic heterogeneous ensembles of collapsed or extended structures [63,64,65,66]. Furthermore, IDPs/IDRs exhibit different degrees of foldability, ranging from potentially foldable to not foldable at all [59,60,61,67].
IDPs/IDRs do not possess a precise equilibrium value of the atomic coordinates and backbone Ramachandran angles over time; as a result, they appear as “protein clouds” [68]. Despite being highly dynamic, the structures of such protein clouds are best described as a few low-energy conformations [69,70].
In most cases, upon interaction with specific binding partners, a disordered protein/segment undergoes a disorder to order conformational transition (termed as ‘induced folding’) [27,28,42,43,44,47,70,71,72,73,74,75],. Additionally, a disordered protein can bind to multiple partners to attain distinct conformations with each of them, which, in turn, enable it to interact with different targets [28,76,77]. Moreover, there is also a “conformational preference” for the structure attained by IDPs upon binding [78,79]. Post-translational modifications (PTMs) also enable IDPs/IDRs to attain diverse conformations, thus increasing the total repertoire of structures resulting from an IDP/IDR sequence [42,72,80,81,82,83,84]. Ensembles of the conformations resulting from a single sequence make it possible for IDPs to perform multiple apparently unrelated biological functions (termed as ‘moonlighting’) required for the maintenance of life [85,86,87,88,89].

2.4. Functional Classification of Intrinsically Disordered Proteins

Several classification schemes have been proposed over the years based on the functions performed by IDPs/IDRs [42,90]. Tompa et al. annotated IDPs/IDRs in six different functional categories depending on the presence/absence and the strength of the binding of the disordered proteins/regions to their ligands [29,91]. Later, this stratification was further extended to define eight functional classes of IDPs, namely entropic chains, modification sites, disordered chaperones, molecular effectors, molecular recognition assemblers, molecular recognition scavengers, metal sponges, and unknown, as shown in Figure 2 [92]. These functional subtypes can be present either alone or in combination within the same protein if the protein has several disordered regions [59,60,61]. In the following sections, different functional classes are described in some detail with relevant examples.

2.4.1. Entropy Chains

Entropic chains take advantage of their conformational flexibility and perform biological functions without becoming a structured entity [93,94]. Examples of entropic chains include flexible linkers and spacers [29,91]. Flexible linkers regulate the relative movement of domains positioned at the two ends of the linkers, whereas spacers specify the inter-domain distances. In addition to providing a means of distance regulation between domains/regions within proteins, both flexible linkers and spacers enable extraordinary freedom for inter-domain orientations. In general, the conformational ensembles generated through computational simulations and modeling are used to understand the flexibility and positioning of the domains in multidomain proteins [95].
Several previous studies showed the significance of conformational flexibility for IDPs/IDRs to function. For instance, a mutation in the flexible linker region connecting light and heavy chains of multidomain antibody protein reduces the linker’s flexibility, disturbs the domain’s orientation, and limits their interactions [96]. In another study, disordered and flexible linker regions of the host proteins have been shown to be the primary targets of the oncogenic adenoviral Early region 1A (E1A) protein [97]. Replication protein A subunit (70 kDa) manifests a conserved dynamic behavior regardless of the insignificant sequence conservation, which is also evidence of the importance of flexibility from the perspective of functional relevance [98]. The microtubule-associated protein 2 (MAP2) projection domain facilitates the spacing of cytoskeletal structures by repelling molecules that approach microtubules [99]. The disordered regions in the mammalian CIP/KIP family of proteins act as flexible linkers or spacers, and have a conserved propensity to remain in the disordered state [100]. Along with the kinetic properties of the binding partners, the linker flexibility and length are critical factors in determining the processivity of enzymes [101].

2.4.2. Modification Sites

Most of the regulatory and signaling proteins possess IDRs, and the intrinsic flexibility of these regions as display sites affords them an advantage over ordered regions [10,13,36,59,90]. Furthermore, the prevalence of charged side chains and low-complexity regions in the disordered proteins facilitates their multivalent electrostatic interactions with the charged lipid head groups in the membranes. Thus, both the composition of membranes and the disorderedness of IDPs/IDRs fine-tune the signaling pathways [102]. In several independent studies, approximately 15% of all disordered proteins have been reported as lipid-binding proteins [103,104].
The flexibility of IDRs helps them to undergo transient, but specific, interactions with enzymes carrying out various post-translational modifications [105,106,107]. Moreover, this intrinsic flexibility of IDRs facilitates the easy entry and recognition of post-translational modifications within IDRs by effector molecules, leading to subsequent outcomes, such as protein stability, turnover, and localization within a cell upon binding [102,105,108]. p53, p27, histone protein tails, and the CREB-kinase-inducible domain are a few examples of well-explored IDPs where PTMs are critical for functioning and regulation [108,109,110]. Recently, phosphorylation at multiple sites within the disordered regions of four representative IDPs (Ash1, E-Cadherin, CTD2, and p130Cas) has been reported to regulate the IDR-mediated protein–protein interactions through conformational plasticity [111]. Furthermore, site-specific phosphorylation within the intrinsically disordered AF1 domain of glucocorticoids receptor (GR) generates surfaces appropriate for the interaction of the AF1 domain with coregulatory binding proteins (BP), which, in turn, regulates the transcription of the downstream genes involved in GR signaling [112]. Additionally, the methylation of arginine residues in low-complexity and disordered regions has been shown to regulate genome integrity, gene transcription, splicing, mRNA–protein complex biology, protein translation and stability, and phase separation [113].

2.4.3. Disordered Chaperones

Proteins that help RNAs and other protein molecules to attain their functional folded state are known as chaperones [114,115]. About half of the RNA chaperone sequences and one-third of the protein chaperones are disordered in nature. The ability of IDRs to adapt their structures according to different binding partners and increase the lifetime of the encounter complex during binding supports their chaperone activity [116,117]. Disordered chaperones prevent toxic aggregation by quickly binding and solubilizing misfolded proteins. In addition, the transient binding of unfolded or misfolded substrate molecules to disordered chaperone regions helps them to fold/refold correctly in a thermodynamically permissible manner [118]. hnRNP A1, GroEL, α-crystallin, Hsp33, etc. are a few well-known examples of IDPs with chaperone activity [119,120].
RNA chaperones enriched with positively charged residues promote the compactness of nucleic acids conformation by acting as a counterion that locally shields repulsive electrostatic repulsions [121]. For instance, the dimerization of the hepatitis C virus genome is facilitated by its core protein with chaperone activity [121]. Additionally, disorder-enriched RNA binding proteins (RBP) have been found to stabilize the disorderedness of the interacting IDPs/IDRs and maintain their functional competence, thus making the disordered state of the human proteome available for drug targeting [122]. Furthermore, IDRs present in the human chaperones Hsp110s, Hsp105α, and Apg-1 prevent general protein aggregation and amyloid genesis [123]. With aging, the disordered chaperone-mediated quality control of intrinsically disordered proteome is lost, leading to the accumulation of protein aggregates responsible for neurodegenerative diseases [124]. In plants, dehydrins, the largest group of disordered chaperones, are expressed at higher levels under various abiotic stress conditions, such as drought, osmotic stress, or high temperatures. An intricate interplay of ordered and disordered segments of these proteins is required for proper cellular protection [125]. Additionally, disordered chaperones have been reported to lose flexibility upon binding to a substrate, which helps them to act as a molecular on/off switch, as characterized in Hsp33 [126,127,128].

2.4.4. Molecular Recognition Effectors

Effector functional class disordered regions bind permanently to other proteins and modify their actions. Such IDRs often undergo disorder to order conformational transition upon binding to their interacting partners, which is known as ‘coupled folding and binding’ [129,130,131,132,133]. The degree of folding upon the development of intermolecular contacts may vary significantly. The disordered protein partners may either fold completely or remain as an extended structure with short flanking ordered regions [134]. Recently, a ubiquitin-binding domain (referred to as DisUBM) was identified in many proteins that remained mostly disordered in a free state, and attained an α-helical structure upon binding to ubiquitin. Such naturally occurring DisUBMs are considered as general affinity enhancers of IDPs that can bind to fold proteins with a possibility of ubiquitinylation [135]. In general, disordered regions have a kinetic advantage over folded segments, even after intermolecular interactions are formed during the folding-upon-binding process [136].
Effector-disordered regions can also alter the functioning of the other parts within the same protein in two different ways: (i) competitive interactions and (ii) allosteric modulation. For example, the disordered GTPase binding domain (GBD) of Wiskott–Aldrich Syndrome protein (WASP) controls its autoinhibition through competitive binding [137]. Upon binding to cdc42 protein there is an enhanced interaction of the WASP GDB domain with the actin cytoskeleton regulatory machinery. However, GDB interaction with the actin machinery is inhibited as it folds back on the other region of WASP. In another study, IDRs of general transcription factors and transcriptional regulators were shown to compete for the folded TAZ1 domain of CREB-binding protein (CBP) in response to various types of stress stimuli. For instance, under oxygen-deficient conditions, the hypoxia-inducible transcription factor (HIF-1α) C-terminal transactivation domain drives the transcription of different critical adaptive genes. One such gene translates to HIF-1α negative feedback regulator protein CITED2, which represses the transcriptional activity of HIF-1α by directly competing for CBP’s TAZ1 binding site [138]. Allosteric coupling in disordered regions can be represented by the binding of adenovirus E1A oncoprotein with retinoblastoma protein (pRb) and the TAZ2 domain of CBP [139]. The binding of pRb to E1A increases the probability of CBP binding, whereas the initial binding of CBP to E1A decreases the possibility of pRb binding. In addition, the globular KIX domain of the CBP is known to bind cooperatively with CREB kinase-inducible transactivation domain (pKID), the activation domain of transcription factor c-Myb, and mixed-lineage leukemia (MLL) protein. The binding of the pKID and c-Myb to the KIX domain is mutually exclusive and regulated by the allosteric binding of MLL [140,141,142]. These results also suggest that the allosteric coupling can also be determined by the stability of the individual protein conformations achieved when interacting partners are binding [143,144].

2.4.5. Molecular Assemblers

As the name suggests, assemblers interact with multiple binding partners to encourage the formation of higher-order protein complexes, such as the ribosome, activated T-cell receptor complexes, and transcription pre-initiation complexes [91,145,146]. An assembly of various interacting partners of large complexes is feasible due to the multiple types of disordered functional regions, such as short linear peptide motifs (SLiMs), molecular recognition features (MoRFs), etc. In fact, with an increase in the size of the protein complexes, the disorder content of the assembler protein increases [147]. Furthermore, upon binding to their partner proteins, disordered assemblers maintain their open structure, allowing multiple proteins to bind to a single IDR [148,149]. Therefore, the overall architecture of assemblies formed by IDPs depends on the disorder content, the available number of interaction sites, and the nature of the binding molecules [150].
The assembler function of IDPs is an outcome of their multivalency and can be achieved in two distinct ways. Firstly, IDPs can act as the binder molecule by tethering all of the interacting proteins and stabilizing the complex. For example, ribosome assembly involves the cooperative binding of proteins enriched with IDRs and RNA [151]. Secondly, IDPs can also regulate the spatiotemporal assembly of the different signaling partners by acting as a scaffold, such as the IDP-mediated formation of biomolecular condensates, nuclear pore complex, and cytoskeletal assemblies [2,152,153]. Among all the functional categories, the scaffolding regions of the assemblers contain the highest degree of disorder [149].

2.4.6. Molecular Recognition Scavengers

This functional class of intrinsically disordered proteins/regions stores and neutralizes small ligands. For example, caseins and other calcium-binding phosphoproteins (SCPPs) are highly disordered proteins that can act as scavengers by solubilizing calcium phosphate clusters in milk and other biofluids [154]. The proline-rich salivary gland glycoproteins also belong to the molecular recognition scavengers class of IDPs; they function by binding and neutralizing the tannin molecules in digestive tracts [29]. Plant dehydrins are highly disordered proteins expressed at elevated levels in the late stages of embryogenesis. The VviDHN4 isoform of dehydrin acts as a scavenger by removing reactive oxygen species from the cellular environment [155]. Human peroxiredoxins-4 protein, most highly expressed and localized exclusively within the endoplasmic reticulum, acts as its major hydrogen peroxide scavenger [156]. Other examples of IDPs with scavenger activity include chromogranin protein A, which is responsible for sequestering adrenaline and ATP in the medulla of the adrenal gland [154].

2.4.7. Metal Sponges

IDPs/IDRs that are capable of storing and neutralizing heavy metals belong to the metal sponges category, such as the sulfonation of catecholamines in humans by catecholamine sulfotransferase enzyme (Sulfotransferase 1A3). This reaction is a detoxification pathway because it readily forms excretable water-soluble metabolites [157]. Additionally, the SmbP protein of the ammonia-oxidizing bacterium Nitrosomonas Europaea, binds to divalent cations, especially copper, to prevent cellular toxicity [158]. Similarly, the crystallin proteins from the bacteria Yersinia Pestis act as metal sponges by managing the cellular copper levels [159].
All of the other IDPs/IDRs with no experimental support or evidence proving/disproving their role in protein function were assigned to the unknown class. Upon functional annotation, the members of this class are transferred to one of the above-mentioned functional categories.

2.5. Functional Elements of Intrinsically Disordered Proteins

Various functional regions in intrinsically disordered regions have been revealed when studying different classes of functions carried out by IDRs. In general, the functional modules within IDRs have been classified into three categories: (i) Short Linear Motifs (SLiMs), (ii) Molecular Recognition Features (MoRFs), and (iii) Intrinsically Disordered Domains (IDDs) [78,160,161,162,163,164,165]. The classification and features of each of these functional modules are shown in Figure 2. The following sections discuss these functional modules of disordered proteins/regions interactions.

2.5.1. Short Linear Motifs

Short Linear Motifs (SLiMs), also known as MiniMotifs or Linear Motifs (LMs), are 3–10-residue-long peptide segments that occur within IDPs [160,166,167]. SLiMs typically comprise a stretch of loosely conserved amino acids interspersed with a few highly conserved residues [160]. As a consequence, interactions mediated by an individual SLiM motif have weak affinities. However, multiple SLiMs frequently coordinate to produce higher-affinity dynamic interfaces. Even though SLiMs are short sequences with transient binding capabilities, they are indispensable to a protein’s binding precision and functioning. The Eukaryotic Linear Motif (ELM) initiative aims to characterize and annotate motifs, mostly focusing on, but not limited to, eukaryote proteins. At present, 1,000,000 SLiMs have been predicted across the human proteome, but only a small fraction of it has been explored in detail to date [166,168,169,170,171].
Owing to their crucial role in regulating signal transduction and protein–protein interactions, SLiMs are considered as functional in almost every biological pathway. They can interact with a variety of biological entities, including globular protein domains, intrinsically disordered proteins or regions, RNA, lipids, etc. Broadly, SLiMs can be classified into two distinct groups: modification sites and ligands, with each having multiple subgroups [2]. If a linear motif act as a modification site, catalytic domains of enzymes interact with SLiMs to catalyze cellular processes, such as post-translational processing, the addition or removal of moieties, and structural alteration of the peptide backbone [172,173]. In contrast, when linear motifs behave as ligands, they act as complex-promoting motifs required for scaffolding and increasing the avidity of interactions with binding partners. Additionally, SLiMs can act as a docking motif to increase the specificity and efficiency of modification events, and as a targeting/trafficking motif to provide stability to proteins and facilitate their subcellular localization and trafficking [174,175,176,177,178,179]. For example, in the family of Rho-activated kinases (Raks), kinase function activation and uniform subcellular localization is an outcome of cleavage by caspases at SLiMs identified in disordered regions [180]. The binding of the catalytic and regulatory subunits of protein serine/threonine phosphatases (PPP) and their substrates depends on SLiMs [181]. The interaction of human-origin recognition complex subunit ORC1 and licensing factor CDC6 during pre-initiation complex formation and its regulation by CDKs are also mediated by SLiMs [182,183].
Furthermore, SLiMs can also act as molecular switches by allowing the mutually exclusive binding of different partners at the protein’s same or distinct SLiM sites [184,185,186]. Such SLiM features are essential to bring about the context-dependent moonlighting functions of IDPs.

2.5.2. Molecular Recognition Features

Molecular Recognition Features (MoRFs) or Elements (MoREs) are 10–70-residue-long peptide motifs that can occur in the disordered regions of proteins [163,164]. These motifs undergo disorder-to-order transition upon binding with their partners (i.e., folding upon binding) and attain well-defined structures, such as α-helices (α-MoRFs), β-strands (β-MoRFs), γ-coils (γ-MoRFs), or mixtures of all of these conformations (complex-MoRFs) [162,163,187]. The final conformation achieved after transition may occur either due to the conformational preference of unbound MoRFs or the induced folding after binding [78,130,188]. The structural preorganization of MoRFs is modulated mainly by the sequence itself [189]. Therefore, sequence analysis of MoRFs from the structural, conformational, and interaction mechanism aspects will help to understand the properties of MoRFs [190]. A recent detailed computational investigation of 868 complete proteomes showed that 29% of IDRs of bacteria and archaea and 21% of eukaryotes harbor MoRFs [191].
p53 protein is a classic example of a protein with multiple MoRFs in a disordered state attaining multiple conformations upon interaction with different binding partners [163,164,192,193]. MoRF at the N-terminus of p53 protein is the primary binding site of Mdm2 and the other 40 proteins. The MoRF between residues 40 and 60 is the binding site of the Mdm2 and RPA70 proteins. C-terminal MoRF binds to multiple proteins, including S100B and the cdk2–cyclin-A complex. Each of these unique combinations of MoRF and the interacting proteins results in a distinct conformation of p53.
Moreover, the T-cell surface glycoprotein CD3 zeta chain has two MoRFs, referred to here as N-terminus and C-terminus MoRF. While tyrosine kinase, tyrosine phosphatase, and nef protein bind to N-terminal MoRF, the SH2 domain of the SHC protein binds to the C-terminus MoRF [194]. Additionally, the ethylene response factor in plants has several identified MoRFs, and the binding of different interacting proteins to these motifs helps the plants to overcome multiple stresses and adapt to different biological niches [195]. In conclusion, a single MoRF can bind structurally diverse sets of protein partners to give rise to multiple distinct conformations.

2.5.3. Intrinsically Disordered Domains

Sequence-based approaches have identified various protein domains that are entirely or largely disordered both in isolation and solution. They are mostly involved in the DNA, RNA, and protein binding (e.g., Wiskott–Aldrich syndrome protein (WASP)-homology domain 2 (WH2) of actin-binding proteins) [196,197,198]. In general, the intrinsically disordered domains are a long region of disorder (>20 residues) with conserved function, conserved sequence, and conserved disorder [196,197,198,199]. Moreover, at the genomic level, the protein domains acquired during evolution as an outcome of the extension of the existing exons or exonization of previously non-coding regions tend to be highly disordered [200]. Therefore, the exonization of the previously non-coding regions could be one way of incorporating disordered segments into proteins. Some well-known examples of disordered domains include cell cycle regulatory proteins p21, p27, p57, etc. The disordered domains of these proteins have been experimentally and computationally verified [201,202].
Interestingly, specific disordered regions have been observed to co-occur frequently with particular types of structured domains in the same sequence [203,204]. Some globular domains require the presence of short, disordered regions in their vicinity, while others require disordered segments at specific locations with respect to the domain boundaries [203]. These findings suggest that the functional module in proteins can comprise either disordered/ordered regions alone or in combination. For example, ER degradation-enhancing alpha-mannosidase-like proteins (EDEM) recognize misfolded proteins and direct them to the ER-associated protein degradation (ERAD) process. EDEM3 degrades the misfolded proteins with the help of both structured domains (mannosidase homology domain (GH47), intermediate (IMD), and protease-associated (PA)) and intrinsically disordered domains [205]. Furthermore, the Bcl-2 family mediates apoptosis, and the maintenance of stem-like behavior of glioblastoma multiforme spheres by RBM14 protein also requires both structured and disordered domains [206,207]. Therefore, studying the co-occurrence of structured domains and IDRs in the same protein may prove insightful in understanding the unannotated disordered segments of proteins with respect to their biological role.

3. Experimental Approaches for Assessing Intrinsic Protein Disorder

Intrinsic protein disorder can be recognized and characterized by various direct and indirect (bio)physical methods. In contrast to direct techniques, which provide structural information about proteins, indirect approaches do not offer any structural details. Still, they suggest a behavior from which the disordered nature of the proteins can be inferred.

3.1. Indirect Methods

Early understanding of the intrinsic structural disorders of proteins was based on a few simple techniques. In general, these indirect intrinsic disorder-identification approaches can quickly provide ample insight into the structural states of a protein or its segments.
Because of the unusual amino acid composition and lack of a compact hydrophobic core, disordered proteins are evident during the purification process. Usually, the molecular mass (Mw) of IDPs estimated by sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS–PAGE) is higher by a factor of 1.2–1.8 in comparison with that measured by mass spectrometry [29]. Indeed, due to the enrichment of acidic residues and extension in solution, IDPs bind less to SDS and migrate more slowly on the gel in comparison with globular proteins [208]. The aberrant mobility of IDPs is also observed in size-exclusion chromatography (SEC) or gel-filtration (GF) experiments, as a result of which the apparent Mw of proteins with disordered regions is higher [45]. Furthermore, the flexible regions of proteins are known to have increased sensitivity to proteolytic degradation. IDPs, which are more affected than ordered proteins, based on limited in vitro proteolysis, exhibit high inherent flexibility [29,42,45,72].
Other peculiar biochemical behaviors of IDPs/IDRs include insensitivity to high temperatures and stability under acidic treatment. The resistance of IDPs/IDRs to boiling temperatures and acidic pH values has been ascribed to their lower contents of hydrophobic residues and enrichment of polar/charged residues, respectively [209,210,211]. Neutralizing acidic groups at lower pH levels reduces the net charge on IDPs/IDRs, leading to their increased solubility and a more compact structural state [45]. In contrast to IDPs/IDRs, the aggregation/precipitation of globular/ordered proteins occur at elevated temperatures and under low-pH conditions. While high-temperature conditions expose the hydrophobic core of ordered proteins, acidic conditions cause protonation of their negatively charged side chains, leading to charge imbalances, followed by the disruption of salt bridges and aggregate formation [29].

3.2. Direct Methods

Several techniques provide both steady-state and dynamic structural information on IDPs/IDRs at the residue level. These methods capitalize on the significantly distinct conformational behavior of IDPs compared with that of globular proteins [29]. Some of the most commonly used direct methods are as follows:

3.2.1. X-ray Crystallography

The diffraction intensity and X-ray pattern scattered by electrons in the protein structure are used to construct a three-dimensional (3-D) model of electron density, which, in turn, is used to deduce the atomic nuclei positions in the protein molecule [29]. Disordered regions in X-ray structures appear as missing regions [28]. This method can provide protein structure resolution down to 1Å. Still, additional experimental support is required to be certain about the structural disorder, as missing electron density regions can also result from technical failures in crystallography [212].

3.2.2. Circular Dichroism (CD)

Circular dichroism (CD) is an absorption spectroscopy-based approach that relies on measuring the difference in the absorption spectra of right-handed and left-handed circularly polarized light. Optically active chiral molecules preferentially absorb either right-handed or left-handed circularly polarized light. Near-UV (250–350 nm) and far-UV (190–230 nm) CD signals are generally used to determine different aspects of the structure of proteins in solutions. The near-UV CD spectrum represents the tertiary structure around aromatic residues Phe, Tyr, and Trp [213,214]. While intense and detailed spectra characterize ordered proteins, those of IDPs are of low intensity and low complexity. The far-UV CD spectra of the secondary structural elements of proteins are quite distinct; therefore, they are used to determine the proportion of ⍺-helix, β-sheet, turn, PPII helix, and coil conformations in proteins [215]. If the far-UV CD spectrum is indicative of predominantly coil conformations, it indicates the disordered nature of the protein. In the case of proteins having both disordered and ordered regions, the CD does not provide clear information, as it lacks residue-specific details [28].

3.2.3. Nuclear Magnetic Resonance (NMR)

NMR is the most common quantitative technique used for studying IDPs. The spinning ability of the charged atomic nuclei forms the basis of the 3-D structure determination of proteins in solutions using NMR. The directions of these spins are random, but the application of the external magnetic field can align these nuclei in directions either parallel or antiparallel to the applied magnetic field. These two states of nuclei have different energy levels, a low-energy state and a high-energy state. The low-energy state attains a high-energy state upon irradiation with electromagnetic radiation, and free inductive decay (FID) is obtained as the nuclei undergo relaxation. Fourier transformation of FID results in a NMR spectrum with peaks from different types of nuclei in the molecule, which, in turn, is used to characterize the local covalent and spatial arrangement of atoms [29,47,216,217].
In a protein, each nucleus of the individual residues experiences a different magnetic field depending on its microenvironment (referred to as the ‘shielding effect’ or ‘chemical shift’). The chemical shift of the peptide backbone (1H, 13CO, 13C, and 13Cβ) can be used to determine the secondary structure type of the given peptide segment [218,219]. Amino acids in ordered proteins are packed in different kinds of chemical environments, as a result of which their NMR spectrum resembles a combination of spectra of various secondary structure elements. In contrast, the NMR spectra of disordered proteins with extensive conformational averaging appear as a summation of the random coil spectra of residues of proteins [216,218]. In addition to the fine structural details, NMR also provides specific information at the residue level [28].

3.2.4. Small-Angle X-ray Scattering (SAXS)

The SAXS technique can quickly define the structural characteristics of proteins of sizes ranging from a few kilo-Daltons to several giga-Daltons under various experimental setups [220,221,222,223]. Briefly, this method involves exposing samples placed in quartz capillary tubes to a collimated monochromatic X-ray beam source and capturing scattered photons with a detector [224]. Comparative analysis of the electron density distributions of the protein sample and pure solvent/buffer is then conducted to determine various parameters of the proteins in the solution, such as the molecular mass, volume, radius of gyration, folding state, etc. [220]. Moreover, SAXS data can also be used to define protein flexibility and the intrinsically disordered state of proteins in solutions [223,225]. The scattering profiles of the proteins obtained from SAXS experiments are most commonly represented as Kratky plots (s2I(s) as a function of s, where s and I represent the momentum transfer function and scattering intensity), which are used to obtain structural insights into the protein.
In contrast to globular proteins’ bell-shaped Kratky’s plot with well-defined maxima, disordered protein-specific Kratky plots exhibit a plateau for a given range of the momentum transfer function (s), followed by a monotonic increase [226,227]. Additionally, the experimentally determined radius of gyration (Rg) of IDPs from the SAXS curve can be directly compared with the theoretical or experimental Rg values of a globular and random coil for a given number of residues. The Rg values of IDPs lie between those of highly compact globular proteins (lowest Rg values) and completely disordered/unfolded proteins represented by random coils (highest Rg values) [228]. Altogether, this method offers fast structural characterization of proteins in solutions with a relatively easy sample preparation protocol and can capture data under near-native conditions [229,230]. As the sensitivity of SAXS depends on the particle size, prior removal of the macromolecular aggregates during sample preparation using a method such as sedimentation or size-exclusion chromatography is suggested [231].
Finally, it is worth mentioning that SAXS-based studies of IDPs can use valuable a priori complementary information from several other experimental and in silico protein structure determination methods. For instance, X-ray crystallography depicts the structured regions of a protein, while SAXS defines the protein segments with missing electron density [232]. Similarly, NMR provides information about different domains/complex sub-units during analyses of bimolecular complexes and multi-domain proteins, and SAXS defines their relative inter-domain positions [233]. Furthermore, other complementary techniques, such as CD, spectroscopy, chromatography, etc., and SAXS, can be used for the biophysical characterization of IDPs [234]. A low-resolution protein structure defined through the ab initio modeling of SAXS data alone can be further refined using inputs from protein structure prediction tools, such as I-TASSER, CORAL, etc. [235,236]. Recently, various protein structure determination/prediction techniques and SAXS have been used to characterize partially disordered mycobacterial ESX-secretion-associated protein K (EspK) [237].

3.2.5. Cryo-Electron Microscopy (Cryo-EM)

In the last five years, the research area involving the structural characterization of proteins and other biological entities has been revolutionized by the development of cryo-electron-microscopy-based techniques [238,239,240,241]. These methods overcome the limitations of primary methods, i.e., X-ray crystallography and NMR, and allow the structural characterization of relatively large, structurally heterogeneous, flexible, and dynamic assemblies at sub-nanometer atomic resolution (below 4 Å) [241,242,243]. Typically, a cryo-EM workflow contains three main steps: (a) vitrification (rapid cooling without ice crystal formation) of specimens in an aqueous solution, (b) image acquisition at a low electron dose using electron microscopy, and (c) 3D model reconstruction and validation. Single-particle analysis (SPA) and sub-tomogram averaging (STA) models are most commonly used for the structural annotation of proteins [244]. However, while the globular/ordered/structured regions of proteins can be structurally resolved using cryo-EM, the predicted intrinsically disordered regions in the proximity of flexible regions escape structural assignment [245]. Therefore, similar to x-ray crystallography, a high degree of intrinsic disorder restricts the implementation of cryo-EM techniques. Alternatively, the structure and dynamics of IDPs/IDRs can be investigated by complementing higher-resolution NMR studies of IDRs with the modeling capabilities of cryo-EM [246,247]. In conclusion, 3D cryo-EM maps in conjunction with high-resolution data from NMR can model IDPs under physiologically relevant conditions and provide insights into their functional behavior [243].

4. Computational Tools for Disorder Prediction

The biased amino acid compositions and peculiar sequence characteristics of IDPs/IDRs have encouraged the development of various reliable computational tools for studying intrinsic protein disorders. As a result, disorder predictors have been grouped into three distinct classes based on the underlying concepts.

4.1. Propensity-Based Predictors

In principle, a disorder predictor is classified as propensity-based if it depends on some essential physical or chemical characteristics of residues or on prior knowledge of the biological background of intrinsic protein disorder. Disorder-predicting tools, such as FoldIndex, NORSp, GlobPlot, CH plot, and PreLink belong to this category [45,245,246,247,248,249,250,251].

4.2. Machine Learning Algorithms (MLAs) Based Predictors

This class of advanced predictors relies on algorithms trained on data sets of experimentally characterized disordered regions and can differentiate disorder and order encoding sequences [29]. Currently, the experimentally characterized disordered proteins are publicly available on three databases: MobiDB (http://mobidb.bio.unipd.it/; accessed on 7 November 2022), IDEAL (https://ngdc.cncb.ac.cn/databasecommons/database/id/198; accessed on 7 November 2022), and DisProt (http://www.disprot.org/; accessed on 7 November 2022) [92,252,253]. PONDR, Spritz, DisEMBL, RONN, and DISOPRED are a few predictors that fit into this category [254,255,256,257,258,259].
Recently, the field of protein structure prediction has been revolutionized by the development of the deep learning-based method AlphaFold [260]. This software generates a per-residue confidence score (pLDDT) based on the protein’s amino acid sequence. The most recent version of this tool, i.e., Alphafold2, has been reported to achieve protein structure prediction accuracy competitive with that of experimental determination [261,262,263]. However, this program gives a low confidence score (pLDDT < 50) for intrinsically unstructured or disordered proteins/regions, and the inconclusive predicted structure resembles a ribbon. In addition, this method does not anticipate the relative likelihood of diverse IDP conformations and the folding pathways followed by IDPs/IDRs attaining an ordered structure upon interaction with other biomolecules [70,264]. At present, the AlphaFold Protein Structure Database is considered as the most complete and precise representation of the human proteome [265,266].

4.3. Inter-Residue Contact-Based Predictors

Predictors based on the idea that IDPs/IDRs are disordered because they cannot make enough inter-residue contacts required to compensate for the loss of configurational entropy during folding are grouped together as inter-residue contact-based predictors. The above conclusions may be derived by either simple statistics involving contact numbers or through sophisticated techniques of determining the total stabilization energy of a protein. Computational tools, such as IUPred, FoldUnfold, and Ucon belong to this class [267,268,269,270].
At present, there is no “best” disorder prediction computational tool. Therefore, to avoid the limitations of a given tool, prediction results from different disorder predictors relying on distinct principles should be combined to provide a consensus prediction, as implemented by meta-predictors (for example, PONDR-FIT) [271]. Alternatively, publicly available meta-servers (for example, MeDor and metaPRDOS can also be used for quick and simultaneous analysis of protein disorder using multiple predictors [272,273].
In several recent articles, extensive comparisons of various computational disorder prediction methods’ performance and comprehensive online resources useful for studying IDPs/IDRs were provided [274,275,276].

5. Evolution of IDPs/IDRs

The evolution of proteins involves changes in the form of insertions, deletions, or substitutions in their amino acid sequences. Over time, such changes can accumulate in the proteins, giving rise to taxonomic classes having substantial differences in their amino acid compositions [277]. In general, the structure and function of proteins are well conserved, but several exceptions exist. Several previous studies suggested that, even if the protein sequence diverges extensively, the protein function is well-conserved [278]. Hence, proteins are generally considered as the ‘chemical fingerprints’ of evolutionary history, as they manifest the underlying genetic changes as amino acid sequences.
The evolution of intrinsic disorder exhibits a wavy pattern in which highly disordered primordial proteins with predominantly RNA-chaperone-like activities were slowly replaced with highly structured proteins [118,279]. Later, because of its peculiar features regarding the regulation of complex cellular processes, protein disorder was reinvented at various succeeding evolutionary stages, resulting in the creation of more complex organisms from the last universal ancestor [280,281].
Several mechanisms, such as de novo generation, horizontal gene transfer, and lateral gene transfer, can give rise to genes that encode IDPs [57,282]. Approximately 14% of Pfam domains, predicted to be mostly disordered and shared by many protein families, appear to have originated from domain duplications and module exchange between genes [196]. The high frequency of occurrence of tandemly repeated sequences in IDPs/IDRs suggests that the expansion of internal repeat regions (microsatellite and minisatellite coding regions) is another possible way by which the IDPs encoding genes arose [283,284]. Looking at the exceptional functional variability conferred to IDPs/IDRs due to the genetic instability of repetitive elements, the mechanism of the extension of repeat elements appears as the frequent method of disorder spread during evolution and rapid genomic changes in adaptation [36,285,286]. Furthermore, these IDPs/IDRs can also act as hot spots for mutations, leading to the loss of different functional modalities and thus resulting in various types of diseases, including cancer [287,288,289]. Seera and Nagarajaram have recently shown that the disease-causing missense mutations within IDRs reduce the overall conformation heterogeneity of the IDRs as compared to their wild type counterparts, and the few ‘locked’ dominant conformations presumably limit their interaction with the cognate partners [290].
Recent studies have shown that disordered protein segments are encoded by GC-enriched gene regions, which, in turn, directly correspond to the disorderedness of the encoded proteins [291,292]. This GC enrichment is due to the prevalence of amino acids coded by GC-rich codons (G, A, R, and P) in the disordered regions of proteins [291]. At the residue level, a relatively higher rate of evolutionary changes in the disordered regions of proteins was observed compared with that in the ordered/globular domains, as there were no structural constraints to maintaining a 3-D structure [293]. However, in certain cases, structured domains and disordered regions of proteins have been observed to co-evolve at higher rates [294,295]. Despite these rapid changes, the biological functions of the structured domains and disordered regions are always conserved [296]. Hence, a deeper understanding of the conformation ensemble–function relationship will help to decipher the evolutionary trajectory of IDPs.
Based on the conservation of sequences coding for protein disorder, disordered residues have been classified as constrained (both sequence and disorder are conserved) or flexible (only protein disorder is conserved). Together, constrained and flexible disorder residues are known as conserved disorder. On the other hand, if neither disorder nor the residues encoding it are conserved, such a disorder class is known as non-conserved disorder [297]. This integrated structural and evolutionary approach has recently been used to define the determinants of the functional adaptability of the neutrophin family of proteins involved in neuronal development [298].
Considering that the disordered regions in proteins have a distinct amino acid composition and evolutionary rate as compared with that of ordered regions, the substitution frequencies of residues in the disordered regions must also be distinct from those found in ordered regions. Thus, identifying the evolutionary and functional features of IDPs/IDRs has become a computational challenge, as most of the sequence analysis tools and parameter optimization procedures are aimed at ordered/structured regions of proteins. Recently, methods evaluating disordered proteins’ molecular features and sequence composition in a position-specific manner have been developed. These advancements have allowed researchers to pursue alignment-based evolutionary studies on IDPs/IDRs without aligning the residues discretely [299,300,301].

6. IDPs/IDRs in Diseases

Like structured proteins, the expression, localization, and interactions of intrinsically disordered proteins (IDPs) are also highly coordinated and regulated. Multiple checkpoints at various stages of the expression of IDPs-specific genes (from transcript synthesis to protein degradation) ensure the availability of IDPs in appropriate quantities and for the desired duration, preventing any ectopic interactions [302]. Several studies have shown the role of IDPs/IDRs in different human disorders, including diabetes, cancer, amyloidosis, neurodegenerative, and cardiovascular diseases [303,304]. Some well-studied examples of IDPs associated with human disease are p53, Mdm2, PTEN, c-Myc, AF4, BRCA1, EWS, Bcl-2, c-Fos, HPV oncoproteins, etc. [303,305,306,307]. Moreover, the deposition of ⍺-synuclein, tau, and amyloid-β proteins leads to Alzheimer’s disease, the accumulation of ⍺-synuclein results in Parkinson’s disease, and aggregates of PrPSC cause prion diseases. The expansion of CAG triplet repeats in disease genes, which introduces disorder, results in the family of polyQ diseases, such as Kennedy’s disease, Huntington’s disease, etc. [308,309,310,311,312,313,314].
In the last two decades, the role of IDPs in human diseases has been actively studied, giving rise to new mechanistic findings that have led to the formation of the D2 concept (‘Disorder to Disorders’) [303]. Several comprehensive reviews and thematic series articles have been published covering the significance of IDPs in diseases [315,316,317]. For instance, Coskuner and Uversky described various hypotheses proposed to explain the molecular mechanisms of the pathogenesis of Alzheimer’s and Parkinson’s diseases and suggested the need for the development of new techniques through the integration of quantum and statistical mechanics, thermodynamics, bioinformatics, and machine learning approaches, which, in turn, may lead to the development of new experimental approaches [318,319,320,321,322,323,324,325]. However, at present, there are several limitations and challenges associated with in silico studies of IDP-associated neurodegenerative disorders [326,327]. Another study found that an NADH-stabilized 26S proteasomal complex could degrade IDPs efficiently. Therefore, the accumulation of disease-causing disordered proteins, such as tau, c-Fos, p53, etc., can be prevented by the selective degradation of IDPs in an ATP-independent manner [328]. Moreover, the analysis of components of the ATP-dependent ubiquitin-proteasome degradation system (UPS) revealed the importance of the disorder content and MoRFs of the complex in neurodegenerative disorders and cancers [329]. However, identifying key mutations, PTM sites, and functional motifs in the disordered regions, exploring the evolutionary history of IDPs involved in diseases, understanding the cooperative functioning of ordered and disordered domains, and dissecting the IDPs’ interactome are some of the many active research areas involving IDPs/IDRs and diseases [288,330,331,332,333,334].

7. IDPs/IDRs as Drug Targets

With increasing evidence of their involvement in molecular functions complementing globular domains, essential biological processes, protein–nucleic acid interactions, protein–protein interactions, and diseases, IDRs/IDPs have emerged as one of the prime targets for drug discovery or repurposing [142,335,336,337,338,339]. However, IDP characteristics, such as a lack of a sTable 3D structure, very high flexibility, conformational ensembles, susceptibility to proteolytic cleavage, protein aggregation, etc., limit the application of the most-established experimental assays and computational methods that would otherwise work for ordered/globular proteins [340,341,342,343,344]. Therefore, IDP-specific drug screening/development is mainly a tradeoff between binding affinity/specificity and the alternation in the functioning of disordered proteins with other features, such as solubility, crowding, efflux, metabolism, etc., a potentially relevant role [345].
Broadly, disordered proteins/regions have been used in drug development procedures by targeting their conformational changes, interactions, and self-aggregating behavior [346]. For example, the inhibitor 10058F4 of Myc proto-oncogene protein (MYC) binds to MYC and prevents conformational disorder-to-order transition, which, in turn, blocks MYC-MAX complex-driven tumorigenesis [33,347,348,349,350]. Similarly, Methyl-CpG-binding domain protein 2 (MBD2) inhibitors restrict the folding of MBD2 upon binding to its partner p66α. This MBD2-p66α is known to regulate the Mi-2/NuRD chromatin remodeling complex involved in promoting metastasis in various cancer cells through epithelial–mesenchymal transition (EMT) [351,352]. In contrast to ordered proteins, the protein–protein interactions involving IDPs offer uneven, shorter, compact, and more mimicable surfaces for the tighter binding of small drug molecules [353,354,355]. In recent times, potential drug molecules have been designed to target either the disordered segment or the binding region of the interacting molecule. For instance, nutlins binding to Mdm2 prevent the interaction of Mdm2 with the disordered regions of p53, which activates the p53 pathway, leading to apoptosis, cell-cycle arrest, and the inhibition of the uncontrolled cell growth of human tumor xenografts [356]. Additionally, an FDA-approved compound, trifluoperazine dihydrochloride, was found to bind to a disordered region of multifunctional protein nuclear protein 1 (NUPR1) and arrest pancreatic ductal adenocarcinoma (PDAC) development [357]. Moreover, the disordered proteins from pathogens can also be targeted to interrupt their interaction with host proteins, which they utilize for their survival and pathogenesis [358]. In a recent review, Santofimia et al. comprehensively described targeting IDPs in various protein–protein and protein–nucleic acid interactions involved in cancer [359]. Furthermore, compounds, such as curcumin, rosmarinic acid, ferulic acid, and safranal, have also been reported to prevent the aggregation of α-synuclein protein by binding to monomers, thus inhibiting the polymerization of these proteins, which results in various neuronal malignancies [360,361]. In summary, deciphering the sequence–ensemble–function relationship of IDPs/IDRs and the development of efficient computational modeling approaches will help to unravel the enormous potential of disordered proteins as drug targets.

8. Conclusions

The spread and versatility of proteins’ functions carried by intrinsically disordered regions within them are phenomenal. The overview of the protein structure–function relationship presented in this review with a focus on various aspects of intrinsically disordered proteins/regions can be very helpful in understanding the fundamentals of biologically active structureless proteins. It also offers a novel perspective for characterizing proteins with unknown functions.
Over the past two decades, the research field of protein disorder has witnessed a revolution with respect to the understanding of various aspects of IDPs/IDRs such as sequence, abundance, structure, function, regulation, evolution, etc. However, there is still a requirement for new theoretical, experimental, and computational models specific to intrinsically disordered regions in proteins that can explain both the diversifying behavior of IDPs and the unifying principle of protein structure–function relationships.

Author Contributions

Conceptualization, R.T. and H.A.N.; writing—original draft preparation, R.T. and H.A.N.; writing—review and editing, R.T. and H.A.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

RT is thankful to the Centre for DNA Fingerprinting and Diagnostics (CDFD), Hyderabad, India, for providing computational facilities and literature while he was a PhD student, which helped in compiling this review. HAN gratefully acknowledges the core funding as well as the DBT-BUILDER funding to the School of Life Sciences, University of Hyderabad, India.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

BP: Binding protein; CBP: CREB binding protein; CD: Circular dichroism; Cryo-EM: Cryo-electron microscopy; E1A: Early region 1A; EDEM: ER degradation-enhancing alpha-mannosidase-like protein; ERAD: ER-associated protein degradation; ELMs: Eukaryotic linear motifs; EMT: Epithelial–mesenchymal transition; EspK: ESX-secretion-associated protein K; FID: Free inductive decay; GBD: GTPase binding domain; GF: Gel filtration; GR: Glucocorticoids receptor; GH47: Mannose homology domain; HIF-1α: Hypoxia-inducible factor 1α; IDPs: Intrinsically disordered proteins; IDRs: Intrinsically disordered regions; IMD: Intermediate domain; MAP2: Microtubule-associated protein 2; MLL: Mixed-lineage leukemia; MoRFs: Molecular recognition features; SLiMs: Short linear peptide motifs; LMs: Linear motifs; PA: Protease-associated; PDAC: Pancreatic ductal adenocarcinoma; PDB: Protein Data Bank; pLDDT: Per-residue confidence score; pRb: Retinoblastoma protein; PPP: Serine/Threonine phosphatases; PTMs: Post-translational modifications; MBD2: Methyl-CpG binding domain protein2; MOREs: Molecular recognition elements; Raks: Rho-activated kinases; RBP: RNA-binding proteins; SAXS: Small-angle X-ray scattering; SEC: Size-exclusion chromatography; SPA: Single particle analysis; STA: Sub-tomogram imaging; MLAs: Machine learning algorithms; MYC: Myc proto-oncogene protein; NMR: Nuclear magnetic resonance; NUPR1: Nuclear protein 1; UPS: Ubiquitin-proteasome degradation system; WASP: Wiskott–Aldrich syndrome protein; WH2: WASP homology domain 2.

References

  1. Babu, M.M.; Kriwacki, R.W.; Pappu, R.V. Structural biology. Versatility from protein disorder. Science 2012, 337, 1460–1461. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Van der Lee, R.; Buljan, M.; Lang, B.; Weatheritt, R.J.; Daughdrill, G.W.; Dunker, A.K.; Fuxreiter, M.; Gough, J.; Gsponer, J.; Jones, D.T.; et al. Classification of intrinsically disordered regions and proteins. Chem. Rev. 2014, 114, 6589–6631. [Google Scholar] [CrossRef] [PubMed]
  3. Luo, Y.; Ye, S.; Li, X.; Lu, W. Emerging Structure-Function Paradigm of Endocrine FGFs in Metabolic Diseases. Trends Pharmacol. Sci. 2019, 40, 142–153. [Google Scholar] [CrossRef] [PubMed]
  4. Lin, D.H.; Hoelz, A. The Structure of the Nuclear Pore Complex (An Update). Annu. Rev. Biochem. 2019, 88, 725–783. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Uversky, V.N. Protein intrinsic disorder and structure-function continuum. Prog. Mol. Biol. Transl. Sci. 2019, 166, 1–17. [Google Scholar] [PubMed]
  6. Mak, H.; Thurston, T.L.M. Interesting Biochemistries in the Structure and Function of Bacterial Effectors. Front. Cell Infect. Microbiol. 2021, 11, 608860. [Google Scholar] [CrossRef]
  7. Pinet, L.; Assrir, N.; van Heijenoort, C. Expanding the Disorder-Function Paradigm in the C-Terminal Tails of Erbbs. Biomolecules 2021, 11, 1690. [Google Scholar] [CrossRef]
  8. Ghag, G.; Holler, C.J.; Taylor, G.; Kukar, T.L.; Uversky, V.N.; Rangachari, V. Disulfide bonds and disorder in granulin-3: An unusual handshake between structural stability and plasticity. Protein Sci. 2017, 9, 1759–1772. [Google Scholar] [CrossRef] [Green Version]
  9. Davey, N.E. The functional importance of structure in unstructured protein regions. Curr. Opin. Struct. Biol. 2019, 56, 155–163. [Google Scholar] [CrossRef]
  10. Bondos, S.E.; Dunker, A.K.; Uversky, V.N. On the roles of intrinsically disordered proteins and regions in cell communication and signaling. Cell Commun. Signal. 2021, 19, 88. [Google Scholar] [CrossRef]
  11. Dunker, A.K.; Babu, M.M.; Barbar, E.; Blackledge, M.; Bondos, S.E.; Dosztányi, Z.; Dyson, H.J.; Forman-Kay, J.; Fuxreiter, M.; Gsponer, J.; et al. What’s in a name? Why these proteins are intrinsically disordered. Intrinsically Disord. Proteins 2013, 1, e24157. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Bickers, S.C.; Sayewich, J.S.; Kanelis, V. Intrinsically disordered regions regulate the activities of ATP binding cassette transporters. Biochim. Biophys. Acta Biomembr. 2020, 1862, 183202. [Google Scholar] [CrossRef] [PubMed]
  13. Brodsky, S.; Jana, T.; Mittelman, K.; Chapal, M.; Kumar, D.K.; Carmi, M.; Barkai, N. Intrinsically Disordered Regions Direct Transcription Factor In Vivo Binding Specificity. Mol. Cell 2020, 79, 459–471. [Google Scholar] [CrossRef] [PubMed]
  14. Jamecna, D.; Antonny, B. Intrinsically disordered protein regions at membrane contact sites. Biochim. Biophys. Acta Mol. Cell Biol. Lipids 2021, 1866, 159020. [Google Scholar] [CrossRef]
  15. Ahrens, J.B.; Nunez-Castilla, J.; Siltberg-Liberles, J. Evolution of intrinsic disorder in eukaryotic proteins. Cell. Mol. Life Sci. 2017, 74, 3163–3174. [Google Scholar] [CrossRef]
  16. Protter, D.S.W.; Rao, B.S.; Van Treeck, B.; Lin, Y.; Mizoue, L.; Rosen, M.K.; Parker, R. Intrinsically Disordered Regions Can Contribute Promiscuous Interactions to RNP Granule Assembly. Cell Rep. 2018, 22, 1401–1412. [Google Scholar] [CrossRef] [Green Version]
  17. Macossay-Castillo, M.; Marvelli, G.; Guharoy, M.; Jain, A.; Kihara, D.; Tompa, P.; Wodak, S.J. The Balancing Act of Intrinsically Disordered Proteins: Enabling Functional Diversity while Minimizing Promiscuity. J. Mol. Biol. 2019, 431, 1650–1670. [Google Scholar] [CrossRef]
  18. Chakrabarti, P.; Chakravarty, D. Intrinsically disordered proteins/regions and insight into their biomolecular interactions. Biophysical Chemistry 2022, 283, 106769. [Google Scholar] [CrossRef]
  19. Huang, Y.H.; Rose, P.W.; Hsu, C.N. Citing a Data Repository: A Case Study of the Protein Data Bank. PLoS ONE 2015, 10, e0136631. [Google Scholar] [CrossRef]
  20. Fischer, E. Einfiuss der Configurationa Uf die Wirkung der Enzyme. Ber. Dtsch. Chem. Ges. 1894, 11, 3479–3483. [Google Scholar] [CrossRef]
  21. Borgia, A.; Borgia, M.; Bugge, K.; Kissling, V.M.; Heidarsson, P.O.; Fernandes, C.B.; Sottini, A.; Soranno, A.; Buholzer, K.J.; Nettels, D.; et al. Extreme disorder in an ultrahigh-affinity protein complex. Nature 2018, 555, 61–66. [Google Scholar] [CrossRef] [Green Version]
  22. Koshland, D. Application of a theory of enzyme specificity to protein synthesis. Proc. Natl. Acad. Sci. USA 1958, 44, 98–104. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Mirzaie, M. Identification of native protein structures captured by principal interactions. BMC Bioinform. 2019, 20, 604. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. El-Gebali, S.; Mistry, J.; Bateman, A.; Eddy, S.R.; Luciani, A.; Potter, S.C.; Qureshi, M.; Richardson, L.J.; Salazar, G.A.; Smart, A.; et al. The Pfam protein families database in 2019. Nucleic Acids Res. 2019, 47, D427–D432. [Google Scholar] [CrossRef] [PubMed]
  25. Lermyte, F. Roles, Characteristics, and Analysis of Intrinsically Disordered Proteins: A Minireview. Life 2020, 10, 320. [Google Scholar] [CrossRef] [PubMed]
  26. Yang, J.; Zeng, Y.; Liu, Y.; Gao, M.; Liu, S.; Su, Z.; Huang, Y. Electrostatic interactions in molecular recognition of intrinsically disordered proteins. J. Biomol. Struct. Dyn. 2020, 38, 4883–4894. [Google Scholar] [CrossRef]
  27. Wright, P.E.; Dyson, H.J. Intrinsically unstructured proteins: Re-assessing the protein structure-function paradigm. J. Mol. Biol. 1999, 293, 321–331. [Google Scholar] [CrossRef] [Green Version]
  28. Dunker, A.K.; Lawson, J.D.; Brown, C.J.; Williams, R.M.; Romero, P.; Oh, J.S.; Oldfield, C.J.; Campen, A.M.; Ratliff, C.M.; Hipps, K.W.; et al. Intrinsically disordered protein. J. Mol. Graph. Model. 2001, 19, 26–59. [Google Scholar] [CrossRef] [Green Version]
  29. Tompa, P. Intrinsically unstructured proteins. Trends Biochem. Sci. 2002, 27, 527–533. [Google Scholar] [CrossRef]
  30. Uversky, V.; Gillespie, J.; Fink, A. Why Are “Natively Unfolded” Proteins Unstructured Under Physiologic Conditions? Proteins Struct. Funct. Genet. 2000, 41, 415–427. [Google Scholar] [CrossRef]
  31. Wallin, S. Intrinsically disordered proteins: Structural and functional dynamics. Res. Rep. Biol. 2017, 8, 7–16. [Google Scholar] [CrossRef] [Green Version]
  32. Dunker, A.K.; Garner, E.; Guilliot, S.; Romero, P.; Albrecht, K.; Hart, J.; Obradovic, Z.; Kissinger, G.; Villafranca, J.E. Protein disorder and the evolution of molecular recognition: Theory, predictions and observations. Pac. Symp. Biocomput. 1998, 3, 473–484. [Google Scholar]
  33. Uversky, V.N.; Dunker, A.K. Understanding protein non-folding. Biochim. Biophys. Acta (BBA) Proteins Proteom. 2010, 1804, 1231–1264. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Uversky, V.N. Intrinsically disordered proteins from A to Z. Int. J. Biochem. Cell Biol. 2011, 43, 1090–1103. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Uversky VN The mysterious unfoldome: Structureless, under appreciated, yet vital part of any given proteome. J. Biomed. Biotechnol. 2010, 2010, 568068.
  36. Ward, J.J.; Sodhi, J.S.; McGuffin, L.J.; Buxton, B.F.; Jones, D.T. Prediction and Functional Analysis of Native Disorder in Proteins from the Three Kingdoms of Life. J. Mol. Biol. 2004, 337, 635–645. [Google Scholar] [CrossRef]
  37. Dunker, A.K.; Obradovic, Z.; Romero, P.; Garner, E.C. Intrinsic Protein Disorder in Complete Genomes. Genome Inform. 2000, 11, 161–171. [Google Scholar]
  38. Hu, G.; Wang, K.; Song, J.; Uversky, V.N.; Kurgan, L. Taxonomic Landscape of the Dark Proteomes: Whole-Proteome Scale Interplay Between Structural Darkness, Intrinsic Disorder, and Crystallization Propensity. Proteomics 2018, 18, e1800243. [Google Scholar] [CrossRef] [PubMed]
  39. Kumar, N.; Kaushik, R.; Tennakoon, C.; Uversky, V.N.; Longhi, S.; Zhang, K.Y.J.; Bhatia, S. Comprehensive Intrinsic Disorder Analysis of 6108 Viral Proteomes: From the Extent of Intrinsic Disorder Penetrance to Functional Annotation of Disordered Viral Proteins. J. Proteome Res. 2021, 20, 2704–2713. [Google Scholar] [CrossRef]
  40. Monzon, A.M.; Necci, M.; Quaglia, F.; Walsh, I.; Zanotti, G.; Piovesan, D.; Tosatto, S.C.E. Experimentally Determined Long Intrinsically Disordered Protein Regions Are Now Abundant in the Protein Data Bank. Int. J. Mol. Sci. 2020, 21, 4496. [Google Scholar] [CrossRef]
  41. Tompa, P.; Dosztanyi, Z.; Simon, I. Prevalent Structural Disorder in E. coli and S. cerevisiae Proteomes. J. Proteome Res. 2006, 5, 1996–2000. [Google Scholar] [CrossRef] [PubMed]
  42. Dunker, A.K.; Brown, C.J.; Lawson, J.D.; Iakoucheva, L.M.; Obradovic, Z. Intrinsic Disorder and Protein Function. Biochemistry 2002, 41, 6573–6582. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Dunker, A.K.; Cortese, M.S.; Romero, P.; Iakoucheva, L.M.; Uversky, V.N. Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J. 2005, 272, 5129–5148. [Google Scholar] [CrossRef] [PubMed]
  44. Dyson, H.J.; Wright, P.E. Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell Biol. 2005, 6, 197–208. [Google Scholar] [CrossRef] [PubMed]
  45. Uversky, V. What does it mean to be natively unfolded? Eur. J. Biochem. 2002, 269, 2–12. [Google Scholar] [CrossRef]
  46. Uversky, V.N. Natively unfolded proteins: A point where biology waits for physics. Protein Sci. 2002, 11, 739–756. [Google Scholar] [CrossRef] [Green Version]
  47. Dyson, H.J.; Wright, P.E. Coupling of folding and binding for unstructured proteins. Curr. Opin. Struct. Biol. 2002, 12, 54–60. [Google Scholar] [CrossRef]
  48. Radivojac, P.; Iakoucheva, L.M.; Oldfield, C.J.; Obradovic, Z.; Uversky, V.N.; Dunker, A.K. Intrinsic Disorder and Functional Proteomics. Biophys. J. 2007, 92, 1439–1456. [Google Scholar] [CrossRef] [Green Version]
  49. Romero, P.; Obradovic, Z.; Li, X.; Garner, E.; Brown, C.; Dunker, A.K. Sequence complexity of disordered protein. Proteins Struct. Funct. Genet. 2001, 42, 38–48. [Google Scholar] [CrossRef]
  50. Williams, R.; Obradovi, Z.; Mathura, V.; Braun, W.; Garner, E.; Young, J.; Takayama, S.; Brown, C.J.; Dunker, A.K. The protein non-folding problem: Amino acid determinants of intrinsic order and disorder. Pac. Symp. Biocomput. 2001, 6, 89–100. [Google Scholar]
  51. Holt, C.; Raynes, J.K.; Carver, J.A. Sequence characteristics responsible for protein-protein interactions in the intrinsically disordered regions of caseins, amelogenins, and small heat-shock proteins. Biopolymers 2019, 110, e23319. [Google Scholar] [CrossRef] [PubMed]
  52. Kastano, K.; Mier, P.; Dosztanyi, Z.; Promponas, V.J.; Andrade-Navarro, M.A. Functional Tuning of Intrinsically Disordered Regions in Human Proteins by Composition Bias. Biomolecules 2022, 12, 1486. [Google Scholar] [CrossRef] [PubMed]
  53. Trivedi, R.; Nagarajaram, H.A. Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins. Sci Rep. 2019, 9, 16380. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Trivedi, R.; Nagarajaram, H.A. Substitution scoring matrices for proteins—An overview. Protein Sci. 2020, 29, 2150–2163. [Google Scholar] [CrossRef]
  55. Jarnot, P.; Ziemska-Legiecka, J.; Grynberg, M.; Gruca, A. Insights from analyses of low complexity regions with canonical methods for protein sequence comparison. Brief Bioinform. 2022, 23, bbac299. [Google Scholar] [CrossRef]
  56. McFadden, W.M.; Yanowitz, J.L. idpr: A package for profiling and analyzing Intrinsically Disordered Proteins in R. PLoS ONE 2022, 17, e0266929. [Google Scholar] [CrossRef]
  57. Tompa, P. Structure and Function of Intrinsically Disordered Proteins; CRC Press: Boca Raton, FL, USA; Taylor and Francis Group: Abingdon, UK, 2010. [Google Scholar]
  58. Uversky, V.N.; Oldfield, C.J.; Dunker, A.K. Showing your ID: Intrinsic disorder as an ID for recognition, regulation and cell signaling. J. Mol. Recognit. 2005, 18, 343–384. [Google Scholar] [CrossRef]
  59. Uversky, V. Intrinsic Disorder-based Protein Interactions and their Modulators. Curr. Pharm. Des. 2013, 19, 4191–4213. [Google Scholar] [CrossRef]
  60. Uversky, V.N. The most important thing is the tail: Multitudinous functionalities of intrinsically disordered protein termini. FEBS Lett. 2013, 587, 1891–1901. [Google Scholar] [CrossRef] [Green Version]
  61. Uversky, V.N. Unusual biophysics of intrinsically disordered proteins. Biochim. Biophys. Acta (BBA) Proteins Proteom. 2013, 1834, 932–951. [Google Scholar] [CrossRef]
  62. Morffy, N.; Strader, L.C. Structural Aspects of Auxin Signaling. Cold Spring Harb. Perspect. Biol. 2022, 14, a039883. [Google Scholar] [CrossRef] [PubMed]
  63. Crick, S.L.; Jayaraman, M.; Frieden, C.; Wetzel, R.; Pappu, R.V. Fluorescence correlation spectroscopy shows that monomeric poly- glutamine molecules form collapsed structures in aqueous solutions. Proc. Natl. Acad. Sci. USA 2006, 103, 16764–16769. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Walters, R.H.; Murphy, R.M. Examining Polyglutamine Peptide Length: A Connection between Collapsed Conformations and Increased Aggregation. J. Mol. Biol. 2009, 393, 978–992. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  65. Mukhopadhyay, S.; Krishnan, R.; Lemke, E.A.; Lindquist, S.; Deniz, A.A. A natively unfolded yeast prion monomer adopts an ensemble of collapsed and rapidly fluctuating structures. Proc. Natl. Acad. Sci. USA 2007, 104, 2649–2654. [Google Scholar] [CrossRef]
  66. Wang, X.; Vitalis, A.; Wyczalkowski, M.A.; Pappu, R.V. Characterizing the conformational ensemble of monomeric polyglutamine. Proteins Struct. Funct. Bioinform. 2006, 63, 297–311. [Google Scholar] [CrossRef]
  67. Mukhopadhyay, S. The Dynamism of Intrinsically Disordered Proteins: Binding-Induced Folding, Amyloid Formation, and Phase Separation. J. Phys. Chem. B 2020, 124, 11541–11560. [Google Scholar] [CrossRef]
  68. Dunker, A.K.; Uversky, V.N. Drugs for ‘protein clouds’: Targeting intrinsically disordered transcription factors. Curr. Opin. Pharmacol. 2010, 10, 782–788. [Google Scholar] [CrossRef]
  69. Choy, W.Y.; Forman-Kay, J.D. Calculation of ensembles of structures representing the unfolded state of an SH3 domain. J. Mol. Biol. 2001, 308, 1011–1032. [Google Scholar] [CrossRef]
  70. Strodel, B. Energy Landscapes of Protein Aggregation and Conformation Switching in Intrinsically Disordered Proteins. J. Mol. Biol. 2021, 433, 167182. [Google Scholar] [CrossRef]
  71. Dunker, A.K.; Obradovic, Z. The protein trinity—Linking function and disorder. Nat. Biotechnol. 2001, 19, 805–806. [Google Scholar] [CrossRef]
  72. Dunker, A.K.; Brown, C.J.; Obradovic, Z. Identification and Functions of Usefully Disordered Proteins. Adv. Protein Chem. 2002, 62, 25–49. [Google Scholar] [PubMed]
  73. Toto, A.; Malagrino, F.; Visconti, L.; Troilo, F.; Pagano, L.; Brunori, M.; Jemth, P.; Gianni, S. Templated folding of intrinsically disordered proteins. J. Biol. Chem. 2020, 295, 6586–6593. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  74. Malagrino, F.; Visconti, L.; Pagano, L.; Toto, A.; Troilo, F.; Gianni, S. Understanding the Binding Induced Folding of Intrinsically Disordered Proteins by Protein Engineering: Caveats and Pitfalls. Int. J. Mol. Sci. 2020, 21, 3484. [Google Scholar] [CrossRef] [PubMed]
  75. Malagrino, F.; Diop, A.; Pagano, L.; Nardella, C.; Toto, A.; Gianni, S. Unveiling induced folding of intrinsically disordered proteins—Protein engineering, frustration and emerging themes. Curr. Opin. Struct. Biol. 2022, 72, 153–160. [Google Scholar] [CrossRef] [PubMed]
  76. Kriwacki, R.W.; Hengst, L.; Tennant, L.; Reed, S.I.; Wright, P.E. Structural studies of p2lWa1CiPl/Sdil in the free and Cdk2-bound state: Conformational disorder mediates binding diversity. Proc. Natl. Acad. Sci. USA 1996, 93, 11504–11509. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  77. Uversky, V.N. Analyzing IDPs in Interactomes. Methods Mol Biol. 2020, 2141, 895–945. [Google Scholar]
  78. Fuxreiter, M.; Simon, I.; Friedrich, P.; Tompa, P. Preformed Structural Elements Feature in Partner Recognition by Intrinsically Unstructured Proteins. J. Mol. Biol. 2004, 338, 1015–1026. [Google Scholar] [CrossRef]
  79. Choi, U.B.; Sanabria, H.; Smirnova, T.; Bowen, M.E.; Weninger, K.R. Spontaneous Switching among Conformational Ensembles in Intrinsically Disordered Proteins. Biomolecules 2019, 9, 114. [Google Scholar] [CrossRef] [Green Version]
  80. Follis, A.V.; Llambi, F.; Kalkavan, H.; Yao, Y.; Phillips, A.H.; Park, C.G.; Marassi, F.M.; Green, D.R.; Kriwacki, R.W. Regulation of apoptosis by an intrinsically disordered region of Bcl-xL. Nat. Chem. Biol. 2018, 14, 458–465. [Google Scholar] [CrossRef]
  81. Miao, Y.; Tipakornsaowapak, T.; Zheng, L.; Mu, Y.; Lewellyn, E. Phospho-regulation of intrinsically disordered proteins for actin assembly and endocytosis. FEBS J. 2018, 285, 2762–2784. [Google Scholar] [CrossRef]
  82. Csizmok, V.; Forman-Kay, J.D. Complex regulatory mechanisms mediated by the interplay of multiple post-translational modifications. Curr. Opin. Struct. Biol. 2018, 48, 58–67. [Google Scholar] [CrossRef] [PubMed]
  83. Bludau, I.; Willems, S.; Zeng, W.F.; Strauss, M.T.; Hansen, F.M.; Tanzer, M.C.; Karayel, O.; Schulman, B.A.; Mann, M. The structural context of posttranslational modifications at a proteome-wide scale. PLoS Biol. 2022, 20, e3001636. [Google Scholar] [CrossRef] [PubMed]
  84. Ye, H.; Han, Y.; Li, P.; Su, Z.; Huang, Y. The Role of Post-Translational Modifications on the Structure and Function of Tau Protein. J. Mol. Neurosci. 2022, 72, 1557–1571. [Google Scholar] [CrossRef] [PubMed]
  85. Jeffery, C.J. Moonlighting proteins. Trends Biochem. Sci. 1999, 24, 8–11. [Google Scholar] [CrossRef]
  86. Jeffery, C.J. Molecular mechanisms for multitasking: Recent crystal structures of moonlighting proteins. Curr. Opin. Struct. Biol. 2004, 14, 663–668. [Google Scholar] [CrossRef]
  87. Meng, F.; Kurgan, L. High-throughput prediction of disordered moonlighting regions in protein sequences. Proteins 2018, 86, 1097–1110. [Google Scholar] [CrossRef]
  88. Balcerak, A.; Trebinska-Stryjewska, A.; Konopinski, R.; Wakula, M.; Grzybowska, E.A. RNA-protein interactions: Disorder, moonlighting and junk contribute to eukaryotic complexity. Open Biol. 2019, 9, 190096. [Google Scholar] [CrossRef] [Green Version]
  89. Rafiee, M.R.; Zagalak, J.A.; Sidorov, S.; Steinhauser, S.; Davey, K.; Ule, J.; Luscombe, N.M. Chromatin-contact atlas reveals disorder-mediated protein interactions and moonlighting chromatin-associated RBPs. Nucleic Acids Res. 2021, 49, 13092–13107. [Google Scholar] [CrossRef]
  90. Gsponer, J.; Babu, M.M. The rules of disorder or why disorder rules. Prog. Biophys. Mol. Biol. 2009, 99, 94–103. [Google Scholar] [CrossRef]
  91. Tompa, P. The interplay between structure and function in intrinsically unstructured proteins. FEBS Lett. 2005, 579, 3346–3354. [Google Scholar] [CrossRef] [Green Version]
  92. Sickmeier, M.; Hamilton, J.A.; LeGall, T.; Vacic, V.; Cortese, M.S.; Tantos, A.; Szabo, B.; Tompa, P.; Chen, J.; Uversky, V.N.; et al. DisProt: The Database of Disordered Proteins. Nucleic Acids Res. 2007, 35, D786–D793. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  93. Piovesan, D.; Tosatto, S.C.E. Mobi 2.0: An improved method to define intrinsic disorder, mobility and linear binding regions in protein structures. Bioinformatics 2018, 34, 122–123. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  94. Vymetal, J.; Vondrasek, J.; Hlouchova, K. Sequence Versus Composition: What Prescribes IDP Biophysical Properties? Entropy 2019, 21, 654. [Google Scholar] [CrossRef] [Green Version]
  95. Thomasen, F.E.; Lindorff-Larsen, K. Conformational ensembles of intrinsically disordered proteins and flexible multidomain proteins. Biochem. Soc. Trans. 2022, 50, 541–554. [Google Scholar] [CrossRef]
  96. Weber, B.; Hora, M.; Kazman, P.; Gobl, C.; Camilloni, C.; Reif, B.; Buchner, J. The antibody light-chain linker regulates domain orientation and amyloidogenicity. J. Mol. Biol. 2018, 430, 4925–4940. [Google Scholar] [CrossRef] [PubMed]
  97. Murrali, M.G.; Felli, I.C.; Pierattelli, R. Adenoviral E1A Exploits Flexibility and Disorder to Target Cellular Proteins. Biomolecules 2020, 10, 1541. [Google Scholar] [CrossRef] [PubMed]
  98. Daughdrill, G.W.; Narayanaswami, P.; Gilmore, S.H.; Belczyk, A.; Brown, C.J. Dynamic behavior of an intrinsically unstructured linker domain is conserved in the face of negligible amino acid sequence conservation. J. Mol. Evol. 2007, 65, 277–288. [Google Scholar] [CrossRef]
  99. Cunningham, C.C.; Leclerc, N.; Flanagan, L.A.; Lu, M.; Janmey, P.A.; Kosik, K.S. Microtubule-associated protein 2c reorganizes both microtubules and microfilaments into distinct cytological structures in an actin-binding protein-280-deficient melanoma cell line. J. Cell Biol. 1997, 136, 845–857. [Google Scholar] [CrossRef] [Green Version]
  100. Fahmi, M.; Ito, M. Evolutionary Approach of Intrinsically Disordered CIP/KIP Proteins. Sci. Rep. 2019, 9, 1575. [Google Scholar] [CrossRef] [Green Version]
  101. Szabo, B.; Horvath, T.; Schad, E.; Murvai, N.; Tantos, A.; Kalmar, L.; Chemes, L.B.; Han, K.H.; Tompa, P. Intrinsically Disordered Linkers Impart Processivity on Enzymes by Spatial Confinement of Binding Domains. Int. J. Mol. Sci. 2019, 20, 2119. [Google Scholar] [CrossRef] [Green Version]
  102. Cornish, J.; Chamberlain, S.G.; Owen, D.; Mott, H.R. Intrinsically disordered proteins and membranes: A marriage of convenience for cell signalling? Biochem. Soc. Trans. 2020, 48, 2669–2689. [Google Scholar] [CrossRef] [PubMed]
  103. Fuglebakk, E.; Reuter, N. A model for hydrophobic protrusions on peripheral membrane proteins. PLoS Comput. Biol. 2018, 14, e1006325. [Google Scholar] [CrossRef] [PubMed]
  104. Deryusheva, E.; Nemashkalova, E.; Galloux, M.; Richard, C.A.; Eleouet, J.F.; Kovacs, D.; Van Belle, K.; Tompa, P.; Uversky, V.; Permyakov, S. Does Intrinsic Disorder in Proteins Favor Their Interaction with Lipids? Proteomics 2019, 19, e1800098. [Google Scholar] [CrossRef] [PubMed]
  105. Diella, F.; Haslam, N.; Chica, C.; Budd, A.; Michael, S.; Brown, N.P.; Trave, G.; Gibson, T.J. Understanding eukaryotic linear motifs and their role in cell signaling and regulation. Front. Biosci. 2008, 13, 6580–6603. [Google Scholar] [CrossRef] [Green Version]
  106. Galea, C.A.; Wang, Y.; Sivakolundu, S.G.; Kriwacki, R.W. Regulation of Cell Division by Intrinsically Unstructured Proteins: Intrinsic Flexibility, Modularity, and Signaling Conduits. Biochemistry 2008, 47, 7598–7609. [Google Scholar] [CrossRef]
  107. Hoermann, B.; Kohn, M. Evolutionary crossroads of cell signaling: PP1 and PP2A substrate sites in intrinsically disordered regions. Biochem. Soc. Trans. 2021, 49, 1065–1074. [Google Scholar] [CrossRef]
  108. Galea, C.A.; Pagala, V.R.; Obenauer, J.C.; Park, C.G.; Slaughter, C.A.; Kriwacki, R.W. Proteomic studies of the intrinsically unstructured mammalian proteome. J Proteome Res. 2006, 5, 2839–2948. [Google Scholar] [CrossRef]
  109. Bode, A.M.; Dong, Z. Post-translational modification of p53 in tumorigenesis. Nat. Rev. Cancer 2004, 4, 793–805. [Google Scholar] [CrossRef]
  110. Kouzarides, T. Chromatin modifications and their function. Cell 2007, 128, 693–705. [Google Scholar] [CrossRef] [Green Version]
  111. Jin, F.; Grater, F. How multisite phosphorylation impacts the conformations of intrinsically disordered proteins. PLoS Comput. Biol. 2021, 17, e1008939. [Google Scholar] [CrossRef]
  112. Kumar, R.; Thompson, E.B. Role of Phosphorylation in the Modulation of the Glucocorticoid Receptor’s Intrinsically Disordered Domain. Biomolecules 2019, 9, 95. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  113. Lorton, B.M.; Shechter, D. Cellular consequences of arginine methylation. Cell. Mol. Life Sci. 2019, 76, 2933–2956. [Google Scholar] [CrossRef] [PubMed]
  114. Schroeder, R.; Barta, A.; Semrad, K. Strategies for RNA folding and assembly. Nat. Rev. Mol. Cell Biol. 2004, 5, 908–919. [Google Scholar] [CrossRef] [PubMed]
  115. Young, J.C.; Agashe, V.R.; Siegers, K.; Hartl, F.U. Pathways of chaperone-mediated protein folding in the cytosol. Nat. Rev. Mol. Cell Biol. 2004, 5, 781–791. [Google Scholar] [CrossRef]
  116. Kovacs, D.; Tompa, P. Diverse functional manifestations of intrinsic structural disorder in molecular chaperones. Biochem. Soc. Trans. 2012, 40, 963–968. [Google Scholar] [CrossRef]
  117. Rogers, J.M.; Steward, A.; Clarke, J. Folding and Binding of an Intrinsically Disordered Protein: Fast, but Not ‘Diffusion-Limited’. J. Am. Chem. Soc. 2013, 135, 1415–1422. [Google Scholar] [CrossRef]
  118. Tompa, P.; Csermely, P. The role of structural disorder in the function of RNA and protein chaperones. FASEB J. 2004, 18, 1169–1175. [Google Scholar] [CrossRef] [Green Version]
  119. Lindner, R.A.; Kapur, A.; Mariani, M.; Titmuss, S.J.; Carver, J.A. Structural alterations of alpha-crystallin during its chaperone action. Eur. J. Biochem. 1998, 258, 170–183. [Google Scholar] [CrossRef]
  120. Treweek, T.M.; Rekas, A.; Walker, M.J.; Carver, J.A. A quantitative NMR spectroscopic examination of the flexibility of the C-terminal extensions of the molecular chaperones, αA- and αB-crystallin. Exp Eye Res. 2010, 91, 691–699. [Google Scholar] [CrossRef]
  121. Holmstrom, E.D.; Liu, Z.; Nettels, D.; Best, R.B.; Schuler, B. Disordered RNA chaperones can enhance nucleic acid folding via local charge screening. Nat. Commun. 2019, 10, 2453. [Google Scholar] [CrossRef] [Green Version]
  122. Lee, H.M.; Kwon, S.B.; Son, A.; Kim, D.H.; Kim, K.H.; Lim, J.; Kwon, Y.G.; Kang, J.S.; Lee, B.K.; Byun, Y.H.; et al. Stabilization of Intrinsically Disordered DKK2 Protein by Fusion to RNA-Binding Domain. Int. J. Mol. Sci. 2019, 20, 2847. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  123. Yakubu, U.M.; Morano, K.A. Suppression of aggregate and amyloid formation by a novel intrinsically disordered region in metazoan Hsp110 chaperones. J. Biol. Chem. 2021, 296, 100567. [Google Scholar] [CrossRef] [PubMed]
  124. Liu, A.Y.; Minetti, C.A.; Remeta, D.P.; Breslauer, K.J.; Chen, K.Y. HSF1, Aging, and Neurodegeneration. Adv. Exp. Med. Biol. 2022, in press. [Google Scholar]
  125. Murvai, N.; Kalmar, L.; Szabo, B.; Schad, E.; Micsonai, A.; Kardos, J.; Buday, L.; Han, K.H.; Tompa, P.; Tantos, A. Cellular Chaperone Function of Intrinsically Disordered Dehydrin ERD14. Int. J. Mol. Sci. 2021, 22, 6190. [Google Scholar] [CrossRef] [PubMed]
  126. Mitrea, D.M.; Kriwacki, R.W. Regulated unfolding of proteins in signaling. FEBS Lett. 2013, 587, 1081–1088. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  127. Bardwell, J.C.; Jakob, U. Conditional disorder in chaperone action. Trends Biochem. Sci. 2012, 37, 517–525. [Google Scholar] [CrossRef] [PubMed]
  128. Tempra, C.; Scollo, F.; Pannuzzo, M.; Lolicato, F.; La Rosa, C. A unifying framework for amyloid-mediated membrane damage: The lipid-chaperone hypothesis. Biochim. Biophys. Acta Proteins Proteom. 2022, 1870, 140767. [Google Scholar] [CrossRef]
  129. Sugase, K.; Dyson, H.J.; Wright, P.E. Mechanism of coupled folding and binding of an intrinsically disordered protein. Nature 2007, 447, 1021–1025. [Google Scholar] [CrossRef]
  130. Wright, P.E.; Dyson, H.J. Linking folding and binding. Curr. Opin. Struct. Biol. 2009, 19, 31–38. [Google Scholar] [CrossRef] [Green Version]
  131. Gianni, S.; Jemth, P. Affinity versus specificity in coupled binding and folding reactions. Protein Eng. Des. Sel. 2019, 32, 355–357. [Google Scholar] [CrossRef]
  132. Zou, R.; Zhou, Y.; Wang, Y.; Kuang, G.; Ågren, H.; Wu, J.; Tu, Y. Free Energy Profile and Kinetics of Coupled Folding and Binding of the Intrinsically Disordered Protein p53 with MDM2. J. Chem. Inf. Model. 2020, 60, 1551–1558. [Google Scholar] [CrossRef] [PubMed]
  133. Morris, O.M.; Torpey, J.H.; Isaacson, R.L. Intrinsically disordered proteins: Modes of binding with emphasis on disordered domains. Open Biol. 2021, 11, 210222. [Google Scholar] [CrossRef] [PubMed]
  134. Schuler, B.; Borgia, A.; Borgia, M.B.; Heidarsson, P.O.; Holmstrom, E.D.; Nettels, D.; Sottini, A. Binding without folding—The biomolecular function of disordered polyelectrolyte complexes. Curr. Opin. Struct. Biol. 2020, 60, 66–76. [Google Scholar] [CrossRef] [PubMed]
  135. Dreier, J.E.; Prestel, A.; Martins, J.M.; Brøndum, S.S.; Nielsen, O.; Garbers, A.E.; Suga, H.; Boomsma, W.; Rogers, J.M.; Hartmann-Petersen, R.; et al. A context-dependent and disordered ubiquitin-binding motif. Cell. Mol. Life Sci. 2022, 79, 484. [Google Scholar] [CrossRef] [PubMed]
  136. Robustelli, P.; Piana, S.; Shaw, D.E. Mechanism of Coupled Folding-upon-Binding of an Intrinsically Disordered Protein. J Am Chem Soc. 2020, 142, 11092–11101. [Google Scholar] [CrossRef]
  137. Kim, A.; Kakalis, L.; Abdul-Manan, N.; Liu, G.; Rosen, M. Autoinhibition and activation mechanisms of the Wiskott-Aldrich syndrome protein. Nature 2000, 404, 151–158. [Google Scholar] [CrossRef]
  138. Berlow, R.B.; Dyson, H.J.; Wright, P.E. Multivalency enables unidirectional switch-like competition between intrinsically disordered proteins. Proc. Natl. Acad. Sci. USA 2022, 119, e2117338119. [Google Scholar] [CrossRef]
  139. Ferreon, A.C.; Ferreon, J.C.; Wright, P.E.; Deniz, A.A. Modulation of allostery by protein intrinsic disorder. Nature 2013, 498, 390–394. [Google Scholar] [CrossRef] [Green Version]
  140. Radhakrishnan, I.; Perez-Alvarado, G.C.; Parker, D.; Dyson, H.J.; Montminy, M.R.; Wright, P.E. Solution structure of the KIX domain of CBP bound to the transactivation domain of CREB: A model for activator: Coactivator interactions. Cell 1997, 91, 741–752. [Google Scholar] [CrossRef] [Green Version]
  141. De Guzman, R.N.; Goto, N.K.; Dyson, H.J.; Wright, P.E. Structural basis for cooperative transcription factor binding to the CBP coactivator. J. Mol. Biol. 2006, 355, 1005–1013. [Google Scholar] [CrossRef]
  142. Berlow, R.B.; Dyson, H.J.; Wright, P.E. Expanding the Paradigm: Intrinsically Disordered Proteins and Allosteric Regulation. J Mol. Biol. 2018, 430, 2309–2320. [Google Scholar] [CrossRef] [PubMed]
  143. Hilser, V.J.; Thompson, E.B. Intrinsic disorder as a mechanism to optimize allosteric coupling in proteins. Proc. Natl. Acad. Sci. USA 2007, 104, 8311–8315. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  144. Flock, T.; Weatheritt, R.J.; Latysheva, N.S.; Babu, M.M. Controlling entropy to tune the functions of intrinsically disordered regions. Curr. Opin. Struct. Biol. 2014, 26, 62–72. [Google Scholar] [CrossRef]
  145. Fuxreiter, M.; Tompa, P.; Simon, I.; Uversky, V.N.; Hansen, J.C.; Asturias, F.J. Malleable machines take shape in eukaryotic transcriptional regulation. Nat. Chem. Biol. 2008, 4, 728–737. [Google Scholar] [CrossRef] [Green Version]
  146. Peng, Z.; Oldfield, C.J.; Xue, B.; Mizianty, M.J.; Dunker, A.K.; Kurgan, L.; Uversky, V.N. A creature with a hundred waggly tails: Intrinsically disordered proteins in the ribosome. Cell. Mol. Life Sci. 2013, 71, 1477–1504. [Google Scholar] [CrossRef] [PubMed]
  147. Hegyi, H.; Schad, E.; Tompa, P. Structural disorder promotes assembly of protein complexes. BMC Struct. Biol. 2007, 7, 65. [Google Scholar] [CrossRef]
  148. Gunasekaran, K.; Tsai, C.J.; Kumar, S.; Zanuy, D.; Nussinov, R. Extended disordered proteins: Targeting function with less scaffold. Trends Biochem. Sci. 2003, 28, 81–85. [Google Scholar] [CrossRef]
  149. Cortese, M.S.; Uversky, V.N.; Dunker, A.K. Intrinsic disorder in scaffold proteins: Getting more from less. Prog. Biophys. Mol. Biol. 2008, 98, 85–106. [Google Scholar] [CrossRef] [Green Version]
  150. Clerc, I.; Sagar, A.; Barducci, A.; Sibille, N.; Bernado, P.; Cortes, J. The diversity of molecular interactions involving intrinsically disor dered proteins: A molecular modeling perspective. Comput. Struct. Biotechnol. J. 2021, 19, 3817–3828. [Google Scholar] [CrossRef]
  151. Shajani, Z.; Sykes, M.T.; Williamson, J.R. Assembly of bacterial ribosomes. Annu. Rev. Biochem. 2011, 80, 501–526. [Google Scholar] [CrossRef] [Green Version]
  152. Bondos, S.E.; Dunker, A.K.; Uversky, V.N. Intrinsically disordered proteins play diverse roles in cell signaling. Cell Commun. Signal. 2022, 20, 20. [Google Scholar] [CrossRef] [PubMed]
  153. Fung, H.Y.J.; Birol, M.; Rhoades, E. IDPs in macromolecular complexes: The roles of multivalent interactions in diverse assemblies. Curr Opin Struct Biol. 2018, 49, 36–43. [Google Scholar] [CrossRef] [PubMed]
  154. Holt, C. Unfolded phosphopolypeptides enable soft and hard tissues to coexist in the same organism with relative ease. Curr. Opin. Struct. Biol. 2013, 23, 420–425. [Google Scholar] [CrossRef] [PubMed]
  155. Vazquez-Hernandez, M.; Romero, I.; Sanchez-Ballesta, M.T.; Merodio, C.; Escribano, M.I. Functional characterization of VviDHN2 and VviDHN4 dehydrin isoforms from Vitis vinifera (L.): An in silico and in vitro approach. Plant Physiol. Biochem. 2021, 158, 146–157. [Google Scholar] [CrossRef] [PubMed]
  156. Elko, E.A.; Manuel, A.M.; White, S.; Zito, E.; van der Vliet, A.; Anathy, V.; Janssen-Heininger, Y.M.W. Oxidation of peroxiredoxin-4 induces oligomerization and promotes interaction with proteins governing protein folding and endoplasmic reticulum stress. J. Biol. Chem. 2021, 296, 100665. [Google Scholar] [CrossRef]
  157. Bidwell LM, McManus ME, Gaedigk A, Kakuta Y, Negishi M, Pedersen L, Martin JL Crystal structure of human catecholamine sulfotransferase. J. Mol. Biol. 1999, 293, 521–530.
  158. Barney, B.M.; LoBrutto, R.; Francisco, W.A. Characterization of a small metal binding protein from Nitrosomonas europaea. Biochemistry 2004, 43, 11206–11213. [Google Scholar] [CrossRef]
  159. Jobby, M.K.; Sharma, Y. Calcium-binding crystallins from Yersinia pestis. Characterization of two single betagamma-crystallin domains of a putative exported protein. J. Biol. Chem. 2005, 280, 1209–1216. [Google Scholar] [CrossRef] [Green Version]
  160. Davey, N.E.; Van Roey, K.; Weatheritt, R.J.; Toedt, G.; Uyar, B.; Altenberg, B.; Budd, A.; Diella, F.; Dinkel, H.; Gibson, T.J. Attributes of short linear motifs. Mol. BioSyst. 2012, 8, 268–281. [Google Scholar] [CrossRef]
  161. Fuxreiter, M.; Tompa, P.; Simon, I. Local structural disorder imparts plasticity on linear motifs. Bioinformatics 2007, 23, 950–956. [Google Scholar] [CrossRef]
  162. Lee, S.; Kim, D.; Han, J.; Cha, E.; Lim, J.; Cho, Y.; Lee, C.; Han, K. Understanding Pre-Structured Motifs (PreSMos) in Intrinsically Unfolded Proteins. Curr. Protein Pept. Sci. 2012, 13, 34–54. [Google Scholar] [CrossRef] [PubMed]
  163. Mohan, A.; Oldfield, C.J.; Radivojac, P.; Vacic, V.; Cortese, M.S.; Dunker, A.K.; Uversky, V.N. Analysis of Molecular Recognition Features (MoRFs). J. Mol. Biol. 2006, 362, 1043–1059. [Google Scholar] [CrossRef] [PubMed]
  164. Oldfield, C.; Cheng, Y.; Cortese, M.; Romero, P.; Uversky, V.; Dunker, A.K. Coupled Folding and Binding with R-Helix-Forming Molecular Recognition. Biochemistry 2005, 44, 12454–12470. [Google Scholar] [CrossRef] [PubMed]
  165. Hinds, M.G.; Smits, C.; Fredericks-Short, R.; Risk, J.M.; Bailey, M.; Huang, D.C.; Day, C.L. Bim, Bad and Bmf: Intrinsically unstructured BH3-only proteins that undergo a localized conformational change upon binding to prosurvival Bcl-2 targets. Cell Death Differ. 2007, 14, 128–136. [Google Scholar] [CrossRef] [Green Version]
  166. Dinkel, H.; Michael, S.; Weatheritt, R.J.; Davey, N.E.; Van Roey, K.; Altenberg, B.; Toedt, G.; Uyar, B.; Seiler, M.; Budd, A.; et al. ELM—The database of eukaryotic linear motifs. Nucleic Acids Res. 2012, 40, D242–D251. [Google Scholar] [CrossRef] [Green Version]
  167. Van Roey, K.; Uyar, B.; Weatheritt, R.J.; Dinkel, H.; Seiler, M.; Budd, A.; Gibson, T.J.; Davey, N.E. Short Linear Motifs: Ubiquitous and Functionally Diverse Protein Interaction Modules Directing Cell Regulation. Chem. Rev. 2014, 114, 6733–6778. [Google Scholar] [CrossRef]
  168. Dinkel, H.; Van Roey, K.; Michael, S.; Davey, N.E.; Weatheritt, R.J.; Born, D.; Speck, T.; Kruger, D.; Grebnev, G.; Kuban, M.; et al. The eukaryotic linear motif resource ELM: 10 years and counting. Nucleic Acids Res. 2014, 42, D259–D266. [Google Scholar] [CrossRef]
  169. Dinkel, H.; Van Roey, K.; Michael, S.; Kumar, M.; Uyar, B.; Altenberg, B.; Milchevskaya, V.; Schneider, M.; Kühn, H.; Behrendt, A.; et al. ELM 2016—Data update and new functionality of the eukaryotic linear motif resource. Nucleic Acids Res. 2016, 44, D294–D300. [Google Scholar] [CrossRef]
  170. Tompa, P.; Davey, N.E.; Gibson, T.J.; Babu, M.M. A million peptide motifs for the molecular biologist. Mol. Cell 2014, 55, 161–169. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  171. Gouw, M.; Michael, S.; Sámano-Sánchez, H.; Kumar, M.; Zeke, A.; Lang, B.; Bely, B.; Chemes, L.B.; Davey, N.E.; Deng, Z.; et al. The eukaryotic linear motif resource—2018 update. Nucleic Acids Res. 2018, 46, D428–D434. [Google Scholar] [CrossRef]
  172. Zhou, X.; Kops, O.; Werner, A.; Lu, P.; Shen, M.; Stoller, G.; Kullertz, G.; Stark, M.; Fischer, G.; Lu, K. Pin1-Dependent Prolyl Isomerization Regulates Dephosphorylation of Cdc25C and Tau Proteins. Mol. Cell 2000, 6, 873–883. [Google Scholar] [CrossRef]
  173. Pop, C.; Salvesen, G.S. Human Caspases: Activation, Specificity, and Regulation. J. Biol. Chem. 2009, 284, 21777–21781. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  174. Kalderon, D.; Roberts, B.L.; Richardson, W.D.; Smith, A.E. A Short Amino Acid Sequence Able to Specify Nuclear Location. Cell 1984, 39, 499–509. [Google Scholar] [CrossRef]
  175. Schmid, E.M.; McMahon, H.T. Integrating molecular and network biology to decode endocytosis. Nature 2007, 448, 883–888. [Google Scholar] [CrossRef]
  176. Li, J.; McQuade, T.; Siemer, A.B.; Napetschnig, J.; Moriwaki, K.; Hsiao, Y.S.; Damko, E.; Moquin, D.; Walz, T.; McDermott, A.; et al. The RIP1/RIP3 necrosome forms a functional amyloid signaling complex required for programmed necrosis. Cell 2012, 150, 339–350. [Google Scholar] [CrossRef] [Green Version]
  177. Li, P.; Banjade, S.; Cheng, H.; Kim, S.; Chen, B.; Guo, L.; Llaguno, M.; Hollingsworth, J.; King, D.; Banani, S.; et al. Phase transitions in the assembly of multivalent signalling proteins. Nature 2012, 483, 336–340. [Google Scholar] [CrossRef] [Green Version]
  178. Pfleger, C.; Kirschner, M. The KEN box: An APC recognition signal distinct from the D box targeted by Cdh1. Genes Dev. 2000, 14, 655–665. [Google Scholar] [CrossRef]
  179. He, J.; Chao, W.C.; Zhang, Z.; Yang, J.; Cronin, N.; Barford, D. Insights into Degron Recognition by APC/C Coactivators from the Structure of an Acm1-Cdh1 Complex. Mol. Cell 2013, 50, 649–660. [Google Scholar] [CrossRef] [Green Version]
  180. Gagliardi, P.A.; Primo, L. Irreversible Activation of Rho-activated Kinases Resulted from Evolution of Proteolytic Sites within Disordered Regions in Coiled-coil Domain. Mol. Biol. Evol. 2019, 36, 376–392. [Google Scholar] [CrossRef]
  181. Brautigan, D.; Shenolikar, S. Protein Serine/Threonine Phosphatases: Keys to Unlocking Regulators and Substrates. Annu. Rev. Biochem. 2018, 87, 921–964. [Google Scholar] [CrossRef] [Green Version]
  182. Hossain, M.; Bhalla, K.; Stillman, B. Multiple, short protein binding motifs in ORC1 and CDC6 control the initiation of DNA replication. Mol. Cell 2021, 81, 1951–1969. [Google Scholar] [CrossRef] [PubMed]
  183. Faustova, I.; Loog, M. SLiMs in intrinsically disordered protein regions regulate the cell cycle dynamics of ORC1-CDC6 interaction and pre-replicative complex assembly. Mol. Cell 2021, 81, 1861–1862. [Google Scholar] [CrossRef] [PubMed]
  184. Gibson, T.J. Cell regulation: Determined to signal discrete cooperation. Trends Biochem. Sci. 2009, 34, 471–482. [Google Scholar] [CrossRef] [PubMed]
  185. Van Roey, K.; Dinkel, H.; Weatheritt, R.J.; Gibson, T.J.; Davey, N.E. The switches.ELM resource: A compendium of conditional regulatory interaction interfaces. Sci. Signal. 2013, 6, rs7. [Google Scholar] [CrossRef] [PubMed]
  186. Zanzoni, A.; Ribeiro, D.M.; Brun, C. Understanding protein multifunctionality: From short linear motifs to cellular functions. Cell Mol Life Sci. 2019, 76, 4407–4412. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  187. Dunker, A.K.; Oldfield, C.J.; Meng, J.; Romero, P.; Yang, J.Y.; Chen, J.; Vacic, V.; Obradovic, Z.; Uversky, V.N. The unfoldomics decade: An update on intrinsically disordered proteins. BMC Genom. 2008, 9, S1. [Google Scholar] [CrossRef] [Green Version]
  188. Espinoza-Fonseca, L.M. Reconciling binding mechanisms of intrinsically disordered proteins. Biochem. Biophys. Res. Commun. 2009, 382, 479–482. [Google Scholar] [CrossRef]
  189. Das, R.K.; Crick, S.L.; Pappu, R.V. N-Terminal Segments Modulate the α-Helical Propensities of the Intrinsically Disordered Basic Regions of bZIP Proteins. J. Mol. Biol. 2012, 416, 287–299. [Google Scholar] [CrossRef]
  190. Sharma, R.; Sharma, A.; Patil, A.; Tsunoda, T. Discovering MoRFs by trisecting intrinsically disordered protein sequence into terminals and middle regions. BMC Bioinform. 2019, 19, 378. [Google Scholar] [CrossRef]
  191. Yan, J.; Dunker, A.K.; Uversky, V.N.; Kurgan, L. Molecular recognition features (MoRFs) in three domains of life. Mol. Biosyst. 2016, 12, 697–710. [Google Scholar] [CrossRef] [Green Version]
  192. Chi, S.W.; Lee, S.H.; Kim, D.H.; Ahn, M.J.; Kim, J.S.; Woo, J.Y.; Torizawa, T.; Kainosho, M.; Han, K.H. Structural Details on mdm2-p53 Interaction. J. Biol. Chem. 2005, 280, 38795–38802. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  193. Borcherds, W.; Kashtanov, S.; Wu, H.; Daughdrill, G.W. Structural divergence is more extensive than sequence divergence for a family of intrinsically disordered proteins. Proteins Struct. Funct. Bioinform. 2013, 81, 1686–1698. [Google Scholar] [CrossRef] [PubMed]
  194. Katuwawala, A.; Peng, Z.; Yang, J.; Kurgan, L. Computational Prediction of MoRFs, Short Disorder-to-order Transitioning Protein Binding Regions. Comput. Struct. Biotechnol. J. 2019, 17, 454–462. [Google Scholar] [CrossRef]
  195. Sun, X.; Malhis, N.; Zhao, B.; Xue, B.; Gsponer, J.; Rikkerink, E.H.A. Computational Disorder Analysis in Ethylene Response Factors Uncovers Binding Motifs Critical to Their Diverse Functions. Int. J. Mol. Sci. 2019, 21, 74. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  196. Tompa, P.; Fuxreiter, M.; Oldfield, C.J.; Simon, I.; Dunker, A.K.; Uversky, V.N. Close encounters of the third kind: Disordered domains and the interactions of proteins. BioEssays 2009, 31, 328–335. [Google Scholar] [CrossRef]
  197. Chen, J.W.; Romero, P.; Uversky, V.N.; Dunker, A.K. Conservation of intrinsic disorder in protein domains and families: I. A database of conserved predicted disordered regions. J. Proteome Res. 2006, 5, 879–887. [Google Scholar] [CrossRef] [Green Version]
  198. Chen, J.W.; Romero, P.; Uversky, V.N.; Dunker, A.K. Conservation of Intrinsic Disorder in Protein Domains and Families: II. Functions of Conserved Disorder. J. Proteome Res. 2006, 5, 888–898. [Google Scholar] [CrossRef]
  199. Zhou, J.; Oldfield, C.J.; Yan, W.; Shen, B.; Dunker, A.K. Intrinsically disordered domains: Sequence ➔ disorder ➔ function relationships. Protein Sci. 2019, 28, 1652–1663. [Google Scholar] [CrossRef]
  200. Buljan, M.; Frankish, A.; Bateman, A. Quantifying the mechanisms of domain gain in animal proteins. Genome Biol. 2010, 11, R74. [Google Scholar] [CrossRef] [Green Version]
  201. Russo, A.A.; Jeffrey, P.D.; Patten, A.K.; Massague, J.; Pavletich, N.P. Crystal structure of the p27Kip1 cyclin-dependent-kinase inhibitor bound to the cyclin A-Cdk2 complex. Nature 1996, 382, 325–331. [Google Scholar] [CrossRef]
  202. Galea, C.A.; Nourse, A.; Wang, Y.; Sivakolundu, S.G.; Heller, W.T.; Kriwacki, R.W. Role of intrinsic flexibility in signal transduction mediated by the cell cycle regulator, p27 Kip1. J. Mol. Biol. 2008, 376, 827–838. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  203. Pentony, M.M.; Jones, D.T. Modularity of intrinsic disorder in the human proteome. Proteins Struct. Funct. Bioinform. 2010, 78, 212–221. [Google Scholar] [CrossRef] [PubMed]
  204. Teraguchi, S.; Patil, A.; Standley, D. Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins. BMC Bioinform. 2010, 11, S7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  205. Manica, G.; Ghenea, S.; Munteanu, C.V.A.; Martin, E.C.; Butnaru, C.; Surleac, M.; Chiritoiu, G.N.; Alexandru, P.R.; Petrescu, A.J.; Petrescu, S.M. EDEM3 Domains Cooperate to Perform Its Overall Cell Functioning. Int. J. Mol. Sci. 2021, 22, 2172. [Google Scholar] [CrossRef]
  206. Rautureau, G.J.; Day, C.L.; Hinds, M.G. Intrinsically disordered proteins in bcl-2 regulated apoptosis. Int. J. Mol. Sci. 2010, 11, 1808–1824. [Google Scholar] [CrossRef] [Green Version]
  207. Kai, M. Roles of RNA-Binding Proteins in DNA Damage Response. Int. J. Mol. Sci. 2016, 17, 310. [Google Scholar] [CrossRef] [Green Version]
  208. Blocquel, D.; Habchi, J.; Gruet, A.; Blangy, S.; Longhi, S. Compaction and binding properties of the intrinsically disordered C-terminal domain of Henipavirus nucleoprotein as unveiled by deletion studies. Mol. BioSyst. 2012, 8, 392–410. [Google Scholar] [CrossRef]
  209. Etoh, Y.; Simon, M.; Green, H. Involucrin acts as transglutaminase substrate at multiple sites. Biochem. Biophys. Res. Commun. 1986, 136, 51–60. [Google Scholar] [CrossRef]
  210. Lynch, W.; Riseman, V.; Bretscher, A. Smooth Muscle CaldesmonIs an Extended Flexible Monomeric Protein in Solution That Can Readily Undergo Reversible Intra- and Intermolecular Sulfhydryl Cross-linking. J. Biol. Chem. 1987, 262, 7429–7437. [Google Scholar] [CrossRef]
  211. Weinreb, P.; Zhen, W.; Poon, A.; Conway, K.; Lansbury, P.J. NACP, a protein implicated in Alzheimer’s disease and learning, is natively unfolded. Biochemistry 1996, 35, 13709–13715. [Google Scholar] [CrossRef]
  212. Huber, R. Conformational flexibility in protein molecules. Nature 1979, 280, 538–539. [Google Scholar] [CrossRef] [PubMed]
  213. Ohgushi, M.; Wada, A. ‘Molten-globule state’: A compact form of globular proteins with mobile side chains. FEBS Lett. 1983, 164, 21–24. [Google Scholar] [CrossRef] [Green Version]
  214. Kuwajima, K. A Folding Model of a-Lactalbumin Deduced from the Three-state Denaturation Mechanism. J. Mol. Biol. 1977, 114, 241–258. [Google Scholar] [CrossRef]
  215. Fasman, G.D. Circular Dichroism and the Conformational Analysis of Biomolecules; Plenum Press: New York, NY, USA, 1996. [Google Scholar]
  216. Dyson, H.J.; Wright, P.E. Insights into the structure and dynamics of unfolded proteins from nuclear magnetic resonance. Adv. Protein Chem. 2002, 62, 311–340. [Google Scholar]
  217. Dyson, H.J.; Wright, P.E. Unfolded proteins and protein folding studied by NMR. Chem. Rev. 2004, 104, 3607–3622. [Google Scholar] [CrossRef]
  218. Poulsen, F.M. A Brief Introduction to NMR Spectroscopy of Proteins. 2002. Available online: https://users.cs.duke.edu/~brd/Teaching/Bio/asmb/Papers/Intro-reviews/flemming.pdf (accessed on 7 November 2022).
  219. Cavalli, A.; Salvatella, X.; Dobson, C.M.; Vendruscolo, M. Protein structure determination from NMR chemical shifts. Proc. Natl. Acad. Sci. USA 2007, 104, 9615–9620. [Google Scholar] [CrossRef] [Green Version]
  220. Kikhney, A.G.; Svergun, D.I. A practical guide to small angle X-ray scattering (SAXS) of flexible and intrinsically disordered proteins. FEBS Lett. 2015, 589, 2570–2577. [Google Scholar] [CrossRef] [Green Version]
  221. Korasick, D.A.; Tanner, J.J. Determination of protein oligomeric structure from small-angle X-ray scattering. Protein Sci. 2018, 27, 814–824. [Google Scholar] [CrossRef]
  222. Da Vela, S.; Svergun, D.I. Methods, development and applications of small-angle X-ray scattering to characterize biological macromolecules in solution. Curr. Res. Struct. Biol. 2020, 27, 164–170. [Google Scholar] [CrossRef]
  223. Grawert, T.W.; Svergun, D.I. Structural Modeling Using Solution Small-Angle X-ray Scattering (SAXS). J. Mol. Biol. 2020, 432, 3078–3092. [Google Scholar] [CrossRef]
  224. Jacques, D.A.; Trewhella, J. Small-angle scattering for structural biology--Expanding the frontier while avoiding the pitfalls. Protein Sci. 2010, 19, 642–657. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  225. Czaplewski, C.; Gong, Z.; Lubecka, E.A.; Xue, K.; Tang, C.; Liwo, A. Recent Developments in Data-Assisted Modeling of Flexible Proteins. Front. Mol. Biosci. 2021, 8, 765562. [Google Scholar] [CrossRef] [PubMed]
  226. Perez, J.; Vachette, P.; Russo, D.; Desmadril, M.; Durand, D. Heat-induced unfolding of neocarzinostatin, a small all-β protein investigated by small-angle X-ray scattering. J. Mol. Biol. 2001, 308, 721–743. [Google Scholar] [CrossRef] [PubMed]
  227. Bernado, P.; Blackledge, M. A Self-Consistent Description of the Conformational Behavior of Chemically Denatured Proteins from NMR and Small Angle Scattering. Biophys. J. 2009, 97, 2839–2845. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  228. Bernado, P.; Svergun, D.I. Structural analysis of intrinsically disordered proteins by small-angle X-ray scattering. Mol. Biosyst. 2012, 8, 151–167. [Google Scholar] [CrossRef]
  229. Jeffries, C.M.; Svergun, D.I. High-Throughput Studies of Protein Shapes and Interactions by Synchrotron Small-Angle X-ray Scattering. In Structural Proteomics; Methods in Molecular Biology; Owens, R., Ed.; Humana: New York, NY, USA, 2015; Volume 1261, pp. 277–301. [Google Scholar]
  230. Graewert, M.A.; Jeffries, C.M. Sample and Buffer Preparation for SAXS. Adv. Exp. Med. Biol. 2017, 1009, 11–30. [Google Scholar]
  231. David, G.; Perez, J. Combined sampler robot and high-performance liquid chromatography: A fully automated system for biological small-angle X-ray scattering experiments at the Synchrotron SOLEIL SWING beamline. J. Appl. Crystallogr. 2009, 42, 892–900. [Google Scholar] [CrossRef]
  232. Schulte, T.; Lofling, J.; Mikaelsson, C.; Kikhney, A.; Hentrich, K.; Diamante, A.; Ebel, C.; Normark, S.; Svergun, D.; Birgitta, H.N.; et al. The basic keratin 10-binding domain of the virulence-associated pneumococcal serine-rich protein PsrP adopts a novel MSCRAMM fold. Open Biol. 2014, 41, 30090. [Google Scholar] [CrossRef]
  233. Rodriguez, Z.P. Conjugation of NMR and SAXS for flexible and multidomain protein structure determination: From sample preparation to model refinement. Prog. Biophys. Mol. Biol. 2020, 150, 140–144. [Google Scholar] [CrossRef]
  234. Gast, K.; Damaschun, H.; Eckert, K.; Schulze-Forster, K.; Maurer, H.R.; Muller-Frohne, M.; Zirwer, D.; Czarnecki, J.; Damaschun, G. Prothymosin alpha: A biologically active protein with random coil conformation. Biochemistry 1995, 34, 13211–13218. [Google Scholar] [CrossRef]
  235. Zhang, Y. I-TASSER server for protein 3D structure prediction. BMC Bioinform. 2008, 9, 40. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  236. Petoukhov, M.V.; Franke, D.; Shkumatov, A.V.; Tria, G.; Kikhney, A.G.; Gajda, M.; Gorba, C.; Mertens, H.D.T.; Konarev, P.V.; Svergun, D.I. New developments in the ATSAS program package for small-angle scattering data analysis. J. Appl. Crystallogr. 2012, 45, 342–350. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  237. Gijsbers, A.; Sánchez-Puig, N.; Gao, Y.; Peters, P.J.; Ravelli, R.B.G.; Siliqi, D. Structural Analysis of the Partially Disordered Protein EspK from Mycobacterium Tuberculosis. Crystals 2021, 11, 18. [Google Scholar] [CrossRef]
  238. Nogales, E. The development of cryo-EM into a mainstream structural biology technique. Nat. Methods 2016, 13, 24–27. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  239. Shoemaker, S.C.; Ando, N. X-rays in the cryo-electron microscopy era: Structural biology’s dynamic future. Bichemistry 2018, 57, 277–285. [Google Scholar] [CrossRef] [PubMed]
  240. Hanske, J.; Sadian, Y.; Muller, C.W. The cryo-em resolution revolution and transcription complexes. Curr. Opin. Struct. Biol. 2018, 52, 8–15. [Google Scholar] [CrossRef]
  241. Nwanochie, E.; Uversky, V.N. Structure Determination by Single-Particle Cryo-Electron Microscopy: Only the Sky (and Intrinsic Disorder) is the Limit. Int. J. Mol. Sci. 2019, 20, 4186. [Google Scholar] [CrossRef] [Green Version]
  242. Cheng, Y. Single-particle cryo-EM-How did it get here and where will it go. Science 2018, 361, 876–880. [Google Scholar] [CrossRef] [Green Version]
  243. Musselman, C.A.; Kutateladze, T.G. Characterization of functional disordered regions within chromatin-associated proteins. IScience 2021, 24, 102070. [Google Scholar] [CrossRef]
  244. Bharat, T.A.M.; Russo, C.J.; Lowe, J.; Passmore, L.A.; Scheres, S.H.W. Advances in single-particle electron cryo microscopy structure determination applied to sub-tomogram averaging. Structure 2015, 23, 1743–1753. [Google Scholar] [CrossRef] [Green Version]
  245. Yan, Z.; Bai, X.; Yan, C.; Wu, J.; Li, Z.; Xie, T.; Peng, W.; Yin, C.; Li, X.; Scheres, S.H.W.; et al. Structure of the rabbit ryanodine receptor ryr1 at near-atomic resolution. Nature 2015, 517, 50–55. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  246. Gibbs, E.B.; Cook, E.C.; Showalter, S.A. Application of NMR to studies of intrinsically disordered proteins. Arch. Biochem. Biophys. 2017, 628, 57–70. [Google Scholar] [CrossRef] [PubMed]
  247. Geraets, J.A.; Pothula, K.R.; Schroder, G.F. Integrating cryo-EM and NMR data. Curr. Opin. Struc. Biol. 2020, 61, 173–181. [Google Scholar] [CrossRef] [PubMed]
  248. Prilusky, J.; Felder, C.E.; Zeev-Ben-Mordehai, T.; Rydberg, E.H.; Man, O.; Beckmann, J.S.; Silman, I.; Sussman, J.L. FoldIndex(C): A simple tool to predict whether a given protein sequence is intrinsically unfolded. Bioinformatics 2005, 21, 3435–3438. [Google Scholar] [CrossRef]
  249. Liu, J.; Rost, B. NORSp: Predictions of long regions without regular secondary structure. Nucleic Acids Res. 2003, 31, 3833–3835. [Google Scholar] [CrossRef] [Green Version]
  250. Linding, R.; Russell, R.; Neduva, V.; Gibson, T. GlobPlot: Exploring protein sequences for globularity and disorder. Nucleic Acids Res. 2003, 31, 3701–3708. [Google Scholar] [CrossRef] [Green Version]
  251. Coeytaux, K.; Poupon, A. Prediction of unfolded segments in a protein sequence based on amino acid composition. Bioinformatics 2005, 21, 1891–1900. [Google Scholar] [CrossRef] [Green Version]
  252. Di Domenico, T.; Walsh, I.; Martin, A.J.; Tosatto, S.C. MobiDB: A comprehensive database of intrinsic protein disorder annotations. Bioinformatics 2012, 28, 2080–2081. [Google Scholar] [CrossRef] [Green Version]
  253. Fukuchi, S.; Sakamoto, S.; Nobe, Y.; Murakami, S.D.; Amemiya, T.; Hosoda, K.; Koike, R.; Hiroaki, H.; Ota, M. IDEAL: Intrinsically Disordered proteins with Extensive Annotations and Literature. Nucleic Acids Res. 2012, 40, D507–D511. [Google Scholar] [CrossRef]
  254. Romero, P.; Obradovic, Z.; Dunker, A.K. Folding minimal sequences: The lower bound for sequence complexity of globular proteins. FEBS Lett. 1999, 462, 363–367. [Google Scholar] [CrossRef] [Green Version]
  255. Vullo, A.; Bortolami, O.; Pollastri, G.; Tosatto, S.C. Spritz: A server for the prediction of intrinsically disordered regions in protein sequences using kernel machines. Nucleic Acids Res. 2006, 34, W164–W168. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  256. Linding, R.; Jensen, L.; Diella, F.; Bork, P.; Gibson, T.; Russell, R. Protein Disorder Prediction: Implications for Structural Proteomics. Structure 2003, 11, 1453–1459. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  257. Yang, Z.R.; Thomson, R.; McNeil, P.; Esnouf, R.M. RONN: The bio-basis function neural network technique applied to the detection of natively disordered regions in proteins. Bioinformatics 2005, 21, 3369–3376. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  258. Ward, J.J.; McGuffin, L.J.; Bryson, K.; Buxton, B.F.; Jones, D.T. The DISOPRED server for the prediction of protein disorder. Bioinformatics 2004, 20, 2138–2139. [Google Scholar] [CrossRef] [Green Version]
  259. Jones, D.T.; Cozzetto, D. DISOPRED3: Precise disordered region predictions with annotated protein-binding activity. Bioinformatics 2015, 31, 857–863. [Google Scholar] [CrossRef] [Green Version]
  260. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Zidek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
  261. Zweckstetter, M. NMR hawk-eyed view of AlphaFold2 structures. Protein Sci. 2021, 30, 2333–2337. [Google Scholar] [CrossRef]
  262. AlQuraishi, M. Protein-structure prediction revolutionized. Nature 2021, 596, 487–488. [Google Scholar] [CrossRef]
  263. He, X.H.; You, C.Z.; Jiang, H.L.; Jiang, Y.; Xu, H.E.; Cheng, X. AlphaFold2 versus experimental structures: Evaluation on G protein-coupled receptors. Acta Pharmacol. Sin. 2022. [Google Scholar] [CrossRef]
  264. Ruff, K.M.; Pappu, R.V. AlphaFold and Implications for Intrinsically Disordered Proteins. J. Mol. Biol. 2021, 433, 167208. [Google Scholar] [CrossRef]
  265. Varadi, M.; Anyango, S.; Deshpande, M.; Nair, S.; Natassia, C.; Yordanova, G.; Yuan, D.; Stroe, O.; Wood, G.; Laydon, A.; et al. AlphaFold Protein Structure Database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022, 50, D439–D444. [Google Scholar] [CrossRef] [PubMed]
  266. Tunyasuvunakool, K.; Adler, J.; Wu, Z.; Green, T.; Zielinski, M.; Zidek, A.; Bridgland, A.; Cowie, A.; Meyer, C.; Laydon, A.; et al. Highly accurate protein structure prediction for the human proteome. Nature 2021, 596, 590–596. [Google Scholar] [CrossRef] [PubMed]
  267. Dosztanyi, Z.; Csizmok, V.; Tompa, P.; Simon, I. IUPred: Web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. Bioinformatics 2005, 21, 3433–3434. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  268. Dosztanyi, Z.; Csizmok VTompa, P.; Simon, I. The Pairwise Energy Content Estimated from Amino Acid Composition Discriminates between Folded and Intrinsically Unstructured Proteins. J. Mol. Biol. 2005, 347, 827–839. [Google Scholar] [CrossRef] [PubMed]
  269. Garbuzynskiy, S.O.; Lobanov, M.Y.; Galzitskaya, O.V. To be folded or to be unfolded? Protein Sci. 2008, 13, 2871–2877. [Google Scholar] [CrossRef] [PubMed]
  270. Schlessinger, A.; Punta, M.; Rost, B. Natively unstructured regions in proteins identified from contact predictions. Bioinformatics 2007, 23, 2376–2384. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  271. Xue BDunbrack, R.L.; Williams, R.W.; Dunker, A.K.; Uversky, V.N. PONDR-FIT: A meta-predictor of intrinsically disordered amino acids. Biochim. Biophys. Acta 2010, 1804, 996–1010. [Google Scholar]
  272. Lieutaud, P.; Canard, B.; Longhi, S. MeDor: A metaserver for predicting protein disorder. BMC Genom. 2008, 9, S25. [Google Scholar] [CrossRef] [Green Version]
  273. Ishida, T.; Kinoshita, K. Prediction of disordered regions in proteins based on the meta-approach. Bioinformatics 2008, 24, 1344–1348. [Google Scholar] [CrossRef] [Green Version]
  274. Meng, F.; Uversky, V.N.; Kurgan, L. Comprehensive review of methods for prediction of intrinsic disorder and its molecular functions. Cell. Mol. Life Sci. 2017, 74, 3069–3090. [Google Scholar] [CrossRef]
  275. Liu, Y.; Wang, X.; Liu, B. A comprehensive review and comparison of existing computational methods for intrinsically disordered protein and region prediction. Brief Bioinform. 2019, 20, 330–346. [Google Scholar] [CrossRef] [PubMed]
  276. Kurgan, L. Resources for computational prediction of intrinsic disorder in proteins. Methods 2022, 204, 132–141. [Google Scholar] [CrossRef] [PubMed]
  277. Jordan, I.; Kondrashov, F.; Adzhubei, I.A.; Wolf, Y.; Koonin, E.; Kondrashov, A.; Sunyaev, S. A universal trend of amino acid gain and loss in protein evolution. Nature 2005, 433, 633–638. [Google Scholar] [CrossRef] [PubMed]
  278. Brooks, D.J.; Fresco, J.R. Increased Frequency of Cysteine, Tyrosine, and Phenylalanine Residues Since the Last Universal Ancestor. Mol. Cell. Proteom. 2002, 1, 125–131. [Google Scholar] [CrossRef] [Green Version]
  279. Cristofari, G.; Darlix, J.L. The ubiquitous nature of RNA chaperone proteins. Prog. Nucl. Acid Res. Mol. Biol. 2002, 72, 223–268. [Google Scholar]
  280. Doolittle, W.F. Uprooting the tree of life. Sci. Am. 2000, 282, 90–95. [Google Scholar] [CrossRef]
  281. Theobald, D.L. A formal test of the theory of universal common ancestry. Nature 2010, 465, 219–222. [Google Scholar] [CrossRef]
  282. Van Oss, S.B.; Carvunis, A.R. De novo gene birth. PLoS Genet. 2019, 15, e1008160. [Google Scholar] [CrossRef] [Green Version]
  283. Tompa, P. Intrinsically unstructured proteins evolve by repeat expansion. BioEssays 2003, 25, 847–855. [Google Scholar] [CrossRef]
  284. Lise, S.; Jones, D. Sequence Patterns Associated with Disordered Regions in Proteins. Proteins Struct. Funct. Genet. 2005, 58, 144–150. [Google Scholar] [CrossRef] [PubMed]
  285. Brown, C.J.; Johnson, A.K.; Dunker, A.K.; Daughdrill, G.W. Evolution and disorder. Curr. Opin. Struct. Biol. 2011, 21, 441–446. [Google Scholar] [CrossRef] [PubMed]
  286. Pancsa, R.; Tompa, P. Structural Disorder in Eukaryotes. PLoS ONE 2012, 7, e34687. [Google Scholar] [CrossRef] [PubMed]
  287. Meyer, K.; Kirchner, M.; Uyar, B.; Cheng, J.Y.; Russo, G.; Hernandez-Miranda, L.R.; Szymborska, A.; Zauber, H.; Rudolph, I.M.; Willnow, T.E.; et al. Mutations in Disordered Regions Can Cause Disease by Creating Dileucine Motifs. Cell 2018, 175, 239–253. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  288. Pajkos, M.; Zeke, A.; Dosztányi, Z. Ancient Evolutionary Origin of Intrinsically Disordered Cancer Risk Regions. Biomolecules 2020, 10, 1115. [Google Scholar] [CrossRef] [PubMed]
  289. Meszaros, B.; Hajdu-Soltesz, B.; Zeke, A.; Dosztanyi, Z. Mutations of Intrinsically Disordered Protein Regions Can Drive Cancer but Lack Therapeutic Strategies. Biomolecules 2021, 11, 381. [Google Scholar] [CrossRef]
  290. Seera, S.; Nagarajaram, H.A. Effect of Disease Causing Missense Mutations on Intrinsically Disordered Regions in Proteins. Protein Pept. Lett. 2022, 29, 254–267. [Google Scholar] [CrossRef]
  291. Peng, Z.; Uversky, V.; Kurgan, L. Genes encoding intrinsic disorder in Eukaryota have high GC content. Intrinsically Disord. Proteins 2016, 4, e1262225. [Google Scholar] [CrossRef]
  292. Basile, W.; Sachenkova, O.; Light, S.; Elofsson, A. High GC content causes orphan proteins to be intrinsically disordered. PLoS Comput. Biol. 2017, 13, e1005375. [Google Scholar] [CrossRef] [Green Version]
  293. Brown, C.J.; Takayama, S.; Campen, A.M.; Vise, P.; Marshall, T.W.; Oldfield, C.J.; Williams, C.J.; Dunker, A.K. Evolutionary Rate Heterogeneity in Proteins with Long Disordered Regions. J. Mol. Evol. 2002, 55, 104–110. [Google Scholar] [CrossRef]
  294. Homma, K.; Anbo, H.; Noguchi, T.; Fukuchi, S. Both Intrinsically Disordered Regions and Structural Domains Evolve Rapidly in Immune-Related Mammalian Proteins. Int. J. Mol. Sci. 2018, 19, 3860. [Google Scholar] [CrossRef] [Green Version]
  295. Pancsa, R.; Zsolyomi, F.; Tompa, P. Co-Evolution of Intrinsically Disordered Proteins with Folded Partners Witnessed by Evolutionary Couplings. Int. J. Mol. Sci. 2018, 19, 3315. [Google Scholar] [CrossRef] [PubMed]
  296. Zarin, T.; Strome, B.; Nguyen Ba, A.N.; Alberti, S.; Forman-Kay, J.D.; Moses, A.M. Proteome-wide signatures of function in highly diverged intrinsically disordered regions. eLife 2019, 8, e46883. [Google Scholar] [CrossRef] [PubMed]
  297. Bellay, J.; Han, S.; Michaut, M.; Kim, T.; Costanzo, M.; Andrews, B.J.; Boone, C.; Bader, G.D.; Myers, C.L.; Kim, P.M. Bringing order to protein disorder through comparative genomics and genetic interactions. Genome Biol. 2011, 12, R14. [Google Scholar] [CrossRef] [PubMed]
  298. Covaceuszach, S.; Peche, L.Y.; Konarev, P.V.; Lamba, D. A combined evolutionary and structural approach to disclose the primary structural determinants essential for proneurotrophins biological functions. Comput. Struct. Biotechnol. J. 2021, 19, 2891–2904. [Google Scholar] [CrossRef]
  299. Millard, P.S.; Bugge, K.; Marabini, R.; Boomsma, W.; Burow, M.; Kragelund, B.B. IDDomainSpotter: Compositional bias reveals domains in long disordered protein regions-Insights from transcription factors. Protein Sci. 2020, 29, 169–183. [Google Scholar] [CrossRef] [Green Version]
  300. Cohan, M.C.; Shinn, M.K.; Lalmansingh, J.M.; Pappu, R.V. Uncovering Non-random Binary Patterns Within Sequences of Intrinsically Disordered Proteins. J. Mol. Biol. 2022, 434, 167373. [Google Scholar] [CrossRef]
  301. Lu, A.X.; Lu, A.X.; Pritisanac, I.; Zarin, T.; Forman-Kay, J.D.; Moses, A.M. Discovering molecular features of intrinsically disordered regions by using evolution for contrastive learning. PLoS Comput. Biol. 2022, 18, e1010238. [Google Scholar] [CrossRef]
  302. Gsponer, J.; Futschik, M.E.; Teichmann, S.A.; Babu, M.M. Tight Regulation of Unstructured Proteins: From Transcript Synthesis to Protein Degradation. Science 2008, 322, 1365–1368. [Google Scholar] [CrossRef] [Green Version]
  303. Uversky, V.N.; Oldfield, C.J.; Dunker, A.K. Intrinsically disordered proteins in human diseases: Introducing the D2 concept. Annu. Rev. Biophys. 2008, 37, 215–246. [Google Scholar] [CrossRef]
  304. Langella, E.; Buonanno, M.; De Simone, G.; Monti, S.M. Intrinsically disordered features of carbonic anhydrase IX proteoglycan-like domain. Cell. Mol. Life Sci. 2021, 78, 2059–2067. [Google Scholar] [CrossRef]
  305. Chang, B.S.; Minn, A.J.; Muchmore, S.W.; Fesik, S.W.; Thompson, C.B. Identification of a novel regulatory domain in Bcl-X(L) and Bcl-2. EMBO J. 1997, 16, 968–977. [Google Scholar] [CrossRef] [PubMed]
  306. Lee, H.; Mok, K.H.; Muhandiram, R.; Park, K.; Suk, J.; Kim, D.; Chang, J.; Sung, Y.C.; Choi, K.Y.; Han, K. Local Structural Elements in the Mostly Unstructured Transcriptional Activation Domain of Human p53. J. Biol. Chem. 2000, 275, 29426–29432. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  307. Garg, N.; Kumar, P.; Gadhave, K.; Giri, R. The dark proteome of cancer: Intrinsic disorderedness and functionality of HIF-1α along with its interacting proteins. Prog. Mol. Biol. Transl. Sci. 2019, 166, 371–403. [Google Scholar] [PubMed]
  308. Glenner, G.G.; Wong, C.W. Alzheimer’s disease and Down’s syndrome: Sharing of a unique cerebrovascular amyloid fibril protein. Biochem. Biophys. Res. Commun. 1984, 122, 1131–1135. [Google Scholar] [CrossRef]
  309. Lee, V.; Balin, B.; Otvos, L.J.; Trojanowski, J. A68: A major subunit of paired helical filaments and derivatized forms of normal Tau. Science 1991, 251, 675–678. [Google Scholar] [CrossRef] [PubMed]
  310. Ueda, K.; Fukushima, H.; Masliah, E.; Xia, Y.; Iwai, A.; Yoshimoto, M.; Otero, D.; Kondo, J.; Ihara, Y.; Saitoh, T. Molecular cloning of cDNA encoding an unrecognized component of amyloid in Alzheimer disease. Proc. Natl. Acad. Sci. USA 1993, 90, 11282–11286. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  311. Prusiner, S. Shattuck lecture: Neurodegenerative diseases and prions. N. Engl. J. Med. 2001, 344, 1516–1526. [Google Scholar] [CrossRef]
  312. Eftekharzadeh BHyman, B.T.; Wegmann, S. Structural studies on the mechanism of protein aggregation in age related neurodegenerative diseases. Mech. Ageing Dev. 2016, 156, 1–13. [Google Scholar] [CrossRef]
  313. Kovacs, G.G. Concepts and classification of neurodegenerative diseases. Handb. Clin. Neurol. 2017, 145, 301–307. [Google Scholar]
  314. Martinelli, A.H.S.; Lopes, F.C.; John, E.B.O.; Carlini, C.R.; Ligabue-Braun, R. Modulation of Disordered Proteins with a Focus on Neurodegenerative Diseases and Other Pathologies. Int. J. Mol. Sci. 2019, 20, 1322. [Google Scholar] [CrossRef] [Green Version]
  315. Kulkarni, P.; Uversky, V.N. Intrinsically Disordered Proteins in Chronic Diseases. Biomolecules 2019, 9, 147. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  316. Monti, S.M.; De Simone, G.; Langella, E. The Amazing World of IDPs in Human Diseases. Biomolecules 2021, 11, 333. [Google Scholar] [CrossRef] [PubMed]
  317. Monti, S.M.; De Simone, G.; Langella, E. The Amazing World of IDPs in Human Diseases II. Biomolecules 2022, 12, 369. [Google Scholar] [CrossRef] [PubMed]
  318. Brundin, P.; Melki, R. Prying into the Prion Hypothesis for Parkinson’s Disease. J. Neurosci. 2017, 37, 9808–9818. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  319. Weller, J.; Budson, A. Current understanding of Alzheimer’s disease diagnosis and treatment. F1000Research 2018, 7, 1161. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  320. Dos Santos Picanco, L.C.; Ozela, P.F.; de Fatima de Brito Brito, M.; Pinheiro, A.A.; Padilha, E.C.; Braga, F.S.; de Paula da Silva, C.H.T.; Dos Santos, C.B.R.; Rosa, J.M.C.; da Silva Hage-Melim, L.I. Alzheimer’s Disease: A Review from the Pathophysiology to Diagnosis, New Perspectives for Pharmacological Treatment. Curr. Med. Chem. 2018, 25, 3141–3159. [Google Scholar] [CrossRef] [PubMed]
  321. Ledeen, R.W.; Wu, G. Gangliosides, α-Synuclein, and Parkinson’s Disease. Prog. Mol. Biol. Transl. Sci. 2018, 156, 435–454. [Google Scholar]
  322. DeTure, M.A.; Dickson, D.W. The neuropathological diagnosis of Alzheimer’s disease. Mol. Neurodegener. 2019, 14, 32. [Google Scholar] [CrossRef] [Green Version]
  323. Coskuner, O.; Uversky, V.N. Intrinsically disordered proteins in various hypotheses on the pathogenesis of Alzheimer’s and Parkinson’s diseases. Prog. Mol. Biol. Transl. Sci. 2019, 166, 145–223. [Google Scholar]
  324. Foster, E.M.; Dangla-Valls, A.; Lovestone, S.; Ribe, E.M.; Buckley, N.J. Clusterin in Alzheimer’s Disease: Mechanisms, Genetics, and Lessons from Other Pathologies. Front. Neurosci. 2019, 13, 164. [Google Scholar] [CrossRef] [Green Version]
  325. Coskuner-Weber, O.; Mirzanli, O.; Uversky, V.N. Intrinsically disordered proteins and proteins with intrinsically disordered regions in neurodegenerative diseases. Biophys. Rev. 2022, 14, 679–707. [Google Scholar] [CrossRef] [PubMed]
  326. Akbayrak, I.Y.; Caglayan, S.I.; Ozcan, Z.; Uversky, V.N.; Coskuner-Weber, O. Current Challenges and Limitations in the Studies of Intrinsically Disordered Proteins in Neurodegenerative Diseases by Computer Simulations. Curr. Alzheimer Res. 2020, 17, 805–818. [Google Scholar] [CrossRef] [PubMed]
  327. Roterman, I.; Stapor, K.; Fabian, P.; Konieczny, L. In Silico Modeling of the Influence of Environment on Amyloid Folding Using FOD-M Model. Int. J. Mol. Sci. 2021, 19, 10587. [Google Scholar] [CrossRef] [PubMed]
  328. Tsvetkov, P.; Myers, N.; Adler, J.; Shaul, Y. Degradation of Intrinsically Disordered Proteins by the NADH 26S Proteasome. Biomolecules 2020, 10, 1642. [Google Scholar] [CrossRef]
  329. Gadhave, K.; Kumar, P.; Kapuganti, S.K.; Uversky, V.N.; Giri, R. Unstructured Biology of Proteins from Ubiquitin-Proteasome System: Roles in Cancer and Neurodegenerative Diseases. Biomolecules 2020, 10, 796. [Google Scholar] [CrossRef] [PubMed]
  330. Ortega-Alarcon, D.; Claveria-Gimeno, R.; Vega, S.; Jorge-Torres, O.C.; Esteller, M.; Abian, O.; Velazquez-Campoy, A. Molecular Context-Dependent Effects Induced by Rett Syndrome-Associated Mutations in MeCP2. Biomolecules 2020, 10, 1533. [Google Scholar] [CrossRef]
  331. Ortega-Alarcon, D.; Claveria-Gimeno, R.; Vega, S.; Jorge-Torres, O.C.; Esteller, M.; Abian, O.; Velazquez-Campoy, A. Stabilization Effect of Intrinsically Disordered Regions on Multidomain Proteins: The Case of the Methyl-CpG Protein 2, MeCP2. Biomolecules 2021, 11, 1216. [Google Scholar] [CrossRef]
  332. Neira, J.L.; Rizzuti, B.; Jimenez-Alesanco, A.; Palomino-Schätzlein, M.; Abian, O.; Velazquez-Campoy, A.; Iovanna, J.L. A Phosphorylation-Induced Switch in the Nuclear Localization Sequence of the Intrinsically Disordered NUPR1 Hampers Binding to Importin. Biomolecule 2020, 10, 1313. [Google Scholar] [CrossRef]
  333. Wong, E.T.C.; So, V.; Guron, M.; Kuechler, E.R.; Malhis, N.; Bui, J.M.; Gsponer, J. Protein-Protein Interactions Mediated by Intrinsically Disordered Protein Regions Are Enriched in Missense Mutations. Biomolecules 2020, 10, 1097. [Google Scholar] [CrossRef]
  334. Raut, K.K.; Ponniah, K.; Pascal, S.M. Structural Analysis of the cl-Par-4 Tumor Suppressor as a Function of Ionic Environment. Biomolecules 2021, 11, 3386. [Google Scholar] [CrossRef]
  335. Wang, J.; Cao, Z.; Zhao, L.; Li, S. Novel strategies for drug discovery based on Intrinsically Disordered Proteins (IDPs). Int. J. Mol. Sci. 2011, 12, 3205–3219. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  336. Melo, A.M.; Coraor, J.; Alpha-Cobb, G.; Elbaum-Garfinkle, S.; Nath, A.; Rhoades, E. A functional role for intrinsic disorder in the tau-tubulin complex. Proc. Natl. Acad. Sci. USA 2016, 113, 14336–14341. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  337. Hu, G.; Wu, Z.; Uversky, V.N.; Kurgan, L. Functional Analysis of Human Hub Proteins and Their Interactors Involved in the Intrinsic Disorder-Enriched Interactions. Int. J. Mol. Sci. 2017, 18, 2761. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  338. Necci, M.; Piovesan, D. CAID Predictors, DisProt Curators, Tosatto SCE Critical assessment of protein intrinsic disorder prediction. Nat. Methods 2021, 18, 472–481. [Google Scholar] [CrossRef] [PubMed]
  339. Necci, M.; Piovesan, D.; Tosatto, S.C. Large-scale analysis of intrinsic disorder flavors and associated functions in the protein sequence universe. Protein Sci. 2016, 25, 2164–2174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  340. Lin, X.; Li, X.; Lin, X. A Review on Applications of Computational Methods in Drug Screening and Design. Molecules 2020, 25, 1375. [Google Scholar] [CrossRef] [Green Version]
  341. Larsson, P.; Engqvist, H.; Biermann, J.; Werner Rönnerman, E.; Forssell-Aronsson, E.; Kovács, A.; Karlsson, P.; Helou, K.; Parris, T.Z. Optimization of cell viability assays to improve replicability and reproducibility of cancer drug sensitivity screens. Sci. Rep. 2020, 10, 5798. [Google Scholar] [CrossRef] [Green Version]
  342. Kitaeva, K.V.; Rutland, C.S.; Rizvanov, A.A.; Solovyeva, V.V. Cell Culture Based in vitro Test Systems for Anticancer Drug Screening. Front. Bioeng. Biotechnol. 2020, 8, 322. [Google Scholar] [CrossRef] [Green Version]
  343. Rothenaigner, I.; Hadian, K. Brief Guide: Experimental Strategies for High-Quality Hit Selection from Small-Molecule Screening Campaigns. SLAS Discov. 2021, 7, 851–854. [Google Scholar] [CrossRef]
  344. Tanoli, Z.; Aldahdooh, J.; Alam, F.; Wang, Y.; Seemab, U.; Fratelli, M.; Pavlis, P.; Hajduch, M.; Bietrix, F.; Gribbon, P.; et al. Minimal information for chemosensitivity assays (MICHA): A next-generation pipeline to enable the FAIRification of drug screening experiments. Brief Bioinform. 2022, 23, bbab350. [Google Scholar] [CrossRef]
  345. Rizzuti, B.; Lan, W.; Santofimia-Castaño, P.; Zhou, Z.; Velázquez-Campoy, A.; Abián, O.; Peng, L.; Neira, J.L.; Xia, Y.; Iovanna, J.L. Design of Inhibitors of the Intrinsically Disordered Protein NUPR1: Balance between Drug Affinity and Target Function. Biomolecules 2021, 11, 1453. [Google Scholar] [CrossRef] [PubMed]
  346. Choudhary, S.; Lopus, M.; Hosur, R.V. Targeting disorders in unstructured and structured proteins in various diseases. Biophys. Chem. 2022, 281, 106742. [Google Scholar] [CrossRef] [PubMed]
  347. Metallo, S.J. Intrinsically disordered proteins are potential drug targets. Curr. Opin. Chem. Biol. 2010, 14, 481–488. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  348. Kumar, D.; Sharma, N.; Giri, R. Therapeutic Interventions of Cancers Using Intrinsically Disordered Proteins as Drug Targets: C-Myc as Model System. Cancer Inform. 2017, 16, 1176935117699408. [Google Scholar] [CrossRef]
  349. Chen, H.; Liu, H.; Qing, G. Targeting oncogenic Myc as a strategy for cancer treatment. Signal Transduct Target Ther. 2018, 3, 5. [Google Scholar] [CrossRef] [Green Version]
  350. Na, I.; Choi, S.; Son, S.H.; Uversky, V.N.; Kim, C.G. Drug Discovery Targeting the Disorder-To-Order Transition Regions through the Conformational Diversity Mimicking and Statistical Analysis. Int. J. Mol. Sci. 2020, 21, 5248. [Google Scholar] [CrossRef]
  351. Kim, M.Y.; Na, I.; Kim, J.S.; Son, S.H.; Choi, S.; Lee, S.E.; Kim, J.H.; Jang, K.; Alterovitz, G.; Chen, Y.; et al. Rational discovery of antimetastatic agents targeting the intrinsically disordered region of MBD2. Sci. Adv. 2019, 5, eaav9810. [Google Scholar] [CrossRef] [Green Version]
  352. Brabletz, T.; Kalluri, R.; Nieto, M.A.; Weinberg, R.A. EMT in cancer. Nat. Rev. Cancer 2018, 18, 128–134. [Google Scholar] [CrossRef]
  353. Cheng, Y.; LeGall, T.; Oldfield, C.J.; Mueller, J.P.; Van, Y.Y.; Romero, P.; Cortese, M.S.; Uversky, V.N.; Dunker, A.K. Rational drug design via intrinsically disordered protein. Trends Biotechnol. 2006, 24, 435–442. [Google Scholar] [CrossRef]
  354. Uversky, V.N. Intrinsically disordered proteins and novel strategies for drug discovery. Expert Opin. Drug Discov. 2012, 7, 475–488. [Google Scholar] [CrossRef]
  355. Uversky, V.N. Intrinsic Disorder, Protein-Protein Interactions, and Disease. Adv. Protein Chem. Struct. Biol. 2018, 110, 85–121. [Google Scholar] [PubMed]
  356. Vassilev, L.T.; Vu, B.T.; Graves, B.; Carvajal, D.; Podlaski, F.; Filipovic, Z.; Kong, N.; Kammlott, U.; Lukacs, C.; Klein, C.; et al. In Vivo Activation of the p53 Pathway by Small-Molecule Antagonists of MDM2. Science 2004, 303, 844–848. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  357. Neira, J.L.; Bintz, J.; Arruebo, M.; Rizzuti, B.; Bonacci, T.; Vega, S.; Lanas, A.; Velázquez-Campoy, A.; Iovanna, J.L.; Abián, O. Identification of a Drug Targeting an Intrinsically Disordered Protein Involved in Pancreatic Adenocarcinoma. Sci. Rep. 2017, 7, 39732. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  358. Mishra, P.; Choudhary, S.; Mukherjee, S.; Sengupta, D.; Sharma, S.; Hosur, R.V. Molten globule nature of Plasmodium falciparum P2 homo-tetramer. Biochem. Biophys. Rep. 2015, 1, 97–107. [Google Scholar] [CrossRef] [PubMed]
  359. Santofimia-Castano, P.; Rizzuti, B.; Xia, Y.; Abian, O.; Peng, L.; Velazquez-Campoy, A.; Neira, J.L.; Iovanna, J. Targeting intrinsically disordered proteins involved in cancer. Cell Mol Life Sci. 2020, 77, 1695–1707. [Google Scholar] [CrossRef] [Green Version]
  360. Pietrobono, D.; Giacomelli, C.; Trincavelli, M.L.; Daniele, S.; Martini, C. Inhibitors of protein aggregates as novel drugs in neurodegenerative diseases. Glob. Drugs Ther. 2017, 2, 1–5. [Google Scholar]
  361. Save, S.S.; Rachineni, K.; Hosur, R.V.; Choudhary, S. Natural compound safranal driven inhibition and dis-aggregation of α-synuclein fibrils. Int. J. Biol. Macromol. 2019, 141, 585–595. [Google Scholar] [CrossRef]
Figure 1. The two paradigms of protein structure and function. According to the well-established ‘structure–function paradigm’, a three-dimensional native structure under physiological conditions is vital for a protein to perform its biological function (for example, enzyme-catalyzed reactions). The more recent ‘disorder–function paradigm’ states that a protein can carry out its biological function without attaining a 3-D stable folded structure under physiological conditions (for example, protein binding to other cellular molecules). For representative purpose, residues coding for ordered/globular domains are shown in ‘green’ color, and residues coding for disordered proteins/segments are shown in ‘red’. At the proteome level, the structured domains and intrinsically disordered regions (IDRs) are two functional building blocks of proteins.
Figure 1. The two paradigms of protein structure and function. According to the well-established ‘structure–function paradigm’, a three-dimensional native structure under physiological conditions is vital for a protein to perform its biological function (for example, enzyme-catalyzed reactions). The more recent ‘disorder–function paradigm’ states that a protein can carry out its biological function without attaining a 3-D stable folded structure under physiological conditions (for example, protein binding to other cellular molecules). For representative purpose, residues coding for ordered/globular domains are shown in ‘green’ color, and residues coding for disordered proteins/segments are shown in ‘red’. At the proteome level, the structured domains and intrinsically disordered regions (IDRs) are two functional building blocks of proteins.
Ijms 23 14050 g001
Figure 2. Functional aspects of IDPs/IDRs. Intrinsically disordered proteins/regions’ functional classes and elements are described here. The functional class scheme describes eight different categories into which IDPs/IDRs can be grouped based on their biological function. The different functional classes include Entropic chains, Modification sites, Disordered chaperones, Molecular effectors, Molecular recognition assemblers, Molecular recognition scavengers, Metal sponges, and Unknown. IDPs/IDRs’ functions are mediated mainly through three types of structural elements, namely Short Linear Motifs (SLiMs), Molecular Recognition Features (MORFs), and Intrinsically Disordered Domains (IDDS).
Figure 2. Functional aspects of IDPs/IDRs. Intrinsically disordered proteins/regions’ functional classes and elements are described here. The functional class scheme describes eight different categories into which IDPs/IDRs can be grouped based on their biological function. The different functional classes include Entropic chains, Modification sites, Disordered chaperones, Molecular effectors, Molecular recognition assemblers, Molecular recognition scavengers, Metal sponges, and Unknown. IDPs/IDRs’ functions are mediated mainly through three types of structural elements, namely Short Linear Motifs (SLiMs), Molecular Recognition Features (MORFs), and Intrinsically Disordered Domains (IDDS).
Ijms 23 14050 g002
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Trivedi, R.; Nagarajaram, H.A. Intrinsically Disordered Proteins: An Overview. Int. J. Mol. Sci. 2022, 23, 14050. https://doi.org/10.3390/ijms232214050

AMA Style

Trivedi R, Nagarajaram HA. Intrinsically Disordered Proteins: An Overview. International Journal of Molecular Sciences. 2022; 23(22):14050. https://doi.org/10.3390/ijms232214050

Chicago/Turabian Style

Trivedi, Rakesh, and Hampapathalu Adimurthy Nagarajaram. 2022. "Intrinsically Disordered Proteins: An Overview" International Journal of Molecular Sciences 23, no. 22: 14050. https://doi.org/10.3390/ijms232214050

APA Style

Trivedi, R., & Nagarajaram, H. A. (2022). Intrinsically Disordered Proteins: An Overview. International Journal of Molecular Sciences, 23(22), 14050. https://doi.org/10.3390/ijms232214050

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop