Previous Article in Journal
Small Extracellular Vesicle (sEV) Uptake from Lung Adenocarcinoma and Squamous Cell Carcinoma Alters T-Cell Cytokine Expression and Modulates Protein Profiles in sEV Biogenesis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Intrinsic Disorder and Phase Separation Coordinate Exocytosis, Motility, and Chromatin Remodeling in the Human Acrosomal Proteome

by
Shivam Shukla
1,†,
Sean S. Lastorka
2,† and
Vladimir N. Uversky
2,3,*
1
Department of Integrative Biology, College of Arts and Sciences, University of South Florida-St. Petersburg, 140 7th Ave. South, St. Petersburg, FL 33701, USA
2
Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
3
USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Proteomes 2025, 13(2), 16; https://doi.org/10.3390/proteomes13020016
Submission received: 15 January 2025 / Revised: 23 April 2025 / Accepted: 25 April 2025 / Published: 28 April 2025

Abstract

:
Intrinsic disorder refers to protein regions that lack a fixed three−dimensional structure under physiological conditions, enabling conformational plasticity. This flexibility allows for diverse functions, including transient interactions, signaling, and phase separation via disorder-to-order transitions upon binding. Our study focused on investigating the role of intrinsic disorder and liquid−liquid phase separation (LLPS) in the human acrosome, a sperm-specific organelle essential for fertilization. Using computational prediction models, network analysis, Structural Classification of Proteins (SCOP) functional assessments, and Gene Ontology, we analyzed 250 proteins within the acrosomal proteome. Our bioinformatic analysis yielded 97 proteins with high levels (>30%) of structural disorder. Further analysis of functional enrichment identified associations between disordered regions overlapping with SCOP domains and critical acrosomal processes, including vesicle trafficking, membrane fusion, and enzymatic activation. Examples of disordered SCOP domains include the PLC-like phosphodiesterase domain, the t-SNARE domain, and the P-domain of calnexin/calreticulin. Protein–protein interaction networks revealed acrosomal proteins as hubs in tightly interconnected systems, emphasizing their functional importance. LLPS propensity modeling determined that over 30% of these proteins are high-probability LLPS drivers (>60%), underscoring their role in dynamic compartmentalization. Proteins such as myristoylated alanine-rich C-kinase substrate and nuclear transition protein 2 exhibited both high LLPS propensities and high levels of structural disorder. A significant relationship (p < 0.0001, R² = 0.649) was observed between the level of intrinsic disorder and LLPS propensity, showing the role of disorder in facilitating phase separation. Overall, these findings provide insights into how intrinsic disorder and LLPS contribute to the structural adaptability and functional precision required for fertilization, with implications for understanding disorders associated with the human acrosome reaction.

1. Introduction

1.1. Spermiogenesis, Structure, Formation, and Function of the Acrosome

Spermatogenesis is the process in which spermatozoa are produced from spermatogonial stem cells. The process takes place in the seminiferous tubules of the testis starting at puberty and continuing for the entirety of an adult male’s life. In the early phases of spermatogenesis, also known as the proliferation phase, spermatogonial stem cells multiply through mitosis and develop into the primary spermatocyte. The primary spermatocytes then undergo meiosis, first dividing into secondary spermatocytes and ultimately resulting in spermatids. The spermatids will then advance to the epididymis where spermiogenesis will take place. During spermiogenesis, the spermatids undergo cytoplasmic rearrangement to form the motile spermatozoa. To form mature spermatozoa, the nucleus elongates, mitochondria condense into the midpiece, the cytoplasm streamlines, the flagellum develops, and the specialized acrosomal organelle forms [1,2].
The acrosome is a sac-like organelle located in the heads of mature spermatozoa and plays a key role in fertilization. It is subdivided into anterior and posterior sections. The membranes of this organelle can be divided into two regions, the outer acrosomal membrane (OAM), which interacts with the plasma membrane, and the inner acrosomal membrane (IAM), which is intricately connected with the nuclear envelope. The lumen of the acrosome, also referred to as the acrosomal matrix, is where its many hydrolytic, proteolytic, receptor, and regulatory proteins are found [1].
The formation of the acrosome starts in the Golgi phase of spermatogenesis as proacrosomal vesicles are released from the Golgi apparatus and fuse together. It then grows and spreads over the anterior half of the nuclear envelope during the cap phase. In the elongation phase, the acrosomal matrix is formed. In the maturation phase, the acrosome takes its characteristic cap-like shape, and the components of the matrix organize themselves into the anterior and posterior sections [1,2].
The main function of the acrosome in fertilization is through the acrosome reaction, binding to and penetrating the zona pellucida (ZP), and fusion of the sperm and egg membrane [1]. The acrosome reaction is calcium ion mediated exocytosis, which releases many hydrolytic and proteolytic enzymes, allowing the sperm to penetrate the oocyte. This reaction is triggered by glycoproteins on the ZP, as well as by the progesterone binding to receptors on the sperm’s plasma membrane, triggering a G protein signal transduction cascade. Once the acrosome reaction has taken place, the protein components are released and begin to interact with the ZP, allowing for the penetration and fusion processes to occur [3].
The decline of male fertility has become a major concern in the 21st century, with sperm dysfunction playing a central role in many infertility cases [4]. Prior studies have demonstrated that IDPs can play essential roles in sperm development, as shown by the IDP SPE− −18 in Caenorhabditis elegans, which is required for the assembly of fibrous bodies and proper progression of spermatogenesis [5].

1.2. Intrinsic Disorder in Human Proteins

Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) represent a unique class of functional proteins that lack a stable three-dimensional structure under physiological conditions [6,7,8,9,10,11,12,13,14]. Unlike globular proteins, which adopt well-defined folds, IDPs/IDRs are characterized by structural plasticity, allowing them to exist as dynamic ensembles of rapidly interconverting conformations (or highly dynamic sets of short-lived structures) [7,9,11]. In fact, this inability to fold is unequally distributed within a protein molecule, with its different parts being under-folded to different degrees. This defines the mosaic structure and astonishing multi-level spatiotemporal heterogeneity of IDP/IDR that represent a complex combination of foldons (independently foldable units), inducible foldons (disordered regions that can (partially) fold at interaction with the binding partners), morphing inducible foldons (disordered regions that can differently fold at interaction with different binding partners), non-foldons (non-foldable protein regions), semi-foldons (regions that are always in a semi-folded form), and unfoldons (ordered regions that have to undergo an order-to-disorder transition to become functional) [15,16]. One should also keep in mind that it is extremely unlikely that different foldons in a given protein would possess identical conformational stability. Furthermore, foldons are known to continually unfold and refold even under native conditions [17,18,19]. As a result, at any given moment, even a well-folded, ordered protein would have a mosaic structure, possessing a set of temporary folded and unfolded foldons [16]. Intrinsic disorder is crucial for structural and functional diversification of proteins contributing to the proteoform concept and the protein structure–function continuum model, where multiple proteoforms (i.e., different molecular forms in which the protein product of a single gene can be found) with different structural features and various functions constitute a highly dynamic conformational ensemble in which a given protein exists [15,20,21].
This inherent flexibility facilitates diverse biological roles and functions, such as in cell signaling, molecular recognition, as well as the regulation of various cellular processes [22,23,24,25,26,27,28,29,30], with the functional repertoire of IDPs/IDRs being typically complementary to functions of ordered proteins and domains [6,10,13,31,32]. The functional versatility of IDPs is linked to their disorder, where differently under-folded pieces of the protein structural mosaic might have well-defined and specific functions [16]. Such structural mosaic defines complex ‘anatomy’ of an IDP that might contain multiple relatively short, differently ordered/disordered functional elements defining its complex molecular ‘physiology’ reflected in its multifunctionality and ability to be involved in interaction with, regulation of, and be controlled by multiple structurally unrelated partners [16,33]. Furthermore, the lack of a fixed structure allows IDPs/IDRs to become engaged in unrestrained interactions with multiple molecules [6,9,10,12,13,14,15,23,34,35,36], often through disorder-to-order transitions upon binding, being even capable of folding differently at interaction with different partners [37,38,39]. This conformational adaptability enhances their binding affinity and specificity, making IDPs important players in cellular processes like protein–protein and protein–nucleic acid interactions [6,40]. Furthermore, IDPs often serve as hubs in interaction networks, and have roles in signaling cascades and regulatory pathways [38,41,42,43,44,45]. Furthermore, structural ‘floppiness’ also defines the ability of IDP/IDR to be controlled and regulated at multiple levels [15,35,40,46], with various post-translational modifications (PTMs) being one of the most important means of such disorder-centered regulation [47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63].
Bioinformatics studies indicated that IDPs and hybrid proteins with ordered domains and IDRs are highly abundant in nature [64,65,66,67,68,69,70,71]. The prevalence of intrinsic disorder in proteomes correlates positively with organismal complexity. Eukaryotic proteomes, for example, exhibit a significantly higher proportion of disorder compared to prokaryotes, with estimates suggesting that more than 60% of eukaryotic proteins contain at least one disordered region. This correlation underscores the evolutionary advantage and function of disorder in facilitating complex regulatory mechanisms and molecular interactions essential for multicellular life [72,73,74,75,76,77,78,79,80,81].
Despite their functional benefits, the unstructured nature of IDPs renders them susceptible to misfolding and aggregation under dysregulated conditions. This susceptibility is shown in numerous human diseases, including neurodegenerative disorders, cancer, and cardiovascular diseases [82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103]. For example, amyloidogenic IDPs such as α-synuclein and tau are central to the pathology of Parkinson’s disease and Alzheimer’s disease, respectively [104,105]. The duality of IDPs, as both functional and potentially pathological entities, highlights the importance of understanding the mechanisms underlying their behavior.
Moreover, the interplay between order and disorder in proteins extends their functional use. As was already emphasized, many IDPs exhibit hybrid structural features, comprising ordered domains alongside disordered regions. This arrangement allows for spatial and temporal modulation of protein activity, as seen in proteins involved in transcriptional regulation and signal transduction [106]. The coupling of intrinsic disorder with specific post-translational modifications further increases their functional diversity and adaptability, showing their significance in cellular homeostasis and evolution.
The study of IDPs is important, as it provides insights into the principles of protein structure and function, challenging the traditional structure–function paradigm. Their intrinsic disorder, while structurally unconventional, is an essential part of their biological utility, from mediating dynamic interactions to organizing cellular architecture through phase separation. As the roles of IDPs in health and disease continue to be studied, more work remains to understand their structural and functional biology and biophysical properties.

1.3. Spontaneous Liquid–Liquid Phase Separation

Intrinsically disordered proteins (IDPs) play a central role in liquid–liquid phase separation (LLPS) [107,108,109,110,111,112,113,114,115,116], a process where biomolecules condense into dense, protein- and nucleic acid-rich droplets that are distinct from the surrounding dilute phase. These phase-separated assemblies, or biomolecular condensates, contribute to cellular organization by forming membraneless organelles, such as nucleoli, stress granules, and Cajal bodies [117,118]. The ability of IDPs to drive LLPS is dictated by their unique sequence characteristics, including charge distribution, hydropathy, and aromatic content, as well as their enrichment in low-complexity domains and intrinsically disordered regions. These features cause multivalent interactions through weak, transient associations that promote the assembly of dense biomolecular condensates.
The sequence composition of IDPs influences LLPS. For instance, the patterning of charged residues within an IDP, such as the distribution of positively and negatively charged amino acids, can modulate phase behavior. Sequences with block-like charge arrangements exhibit larger phase separation windows than those with randomly distributed charges. This relationship shows the importance of electrostatic interactions in IDP-driven coacervation [119,120]. Similarly, aromatic and hydrophobic residues in IDPs can contribute to phase separation by mediating π-π stacking or hydrophobic interactions within condensates [118].
IDPs associated with LLPS serve diverse biological functions. For instance, phase-separated droplets act as reservoirs for biomolecules, regulate biochemical reactions by concentrating reactants, and serve as structural scaffolds for assembling macromolecular complexes [121,122]. IDP-mediated phase separation also plays a critical role in pathological contexts. Aberrant LLPS has been implicated in the formation of amyloid-like aggregates in diseases such as amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD), where phase-separated intermediates composed of IDPs, such as FUS and TDP-43, transition to pathological solid states [123]. Although these examples stem from neurodegenerative diseases, similar dysregulation of phase separation and intrinsic disorder in the acrosome could potentially contribute to sperm dysfunction and male infertility.
From a biophysical perspective, the ability of IDPs to undergo LLPS is regulated by environmental conditions, including pH, salt concentration, and temperature. These factors change the physicochemical properties of IDPs, such as their net charge and hydrophobicity, thereby influencing the phase diagram of their condensates [120]. Advanced computational and experimental approaches, including molecular dynamics simulations and fluorescence microscopy, have been employed to understand the molecular principles regulating IDP-mediated LLPS. Field-theoretic simulation (FTS) approaches, for example, have provided insights into how charge patterning and sequence variation dictate phase behavior in polyampholyte IDPs [119,120].
Overall, the relationship between IDPs and LLPS highlights a mechanism of cellular organization that operates beyond the classical paradigms of compartmentalization. The ability of IDPs to phase-separate, forming dynamic and tunable assemblies, highlights their biological importance and function, as well as their potential as research targets in therapeutic interventions for modulating biomolecular condensates.

2. Materials and Methods

2.1. Compilation of Human Acrosomal Proteins and Visualization

On 19 September 2024, proteins related to the human acrosome were compiled from UniProt (https://www.uniprot.org (accessed 19 September 2024)) using the search term ‘acrosome OR acrosomal’ and filtering the results to only include SwissProt reviewed, human entries. UniProt IDs, protein names, gene names, sequences in FASTA format and functional information were extracted for each protein and used to build the dataset.
If available, experimentally derived protein structures were obtained from the Protein Data Bank (https://www.rcsb.org (accessed 19 September 2024)). If they were not available, predicted protein structures were obtained from the AlphaFold Protein Structure Database (https://alphafold.ebi.ac.uk (accessed 19 September 2024)).
An overview of our computational methods is shown in a flowchart (Figure 1) for clarity.

2.2. Computational Prediction of Intrinsic Disorder in Human Acrosomal Proteins

The prediction of intrinsic disorder in the dataset of proteins was conducted using six widely recognized computational models, accessed via the Rapid Intrinsic Disorder Analysis Online RIDAO platform (https://ridao.app (accessed 24 September 2024)) [124]. These models include PONDR® VLXT, which integrates three neural networks designed to predict disorder across the full length of a protein sequence, combining results from terminal and internal region predictors [125]. PONDR® VL3, which focuses on long disordered regions and uses multiple neural networks to generate predictions [126]. PONDR® VSL2B which uses support vector machines (SVMs) and is optimized for both short and long disordered regions [127]. The IUPred model, which is different from the previous machine learning models, estimates disorder by evaluating inter-residue interaction energies, relying on energy-based potentials to predict regions unlikely to form stable structures, and which is run in two modes, to predict short and long IDRs [128,129]. Finally, PONDR-FIT is an artificial neural network (ANN) meta-predictor that combines outputs from multiple disorder prediction models [130]. These models were aggregated by RIDAO [124] to provide a comprehensive disorder prediction profile for each acrosomal protein.
Protein disorder was evaluated using two metrics obtained from each of the six models: the average disorder score (ADS) and the percentage of predicted intrinsically disordered residues (PPIDR). For each protein, the ADS was calculated by taking the arithmetic mean of all residue-specific disorder scores. The PPIDR was determined by counting the number of residues with a disorder score greater than 0.5, dividing this by the total number of residues, and expressing the result as a percentage. Using these metrics, we classified proteins into distinct categories. Based on ADS, proteins were considered highly ordered if their score was below 0.25, moderately disordered if between 0.25 and 0.5, and highly disordered if 0.5 or higher. Similarly, using PPIDR, proteins were classified as highly ordered (PPIDR < 10%), moderately disordered (10% ≤ PPIDR < 30%), or highly disordered (PPIDR ≥ 30%). These categories were based on established baselines of disorder found in the literature.
Then, RIDAO was further used to obtain the outputs from two binary predictors, the charge-hydropathy (CH) plot [11] and the cumulative distribution function (CDF) plot [131], which were combined to generate a CH-CDF phase diagram [132,133,134]. This allowed for the classification of proteins based on their location within the CH-CDF phase space. Proteins in Quadrant I can be categorized as ordered (predicted to be ordered by both models), Quadrant III can be categorized as native coils or pre-molten globules (disordered by both methods), Quadrant II can be categorized as hybrid proteins or putative native molten globules (ordered by CH but disordered by CDF), and Quadrant IV can be categorized as proteins that are disordered according to the CH plot but ordered by the CDF plot [132].

2.3. Impact of Intrinsic Disorder on Function in Human Acrosomal Proteins

Disorder predictions and related functional annotations for the dataset of proteins were retrieved from the D2P2 database, which provides predictions for certain indexed proteins from fully sequenced genomes [135]. D2P2 aggregates results from several well-established computational per-residue disorder prediction models, including PONDR® VLXT [125], PONDR® VSL2B [127], IUPred (both short and long forms) [128,129], PrDOS [136], PV2, and three variants of ESpritz (NMR, DisProt, and X-ray) [137]. The predicted locations of disordered regions are displayed as nine colored bars in the D2P2 visual interface, each representing a different model. In addition to individual predictions, D2P2 also presents a consensus view of disorder across all predictors using a blue-green-white bar, where blue indicates regions where the disorder predictions intersect the SCOP domain prediction and green indicates regions that represent disorder that is not found within a predicted SCOP domain [135]. Above this consensus bar, two lines with numbered, colored bars show the predicted locations of SCOP (Structural Classifications of Proteins) domains [138,139], identified by the SUPERFAMILY predictor [140]. Empirically confirmed post-translational modifications (PTMs), identified by the PhosphoSitePlus platform [141], are shown as differently colored circles at the bottom of the plot. Functional annotations of disordered regions are also provided, such as the predicted disorder-based binding sites (MoRF regions) identified by the ANCHOR algorithm [142,143], which are displayed as yellow patterned bars.
Acrosomal proteins in the dataset were classified based on the presence of at least one residue with 75% consensus for predicted disorder and at least one strong SCOP domain hit via D2P2. Proteins meeting these criteria were further analyzed to identify instances of overlap between consensus disordered regions and SCOP domains, which may suggest potential functional implications of the intrinsic disorder. The superfamilies of the SCOP domains affected by disordered regions were further analyzed based on the number of residues impacted and to determine whether certain SCOP superfamilies were more frequently affected by disorder within proteins associated with the acrosome.

2.4. Gene Ontology Enrichment Analysis of Acrosomal Proteins

Gene Ontology (GO) enrichment analysis was conducted using the g:GOSt tool, part of the g:Profiler (https://biit.cs.ut.ee/gprofiler/gost (accessed 26 September 2024)), to understand the biological processes, molecular functions, and cellular components associated with proteins that exhibited overlap between SCOP domains and regions of predicted intrinsic disorder. The analysis was performed specifically on proteins with at least one SCOP functional domain and 75% consensus for disordered residues, as identified through the D2P2 platform.
The tool g:GOSt looks at functional enrichment analysis by mapping input proteins to known GO terms and testing for overrepresentation of specific GO categories in the proteins of interest. The tool uses a hypergeometric test to assess enrichment, and statistical significance is controlled using the Benjamini–Hochberg false discovery rate (FDR) correction to minimize type I errors. For the purposes of our study, g:GOSt was used to categorize proteins into three main GO functional domains: Biological Process (BP) (e.g., cellular activities, reproductive functions), Molecular Function (MF) (e.g., binding activity, enzymatic activity), and Cellular Component (CC) (e.g., membrane regions, subcellular structures). Only significantly enriched terms with an adjusted FDR < 0.05 were considered in the final analysis. In addition to GO terms, g:GOSt also yielded information on KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways, Reactome pathways, and other functional categories, including protein complexes and human phenotype ontology. This approach identified important biological roles and functional characteristics associated with disorder in SCOP domains. This will allow for insight into potential functional implications of disordered regions in acrosomal proteins.

2.5. Intractability Analysis of Human Acrosomal Proteins

The interactions of the 250 acrosomal proteins were retrieved and visualized using Search Tool for the Retrieval of Interacting Genes (STRING) (https://string-db.org (accessed 6 October 2025)). STRING utilizes experimentally validated information and predictions to generate models of protein–protein interactions (PPI) [144]. Proteins in the network maps are represented as nodes and the interactions are represented as edges. The model’s interactions are broken up into seven categories, each represented by different color edges. Known interactions are represented by light blue for evidence from a database and purple represents experimentally determined interactions. For predicted interactions, green represents proteins in the same gene neighborhood, red represents gene fusion, blue represents gene co-occurrence, black represents co-expression, light purple represents protein homology, and green represents text mining evidence [144].
String was utilized to generate three types of models: a PPI network between the 250 human acrosomal proteins, a PPI network centered on the 250 human acrosomal proteins, and individual networks centered on individual human acrosomal proteins. Models were generated at 4 different confidence intervals 0.15 (low confidence), 0.4 (medium confidence), 0.7 (high confidence), and 0.9 (highest confidence) and were run with a maximum of 500 interactions in the first shell for the global and the individual PPI networks [106,145]. For the PPI networks centered on individual human acrosomal proteins, a confidence threshold of 0.4 was used to maintain consistency across all networks and to align with thresholds commonly used in previous studies [145]. With each model, STRING embedded processes generate network statistics, such as the number of nodes, the average node degree which represents the average number of interactions per protein, the average clustering coefficient which is a number between 0 and 1 (0 represents each neighboring protein which are only connected to one protein, while 1 represents each neighboring protein which are interacting with each other), and a PPI enrichment p-value, which is the probability of having more interactions than predicted for a random set of proteins [106,144,145,146].

2.6. Liquid–Liquid Phase Separation Propensity in Human Acrosomal Proteins

To evaluate the potential of the acrosomal proteins for liquid–liquid phase separation (LLPS), we employed FuzDrop, a computational tool that predicts LLPS propensity based on protein sequence features [147,148]. FuzDrop uses sequence complexity and specific interaction motifs, such as cation-π or hydrophobic contacts, to determine the likelihood of droplet state formation in proteins. The LLPS analysis provided insights into sequence regions that support non-specific interactions crucial for droplet formation.
For each protein, we calculated the probability of forming a droplet state (probability of liquid–liquid phase separation; pLLPS) through FuzDrop [147,148]. Proteins with a pLLPS of ≥0.60 were classified as droplet-drivers, meaning they can independently undergo LLPS. On the other hand, proteins with pLLPS < 0.60 were classified as droplet-clients, which need additional interactions to partition into condensates. These droplet-clients may still participate in LLPS through specific droplet-promoting regions, defined by consecutive residues with high probability values (pDP ≥ 0.60). This analysis allowed us to distinguish between proteins that can spontaneously phase-separate and those dependent on cellular or environmental contexts.
Next, droplet-promoting regions were identified based on residue-level droplet-promoting probabilities (pDP), with regions classified by at least five consecutive residues scoring pDP ≥ 0.60. These regions are considered main contributors to phase separation propensity and were used to assess LLPS potential, including within proteins classified as droplet-clients. We then assessed the presence of aggregation hot-spots, which are residues with high droplet-promoting probabilities (pDP ≥ 0.60) and high interaction mode divergence (SBIND ≥ 2.2). These hot-spots indicate a predisposition to aggregation within condensates, especially under conditions that promote a shift from liquid-like to solid-like states. The analysis also included cellular context-dependence, with SBIND values calculated to show the potential for residues to switch between binding modes based on cellular environment. Residues with SBIND ≥ 2.25 were classified as context-dependent interaction zones, which means that the binding mode diversity might influence LLPS behavior and potential of aggregation.
To analyze the relationship between the probability of liquid–liquid phase separation (pLLPS) and the percentage of predicted intrinsically disordered residues (PPIDR) from three models (PONDR® VXLT, PONDR® VL3, and PONDR® VSL2B), we employed a regression model. Due to the presence of boundary values in pLLPS (0 and 1), we adjusted these values to avoid computational issues by applying small corrections (0 replaced with 10−6, and 1 replaced with 1 10−6). Then, a Box–Cox transformation was used on the adjusted pLLPS values to normalize the distribution, with an optimal lambda of 0.0606 determined empirically. This transformation resulted in the creation of a new variable, transformed pLLPS, which we used as the response variable in the model.
Then, we fitted a second-degree polynomial regression model to assess the relationship between transformed pLLPS and PPIDR, capturing any potential non-linear patterns in the data. We then conducted diagnostic analyses to evaluate the assumptions underlying our regression model. A Residuals vs. Fitted plot was generated to assess linearity, allowing us to verify that residuals displayed no systematic pattern across fitted values. To evaluate the normality of residuals, we used a Q–Q plot, comparing the distribution of residuals against a theoretical normal distribution. Then, we used a Scale-Location plot to assess homoscedasticity, to ensure that the residuals maintained a constant level of variance across the fitted values. Lastly, we assessed the Residuals vs. Leverage plot to identify any potential influential data points with high leverage or large residuals that might disproportionally affect the model.

2.7. Data Analysis via R

All data analysis was conducted in R (version 4.4.2) with RStudio (version 2024.04.2+764) by employing a wide range of packages to process, visualize, and analyze the acrosomal protein dataset. Data manipulation was performed using the packages dplyr and tidyr, which allowed for the merging of datasets, reshaping data from wide to long formats, and calculating summary statistics. Importation and management of CSV and Excel data files was completed using the packages readr and readxl, respectively. For correlation analysis and model fitting, R’s built-in stats package functions were used. The ggplot2 package was used to create various visualizations, including scatter plots, boxplots, histograms, and pie charts. The package gridExtra was applied for arranging multiple plots into a single figure. Additional enhancements to these plots, such as label placement in scatter plots, were achieved using the package ggrepel. To create formatted tables of results, the kableExtra package was utilized.

3. Results

3.1. Compilation of Human Acrosomal Proteins and Visualization

The UniProt database search yielded a total of 250 proteins associated with the human acrosome, which were used for the present study. Representative examples of these 250 proteins were selected and are shown in Figure 2, which shows illustrative examples of experimentally derived protein structures obtained from the Protein Data Bank, and Figure 3, which displays predicted structures of the 10 most disordered proteins obtained from the AlphaFold Protein Structure Database.

3.2. Computational Prediction of Intrinsic Disorder in Human Acrosomal Proteins

Using RIDAO, we generated disorder predictions for our 250 acrosomal proteins, analyzing each protein across the models. Key descriptive statistics for 3 of the models (PONDR® VXLT, PONDR® VSL2B, and PONDR® VL3) disorder scores (minimum, mean, median, and maximum) are summarized in Table 1. For example, as per PONDR® VL3, the mean PPIDR score across all proteins was 33.37, with values ranging from 0 to 100, indicating a wide variation in intrinsic disorder across our dataset. Similarly, based on corresponding ADS values, the PONDR® VXLT model exhibited the lowest minimum score (0.0602), while PONDR® VSL2B had the highest maximum score (0.9907). A series of boxplots to show percent disorder values across our dataset between different models was also created (Figure 4).
These findings indicate that the degree of intrinsic disorder in acrosomal proteins spans a broad spectrum, potentially reflecting diversity of their disorder-based functionalities. Proteins with low disorder scores are likely to be involved in roles requiring structural stability, such as enzymatic catalysis, transport, scaffolding, or maintaining the architecture of the acrosome, whereas those with high disorder scores may contribute to dynamic processes, such as vesicle trafficking or molecular recognition. The observed range suggests a selective advantage for both ordered and disordered structures within the acrosomal proteome, enabling the acrosome to fulfill its complex roles in fertilization.
We classified proteins by their PPIDR values into three categories: highly ordered (0−10%), moderately disordered (10−30%), and highly disordered (>30%). As illustrated in Figure 5, this classification showed that 97 proteins were highly disordered, 106 were moderately disordered, and 47 were highly ordered on average across all models. These proportions emphasize a skew toward moderate and high levels of disorder in the acrosomal protein dataset. This distribution aligns with previous findings in other specialized cellular structures, such as the nucleolus and synaptic vesicles [153,154], where intrinsic disorder is prevalent and facilitates rapid assembly and disassembly of macromolecular complexes. The high proportion of moderately and highly disordered proteins in the acrosomal dataset may similarly reflect an adaptation for flexible interactions required during the fertilization process.
Next, we examined the relationship between the PPIDR and ADS scores to better understand correlations between these two disorder metrics. As shown in Figure 6, the scatter plot of PPIDR vs. ADS reveals a consistent trend (p < 0.001) with higher ADS values generally associated with increased PPIDR, suggesting a relationship between these metrics across the majority of proteins. This trend further supports the idea that disorder−related properties, such as extended conformations or binding promiscuity, are integral to the function of many acrosomal proteins. Notably, proteins with high ADS values may exhibit increased potential for transient interactions, such as binding to sperm or oocyte receptors, or participating in acrosomal vesicle fusion.
We further categorized our proteins based on Charge-Hydropathy (CH) and Cumulative Distribution Function (CDF) scores. Figure 7A shows a scatter CH vs. CDF graph, whereas Figure 7B presents a bar chart showing quadrant distributions, where each quadrant represents a different prediction alignment between CH and CDF: Q1 (lower-right quadrant) includes proteins predicted as ordered by both CH and CDF; Q2 (lower-left quadrant) includes those ordered by CH but disordered by CDF; Q3 (upper-left quadrant) includes proteins disordered by both predictors; and Q4 (upper-right quadrant) represents those disordered by CH but ordered by CDF. This quadrant analysis provides a structured view of how each protein’s disorder may vary based on predictor alignment.
The quadrant distribution offers additional insight into the complexity of disorder prediction and its functional implications. Q3 proteins, predicted as disordered by both CH and CDF, align well with the high levels of intrinsic disorder observed in the acrosomal proteome and likely represent proteins with a high degree of flexibility essential for dynamic processes like vesicle fusion or enzymatic activation. In contrast, Q1 proteins, which are ordered by both predictors, may play more static structural roles, supporting the acrosomal matrix or organizing its architecture. Q2 and Q4 proteins, where predictor alignments diverge, present a middle ground. Proteins in Q2 (ordered by CH but disordered by CDF) may exhibit conditional disorder, becoming functional only under specific physiological conditions such as changes in pH, ion concentration, or binding partner availability. Similarly, Q4 proteins (disordered by CH but ordered by CDF) may represent cases where localized order is embedded within a largely disordered protein, enabling dual functionalities. These nuances highlight how prediction alignments offer additional perspectives on the roles of intrinsic disorder in the acrosomal proteome.
These findings suggest the critical role of intrinsic disorder in acrosomal function. The acrosome, as a specialized organelle, must balance structural stability with dynamic adaptability during sperm capacitation, the acrosome reaction, and fertilization. Our analysis highlights how intrinsic disorder in acrosomal proteins contributes to this balance, enabling complex processes such as enzymatic activation, membrane remodeling, and molecular recognition. These results provide a foundation for future studies investigating how intrinsic disorder in acrosomal proteins contributes to reproductive biology and potential implications for fertility disorders.

3.3. Impact of Disorder on Function in Human Acrosomal Proteins

To further examine intrinsic disorder in acrosomal proteins, we utilized D2P2 to identify proteins with disordered residues in SCOP functional domains, using a 75% agreement threshold. Out of 233 proteins initially analyzed, 190 met the high−confidence disorder threshold, and 148 were found to contain identifiable SCOP domains. Among these, 48 proteins showed overlap between disordered regions and SCOP domains. Figure 8 displays the number of proteins progressing through each analysis stage, ultimately identifying the 48 proteins with SCOP-associated disordered regions. Furthermore, Table 2 provides a comprehensive list of SCOP superfamilies affected by disorder, highlighting those that are commonly disordered and frequently observed across different proteins. The identification of these SCOP-associated disordered regions underscores the selective pressure for disorder in specific functional domains, which may play a crucial role in the specialized functions of acrosomal proteins.
The most common SCOP superfamilies affected by disorder included domains associated with structural flexibility and protein–protein interactions. This prevalence supports the hypothesis that intrinsic disorder is a key driver of functional adaptability, particularly in dynamic environments like the acrosome. Several superfamilies appeared multiple times across different proteins, indicating a widespread occurrence of intrinsic disorder within certain functional domains. Notably, the PLC-like phosphodiesterase superfamily showed the highest incidence of disorder, with 111 disordered residues across multiple proteins (1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase delta-4 (UniProt ID: Q9BRC7) and 1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase beta-1 (UniProt ID: Q9NQ66)). This observation suggests that intrinsic disorder within the PLC-like domains may facilitate enzymatic flexibility and regulation during membrane-related signaling processes, which are vital in the context of sperm-egg recognition and fertilization.
The t-SNARE proteins superfamily also exhibited significant disorder, with 73 residues disordered across proteins such as Syntaxin-2 (UniProt ID: P32856), Syntaxin-1B (UniProt ID: P61266), and Syntaxin-1A (UniProt ID: Q16623). The disordered regions in these proteins are likely critical for vesicle docking and membrane fusion, processes essential for acrosomal exocytosis. Additionally, the Concanavalin A-like lectins/glucanases superfamily exhibited 40 disordered residues in both Calreticulin (UniProt ID: P27797) and E3 ubiquitin-protein ligase TRIM36 (UniProt ID: Q9NQ86), suggesting a role in mediating dynamic interactions during immune-related functions or protein ubiquitination pathways in acrosomal environments.
In our dataset, several acrosomal proteins exhibited multiple SCOP domains within disordered regions, indicating complex structural architectures. For example, Calreticulin (UniProt ID: P27797) contained both a C-type lectin domain and a collagen-like domain within disordered regions, facilitating its role in pathogen recognition and immune response. This structural configuration likely enhances the ability of Calreticulin to engage in multiple interactions simultaneously, reflecting the importance of intrinsic disorder in multifunctional proteins. Similarly, E3 ubiquitin-protein ligase TRIM36 (UniProt ID: Q9NQ86) comprised a serine protease domain and a C-type lectin domain within disordered regions, contributing to its function in the lectin pathway of the complement system. These multi-domain configurations suggest that proteins with multiple SCOP domains may exhibit complex functional roles, potentially influenced by the presence of intrinsic disorder within these domains. Such findings reinforce the idea that intrinsic disorder enhances versatility and adaptability in proteins that interact with diverse molecular partners.
Several superfamilies showed substantial disorder within individual proteins, highlighting the occurrence of intrinsic disorder within specific functional domains. For instance, the P-domain of calnexin/calreticulin superfamily had a high magnitude of disorder, with 70 disordered residues identified in the protein Calreticulin (UniProt ID: P27797). This domain’s disorder is likely crucial for its role in calcium binding and molecular chaperoning. Additionally, the Fibronectin type III domain, commonly involved in cell adhesion and interaction, exhibited 23 disordered residues and was present in protein Fibronectin type-III domain-containing protein 3A (UniProt ID: Q9Y2H6). This finding suggests that disorder in this domain may enhance flexibility and promote interactions with a range of cellular partners, further emphasizing the link between intrinsic disorder and functional plasticity.
The Ankyrin repeat superfamily also showed disorder, with 23 disordered residues found in the protein Neurogenic locus notch homolog protein 1 (UniProt ID: P46531), known for its role in protein–protein interactions. Other frequently disordered superfamilies included the ARM repeat (17 residues in Maestro heat-like repeat-containing protein family member 2B (UniProt ID: Q7Z745)) and the Sec1/munc18-like (SM) proteins (17 residues in Syntaxin-binding protein 1 (UniProt ID P61764)), both of which contribute to functional versatility in protein complexes. Collectively, these findings highlight how intrinsic disorder in SCOP domains supports functional diversity, particularly in proteins central to acrosomal processes.
We further analyzed this subset of 48 proteins with SCOP domains within disordered regions by overlaying the data on the CH/CDF plot, as shown in Figure 9. This overlay provides a visual representation of the relationship between disorder predictions by Charge-Hydropathy and Cumulative Distribution Function metrics and the presence of disordered residues within SCOP domains. Of the 48 proteins with SCOP domains within the disordered region, 35 were in Q1 (ordered by both), 9 were in Q2 (CH ordered but CDF disordered), 3 were in Q3 (disordered by both), and 1 was in Q4 (CH disordered but CDF ordered).

3.4. Gene Ontology Enrichment Analysis of Acrosomal Proteins

Gene Ontology (GO) enrichment analysis was performed using g:GOSt to understand the functional roles of disordered regions overlapping SCOP domains in acrosomal proteins (Table 3). The analysis provided insights into molecular functions, biological processes, and cellular components enriched among these proteins (Figure 10). Notably, the term ‘acrosomal vesicle’ (GO:0001669, p_adj = 5.395 × 10⁻³⁸) was the most significant, highlighting the functional relevance of intrinsic disorder in reproductive and cellular processes. The ancestor chart for ‘acrosomal vesicle’ (GO:0001669) is shown in Figure 11, as obtained from the European Bioinformatics Institute (https://www.ebi.ac.uk/QuickGO/ (accessed 27 October 2024)).
For molecular function, significant enrichment was observed for enzyme binding (GO:0019899), indicating that intrinsic disorder in these proteins may facilitate dynamic interactions with enzymatic partners, potentially critical for acrosomal activity. SNARE binding (GO:0000149) was also highly enriched, suggesting that these proteins play an integral role in membrane fusion events essential for the acrosomal reaction during fertilization. Additional enriched functions included cation binding and phospholipase activity, reflecting the importance of these proteins in calcium-mediated signaling and membrane remodeling within the acrosome.
Within the domain of biological processes, the most significant enrichment was observed for developmental processes involved in reproduction (GO:0003006), consistent with the essential role of these proteins in fertilization. Proteins with disordered regions were also enriched for acrosomal vesicle (GO:0098589), further emphasizing their association with the acrosome. Terms such as regulation of synaptic vesicle priming and cell projection organization were also enriched, highlighting their involvement in vesicle trafficking and cytoskeletal dynamics, both of which are processes critical for sperm functionality.
For cellular components, proteins were primarily localized to the acrosomal vesicle and the plasma membrane region, consistent with their roles in acrosomal membrane dynamics. Enrichment for the zonula adherens component suggested potential roles in adhesion, showing the interaction between the acrosome and the oocyte during fertilization.
These findings underscore the importance of intrinsic disorder in mediating the functional versatility of acrosomal proteins. The enrichment of terms associated with dynamic molecular interactions, membrane remodeling, and vesicular trafficking aligns with the known physiological roles of the acrosome in fertilization. The association of disordered regions with these processes suggests that intrinsic disorder contributes to the structural flexibility required for the complex interactions and functions of acrosomal proteins.

3.5. Interactivity Analysis of Human Acrosomal Proteins

A key feature of intrinsically disordered proteins is their high level of binding promiscuity and ability to interact with many partners through different binding modes [11]. The intractability of the human acrosomal proteins was investigated using Search Tool for the Retrieval of Interacting Genes (STRING). Figure 12 shows the PPI network for 245 of the human acrosomal proteins that were in the STRING database. No data were available in STRING for five of the proteins in our dataset: putative spermatogenesis-associated protein 31C1 (UniProt ID: P0DKV0), putative spermatogenesis-associated protein 31C2 (UniProt ID: B4DYI2), IQ domain-containing protein N (UniProt ID: Q9H0B3), spermatogenesis-associated protein 31D4 (UniProt ID: Q6ZUB0), and putative protein SPATA31F2P (UniProt ID: Q63HN1).
To capture the most interactions, a low confidence interval (0.15) was first utilized. The resulting PPI network showed that all 245 acrosomal proteins were interacting with each other, with the exception of one protein: protein SPATA31F3 (UniProt ID: A6NFA0). This PPI network comprised 245 nodes (human acrosomal proteins) connected by 2821 edges (PPIs). The average node degree (number of edges per node) was 23 and its average local clustering coefficient (which is a number between 0 and 1 that represents how complete the neighborhoods of interactions are, 0 meaning proteins in the neighborhood only connect to the same protein, and 1 meaning every protein in the neighborhood is interacting with each other) of 0.356. For a random group of proteins of the same size, the expected number of edges would be 942 at this confidence interval, which is significantly lower than our PPI network’s 2821, indicating at the low confidence level, this PPI network has roughly 3 times as many interactions as expected (PPI enrichment p-value of <10−16). This shows that most of the human acrosomal proteins are interacting with each other.
When the STRING confidence score was increased to a moderate threshold (0.4), the number of interacting nodes decreased from 244 to 209 (Figure 13A). The 209 nodes are connected by 581 edges, with an average node degree of 4.74, an average local clustering coefficient 0.423, an expected number of edges of 113, and a PPI enrichment p-value of <10−16. Increasing the confidence interval to high level (0.7) caused the PPI network to break down into small clusters of interacting proteins and many non-interacting proteins (Figure 13B).
To explore the global interactivity of human acrosomal proteins, we incorporated their first shell of interactors—proteins that directly interact with our proteins of interest. These direct interactors are visualized in Figure 14. The PPI network was set to the highest confidence interval (0.9) and 500 interactants (the maximum value STRING allows) resulted in 745 nodes connected by 2523 edges, that is characterized by an average node degree of 6.77, and an average local clustering coefficient of 0.602. The expected number of edges for a PPI network of this size is 872, which indicates that the human acrosomal protein PPI network has significantly more interactions then expected with a PPI enrichment p-value of < 10−16.

3.6. Liquid–Liquid Phase Separation Propensity in Human Acrosomal Proteins

We used computational models to predict the liquid–liquid phase separation (LLPS) propensity for the 250 proteins in our acrosomal dataset. To quantify and understand LLPS trends, descriptive statistics for the LLPS propensity scores (pLLPS) are summarized in Table 4. The mean pLLPS across all proteins was 0.4511, with a median of 0.2864 and a standard deviation of 0.3334. The scores ranged from 0.0922 to 1, with the lower quartile (Q1) at 0.167 and the upper quartile (Q3) at 0.7946. These findings indicate substantial variability in LLPS propensity among the acrosomal proteins.
To explore the relationship between intrinsic disorder and LLPS propensity, we conducted a regression analysis using the average PPIDR score from three disorder models (PONDR® VXLT, PONDR® VL3, and PONDR® VSL2B) as the independent variable and pLLPS as the dependent variable. A Box–Cox transformation of the pLLPS scores (λ ≈ 0.0606) was applied to stabilize variance and improve linearity.
The subsequent second-degree polynomial regression model revealed a significant positive relationship between intrinsic disorder and LLPS propensity, with a p-value < 0.0001 and an R-squared value of 0.649 (Figure 15). Diagnostic plots indicated that the model assumptions were satisfied, supporting the robustness of this relationship.
The observed link between intrinsic disorder and LLPS propensity aligns with the known role of intrinsically disordered regions (IDRs) in facilitating phase separation. IDRs, characterized by their lack of a fixed tertiary structure, enable dynamic interactions essential for condensate formation, a key feature of phase-separated systems. In the context of the acrosome, proteins with a high LLPS propensity likely contribute to the dynamic remodeling of the acrosomal vesicle and the assembly of molecular condensates critical for fertilization. For instance, proteins with high pLLPS scores may facilitate rapid assembly and disassembly of protein complexes during the acrosome reaction, enabling precise spatiotemporal control over enzymatic and structural functions.
The wide range of pLLPS scores suggests functional diversity within the acrosomal proteome. Proteins with low PLLPS may perform structural roles, providing stability to the acrosomal matrix, while those with high pLLPS likely play dynamic roles in processes such as vesicle fusion, membrane remodeling, and molecular recognition. This variability underscores the dual requirements for structural integrity and functional adaptability within the acrosome.
The polynomial nature of the relationship between PPIDR and pLLPS highlights a nuanced interplay between intrinsic disorder and LLPS propensity. While higher levels of disorder generally enhance phase separation propensity, the saturation observed in the quadratic relationship suggests a threshold effect, where excessively disordered proteins may lack sufficient interaction interfaces for effective condensate formation. This balance may reflect evolutionary pressures to optimize disorder for LLPS functionality while maintaining the ability to form specific, transient interactions.
These findings provide a foundation for further investigation into the role of LLPS in acrosomal biology. Future studies could focus on experimentally validating the LLPS behavior of high-pLLPS proteins using in vitro or in vivo approaches, such as fluorescence recovery after photobleaching (FRAP) or phase separation assays. Additionally, exploring the involvement of post-translational modifications (PTMs) in modulating LLPS propensity could provide insights into the regulation of acrosomal protein dynamics during fertilization.

3.7. Impact of Disorder and Liquid–Liquid Phase Separation in the Five Most Disordered Acrosomal Proteins

3.7.1. Myristoylated Alanine-Rich C-Kinase Substrate Protein (UniProt ID: P29966)

The myristoylated alanine-rich C-kinase substrate (MARCKS) protein was identified as the most disordered among acrosomal proteins, with a percentage of predicted disordered residues of 97.0% ± 1.30% and an average disorder score of 0.890 ± 0.035. Figure 16 presents the complete computational analysis of the MARCKS protein. This protein is 332 amino acids long, and most of the residues show signs of disorder (Figure 16A). MARCKS has 1 Pfam conserved domain spanning from residue 2 to 329 corresponding to the MARCKS family. This protein also has multiple posttranslational modification sites spanning across the protein with most of them being in the first 176 residues and as well as three molecular recognition features associated with ANCHOR binding regions (Figure 16B). The PPI network (Figure 16C) generated by STRING for MARCKS at the medium confidence interval of 0.4 shows that the network comprises 81 nodes and 859 edges. The network has an average node degree of 21.2 interactions per node and the local clustering coefficient of 0.703. The expected number of edges is 393 and has a PPI enrichment p-value was highly significant (p < 10−16).
MARCKS has an important regulatory role in acrosomal exocytosis, mainly by modulating calcium mobilization and interacting with phosphatidylinositol 4,5-bisphosphate (PIP2) [155]. With a liquid–liquid phase separation (LLPS) propensity of 0.9995 (as per the FuzDrop analysis results), MARCKS likely contributes to the formation of dynamic, membrane-less compartments, which allow for spatial and temporal coordination of signaling molecules, which are essential for exocytosis. This high LLPS propensity and intrinsic disorder likelihood enables it to act as a molecular scaffold, holding PIP2 in concentrated, reversible phases that can be quickly disbanded upon phosphorylation by protein kinase C (PKC). The flexibility that the intrinsic disorder provides the MARCKS’ protein likely enhances its ability to form and dissolve these LLPS-driven condensates as needed, enabling the fast release of PIP2 for downstream signaling during acrosomal exocytosis [155,156]. This idea is in line with the observations that LLPS can be related to the orchestration of the kinase signaling representing a controllable mechanism of dynamic compartmentalization of kinases and their substrates and co-factors, as well as a potential means of the isolation of kinases from their inhibitors [157].
Multifunctionality and binding promiscuity of MARCKS show that this protein serves as an illustration of the aforementioned structure–function continuum model, where a protein exists as a dynamic conformational ensemble containing multiple proteoforms characterized by a broad spectrum of structural features and possessing various functional potentials. These proteoforms originate due to the presence of a large number of different PTMs, as well as because of the highly disordered status of this protein and structural changes induced by its functionality. Looking at the MARCKS from the perspective of a proteoform-based structure–function continuum helps better understanding of structure, regulation, and functionality of this important protein.
MARCKS’ disorder can be particularly important in its effector domain (ED), which interacts with PKC and calmodulin. When phosphorylated by PKC, MARCKS dissociates from the membrane, releasing PIP2 for phospholipase C (PLC) activity, which converts it to inositol trisphosphate (IP3) and diacylglycerol (DAG). This reaction is needed for initiating the release of calcium, a primary driver of membrane fusion in the acrosome reaction [158]. The high LLPS propensity of MARCKS supports a transient, phase-separated environment in which calcium-sensitive signaling molecules can be temporarily contained and released in a coordinated fashion. This shows its role as a highly dynamic regulatory scaffold within the sperm head. The ability of MARCKS to undergo phase separation could potentially streamline the availability of PIP2 and calcium within these condensates, which would emphasize the importance of structural reorganization to the functional requirements of the acrosomal exocytosis process [155].
Moreover, MARCKS’ ability to form LLPS-driven condensates can potentially be important for its interaction with actin, by cross-linking and stabilizing the cytoskeleton to prevent premature exocytosis under resting conditions. Upon PKC activation, MARCKS can change from a stabilizing role to a facilitative one, freeing calcium and reorganizing the actin filaments for exocytotic membrane fusion. This interaction between intrinsic disorder and LLPS likely enhances MARCKS’ function as a reversible regulator, with phosphorylation dictating its role and allowing specific control over acrosomal readiness and exocytotic response in fertilization [155].

3.7.2. Nuclear Transition Protein 2 (UniProt ID: Q05952)

Figure 17 presents the computational results of nuclear transition protein 2 or TNP2, which has a percentage of predicted disorder residues of 92.5% ± 3.22% and an average disorder score of 0.864 ± 0.036. This protein is 138 amino acids long all of which show signs of disorder (Figure 17A). TNP2 has 1 Pfam conserved domain spanning all 138 amino acids corresponding to the PF01254.13 family. This protein also has 1 posttranslational modification site at locus 17 and has 4 molecular recognition features associated with ANCHOR binding regions (Figure 17B). The PPI network (Figure 17C) generated by STRING for CEP131 at the medium confidence interval of 0.4 shows that the network comprises 46 nodes and 211 edges. The network has an average node degree of 9.17 interactions per node and the local clustering coefficient of 0.753. The expected number of edges is 51 and has a PPI enrichment p-value of less than 1.0 × 10−16.
TNP2 is an important protein involved in spermiogenesis. Its key function is facilitating chromatin remodeling through aiding in the removal of histones and taking their place within the chromatin before being replaced by protamines [159,160]. Studies have also shown that they are involved with DNA repair by preventing DNA breaks in late spermiogenesis [160]. Its ability to interact with diverse types of biologic molecules is likely facilitated by its high intrinsic disorder. The high probability of spontaneous liquid–liquid phase separation of 0.994 and the presence of two droplet-promoting regions (DPRs, residues 1–98 and 128–138) predicted by FuzDrop may be related to TNP2’s role of replacing histones, as histones have been found to have parts of their formation process linked to their LLPS [161]. Analysis of TNP2 localization patterns within the rat spermatid nucleus revealed that TNP2 is preferentially localized to the GC-rich DNA sequences, forming specific foci, some of which also contain TNP1 [162].
Clearly, this protein exists as a complex and highly dynamic conformational ensemble, in which one can find basic (or intrinsic, or conformational) proteoforms generated by the presence of intrinsic disorder, inducible (or modified) proteoforms generated by PTMs, and functioning proteoforms originating due to the structural alterations induced in a protein by interaction with its partners.

3.7.3. Centrosomal Protein of 131 kDa (UniProt ID: Q9UPN4)

Figure 18 presents the results of centrosomal protein of 131 kDA (CEP131) which has a percentage of predicted disorder residues of 82.0% ± 7.35% and an average disorder score of 0.702 ± 0.043. This protein is 1083 amino acids long and has large areas of disorder spanning from N terminus to the C terminus (Figure 18A). CEP131 has 4 Pfam conserved domains spanning from Amino 3 to 104, 231 to 424, 264 to 553, and 526 to 719, respectively. This protein also has multiple posttranslational modification sites spanning across the protein with most of them being in the first 250 residues and has 20 molecular recognition features associated with ANCHOR binding regions (Figure 18B). It was pointed out that this protein has 3 isoforms produced by alternative splicing [163]. Therefore, in addition to the conformational and PTM-induced proteoforms, CEP131 has another type of inducible (or modified) proteoform, which is generated by alternative splicing. The PPI network (Figure 18C) generated by STRING for CEP131 at the medium confidence interval of 0.4 shows that the network comprises 101 nodes and 1392 edges. The network has an average node degree of 27.6 interactions per node and the local clustering coefficient of 0.735. The expected number of edges is 144 and has a PPI enrichment p-value of less than 10−16.
Centrosomal protein of 131 kDA (CEP131) is a protein that has a wide range of functions which may be linked to its high intrinsic disorder. CEP131 plays a key role in flagella and cilia formation as sperm cells lacking the proteins have malformed flagella [164,165]. Its ability to form flagella may be linked to interactions or a shared pathway CEP70 [165]. It has also been shown to interact with microtubule-based transport proteins such as dynein and could play a role in transport of material transport [166]. Its interactions with these microtubule proteins may also play a role in the formation of the acrosome as studies have shown mice lacking CEP131 have malformed sperm heads and fragmented acrosomes [164,167].
CEP131 has a high propensity for liquid–liquid phase separation (LLPS) with a probability of spontaneous liquid–liquid phase separation of 0.9989, suggesting that its significant intrinsic disorder and ability to form dynamic biomolecular condensates may facilitate its roles in flagella and acrosome formation, microtubule-based transport, and centrosome duplication. It was pointed out that in mice, male infertility is linked to the CEP131 loss that triggers defects in the manchette and flagella, indicating a possible link between CEP131 and the organization of cellular structures relying on LLPS [164]. Furthermore, another centrosomal protein, CEP112, was shown to undergo LLPS and form biomolecular condensates that recruit essential proteins and mRNAs [168].
Outside of its structural role in sperm, CEP131 plays a role in centrosome duplication and is up regulated in many cancers [169,170]. CEP131 is phosphorylated PIK4 with many of its phosphorylation sites located in its disordered regions. Once phosphorylated CEP 131 can recruit STIL which is a key protein for centrosome duplication [169]. CEP131 has also been found to interact with other regulatory genes such as ARID3A and NPM [170,171].

3.7.4. Cylicin-1 (UniProt ID: P35663)

Figure 19 presents the computational analysis of Cylicin-1, or CYLC1, showing a predicted percentage of disordered residues at 81.1% ± 4.14% and an average disorder score of 0.745 ± 0.042. CYLC1 is a 651-amino-acid-long protein with most disorder localized toward the C-terminus starting at residue 268, while the N-terminal region shows smaller disordered segments (Figure 19A). This protein contains two Pfam conserved domains spanning amino acids 7–116 and 241–464, respectively. Multiple post-translational modification sites are clustered between residues 250 and 600, alongside 12 molecular recognition features (MoRFs) associated with ANCHOR-predicted binding regions (Figure 19B). The protein–protein interaction (PPI) network generated by STRING at a medium confidence interval (0.4) reveals 34 nodes and 75 edges, with an average node degree of 4.41 interactions per node, a local clustering coefficient of 0.877, and a statistically significant PPI enrichment (p-value = 4.84 × 10⁻10) (Figure 19C).
The high intrinsic disorder observed in CYLC1 likely facilitates its functional versatility, particularly in its role within the perinuclear theca (PT), a cytoskeletal element important for maintaining the acrosome’s attachment to the nucleus during spermiogenesis. CYLC1 has been shown to interact with itself and other PT proteins, such as ACTRT1, ACTL7A, and SPACA1, forming a structural ‘sandwich’ that anchors the acrosome to the nuclear envelope [172]. The disordered regions in CYLC1, particularly in the C-terminal domain, may enhance its capacity for multivalent interactions, allowing dynamic assembly and disassembly of protein complexes. This flexibility likely contributes to the stability of the acrosome–nucleus connection, as evidenced by the severe acrosome detachment and male subfertility observed in CYLC1 knockout mice [172].
Moreover, the clustering of MoRFs and post-translational modification (PTM) sites within the disordered regions of CYLC1 suggests a regulatory role in binding and interaction dynamics. These features may allow CYLC1 to act as a molecular scaffold, mediating the assembly of protein complexes that stabilize the acrosome’s structural integrity. PTMs such as phosphorylation could modulate the disorder-to-order transitions in these regions, dynamically regulating the protein’s interaction with binding partners during acrosomal maturation and exocytosis [117,172].
CYLC1’s intrinsic disorder might also influence its role in phase separation. The probability of spontaneous liquid–liquid phase separation of CYLC1 was 0.9965, which is incredibly high. The protein is also predicted by FuzDrop to contain 4 DPRs, residues 39–53, 101–245, 267–621, and 629–643. The enrichment of disordered and low-complexity regions makes CYLC1 a candidate for contributing to membraneless compartmentalization within the PT. Such phase-separated domains could act as reservoirs for critical proteins and molecular factors required for acrosomal function and fertilization, facilitating spatial and temporal coordination of these processes [118,172].
The STRING analysis further supports CYLC1’s role as a hub in the acrosomal interactome, with significant PPI enrichment and a high clustering coefficient indicating a tightly interconnected network of interactions. The observed disorder and network centrality together show CYLC1’s importance in ensuring the structural and functional integrity of the acrosome, with potential implications for male fertility in humans and animal models [172].
Similar to MARCKS and TNP2, multifunctionality and binding promiscuity of CYLC1 are determined by its highly disordered nature and the presence of numerous PTMs, generating basic and inducible proteoforms, respectively.

3.7.5. Coiled-Coil Domain-Containing Protein 136 (CCDC136) (UniProt ID: Q96JN2)

Figure 20 presents the computational analysis of CCDC136, showing a predicted percentage of disordered residues at 72.9% ± 10.02% and an average disorder score of 0.639 ± 0.062. CCDC136 is a 1154-amino-acid-long protein with disordered regions distributed throughout its sequence (Figure 20A). The protein contains two Pfam conserved domains spanning residues 391–670 and 658–873, respectively, and features 22 molecular recognition features (MoRFs) predicted by ANCHOR. In addition, CCDC136 has multiple post-translational modification (PTM) sites scattered across its sequence, further highlighting its potential regulatory complexity (Figure 20B). Furthermore, complexity of this protein is further increased by the presence of four isoforms generated by alternative splicing, where the isoform #2 is different from the canonical form by changing the 922–957 region from IKELQTKLRELQLQYQASMDEQGRLLVVQEQLEGQL to NMFGLWKPMVFLAIAAVA LYVLPNMRQQESEFCLME and missing the 958–1154 region [163], the isoform #3 has an MEAGAGAGAGAAGWSCPGPGPTVTTLGSYEASEGCERKKGQRWGSLERRGM extension at N-terminus and is missing regions 212–223 and 364–1121 [163], and the isoform #4, being associated with gastric cancer, is characterized by changes in the 1122–1154 region from SSPTPNPPIFSLPLVGLVVISALLWCWWAETSS to NMFGLWKPMVFLAIAAVALYVLPNMRQQESEFCLME [173]. The protein–protein interaction (PPI) network generated by STRING at a medium confidence interval of 0.4 (Figure 20C) shows that CCDC136 interacts with 41 nodes via 86 edges, with an average node degree of 4.2 interactions per node, a local clustering coefficient of 0.735, and a statistically significant PPI enrichment (p-value < 1.0 × 10⁻¹⁶). Its predicted liquid–liquid phase separation propensity (pLLPS) is high, at 0.9894, suggesting a strong potential for forming biomolecular condensates.
The intrinsic disorder of CCDC136 likely plays a key role in mediating its function during acrosome biogenesis and fertilization. Studies indicate that CCDC136 localizes to the acrosome during spermatogenesis, where it is critical for the correct assembly of this specialized organelle [174]. The protein’s disordered regions may provide the flexibility required for dynamic interactions with other acrosome-related proteins, such as SPACA1 and PICK1, which are involved in the early stages of acrosome formation and are significantly downregulated in CCDC136 knockout mice [174]. The distribution of MoRFs across CCDC136 supports its role as a scaffold that mediates transient, multivalent interactions essential for acrosomal maturation.
CCDC136’s high pLLPS score (0.9894) and the presence of multiple DPRs (residues 1–55, 70–99, 149–170, 195–227, 240–288, 303–320, 518–531, 789–800, 832–845, 908–928, and 967–1143) further suggests its involvement in forming liquid-like condensates within the acrosome. These condensates may act as compartments for concentrating acrosome-specific proteins, ensuring their correct localization and function during spermiogenesis. The ability to undergo phase separation could also help stabilize the structural integrity of the acrosome and facilitate its rapid reorganization during the acrosome reaction, a process critical for successful fertilization [118]. In CCDC136 knockout mice, the disruption of acrosome morphology and the associated male infertility underscore the importance of this protein’s role in both structural and functional aspects of spermatogenesis [174].
Moreover, the presence of PTMs within disordered regions of CCDC136 provides an additional regulatory layer, potentially controlling the protein’s interactions and phase-separation behavior. PTMs such as phosphorylation may modulate CCDC136’s role in acrosome formation by dynamically altering its binding affinity to partner proteins or its ability to assemble into condensates. This aligns with the broader understanding of disordered proteins as dynamic regulators in cellular processes, where disorder and phase separation allow for reversible, context-dependent assembly of protein complexes [117].
The STRING network further illustrates CCDC136’s centrality in the acrosomal protein interaction network, with its significant enrichment of interactions indicating a pivotal role in coordinating processes necessary for spermatogenesis and fertilization. The interplay between CCDC136’s intrinsic disorder, conserved domains, and its ability to form condensates highlights its dual role as both a structural and regulatory component of the acrosome. Again, the proteoform-based structure–function continuum model provides mechanistic explanation of the multifunctionality of CCDC136 and its ability to interact with multiple partners and control their functionality.

3.7.6. Coordinated Roles of Highly Disordered Acrosomal Proteins in Fertility

Summarizing, our analyses of the most disordered acrosomal proteins provide important clues on the links between their functionality, intrinsic disorder propensity, and LLPS potential.
MARCKS, with its important regulatory role in acrosomal exocytosis—the central event of the acrosome reaction—is a highly disordered protein (PPIDR = 97.0% ± 1.30%; ADS = 0.890 ± 0.035) and exhibits an exceptionally high LLPS propensity (pLLPS = 0.9995). TNP2 (PPIDR = 92.5% ± 3.22%; ADS = 0.864 ± 0.036; pLLPS = 0.9440) plays a key role in sperm chromatin remodeling by facilitating the removal of histones and their replacement within the chromatin, prior to being displaced by very alkaline protamines. This process, known as sperm chromatin protamination, is fundamental to sperm maturation and leads to the tightly compacted toroidal chromatin architecture of the sperm head, which is essential for fertilization. Centrosomal protein CEP131 (PPIDR = 82.0% ± 7.35%; ADS = 0.702 ± 0.053; pLLPS = 0.9989) is critical for flagella formation and thus, sperm motility. Importantly, both acrosome reaction and motility activation are coordinated responses triggered during zona pellucida (ZP) penetration and are regulated by progesterone and FSH produced by the egg and surrounding cumulus cells. CYLC1 (PPIDR = 81.1% ± 4.14%; ADS = 0.745 ± 0.042; pLLPS = 0.9965), which functions in anchoring the acrosome to the sperm nucleus, also serves as a highly connected hub within the acrosome interactome of intrinsically disordered proteins.
These observations highlight how highly disordered acrosomal proteins, beyond their direct roles in acrosomal exocytosis and LLPS, contribute to core processes essential to sperm fertility—including chromatin remodeling and flagella formation. Collectively, they underscore the multifunctionality conferred by intrinsic disorder, which enables these proteins to coordinate diverse molecular events that are fundamental to human fertility and spermiogenesis.

4. Conclusions

Our study investigated the role of intrinsic disorder and liquid–liquid phase separation (LLPS) in the human acrosomal proteome through bioinformatics and computational analysis. This analysis shows that approximately 81% of the 250 human acrosomal proteins in our study had signs of moderate to high levels of disorder, and the flexibility of these disordered regions may aid their roles in fertilization. These disordered and LLPS-prone regions likely enhance acrosomal protein function by enabling rapid, multivalent interactions and dynamic molecular assembly, which are critical for processes like vesicle fusion, enzymatic release, and sperm-egg membrane interactions during fertilization. This is further supported by the fact that the most common SCOP superfamilies found among the disordered regions were associated with structural flexibility and protein–protein interactions, which are hallmarks of IDPs and IDRs. The STRING analysis revealed a prominent level of interaction between the acrosomal proteins and other proteins showing their high levels of promiscuity. When looking into the LLPS propensity, 30% of the acrosomal proteins were high-probability LLPS drivers. In the context of the acrosome, the proteins with high LLPS propensity contribute to the acrosomal vesicle and the assembly of molecular condensates critical for fertilization.
While our findings highlight the significance of intrinsic disorder and LLPS in the acrosomal proteome, it is also important to recognize that ordered proteins play essential roles in fertilization as well, and a comprehensive understanding of acrosomal function should consider both structured and disordered components.
Results of our study add a new perspective to the functionality and formation of the human acrosome through the identification of IDPs and IDRs, and LLPS within its proteome. These proteins with high intrinsic disorder and LLPS propensity, such as myristoylated alanine rich C-kinase substrate (MARCKS; UniProt ID: P29966) and nuclear transition protein 2 (TNP2; UniProt ID: Q05952) could serve as potential targets for understanding acrosome deformities and infertility. Given growing global concerns about declining male fertility [4], understanding the roles of intrinsically disordered and LLPS-prone proteins in acrosomal structure and function may not only advance basic reproductive biology but also hold applied value for developing diagnostic or predictive tools in assisted reproduction.

Author Contributions

Conceptualization, V.N.U.; methodology, S.S., S.S.L., and V.N.U.; software, S.S. and S.S.L.; validation, S.S., S.S.L., and V.N.U.; formal analysis, S.S. and S.S.L.; investigation, S.S. and S.S.L.; data curation, S.S., S.S.L., and V.N.U.; writing—original draft preparation, S.S. and S.S.L.; writing—review and editing, S.S., S.S.L., and V.N.U.; visualization, S.S. and S.S.L.; supervision, V.N.U. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within this article.

Acknowledgments

We thank Mitul Shukla for his assistance with data aggregation, technical support, and providing access to his computing resources.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of this study; in the collection, analyses, or interpretation of data; in the writing of this manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
ADSAverage Disorder Score
ANNArtificial Neural Network
ATPAdenosine Triphosphate
CDFCumulative Distribution Function
CEPCentrosomal Protein
CHCharge-Hydropathy
Cryo-EMCryo-Electron Microscopy
D2P2Database of Disordered Protein Predictions
DAGDiacylglycerol
DPRDroplet Promoting Regions
FDRFalse Discovery Rate
FRAPFluorescence Recovery After Photobleaching
GOGene Ontology
IAMInner Acrosomal Membrane
IDPIntrinsically Disordered Protein
IDRIntrinsically Disordered Region
IP3Inositol Trisphosphate
KEGGKyoto Encyclopedia of Genes and Genomes
LLPSLiquid–Liquid Phase Separation
MARCKSMyristoylated Alanine-Rich C-Kinase Substrate
MoRFMolecular Recognition Feature
NMRNuclear Magnetic Resonance
OAMOuter Acrosomal Membrane
PIP2Phosphatidylinositol 4,5-Bisphosphate
PKCProtein Kinase C
PLCPhospholipase C
pLDDTPredicted Local Distance Difference Test
pLLPSProbability of Liquid–Liquid Phase Separation
PONDRPredictor of Natural Disordered Regions
PPIProtein–Protein Interaction
PPIDRPercentage of Predicted Intrinsically Disordered Residues
PTMPost-Translational Modification
Q–QQuantile−Quantile
RIDAORapid Intrinsic Disorder Analysis Online
SBINDSequence-Based Interaction Mode Divergence
SCOPStructural Classification of Proteins
SNARESoluble N-ethylmaleimide-Sensitive Factor Attachment Protein Receptor
SVMSupport Vector Machine
TNP2Nuclear Transition Protein 2
ZPZona Pellucida

References

  1. Ito, C.; Toshimori, K. Acrosome markers of human sperm. Anat. Sci. Int. 2016, 91, 128–142. [Google Scholar] [CrossRef] [PubMed]
  2. Abou-Haila, A.; Tulsiani, D.R.P. Mammalian Sperm Acrosome: Formation, Contents, and Function. Arch. Biochem. Biophys. 2000, 379, 173–182. [Google Scholar] [CrossRef] [PubMed]
  3. Florman, H.M.; Jungnickel, M.K.; Sutton, K.A. Regulating the acrosome reaction. Int. J. Dev. Biol. 2008, 52, 503–510. [Google Scholar] [CrossRef] [PubMed]
  4. Erenpreisa, J.; Vainshelbaum, N.M.; Lazovska, M.; Karklins, R.; Salmina, K.; Zayakin, P.; Rumnieks, F.; Inashkina, I.; Pjanova, D.; Erenpreiss, J. The Price of Human Evolution: Cancer-Testis Antigens, the Decline in Male Fertility and the Increase in Cancer. Int. J. Mol. Sci. 2023, 24, 11660. [Google Scholar] [CrossRef]
  5. Price, K.L.; Presler, M.; Uyehara, C.M.; Shakes, D.C. The intrinsically disordered protein SPE-18 promotes localized assembly of MSP in Caenorhabditis elegans spermatocytes. Development 2021, 148, dev195875. [Google Scholar] [CrossRef]
  6. Wright, P.E.; Dyson, H.J. Intrinsically unstructured proteins: Re−assessing the protein structure-function paradigm. J. Mol. Biol. 1999, 293, 321–331. [Google Scholar] [CrossRef]
  7. Uversky, V.N. Natively unfolded proteins: A point where biology waits for physics. Protein Sci. 2002, 11, 739–756. [Google Scholar] [CrossRef]
  8. Dunker, A.K.; Lawson, J.D.; Brown, C.J.; Williams, R.M.; Romero, P.; Oh, J.S.; Oldfield, C.J.; Campen, A.M.; Ratliff, C.M.; Hipps, K.W.; et al. Intrinsically disordered protein. J. Mol. Graph. Model. 2001, 19, 26–59. [Google Scholar] [CrossRef]
  9. Dunker, A.K.; Obradovic, Z. The protein trinity—Linking function and disorder. Nat. Biotechnol. 2001, 19, 805–806. [Google Scholar] [CrossRef]
  10. Tompa, P. Intrinsically unstructured proteins. Trends Biochem. Sci. 2002, 27, 527–533. [Google Scholar] [CrossRef]
  11. Uversky, V.N.; Gillespie, J.R.; Fink, A.L. Why are “natively unfolded” proteins unstructured under physiologic conditions? Proteins Struct. Funct. Bioinform. 2000, 41, 415–427. [Google Scholar] [CrossRef]
  12. Dyson, H.J.; Wright, P.E. Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell Biol. 2005, 6, 197–208. [Google Scholar] [CrossRef]
  13. Uversky, V.N.; Dunker, A.K. Understanding Protein Non-Folding. Biochim. Biophys. Acta 2010, 1804, 1231–1264. [Google Scholar] [CrossRef]
  14. Dunker, A.K.; Silman, I.; Uversky, V.N.; Sussman, J.L. Function and structure of inherently disordered proteins. Curr. Opin. Struct. Biol. 2008, 18, 756–764. [Google Scholar] [CrossRef]
  15. Uversky, V.N. Unusual biophysics of intrinsically disordered proteins. Biochim. Biophys. Acta 2013, 1834, 932–951. [Google Scholar] [CrossRef]
  16. Uversky, V.N. Functional roles of transiently and intrinsically disordered regions within proteins. FEBS J. 2015, 282, 1182–1189. [Google Scholar] [CrossRef]
  17. Maity, H.; Maity, M.; Englander, S.W. How cytochrome c folds, and why: Submolecular foldon units and their stepwise sequential stabilization. J. Mol. Biol. 2004, 343, 223–233. [Google Scholar] [CrossRef]
  18. Maity, H.; Maity, M.; Krishna, M.M.; Mayne, L.; Englander, S.W. Protein folding: The stepwise assembly of foldon units. Proc. Natl. Acad. Sci. USA 2005, 102, 4741–4746. [Google Scholar] [CrossRef]
  19. Maity, H.; Rumbley, J.N.; Englander, S.W. Functional role of a protein foldon--an Omega-loop foldon controls the alkaline transition in ferricytochrome c. Proteins 2006, 63, 349–355. [Google Scholar] [CrossRef]
  20. Gupta, M.N.; Uversky, V.N. Protein structure-function continuum model: Emerging nexuses between specificity, evolution, and structure. Protein Sci. 2024, 33, e4968. [Google Scholar] [CrossRef]
  21. Uversky, V.N. Protein intrinsic disorder and structure-function continuum. Prog. Mol. Biol. Transl. Sci. 2019, 166, 1–17. [Google Scholar] [CrossRef]
  22. Wright, P.E.; Dyson, H.J. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 2015, 16, 18–29. [Google Scholar] [CrossRef]
  23. Dunker, A.K.; Brown, C.J.; Obradovic, Z. Identification and functions of usefully disordered proteins. Adv. Protein Chem. 2002, 62, 25–49. [Google Scholar]
  24. Tompa, P.; Csermely, P. The role of structural disorder in the function of RNA and protein chaperones. FASEB J. 2004, 18, 1169–1175. [Google Scholar] [CrossRef]
  25. Bondos, S.E.; Dunker, A.K.; Uversky, V.N. On the roles of intrinsically disordered proteins and regions in cell communication and signaling. Cell Commun. Signal. 2021, 19, 88. [Google Scholar] [CrossRef]
  26. Bondos, S.E.; Dunker, A.K.; Uversky, V.N. Intrinsically disordered proteins play diverse roles in cell signaling. Cell Commun. Signal. 2022, 20, 20. [Google Scholar] [CrossRef]
  27. Kulkarni, P.; Bhattacharya, S.; Achuthan, S.; Behal, A.; Jolly, M.K.; Kotnala, S.; Mohanty, A.; Rangarajan, G.; Salgia, R.; Uversky, V. Intrinsically Disordered Proteins: Critical Components of the Wetware. Chem. Rev. 2022, 122, 6614–6633. [Google Scholar] [CrossRef]
  28. Arai, M.; Suetaka, S.; Ooka, K. Dynamics and interactions of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 2024, 84, 102734. [Google Scholar] [CrossRef]
  29. Cai, L.; Wang, G.G. Through the lens of phase separation: Intrinsically unstructured protein and chromatin looping. Nucleus 2023, 14, 2179766. [Google Scholar] [CrossRef]
  30. Miao, J.; Chong, S. Roles of intrinsically disordered protein regions in transcriptional regulation and genome organization. Curr. Opin. Genet. Dev. 2025, 90, 102285. [Google Scholar] [CrossRef]
  31. Dunker, A.K.; Brown, C.J.; Lawson, J.D.; Iakoucheva, L.M.; Obradovic, Z. Intrinsic disorder and protein function. Biochemistry 2002, 41, 6573–6582. [Google Scholar] [CrossRef]
  32. Oldfield, C.J.; Dunker, A.K. Intrinsically disordered proteins and intrinsically disordered protein regions. Annu. Rev. Biochem. 2014, 83, 553–584. [Google Scholar] [CrossRef]
  33. Uversky, V.N. p53 Proteoforms and Intrinsic Disorder: An Illustration of the Protein Structure-Function Continuum Concept. Int. J. Mol. Sci. 2016, 17, 1874. [Google Scholar] [CrossRef]
  34. Dyson, H.J.; Wright, P.E. Coupling of folding and binding for unstructured proteins. Curr. Opin. Struct. Biol. 2002, 12, 54–60. [Google Scholar] [CrossRef]
  35. Uversky, V.N. Multitude of binding modes attainable by intrinsically disordered proteins: A portrait gallery of disorder-based complexes. Chem. Soc. Rev. 2011, 40, 1623–1634. [Google Scholar] [CrossRef]
  36. Oldfield, C.J.; Cheng, Y.; Cortese, M.S.; Romero, P.; Uversky, V.N.; Dunker, A.K. Coupled folding and binding with alpha-helix-forming molecular recognition elements. Biochemistry 2005, 44, 12454–12470. [Google Scholar] [CrossRef]
  37. Hsu, W.L.; Oldfield, C.; Meng, J.; Huang, F.; Xue, B.; Uversky, V.N.; Romero, P.; Dunker, A.K. Intrinsic protein disorder and protein-protein interactions. Pac. Symp. Biocomput. 2012, 2012, 116–127. [Google Scholar] [CrossRef]
  38. Oldfield, C.J.; Meng, J.; Yang, J.Y.; Yang, M.Q.; Uversky, V.N.; Dunker, A.K. Flexible nets: Disorder and induced fit in the associations of p53 and 14-3-3 with their partners. BMC Genom. 2008, 9 (Suppl. 1), S1. [Google Scholar] [CrossRef]
  39. Hsu, W.L.; Oldfield, C.J.; Xue, B.; Meng, J.; Huang, F.; Romero, P.; Uversky, V.N.; Dunker, A.K. Exploring the binding diversity of intrinsically disordered proteins involved in one-to-many binding. Protein Sci. 2013, 22, 258–273. [Google Scholar] [CrossRef]
  40. van der Lee, R.; Buljan, M.; Lang, B.; Weatheritt, R.J.; Daughdrill, G.W.; Dunker, A.K.; Fuxreiter, M.; Gough, J.; Gsponer, J.; Jones, D.T.; et al. Classification of Intrinsically Disordered Regions and Proteins. Chem. Rev. 2014, 114, 6589–6631. [Google Scholar] [CrossRef]
  41. Martin-Yken, H.; Francois, J.M.; Zerbib, D. Knr4: A disordered hub protein at the heart of fungal cell wall signalling. Cell Microbiol. 2016, 18, 1217–1227. [Google Scholar] [CrossRef] [PubMed]
  42. Cumberworth, A.; Lamour, G.; Babu, M.M.; Gsponer, J. Promiscuity as a functional trait: Intrinsically disordered regions as central players of interactomes. Biochem. J. 2013, 454, 361–369. [Google Scholar] [CrossRef] [PubMed]
  43. Hu, G.; Wu, Z.; Uversky, V.N.; Kurgan, L. Functional Analysis of Human Hub Proteins and Their Interactors Involved in the Intrinsic Disorder-Enriched Interactions. Int. J. Mol. Sci. 2017, 18, 2761. [Google Scholar] [CrossRef] [PubMed]
  44. Dosztanyi, Z.; Chen, J.; Dunker, A.K.; Simon, I.; Tompa, P. Disorder and sequence repeats in hub proteins and their implications for network evolution. J. Proteome Res. 2006, 5, 2985–2995. [Google Scholar] [CrossRef]
  45. Dunker, A.K.; Cortese, M.S.; Romero, P.; Iakoucheva, L.M.; Uversky, V.N. Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J. 2005, 272, 5129–5148. [Google Scholar] [CrossRef]
  46. Habchi, J.; Tompa, P.; Longhi, S.; Uversky, V.N. Introducing protein intrinsic disorder. Chem. Rev. 2014, 114, 6561–6588. [Google Scholar] [CrossRef]
  47. Iakoucheva, L.M.; Radivojac, P.; Brown, C.J.; O’Connor, T.R.; Sikes, J.G.; Obradovic, Z.; Dunker, A.K. The importance of intrinsic disorder for protein phosphorylation. Nucleic Acids Res. 2004, 32, 1037–1049. [Google Scholar] [CrossRef]
  48. Pejaver, V.; Hsu, W.L.; Xin, F.; Dunker, A.K.; Uversky, V.N.; Radivojac, P. The structural and functional signatures of proteins that undergo multiple events of post-translational modification. Protein Sci. 2014, 23, 1077–1093. [Google Scholar] [CrossRef]
  49. Bah, A.; Forman-Kay, J.D. Modulation of Intrinsically Disordered Protein Function by Post-translational Modifications. J. Biol. Chem. 2016, 291, 6696–6705. [Google Scholar] [CrossRef]
  50. Csizmok, V.; Forman-Kay, J.D. Complex regulatory mechanisms mediated by the interplay of multiple post-translational modifications. Curr. Opin. Struct. Biol. 2018, 48, 58–67. [Google Scholar] [CrossRef]
  51. Danis, C.; Dupré, E.; Hanoulle, X.; Landrieu, I.; Lasorsa, A.; Neves, J.F.; Schneider, R.; Smet-Nocca, C. Nuclear Magnetic Resonance Spectroscopy Insights into Tau Structure in Solution: Impact of Post-translational Modifications. In Tau Biology; Takashima, A., Wolozin, B., Buee, L., Eds.; Springer Singapore: Singapore, 2019; pp. 35–45. [Google Scholar]
  52. Eisenhaber, B.; Eisenhaber, F. Posttranslational Modifications and Subcellular Localization Signals: Indicators of Sequence Regions without Inherent 3D Structure? Curr. Protein Pept. Sci. 2007, 8, 197–203. [Google Scholar] [CrossRef] [PubMed]
  53. Flock, T.; Weatheritt, R.J.; Latysheva, N.S.; Babu, M.M. Controlling entropy to tune the functions of intrinsically disordered regions. Curr. Opin. Struct. Biol. 2014, 26, 62–72. [Google Scholar] [CrossRef] [PubMed]
  54. Miao, Y.; Tipakornsaowapak, T.; Zheng, L.; Mu, Y.; Lewellyn, E. Phospho-regulation of intrinsically disordered proteins for actin assembly and endocytosis. FEBS J. 2018, 285, 2762–2784. [Google Scholar] [CrossRef] [PubMed]
  55. Modic, M.; Adamek, M.; Ule, J. The impact of IDR phosphorylation on the RNA binding profiles of proteins. Trends Genet. 2024, 40, 580–586. [Google Scholar] [CrossRef]
  56. Owen, I.; Shewmaker, F. The Role of Post-Translational Modifications in the Phase Transitions of Intrinsically Disordered Proteins. Int. J. Mol. Sci. 2019, 20, 5501. [Google Scholar] [CrossRef]
  57. Phillips, A.H.; Kriwacki, R.W. Intrinsic protein disorder and protein modifications in the processing of biological signals. Curr. Opin. Struct. Biol. 2020, 60, 1–6. [Google Scholar] [CrossRef]
  58. Roque, A.; Ponte, I.; Suau, P. Post-translational modifications of the intrinsically disordered terminal domains of histone H1: Effects on secondary structure and chromatin dynamics. Chromosoma 2017, 126, 83–91. [Google Scholar] [CrossRef]
  59. Toruno, T.Y.; Shen, M.; Coaker, G.; Mackey, D. Regulated Disorder: Posttranslational Modifications Control the RIN4 Plant Immune Signaling Hub. Mol. Plant Microbe Interact. 2019, 32, 56–64. [Google Scholar] [CrossRef]
  60. Van Roey, K.; Davey, N.E. Motif co-regulation and co-operativity are common mechanisms in transcriptional, post-transcriptional and post-translational regulation. Cell Commun. Signal. 2015, 13, 45. [Google Scholar] [CrossRef]
  61. Yan, X.; Zhang, M.; Wang, D. Interplay between posttranslational modifications and liquid—liquid phase separation in tumors. Cancer Lett. 2024, 584, 216614. [Google Scholar] [CrossRef]
  62. Ye, H.; Han, Y.; Li, P.; Su, Z.; Huang, Y. The Role of Post-Translational Modifications on the Structure and Function of Tau Protein. J. Mol. Neurosci. 2022, 72, 1557–1571. [Google Scholar] [CrossRef] [PubMed]
  63. Darling, A.L.; Uversky, V.N. Intrinsic Disorder and Posttranslational Modifications: The Darker Side of the Biological Dark Matter. Front. Genet. 2018, 9, 158. [Google Scholar] [CrossRef] [PubMed]
  64. Dunker, A.K.; Obradovic, Z.; Romero, P.; Garner, E.C.; Brown, C.J. Intrinsic protein disorder in complete genomes. Genome Inf. Ser. Workshop Genome Inf. 2000, 11, 161–171. [Google Scholar]
  65. Ward, J.J.; Sodhi, J.S.; McGuffin, L.J.; Buxton, B.F.; Jones, D.T. Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J. Mol. Biol. 2004, 337, 635–645. [Google Scholar] [CrossRef]
  66. Xue, B.; Dunker, A.K.; Uversky, V.N. Orderly order in protein intrinsic disorder distribution: Disorder in 3500 proteomes from viruses and the three domains of life. J. Biomol. Struct. Dyn. 2012, 30, 137–149. [Google Scholar] [CrossRef]
  67. Peng, Z.; Yan, J.; Fan, X.; Mizianty, M.J.; Xue, B.; Wang, K.; Hu, G.; Uversky, V.N.; Kurgan, L. Exceptionally abundant exceptions: Comprehensive characterization of intrinsic disorder in all domains of life. Cell Mol. Life Sci. 2015, 72, 137–151. [Google Scholar] [CrossRef]
  68. Kumar, N.; Kaushik, R.; Tennakoon, C.; Uversky, V.N.; Longhi, S.; Zhang, K.Y.J.; Bhatia, S. Comprehensive Intrinsic Disorder Analysis of 6108 Viral Proteomes: From the Extent of Intrinsic Disorder Penetrance to Functional Annotation of Disordered Viral Proteins. J. Proteome Res. 2021, 20, 2704–2713. [Google Scholar] [CrossRef]
  69. Skupien-Rabian, B.; Jankowska, U.; Swiderska, B.; Lukasiewicz, S.; Ryszawy, D.; Dziedzicka-Wasylewska, M.; Kedracka-Krok, S. Proteomic and bioinformatic analysis of a nuclear intrinsically disordered proteome. J. Proteom. 2016, 130, 76–84. [Google Scholar] [CrossRef]
  70. Tahzima, R.; Haegeman, A.; Massart, S.; Hebrard, E. Flexible spandrels of the global plant virome: Proteomic-wide evolutionary patterns of structural intrinsic protein disorder elucidate modulation at the functional virus-host interplay. Prog. Mol. Biol. Transl. Sci. 2021, 183, 355–409. [Google Scholar] [CrossRef]
  71. Tompa, P.; Dosztányi, Z.; Simon, I. Prevalent Structural Disorder in E. coli and S. cerevisiae Proteomes. J. Proteome Res. 2006, 5, 1996–2000. [Google Scholar] [CrossRef]
  72. Tompa, P. Intrinsically disordered proteins: A 10-year recap. Trends Biochem. Sci. 2012, 37, 509–516. [Google Scholar] [CrossRef] [PubMed]
  73. Dunker, A.K.; Bondos, S.E.; Huang, F.; Oldfield, C.J. Intrinsically disordered proteins and multicellular organisms. Semin. Cell Dev. Biol. 2015, 37, 44–55. [Google Scholar] [CrossRef] [PubMed]
  74. Niklas, K.J.; Dunker, A.K.; Yruela, I. The evolutionary origins of cell type diversification and the role of intrinsically disordered proteins. J. Exp. Bot. 2018, 69, 1437–1446. [Google Scholar] [CrossRef]
  75. Yruela, I.; Oldfield, C.J.; Niklas, K.J.; Dunker, A.K. Evidence for a Strong Correlation Between Transcription Factor Protein Disorder and Organismic Complexity. Genome Biol. Evol. 2017, 9, 1248–1265. [Google Scholar] [CrossRef]
  76. Kulkarni, P.; Salgia, R. Comprehending phenotypic plasticity in cancer and evolution. iScience 2024, 27, 109308. [Google Scholar] [CrossRef]
  77. Kulkarni, V.; Kulkarni, P. Intrinsically disordered proteins and phenotypic switching: Implications in cancer. Prog. Mol. Biol. Transl. Sci. 2019, 166, 63–84. [Google Scholar] [CrossRef]
  78. Rangarajan, N.; Kulkarni, P.; Hannenhalli, S. Evolutionarily conserved network properties of intrinsically disordered proteins. PLoS ONE 2015, 10, e0126729. [Google Scholar] [CrossRef]
  79. Handa, T.; Kundu, D.; Dubey, V.K. Perspectives on evolutionary and functional importance of intrinsically disordered proteins. Int. J. Biol. Macromol. 2023, 224, 243–255. [Google Scholar] [CrossRef]
  80. Pajkos, M.; Dosztanyi, Z. Functions of intrinsically disordered proteins through evolutionary lenses. Prog. Mol. Biol. Transl. Sci. 2021, 183, 45–74. [Google Scholar] [CrossRef]
  81. Uversky, V.N. On the Roles of Protein Intrinsic Disorder in the Origin of Life and Evolution. Life 2024, 14, 1307. [Google Scholar] [CrossRef]
  82. Uversky, V.N.; Dave, V.; Iakoucheva, L.M.; Malaney, P.; Metallo, S.J.; Pathak, R.R.; Joerger, A.C. Pathological unfoldomics of uncontrolled chaos: Intrinsically disordered proteins and human diseases. Chem. Rev. 2014, 114, 6844–6879. [Google Scholar] [CrossRef] [PubMed]
  83. Uversky, V.N.; Oldfield, C.J.; Dunker, A.K. Intrinsically disordered proteins in human diseases: Introducing the D2 concept. Annu. Rev. Biophys. 2008, 37, 215–246. [Google Scholar] [CrossRef] [PubMed]
  84. Ayyadevara, S.; Ganne, A.; Balasubramaniam, M.; Shmookler Reis, R.J. Intrinsically disordered proteins identified in the aggregate proteome serve as biomarkers of neurodegeneration. Metab. Brain Dis. 2022, 37, 147–152. [Google Scholar] [CrossRef] [PubMed]
  85. Tsoi, P.S.; Quan, M.D.; Ferreon, J.C.; Ferreon, A.C.M. Aggregation of Disordered Proteins Associated with Neurodegeneration. Int. J. Mol. Sci. 2023, 24, 3380. [Google Scholar] [CrossRef]
  86. Babu, M.M. The contribution of intrinsically disordered regions to protein function, cellular complexity, and human disease. Biochem. Soc. Trans. 2016, 44, 1185–1200. [Google Scholar] [CrossRef]
  87. Babu, M.M.; van der Lee, R.; de Groot, N.S.; Gsponer, J. Intrinsically disordered proteins: Regulation and disease. Curr. Opin. Struct. Biol. 2011, 21, 432–440. [Google Scholar] [CrossRef]
  88. Chepsergon, J.; Moleleki, L.N. “Order from disordered”: Potential role of intrinsically disordered regions in phytopathogenic oomycete intracellular effector proteins. Curr. Opin. Plant Biol. 2023, 75, 102402. [Google Scholar] [CrossRef]
  89. Choudhary, S.; Lopus, M.; Hosur, R.V. Targeting disorders in unstructured and structured proteins in various diseases. Biophys. Chem. 2022, 281, 106742. [Google Scholar] [CrossRef]
  90. Cieplak, M.; Chwastyk, M.; Mioduszewski, Ł.; de Aquino, B.R.H. Chapter Two—Transient knots in intrinsically disordered proteins and neurodegeneration. In Progress in Molecular Biology and Translational Science; Uversky, V.N., Ed.; Academic Press: Cambridge, MA, USA, 2020; Volume 174, pp. 79–103. [Google Scholar]
  91. Kodavati, M.; Maloji Rao, V.H.; Provasek, V.E.; Hegde, M.L. Regulation of DNA damage response by RNA/DNA-binding proteins: Implications for neurological disorders and aging. Ageing Res. Rev. 2024, 100, 102413. [Google Scholar] [CrossRef]
  92. Lee, K.E.; Pulido, J.S.; da Palma, M.M.; Procopio, R.; Hufnagel, R.B.; Reynolds, M. A Comprehensive Report of Intrinsically Disordered Regions in Inherited Retinal Diseases. Genes 2023, 14, 1601. [Google Scholar] [CrossRef]
  93. Longhena, F.; Spano, P.; Bellucci, A. Targeting of Disordered Proteins by Small Molecules in Neurodegenerative Diseases. In Targeting Trafficking in Drug Development; Ulloa-Aguirre, A., Tao, Y.-X., Eds.; Springer International Publishing: Cham, Switzerland, 2018; pp. 85–110. [Google Scholar]
  94. Malinovska, L.; Kroschwald, S.; Alberti, S. Protein disorder, prion propensities, and self-organizing macromolecular collectives. Biochim. Biophys. Acta (BBA)-Proteins Proteom. 2013, 1834, 918–931. [Google Scholar] [CrossRef] [PubMed]
  95. Martinelli, A.H.S.; Lopes, F.C.; John, E.B.O.; Carlini, C.R.; Ligabue-Braun, R. Modulation of Disordered Proteins with a Focus on Neurodegenerative Diseases and Other Pathologies. Int. J. Mol. Sci. 2019, 20, 1322. [Google Scholar] [CrossRef] [PubMed]
  96. Mooney, S.M.; Jolly, M.K.; Levine, H.; Kulkarni, P. Phenotypic plasticity in prostate cancer: Role of intrinsically disordered proteins. Asian J. Androl. 2016, 18, 704–710. [Google Scholar] [CrossRef] [PubMed]
  97. Raasakka, A.; Kursula, P. Flexible Players within the Sheaths: The Intrinsically Disordered Proteins of Myelin in Health and Disease. Cells 2020, 9, 470. [Google Scholar] [CrossRef]
  98. Russo, A.; Manna, S.L.; Novellino, E.; Malfitano, A.M.; Marasco, D. Molecular signaling involving intrinsically disordered proteins in prostate cancer. Asian J. Androl. 2016, 18, 673–681. [Google Scholar]
  99. Santofimia-Castano, P.; Rizzuti, B.; Xia, Y.; Abian, O.; Peng, L.; Velazquez-Campoy, A.; Neira, J.L.; Iovanna, J. Targeting intrinsically disordered proteins involved in cancer. Cell Mol. Life Sci. 2020, 77, 1695–1707. [Google Scholar] [CrossRef]
  100. Saurabh, S.; Nadendla, K.; Purohit, S.S.; Sivakumar, P.M.; Cetinel, S. Fuzzy Drug Targets: Disordered Proteins in the Drug-Discovery Realm. ACS Omega 2023, 8, 9729–9747. [Google Scholar] [CrossRef]
  101. Scollo, F.; La Rosa, C. Amyloidogenic Intrinsically Disordered Proteins: New Insights into Their Self-Assembly and Their Interaction with Membranes. Life 2020, 10, 144. [Google Scholar] [CrossRef]
  102. Shahrajabian, M.H.; Sun, W. Characterization of Intrinsically Disordered Proteins in Healthy and Diseased States by Nuclear Magnetic Resonance. Rev. Recent Clin. Trials 2024, 19, 176–188. [Google Scholar] [CrossRef]
  103. Vacic, V.; Iakoucheva, L.M. Disease mutations in disordered regions--exception to the rule? Mol. Biosyst. 2012, 8, 27–32. [Google Scholar] [CrossRef]
  104. Chiti, F.; Dobson, C.M. Protein misfolding, functional amyloid, and human disease. Annu. Rev. Biochem. 2006, 75, 333–366. [Google Scholar] [CrossRef] [PubMed]
  105. Dobson, C.M. Protein folding and misfolding. Nature 2003, 426, 884–890. [Google Scholar] [CrossRef] [PubMed]
  106. Van Bibber, N.W.; Haerle, C.; Khalife, R.; Xue, B.; Uversky, V.N. Intrinsic Disorder in Tetratricopeptide Repeat Proteins. Int. J. Mol. Sci. 2020, 21, 3709. [Google Scholar] [CrossRef]
  107. Meng, F.; Na, I.; Kurgan, L.; Uversky, V.N. Compartmentalization and Functionality of Nuclear Disorder: Intrinsic Disorder and Protein-Protein Interactions in Intra-Nuclear Compartments. Int. J. Mol. Sci. 2015, 17, 24. [Google Scholar] [CrossRef]
  108. Darling, A.L.; Liu, Y.; Oldfield, C.J.; Uversky, V.N. Intrinsically Disordered Proteome of Human Membrane−Less Organelles. Proteomics 2018, 18, e1700193. [Google Scholar] [CrossRef]
  109. Uversky, V.N. The roles of intrinsic disorder-based liquid-liquid phase transitions in the “Dr. Jekyll-Mr. Hyde” behavior of proteins involved in amyotrophic lateral sclerosis and frontotemporal lobar degeneration. Autophagy 2017, 13, 2115–2162. [Google Scholar] [CrossRef]
  110. Fonin, A.V.; Darling, A.L.; Kuznetsova, I.M.; Turoverov, K.K.; Uversky, V.N. Intrinsically disordered proteins in crowded milieu: When chaos prevails within the cellular gumbo. Cell Mol. Life Sci. 2018, 75, 3907–3929. [Google Scholar] [CrossRef]
  111. Uversky, V.N. Protein intrinsic disorder-based liquid-liquid phase transitions in biological systems: Complex coacervates and membrane-less organelles. Adv. Colloid Interface Sci. 2017, 239, 97–114. [Google Scholar] [CrossRef]
  112. Antifeeva, I.A.; Fonin, A.V.; Fefilova, A.S.; Stepanenko, O.V.; Povarova, O.I.; Silonov, S.A.; Kuznetsova, I.M.; Uversky, V.N.; Turoverov, K.K. Liquid-liquid phase separation as an organizing principle of intracellular space: Overview of the evolution of the cell compartmentalization concept. Cell Mol. Life Sci. 2022, 79, 251. [Google Scholar] [CrossRef]
  113. Uversky, V.N. Intrinsically disordered proteins in overcrowded milieu: Membrane-less organelles, phase separation, and intrinsic disorder. Curr. Opin. Struct. Biol. 2017, 44, 18–30. [Google Scholar] [CrossRef]
  114. Fonin, A.V.; Antifeeva, I.A.; Kuznetsova, I.M.; Turoverov, K.K.; Zaslavsky, B.Y.; Kulkarni, P.; Uversky, V.N. Biological soft matter: Intrinsically disordered proteins in liquid-liquid phase separation and biomolecular condensates. Essays Biochem. 2022, 66, 831–847. [Google Scholar] [CrossRef] [PubMed]
  115. Turoverov, K.K.; Kuznetsova, I.M.; Fonin, A.V.; Darling, A.L.; Zaslavsky, B.Y.; Uversky, V.N. Stochasticity of Biological Soft Matter: Emerging Concepts in Intrinsically Disordered Proteins and Biological Phase Separation. Trends Biochem. Sci. 2019, 44, 716–728. [Google Scholar] [CrossRef] [PubMed]
  116. Uversky, V.N.; Kuznetsova, I.M.; Turoverov, K.K.; Zaslavsky, B. Intrinsically disordered proteins as crucial constituents of cellular aqueous two phase systems and coacervates. FEBS Lett. 2015, 589, 15–22. [Google Scholar] [CrossRef]
  117. Alberti, S.; Gladfelter, A.; Mittag, T. Considerations and Challenges in Studying Liquid-Liquid Phase Separation and Biomolecular Condensates. Cell 2019, 176, 419–434. [Google Scholar] [CrossRef]
  118. Patel, A.; Malinovska, L.; Saha, S.; Wang, J.; Alberti, S.; Krishnan, Y.; Hyman, A.A. ATP as a biological hydrotrope. Science 2017, 356, 753–756. [Google Scholar] [CrossRef]
  119. Das, R.K.; Pappu, R.V. Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues. Proc. Natl. Acad. Sci. USA 2013, 110, 13392–13397. [Google Scholar] [CrossRef]
  120. McCarty, J.; Delaney, K.T.; Danielsen, S.P.O.; Fredrickson, G.H.; Shea, J.E. Complete Phase Diagram for Liquid-Liquid Phase Separation of Intrinsically Disordered Proteins. J. Phys. Chem. Lett. 2019, 10, 1644–1652. [Google Scholar] [CrossRef]
  121. Banani, S.F.; Lee, H.O.; Hyman, A.A.; Rosen, M.K. Biomolecular condensates: Organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol. 2017, 18, 285–298. [Google Scholar] [CrossRef]
  122. Shin, Y.; Brangwynne, C.P. Liquid phase condensation in cell physiology and disease. Science 2017, 357, eaaf4382. [Google Scholar] [CrossRef]
  123. Lin, Y.; Protter, D.S.W.; Rosen, M.K.; Parker, R. Formation and Maturation of Phase-Separated Liquid Droplets by RNA-Binding Proteins. Mol. Cell 2015, 60, 208–219. [Google Scholar] [CrossRef]
  124. Dayhoff, G.W., 2nd; Uversky, V.N. Rapid prediction and analysis of protein intrinsic disorder. Protein Sci. 2022, 31, e4496. [Google Scholar] [CrossRef] [PubMed]
  125. Li, X.; Romero, P.; Rani, M.; Dunker, A.K.; Obradovic, Z. Predicting Protein Disorder for N-, C-, and Internal Regions. Genome Inf. Ser. Workshop Genome Inf. 1999, 10, 30–40. [Google Scholar]
  126. Radivojac, P.; Obradovic, Z.; Brown, C.J.; Dunker, A.K. Prediction of boundaries between intrinsically ordered and disordered protein regions. In Biocomputing; World Scientific: Singapore, 2003; pp. 216–227. [Google Scholar]
  127. Peng, K.; Radivojac, P.; Vucetic, S.; Dunker, A.K.; Obradovic, Z. Length-dependent prediction of protein intrinsic disorder. BMC Bioinform. 2006, 7, 208. [Google Scholar] [CrossRef] [PubMed]
  128. Dosztanyi, Z.; Csizmok, V.; Tompa, P.; Simon, I. The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. J. Mol. Biol. 2005, 347, 827–839. [Google Scholar] [CrossRef]
  129. Meszaros, B.; Erdos, G.; Dosztanyi, Z. IUPred2A: Context-dependent prediction of protein disorder as a function of redox state and protein binding. Nucleic Acids Res. 2018, 46, W329–W337. [Google Scholar] [CrossRef]
  130. Xue, B.; Dunbrack, R.L.; Williams, R.W.; Dunker, A.K.; Uversky, V.N. PONDR-FIT: A meta-predictor of intrinsically disordered amino acids. Biochim. Biophys. Acta 2010, 1804, 996–1010. [Google Scholar] [CrossRef]
  131. Oldfield, C.J.; Cheng, Y.; Cortese, M.S.; Brown, C.J.; Uversky, V.N.; Dunker, A.K. Comparing and combining predictors of mostly disordered proteins. Biochemistry 2005, 44, 1989–2000. [Google Scholar] [CrossRef]
  132. Mohan, A.; Sullivan, W.J., Jr.; Radivojac, P.; Dunker, A.K.; Uversky, V.N. Intrinsic disorder in pathogenic and non-pathogenic microbes: Discovering and analyzing the unfoldomes of early-branching eukaryotes. Mol. Biosyst. 2008, 4, 328–340. [Google Scholar] [CrossRef]
  133. Huang, F.; Oldfield, C.; Meng, J.; Hsu, W.L.; Xue, B.; Uversky, V.N.; Romero, P.; Dunker, A.K. Subclassifying disordered proteins by the CH−CDF plot method. In Biocomputing; World Scientific: Singapore, 2012; pp. 128–139. [Google Scholar]
  134. Huang, F.; Oldfield, C.J.; Xue, B.; Hsu, W.L.; Meng, J.; Liu, X.; Shen, L.; Romero, P.; Uversky, V.N.; Dunker, A. Improving protein order-disorder classification using charge-hydropathy plots. BMC Bioinform. 2014, 15 (Suppl. 17), S4. [Google Scholar] [CrossRef]
  135. Oates, M.E.; Romero, P.; Ishida, T.; Ghalwash, M.; Mizianty, M.J.; Xue, B.; Dosztanyi, Z.; Uversky, V.N.; Obradovic, Z.; Kurgan, L.; et al. D(2)P(2): Database of disordered protein predictions. Nucleic Acids Res. 2013, 41, D508–D516. [Google Scholar] [CrossRef]
  136. Ishida, T.; Kinoshita, K. PrDOS: Prediction of disordered protein regions from amino acid sequence. Nucleic Acids Res. 2007, 35, W460–W464. [Google Scholar] [CrossRef] [PubMed]
  137. Walsh, I.; Martin, A.J.; Di Domenico, T.; Tosatto, S.C. ESpritz: Accurate and fast prediction of protein disorder. Bioinformatics 2012, 28, 503–509. [Google Scholar] [CrossRef] [PubMed]
  138. Andreeva, A.; Howorth, D.; Brenner, S.E.; Hubbard, T.J.; Chothia, C.; Murzin, A.G. SCOP database in 2004: Refinements integrate structure and sequence family data. Nucleic Acids Res. 2004, 32, D226–D229. [Google Scholar] [CrossRef] [PubMed]
  139. Murzin, A.G.; Brenner, S.E.; Hubbard, T.; Chothia, C. SCOP: A structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 1995, 247, 536–540. [Google Scholar] [CrossRef]
  140. de Lima Morais, D.A.; Fang, H.; Rackham, O.J.; Wilson, D.; Pethica, R.; Chothia, C.; Gough, J. SUPERFAMILY 1.75 including a domain-centric gene ontology method. Nucleic Acids Res. 2011, 39, D427–D434. [Google Scholar] [CrossRef]
  141. Hornbeck, P.V.; Kornhauser, J.M.; Tkachev, S.; Zhang, B.; Skrzypek, E.; Murray, B.; Latham, V.; Sullivan, M. PhosphoSitePlus: A comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse. Nucleic Acids Res. 2012, 40, D261–D270. [Google Scholar] [CrossRef]
  142. Dosztanyi, Z.; Meszaros, B.; Simon, I. ANCHOR: Web server for predicting protein binding regions in disordered proteins. Bioinformatics 2009, 25, 2745–2746. [Google Scholar] [CrossRef]
  143. Meszaros, B.; Simon, I.; Dosztanyi, Z. Prediction of protein binding regions in disordered proteins. PLoS Comput. Biol. 2009, 5, e1000376. [Google Scholar] [CrossRef]
  144. Szklarczyk, D.; Franceschini, A.; Kuhn, M.; Simonovic, M.; Roth, A.; Minguez, P.; Doerks, T.; Stark, M.; Muller, J.; Bork, P.; et al. The STRING database in 2011: Functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res. 2011, 39, D561–D568. [Google Scholar] [CrossRef]
  145. Van Bibber, N.W.; Haerle, C.; Khalife, R.; Dayhoff, G.W., 2nd; Uversky, V.N. Intrinsic Disorder in Human Proteins Encoded by Core Duplicon Gene Families. J. Phys. Chem. B 2020, 124, 8050–8070. [Google Scholar] [CrossRef]
  146. Szklarczyk, D.; Kirsch, R.; Koutrouli, M.; Nastou, K.; Mehryary, F.; Hachilif, R.; Gable, A.L.; Fang, T.; Doncheva, N.T.; Pyysalo, S.; et al. The STRING database in 2023: Protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023, 51, D638–D646. [Google Scholar] [CrossRef] [PubMed]
  147. Hardenberg, M.; Horvath, A.; Ambrus, V.; Fuxreiter, M.; Vendruscolo, M. Widespread occurrence of the droplet state of proteins in the human proteome. Proc. Natl. Acad. Sci. USA 2020, 117, 33254–33262. [Google Scholar] [CrossRef] [PubMed]
  148. Hatos, A.; Tosatto, S.C.E.; Vendruscolo, M.; Fuxreiter, M. FuzDrop on AlphaFold: Visualizing the sequence-dependent propensity of liquid-liquid phase separation and aggregation of proteins. Nucleic Acids Res. 2022, 50, W337–W344. [Google Scholar] [CrossRef] [PubMed]
  149. Berman, H.; Henrick, K.; Nakamura, H. Announcing the worldwide Protein Data Bank. Nat. Struct. Mol. Biol. 2003, 10, 980. [Google Scholar] [CrossRef]
  150. Suetake, T.; Hayashi, F.; Yokoyama, S. RIKEN Structural Genomics/Proteomics Initiative (RSGI). 2D9L. Available online: https://www.rcsb.org/structure/2D9L (accessed on 13 April 2025).
  151. Vance, T.D.R.; Yip, P.; Jiménez, E.; Li, S.; Gawol, D.; Byrnes, J.; Usón, I.; Ziyyat, A.; Lee, J.E. SPACA6 ectodomain structure reveals a conserved superfamily of gamete fusion-associated proteins. Commun. Biol. 2022, 5, 984. [Google Scholar] [CrossRef]
  152. He, L.; Yu, Z.; Geng, Z.; Huang, Z.; Zhang, C.; Dong, Y.; Gao, Y.; Wang, Y.; Chen, Q.; Sun, L.; et al. Structure, gating, and pharmacology of human CaV3.3 channel. Nat. Commun. 2022, 13, 2084. [Google Scholar] [CrossRef]
  153. Lavering, E.D.; Gandhamaneni, M.; Weeks, D.L. Intrinsically disordered regions are not sufficient to direct the compartmental localization of nucleolar proteins in the nucleus. PLoS Biol. 2023, 21, e3002378. [Google Scholar] [CrossRef]
  154. Snead, D.; Eliezer, D. Intrinsically disordered proteins in synaptic vesicle trafficking and release. J. Biol. Chem. 2019, 294, 3325–3342. [Google Scholar] [CrossRef]
  155. Rodriguez Peña, M.J.; Castillo Bennett, J.V.; Soler, O.M.; Mayorga, L.S.; Michaut, M.A. MARCKS Protein Is Phosphorylated and Regulates Calcium Mobilization during Human Acrosomal Exocytosis. PLoS ONE 2013, 8, e64551. [Google Scholar] [CrossRef]
  156. Breitbart, H. Intracellular calcium regulation in sperm capacitation and acrosomal reaction. Mol. Cell. Endocrinol. 2002, 187, 139–144. [Google Scholar] [CrossRef]
  157. López-Palacios, T.P.; Andersen, J.L. Kinase regulation by liquid-liquid phase separation. Trends Cell Biol. 2023, 33, 649–666. [Google Scholar] [CrossRef] [PubMed]
  158. Arbuzova, A.; Schmitz, A.A.; Vergeres, G. Cross-talk unfolded: MARCKS proteins. Biochem. J. 2002, 362, 1–12. [Google Scholar] [CrossRef] [PubMed]
  159. Barral, S.; Morozumi, Y.; Tanaka, H.; Montellier, E.; Govin, J.; De Dieuleveult, M.; Charbonnier, G.; Couté, Y.; Puthier, D.; Buchou, T.; et al. Histone Variant H2A.L.2 Guides Transition Protein-Dependent Protamine Assembly in Male Germ Cells. Mol. Cell 2017, 66, 89–101.e8. [Google Scholar] [CrossRef]
  160. Zhao, M.; Shirley, C.R.; Hayashi, S.; Marcon, L.; Mohapatra, B.; Suganuma, R.; Behringer, R.R.; Boissonneault, G.; Yanagimachi, R.; Meistrich, M.L. Transition nuclear proteins are required for normal chromatin condensation and functional sperm development. Genesis 2004, 38, 200–213. [Google Scholar] [CrossRef]
  161. Shakya, A.; Park, S.; Rana, N.; King, J.T. Liquid-Liquid Phase Separation of Histone Proteins in Cells: Role in Chromatin Organization. Biophys. J. 2020, 118, 753–764. [Google Scholar] [CrossRef]
  162. Kolthur-Seetharam, U.; Pradeepa, M.M.; Gupta, N.; Narayanaswamy, R.; Rao, M.R. Spatiotemporal organization of AT- and GC-rich DNA and their association with transition proteins TP1 and TP2 in rat condensing spermatids. J. Histochem. Cytochem. 2009, 57, 951–962. [Google Scholar] [CrossRef]
  163. Gerhard, D.S.; Wagner, L.; Feingold, E.A.; Shenmen, C.M.; Grouse, L.H.; Schuler, G.; Klein, S.L.; Old, S.; Rasooly, R.; Good, P.; et al. The Status, Quality, and Expansion of the NIH Full-Length cDNA Project: The Mammalian Gene Collection (MGC). Genome Res. 2004, 14, 2121–2127. [Google Scholar] [CrossRef]
  164. Hall, E.A.; Keighren, M.; Ford, M.J.; Davey, T.; Jarman, A.P.; Smith, L.B.; Jackson, I.J.; Mill, P. Acute Versus Chronic Loss of Mammalian Azi1/Cep131 Results in Distinct Ciliary Phenotypes. PLoS Genet. 2013, 9, e1003928. [Google Scholar] [CrossRef]
  165. Wilkinson, C.J.; Carl, M.; Harris, W.A. Cep70 and Cep131 contribute to ciliogenesis in zebrafish embryos. BMC Cell Biol. 2009, 10, 17. [Google Scholar] [CrossRef]
  166. Staples, C.J.; Myers, K.N.; Beveridge, R.D.; Patil, A.A.; Lee, A.J.; Swanton, C.; Howell, M.; Boulton, S.J.; Collis, S.J. The centriolar satellite protein Cep131 is important for genome stability. J. Cell Sci. 2012, 125, 4770–4779. [Google Scholar] [CrossRef]
  167. Liu, Q.; Guo, Q.; Guo, W.; Song, S.; Wang, N.; Chen, X.; Sun, A.; Yan, L.; Qiao, J. Loss of CEP70 function affects acrosome biogenesis and flagella formation during spermiogenesis. Cell Death Dis. 2021, 12, 478. [Google Scholar] [CrossRef] [PubMed]
  168. Zhang, X.; Huang, G.; Jiang, T.; Meng, L.; Li, T.; Zhang, G.; Wu, N.; Chen, X.; Zhao, B.; Li, N.; et al. CEP112 coordinates translational regulation of essential fertility genes during spermiogenesis through phase separation in humans and mice. Nat. Commun. 2024, 15, 8465. [Google Scholar] [CrossRef] [PubMed]
  169. Kim, D.H.; Ahn, J.S.; Han, H.J.; Kim, H.M.; Hwang, J.; Lee, K.H.; Cha-Molstad, H.; Ryoo, I.J.; Jang, J.H.; Ko, S.K.; et al. Cep131 overexpression promotes centrosome amplification and colon cancer progression by regulating Plk4 stability. Cell Death Dis. 2019, 10, 570. [Google Scholar] [CrossRef] [PubMed]
  170. Liu, X.-H.; Yang, Y.-F.; Fang, H.-Y.; Wang, X.-H.; Zhang, M.-F.; Wu, D.-C. CEP131 indicates poor prognosis and promotes cell proliferation and migration in hepatocellular carcinoma. Int. J. Biochem. Cell Biol. 2017, 90, 1–8. [Google Scholar] [CrossRef]
  171. Shen, M.; Li, S.; Zhao, Y.; Liu, Y.; Liu, Z.; Huan, L.; Qiao, Y.; Wang, L.; Han, L.; Chen, Z.; et al. Hepatic ARID3A facilitates liver cancer malignancy by cooperating with CEP131 to regulate an embryonic stem cell-like gene signature. Cell Death Dis. 2022, 13, 732. [Google Scholar] [CrossRef]
  172. Jin, H.J.; Fan, Y.; Yang, X.; Dong, Y.; Zhang, X.Z.; Geng, X.Y.; Yan, Z.; Wu, L.; Ma, M.; Li, B.; et al. Disruption in CYLC1 leads to acrosome detachment, sperm head deformity, and male in/subfertility in humans and mice. eLife 2024, 13, RP95054. [Google Scholar] [CrossRef]
  173. Zhang, X.M.; Sheng, S.R.; Wang, X.Y.; Bin, L.H.; Wang, J.R.; Li, G.Y. Expression of tumor related gene NAG6 in gastric cancer and restriction fragment length polymorphism analysis. World J. Gastroenterol. 2004, 10, 1361–1364. [Google Scholar] [CrossRef]
  174. Geng, Q.; Ni, L.; Ouyang, B.; Hu, Y.; Zhao, Y.; Guo, J. A Novel Testis-Specific Gene, Ccdc136, Is Required for Acrosome Formation and Fertilization in Mice. Reprod. Sci. 2016, 23, 1387–1396. [Google Scholar] [CrossRef]
Figure 1. Flowchart of Computational Methods. Overview of the bioinformatics pipeline used to analyze the human acrosomal proteome. A dataset of acrosomal proteins was compiled from UniProt and analyzed using RIDAO to predict intrinsic disorder (PPIDR and ADS), D²P² to identify SCOP domains and functional features, FuzDrop to evaluate LLPS propensity, and STRING to model protein-protein interactions. Outputs from all tools were integrated and statistically analyzed using R for visualization and interpretation.
Figure 1. Flowchart of Computational Methods. Overview of the bioinformatics pipeline used to analyze the human acrosomal proteome. A dataset of acrosomal proteins was compiled from UniProt and analyzed using RIDAO to predict intrinsic disorder (PPIDR and ADS), D²P² to identify SCOP domains and functional features, FuzDrop to evaluate LLPS propensity, and STRING to model protein-protein interactions. Outputs from all tools were integrated and statistically analyzed using R for visualization and interpretation.
Proteomes 13 00016 g001
Figure 2. Structural Representations of Selected Proteins. From left to right, the structures shown are: Arf-GAP domain and FG repeat-containing protein 1 (obtained via Solution NMR; PDB ID: 2D9L; [149,150], Sperm acrosome membrane-associated protein 6 (obtained via X-ray diffraction; PDB ID: 7TA2; [151]), and Voltage-dependent T-type calcium channel subunit alpha-1H (obtained via Cryo-EM; PDB ID: 7WLJ; [152]). Each structure is colored in a rainbow gradient from the N-terminus (blue) to the C-terminus (red), highlighting the secondary structural features, including α-helices, β-strands, and loops.
Figure 2. Structural Representations of Selected Proteins. From left to right, the structures shown are: Arf-GAP domain and FG repeat-containing protein 1 (obtained via Solution NMR; PDB ID: 2D9L; [149,150], Sperm acrosome membrane-associated protein 6 (obtained via X-ray diffraction; PDB ID: 7TA2; [151]), and Voltage-dependent T-type calcium channel subunit alpha-1H (obtained via Cryo-EM; PDB ID: 7WLJ; [152]). Each structure is colored in a rainbow gradient from the N-terminus (blue) to the C-terminus (red), highlighting the secondary structural features, including α-helices, β-strands, and loops.
Proteomes 13 00016 g002
Figure 3. Predicted Structures of the Top 10 Most Disordered Proteins. AlphaFold was used to model the predicted structures of the 10 most disordered proteins in our dataset, creating a structural portrait gallery. Proteins are displayed by UniProt ID from left to right, starting with the top row: Calcium-binding and spermatid-specific protein 1 (UniProt ID: Q96KC9), regulating synaptic membrane exocytosis protein 1 (UniProt ID: Q86UR5), acrosomal protein SP-10 (UniProt ID: P26436), TATA element modulatory factor (UniProt ID: P82094), Golgin subfamily A member 1 (UniProt ID: Q92805), nuclear transition protein 2 (UniProt ID: Q05952), coiled-coil domain-containing protein 136 (UniProt ID: Q96JN2), myristoylated alanine-rich C-kinase substrate (UniProt ID: P29966), Cylicin-1 (UniProt ID: P35663), and centrosomal protein of 131 kDa (UniProt ID: Q9UPN4). Colors represent the predicted local distance difference test (pLDDT) scores: very high confidence (pLDDT > 90, dark blue), high confidence (90 > pLDDT > 70, light blue), low confidence (70 > pLDDT > 50, yellow), and very low confidence (pLDDT < 50, orange).
Figure 3. Predicted Structures of the Top 10 Most Disordered Proteins. AlphaFold was used to model the predicted structures of the 10 most disordered proteins in our dataset, creating a structural portrait gallery. Proteins are displayed by UniProt ID from left to right, starting with the top row: Calcium-binding and spermatid-specific protein 1 (UniProt ID: Q96KC9), regulating synaptic membrane exocytosis protein 1 (UniProt ID: Q86UR5), acrosomal protein SP-10 (UniProt ID: P26436), TATA element modulatory factor (UniProt ID: P82094), Golgin subfamily A member 1 (UniProt ID: Q92805), nuclear transition protein 2 (UniProt ID: Q05952), coiled-coil domain-containing protein 136 (UniProt ID: Q96JN2), myristoylated alanine-rich C-kinase substrate (UniProt ID: P29966), Cylicin-1 (UniProt ID: P35663), and centrosomal protein of 131 kDa (UniProt ID: Q9UPN4). Colors represent the predicted local distance difference test (pLDDT) scores: very high confidence (pLDDT > 90, dark blue), high confidence (90 > pLDDT > 70, light blue), low confidence (70 > pLDDT > 50, yellow), and very low confidence (pLDDT < 50, orange).
Proteomes 13 00016 g003
Figure 4. Percent Disorder Across Prediction Models. This boxplot compares the distribution of percent disorder (PER) predicted by seven different models: IUPred long, IUPred short, mean disorder propensity (MDP), PONDR-FIT, PONDR® VL3, PONDR® VLXT, and PONDR® VSL2B (PER-IUPL, PER-IUPS, PER-MDP, PER-PFIT, PER-VL3, PER-VLXT, and PER-VSL2B, respectively). Each box represents the interquartile range (IQR), with the median marked by a horizontal line and outliers shown as individual points. The PER-VSL2B model predicts the highest percent disorder overall, while models such as PER-IUPL and PER-IUPS show lower median values, indicating variability in disorder prediction across different tools.
Figure 4. Percent Disorder Across Prediction Models. This boxplot compares the distribution of percent disorder (PER) predicted by seven different models: IUPred long, IUPred short, mean disorder propensity (MDP), PONDR-FIT, PONDR® VL3, PONDR® VLXT, and PONDR® VSL2B (PER-IUPL, PER-IUPS, PER-MDP, PER-PFIT, PER-VL3, PER-VLXT, and PER-VSL2B, respectively). Each box represents the interquartile range (IQR), with the median marked by a horizontal line and outliers shown as individual points. The PER-VSL2B model predicts the highest percent disorder overall, while models such as PER-IUPL and PER-IUPS show lower median values, indicating variability in disorder prediction across different tools.
Proteomes 13 00016 g004
Figure 5. Protein Disorder Categories by Model. This figure shows the distribution of proteins categorized into three disorder classes across multiple prediction models. Proteins are classified as highly disordered (>30% disordered residues, red), moderately disordered (10–30% disordered residues, blue), and highly ordered (0−10% disordered residues, green). Each pie chart corresponds to a specific prediction model. The numbers (n) represent the count of proteins within each disorder category for the respective model.
Figure 5. Protein Disorder Categories by Model. This figure shows the distribution of proteins categorized into three disorder classes across multiple prediction models. Proteins are classified as highly disordered (>30% disordered residues, red), moderately disordered (10–30% disordered residues, blue), and highly ordered (0−10% disordered residues, green). Each pie chart corresponds to a specific prediction model. The numbers (n) represent the count of proteins within each disorder category for the respective model.
Proteomes 13 00016 g005
Figure 6. Scatter Plot of PPIDR vs. ADS Across All Models Analyzed. This scatter plot shows the relationship between PPIDR (percent disorder) and ADS (average disorder score) across multiple prediction models. Each dot represents a protein, colored by the respective disorder prediction model IUPred long, IUPred short, MDP, PONDR-FIT, PONDR® VL3, PONDR® VLXT, and PONDR® VSL2B (IUPL, IUPS, MDP, PFIT, VL3, VLXT, and VSL2B). A strong positive correlation is observed, with a correlation coefficient of r = 0.947 and a statistically significant p-value (p < 0.001), indicating a strong linear relationship between percent disorder and average disorder score.
Figure 6. Scatter Plot of PPIDR vs. ADS Across All Models Analyzed. This scatter plot shows the relationship between PPIDR (percent disorder) and ADS (average disorder score) across multiple prediction models. Each dot represents a protein, colored by the respective disorder prediction model IUPred long, IUPred short, MDP, PONDR-FIT, PONDR® VL3, PONDR® VLXT, and PONDR® VSL2B (IUPL, IUPS, MDP, PFIT, VL3, VLXT, and VSL2B). A strong positive correlation is observed, with a correlation coefficient of r = 0.947 and a statistically significant p-value (p < 0.001), indicating a strong linear relationship between percent disorder and average disorder score.
Proteomes 13 00016 g006
Figure 7. (A) CH-CDF plot of intrinsic disorder status. This scatter plot displays the relationship between CH scores (distance from CH boundary) and CDF scores (average distance from CDF boundary), classifying proteins into four quadrants based on their intrinsic disorder status. Quadrants are defined as Q1 (Ordered by both CH and CDF boundaries, blue), Q2 (CH Ordered, CDF Disordered, green), Q3 (Disordered by both CH and CDF boundaries, red), and Q4 (CH Disordered, CDF Ordered, orange). This classification highlights the agreement or disagreement between CH and CDF disorder predictions. (B). Number and ratio of proteins in each quadrant. The bar chart shows the distribution of proteins across four quadrants based on CH-CDF disorder classification. The percentages on the top of each bar represent the percent of total proteins in that category.
Figure 7. (A) CH-CDF plot of intrinsic disorder status. This scatter plot displays the relationship between CH scores (distance from CH boundary) and CDF scores (average distance from CDF boundary), classifying proteins into four quadrants based on their intrinsic disorder status. Quadrants are defined as Q1 (Ordered by both CH and CDF boundaries, blue), Q2 (CH Ordered, CDF Disordered, green), Q3 (Disordered by both CH and CDF boundaries, red), and Q4 (CH Disordered, CDF Ordered, orange). This classification highlights the agreement or disagreement between CH and CDF disorder predictions. (B). Number and ratio of proteins in each quadrant. The bar chart shows the distribution of proteins across four quadrants based on CH-CDF disorder classification. The percentages on the top of each bar represent the percent of total proteins in that category.
Proteomes 13 00016 g007
Figure 8. Number of Proteins at Each Analysis Stage. This bar chart displays the number of proteins retained through successive stages of analysis. The stages include D2P2 hits (233 proteins), proteins with 75% agreement on disorder prediction (190 proteins), SCOP hits (148 proteins), and SCOP domains located within disordered regions (48 proteins). This stepwise filtering highlights the progressive narrowing of the dataset to identify proteins with structural disorder within SCOP domains overlap.
Figure 8. Number of Proteins at Each Analysis Stage. This bar chart displays the number of proteins retained through successive stages of analysis. The stages include D2P2 hits (233 proteins), proteins with 75% agreement on disorder prediction (190 proteins), SCOP hits (148 proteins), and SCOP domains located within disordered regions (48 proteins). This stepwise filtering highlights the progressive narrowing of the dataset to identify proteins with structural disorder within SCOP domains overlap.
Proteomes 13 00016 g008
Figure 9. CH-CDF Plot with Disorder-SCOP Overlap. This plot illustrates the relationship between CH scores (distance from CH boundary) and CDF scores (average distance from CDF boundary) for proteins with and without disorder-SCOP overlaps. Data points are classified into four quadrants: Q1 (Ordered by both CH and CDF boundaries, blue), Q2 (CH Ordered, CDF Disordered, green), Q3 (Disordered by both CH and CDF boundaries, red), and Q4 (CH Disordered, CDF Ordered, orange). Circles (●) indicate proteins with disordered regions (classified by 75% agreement in disorder) that overlap with SCOP domains, while triangles (▲) indicate no overlap. This visualization highlights the distribution of structural classification across the disorder-CDF boundary space.
Figure 9. CH-CDF Plot with Disorder-SCOP Overlap. This plot illustrates the relationship between CH scores (distance from CH boundary) and CDF scores (average distance from CDF boundary) for proteins with and without disorder-SCOP overlaps. Data points are classified into four quadrants: Q1 (Ordered by both CH and CDF boundaries, blue), Q2 (CH Ordered, CDF Disordered, green), Q3 (Disordered by both CH and CDF boundaries, red), and Q4 (CH Disordered, CDF Ordered, orange). Circles (●) indicate proteins with disordered regions (classified by 75% agreement in disorder) that overlap with SCOP domains, while triangles (▲) indicate no overlap. This visualization highlights the distribution of structural classification across the disorder-CDF boundary space.
Proteomes 13 00016 g009
Figure 10. Gene Ontology Analysis of Disorder-Affected Proteins. This bubble plot displays enriched Gene Ontology (GO) terms for disorder-affected proteins across three GO categories: Molecular Function (GO:MF), Biological Process (GO:BP), and Cellular Component (GO:CC). The size of each bubble corresponds to the number of proteins associated with the term, while the y-axis represents the statistical significance (−log10 p-value). KEGG, Reactome (REAC), and other pathways are also included, illustrating the functional and structural roles of intrinsically disordered proteins within biological systems. Values above the dotted line threshold were capped.
Figure 10. Gene Ontology Analysis of Disorder-Affected Proteins. This bubble plot displays enriched Gene Ontology (GO) terms for disorder-affected proteins across three GO categories: Molecular Function (GO:MF), Biological Process (GO:BP), and Cellular Component (GO:CC). The size of each bubble corresponds to the number of proteins associated with the term, while the y-axis represents the statistical significance (−log10 p-value). KEGG, Reactome (REAC), and other pathways are also included, illustrating the functional and structural roles of intrinsically disordered proteins within biological systems. Values above the dotted line threshold were capped.
Proteomes 13 00016 g010
Figure 11. Gene Ontology (GO) Hierarchical Annotation for ‘Acrosomal Vesicle’ (GO:0001669). This hierarchical chart depicts the Gene Ontology relationships for GO:0001669 (acrosomal vesicle), a cellular component critical for acrosome structure and function during fertilization. This chart traces the acrosomal vesicle as part of higher-level biological processes, such as vesicle fusion (GO:0099500), organelle membrane fusion (GO:0090174), and membrane fusion (GO:0061025). These components are further nested under broad categories like organelle organization (GO:0006996) and cellular anatomical structure (GO:0110165). The acrosomal vesicle is functionally linked to structures like the acrosomal matrix (GO:0043159), acrosomal lumen (GO:0043160), and acrosomal membrane (GO:0002080), highlighting its role in vesicle organization and exocytosis. This chart was created using EMBL’s European Bioinformatics Institute’s QuickGO tool.
Figure 11. Gene Ontology (GO) Hierarchical Annotation for ‘Acrosomal Vesicle’ (GO:0001669). This hierarchical chart depicts the Gene Ontology relationships for GO:0001669 (acrosomal vesicle), a cellular component critical for acrosome structure and function during fertilization. This chart traces the acrosomal vesicle as part of higher-level biological processes, such as vesicle fusion (GO:0099500), organelle membrane fusion (GO:0090174), and membrane fusion (GO:0061025). These components are further nested under broad categories like organelle organization (GO:0006996) and cellular anatomical structure (GO:0110165). The acrosomal vesicle is functionally linked to structures like the acrosomal matrix (GO:0043159), acrosomal lumen (GO:0043160), and acrosomal membrane (GO:0002080), highlighting its role in vesicle organization and exocytosis. This chart was created using EMBL’s European Bioinformatics Institute’s QuickGO tool.
Proteomes 13 00016 g011
Figure 12. Inter-PPI Network of Human Acrosomal Proteins with 0.15 Confidence. STRING-based analysis of the inter-set interactivity of 245 human acrosomal proteins at the low confidence interval (0.15) to ensure maximum inclusion. Interactions are based on experimental and predicted information. All proteins except for one (UniProt ID: A6NFA0) have determined interactions.
Figure 12. Inter-PPI Network of Human Acrosomal Proteins with 0.15 Confidence. STRING-based analysis of the inter-set interactivity of 245 human acrosomal proteins at the low confidence interval (0.15) to ensure maximum inclusion. Interactions are based on experimental and predicted information. All proteins except for one (UniProt ID: A6NFA0) have determined interactions.
Proteomes 13 00016 g012
Figure 13. Inter-PPI Networks of Human Acrosomal Proteins with 0.4 and 0.7 Confidence. STRING-based analysis of the inter-set interactivity of 245 human acrosomal proteins at the (A) medium confidence interval (0.4) and (B) high confidence interval (0.7).
Figure 13. Inter-PPI Networks of Human Acrosomal Proteins with 0.4 and 0.7 Confidence. STRING-based analysis of the inter-set interactivity of 245 human acrosomal proteins at the (A) medium confidence interval (0.4) and (B) high confidence interval (0.7).
Proteomes 13 00016 g013
Figure 14. Global PPI Network at a 0.9 Confidence Interval. STRING-based analysis of 245 human acrosomal proteins with the 500 highest interacting proteins. The highest confidence was selected to see the most accurate interactions.
Figure 14. Global PPI Network at a 0.9 Confidence Interval. STRING-based analysis of 245 human acrosomal proteins with the 500 highest interacting proteins. The highest confidence was selected to see the most accurate interactions.
Proteomes 13 00016 g014
Figure 15. Relationship between Average PPIDR and Transformed pLLPS. This scatter plot illustrates the relationship between the average PPIDR of three models (x-axis) and transformed pLLPS (y-axis). A fitted regression line with a 95% confidence interval (shaded region) shows a significant relationship. The model explains 64.9% of the variance (R2 = 0.649) and is highly significant (p-value < 0.0001).
Figure 15. Relationship between Average PPIDR and Transformed pLLPS. This scatter plot illustrates the relationship between the average PPIDR of three models (x-axis) and transformed pLLPS (y-axis). A fitted regression line with a 95% confidence interval (shaded region) shows a significant relationship. The model explains 64.9% of the variance (R2 = 0.649) and is highly significant (p-value < 0.0001).
Proteomes 13 00016 g015
Figure 16. Functional Disorder Analysis of Myristoylated Alanine-Rich C-kinase Substrate (MARCKS) UniProt ID: P29966. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of MARCKS at 0.4 confidence interval.
Figure 16. Functional Disorder Analysis of Myristoylated Alanine-Rich C-kinase Substrate (MARCKS) UniProt ID: P29966. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of MARCKS at 0.4 confidence interval.
Proteomes 13 00016 g016
Figure 17. Functional Disorder Analysis of Nuclear Transition Protein 2 (TNP2) UniProt ID: Q05952. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of TNP2 at a 0.4 confidence interval.
Figure 17. Functional Disorder Analysis of Nuclear Transition Protein 2 (TNP2) UniProt ID: Q05952. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of TNP2 at a 0.4 confidence interval.
Proteomes 13 00016 g017
Figure 18. Functional Disorder Analysis of Centrosomal Protein of 131 kDA (CEP131). UniProt ID: (Q9UPN4) (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of CEP 131 at a 0.4 confidence interval.
Figure 18. Functional Disorder Analysis of Centrosomal Protein of 131 kDA (CEP131). UniProt ID: (Q9UPN4) (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of CEP 131 at a 0.4 confidence interval.
Proteomes 13 00016 g018
Figure 19. Functional Disorder Analysis of Cylicin-1 (CYLC1); UniProt ID: P35663. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of CYLC1 at a 0.4 confidence interval.
Figure 19. Functional Disorder Analysis of Cylicin-1 (CYLC1); UniProt ID: P35663. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of CYLC1 at a 0.4 confidence interval.
Proteomes 13 00016 g019
Figure 20. Functional Disorder Analysis of Coiled-Coil Domain-Containing Protein 136 (CCDC136); UniProt ID: Q96JN2. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of CCDC136 at a 0.4 confidence interval.
Figure 20. Functional Disorder Analysis of Coiled-Coil Domain-Containing Protein 136 (CCDC136); UniProt ID: Q96JN2. (A) Intrinsic disorder profile generated by RIDAO. (B) Functional disorder profile generated by D2P2. (C) PPI network of CCDC136 at a 0.4 confidence interval.
Proteomes 13 00016 g020
Table 1. Summary Statistics for ADS and PPIDR Across PONDR® VXLT, PONDR® VSL2B, and PONDR® VL3 Models. This table presents the minimum, mean, median, and maximum values for ADS (average disorder score) and PPIDR (percent predicted intrinsically disordered residues) for three disorder prediction models: PONDR® VXLT, PONDR® VSL2B, and PONDR® VL3.
Table 1. Summary Statistics for ADS and PPIDR Across PONDR® VXLT, PONDR® VSL2B, and PONDR® VL3 Models. This table presents the minimum, mean, median, and maximum values for ADS (average disorder score) and PPIDR (percent predicted intrinsically disordered residues) for three disorder prediction models: PONDR® VXLT, PONDR® VSL2B, and PONDR® VL3.
PONDR® VXLTPONDR® VSL2BPONDR® VL3
ADSPPIDRADSPPIDRADSPPIDR
Minimum0.06020.790.17554.790.09950
Mean0.343532.090.458240.830.399933.37
Median0.317728.880.414134.040.369327.14
Maximum0.829590.960.99071000.9240100
Table 2. SCOP Superfamilies Affected by Disorder and Associated Proteins. This table lists SCOP superfamilies impacted by intrinsic disorder, including the total number of residues affected and the corresponding associated proteins. This highlights the overlap between structural disorder and SCOP domain regions.
Table 2. SCOP Superfamilies Affected by Disorder and Associated Proteins. This table lists SCOP superfamilies impacted by intrinsic disorder, including the total number of residues affected and the corresponding associated proteins. This highlights the overlap between structural disorder and SCOP domain regions.
SuperfamilyDisordered Residues (n)Proteins Affected (UniProt IDs)
ARM repeat17Q7Z745
Actin-like ATPase domain2Q9Y615
Ankyrin repeat23P46531
C-terminal domain of PLC-beta40Q9NQ66
C2 domain (Calcium/lipid-binding domain, CaLB)14O14795
CAD and PB1 domains15P19878
CATH1P17612
Calpain large subunit, middle domain (domain III)3Q9UMQ6
Concanavalin A-like lectins/glucanases40P27797, Q9NQ86
Cysteine proteinases13O75604
Cysteine-rich domain4Q9H0H5
Dimerization-anchoring domain of cAMP-dependent PK regulatory subunit4Q9BZX4
EF-hand1Q9NQ66
FAD-linked reductases, C-terminal domain1Q96RQ9
FAD/NAD(P)-binding domain3Q96RQ9
FYVE/PHD zinc finger36Q86UR5
Family A G protein-coupled receptor-like5P14416, P21731
Fibronectin type III23Q9Y2H6
Growth factor receptor domain6Q6UW60
HRDC-like17O75575
Immunoglobulin3P35613, Q92692
Integrin domains4P56199
MIR domain5Q9UKY4, Q9Y6A1
Neurotransmitter-gated ion-channel transmembrane pore40P23415, P48167
Nucleotide cyclase10O60266
Nucleotide-diphospho-sugar transferases12P15291
P-domain of calnexin/calreticulin70P27797
P-loop containing nucleoside triphosphate hydrolases9P20340
PDZ domain-like10Q86UR5
PLC-like phosphodiesterases111Q9BRC7, Q9NQ66
Phospholipase A2, PLA215Q9NZ20
Protein kinase-like (PK-like)31Q13976, Q9UQB9
Rhodanese/Cell cycle control phosphatase7P40818
SGNH hydrolase6Q6P1J6
SH3-domain24P19878
Sec1/munc18-like (SM) proteins17P61764
Serpins5P05154
Subtilisin-like11Q6UW60
Thioredoxin-like4Q8N4E4
Transcription factor IIA (TFIIA), β-barrel domain6Q9UNN4
Trypsin-like serine proteases11Q6UWB4
Tubulin C-terminal domain-like5Q9NY65
USP8 N-terminal domain-like20P40818
WD40 repeat-like10P43034
Ypt/Rab-GAP domain of gyp1p8Q96BZ9
alpha-catenin/vinculin-like10P35221
t-SNARE proteins73P32856, P61266, Q16623
Table 3. Enriched Gene Ontology (GO) Terms for Disorder-Affected Proteins. This table lists enriched GO terms across Molecular Function (GO:MF), Biological Process (GO:BP), and Cellular Component (GO:CC) categories. Each row includes the GO Term ID, name, and the adjusted p-value (p_adj), indicating the statistical significance of enrichment. Notably, terms such as ‘acrosomal vesicle’ (GO:0001669, p_adj = 5.395 × 10⁻38) highlight the functional relevance of intrinsic disorder in reproductive and cellular processes.
Table 3. Enriched Gene Ontology (GO) Terms for Disorder-Affected Proteins. This table lists enriched GO terms across Molecular Function (GO:MF), Biological Process (GO:BP), and Cellular Component (GO:CC) categories. Each row includes the GO Term ID, name, and the adjusted p-value (p_adj), indicating the statistical significance of enrichment. Notably, terms such as ‘acrosomal vesicle’ (GO:0001669, p_adj = 5.395 × 10⁻38) highlight the functional relevance of intrinsic disorder in reproductive and cellular processes.
SourceTerm IDTerm Namep_adj (Query)
GO:MFGO:0019899enzyme binding7.900 × 10−4
GO:0000149SNARE binding1.876 × 10−3
GO:0016934extracellularly glycine−gated chloride channel …1.430 × 10−2
GO:0043169cation binding4.442 × 10−2
GO:0005484SNAP receptor activity4.490 × 10−2
GO:0004620phospholipase activity4.815 × 10−2
GO:BPGO:0003006developmental process involved in reproduction3.127 × 10−11
GO:0010807regulation of synaptic vesicle priming1.386 × 10−6
GO:0065008regulation of biological quality9.343 × 10−6
GO:0099170postsynaptic modulation of chemical synaptic t …6.720 × 10−4
GO:0007405neuroblast proliferation1.993 × 10−3
GO:0048870cell motility5.806 × 10−3
GO:0000910cytokinesis7.271 × 10−3
GO:0045787positive regulation of cell cycle2.231 × 10−2
GO:0031175neuron projection development2.257 × 10−2
GO:1904100positive regulation of protein O−linked glycosyl …2.547 × 10−2
GO:0010560positive regulation of glycoprotein biosynthet …3.189 × 10−2
GO:CCGO:0001669acrosomal vesicle5.395 × 10−38
GO:0098590plasma membrane region1.175 × 10−6
GO:0042995cell projection1.406 × 10−6
GO:0005915zonula adherens3.998 × 10−2
Table 4. LLPS Propensity Summary. Summary of descriptive statistics for the LLPS propensity of proteins, including measures of central tendency (mean and median), variability (standard deviation), and distribution (minimum, maximum, and quartiles).
Table 4. LLPS Propensity Summary. Summary of descriptive statistics for the LLPS propensity of proteins, including measures of central tendency (mean and median), variability (standard deviation), and distribution (minimum, maximum, and quartiles).
StatisticLLPS Propensity Statistics
Count (N)250
Mean (x¯)0.451146
Std. Deviation (σ)0.33337186
Minimum (xmin)0.0922
First Quartile (Q1))0.166975
Median (Q2)0.28645
Third Quartile (Q3)0.7946
Maximum (xmax)1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shukla, S.; Lastorka, S.S.; Uversky, V.N. Intrinsic Disorder and Phase Separation Coordinate Exocytosis, Motility, and Chromatin Remodeling in the Human Acrosomal Proteome. Proteomes 2025, 13, 16. https://doi.org/10.3390/proteomes13020016

AMA Style

Shukla S, Lastorka SS, Uversky VN. Intrinsic Disorder and Phase Separation Coordinate Exocytosis, Motility, and Chromatin Remodeling in the Human Acrosomal Proteome. Proteomes. 2025; 13(2):16. https://doi.org/10.3390/proteomes13020016

Chicago/Turabian Style

Shukla, Shivam, Sean S. Lastorka, and Vladimir N. Uversky. 2025. "Intrinsic Disorder and Phase Separation Coordinate Exocytosis, Motility, and Chromatin Remodeling in the Human Acrosomal Proteome" Proteomes 13, no. 2: 16. https://doi.org/10.3390/proteomes13020016

APA Style

Shukla, S., Lastorka, S. S., & Uversky, V. N. (2025). Intrinsic Disorder and Phase Separation Coordinate Exocytosis, Motility, and Chromatin Remodeling in the Human Acrosomal Proteome. Proteomes, 13(2), 16. https://doi.org/10.3390/proteomes13020016

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop