Next Article in Journal
MASLD Under the Microscope: How microRNAs and Microbiota Shape Hepatic Metabolic Disease Progression
Previous Article in Journal
Comparison of Current International Guidelines for the Management of Alopecia Areata—Comprehensive Review
Previous Article in Special Issue
Acute Hyperglycemia-Induced Inflammation in MIO-M1 Cells: The Role of Aldose Reductase
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Theoretical Methods for Assessing the Density of Protein Nanodroplets

by
Midhun Mohan Anila
,
Michał Wojciechowski
,
Mateusz Chwastyk
and
Bartosz Różycki
*
Institute of Physics, Polish Academy of Sciences, Al. Lotników 32/46, 02-668 Warsaw, Poland
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(17), 8631; https://doi.org/10.3390/ijms26178631
Submission received: 4 August 2025 / Revised: 1 September 2025 / Accepted: 2 September 2025 / Published: 4 September 2025
(This article belongs to the Collection Feature Paper Collection in Biochemistry)

Abstract

Many intrinsically disordered proteins (IDPs) are known to undergo liquid–liquid phase separation (LLPS), which is a physical process that drives the formation of biomolecular condensates and membraneless organelles in biological cells. Molecular dynamics (MD) simulations provide valuable tools to explore both the molecular mechanisms of LLPS and the physical properties of biomolecular condensates. However, a direct comparison of MD simulation results with phase diagrams obtained experimentally is normally prevented not only by the high computational costs of simulating large biomacromolecular systems on sufficient timescales but also by conceptual challenges. Specifically, there currently seems to be no standard or unambiguous method of defining and determining volumes occupied by coexisting phases at the nanoscale, with typically no more than a few hundred biomacromolecules in the simulation box. The goal of this work is to fill in this gap in the methodology. Focusing on α -synuclein as a model IDP, we test and compare three methods for determining the molecular density of protein nanodroplets, or clusters, generated in MD simulations or using other molecular modeling approaches. Two of the methods are based on approximating nanodroplets with homogeneous spheres and ellipsoids, respectively. The third method, which is expected to yield the most physically accurate results, is based on the SPACEBALL algorithm, with optimized, cluster-specific radii for volume probes. Our results contribute to the construction of accurate phase diagrams on the basis of MD simulations of IDP systems.

1. Introduction

Liquid–liquid phase separation (LLPS) is a fundamental biophysical process in which proteins and other biomolecules demix from a homogeneous solution to form liquid droplets [1]. This phenomenon plays a crucial role in cellular organization, enabling the formation of membraneless organelles such as stress granules, nucleoli, and P-bodies, which regulate various biological processes, including RNA metabolism, signal transduction, and protein homeostasis [2,3,4]. LLPS is driven primarily by multivalent interactions between intrinsically disordered regions and modular protein domains, and can be influenced by such physical factors as temperature, pH, salt concentration, and molecular crowding [5].
A crucial tool for understanding the conditions under which phase separation occurs, as well as the physical properties of the resulting phases, is the phase diagram [6]. A typical phase diagram for LLPS maps parameters such as temperature, protein concentration, and salt concentration, delineating regions where a homogeneous mixture exists and where phase separation leads to the formation of biomolecular condensates [7,8,9]. The binodal curve defines the boundary between the single-phase and two-phase regions, while the spinodal region represents conditions where phase separation occurs spontaneously via nucleation-independent mechanisms [10,11]. Understanding such phase diagrams is essential for deciphering cellular compartmentalization mechanisms and developing strategies to modulate phase behavior in disease contexts.
Several methods are used to construct phase diagrams, including experimental, theoretical, and computational approaches. Experimental techniques involve differential scanning calorimetry [12,13], X-ray diffraction [14], and optical or electron microscopy [15] to identify phase transitions and equilibrium states. Thermodynamic modeling, such as the CALPHAD (Calculation of Phase Diagrams) method [16,17,18], combines experimental data with thermodynamic equations to predict phase diagrams. Computational methods, including molecular dynamics (MD) [19] and density functional theory [20], can provide insights into phase stability at the molecular scale, even for complex biomacromolecular systems. However, constructing phase diagrams for biomacromolecular systems on the basis of MD simulations is a very difficult task, not only because of the high computational costs but also due to conceptual challenges related to finite-size effects. To the best of our knowledge, there is currently no standard or unambiguous method of defining and determining the volumes occupied by coexisting phases at the nanoscale with typically no more than a few hundred biomacromolecules in the simulation box. Our objective is to fill this gap in the methodology by presenting a well-defined and robust method of computing the volumes and densities of protein nanodroplets or clusters generated in MD simulations or by other molecular modeling techniques.
Here, we test and assess different methods for calculating the molecular density of nanodroplets, or clusters, formed by intrinsically disordered proteins (IDPs). For the purpose of this study, we use α -synuclein, which is a physiologically relevant and well-studied IDP known to undergo LLPS at certain conditions [21]. Specifically, α -synuclein is a 140-residue IDP abundantly expressed in the mammalian brain, where it functions primarily at the presynaptic terminal to regulate synaptic vesicle trafficking, neurotransmitter release, and lipid interactions [22,23]. Its chaperone-like activity in modulating SNARE complex assembly further underscores its central role in maintaining synaptic functions [22]. Under physiological conditions, α -synuclein remains conformationally dynamic and lacks a stable secondary structure; however, it adopts an α -helical conformation upon membrane binding, with residues 3-37 and 45-92 forming curved α -helices, as revealed by NMR structural studies [24]. The relevance of this dynamic behavior is reinforced by MD simulations, which consistently depict α -synuclein as highly disordered, exhibiting persistent secondary-structure fluctuations and minimal propensity for β -sheet formation in its monomeric form [25,26,27,28].
The LLPS of α -synuclein is primarily mediated by weak multivalent interactions involving the NAC domain and the acidic C-terminal region, and is proposed to be critical for normal cellular functions, including synaptic organization, while also serving as a potential precursor to pathological aggregation into amyloid fibrils [29,30,31]. Depending on experimental conditions (such as salt concentration, temperature, pH value, and polyethylene glycol (PEG) concentration), the LLPS of α -synuclein has been observed in a broad range of protein concentrations, between 15 and 500 μ M [31,32,33]. The concentration of α -synuclein within droplets has been found to range between 260 and 600 μ M in the presence of 10% PEG [34]. However, some publications report no LLPS of α -synuclein at protein concentrations of up to 500 μ M at pH 7.4 [35].
The amount of research on α -synuclein, as well as the physiological relevance of this protein, provide a rationale for selecting clusters of α -synuclein molecules—generated either in MD simulations or using other molecular modeling approaches—as model systems for developing and evaluating methods for protein cluster density determination. Among the methods evaluated in this study, the one that is expected to yield the most physically accurate results is based on the SPACEBALL algorithm, which was originally developed for geometrical analyses of protein structures with cavities [36]. Importantly, the original parametrization of SPACEBALL is suitable for computing cavity volumes in protein structures and not for determining densities of IDP clusters. Here, we present a robust approach to optimizing the SPACEBALL parameters for any given IDP cluster, which enables cluster density computation with satisfactory precision, which is a critical step toward constructing accurate phase diagrams.
Recently, a SPACEBALL-based approach has been introduced to construct a phase diagram of the van der Waals fluid [37]. This approach involves a combination of three key elements: (i) counting free molecules that are not bound to any cluster, (ii) computing the density of the largest cluster, and (iii) calculating the density of the entire system. The high-density branch of the binodal line is then determined from the number of free molecules not in contact with any cluster. In contrast, the low-density branch of the binodal line is obtained from the SPACEBALL-calculated densities. Specifically, the position of the low-density branch is identified by analyzing the system density at which fluctuations in the largest cluster density become apparent. Moreover, the spinodal line is determined by monitoring the system-wide density variations. The onset of strong fluctuations indicates the left spinodal boundary, while their disappearance marks the right spinodal branch. The main purpose of our present work is to adapt and re-parameterize the SPACEBALL-based method of density calculation to systems of IDPs, with the aim of applying this method in future studies to explore phase diagrams of IDPs.
Our SPACEBALL-based method of computing protein nanodroplet density is clearly distinct from the slab method, which is commonly used in the MD community to determine the densities of two coexisting phases in polymer or IDP systems [38,39]. In the framework of the slab method, MD simulations are performed using a cuboid box of side lengths L x , L y and L z , where L z is taken to be much larger than L x and L y . The polymers or IDPs under study are prearranged into a slab perpendicular to the z axis and spanning through the simulation box in the x and y directions. Due to the periodic boundary conditions applied in all three directions of the simulation box, the slab is typically stable during the MD simulations. After equilibration, the slab is taken as the condensed phase, whereas the IDPs in the remaining part of the simulation box represent the dilute phase. The condensed phase density, as well as the dilute phase density, is obtained from the time-averaged density profile along the z-axis. These densities can then be used to construct the phase diagram.
Although the slab method is useful and relatively simple to implement, it suffers from a few drawbacks. Firstly, there is no formula or protocol for selecting the box side lengths: L x , L y , and L z . However, the choice of the box side lengths is relevant as it determines both the time of equilibration and the thickness of the slab at equilibrium, and thus can influence the estimation of the densities of the two coexisting phases. Secondly, it is assumed that the slab remains perpendicular to the z axis throughout the simulation. However, thermal fluctuations can cause bending or shear deformations in the slab during the simulation, thus affecting the time-averaged density profile along the z axis, which can lead to errors in the estimation of the densities of the two phases. Thirdly, when density fluctuations are large (e.g., in the vicinity of the critical point), the slab interface is rough and cavities can form spontaneously inside the slab, giving rise to significant uncertainties of the time-averaged density profile along the z axis. On the other hand, none of these issues pertain to our SPACEBALL-based method, simply because our method does not impose any restrictions on the shape of IDP nanodroplets condensates. Thus, our SPACEBALL-based method seems to be more robust, geometry-independent, and less sensitive to interface fluctuations than the slab method.

2. Results

We performed coarse-grained MD simulations of fifty molecules of α -synuclein at three temperatures, as described in Section 4.1. Then, we used a contact-based method to identify α -synuclein clusters in the MD trajectories, and selected two sets of geometrically diverse clusters for further analysis, as described in Section 4.2. Additionally, as described in Section 4.3, we introduced three groups of clusters with well-defined volumes and shapes (namely, spheres, ellipsoids, and cylinders) for assessments of density determination methods. The spherical, ellipsoidal, and cylindrical clusters were constructed using Packmol software [40] and provided reference density values against which the calculation results could be validated.
In this section, we discuss three methods for density calculation. Each of the methods is introduced, applied, and discussed in a separate subsection below. The first two of the methods are based on approximating molecular clusters by homogeneous spheres and ellipsoids, respectively, as detailed in Section 2.1 and Section 2.2. These methods are rather conceptually simple and computationally fast. The third method is computationally more demanding and time-consuming as it employs the SPACEBALL software [28,36]. The method boils down to identifying optimal parameters for SPACEBALL computations of protein cluster volumes, as discussed in Section 2.3.

2.1. Spherical Approximation

The radius of gyration, denoted herein by R g , is often used in polymer physics to describe the dimensions of polymer chains. It is straightforward to compute the radius of gyration for each of the protein clusters introduced in Section 4.2 and Section 4.3. The resulting R g values can then be used to estimate the cluster density. Firstly, the shape of a given cluster is approximated by a sphere whose mass distribution is uniform and has a gyration radius equal to the R g value of the cluster, as illustrated in Figure 1. The radius of such a sphere is R = R g 5 / 3 1.291 R g and its volume is V S = 4 π R 3 / 3 9.013 R g 3 . Then, the density of the cluster is taken as d = N / V S 0.111 N / R g 3 , where N denotes the total number of residues forming the cluster and R g is the radius of gyration of the cluster.
For the purpose of this study, we compare the densities obtained for the spherical, cylindrical, and ellipsoidal clusters under the spherical approximation, d, with their respective theoretical values, d theory , introduced in Section 4.3. Specifically, Figure 2 shows the density ratio d / d theory  versus the number n of chains in a given cluster of the spherical, ellipsoidal, or cylindrical shape. Since the spherical approximation is based on the assumption of uniform mass distribution, it overestimates the densities of spherical clusters in which protein chains are distributed randomly but not entirely homogeneously. On the other hand, the spherical approximation does not capture the elongated shape of the cylindrical clusters, underestimating their densities. These two effects cancel out to some degree for the ellipsoidal clusters, for which d is in agreement with d theory within less than about 20%.
The discrepancies between the theoretical and computed densities, as seen in Figure 2, arise because the radius of gyration represents the mass-weighted average distance from the center of mass, rather than the true spatial extent of the cluster. As a result, the approximated spherical volume does not capture the actual volume occupied by the cluster, particularly in the case of highly irregular or loosely connected structures. Consequently, the density calculations based on this approach do not accurately reflect the physical extension of the clusters, as illustrated in Figure 1, where an α -synuclein cluster (shown in green) is approximated by a sphere (shown in red) whose radius of gyration equals the cluster’s radius of gyration. The spherical approximation fails to encompass several regions of the cluster that extend beyond the spherical boundary. This example underscores the inability of this method to capture shape anisotropy and non-uniform mass distribution, rendering it unreliable for estimating the density of irregular-shaped clusters. In addition, cavities or voids between chains or sub-clusters are not taken into account under the spherical approximation, which can result in an underestimation of the cluster volume and, consequently, an overestimation of the density. Nevertheless, the densities obtained under the spherical approximation can provide information about the compactness and packing of protein chains within the clusters.

2.2. Ellipsoidal Approximation

The spherical approximation, based on the concept of the radius of gyration, is not well-suited for protein clusters with non-globular or irregular shapes, as indicated in the previous subsection. To address this limitation, we now employ an alternative method [41] based on the tensor of inertia, which offers a compact representation of the mass distribution relative to rotational axes. Although the tensor of inertia can be computed with respect to any point in space, it is most commonly calculated around the center of mass. Then, for a system of N point masses, the tensor of inertia is defined as
I = I x x I x y I x z I y x I y y I y z I z x I z y I z z
with diagonal elements, representing the moments of inertia about the coordinate axes, given by
I x x = k = 1 N ( y k 2 + z k 2 ) , I y y = k = 1 N ( x k 2 + z k 2 ) , I z z = k = 1 N ( x k 2 + y k 2 )
and off-diagonal elements, also called the products of inertia, computed as
I x y = I y x = k = 1 N x k y k , I x z = I z x = k = 1 N x k z k , I y z = I z y = k = 1 N y k z k
The tensor of inertia can be computed using Equations (1)–(3) and then diagonalized for each of the protein clusters introduced in Section 4.2 and Section 4.3. The resulting eigenvalues λ x , λ y and λ z determine the principal moments of inertia, corresponding to the axes along which the cluster has no products of inertia. An ellipsoid with a uniform mass distribution and the principal moments of inertia equal λ x , λ y , and λ z has the semi-axes lengths R x = 5 ( λ y + λ z λ x ) / ( 2 N ) , R y = 5 ( λ x + λ z λ y ) / ( 2 N ) and R z = 5 ( λ x + λ y λ z ) / ( 2 N ) [42]. Here, the cluster shape is approximated by such an ellipsoid with volume V = 4 π R x R y R z / 3 , as illustrated in Figure 3. Therefore, the cluster density is estimated as d = 3 N / ( 4 π R x R y R z ) , where N is the total number of residues in the cluster.
The accuracy of the density estimation depends sensitively on how well the ellipsoidal model captures the actual shape of a given cluster. Furthermore, under the ellipsoidal approximation, internal cavities or voids between chains or sub-clusters are not taken into account, which can result in an underestimation of the cluster volume and, consequently, an overestimation of the density. Nevertheless, the densities obtained within the ellipsoidal model provide a measure of how compactly the residues are packed within the clusters.
The data shown in Figure 4 provide a comparison of the theoretical densities of the spherical, ellipsoidal and cylindrical clusters, d theory , with the corresponding densities obtained under the ellipsoidal approximation, d. Specifically, Figure 4 shows the density ratio d / d theory  versus the number n of chains per cluster. As one would expect, the ellipsoidal and spherical approaches yield identical results for each of the spherical clusters. Moreover, the ellipsoidal approximation is seen to systematically overestimate the density of clusters of all shapes.
Although the ellipsoidal method shares many of the limitations of the spherical approach, it is expected to yield more accurate density estimates for elongated or anisotropic clusters, as their shapes are generally better approximated by ellipsoids. Major limitations of this method are illustrated in Figure 3, which shows an α -synuclein cluster (green) enclosed within an ellipsoid (red) whose semi-axes correspond to the principal dimensions derived from the moment of inertia of the cluster. As can be seen in Figure 3, the ellipsoid fails to fully capture the actual volume of the cluster, particularly in regions with a non-uniform mass distribution or internal cavities. This geometric mismatch contributes to the observed overestimation of the computed densities, as seen in Figure 4.

2.3. SPACEBALL

The density of protein clusters can also be computed using the SPACEBALL software [36] (version 2.0). Although the SPACEBALL calculation is computationally more demanding than the approximations presented in Section 2.1 and Section 2.2, it is expected to yield the most accurate results for protein clusters with irregular shapes.
The SPACEBALL algorithm combines grid-scanning and sphere-filling techniques [36]. The volume calculation starts with enclosing a given protein cluster by a cuboid box. Next, a cubic grid with the lattice constant a is generated within this box. The default value of the lattice constant, a = 0.3 nm, roughly corresponds to the water molecule diameter. Next, spherical probes are placed at the lattice points that delineate the walls of the box. Then, the walls are moved inward along their normal directions—corresponding to the Cartesian axes—in discrete steps equal to the lattice constant a. As the probes move inward, each sphere is checked for overlap with protein atoms. If a probe overlaps with any atom of the protein cluster, its movement is stopped, while other probes on the same wall continue moving until they also encounter an overlap with protein atoms. The positions of non-overlapping probes are recorded on the lattice. When a probe overlaps with at least one atom of the protein cluster, its position is marked differently. This marking procedure leads to a simple classification of the lattice points: those with no contact are considered outside the protein, while those that encounter the protein cluster define its surface. In addition, the unmarked lattice points within the protein-cluster boundary define its interior volume.
A key parameter of the SPACEBALL method is the radius r probe of the wall probes. This parameter governs the resolution of surface detection and, consequently, the accuracy of the volume estimate. Importantly, the optimal radius of the wall probes depends on the shape and density of the cluster under study. Namely, if the probe radius is too small, the probes may penetrate the cluster interior, creating artificial cavities and underestimating the cluster volume. Conversely, if the probe radius is too large, it may overestimate the cluster volume by including artificial ‘pockets’ on the cluster surface, thereby artificially enlarging the cluster. Therefore, optimizing the probe radius is critical for the accuracy of the volume calculation method. Here, we introduce a general and robust approach to identify an optimal probe radius based on the geometric properties of the cluster.
Our approach for selecting the optimal probe radius is based on identifying protein chains that are located on the cluster surface, as illustrated in Figure 5. Firstly, for each of the chains in a given cluster, the mean distance of its atoms from the geometrical center of the cluster is calculated. The chains are then sorted based on these mean distances, and a set of most peripheral chains is selected as surface chains (shown in Figure 5 in red). The remaining chains (shown in Figure 5 in blue) are assumed to form the interior of the cluster. The surface chains constitute a certain fraction ϕ of all chains in the cluster. Next, the geometrical center G j of each of the surface chains is determined. Then, each of the surface chains is assigned its nearest neighbor chain based on minimal distances between the chain centers G j . Finally, the average distance between the geometrical centers of the surface chains and their nearest neighbors is taken as a pre-optimal radius of the wall probes, which we denote herein as a surface radius r probe S . A Python script computing r probe S for a given protein cluster is deposited on the GitHub platform; see https://github.com/midhunanila/SPACEBALL_DENSITY (accessed on 2 September 2025).
The surface radius r probe S is specific for a given cluster and depends on the fraction ϕ of chains identified as the surface chains. Figure 6 shows how r probe S varies with ϕ for the spherical, ellipsoidal, and cylindrical clusters with 100, 200, 400, and 600 chains. Importantly, the surface radius is seen to be systematically larger for the loose clusters with 100 chains (data points in blue) than for the compact clusters with 600 chains (data points in red), independent of the surface chain fraction ϕ or the cluster shape. As a matter of fact, r probe S is found to monotonically decrease with an increase in the number of chains, meaning that the sparser the cluster is, the larger the selected probe radius in order to avoid penetration of the wall probes into the cluster interior. This observation is consistent with the method for identifying the surface radius of the wall probes, as described in the paragraph above.
Another relevant piece of information that follows from the results presented in Figure 6 is that the surface radius does not change significantly as the surface chain fraction ϕ varies between 0.3 and 0.5. This observation implies that the SPACEBALL calculation results should be practically unaffected by the exact value of ϕ , as long as it is in the range between 0.3 and 0.5. Therefore, in the following, we take the optimal radius r probe O of the wall probes as the surface radius r probe S at ϕ = 0.5 , i.e., r probe O = r probe S ( ϕ = 0.5 ) . Python scripts computing the optimal probe radius for a given protein cluster are deposited on the GitHub platform; see https://github.com/midhunanila/SPACEBALL_DENSITY (accessed on 2 September 2025).
Table 1 contains the optimal values of the wall probe radius, r probe O = r probe S ( ϕ = 0.5 ) , for the spherical, ellipsoidal, and cylindrical clusters. For each of these shapes, the values of r probe O are seen to decrease with an increase in the number n of chains in the cluster, which means that the denser the cluster is, the smaller the optimal probe radius required to properly delineate the cluster surface. For n = 500 and n = 600 , respectively, r probe O 3.9 nm and r probe O 3.8 nm, independent of the cluster shape, indicating that the cluster density is the major factor determining the optimal probe radius in the case of sufficiently dense clusters.
Table 1 also contains the densities of the spherical, ellipsoidal, and cylindrical clusters computed using SPACEBALL with the specified values of r probe O , as well as the corresponding theoretical densities d theory S , d theory E , and d theory C . The theoretical and computed densities are also compared in Figure 7. The SPACEBALL calculation results compare much better with the theoretical densities than the densities computed under the spherical and ellipsoidal approximations.
Figure 8 shows snapshots of the spherical, ellipsoidal and cylindrical clusters with 100, 200, 400, and 600 chains. The α -synuclein chains are depicted in green, whereas envelopes of these clusters—as determined by SPACEBALL with the optimal probe radii listed in Table 1—are marked in red. For clarity, the envelopes are shown as semi-transparent surfaces in the top panels, whereas the bottom panels show cross-sections through the clusters with opaque envelopes. These snapshots together demonstrate that, firstly, the shape of each of the clusters is clearly and precisely delineated by the SPACEBALL envelopes and, secondly, the envelopes comprise no artificial cavities or pockets. If the selected probe radii were smaller, the wall probes would likely penetrate the cluster interior, creating artificial cavities, and if the selected probe radii were larger, the envelopes would contain artificial pockets, thereby artificially enlarging the computed volumes. Thus, our method for identifying the optimal radius of the wall probes for the SPACEBALL calculation leads to meaningful shapes and volumes for the clusters.
Another way to test our method for selecting the optimal probe radius is to run SPACEBALL in series with different probe radius values and compare the resulting densities d of the spherical, ellipsoidal, and cylindrical clusters with the theoretical densities d theory S , d theory E , and d theory C of these clusters. Figure 9 shows the density rations d / d theory S , d / d theory E , and d / d theory C  versus the probe radius, r probe , for the three sets of clusters. The lines in blue, yellow, green, and red correspond to the clusters comprising 100, 200, 400, and 600 chains. The cross symbols (×) indicate the optimal values of the wall probes, r probe O , presented in Table 1. As can be seen in Figure 9, the values of r probe O compare well with the corresponding radii of the wall probes at which the computed and theoretical densities are equal, d = d theory S , d = d theory E , and d = d theory C , which further confirms that our choice of optimal probe radii is reasonable.
Another important conclusion that follows from the results shown in Figure 9 is that small variations in the probe radii around the r probe O values cause only marginal changes in the computed densities of the clusters. This means that even if the optimal probe radii listed in Table 1 are not perfectly accurate, the densities of the clusters are expected to be captured reasonably well using the SPACEBALL method.
Although the SPACEBALL calculations are computationally more demanding and time-consuming than the calculations based on the approximate methods introduced in Section 2.1 and Section 2.2, they yield results superior to those obtained under the spherical and ellipsoidal approximations. The advantage of the SPACEBALL method over the approximate methods is illustrated in Figure 10, where the ratios of the computed and theoretical densities, d / d theory S , d / d theory E , and d / d theory C , are shown for the spherical, ellipsoidal, and cylindrical clusters with 100, 200, 300, 400, 500, and 600 chains. Clearly, in all of the cases studied here, the densities obtained using SPACEBALL with the optimal probe radius r probe O are closer to the theoretical densities than the densities estimated within the spherical and ellipsoidal approximations.

3. Discussion

Having optimized the SPACEBALL-based method with protein clusters of well-defined shapes, as generated using Packmol software (see Section 4.3), we now apply this method to the α -synuclein clusters obtained from the coarse-grained MD simulations (see Section 4.2). The clusters selected for the density calculation are quite diverse and have rather irregular shapes. In the clusters called ‘large cluster A’ and ‘large cluster B’, obtained for a high temperature, the α -synuclein chains are assembled rather loosely. In the clusters called ‘small cluster A’ and ‘small cluster B’, obtained for a low temperature, the chains are packed more densely. And the clusters called ‘medium cluster A’ and ‘medium cluster B’, obtained for a medium temperature, have an intermediate degree of chain packing.
To compute the density of each of the clusters using SPACEBALL, the optimal radius r probe O of the wall probes needs to be selected. Figure 11 shows how the surface radius r probe S of the wall probes varies with the surface chain fraction ϕ for the small, medium, and large clusters of group A and B. Importantly, for any of the six clusters, r probe S is observed not to change significantly as ϕ is varied between 0.3 and 0.5. This observation implies that the SPACEBALL calculation results should be robust and practically unaffected by the exact value of ϕ , as long as it is in the range between 0.3 and 0.5. Therefore, in the following, we use ϕ = 0.5 for the selection of the optimal radius of the wall probes for the SPACEBALL calculation. In other words, we take r probe O = r probe S ( ϕ = 0.5 ) .
Table 2 lists the values of r probe O for the small, medium, and large clusters of group A and B. These optimal probe radii range from about 3.9 to 4.9 nm and, thus, are comparable to or even larger than the radius of gyration of the single chains of α -synuclein forming the clusters. This observation suggests that all of these clusters are rather loose—at least on their surface—and contain space for α -synuclein chains to diffuse and change conformation due to thermal fluctuations.
Table 2 also provides the densities of the small, medium, and large clusters, computed using SPACEBALL with the specified values of r probe O . The corresponding envelopes of these clusters are presented in Figure 12. In panels A and C, the cluster envelopes are shown as semi-transparent surfaces. For comparison, panels B and D show cross-sections through the clusters with opaque envelopes. Each of these envelopes is seen to comprise no artificial overhangs, cavities or pockets. Moreover, the cluster shapes are captured well by the envelopes generated by SPACEBALL with the optimal probe radii.
Interestingly, the computed densities of the small, medium, and large clusters do not differ much, with all falling in a range between 0.22 and 0.31 res/ nm 3 . The small clusters, generated in the MD simulations at the lowest temperature, were found to have a density of about 0.3 res/ nm 3 . On the other hand, the large and medium clusters were found to have a lower density of about 0.25 res/ nm 3 on average. These results indicate that the densities of the simulation-derived clusters originate from inter- and intra-molecular interactions captured in the coarse-grained MD simulations. These results should thus provide a reliable estimate for the density of the condensed phase of α -synuclein.
It should be noted that even if the optimal probe radii listed in Table 2 are not perfectly accurate, the densities of the clusters are expected to be captured reasonably well using the SPACEBALL-based method. This conclusion can be drawn from the data presented in Figure 13, where the density of each of the clusters is shown as a function of the radius r probe of the wall probes used in the SPACEBALL calculation. The optimal probe radii from Table 2 are marked as cross symbols (×). It can be seen from the curves in Figure 13 that small variations in the probe radii around r probe O cause only small changes in the cluster density. In other words, if the optimal probe radii were changed by, say, 20%, the computed densities of the clusters would be affected only to a small degree.
The three methods of cluster density calculation are compared in Figure 14, which displays the density of each of the small, medium and large clusters. The bars in blue, orange, and green correspond to the spherical, ellipsoidal, and SPACEBALL-based methods. Notably, the spherical and ellipsoidal approximations yield densities larger than those determined by SPACEBALL, especially for the small clusters with a large degree of chain packing. This result is understandable because the approximated spherical and ellipsoidal volumes do not capture the actual volumes of the simulation-derived clusters with irregular shapes and non-uniform chain distributions, which can be clearly seen in Figure 1 and Figure 3, where a medium-size cluster from the coarse-grained MD simulations is approximated by a uniform sphere and a uniform ellipsoid, respectively. In these examples, both the sphere and the ellipsoid fail to encompass certain regions of the cluster, underestimating the actual volume occupied by the cluster and, thus, overestimating its density.

4. Materials and Methods

4.1. MD Simulations

To generate protein nanodroplets, or clusters, for testing methods for density calculations, we performed coarse-grained MD simulations of fifty chains of α -synuclein. The initial conformations of these chains were obtained from all-atom implicit-solvent MD simulations of a single molecule of α -synuclein, which were conducted using the generalized Born (GB) model with the NAMD software [43]. The electrostatic screening length corresponding to a monovalent ion concentration of 0.3 M was used to reduce computational costs without significantly compromising solvation effects [44]. An NMR structure of micelle-bound α -synuclein (PDB ID: 1XQ8 [24]) was used as input. The system was subjected to energy minimization to remove steric clashes and stabilize the structure prior to equilibration. This was followed by 1 ns of equilibration at a temperature of 298 K, maintained through a Langevin thermostat with a damping coefficient of 1 ps 1 . The simulation employed the CHARMM36 force field. Non-bonded interactions were handled using a cutoff of 1.4 nm, a switching distance of 1.3 nm, and a pair list distance of 1.6 nm to ensure accurate treatment of long-range forces. A time step of 2 fs was used. Rigid bonds were applied to hydrogen atoms to maintain numerical stability. Following the equilibration, a production run was carried out for a total simulation time of 3 μ s. From this all-atom implicit-solvent MD trajectory, the fifty most extended conformations were selected for subsequent coarse-grained MD simulations.
The coarse-grained MD simulations were conducted using the dynamic structure-based (DSB) model with a non-radial multi-body pseudo-improper-dihedral potential and implicit solvent [45]. In the DSB, amino acid residues are represented by single beads. The dynamics of the beads are derived using the Langevin equation. Temperature is expressed in units of ϵ / k B , where k B is the Boltzmann constant and ϵ represents the energy scale, which is in the order of 1.5 kcal/mol. The time unit τ 1 ns corresponds to 200 simulation steps.
We used the P F M J 0.1 C variant of the DSB model with the force-field parameters and the damping coefficient, as introduced by Mioduszewski et al. [45]. Fifty chains of α -synuclein, obtained from the all-atom implicit-solvent MD simulations with the GB model, were placed randomly inside a cubic box using the Packmol software [40], as shown in the left panel of Figure 15. The initial density of the system was smaller than 0.001 beads per nm 3 , ensuring that each of the chains did not come into contact with any other. The system was pre-equilibrated for 100 τ , and then the simulation box was compressed uniformly in all three directions at a rate of 0.002 nm/ τ until a target box size was achieved. Next, the fifty chains of α -synuclein were simulated at constant volume in a cubic box of side length L with periodic boundary conditions. The simulations were carried out with three different box lengths L and temperatures T over a sub-millisecond timescale, as specified in Table 3. The right panel of Figure 15 shows a snapshot from the simulation with L = 20.6 nm taken at the time instance t = 70 μ s.

4.2. Cluster Analysis of the Coarse-Grained MD Trajectories

Each of the coarse-grained simulations provided a set of configurations of the system comprising fifty chains of α -synuclein. In order to select clusters of α -synuclein chains for subsequent density calculations, the periodic boundaries used in the MD simulations were unwrapped and agglomerative clustering was applied to each of the configurations obtained from the coarse-grained MD simulations. At the beginning of the clustering procedure, each of the α -synuclein chains in a given configuration was taken as a single cluster. Then the clusters were merged successively. Two clusters were merged if any chain from one cluster was identified to make at least one inter-molecular contact with any chain from the other cluster. The clustering procedure ended when no more clusters could be merged in a given configuration. The agglomerative clustering was applied successively to each of the configurations individually, typically identifying several clusters of different sizes and a number of individual, unassociated chains.
An important factor in the clustering algorithm is the criterion for inter-molecular contacts. Here, we applied a simple distance criterion with the effective radii of amino acid residues used previously in one-bead-per-residue models of intrinsically disordered proteins [46,47]. Namely, we assumed that two beads with indices i and j and the effective radii σ i and σ j were in contact if their distance was smaller than a cut-off distance d cut off = 0.75 ( σ i + σ j ) [48].
The clustering of the configurations obtained from the coarse-grained MD simulations allowed us to identify a huge number of α -synuclein clusters of various sizes and shapes. Typically, clusters formed in the MD simulations with low temperatures ( T = 0.45   ϵ / k B or T = 0.5   ϵ / k B ) appeared to be somewhat compact, whereas clusters obtained from the MD simulations with the high temperature ( T = 1.2   ϵ / k B ) were found to be rather loose and irregular in shape.
To proceed with testing different methods for density calculation at the supra-molecular level, we randomly selected several distinct clusters with different shapes and volumes, each comprising fifty chains of α -synuclein. The spatial dimensions of the selected clusters were quantified by their radius of gyration, R g , and their maximum diameter D max . Specifically, from the MD simulations at a high temperature ( T = 1.2   ϵ / k B ) we selected two clusters (denoted herein as ‘large cluster A’ and ‘large cluster B’) of comparable spatial dimensions and of very different shapes (see Table 4 and Figure 16). Next, from the MD simulations at the intermediate temperature ( T = 0.5   ϵ / k B ), we selected two clusters (denoted as ’medium cluster A’ and ’medium cluster B’) with almost identical R g and D max values and with clearly different shapes (see Table 4 and Figure 16). Finally, from the MD simulations at the low temperature ( T = 0.45   ϵ / k B ) we selected two clusters (denoted as ’small cluster A’ and ’small cluster B’) that had similar spatial dimensions and rather distinct molecular arrangements (see Table 4 and Figure 16). As quantified by their R g and D max values, the large clusters were more extended than the medium-size clusters, whereas the small clusters were more compact than the medium-size clusters (see Table 4). With this selection of diverse clusters, our goal is a comprehensive examination of three methods for density calculation of protein clusters.

4.3. Generation of Spherical, Ellipsoidal, and Cylindrical Clusters

In principle, the aforementioned goal seems rather straightforward to achieve. However, a major obstacle lies in the absence of reference density values against which the calculated results can be validated. Another difficulty arises from the highly irregular shapes of the α -synuclein clusters obtained from the coarse-grained MD simulations, as shown in Figure 16, for which no unambiguous method of density calculation currently exists; the development of such a method is actually the main objective of this work. Consequently, comparing different density calculation approaches with only the α -synuclein clusters identified in the coarse-grained MD simulations would have a rather limited significance.
To address these issues, we generated clusters with well-defined shapes and volumes, as shown in Figure 17. Firstly, we determined the average radius of gyration of the α -synuclein structures obtained from the all-atom MD simulations. Then, we selected an α -synuclein structure whose radius of gyration was closest to the average value. Next, we used the Packmol software [40] to distribute multiple copies of the selected structure within three geometric shapes—a sphere, an ellipsoid, and a cylinder—whose volumes could be calculated using standard geometric formulas. As a result, we obtained systems consisting of 100, 200, 300, 400, 500, and 600 copies of the selected structure with a minimum interatomic distance of 0.2 nm, preventing molecular clashes or overlaps. The generated sphere had a radius of R S = 35 nm. The ellipsoid had principal semi-axes lengths of R x = 25 nm, R y = 35 nm and R z = 50 nm along the x-, y-, and z-axes, respectively. The cylinder was aligned along the x-axis and had a radius of R C = 25 nm and a height of H C = 93 nm. The sphere, the ellipsoid, and the cylinder thus had comparable volumes. With N denoting the total number of residues forming the cluster, the theoretical density for the spherical clusters was determined using d theory S = 3 N / ( 4 π R S 3 ) . For the cylindrical clusters, the theoretical density was calculated as d theory C = N / ( π R C 2 H ) . For the ellipsoidal clusters, the theoretical density was determined using d theory E = 3 N / ( 4 π R x R y R z ) . We used these systems as a reference for testing density calculation methods, as described in the Results.

5. Conclusions

Liquid droplets of IDPs observed in microscopy experiments are spherical and at least several micrometers in diameter [49]. In contrast, protein nonodroplets in MD simulations are at least two orders of magnitude smaller and comprise, at most, a few hundred IDPs [50]. As a result, such IDP nanodroplets have irregular shapes and ambiguous boundaries; thus, their densities are challenging to compute and compare with phase diagrams that are determined experimentally.
We implemented and tested three methods for computing the densities of protein nanodroplets, or clusters, obtained from MD simulations or other molecular modeling techniques. Two of them are based on simple geometrical considerations, as discussed in Section 2.1 and Section 2.2, and involve no adjustable parameters. Our tests indicate that these methods only perform well for homogeneous and globular clusters, whose shapes can be reasonably approximated by spheres or ellipsoids, and fail to correctly capture the volume of clusters with irregular shapes containing protrusions, cavities, or overhangs. In addition, their underlying assumption of uniform mass distribution within clusters leads to density overestimations, as can be seen in Figure 2 and Figure 4. Nevertheless, these methods can be useful in some applications because of their computational efficiency. The third method is based on the SPACEBALL algorithm, which is computationally more demanding and time-consuming than the other methods, but is capable of accurately determining the volumes of clusters of practically any shape, as discussed in Section 2.3. A central parameter of the SPACEBALL algorithm, which governs the resolution of surface and volume detection, is the radius of volume probes. We devised a robust way of optimizing the probe radius for any given protein cluster. The optimization is based on the spatial distribution of protein chains on the cluster surface, and leads to satisfactory accuracy in density calculations using SPACEBALL, as can be seen in Figure 7 and Figure 10. Python scripts for optimizing the probe radius for any given protein nanodroplet, as well as input parameters for the subsequent calculations of the nanodroplet density using SPACEBALL, are deposited on the GitHub platform; see https://github.com/midhunanila/SPACEBALL_DENSITY (accessed on 2 September 2025). It is worth noting at this point that our procedure for the probe radius optimization may fail or lead to unsatisfactory results in cases of very small clusters, when distinguishing between protein chains on the cluster surface and inside the cluster is impossible. Nevertheless, the SPACEBALL-based method yields generally superior results to those obtained under the spherical and ellipsoidal approximations.
The α -synuclein clusters obtained from the coarse-grained MD simulations with different temperatures differ in both their morphology and packing degree, as illustrated in Figure 16 and Figure 12. The SPACEBALL-based analysis presented in Section 3 revealed, however, that the densities of these clusters fall in a rather narrow range between 0.22 and 0.31 res/ nm 3 . The small clusters, generated in the MD simulations at lower temperatures, are found to have a density of about 0.3 res/ nm 3 . The larger clusters, generated in the MD simulations at higher temperatures, are found to have a lower density of about 0.25 res/ nm 3 on average. Importantly, it should be noted that these densities are much lower than the average density of a folded protein, which is estimated to be about 7 res/ nm 3 [51]. Therefore, even the α -synuclein clusters produced at low temperatures are rather loose and comprise substantial amounts of solvent.
Here, we used the coarse-grained DSB model to generate clusters of α -synuclein chains. It should be noted, however, that the methods of computing nanodroplet density discussed in Section 2.1, Section 2.2 and Section 2.3 are independent of the simulation models or force fields used to generate the nanodroplet structures. Our density calculation methods are applicable to protein nanodroplets obtained from all-atom or coarse-grained MD simulations with any force field or using any other molecular modeling technique.
Recently, the LLPS of α -synuclein has been studied in MD simulations employing the Martini force field [41]. Fifty chains of α -synuclein were simulated at different conditions in a cubic box with a side length ranging from about 48 to 66 nm. The concentration of α -synuclein within nanodroplets was determined using a variant of the ellipsoidal approximation. The average concentration of α -synuclein in the simulated nanodroplets was found to be about 30 mM, which is an order of magnitude higher than the ∼0.25 res/ nm 3 (corresponding to the α -synuclein concentration of about 3 mM) that we report in Table 2. This comparison indicates that our estimation of α -synuclein concentration in nanodroplets, i.e., about 3 mM, may be closer to the experimentally determined values [34] than the concentration of about 30 mM obtained from the Martini simulations [41].

Author Contributions

Conceptualization: M.M.A., M.W., M.C. and B.R.; Data Curation: M.M.A.; Formal Analysis: M.M.A., M.W., M.C. and B.R.; Funding Acquisition: M.C. and B.R.; Investigation: M.M.A., M.W., M.C. and B.R.; Methodology: M.M.A., M.W., M.C. and B.R.; Project Administration: M.C. and B.R.; Resources: M.C. and B.R.; Software: M.M.A., M.W. and M.C.; Supervision: M.C. and B.R.; Validation: M.M.A., M.W. and B.R.; Visualization: M.M.A.; Writing—Original Draft Preparation: M.M.A., M.W., M.C. and B.R.; Writing—Review and Editing: B.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research has received support from the National Science Center, Poland, under grants number 2018/31/B/NZ1/00047 (to M.C.) and 2020/39/B/NZ1/00377 (to B.R.).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Boeynaems, S.; Alberti, S.; Fawzi, N.L.; Mittag, T.; Polymenidou, M.; Rousseau, F.; Schymkowitz, J.; Shorter, J.; Wolozin, B.; Van Den Bosch, L.; et al. Protein phase separation: A new phase in cell biology. Trends Cell Biol. 2018, 28, 420–435. [Google Scholar] [CrossRef]
  2. Alberti, S.; Dormann, D. Liquid–liquid phase separation in disease. Annu. Rev. Genet. 2019, 53, 171–194. [Google Scholar] [CrossRef] [PubMed]
  3. Banani, S.F.; Lee, H.O.; Hyman, A.A.; Rosen, M.K. Biomolecular condensates: Organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol. 2017, 18, 285–298. [Google Scholar] [CrossRef] [PubMed]
  4. Weber, S.C. Evidence for and against liquid-liquid phase separation in the nucleus. Non-Coding RNA 2019, 5, 50. [Google Scholar] [CrossRef] [PubMed]
  5. Kim, T.; Yoo, J.; Do, S.; Hwang, D.S.; Park, Y.; Shin, Y. RNA-mediated demixing transition of low-density condensates. Nat. Commun. 2023, 14, 2425. [Google Scholar] [CrossRef]
  6. Arter, W.E.; Qi, R.; Erkamp, N.A.; Krainer, G.; Didi, K.; Welsh, T.J.; Acker, J.; Nixon-Abell, J.; Qamar, S.; Guillén-Boixet, J.; et al. Biomolecular condensate phase diagrams with a combinatorial microdroplet platform. Nat. Commun. 2022, 13, 7845. [Google Scholar] [CrossRef]
  7. Brangwynne, C.P.; Mitchison, T.J.; Hyman, A.A. Active liquid-like behavior of nucleoli determines their size and shape in Xenopus laevis oocytes. Proc. Natl. Acad. Sci. USA 2011, 108, 4334–4339. [Google Scholar] [CrossRef]
  8. Brangwynne, C.P.; Eckmann, C.R.; Courson, D.S.; Rybarska, A.; Hoege, C.; Gharakhani, J.; Jülicher, F.; Hyman, A.A. Germline P granules are liquid droplets that localize by controlled dissolution/condensation. Science 2009, 324, 1729–1732. [Google Scholar] [CrossRef]
  9. Elbaum-Garfinkle, S. Matter over mind: Liquid phase separation and neurodegeneration. J. Biol. Chem. 2019, 294, 7160–7168. [Google Scholar] [CrossRef]
  10. Gebauer, D.; Kellermeier, M.; Gale, J.D.; Bergström, L.; Cölfen, H. Pre-nucleation clusters as solute precursors in crystallisation. Chem. Soc. Rev. 2014, 43, 2348–2371. [Google Scholar] [CrossRef]
  11. Alert, R.; Tierno, P.; Casademunt, J. Formation of metastable phases by spinodal decomposition. Nat. Commun. 2016, 7, 13067. [Google Scholar] [CrossRef] [PubMed]
  12. Yaragalla, S.; Mishra, R.K.; Thomas, S.; Kalarikkal, N.; Maria, H.J. Carbon-Based Nanofillers and Their Rubber Nanocomposites: Fundamentals and Applications; Elsevier: New York, NY, USA, 2019. [Google Scholar]
  13. Freire, E. Differential scanning calorimetry. Protein Stability And Folding: Theory and Practice; Humana: Totowa, NJ, USA, 1995; pp. 191–218. [Google Scholar]
  14. Guinier, A. X-Ray Crystallographic Technology; Hilger and Watts: London, UK, 1952. [Google Scholar]
  15. Stahlberg, H.; Walz, T. Molecular electron microscopy: State of the art and current challenges. ACS Chem. Biol. 2008, 3, 268–281. [Google Scholar] [CrossRef]
  16. Saunders, N.; Miodownik, A.P. CALPHAD (Calculation of Phase DIAGRAMS): A Comprehensive Guide; Elsevier: New York, NY, USA, 1998; Volume 1. [Google Scholar]
  17. Lukas, H.; Fries, S.G.; Sundman, B. Computational Thermodynamics: The Calphad Method; Cambridge University Press: New York, NY, USA, 2007. [Google Scholar]
  18. Kattner, U.R. The CALPHAD method and its role in material and process development. Tecnol. Metal. Mater. Min. 2016, 13, 3–15. [Google Scholar] [CrossRef] [PubMed]
  19. Schlick, T. Pursuing Laplace’s vision on modern computers. In Mathematical Approaches to Biomolecular Structure and Dynamics; Springer: New York, NY, USA, 1996; pp. 219–247. [Google Scholar]
  20. Van Mourik, T.; Gdanitz, R.J. A critical note on density functional theory studies on rare-gas dimers. J. Chem. Phys. 2002, 116, 9620–9623. [Google Scholar] [CrossRef]
  21. Ray, S.; Singh, N.; Kumar, R.; Patel, K.; Pandey, S.; Datta, D.; Mahato, J.; Panigrahi, R.; Navalkar, A.; Mehra, S.; et al. α-Synuclein aggregation nucleates through liquid–liquid phase separation. Nat. Chem. 2020, 12, 705–716. [Google Scholar] [CrossRef]
  22. Lashuel, H.A.; Overk, C.R.; Oueslati, A.; Masliah, E. The many faces of α-synuclein: From structure and toxicity to therapeutic target. Nat. Rev. Neurosci. 2013, 14, 38–48. [Google Scholar] [CrossRef]
  23. Pieri, L.; Madiona, K.; Bousset, L.; Melki, R. Fibrillar α-synuclein and huntingtin exon 1 assemblies are toxic to the cells. Biophys. J. 2012, 102, 2894–2905. [Google Scholar] [CrossRef]
  24. Ulmer, T.S.; Bax, A.; Cole, N.B.; Nussbaum, R.L. Structure and dynamics of micelle-bound human α-synuclein. J. Biol. Chem. 2005, 280, 9595–9603. [Google Scholar] [CrossRef]
  25. Ball, K.A.; Phillips, A.H.; Nerenberg, P.S.; Fawzi, N.L.; Wemmer, D.E.; Head-Gordon, T. Homogeneous and heterogeneous tertiary structure ensembles of amyloid-β peptides. Biochemistry 2011, 50, 7612–7628. [Google Scholar] [CrossRef]
  26. Cote, Y.; Delarue, P.; Scheraga, H.A.; Senet, P.; Maisuradze, G.G. From a highly disordered to a metastable state: Uncovering insights of α-synuclein. ACS Chem. Neurosci. 2018, 9, 1051–1065. [Google Scholar] [CrossRef]
  27. Ilie, I.M.; Nayar, D.; den Otter, W.K.; van der Vegt, N.F.; Briels, W.J. Intrinsic conformational preferences and interactions in α-synuclein fibrils: Insights from molecular dynamics simulations. J. Chem. Theory Comput. 2018, 14, 3298–3310. [Google Scholar] [CrossRef]
  28. Chwastyk, M.; Panek, E.A.; Malinowski, J.; Jaskólski, M.; Cieplak, M. Properties of cavities in biological structures—A survey of the Protein Data Bank. Front. Mol. Biosci. 2020, 7, 591381. [Google Scholar] [CrossRef]
  29. Mukherjee, S.; Sakunthala, A.; Gadhe, L.; Poudyal, M.; Sawner, A.S.; Kadu, P.; Maji, S.K. Liquid-liquid phase separation of α-synuclein: A new mechanistic insight for α-synuclein aggregation associated with Parkinson’s disease pathogenesis. J. Mol. Biol. 2023, 435, 167713. [Google Scholar] [CrossRef]
  30. Khare, S.D.; Chinchilla, P.; Baum, J. Multifaceted interactions mediated by intrinsically disordered regions play key roles in alpha synuclein aggregation. Curr. Opin. Struct. Biol. 2023, 80, 102579. [Google Scholar] [CrossRef]
  31. Sawner, A.S.; Ray, S.; Yadav, P.; Mukherjee, S.; Panigrahi, R.; Poudyal, M.; Patel, K.; Ghosh, D.; Kummerant, E.; Kumar, A.; et al. Modulating α-synuclein liquid–liquid phase separation: Published as part of the Biochemistry virtual special issue “Protein Condensates”. Biochemistry 2021, 60, 3676–3696. [Google Scholar] [CrossRef]
  32. Sternke-Hoffmann, R.; Sun, X.; Menzel, A.; Pinto, M.D.S.; Venclovaite, U.; Wördehoff, M.; Hoyer, W.; Zheng, W.; Luo, J. Phase Separation and Aggregation of Synuclein Diverge at Different Salt Conditions. Adv. Sci. 2024, 11, 2308279. [Google Scholar] [CrossRef]
  33. Ziaunys, M.; Sulskis, D.; Veiveris, D.; Kopustas, A.; Snieckute, R.; Mikalauskaite, K.; Sakalauskas, A.; Tutkus, M.; Smirnovas, V. Liquid–liquid phase separation of alpha-synuclein increases the structural variability of fibrils formed during amyloid aggregation. FEBS J. 2024, 291, 4522–4538. [Google Scholar] [CrossRef]
  34. Dada, S.T.; Hardenberg, M.C.; Toprakcioglu, Z.; Mrugalla, L.K.; Cali, M.P.; McKeon, M.O.; Klimont, E.; Michaels, T.C.; Knowles, T.P.; Vendruscolo, M. Spontaneous nucleation and fast aggregate-dependent proliferation of α-synuclein aggregates within liquid condensates at neutral pH. Proc. Natl. Acad. Sci. USA 2023, 120, e2208792120. [Google Scholar] [CrossRef]
  35. Siegert, A.; Rankovic, M.; Favretto, F.; Ukmar-Godec, T.; Strohäker, T.; Becker, S.; Zweckstetter, M. Interplay between tau and α-synuclein liquid–liquid phase separation. Protein Sci. 2021, 30, 1326–1336. [Google Scholar] [CrossRef]
  36. Chwastyk, M.; Jaskolski, M.; Cieplak, M. Structure-based analysis of thermodynamic and mechanical properties of cavity-containing proteins–case study of plant pathogenesis-related proteins of class 10. FEBS J. 2014, 281, 416–429. [Google Scholar] [CrossRef]
  37. Pham, D.Q.H.; Anila, M.M.; Chwastyk, M. Clustering-based Method for Constructing the Phase Diagram of the van der Waals Model Fluid. Curr. Protein Pept. Sci. 2025; online ahead of print. [Google Scholar] [CrossRef]
  38. Blas, F.J.; MacDowell, L.G.; de Miguel, E.; Jackson, G. Vapor-liquid interfacial properties of fully flexible Lennard-Jones chains. J. Chem. Phys. 2008, 129, 144703. [Google Scholar] [CrossRef]
  39. Dignon, G.L.; Zheng, W.; Best, R.B.; Kim, Y.C.; Mittal, J. Relation between single-molecule properties and phase behavior of intrinsically disordered proteins. Proc. Natl. Acad. Sci. USA 2018, 115, 9929–9934. [Google Scholar] [CrossRef]
  40. Martínez, L.; Andrade, R.; Birgin, E.G.; Martínez, J.M. PACKMOL: A package for building initial configurations for molecular dynamics simulations. J. Comput. Chem. 2009, 30, 2157–2164. [Google Scholar] [CrossRef]
  41. Wasim, A.; Menon, S.; Mondal, J. Modulation of α-synuclein aggregation amid diverse environmental perturbation. eLife 2024, 13, RP95180. [Google Scholar] [CrossRef]
  42. Landau, L.D.; Lifshitz, E.M. Course of Theoretical Physics; Elsevier: New York, NY, USA, 2013. [Google Scholar]
  43. Phillips, J.C.; Hardy, D.J.; Maia, J.D.; Stone, J.E.; Ribeiro, J.V.; Bernardi, R.C.; Buch, R.; Fiorin, G.; Hénin, J.; Jiang, W.; et al. Scalable molecular dynamics on CPU and GPU architectures with NAMD. J. Chem. Phys. 2020, 153, 044130. [Google Scholar] [CrossRef]
  44. Tanner, D.E.; Chan, K.Y.; Phillips, J.C.; Schulten, K. Parallel generalized Born implicit solvent calculations with NAMD. J. Chem. Theory Comput. 2011, 7, 3635–3642. [Google Scholar] [CrossRef]
  45. Mioduszewski, Ł.; Rozycki, B.; Cieplak, M. Pseudo-improper-dihedral model for intrinsically disordered proteins. J. Chem. Theory Comput. 2020, 16, 4726–4733. [Google Scholar] [CrossRef]
  46. Kim, Y.C.; Hummer, G. Coarse-grained models for simulations of multiprotein complexes: Application to ubiquitin binding. J. Mol. Biol. 2008, 375, 1416–1433. [Google Scholar] [CrossRef]
  47. Dignon, G.L.; Zheng, W.; Kim, Y.C.; Best, R.B.; Mittal, J. Sequence determinants of protein phase behavior from a coarse-grained model. PLoS Comput. Biol. 2018, 14, e1005941. [Google Scholar] [CrossRef]
  48. Różycki, B.; Mioduszewski, Ł.; Cieplak, M. Unbinding and unfolding of adhesion protein complexes through stretching: Interplay between shear and tensile mechanical clamps. Proteins Struct. Funct. Bioinform. 2014, 82, 3144–3153. [Google Scholar] [CrossRef] [PubMed]
  49. Chiu, Y.P.; Sun, Y.C.; Qiu, D.C.; Lin, Y.H.; Chen, Y.Q.; Kuo, J.C.; Huang, J.-R. Liquid-liquid phase separation and extracellular multivalent interactions in the tale of galectin-3. Nat. Commun. 2020, 11, 1229. [Google Scholar] [CrossRef]
  50. Benayad, Z.; von Bulow, S.; Stelzl, L.S.; Hummer, G. Simulation of FUS protein condensates with an adapted coarse-grained model. J. Chem. Theory Comput. 2020, 17, 525–537. [Google Scholar] [CrossRef]
  51. Chen, C.R.; Makhatadze, G.I. ProteinVolume: Calculating molecular van der Waals and void volumes in proteins. BMC Bioinform. 2015, 16, 101. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Illustration of the spherical approximation for the cluster density calculation. The α -synuclein chains forming a cluster are shown in green and a homogeneous sphere approximating the cluster’s volume is semi-transparent and colored in red. Importantly, the sphere has the same radius of gyration as the cluster.
Figure 1. Illustration of the spherical approximation for the cluster density calculation. The α -synuclein chains forming a cluster are shown in green and a homogeneous sphere approximating the cluster’s volume is semi-transparent and colored in red. Importantly, the sphere has the same radius of gyration as the cluster.
Ijms 26 08631 g001
Figure 2. The density ratio d / d theory  versus the number n of chains in a given cluster of the spherical (circles), ellipsoidal (squares), or cylindrical (triangles) shape. Here, d theory and d denote, respectively, the theoretical density specified in Section 4.3 and the density computed under the spherical approximation.
Figure 2. The density ratio d / d theory  versus the number n of chains in a given cluster of the spherical (circles), ellipsoidal (squares), or cylindrical (triangles) shape. Here, d theory and d denote, respectively, the theoretical density specified in Section 4.3 and the density computed under the spherical approximation.
Ijms 26 08631 g002
Figure 3. Illustration of the ellipsoidal approximation for the cluster density calculation. The α -synuclein chains forming a medium-size cluster are shown in green. The homogeneous ellipsoid with semi-axis lengths derived from the cluster’s principal moments of inertia is semi-transparent and shown in red.
Figure 3. Illustration of the ellipsoidal approximation for the cluster density calculation. The α -synuclein chains forming a medium-size cluster are shown in green. The homogeneous ellipsoid with semi-axis lengths derived from the cluster’s principal moments of inertia is semi-transparent and shown in red.
Ijms 26 08631 g003
Figure 4. The density ratio d / d theory  versus the number n of chains, forming a given cluster of the spherical (circles), ellipsoidal (squares) or cylindrical (triangles) shape. Here, d theory and d denote, respectively, the theoretical density specified in Section 4.3 and the density computed under the ellipsoidal approximation.
Figure 4. The density ratio d / d theory  versus the number n of chains, forming a given cluster of the spherical (circles), ellipsoidal (squares) or cylindrical (triangles) shape. Here, d theory and d denote, respectively, the theoretical density specified in Section 4.3 and the density computed under the ellipsoidal approximation.
Ijms 26 08631 g004
Figure 5. Schematic illustration of how the optimal probe radius is chosen for the cluster volume calculation using SPACEBALL. Each of the proteins forming the cluster is depicted as a chain of beads. The surface chains are marked in red. The remaining chains, marked in blue, form the interior of the cluster. The distinction between the surface chains and the interior chains is made on the basis of the distances from each of the chains to the geometrical center of the cluster. These distances are indicated by the green dotted lines. In contrast, the distances between the geometrical centers of the surface chains and their nearest neighbors are marked as the black dotted lines. The average distance between the geometrical centers of the surface chains and their nearest neighbors is selected as the surface radius, which is denoted as r probe S and depends on the surface chain fraction ϕ . The surface radius r probe S determined for ϕ = 0.5 is then taken as the optimal radius r probe O of the wall probes for the volume calculation using SPACEBALL.
Figure 5. Schematic illustration of how the optimal probe radius is chosen for the cluster volume calculation using SPACEBALL. Each of the proteins forming the cluster is depicted as a chain of beads. The surface chains are marked in red. The remaining chains, marked in blue, form the interior of the cluster. The distinction between the surface chains and the interior chains is made on the basis of the distances from each of the chains to the geometrical center of the cluster. These distances are indicated by the green dotted lines. In contrast, the distances between the geometrical centers of the surface chains and their nearest neighbors are marked as the black dotted lines. The average distance between the geometrical centers of the surface chains and their nearest neighbors is selected as the surface radius, which is denoted as r probe S and depends on the surface chain fraction ϕ . The surface radius r probe S determined for ϕ = 0.5 is then taken as the optimal radius r probe O of the wall probes for the volume calculation using SPACEBALL.
Ijms 26 08631 g005
Figure 6. The surface radius r probe S  versus the surface chain fraction ϕ for the spherical, ellipsoidal and cylindrical clusters with 100 (data points in blue), 200 (data points in orange), 400 (data points in green), and 600 (data points in red) chains.
Figure 6. The surface radius r probe S  versus the surface chain fraction ϕ for the spherical, ellipsoidal and cylindrical clusters with 100 (data points in blue), 200 (data points in orange), 400 (data points in green), and 600 (data points in red) chains.
Ijms 26 08631 g006
Figure 7. The density ratio d / d theory  versus the number n of chains in a given cluster of the spherical (circles), ellipsoidal (squares), or cylindrical (triangles) shape. Here, d theory and d denote, respectively, the theoretical density specified in Section 4.3 and the density computed using SPACEBALL with the optimal probe radii given in Table 1.
Figure 7. The density ratio d / d theory  versus the number n of chains in a given cluster of the spherical (circles), ellipsoidal (squares), or cylindrical (triangles) shape. Here, d theory and d denote, respectively, the theoretical density specified in Section 4.3 and the density computed using SPACEBALL with the optimal probe radii given in Table 1.
Ijms 26 08631 g007
Figure 8. Snapshots of the spherical (A,B), ellipsoidal (C,D) and cylindrical (E,F) clusters with 100 (first column), 200 (second column), 400 (third column), and 600 (fourth column) chains (green), together with envelopes (red) enclosing the clusters. The envelopes are generated by SPACEBALL and shown as semi-transparent surfaces in panels A, C, and E. For comparison, panels B, D, and F show cross-sections of the clusters with opaque envelopes.
Figure 8. Snapshots of the spherical (A,B), ellipsoidal (C,D) and cylindrical (E,F) clusters with 100 (first column), 200 (second column), 400 (third column), and 600 (fourth column) chains (green), together with envelopes (red) enclosing the clusters. The envelopes are generated by SPACEBALL and shown as semi-transparent surfaces in panels A, C, and E. For comparison, panels B, D, and F show cross-sections of the clusters with opaque envelopes.
Ijms 26 08631 g008
Figure 9. The ratio of the computed density and the theoretical density versus the radius of the wall probes applied in the SPACEBALL computation to the spherical, ellipsoidal, and cylindrical clusters with 100 (data points in blue), 200 (data points in yellow), 400 (data points in green), and 600 (data points in red) chains. The cross symbols (×) indicate the values of the optimal probe radius r probe O presented in Table 1. The horizontal dashed lines indicate d = d theory S , d = d theory E and d = d theory C , i.e., perfect agreement between the computed and theoretical densities.
Figure 9. The ratio of the computed density and the theoretical density versus the radius of the wall probes applied in the SPACEBALL computation to the spherical, ellipsoidal, and cylindrical clusters with 100 (data points in blue), 200 (data points in yellow), 400 (data points in green), and 600 (data points in red) chains. The cross symbols (×) indicate the values of the optimal probe radius r probe O presented in Table 1. The horizontal dashed lines indicate d = d theory S , d = d theory E and d = d theory C , i.e., perfect agreement between the computed and theoretical densities.
Ijms 26 08631 g009
Figure 10. Comparison of the densities obtained using the spherical approximation, the ellipsoidal approximation, and the SPACEBALL method. The ratios of the computed and theoretical densities, d / d theory S , d / d theory E and d / d theory C , are shown for the spherical, ellipsoidal and cylindrical clusters with 100 (bars in blue), 200 (bars in orange), 300 (bars in purple), 400 (bars in green), 500 (bars in brown) and 600 (bars in red) chains. The dashed lines indicate a perfect agreement between the computed and theoretical densities, i.e., d = d theory S , d = d theory E and d = d theory C .
Figure 10. Comparison of the densities obtained using the spherical approximation, the ellipsoidal approximation, and the SPACEBALL method. The ratios of the computed and theoretical densities, d / d theory S , d / d theory E and d / d theory C , are shown for the spherical, ellipsoidal and cylindrical clusters with 100 (bars in blue), 200 (bars in orange), 300 (bars in purple), 400 (bars in green), 500 (bars in brown) and 600 (bars in red) chains. The dashed lines indicate a perfect agreement between the computed and theoretical densities, i.e., d = d theory S , d = d theory E and d = d theory C .
Ijms 26 08631 g010
Figure 11. The surface radius r probe  versus the surface chain fraction ϕ for the clusters of small (data points in blue), medium (data points in orange), and large (data points in green) size. Panels (A,B) correspond to the two sets of clusters.
Figure 11. The surface radius r probe  versus the surface chain fraction ϕ for the clusters of small (data points in blue), medium (data points in orange), and large (data points in green) size. Panels (A,B) correspond to the two sets of clusters.
Ijms 26 08631 g011
Figure 12. Snapshots of the large (first column), medium (second column), and small (third column) clusters of α -synuclein chains (green), together with envelopes (red) enclosing the clusters. The envelopes are generated by SPACEBALL and shown as semi-transparent surfaces in panels (A,C). For comparison, panels (B,D) show cross-sections through the clusters with opaque envelopes.
Figure 12. Snapshots of the large (first column), medium (second column), and small (third column) clusters of α -synuclein chains (green), together with envelopes (red) enclosing the clusters. The envelopes are generated by SPACEBALL and shown as semi-transparent surfaces in panels (A,C). For comparison, panels (B,D) show cross-sections through the clusters with opaque envelopes.
Ijms 26 08631 g012
Figure 13. Density of the large (data points in blue), medium (data points in orange), and small (data points in green) clusters of α -synuclein chains versus the radius of the wall probes used in the SPACEBALL calculation. Panels (A,B) correspond to the two sets of clusters. The cross symbols (×) indicate the values of the optimal probe radius r probe O listed in Table 2.
Figure 13. Density of the large (data points in blue), medium (data points in orange), and small (data points in green) clusters of α -synuclein chains versus the radius of the wall probes used in the SPACEBALL calculation. Panels (A,B) correspond to the two sets of clusters. The cross symbols (×) indicate the values of the optimal probe radius r probe O listed in Table 2.
Ijms 26 08631 g013
Figure 14. The densities of the large, medium, and small clusters determined using the spherical approximation (bars in blue), the ellipsoidal approximation (bars in orange), and the SPACEBALL method (bars in green). Panels (A,B) correspond to the two sets of clusters.
Figure 14. The densities of the large, medium, and small clusters determined using the spherical approximation (bars in blue), the ellipsoidal approximation (bars in orange), and the SPACEBALL method (bars in green). Panels (A,B) correspond to the two sets of clusters.
Ijms 26 08631 g014
Figure 15. Cluster formation during the coarse-grained MD simulation of fifty chains of α -synuclein. The left panel shows the initial configuration of the simulation system, where the fifty chains of α -synuclein are distributed randomly in the cubic box of side length L = 200 nm. The right panel displays the system configuration after 70 μ s of MD with the fifty chains of α -synuclein in a cubic box of side length L = 20.6 nm. Clusters of α -synuclein, identified by the method described in Section 4.2, are highlighted in different colors.
Figure 15. Cluster formation during the coarse-grained MD simulation of fifty chains of α -synuclein. The left panel shows the initial configuration of the simulation system, where the fifty chains of α -synuclein are distributed randomly in the cubic box of side length L = 200 nm. The right panel displays the system configuration after 70 μ s of MD with the fifty chains of α -synuclein in a cubic box of side length L = 20.6 nm. Clusters of α -synuclein, identified by the method described in Section 4.2, are highlighted in different colors.
Ijms 26 08631 g015
Figure 16. Snapshots of the large, medium, and small clusters selected for assessment of different methods of density calculation. Each of the clusters contains fifty chains of α -synuclein (green). The gyration radius and the maximum diameter of each of the clusters is presented in Table 4. Panels (A,B) correspond to the two sets of clusters described in the main text and indicated in Table 4. Each of the clusters was generated in the coarse-grained MD simulations and identified by agglomerative clustering with the cut-off distance d cut off = 0.75 ( σ i + σ j ) , as described in the main text.
Figure 16. Snapshots of the large, medium, and small clusters selected for assessment of different methods of density calculation. Each of the clusters contains fifty chains of α -synuclein (green). The gyration radius and the maximum diameter of each of the clusters is presented in Table 4. Panels (A,B) correspond to the two sets of clusters described in the main text and indicated in Table 4. Each of the clusters was generated in the coarse-grained MD simulations and identified by agglomerative clustering with the cut-off distance d cut off = 0.75 ( σ i + σ j ) , as described in the main text.
Ijms 26 08631 g016
Figure 17. Snapshots of twelve clusters of α -synuclein chains (green) generated using Packmol and used for the assessment of different methods of density calculation. The clusters are shaped like a sphere (A), an ellipsoid (B), and a cylinder (C), and comprise 100, 200, 400, and 600 chains of α -synuclein (from the left to the right).
Figure 17. Snapshots of twelve clusters of α -synuclein chains (green) generated using Packmol and used for the assessment of different methods of density calculation. The clusters are shaped like a sphere (A), an ellipsoid (B), and a cylinder (C), and comprise 100, 200, 400, and 600 chains of α -synuclein (from the left to the right).
Ijms 26 08631 g017
Table 1. The optimal probe radii r probe O alongside the SPACEBALL-computed densities of the spherical, ellipsoidal, and cylindrical clusters with 100, 200, 300, 400, 500, and 600 chains.
Table 1. The optimal probe radii r probe O alongside the SPACEBALL-computed densities of the spherical, ellipsoidal, and cylindrical clusters with 100, 200, 300, 400, 500, and 600 chains.
n r probe O (nm)d (res/ nm 3 ) d theory S (res/ nm 3 )
1008.860.0660.078
2005.880.1540.156
3005.070.2360.234
4004.540.3060.312
5004.060.3940.390
6003.930.4540.468
n r probe O (nm)d (res/ nm 3 ) d theory E (res/ nm 3 )
1007.580.0740.076
2005.720.1480.152
3004.740.2500.228
4004.450.2990.304
5003.820.3770.380
6003.800.4880.456
n r probe O (nm)d (res/ nm 3 ) d theory C (res/ nm 3 )
1007.490.0780.076
2005.150.1750.153
3004.680.2340.230
4003.920.3230.306
5003.950.3870.383
6003.690.4670.460
Table 2. The optimal probe radius r probe O and the computed density of the clusters obtained from the coarse-grained MD simulations.
Table 2. The optimal probe radius r probe O and the computed density of the clusters obtained from the coarse-grained MD simulations.
Cluster SetCluster Extension r probe O (nm)d (res/ nm 3 )
Alarge4.370.220
medium4.610.236
small4.440.310
Blarge4.860.259
medium4.450.258
small3.940.301
Table 3. Parameters of the coarse-grained MD simulations of fifty chains of α -synuclein.
Table 3. Parameters of the coarse-grained MD simulations of fifty chains of α -synuclein.
Temperature T ( ϵ / k B )Box Length L (nm)Simulation Time ( μ s)
0.4518.5481
0.5020.6487
1.2026.1313
Table 4. Dimensions of the α -synuclein clusters derived from the coarse-grained MD simulations and selected for an assessment of density calculation methods. Each of the clusters is formed of fifty chains. The cluster size (large, medium, or small) is indicated in the second column and quantified by the gyration radius ( R g ) and the maximum diameter ( D max ) in the third and fourth column, respectively. The simulation temperature (T) is given in the fifth column.
Table 4. Dimensions of the α -synuclein clusters derived from the coarse-grained MD simulations and selected for an assessment of density calculation methods. Each of the clusters is formed of fifty chains. The cluster size (large, medium, or small) is indicated in the second column and quantified by the gyration radius ( R g ) and the maximum diameter ( D max ) in the third and fourth column, respectively. The simulation temperature (T) is given in the fifth column.
Cluster SetCluster Extension R g (nm) D max (nm)T ( ϵ / k B )
Alarge14.258.61.2
medium12.144.30.5
small10.236.70.45
Blarge1352.31.2
medium1244.30.5
small9.938.70.45
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Anila, M.M.; Wojciechowski, M.; Chwastyk, M.; Różycki, B. Theoretical Methods for Assessing the Density of Protein Nanodroplets. Int. J. Mol. Sci. 2025, 26, 8631. https://doi.org/10.3390/ijms26178631

AMA Style

Anila MM, Wojciechowski M, Chwastyk M, Różycki B. Theoretical Methods for Assessing the Density of Protein Nanodroplets. International Journal of Molecular Sciences. 2025; 26(17):8631. https://doi.org/10.3390/ijms26178631

Chicago/Turabian Style

Anila, Midhun Mohan, Michał Wojciechowski, Mateusz Chwastyk, and Bartosz Różycki. 2025. "Theoretical Methods for Assessing the Density of Protein Nanodroplets" International Journal of Molecular Sciences 26, no. 17: 8631. https://doi.org/10.3390/ijms26178631

APA Style

Anila, M. M., Wojciechowski, M., Chwastyk, M., & Różycki, B. (2025). Theoretical Methods for Assessing the Density of Protein Nanodroplets. International Journal of Molecular Sciences, 26(17), 8631. https://doi.org/10.3390/ijms26178631

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop