1. Introduction
Mammalian reproduction depends on the ability of sperm cells to locate an egg within the complex microenvironment of the female reproductive tract. Reproductive tracts impose numerous barriers to sperm success, ultimately reducing millions of potential fertilizers to only a few dozen cells that manage to find their way to the ampulla of the oviduct, where fertilization takes place. Although rudimentary forms of navigation, such as chemotaxis and rheotaxis, have been investigated in mammals [
1,
2], the physiological relevance of these phenomena remains unclear and may be more significant for externally fertilizing organisms that navigate in three dimensions [
3,
4]. For nearly all eukaryotes, the sperm search for an egg depends, at least in part, on intrinsically controlled motility patterns. Additionally, among species for which forms of taxis have been shown to be important, it is likely that sperm find themselves in regions or conditions where external guidance cues may be weak or absent, highlighting the importance of understanding how intrinsically controlled motility contributes to the search process.
Although the intracellular biochemical mechanisms that control sperm post-ejaculatory behaviors have been well studied in rodents and humans [
5], the stochasticity of sperm search owing to complex variation in intrinsically controlled motility patterns remains poorly understood, particularly at the scale of sperm cell populations, which vary from
to
cells per ejaculate depending on the species.
Sperm cells are microswimmers that are subject to the physical limitations of life at low Reynolds numbers [
6], i.e., inertia plays no role and the motion is overdamped. Furthermore, mammalian sperm move on the epithelial surface of the reproductive tract, and two-dimensional modeling is a reasonable simplification [
7]. Sperm motility is driven by active flagellar beating and can be modeled as a random walk [
8]. Contemporary computer-aided sperm motility analysis generally focuses on averaged motility patterns on timescales of just a few seconds [
9].
A human sperm cell has a width of about 2 μm. Human oviducts are called “fallopian tubes”. Fertilization generally occurs in the lining of a fallopian tube. After release in the vagina, only a fraction of less than one-thousandth of the approximately hundred million sperm cells in a human ejaculate reaches the fallopian tubes. The lining of the fallopian tubes has folds but can still roughly be thought of as the inside of a cylinder. The cylinder is 10 to 14 cm long and about 1 cm in diameter. The egg cell is an ovoid with a diameter of about 0.15 mm. In other words, we are looking at something the size of a pea that is looking for a big beach ball in a search area that is the size of ten football fields.
An important parameter in this study is the persistence length. The persistence length characterizes the sperm’s trajectory, and it can only be observed when the trajectory is followed on a timescale longer than just a few seconds. The persistence length is the distance over which a curve remains directionally correlated, and the concept is also frequently applied in the study of polymers [
10,
11,
12]. We will present the formal definition of the persistence length in
Section 2. Suffice it to say at this point that a trajectory with a long persistence length appears as a relatively straight path. A smaller persistence length, on the other hand, corresponds to more frequent changes in direction. Sperm within one ejaculate can exhibit persistence lengths that may differ by orders of magnitude (cf.
Figure 1).
Recently, transitions between high-persistence-length movement and low-persistence-length movement have been observed in bovine sperm [
13]. This suggests that sperm may transition between mere dispersal and localized search behavior. In confined or semi-confined domains, such as an oviduct in vivo or a culture dish surface in vitro, effective sperm navigation likely requires a strategy that balances rapid traversal across a searchable surface area with a thorough exploration of the local space [
8]. The efficiency of sperm search may depend on the transition rates among movement patterns. Transitions between different movement patterns have also been observed in
E. coli bacteria [
14]. In this case, the transitions between the “movement states” were well described with ordinary chemical kinetics (i.e., exponentially distributed dwelling times in states).
Here, we hypothesize that this two-state random walk—switching from a high-persistence length to a low-persistence length—can enhance search efficiency in domains where uniform coverage is essential. We develop a two-state active particle model to explore this hypothesis on a disk with an absorbing boundary, where we let searchers emerge from the center of the disk. In
Section 2, we show through numerical simulation how changes in the persistence length can improve the coverage of the search domain. As a baseline, we examine the case of a searcher with a constant persistence length. Next, we study two enhancements. The first one involves a well-tuned transition rate (i.e., exponentially distributed waiting times) to a persistence length that is about an order of magnitude smaller. There is evidence that the change of state can be catalyzed by chemicals that are present in the environment [
13,
14]. Taking into account this positional guidance, we also study, as a second case, a modified version of our first model: we take the stochasticity out of the switch and let the searching sperm cell change its motility state upon entering the outer section of the disk. In
Section 3, we use the diffusion equation to add mathematical rigor to the results from
Section 2.
Figure 1.
Representative video microscopy trajectories for approximately 20 human sperm over the course of 26 s in a 563 × 563 μm
2 field of view at 5× magnification at the objective under negative phase contrast. Following 30 min liquefaction at 37 °C in a 5% CO
2 incubator, sperm were isolated by differential centrifugation in 50% isotonic Percoll. Sperm were resuspended in Biggers, Whitten, and Wiggingham media [
15], imaged in a 20 μm deep chambered slide, and temperature was maintained during imaging with a Peltier-heated stage at 37 °C. The sperm cells were observed to move at roughly the same speed, but the persistence lengths differed widely. It is for distinguishability that trajectories have been given different colors.
Figure 1.
Representative video microscopy trajectories for approximately 20 human sperm over the course of 26 s in a 563 × 563 μm
2 field of view at 5× magnification at the objective under negative phase contrast. Following 30 min liquefaction at 37 °C in a 5% CO
2 incubator, sperm were isolated by differential centrifugation in 50% isotonic Percoll. Sperm were resuspended in Biggers, Whitten, and Wiggingham media [
15], imaged in a 20 μm deep chambered slide, and temperature was maintained during imaging with a Peltier-heated stage at 37 °C. The sperm cells were observed to move at roughly the same speed, but the persistence lengths differed widely. It is for distinguishability that trajectories have been given different colors.
2. A Two-State Random Searcher on a Disk
Our geometric setup is simple. Searching particles appear in the center of a disk and follow a random trajectory until they hit the absorbing boundary at the circumference. We set the disk’s radius at
(cf.
Figure 2). Each moving searcher starts out with a large persistence length that is on the order of magnitude of the radius of the disk. The switch is to a persistence length that is about an order of magnitude smaller. We gather statistics for three cases: (
i) a persistence length that is fixed at half the disk’s radius and does not change, (
ii) a persistence length that is initially set at one-and-a-half times the disk’s radius, with a transition rate that is set such that the irreversible switch to the smaller persistence length occurs on average at about
, and (
iii) persistence lengths as in the second case, but with the irreversible switch occurring at the moment that
is first crossed. We consider a search effective if there is no place for the target to “hide” on the disk. In practical terms, this means that all parts of the disk are equally searched. We divide the disk into annuli and numerically investigate. The circular symmetry of this setup keeps the analysis very manageable. In the next section, we set up and analyze a diffusion equation to help us understand the results of the stochastic simulations.
After the sperm is randomly distributed into different parts of the reproductive tract, each sperm performs a local search within a reasonable distance relative to its own body length. Such a search can be well modeled using the disk-with-absorbing-boundary setup described in the previous paragraph.
We model the sperm cell as an active particle that moves at a constant speed
. Fluctuations in the direction of motion
are accounted for with a diffusion coefficient
. This stochasticity is due to active flagellar fluctuations rather than Brownian motion [
16]. Let
be the unit vector in the direction of motion, and the following equation describes the sperm cell’s motion:
At any time, the unit vector in the direction of motion is given by
. The term
represents Gaussian-distributed white noise. We use the Euler–Maruyama scheme for the actual numerical simulations: at every timestep, the angle
is updated through the addition of a small random number. The actual
is implemented by taking a set of Gaussian-distributed numbers that have zero average and unit variance. Subsequent values of
are obtained by dividing these numbers by the square root of the timestep
.
The trajectories generated by Equation (
1) display a persistence length
. The persistence length is a notion that is also well known in polymer theory [
10,
11,
12]. We parametrize the trajectory with the contour length
ℓ. Obviously, the direction
changes as a function of
ℓ. We consider the orientations
and
, i.e., the orientations at two points that are a distance
ℓ away from each other along the contour. A good measure for the correlation between
and
is the average
, where
, and the average is taken over all positions
along the entire trajectory such that the different segments of length
ℓ do not overlap. For a very stiff, almost straight trajectory,
is close to one for all
ℓ. The persistence length
is defined by
[
10,
11,
12]. A smaller persistence length makes a trajectory appear more tightly wound (cf.
Figure 2).
Related to the persistence length is the Kuhn length. Imagine a point P along the trajectory. Points that are to the left and right of P come with orientations that are not correlated to the orientation at P. With this realization, we can then imagine the trajectory as being built up from straight segments with a length of . At each junction, the angle between the linked segments is a random number from a flat distribution between 0 and . The polymer or trajectory conceived as such is called a Freely Jointed Chain (FJC), and is known as the Kuhn length.
The FJC comes with concise formulae. For a chain of a total length
L and with
N Kuhn segments of length
, we have
for the total contour length
L and the average square end-to-end distance
, respectively. These formulae are well known, widely applied, and explained in many authoritative textbooks, although mostly in the context of polymer theory [
10,
11,
12].
For a searching particle that emerges at the center of the disk, it makes sense to search in such a way that any random point on the disk has an equal likelihood of being “found”. An annulus at a distance
r from the center of width
covers an area
(cf.
Figure 2). A search by a moving particle is thus more likely to be successful if the dwelling time at radius
r is proportional to
r itself. For an effective search, the trajectory should become tighter as the rim is approached, i.e., the Kuhn length
should decrease during the search. A simple physiological course of action to achieve such a decrease in
is to emerge from the center with an initial Kuhn length
and a well-tuned transition rate to a smaller value
.
Consider the disk with radius
, as depicted in
Figure 2.
Figure 2a shows a search with
and
, which implies a persistence length of
.
Figure 2b depicts the realization of a search with an initial
and a transition rate
to a final
. Note that the
in
Figure 2a is the geometric average of
and
.
The unit disk has three times as much area outside
as it has inside
. It thus makes sense to explore the disk’s outer part (
) with three times the contour length of the inner part. After some algebra, it follows from Equation (
2) that with a Kuhn length that is one-ninth of the original one, approximately the same end-to-end distance is covered with three times the contour length. Therefore, we take
. Finally, it needs to be noted that, for all generated trajectories, we take
for the timestep.
In
Figure 2, the unit disk is divided into nineteen annuli, each with a width of 1/20. In the center, there is one small circle with radius 1/20. A search is most effective if each location on the disk has an equal chance of being visited by the searcher. In terms of the annuli, this means that in each annulus, the searching particle spends the same amount of time per unit of area, i.e.,
(cf.
Figure 3) should be constant. Each of the three bar charts in
Figure 3 is the result of 5000 trajectories generated with Equation (
1). For each trajectory, the amount of time in each annulus was recorded and then divided by the area of that annulus.
Figure 3 represents the average over 5000 trajectories for three cases. For
Figure 3a, the radial diffusion coefficient was held constant at
. For
Figure 3b, each of the 5000 searchers starts at the origin with
and a transition rate from
to
.
Figure 3c depicts the result for searchers that can detect their own positions and irreversibly switch from
to
when
is first traversed.
Going from
Figure 3a to
Figure 3b, we see an improved search. The average and standard deviations of
over the twenty regions go from 1.1 and 1.5 to 1.7 and 1.7, which implies that the relative standard deviation (the standard deviation divided by the average) goes from 1.3 to 0.98. Furthermore, the escape time
, i.e., the average time it takes to go from the center of the disk to
, increases from
to
in the setup with the switch of
.
There is another small improvement as we go from
Figure 3b to
Figure 3c, i.e., to a setup where the searcher “reads” its position and makes the switch contingent on the position. For the search in
Figure 3c, the average and standard deviations are 1.6 and 1.5, resulting in a relative standard deviation of 0.94. The value of
is the same as for the non-position-guided switch, i.e.,
. Particularly striking in
Figure 3c is how
stays constant in the region between about
and
.
All in all, both switch-included searches last about 70% longer than the original search in
Figure 2a and
Figure 3a. Both of these enhanced searches are also more uniform in that among the annuli in the unit circle, the search times per area (
) have a relative standard deviation that is reduced by about 25% compared to the original switchless search. The improvements going from
Figure 3a to
Figure 3b and then to
Figure 3c may appear small. However, in the 1930s, the “New Evolutionary Synthesis” combined genetics and population dynamics to put evolution on a more quantitative and scientifically rigorous foundation. It was found that even small selective advantages drive evolution. The authoritative textbook by D. Futuyma [
17] summarizes it as follows: “… a character state with even a minuscule advantage will be fixed by natural selection. Hence, even very slight differences among species, in seemingly trivial characters such as the distribution of hairs on a fly or veins on a leaf, could conceivably have evolved as adaptations”.
3. The Evolution of the Probability Distribution on the Disk
The previous section involved an ordinary differential equation with a stochastic input at every timestep. The ensuing stochastic differential equation describes a trajectory of the diffusing particle. It is possible to formulate an equivalent partial differential equation that describes the time evolution of the particle’s probability distribution
. The underlying theory is covered in many standard textbooks [
18,
19,
20].
The equation that rules the 2D diffusion on the disk is
Here,
D and
stand for the diffusion coefficient and the Laplace operator, respectively. The diffusion coefficient
D in this equation has a simple relation to the
in Equation (
1). For a particle diffusing in 2D, the mean square displacement and the elapsed time are related through
. The
can be identified with
S in Equation (
2). The contour length
L can be identified with
. This leads first to
, and after invoking
and
, we find
In polar coordinates and with circular symmetry, Equation (
3) takes the form
Before further analysis, it is worth reflecting on Equation (
5) in light of what was established in the previous section. The first term on the right-hand side of Equation (
5) is a diffusion term that does not discriminate between the directions
r and
. The second term on the right-hand side is a drift term that contains a characteristic speed
. This may be puzzling as the diffusion should have no apparent direction. In his classic text, Feller [
21] explained the apparent anisotropy as follows (Volume 2, Chapter 10, Section 6): “The existence of a drift away from the origin can be understood if one considers a plane Brownian motion starting at the point
r of the
x-axis. For reasons of symmetry its
abscissa at epoch
is equally likely to be
or
. In the first case certainly
, but this relation can occur also in the second case. Thus the relation
has probability
, and on the average
R is bound to increase”.
With an initial condition
and on a disk with infinite radius, Equation (
5) is readily solved by the spreading Gaussian:
This solution comes with a root-mean-square distance to the origin,
, which increases following
. Taking the derivative with respect to time on both sides, we find that the speed of the root-mean-square distance
.
For a particle that drifts away from the origin with a speed , the time spent in an annulus of area is found to be . So, the search time per area, , appears to be independent of r. This uniform coverage of the search area occurs with any drift speed that is proportional to .
However, it is not hard to see that such
r-independence of
only works for a disk with a radius
. If a diffusing particle is absorbed upon hitting the boundary at a finite
, any trajectory leading to a position
after passing through an
region is eliminated. This leads to a decreasing
as
r increases. Such a decrease is indeed what we see in
Figure 3. Below, we add mathematical rigor to these insights and observations.
An absorbing boundary at
means that
. Solutions for setups with an absorbing boundary condition can commonly be found by placing an “image initial condition”,
, outside the disk. The image is negative, and the positioning of the image initial condition has to be such that the ensuing flow toward
(governed also by Equation (
5)) leads to
. The full solution
then satisfies Equation (
5) in the area
.
For 1D diffusion along an axis, the image method is easily implemented and readily leads to a solution. But complications arise in the 2D case. As previously mentioned in the context of Equation (
5), the directions toward the disk’s center and away from the disk’s center are
not equivalent. For an image that is initially positioned at
, i.e.,
, a solution according to Equation (
5) can be derived after some nontrivial mathematical operations [
22]:
, where
represents the modified Bessel function of the first kind. The problem with this solution is that we do
not have
at all times.
Below, we derive some relevant features of the diffusion on the 2D disk without an actual closed-form solution. Through Fick’s Law, the outflow from the disk can be identified with
. If we set
for the slope of the
curve at
(cf.
Figure 4), we have
at
.
is also a good approximation on
near the outer edge of the disk.
In our 2D case, the flux is the amount of probability that crosses at per unit of time and per unit of distance along the circumference of the circle with radius . Next, it is the ratio that represents the average speed necessary to bring about this flux. With , we obtain for the drift speed of the diffusing particle. We observe that, unlike the case discussed earlier, the drift speed now increases as the absorbing boundary of the disk becomes closer. This can be readily intuited upon the realization that with nearer, ever more possible trajectories hit and disappear from the domain.
As before, we divide our disk into annuli with a width
. With the drift speed derived in the last paragraph, we find that the dwelling time per annulus
. From there, we derive a search time per area that depends on
r:
It is obvious that
vanishes as
.
Figure 5 shows the results of a simulation. We let 1000 particles diffuse from the center of the disk to the absorbing boundary at
. The motion of the particles is again ruled by Equation (
1), with
and
. Through Equation (
4), this leads to
. The timestep used for the Euler scheme is again
. As before, the unit disk is segmented into a small
disk and nineteen annuli with a width of 0.05. Like before, the twenty bars in the histogram (
Figure 5a) give the average time per unit area spent in each of the twenty segments.
Figure 5b gives
for the four outermost annuli, together with the best-fitting straight line. Equation (
7) gives 63.7 for this slope. The slope of the fitted line is 67.7. The small discrepancy is not only due to the neglected curvature but also to the finite persistence length that is implicit when simulating Equation (
1). In our case,
. The partial differential equations that are central in this section, i.e., Equations (
3) and (
5), actually correspond to a fractal trajectory with zero persistence length.
A diffusion coefficient that decreases as the particle diffuses away from the center leads to a
D that effectively decreases with
r. As pointed out in the previous section, and as should now also be clear from Equation (
7), this flattens out
as a function of
r and leads to a more efficient search of the disk’s area.
Finally, it is worth pointing out that Equation (
7) and the drift speed
are independent of the actual slope
. This is a nice feature because as the particle diffuses away from
, the slope actually changes over time. Immediately after
, the slope increases. Later in the relaxation, the slope starts decreasing again. It is remarkable that Equation (
7) and the drift speed
remain the same throughout this process.
4. Discussion
We have studied a simple, bare-bones model to keep the analysis intuitive and to have manageable mathematics and numerics. Active particles can interact with each other in intricate ways (see, e.g., Ref [
23]), but we have studied independent, non-interacting searchers.
Figure 6 presents a search space that more closely resembles an actual oviduct. In the oviduct, the searching sperm cells enter on one side. Here, we let them enter at an arbitrary point on the left edge. We account for being on the inside of a cylinder through the periodicity of the vertical coordinate. The infundibulum is the interface between the fallopian tube and the ovary. Sperm can “get lost” there, and we model this by implementing an absorbing boundary on the right edge. The two-state strategy we studied on the disk would also likely be effective in this case. A good dispersal of searchers could be achieved by taking off with a large persistence length in an arbitrary direction from an arbitrary point on the left edge. Each individual sperm cell would then perform a more detailed search after it switches to the smaller persistence length.
Reproductive biologists have been aware for a long time that sperm in the oviduct can switch to a less linear, more vigorous type of motion. They have termed this type of movement “hyperactivation” [
13,
24,
25,
26,
27]. Many research efforts have been devoted to determining which types of chemicals can trigger the switch to hyperactivation. In terms of our Equation (
1), hyperactivation corresponds to a larger
and an ensuing smaller value of the persistence length
. It is commonly assumed that the purpose of hyperactive “wriggling” is detachment from the epithelium when the sperm cell is stuck there. In this work, we suggest that the switch to hyperactivation may simply be part of an optimized search strategy.
Over the past one and a half decades, an extensive literature about active Brownian particles has developed. Our Equation (
1) commonly features in review articles on the subject as a fundamental model to which more effects can be added [
28,
29]. Some of the trajectories in
Figure 1 exhibit chirality. Chirality has often been included in studies that model the motion of sperm cells [
13] or other microswimmers [
30]. In our minimal model, we did not take it into account. Chirality could readily be added to the model in Equation (
1) through the addition of an angular velocity
on the right-hand side of the second equation. However, such an addition would significantly complicate the subsequent analysis, and a derivation, such as that in
Section 3, would be harder and less intuitive. It is obvious that chirality, or any added curvature for that matter, is like a smaller Kuhn length in that it leads to an area being more thoroughly searched. Nonetheless, the central result of our work, i.e., that a well-tuned switch to a much larger
enhances the search, will only be affected if
(see, e.g., Equation (
3) in [
13]).
Routine semen analysis in clinical practice relies on microscopic inspection of sperm characteristics and statistical comparison with fertile samples [
31,
32]. Contrary to common belief, fast motility alone is not necessarily an indicator of high fertilizing potential due to the complexity of the sperm search task and statistical variation among the large number of sperm present in an ejaculate [
33]. Efficient search likely requires a combination of motility patterns, including high-persistence-length movement for rapid dispersal and low-persistence-length movement for thorough local exploration. Our modeling suggests that a well-tuned combination of both modes may enhance search efficiency on restricted domains such as those found in the complex environment of the female reproductive tract, where sperm must balance wide-area coverage with detailed exploration to locate the egg.
The two-state random walkers (
Figure 2b and
Figure 3b,c) that we studied represent improvements over the regular random walk (
Figure 2a and
Figure 3a). In closing, it is instructive to look at our subject matter and results from the wider perspective of computation and problem solving in biological systems. Searches are also part of macroscopic everyday reality. When a search-and-rescue team searches for a lost or missing person in a large wilderness area, a common strategy is to divide the search area into smaller areas. These smaller areas are labeled, and then different teams or individuals are assigned to different areas in the grid to conduct detailed searches. Obviously, sperm cells cannot achieve such a level of organization and coordination.
The random walk with identical searchers with a constant persistence length (cf.
Figure 2a) is the most brute-force approach to the search problem. It is the Monte Carlo modus operandi and requires the mobilization of many searchers to achieve a non-negligible probability of fertilization. The random walk shown in
Figure 2b and
Figure 3b contains a transition to a smaller persistence length. We have shown that this feature leads to improved efficacy. Through the transition rate
k, this search involves a characteristic time
. So, the improvement is due to the usage of a very crude clock and the search being split up into a “before” and “after”.
Figure 3c shows a bar chart corresponding to a random-walk searcher that does not have a clock but instead uses a very crude GPS. The searcher only “knows” whether it is in Region 1 or 2, and switches the persistence length the first time it “sets foot” in Region 2.
Biologically, a well-tuned k or an effective partitioning into two regions would be an outcome of natural selection. This improvement over the pure Monte Carlo algorithm can be viewed as enhanced agency and as something “learned”—a genetic legacy built up after much past “trial and error”. Neither of the two improvements involves any sperm-to-sperm communication.
The analogy with a team on a search-and-rescue mission underscores an important point. Popular culture often views ejaculate as a swimming contest. However, it is more appropriate to view ejaculate as tissue. A heart pumps, an ear detects sound, and ejaculate searches. Successful ejaculate is ejaculate that yields a high probability that one sperm cell finds and fertilizes an egg cell. It is obvious that there is selective pressure toward more successful ejaculate.
Sperm vary significantly in form and function within ejaculates, among individuals, and within individuals over time [
34,
35,
36]. There is evidence that increased between-male competition for mates over time can drive apparent adaptations in sperm morphology and motility [
37]. Simple features such as swimming velocity do not necessarily enable strong predictions about fertility competence when the individuals being compared are from the same species [
38]. Additionally, the effect of sperm competition within an ejaculate remains unresolved [
39]. Ultimately, a better understanding of the relationship between sperm form, function, and the physiological demands of the search task will enable these questions to be answered.