D’Alembert’s Direct and Inertial Forces Acting on Populations: The Price Equation and the Fundamental Theorem of Natural Selection

Frank, Steven A.

doi:10.3390/e17107087

Open AccessArticle

D’Alembert’s Direct and Inertial Forces Acting on Populations: The Price Equation and the Fundamental Theorem of Natural Selection

by

Steven A. Frank

Department of Ecology & Evolutionary Biology, University of California, Irvine, CA 92697, USA

Entropy 2015, 17(10), 7087-7100; https://doi.org/10.3390/e17107087

Submission received: 4 September 2015 / Revised: 5 October 2015 / Accepted: 15 October 2015 / Published: 20 October 2015

(This article belongs to the Section Information Theory, Probability and Statistics)

Download Versions Notes

Abstract

:

I develop a framework for interpreting the forces that act on any population described by frequencies. The conservation of total frequency, or total probability, shapes the characteristics of force. I begin with Fisher’s fundamental theorem of natural selection. That theorem partitions the total evolutionary change of a population into two components. The first component is the partial change caused by the direct force of natural selection, holding constant all aspects of the environment. The second component is the partial change caused by the changing environment. I demonstrate that Fisher’s partition of total change into the direct force of selection and the forces from the changing environmental frame of reference is identical to d’Alembert’s principle of mechanics, which separates the work done by the direct forces from the work done by the inertial forces associated with the changing frame of reference. In d’Alembert’s principle, there exist inertial forces from a change in the frame of reference that exactly balance the direct forces. I show that the conservation of total probability strongly shapes the form of the balance between the direct and inertial forces. I then use the strong results for conserved probability to obtain general results for the change in any system quantity, such as biological fitness or energy. Those general results derive from simple coordinate changes between frequencies and system quantities. Ultimately, d’Alembert’s separation of direct and inertial forces provides deep conceptual insight into the interpretation of forces and the unification of disparate fields of study.

Keywords:

hamiltonian dynamics; information geometry; population genetics; theoretical biology; theoretical physics

1. Introduction

The fundamental theorem of natural selection divides total evolutionary change into two components [1]. The first component is the partial change caused by the direct force of natural selection. The second component is the partial change caused by all other forces.

The theorem states that the change in fitness caused by the direct force of natural selection equals the genetic variance in fitness. We can interpret “genetic variance” to mean the component of variance associated with things that are transmitted through time. Natural selection is the force that changes the frequencies of those transmissible things.

Fisher wrote clearly about the distinction between the direct force of natural selection and the other evolutionary forces [1,2]. Yet much confusion followed in the history of the subject. Essentially all commentators considered only the total evolutionary change, rather than Fisher’s split into two partial components.

A correct interpretation of Fisher’s partial components eventually developed, starting with Price [3] and Ewens [4]. However, both of those authors concluded that Fisher’s split of total change into components provided little value.

In this article, I show that Fisher’s split of evolutionary change is equivalent to d’Alembert’s split of the general causes of dynamics into direct and inertial forces. d’Alembert’s principle is the foundation for essentially all of the key results of theoretical physics, starting with Newton’s laws and leading to the subsequent generalizations via Lagrangian and Hamiltonian mechanics.

Lanczos [5], in his great synthesis of the variational principles of mechanics, elevates d’Alembert’s principle to the key insight that ties together the whole subject. To Lanczos, the tremendous value of d’Alembert’s principle follows from the fact that it “focuses attention on the forces, not on the moving body …” In the same way, Fisher’s goal was to isolate and interpret the force of natural selection, rather than to emphasize the dynamics of total change.

The study and interpretation of force requires separating the action of a force from the frame of reference. A force affects change, and the measurement and interpretation of that change depends on the changing frame of reference of the system. To understand the force as distinct from the frame of reference, force and frame of reference must be separated.

That separation between force and frame of reference is exactly what Fisher did and was exactly how he discussed his analysis. I argue here that connecting Fisher’s theorem to d’Alembert’s principle will help to clarify the separation of direct force and frame of reference.

In Fisher’s analysis, he was vague about the mathematical form of the changes associated with the frame of reference. Here, by using the Price equation, I make explicit the connections between Fisher’s theorem and d’Alembert’s principle.

My argument follows three steps. First, I derive the general form of the Price equation. Second, I connect the Price equation to d’Alembert’s principle. Third, I discuss the fundamental theorem of natural selection in the context of d’Alembert’s separation of the direct forces and the inertial forces associated with the changing frame of reference. By d’Alembert’s separation, we obtain a partition of total evolutionary change in fitness into the change by the direct force of natural selection and the change by the inertial forces of the changing environmental frame of reference.

The analysis is much more general and powerful than a theorem limited to natural selection. Instead, we find a broad analysis of the dynamics of any population or aggregation that can be characterized by frequencies. The conservation of total frequency, or total probability, establishes a symmetry that defines many of the characteristics of aggregate dynamics. Those characteristics of aggregate dynamics apply to natural selection, to many problems in mechanics, and to any analysis of the changes in probability distributions.

2. The Price Equation

The Price equation [6,7] describes the change in an average value obtained over some aggregation or population. Each component of the population has a weighting, q, and a value, z. Begin with a discrete analog of the chain rule for differentiation of a product

\begin{matrix} Δ (q z) & = (q + Δ q) (z + Δ z) - q z \\ = (Δ q) z + (q + Δ q) Δ z \\ = (Δ q) z + q^{'} Δ z \end{matrix}

in which

q^{'} = q + Δ q

and

z^{'} = z + Δ z

. The same chain rule can be applied to vectors. By using dot product notation, we obtain an abstract form of the Price equation [7,8,9]

Δ (q \cdot z) = Δ q \cdot z + q^{'} \cdot Δ z

(1)

in which a dot product is understood in the usual way as

q \cdot z = \sum q_{i} z_{i}

.

This equation can be interpreted in various ways. For our purposes, we can take

q_{i}

to be the frequency associated with a subset, i, of the initial population, such that the total frequency is

\sum q_{i} = 1

. Thus,

\bar{z} = \sum q_{i} z_{i}

is the average of z, in which

z_{i}

is a function that maps i to some value. Similarly, we have a second population, with frequencies

q_{i}^{'}

and values

z_{i}^{'}

, in which

\sum q_{i}^{'} = 1

.

One can use various rules for the relations between

q_{i}

and

q_{i}^{'}

and between

z_{i}

and

z_{i}^{'}

, allowing a wide variety of different perspectives on the transformations that relate the two populations [7]. For our purposes, we can operate abstractly and not worry about the particular rules. Our only restriction is that we can map the index i between the two populations.

3. Fitness as a Change in Frequency

The function

z_{i}

can map subset i to any value. When studying frequency changes, let us rename the variable as

m \equiv z

, and choose

m_{i} = log \frac{q_{i}^{'}}{q_{i}}

to describe the ratio of frequencies between the two populations associated with i. We can think of

m_{i}

as a growth rate, or as a kind of force that moves the system from

q_{i}

to

q_{i}^{'}

. In particular, the above expression is equivalent to exponential growth driven by

m_{i}

as

q_{i}^{'} = q_{i} e^{m_{i}}

We may call

m_{i}

fitness, because it expresses the relative growth of the weighting associated with i. The term

m_{i}

is, in effect, a growth rate relative to an unspecified underlying scale of change. We can take

m_{i}

as a given force of growth and derive

q_{i}^{'}

, or we can take the outcome

q_{i}^{'}

as given, and derive the effective force,

m_{i}

, that is consistent with the outcome.

If we thought of i as a particular individual or a particular type, then

m_{i}

would express the growth rate associated with that individual or type between the two populations. However, the equations allow us simply to make the definition that relates

q_{i}

to

q_{i}^{'}

, and not restrict ourselves to a particular interpretation of what i means in those terms.

I confine my analysis to small differences,

Δ q_{i} \to d q_{i} \equiv {\dot{q}}_{i}

, in which

{\dot{q}}_{i} = q_{i}^{'} - q_{i}

is small. For small differences we have (see Methods for assumptions)

m_{i} = \frac{{\dot{q}}_{i}}{q_{i}}

Using this definition and the substitution

m_{i} \equiv z_{i}

in the Price equation Equation (1) from the prior section, we obtain a general expression for the total change in fitness as

\dot{\bar{m}} = \dot{q} \cdot m + q \cdot \dot{m}

in which we ignore the second order term

\dot{q} \cdot \dot{m}

in this description of small changes, with

Δ z \to d z \equiv \dot{m}

.

4. Conservation of Total Probability, Entropy Momentum, and Fisher Information

With the definition of fitness as a growth rate,

m_{i} = {\dot{q}}_{i} / q_{i}

, average fitness is

\bar{m} = q \cdot m = \sum {\dot{q}}_{i} = 0

This equation expresses the conservation of total probability or total frequency. It follows that the change in average fitness,

\dot{\bar{m}}

, must also be zero

\dot{\bar{m}} = \dot{q} \cdot m + q \cdot \dot{m} = 0

(2)

The term

\dot{q} \cdot m

has a wide variety of interpretations related to information theory and classical mechanics. For example, this term expresses entropy momentum or Fisher information [10,11], as

\dot{q} \cdot m = \sum {\dot{q}}_{i} \dot{log q_{i}} = \sum \frac{{\dot{q}}_{i}^{2}}{q_{i}}

The term

m_{i} = \dot{log q_{i}} = log q_{i}^{'} / q_{i}

is the change in entropy in each dimension, i, describing an entropy velocity or nondimensional entropy momentum relative to an unspecified underlying scale of change. Thus,

\dot{q} \cdot m

may be interpreted as the gain in entropy momentum, which must be balanced by the loss of entropy momentum in the second term,

q \cdot \dot{m}

, to achieve overall conservation,

\dot{\bar{m}} = 0

.

Note that I have used

- log q_{i}

as the entropy in each dimension, consistent with the information theory concept of self-information or surprise as

- log q_{i}

. That definition leads to system entropy as the expectation over the different dimensions,

- \sum q_{i} log q_{i}

. Some people prefer to define the entropy in each dimension as

- q_{i} log q_{i}

, and system entropy as the sum over each dimension, in which case my usage of entropy or information momentum does not make sense.

The term

\sum {\dot{q}}_{i}^{2} / q_{i}

is widely used as the Fisher information metric, particularly in the study of information geometry [11]. Thus, the first term in

\dot{\bar{m}} = 0

is the gain in Fisher information, and the second term is an exact balancing loss in Fisher information. The balance leads to an overall conservation of Fisher information, as emphasized by Frieden [10].

We have transcended our original formulation of biological fitness in these descriptions of probability, information, and entropy. The expressions here apply to any problem that can be expressed in terms of changing frequencies in populations or aggregates, subject to the conservation of total frequency.

5. d’Alembert’s Principle

We may write d’Alembert’s principle [5] as

(F + I) \dot{q} = 0

Here, all terms are vectors, and the implicit dot product with

\dot{q}

distributes over the parentheses. The vector

q

locates the system, and

\dot{q}

is a virtual displacement of the system from its current location to a nearby location. A virtual displacement is like an imaginary displacement, in which the system is held fixed in its current state, and then one moves its location without changing anything else. All forces and the frame of reference for measurement are held constant [5].

A virtual displacement must be consistent with all forces of constraint. In our case, the primary force of constraint on a virtual displacement,

\dot{q}

, is that the sum of the frequencies is one. Thus,

\sum {\dot{q}}_{i} = 0

expresses the force of constraint set by the conservation of total frequency or probability. Because a virtual displacement must be consistent with the forces of constraint, we need only analyze those forces that are in addition to the forces of constraint. In particular, we need to track the direct forces,

F

, and inertial forces,

I

.

The term

F

is the vector of direct forces acting on the system, and the term

I

is the vector of inertial forces that balance the direct forces to achieve no net change. d’Alembert’s principle can be thought of as a generalization of Newton’s second law of motion [5], in which

\tilde{F} = μ \tilde{A}

is read as the total force,

\tilde{F}

, equals mass, μ, times total acceleration,

\tilde{A}

. Total force and total acceleration must include forces of constraint. If we write total inertial force as

\tilde{I} = - μ \tilde{A}

, then Newton’s law is

\tilde{F} + \tilde{I} = 0

.

When we study an actual system, we are usually interested in how the direct, or applied, forces influence dynamics. To do that, we need to separate the direct forces from the constraining forces. For example, in studying the frequency dynamics and evolutionary change caused by natural selection, we usually wish to analyze the direct force of growth rate, or fitness, separately from the force of constraint imposed by the conservation of total probability.

In d’Alembert’s formulation, the direct and inertial forces typically do not sum to zero,

F + I \neq 0

, because those terms do not include the constraining forces. Instead, in d’Alembert’s expression

(F + I) \dot{q} = 0

, the term

\dot{q} \cdot F

combines the direct and constraining forces, and the term

\dot{q} \cdot I

combines all inertial forces, including any forces of constraint. Newton’s law is a special case of the more general principle of d’Alembert [5].

6. Interpretation of d’Alembert’s Principle

Here is a simple intuitive description of d’Alembert’s principle [12]. You are sitting in a car at rest, and the car suddenly accelerates. You feel thrown back into the seat. But, even as the car gains speed, you effectively do not move in relation to the frame of reference of the car: your velocity relative to the car remains zero. That net zero velocity can be thought of as the balance between the direct force of the seat pushing on you and the inertial force sending you back as the car accelerates forward.

As long as your frame of reference moves with you, then your net motion in your frame of reference is zero. Put another way, there is always a changing frame of reference that zeroes net change by balancing the work of direct forces on a system against the work of a balancing inertial force. Although the system is a dynamic expression of changing components, it also has an overall static, equilibrium quality that aids analysis. As Lanczos [5] emphasizes, d’Alembert’s principle “focuses attention on the forces, not on the moving body …”

7. d’Alembert and the Conservation of Total Probability

This section transforms the conservation of total probability expressed by Equation (2) into a form of d’Alembert’s principle. We first note that (see Methods for

\dot{log m}

notation)

q \cdot \dot{m} = (\frac{q}{\dot{q}} ⊙ \dot{m}) \dot{q} = (\frac{\dot{m}}{m}) \dot{q} = \dot{log m} \cdot \dot{q}

The symbol “⊙” denotes element-wise multiplication of vectors, the ratio denotes element-wise division, and dot products distribute over parentheses. With this expression, we can rewrite our general result in Equation (2) for the conservation of total probability, or the change in fitness, in the general form of d’Alembert,

(F + I) \dot{q} = 0

, as

(m + \dot{log m}) \dot{q} = 0

(3)

We equate this expression with d’Alembert by interpreting

m \equiv F

as the force of growth, or fitness, or, more generally, the direct forces acting on frequency change. We interpret

\dot{log m} \equiv I

as the inertial forces, which typically are described in terms of acceleration with respect to the frame of reference.

8. Direct and Inertial Forces

The expression in Equation (3) describes d’Alembert’s principle for systems that follow conservation of total probability. This section considers how we should interpret

(F + I) \dot{q} = 0

for the direct and inertial forces in terms of Newtonian concepts of force and acceleration.

The dot product expression in Equation (3) can be written as a sum over the individual dimensions of the system

(m + \dot{log m}) \dot{q} = \sum (m_{i} + \dot{log m_{i}}) {\dot{q}}_{i}

The first term on each side,

\dot{q} \cdot m \equiv \dot{q} \cdot F

, is the virtual displacement times the direct force. We may call this term the virtual work of the direct forces, because physical work is displacement times force. We can write this component of virtual work solely in terms of frequencies from our prior definition of

m_{i} = {\dot{q}}_{i} / q_{i}

.

The second term on each side,

\dot{q} \cdot \dot{log m} \equiv \dot{q} \cdot I

, is the virtual work of the inertial forces. To interpret the inertial forces with respect to acceleration, it is useful to express

\dot{log m}

as

\dot{log m_{i}} = \frac{{\ddot{q}}_{i}}{{\dot{q}}_{i}} - \frac{{\dot{q}}_{i}}{q_{i}}

(4)

The term

{\ddot{q}}_{i}

is the second order infinitesimal change, or acceleration. Thus,

I \equiv \dot{log m}

expresses how the changing frame of reference, arising from changed frequencies, leads to inertial forces that are accelerations.

We can now write d’Alembert’s principle under the conservation of total probability solely in terms of the probabilities, or frequencies, as

(m + \dot{log m}) \dot{q} = \sum (\frac{{\dot{q}}_{i}}{q_{i}} + \frac{{\ddot{q}}_{i}}{{\dot{q}}_{i}} - \frac{{\dot{q}}_{i}}{q_{i}}) {\dot{q}}_{i} = 0

(5)

Distributing the virtual displacement,

{\dot{q}}_{i}

, across the parentheses in the sum and splitting the sum into direct and inertial components yields

\sum \frac{{\dot{q}}_{i}^{2}}{q_{i}} + \sum ({\ddot{q}}_{i} - \frac{{\dot{q}}_{i}^{2}}{q_{i}}) = \sum {\ddot{q}}_{i} = 0

(6)

The sum of

{\ddot{q}}_{i}

is zero because

\sum {\dot{q}}_{i} = 0

by conservation of total probability, and thus the accelerations,

{\ddot{q}}_{i}

, also sum to zero. However, in a particular dimension, there may be an imbalance between direct and inertial force,

{\ddot{q}}_{i}

. That imbalance arises because the force of constraint on total probability differs across dimensions.

9. Unitary Coordinates and Path Lengths

From Equations (5) and (6), we may express d’Alembert’s balance between the total direct and inertial components as

(m + \dot{log m}) \dot{q} = \sum \frac{{\dot{q}}_{i}^{2}}{q_{i}} - \sum \frac{{\dot{q}}_{i}^{2}}{q_{i}} = 0

(7)

The

\sum {\dot{q}}_{i}^{2} / q_{i}

terms can be understood as distances by considering the curvature caused by the constraining force of the conservation of total probability. To get a proper sense of distance in that curved geometric space, we need to change the coordinates.

Let the new coordinates be

r = \sqrt{q}

. Then the total Euclidean length of the vector

r

is the square root of the sum of squares in each dimension, which is

|r| = \sqrt{\sum r_{i}^{2}} = \sqrt{\sum q_{i}} = 1

Vector lengths in the new coordinates are always one, which provides a pure expression of the conservation of total probability. In general, the

q

may be arbitrary weightings, such that

\sum q_{i}

is conserved, and thus

\sum {\dot{q}}_{i} = 0

. Here, I focus on conserved probability, in which the

q_{i}

are positive and sum to one.

The path lengths of motion take on simple interpretations in terms of distance in the unitary coordinates. The transformed coordinates yield

\sum \frac{{\dot{q}}_{i}^{2}}{q_{i}} = 4 \sum {\dot{r}}_{i}^{2}

which shows the simple Euclidean interpretation of squared distance in the

r

coordinates as a sum of squared differences. This expression of distance is also equivalent to the Fisher information metric [10,11]. However, geometry is perhaps more fundamental than information, because the distance arises inevitably from curvature of paths caused by analyzing probability displacement subject to unitary conservation of total probability.

10. Geometry

This section briefly reviews the geometry of frequency change dynamics that follow from two assumptions. The first assumption is that direct force,

m_{i}

, causes exponential growth

q_{i}^{'} = q_{i} e^{m_{i}}

This growth expression establishes a natural logarithmic scaling for comparing frequencies, because

m_{i} = log \frac{q_{i}^{'}}{q_{i}}

When changes are small,

m_{i} = \dot{log q_{i}} = {\dot{q}}_{i} / q_{i}

. We could interpret those changes with respect to

log q_{i}

as entropy or information. But the geometry of force and growth may be a better way to think about the fundamental nature of these expressions.

The second assumption is that total frequency or probability is conserved,

\sum {\dot{q}}_{i} = 0

. That conservation imposes a constraint on paths of change. The constraint may be expressed by the geometry of the unitary coordinates,

r = \sqrt{q}

, which yields a conserved length

|r| = 1

. The path lengths for virtual displacements times direct or inertial forces are

\sum {\dot{q}}_{i}^{2} / q_{i} = 4 \sum {\dot{r}}_{i}^{2}

. The essential geometry arising from growth and from conservation of total probability sets the form of the distances.

11. Canonical Coordinates and Conservation in Each Dimension

Hamiltonian expressions in canonical coordinates often provide the deepest insight into the symmetries of a system [13]. To obtain the Hamiltonian, the use of

r = \sqrt{q}

coordinates was a first step, because we can rewrite d’Alembert’s principle in Equation (7) as

\frac{1}{4} (m + \dot{log m}) \dot{q} = \sum {\dot{r}}_{i}^{2} - \sum {\dot{r}}_{i}^{2} = 0

However, the net balance only applies to the total system rather than separately in each dimension. If we can find the proper canonical coordinates, then the forces of constraint will appear independently in each dimension, and the balance of direct and inertial forces will also appear independently in each dimension.

In a Hamiltonian formulation, we assign two values to each component, usually considered as position and momentum [13]. In our nondimensional system, our primary factor is the conservation of total probability, which we express through the unitary coordinates

r = \sqrt{q}

, such that the length of

r

is always one

|r| = \sqrt{q} \cdot \sqrt{q} = 1

If, for each point, we take

r_{i} = \sqrt{q_{i}}

for position and

p_{i} = \sqrt{q_{i}}

for momentum, then

r \cdot p = 1

, and the conserved Hamiltonian is

H = \dot{r} \cdot p - r \cdot \dot{p} = 0

This expression satisfies the requirements for Hamiltonian canonical coordinates of position and momentum, which are that

\partial H / \partial r_{i} = - {\dot{p}}_{i}

and

\partial H / \partial p_{i} = {\dot{r}}_{i}

. The differential of the Hamiltonian often provides a useful expression

\dot{H} = \ddot{r} \cdot p - r \cdot \ddot{p} = 0

(8)

which, in each separate dimension, is zero

{\dot{H}}_{i} = \ddot{r_{i}} p_{i} - r_{i} {\ddot{p}}_{i} = 0

(9)

because

r_{i} = p_{i} = \sqrt{q_{i}}

, and

\ddot{r_{i}} = \ddot{p_{i}} = \frac{1}{2 \sqrt{q_{i}}} (\ddot{q_{i}} - \frac{{\dot{q}}_{i}^{2}}{2 q_{i}})

thus we can write the Hamiltonian in each dimension as

4 {\dot{H}}_{i} = (\frac{{\dot{q}}_{i}^{2}}{q_{i}} - 2 \ddot{q_{i}}) - (\frac{{\dot{q}}_{i}^{2}}{q_{i}} - 2 \ddot{q_{i}}) = 0

Here, the curvature from the force of constraint is divided into equal and opposite contributions in the direct and inertial force components, recovering a Newtonian

{\tilde{F}}_{i} - μ {\tilde{A}}_{i} = 0

perspective independently in each dimension.

We can rewrite Equation (8) as a d’Alembert’s principle expression

\dot{H} = (p ⊙ \dot{log} \dot{r} - r ⊙ \dot{log} \dot{p}) \dot{r} = 0

for virtual displacement

\dot{r}

, direct force

F = - p ⊙ \dot{log} \dot{r}

, and inertial force

I = r ⊙ \dot{log} \dot{p}

. The symbol “⊙” denotes element-wise multiplication of vectors, and dot products distribute over parentheses. Thus,

\dot{H} = (F + I) \dot{r} = 0

, with the Newtonian equality

F_{i} + I_{i} = 0

satisfied in each dimension.

12. Coordinates for Quantities Correlated with Force

We can analyze any quantitative system property by transforming coordinates. We start with the general results for the conservation of total probability and information momentum,

\dot{\bar{m}} = 0

. We then obtain an expression for the change in the system quantity,

\dot{\bar{z}}

, by the change in coordinates

(m, \dot{m}) \mapsto (z, \dot{z})

, in which the different coordinates now have an arbitrary relation rather than the earlier equivalence. That change in coordinates generalizes the

\dot{\bar{m}}

form of the Price equation (Equation (2)), to give the change in the average value of z as

\dot{\bar{z}} = \dot{q} \cdot z + q \cdot \dot{z}

The

z_{i}

values are the averages of z in each dimension, i. Because z can be any quantity, calculated in any way, this equation gives the most general expression for

\dot{\bar{z}}

, the change in the average of z. One can think of

\bar{z} = \sum q_{i} z_{i}

as a functional of the arbitrary function, z, that maps

i \mapsto z_{i}

. The only restriction on the expression for

\dot{\bar{z}}

shown here is that changes be small. For large changes, the exact form of the Price equation in Equation (1) should be used.

We can relate

\dot{\bar{m}}

to

\dot{\bar{z}}

by writing the change in coordinates,

m \mapsto z

and

\dot{m} \mapsto \dot{z}

, as the regression equations

\begin{matrix} z & = β_{z m} m + ϵ \\ \dot{z} & = β_{\dot{z} \dot{m}} \dot{m} + γ \end{matrix}

in which the regression coefficients, β, are obtained by minimizing the length of the “error” vector. To analyze the length of the error vector, we can use standard identities from the theory of least squares for regression [14].

In particular, the first regression equation follows from choosing

β_{z m}

to minimize

{|ϵ_{q}|}^{2} = \sum q_{i} ϵ_{i}^{2}

, in which

ϵ_{q} = \sqrt{q_{i}} ϵ_{i}

denotes a

\sqrt{q}

weighted vector. Choosing

β_{z m}

to minimize the length of

ϵ_{q}

leads to

m_{q} \cdot ϵ_{q} = 0

, because the minimum length of

ϵ_{q}

occurs when that vector is orthogonal to

m_{q}

. Note that

{\dot{q}}_{i} = q_{i} m_{i}

, thus

\dot{q} \cdot ϵ = \sum q_{i} m_{i} ϵ_{i} = m_{q} \cdot ϵ_{q} = 0

In the equation for

\dot{z}

, minimizing

{|γ_{q}|}^{2}

sets

β_{\dot{z} \dot{m}}

. We also have, by standard theory,

q \cdot γ = 0

.

Using these identities,

\begin{matrix} \dot{q} \cdot z & = β_{z m} \dot{q} \cdot m + \dot{q} \cdot ϵ = β_{z m} \dot{q} \cdot m \end{matrix}

(10)

\begin{matrix} q \cdot \dot{z} & = β_{\dot{z} \dot{m}} q \cdot \dot{m} + q \cdot γ = β_{\dot{z} \dot{m}} q \cdot \dot{m} \end{matrix}

(11)

from which we obtain the change

\dot{\bar{z}}

in terms of the original coordinates for

\dot{\bar{m}}

as

\dot{\bar{z}} = β_{z m} \dot{q} \cdot m + β_{\dot{z} \dot{m}} q \cdot \dot{m} = (β_{z m} - β_{\dot{z} \dot{m}}) \dot{q} \cdot m

(12)

the right expression arising from the fact that

\dot{q} \cdot m + q \cdot \dot{m} = 0

. The total change,

\dot{\bar{z}}

, is split into the virtual work term,

β_{z m} \dot{q} \cdot m

, and the inertial force term,

β_{\dot{z} \dot{m}} q \cdot \dot{m}

. The regression coefficients rescale coordinates

(m, \dot{m}) \mapsto (z, \dot{z})

.

If

\bar{z}

is a conserved quantity, or the system is at an equilibrium with respect to

\bar{z}

, then

\dot{\bar{z}} = 0

. We can write a d’Alembert form

\dot{\bar{z}} = (β_{z m} m - β_{\dot{z} \dot{m}} m) \dot{q} = 0

which, when

\dot{q} \cdot m \neq 0

, implies

β_{z m} = β_{\dot{z} \dot{m}}

, and the d’Alembert equality holds separately in each dimension. In this case, the dynamics of

z

are influenced by both the conservation of probability and by additional constraints set by the conservation of

\bar{z}

. We may, of course, choose the changing reference frame,

\dot{z}

, such that

\dot{\bar{z}} \neq 0

, in which case the direct and inertial forces do not completely balance.

13. The Fundamental Theorem

We may set

β_{\dot{z} \dot{m}} = 0

, either because the changing value of

\bar{z}

is unaffected by the changing reference frame, or because the effects of the changing reference frame are ignored by assumption. We then have an expression for the partial change caused by the direct forces, holding constant the frame of reference

{\dot{\bar{z}}}_{s} = \dot{q} \cdot z = β_{z m} \dot{q} \cdot m

in which the s subscript emphasizes that this is a partial change ascribed to the direct forces, or the forces of selection. This form includes, as special cases, Fisher’s fundamental theorem of natural selection, the breeder’s equation of genetics, and other common expressions for the change in populations caused by natural selection.

Note that

\dot{q} \cdot m = V_{m}

, the variance of

m

, because

\dot{q} \cdot m = \sum {\dot{q}}_{i} m_{i} = \sum q_{i} (\frac{{\dot{q}}_{i}}{q_{i}}) m_{i} = \sum q_{i} m_{i}^{2}

which is the variance of

m

, because

\bar{m} = 0

.

If we take

z = m

in order to study the change in fitness caused by the direct forces, then

{\dot{\bar{m}}}_{s} = V_{m}

, the change in mean fitness caused by selection,

{\dot{\bar{m}}}_{s}

, is the variance in fitness,

V_{m}

. Fisher was interested in the transmissible change in

\bar{m}

associated with genetic factors,

g

, thus he partitioned fitness as

m = g + δ

. Here, the genetic factors are partial regressions associated with particular genes, such that

g

is chosen to maximize the amount of the total variance in fitness,

V_{m}

, associated with the transmissible genes [4,9,15,16]. The δ terms are residuals in the regression, such that one gets the additive partition of total variance from classical regression theory as

V_{m} = V_{g} + V_{δ}

.

The change in fitness caused by the direct forces can now be written as

{\dot{\bar{m}}}_{s} = V_{g} + V_{δ}

and thus the transmissible change in fitness caused by natural selection and associated with genetic factors is

{\dot{\bar{m}}}_{s | g} = V_{g}

in which

V_{g}

is the variance in the transmissible effects of the genetic factors on fitness, or the genetic variance in fitness. That partial change in fitness caused by direct forces and associated with transmissible factors is what Fisher emphasized in his fundamental theorem of natural selection. By defining the genetic factors,

g

, as the only direct forces of interest, the residual forces of selection, δ, are added to the other inertial forces that define the changing frame of reference.

In models of evolutionary change, Fisher chose to ascribe the direct force of change associated with

g

to natural selection, and all other forces to the inertial frame that he called environmental causes. That d’Alembert interpretation of the split between direct and inertial forces provides a clear way in which to understand Fisher’s fundamental theorem of natural selection. There is, of course, an arbitrary aspect to such a partition, because the split between direct and inertial forces depends entirely on how one chooses to define the frames of reference. For example, a change in how one defines the set of potentially transmissible factors,

g

, alters how one splits forces between direct and inertial components [15].

14. Conclusions

The fundamental equations for change are identical between many laws of physics and evolutionary change by natural selection. However, the different histories of those subjects and the long and confused debates in biology about Fisher’s fundamental theorem have obscured the simple, common basis of the underlying theory.

I unified different theories by combining d’Alembert’s conceptual frame with the abstract expressions of the Price equation. That combination led to a simple and very general basis for understanding populations or aggregations, in which one can interpret total frequency or total probability as a conserved quantity. By combining conservation of total frequency with a notion of change based on exponential growth, I showed the geometric and algebraic forms of change that arise from d’Alembert’s partition of direct and inertial forces. I also provided an elegant Hamiltonian expression in canonical coordinates, which recovers the Newtonian balance of force and acceleration independently in each dimension for the corresponding direct and inertial forces of d’Alembert.

Finally, I showed that arbitrary system quantities, such as biological traits, or any total system quantity such as energy, can be interpreted through two steps. First, begin with the universal results that arise from conservation of total probability and the notion of change as exponential growth. Second, apply a simple coordinate transformation between frequency change and system quantities to obtain general expressions for the change in system quantities.

Acknowledgments

National Science Foundation grant DEB–1251035 supports my research. I completed this work while on fellowship at the Wissenschaftskolleg zu Berlin.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A: Methods

The assumption of small changes associated with the overdot notation does not imply that forces are weak. Instead, the scale of change is small, in the sense typically associated with continuous time derivatives in differential equations. However, I have avoided classical derivative notation and differential equations in order to retain the more general form of the abstract Price equation [7,8].

For example, in the definition

m_{i} = {\dot{q}}_{i} / q_{i}

, the overdot notation can be interpreted as a small change in

q_{i}

, such that

{\dot{q}}_{i} \equiv d q_{i}

. Fitness in biology is sometimes given as an absolute number or as a nondimensional change in frequency, consistent with

m_{i}

, and sometimes as a rate or Malthusian parameter, which might be given as

M_{i} = \frac{m_{i}}{d τ} = \frac{{\dot{q}}_{i}}{q_{i} d τ} = \frac{d log q_{i}}{d τ}

(13)

Here,

d τ

is the underlying scale of change, which is typically a small change in time. However, we can take

d τ

as an abstraction of the underlying scale of change, which may have any units or be nondimensional. If we take the units on τ as the square of time, then we move toward traditional definitions of force or acceleration. Because

d τ

is small, the quantities of rates, forces, or accelerations may be large.

In the text, we are always looking at equivalences between left and right hand sides of equations. So we can always multiply or divide by various functions of

d τ

interpreted with respect to arbitrary dimensions. The abstraction in the text is intentional, because the interdisciplinary connections between seemingly different subjects and results arise only when one focuses on the abstract structure of the key results. For example, the need for such abstraction arose elsewhere when studying the relation between Fisher’s fundamental theorem and Fisher information [7,8,17].

The abstract structure shows the unity among a broad array of fundamental expressions in mechanics, in biology, in information theory and information geometry, and in many other kinds of problems that can be cast in variational form.

I have made the assumption that the scale of change is small, and thus all quantities with overdots are small. In biology, that assumption is often associated with models of populations with overlapping generations described in continuous time differential equations [16]. In mechanics, that assumption corresponds to the classical differential equation expressions in continuous time.

The analysis of discrete changes that are not small, typically associated with discrete time models, remains an open problem. The exact Price expression in Equation (1) gives a hint at how to proceed when changes are not small. The connection to the continuous expressions of mechanics and d’Alembert might be achieved by careful use of differential geometry and construction of discrete changes as sums of small changes along continuous paths. But that analysis remains an open problem for the future. Some results based on the analysis of the exact, discrete Price equation may provide a point of departure [7,8].

The

\dot{log m_{i}}

notation is interpreted as

\dot{log m_{i}} = \frac{d m_{i}}{m_{i}}

which is the change in the relative distance of

m_{i}

from zero. This interpretation is consistent with the expression of

\dot{log m_{i}}

in terms of the changes in

q_{i}

given in Equation (4).

References

Fisher, R.A. The Genetical Theory of Natural Selection, 2nd ed.; Dover: New York, NY, USA, 1958. [Google Scholar]
Frank, S.A.; Slatkin, M. Fisher’s fundamental theorem of natural selection. Trends Ecol. Evol. 1992, 7, 92–95. [Google Scholar]
Price, G.R. Fisher’s `fundamental theorem’ made clear. Ann. Hum. Genet. 1972, 36, 129–140. [Google Scholar] [PubMed]
Ewens, W.J. An interpretation and proof of the fundamental theorem of natural selection. Theor. Popul. Biol. 1989, 36, 167–180. [Google Scholar] [PubMed]
Lanczos, C. The Variational Principles of Mechanics, 4th ed.; Dover Publications: New York, NY, USA, 1986. [Google Scholar]
Price, G.R. Extension of covariance selection mathematics. Ann. Hum. Genet. 1972, 35, 485–490. [Google Scholar] [CrossRef] [PubMed]
Frank, S.A. Natural selection. IV. The Price equation. J. Evol. Biol. 2012, 25, 1002–1019. [Google Scholar] [CrossRef] [PubMed]
Frank, S.A. Natural selection. V. How to read the fundamental equations of evolutionary change in terms of information theory. J. Evol. Biol. 2012, 25, 2377–2396. [Google Scholar] [CrossRef] [PubMed]
Frank, S.A. Natural selection. VI. Partitioning the information in fitness and characters by path analysis. J. Evol. Biol. 2013, 26, 457–471. [Google Scholar] [CrossRef] [PubMed]
Frieden, B.R. Science from Fisher Information: A Unification; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Amari, S.; Nagaoka, H. Methods of Information Geometry; Oxford University Press: New York, NY, USA, 2000. [Google Scholar]
Wikipedia. Fictitious force—Wikipedia, The Free Encyclopedia, 2015. Available online: https://en.wikipedia.org/wiki/Fictitious_force (accessed on 22 May 2015).
Landau, L.D.; Lifshitz, E.M. Mechanics, 3rd ed.; Butterworth-Heinemann: London, UK, 1976; Volume 1. [Google Scholar]
Draper, N.R.; Smith, H. Applied Regression Analysis, 3rd ed.; Wiley: Hoboken, NJ, USA, 1998. [Google Scholar]
Frank, S.A. The Price equation, Fisher’s fundamental theorem, kin selection, and causal analysis. Evolution 1997, 51, 1712–1729. [Google Scholar] [CrossRef]
Crow, J.F.; Kimura, M. An Introduction to Population Genetics Theory; Burgess: Minneapolis, MN, USA, 1970. [Google Scholar]
Frank, S.A. Natural selection maximizes Fisher information. J. Evol. Biol. 2009, 22, 231–244. [Google Scholar] [CrossRef] [PubMed]

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Frank, S.A. D’Alembert’s Direct and Inertial Forces Acting on Populations: The Price Equation and the Fundamental Theorem of Natural Selection. Entropy 2015, 17, 7087-7100. https://doi.org/10.3390/e17107087

AMA Style

Frank SA. D’Alembert’s Direct and Inertial Forces Acting on Populations: The Price Equation and the Fundamental Theorem of Natural Selection. Entropy. 2015; 17(10):7087-7100. https://doi.org/10.3390/e17107087

Chicago/Turabian Style

Frank, Steven A. 2015. "D’Alembert’s Direct and Inertial Forces Acting on Populations: The Price Equation and the Fundamental Theorem of Natural Selection" Entropy 17, no. 10: 7087-7100. https://doi.org/10.3390/e17107087

Article Menu

D’Alembert’s Direct and Inertial Forces Acting on Populations: The Price Equation and the Fundamental Theorem of Natural Selection

Abstract

1. Introduction

2. The Price Equation

3. Fitness as a Change in Frequency

4. Conservation of Total Probability, Entropy Momentum, and Fisher Information

5. d’Alembert’s Principle

6. Interpretation of d’Alembert’s Principle

7. d’Alembert and the Conservation of Total Probability

8. Direct and Inertial Forces

9. Unitary Coordinates and Path Lengths

10. Geometry

11. Canonical Coordinates and Conservation in Each Dimension

12. Coordinates for Quantities Correlated with Force

13. The Fundamental Theorem

14. Conclusions

Acknowledgments

Conflicts of Interest

Appendix A: Methods

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI