1. Prelude on Stochastic Motions
Stochastic differential equations, whose solutions are stochastic motions (processes) describing the position of an entity, are currently used in a diversity of scientific areas to model various phenomena. It is a common practice in many of these models to derive a stochastic differential equation by ad hoc arguments rather than from the first principles. The classic concept of velocity is not applicable in these models (whenever the position process is driven by a Wiener process for instance), and consequently the traditional formulations of the laws of motion (originating in classical mechanics) are not applicable either. This raises the question of finding the ‘axioms of motion’ needed to put ‘stochastic modelling’ on a firmer basis. Put differently, the aim is to focus not only on ‘kinematics’ (describing the stochastic motion) but also on ‘dynamics’ (explaining its cause).
2. Dual Meaning of Newton’s First Law
Newton’s first law of motion was stated in the first two editions of his
Principia [
1] (published in 1687 and 1713) as follows: ‘
Corpus omne perseverare in statu suo quiescendi vel movendi uniformiter in directum, nisi quatenus a viribus impressis cogitur statum illum mutare’. An accurate translation of the Latin original into English given in [
1] (p. 416) reads:
Every body perseveres in its state of being at rest or of moving uniformly straight forward except insofar as it is compelled to change its state by forces impressed. The accuracy of this translation is reached by converting the Latin word ‘
perseverare’ into the English word ‘
perseveres’ and the Latin phrase ‘
nisi quatenus’ into the English phrase ‘
except insofar as’ so that the Latin original is genuinely replicated in English.
Earlier translations of the Latin original into English, including the very first one by Motte in 1729, were somewhat inaccurate in this regard due to the natural branching of the Latin word ‘
perseverare’ into the English words ‘
continue/preserve’ combined with ‘
until/so-far-as-not’ (see [
2] for a detailed historical and conceptual analysis). This gave rise to two forms of Newton’s first law in English translations, one ‘
temporal’ and another ‘
quantitative’, both being contained in the Latin original. The
temporal form addresses an interval of time
when external forces are absent and the body retains its natural state of
rest or
uniform rectilinear motion. The
quantitative form addresses an instant of time
T when external forces are present but the body is still in its natural state (of
rest or
uniform rectilinear motion) which will be changed only in the very next instant of time. (See also [
1] (pp. 109–111) for additional arguments which relate Newton’s first law of motion to Newton’s second law of motion.)
It appears that the dual meaning of Newton’s first law in the Latin original was intentional. The temporal form embodies Galileo’s vision of the ‘uniform rectilinear motion’ of a body as its natural state and thus departs from the view held by Aristotle (and Kepler) that the only natural state of a body is to be at ‘rest’ (implying the need of a ‘mover’ to maintain a ‘uniform rectilinear motion’ itself). The quantitative form embodies Galileo’s perception of the ‘force of inertia’ exerted by the body during a change in its natural state and thus departs from the view held by Descartes (and Huygens) that such a ‘force’ did not exist. The dual meaning of Newton’s first law of motion is obtained by addressing simultaneously (i) a body’s maintenance of a natural state of motion (when external forces are absent) and (ii) a body’s resistance to a change in the natural state of motion (when external forces are present). This synthesis has been recognised as the main power of Newton’s first law (see [
2] for further details).
3. The Question of Merger
Part (i) of the dual meaning addresses a highly idealised situation which does not occur in reality and thus could only be understood as a powerful thought experiment. Part (ii) of the dual meaning is far removed from part (i) because the presence of external forces excludes their absence. On deeper reflection, this raises the question whether it is possible to merge the (seemingly disjoint) parts (i) and (ii) of the dual meaning and obtain a single meaning of Newton’s first law of motion. It appears to be evident that if such a merger is possible, then one should focus on the highly idealised part (i) and move it closer to its (seemingly distant) relative embodied in part (ii) of the dual meaning.
A starting point in realising the merger is to recall that the absence of external forces can also be thought of as if the resulting force were zero (both in magnitude and direction) when external forces are present. This realisation moves part (i) an infinitesimal step closer to part (ii) of the dual meaning. To move beyond the infinitesimal step, it is apparent that the ‘nature’ ought to exert the resulting forces in opposite directions so to cancel each other out, however, in such a way that the cancellation fails at infinitesimally small time scales that are beyond any reach. Denoting the position of an entity in this static/dynamic equilibrium by at time , and realising the motion under the same conditions twice, the observable outcome would be that X exhibits two different sample paths both satisfying the same initial condition . This means that the motion X is not only chaotic but also stochastic.
It is important to bear in mind that the infinitesimal cancellation of the resulting forces fails in every possible direction of the state space of X with no imposed bound on the total number of external forces including their magnitude and/or direction either. Clearly, part (i) of the dual meaning expanded in that way addresses a highly idealised situation as well, which however has the advantage of being closer to reality if one accounts for all forces in the ‘nature’. Moreover, the static/dynamic equilibrium of the ‘nature’ just described evidently merges parts (i) and (ii) of the dual meaning and establishes a single meaning of Newton’s first law of motion.
4. Stationary Independent Increments
The static/dynamic equilibrium can be fully described by focusing on increments
for
and inferring that they are (a)
independent and (b)
stationary (i.e., equally distributed whenever
) for any
given and fixed. Both properties embody a canonical meaning which is close to naive intuition that underpins the static/dynamic equilibrium.
The property (a) shows that the stochastic motion X has no memory and always renews itself completely afresh. Moreover, setting whenever the expected values are finite for , one can easily verify using (a) that the centred motion is a martingale (i.e., the best prediction of the future position given the observed positions for is the present position for given and fixed). On the other hand, setting whenever the expected values are finite for , one can easily verify using (b) that m satisfies the Cauchy functional equation for so that for (whenever measurable) with some real constant a. Hence we see that for with and showing that the stochastic motion X in the static/dynamic equilibrium behaves on average analogously to the natural motions of Aristotle () and/or Galileo ().
Newton’s first law of motion then extends as follows: Every entity perseveres in its state of independent and stationary increments except insofar as it is compelled to change its state by forces impressed. The term ‘entity’ applies generally to anything having real or distinct existence, including any dual nature depending on means of observation, for instance. The motion X in the latter case may be rather seen as a motion of the probability law of for , while itself may be viewed as one of its sample path realisations.
In addition to the properties (a) and (b) in relation to (
1) above, the static/dynamic equilibrium also includes the continuity property
in probability (i.e., the probability that a distance from
to
is larger than any small positive number tends to zero) as
in
. It needs to be noted that this weak continuity property (as well as the properties (a) and (b) in relation to (
1) above) is truly a property of the probability law of
X and a sample path
need not be continuous on
. It is known however that a stochastic motion
X satisfying the properties (a) and (b) in relation to (
1) above, together with the weak continuity property (
2) itself, can always be realised through its sample paths which are
right-continuous and have
left limits as functions of time. The times at which jumps of the sample paths occur are
unpredictable.
5. General Description and Examples
The structure of stochastic motions
X having independent and stationary increments (
1) and satisfying the weak continuity property (
2) has been completely described firstly by Einstein and Wiener in the continuous sample path case and then by de Finetti, Kolmogorov, and Lévy in the general/discontinuous sample path case. The resulting stochastic motions (or processes) bear the name of Lévy in the modern literature (see e.g., [
3]). Leaving aside the more familiar Wiener process, whose sample paths (although nowhere differentiable) are continuous, a key starting fact in the discontinuous sample path case is that
is a Poisson process for
with intensity
for any measurable set
not containing a (small) open ball with centre at 0, where we set
for
. (In particular it means that the compensated process
is a martingale.) The measure
defined by (
4) satisfies
and is called the
Lévy measure. Extensions from
to
with
and more general state spaces of
X are straightforward.
The
Lévy-Itô decomposition of
X states that
where
does not depend on any chosen truncation level
and
is a standard Wiener process. The finite constant
plays the role of a drift, the first integral on the right-hand side of (
5) represents a mixture of compensated Poisson processes (with bounded jumps), and the second integral on the right-hand side of (
5) represents a mixture of Poisson processes (with unbounded jumps). From (
5) we see that a ‘free particle’ moves on a ‘straight line’ when
and
(see [
4]) and undertakes ‘quantum jumps’ (as the motion of probability laws) when
(see [
5]).
The
Lévy–Khintchine formula of
X states that
for
where the exponent function
is explicitly given by
for
. Recalling that
X has independent and stationary increments, we see that (
6)+(
7) completely determine all possible probability laws of
X.
Examples of stochastic motions
X having independent and stationary increments (
1) and satisfying the weak continuity property (
2) include: (1)
Straight line for
(Aristotle
& Galileo
); (2)
Wiener process for
(Einstein and Wiener
); (3)
Poisson process (
; the length of time between two jumps is
distributed; the size of each jump is 1); (4)
Compound Poisson process (
is any finite measure on
; the length of time between two jumps is
distributed with
; the size of each jump is
distributed); (5)
Stable process (
where
,
with
and
); (6)
Gamma process (
). For more details see [
3] and the references therein.
6. Synopsis
Merging the dual meanings of Newton’s first law of motion into a single/canonical meaning, as embodied in independent and stationary increments of the stochastic motion, one unleashes the full power of the extended first law. The merger itself removes the conceptual barrier separating the traditional formulation of Newton’s first law from its applicability to more general stochastic motions to which, among other things, the classic concept of velocity is not applicable (recall that the sample paths of a standard Wiener process are nowhere differentiable).
In accordance with the traditional interpretation, the extended first law implies that stochastic motions having independent and stationary increments represent inertial frames, which then can be used to account for the presence of external forces and derive the equation of motion using a simple principle of their superposition. The extended first law thus aids in deriving the equation of motion, and this appears to be its single most useful practical implication, leaving aside any deeper/philosophical ones.
For such an in-depth analysis, we may point out a derivation of the equation of motion for a Brownian particle, which is under the influence of an external force in the fluid with a non-constant temperature (see Sections 4–6 and the resulting Equation (6.4) in [
6]). This equation of motion is ‘driven’ by a standard Wiener process and its solution represents the position of the Brownian particle as a function of time. On more careful inspection of the derivation, one sees that the underlying inertial frame of a standard Wiener process plays a fundamental role in accounting for the external forces present, which in turn yields the equation of motion itself. The analogous arguments clearly apply in more general settings as well whenever the inertial frames are identified with Wiener processes.
Equations of motions ‘driven’ by more general Lévy processes have also been studied in the literature, though with no awareness or reference to inertial frames of the extended first law. The foundations laid down for a unifying view of the extended first law open up new avenues for research in these and related settings.