We could start this exposition in a number of ways, but the easiest is perhaps to explain how, in classical mechanics, time could be also thought of as a dynamical variable just like position and momentum. The idea is to start with a time-independent Hamiltonian
and construct a new Hamiltonian
where
is conjugate to the time variable
t. The equations of motion for this new Hamiltonian (with respect to a new time
) are:
from which it follows that
and that
is conserved in time. We can set
without any loss of generality and can also assume that the value of
is such that
. In other words, the total energy equals zero. We can already see how, at the level of
, there is no evolution; however, the dynamics generated by the original Hamiltonian remain the same. Namely,
This is the formalism of super Hamiltonians (see, for instance, Kramers [
1]). It is well known that, when we quantize this system,
H becomes the Hamiltonian operator and
. The resulting equation
is the Schrödinger equation.
This already tells us that even the time-dependent Schrödinger equation can be though of in a timeless way (even Schrödinger himself apparently wrote down the time-independent form of his equation first). However, to nudge the classical discussion even closer to the Page–Wootters logic, let us now treat the variables as actually pertaining to another subsystem, which is distinct from the one with the conjugate variables .
To make the notation more transparent, the first subsystem will be the one with the variables , while the second subsystem will have variables . Suppose that the two subsytems are non-interacting, which means that the total Hamiltonian is given by . If the total system is closed, the total energy is conserved. We can again assume without loss of generality that the total energy is zero , which means that if and , then .
In the Hamilton–Jacobi formalism, we thus write:
where
is the total action variable, which is the sum of the individual actions as the subsystems are non-interacting. Each subsystem could be assigned its own time variable
Because the total energy is conserved, this immediately implies that (we are again omitting any additive constants without loss of generality). Just like in the quantum (Page–Wootters) formalism, the time variables of the two subsystems are correlated, not because the subsystems are coupled, but because the overall energy is conserved. Therefore, we can say that one subsystem “evolves” dynamically with respect to the other, one even though the total system is stationary (i.e., we have that ).
In the Page–Wootters (PW) work [
2], this relative evolution is immediately transparent through the use of Lie algebras and groups. We first briefly summarize the quantum derivation in order to follow it up with the exact classical analogue.
In the PW formalism, the two subsystems are usually called the system and the clock. The total system is referred to as the universe. The state of the universe is of the form
, and it is an eigenstate of the total Hamiltonian (with zero eigenvalue assumed for simplicity), which itself is the sum of the system and the clock Hamiltonians,
. The index
t can be thought of as just a dummy index; however, it is clear that it will become the time label of the clock, with respect to which the system states
will evolve. Furthermore, we assume that
It is already clear from here that since , the dynamics of the system unfold in sync with the dynamics of the clock, and without any overall dynamics. Namely, if we apply a small time shift to the clock so that the state changes , then the system will undergo the change .
Another way of saying the same and obtaining the dynamics of the system is to compute [
3]
which, after a sequence of steps, leads us to
i.e., the Schrödinger equation for the system (the bracket
is the commutator).
Can this derivation also be paralleled in classical mechanics? Yes. We can assume that the classical Hamiltonian is, once again, the sum of the system and clock Hamiltonians and that the classical state of the system is a phase space distribution in which the states of the system are correlated to the states of the clock. The dynamics of classical states can be represented as (see e.g., [
4]):
where
H is the classical Hamiltonian and
is the classical phase space distribution at time
t (note the absence of the imaginary
i).
plays the classical role of the state of the system.
The notation with tilde is a shorthand notation for the following operation
where
is the
-dimensional antisymmetric matrix,
s are the so-called unified coordinates that represent all the conjugate variables (the first
k coordinates are
qs and the next
k coordinates are conjugate momenta
p) and
labels the
-th position coordinate. The Hamilton equations of motion in this notation are given by
The tilde definition is designed so that the Poisson bracket can be written as
. The iterations of this formula give us the exponential form of the classical propagator
[
4].
If we now write the Hamiltonian as the sum of two Hamiltonians, so that , and assume that the total energy is zero, we will end up reproducing the classical derivation of the evolution of the system relative to the clock. Instead of the Schrödinger equation, we will, of course, obtain the Liouville equation in which the commutator will be replaced by the Poisson bracket.
The group-theoretic formulation of both classical and quantum dynamics therefore helps us expose the analogies as far as the timeless dynamics are concerned. We would now like to discuss one clear difference. Unsurprisingly, it is ultimately related to quantum entanglement. The PW state of the universe is a pure state which corresponds to the maximum knowledge. The reduced states of the system and the clock are mixed, but they are correlated with each other. Classically, as is well known, this is an impossible state of affairs. Classical phase space densities can be correlated, but this automatically implies that their overall state cannot be pure (classically, this means a state with perfectly known positions and momenta of all particles). For instance, we can write to total classical density as
where
is the probability corresponding to the state
t (which, as in the quantum case, could be assumed to be uniform). To each classical state of the clock, there corresponds one classical state of the system. Formally,
, which requires us to assume that the classical states of the clock are “orthogonal" to each other (meaning that their overlap in phase space is zero, as would be the case with two delta functions centred on two different points in phase space). The simplest, non-trivial, instance of this is the state that contains only two terms (
) where the states of the clock are two orthogonal delta functions (infinitely localised states) and the state of the system are two arbitrary distributions in phase space. As the clock “changes” from one state to the other, the system also “evolves”’ from the first to the second distribution.
In classical physics, the total state (of the universe) cannot be less mixed than the state of any of the subsystems. This presents a problem since, in a classical universe, its analogue PW state can only be explained statistically. All classical mixtures are due to either a lack of knowledge or can be seen as describing an ensemble of universes. If the universe is mixed because of the former, would this lack of knowledge be fundamental and why? On the other hand, a statistical ensemble of universes is not a good hypothesis unless we have a strong rationale for introducing it. In quantum mechanics, on the other hand, these problems do not arise. While a mixed state between the system and the clock would also suffice for the PW construction, the point is that quantum physics always allows us to view any mixed state as a pure state in an enlarged Hilbert Space. A quantum universe therefore seems more satisfactory even just from just this formal perspective. An argument could be made, though, that a pure state is somehow special, and that this special state in turn requires an explanation, whereas a maximally mixed state is more natural and follows from some kind of a maximum entropy principle. Either way, it is possibly an additional strength of this approach to time that the exact overall state of the universe may not even be relevant (in the sense that the amplitudes between different states do not have any observable meaning).
Setting aside the issue of the state of the universe, we would finally like to demonstrate that both classical and quantum discussions can be set in a way that all other conservation laws (other than energy) are included. This is straightforward once it is realised that the Hamiltonian is just one of the relevant generators of transformations. Other generators would be the generators of translations, rotations, boosts and so on, whatever is believed to be needed to characterise all the symmetries of the universe.
Let us enumerate all the relevant generators of displacements as
, such that, say,
and so on. The closed universe consisting of two subsystems would satisfy the following constraint for all
i,
. In other words, not only would the total energy be zero, but so would the total momentum, total angular momentum and so on [
5]. Analogous to Page–Wootters, the momentum of one subsystem “changing” by some value
p, would “result” in the other subsystem changing by
, and the same for all other relevant quantities. This can ultimately be achieved at the level of fields (see e.g., [
6] for a symmetry-based approach to quantum field theory).
In cosmology, we can think of the two subsystems as being the matter fields (the system) and the gravitational field (the clock) [
7,
8,
9,
10,
11]. But any quantum evolution that rests on the entanglement between different subsystems can be though of in this way. In quantum physics, this takes us back to Mott’s analysis of alpha-particle tracks in a cloud chamber [
12], and ultimately to Everett’s relative state interpretation [
13] of quantum physics. As the present paper shows, all of them could be phrased perfectly well within classical mechanics, with the main difference being that classical correlations invariably imply some kind of a lack of knowledge at the level of the universe. Our work does not take into account the fact that, in general, the subsystems would also interact with one another, and that the total Hamiltonian considered here is therefore only approximate). This raises a number of questions that are worth investigating in the future.