1. Introduction
The problem of reconciling quantum theory (QT) and general relativity (GR) has most commonly been addressed by preserving the framework of QT essentially unchanged while modifying the structure and dynamics of space-time. This is not unreasonable. Einstein’s equation,
, relates geometry on the left to matter on the right. Since our best theories for the matter right hand side are QTs it is natural to try to construct a theory in which the geometrical left hand side is also of quantum mechanical origin [
1,
2,
3].
Further thought however shows that this move carries a considerable risk, particularly because the old process of quantization involves ad hoc rules which, however successful in the past, have led to conceptual difficulties that would immediately spread and also infect the gravitational field. One example is the old quantum measurement problem and its closely related cousin the problem of macroscopic superpositions. Do quantum superpositions of space-times even make sense? In what direction would the future be? Another example is the cosmological constant problem. Does the zero point energy of quantum fields gravitate? Why does it not give rise to unacceptably large space-time curvatures? Considerations such as these suggest that the issue of whether and how to quantize gravity hinges on a deeper understanding of the foundations of QT and also on a deeper understanding of GR and of geometry itself—what, after all, is
distance? Why are QT and GR framed in such different languages? Recent developments indicate that they might be closer than previously thought—the link is entropy. Indeed, in the entropic dynamics approach [
4,
5,
6] QT is derived as an application of entropic methods of inference [
7] with a central role assigned to concepts of information geometry [
8,
9,
10,
11,
12,
13,
14,
15,
16]. And, on the GR side, the link between gravity and entropy has been recognized from the early work of Bekenstein and Hawking and further reaffirmed in more recent thermodynamic approaches to GR [
17,
18,
19,
20,
21,
22].
In a previous paper [
23] we used the method of maximum entropy to construct a model of physical space in which points are blurred; they are defined with a finite resolution. Such a blurred space is a statistical manifold and therefore it is automatically endowed with a Riemannian metric given by information geometry. Our goal here is to further close the gap between QT and GR by formulating the corresponding Lorentzian geometry of space-time.
The extension from space to space-time is not just a simple matter of applying information geometry to four dimensions rather than three. The problem is that information geometry leads to metrics that are positive—statistical manifolds are inevitably Riemannian—which cannot reproduce the light-cone structure of space-time. Some additional ingredient is needed. We do not model space-time as a statistical manifold. Instead, space-time is modelled as a four-dimensional manifold such that the geometry of all space-like embedded surfaces is given by information geometry. We find that in the limit of a flat space-time our model coincides with a stochastic model of space-time proposed long ago by Ingraham by following a very different line of argument [
24].
Blurred space is a curious hybrid: some features are typical of discrete spaces while other features are typical of continuous manifolds [
25,
26]. For example, there is a minimum length and blurred points have a finite volume. The volume of a region of space is a measure of the number of blurred points within it, and it is also a measure of its bulk entropy. Under Lorentz transformations the minimum length suffers a dilation which is more analogous to the relativistic time dilation than to the familiar length contraction.
The dynamics of blurred space, its
geometrodynamics, is constructed by requiring that as three-dimensional space undergoes the deformations associated with time evolution it sweeps a four-dimensional space-time. As shown in a remarkable paper by Hojman, Kuchar̆, and Teitelboim [
27] in the context of the familiar sharp space-time this requirement is sufficient to determine the dynamics. Exactly the same argument can be deployed here. The result is that in the absence of matter the geometrodynamics of four-dimensional blurred space-time is given by Einstein’s equations. The coupling of gravity to matter will not be addressed in this work.
2. The Information Geometry of Blurred Space
To set the stage we recall the model of blurred space as a smooth three-dimensional manifold
the points of which are defined with a finite resolution [
23]. It is noteworthy that, unlike the very rough space-time foams expected in some models of quantum gravity, one expects blurred space to be very smooth because irregularities at scales smaller than the local uncertainty are suppressed. Blurriness is implemented as follows: when we say that a test particle is located at
(with coordinates
,
) it turns out that it is actually located at some unknown neighboring
. The probability that
lies within
is
. Since to each point
one associates a distribution
the space
is a statistical manifold automatically endowed with a metric. Indeed, when points are blurred one cannot fully distinguish the point at
x described by the distribution
from another point at
described by
. The quantitative measure of distinguishability [
7,
11] is the
information distance,
where the metric tensor
—
the information metric—is given by,
(We adopt the standard notation
and
.) Thus, in a blurred space
distance is distinguishability.
In
Section 4 we will briefly address the physical/geometrical interpretation of
. For now we merely state [
23] that
measures the distance between two neighboring points in units of the local uncertainty defined by the distribution
, that is, information length is measured in units of the local blur.
In order to completely define the information geometry of
which will allow us to introduce notions of parallel transport, curvature, and so on, one must specify a connection or covariant derivative ∇. The natural choice is the Levi-Civita connection, defined so that
. Indeed, as argued in [
28], the Levi-Civita connection is to be preferred because, unlike the other
-connections [
11], it does not require imposing any additional structure on the Hilbert space of functions
.
The next step is to use the method of maximum entropy to assign the blur distribution
. The challenge is to identify the constraints that capture the physically relevant information. One might be tempted to consider imposing constraints on the expected values of
and
but this does not work because in a curved space neither of these constraints is covariant. This technical difficulty is evaded by maximizing entropy on the flat space
that is tangent to
at
P and then using the
exponential map (see [
23]) to “project” the distribution from the flat
to the curved space
. It is important to emphasize that the validity of this construction rests on the assumption that the normal neighborhood of every point
x—the region about
x where the exponential map is 1-1—is sufficiently large. The assumption is justified provided the scale of the blur is much smaller than the scale over which curvature effects are appreciable.
Consider a point
with generic coordinates
and a positive definite tensor field
. The components of
are
. The distribution
on
is assigned on the basis of information about the expectation
and the variance-covariance matrix
On
it is always possible to transform to new coordinates
such that
where
. If
were a metric tensor the new coordinates would be called Riemann Normal Coordinates at
P (
). The new components of
y are
and the constraints (
3) take the simpler form,
We can now maximize the entropy
relative to the measure
subject to (
7) and normalization. Since
is flat we can take
to be constant and we may ignore it. The result in
is
Using the inverse of Equation (
6) we can transform back to the original coordinates
,
The resulting distribution is also Gaussian,
and the matrix
of Lagrange multipliers turns out to be the inverse of the correlation matrix
,
.
Next we use the exponential map to project
coordinates on the flat
to the
coordinates on the curved
,
The corresponding distribution
induced on
by
on
is
or
Thus, in
the distribution
retains the Gaussian form. We can now invert (
4) and transform back to the original generic frame of coordinates
and define
by
which is an identity between scalars and holds in all coordinate systems. In the original
coordinates the distribution
will not, in general, be Gaussian,
Finally we substitute (
16) into (
2) to calculate the information metric
. (The integral is easily handled in
.) The result is deceptively simple,
The main result of [
23] was to show that the metric
of a blurred space is a statistical concept that measures the “degree of distinguishability” between neighboring points. The metric is given by the Lagrange multipliers
associated to the covariance tensor
that describes the blurriness of space.
3. Space-Time and the Geometrodynamics of Pure Gravity
The constraint that determines the dynamics is the requirement that blurred space be a three-dimensional spacelike “surface” embedded in four-dimensional space-time. As shown in [
27] the reason this condition is so constraining is that when evolving from an initial to a final surface every intermediate surface must also be embeddable in the same space-time and, furthermore, the sequence of intermediate surfaces—the path or foliation—is not unique. Such a “foliation invariance”, which amounts to the local relativity of simultaneity, is a requirement of consistency: if there are two alternative paths to evolve from an initial to a final state, then the two paths must lead to the same result.
Space-time is foliated by a sequence of space-like surfaces
. Points on the surface
are labeled by coordinates
(
) and space-time events are labeled by space-time coordinates
(
). The embedding of
within space-time is defined by four functions
. An infinitesimal deformation of
to a neighboring
is specified by
. The deformation vector
is decomposed into normal and tangential components,
where
is the unit normal to the surface and the three vectors
are tangent to the coordinate lines
(
,
).
We assume a phase space endowed with a symplectic structure: the basic dynamical variables are the surface metric
and its canonically conjugate momentum
. This leads to a Hamiltonian dynamics where the super-Hamiltonian
and the super-momentum
generate normal and tangential deformations respectively. In order for the dynamics to be consistent with the kinematics of deformations the Poisson brackets of
and
must obey two sets of conditions [
29,
30]. First, they must close in the same way as the “group” of deformations, that is, they must provide a representation of the “algebra” of deformations [
31],
And second, the initial values of the variables
and
must be restricted to obey the weak constraints
A remarkable feature of the resulting dynamics is that once the constraints (
22) are imposed on one initial surface
they will be satisfied automatically on all subsequent surfaces. As shown in [
27] the generators that satisfy (
19)–(
21) are
where
and
are constants which, once the coupling to matter is introduced, can be related to Newton’s constant
and to the cosmological constant
. Equations (
22)–(
25) are known to be equivalent to Einstein’s equations in vacuum.
To summarize: (a) Space-time is constructed so that the geometry of any embedded spacelike surface is given by information geometry. (b) The geometrodynamics of blurred space is given by Einstein’s equations. These are the main conclusions of this paper.
4. Discussion
Dimensionless Distance?—As with any information geometry the distance
given in Equations (
1) and (
2) turns out to be dimensionless. The interpretation [
23] is that an information distance is measured distances in units of the local uncertainty—the blur. To make this explicit we write the distribution (
14) that describes a blurred point in
in the form
so that the information distance between two neighboring points is
Since the blur
is the only unit of length available to us (there are no external rulers) it follows that
but it is nevertheless useful to write our equations showing
explicitly. In (
26) the two points
x and
are meant to be simultaneous.
Minimum Length—To explore the geometry of blurred space it helps to distinguish the abstract “mathematical” points that are sharply defined by the coordinates
x from the more “physical” blurred points. We shall call them c-points and b-points respectively. In
the distance between two c-points located at
x and at
is given by (
27). To find the corresponding distance
between two b-points located at
x and at
we recall that when we say a test particle is at
x it is actually located at
so that
Taking the expectation over
y with the probability (
26)—use
and
—we find
We see that even as
and the two b-points coincide we still expect a minimum
rms distance of
.
Blur Dilation—The size of the blur of space is a length but it does not behave as the length of a rod. When referred to a moving frame it does not undergo a Lorentz contraction. It is more analogous to time dilation: just as a clock marks time by ticking along the time axis, so are lengths measured by ticking s along them. By the principle of relativity all inertial observers measure the same blur in their own rest frames — the proper blur . Relative to another inertial frame the blur is dilated to where is the usual relativistic factor. This implies the proper blur is indeed the minimum attainable.
The Volume of a Blurred Point: Is Space Continuous or Discrete?—A b-point is smeared over the whole of space but we can still define a useful measure of its volume by adding all volume elements
weighed by the scalar density
. Therefore in
units a blurred point has unit volume. This means that we can measure the volume of a finite region of space by
counting the number of b-points it contains. It also means that the number of distinguishable b-points within a region of finite volume is finite which is a property one would normally associate to discrete spaces. In this sense blurred space is both continuous and discrete. (See also [
26].)
The Entropy of Space—The statistical state of blurred space is the joint distribution of all the
variables associated to every b-point
x. We assume that the
variables at different
xs are independent, and therefore their joint distribution is a product,
From (
11) and (
17) the distribution
in the tangent space
is Gaussian,
which shows explicitly how the information metric
determines the statistical state of space.
Next we calculate the total entropy of space,
relative to the uniform distribution
which is independent of
y — a constant. Since the
y’s in Equation (
30) are independent variables the entropy is additive,
, and we only need to calculate the entropy
associated to a b-point at a generic location
x,
Thus, the entropy per b-point is a numerical constant
and the entropy of any region
R of space,
, is just its volume,
Thus, the entropy of a region of space is proportional to the number of b-points within it and is proportional to its volume.
Canonical Quantization of Gravity?—The picture of space as a smooth blurred statistical manifold stands in sharp contrast to ideas inspired from various models of quantized gravity in which the short distance structure of space is dominated by extreme fluctuations. From our perspective it is not surprising that attempts to quantize gravity by imposing commutation relations on the metric tensor have not been successful. The information geometry approach suggests a reason why: quantizing the Lagrange multipliers would be just as misguided as formulating a quantum theory of fluids by imposing commutation relations on those Lagrange multipliers like temperature, pressure, or chemical potential, that define the thermodynamic macrostate.
Physical Consequences of a Minimum Length?—A minimum length will eliminate the short wavelength divergences in QFT. This in turn will most likely illuminate our understanding of the cosmological constant and affect the scale dependence of running coupling constants. One also expects that QFT effects that are mediated by short wavelength excitations should be suppressed. For example, the lifetime of the proton ought to be longer than predicted by grand-unified theories formulated in Minkowski space-time. The nonlocality implicit in a minimum length might lead to possible violations of CPT symmetry with new insights into matter-antimatter asymmetry. Of particular interest would be the early universe cosmology where inflation might amplify minimum-length effects possibly making them observable.