Lower Bounds for the Integrated and Minimax Risks in Intrinsic Statistical Estimation: A Geometric Approach

Corcuera, José Manuel; Oller, José María

doi:10.3390/math14020240

Open AccessArticle

Lower Bounds for the Integrated and Minimax Risks in Intrinsic Statistical Estimation: A Geometric Approach

by

José Manuel Corcuera

¹

and

José María Oller

^2,*

¹

Department of Mathematics and Computer Science, Faculty of Mathematics, Universitat de Barcelona, 08007 Barcelona, Spain

²

Department of Genetics, Microbiology and Statistics, Faculty of Biology, Universitat de Barcelona, 08028 Barcelona, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(2), 240; https://doi.org/10.3390/math14020240

Submission received: 3 October 2025 / Revised: 23 December 2025 / Accepted: 5 January 2026 / Published: 8 January 2026

(This article belongs to the Section D1: Probability and Statistics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

In parametric statistics, it is well established that the canonical measures of estimator performance—such as bias, variance, and mean squared error—are inherently dependent on the parameterization of the model. Consequently, these quantities describe the behavior of an estimator only relative to a particular parameterization, rather than representing intrinsic properties of either the estimator itself or the underlying probability distribution it seeks to estimate. Some years ago, the authors introduced a framework, termed the intrinsic analysis of point estimation, in which tools from information geometry were employed to construct analogues of classical statistical notions that are intrinsic to both the estimator and the associated probability measure. Within this framework, a contravariant vector field was introduced to define the intrinsic bias, while the squared Riemannian distance naturally emerged as the intrinsic analogue of the classical squared distance. Intrinsic counterparts of the Cramér–Rao inequalities, as well as the Rao–Blackwell and Lehmann–Scheffé theorems, were also established. The present work extends the intrinsic analysis—originally founded on the concept of intrinsic risk, a fundamentally local measure of estimator performance—to an approach that characterizes the estimator over an entire region of the parameter space, thereby yielding an intrinsically global perspective. Building upon intrinsic risk, two indices are proposed to evaluate estimator performance within a bounded region: (i) the integral of the intrinsic risk with respect to the Riemannian volume over the specified region, and (ii) the maximum intrinsic risk attained within that region. The Riemannian volume induced by the Fisher information metric on the manifold associated with the parametric model provides a natural means of averaging the intrinsic risk. Using variational methods, integral inequalities of the Cramér–Rao type are derived for the mean squared integrated Rao distance of the estimators, thereby extending previous contributions by several authors. Furthermore, lower bounds for the maximum intrinsic risk are obtained through corresponding integral formulations.

Keywords:

Riemannian metric; information metric; Rao distance; intrinsic analysis; Cramér–Rao type integral lower bounds; minimax risk

MSC:

62B11; 53B12; 62F10; 62F15

1. Introduction

The study of estimation efficiency within the framework of information geometry has evolved significantly since the pioneering work of Rao [1] and the subsequent works of others—see [2,3,4]—and may be found in more recent papers and books like [5,6]. The Fisher information metric, providing a canonical Riemannian structure on parametric statistical models, allows an intrinsic quantification of statistical distinguishability and the derivation of sharp risk bounds. This was developed in the article [7], where intrinsic classical results, such as Cramér–Rao inequalities, were established under regularity conditions; in that work, we developed what can be termed an intrinsic approach to the analysis of point estimation. Given a statistical model—that is, after fixing the collection of possible stochastic mechanisms assumed to generate the observed sample—the term intrinsic refers to properties that are inherent to the estimator itself and not to the particular parameterization used to describe the model. Intrinsic properties, in contrast to classical ones, are invariant under reparameterizations of the model, which may be interpreted as changes of coordinates in the space of probabilistic mechanisms under consideration; see also [8].

However, whether one adopts a classical or an intrinsic perspective, the risk functions of different estimators often intersect. As a consequence, the comparison of estimators cannot, in general, be based on pointwise risk criteria alone, unless additional structural properties are imposed, such as unbiasedness or equivariance in the case of families that are invariant under the action of a group (see [9]). One natural way to address this issue is to assess the performance of an estimator—intrinsic or classical—over an entire region of the parameter space, for example, by integrating its risk with respect to the Riemannian volume measure or by considering the supremum of the risk over that region. This work extends the findings of these articles through the use of indices that quantify estimator performance over regions of the parameter space, rather than at single pointwise locations. Specifically, the aim of this paper is to derive lower bounds for two global risk measures of an estimator over a subset of the parameter space under the intrinsic geometry induced by the Fisher information: the average risk and the maximum risk. In the next section, we outline the setting of the problem and recall some results on local risk bounds, which will be later applied to obtain global bounds. Related contributions are found in [2], where the analysis is carried out in a classical unidimensional framework, and in [10], which develops a classical but non-intrinsic perspective.

Building on these foundations, the notion of global efficiency has recently attracted renewed attention, emphasizing the behavior of estimators not only locally but across regions of the parameter space, particularly if we take into account that the interplay between geometry and physics has been further enriched by applications of Fisher information to variational principles in classical and quantum mechanics; see [11,12,13,14,15].

2. The Intrinsic Analysis Framework

Let

χ

be a sample space,

A

a

σ

–algebra of subsets of

χ

and

μ

a

σ

–finite positive measure in

(χ, A)

. A parametric statistical model is defined as the triple

\{(χ, A, μ); Θ; f\}

, where

(χ, A, μ)

is a measure space,

Θ

is a smooth real manifold, known as the parameter space, and f is a non-negative measurable map,

f : X \times Θ \to R_{\geq 0}

, such that

P_{θ} (d x) = f (x, θ) μ (d x)

is a probability measure on the measurable space

(χ, A)

,

\forall θ \in Θ

. Here,

μ

is referred to as the reference measure and f as the model function.

For simplicity, in this paper, we shall focus on the case in which

Θ

is an open-connected subset of

R^{n}

. In this setting, it is customary to use the same symbol

θ

to denote both the points in

Θ

and their coordinate representations. Adopting this convention, it is possible to present the results in this familiar form hereafter, even though the statements can be formulated in greater generality.

Additionally, it will be assumed that the model function f satisfies certain regularity conditions:

When x is fixed, the real function $ξ \mapsto f (x; ξ)$ is a $C^{\infty}$ function in the manifold $Θ$ .
The functions in x, $\partial ln f (x; θ) / \partial θ^{i} i = 1, \dots, n$ , are linearly independent and belong to $L^{α} (f (\cdot; θ) d μ)$ for a suitable $α > 0$ , that is, the scores have moments of the order $α$ for a convenient $α > 0$ .
The partial derivatives of the required orders

$\partial / \partial θ^{i}, \partial^{2} / \partial θ^{i} \partial θ^{j}, \partial^{3} / \partial θ^{i} \partial θ^{j} \partial θ^{k}, \dots i, j, k = 1, \dots, n,$

and the integration of $f (x; θ)$ with respect to $d μ$ can always be interchanged.
The model is identifiable: the map $θ \mapsto P_{θ}$ , with $d P_{θ} = f (\cdot; θ) d μ$ is one-to-one.

Within this framework, the probabilistic mechanism that generates the data under analysis can be equivalently represented by a probability measure, a density function or a parameter, that is, by a point in the parametric manifold

Θ

. When these conditions are satisfied, the parametric statistical model is said to be regular. Initially,

Θ

is regarded as a Riemannian manifold endowed with an arbitrary fundamental tensor h on

Θ

, whose components are denoted by

h_{i j}

. Nevertheless, it is well known that the parameter space admits a natural Riemannian structure induced by probability measures, referred to as the information metric, whose fundamental tensor components

g_{i j}

coincide with those of the Fisher information matrix. For further details, see [1,3,4,6,7], among many others.

In this context, for a given sample size k, an estimator

U

of the true parameter

θ \in Θ

—that is, the parameter associated with the true probabilistic mechanism generating the observed sample—is defined as a measurable map

U : χ^{k} \to Θ

, under the assumption that the probability measure in

χ^{k}

is given by

{(P_{θ})}_{(k)} (d x) = f_{(k)} (x; θ) μ_{k} (d x) = \prod_{i = 1}^{k} f (x_{i}; θ) μ (d x_{i})

.

2.1. Local Bounds

Let

h_{α β}

denote the components of the metric tensor associated with the Riemannian metric on

Θ

, and let

g_{α β}

denote the components of the information metric on

Θ

. Consider the Levi–Civita connection corresponding to

h_{α β}

, and define

A = \exp_{θ}^{- 1} (U), B = E_{θ} (\exp_{θ}^{- 1} (U)),

(1)

where

\exp_{θ}^{- 1}

is the inverse of the exponential map induced by this connection (see Appendix A). Observe that A encodes the deviation between the true parameter

θ

and its estimate

U

, quantified by the tangent vector at

θ

of the geodesic connecting both points, whose length equals the corresponding Riemannian distance. The term B is the expectation of this deviation vector. For simplicity, it is assumed that the estimators

U

are such that A is defined almost everywhere with respect to

μ

, and B is a

C^{1}

vector field on

Θ

. The existence of such a field is ensured whenever the mean square Riemannian distance exists.

Let

𝔖_{θ} = {ξ \in T_{θ} Θ : ∥ ξ ∥ = 1}

, where

T_{θ} Θ

denotes the tangent space at

θ

. For each

ξ \in 𝔖_{θ}

, define

C_{θ} (ξ) = sup {s > 0 : d (θ, γ_{ξ} (s)) = s},

(2)

where d denotes the Riemannian distance and

γ_{ξ}

is a geodesic defined on an open interval containing zero, satisfying

γ_{ξ} (0) = θ

and

{\dot{γ}}_{ξ} (0) = ξ

, that is, the tangent vector at

θ

is equal to

ξ

. Define

𝔇_{θ} = {s ξ \in T_{θ} Θ : 0 \leq s < C_{θ} (ξ), ξ \in 𝔖_{θ}}, D_{θ} = \exp_{θ} (𝔇_{θ}) .

It is known that

\exp_{θ}

is a diffeomorphism mapping

𝔇_{θ}

onto

D_{θ}

(see Hicks [16]). An intrinsic extension of the Cramér–Rao bound is obtained, generalizing the formulation of [7]. The previous result relied exclusively on the information metric, whereas the current framework admits an arbitrary Riemannian metric for the quantification of estimator loss.

Theorem 1

(Riemannian Cramér-Rao lower bound). Let

U

be an estimator based on a sample of size k, corresponding to an n-dimensional regular parametric family of density functions. Assume that the parameter manifold Θ is simply connected and that

{(P_{θ})}_{k} \circ U^{- 1} (Θ ∖ D_{θ}) = 0, \forall θ \in Θ,

so that the estimator takes values almost surely in a normal neighborhood

D_{θ}

of θ. Suppose further that the mean squared Riemannian distance with respect to the metric

h_{α β}

between the true parameter and the estimator,

E_{θ} (d^{2} (U, θ)),

exists for all θ, and that the covariant derivative of the bias field B may be computed by differentiating under the integral sign. Then

E_{θ} (d^{2} (U, θ)) \geq \frac{{(div (B) - E_{θ} (div (A)))}^{2}}{k c} + {∥ B ∥}^{2},

(3)

where

c = \sum_{α, β} h^{α β} g_{α β}

and

div (\cdot)

represent the divergence operator.

Observe that the divergence of a vector field on a Riemannian manifold is the scalar function that quantifies the net rate at which the vector field flows outward from (or inward toward) a point.

Proof.

Let C be any vector field. Then, applying the Cauchy–Schwartz inequality twice,

E_{θ} (| 〈 A - B, C 〉 |) \leq E_{θ} (∥ A - B ∥ ∥ C ∥) \leq \sqrt{E_{θ} ({∥ A - B ∥}^{2})} \sqrt{E_{θ} ({∥ C ∥}^{2})},

where

〈, 〉

and

∥ ∥

denote, respectively, the inner product and the norm defined on each tangent space.

Let

C (x; θ) = grad (ln f_{(k)} (x; θ))

, where

grad (\cdot)

is the gradient operator. Taking expectations and using the repeated index convention,

E_{θ} ({∥ C ∥}^{2}) = E_{θ} (h_{α β} h^{β γ} \frac{\partial ln f_{(k)}}{\partial θ^{γ}} h^{α λ} \frac{\partial ln f_{(k)}}{\partial θ^{λ}}) = k h_{α β} h^{β γ} h^{α λ} g_{γ λ} = k h^{γ λ} g_{γ λ} .

Furthermore, we also have

| E_{θ} (〈 A, C 〉) | = | E_{θ} (〈 A - B, C 〉) | \leq E_{θ} (| 〈 A - B, C 〉 |)

and

E_{θ} ({∥ A - B ∥}^{2}) = E_{θ} ({∥ A ∥}^{2}) {- ∥ B ∥}^{2} .

Thus,

| E_{θ} (〈 A, C 〉) | \leq \sqrt{E_{θ} ({∥ A ∥}^{2}) - {∥ B ∥}^{2}} \sqrt{k c},

but

{∥ A ∥}^{2} = d^{2} (U, θ)

. Moreover,

div (B) = E_{θ} (div (A)) + E_{θ} (〈 A, C 〉) .

Then the theorem follows. □

Remark 1.

We can choose a geodesic spherical coordinate system with origin

U (x)

; under this coordinate system, we have the following.

\frac{\partial A^{α}}{\partial θ^{α}} = - 1 and Γ_{α j}^{α} A^{j} = - ρ Γ_{α 1}^{α} = - \frac{\partial ln \sqrt{g}}{\partial ρ} ρ,

where g is the determinant of the metric tensor. Then

div (A) = - 1 - ρ \frac{\partial ln \sqrt{g}}{\partial ρ} .

Now we can use Bishop’s comparison theorems (see [17], pp. 71–73) to estimate

\frac{\partial ln \sqrt{g}}{\partial ρ} .

In the Euclidean case,

\frac{\partial ln \sqrt{g}}{\partial ρ} = \frac{n - 1}{ρ},

and thus

div (A) = - n

.

When the sectional curvatures are non-positive, we obtain the following.

\frac{\partial ln \sqrt{g}}{\partial ρ} \geq \frac{n - 1}{ρ},

and therefore

div (A) \leq - n

.

Finally, when the supreme of the sectional curvatures,

K

, is positive and the diameter of the manifold satisfies

d (Θ) < π / 2 \sqrt{K}

, we have

\frac{\partial ln \sqrt{g}}{\partial ρ} \geq 0,

and then we obtain

div (A) \leq - 1

.

In any case,

div (A) \leq - a

, with

a = n

or

a = 1

, depending on the sectional curvature sign.

Corollary 1.

Suppose that there is a global chart such that

h_{α β} = δ_{α β}

. Identifying the points with their coordinates, we have

MSE (U) \geq \frac{{(div (E_{θ} (U)))}^{2}}{k g_{β β}} + {∥Bias (U)∥}^{2},

where MSE and Bias are the ordinary mean squared error and bias under the assumed global chart, and we use the repeated index summation convention.

Proof.

It follows straightforwardly from the previous theorem and the facts that d is the Euclidean distance,

A = U - θ

and

div (A) = - n

and

Bias (U) = E_{θ} (U) - θ

. □

Corollary 2

(Intrinsic Cramér-Rao lower bound). If

h_{α β} = g_{α β}

, we have

E_{θ} (ρ^{2} (U, θ)) \geq \frac{{(div (B) - E_{θ} (div (A)))}^{2}}{k n} + {∥ B ∥}^{2},

(4)

where ρ is the Rao distance, that is, the Riemannian distance induced by the information metric. In particular, If all the sectional Riemannian curvatures K are bounded from above by a non-positive constant

K

and

div (B) \geq - n

, then

E_{θ} (ρ^{2} (U, θ)) \geq \frac{{(div (B) + 1 + (n - 1) \sqrt{- K} ∥ B ∥ coth (\sqrt{- K} ∥ B ∥))}^{2}}{k n} + {∥ B ∥}^{2} .

(5)

If all sectional Riemannian curvatures K are bounded from above by a positive constant

K

and

d (Θ) < π / 2 \sqrt{K}

, where

d (Θ)

is the diameter of the manifold and

div (B) \geq - 1

, then

E_{θ} (ρ^{2} (U, θ)) \geq \frac{{(div (B) + 1 + (n - 1) \sqrt{K} d (Θ) cot (\sqrt{K} d (Θ)))}^{2}}{k n} + {∥ B ∥}^{2} .

(6)

Proof.

If the Riemannian metric is the Fisher metric, the distance is known by the Rao distance and

c = g^{α β} g_{α β} = δ_{α}^{α} = n

. To prove (5) and (6) see [7]. □

Note that the geometry of the model influences the lower bounds of the Riemannian risk. This influence is intricate, as it depends not only on the Riemannian structure of the parameter space but also on the probability distribution that the estimator induces in that space. For a given bias structure, specified by the bias vector field B and its divergence

div (B)

, the behavior of the lower bounds of the Riemannian risk is determined by the curvature. When the curvature is negative, these bounds tend to increase when

K

decreases; see (5) as

\sqrt{- K} ∥ B ∥ coth (\sqrt{- K} ∥ B ∥)

increases, while for positive curvature, they tend to decrease when

K

increases as a function of

\sqrt{K} d (Θ) cot (\sqrt{K} d (Θ))

,

d (Θ)

being the diameter of the manifold

Θ

that plays a significant role in the calculation of this lower bound.

Intuitively, geodesics are the straightest possible paths in a curved space. When sectional curvatures are positive, the geodesics starting from a single point initially spread out, but eventually start to converge: positive curvature pulls geodesics together. When the curvature is zero, the geodesics starting from a point spread uniformly in all directions, the space is flat, and the geodesics neither attract nor repel each other. When the curvature is negative, geodesics starting from a point diverge between them. Even if they start close together, they spread apart very quickly as you move along them: negative curvature pushes geodesics away from each other. This behavior has consequences in connection with intrinsic risk. When the model has positive sectional curvatures, the risk can decrease, since the geodesics bend together and the estimators might behave more similarly across nearby parameters, while when the sectional curvatures are negative, the risk can increase since the geodesics spread apart and estimators may differ more across the space.

2.2. Global Bounds

It is well known that, for a general loss function, there is no estimator whose risk function is uniformly smaller than that of every other estimator. Consequently, given a particular estimator, it is natural to assess its performance over a specified region of the statistical model by integrating its risk function over that region and normalizing the result by the corresponding Riemannian volume. In what follows, the square of the Rao distance is adopted as the loss function and the Riemannian metric is taken to be the Fisher information metric. This setting corresponds to the intrinsic analysis framework developed in [7].

Let

B \subset Θ

be a measurable subset satisfying

0 < V (B) < \infty

, where V denotes the Riemannian measure. The Riemannian average of the mean squared Rao distance is defined as

R_{U}^{2} (B) = \frac{\int_{B} E_{θ} (ρ^{2} (U, θ)) d V}{\int_{B} d V} .

(7)

The resulting performance index represents a weighted average of the mean squared Rao distance. This formulation is compatible with a Bayesian perspective: a uniform prior with respect to the Riemannian volume can be regarded as a noninformative prior, see [18]. Furthermore, as shown in [19], when the parameter space is a locally compact topological group, the corresponding Riemannian volume coincides, up to a multiplicative constant, with a left-invariant Haar measure. In general, this volume is invariant under any group that leaves the parametric family of densities unchanged.

In the first part of the article, the lower bounds for this global index are derived on geodesic balls of radius R,

R_{U}^{2} (B)

.

An alternative measure of global estimator performance is given by the maximum risk over a region of the parameter space:

M_{U}^{2} (B) = sup_{θ \in B} E_{θ} (ρ^{2} (U, θ)),

(8)

corresponding to the minimax approach. The final part of the paper is devoted to the derivation of lower bounds for this maximum risk.

3. Variational Methods to Obtain Global Bounds

The local bounds established in Corollary 2 indicate that the expected squared Rao distance between the true probabilistic mechanism generating the sample and its corresponding estimates is bounded from below by a quantity depending on the intrinsic bias structure of the estimator.

Global bounds can be obtained by using variational methods. A study in this direction was previously conducted in [2]. The approach consists in integrating the local bounds for the mean squared Rao distance, as derived above, under the assumption that the Riemannian metric coincides with the Fisher information metric and over a submanifold

W \subset Θ

with boundary

\partial W \subset Θ

. Specifically,

Y (B) = \int_{W} \{{∥ B ∥}^{2} + \frac{1}{k n} {(div (B) + a)}^{2}\} d V,

where

a = n

if the sectional curvatures are nonpositive and

a = 1

otherwise.

The functional above depends solely on the vector field B, and the problem reduces to finding the

C^{\infty}

vector field B that minimizes

Y (B)

. Since the minimization is performed over a class of vector fields larger than that of smooth bias fields, the resulting minimum provides a lower bound for the average of the mean squared Rao distance.

Observe that the expression (2) yields a pointwise lower bound for the intrinsic risk, whose dependence on the estimator’s bias is immediately evident. Allowing for a non-negligible bias may lead to an artificial reduction of the risk—whether classical or intrinsic—but only at the expense of increasing the bias itself. This trade-off would at best indicate satisfactory performance for a specific probabilistic mechanism, corresponding to a single point in the parameter space

Θ

, while typically resulting in poor performance over a substantial region of the parameter space, primarily due to the growth of the bias. The minimization of (2) thus emerges naturally when considering this pointwise intrinsic bound. To assess the performance of an estimator over an entire region of the parameter space, it is therefore reasonable to consider estimators with various bias structures, since, in principle, biased estimators may outperform unbiased ones when evaluated over a given region. The problem may then be formulated as follows: among all estimators exhibiting a prescribed form of bias, determine the bias structure that minimizes a lower bound on the risk over the region of interest. In particular, this formulation leads to a relatively simple variational problem, considerably more tractable than the one posed directly in terms of the field A.

In applications, the integration region W should be selected as the subset of

Θ

within which the true probabilistic mechanism that generates the data is expected to lie. Consequently, since this region can be chosen arbitrarily, it will be chosen in such a way that we expect any well-behaved estimator to exhibit a low or vanishing risk on the boundary

\partial W

, and consequently a small or negligible squared norm of the bias vector, since

{∥ B ∥}^{2} \leq E_{θ} (ρ^{2} (U, θ))

. Hence, W is chosen so that the bias values on its boundary can be considered zero or negligible. These assumptions can reasonably be made within a broad class of statistical models. In situations where they fail to hold—typically because the true parameter value lies on the boundary of

Θ

—the boundary conditions must be modified appropriately to reflect the specific structure of the model under consideration.

Lemma 1.

The

C^{\infty}

field B minimizes the functional

Y (B) = \int_{W} \{{∥ B ∥}^{2} + \frac{1}{k n} {(div (B) + a)}^{2}\} d V

iff it verifies

\begin{matrix} B - \frac{1}{k n} grad (div (B)) = 0, \forall θ \in W, \\ div (B) + a = 0, \forall θ \in \partial W, \end{matrix}

(9)

and the minimum value is given by

Y^{*} = \frac{a^{2}}{k n} vol (W) - \frac{a}{k n} \int_{\partial W} ∥ B^{*} ∥ d σ = \frac{a^{2}}{k n} vol (W) + \frac{a}{k n} \int_{W} div (B^{*}) d V,

(10)

where

B^{*}

satisfies (9) and

d σ

denotes the element of the induced surface area in

\partial W

.

Proof.

Consider the first variation

δ Y (B, η)

, where

η

is an arbitrary smooth vector field. A direct computation yields

lim_{ϵ \to 0} \frac{Y (B + ϵ η) - Y (B)}{ϵ} \equiv δ Y (B, η) = \int_{W} (2 〈 B, η 〉 + \frac{2}{k n} div (η) (div (B) + a)) d V

Moreover, the Gâteaux variations satisfy

Y (B + η) - Y (B) = δ Y (B, η) + \int_{W} ({∥ η ∥}^{2} + \frac{1}{k n} {(div (η))}^{2}) d V .

(11)

which shows that the functional

Y

is minimized at any point B for which its Gâteaux variations vanish. In addition, since the integral term in (11) is strictly positive for every nonzero smooth vector field

η

, the functional (Y) is strictly convex; see, for example, [20]. Consequently, every stationary point is necessarily a global minimizer. The stationary condition

δ Y (B, η) = 0

is equivalent to

\int_{W} (〈 B, η 〉 + \frac{1}{k n} div (η) (div (B) + a)) d V = 0 .

Using the identity

div (f X) = f div (X) + 〈 X, grad (f) 〉,

(12)

it follows that

\begin{matrix} \frac{1}{k n} div (η) (div (B) + a) & = & div (\frac{1}{k n} η (div (B) + a)) - 〈grad (\frac{1}{k n} (div (B) + a)), η〉 \\ = & \frac{1}{k n} div (η (div (B) + a)) - \frac{1}{k n} 〈grad (div (B)), η〉 . \end{matrix}

Hence, the stationary condition can be written as

\int_{W} (〈 B, η 〉 + \frac{1}{k n} div (η (div (B) + a)) - \frac{1}{k n} 〈grad (div (B)), η〉) d V = 0 .

By the Gauss divergence theorem, this expression becomes

\int_{W} 〈 B - \frac{1}{k n} grad (div (B)), η 〉 d V + \frac{1}{k n} \int_{\partial W} 〈 (div (B) + a) η, ν 〉 d σ = 0,

where

d σ

denotes the Riemannian measure induced in

\partial W

and

ν

is the outward unit normal vector field in

\partial W

. Equation (9) follows from the fact that the preceding equality holds for all

η

.

For the second part of the proposition, applying condition (9) together with (12) gives the following.

{∥ B^{*} ∥}^{2} = \frac{1}{k n} (div (B^{*} div (B^{*})) - {(div (B^{*}))}^{2}),

Substituting this expression into

Y (B)

yields

\begin{matrix} Y^{*} & = & \int_{W} \frac{1}{k n} (div (B^{*} div (B^{*})) + a^{2} + 2 a div (B^{*})) d V \\ = & \frac{a^{2}}{k n} vol (W) + \frac{1}{k n} \int_{\partial W} 〈 B^{*} div (B^{*}) + 2 a B^{*}, ν 〉 d σ . \end{matrix}

From the second stationary condition in (9), it follows that

Y^{*} = \frac{a^{2}}{k n} vol (W) + \frac{a}{k n} \int_{\partial W} 〈 B^{*}, ν 〉 d σ .

(13)

Since

0 \geq div (B^{*}) \geq - a

, and

div (B^{*}) = - a

on

\partial W

, together with

B^{*} = grad (div (B^{*}))

, it follows that

〈 B^{*}, ν 〉 = - ∥ B^{*} ∥

. Finally, by another application of the Gauss divergence theorem, the second equality in (10) is obtained. □

Remark 2.

The minimal value of

Y (B)

depends solely on the divergence of the optimal field,

div (B^{*})

. Let

f^{*} \equiv div (B^{*})

. It follows from condition (9) that

f^{*}

satisfies the boundary value problem

Δ f = k n f, f (θ) = - a, \forall θ \in \partial W,

(14)

where Δ denotes the Laplace–Beltrami operator associated with the Riemannian metric on W. We have obtained an explicit solution to this problem in the case where

W = S_{R}

, the geodesic ball of radius R, under the assumption of constant sectional curvature

K

.

Theorem 2.

Let the parametric statistical model be a Riemannian manifold with constant sectional curvature

K

. Then, for geodesic balls

S_{R} \subset Θ

centered at

γ \in Θ

of radius R less than the injectivity radius at γ, and satisfying

| K | S_{K}^{2} (R) < 1

, the average of the mean squared Rao distance satisfies the following lower bound:

R_{U}^{2} (S_{R}) \geq \frac{a^{2}}{k n} (1 - \frac{f^{'} (R) S_{K}^{n - 1} (R)}{k n f (R) \int_{0}^{R} S_{K}^{n - 1} (r) d r}),

(15)

where

f (R) = a_{0} \sum_{j = 0}^{\infty} \frac{\prod_{s = 1}^{j} \{k n + 2 K (s - 1) (n + 2 s - 3)\}}{j! {(\frac{n}{2})}_{j} 4^{j}} S_{K}^{2 j} (R)

and the function

S_{K} (t)

is defined by

S_{K} (t) = \{\begin{matrix} \frac{sin (\sqrt{K} t)}{\sqrt{K}} & i f K > 0, \\ t & i f K = 0, \\ \frac{sinh (\sqrt{- K} t)}{\sqrt{- K}} & i f K < 0 . \end{matrix}

(16)

Proof.

By symmetry and uniqueness, the solution of the boundary value problem in the geodesic ball

S_{R}

,

\begin{matrix} Δ f = k n f, with f (θ) = - a \forall θ \in \partial S_{R}, \end{matrix}

(17)

depends only on the geodesic distance to the center of

S_{R}

. Using geodesic spherical coordinates

(r, u)

with origin at the center of the geodesic ball, the Riemannian volume element satisfies

\sqrt{g (r, u)} = S_{K}^{n - 1} Ω (u)

(see the Appendix A). Hence

Δ f = \frac{1}{\sqrt{g}} \frac{d}{d r} (\sqrt{g} \frac{d}{d r} f) = \frac{1}{S_{K}^{n - 1}} \frac{d}{d r} (S_{K}^{n - 1} \frac{d}{d r} f) .

and the differential equation can be written as

(n - 1) \frac{S_{K}^{'}}{S_{K}} f^{'} + f^{″} = k n f .

Let

v = S_{K} (r)

and define

h (v) = f (S_{K}^{- 1} (v))

. Using the relations

f^{'} (r) = S_{K}^{'} (r) h^{'} (v) and f^{″} (r) = S_{K}^{″} (r) h^{'} (v) + S_{K}^{' 2} (r) h^{″} (v),

we obtain

(n - 1) \frac{S_{K}^{' 2} (r) h^{'} (v)}{S_{K} (r)} + S_{K}^{″} (r) h^{'} (v) + S_{K}^{' 2} (r) h^{″} (v) = k n h (v) .

Since the identities are

S_{K}^{' 2} (r) + K S_{K}^{2} (r) = 1 and S_{K}^{″} (r) + K S_{K} (r) = 0,

hold, substitution yields

v (1 - K v^{2}) h^{″} (v) + (n (1 - K v^{2}) - 1) h^{'} (v) - k n v h (v) = 0 .

Assuming a power series expansion

h (v) = \sum_{j = 0}^{\infty} a_{j} v^{j}

, with

h^{'} (v) = \sum_{j = 1}^{\infty} a_{j} j v^{j - 1}, h^{″} (v) = \sum_{j = 2}^{\infty} a_{j} j (j - 1) v^{j - 2},

and substituting into the above equation, we find

\begin{matrix} \sum_{j = 2}^{\infty} a_{j} j (j - 1) v^{j - 1} & - K \sum_{j = 2}^{\infty} a_{j} j (j - 1) v^{j + 1} + (n - 1) \sum_{j = 1}^{\infty} a_{j} j v^{j - 1} \\ - K n \sum_{j = 1}^{\infty} a_{j} j v^{j + 1} - k n \sum_{j = 0}^{\infty} a_{j} v^{j + 1} = 0, \end{matrix}

that is,

\begin{matrix} (n - 1) a_{1} & + (2 (n - 1) a_{2} + 2 a_{2} - k n a_{0}) v \\ + \sum_{j = 1}^{\infty} (a_{j + 2} (j + 2) (n + j) - a_{j} (k n + K j (n + j - 1))) v^{j + 1} = 0 . \end{matrix}

For

n \neq 1

, this yields the recurrence

a_{1} = 0 and a_{j + 2} = \frac{k n + K j (n + j - 1)}{(n + 1) (j + 2)} a_{j}, j \geq 0 .

Hence,

h (v) = a_{0} \sum_{j = 0}^{\infty} \frac{\prod_{s = 1}^{j} \{k n + 2 K (s - 1) (n + 2 s - 3)\}}{j! {(\frac{n}{2})}_{j} 4^{j}} v^{2 j},

and consequently,

f (r) = a_{0} \sum_{j = 0}^{\infty} \frac{\prod_{s = 1}^{j} \{k n + 2 K (s - 1) (n + 2 s - 3)\}}{j! {(\frac{n}{2})}_{j} 4^{j}} S_{K}^{2 j} (r),

where

a_{0}

is determined from the boundary condition

f (R) = - a

. It is straightforward to verify that this series converges whenever

| K | S_{K}^{2} (r) < 1

, which is true automatically for nonnegative sectional curvature.

To compute

\int_{S_{R}} f d V

, note that in spherical coordinates, see Appendix A,

\int_{S_{R}} f d V = area (S) \int_{0}^{R} S_{K}^{n - 1} (r) f (r) d r,

where

area (S) = \frac{2 π^{n / 2}}{Γ (n / 2)}

is the area of the n-dimensional unit radius sphere, S. Since

{(S_{K}^{n - 1} (r) f^{'} (r))}^{'} = k n S_{K}^{n - 1} (r) f (r)

, we obtain the following:

\int_{S_{R}} f d V = \frac{2 π^{n / 2}}{k n Γ (n / 2)} S_{K}^{n - 1} (R) f^{'} (R),

and thus,

Y^{*} = \frac{a^{2}}{k n} vol (S_{R}) (1 - \frac{f^{'} (R) S_{K}^{n - 1} (R)}{k n f (R) \int_{0}^{R} S_{K}^{n - 1} (r) d r}) .

□

Corollary 3.

When the parametric statistical model is a Euclidean manifold, the following lower bound holds for the Riemannian average of the mean squared Rao distance on a ball centered at

γ \in Θ

of radius R less than the injectivity radius at γ:

R_{U}^{2} (S_{R}) \geq \frac{n}{k} (1 - \frac{{}_{0}F_{1} (\frac{n}{2} + 1; \frac{k n R^{2}}{4})}{{}_{0}F_{1} (\frac{n}{2}; \frac{k n R^{2}}{4})}),

(18)

where

{}_{0}F_{1} (a; z)

denotes the confluent hypergeometric limit function, see (A2) in Appendix A.

Moreover, if the Euclidean manifold Θ is complete and simply connected, then the following lower bound holds globally:

R_{U}^{2} (Θ) \equiv lim_{R \to \infty} R_{U}^{2} (S_{R}) \geq \frac{n}{k} .

(19)

Proof.

The result follows directly as a particular case of Equation (15) with constant sectional curvature

K = 0

. The second assertion is obtained by taking the limit

R \to \infty

in Equation (18). □

Note again that the geometry of the model influences the lower bounds of the Riemannian risk—both in its local and integral forms—when these bounds are extended over a region of the parameter space, and that it does so in a non-trivial manner, a fact that merits further investigation.

Example 1.

Consider the n–variate normal distribution with known covariance matrix

Σ

. For a sample of size k, the Riemannian risk-measured as the mean squared Rao distance associated with the sample mean

{\bar{X}}_{k}

is given by

R_{U}^{2} (S_{R}) = n / k,

which coincides with the lower bound (19) derived in the preceding corollary.

Observe that in this case the parameter space is

Θ = R^{n}

. It is well known that this model is invariant under the action of a subgroup of the affine group that leaves the sample variance-covariance matrix unchanged. In this setting, the unique equivariant estimator throughout the parameter space is the sample mean

U (X_{1}, \dots, X_{k}) = {\bar{X}}_{k} = \frac{1}{k} (\sum_{j = 1}^{k} X_{j})

. The induced group acting on the parameter space is, in this case, transitive and commutative, implying that the estimator is intrinsically unbiased; see [21], and in the context of intrinsic analysis [22].

The intrinsic risk of

U

is

n / k

, which coincides with the intrinsic Cramér–Rao bound [7]. However, in a geodesic ball of radius R, the Riemannian mean may achieve an integrated intrinsic risk strictly smaller than

n / k

, provided that estimators are allowed with appropriate bias, although this quantity goes to

n / k

when

R \to \infty

, as we naturally expect. This is consistent with the existence of shrinkage estimators of the James–Stein type; see [23]. In the present example, though, such estimators cannot be equivariant under the group action.

Figure 1a,b illustrate these phenomena for

(n, k) = (2, 10)

and

(n, k) = (5, 50)

, respectively.

The method presented above apply to any statistical model satisfying standard regularity conditions, independently of the particular Riemannian geometry induced by the information metric. Moreover, they remain valid regardless of the probability distribution of the estimator of the probabilistic mechanism determined by

θ

. The only aspects that may vary are the complexity of the resulting calculations.

In the univariate case

n = 1

, the parameter manifold is Euclidean, and the corresponding expression simplifies to

\frac{Y^{*}}{2 R} = \frac{1}{k} (1 - \frac{tanh (\sqrt{k} R)}{\sqrt{k} R}),

which agrees with the result originally obtained by Chentsov [2].

We now examine the Euclidean case in Cartesian coordinates. Let us fix a coordinate system with origin at an arbitrary point

θ_{0}

and consider the cube

C_{R} = {x \in R^{n} : | x^{i} | \leq R, i = 1, \dots, n} .

In this setting, the corresponding variational problem reduces to solving the Dirichlet boundary value problem

\sum_{i = 1}^{n} \frac{\partial^{2} f}{\partial {(x^{i})}^{2}} = k n f, with f (x) = - n i f | x^{i} | = R for some i = 1, \dots, n .

Looking for a solution of the form

f (x) = \sum_{i = 1}^{n} f_{i} (x^{i}),

with real-valued functions

f_{i} : R \to R

each defined on the real variable

x^{i}

, obtaining the following

\sum_{i = 1}^{n} \frac{d^{2} f_{i}}{d {(x^{i})}^{2}} = k n \sum_{i = 1}^{n} f_{i} (x^{i}), \sum_{i = 1}^{n} f_{i} (\pm R) = - n .

A convenient particular solution is of the form

f (x) = \sum_{i = 1}^{n} g (x^{i})

, where g satisfies

\frac{d^{2} g}{d z^{2}} = k n g (z), g (\pm R) = - 1 .

The unique solution is given by

g (z) = - \frac{cosh (\sqrt{k n} z)}{cosh (\sqrt{k n} R)},

so that

f (x) = - \sum_{i = 1}^{n} \frac{cosh (\sqrt{k n} x^{i})}{cosh (\sqrt{k n} R)} .

Substituting this expression into the definition of functional

Y^{*}

yields

\frac{Y^{*}}{vol (C_{R})} = \frac{n}{k} (1 - \frac{g^{'} (R) area (\partial C_{R})}{k n^{2} g (R) vol (C_{R})}) = \frac{n}{k} (1 - \frac{tanh (\sqrt{k n} R)}{\sqrt{k n} R}),

which constitutes an improvement over the result obtained by Chentsov [2].

Furthermore, by Corollary 1, a similar inequality can be established in the general non-Euclidean case (with a fixed coordinate system). Specifically, the

M S E

satisfies the bound

R_{U}^{2} (C_{R}) \geq \frac{n^{2}}{k η} (1 - \frac{tanh (\sqrt{k η} R)}{\sqrt{k η} R}),

where

η

denotes an upper bound of

\sum_{β} g_{β β}

within

C_{R}

.

Analogous lower bounds can also be obtained for more general settings.

Theorem 3.

Let the parametric statistical model be represented by a Riemannian manifold whose sectional curvatures are bounded from above by

K

. Then the average of the mean squared Rao distance satisfies the lower bound

R_{U}^{2} (S_{R}) \geq \frac{a^{2}}{k n} (1 - \frac{f_{K}^{'} (R) area (S_{R})}{k n f_{K} (R) vol (S_{R})}) > 0,

(20)

where

area (S_{R})

denotes the area of the boundary of a n-dimensional geodesic ball centered at

γ \in Θ

of radius R less than the injectivity radius at γ,

vol (S_{R})

its corresponding volume, and

f_{K} (r)

is the solution to the boundary value problem (17), on a manifold of constant sectional curvature

K

.

Proof.

Using geodesic spherical coordinates

(r, u)

, let

f (r, u)

denote the solution to the boundary value problem (17) on a manifold with sectional curvatures bounded above by

K

, and let

f_{K} (r)

denote the solution to the same problem on a manifold of constant sectional curvature

K

, we have

Δ f_{K} = \frac{1}{\sqrt{g}} \frac{\partial}{\partial r} (\sqrt{g} \frac{\partial}{\partial r} f_{K}) = \frac{\partial^{2}}{\partial r^{2}} f_{K} + \frac{\partial}{\partial r} (ln \sqrt{g}) \frac{\partial}{\partial r} f_{K} .

By Bishop’s comparison theorem, it follows that

(n - 1) \frac{S_{K}^{'}}{S_{K}} \leq \frac{\partial}{\partial r} (ln \sqrt{g}),

and, since

\partial f_{K} / \partial r \leq 0

, we obtain

Δ f_{K} - k n f_{K} \leq Δ_{K} f_{K} - k n f_{K} = 0,

where

Δ_{K}

denotes the Laplace–Beltrami operator for a manifold of constant sectional curvature

K

. Therefore,

Δ f_{K} - k n f_{K} \leq Δ f - k n f = 0 .

Since

f_{K} (θ) = f (θ) = - a

for all

θ \in \partial S_{R}

, the comparison theorem for elliptic differential equations (see [24], Theorem 6, p. 243) implies

f (θ) \leq f_{K} (θ), θ \in S_{R},

and equality on the boundary gives

\frac{\partial}{\partial r} f (θ) \geq \frac{\partial}{\partial r} f_{K} (θ) θ \in \partial S_{R} .

Using (9) and (13), we then obtain

Y^{*} = \frac{a^{2}}{k n} vol (S_{R}) + \frac{a}{k n} \int_{\partial S_{R}} \frac{\partial}{\partial r} f d σ \geq \frac{a^{2}}{k n} vol (S_{R}) (1 - \frac{f_{K}^{'} (R) area (S_{R})}{k n f_{K} (R) vol (S_{R})}),

which establishes the claimed bound. □

Remark 3.

Estimates for the volumes of geodesic balls provided in the Appendix are instrumental in obtaining explicit expressions for the lower bounds derived above. In particular, if the sectional curvatures of the parametric manifold are bounded from below by κ and from above by

K

, then, according to Proposition (A3), the ratio between the area and the volume of the geodesic ball of radius R satisfies

\frac{area (S_{R})}{vol (S_{R})} \leq \frac{S_{κ}^{n - 1} (R)}{\int_{0}^{R} S_{K}^{n - 1} (r) d r} .

where

S_{K} (\cdot)

denotes the comparison function associated with curvature

K

(as defined in (16)).

4. Lower Bounds for the Maximum Risk

Although one may employ the Riemannian average of the risk to obtain bounds on the maximum risk, alternate minimax bounds can be derived by a more direct argument, as shown below.

Lemma 2.

Let X be a smooth vector field on the parameter manifold Θ such that

div (X) \leq - a

. Let f be a nonnegative smooth function on Θ, and let

W \subset Θ

be a submanifold with a smooth boundary

\partial W

. Then

a \int_{W} f d V \leq \int_{\partial W} f ∥ X ∥ d σ + \int_{W} ∥ X ∥ ∥ grad (f) ∥ d V

(21)

where

d σ

denotes the element of the induced surface area in

\partial S_{R}

.

Proof.

We have

\begin{matrix} a \int_{W} f d V & \leq & \int_{W} (〈 X, grad (f) 〉 - div (f X)) d V \\ \leq & \int_{W} ∥ X ∥ ∥ grad (f) ∥ d V + \int_{\partial W} f ∥ X ∥ d σ \end{matrix}

□

Theorem 4.

Let

U

be an estimator in a submanifold

W \subset Θ

with

vol (W) > 0

. Then

M_{U}^{2} (W)

satisfies the inequality

M_{U}^{2} (W) = sup_{θ \in W} E (ρ^{2} (U, θ)) \geq \frac{a^{2}}{{(\frac{area (\partial W)}{vol (W)} + \sqrt{k n})}^{2}} .

and therefore

M_{U}^{2} (W)

is a lower bound of the risk of the local minimax estimator on W.

Proof.

Let

A = \exp^{- 1} (U)

. Integrating inequality (21) with respect to measure

f_{(k)} (x; θ), d μ

and applying Fubini’s theorem yield

\begin{matrix} a vol (W) & \leq & \int_{W} \sqrt{E ({∥ A ∥}^{2})} \sqrt{E ({∥ C ∥}^{2})} d V + \int_{\partial W} E (∥ A ∥) d σ \\ \leq & \sqrt{k n} \int_{W} \sqrt{E ({∥ A ∥}^{2})} d V + \int_{\partial W} \sqrt{E ({∥ A ∥}^{2})} d σ . \end{matrix}

(22)

Hence,

sup_{θ \in W} E ({∥ A ∥}^{2}) \geq {(\frac{a vol (W)}{area (\partial W) + \sqrt{k n} vol (W)})}^{2} .

□

Corollary 4.

When the parameter manifold is Euclidean and

dim (Θ) = n

, given a geodesic ball centered at

γ \in Θ

of radius R less than the injectivity radius at γ, the following lower bound holds for the local minimax risk:

M_{U}^{2} (S_{R}) \geq {(\frac{n R}{n + \sqrt{k n} R})}^{2},

(23)

where

M_{U}^{2} (S_{R}) = {sup}_{θ \in S_{R}} E (ρ^{2} (U, θ))

. If the Euclidean manifold Θ is complete and simply connected, then

M_{U}^{2} (Θ) \geq \frac{n}{k} .

Proof.

Since

vol (S_{r}) = \frac{2 π^{n / 2} r^{n}}{n Γ (n / 2)},

we obtain

\int_{0}^{R} vol (S_{r}) d r = \frac{2 π^{n / 2} R^{n + 1}}{n (n + 1) Γ (n / 2)} .

Substituting into Theorem 4 yields

M_{U}^{2} (S_{R}) \geq {(\frac{n R}{n + \sqrt{k n} R})}^{2} .

The global bound follows by taking the limit

R \to \infty

. □

The lower bounds for the local minimax risk in geodesic balls of radius R in Example 1 are calculated using (23) and graphically displayed in Figure 2a,b.

Another lower bound for the integrated Riemannian risk is stated in the following theorem.

Theorem 5.

We obtain the following lower bound for the Riemannian average of the mean–squared Rao distance over the geodesic ball centered at

γ \in Θ

of radius R less than the injectivity radius at γ,

S_{R}

:

R_{U_{k}}^{2} (S_{R}) \geq {\{\frac{a \int_{0}^{R} vol (S_{r}) d r}{vol (S_{R}) + \sqrt{k n} \sqrt{vol (S_{R})} \int_{0}^{R} \sqrt{vol (S_{r})} d r}\}}^{2},

(24)

where

a = n

when the sectional curvatures are nonpositive and

a = 1

when the sectional curvature satisfies

sup K > 0

.

Proof.

Consider inequality (22) with

W = S_{r}

,

0 < r \leq R

, and integrate with respect to

d r

on

[0, R]

. We obtain

\begin{matrix} a \int_{0}^{R} vol (S_{r}) d r & \leq & \sqrt{k n} \int_{0}^{R} \int_{S_{r}} \sqrt{E ({∥ A ∥}^{2})} d V d r + \int_{0}^{R} \int_{\partial S_{r}} \sqrt{E ({∥ A ∥}^{2})} d σ d r \\ \leq \sqrt{k n} \int_{0}^{R} \sqrt{vol (S_{r})} \sqrt{\int_{S_{r}} E ({∥ A ∥}^{2}) d V} d r + vol (S_{R}) \sqrt{\frac{1}{vol (S_{R})} \int_{S_{R}} E ({∥ A ∥}^{2}) d V} \end{matrix}

The map

r ⟼ \sqrt{\int_{S_{r}} E {(∥ A ∥)}^{2} d V}

is positive and increases monotonically. Using

E {(∥ A ∥)}^{2} = E (ρ^{2} (U_{k}, θ))

, we obtain

\frac{a}{vol (S_{R})} \int_{0}^{R} vol (S_{r}) d r \leq (\frac{\sqrt{k n}}{\sqrt{vol (S_{R})}} \int_{0}^{R} \sqrt{vol (S_{r})} d r + 1) \sqrt{R_{U}^{2} (S_{R})} .

This yields the stated bound and completes the proof. □

Corollary 5.

When the parametric statistical model is a Euclidean manifold, the Riemannian average of the mean squared Rao distance over the geodesic ball centered at

γ \in Θ

of radius R less than the injectivity radius at γ,

S_{R}

, satisfies the lower bound

R_{U}^{2} (S_{R}) \geq {\{\frac{n (n + 2) R}{(n + 1) (n + 2 + 2 \sqrt{k n} R)}\}}^{2} .

If the Euclidean manifold Θ is complete and simply connected, the corresponding lower bound over the entire manifold is

R_{U}^{2} (Θ) \equiv lim_{R \to \infty} R_{U}^{2} (S_{R}) \geq \frac{n {(n + 2)}^{2}}{4 k {(n + 1)}^{2}} .

Proof.

For a Euclidean manifold of dimension n, the volume of the geodesic ball of radius r is

vol (S_{r}) = \frac{2 π^{n / 2} r^{n}}{n Γ (n / 2)},

Consequently,

\int_{0}^{R} \sqrt{vol (S_{r})} d r = {(\frac{8 π^{n / 2} R^{n + 2}}{n {(n + 2)}^{2} Γ (n / 2)})}^{1 / 2}, \int_{0}^{R} vol (S_{r}) d r = \frac{2 π^{n / 2} R^{n + 1}}{n (n + 1) Γ (n / 2)} .

Substituting these expressions into the inequality (24) yields

0 < {\{\frac{n (n + 2) R}{(n + 1) (n + 2 + 2 \sqrt{k n} R)}\}}^{2} \leq R_{U_{k}}^{2} (S_{R}) .

The second assertion follows by taking the limit

R \to \infty

. □

5. Concluding Remarks

This article introduces two performance indices for statistical estimators restricted to a prescribed region of the parameter space. These indices are based on the intrinsic risk function developed in [7]. Specifically, for a bounded region W of the parameter space, we consider (i) the integral of the intrinsic risk over W with respect to the associated Riemannian volume measure, and (ii) the maximum value of the intrinsic risk attained within W. Both criteria are compatible with Bayesian methodologies.

In this framework, if it is known a priori that the true parameter belongs to a given region W, biased estimators can be constructed that outperform unbiased estimators in terms of either criterion. When there is strong practical evidence that the parameter lies within a sufficiently small region, allowing controlled bias may be preferable whenever it reduces the average Riemannian risk or the worst-case risk over W.

A representative example is examined in which the statistical model is the multivariate normal distribution with known covariance matrix and a simple random sample of size k. Numerical illustrations (Figure 1 and Figure 2) show that restricting attention to a region W allows the construction of estimators with strictly smaller integrated risk or smaller maximal risk than classical unbiased estimators, at the cost of introducing a region-dependent bias term. The magnitude and behavior of these improvements depend on the underlying Riemannian geometry induced by the statistical model, suggesting several directions for further investigation.

When the model is invariant under the action of a transformation group G, coherence considerations require restricting attention to equivariant estimators. In this setting, the corresponding intrinsic risk must remain constant along the orbits of the induced action

G^{*}

in the parameter space. As shown in [21], if an intrinsically unbiased estimator exists that uniformly minimizes the Riemannian risk, then this estimator must be equivariant. The converse does not hold: equivariance, together with uniform optimality, does not, in general, imply intrinsic unbiasedness. Additional structural assumptions—such as the transitivity of the G-action and the commutativity of the induced action in

G^{*}

—are required to recover this implication. In that case, the intrinsically unbiased estimator that uniformly minimizes the Riemannian risk will be preferable.

Author Contributions

Conceptualization, J.M.C. and J.M.O.; Methodology, J.M.C. and J.M.O.; Writing—original draft, J.M.C. and J.M.O.; Writing—review & editing, J.M.C. and J.M.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study did not require ethical approval.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. Notation and Basic Concepts of Differential Geometry

In this Appendix, we introduce the main frequently used symbols in a small summary table and additionally we briefly recall several standard notions from differential geometry that will be used throughout the paper. Classical references include [16,25,26,27], among others.

Appendix A.1.1. Manifolds

Let

Θ

be a nonempty set. A

C^{\infty}

n-atlas on

Θ

is a family of charts

A = {(U_{i}, ϕ_{i})}_{i \in I},

where

{U_{i}}_{i \in I}

is a cover of

Θ

, each map

ϕ_{i} : U_{i} \to ϕ_{i} (U_{i}) \subset R^{n}

is a bijection onto an open subset of

R^{n}

, and for every

i, j \in I

the transition map

ϕ_{j} \circ ϕ_{i}^{- 1} : ϕ_{i} (U_{i} \cap U_{j}) ⟶ ϕ_{j} (U_{i} \cap U_{j})

is a

C^{\infty}

diffeomorphism. Charts satisfying this compatibility condition are called

C^{\infty}

-compatible. The maximal

C^{\infty}

atlas containing

A

determines a unique n-dimensional differentiable structure on

Θ

, although dependent on

A

. Equipped with this structure,

Θ

becomes a smooth manifold, where a set

W \subset Θ

is defined to be open if and only if

ϕ (W \cap U)

is open in

R^{n}

for every chart

(U, ϕ)

in the maximal atlas. Additionally, a real-valued function

f : U \to R

in an open subset

U \subset Θ

is said to be smooth if, for every chart

(V, φ)

with

U \cap V \neq \emptyset

, the composition

f \circ φ^{- 1} : φ (U \cap V) ⟶ R

is a smooth function in the classical sense on an open subset of

R^{n}

. This definition is independent of the chosen chart and provides the manifold with its canonical sheaf of smooth real-valued functions. It is rather straightforward and well known to extend this smoothness notion to other objects, through the convenient compatible local charts.

Table A1. Summary of the main symbols used in this article.

Symbols	Description
$f_{(k)}$	Joint k-size sample
$Θ$	Parametric space
$\partial W$	Boundary of W
$U$	Estimator
$g_{i j}$	Metric tensor components
$A = \exp_{θ}^{- 1} (U)$	Estimator vector field
$B = E_{θ} (\exp_{θ}^{- 1} (U))$	Bias vector field
$ρ^{2} (U, θ)$	Riemannian (Rao) distance
grad	Gradient operator
$C = g r a d ln f_{k}$	Gradient of $ln f_{(k)}$
div	Divergence operator
$Δ$	Laplace-Beltrami operator
$R_{U}^{2} (B)$	Riemannian average of the intrinsic risk in $B$
$M_{U}^{2} (W)$	Supremum of the intrinsic risk in $W$
$S_{R}$	Geodesic ball of radius R

Appendix A.1.2. Vectors and Lie Bracket

Let

θ \in Θ

. Denote by

C^{\infty} (θ)

the set of all smooth functions in an open neighborhood of

θ

. A tangent vector at a point

θ \in Θ

is a linear map

ξ : C^{\infty} (θ) ⟶ R

which also satisfy the Leibniz rule

ξ_{θ} (f g) = f (θ) ξ_{θ} (g) + g (θ) ξ_{θ} (f), f, g \in C^{\infty} (γ) .

The collection of all tangent vectors at

θ

forms the tangent space (at

θ

)

T_{θ} Θ

, a real vector space of dimension n. Given a local chart

(U, ϕ)

around

θ

, let

π^{i}

be the i-th projection function and let

ϕ^{i} = π^{i} \circ ϕ

,

(ϕ^{1}, \dots, ϕ^{n})

be denoted as a coordinate system. The tangent vectors are defined as

\frac{\partial}{\partial ϕ^{i}} |_{θ} f = D_{i} (f \circ ϕ^{- 1}) (ϕ (θ)), i = 1, \dots, n

where

D_{i}

is the ordinary partial derivative in

R^{n}

, form a natural basis linked to the local chart.

A vector field on a subset

W \subset Θ

is a map assigning to each point

θ \in W

a tangent vector

ξ_{θ} \in T_{θ} Θ

. The field

ξ

is of class

C^{\infty}

in W if and only if W is open and, for every smooth function f defined in an open set

B \subset A

, the function

θ \mapsto (ξ f) (θ) = ξ_{θ} (f)

is

(C^{\infty}

on

A \cap B

.

Given a coordinate system

ϕ^{1}, \dots, ϕ^{n}

, a smooth vector field

ξ

is represented by its component functions

ξ^{1}, \dots, ξ^{n}

via

ξ_{θ} = \sum_{i = 1}^{n} ξ_{θ}^{i} {(\frac{\partial}{\partial ϕ^{i}})}_{θ} .

In coordinates, we thus identify the field with the map

θ \mapsto (ξ^{1} (ϕ (θ)), \dots, ξ^{n} (ϕ (θ))) .

For two smooth vector fields

ξ

and

ζ

in W, their Lie bracket is the vector field

[ξ, ζ]

defined by

{[ξ, ζ]}_{θ} (f) = ξ_{θ} (ζ f) - ζ_{θ} (ξ f),

for every smooth function f.

Appendix A.1.3. Tensors

Let

Θ

be an n-dimensional smooth manifold and let

r, s \geq 0

be integers. A

(r, s)

-tensor field A in an open set

U \subset Θ

is a smooth assignment

\begin{matrix} \underset{︸}{T_{θ}^{*} Θ \times \dots \times T_{θ}^{*} Θ} & \times & \underset{︸}{T_{θ} Θ \times \dots \times T_{θ} Θ} & ⟶ R \\ r & s \end{matrix}

(A1)

which is multilinear in each argument and depends smoothly on

θ \in U

.

Given a coordinate chart

(θ^{1}, \dots, θ^{m})

, the induced local frame

\partial / \partial θ^{1}, \dots, \partial / \partial θ^{m}

spans each tangent space

T_{θ} Θ

, and the dual frame

d θ^{1}, \dots, d θ^{m}

spans

T_{θ}^{*} Θ

. With respect to these bases, the tensor field A has components

A_{β_{1} \dots β_{s}}^{α_{1} \dots α_{r}}

. Under a coordinate change

(θ^{1}, \dots, θ^{m}) \mapsto ({\bar{θ}}^{1}, \dots, {\bar{θ}}^{m})

the components transform according to the classical rule

{\bar{A}}_{β_{1} \dots β_{s}}^{α_{1} \dots α_{r}} = \frac{\partial {\bar{θ}}^{α_{1}}}{\partial θ^{i_{1}}} \cdot \dots \cdot \frac{\partial {\bar{θ}}^{α_{r}}}{\partial θ^{i_{r}}} \cdot \frac{\partial θ^{j_{1}}}{\partial {\bar{θ}}^{β_{1}}} \cdot \dots \cdot \frac{\partial θ^{j_{s}}}{\partial {\bar{θ}}^{β_{s}}} \cdot A_{j_{1} \dots j_{s}}^{i_{1} \dots i_{r}},

where repeated indices are summed and all quantities are evaluated at the same point

θ

.

The case

r = s = 0

corresponds to smooth real-valued functions in

Θ

, which are invariant under coordinate transformations. When

r = 0

, the tensor field is called s-covariant; when

s = 0

, it is r-contravariant. Throughout, any object or property that is independent of the chosen coordinate system in

Θ

will be called intrinsic.

Appendix A.1.4. Riemannian Manifold

A Riemannian manifold is a smooth manifold

Θ

equipped with a Riemannian metric, that is, a smooth

(0, 2)

-tensor field g that is positive definite on each tangent space. Equivalently, for every

θ \in Θ

, the metric assigns an inner product

{〈 \cdot, \cdot 〉}_{θ} : T_{θ} Θ \times T_{θ} Θ \to R

that varies smoothly with

θ

. In local coordinates

(θ^{1}, \dots, θ^{m})

, the metric is represented by the symmetric, positive-definite matrix-valued function

g_{μ ν} (θ) = {〈\frac{\partial}{\partial θ^{μ}}, \frac{\partial}{\partial θ^{ν}}〉}_{θ},

and the corresponding line element is written, using the summation convention,

d s^{2} = g_{μ ν} (θ) d θ^{μ} d θ^{ν} .

Associated with the metric g are the canonical index–raising and index–lowering operations, which provide isomorphisms between the tangent bundle

T Θ

and the cotangent bundle

T^{*} Θ

. The flat map

♭ : T Θ \to T^{*} Θ,

is defined by

ξ^{♭} (η) = {〈 ξ, η 〉}_{θ}, \forall η \in T_{θ} Θ,

which enable the identification of a contravariant vector field with a covariant vector field by lowering the indices. In contrast, the sharp map

♯ : T^{*} Θ \to T Θ

, is defined as the inverse of ♭, that is, the unique vector

♯ (ω) \in T_{θ} Θ

satisfying

ω (η) = {〈♯ (ω), η〉}_{θ}, \forall η \in T_{θ} Θ .

In coordinates, these operations are expressed by contraction with the metric tensor and its inverse: lowering an index uses

g_{μ ν}

, while raising an index uses the components

g^{μ ν}

of the inverse metric. Thus, in component form, the passage between the covariant and contravariant tensors is achieved by multiplication by

g_{μ ν}

or

g^{μ ν}

, respectively.

Appendix A.1.5. Gradient, Divergence and the Covariant Derivative

For a smooth function q, its gradient is the unique vector field

grad (q)

satisfying

{〈 grad (q), ζ 〉}_{θ} = ζ_{θ} (q), \forall ζ \in T_{θ} Θ,

so that

grad (q) = ♯ (\nabla q)

where

\nabla q

is the covariant vector whose components with respect to the dual frame

d θ^{1}, \dots, d θ^{n}

are

(\frac{\partial q}{\partial θ^{1}}, \dots, \frac{\partial q}{\partial θ^{1}})

. By Cauchy–Schwarz,

| ζ_{θ} (q) | \leq | grad (q) |

, with equality when

ζ = grad (q) / | grad (q) |

.

A connection in

T Θ

is a bilinear operator

\nabla_{ξ} ζ

assigning to each tangent vector

ξ

and the local vector field

ζ

another tangent vector at the same point, satisfying the usual linearity and Leibniz properties. In a Riemannian manifold, the Levi–Civita connection ∇ is uniquely characterized by 1. torsion-freeness:

\nabla_{ζ} ξ - \nabla_{ξ} ζ = [ζ, ξ]

and 2. metric compatibility:

ξ 〈 ζ, η 〉 = 〈 \nabla_{ξ} ζ, η 〉 + 〈 ζ, \nabla_{ξ} η 〉 .

In coordinates,

\nabla_{ξ} ζ = (\frac{\partial ζ^{μ}}{\partial θ^{ν}} + Γ_{ν λ}^{μ} ζ^{λ}) ξ^{ν} \frac{\partial}{\partial θ^{μ}},

where

Γ_{ν λ}^{μ}

are the Christoffel symbols given by, using the repeated index summation convention,

Γ_{ν λ}^{μ} = \frac{1}{2} g^{μ α} (\frac{\partial g_{α λ}}{\partial θ^{ν}} + \frac{\partial g_{α ν}}{\partial θ^{λ}} - \frac{\partial g_{ν λ}}{\partial θ^{α}}) .

where

g^{μ α}

are the coefficients of the inverse of the fundamental tensor components with respect to

\frac{\partial}{\partial θ^{i}}, i = 1, \dots, n

. The divergence of a vector field

ζ

is defined by

div (ζ) (θ) = trace (ξ \mapsto \nabla_{ξ} ζ),

and the Laplace–Beltrami operator is

Δ q = div (grad (q)) .

Appendix A.1.6. Curvature and Geodesics

The curvature operator is defined by

R (ξ, η) ζ = \nabla_{η} \nabla_{ξ} ζ - \nabla_{ξ} \nabla_{η} ζ - \nabla_{[η, ξ]} ζ .

Associated quantities include the Riemann curvature tensor

K (ν, ζ, ξ, η) = 〈 ν, R (ξ, η) ζ 〉

, and the sectional curvature

K (ξ, η) = \frac{K (ξ, η, ξ, η)}{〈 ξ, ξ 〉 〈 η, η 〉 - {〈 ξ, η 〉}^{2}};

and the Ricci tensor,

Ric (ξ, η) = trace (ζ \mapsto R (ξ, ζ) η)

.

A curve

γ (t)

with tangent vector field

ξ

is a geodesic if

\nabla_{ξ} ξ = 0,

which, in local coordinates and using repeated index summation convention, yields the system

\frac{d^{2} θ^{k}}{d t^{2}} + Γ_{i j}^{k} \frac{d θ^{i}}{d t} \frac{d θ^{j}}{d t} = 0, k = 1, \dots, n

.

Appendix A.1.7. Exponential Map, Cut Locus and Injectivity Radius

For

θ \in Θ

and

ξ \in T_{θ} Θ

, let

γ_{ξ}

be a geodesic that satisfies

γ_{ξ} (0) = θ

and

γ_{ξ}^{'} (0) = ξ

. The exponential map is defined by

\exp_{θ} (ξ) = γ_{ξ} (1)

and is a diffeomorphism near the origin.

Denote by

𝔖_{θ} (r) = {ξ \in T_{θ} {Θ : ∥ ξ ∥}_{θ} = r}

, where

r > 0

, and for each

ξ \in 𝔖_{θ} \equiv 𝔖_{θ} (1)

we define

C_{θ} (ξ) = sup {s > 0 : ρ (θ, γ_{ξ} (s)) = s},

where

ρ

is the Riemannian distance and

γ_{ξ}

is a geodesic defined in an open interval containing zero such that

γ_{ξ} (0) = θ

and with a tangent vector equal to

ξ

at the origin. Then if we set

𝔇_{θ} = {s ξ \in T_{θ} Θ : 0 \leq s < C_{θ} (ξ); ξ \in 𝔖_{θ}}

and

D_{θ} = \exp_{θ} (𝔇_{θ}),

the map

\exp_{θ}

is a diffeomorphism from

𝔇_{θ}

onto

D_{θ}

. Moreover, if the manifold is also complete, the boundary of

𝔇_{θ}

,

\partial 𝔇_{θ}

, is mapped by the exponential map onto

\partial D_{θ}

, called the cut locus of θ in Θ. It is also interesting to note that, in this case, the cut locus of

θ

has a zero n-dimensional Riemannian measure in

Θ

and

Θ

is the disjoint union of

D_{θ}

and

\partial D_{θ}

.

The injectivity radius at θ is

inj (θ) = sup {r > 0 : \exp_{θ} |_{B_{r} (0)}

is a diffeomorphism onto its image

},

where

B_{r} (0) \subset T_{θ} Θ

is the Euclidean ball of radius r, at the tangent space

T_{θ} Θ

. The injectivity radius at

θ

captures the largest radius of a geodesic ball around θ on which normal coordinates are well-defined and uniquely minimize the distance. The injectivity radius of Θ is

inj (Θ) = {inf}_{θ \in Θ} {inj (θ)}

.

Appendix A.2. Comparison Theorems and Volume

Bishop’s comparison theorems provide a powerful means of deriving the volume of a radius geodesic ball r in a Riemannian manifold with constant sectional curvature, as well as obtaining upper and lower bounds for this volume when sectional curvatures are only bounded. The following propositions summarize these results.

Proposition A1.

Let Θ be a Riemannian manifold with constant sectional curvature

K

and let

S_{r}

be a geodesic Riemannian ball centered at

γ \in Θ

of radius r, with

r > 0

but less than the injectivity radius at γ. Then the volume of this Riemannian ball,

S_{r}

, and the area of its smooth boundary are equal to

\partial S_{r}

.

vol (S_{r}) = \frac{2 π^{n / 2}}{Γ (n / 2)} \int_{0}^{r} S_{K}^{n - 1} (t) d t and area (\partial S_{r}) = \frac{2 π^{n / 2}}{Γ (n / 2)} S_{K}^{n - 1} (r),

where

S_{K} (t) = \{\begin{matrix} \frac{sin (\sqrt{K} t)}{\sqrt{K}} & i f K > 0, \\ t & i f K = 0, \\ \frac{sinh (\sqrt{- K} t)}{\sqrt{- K}} & i f K < 0 . \end{matrix}

Proof.

Using geodesic spherical coordinates

ξ (θ) = (ρ, u)

centered on

γ \in Θ

, where

ρ

is the Riemannian distance between

θ

and

γ

and u defines a point in

S_{n}

, the unit sphere in the tangent space

Θ_{γ}

, we have the following

vol (S_{r}) = \int_{0}^{r} \int_{ξ^{- 1} (S_{n})} \sqrt{g} d u d ρ,

When the sectional curvature is constant, Bishop’s theorem implies

\frac{\partial}{\partial ρ} ln \sqrt{g (ρ, u)} = (n - 1) \frac{S_{K}^{'}}{S_{K}} (ρ),

Integrating this relation yields

\sqrt{g (ρ, u)} = S_{K}^{n - 1} Ω (u)

, where

Ω (u) d u

is the surface element of the Euclidean unit sphere and satisfies

\int_{ξ^{- 1} (S_{n})} Ω (u) d u = \frac{2 π^{n / 2}}{Γ (n / 2)} .

Substituting completes the proof. □

For the next result, we are going to use generalized hypergeometric functions. These functions are defined by

{}_{p}F_{q} (a_{1}, \dots, a_{p}; b_{1}, \dots, b_{q}; z) \equiv \sum_{j = 0}^{\infty} \frac{{(a_{1})}_{j} \dots {(a_{p})}_{j}}{{(b_{1})}_{j} \dots {(b_{q})}_{j}} \frac{z^{j}}{j!},

(A2)

where

{(a)}_{j} = a (a + 1) \dots (a + j - 1)

and z is any complex number if

p \leq q

,

∥ z ∥ < 1

if

p = q + 1

and diverge for all

z \neq 0

if

p > q + 1

, see [28].

Proposition A2.

If the sectional curvature is constant and equal to

K

, with

| K | S_{K}^{2} (r) < 1

, then the volume of a geodesic ball geodesic ball centered at

γ \in Θ

of radius r less than the injectivity radius at γ, satisfies

vol (S_{r}) = \frac{2 π^{n / 2}}{n Γ (n / 2)} S_{K}^{n} (r) \{1 + \sum_{j = 1}^{\infty} \frac{n Γ (j + \frac{1}{2})}{\sqrt{π} (n + 2 j)} \frac{K^{j} S_{K}^{2 j} (r)}{j!}\} .

(A3)

Proof.

From Proposition A1,

vol (S_{r}) = \frac{2 π^{n / 2}}{Γ (n / 2)} \int_{0}^{r} S_{K}^{n - 1} (t) d t .

Using the identity

{\{S_{K}^{'} (t)\}}^{2} + K {S_{K} (t)}^{2} = 1

, and the setting

y = S_{K}^{2} (t) / S_{K}^{2} (r)

, we obtain

\int_{0}^{r} S_{K}^{n - 1} (t) d t = \frac{1}{2} S_{K}^{n} (r) \int_{0}^{1} y^{\frac{n - 2}{2}} {(1 - K S_{K}^{2} (r) y)}^{- \frac{1}{2}} d y .

Using the Euler integral representation of the Gauss hypergeometric function,

{}_{2}F_{1} (a, b; c; z) = \frac{Γ (c)}{Γ (b) Γ (c - b)} \int_{0}^{1} t^{b - 1} {(1 - t)}^{c - b - 1} {(1 - t z)}^{- a} d t, R e (c) > R e (b) > 0 .

we find

\int_{0}^{r} S_{K}^{n - 1} (t) d t = \frac{1}{2} S_{K}^{n} (r) \frac{Γ (\frac{n}{2})}{Γ (\frac{n + 2}{2})} {}_{2}F_{1} (\frac{1}{2}, \frac{n}{2}; \frac{n + 2}{2}; K S_{K}^{2} (r))

and the series expansion of

{}_{2}F_{1}

yields the desired expression. □

Proposition A3.

Let

vol (S_{r} (γ))

denote the volume of a geodesic ball centered at

γ \in Θ

of radius r less than the injectivity radius at γ, in a manifold whose sectional curvatures are bounded below by κ and above by

K

. Then

{vol}_{κ} (S_{r}) \geq vol (S_{r} (γ)) \geq {vol}_{K} (S_{r}),

where

{vol}_{κ} (S_{r})

and

{vol}_{K} (S_{r})

denote the volumes of balls of radius r and center γ in manifolds with constant sectional curvatures κ and

K

, respectively.

Proof.

Integrating the inequalities in Bishop’s comparison theorem from

ρ_{0}

to

ρ

gives

\frac{S_{κ}^{n - 1} (ρ)}{S_{κ}^{n - 1} (ρ_{0})} \geq \frac{\sqrt{g (ρ, u)}}{\sqrt{g (ρ_{0}, u)}} \geq \frac{S_{K}^{n - 1} (ρ)}{S_{K}^{n - 1} (ρ_{0})} .

Taking the limit as

ρ_{0} \to 0

and noting that

lim_{ρ_{0} \to 0} \frac{\sqrt{g (ρ_{0}, u)}}{S_{κ}^{n - 1} (ρ_{0})} = lim_{ρ_{0} \to 0} \frac{\sqrt{g (ρ_{0}, u)}}{S_{K}^{n - 1} (ρ_{0})} = Ω (u),

we conclude that

S_{κ}^{n - 1} (ρ) Ω (u) \geq \sqrt{g (ρ, u)} \geq S_{K}^{n - 1} (ρ) Ω (u) .

and the result follows upon integration. □

References

Rao, C. Information and Accuracy Attainable in Estimation of Statistical Parameters. Bull. Calcutta Math. Soc. 1945, 37, 81–91. [Google Scholar] [CrossRef]
Chentsov, N.N. Statistical Decision Rules and Optimal Inference; Translations of Mathematical Monographs (English translation of the Russian book published in 1972, Nauka, Moscow); American Mathematical Society: Providence, RI, USA, 1982; Volume 53. [Google Scholar] [CrossRef]
Atkinson, C.; Mitchell, A.F.S. Rao’s Distance Measure. Sankhyā Indian J. Stat. Ser. A 1981, 43, 345–365. [Google Scholar]
Burbea, J.; Rao, C.R. Entropy differential metric, distance and divergence measures in probability spaces: A unified approach. J. Multivar. Anal. 1982, 12, 575–596. [Google Scholar] [CrossRef]
Nielsen, F. An Elementary Introduction to Information Geometry. Entropy 2020, 22, 1100. [Google Scholar] [CrossRef] [PubMed]
Amari, S.i. Information Geometry and Its Applications, 1st ed.; Springer Publishing Company: Tokyo, Japan, 2016. [Google Scholar] [CrossRef]
Oller, J.M.; Corcuera, J.M. Intrinsic Analysis of Statistical Estimation. Ann. Stat. 1995, 23, 1562–1581. [Google Scholar] [CrossRef]
García, G.; Oller, J.M. What does intrinsic mean in statistical estimation? (with discussion). SORT-Stat. Oper. Res. Trans. 2006, 30, 125–170. [Google Scholar]
García, G.; Cubedo, M.; Oller, J.M. Univariate Linear Normal Models: Optimal Equivariant Estimation. Mathematics 2025, 13, 3659. [Google Scholar] [CrossRef]
Rao, B.L.S.P. Remarks on Cramer-Rao Type Integral Inequalities for Randomly Censored Data. Lect. Notes-Monogr. Ser. 1995, 27, 163–175. [Google Scholar]
Frieden, B. Science from Fisher Information: A Unification, 2nd ed.; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar] [CrossRef]
Tsang, M. Physics-inspired forms of the Bayesian Cramér-Rao bound. Phys. Rev. A 2020, 102, 062217. [Google Scholar] [CrossRef]
Brody, D.C.; Hughston, L.P. Statistical Geometry in Quantum Mechanics. Proc. Math. Phys. Eng. Sci. 1998, 454, 2445–2475. [Google Scholar] [CrossRef]
Esposito, A.R.; Vandenbroucque, A.; Gastpar, M. Lower bounds on the Bayesian risk via information measures. J. Mach. Learn. Res. 2024, 25, 1–45. [Google Scholar]
Bernal-Casas, D.; Oller, J.M. Variational Information Principles to Unveil Physical Laws. Mathematics 2024, 12, 3941. [Google Scholar] [CrossRef]
Hicks, N. Notes on Differential Geometry; Mathematica Studies; Van Nostrand: New York, NY, USA, 1965. [Google Scholar]
Chavel, I. Eigenvalues in Riemannian Geometry; Elsevier: Amsterdam, The Netherlands, 1984. [Google Scholar] [CrossRef]
Jeffreys, H. An invariant form for the prior probability in estimation problems. Proc. R. Soc. Lond. Ser. A Math. Phys. Sci. 1946, 186, 453–461. [Google Scholar] [CrossRef] [PubMed]
Berger, J.O. Statistical Decision Theory and Bayesian Analysis; Springer: New York, NY, USA, 1985. [Google Scholar] [CrossRef]
Troutman, J.L. Variational Calculus and Optimal Control: Optimization with Elementary Convexity, 2nd ed.; Undergraduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 1996. [Google Scholar]
Lehmann, E.L. A general concept of unbiasedness. Ann. Math. Stat. 1951, 22, 587–592. [Google Scholar] [CrossRef]
García, G. Cuantificación de la no invariancia y su aplicación en estadística. Ph.D. Thesis, Universitat de Barcelona, Barcelona, Spain, 2001. [Google Scholar]
Muirhead, R. Aspects of Multivariate Statistical Theory; Wiley: Hoboken, NJ, USA, 1982. [Google Scholar] [CrossRef]
Rauch, J. Partial Differential Equations; Graduate texts in mathematics; Springer: New York, NY, USA, 1991; Volume 128. [Google Scholar] [CrossRef]
Kobayashi, S.; Nomizu, K. Foundations of Differential Geometry; John Wiley & Sons: Hoboken, NJ, USA, 1963; Volumes 1 and 2. [Google Scholar]
Petersen, P. Riemannian Geometry, 3rd ed.; Graduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 2016; Volume 171. [Google Scholar]
Chavel, I. Riemannian Geometry: A Modern Introduction; Cambridge Studies in Advanced Mathematics; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; ninth dover printing, tenth gpo printing ed.; Dover: New York, NY, USA, 1964. [Google Scholar]

Figure 1. This figure displays the lower bound, calculated trough (18), of the integrated Riemannian risk on

S_{R}

,

R_{U}^{2} (S_{R})

, for certain values of n and k as a function of R. Note that this intrinsic index, evaluated in terms of its integral across an entire region, unlike the classical Cramér-Rao bound, is independent of any reparametrization of the model. The lower bound increases with R and converges to the intrinsic Cramér–Rao bound for intrinsically unbiased estimators,

n / k

. Furthermore, this bound is attained by the unique equivariant estimator for the model, namely the sample mean.

Figure 1. This figure displays the lower bound, calculated trough (18), of the integrated Riemannian risk on

S_{R}

,

R_{U}^{2} (S_{R})

, for certain values of n and k as a function of R. Note that this intrinsic index, evaluated in terms of its integral across an entire region, unlike the classical Cramér-Rao bound, is independent of any reparametrization of the model. The lower bound increases with R and converges to the intrinsic Cramér–Rao bound for intrinsically unbiased estimators,

n / k

. Furthermore, this bound is attained by the unique equivariant estimator for the model, namely the sample mean.

Figure 2. This figure displays the lower bound, calculated trough (23), of the local minimax Riemannian risk on

S_{R}

,

M_{U}^{2} (S_{R})

, for certain values of n and k as a function of R. Note that this intrinsic index, evaluated in terms of its supremum across an entire region, unlike the classical Cramér-Rao bound, is independent of any reparametrization of the model. These lower bounds increases with R and converges to the intrinsic Cramér–Rao bound for intrinsically unbiased estimators,

n / k

. Furthermore, this bound is attained by the unique equivariant estimator for the model, namely the sample mean.

Figure 2. This figure displays the lower bound, calculated trough (23), of the local minimax Riemannian risk on

S_{R}

,

M_{U}^{2} (S_{R})

, for certain values of n and k as a function of R. Note that this intrinsic index, evaluated in terms of its supremum across an entire region, unlike the classical Cramér-Rao bound, is independent of any reparametrization of the model. These lower bounds increases with R and converges to the intrinsic Cramér–Rao bound for intrinsically unbiased estimators,

n / k

. Furthermore, this bound is attained by the unique equivariant estimator for the model, namely the sample mean.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Corcuera, J.M.; Oller, J.M. Lower Bounds for the Integrated and Minimax Risks in Intrinsic Statistical Estimation: A Geometric Approach. Mathematics 2026, 14, 240. https://doi.org/10.3390/math14020240

AMA Style

Corcuera JM, Oller JM. Lower Bounds for the Integrated and Minimax Risks in Intrinsic Statistical Estimation: A Geometric Approach. Mathematics. 2026; 14(2):240. https://doi.org/10.3390/math14020240

Chicago/Turabian Style

Corcuera, José Manuel, and José María Oller. 2026. "Lower Bounds for the Integrated and Minimax Risks in Intrinsic Statistical Estimation: A Geometric Approach" Mathematics 14, no. 2: 240. https://doi.org/10.3390/math14020240

APA Style

Corcuera, J. M., & Oller, J. M. (2026). Lower Bounds for the Integrated and Minimax Risks in Intrinsic Statistical Estimation: A Geometric Approach. Mathematics, 14(2), 240. https://doi.org/10.3390/math14020240

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Lower Bounds for the Integrated and Minimax Risks in Intrinsic Statistical Estimation: A Geometric Approach

Abstract

1. Introduction

2. The Intrinsic Analysis Framework

2.1. Local Bounds

2.2. Global Bounds

3. Variational Methods to Obtain Global Bounds

4. Lower Bounds for the Maximum Risk

5. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix A.1. Notation and Basic Concepts of Differential Geometry

Appendix A.1.1. Manifolds

Appendix A.1.2. Vectors and Lie Bracket

Appendix A.1.3. Tensors

Appendix A.1.4. Riemannian Manifold

Appendix A.1.5. Gradient, Divergence and the Covariant Derivative

Appendix A.1.6. Curvature and Geodesics

Appendix A.1.7. Exponential Map, Cut Locus and Injectivity Radius

Appendix A.2. Comparison Theorems and Volume

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI