The Distribution and Quantiles of the Sample Mean from a Stationary Process

Christopher S. Withers

doi:10.3390/axioms14060406

Callaghan Innovation (Formerly Industrial Research Ltd.), 101 Allington Road, Wellington 6012, New Zealand

Axioms2025, 14(6), 406;https://doi.org/10.3390/axioms14060406

This article belongs to the Special Issue New Perspectives in Mathematical Statistics

Version Notes

Order Reprints

Abstract

Edgeworth–Cornish–Fisher expansions are hugely important, as they give the distribution, density and quantiles of any standard estimate. Here we show that the sample mean of a univariate or multivariate stationary process is a standard estimate, so that all the known results for standard estimates can be applied. We also show how to allow for missing data and weighted means.

Keywords:

Edgeworth–Cornish–Fisher expansions; sample mean; stationary process

MSC:

62E20

1. Introduction and Summary

Finding the distribution, density and quantiles of an estimate is of great importance. This has been accomplished for standard estimates by extending the expansions of Edgeworth, Cornish, and Fisher. In this section we summarize these results for a univariate standard estimate.

Section 2 gives our main result: we show that the mean of a sample from a univariate stationary process satisfies a special truncated form of the cumulant expansion (1) below, so that all the results of this section can be applied. It also considers the case where observations are not sequential, as for the case of missing data. And it considers unbiased weighted sample means.

In Section 4, we extend this to the mean of a sample from a multivariate stationary process, after summarising the multivariate Edgeworth expansions for a standard estimate in Section 3.

We now summarise the known results for the distribution and quantiles of a univariate standard estimate. Let

\hat{w}

be an estimate of an unknown

w \in R

based on a sample of size n. References [1,2] gave expansions in

n^{- 1 / 2}

for its distribution and quantiles when its cumulants satisfied an artifical assumption removed by [3]. We call

\hat{w}

a standard estimate with respect to n, if

E \hat{w} \to w

as

n \to \infty

, and its rth cumulant can be expanded as

\begin{matrix} κ_{r} (\hat{w}) \approx \sum_{i = r - 1}^{\infty} a_{r i} n^{- i}, r \geq 1, \end{matrix}

(1)

where ≈ indicates an asymptotic expansion that need not converge. For example, in [4], this holds if

\hat{w}

is a smooth function of the mean of independent identically distributed (i.i.d.) random variables. The cumulant coefficients of

\hat{w}, a_{r i}

, may depend on n but must be bounded as

n \to \infty

, and

a_{21}

must be bounded away from 0 as

n \to \infty

. For

\hat{w}

non-lattice, the distribution, density and quantiles of

Y_{n} = {(n / a_{21})}^{1 / 2} (\hat{w} - w)

have asymptotic expansions in powers of

n^{- 1 / 2}

:

\begin{matrix} P_{n} (x) = P r o b . (Y_{n} \leq x) \approx Φ (x) - ϕ (x) \sum_{r = 1}^{\infty} h_{0 r} (x) n^{- r / 2}, \end{matrix}

(2)

\begin{matrix} p_{n} (x) = d P_{n} (x) / d x \approx ϕ (x) [1 + \sum_{r = 1}^{\infty} h_{1 r} (x) n^{- r / 2}], \end{matrix}

(3)

\begin{matrix} Φ^{- 1} (P_{n} (x)) \approx x - \sum_{r = 1}^{\infty} f_{r} (x) n^{- r / 2}, P_{n}^{- 1} (Φ (x)) \approx x + \sum_{r = 1}^{\infty} g_{r} (x) n^{- r / 2}, \end{matrix}

(4)

where

Φ

and

ϕ

are the distribution and density of a unit normal random variable

N \sim N (0, 1)

, and

h_{k r} (x), f_{r} (x), g_{r} (x)

are polynomials of degrees

k + 3 r - 1, r + 1, r + 1

both in x and in the standardized cumulant coefficients

\begin{matrix} A_{r i} = a_{r i} / a_{21}^{r / 2} . \end{matrix}

(5)

In practice, one truncates these expansions for the distribution, density and quantiles of

\hat{w}

. The leading

h_{k r} (x), f_{r} (x), g_{r} (x)

given below suffice to see if divergence has begun. For

k \geq 0,

the kth Hermite polynomial is

\begin{matrix} H_{k} = H_{k} (x) = ϕ {(x)}^{- 1} {(- d / d x)}^{k} ϕ (x) = E {(x + i N)}^{k} where i = \sqrt{- 1} : \\ H_{0} = 1, H_{1} = x, H_{2} = x^{2} - 1, H_{3} = x^{3} - 3 x, H_{4} = x^{4} - 6 x^{2} + 3, \\ H_{5} = x^{5} - 10 x^{3} + 15 x, H_{6} = x^{6} - 15 x^{4} + 45 x^{2} - 15, \end{matrix}

(6)

and so on. See [5] for (6). For the expansions to

O (n^{- 5 / 2})

of (2) and (4), see [3].

h_{k r} (x)

of (2), (3) has the form

\begin{matrix} h_{k r} (x) = \sum [P_{r j} H_{k + j - 1} : 1 \leq j \leq 3 r, r - j even] \end{matrix}

where

P_{r j}

are polynomials in

{A_{r j}}

of (5), given for

r \leq 4

in the appendix of [6], or easily derived from [3]. The terms needed in expansions (2)–(4) to

O (n^{- 3 / 2})

are

\begin{matrix} h_{01} (x) = f_{1} (x) = g_{1} (x) = A_{11} + A_{32} H_{2} / 6, h_{11} (x) = A_{11} H_{1} + A_{32} H_{3} / 6, \\ h_{k 2} (x) = (A_{11}^{2} + A_{22}) H_{1 + k} / 2 + (A_{43} + 4 A_{11} A_{32}) H_{3 + k} / 24 + A_{32}^{2} H_{5 + k} / 72, \end{matrix}

(7)

\begin{matrix} f_{2} (x) = (A_{22} / 2 - A_{11} A_{32} / 3) H_{1} + A_{43} H_{3} / 24 - A_{32}^{2} (4 x^{3} - 7 x) / 36, \\ g_{2} (x) = A_{22} H_{1} / 2 + A_{43} H_{3} / 24 - A_{32}^{2} (2 x^{3} - 5 x) / 36 . \end{matrix}

(8)

By [7], the log density has a simpler form than the density:

\begin{matrix} ln [p_{n} (x) / ϕ (x)] = \sum_{r = 1}^{\infty} b_{r} (x) n^{- r / 2} where b_{1} (x) = h_{11} (x), \\ b_{2} (x) = - A_{11}^{2} / 2 + (A_{22} - A_{32} A_{11}) H_{2} / 2 + A_{43} H_{4} / 24 - A_{32}^{2} (3 x^{4} - 12 x^{2} + 5) / 24 . \end{matrix}

(9)

For

r > 1, b_{r} (x)

is a polynomial of an order only

r + 2

, while

h_{1 r} (x)

is of an order

3 r

. See [6] for other

h_{k r} (x)

and

b_{r} (x)

. If

E \hat{w} = w

, then

A_{11} = 0

so that (2)–(4) and (9) hold with

\begin{matrix} h_{01} (x) = f_{1} (x) = g_{1} (x) = b_{1} (x) = A_{32} H_{2} / 6, h_{11} (x) = A_{32} H_{3} / 6, \\ h_{k 2} (x) = A_{22} H_{1 + k} / 2 + A_{43} H_{3 + k} / 24 + A_{32}^{2} H_{5 + k} / 72 for k = 0, 1, \end{matrix}

(10)

\begin{matrix} f_{2} (x) = A_{22} H_{1} / 2 + A_{43} H_{3} / 24 - A_{32}^{2} (4 x^{3} - 7 x) / 36, \\ b_{2} (x) = A_{22} H_{2} / 2 + A_{43} H_{4} / 24 - A_{32}^{2} (3 x^{4} - 12 x^{2} + 5) / 24, \end{matrix}

(11)

and

g_{2} (x)

of (8) is unchanged.

Note 1.

The original Edgeworth expansion was for

\hat{w}

the mean of n i.i.d. random variables from a distribution with the rth cumulant

κ_{r}

. So (1) holds with

a_{r, r - 1} = κ_{r}, a_{r i} = 0, i \geq r

. An explicit formula for its general term was given in [8].

Edgeworth–Cornish–Fisher expansions were first extended to general parametric and non-parametric standard estimates in [3,4] and to functions of them in [9].

In [10], we gave the extended Edgeworth-Cornish-Fisher expansions for smooth functions of the sample cross-moments of a linear process. We now show that this extends easily to a stationary process.

2. The Cumulants of a Stationary Sample Mean

When

\hat{w}

is the mean of i.i.d. random variables, the cumulant expansion (1) has only one term. Remarkably, for the sample mean of a stationary process, its cumulant expansion has exactly two terms.

Suppose that

\bar{X}

is the mean of a sample

X_{1}, \dots, X_{n}

from a real stationary process

{X_{i}}

. So

\bar{X}

is an unbiased estimate of

μ = E X_{0}

. We now show that its rth cumulant has the form

\begin{matrix} κ_{r} (\bar{X}) = a_{n r, r - 1} n^{1 - r} + a_{n r r} n^{- r}, r > 1, \end{matrix}

(12)

where

a_{n r i}

are bounded as n increases, and

a_{n 21}

is bounded away from 0. This makes it a special case of a standard estimate, so that Section 1 applies with

a_{r i} = a_{n r i}

for

i = r - 1

and

i = r

, and

a_{r i} = 0

for

i > r

and

a_{11} = 0

.

If

a_{n r i} = a_{r i} + O (e^{- n λ_{r}})

where

λ_{r i} > 0,

then

a_{n r i}

can be replaced by

a_{r i} .

Here,

x_{n} = O (y_{n})

means that

x_{n} / y_{n}

is bounded.

Suppose that the stationary process has cross-cumulants,

\begin{matrix} k (i_{1} \dots i_{r}) = κ_{i_{1}, \dots, i_{r}} = κ (X_{i_{1}}, \dots, X_{i_{r}}) . \end{matrix}

(13)

Given integers

i_{1}, \dots, i_{r},

set

\begin{matrix} i_{0} = {min}_{k = 1}^{r} i_{k}, I_{k} = i_{k} - i_{0} \geq 0, I_{0} = {max}_{k = 1}^{r} I_{k} = {max}_{k = 1}^{r} i_{k} - i_{0} . \end{matrix}

(14)

Since

{X_{i}}

is stationary,

\begin{matrix} k (i_{1} \dots i_{r}) = k (I_{1} \dots I_{r}) . \end{matrix}

(15)

These are not changed by permuting subscripts. Also, at least one

I_{k}

is zero. For

r \geq 2,

transforming from

i_{k}

to

T_{k} = i_{k} - i_{1}

for

k = 2, \dots, r

,

\begin{matrix} n^{r} κ_{r} (\bar{X}) = \sum_{i_{1}, \dots, i_{r} = 1}^{n} k (i_{1} \dots i_{r}) = \sum_{| T_{k} | < n, k = 2, \dots, r} [n - δ_{r} (T)] k (0 T_{2} \dots T_{r}) \end{matrix}

(16)

\begin{matrix} where δ_{r} (T) = max (0, T_{2}, \dots, T_{r}) - min (0, T_{2}, \dots, T_{r}) . \end{matrix}

(17)

For example,

δ_{2} (T) = | T_{2} |

,

\begin{matrix} δ_{3} (T) = T_{3} I (0 \leq T_{2} < T_{3}) + (T_{3} - T_{2}) I (T_{2} \leq 0 < T_{3}) - T_{2} I (T_{2} < T_{3} \leq 0) . \end{matrix}

So for

r \geq 2, κ_{r} (\bar{X}) = \sum_{i = r - 1}^{r} a_{n r i} n^{- i}

, where

\begin{matrix} a_{n 21} = \sum_{| T | < n} k (0 T), a_{n r, r - 1} = \sum_{| T_{i} | < n, i = 2, \dots, r} k (0 T_{2} \dots T_{r}), \end{matrix}

(18)

\begin{matrix} a_{n r r} = - \sum_{| T_{i} | < n, i = 2, \dots, r} δ_{r} (T) k (0 T_{2} \dots T_{r}) . \end{matrix}

(19)

This proves that (12) holds. So the Edgeworth–Cornish–Fisher expansions (2)–(4) and (9)–(11) apply to

(\hat{w}, w) = (\bar{X}, μ)

with

a_{r i}

in (1) replaced by these

a_{n r i}

, so that

A_{r i} = a_{n r i} / a_{n 21}^{r / 2} .

If the cross-cumulants

k (0 T_{2} \dots T_{r})

decrease exponentially in

T_{2}

, as is true for a stationary ARMA process by [10], then for

r \geq 2

,

\begin{matrix} a_{n r, r - 1} = a_{r, r - 1} + O (e^{- n λ_{r}}), a_{n r r} = a_{r r} + O (e^{- n λ_{r}}) where λ_{r} > 0, \\ a_{21} = \sum_{| T | < \infty} k (0 T), a_{r, r - 1} = \sum_{| T_{i} | < \infty, i = 2, \dots, r} k (0 T_{2} \dots T_{r}), \end{matrix}

(20)

\begin{matrix} a_{r r} = - \sum_{| T_{i} | < \infty, i = 2, \dots, r} δ_{r} (T) k (0 T_{2} \dots T_{r}), \end{matrix}

(21)

so that for

X_{0}

non-lattice, these Edgeworth–Cornish–Fisher expansions apply to

(\hat{w}, w) = (\bar{X}, μ)

with these

a_{r i}

. According to [3] or the appendix to [6], (10)–(8),

h_{k r}, f_{r}, g_{r}

simplify for

r = 3, 4

. This expands the results given in [10] for a linear process.

For convergence in law of

n^{1 / 2} (\bar{X} - μ)

to

N (0, a_{21})

with

a_{21}

of (20), under mixing conditions on a stationary process, see Sections 18.4 and 18.5 of [11]. They also show how to express

a_{21 n}

and

a_{21}

in terms of the spectral distribution and density.

Missing values. Now suppose that we only have observations at times

t_{1}, \dots, t_{n}

. Our estimate of

μ = E X_{0}

is then

\begin{matrix} {\hat{μ}}_{t} = n^{- 1} \sum_{i = 1}^{n} X_{t_{i}} . So E {\hat{μ}}_{t} = μ, and for S_{k} = t_{i_{k}} - t_{i_{1}}, r \geq 2, \\ n^{r} κ_{r} ({\hat{μ}}_{t}) = \sum_{i_{1}, \dots, i_{r} = 1}^{n} k (t_{i_{1}} \dots t_{i_{r}}) = \sum_{i_{1}, \dots, i_{r} = 1}^{n} k (0 S_{2} \dots S_{r}) = n a_{n r, r - 1} say . \end{matrix}

So if

a_{n 21}

is bounded away from 0 and

a_{n r, r - 1}

is bounded in n, we can apply Section 1 with

a_{r i} = 0

for

i \geq r \geq 1

.

Weighted means. Let

w_{n 1}, \dots, w_{n n}

be given numbers adding to n. For example, the standardized form of the Chernoff weight

i / n

, giving more weight to more recent observations, is

w_{n i} = 2 i / (n + 1) .

See [12]. An unbiased estimate of

μ

is the weighted sample mean,

{\hat{μ}}_{w} = n^{- 1} \sum_{i = 1}^{n} w_{n i} X_{i} .

\begin{matrix} For r \geq 2, n^{r} κ_{r} ({\hat{μ}}_{w}) = \sum_{i_{1}, \dots, i_{r} = 1}^{n} w_{n i_{1}} \dots w_{n i_{r}} k (i_{1}, \dots, i_{r}) = n a_{n r, r - 1} \end{matrix}

So if

a_{n 21}

is bounded away from 0 and

a_{n r, r - 1}

is bounded in n, we can apply Section 1 with

a_{r i} = 0

for

i \geq r \geq 1

. Missing values can also be treated by giving them a weight of 0.

3. Multivariate Edgeworth Expansions

Ordinary Bell polynomials. For a sequence from R, say

e = (e_{1}, e_{2}, \dots),

the partial ordinary Bell polynomial

{\tilde{B}}_{r s} = {\tilde{B}}_{r s} (e)

, is defined by the identity

\begin{matrix} S^{s} = \sum_{r = s}^{\infty} z^{r} {\tilde{B}}_{r s} (e) where S = \sum_{r = 1}^{\infty} z^{r} e_{r}, z \in R . \\ So, {\tilde{B}}_{r 0} = δ_{r 0}, {\tilde{B}}_{r 1} = e_{r}, {\tilde{B}}_{r r} = e_{1}^{r}, {\tilde{B}}_{21} = 2 e_{1} e_{2}, \end{matrix}

where

δ_{00} = 1, δ_{r 0} = 0

for

r \neq 0 .

They are tabled on p. 309 of [13]. The complete ordinary Bell polynomial,

{\tilde{B}}_{r} (e)

, is defined in terms of S by

\begin{matrix} e^{S} = \sum_{r = 0}^{\infty} z^{r} {\tilde{B}}_{r s} (e) . So {\tilde{B}}_{r} (e) = \sum_{s = 0}^{r} {\tilde{B}}_{r s} (e) / s! : \end{matrix}

(22)

\begin{matrix} {\tilde{B}}_{0} (e) = 1, {\tilde{B}}_{1} (e) = e_{1}, {\tilde{B}}_{2} (e) = e_{2} + e_{1}^{2} / 2, {\tilde{B}}_{3} (e) = e_{3} + e_{1} e_{2} + e_{1}^{3} / 6 . \end{matrix}

(23)

Now suppose that

\begin{matrix} e_{j} (s) = \sum_{r = 1}^{j + 2} {\bar{b}}_{r + j}^{1 - r} {\bar{s}}_{1} \dots {\bar{s}}_{r} / r! where {\bar{s}}_{k} = s_{i_{k}}, {\bar{b}}_{r + j}^{1 - r} = b_{r + j}^{i_{1} \dots i_{r}}, b_{2 d + 1}^{i_{1} \dots i_{r}} = 0, \end{matrix}

(24)

for some constants

b_{j}^{i_{1} \dots i_{r}}

. Then for

r \geq 1

, we can write

\begin{matrix} {\tilde{B}}_{r} (e (s)) = \sum_{k = 1}^{3 r} [{\bar{P}}_{r}^{1 - k} {\bar{s}}_{1} \dots {\bar{s}}_{k} : k - r even], \end{matrix}

(25)

where (24) and (25) use the tensor summation convention of implicitly summing

i_{1}, i_{2}, \dots

over their range

1, \dots, p

, and

{\bar{P}}_{r}^{1 - k}

is a polynomial in

{{\bar{b}}_{j}^{1 - r}}

. For

r = 1, 2, {\tilde{B}}_{r} (e (s))

is given by (25) in terms of

\begin{matrix} {\bar{P}}_{1}^{1} = {\bar{b}}_{2}^{1}, {\bar{P}}_{1}^{1 - 3} = {\bar{b}}_{4}^{1 - 3} / 6, {\bar{P}}_{2}^{12} = {\bar{b}}_{2}^{1} {\bar{b}}_{2}^{2} / 2 + {\bar{b}}_{4}^{12} / 2, \\ {\bar{P}}_{2}^{1 - 4} = {\bar{b}}_{6}^{1 - 4} / 24 + S {\bar{b}}_{2}^{1} {\bar{b}}_{4}^{2 - 4} / 6, {\bar{P}}_{2}^{1 - 6} = S {\bar{b}}_{4}^{1 - 3} {\bar{b}}_{4}^{4 - 6} / 72, \end{matrix}

and the operator

S

symmetrises over

i_{1}, \dots, i_{k}

.

Multivariate estimates. Suppose that

\hat{w}

is a standard estimate of

w \in R^{p}

with respect to n. That is,

E \hat{w} \to w

as

n \to \infty

, and for

r \geq 1

,

1 \leq i_{1}, \dots, i_{r} \leq p,

the rth order cumulants of

\hat{w}

can be expanded as

\begin{matrix} {\bar{k}}^{1 - r} = κ ({\hat{w}}^{i_{1}}, \dots, {\hat{w}}^{i_{r}}) \approx \sum_{j = r - 1}^{\infty} {\bar{k}}_{j}^{1 - r} n^{- j} where {\bar{k}}_{j}^{1 - r} = k_{j}^{i_{1} \dots i_{r}}, \end{matrix}

(26)

and the cumulant coefficients

{\bar{k}}_{j}^{1 - r} = k_{j}^{i_{1} \dots i_{r}}

may depend on n but are bounded as

n \to \infty

. So

{\bar{k}}_{0}^{1} = w^{i_{1}} .

Y_{n} = n^{1 / 2} (\hat{w} - w)

converges in law to the multivariate normal

N_{p} (0, V)

with

p \times p

covariance

V = (k_{1}^{i_{1} i_{2}}), p \times p

, and distribution and density

Φ_{V} (x)

and

ϕ_{V} (x)

. So V may depend on n, but we assume that

d e t (V)

is bounded away from 0 as

n \to \infty

. By [14], for

\hat{w}

non-lattice, the density and distribution of

Y_{n}

can be expanded as

\begin{matrix} p_{Y_{n}} (x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} p_{r} (x), P r o b . (Y_{n} \leq x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} P_{r} (x), x \in R^{p}, \end{matrix}

(27)

\begin{matrix} where p_{0} (x) = ϕ_{V} (x), P_{0} (x) = Φ_{V} (x), \end{matrix}

(28)

\begin{matrix} p_{r} (x) = {\tilde{B}}_{r} (e (- \partial / \partial x)) ϕ_{V} (x) for r \geq 1, \\ P_{r} (x) = {\tilde{B}}_{r} (e (- \partial / \partial x)) Φ_{V} (x) = \int_{- \infty}^{x} p_{r} (x) ϕ_{V} (x) d x for r \geq 1, \end{matrix}

(29)

and

b_{2 d}^{i_{1} \dots i_{r}} = k_{d}^{i_{1} \dots i_{r}}

in (24), so that for example,

\begin{matrix} e_{1} (s) = {\bar{k}}_{1}^{1} {\bar{s}}_{1} + {\bar{k}}_{2}^{1 - 3} {\bar{s}}_{1} {\bar{s}}_{2} {\bar{s}}_{3} / 6, e_{2} (s) = {\bar{k}}_{2}^{12} {\bar{s}}_{1} {\bar{s}}_{2} / 2 + {\bar{k}}_{3}^{1 - 4} {\bar{s}}_{1} \dots {\bar{s}}_{4} / 24 . \end{matrix}

This gives the Edgeworth expansion for the distribution of

Y_{n}

to

O (n^{- 3 / 2})

. See [15] for more terms. According to (25),

\begin{matrix} p_{r} (x) / ϕ_{V} (x) = \sum_{k = 1}^{3 r} [{\bar{P}}_{r}^{1 - k} {\bar{H}}^{1 - k} (x, V) : k - r even], \\ P_{r} (x) = \sum_{k = 1}^{3 r} [{\bar{P}}_{r}^{1 - k} {\bar{H}}_{*}^{1 - k} (x, V) : k - r even], where \\ {\bar{P}}_{1}^{1} = {\bar{k}}_{1}^{1}, {\bar{P}}_{1}^{1 - 3} = {\bar{k}}_{2}^{1 - 3} / 6, {\bar{P}}_{2}^{12} = {\bar{k}}_{1}^{1} {\bar{k}}_{1}^{2} + {\bar{k}}_{2}^{12} / 2, \\ {\bar{P}}_{2}^{1 - 4} = {\bar{k}}_{3}^{1 - 4} / 24 + S {\bar{k}}_{1}^{1} {\bar{k}}_{2}^{2 - 4} / 6, {\bar{P}}_{2}^{1 - 6} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} / 36, \end{matrix}

(30)

{\bar{H}}^{1 - k} = {\bar{H}}^{1 - k} (x, V)

is the multivariate Hermite polynomial,

\begin{matrix} {\bar{H}}^{1 - k} (x, V) = ϕ_{V} {(x)}^{- 1} (- {\bar{\partial}}_{1}) \dots (- {\bar{\partial}}_{k}) ϕ_{V} (x) where \partial_{i} = \partial / \partial x_{i}, {\bar{\partial}}_{k} = \partial_{i_{k}}, \\ and {\bar{H}}_{*}^{1 - k} = {\bar{H}}_{*}^{1 - k} (x, V) = (- {\bar{\partial}}_{1}) \dots (- {\bar{\partial}}_{k}) Φ_{V} (x) = \int_{- \infty}^{x} {\bar{H}}^{1 - k} (x, V) ϕ_{V} (x) d x . \end{matrix}

(

{\bar{H}}_{*}^{1 - k}

deserves a name. Let us call it the multivariate Hermite function.) So

P_{r} (x)

is just

p_{r} (x)

with

{\bar{H}}^{1 - k} ϕ_{V} (x)

replaced by

{\bar{H}}_{*}^{1 - k}

. For example, according to (27), the leading corrections to the Central Limit Theorem are given by

\begin{matrix} p_{1} (x) = e_{1} (- \partial / \partial x) ϕ_{V} (x) = ({\bar{k}}_{1}^{1} {\bar{H}}^{1} + {\bar{k}}_{2}^{1 - 3} {\bar{H}}^{1 - 3} / 6) ϕ_{V} (x), \\ P_{1} (x) = e_{1} (- \partial / \partial x) Φ_{V} (x) = {\bar{k}}_{1}^{1} {\bar{H}}_{*}^{1} + {\bar{k}}_{2}^{1 - 3} {\bar{H}}_{*}^{1 - 3} / 6, \end{matrix}

(31)

\begin{matrix} p_{2} (x) = ϕ_{V} (x) \sum_{k = 2, 4, 6} {\bar{P}}_{2}^{1 - k} {\bar{H}}^{1 - k} (x, V), \\ P_{2} (x) = \sum_{k = 2, 4, 6} {\bar{P}}_{2}^{1 - k} {\bar{H}}_{*}^{1 - k} (x, V), \end{matrix}

(32)

for

{\bar{P}}_{2}^{1 - k}

of (30). (So

h_{1 r} (x)

of Section 1 is a one-dimensional form of

p_{r} (x)

). By [5], for

i = \sqrt{- 1}

,

\begin{matrix} {\bar{H}}^{1 - k} (x, V) = E Π_{j = 1}^{k} ({\bar{y}}_{j} + i {\bar{Y}}_{j}) where {\bar{y}}_{j} = y_{i_{j}}, {\bar{Y}}_{j} = Y_{i_{j}}, y = V^{- 1} x, \\ Y \sim N_{p} (0, V^{- 1}) . So, H^{1} = y_{1}, {\bar{H}}^{1} = {\bar{y}}_{1}, H^{1 - 3} = y_{1} y_{2} y_{3} - \sum^{3} V^{12} y_{3}, \end{matrix}

where

V^{i_{1} i_{2}}

is the

(i_{1}, i_{2})

element of

V^{- 1},

and

\sum^{3} V^{12} y_{3} = V^{12} y_{3} + V^{13} y_{2} + V^{23} y_{1} .

The

H^{1 - k}

needed for

p_{2} (x)

are

\begin{matrix} H^{12} = y_{1} y_{2} - V^{12}, so that {\bar{H}}^{12} = {\bar{y}}_{1} {\bar{y}}_{2} - {\bar{V}}^{12} f o r {\bar{V}}^{12} = V^{i_{1} i_{2}}, \\ H^{1 - 4} = y_{1} \dots y_{4} - \sum^{6} V^{12} y_{3} y_{4} + \sum^{3} V^{12} V^{34}, \\ H^{1 - 6} = y_{1} \dots y_{6} - \sum^{15} V^{12} y_{3} \dots y_{6} + \sum^{45} V^{12} V^{34} y_{5} y_{6} - \sum^{45} V^{12} V^{34} V^{56} . \end{matrix}

According to [7], the log density can be expanded as

\begin{matrix} ln [p_{n} (x) / ϕ_{V} (x)] \approx \sum_{r = 1}^{\infty} n^{- r / 2} b_{r} (x) . \end{matrix}

(33)

\begin{matrix} So p_{n} (x) / ϕ_{V} (x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} {\tilde{B}}_{r} (b (x)) where b = (b_{2}, b_{2}, \dots), \end{matrix}

(34)

for

{\tilde{B}}_{r} (e)

of (22). If

E \hat{w} = w

, then

{\bar{k}}_{1}^{1} = 0

, so that for

r = 1, 2, p_{r} (x), P_{r} (x)

are given by (31)–(32) in terms of

\begin{matrix} {\bar{P}}_{1}^{1} = 0, {\bar{P}}_{1}^{1 - 3} = {\bar{k}}_{2}^{1 - 3} / 6, {\bar{P}}_{2}^{12} = {\bar{k}}_{2}^{12} / 2, {\bar{P}}_{2}^{1 - 4} = {\bar{k}}_{3}^{1 - 4} / 24, {\bar{P}}_{2}^{1 - 6} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} / 36 . \end{matrix}

Note 2.

The term Hermite function is also used by [16] for the non-polynomial solution of the 2nd order differential equation for

H_{n} (x)

of Section 1, given by [17] in terms of confluent hypergeometric functions.

4. Application to Stationary Vector $\bar{X}$

Suppose that

\dots, X_{- 1}, X_{0}, X_{1}, \dots

lie in

R^{p}

and are stationary with mean

μ = E X_{0}

and finite moments. Suppose that

\bar{X}

is the mean of a sample

X_{1}, \dots, X_{n}

. So

E \bar{X} = μ .

For

j = 1, \dots, p

, denote the jth component of

X_{i}

and

\bar{X}

by

X_{i}^{j}

and

{\bar{X}}^{j},

and the cross-cumulants of

{X_{i}}

, by

\begin{matrix} k (\binom{j_{1} \dots j_{r}}{i_{1} \dots i_{r}}) = κ (X_{i_{1}}^{j_{1}}, \dots, X_{i_{r}}^{j_{r}}) for 1 \leq j_{1}, \dots, j_{r} \leq p . \end{matrix}

(35)

Given a sequence of integers

i_{1}, \dots, i_{r}

, define

i_{0}, I_{k}

as in (14), and again transform from

i_{k}

to

T_{k} = i_{k} - i_{1}

for

k = 2, \dots, r

. (15) becomes

\begin{matrix} k (\binom{j_{1} \dots j_{r}}{i_{1} \dots i_{r}}) = k (\binom{j_{1} \dots j_{r}}{I_{1} \dots I_{r}}) . \end{matrix}

(36)

In general,

k (\binom{j_{1} j_{2}}{0 I}) \neq k (\binom{j_{1} j_{2}}{0 - I})

. For

r \geq 2

and

δ_{r} (T)

of (17), by (16),

\begin{matrix} n^{r} κ ({\bar{X}}^{j_{1}}, \dots, {\bar{X}}^{j_{r}}) = \sum_{i_{1}, \dots, i_{r} = 1}^{n} k (\binom{j_{1} \dots j_{r}}{i_{1} \dots i_{r}}) \\ = \sum_{| T_{k} | < n, k = 2, \dots, r} [n - δ_{r} (T)] k (\binom{j_{1} \dots j_{r}}{0 T_{2} \dots T_{r}}) . \end{matrix}

(37)

So

f o r r \geq 2, κ ({\bar{X}}^{j_{1}}, \dots, {\bar{X}}^{j_{r}}) = \sum_{e = r - 1}^{r} k_{n e}^{j_{1} \dots j_{r}} n^{- e}

where

\begin{matrix} k_{n 1}^{j_{1} j_{2}} = \sum_{| T | < n} k (\binom{j_{1} j_{2}}{0 T}), k_{n, r - 1}^{j_{1} \dots j_{r}} = \sum_{| T_{i} | < n, i = 2, \dots, r} k (\binom{j_{1} \dots j_{r}}{0 T_{2} \dots T_{r}}), \end{matrix}

(38)

\begin{matrix} and k_{n r}^{j_{1} \dots j_{r}} = - \sum_{| T_{i} | < n, i = 2, \dots, r} δ_{r} (T) k (\binom{j_{1} \dots j_{r}}{0 T_{2} \dots T_{r}}) . \end{matrix}

(39)

This proves that a two-term version of (26) holds. So the expansions (27)–(29) and (33) hold for the density and distribution of

n^{1 / 2} (\bar{X} - μ)

with

V = (k_{n 1}^{j_{1} j_{2}})

of (38),

k_{i}^{j_{1} \dots j_{r}} = k_{n i}^{j_{1} \dots j_{r}}

of (38), (39), and

{\bar{k}}_{1}^{1} = 0

. If the cross-cumulants

k (\binom{j_{1} \dots j_{r}}{0 T_{2} \dots T_{r}})

decrease exponentially in

T_{2}

, then for

r \geq 2

,

\begin{matrix} k_{n, r - 1}^{j_{1} \dots j_{r}} = k_{r - 1}^{j_{1} \dots j_{r}} + O (e^{- n λ_{r}}), and k_{n r}^{j_{1} \dots j_{r}} = k_{r}^{j_{1} \dots j_{r}} + O (e^{- n λ_{r}}) where λ_{r} > 0, \\ k_{1}^{j_{1} j_{2}} = \sum_{| T | < \infty} k (\binom{j_{1} j_{2}}{0 T}), k_{r - 1}^{j_{1} \dots j_{r}} = \sum_{| T_{i} | < \infty, i = 2, \dots, r} k (\binom{j_{1} \dots j_{r}}{0 T_{2} \dots T_{r}}), \end{matrix}

(40)

\begin{matrix} and k_{r}^{j_{1} \dots j_{r}} = - \sum_{| T_{i} | < \infty, i = 2, \dots, r} δ_{r} (T) k (\binom{j_{1} \dots j_{r}}{0 T_{2} \dots T_{r}}), \end{matrix}

(41)

so that for

X_{0}

non-lattice, the expansions (27)–(29) and (33) hold for the density and distribution of

Y_{n} = n^{1 / 2} (\bar{X} - μ)

with

V = (k_{1}^{j_{1} j_{2}})

of (40), and

{\bar{k}}_{1}^{1} = 0

.

These results extend to missing data and weighted means as in Section 2.

5. Discussion and Conclusions

References [1,2] showed that their quantile expansion, updated here as (4), can give great accuracy, in fact to many decimal places. However, for a given n and a large enough x, or for a given x and a small enough n, the expansions (2)–(4) will clearly diverge.

We have shown that the sample mean from a stationary process is a standard estimate of the mean of the process, so that we can apply the Edgeworth–Cornish–Fisher expansions given in Section 1 and Section 3 for any standard estimate. We also showed that remarkably, its cumulant expansion has only two terms. This simplifies the forms for

h_{k r} (x), f_{r} (x), g_{r} (x),

and

r \geq 3

of Section 1, and for

p_{r} (x), P_{r} (x),

and

r \geq 3

of Section 3. And we showed that these results extend to missing data and to weighted means.

As

E \bar{X} = μ, a_{11} = 0

. So the expansions of Section 1 to

O (n^{- 3 / 2})

require

n, x

and

a_{32}, a_{22}, a_{43}

of (18) and (19) as input, or alternatively,

a_{32}, a_{22}, a_{43}

of (20) and (21). That is, one needs

n, x

and the first three cross-cumulants of (13),

k (0 T_{2} \dots T_{r})

, for

r = 2, 3, 4 .

Similarly, for multivariate series, the Edgeworth expansions of (27) to

O (n^{- 3 / 2})

for

Y_{n} = n^{1 / 2} (\bar{X} - μ)

require

p_{r} (x), P_{r} (x), r = 1, 2

of (31)–(32) with

{\bar{k}}_{1}^{1} = 0

. These require

V = (k_{1}^{j_{1} j_{2}}), {\bar{k}}_{1}^{1} = 0, {\bar{k}}_{2}^{1 - 3}, {\bar{k}}_{3}^{1 - 4}

of (38), (39) as inputs, or their limits in (40), (41). That is, one needs

n, x

and the cross-cumulants,

k (\binom{j_{1} \dots j_{r}}{0 T_{2} \dots T_{r}})

of (35) for

r = 2, 3, 4

.

In general, non-parametric methods are to be preferred to parametric methods, as a wrong parametric model will give wrong results, even asymptotically. However, most econometric models are parametric. Reference [10] considered the case of the stationary linear process

X_{i} = \sum_{j = 0}^{\infty} ρ_{j} e_{i - j},

where

ρ_{j}, j \geq 0,

are given constants, and

{e_{i}}

are i.i.d. random variables with finite cumulants

τ_{r}, r \geq 1 .

For this very general class of semi-parametric models,

k (i_{1}, \dots, i_{r}) = α (i_{1}, \dots, i_{r}) τ_{r} where α (i_{1}, \dots, i_{r}) = \sum_{j = 0}^{\infty} ρ_{j + i_{1}} \dots ρ_{j + i_{r}} .

Example 1.

For the AR(1) model,

X_{i} - φ X_{i - 1} = e_{i}

with

| φ | < 1

,

ρ_{j} = φ^{j}, α (i_{1}, \dots, i_{r}) = φ^{I_{1} + \dots + I_{r}}

for

I_{k}

of (14). So the

k (i_{1}, \dots, i_{r})

we need are given by φ and

τ_{r}, r = 2, 3, 4 .

For related work, see [18,19].

For a particular case like this, one can write a general computational code to plot graphs or make tables for various scenarios.

The theory of linear processes is well developed. See, for example, [20,21,22,23].

6. Four Examples of Future Directions

Traditionally, analysis of time series relies on parametric models, such as the autoregressive model of Example 1, or much more generally, ARIMA models, or the use of spectral theory that moves considerations from the time domain to the frequency domain. But as noted, a non-parametric approach is much to be preferred as the wrong parametric model gives incorrect results. A non-parametric estimate of the asymptotic variance

a_{21} = \sum_{| T | < \infty} k (0 T)

of (20), is

{\hat{a}}_{21} = \sum_{| T | < N_{n}} \hat{k} (0 T)

where

\hat{k} (0 T)

is the empirical estimate of

k (0 T)

and

N_{n} \to \infty .

This should give one- and two-sided confidence intervals for

μ

of error

O (n^{- 1 / 2})

and

O (n^{- 1})

. It should also be possible to reduce the one-sided error to

O (n^{- 1})

or

O (n^{- 3 / 2})

with (more complicated) confidence intervals, as performed for the i.i.d. case in [4].

A second area where an extension should be possible is to sample central moments and smooth functions of them, as achieved for i.i.d. observations for non-parametric and parametric statistics in [3,4,15] and for linear processes in [10]. Again, it should be possible to develop confidence intervals.

A third, more difficult extension would be a small sample version, using the method of [14]. This would remove the problem of series divergence.

A fourth extension would be to a kernel estimate of the marginal density of

X_{0}

. The authors of [24] gave Cornish–Fisher expansions for these based on a random sample. The expansions are in powers of

{(n h)}^{- 1 / 2}

, not

n^{- 1 / 2}

, where

h = c n^{- α}

and

α > 0

can be made as small as desired by choice of kernel.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The author C.S.W. was employed by Industrial Research Ltd., now called Callaghan Innovation. The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Cornish, E.A.; Fisher, R.A. Moments and cumulants in the specification of distributions. Rev. de l’Inst. Int. de Statist. 1937, 5, 307–322, Reproduced in The Collected Papers of R.A. Fisher, 4. [Google Scholar] [CrossRef]
Fisher, R.A.; Cornish, E.A. The percentile points of distributions having known cumulants. Technometrics 1960, 2, 209–225. [Google Scholar] [CrossRef]
Withers, C.S. Asymptotic expansions for distributions and quantiles with power series cumulants. J. R. Statist. Soc. B 1984, 46, 389–396. [Google Scholar] [CrossRef]
Withers, C.S. Expansions for the distribution and quantiles of a regular functional of the empirical distribution with applications to non-parametric confidence intervals. Annals Statist. 1983, 11, 577–587. [Google Scholar] [CrossRef]
Withers, C.S. A simple expression for the multivariate Hermite polynomials. Stat. Prob. Lett. 2020, 47, 165–169. [Google Scholar] [CrossRef]
Withers, C.S.; Nadarajah, S.N. Expansions for log densities of asymptotically normal estimates. Stat. Pap. 2010, 51, 247–257. [Google Scholar] [CrossRef]
Withers, C.S.; Nadarajah, S. Expansions for log densities of multivariate estimates. Methodol. Comput. Appl. Probab. 2016, 18, 911–920. [Google Scholar] [CrossRef]
Withers, C.S.; Nadarajah, S.N. Charlier and Edgeworth expansions via Bell polynomials. Probab. Math. Stat. 2009, 29, 271–280. [Google Scholar]
Withers, C.S. The distribution and quantiles of a function of parameter estimates. Ann. Inst. Statist. Math. A 1982, 34, 55–68. [Google Scholar] [CrossRef]
Withers, C.S.; Nadarajah, S. Cornish-Fisher expansions for sample autocovariances and other functions of sample moments of linear processes. Braz. J. Probab. Stat. 2012, 26, 149–166. [Google Scholar] [CrossRef]
Ibragimov, I.A.; Linnik, Y.V. Independent and Stationary Sequences of Random Variables; Wolters-Noordhoff: Groningen, The Netherlands, 1971. [Google Scholar]
Chernoff, H.; Zacks, S. Estimating the current mean of a normal distribution which is subjected to changes in time. Ann. Math. Statist. 1964, 35, 999–1018. [Google Scholar] [CrossRef]
Comtet, L. Advanced Combinatorics; Reidel: Dordrecht, The Netherlands, 1974. [Google Scholar]
Withers, C.S.; Nadarajah, S. Tilted Edgeworth expansions for asymptotically normal vectors. Ann. Inst. Stat. Math. 2010, 62, 1113–1142. [Google Scholar] [CrossRef]
Withers, C.S. 5th-Order multivariate Edgeworth expansions for parametric estimates. Mathematics 2024, 12, 905. [Google Scholar] [CrossRef]
Arfken, G.B.; Weber, H.J.; Harris, F.E. Mathematical Methods for Physicists, 7th ed.; Academic Press: Cambridge, MA, USA, 2012. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; U.S. Department of Commerce, National Bureau of Standards, Applied Mathematics Series; Dover Publications: Mineola, NY, USA, 1964; p. 55. [Google Scholar]
Phillips, P.C.B. A general theorem in the theory of asymptotic expansions as an approximation to the finite sample distributions of econometric estimators. Econometrika 1977, 45, 1517–1534. [Google Scholar] [CrossRef]
Phillips, P.C.B. Asymptotic expansions in nonstationary vector autoregressions. Econom. Theory 1987, 3, 45–68. [Google Scholar] [CrossRef]
Hannan, E.J. Time Series Analysis; Wiley: New York, NY, USA, 1962. [Google Scholar]
Hannan, E.J. Multiple Time Series; Wiley: New York, NY, USA, 1970. [Google Scholar]
Kendall, M.G.; Ord, K. Time Series, 3rd ed.; Griffin: London, UK, 1990. [Google Scholar]
Taniguchi, M.; Kakizawa, Y. Asymptotic Theory of Statistical Inference for Time Series; Springer: New York, NY, USA, 2000. [Google Scholar]
Withers, C.S.; Nadarajah, S.N. Edgeworth and Cornish Fisher expansions and confidence intervals for the distribution, density and quantiles of kernel density estimates, and confidence intervals for densities. Statistica 2008, 68, 281–301. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

The Distribution and Quantiles of the Sample Mean from a Stationary Process

Abstract

1. Introduction and Summary

2. The Cumulants of a Stationary Sample Mean

3. Multivariate Edgeworth Expansions

4. Application to Stationary Vector $\bar{X}$

5. Discussion and Conclusions

6. Four Examples of Future Directions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

The Distribution and Quantiles of the Sample Mean from a Stationary Process

Abstract

1. Introduction and Summary

2. The Cumulants of a Stationary Sample Mean

3. Multivariate Edgeworth Expansions

4. Application to Stationary Vector X ¯

5. Discussion and Conclusions

6. Four Examples of Future Directions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

4. Application to Stationary Vector $\bar{X}$