Edgeworth Coefficients for Standard Multivariate Estimates

Withers, Christopher Stroude

doi:10.3390/axioms14080632

Open AccessArticle

Edgeworth Coefficients for Standard Multivariate Estimates

by

Christopher Stroude Withers

Callaghan Innovation (Formerly Industrial Research Ltd.), 101 Allington Road, Wellington 6012, New Zealand

Axioms 2025, 14(8), 632; https://doi.org/10.3390/axioms14080632

Submission received: 26 June 2025 / Revised: 30 July 2025 / Accepted: 5 August 2025 / Published: 13 August 2025

(This article belongs to the Special Issue New Perspectives in Mathematical Statistics, 2nd Edition)

Download

Browse Figure

Versions Notes

Abstract

I give for the first time explicit formulas for the coefficients needed for the fourth-order Edgeworth expansions of a multivariate standard estimate. I call these the Edgeworth coefficients. They are Bell polynomials in the cumulant coefficients. Standard estimates include most estimates of interest, including smooth functions of sample means and other empirical estimates. I also give applications to ellipsoidal and hyperrectangular sets.

Keywords:

Edgeworth expansions; standard estimates; multivariate Edgeworth coefficients; ellipsoidal and hyperrectangular sets

MSC:

62E20; 62F12

1. Introduction and Summary

Suppose that

\hat{w}

is a standard estimate, as defined in Section 2, of an unknown parameter

w \in R^{q}

of a statistical model, based on a sample of size n. For example,

\hat{w}

may be a smooth function of a sample mean, or a smooth functional of an empirical distribution. A smooth function of a standard estimate is also a standard estimate: see [1]. Section 2 summarises the multivariate Edgeworth expansions of Withers and Nadarajah (2010) [2] for the distribution and density of

X_{n} = n^{1 / 2} (\hat{w} - w)

in powers of

n^{- 1 / 2}

about the multivariate normal in terms of the Edgeworth coefficients, the

P_{r}

-coefficients of (18). For

r \geq 1

, these

P_{r}

are needed for the

r + 1

st term of the multivariate Edgeworth expansions. They are Bell polynomials in the cumulant coefficients of (9). They are given for

r = 1

by (19) and for

r = 2, 3

by (19)–(21) in terms of the symmetrizing operator

S

and explicitly in Appendix A. Section 3 derives expansions on ellipsoidal and hyperrectangular sets.

When

q = 2

, Section 4 simplifies these Edgeworth coefficients using an alternative notation. Examples include the distribution and density of a sample mean and of bivariate entangled gamma random variables.

Section 5 and Section 6 give conclusions and discussion and suggest some future directions. Appendix B explicitly gives the bivariate Hermite polynomials needed for bivariate Edgeworth expansions to

O (n^{- 2})

. The results of [3,4] could be considered heuristic. However, Section 5 of [5] showed that the Cornish–Fisher expansions (4) are valid under the usual conditions for validity of multivariate Edgeworth expansions. Their Section 5 appears to show that this is also true for the multivariate case. See also [6]. Appendix C gives for the first time a theorem for the validity of the Edgeworth expansions given here. Appendix D gives some corrigenda to the references. When q = 1, see also [7].

Univariate estimates. Suppose that

\hat{w}

is a standard estimate of

w \in R

with respect to n, (typically the sample size). That is,

\hat{w}

is non-lattice,

E \hat{w} \to w

as

n \to \infty

, and its rth cumulant can be expanded as

\begin{matrix} κ_{r} (\hat{w}) \approx \sum_{j = r - 1}^{\infty} n^{- j} a_{r j} for r \geq 1, \end{matrix}

(1)

where the cumulant coefficients

a_{r j}

may depend on n but are bounded as

n \to \infty

, and

a_{21}

is bounded away from 0. Here and below, ≈ indicates an asymptotic expansion that need not converge. Thus, (1) holds in the sense that

κ_{r} (\hat{w}) = \sum_{j = r - 1}^{I - 1} n^{- j} a_{r j} + O (n^{- I}) for I \geq r \geq 1,

where

y_{n} = O (x_{n})

means that

y_{n} / x_{n}

is bounded in n. Ref. [4] replaced the artificial assumptions of [3,8] by (1) and gave the distribution, density and quantiles of

U_{n} = {(n / a_{21})}^{1 / 2} (\hat{w} - w)

as asymptotic expansions in powers of

n^{- 1 / 2}

:

\begin{matrix} P_{n} (u) = P r o b . (U_{n} \leq u) \approx Φ (u) - ϕ (u) \sum_{r = 1}^{\infty} n^{- r / 2} h_{r} (u), \end{matrix}

(2)

\begin{matrix} p_{n} (u) = d P_{n} (u) / d u \approx ϕ (u) [1 + \sum_{r = 1}^{\infty} n^{- r / 2} {\bar{h}}_{r} (u)], \end{matrix}

(3)

\begin{matrix} Φ^{- 1} (P_{n} (u)) \approx u - \sum_{r = 1}^{\infty} n^{- r / 2} f_{r} (u), P_{n}^{- 1} (Φ (u)) \approx u + \sum_{r = 1}^{\infty} n^{- r / 2} g_{r} (u), \end{matrix}

(4)

where

P r o b . (A)

is the probability that A is true,

Φ (u) = P r o b . (N \leq u) = \int_{- \infty}^{u} ϕ (u) d u, ϕ (u) = {(2 π)}^{- 1 / 2} e^{- u^{2} / 2}, N \sim N (0, 1)

is a unit normal random variable with even moments

E N^{2 k} = 1.3 \dots (2 k - 1)

, and

h_{r} (u), {\bar{h}}_{r} (u), f_{r} (u)

and

g_{r} (u)

are polynomials in u and also in the standardized cumulant coefficients

A_{r i} = a_{r i} / a_{21}^{r / 2}

. For example,

\begin{matrix} h_{1} (u) = f_{1} (u) = g_{1} (u) = A_{11} + A_{32} H_{2} / 6, {\bar{h}}_{1} (u) = A_{11} H_{1} + A_{32} H_{3} / 6, \\ h_{2} (u) = (A_{11}^{2} + A_{22}) H_{1} / 2 + (A_{11} A_{32} + A_{43} / 4) H_{3} / 6 + A_{32}^{2} H_{5} / 72, \end{matrix}

(5)

where

H_{k}

is the kth Hermite polynomial. By [9], for

I = \sqrt{- 1}

,

\begin{matrix} H_{k} = H_{k} (u) = ϕ {(u)}^{- 1} {(- d / d u)}^{k} ϕ (u) = E {(u + I N)}^{k} for k \geq 0 : \\ H_{0} = 1, H_{1} = u, H_{2} = u^{2} - 1, H_{3} = u^{3} - 3 u, H_{4} = u^{4} - 6 u^{2} + 3 . \end{matrix}

(6)

(A2) gives

H_{k}

for

k \leq 9

. Since

h_{r} (u)

is even/odd for r odd/even,

\begin{matrix} P r o b . (| U_{n} | \leq u) \approx Φ (u) - 2 ϕ (u) \sum_{r = 1}^{\infty} n^{- r} h_{2 r} (u) for u > 0 . \end{matrix}

(7)

From (4), it follows that

\begin{matrix} U_{n} \approx u + \sum_{r = 1}^{\infty} n^{- r / 2} g_{r} (N) where N \sim N (0, 1) . \end{matrix}

(8)

For a discussion of when to truncate (2)–(4), see [10].

Note 1.

Edgeworth’s expansion [11] was for

\hat{w}

, the mean of n independent identically distributed random variables on R from a distribution with rth cumulant

κ_{r}

. Thus, (1) holds with

a_{r, r - 1} = κ_{r}

and

a_{r i} = 0

. An explicit formula for its general term was given in [12].

For examples of the many applications of Cornish–Fisher expansions and the extensions of [13], see [14,15,16]. Applications in finance include [17,18]. For an application to Rayleigh fading amplitudes, see [19]. Ref. [20] used them for GPS accuracy. Refs. [21,22,23] successfully used them for system reliability, even though binomial random variables fall on a lattice. Ref. [24] used them for cosmology. Ref. [25] used them for optimal electric power flow.

Now suppose that

{\hat{w}}_{*}

is another standard estimate of w with the same asymptotic variance

a_{21} / n

. Denote its standardized cumulant coefficients by

A_{r i *}

. Suppose that

U_{n *} = {(n / a_{21})}^{1 / 2} ({\hat{w}}_{*} - w) \sim Φ_{n} (u) = \int_{- \infty}^{u} ϕ_{n} (u) d u \to Φ (u) as n \to \infty .

Then, one can expand

P_{n} (u)

about

Φ_{n} (u)

rather than

Φ (u)

. The above expressions for

P_{n} (u), h_{k} (u)

become

\begin{matrix} P_{n} (u) = P r o b . (U_{n} \leq u) \approx Φ_{n} (u) - ϕ_{n} (u) \sum_{r = 1}^{\infty} n^{- r / 2} h_{r n} (u), where \\ h_{1 n} (u) = A_{110} + A_{320} H_{2 n} / 6, \\ h_{2 n} (u) = (A_{110}^{2} + A_{220}) H_{1 n} / 2 + (A_{110} A_{320} + A_{430} / 4) H_{3 n} / 6 + A_{320}^{2} H_{5 n} / 72, \\ A_{r j 0} = A_{r j} - A_{r j *}, and H_{k n} = H_{k n} (u) = ϕ_{n} {(u)}^{- 1} {(- d / d u)}^{k} ϕ_{n} (u) . \end{matrix}

Therefore, if, for example, we choose

{\hat{w}}_{*}

so that

A_{110} = A_{320} = 0

, then

h_{1 n} (u) = 0, h_{2 n} (u) = A_{220} H_{1 n} / 2 + A_{430} H_{3 n} / 24,

and the number of terms in the analogues of

h_{r}, f_{r}, g_{r}

are greatly reduced. The disadvantage is that

H_{k n} (u)

is more complicated than

H_{k} (u)

. See [19]. For expansions about a matching Student’s, gamma, or F distribution, see [14,15,26].

Regularity conditions. The expansions of [27] and subsequent extensions by [1,3,4] all build on the fact that for

f (x)

a density on R with finite cumulants,

exp {A {(- d / d x)}^{r} / r!} f (x)

is a density with the same cumulants except that its rth cumulant has increased by A, and similarly for a density on

R^{q}

. For a sample mean, many authors have given precise conditions for the Edgeworth expansions. See, for example, p. 229 of [28] and (19.17), (20.48), (23.3) of [29]. The latter gives corrections for a lattice sample mean. Ref. [30] and its references also give conditions for their validity.

2. Multivariate Edgeworth Expansions

Suppose that

\hat{w}

is a standard estimate of

w \in R^{q}

with respect to n. That is,

\hat{w}

is non-lattice,

E \hat{w} \to w

as

n \to \infty

, and for

r \geq 1

,

1 \leq i_{1}, \dots, i_{r} \leq q,

the rth order cumulants of

\hat{w}

can be expanded as

\begin{matrix} {\bar{k}}^{1 - r} = k^{i_{1} \dots i_{r}} = κ ({\hat{w}}^{i_{1}}, \dots, {\hat{w}}^{i_{r}}) \approx \sum_{d = r - 1}^{\infty} n^{- d} {\bar{k}}_{d}^{1 - r} where {\bar{k}}_{d}^{1 - r} = k_{d}^{i_{1} \dots i_{r}}, \end{matrix}

(9)

and the cumulant coefficients

{\bar{k}}_{d}^{1 - r}

may depend on n but are bounded as

n \to \infty

. Thus, the bar replaces each

i_{k}

by k. The use of

i_{k}

is reserved for this purpose. For example,

{\bar{k}}_{0}^{1} = w^{i_{1}}

and

{\bar{k}}_{1}^{12} = k_{1}^{i_{1} i_{2}} .

I use this bar notation repeatedly to avoid double subscripts

i_{k}

. Therefore, I use I, not i, for

\sqrt{- 1}

below.

\begin{matrix} As n \to \infty, X_{n} = n^{1 / 2} (\hat{w} - w) \overset{L}{\to} X = N_{q} (0, V) for V = ({\bar{k}}_{1}^{12}), q \times q, \end{matrix}

(10)

the multivariate normal on

R^{q}

, with density and distribution

\begin{matrix} ϕ_{V} (x) = {(2 π)}^{- q / 2} {(d e t V)}^{- 1 / 2} exp (- x^{'} V^{- 1} x / 2), Φ_{V} (x) = \int_{- \infty}^{x} ϕ_{V} (x) d x . \end{matrix}

(11)

V may depend on n, but I assume that

d e t V

is bounded away from 0.

\begin{matrix} \begin{matrix} Set Y = V^{- 1} X \sim N_{q} (0, V^{- 1}), {\bar{Y}}_{j} = Y_{i_{j}}, {\bar{μ}}^{1 - k} = E {\bar{Y}}_{1} \dots {\bar{Y}}_{k} o f (A 3), \\ So E {\bar{Y}}_{1} {\bar{Y}}_{2} = {\bar{V}}^{12} where {\bar{V}}^{12} = V^{i_{1} i_{2}} is the (i_{1}, i_{2}) element of V^{- 1} . Set \end{matrix} \end{matrix}

(12)

\begin{matrix} \begin{matrix} {\bar{b}}_{2 d}^{1 - k} = {\bar{k}}_{d}^{1 - k}, {\bar{b}}_{2 d + 1}^{1 - k} = 0, {\bar{t}}_{k} = t_{i_{k}}, e_{r} (t) = \sum_{k = 1}^{r + 2} {\bar{b}}_{k + r}^{1 - k} {\bar{t}}_{1} \dots {\bar{t}}_{k} / k!, t \in R^{q} . \\ So, e_{1} (t) = {\bar{k}}_{1}^{1} {\bar{t}}_{1} + {\bar{k}}_{2}^{1 - 3} {\bar{t}}_{1} {\bar{t}}_{2} {\bar{t}}_{3} / 3!, e_{2} (t) = {\bar{k}}_{2}^{12} {\bar{t}}_{1} {\bar{t}}_{2} / 2 + {\bar{k}}_{3}^{1 - 4} {\bar{t}}_{1} \dots {\bar{t}}_{4} / 4!, \\ e_{3} (t) = {\bar{k}}_{2}^{1} {\bar{t}}_{1} + {\bar{k}}_{3}^{1 - 3} {\bar{t}}_{1} \dots {\bar{t}}_{3} / 3! + {\bar{k}}_{4}^{1 - 5} {\bar{t}}_{1} \dots {\bar{t}}_{5} / 5!, \end{matrix} \end{matrix}

(13)

where here and below, I use the tensor summation convention of implicitly summing each

i_{k}

over its range

1, \dots, q

. For example,

\begin{matrix} {\bar{b}}_{2}^{1} {\bar{t}}_{1} = {\bar{k}}_{1}^{1} {\bar{t}}_{1} = \sum_{i_{1} = 1}^{q} k_{1}^{i_{1}} t_{i_{1}} and {\bar{b}}_{2}^{12} {\bar{t}}_{1} {\bar{t}}_{2} = {\bar{k}}_{1}^{12} {\bar{t}}_{1} {\bar{t}}_{2} = \sum_{i_{1}, i_{2} = 1}^{q} k_{1}^{i_{1} i_{2}} t_{i_{1}} t_{i_{2}} . \end{matrix}

Ordinary Bell polynomials. For a sequence

e = (e_{1}, e_{2}, \dots)

from R, the partial ordinary Bell polynomial

{\tilde{B}}_{r s} = {\tilde{B}}_{r s} (e)

is defined by the identity

\begin{matrix} for s = 0, 1, 2, \dots, and z \in R, S^{s} = \sum_{r = s}^{\infty} z^{r} {\tilde{B}}_{r s} (e) where S = \sum_{r = 1}^{\infty} z^{r} e_{r} . \end{matrix}

(14)

\begin{matrix} So, {\tilde{B}}_{r 0} = δ_{r 0}, {\tilde{B}}_{r 1} = e_{r}, {\tilde{B}}_{r r} = e_{1}^{r}, {\tilde{B}}_{32} = 2 e_{1} e_{2}, \end{matrix}

(15)

where

δ_{00} = 1, δ_{r 0} = 0

for

r \neq 0 .

They are tabled on p. 309 of [31]. (The partial exponential Bell polynomials are not used in this paper.) The complete ordinary Bell polynomial,

{\tilde{B}}_{r} (e)

, is defined in terms of S of (14) by

\begin{matrix} e^{S} = \sum_{r = 0}^{\infty} z^{r} {\tilde{B}}_{r} (e) . So {\tilde{B}}_{0} (e) = 1, and for r \geq 1, {\tilde{B}}_{r} (e) = \sum_{s = 1}^{r} {\tilde{B}}_{r s} (e) / s! : \end{matrix}

(16)

\begin{matrix} {\tilde{B}}_{1} (e) = e_{1}, {\tilde{B}}_{2} (e) = e_{2} + e_{1}^{2} / 2, {\tilde{B}}_{3} (e) = e_{3} + e_{1} e_{2} + e_{1}^{3} / 6 . \end{matrix}

(17)

\begin{matrix} Then, for r \geq 1, {\tilde{B}}_{r} (e (t)) = \sum_{k = 1}^{3 r} [{\bar{P}}_{r}^{1 - k} {\bar{t}}_{1} \dots {\bar{t}}_{k} : k - r even], \end{matrix}

(18)

where the rth Edgeworth coefficient,

{\bar{P}}_{r}^{1 - k} = P_{r}^{i_{1} \dots i_{k}}

, is a function of

{\bar{k}}_{d}^{1 - r}

. One could use unsymmetrized

{\bar{P}}_{r}^{1 - k}

. For example, by (17),

{\tilde{B}}_{2} (e (t))

needs the cross term in

e_{1} {(t)}^{2} / 2, {\bar{k}}_{1}^{4} {\bar{k}}_{2}^{1 - 3} {\bar{t}}_{1} {\bar{t}}_{2} {\bar{t}}_{3} {\bar{t}}_{4} / 6

. Thus, we could use

{\bar{k}}_{1}^{4} {\bar{k}}_{2}^{1 - 3} / 6

in

{\bar{P}}_{2}^{1 - 4}

. But as (68) below illustrates, there is a big advantage in making

{\bar{P}}_{r}^{1 - k}

symmetric in

i_{1}, \dots, i_{k}

using the operator

S

that symmetrizes over

i_{1}, \dots, i_{k}

. Therefore, the

{\bar{P}}_{r}^{1 - k}

that we need for

r = 1, 2, 3

are

\begin{matrix} {\bar{P}}_{1}^{1} = {\bar{k}}_{1}^{1}, {\bar{P}}_{1}^{1 - 3} = {\bar{k}}_{2}^{1 - 3} / 3!, {\bar{P}}_{2}^{12} = {\bar{k}}_{2}^{12} / 2 + k_{1}^{1} {\bar{k}}_{1}^{2} / 2, \end{matrix}

(19)

\begin{matrix} {\bar{P}}_{2}^{1 - 4} = {\bar{k}}_{3}^{1 - 4} / 4! + S {\bar{k}}_{1}^{4} {\bar{k}}_{2}^{1 - 3} / 6, {\bar{P}}_{2}^{1 - 6} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} / 72, \end{matrix}

(20)

\begin{matrix} \begin{matrix} {\bar{P}}_{3}^{1} = {\bar{k}}_{2}^{1}, {\bar{P}}_{3}^{1 - 3} = {\bar{k}}_{3}^{1 - 3} / 6 + S {\bar{k}}_{2}^{12} {\bar{k}}_{1}^{3} / 2 + {\bar{k}}_{1}^{1} {\bar{k}}_{1}^{2} {\bar{k}}_{1}^{3} / 6, \\ {\bar{P}}_{3}^{1 - 5} = {\bar{k}}_{4}^{1 - 5} / 5! + S_{1} / 24 + S_{2} / 12 + S_{3} / 12 \\ where S_{1} = S {\bar{k}}_{3}^{1 - 4} {\bar{k}}_{1}^{5}, S_{2} = S {\bar{k}}_{2}^{12} {\bar{k}}_{2}^{345}, S_{3} = S {\bar{k}}_{1}^{1} {\bar{k}}_{1}^{2} {\bar{k}}_{2}^{345} / 12, \\ {\bar{P}}_{3}^{1 - 7} = S {\bar{k}}_{2}^{123} {\bar{k}}_{3}^{4 - 7} / 144 + S {\bar{k}}_{2}^{123} {\bar{k}}_{2}^{4 - 6} {\bar{k}}_{1}^{7} / 72, \\ {\bar{P}}_{3}^{1 - 9} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} {\bar{k}}_{2}^{7 - 9} / 6^{4} . \end{matrix} \end{matrix}

(21)

These formulas are mostly new. The terms involving

S

are given explicitly for the first time in Appendix A. By [2], the distribution and density of

X_{n}

of (10) can be expanded as

\begin{matrix} P r o b . (X_{n} \leq x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} P_{r} (x), p_{X_{n}} (x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} p_{r} (x), x \in R^{q}, \end{matrix}

(22)

\begin{matrix} \begin{matrix} where P_{0} (x) = Φ_{V} (x), p_{0} (x) = ϕ_{V} (x), and \\ P_{r} (x) = {\tilde{B}}_{r} (e (- \partial / \partial x)) Φ_{V} (x), p_{r} (x) = {\tilde{B}}_{r} (e (- \partial / \partial x)) ϕ_{V} (x) for r \geq 1 . \\ Set \partial_{i} = \partial / \partial x_{i}, {\bar{\partial}}_{k} = \partial_{i_{k}} and {\bar{O}}^{1 - k} = O^{i_{1} \dots i_{k}} = (- {\bar{\partial}}_{1}) \dots (- {\bar{\partial}}_{k}) . \\ Thus, by (18), for r \geq 1, P_{r} (x) = \sum_{k = 1}^{3 r} [P_{r k} (x) : k - r even], \end{matrix} \end{matrix}

(23)

\begin{matrix} and p_{r} (x) / ϕ_{V} (x) = \sum_{k = 1}^{3 r} [{\tilde{p}}_{r k} : k - r even] = {\tilde{p}}_{r} (x), \end{matrix}

(24)

\begin{matrix} where P_{r k} (x) = {\bar{P}}_{r}^{1 - k} {\bar{H}}_{*}^{1 - k}, {\tilde{p}}_{r k} = {\bar{P}}_{r}^{1 - k} {\bar{H}}^{1 - k}, \end{matrix}

(25)

\begin{matrix} {\bar{H}}_{*}^{1 - k} = {\bar{H}}_{*}^{1 - k} (x, V) = {\bar{O}}^{1 - k} Φ_{V} (x) = \int_{- \infty}^{x} {\bar{H}}^{1 - k} ϕ_{V} (x) d x, \end{matrix}

(26)

\begin{matrix} and {\bar{H}}^{1 - k} = H^{i_{1} \dots i_{k}} = {\bar{H}}^{1 - k} (x, V) = ϕ_{V} {(x)}^{- 1} {\bar{O}}^{1 - k} ϕ_{V} (x) \end{matrix}

(27)

is the multivariate Hermite polynomial. By [9], for

I = \sqrt{- 1}

,

\begin{matrix} {\bar{H}}^{1 - k} = E Π_{j = 1}^{k} ({\bar{y}}_{j} + I {\bar{Y}}_{j}) for Y of (12), where {\bar{y}}_{j} = y_{i_{j}}, y = V^{- 1} x, \end{matrix}

(28)

\begin{matrix} Thus, {\bar{H}}^{1} = {\bar{y}}_{1}, {\bar{H}}^{12} = {\bar{y}}_{1} {\bar{y}}_{2} - {\bar{V}}^{12}, \end{matrix}

(29)

\begin{matrix} {\bar{H}}^{1 - 3} = {\bar{y}}_{1} {\bar{y}}_{2} {\bar{y}}_{3} - \sum^{3} {\bar{y}}_{1} {\bar{V}}^{23}, \sum^{3} {\bar{y}}_{1} {\bar{V}}^{23} = {\bar{y}}_{1} {\bar{V}}^{23} + {\bar{y}}_{2} {\bar{V}}^{13} + {\bar{y}}_{3} {\bar{V}}^{12}, \end{matrix}

(30)

\begin{matrix} H^{1 - 4} = y_{1} \dots y_{4} - \sum^{6} V^{12} y_{3} y_{4} + μ^{1 - 4}, \end{matrix}

(31)

\begin{matrix} H^{1 - 5} = y_{1} \dots y_{5} - \sum^{10} V^{12} y_{3} \dots y_{5} + \sum^{5} y_{5} μ^{1 - 4}, \end{matrix}

(32)

\begin{matrix} H^{1 - 6} = {\bar{y}}_{1} \dots {\bar{y}}_{6} - \sum^{15} {\bar{y}}_{1} \dots {\bar{y}}_{4} {\bar{V}}^{56} + \sum^{15} {\bar{y}}_{1} {\bar{y}}_{2} {\bar{μ}}^{3 - 6} - {\bar{μ}}^{1 - 6}, \end{matrix}

(33)

for

{\bar{μ}}^{1 - 2 k}

of (12). These give

p_{1} (x)

and

p_{2} (x)

, and so

p_{X_{n}} (x)

of (22) to

O (n^{- 3 / 2})

. For

{\bar{H}}^{1 - k}

, for

7 \leq k \leq 9

needed for

p_{3} (x)

when

q = 2

, see Appendix B.

P_{r k} (x)

is just

{\tilde{p}}_{r k}

with

{\bar{H}}^{1 - k}

replaced by

{\bar{H}}_{*}^{1 - k}

of (26). For example,

\begin{matrix} {\bar{H}}_{*}^{1} = {\bar{J}}^{1}, {\bar{H}}_{*}^{12} = {\bar{J}}^{12} - {\bar{V}}^{12} Φ_{V} (x), {\bar{H}}_{*}^{1 - 3} = {\bar{J}}^{123} - \sum^{3} {\bar{J}}^{1} {\bar{V}}^{23}, w h e r e \\ {\bar{J}}^{1 - k} = {\bar{J}}^{1 - k} (x, V) = E {\bar{Y}}_{1} \dots {\bar{Y}}_{k} I (X \leq x) = {\bar{V}}^{1, k + 1} \dots {\bar{V}}^{k, 2 k} {\bar{M}}^{k + 1 - 2 k}, \\ a n d {\bar{M}}^{a - b} = {\bar{M}}^{a - b} (x, V) = E {\bar{X}}_{a} \dots {\bar{X}}_{b} I (X \leq x) = \int_{- \infty}^{x} {\bar{x}}_{a} \dots {\bar{x}}_{b} ϕ_{V} (x) d x . \end{matrix}

I call

{{\bar{M}}^{a - b} (x, V)}

the partial moments of

Φ_{V} (x) .

By (22), the Edgeworth expansions to

O (n^{- 2})

for the distribution and density of

X_{n}

of (10) about those of

X = N_{q} (0, V)

are given by

\begin{matrix} \begin{matrix} P_{1} (x) = e_{1} (- \partial / \partial x) Φ_{V} (x) = \sum_{r = 1}^{3} {\bar{b}}_{r + 1}^{1 - r} {\bar{O}}^{1 - r} Φ_{V} (x) / r! = \sum_{k = 1, 3} P_{1 k} (x), \\ where P_{11} (x) = {\bar{k}}_{1}^{1} (- {\bar{\partial}}_{1}) Φ_{V} (x), P_{13} (x) = {\bar{k}}_{2}^{1 - 3} {\bar{O}}^{1 - 3} Φ_{V} (x) / 6, \\ p_{1} (x) = {\bar{k}}_{1}^{1} (- {\bar{\partial}}_{1}) ϕ_{V} (x) + {\bar{k}}_{2}^{1 - 3} {\bar{O}}^{1 - 3} ϕ_{V} (x) / 6, \\ {\tilde{p}}_{1} (x) = p_{1} (x) / ϕ_{V} (x) = \sum_{k = 1, 3} {\tilde{p}}_{1 k}, {\tilde{p}}_{11} = {\bar{k}}_{1}^{1} {\bar{H}}^{1}, {\tilde{p}}_{13} = {\bar{k}}_{2}^{1 - 3} {\bar{H}}^{1 - 3} / 6, \end{matrix} \end{matrix}

(34)

\begin{matrix} P_{2} (x) = \sum_{k = 2, 4, 6} P_{2 k} (x), {\tilde{p}}_{2} (x) = \sum_{k = 2, 4, 6} {\tilde{p}}_{2 k}, \end{matrix}

(35)

\begin{matrix} P_{3} (x) = \sum_{k = 1, 3, 5, 7, 9} P_{3 k} (x), {\tilde{p}}_{3} (x) = \sum_{k = 1, 3, 5, 7, 9} {\tilde{p}}_{3 k}, \end{matrix}

(36)

for

{\tilde{p}}_{r k}

and

P_{r k}

of (25). Each has

q^{k}

terms, but many are duplicates, as I make

{\bar{P}}_{r}^{1 - k}

symmetric in

i_{1}, \dots, i_{k}

. Let us call (22) the basic Edgeworth expansions. Typically,

E \hat{w}

is a one-to-one function of w. In this case, an alternative is to use the zero-mean Edgeworth expansions, that is, the expansions for

\begin{matrix} \begin{matrix} X_{n 0} = n^{1 / 2} (\hat{w} - E \hat{w}) = X_{n} + δ_{n}, \\ where δ_{n} = n^{1 / 2} (E \hat{w} - w) \approx \sum_{d = 1}^{\infty} n^{1 / 2 - d} (k_{d}^{i}), \end{matrix} \end{matrix}

(37)

by (9) with

k = 1

. That is,

\begin{matrix} \begin{matrix} P r o b . (X_{n} \leq x) = P r o b . (X_{n 0} \leq x + δ_{n}), p_{X_{n}} (x) = p_{X_{n 0}} (x + δ_{n}), where \\ P r o b . (X_{n 0} \leq x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} P_{r 0} (x), p_{X_{n 0}} (x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} p_{r 0} (x), x \in R^{q}, \\ P_{00} (x) = Φ_{V} (x), p_{00} (x) = ϕ_{V} (x), \\ and for r \geq 1, P_{r 0} (x) = \sum_{k = 1}^{3 r} [P_{r k 0} (x) : k - r even], \\ p_{r 0} (x) / ϕ_{V} (x) = \sum_{k = 1}^{3 r} [{\tilde{p}}_{r k 0} : k - r even] = {\tilde{p}}_{r 0} (x), \\ P_{r k 0} (x) = {\bar{P}}_{r 0}^{1 - k} {\bar{H}}_{*}^{1 - k}, {\tilde{p}}_{r k 0} = {\bar{P}}_{r 0}^{1 - k} {\bar{H}}^{1 - k}, \end{matrix} \end{matrix}

(38)

and

{\bar{P}}_{r 0}^{1 - k}

is the zero-mean Edgeworth coefficient, that is,

{\bar{P}}_{r}^{1 - k}

with

{\bar{k}}_{d}^{1} \equiv 0 .

These are mostly simpler. By (19)–(21),

\begin{matrix} {\bar{P}}_{10}^{1} = 0, {\bar{P}}_{10}^{1 - 3} = {\bar{k}}_{2}^{1 - 3} / 3!, {\bar{P}}_{20}^{12} = {\bar{k}}_{2}^{12} / 2, {\bar{P}}_{20}^{1 - 4} = {\bar{k}}_{3}^{1 - 4} / 4!, \\ {\bar{P}}_{20}^{1 - 6} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} / 72, {\bar{P}}_{30}^{1} = 0, {\bar{P}}_{30}^{1 - 3} = {\bar{k}}_{3}^{1 - 3} / 6, \\ {\bar{P}}_{30}^{1 - 5} = {\bar{k}}_{4}^{1 - 5} / 5! + S {\bar{k}}_{2}^{12} {\bar{k}}_{2}^{3 - 5} / 12, {\bar{P}}_{30}^{1 - 7} = S {\bar{k}}_{2}^{123} {\bar{k}}_{3}^{4 - 7} / 144, {\bar{P}}_{30}^{1 - 9} = {\bar{P}}_{3}^{1 - 9} . \end{matrix}

The simplest example is when sampling from a distribution

F_{0} (x - w)

where

F_{0} (x)

is a known distribution with mean

0 \in R^{q}

; then, the cumulant coefficients do not depend on w apart from the mean.

For the location-scale model when sampling from a distribution

F_{0} (V^{- 1 / 2} (x - w))

, where

F_{0} (x)

is a known distribution with finite moments, mean

0 \in R^{q}

and covariance

I_{q}

, if

V = V (w)

depends on w, then so do the cumulant coefficients. To make an inference on w, one needs to consider the Studentised statistic. For this and more general models, see [32,33].

Returning to the expansion for the density of

X_{n}

, the density of

X_{n}

relative to its asymptotic value is

p_{X_{n}} (x) / ϕ_{V} (x) \approx 1 + \sum_{r = 1}^{\infty} n^{- r / 2} {\tilde{p}}_{r} (x) = 1 + n^{- 1 / 2} {\tilde{p}}_{1} (x) + O (n^{- 1}) for x \in R^{q},

and

{\tilde{p}}_{r} (x)

of (24). Thus,

n^{- 1 / 2} {\tilde{p}}_{1} (x)

is a simple measure of the inaccuracy of the Central Limit Theorem (CLT) approximation.

Example 1.

If the distribution of

\hat{w}

is symmetric about w, then, for r odd,

p_{r} (x) = {\tilde{p}}_{r} (x) = P_{r} (x) = 0

, and by (20),

{\tilde{p}}_{26} (x) = 0

. Thus,

\begin{matrix} p_{X_{n}} (x) / ϕ_{V} (x) = 1 + n^{- 1} {\tilde{p}}_{2} (x) + O (n^{- 2}), \end{matrix}

(39)

\begin{matrix} \begin{matrix} w h e r e {\tilde{p}}_{2} (x) = {\tilde{p}}_{22} + {\tilde{p}}_{24}, {\tilde{p}}_{22} = {\bar{k}}_{2}^{12} {\bar{H}}^{12} / 2, {\tilde{p}}_{24} = {\bar{k}}_{3}^{1 - 4} {\bar{H}}^{1 - 4} / 24 . \\ A l s o, P r o b . (X_{n} \leq x) = Φ_{V} (x) + n^{- 1} P_{2} (x) + O (n^{- 2}), w h e r e P_{2} = P_{22} + P_{24}, \end{matrix} \end{matrix}

(40)

since

{\bar{P}}_{2}^{12} = {\bar{k}}_{2}^{12} / 2, {\bar{P}}_{2}^{1 - 4} = {\bar{k}}_{3}^{1 - 4} / 24 .

In this case,

n^{- 1} {\tilde{p}}_{2} (x)

is a measure of the inaccuracy of the CLT approximation. Also,

\begin{matrix} P r o b . (X_{n} \leq x) = Φ_{V} (x) + n^{- 1} P_{2} (x) + O (n^{- 2}), w h e r e P_{2} = P_{22} + P_{24} . \end{matrix}

(41)

Example 2.

Let

\hat{w}

be the sample mean from a distribution with finite cross cumulants

{\bar{κ}}^{1 - r}, r \geq 1

. Then,

E \hat{w} = w

, and only the leading coefficient in (9) is non-zero. Thus,

{\bar{k}}_{1}^{1} = {\bar{k}}_{2}^{12} = {\bar{k}}_{2}^{1} = {\bar{k}}_{3}^{123} = 0, and {\tilde{p}}_{11} = {\tilde{p}}_{22} = {\tilde{p}}_{31} = {\tilde{p}}_{33} = 0 .

For

r = 1, 2, 3

, the non-zero Edgeworth coefficients

P_{r}^{1 - k}

are

\begin{matrix} {\bar{P}}_{10}^{1 - 3} = {\bar{κ}}^{1 - 3} / 3!, {\bar{P}}_{20}^{1 - 4} = {\bar{κ}}^{1 - 4} / 4!, {\bar{P}}_{20}^{1 - 6} = S {\bar{κ}}^{1 - 3} {\bar{κ}}^{4 - 6} / 72,, \\ {\bar{P}}_{30}^{1 - 5} = {\bar{κ}}^{1 - 5} / 5!, {\bar{P}}_{30}^{1 - 7} = S {\bar{κ}}^{123} {\bar{κ}}^{4 - 7} / 144, {\bar{P}}_{30}^{1 - 9} = S {\bar{κ}}^{1 - 3} {\bar{κ}}^{4 - 6} {\bar{κ}}^{7 - 9} / 6^{4} . \end{matrix}

By (34)–(36), for

1 \leq r \leq 3, P_{r 0} (x)

and

{\tilde{p}}_{r 0} (x)

have only r terms. Note that

{\bar{P}}_{r 0}^{1 - k} = 0 for 1 \leq k \leq r + 1, and {\bar{κ}}^{1 - k} / k! f o r k = r + 2 .

Note 2.

For

H_{k} (x)

, the univariate Hermite polynomial of (6), and

1 \leq j \leq q

, I define the jth marginal Hermite polynomial as

\begin{matrix} H^{j^{k}} = E {(y_{j} + I Y_{j})}^{k} = τ_{j}^{k} H_{k} (τ_{j}^{- 1} y_{j}) where I = \sqrt{- 1}, τ_{j} = {(V^{j j})}^{1 / 2} : \end{matrix}

(42)

\begin{matrix} \begin{matrix} H^{j} = y_{j}, H^{j^{2}} = y_{j}^{2} - V^{j j}, H^{j^{3}} = y_{j}^{3} - 3 V^{j j} y_{j}, H^{j^{4}} = y_{j}^{4} - 6 V^{j j} y_{j}^{2} + 3 {(V^{j j})}^{2} . \\ T h u s, i f q = 1, H^{1^{k}} (x, V) = σ^{- k} H_{k} (σ x) = H_{k} (x, V), w h e r e σ^{2} = V . \end{matrix} \end{matrix}

(43)

Ref. [34] gave explicit expressions for a multivariate version of the Cornish–Fisher expansion when

q = 2

. Ref. [35] showed how to extend (4) to the multivariate case by replacing (8) by

\begin{matrix} X_{n} \approx X + \sum_{r = 1}^{\infty} n^{- r / 2} g_{r} (X) . \end{matrix}

(44)

It would be very useful to obtain these multivariate

g_{r} (X)

explicitly. How does one extend the univariate formula for

g_{r} (x)

in terms of

h_{r} (x)

?

Note 3.

Standard estimates have a natural extension to Type b estimates. These are estimates for which the cumulant expansion (9) is replaced by

{\bar{k}}^{1 - r} = k^{i_{1} \dots i_{r}} = κ ({\hat{w}}^{i_{1}}, \dots, {\hat{w}}^{i_{r}}) \approx \sum_{j = 2 r - 2}^{\infty} n^{- j / 2} {\bar{b}}_{j}^{1 - r}, w h e r e {\bar{b}}_{j}^{1 - r} = b_{j}^{i_{1} \dots i_{r}} .

Examples are one-sided confidence interval limits. These have the form

\hat{w} \approx \sum_{j = 0}^{\infty} n^{- j / 2} t_{j} (\hat{θ}),

where

\hat{θ}

is a standard estimate and

t_{j}

are smooth functions. See [2] for more details.

3. Secondary or Derived Expansions

Let V have the Hermitian form

H^{'} Λ H

, where

H^{'} H = I_{q}

.

Set S = V^{- 1 / 2} = H^{'} Λ^{- 1 / 2} H and T = V^{1 / 2} = H^{'} Λ^{1 / 2} H .

As in (25)–(27), I use tensor summation and the bar notation, and

\begin{matrix} N \sim N_{q} (0, I_{q}), X = T N \sim N_{q} (0, V), Y = V^{- 1} X = S N \sim N_{q} (0, V^{- 1}) . \end{matrix}

From the second Edgeworth expansion in (22), it follows that for

C \subset R^{q}

,

\begin{matrix} P r o b . (X_{n} \in C) \approx \sum_{r = 0}^{\infty} n^{- r / 2} P_{r} (C), where \end{matrix}

(45)

\begin{matrix} \begin{matrix} P_{r} (C) = E p_{r} (X) I (X \in C) = \int_{C} p_{r} (x) ϕ_{V} (x) d x . \\ Thus, P_{0} (C) = P r o b . (X \in C) = \int_{C} d Φ_{V} (x) = Φ_{V} (C), \end{matrix} \end{matrix}

(46)

\begin{matrix} and for r \geq 1, P_{r} (C) = \sum_{k = 1}^{3 r} [P_{r k} (C) : k - r even], where \end{matrix}

(47)

\begin{matrix} P_{r k} (C) = E {\tilde{p}}_{r k} (X) I (X \in C) = \int_{C} {\tilde{p}}_{r k} (x) ϕ_{V} (x) d x = {\bar{P}}_{r}^{1 - k} {\bar{H}}_{*}^{1 - k} (C), \end{matrix}

(48)

\begin{matrix} and {\bar{H}}_{*}^{1 - k} (C) = E {\bar{H}}^{1 - k} (X, V) I (X \in C) = \int_{C} {\bar{H}}^{1 - k} ϕ_{V} (x) d x : \end{matrix}

(49)

\begin{matrix} \begin{matrix} P_{1} (C) = P_{11} (C) + P_{13} (C), P_{r 1} (C) = {\bar{P}}_{r}^{1} {\bar{H}}_{*}^{1} (C), P_{r 3} (C) = {\bar{P}}_{r}^{1 - 3} {\bar{H}}_{*}^{1 - 3} (C), \\ P_{2} (C) = \sum_{k = 2, 4, 6} P_{2 k} (C), where P_{r 2} (C) = {\bar{P}}_{r}^{12} {\bar{H}}_{*}^{12} (C), \\ P_{r 4} (C) = {\bar{P}}_{r}^{1 - 4} {\bar{H}}_{*}^{1 - 4} (C), P_{r 6} (C) = {\bar{P}}_{r}^{1 - 6} {\bar{H}}_{*}^{1 - 6} (C) . \end{matrix} \end{matrix}

(50)

As

{\bar{H}}^{1 - k}

is a linear combination of

{\bar{y}}_{1} \dots {\bar{y}}_{s}

, we can write

{\bar{H}}_{*}^{1 - k} (C)

in terms of

\begin{matrix} {\bar{m}}^{1 - s} = E {\bar{Y}}_{1} \dots {\bar{Y}}_{s} I (V Y \in C) = \int_{C} {\bar{y}}_{1} \dots {\bar{y}}_{s} ϕ_{V} (x) d x : \end{matrix}

(51)

\begin{matrix} \begin{matrix} {\bar{H}}_{*}^{1} (C) = {\bar{m}}^{1}, {\bar{H}}_{*}^{12} (C) = {\bar{m}}^{12} - Φ_{V} (C) {\bar{V}}^{12}, {\bar{H}}_{*}^{1 - 3} (C) = {\bar{m}}^{1 - 3} - \sum^{3} {\bar{m}}^{1} {\bar{V}}^{23}, \\ {\bar{H}}_{*}^{1 - 4} (C) = {\bar{m}}^{1 - 4} - \sum^{6} {\bar{m}}^{12} {\bar{V}}^{34} + Φ_{V} (C) {\bar{μ}}^{1 - 4}, \\ {\bar{H}}_{*}^{1 - 6} (C) = {\bar{m}}^{1 - 6} - \sum^{15} {\bar{m}}^{1 - 4} {\bar{V}}^{56} + \sum^{15} {\bar{m}}^{12} {\bar{μ}}^{3 - 6} - Φ_{V} (C) {\bar{μ}}^{1 - 6} . \end{matrix} \end{matrix}

This gives

P_{1} (C)

and

P_{2} (C)

of (45). Therefore, it gives

P r o b . (X_{n} \in C)

to

O (n^{- 3 / 2})

. For

\hat{w}

a sample mean,

P_{22} (C) = 0

, and for

k = 4, 6,

the

{\bar{P}}_{2}^{1 - k}

needed for

P_{2 k} (C)

are given by Example 2.

If

- C = C

, then for r odd,

{\bar{m}}^{1 - r} = P_{r k} (C) = P_{r} (C) = 0,

so that

\begin{matrix} P r o b . (X_{n} \in C) \approx \sum_{r = 0}^{\infty} n^{- r} P_{2 r} (C) = Φ_{V} (C) + n^{- 1} P_{2} (C) + O (n^{- 2}) . \end{matrix}

(52)

I now consider three such C’s.

Example 3.

Ellipsoidal C. Take

C = {x : x^{'} V^{- 1} x \leq u}

for some

u > 0

.

T h e n, - C = C, Φ_{V} (C) = P r o b . (N^{'} N < u) = P r o b . (χ_{q}^{2} < u) .

(We might choose u such that

Φ_{V} (C) = 0.5

or

0.9

.)

{\bar{m}}^{1 - s}

of (51) is given by

\begin{matrix} {\bar{m}}^{1 - s} = E {\bar{Y}}_{1} \dots {\bar{Y}}_{s} I (N^{'} N < u) = {\bar{S}}_{1, k + 1} \dots {\bar{S}}_{k, 2 k} {\bar{R}}^{k + 1 - 2 k}, \end{matrix}

(53)

\begin{matrix} where {\bar{R}}^{1 - k} = E {\bar{N}}_{1} \dots {\bar{N}}_{k} I (N^{'} N < u) . \end{matrix}

(54)

\begin{matrix} T h u s, R^{12} = 0, R^{11} = E Z_{1} I (Z_{1} + Z_{2} < u) = R_{2}, w h e r e \end{matrix}

(55)

\begin{matrix} Z_{1} = N_{1}^{2} \sim χ_{1}^{2}, Z_{2} = N^{'} N - N_{1}^{2} \sim χ_{q - 1}^{2} a r e i n d e p e n d e n t . T h u s, \end{matrix}

(56)

\begin{matrix} {\bar{m}}^{12} = R^{11} {\bar{S}}_{1}^{12}, w h e r e {\bar{S}}_{1}^{12} = \sum_{i_{3}, i_{4}} {\bar{S}}_{13} {\bar{S}}_{24} I (i_{3} = i_{4}) = \sum_{i_{3} = 1}^{q} {\bar{S}}_{13} {\bar{S}}_{23} = {\bar{V}}^{12} . \end{matrix}

(57)

\begin{matrix} T h u s, b y (50), P_{r 2} (C) = (R_{2} - Φ_{V} (C)) {\bar{P}}_{r}^{12} {\bar{V}}^{12} . \end{matrix}

(58)

Similarly,

{\bar{R}}^{1 - k} = 0

unless k is even and

{i_{1}, \dots, i_{k}} = {1^{J_{1}}, \dots, q^{J_{q}}}

, where

J_{1}, \dots, J_{q}

are even integers and

j^{k}

is a string of j’s of length k.

For $Z_{1}, Z_{2}$ of (56), set

\begin{matrix} R_{2 k} = E N_{1}^{2 k} I (N^{'} N < u) = E Z_{1}^{k} I (Z_{1} + Z_{2} < u) . \\ Set R_{2 k_{1}, 2 k_{2}} = E N_{1}^{2 k_{1}} N_{2}^{2 k_{2}} I (N^{'} N < u) = E Z_{1}^{k_{1}} Z_{2}^{k_{2}} I (Z_{1} + Z_{2} + Z_{3} < u), \end{matrix}

where now, for

Z_{1} = N_{1}^{2}, Z_{2} = N_{2}^{2}

and

Z_{3} = N^{'} N - Z_{1} - Z_{2} \sim χ_{q - 2}^{2},

so that

Z_{1}, Z_{2}, Z_{3}

are independent. Set

\begin{matrix} I_{1} = I (i_{5} = i_{6} \neq i_{7} = i_{8}) + I (i_{5} = i_{7} \neq i_{6} = i_{8}) + I (i_{5} = i_{8} \neq i_{6} = i_{7}) . Then, \\ {\bar{m}}^{1 - 4} = \sum_{i_{5}, \dots, i_{8}} {\bar{S}}_{15} \dots {\bar{S}}_{48} [R_{4} I (i_{5} = \dots = i_{8}) + R_{22} I_{1}] = R_{4} {\bar{S}}_{1}^{1 - 4} + R_{22} {\bar{S}}_{2}^{1 - 4}, \\ w h e r e {\bar{S}}_{1}^{1 - k} = \sum_{j = 1}^{q} S_{i_{1} j} \dots S_{i_{k} j}, {\bar{S}}_{2}^{1 - 4} = {\bar{S}}^{12 : 34} + {\bar{S}}^{13 : 24} + {\bar{S}}^{14 : 23}, \\ a n d {\bar{S}}^{12 : 34} = \sum_{j \neq k} S_{i_{1} j} S_{i_{2} j} S_{i_{3} k} S_{i_{4} k} = {\bar{V}}^{12} {\bar{V}}^{34} - {\bar{S}}_{1}^{1 - 4} b y (57) . \\ T h u s, {\bar{S}}_{2}^{1 - 4} = {\bar{μ}}^{1 - 4} - 3 {\bar{S}}_{1}^{1 - 4}, \\ {\bar{m}}^{1 - 4} = {\bar{S}}_{1}^{1 - 4} r_{4} + {\bar{μ}}^{1 - 4} R_{22}, where r_{4} = R_{4} - 3 R_{22}, \end{matrix}

(59)

\begin{matrix} P_{r 4} (C) = {\bar{P}}_{r}^{1 - 4} [{\bar{S}}_{1}^{1 - 4} r_{4} + 3 {\bar{V}}^{12} {\bar{V}}^{34} r_{5}], w h e r e r_{5} = R_{22} - 2 R_{2} + Φ_{V} (C) . \end{matrix}

(60)

Similarly, using

\sum_{j, k, l}^{'}

to mean a sum over distinct

j, k, l

,

\begin{matrix} \begin{matrix} {\bar{m}}^{1 - 6} = {\bar{S}}_{17} \dots {\bar{S}}_{6, 12} [R_{6} I (i_{7} = \dots = i_{12}) + R_{42} I_{2} + R_{222} I_{3}], \\ w h e r e R_{222} = E N_{1}^{2} N_{2}^{2} N_{3}^{2} I (N^{'} N < u), I_{2} = \sum^{15} {\bar{S}}^{1 - 4 : 56}, I_{3} = \sum^{90} {\bar{S}}^{12 : 34 : 56}, \\ {\bar{S}}^{1 - 4 : 56} = \sum_{j \neq k} S_{i_{1} j} \dots S_{i_{4} j} S_{i_{5} k} S_{i_{6} k}, {\bar{S}}^{12 : 34 : 56} = \sum_{j, k, l}^{'} S_{i_{1} j} S_{i_{2} j} S_{i_{3} k} S_{i_{4} k} S_{i_{5} l} S_{i_{6} l} . \\ T h u s, {\bar{P}}_{r}^{1 - 6} {\bar{m}}^{1 - 6} = {\bar{P}}_{r}^{1 - 6} [{\bar{S}}_{1}^{1 - 6} R_{6} + 15 {\bar{S}}^{1 - 4 : 56} R_{42} + 90 {\bar{S}}^{12 : 34 : 56} R_{222}], \\ P_{r 6} (C) = {\bar{P}}_{r}^{1 - 6} [{\bar{m}}^{1 - 6} - 15 {\bar{m}}^{1 - 4} {\bar{V}}^{56} + 15 {\bar{m}}^{12} {\bar{μ}}^{3 - 6} - Φ_{V} (C) {\bar{μ}}^{1 - 6}] \\ = {\bar{P}}_{r}^{1 - 6} [{\bar{S}}_{1}^{1 - 6} R_{6} + 15 {\bar{S}}^{1 - 4 : 56} R_{42} + 90 {\bar{S}}^{12 : 34 : 56} R_{222} - 15 {\bar{S}}_{1}^{1 - 4} {\bar{V}}^{56} r_{4} \\ + 3 {\bar{V}}^{12} {\bar{V}}^{34} {\bar{V}}^{56} r_{6}], w h e r e r_{6} = - 15 R_{42} + 15 R_{2} - Φ_{V} (C) . \end{matrix} \end{matrix}

(61)

P_{2} (C)

of (52) is now given by (58), (60) and (61). Note that

\begin{matrix} {\bar{V}}^{12} {\bar{V}}^{34} {\bar{V}}^{56} = {\bar{S}}_{1}^{1 - 6} + \sum^{3} {\bar{S}}_{1}^{1 - 4 : 56} + {\bar{S}}_{1}^{12 : 34 : 56} . \end{matrix}

We can write

R_{2 k_{1}, \dots}

in terms of

F_{k} (u) = P r o b . (χ_{k}^{2} \leq u) :

\begin{matrix} R_{2 k} = E Z_{1}^{k} F_{q - 1} (u - Z_{1}) = \int_{0}^{u} z_{1}^{k} F_{q - 1} (u - z_{1}) d F_{1} (z_{1}) = G_{q - 1} (u : k), \\ R_{2 k_{1}, 2 k_{2}} = E Z_{2}^{k_{2}} G_{q - 2} (u - Z_{2} : k_{1}) = \int_{0}^{u} z_{2}^{k_{2}} G_{q - 2} (u - z_{2} : k_{1}) d F_{1} (z_{2}) \\ = G_{q - 2} (u : k_{1} k_{2}), \\ R_{222} = E Z_{3} G_{q - 3} (u - Z_{3} : 11) = \int_{0}^{u} z_{3} G_{q - 3} (u - z_{3} : 11) d F_{1} (z_{3}) . \end{matrix}

For

X, X_{n}

of (10), set

Q = X^{'} V^{- 1} X \sim χ_{q}^{2}, Q_{n} = X_{n}^{'} V^{- 1} X_{n}, {\hat{Q}}_{n} = X_{n}^{'} {\hat{V}}^{- 1} X_{n} = n {(\hat{w} - w)}^{'} {\hat{V}}^{- 1} (\hat{w} - w),

where

\hat{V}

is an estimate

\hat{V}

(empirical, semi-parametric, or parametric depending on the model used). Therefore,

P r o b . (Q_{n} \leq u) = P r o b . (χ_{q}^{2} \leq u) + n^{- 1} P_{2} (C) + O (n^{- 2}) = 1 - α + O (n^{- 1}) if u = χ_{q, 1 - α}^{2},

the

1 - α

quantile of

χ_{q}^{2}

. It is often true that

P r o b . ({\hat{Q}}_{n} \leq u) = P r o b . (χ_{q}^{2} \leq u) + O (n^{- 1}) .

(An exact confidence region for w is only possible if

\hat{w}

is normal: see 5.3.2 of [36].)

χ_{q}^{2} / q = G_{γ} / γ

, where

γ = q / 2

, and

G_{γ}

is a gamma random variable with mean γ. Thus, if

q = 2

,

Φ_{V} (C) = P r o b . (G_{1} < u / 2) = 1 - e^{- u / 2} = 1 - α if 0 < u = - 2 ln α,

Q_{n} < - 2 ln α

with probability

1 - α + O (n^{- 1})

, and typically,

{\hat{Q}}_{n} \leq - 2 ln α w i t h p r o b a b i l i t y 1 - α + O (n^{- 1}) .

By way of illustration, Figure 1 plots the elliptical contours

x = X_{n}

when

Q_{n} = - 2 ln α

, for

1 - α = 0.5, 0.9, 0.99,

when

\begin{matrix} V = (\begin{matrix} 2 & 1 \\ 1 & 2 \end{matrix}), V^{- 1} = (\begin{matrix} 2 & - 1 \\ - 1 & 2 \end{matrix}) / 3 . \end{matrix}

(62)

Therefore, the asymptotic correlation of

{\hat{w}}_{1}

and

{\hat{w}}_{2}

is 1/2.

One can do similarly for

q > 2

, using

χ_{q, 1 - α}^{2}

and two-dimensional slices of these ellipsoids. For related references on confidence regions, see [10].

The R functions. If

q = 1,

then

R_{2 k_{1}, 2 k_{2}} = R_{222} = 0

for

k_{2} > 0

, and

R_{2 k} = 2 g_{2 k} (u^{1 / 2}), w h e r e g_{k} (u) = E N^{k} I (0 < N < u) = \int_{0}^{u} n^{k} ϕ (n) d n

is given in Example 4 using a recurrence formula. (A simpler way is to just use (7).)

Now take $q = 2$ . Then,

\begin{matrix} \begin{matrix} R_{2 k_{1}, 2 k_{2}} = 4 E N_{1}^{2 k_{1}} N_{2}^{2 k_{2}} I (N_{1}^{2} + N_{2}^{2} < u, N_{1} > 0, N_{2} > 0), \\ = 4 g_{2 k_{1}, 2 k_{2}} (u^{1 / 2}) where g_{k_{1} k_{2}} (v) = \int_{0}^{v} d n_{1} ϕ (n_{1}) n_{1}^{k_{1}} g_{k_{2}} ({(v^{2} - n_{1}^{2})}^{1 / 2}) . \end{matrix} \end{matrix}

(63)

To obtain a recurrence formula for

g_{k_{1} k_{2}} = g_{k_{1} k_{2}} (v)

, set

\begin{matrix} f_{k} (u) = g_{k} (u^{1 / 2}), A = ϕ (n_{1}) n_{1}^{k_{1} - 1} f_{k_{2}} (v^{2} - n_{1}^{2}) . \\ T h e n, g_{k_{1} k_{2}} = - \int_{0}^{v} A d ϕ (n_{1}) = {[A]}_{v}^{0} + {\bar{g}}_{k_{1} 2 k_{2}}, where \\ {\bar{g}}_{k_{1} 2 k_{2}} = \int_{0}^{v} ϕ (n_{1}) d A = (k_{1} - 1) A_{1} + A_{2}, \\ A_{1} = \int_{0}^{v} d n_{1} ϕ (n_{1}) n_{1}^{k_{1} - 2} f_{k_{2}} (v^{2} - n_{1}^{2}) = g_{k_{1} - 2, k_{2}}, \\ A_{2} = \int_{0}^{v} d n_{1} ϕ (n_{1}) n_{1}^{k_{1} - 1} (- 2 n_{1}) {\dot{f}}_{k_{2}} (v^{2} - n_{1}^{2}), {\dot{f}}_{k} (u) = d f_{k} (u) / d u . \\ f_{k} (v^{2}) = g_{k} (v) \Rightarrow 2 v {\dot{f}}_{k_{2}} (v^{2}) = {\dot{g}}_{k} (v) = v^{k} ϕ (v) . \\ T h u s, {\dot{f}}_{k_{2}} (v^{2} - n_{1}^{2}) = {[x^{k - 1} ϕ (x)]}_{x = v^{2} - n_{1}^{2}} = {(v^{2} - n_{1}^{2})}^{(k - 1) / 2} ϕ (v) e^{n_{1}^{2} / 2} . \\ T h u s, A_{2} = - ϕ (0) ϕ (v) A_{3}, w h e r e f o r B_{12} = B (k_{1} + 1) / 2, k_{2} + 1) / 2), \\ A_{3} = \int_{0}^{v} d n_{1} n_{1}^{k_{1}} {v^{2} - n_{1}^{2}}^{(k_{2} - 1) / 2} = v^{k_{1} + k_{2}} B_{12} / 2 . \\ T h u s, A_{2} = - ϕ (0) ϕ (v) v^{k_{1} + k_{2}} B_{12} / 2 = - A_{k_{1} k_{2}} . \\ T h u s, f o r k_{1} \geq 1, g_{k_{1} k_{2}} = (k_{1} - 1) g_{k_{1} - 2, k_{2}} + G_{k_{1} k_{2}}, \\ where G_{k_{1} k_{2}} = I (k_{1} = 1) g_{k_{2}} (v) + A_{k_{1} k_{2}} . \end{matrix}

(64)

This recurrence formula for

g_{k_{1} k_{2}}

of (63) gives

\begin{matrix} R_{22} = 4 g_{22} (u^{1 / 2}) where g_{22} (v) = g_{22} = g_{02} + A_{22}, \\ a n d A_{22} = v^{4} e^{- v^{2}} / 32 as B_{12} = π / 8, \\ R_{42} = 4 g_{42} (u^{1 / 2}) where g_{42} (v) = g_{42} = g_{02} + A_{42}, \\ a n d A_{42} = v^{6} e^{- v^{2}} / 64 as B_{12} = π / 16 . \end{matrix}

Therefore, we need

g_{02}

. By Example 4,

g_{2} (u) = Φ_{0} (u) - u ϕ (u), w h e r e Φ_{0} (u) = Φ (u) - 1 / 2 .

Transforming to

u^{2} = v^{2} - n_{1}^{2},

\begin{matrix} g_{02} = \int_{0}^{v} d n_{1} ϕ (n_{1}) g_{2} (u) = e^{v^{2} / 2} (A - B), \\ where A = \int_{0}^{v} {(v^{2} - u^{2})}^{1 / 2} Φ_{0} (u) u ϕ (u) d u, \end{matrix}

\begin{matrix} B = ϕ (0) \int_{0}^{v} u^{2} {(v^{2} - u^{2})}^{1 / 2} ϕ (u) d u = {(2 π)}^{- 1} v^{2} I (- v^{2}), \\ a n d I (t) = \int_{0}^{1} x^{1 / 2} {(1 - x)}^{- 1 / 2} e^{t x} d x = \sum_{i = 0}^{\infty} B (i + 3 / 2, 1 / 2) t^{i} / i! . \\ B (i + 3 / 2, 1 / 2) = π^{1 / 2} Γ (i + 3 / 2) / i! . \\ Γ (i + 3 / 2) = Γ (3 / 2) {[3 / 2]}_{i}, w h e r e {[t]}_{i} = t (t + 1) \dots (t + i - 1) . \\ S o B = {(v^{2} / 8)}_{1} F_{1} (3 / 2, 1 : - v^{2}) \end{matrix}

where

{}_{1}F_{1} (3 / 2, 1 : t)

is the confluent or degenerate hypergeometric function: see Section 9.21 of [37] and p. 504 of [38].

Now, suppose that $q > 2$ .Then,

\begin{matrix} R_{2 k_{1}, 2 k_{2}} = 4 E N_{1}^{2 k_{1}} N_{2}^{2 k_{2}} I (N_{1}^{2} + N_{2}^{2} < u - Z, N_{1} > 0, N_{2} > 0) \\ = 4 E g_{2 k_{1}, 2 k_{2}} ({(u - Z)}^{1 / 2}), w h e r e Z = \sum_{i = 3}^{q} \sim χ_{q - 2}^{2}, \\ a n d R_{222} = 8 E N_{1}^{2} N_{2}^{2} N_{3}^{2} I (N^{'} N < u, N_{1} > 0, N_{2} > 0, N_{3} > 0) \\ = 8 E N_{3}^{2} R_{22} (u - N_{3}^{2} - Z) I (N_{3} > 0) \end{matrix}

where

R_{22} (u) = R_{22}

of (63), and

Z = \sum_{i = 4}^{q} N_{i}^{2} \sim χ_{q - 3}^{2}

is independent of

N_{3}

.

By Theorem 2.2 of [10], for

r \geq 1,

\begin{matrix} P_{r} (C) = \sum_{j = 1}^{3 r} b_{2 r, j} {(U - 1)}^{j} F_{q} (x), w h e r e F_{q} (x) = P r o b . (χ_{q}^{2} < x), U^{j} F_{q} (x) = F_{q + 2 j} (x), \\ b_{2 r, j} = {\bar{b}}_{2 r}^{1 - 2 j} {\bar{μ}}^{1 - 2 j}, {\bar{μ}}^{1 - 2 j} = E {\bar{X}}_{1} \dots {\bar{X}}_{2 j} . \\ F o r e x a m p l e, P_{r} (C) i s g i v e n b y b_{21} = ({\bar{k}}_{2}^{12} + {\bar{k}}_{1}^{1} {\bar{k}}_{1}^{2}) {\bar{V}}^{12}, \\ b_{22} = {\bar{k}}_{3}^{1 - 4} {\bar{V}}^{12} {\bar{V}}^{34} / 4 + {\bar{k}}_{1}^{1} {\bar{k}}_{2}^{34} {\bar{V}}^{12} {\bar{V}}^{34}, \\ b_{23} = {\bar{k}}_{2}^{123} {\bar{k}}_{2}^{456} ({\bar{V}}^{12} {\bar{V}}^{34} {\bar{V}}^{56} / 4 + {\bar{V}}^{14} {\bar{V}}^{25} {\bar{V}}^{36} / 6) . \end{matrix}

Its extension to

{\hat{Q}}_{n}

is given in Section 3 of [10] for parametric and non-parametric models.

Example 4.

Take

C = {x : | {(V^{- 1 / 2} x)}_{j} | \leq u_{j}, j = 1, \dots, q}

.

Thus, $- C = C$ . For $Y = V^{- 1 / 2} N$ and $N \sim N_{q} (0, I_{q})$ ,

Φ_{V} (C) = Π_{j = 1}^{q} G_{0} (u_{j}), w h e r e G_{0} (u) = P r o b . (| N_{1} | < u) = 2 Φ (u) - 1 .

(We might choose

u_{j} \equiv u

such that

Φ_{V} (C) = 0.5

or

0.9

.) For

S = V^{- 1 / 2}

,

\begin{matrix} {\bar{m}}^{1 - k} = E {\bar{Y}}_{1} \dots {\bar{Y}}_{k} {I (| N |}_{j} < u_{j}, j = 1, \dots, q) = {\bar{S}}_{1, k + 1} \dots {\bar{S}}_{k, 2 k} {\bar{R}}^{k + 1 - 2 k}, \\ w h e r e n o w, {\bar{R}}^{1 - k} = E {\bar{N}}_{1} \dots {\bar{N}}_{k} {I (| N |}_{j} < u_{j}, j = 1, \dots, q) . \\ Set R_{2 k_{1}, \dots, 2 k_{s}} = E N_{1}^{2 k_{1}} \dots N_{s}^{2 k_{s}} {I (| N |}_{j} < u_{j}, j = 1, \dots, q) \\ = [Π_{j = 1}^{s} G_{k_{j}} (u_{j})] [Π_{j = s + 1}^{q} G_{0} (u_{j})], w h e r e \\ G_{k} (u) = E N_{1}^{2 k} I (| N_{1} | < u) = 2 g_{2 k} (u), w h e r e \\ g_{k} (u) = g_{k} = \int_{0}^{u} n^{k} ϕ (n) d n = - \int_{0}^{u} n^{k - 1} d ϕ (n) = {[n^{k - 1} ϕ (n)]}_{u}^{0} + (k - 1) g_{k - 2} : \\ g_{0} = Φ (u) - 1 / 2, g_{1} = ϕ (0) - ϕ (u), g_{2} = g_{0} - u ϕ (u), \\ g_{3} = 2 ϕ (0) - (u^{2} + 2) ϕ (u), g_{4} = 3 g_{0} - (u^{3} + 3 u) ϕ (u), \\ g_{5} = 2.4 ϕ (0) - (u^{4} + 4 u^{2} + 4.2) ϕ (u), g_{6} = 3.5 g_{0} - (u^{5} + 5 u^{3} + 5.3 u) ϕ (u), \\ g_{7} = 2.4 . 6 ϕ (0) - (u^{6} + 6 u^{4} + 66.4 u^{2} + 6 . 4.2) ϕ (u), \\ g_{8} = 3.5 . 7 g_{0} - (u^{7} + 7 u^{5} + 7.5 u^{3} + 7 . 5.3 u) ϕ (u) . \\ A s R^{12} = 0, {\bar{m}}^{12} = {\bar{V}}^{12} R_{2} b y (57), w h e r e R_{2 k} = 2^{q} g_{2 k} (u_{1}) Π_{j = 2}^{q} g_{0} (u_{j}) . \\ Set R_{2 k_{1}, 2 k_{2}} = 2^{q} [Π_{j = 1}^{2} g_{2 k_{j}} (u_{j})] [Π_{j = 3}^{q} g_{0} (u_{j})], \\ R_{2 k_{1}, 2 k_{2}, 2 k_{3}} = 2^{q} [Π_{j = 1}^{3} g_{2 k_{j}} (u_{j})] [Π_{j = 4}^{q} g_{0} (u_{j})], \end{matrix}

Thus, now,

{\bar{m}}^{1 - 4}

is given by (59) and (60) in terms of these new

R_{2 k_{1}, \dots}

. Similarly,

{\bar{m}}^{1 - 6}

and

P_{r k} (C)

are given by Example 3 with these new

R_{2 k_{1}, \dots}

. Therefore, we now have

P_{2} (C)

of (52). If we choose

u_{j} \equiv u

, then

\begin{matrix} R_{2 k} = 2^{q} g_{2 k} g_{0}^{q - 1}, R_{2 k_{1}, 2 k_{2}} = 2^{q} [Π_{j = 1}^{2} g_{2 k_{j}}] g_{0}^{q - 2}, \\ R_{2 k_{1}, 2 k_{2}, 2 k_{3}} = 2^{q} [Π_{j = 1}^{3} g_{2 k_{j}}] g_{0}^{q - 3} . \end{matrix}

Example 5.

Take

C = {x : | x_{j} | \leq u_{j}, j = 1, \dots, q}

. Thus,

- C = C

. This choice gives a set of q simultaneous intervals. For

Y = S N

and

T = V^{1 / 2}

\begin{matrix} Φ_{V} (C) = {P r o b . (| (V Y)}_{j} | < u_{j} {, j = 1, \dots, q) = P r o b . (| (T N)}_{j} | < u_{j}, j = 1, \dots, q), \\ {\bar{m}}^{1 - k} = E {\bar{Y}}_{1} \dots {\bar{Y}}_{k} {I (| (V Y)}_{j} | < u_{j}, j = 1, \dots, q) = {\bar{S}}_{1, k + 1} \dots {\bar{S}}_{k, 2 k} {\bar{R}}^{k + 1 - 2 k}, \\ w h e r e n o w, {\bar{R}}^{1 - k} = E {\bar{N}}_{1} \dots {\bar{N}}_{k} {I (| (T N)}_{j} | < u_{j}, j = 1, \dots, q) . \end{matrix}

Thus,

{\bar{R}}^{1 \dots k} = 0

for k odd. (

R^{12}

is no longer zero.) But each of those non-zero terms requires q numerical integrations. Consider the case

q = 2

.

\begin{matrix} I (| X_{j} | < u_{j}, j = 1, 2) = I (X_{j} < u_{j}, j = 1, 2) - I (X_{1} < - u_{1}, X_{2} < u_{2}) \\ - I (X_{1} < u_{1}, X_{2} < - u_{2}) + I (X_{1} < - u_{1}, X_{2} < - u_{2}) \Rightarrow \\ R^{j k} = F_{j k} (u_{1}, u_{2}) - F_{j k} (u_{1}, - u_{2}) - F_{j k} (- u_{1}, u_{2}) + F_{j k} (- u_{1}, - u_{2}), \\ w h e r e F_{j k} (u_{1}, u_{2}) = E N_{j} N_{k} I (X_{1} < u_{1}, X_{2} < u_{2}) \\ = \int d n_{1} \int d n_{2} ϕ (n_{1}) ϕ (n_{2}) I (\sum_{k = 1}^{2} T_{j k} n_{k} < u_{j}, j = 1, 2) . \end{matrix}

For more on these types of expansions and their inverses, see [13] and their simplifications given in Sections 4 and 5 of [16]. However, in these papers, the need to express results in terms of

S = V^{- 1 / 2}

did not arise. See also [39]. For the case of a sample mean with estimated covariance, it is possible to avoid the use of

S = V^{- 1 / 2}

; see [40].

4. The Distribution of X_n = n^1/2( $\hat{w} - w$ ) for q = 2

As above, we set

y = V^{- 1} x, Y = V^{- 1} X \sim N_{q} (0, V^{- 1})

. Taking

q = 2

, we switch notation to

\begin{matrix} μ_{a b} = μ^{1^{a} 2^{b}} = E Y_{1}^{a} Y_{2}^{b}, so that μ_{20} = V^{11}, μ_{11} = V^{12}, \end{matrix}

(65)

\begin{matrix} H_{a b} = H_{a b} (x) = H^{1^{a} 2^{b}} = E {(y_{1} + I Y_{1})}^{a} {(y_{2} + I Y_{2})}^{b} for I = \sqrt{- 1} . \end{matrix}

(66)

Thus,

H_{k 0}

and

H_{0 k}

are given by (42) with

j = 1

and 2:

H_{10} = y_{1}, H_{01} = y_{2},

\begin{matrix} H_{11} = y_{1} y_{2} - μ_{11}, H_{21} = y_{1}^{2} y_{2} - μ_{20} y_{2} - 2 μ_{11} y_{1}, H_{12} = y_{1} y_{2}^{2} - μ_{02} y_{1} - 2 μ_{11} y_{2}, \end{matrix}

and so on. The

μ_{a b}

and

H_{a b}

needed here are given in Appendix B. Let us write the cumulant expansion (9) as

\begin{matrix} κ_{a b} ({\hat{w}}_{1}, {\hat{w}}_{2}) \approx \sum_{d = a + b - 1}^{\infty} n^{- d} k_{a b d} for a + b \geq 1, where k_{a b d} = k_{d}^{1^{a} 2^{b}} . \end{matrix}

(67)

Thus

k_{100} = w_{1}, k_{010} = w_{2} .

The density of

X_{n} = n^{1 / 2} (\hat{w} - w)

,

p_{X_{n}} (x)

, is given by (22) and (25) in terms of

{\tilde{p}}_{r} (x)

and

{\tilde{p}}_{r k}

of (25). As

{\bar{P}}_{r}^{1 - k}

is symmetric,

\begin{matrix} {\tilde{p}}_{r k} = \sum_{b = 0}^{k} P_{r} (k - b, b) H_{k - b, b}, w h e r e P_{r} (a b) = (\binom{a + b}{a}) P_{r}^{1^{a} 2^{b}}, \end{matrix}

(68)

for

{\bar{P}}_{r}^{1 - k}

of (19)–(21). Therefore,

P_{r} (b a)

is just

P_{r} (a b)

with 1 and 2 reversed.

\begin{matrix} Set \sum^{2} P_{r} (a b) H_{a b} = P_{r} (a b) H_{a b} + P_{r} (b a) H_{b a} . \end{matrix}

Then,

{\tilde{p}}_{r} (x)

is given for

r = 1, 2, 3

by (34)–(36) in terms of

\begin{matrix} {\tilde{p}}_{r 1} = \sum^{2} P_{r} (10) H_{10}, {\tilde{p}}_{r 3} = \sum^{2} [P_{r} (30) H_{30} + P_{r} (21) H_{21}], r o d d, \\ {\tilde{p}}_{22} = \sum^{2} P_{2} (20) H_{20} + P_{2} (11) H_{11}, \\ {\tilde{p}}_{24} = \sum^{2} [P_{2} (40) H_{40} + P_{2} (31) H_{31}] + P_{2} (22) H_{22}, \\ {\tilde{p}}_{26} = \sum^{2} [P_{2} (60) H_{60} + P_{2} (51) H_{51} + P_{2} (42) H_{42}] + P_{2} (33) H_{33}, \\ {\tilde{p}}_{35} = \sum^{2} [P_{3} (50) H_{50} + P_{3} (41) H_{41} + P_{3} (32) H_{32}], \\ {\tilde{p}}_{37} = \sum^{2} [P_{3} (70) H_{70} + P_{3} (61) H_{61} + P_{3} (52) H_{52} + P_{3} (43) H_{43}], \\ {\tilde{p}}_{39} = \sum^{2} [P_{3} (90) H_{90} + P_{3} (81) H_{81} + P_{3} (72) H_{72} + P_{3} (63) H_{63} + P_{3} (54) H_{54}] . \end{matrix}

(69)

The

P_{r} (a b)

needed in (68) for

{\tilde{p}}_{r k}, r \leq 3,

are as follows, in terms of

k_{a b d}

of (67):

\begin{matrix} \begin{matrix} For {\tilde{p}}_{11}, P_{1} (10) = k_{101}, (so P_{1} (01) = k_{011}) . \\ For {\tilde{p}}_{13}, P_{1} (30) = k_{302} / 6, (so P_{1} (03) = k_{032} / 6,) P_{1} (21) = k_{212} / 2 . \\ For {\tilde{p}}_{22}, P_{2} (20) = k_{202} / 2 + k_{101}^{2} / 2, P_{2} (11) = k_{112} + k_{101} k_{011} . \\ For {\tilde{p}}_{24}, P_{2} (40) = k_{403} / 24 + k_{101} k_{302} / 6, P_{2} (31) = k_{313} / 6 + k_{101} k_{212} / 2 \\ + k_{011} k_{302} / 6, P_{2} (22) = k_{223} / 4 + k_{011} k_{212} / 2 + k_{101} k_{122} / 2 . \\ For {\tilde{p}}_{26}, P_{2} (60) = k_{302}^{2} / 72, P_{2} (51) = k_{302} k_{212} / 12, \\ P_{2} (42) = k_{302} k_{122} / 12 + k_{212}^{2} / 8, P_{2} (33) = k_{302} k_{032} / 36 + k_{212} k_{122} / 4, \\ For {\tilde{p}}_{31}, P_{3} (10) = k_{102} . For {\tilde{p}}_{33}, P_{3} (30) = k_{303} / 6 + k_{202} k_{101} / 2 + k_{101}^{3} / 6, \\ P_{3} (21) = k_{213} / 2 + k_{101} k_{112} + k_{011} k_{202} / 2 + k_{101}^{2} k_{011} / 2 . \\ For {\tilde{p}}_{35}, P_{3} (50) = k_{504} / 120 + k_{403} k_{101} / 24 + k_{302} [k_{202} + k_{101}^{2}] / 12, \\ P_{3} (41) / 5 = P_{3}^{1^{4} 2} = k_{414} / 120 + S_{1} / 24 + S_{2} / 12 + S_{3} / 12, where \\ S_{1} = (4 k_{101} k_{313} + k_{011} k_{403}) / 5, S_{2} = (7 k_{202} k_{212} + 3 k_{112} k_{302}) / 10, \\ S_{3} = (3 k_{101} k_{011} k_{302} + 7 k_{101}^{2} k_{212}) / 10, \\ P_{3} (32) = 10 P_{3}^{11122} = k_{324} / 12 + S_{1} / 24 + S_{2} / 12 + S_{3} / 12, where \\ S_{1} = (3 k_{101} k_{223} + 2 k_{011} k_{313}) / 5, S_{2} = (3 k_{202} k_{122} + k_{022} k_{302} + 6 k_{112} k_{212}) / 10, \\ S_{3} = (3 k_{101}^{2} k_{122} k_{302} + k_{011}^{2} k_{302} + 6 k_{101} k_{011} k_{212}) / 10 . \\ For {\tilde{p}}_{37}, P_{3} (70) = k_{403} k_{302} / 144 + k_{302}^{2} k_{101} / 72, \\ P_{3} (61) = k_{212} k_{403} / 48 + k_{302} k_{313} / 36 + k_{011} k_{302}^{2} / 72 + k_{101} k_{302} k_{212} / 12 . \\ P_{3} (52) = k_{403} k_{122} / 48 + k_{313} k_{212} / 12 + k_{223} k_{302} / 24 + k_{302} k_{212} k_{011} / 12 \\ + k_{101} (2 k_{302} k_{122} + 3 k_{212}^{2}) / 24, \\ P_{3} (43) = 35 P_{3}^{1^{4} 2^{3}} = A / 144 + B / 72 for A = k_{403} k_{032} + 12 k_{313} k_{122} + 18 k_{223} k_{212} \\ + 4 k_{133} k_{302}, B = 2 k_{101} (k_{302} k_{032} + 9 k_{212} k_{122}) + 3 k_{011} (2 k_{302} k_{122} + 3 k_{212}^{2}) . \\ For {\tilde{p}}_{39}, P_{3} (90) = k_{302}^{3} / 6^{4}, P_{3} (81) = k_{302}^{2} k_{212} / 144, \\ and P_{3} (72) = k_{302} (k_{302} k_{122} + 3 k_{212}^{2}) / 144, \\ P_{3} (63) = (3 k_{302}^{2} k_{032} + 16 k_{302} k_{212} k_{122} + 9 k_{212}^{3}) / 432, \\ P_{3} (54) = (2 k_{212} k_{302} k_{032} + 3 k_{122}^{2} k_{302} + 9 k_{212}^{2} k_{122}) / 144 . \end{matrix} \end{matrix}

(70)

Equations (22) and (25) now give the density of

X_{n} = n^{1 / 2} (\hat{w} - w)

to

O (n^{- 2})

.

\begin{matrix} Set H_{a b}^{*} = {(- \partial_{1})}^{a} {(- \partial_{2})}^{b} Φ_{V} (x) = \int_{- \infty}^{x} H_{a b} ϕ_{V} (x) d x . \\ Thus, H_{a b}^{*} = H_{a - 1, b - 1} ϕ_{V} (x) if a \geq 2, b \geq 1, \\ H_{10}^{*} = \int_{- \infty}^{x_{2}} ϕ_{V} (x) d x_{2} = \partial_{1} Φ_{V} (x), H_{a 0}^{*} = {(- \partial_{1})}^{a - 1} H_{10}^{*} if a \geq 2, \\ H_{01}^{*} = \int_{- \infty}^{x_{1}} ϕ_{V} (x) d x_{1} = \partial_{2} Φ_{V} (x), H_{0 b}^{*} = {(- \partial_{2})}^{b - 1} H_{01}^{*} if b \geq 1 . \end{matrix}

(71)

The distribution of

X_{n} = n^{1 / 2} (\hat{w} - w)

is given by (22) and (26) in terms of

P_{r k} (x)

.

P_{r k} (x)

is just the

{\tilde{p}}_{r k}

above with

H_{a b}

replaced by

H_{a b}^{*}

of (71). That is,

\begin{matrix} P_{r k} (x) = \sum_{b = 0}^{k} P_{r} (k - b, b) H_{k - b, b}^{*} . \end{matrix}

(72)

Equations (22) and (26) now give

P r o b . (X_{n} \leq x)

to

O (n^{- 2})

for

X_{n} = n^{1 / 2} (\hat{w} - w)

. For example, for

x = {(1, 1)}^{'}

and

x = {(2, 2)}^{'}

, the values of

H_{a b}

are given in Example A1.

By (45),

P (X_{n} \in C)

is given to

O (n^{- 2}

by

P_{r} (C), r = 1, 2, 3

, of (45) and (52). These are given by replacing

{\tilde{p}}_{r k}, H_{a b}

above by

P_{r k} (C)

of (48) and the

H_{a b C} = H^{1^{a} 2^{b}} (C)

of (49).

By Section 3, for

Y \sim N_{2} (0, V^{- 1}),

and

μ_{a b} = E Y_{1}^{a} Y_{2}^{b}

,

m_{a b} = E Y_{1}^{a} Y_{2}^{b} I (V Y \in C)

is given by

\begin{matrix} H_{10 C} = m_{10}, H_{20 C} = m_{20} - Φ_{V} (C) μ_{20}, H_{11 C} = m_{11} - Φ_{V} (C) μ_{11}, \\ H_{30 C} = m_{30} - 3 m_{10} μ_{20}, H_{21 C} = m_{21} - 2 m_{10} μ_{11} - m_{01} μ_{20}, \\ H_{40 C} = m_{40} - 6 m_{20} μ_{20} + Φ_{V} (C) μ_{40}, \\ H_{31 C} = m_{31} - 3 m_{20} μ_{11} - 3 m_{11} μ_{20} + Φ_{V} (C) μ_{31}, \\ H_{22 C} = m_{22} - m_{20} μ_{02} - m_{02} μ_{20} - 4 m_{11} μ_{11} + Φ_{V} (C) μ_{22}, \\ H_{60 C} = m_{60} - 15 m_{40} μ_{20} + 15 m_{20} μ_{40} - Φ_{V} (C) μ_{60}, \\ H_{51 C} = m_{51} - 10 m_{31} μ_{20} + 5 m_{11} μ_{40} - 5 m_{40} μ_{11} + 10 m_{20} μ_{31} Φ_{V} (C) μ_{51}, \\ H_{42 C} = m_{42} - 6 m_{22} μ_{20} + m_{02} μ_{40} - 8 m_{31} μ_{11} + 8 m_{11} μ_{31} - m_{40} μ_{02} + 6 m_{20} μ_{22} \\ - Φ_{V} (C) μ_{42}, \\ H_{33 C} = m_{33} + 3 \sum^{2} (m_{20} μ_{13} - m_{13} μ_{20}) - 9 m_{22} μ_{11} + 9 m_{11} μ_{22} - Φ_{V} (C) μ_{33} . \end{matrix}

These expressions can be read off those for

H_{a b}

in Appendix B. If

- C = C

, then

0 = m_{a b} = H_{a b C}

for

a + b

odd. For Examples 3 and 4,

m_{11} = μ_{11} R_{2},

and

P_{r 2} (C)

of (48) is given by (58) in terms of

{\bar{P}}_{r}^{12} {\bar{V}}^{12} = \sum^{2} P_{r} (20) μ_{20} + 2 P_{r} (11) μ_{11}

. For

r = 2

, this is given by (70).

Example 6.

Let

\hat{w}

be a sample mean of a distribution with cumulants

κ_{a b}

. By Example 2, for

(r k) = (11), (22), (31), (33),

P_{r k} = {\tilde{p}}_{r k} = 0

, and we need the following Edgeworth coefficients.

\begin{matrix} F o r {\tilde{p}}_{13} and P_{13} (x) : P_{1} (30) = κ_{30} / 3!, P_{1} (21) = κ_{21} / 2, P_{1} (12) = κ_{12} / 2 . \\ F o r {\tilde{p}}_{24}, P_{24} (x) : P_{2} (40) = κ_{40} / 4!, P_{2} (31) = κ_{31} / 6, P_{2} (22) = κ_{22} / 4 . \\ F o r {\tilde{p}}_{26}, P_{26} (x) : P_{2} (60) = κ_{30}^{2} / 72, P_{2} (51) = κ_{30} κ_{21} / 12, \\ P_{2} (42) = (2 κ_{30} κ_{12} + 3 κ_{21}^{2}) / 24, P_{2} (33) = (κ_{30} κ_{03} + 9 κ_{21} κ_{12}) / 36 . \\ F o r {\tilde{p}}_{35}, P_{35} (x) : P_{3} (50) = κ_{50} / 5!, P_{3} (41) = κ_{41} / 4!, P_{3} (32) = κ_{32} / 12 . \\ F o r {\tilde{p}}_{37}, P_{37} (x) : P_{3} (70) = κ_{40} κ_{30} / 144, P_{3} (61) = κ_{21} κ_{40} / 48 + κ_{30} κ_{31} / 36, \\ P_{3} (52) = κ_{40} κ_{12} / 48 + κ_{31} κ_{21} / 12 + κ_{22} κ_{30} / 24, \\ P_{3} (43) = A / 144 for A = κ_{40} κ_{03} + 12 κ_{31} κ_{12} + 18 κ_{22} κ_{21} + 4 κ_{13} κ_{30} . \end{matrix}

\begin{matrix} F o r {\tilde{p}}_{39}, P_{39} (x) : P_{3} (90) = κ_{30}^{3} / 6^{4}, P_{3} (81) = κ_{30}^{2} κ_{21} / 144, \\ P_{3} (72) = κ_{30} (κ_{30} κ_{12} + 3 κ_{21}^{2}) / 144, P_{3} (63) = (3 κ_{30}^{2} κ_{03} + 16 κ_{30} κ_{21} κ_{12} \\ + 9 κ_{21}^{3}) / 432, P_{3} (54) = (2 κ_{21} κ_{30} κ_{03} + 3 κ_{12}^{2} κ_{30} + 9 κ_{21}^{2} κ_{12}) / 144 . \end{matrix}

For

a < b, P_{r} (a b)

is

P_{r} (b a)

with superscripts one and two reversed. The

P_{2} (a b)

needed for

P_{2} (C)

of (45) and (52) are those given above for

{\tilde{p}}_{24}, {\tilde{p}}_{26}

.

For a specific case of a bivariate sample mean, consider the following.

Example 7.

An entangled gamma model. Let

G_{0}, G_{1}, G_{2}

be independent gamma random variables with means

γ = γ_{0}, γ_{1}, γ_{2}

. For

i = 1, 2

, set

X_{i} = G_{0} + G_{i}, w_{i} = E X_{i} = γ_{0} + γ_{i}

, and let

\hat{w}

be the mean of a random sample of size n distributed as

X = {(X_{1}, X_{2})}^{'}

. Thus,

E \hat{w} = w,

and

n \hat{w} \overset{L}{=} {(G_{n 0} + G_{n 1}, G_{n 0} + G_{n 2})}^{'}

, where

G_{n 0}, G_{n 1}, G_{n 2}

are independent gamma random variables with means

n γ_{0}, n γ_{1}, n γ_{2}

. The rth order cumulants of X are

κ^{i^{r}} = (r - 1)! w_{i},

and otherwise,

(r - 1)! γ_{0}

. For example,

κ_{20} = κ^{11} = w_{1}, κ_{02} = κ^{22} = w_{2}, κ_{11} = κ^{12} = γ_{0}, κ_{30} = 2 w_{1}, κ_{03} = 2 w_{2}, κ_{21} = κ^{112} = κ_{12} = κ^{122} = 2! γ_{0}

, and

\begin{matrix} V = (\begin{matrix} w_{1} & γ_{0} \\ γ_{0} & w_{2} \end{matrix}), V^{- 1} = (\begin{matrix} w_{2} & - γ_{0} \\ - γ_{0} & w_{1} \end{matrix}) / D where D = d e t V = w_{1} w_{2} - γ_{0}^{2} . \end{matrix}

Thus,

y = V^{- 1} x = {(w_{2} x_{1} - γ_{0} x_{2}, - γ_{0} x_{1} + w_{1} x_{2})}^{'} / D .

Set

ν_{i} = w_{i} / γ_{0} .

Then, V has correlation

{(ν_{1} ν_{2})}^{- 1 / 2}

. This ranges from 0 at

γ_{0} = 0

to 1 at

γ_{0} = \infty .

Thus,

{\hat{w}}_{1}

and

{\hat{w}}_{2}

are positively entangled. (For a negatively entangled example, replace

G_{n 0} + G_{n 2}

by

- G_{n 0} + G_{n 2}

: the correlation is then

- {(ν_{1} ν_{2})}^{- 1 / 2}

. An extension to

R^{q}

is

n {\hat{w}}_{i} = λ_{i} G_{n 0} + G_{n i}, i = 1, \dots q .

) For

c = 0, 1, \dots,

set

\sum^{2} w_{1}^{c} H_{a b} = w_{1}^{c} H_{a b} + w_{2}^{c} H_{b a} .

By Example 6, for

r \leq 3, {{\tilde{p}}_{r k}}

are given by the equations following (69) in terms of

\begin{matrix} P_{1} (10) = 0, P_{1} (30) = w_{1} / 3, (s o P_{1} (01) = 0, P_{1} (03) = w_{2} / 3,) P_{1} (21) = γ_{0}, \\ P_{2} (20) = P_{2} (11) = 0, P_{2} (40) = w_{1} / 4, P_{2} (31) = γ_{0}, P_{2} (22) = 3 γ_{0} / 2, \\ P_{2} (60) = w_{1}^{2} / 18, P_{2} (51) = w_{1} γ_{0} / 3, P_{2} (42) = w_{1} γ_{0} / 3 + γ_{0}^{2} / 2, \\ P_{2} (33) = w_{1} w_{2} / 9 + γ_{0}^{2}, P_{3} (10) = P_{3} (30) = P_{3} (21) = 0, P_{3} (50) = w_{1} / 5, \\ P_{3} (41) = γ_{0}, P_{3} (32) = 2 γ_{0}, P_{3} (70) = w_{1}^{2} / 12, P_{3} (61) = 7 γ_{0} w_{1} / 12, \\ P_{3} (52) = (3 γ_{0} w_{1} + 4 γ_{0}^{2}) / 4, P_{3} (43) = (w_{1} w_{2} + 4 γ_{0} w_{1} + 30 γ_{0}^{2}) / 12, \\ P_{3} (90) = w_{1}^{3} / 162, P_{3} (81) = w_{1}^{2} γ_{0} / 18, P_{3} (72) = γ_{0} w_{1} (w_{1} + 3 γ_{0}) / 18, \\ P_{3} (63) = (3 w_{1}^{2} w_{2} + 16 w_{1} γ_{0}^{2} + 9 γ_{0}^{3}) / 54, P_{3} (54) = (2 γ_{0} w_{1} w_{2} + 3 γ_{0}^{2} w_{1} + 9 γ_{0}^{3}) / 18 . \end{matrix}

Now, consider the entangled exponential case

γ_{i} \equiv 1

. Thus,

w_{i} \equiv 2,

V and

V^{- 1}

are given by (62),

\begin{matrix} P_{1} (30) = 2 / 3, P_{1} (21) = 1, P_{2} (40) = 1 / 2, P_{2} (31) = 1, P_{2} (22) = 3 / 2, \\ P_{2} (60) = 2 / 9, P_{2} (51) = 2 / 3, P_{2} (42) = 7 / 6, P_{2} (33) = 13 / 9, P_{3} (50) = 2 / 5, \\ P_{3} (41) = 1, P_{3} (32) = 2, P_{3} (70) = 1 / 3, P_{3} (61) = 7 / 6, P_{3} (52) = 5 / 2, \\ P_{3} (43) = 13 / 3, P_{3} (90) = 4 / 81, P_{3} (81) = 2 / 9, P_{3} (72) = 5 / 9, P_{3} (63) = 54 / 54, \\ P_{3} (54) = 23 / 18 . \end{matrix}

For

x = {(1, 1)}^{'}

and

x = {(2, 2)}^{'}

, the values of

H_{a b}

are given in Example A1, and by symmetry,

\sum^{2}

can be replaced by 2 in (69) and the other equations for

{\tilde{p}}_{r k}

.

Thus, if

x = {(1, 1)}^{'}

, then

y = {(1, 1)}^{'} / 3, {\tilde{p}}_{1} (x) = - 62 / 81 \approx - 0.7654,

so that our measure of the inaccuracy of the CLT is

n^{- 1 / 2} {\tilde{p}}_{1} (x) = - 31 / 81 \approx - 0.3827

for

n = 4

and

- 62 / 3^{5} \approx - 0.2551

\begin{matrix} F o r x = {(1, 1)}^{'}, {\tilde{p}}_{24} = 7 / 18 \approx 0.3889, {\tilde{p}}_{26} = - 697 / 3^{8} \approx - 0.1062, \\ {\tilde{p}}_{2} (x) = 3709 / (2 3^{8}) \approx 0.2827, {\tilde{p}}_{35} = 1594 / (5 3^{5}) \approx 1.3119, \\ {\tilde{p}}_{37} = 16330 / 3^{7} \approx 7.4669, {\tilde{p}}_{39} = - 3508524 / 3^{12} \approx - 6.6019, \\ {\tilde{p}}_{3} (x) = 5784408 / (5 3^{12}) \approx 2.1769, \\ s o p_{X_{n}} (x) / ϕ_{V} (x) \approx 1 - 0.7654 n^{- 1 / 2} + 0.2827 n^{- 1} + 2.1769 n^{- 3 / 2} + O (n^{- 2}) \\ \approx 1 - 0.1914 + 0.0177 + 0.0340 i f n = 16 s o t h a t o n l y t h r e e t e r m s c a n b e u s e d, \\ \approx 1 - 0.0957 + 0.0044 + 0.0043 i f n = 64 . \end{matrix}

If

x = {(2, 2)}^{'}

, then

y = {(2, 2)}^{'} / 3, {\tilde{p}}_{1} (x) = - 64 / 81 \approx - 0.7901

, and our measure of the inaccuracy of the CLT is

n^{- 1 / 2} {\tilde{p}}_{1} (x) = - 32 / 81 \approx - 0.395

for

n = 4

, and

- 16 / 81 \approx - 0.197

for

n = 16

.

\begin{matrix} For x = {(2, 2)}^{'}, {\tilde{p}}_{24} = - 7 / 9 \approx - 0.7778, {\tilde{p}}_{26} = 9380 / 3^{8} \approx 1.4251, \\ {\tilde{p}}_{2} (x) = 4247 / 3^{8} \approx, 0.6473 \\ {\tilde{p}}_{35} = 608 / (5 3^{7}) \approx 0.05560, {\tilde{p}}_{37} = 22096 / 3^{8} \approx 3.3678, \\ {\tilde{p}}_{39} = - 12331328 / 3^{13} \approx - 7.7345, {\tilde{p}}_{3} (x) \approx - 4.3111, \\ s o p_{X_{n}} (x) / ϕ_{V} (x) \approx 1 - 0.7901 n^{- 1 / 2} + 0.6473 n^{- 1} - 4.3111 n^{- 3 / 2} + O (n^{- 2}) \\ \approx 1 - 0.1975 + 0.0040 - 0.0067 i f n = 16 . \end{matrix}

Example 8.

As in Example 1, suppose that the distribution of

\hat{w}

is symmetric about w. By (40),

\begin{matrix} 2 {\tilde{p}}_{22} = k_{2}^{11} H_{20} + 2 k_{2}^{12} H_{11} + k_{2}^{22} H_{02}, \\ 24 {\tilde{p}}_{24} = k_{3}^{1111} H_{40} + 4 k_{3}^{1112} H_{31} + 6 k_{3}^{1122} H_{22} + 4 k_{3}^{1222} H_{13} + k_{3}^{2222} H_{04}, \end{matrix}

and

P_{2 k} (x)

needed for (41) is

{\tilde{p}}_{2 k}

with

H_{a b}

replaced by

H_{a b}^{*}

of (71). Now, suppose that

\hat{w} = {\hat{W}}_{1} - {\hat{W}}_{2}

, where

{\hat{W}}_{1}

and

{\hat{W}}_{2}

are independent copies of a random vector

\hat{W} .

Then, the cumulants of

\hat{w}

of odd order are zero, and the cumulants of even order are twice those of

\hat{W} .

Consider the case

\hat{W} = \hat{w}

, the bivariate entangled gamma of Example 7. Thus,

w = {(0, 0)}^{'}

, the odd cumulants of

\hat{w}

are zero, and the odd

{\tilde{p}}_{r} (x), P_{r} (x)

are zero. Denote

V, x, y, X, Y

of Example 4 as

V_{0}, x_{0}, y_{0}, X_{0}, Y_{0}

. Then,

\begin{matrix} V = 2 V_{0}, X = 2^{1 / 2} X_{0}, Y = 2^{- 1 / 2} Y_{0}, \\ H_{a b} = 2^{- (a + b) / 2} H_{a b} (x_{0}, V_{0}) where x = 2^{1 / 2} x_{0}, y = 2^{- 1 / 2} y_{0} . \end{matrix}

By Example 2,

{\tilde{p}}_{22} = 0, P_{2} (40) = k_{3}^{1^{4}} / 24, P_{2} (31) = κ_{40} / 6, P_{2} (22) = κ_{22} / 4

, where

κ_{40} = 12 (γ_{1} + γ_{3}), κ_{31} = κ_{22} = 12 γ_{3} .

\begin{matrix} S o, {\tilde{p}}_{2} (x) = \sum^{2} [(γ_{1} + γ_{3}) H_{40} + 4 γ_{3} H_{31}] + 6 γ_{3} H_{22} . \end{matrix}

Now, consider the exponential case

γ_{i} \equiv 1

. Thus,

\begin{matrix} {\tilde{p}}_{2} (x) = {\tilde{p}}_{24} = \sum^{2} [H_{40} + 2 H_{31}] + 2 H_{22} \\ = 7 / 9 \approx 0.778 if x = (1, 1) o r - 14 / 9 \approx - 1.556 if x = (2, 2) . \end{matrix}

Thus, for

n = 16, n^{- 1} {\tilde{p}}_{2} (x) \approx 0.049 if x = (1, 1), o r - 0.194 if x = (2, 2) .

Here, I used the values of

H_{a b}

given in the example of [41].

5. Conclusions

Most estimates of interest are standard estimates, including functions of sample moments, like the sample correlation, and any smooth multivariate function of Fisher’s k-statistics; see [5] for these. In Section 2, I gave the density and distribution of

X_{n} = n^{1 / 2} (\hat{w} - w)

to

O (n^{- 2})

, for

\hat{w}

any standard estimate, in terms of functions of the cumulants coefficients

{\bar{k}}_{j}^{1 - r}

of (9), the coefficients

{\bar{P}}_{r}^{1 - k}

of (18)–(21). Section 3 gave explicit detail when

q = 2

using the alternative notation

P_{r} (a b)

. Section 3 gave as examples Edgeworth expansions for the probability of

X_{n}

lying in an ellipsoidal or hyperrectangular set.

6. Discussion

A good approximation for the distribution of an estimate is vital for accurate inference. It enables one to explore the distribution’s dependence on underlying parameters. Equation (22) gives expansions in powers of

n^{- 1 / 2}

, for the distribution and density of a multivariate standard estimate, in terms of the Edgeworth coefficients

{\bar{P}}_{r}^{1 - k}

of (18). As noted at the end of Sections 1 and 5 of [35], the Cornish–Fisher expansions (4) are valid under the usual conditions for validity of the Edgeworth expansion (2), and their Section 5 appears to show that this is also true for the multivariate case. Appendix C shows for the first time how to find the extended Edgeworth expansions of (22) on a rigorous basis. Examples of standard estimates are moment estimates and maximum likelihood estimates. The underlying cumulant coefficients will be functions of the parameters of the model on which

\hat{w}

is based. For

\hat{w}

a k-statistic or a polynomial in k-statistics, the cumulant coefficients are polynomials in the cumulants of the sample.

The analytic results given here avoid the need for simulation, jack-knife or bootstrap methods while providing greater accuracy than them. Ref. [42] used the Edgeworth expansion to show that the bootstrap gives accuracy to

O (n^{- 1})

. Ref. [43] said that “2nd order correctness usually cannot be bettered”. But this is not true using these analytic results. Simulation, while popular, can at best shine a light on behaviour when there is only a small number of parameters.

Estimates based on a sample of independent but not identically distributed random vectors, are also generally standard estimates. For example, for a univariate sample mean

\bar{w} = n^{- 1} \sum_{j = 1}^{n} X_{j n}

, where

X_{j n}

has an rth cumulant

κ_{r j n}

,

κ_{r} (\bar{w}) = n^{1 - r} κ_{r}

, where

κ_{r} = n^{- 1} \sum_{j = 1}^{n} κ_{r j n}

is the average rth cumulant. For some examples, see [2,44,45,46]. The latter is for a function of a weighted mean of complex random matrices. It gives an application in electrical engineering of channel capacity for multiple arrays.

Estimates based on a stationary sample can also be standard estimates. See [47].

For conditions for the validity of multivariate Edgeworth expansions, see [30] and its references.

Refs. [3,8] did not deal with the question of when these expansions diverged. I showed how to confront this in the numerical examples in Example 7 and [4].

Lastly, I discuss numerical computation. I have used [41] for the numerical calculations in Example 7. One could also download R-4.4.1 for Windows and google the routines needed. dmvnorm computes the density function of the multivariate normal specified by the mean and co-variance matrix. Use mvtnorm for the multivariate normal. See also qmvnorm for quantiles. rmvnorm generates multivariate normal variables. Googling “bivariate hermite polynomials r” gives https://cran.r-project.org/web/packages/calculus/vignettes/hermite.html (accessed on 25 June 2025). One can then use install.packages(“calculus”). However, I have not used this route.

Some future directions.

Ref. [13] showed how to generalise the expansions of Cornish and Fisher about $N (0, 1)$ to expansions about an arbitrary continuous distribution. Their results are cumbersome as they involve partition theory. In [16], I overcame this using Bell polynomials. It would be straightforward to apply these to expansions about $χ_{q}^{2}$ in Example 3 to obtain the percentiles of $n {(\hat{w} - w)}^{'} V^{- 1} (\hat{w} - w)$ and $n {(\hat{w} - w)}^{'} {\hat{V}}^{- 1} (\hat{w} - w)$ . However, in the latter case, we first need to derive the cumulant coefficients of ${\hat{V}}^{- 1 / 2} (\hat{w} - w)$ from those of $\hat{w}$ . This can be done by applying [1].
It would very useful to obtain the multivariate $g_{r} (X)$ of (44) explicitly.
The multivariate expansions considered here have been about the multivariate normal. However, as noted at the end of Section 1, expansions about other distributions can greatly reduce the number of terms in each $P_{r} (x), {\tilde{p}}_{r} (x)$ and $P_{r} (C)$ by matching bias and/or skewness. While this was derived for $q = 1$ by Withers and [10,14,15] about Student’s distribution, the F-distribution and the gamma distribution, to date, this has yet to be derived for multivariate expansions about, for example, a multivariate gamma distribution.
The results given here are the first step for constructing confidence intervals and confidence regions. To do this one estimates the cumulant coefficients. See [32,33,48] for the case $q = 1$ .
The results here can be extended to tilted (saddle-point) expansions by applying the results of [2]. These are very useful where convergence fails, that is, where the CLT cannot be improved upon, typically due to $\hat{w}$ being in a tail. The tilted version of the multivariate distribution and density of a standard estimate are given by Corollaries 3 and 4 there. Tilting was first used in statistics by [27]. He gave an approximation to the density of a sample mean. See also [49]. Ref. [7] gave a univariate extension to $S_{N}$ where $S_{N}$ was the sum of N independent and identically distributed observations, and N was Poisson. The extension of the present results from ${\hat{w}}_{n} = \hat{w}$ to ${\hat{w}}_{N}$ would be useful for both univariate and multivariate observations. For a review of references on tilting, see [2].
Ref. [41] wrote a python program to obtain both analytic and numerical values of multivariate normal moments and multivariate Hermite polynomials when $q = 2$ . It would be useful to have these extended to $q = 3$ and 4. (The alternative notation for ${\bar{μ}}^{1 - k}$ and ${\bar{H}}^{1 - k}$ when $q = 3$ or 4 is straightforward.)
The end of Appendix C suggests a way of giving more theorems for Edgeworth expansions.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the author.

Conflicts of Interest

Author Christopher Stroude Withers was employed by the company Industrial Research Ltd. before retiring. The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. The Edgeworth Coefficients ${\bar{P}}_{r}^{1 - k}$ Needed for (18)

Here, I give for the first time the symmetric form of the coefficients

{\bar{P}}_{r}^{1 - k}

needed for (18) for

r \leq 3

, that is, for the Edgeworth expansions (22) to

O (n^{- 2})

, using the symmetrising operator

S

. They are given for

r = 1

by (19), and for

r = 2, 3

by (20) and (21) and the following.

\begin{matrix} \begin{matrix} {\bar{P}}_{2}^{1 - 4} needs S {\bar{k}}_{1}^{1} {\bar{k}}_{2}^{234}, where S a^{1} b^{234} = (a^{1} b^{234} + a^{2} b^{341} + a^{3} b^{412} + a^{4} b^{123}) / 4 . \\ {\bar{P}}_{2}^{1 - 6} needs S {\bar{k}}_{2}^{123} {\bar{k}}_{2}^{456}, where \\ S a^{123} a^{456} = S 123.456 = (a^{123} a^{456} + a^{124} a^{356} + a^{125} a^{346} + a^{126} a^{346} \\ + a^{134} a^{256} + a^{135} a^{246} + a^{136} a^{245} + a^{145} a^{236} + a^{146} a^{235} + a^{156} a^{234}) / 10 . \\ {\bar{P}}_{3}^{1 - 3} needs S {\bar{k}}_{2}^{12} {\bar{k}}_{1}^{3}, where S a^{12} b^{3} = (a^{12} b^{3} + a^{13} b^{2} + a^{23} b^{1}) / 3 . \\ {\bar{P}}_{3}^{1 - 5} needs S_{1} = S {\bar{k}}_{3}^{1 - 4} {\bar{k}}_{1}^{5}, S_{2} = S {\bar{k}}_{2}^{12} {\bar{k}}_{2}^{345}, S_{3} = S {\bar{k}}_{2}^{123} {\bar{k}}_{1}^{4} {\bar{k}}_{1}^{5}, \\ where S a^{1} b^{2 - 5} = (a^{1} b^{2 - 5} + \dots + a^{5} b^{1 - 4}) / 5, \\ S a^{12} b^{345} = (12.345 + 13.245 + 14.235 + 15.234 + 23.145 + 24.135 + 25.134 \\ + 34.125 + 35.124 + 45.123) / 10 for 12.345 = a^{12} b^{345}, \\ and for 1.2 . = 1.2 . 345 = a^{1} a^{2} b^{345}, S a^{1} a^{2} b^{345} = S a^{12} b^{345} at a^{12} = a^{1} a^{2}, \\ {\bar{P}}_{3}^{1 - 7} needs A = S {\bar{k}}_{2}^{123} {\bar{k}}_{3}^{4 - 7}, and B = S {\bar{k}}_{2}^{123} {\bar{k}}_{2}^{456} {\bar{k}}_{1}^{7}, where \\ for 123 . = a^{123} b^{4 - 7}, A = S a^{123} b^{4 - 7} = (123 . + 124 . + \dots + 567 .) / (\binom{7}{3}), \\ B = S a^{123} a^{456} b^{7} = S 123.456 . 7 = (b^{7} S 123.456 + \dots + b^{1} S 234.567) / 7, \\ where S 123.456 = S a^{1 - 3} a^{4 - 6} o f (A 1), \\ {\bar{P}}_{3}^{1 - 9} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} {\bar{k}}_{2}^{7 - 9} / 6^{4}, where for 123.456 . 789 = a^{1 - 3} a^{4 - 6} a^{7 - 9} \\ and 123 - = 123 . [S 456.789 o f (A 1)], \\ S a^{1 - 3} a^{4 - 6} a^{7 - 9} = [123 - + 124 - + 125 - + 126 - + 127 - + 128 - + 129 - \\ + 134 - + 135 - + 136 - + 137 - + 138 - + 139 - + 145 - + 146 - + 147 - \\ + 148 - + 149 - + 156 - + 157 - + 158 - + 159 - + 167 - + 168 - + 169 - \\ + 178 - + 179 - + 189 -] / 28 . \end{matrix} \end{matrix}

(A1)

Appendix B. μ_ab and H_ab of (66) for a + b ≤ 9

Here, we give the bivariate normal moments

μ_{a b}

of (65) and the bivariate Hermite polynomials

H_{a b}

of (66) for

a + b \leq 9

. These are needed for the Edgeworth expansions to

O (n^{- 2})

. The first nine univariate Hermite polynomials are

\begin{matrix} \begin{matrix} H_{0} = 1, H_{1} = u, H_{2} = u^{2} - 1, H_{3} = u^{3} - 3 u, H_{4} = u^{4} - 6 u^{2} + 3, \\ H_{5} = u^{5} - 10 u^{3} + 15 u, H_{6} = u^{6} - 15 u^{4} + 45 u^{2} - 15, \\ H_{7} = u^{7} - 21 u^{5} + 105 u^{3} - 105 u, H_{8} = u^{8} - 28 u^{6} + 210 u^{4} - 420 u^{2} + 105, \\ H_{9} = u^{9} - 36 u^{7} + 378 u^{5} - 1260 u^{3} + 945 u . \end{matrix} \end{matrix}

(A2)

These are needed for

h_{r} (u), r = 1, 2, 3

of (2), (3), that is, for the univariate Edgeworth expansions to

O (n^{- 2})

. See [5].

Let

X \sim N_{q} (0, V)

be a q-variate normal random variable with mean

0 \in R^{q}

, positive-definite covariance V, with density and distribution

ϕ_{V} (x), Φ_{V} (x)

of (11). Set

V^{a b} = E Y_{a} Y_{b},

as in (12), so that

V^{- 1} = (V^{a b}) .

Y has odd moments of zero and even moments

\begin{matrix} μ^{1 - 2 k} = E Y_{1} \dots Y_{2 k} = \sum^{2 k - 1} V^{12} μ^{3 - 3 k} = \sum^{1.3 \dots (2 k - 1)} V^{12} \dots V^{2 k - 1, 2 k} . \end{matrix}

(A3)

For example, μ^{1 - 4} = \sum^{3} V^{12} V^{34} = V^{12} V^{34} + V^{13} V^{24} + V^{14} V^{32},

(A4)

μ^{1 - 6} = \sum^{5} V^{12} μ^{3 - 6} = \sum^{15} V^{12} V^{34} V^{56} = V^{12} V^{34} V^{56} + \dots + V^{16} V^{25} V^{34},

(A5)

where

\sum_{r}^{m}

sums over the m permutations of

1, 2, \dots, r

giving distinct terms. For example,

\sum_{3}^{3} V^{12} y_{3} = V^{12} y_{3} + V^{13} y_{2} + V^{23} y_{1} .

Ref. [41] wrote a python program to obtain the bivariate moments using (A3). Let

H^{1 - k} = {\bar{H}}^{1 - k} (x, V)

be the multivariate Hermite polynomial is defined by (27) and (28). For

k \leq 6, H^{1 - k}

is given by (12)–(33). In these expressions,

1, 2, \dots, k

can be replaced by any integer in

1, 2, \dots, q

. For example,

H^{11} = y_{1}^{2} - V^{11}, H^{112} = y_{1}^{2} y_{2} - 2 V^{12} y_{1} - V^{11} y_{2} .

Now, consider the bivariate case,

q = 2

, and denote the moments of Y by

μ_{a b} = E Y_{1}^{a} Y_{2}^{b} .

Two special cases are

\begin{matrix} μ_{2 k, 0} = 1.3 \dots (2 k - 1) μ_{20}^{k}, k \geq 0, where μ_{20} = E Y_{1}^{2} = V^{11}, \\ μ_{2 k - 1, 1} = (2 k - 1) μ_{11} μ_{2 k - 2, 0} = 1.3 \dots (2 k - 1) μ_{20}^{k - 1} μ_{11}, k \geq 1 . \end{matrix}

(A6)

The

μ_{a b}

needed here are those up to order

a + b = 8 :

\begin{matrix} μ_{20} = E Y_{1}^{2} = V^{11}, μ_{11} = E Y_{1} Y_{2} = V^{12}, μ_{40} = 3 μ_{20}^{2}, μ_{31} = 3 μ_{20} μ_{11}, \\ μ_{22} = μ_{20} μ_{02} + 2 μ_{11}^{2}, μ_{60} = 15 μ_{20}^{3}, μ_{51} = 15 μ_{20}^{2} μ_{11}, \\ μ_{42} = μ_{40} μ_{02} + 4 μ_{11} μ_{31} = 3 μ_{20} (μ_{20} μ_{02} + 4 μ_{11}^{2}), \\ μ_{33} = 2 μ_{02} μ_{31} + 3 μ_{11} μ_{22} = 3 μ_{11} (3 μ_{20} μ_{02} + 2 μ_{11}^{2}), \\ μ_{80} = 105 μ_{20}^{4}, μ_{71} = 105 μ_{20}^{3} μ_{11}, μ_{62} = μ_{02} μ_{60} + 6 μ_{11} μ_{51} = 15 μ_{20}^{2} (μ_{02} + 6 μ_{11}^{2}), \\ μ_{53} = 2 μ_{02} μ_{51} + 5 μ_{11} μ_{42} = 15 μ_{20} μ_{11} (3 μ_{20} μ_{02} + 4 μ_{11}^{2}), \\ μ_{44} = 3 μ_{02} μ_{42} + 4 μ_{11} μ_{33} = 3 (3 c^{2} + 24 c d + 8 d^{2}) for c = μ_{20} μ_{02}, d = μ_{11}^{2} . \end{matrix}

For example, to derive

μ_{53}

from

μ^{1 - 8}

of (A3), replace

k \leq 5

by 1 and

k = 6, 7, 8

by 2.

μ_{b a}

can be read off

μ_{a b}

using symmetry. For example,

μ_{02} = V^{22}

. Formulas for the general

μ_{a b}

were given by [50,51]. By (66),

\begin{matrix} H_{a b} = \sum_{j = 0}^{a} (\binom{a}{j}) y_{1}^{a - j} \sum_{k = 0}^{b} (\binom{b}{k}) y_{1}^{b - k} i^{j + k} μ_{j k} . \end{matrix}

(A7)

This is the formula used in [41] to obtain bivariate Hermite polynomials.

H_{a b}

is said to be of order

a + b

. Most of those of order up to

a + b = 9

are needed to expand the density of

n^{1 / 2} (\hat{w} - w)

to

O (n^{- 2})

, but not those of order 8 (nor those of order 6 if the distribution of

\hat{w}

is symmetric about w). But we include them here for completeness. We can write

H_{k 0}

in terms of

μ_{a 0}

of (A6) and

y = V^{- 1} x

:

\begin{matrix} H_{10} = y_{1}, H_{20} = y_{1}^{2} - μ_{20}, H_{30} = y_{1}^{3} - 3 y_{1} μ_{20}, \\ H_{40} = y_{1}^{4} - 6 y_{1}^{2} μ_{20} + μ_{40}, H_{50} = y_{1}^{5} - 10 y_{1}^{3} μ_{20} + 5 y_{1} μ_{40}, \\ H_{60} = y_{1}^{6} - 15 y_{1}^{4} μ_{20} + 15 y_{1}^{2} μ_{40} - μ_{60}, \\ H_{70} = y_{1}^{7} - 21 y_{1}^{5} μ_{20} + 35 y_{1}^{3} μ_{40} - 7 y_{1} μ_{60}, \\ H_{80} = y_{1}^{8} - 28 y_{1}^{6} μ_{20} + 70 y_{1}^{4} μ_{40} - 28 y_{1}^{2} μ_{60} + μ_{80}, \\ H_{90} = y_{1}^{9} - 36 y_{1}^{7} μ_{20} + 126 y_{1}^{5} μ_{40} - 84 y_{1}^{3} μ_{60} + 9 μ_{80}, \end{matrix}

(A8)

These are actually simpler formulas than their univariate forms because of the use of

μ_{a b}

. The other

H_{a b}

up to order nine are

\begin{matrix} H_{11} = y_{1} y_{2} - μ_{11}, H_{21} = y_{2} H_{20} - 2 y_{1} μ_{11}, \\ H_{31} = y_{2} H_{30} - 3 y_{1}^{2} μ_{11} + μ_{31} = y_{1}^{3} y_{2} - 3 y_{1} y_{2} μ_{20} - 3 y_{1}^{2} μ_{11} + μ_{31}, \\ H_{22} = y_{2}^{2} H_{20} - y_{1}^{2} μ_{02} - 4 y_{1} y_{2} μ_{11} + μ_{22}, \\ H_{41} = y_{2} H_{40} - 4 y_{1}^{3} μ_{11} + 4 y_{1} μ_{31} = y_{1}^{4} y_{2} - 4 y_{1}^{3} μ_{11} - 6 y_{1}^{2} y_{2} μ_{20} + 4 y_{1} μ_{31} + y_{2} μ_{40}, \\ H_{32} = y_{2}^{2} H_{30} - y_{1}^{3} μ_{02} - 6 y_{1}^{2} y_{2} μ_{11} + 3 y_{1} μ_{22} + 2 y_{2} μ_{31}, \\ H_{51} = y_{2} H_{50} - 5 y_{1}^{4} μ_{11} + 10 y_{1}^{2} μ_{31} - μ_{51}, \\ H_{42} = y_{2}^{2} H_{40} + 8 y_{1} y_{2} (- y_{1}^{2} μ_{11} + μ_{31}) - y_{1}^{4} μ_{02} + 6 y_{1}^{2} μ_{22} - μ_{42} \\ H_{33} = y_{2}^{3} H_{30} + 3 y_{2}^{2} (- 3 y_{1}^{2} μ_{11} + μ_{31}) + 3 y_{1} y_{2} (- y_{1}^{2} μ_{02} + 3 μ_{22}) + 3 y_{1}^{2} μ_{13} - μ_{33} \\ = y_{1}^{3} y_{2}^{3} - 9 y_{1}^{2} y_{2}^{2} μ_{11} + 3 \sum_{12}^{2} (y_{1}^{2} μ_{13} - y_{1}^{3} y_{2} μ_{02}) + 9 y_{1} y_{2} μ_{22} - μ_{33}, \\ H_{61} = y_{2} H_{60} - 6 y_{1}^{5} μ_{11} + 20 y_{1}^{3} μ_{31} - 6 y_{1} μ_{51}, \\ H_{52} = y_{2}^{2} H_{50} + 2 y_{2} (- 5 y_{1}^{4} μ_{11} + 10 y_{1}^{2} μ_{31} - μ_{51}) - y_{1}^{5} μ_{02} + 10 y_{1}^{3} μ_{22} - 5 y_{1} μ_{42}, \\ H_{43} = H_{40} y_{2}^{3} + 12 y_{2}^{2} (- y_{1}^{3} μ_{11} + y_{1} μ_{31}) - 3 y_{2} (y_{1}^{4} μ_{02} - 6 y_{1}^{2} μ_{22} + μ_{42}) \\ + 4 y_{1}^{3} μ_{13} - 4 y_{1} μ_{33}, \\ H_{71} = y_{2} H_{70} - 7 y_{1}^{6} μ_{11} + 35 y_{1}^{4} μ_{31} - 21 y_{1}^{2} μ_{51} + μ_{71}, \\ H_{62} = y_{2}^{2} H_{60} + 2 y_{2} [(- 6 y_{1}^{5} μ_{11} + 20 y_{1}^{3} μ_{31} - 6 y_{1} μ_{51}) - y_{1}^{6} μ_{02} + 15 y_{1}^{4} μ_{22} \\ - 15 y_{1}^{2} μ_{42} + μ_{62}, \\ H_{53} = y_{2}^{3} H_{50} + 15 y_{2}^{2} (- 2 y_{1}^{4} μ_{11} + 2 y_{1}^{2} μ_{31} - μ_{51}) + 3 y_{2} (- y_{1}^{5} μ_{02} + 10 y_{1}^{3} μ_{22} \\ - 5 y_{1} μ_{42}) + 5 y_{1}^{4} μ_{13} - 10 y_{1}^{2} μ_{33} + 5 μ_{53}, \\ H_{44} = y_{2}^{4} H_{40} + 16 y_{2}^{3} (- y_{1}^{3} μ_{11} + y_{1} μ_{31}) + 6 y_{2}^{2} (- y_{1}^{4} μ_{02} + 6 y_{1}^{2} μ_{22} - μ_{42}) \\ + 16 y_{2} (y_{1}^{3} μ_{13} - y_{1}^{2} μ_{33}) + y_{1}^{4} μ_{04} - 6 y_{1}^{2} μ_{24} + μ_{44}, \\ H_{81} = y_{2} H_{80} - 8 y_{1}^{7} μ_{11} + 56 y_{1}^{5} μ_{31} - 28 y_{1}^{3} μ_{51} + 8 y_{1} μ_{71}, \\ H_{72} = y_{2}^{2} H_{70} + 2 y_{2} (- 7 y_{1}^{6} μ_{11} + 35 y_{1}^{4} μ_{31} - 21 y_{1}^{2} μ_{51} + μ_{71}) - y_{1}^{7} μ_{02} + 21 y_{1}^{5} μ_{22} \\ - 35 y_{1}^{3} μ_{42} + 7 y_{1} μ_{62}, \\ H_{63} = y_{2}^{3} H_{60} + 3 y_{2}^{2} (- 6 y_{1}^{5} μ_{11} + 20 y_{1}^{3} μ_{31} - 6 y_{1} μ_{51}) + 3 y_{2} (- y_{1}^{6} μ_{02} + 15 y_{1}^{4} μ_{22} \\ - 15 y_{1}^{2} μ_{42} + μ_{62}) + 6 y_{1}^{5} μ_{13} - 20 y_{1}^{3} μ_{33} + 6 y_{1} μ_{53}, \\ H_{54} = y_{2}^{4} H_{50} + 4 y_{2}^{3} (- 5 y_{1}^{4} μ_{11} + 10 y_{1}^{2} μ_{31} - μ_{51}) + 6 y_{2}^{2} (- y_{1}^{5} μ_{02} + 10 y_{1}^{3} μ_{22} \\ - 5 y_{1} μ_{42}) + 4 y_{2} (5 y_{1}^{4} μ_{13} - 10 y_{1}^{2} μ_{33} + μ_{53}) + y_{1}^{5} μ_{04} - 10 y_{1}^{3} μ_{24} + 5 y_{1} μ_{44} . \end{matrix}

The method of the proof is illustrated as follows for

I = \sqrt{- 1}

:

\begin{matrix} H_{a 1} = E {(y_{1} + I Y_{1})}^{a} (y_{2} + I Y_{2}) = y_{2} E {(y_{1} + I Y_{1})}^{a} + E {(y_{1} + I Y_{1})}^{a} i Y_{2} . \end{matrix}

Now, expand the second term to get

\begin{matrix} H_{a 1} = y_{2} H_{a 0} + \sum [{(- 1)}^{k} (\binom{a}{2 k - 1}) y_{1}^{a - 2 k + 1} μ_{2 k - 1, 1} : 1 \leq k \leq (a + 1) / 2] . \end{matrix}

Similarly, for

H_{a 2}

, expand

\begin{matrix} H_{a 2} = E {(y_{1} + I Y_{1})}^{a} (y_{2}^{2} + 2 I y_{2} Y_{2} - Y_{2}^{2}) . \end{matrix}

The above formulas for

H_{a b}

can be called their short form as each uses

H_{a 0}

. The explicit form when

H_{a 0}

of (A8) is substituted can be called its long form. My short forms for

H_{a b}

were checked against the long forms given by [41]. Here is a selection of his results for comparison.

\begin{matrix} H_{51} = & - 15 μ_{11} μ_{20}^{2} + 30 μ_{11} μ_{20} y_{1}^{2} - 5 μ_{11} y_{1}^{4} + 15 μ_{20}^{2} y_{1} y_{2} - 10 μ_{20} y_{1}^{3} y_{2} + y_{1}^{5} y_{2} \\ H_{42} = & - 3 μ_{02} μ_{20}^{2} + 6 μ_{02} μ_{20} y_{1}^{2} - μ_{02} y_{1}^{4} - 12 μ_{11}^{2} μ_{20} + 12 μ_{11}^{2} y_{1}^{2} + 24 μ_{11} μ_{20} y_{1} y_{2} \\ - 8 μ_{11} y_{1}^{3} y_{2} + 3 μ_{20}^{2} y_{2}^{2} - 6 μ_{20} y_{1}^{2} y_{2}^{2} + y_{1}^{4} y_{2}^{2} \\ H_{33} = & - 9 μ_{02} μ_{11} μ_{20} + 9 μ_{02} μ_{11} y_{1}^{2} + 9 μ_{02} μ_{20} y_{1} y_{2} - 3 μ_{02} y_{1}^{3} y_{2} - 6 μ_{11}^{3} + 18 μ_{11}^{2} y_{1} y_{2} \\ + 9 μ_{11} μ_{20} y_{2}^{2} - 9 μ_{11} y_{1}^{2} y_{2}^{2} - 3 μ_{20} y_{1} y_{2}^{3} + y_{1}^{3} y_{2}^{3} \\ H_{70} = & - 105 μ_{20}^{3} y_{1} + 105 μ_{20}^{2} y_{1}^{3} - 21 μ_{20} y_{1}^{5} + y_{1}^{7} \\ H_{61} = & - 90 μ_{11} μ_{20}^{2} y_{1} + 60 μ_{11} μ_{20} y_{1}^{3} - 6 μ_{11} y_{1}^{5} - 15 μ_{20}^{3} y_{2} + 45 μ_{20}^{2} y_{1}^{2} y_{2} - 15 μ_{20} y_{1}^{4} y_{2} \\ + y_{1}^{6} y_{2} \\ H_{52} = & - 15 μ_{02} μ_{20}^{2} y_{1} + 10 μ_{02} μ_{20} y_{1}^{3} - μ_{02} y_{1}^{5} - 60 μ_{11}^{2} μ_{20} y_{1} + 20 μ_{11}^{2} y_{1}^{3} - 30 μ_{11} μ_{20}^{2} y_{2} \\ + 60 μ_{11} μ_{20} y_{1}^{2} y_{2} - 10 μ_{11} y_{1}^{4} y_{2} + 15 μ_{20}^{2} y_{1} y_{2}^{2} - 10 μ_{20} y_{1}^{3} y_{2}^{2} + y_{1}^{5} y_{2}^{2} \\ H_{43} = & - 36 μ_{02} μ_{11} μ_{20} y_{1} + 12 μ_{02} μ_{11} y_{1}^{3} - 9 μ_{02} μ_{20}^{2} y_{2} + 18 μ_{02} μ_{20} y_{1}^{2} y_{2} - 3 μ_{02} y_{1}^{4} y_{2} \\ - 24 μ_{11}^{3} y_{1} - 36 μ_{11}^{2} μ_{20} y_{2} + 36 μ_{11}^{2} y_{1}^{2} y_{2} + 36 μ_{11} μ_{20} y_{1} y_{2}^{2} - 12 μ_{11} y_{1}^{3} y_{2}^{2} \\ + 3 μ_{20}^{2} y_{2}^{3} - 6 μ_{20} y_{1}^{2} y_{2}^{3} + y_{1}^{4} y_{2}^{3} \\ H_{80} = & 105 μ_{20}^{4} - 420 μ_{20}^{3} y_{1}^{2} + 210 μ_{20}^{2} y_{1}^{4} - 28 μ_{20} y_{1}^{6} + y_{1}^{8} \\ H_{54} = & 45 μ_{02}^{2} μ_{20}^{2} y_{1} - 30 μ_{02}^{2} μ_{20} y_{1}^{3} + 3 μ_{02}^{2} y_{1}^{5} + 360 μ_{02} μ_{11}^{2} μ_{20} y_{1} - 120 μ_{02} μ_{11}^{2} y_{1}^{3} \\ + 180 μ_{02} μ_{11} μ_{20}^{2} y_{2} - 360 μ_{02} μ_{11} μ_{20} y_{1}^{2} y_{2} + 60 μ_{02} μ_{11} y_{1}^{4} y_{2} - 90 μ_{02} μ_{20}^{2} y_{1} y_{2}^{2} \\ + 60 μ_{02} μ_{20} y_{1}^{3} y_{2}^{2} - 6 μ_{02} y_{1}^{5} y_{2}^{2} + 120 μ_{11}^{4} y_{1} + 240 μ_{11}^{3} μ_{20} y_{2} - 240 μ_{11}^{3} y_{1}^{2} y_{2} \\ - 360 μ_{11}^{2} μ_{20} y_{1} y_{2}^{2} + 120 μ_{11}^{2} y_{1}^{3} y_{2}^{2} - 60 μ_{11} μ_{20}^{2} y_{2}^{3} + 120 μ_{11} μ_{20} y_{1}^{2} y_{2}^{3} \\ - 20 μ_{11} y_{1}^{4} y_{2}^{3} + 15 μ_{20}^{2} y_{1} y_{2}^{4} - 10 μ_{20} y_{1}^{3} y_{2}^{4} + y_{1}^{5} y_{2}^{4} \end{matrix}

For example the short and long forms for

H_{33}

have 5 and 10 terms, and those for

H_{43}

have 8 and 13 terms.

Example A1.

Take

V, V^{- 1}

of (62). Then,

\begin{matrix} μ_{20} = 2 / 3, μ_{11} = - 1 / 3, μ_{40} = 4 / 3, μ_{31} = - 2 / 3, μ_{22} = 2 / 3, \\ μ_{60} = 40 / 9, μ_{51} = - 20 / 9, μ_{42} = 16 / 9, μ_{33} = - 14 / 9, \\ μ_{80} = 560 / 27, μ_{71} = - 280 / 27, μ_{62} = 200 / 27, μ_{53} = - 160 / 27, μ_{44} = 152 / 27 . \end{matrix}

Thus, if

x = {(1, 1)}^{'}

, then

y = {(1, 1)}^{'} / 3,

\begin{matrix} H_{10} = 1 / 3 \approx 0.3333, H_{20} = - 5 / 9 \approx - 0.5556, H_{11} = 4 / 9 \approx 0.4444, \\ H_{30} = - 17 / 27 \approx - 0.6296, H_{21} = 1 / 27 \approx 0.0370, H_{40} = 73 / 81 \approx 0.9012, \\ H_{31} = - 62 / 81 \approx - 0.7654, H_{22} = 55 / 81 \approx 0.6790, \\ H_{60} = - 1709 / 3^{6} \approx - 2.344, H_{51} = 1576 / 3^{6} \approx 2.1619, \\ H_{42} = - 1361 / 3^{6} \approx - 1.8669, H_{33} = 1216 / 3^{6} \approx 1.6680, \\ H_{50} = 481 / 3^{5} \approx 1.9794, H_{41} = - 131 / 3^{5} \approx - 0.5391, H_{32} = 49 / 3^{5} \approx 0.2016, \\ H_{70} = - 19, 025 / 3^{7} \approx - 8.6991, H_{61} = 6949 / 3^{7} \approx 3.1774, \\ H_{52} = 3275 / 3^{7} \approx - 1.4975, H_{43} = 847 / 3^{7} \approx 0.3873, \\ H_{90} = 965953 / 3^{9} \approx 49.0755, H_{81} = - 403847 / 3^{9} \approx - 20.5176 \\ H_{72} = 205165 / 3^{9} \approx 10.4235, H_{63} = - 96, 767 / 3^{9} \approx - 4.91627, \\ H_{54} = 29773 / 3^{9} \approx 1.5126 . \end{matrix}

If

x = {(2, 2)}^{'}

, then

y = {(2, 2)}^{'} / 3,

\begin{matrix} H_{10} = 2 / 3 \approx 0.6667, H_{20} = - 2 / 9 \approx - 0.2222, H_{11} = 7 / 9 \approx 0.7778, \\ H_{30} = - 28 / 27 \approx - 1.0370, H_{21} = 8 / 27 \approx 0.2963, H_{40} = - 20 / 81 \approx - 0.2469, \end{matrix}

\begin{matrix} H_{31} = - 74 / 81 \approx - 0.9136, H_{22} = 70 / 81 \approx 0.8642, H_{60} = 1864 / 3^{6} \approx 2.5569, \\ H_{51} = 964 / 3^{6} \approx 1.3224, H_{42} = - 1520 / 729 \approx - 2.0850, \\ H_{33} = 1702 / 3^{6} \approx 2.3347, H_{50} = 632 / 3^{5} \approx 2.6008, \\ H_{41} = - 376 / 3^{5} \approx - 1.5473, H_{32} = 92 / 3^{5} \approx 0.3786, \\ H_{70} = - 19024 / 3^{7} \approx - 8.6987, H_{61} = 15104 / 3^{7} \approx 6.9063, \\ H_{52} = 7504 / 3^{7} \approx - 3.4312, H_{43} = 2576 / 3^{7} \approx 1.1779, \\ H_{90} = 680480 / 3^{9} \approx 34.5720, H_{81} = - 689248 / 3^{9} \approx - 35.0174, \\ H_{72} = 433520 / 3^{9} \approx 22.0251, H_{63} = - 243568 / 3^{9} \approx - 12.3745, \\ H_{54} = 74960 / 3^{9} \approx 3.8084 . \end{matrix}

These results were computed by [41], using (A7), and are used in Example 7. Note how for these examples, when the elements of x and V are integers,

H_{a b}

is an integer

/ {(d e t V)}^{a + b}

.

Appendix C. Regularity Conditions for the Edgeworth Expansions of (22)

Here, I build on (20.53) of [29]. By (20.50), this holds for all q (their k) not just for

q = 1

. Unlike theirs, my version is explicit.

Theorem A1.

Let

Z_{1}, \dots, Z_{n}

be independent identically distributed (i.i.d.) random vectors in

R^{q}

with means zero, covariance V, and distribution function

Q (z)

. Set

I = \sqrt{- 1}

. Assume that for some integer

s \geq 3, E | Z_{1} |^{s} < \infty

, and that Cramer’s condition holds,

\begin{matrix} \underset{| t | \to \infty}{lim sup} | \hat{Q} (t) | < 1, where for t \in R^{q}, \hat{Q} (t) = E e^{I t^{'} Z_{1}} = \int e^{I t^{'} z} d Q (z) . \end{matrix}

(A9)

\begin{matrix} \begin{matrix} T h e n, sup_{x \in R^{q}} {(1 + | x |}^{s}) | P r o b . (n^{1 / 2} \bar{Z} \leq x) - S_{n, s - 2} (x, κ) | = o (n^{- (s - 2) / 2}), \\ a s n \to \infty w h e r e S_{n, s - 2} (x, κ) = \sum_{r = 0}^{s - 2} n^{- r / 2} P_{r} (x, κ), \end{matrix} \end{matrix}

(A10)

and

P_{r} (x, κ)

is

P_{r 0} (x)

of (38) with

{\bar{P}}_{r 0}^{1 - k}

of Example 2 and

{\bar{κ}}^{1 - k} = {\bar{κ}}^{1 - k} (Z_{1}),

the cumulants of

Q (z) .

Corollary A1.

Let

Z_{n 1}, \dots, Z_{n n}

be i.i.d. random vectors in

R^{q}

with mean

{\bar{Z}}_{n}

, distribution function

Q_{n} (z)

with mean 0, covariance

V_{n}

with minimum evalue bounded away from 0, and Fourier transform

{\hat{Q}}_{n} (t)

satisfying

\begin{matrix} \underset{| t | \to \infty}{lim sup} \underset{n}{lim sup} | {\hat{Q}}_{n} (t) | < 1 . \end{matrix}

Suppose also that for some integer

s \geq 3, {lim sup}_{n} E {| Z_{n 1} |}^{s} < \infty

. Then,

\begin{matrix} sup_{x \in R^{q}} {(1 + | x |}^{s}) | P r o b . (n^{1 / 2} {\bar{Z}}_{n} \leq x) - S_{n, s - 2} (x, κ_{n}) | = o (n^{- (s - 2) / 2}) \end{matrix}

(A11)

for

S_{n, s - 2} (x, κ)

of Theorem C.1 and

{{\bar{κ}}^{1 - k}}

replaced by

{{\bar{κ}}_{n}^{1 - k}}

, the cumulants of

Z_{n 1}

up to order s.

If all cumulants are finite, then we can represent

\hat{w} - E \hat{w}

as

{\bar{Z}}_{n}

, by requiring that their cumulants be the same, that is, by choosing

\begin{matrix} {\bar{κ}}_{n}^{1 - k} = n^{r - 1} {\bar{κ}}^{1 - k} (\hat{w} - E \hat{w}) . \end{matrix}

(A12)

However if we only require cumulants up to order s to match, then we can apply the corollary to obtain

Corollary A2.

Let

Z_{n 1}, \dots, Z_{n n}

be i.i.d. random vectors in

R^{q}

chosen so that (A12) holds for cumulants up to order s. Suppose that the regularity conditions of Corollary A1 hold.Then for

X_{n} = n^{1 / 2} (\hat{w} - E \hat{w})

,

\begin{matrix} sup_{x \in R^{q}} {(1 + | x |}^{s}) | P r o b . (X_{n} \leq x) - S_{n, s - 2} (x, κ_{n}) | = o (n^{- (s - 2) / 2}) \end{matrix}

(A13)

for

S_{n, s - 2} (x, κ)

of Corollary A1 with

{\bar{κ}}_{n}^{1 - k} = {\bar{κ}}^{1 - k} (\hat{w})

, where

\begin{matrix} \begin{matrix} for s = 3, assume that \\ {\bar{κ}}^{12} (\hat{w}) = n^{- 1} {\bar{k}}_{1}^{12} + o (n^{- 1}), {\bar{κ}}^{1 - 3} (\hat{w}) = n^{- 2} {\bar{k}}_{2}^{1 - 3} + o (n^{- 2}); \\ for s = 4, assume that \\ {\bar{κ}}^{12} (\hat{w}) = n^{- 1} {\bar{k}}_{1}^{12} + n^{- 2} {\bar{k}}_{2}^{12} + o (n^{- 2}), {\bar{κ}}^{1 - 4} (\hat{w}) = n^{- 3} {\bar{k}}_{3}^{1 - 4} + o (n^{- 3}); \end{matrix} \end{matrix}

(A14)

\begin{matrix} \begin{matrix} for s = 5, assume that \\ {\bar{κ}}^{1 - 3} (\hat{w}) = n^{- 2} {\bar{k}}_{2}^{1 - 3} + n^{- 3} {\bar{k}}_{3}^{1 - 3} + o (n^{- 3}), {\bar{κ}}^{1 - 5} (\hat{w}) = n^{- 4} {\bar{k}}_{4}^{1 - 5} + o (n^{- 4}) . \end{matrix} \end{matrix}

(A15)

\begin{matrix} \begin{matrix} A l s o, f o r s = 3, 4, 5, \\ sup_{x \in R^{q}} | P r o b . (X_{n} \leq x) - S_{n, s - 2} (x, k_{n s}) | = o (n^{- (s - 2) / 2}) \end{matrix} \end{matrix}

(A16)

where

{\bar{k}}_{n s}^{1 - k}

is

{\bar{κ}}_{n}^{1 - k}

truncated by dropping

o (\cdot)

terms.

The reason (A16) drops the

{| x |}^{s}

, is that for

k \geq 1

,

sup_{x} | {\bar{H}}_{*}^{1 - k} | < \infty b u t s u p_{x} {| x |}^{s} | {\bar{H}}_{*}^{1 - k} | = \infty .

We skip further details of the proof.

Expanding

E \hat{w}

about w, as

{\bar{P}}_{1}^{1} = {\bar{k}}_{1}^{1}

and

{\bar{P}}_{3}^{1} = {\bar{k}}_{2}^{1}

, we obtain

Corollary A3.

Suppose that the conditions of Corollary A2 hold, and that

\begin{matrix} f o r s = 3, E \hat{w} = w + n^{- 1} ({\bar{k}}_{1}^{1}) + o (n^{- 1}), \\ a n d a l s o f o r s = 5, E \hat{w} = w + n^{- 1} ({\bar{k}}_{1}^{1}) + n^{- 2} ({\bar{k}}_{2}^{1}) + o (n^{- 2}) . \\ T h e n f o r X_{n} = n^{1 / 2} (\hat{w} - w), and S_{n, s - 2} (x, k) = \sum_{r = 0}^{s - 2} n^{- r / 2} P_{r} (x), \\ sup_{x \in R^{q}} |Prob . (X_{n} \leq x) - S_{n, s - 2} (x, k)| = o (n^{- (s - 2) / 2}) \end{matrix}

(A17)

for

P_{r} (x)

of (22) and (23).

One could also treat the density expansion of (22) similarly, by building on their (19.17). And for corrections for a lattice sample mean, one could build on their (23.3).

Appendix D. Some Corrigenda to the References

For typos and corrections for [2,4,15,16,19,32], see p. 23–25 of [1].

Typos and corrections for [33]:

p. 4231, seventh and eighth lines from the bottom: replace “

n_{j}

” by “

n_{i}

”.

p. 4233, line 13: replace “

\hat{σ} q_{2}^{- 1} (\hat{w}, x)

” by “

{\hat{σ}}^{- 1} q_{2}^{- 1} (\hat{w}, x)

”.

p. 4233, at the end of the seventh to last line, replace

+ n^{- 3 / 2} q_{3} (\hat{w}, x) .

by

+ n^{- 3 / 2} q_{3} (\hat{w}, x)

\leq t (w) .

p. 4236, second to last line: “1982a” should read “1983b”; in the last line, “Example 3.3” should read “Example 3.4”.

p. 4241, in the last equation, replace

{(\sum a_{i}^{2} σ_{i}^{2} / n_{i})}^{- 1 / 2}

by

{(\sum a_{i}^{2} σ_{i}^{2} / n_{i})}^{- 3 / 2}

.

p. 4250: in the first equation in Example 3.5, “

2 w

” should read “

2 w)

”.

p. 4253: in the first equation, “

σ^{- 1}

” should read “

b_{22}^{- 1 / 2}

”; in the fourth equation,

q_{1} (w, x) = - σ x

, not

σ x

. in the seventh to last line, (2.1) should be (4.1); in the sixth to last line, “

{Z_{n} \leq x}

, which Section 4”, (2.1) should read “

{n^{1 / 2} Z_{n} \leq x}

, which by Section 4”.

p. 4254, line 1: replace

+ 2 I_{2} (\binom{21}{01})

by

+ 2 I_{2} (\binom{12}{10})

.

Typos in [14]: p. 217, line 7: replace stem by step. p. 220: replace the first two words “That is,” by “Suppose now that”. p. 220: after “replace” in line 6, insert “

Y_{n}

by

- Y_{n}

,”. p. 226: replace lines 5–7, “Suppose that … This is”, as follows: “Suppose that for

ν

in

N^{p}

and

| ν | = \sum_{j = 1}^{p} ν_{j}, l_{ν} = n^{a (| ν |)} λ_{ν}

satisfies

l_{ν} = O (1)

where

a (r) = r / 2 - I (r \geq 3)

as

n \to \infty .

This is”. p. 226: replace

κ_{r}

on the LHS of fourth displayed equation by

k_{r}

. p. 226: replace

k_{r}

on RHS of sixth displayed equation by

K_{r}

. p. 227: replace r in (7.5) and the following equation by

| ν |

. p. 227: replace “variance” in (7.6) by “covariance”.

Typos and corrections to [10]: p. 4369, lines 8–11 should read:

H e_{1} (r) = r so H e_{j_{1}} (y, V) = z_{j_{1}},

{\tilde{H e}}_{j_{1}} (y, V) = y_{j_{1}}, H e_{2} (r) = r^{2} - 1 so H e_{j_{1} j_{2}} (y, V) = z_{j_{1}} z_{j_{2}} - V^{j_{1} j_{2}},

{\tilde{H e}}_{j_{1} j_{2}} (y, V) = y_{j_{1}} y_{j_{2}} - V_{j_{1} j_{2}}, H e_{3} (r) = r^{3} - 3 r so H e_{j_{1} j_{2} j_{3}} (y, V) = z_{j_{1}} z_{j_{2}} z_{j_{3}} - \sum^{3} z_{j_{1}} V^{j_{2} j_{3}},

{\tilde{H e}}_{j_{1} j_{2} j_{3}} (y, V) = y_{j_{1}} y_{j_{2}} y_{j_{3}} - \sum^{3} y_{j_{1}} V_{j_{2} j_{3}}, H e_{4} (r) = r^{4} - 6 r^{2} + 3 so

H e_{j_{1} \dots j_{4}} (y) = z_{j_{1}} \dots z_{j_{4}} - \sum^{6} z_{j_{1}} z_{j_{2}} V^{j_{3} j_{4}} + \sum^{3} V^{j_{1} j_{2}} V^{j_{3} j_{4}}, {\tilde{H e}}_{j_{1} \dots j_{4}} (y) = y_{j_{1}} \dots y_{j_{4}} - \sum^{6} y_{j_{1}} y_{j_{2}} V_{j_{3} j_{4}} + \sum^{3} V_{j_{1} j_{2}} V_{j_{3} j_{4}},

where

{\tilde{H e}}_{j_{1} \dots j_{r}} (y, V) = {H e}_{j_{1} \dots j_{r}} (V^{- 1} y, V^{- 1})

is the dual Hermite polynomial; see [9].

p. 4370: in Theorem 2.2,

d (j_{1}, \dots, d_{2 i}) = E {\bar{Y}}_{j_{1}} \dots {\bar{Y}}_{j_{2 i}}

where

\bar{Y} \sim N_{p} (0, V^{- 1}) .

p. 4379: Replace the third line after (A.3) by

K (t, Y_{n}) = \sum_{i = 0}^{\infty} n^{- i} \sum_{r = 1}^{i + 1} n^{r / 2} k_{r, i} (t) / r! - s^{'} w = \sum_{k = 1}^{\infty} \nabla_{k} (t) n^{- k / 2} / k! .

In the second equation after (A.3), replace

\sum_{r = 1}^{\infty}

by

\sum_{r = 1}^{i + 2}

. For example,

\nabla_{1} = k_{11} (t) + k_{32} (t) / 3!, \nabla_{2} / 2! = k_{22} (t) / 2! + k_{43} (t) / 4!, \nabla_{3} / 3! = k_{12} (t) + k_{33} (t) / 3! + k_{54} (t) / 5! .

The next equation should read

Q_{n} (t) = e x p {\sum_{k = 1}^{\infty} \nabla_{k} (t) n^{- k / 2} / k! - t^{'} V t / 2} = \sum_{r = 0}^{\infty} b_{r} (t) n^{- r / 2} e x p {- t^{'} V t / 2} / r!,

where

B_{r} (\nabla)

is the complete exponential Bell polynomial. Thus, for

r \geq 1, B_{r} (\nabla) = \sum_{k = 1}^{r} B_{r k} (\nabla)

, where

B_{r k} (\nabla)

is the partial exponential Bell polynomial tabled on pp. 307–308 of [31] for

1 \leq r \leq 12

.

References

Withers, C.S. 5th-Order multivariate Edgeworth expansions for parametric estimates. Mathematics 2024, 12, 905. [Google Scholar] [CrossRef]
Withers, C.S.; Nadarajah, S. Tilted Edgeworth expansions for asymptotically normal vectors. Ann. Inst. Stat. Math. 2010, 62, 1113–1142. [Google Scholar] [CrossRef]
Cornish, E.A.; Fisher, R.A. Moments and Cumulants in the Specification of Distributions. Rev. l’Inst. Int. Statist. 1937, 5, 307–322. [Google Scholar]
Withers, C.S. Asymptotic expansions for distributions and quantiles with power series cumulants. J. R. Statist. Soc. B 1984, 46, 389–396, Erratum in J. R. Statist. Soc. B 1986, 48, 256. [Google Scholar]
Stuart, A.; Ord, K. Kendall’s Advanced Theory of Statistics, 2, 5th ed.; Griffin: London, UK, 1991. [Google Scholar]
Kolassa, J.E. Series Approximation Methods in Statistics; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar]
Jensen, J.L. Uniform saddlepoint approximations. Adv. Appl. Prob. 1988, 20, 622–634. [Google Scholar]
Fisher, R.A.; Cornish, E.A. The percentile points of distributions having known cumulants. Technometrics 1960, 2, 209–225. [Google Scholar] [CrossRef]
Withers, C.S. A simple expression for the multivariate Hermite polynomials. Stat. Prob. Lett. 2000, 47, 165–169. [Google Scholar]
Withers, C.S.; Nadarajah, S.N. Improved confidence regions based on Edgeworth expansions. Comput. Stat. Data Anal. 2012, 56, 4366–4380. [Google Scholar] [CrossRef]
Edgeworth, F.Y. The law of error. Proc. Camb. Philos. Soc. 1905, 20, 36–65. [Google Scholar]
Withers, C.S.; Nadarajah, S.N. Charlier and Edgeworth expansions via Bell polynomials. Probab. Math. Stat. 2009, 29, 271–280. [Google Scholar]
Hill, G.W.; Davis, A.W. Generalised asymptotic expansions of Cornish-Fisher type. Ann. Math. Statist. 1968, 39, 1264–1273. [Google Scholar]
Withers, C.S.; Nadarajah, S. Generalized Cornish-Fisher expansions. Bull. Braz. Math. Soc. New Ser. 2011, 42, 213–242. [Google Scholar] [CrossRef]
Withers, C.S.; Nadarajah, S. Expansions about the gamma for the distribution and quantiles of a standard estimate. Methodol. Comput. Appl. Prob. 2014, 16, 693–713. [Google Scholar] [CrossRef][Green Version]
Withers, C.S.; Nadarajah, S. Edgeworth-Cornish-Fisher-Hill-Davis expansions for normal and non-normal limits via Bell polynomials. Stochastics Int. J. Probab. Stoch. Processes 2015, 87, 794–805. [Google Scholar] [CrossRef]
Simonato, J.G. The performance of Johnson distributions for computing value at risk and expected shortfall. J. Deriv. 2011, 19, 7–24. [Google Scholar] [CrossRef]
Zhang, L.; Mykl, P.A.; Ait-Sahalia, Y. Edgeworth expansions for realised volatility and related estimators. J. Econom. 2011, 160, 190–203. [Google Scholar]
Withers, C.S.; Nadarajah, S.N. Moment generating functions for Rayleigh Random Variables. Wirel. Pers. Commun. 2008, 46, 463–468. [Google Scholar]
Song, T.; Wang, S.; An, W. GPS positioning accuracy estimation using Cornish-Fisher expansions. In Proceedings of the 2009 WRI International Conference on Communications and Mobile Computing, Kunming, China, 6–8 January 2009; pp. 152–155. [Google Scholar]
Abdel-Wahed, A.R.; Winterbottom, A. Approximating posterior distributions of system reliability. Statistician 1983, 32, 224–228. [Google Scholar]
Winterbottom, A. Asymptotic expansions to improve large sample confidence intervals for system reliability. Biometrika 1980, 67, 351–357. [Google Scholar] [CrossRef]
Winterbottom, A. The interval estimation of system reliability component test data. Oper. Res. 1984, 32, 628–640. [Google Scholar] [CrossRef]
Sellentin, E.; Jaffe, A.H.; Heavens, A.F. On the use of the Edgeworth expansion in cosmology I: How to foresee and evade its pitfalls. arXiv 2017, arXiv:1709.03452. [Google Scholar] [CrossRef]
Perninge, M. Stochastic Optimal Power Flow by Multivariate Edgeworth Expansions; Electric Power Systems Research; Elsevier: Amsterdam, The Netherlands, 2014. [Google Scholar]
Withers, C.S.; Nadarajah, S.N. Cornish-Fisher expansions about the F-distribution. Appl. Math. Comput. 2012, 218, 7947–7957. [Google Scholar] [CrossRef]
Daniels, H.E. Saddlepoint approximations in statistics. Ann. Math. Statist. 1954, 25, 631–650. [Google Scholar]
Cramer, H. Mathematical Methods of Statistics; Princeton University Press: Princeton, NJ, USA, 1946. [Google Scholar]
Bhattacharya, R.N.; Rao Ranga, R. Normal Approximation and Asymptotic Expansions; Wiley: New York, NY, USA, 1976. [Google Scholar]
Skovgaard, I.M. On multivariate Edgeworth expansions. Int. Statist. Rev. 1986, 54, 169–186. [Google Scholar]
Comtet, L. Advanced Combinatorics; Reidel: Dordrecht, The Netherlands, 1974. [Google Scholar]
Withers, C.S. Nonparametric confidence intervals for functions of several distributions. Ann. Inst. Statist. Math. 1988, 40, 727–746. [Google Scholar]
Withers, C.S. Accurate confidence intervals when nuisance parameters are present. Comm. Statist.-Theory Methods 1989, 18, 4229–4259. [Google Scholar]
Takeuchi, K. A multivariate generalization of Cornish-Fisher expansion and its applications. Keizaigaku Ronshu 1978, 44, 1–12. (In Japanese) [Google Scholar]
Takemura, A.; Takeuchi, K. Some results on univariate and multivariate Cornish-Fisher expansions: Algebraic properties and validity. Sankhya Ser. A 1988, 50, 111–136. [Google Scholar]
Anderson, T.W. An Introduction to Multivariate Analysis; John Wiley: New York, NY, USA, 1958. [Google Scholar]
Gradshteyn, I.S.; Ryzhik, I.M. Tables of Integrals, Series and Products, 6th ed.; Academic Press: New York, NY, USA, 2000; (The Generalized Hypergeometric Function Is Defined in Section 9.14.). [Google Scholar]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; Applied Mathematics Series 55; U.S. Department of Commerce; National Bureau of Standards: Gaithersburg, MD, USA, 1964. [Google Scholar]
Bentkus, V.; Gtze, F.; van Zwet, W.R. An Edgeworth expansion for symmetric statistics. Ann. Stat. 1997, 25, 851–896. [Google Scholar] [CrossRef]
Xu, J.; Gupta, A.K. Improved confidence regions for a mean vector under general conditions. Comput. Stat. Data Anal. 2006, 51, 1051–1062. [Google Scholar]
Teal, P. A code to calculate bivariate Hermite polynomials. Its input is V11,V12,V22 and y1,y2, not V11,V12,V22 and x1,x2. 2024. Available online: https://github.com/paultnz/bihermite/blob/main/bihermite.py (accessed on 26 June 2025).
Hall, P. The Bootstrap and Edgeworth Expansion; Springer: New York, NY, USA, 1992. [Google Scholar]
Hall, P. Rejoinder: Theoretical Comparison of Bootstrap Confidence Intervals. Ann. Stat. 1988, 16, 981–985. [Google Scholar] [CrossRef]
Skovgaard, I.M. Edgeworth expansions of the distributions of maximum likelihood estimators in the general (non i.i.d.) case. Scand. J. Statist. 1981, 8, 227–236. [Google Scholar]
Skovgaard, I.M. Transformation of an Edgeworth expansion by a sequence of smooth functions. Scand. J. Statist. 1981, 8, 207–217. [Google Scholar]
Withers, C.S.; Nadarajah, S. The distribution and percentiles of channel capacity for multiple arrays. Sadhana SADH Indian Acad. Sci. 2020, 45, 155. [Google Scholar] [CrossRef]
Withers, C.S. Edgeworth-Cornish-Fisher expansion for the mean when sampling from a stationary process. Axioms 2025, 14, 406. [Google Scholar] [CrossRef]
Withers, C.S. Expansions for the distribution and quantiles of a regular functional of the empirical distribution with applications to nonparametric confidence intervals. Annals Statist. 1983, 11, 577–587. [Google Scholar]
Field, C.A.; Hampel, F.R. Small sample asymptotic distributions of M-estimators of location. Biometrika 1982, 69, 29–46. [Google Scholar]
Isserlis, L. On a formula for the product-moment coefficient of any order of a normal frequency distribution in any number of variables. Biometrika 1918, 12, 134–139. [Google Scholar] [PubMed]
Withers, C.S. New methods for multivariate normal moments. Stats 2025, 8, 46. [Google Scholar] [CrossRef]

Figure 1.

x = X_{n}

when

Q_{n} = - 2 ln α

for

1 - α = 0.5,

the inner ellipse,

0.9, 0.99

, the outer ellipse, courtesy of Dr Paul Teal.

Figure 1.

x = X_{n}

when

Q_{n} = - 2 ln α

for

1 - α = 0.5,

the inner ellipse,

0.9, 0.99

, the outer ellipse, courtesy of Dr Paul Teal.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Withers, C.S. Edgeworth Coefficients for Standard Multivariate Estimates. Axioms 2025, 14, 632. https://doi.org/10.3390/axioms14080632

AMA Style

Withers CS. Edgeworth Coefficients for Standard Multivariate Estimates. Axioms. 2025; 14(8):632. https://doi.org/10.3390/axioms14080632

Chicago/Turabian Style

Withers, Christopher Stroude. 2025. "Edgeworth Coefficients for Standard Multivariate Estimates" Axioms 14, no. 8: 632. https://doi.org/10.3390/axioms14080632

APA Style

Withers, C. S. (2025). Edgeworth Coefficients for Standard Multivariate Estimates. Axioms, 14(8), 632. https://doi.org/10.3390/axioms14080632

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Edgeworth Coefficients for Standard Multivariate Estimates

Abstract

1. Introduction and Summary

2. Multivariate Edgeworth Expansions

3. Secondary or Derived Expansions

4. The Distribution of X_n = n^1/2( $\hat{w} - w$ ) for q = 2

5. Conclusions

6. Discussion

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. The Edgeworth Coefficients ${\bar{P}}_{r}^{1 - k}$ Needed for (18)

Appendix B. μ_ab and H_ab of (66) for a + b ≤ 9

Appendix C. Regularity Conditions for the Edgeworth Expansions of (22)

Appendix D. Some Corrigenda to the References

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Edgeworth Coefficients for Standard Multivariate Estimates

Abstract

1. Introduction and Summary

2. Multivariate Edgeworth Expansions

3. Secondary or Derived Expansions

4. The Distribution of Xn = n1/2( w ^ − w ) for q = 2

5. Conclusions

6. Discussion

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. The Edgeworth Coefficients P ¯ r 1 − k Needed for (18)

Appendix B. μab and Hab of (66) for a + b ≤ 9

Appendix C. Regularity Conditions for the Edgeworth Expansions of (22)

Appendix D. Some Corrigenda to the References

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4. The Distribution of X_n = n^1/2( $\hat{w} - w$ ) for q = 2

Appendix A. The Edgeworth Coefficients ${\bar{P}}_{r}^{1 - k}$ Needed for (18)

Appendix B. μ_ab and H_ab of (66) for a + b ≤ 9