Next Article in Journal
Dynamics of Predator-Prey Model with Fear Factor and Impulsive Nonlinear Effects
Next Article in Special Issue
Optimizing a Bayesian Method for Estimating the Hurst Exponent in Behavioral Sciences
Previous Article in Journal
Dynamical Systems with Fractional Derivatives: Focus on Phase Portraits and Plasma Wave Propagation Using Lakshmanan–Porsezian–Daniel Model
Previous Article in Special Issue
Measuring Semantic Stability: Statistical Estimation of Semantic Projections via Word Embeddings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Distribution and Quantiles of the Sample Mean from a Stationary Process

by
Christopher S. Withers
Callaghan Innovation (Formerly Industrial Research Ltd.), 101 Allington Road, Wellington 6012, New Zealand
Axioms 2025, 14(6), 406; https://doi.org/10.3390/axioms14060406
Submission received: 7 April 2025 / Revised: 16 May 2025 / Accepted: 23 May 2025 / Published: 27 May 2025
(This article belongs to the Special Issue New Perspectives in Mathematical Statistics)

Abstract

:
Edgeworth–Cornish–Fisher expansions are hugely important, as they give the distribution, density and quantiles of any standard estimate. Here we show that the sample mean of a univariate or multivariate stationary process is a standard estimate, so that all the known results for standard estimates can be applied. We also show how to allow for missing data and weighted means.

1. Introduction and Summary

Finding the distribution, density and quantiles of an estimate is of great importance. This has been accomplished for standard estimates by extending the expansions of Edgeworth, Cornish, and Fisher. In this section we summarize these results for a univariate standard estimate.
Section 2 gives our main result: we show that the mean of a sample from a univariate stationary process satisfies a special truncated form of the cumulant expansion (1) below, so that all the results of this section can be applied. It also considers the case where observations are not sequential, as for the case of missing data. And it considers unbiased weighted sample means.
In Section 4, we extend this to the mean of a sample from a multivariate stationary process, after summarising the multivariate Edgeworth expansions for a standard estimate in Section 3.
We now summarise the known results for the distribution and quantiles of a univariate standard estimate. Let w ^ be an estimate of an unknown w R based on a sample of size n. References [1,2] gave expansions in n 1 / 2 for its distribution and quantiles when its cumulants satisfied an artifical assumption removed by [3]. We call w ^ a standard estimate with respect to n, if E w ^ w as n , and its rth cumulant can be expanded as
κ r ( w ^ ) i = r 1 a r i n i , r 1 ,
where ≈ indicates an asymptotic expansion that need not converge. For example, in [4], this holds if w ^ is a smooth function of the mean of independent identically distributed (i.i.d.) random variables. The cumulant coefficients of w ^ , a r i , may depend on n but must be bounded as n , and a 21 must be bounded away from 0 as n . For w ^ non-lattice, the distribution, density and quantiles of
Y n = ( n / a 21 ) 1 / 2 ( w ^ w )
have asymptotic expansions in powers of n 1 / 2 :
P n ( x ) = P r o b . ( Y n x ) Φ ( x ) ϕ ( x ) r = 1 h 0 r ( x ) n r / 2 ,
p n ( x ) = d P n ( x ) / d x ϕ ( x ) [ 1 + r = 1 h 1 r ( x ) n r / 2 ] ,
Φ 1 ( P n ( x ) ) x r = 1 f r ( x ) n r / 2 , P n 1 ( Φ ( x ) ) x + r = 1 g r ( x ) n r / 2 ,
where Φ and ϕ are the distribution and density of a unit normal random variable N N ( 0 , 1 ) , and h k r ( x ) , f r ( x ) , g r ( x ) are polynomials of degrees k + 3 r 1 , r + 1 , r + 1 both in x and in the standardized cumulant coefficients
A r i = a r i / a 21 r / 2 .
In practice, one truncates these expansions for the distribution, density and quantiles of w ^ . The leading h k r ( x ) , f r ( x ) , g r ( x ) given below suffice to see if divergence has begun. For k 0 , the kth Hermite polynomial is
H k = H k ( x ) = ϕ ( x ) 1 ( d / d x ) k ϕ ( x ) = E ( x + i N ) k where i = 1 : H 0 = 1 , H 1 = x , H 2 = x 2 1 , H 3 = x 3 3 x , H 4 = x 4 6 x 2 + 3 , H 5 = x 5 10 x 3 + 15 x , H 6 = x 6 15 x 4 + 45 x 2 15 ,
and so on. See [5] for (6). For the expansions to O ( n 5 / 2 ) of (2) and (4), see [3]. h k r ( x ) of (2), (3) has the form
h k r ( x ) = [ P r j H k + j 1 : 1 j 3 r , r j   even ]
where P r j are polynomials in { A r j } of (5), given for r 4 in the appendix of [6], or easily derived from [3]. The terms needed in expansions (2)–(4) to O ( n 3 / 2 ) are
h 01 ( x ) = f 1 ( x ) = g 1 ( x ) = A 11 + A 32 H 2 / 6 , h 11 ( x ) = A 11 H 1 + A 32 H 3 / 6 , h k 2 ( x ) = ( A 11 2 + A 22 ) H 1 + k / 2 + ( A 43 + 4 A 11 A 32 ) H 3 + k / 24 + A 32 2 H 5 + k / 72 ,
f 2 ( x ) = ( A 22 / 2 A 11 A 32 / 3 ) H 1 + A 43 H 3 / 24 A 32 2 ( 4 x 3 7 x ) / 36 , g 2 ( x ) = A 22 H 1 / 2 + A 43 H 3 / 24 A 32 2 ( 2 x 3 5 x ) / 36 .
By [7], the log density has a simpler form than the density:
ln [ p n ( x ) / ϕ ( x ) ] = r = 1 b r ( x ) n r / 2 where b 1 ( x ) = h 11 ( x ) , b 2 ( x ) = A 11 2 / 2 + ( A 22 A 32 A 11 ) H 2 / 2 + A 43 H 4 / 24 A 32 2 ( 3 x 4 12 x 2 + 5 ) / 24 .
For r > 1 , b r ( x ) is a polynomial of an order only r + 2 , while h 1 r ( x ) is of an order 3 r . See [6] for other h k r ( x ) and b r ( x ) . If E w ^ = w , then A 11 = 0 so that (2)–(4) and (9) hold with
h 01 ( x ) = f 1 ( x ) = g 1 ( x ) = b 1 ( x ) = A 32 H 2 / 6 , h 11 ( x ) = A 32 H 3 / 6 , h k 2 ( x ) = A 22 H 1 + k / 2 + A 43 H 3 + k / 24 + A 32 2 H 5 + k / 72 for   k = 0 , 1 ,
f 2 ( x ) = A 22 H 1 / 2 + A 43 H 3 / 24 A 32 2 ( 4 x 3 7 x ) / 36 , b 2 ( x ) = A 22 H 2 / 2 + A 43 H 4 / 24 A 32 2 ( 3 x 4 12 x 2 + 5 ) / 24 ,
and g 2 ( x ) of (8) is unchanged.
Note 1.
The original Edgeworth expansion was for  w ^  the mean of n i.i.d. random variables from a distribution with the rth cumulant  κ r So (1) holds with  a r , r 1 = κ r , a r i = 0 , i r An explicit formula for its general term was given in [8].
Edgeworth–Cornish–Fisher expansions were first extended to general parametric and non-parametric standard estimates in [3,4] and to functions of them in [9].
In [10], we gave the extended Edgeworth-Cornish-Fisher expansions for smooth functions of the sample cross-moments of a linear process. We now show that this extends easily to a stationary process.

2. The Cumulants of a Stationary Sample Mean

When w ^ is the mean of i.i.d. random variables, the cumulant expansion (1) has only one term. Remarkably, for the sample mean of a stationary process, its cumulant expansion has exactly two terms.
Suppose that X ¯ is the mean of a sample X 1 , , X n from a real stationary process { X i } . So X ¯ is an unbiased estimate of μ = E X 0 . We now show that its rth cumulant has the form
κ r ( X ¯ ) = a n r , r 1 n 1 r + a n r r n r , r > 1 ,
where a n r i are bounded as n increases, and a n 21 is bounded away from 0. This makes it a special case of a standard estimate, so that Section 1 applies with a r i = a n r i for i = r 1 and i = r , and a r i = 0 for i > r and a 11 = 0 .
If a n r i = a r i + O ( e n λ r ) where λ r i > 0 , then a n r i can be replaced by a r i . Here, x n = O ( y n ) means that x n / y n is bounded.
Suppose that the stationary process has cross-cumulants,
k ( i 1 i r ) = κ i 1 , , i r = κ ( X i 1 , , X i r ) .
Given integers i 1 , , i r , set
i 0 = min k = 1 r i k , I k = i k i 0 0 , I 0 = max k = 1 r I k = max k = 1 r i k i 0 .
Since { X i } is stationary,
k ( i 1 i r ) = k ( I 1 I r ) .
These are not changed by permuting subscripts. Also, at least one I k is zero. For r 2 , transforming from i k to T k = i k i 1 for k = 2 , , r ,
n r κ r ( X ¯ ) = i 1 , , i r = 1 n k ( i 1 i r ) = | T k | < n , k = 2 , , r [ n δ r ( T ) ] k ( 0 T 2 T r )
where δ r ( T ) = max ( 0 , T 2 , , T r ) min ( 0 , T 2 , , T r ) .
For example, δ 2 ( T ) = | T 2 | ,
δ 3 ( T ) = T 3 I ( 0 T 2 < T 3 ) + ( T 3 T 2 ) I ( T 2 0 < T 3 ) T 2 I ( T 2 < T 3 0 ) .
So for r 2 , κ r ( X ¯ ) = i = r 1 r a n r i n i , where
a n 21 = | T | < n k ( 0 T ) , a n r , r 1 = | T i | < n , i = 2 , , r k ( 0 T 2 T r ) ,
a n r r = | T i | < n , i = 2 , , r δ r ( T ) k ( 0 T 2 T r ) .
This proves that (12) holds. So the Edgeworth–Cornish–Fisher expansions (2)–(4) and (9)–(11) apply to ( w ^ , w ) = ( X ¯ , μ ) with a r i in (1) replaced by these a n r i , so that A r i = a n r i / a n 21 r / 2 .
If the cross-cumulants k ( 0 T 2 T r ) decrease exponentially in T 2 , as is true for a stationary ARMA process by [10], then for r 2 ,
a n r , r 1 = a r , r 1 + O ( e n λ r ) , a n r r = a r r + O ( e n λ r ) where λ r > 0 , a 21 = | T | < k ( 0 T ) , a r , r 1 = | T i | < , i = 2 , , r k ( 0 T 2 T r ) ,
a r r = | T i | < , i = 2 , , r δ r ( T ) k ( 0 T 2 T r ) ,
so that for X 0 non-lattice, these Edgeworth–Cornish–Fisher expansions apply to ( w ^ , w ) = ( X ¯ , μ ) with these a r i . According to [3] or the appendix to [6], (10)–(8), h k r , f r , g r simplify for r = 3 , 4 . This expands the results given in [10] for a linear process.
For convergence in law of n 1 / 2 ( X ¯ μ ) to N ( 0 , a 21 ) with a 21 of (20), under mixing conditions on a stationary process, see Sections 18.4 and 18.5 of [11]. They also show how to express a 21 n and a 21 in terms of the spectral distribution and density.
Missing values. Now suppose that we only have observations at times t 1 , , t n . Our estimate of μ = E X 0 is then
μ ^ t = n 1 i = 1 n X t i . So E μ ^ t = μ , and for S k = t i k t i 1 , r 2 , n r κ r ( μ ^ t ) = i 1 , , i r = 1 n k ( t i 1 t i r ) = i 1 , , i r = 1 n k ( 0 S 2 S r ) = n a n r , r 1   say .
So if a n 21 is bounded away from 0 and a n r , r 1 is bounded in n, we can apply Section 1 with a r i = 0 for i r 1 .
Weighted means. Let w n 1 , , w n n be given numbers adding to n. For example, the standardized form of the Chernoff weight i / n , giving more weight to more recent observations, is w n i = 2 i / ( n + 1 ) . See [12]. An unbiased estimate of μ is the weighted sample mean, μ ^ w = n 1 i = 1 n w n i X i .
For   r 2 , n r κ r ( μ ^ w ) = i 1 , , i r = 1 n w n i 1 w n i r k ( i 1 , , i r ) = n a n r , r 1
So if a n 21 is bounded away from 0 and a n r , r 1 is bounded in n, we can apply Section 1 with a r i = 0 for i r 1 . Missing values can also be treated by giving them a weight of 0.

3. Multivariate Edgeworth Expansions

Ordinary Bell polynomials. For a sequence from R, say e = ( e 1 , e 2 , ) ,  the partial ordinary Bell polynomial  B ˜ r s = B ˜ r s ( e ) , is defined by the identity
S s = r = s z r B ˜ r s ( e ) where S = r = 1 z r e r , z R . So ,   B ˜ r 0 = δ r 0 , B ˜ r 1 = e r , B ˜ r r = e 1 r , B ˜ 21 = 2 e 1 e 2 ,
where δ 00 = 1 , δ r 0 = 0 for r 0 . They are tabled on p. 309 of [13]. The complete ordinary Bell polynomial, B ˜ r ( e ) , is defined in terms of S by
e S = r = 0 z r B ˜ r s ( e ) . So B ˜ r ( e ) = s = 0 r B ˜ r s ( e ) / s ! :
B ˜ 0 ( e ) = 1 , B ˜ 1 ( e ) = e 1 , B ˜ 2 ( e ) = e 2 + e 1 2 / 2 , B ˜ 3 ( e ) = e 3 + e 1 e 2 + e 1 3 / 6 .
Now suppose that
e j ( s ) = r = 1 j + 2 b ¯ r + j 1 r s ¯ 1 s ¯ r / r ! where s ¯ k = s i k , b ¯ r + j 1 r = b r + j i 1 i r , b 2 d + 1 i 1 i r = 0 ,
for some constants b j i 1 i r . Then for r 1 , we can write
B ˜ r ( e ( s ) ) = k = 1 3 r [ P ¯ r 1 k s ¯ 1 s ¯ k : k r   even ] ,
where (24) and (25) use the tensor summation convention of implicitly summing i 1 , i 2 , over their range 1 , , p , and P ¯ r 1 k is a polynomial in { b ¯ j 1 r } . For r = 1 , 2 , B ˜ r ( e ( s ) ) is given by (25) in terms of
P ¯ 1 1 = b ¯ 2 1 , P ¯ 1 1 3 = b ¯ 4 1 3 / 6 , P ¯ 2 12 = b ¯ 2 1 b ¯ 2 2 / 2 + b ¯ 4 12 / 2 , P ¯ 2 1 4 = b ¯ 6 1 4 / 24 + S b ¯ 2 1 b ¯ 4 2 4 / 6 , P ¯ 2 1 6 = S b ¯ 4 1 3 b ¯ 4 4 6 / 72 ,
and the operator S symmetrises over i 1 , , i k .
Multivariate estimates. Suppose that w ^ is a standard estimate of w R p with respect to n. That is, E w ^ w as n , and for r 1 , 1 i 1 , , i r p , the rth order cumulants of w ^ can be expanded as
k ¯ 1 r = κ ( w ^ i 1 , , w ^ i r ) j = r 1 k ¯ j 1 r n j where k ¯ j 1 r = k j i 1 i r ,
and the cumulant coefficients  k ¯ j 1 r = k j i 1 i r may depend on n but are bounded as n . So k ¯ 0 1 = w i 1 .
Y n = n 1 / 2 ( w ^ w )
converges in law to the multivariate normal N p ( 0 , V ) with p × p covariance V = ( k 1 i 1 i 2 ) , p × p , and distribution and density Φ V ( x ) and ϕ V ( x ) . So V may depend on n, but we assume that d e t ( V ) is bounded away from 0 as n . By [14], for w ^ non-lattice, the density and distribution of Y n can be expanded as
p Y n ( x ) r = 0 n r / 2 p r ( x ) , P r o b . ( Y n x ) r = 0 n r / 2 P r ( x ) , x R p ,
where p 0 ( x ) = ϕ V ( x ) , P 0 ( x ) = Φ V ( x ) ,
p r ( x ) = B ˜ r ( e ( / x ) ) ϕ V ( x ) for r 1 , P r ( x ) = B ˜ r ( e ( / x ) ) Φ V ( x ) = x p r ( x ) ϕ V ( x ) d x for r 1 ,
and b 2 d i 1 i r = k d i 1 i r in (24), so that for example,
e 1 ( s ) = k ¯ 1 1 s ¯ 1 + k ¯ 2 1 3 s ¯ 1 s ¯ 2 s ¯ 3 / 6 , e 2 ( s ) = k ¯ 2 12 s ¯ 1 s ¯ 2 / 2 + k ¯ 3 1 4 s ¯ 1 s ¯ 4 / 24 .
This gives the Edgeworth expansion for the distribution of Y n to O ( n 3 / 2 ) . See [15] for more terms. According to (25),
p r ( x ) / ϕ V ( x ) = k = 1 3 r [ P ¯ r 1 k H ¯ 1 k ( x , V ) : k r even ] , P r ( x ) = k = 1 3 r [ P ¯ r 1 k H ¯ * 1 k ( x , V ) : k r even ] , where P ¯ 1 1 = k ¯ 1 1 , P ¯ 1 1 3 = k ¯ 2 1 3 / 6 , P ¯ 2 12 = k ¯ 1 1 k ¯ 1 2 + k ¯ 2 12 / 2 , P ¯ 2 1 4 = k ¯ 3 1 4 / 24 + S k ¯ 1 1 k ¯ 2 2 4 / 6 , P ¯ 2 1 6 = S k ¯ 2 1 3 k ¯ 2 4 6 / 36 ,
H ¯ 1 k = H ¯ 1 k ( x , V ) is the multivariate Hermite polynomial,
H ¯ 1 k ( x , V ) = ϕ V ( x ) 1 ( ¯ 1 ) ( ¯ k ) ϕ V ( x ) where i = / x i , ¯ k = i k , and   H ¯ * 1 k = H ¯ * 1 k ( x , V ) = ( ¯ 1 ) ( ¯ k ) Φ V ( x ) = x H ¯ 1 k ( x , V ) ϕ V ( x ) d x .
( H ¯ * 1 k deserves a name. Let us call it the multivariate Hermite function.) So P r ( x ) is just p r ( x ) with H ¯ 1 k ϕ V ( x ) replaced by H ¯ * 1 k . For example, according to (27), the leading corrections to the Central Limit Theorem are given by
p 1 ( x ) = e 1 ( / x ) ϕ V ( x ) = ( k ¯ 1 1 H ¯ 1 + k ¯ 2 1 3 H ¯ 1 3 / 6 ) ϕ V ( x ) , P 1 ( x ) = e 1 ( / x ) Φ V ( x ) = k ¯ 1 1 H ¯ * 1 + k ¯ 2 1 3 H ¯ * 1 3 / 6 ,
p 2 ( x ) = ϕ V ( x ) k = 2 , 4 , 6 P ¯ 2 1 k H ¯ 1 k ( x , V ) , P 2 ( x ) = k = 2 , 4 , 6 P ¯ 2 1 k H ¯ * 1 k ( x , V ) ,
for P ¯ 2 1 k of (30). (So h 1 r ( x ) of Section 1 is a one-dimensional form of p r ( x ) ). By [5], for i = 1 ,
H ¯ 1 k ( x , V ) = E Π j = 1 k ( y ¯ j + i Y ¯ j ) where y ¯ j = y i j , Y ¯ j = Y i j , y = V 1 x , Y N p ( 0 , V 1 ) . So ,   H 1 = y 1 , H ¯ 1 = y ¯ 1 , H 1 3 = y 1 y 2 y 3 3 V 12 y 3 ,
where V i 1 i 2 is the ( i 1 , i 2 ) element of V 1 , and 3 V 12 y 3 = V 12 y 3 + V 13 y 2 + V 23 y 1 .
The H 1 k needed for p 2 ( x ) are
H 12 = y 1 y 2 V 12 , so   that   H ¯ 12 = y ¯ 1 y ¯ 2 V ¯ 12 f o r V ¯ 12 = V i 1 i 2 , H 1 4 = y 1 y 4 6 V 12 y 3 y 4 + 3 V 12 V 34 , H 1 6 = y 1 y 6 15 V 12 y 3 y 6 + 45 V 12 V 34 y 5 y 6 45 V 12 V 34 V 56 .
According to [7], the log density can be expanded as
ln [ p n ( x ) / ϕ V ( x ) ] r = 1 n r / 2 b r ( x ) .
So   p n ( x ) / ϕ V ( x ) r = 0 n r / 2 B ˜ r ( b ( x ) ) where b = ( b 2 , b 2 , ) ,
for B ˜ r ( e ) of (22). If E w ^ = w , then k ¯ 1 1 = 0 , so that for r = 1 , 2 , p r ( x ) , P r ( x ) are given by (31)–(32) in terms of
P ¯ 1 1 = 0 , P ¯ 1 1 3 = k ¯ 2 1 3 / 6 , P ¯ 2 12 = k ¯ 2 12 / 2 , P ¯ 2 1 4 = k ¯ 3 1 4 / 24 , P ¯ 2 1 6 = S k ¯ 2 1 3 k ¯ 2 4 6 / 36 .
Note 2.
The term Hermite function is also used by [16] for the non-polynomial solution of the 2nd order differential equation for H n ( x ) of Section 1, given by [17] in terms of confluent hypergeometric functions.

4. Application to Stationary Vector X ¯

Suppose that , X 1 , X 0 , X 1 , lie in R p and are stationary with mean μ = E X 0 and finite moments. Suppose that X ¯ is the mean of a sample X 1 , , X n . So E X ¯ = μ . For j = 1 , , p , denote the jth component of X i and X ¯ by X i j and X ¯ j , and the cross-cumulants of { X i } , by
k j 1 j r i 1 i r = κ ( X i 1 j 1 , , X i r j r )   for   1 j 1 , , j r p .
Given a sequence of integers i 1 , , i r , define i 0 , I k as in (14), and again transform from i k to T k = i k i 1 for k = 2 , , r . (15) becomes
k j 1 j r i 1 i r = k j 1 j r I 1 I r .
In general, k j 1 j 2 0 I k j 1 j 2 0 I . For r 2 and δ r ( T ) of (17), by (16),
n r κ ( X ¯ j 1 , , X ¯ j r ) = i 1 , , i r = 1 n k j 1 j r i 1 i r = | T k | < n , k = 2 , , r [ n δ r ( T ) ] k j 1 j r 0 T 2 T r .
So f o r r 2 , κ ( X ¯ j 1 , , X ¯ j r ) = e = r 1 r k n e j 1 j r n e where
k n 1 j 1 j 2 = | T | < n k j 1 j 2 0 T , k n , r 1 j 1 j r = | T i | < n , i = 2 , , r k j 1 j r 0 T 2 T r ,
and k n r j 1 j r = | T i | < n , i = 2 , , r δ r ( T ) k j 1 j r 0 T 2 T r .
This proves that a two-term version of (26) holds. So the expansions (27)–(29) and (33) hold for the density and distribution of n 1 / 2 ( X ¯ μ ) with V = ( k n 1 j 1 j 2 ) of (38), k i j 1 j r = k n i j 1 j r of (38), (39), and k ¯ 1 1 = 0 . If the cross-cumulants k j 1 j r 0 T 2 T r decrease exponentially in T 2 , then for r 2 ,
k n , r 1 j 1 j r = k r 1 j 1 j r + O ( e n λ r ) ,   and   k n r j 1 j r = k r j 1 j r + O ( e n λ r ) where λ r > 0 , k 1 j 1 j 2 = | T | < k j 1 j 2 0 T , k r 1 j 1 j r = | T i | < , i = 2 , , r k j 1 j r 0 T 2 T r ,
and k r j 1 j r = | T i | < , i = 2 , , r δ r ( T ) k j 1 j r 0 T 2 T r ,
so that for X 0 non-lattice, the expansions (27)–(29) and (33) hold for the density and distribution of Y n = n 1 / 2 ( X ¯ μ ) with V = ( k 1 j 1 j 2 ) of (40), and k ¯ 1 1 = 0 .
These results extend to missing data and weighted means as in Section 2.

5. Discussion and Conclusions

References [1,2] showed that their quantile expansion, updated here as (4), can give great accuracy, in fact to many decimal places. However, for a given n and a large enough x, or for a given x and a small enough n, the expansions (2)–(4) will clearly diverge.
We have shown that the sample mean from a stationary process is a standard estimate of the mean of the process, so that we can apply the Edgeworth–Cornish–Fisher expansions given in Section 1 and Section 3 for any standard estimate. We also showed that remarkably, its cumulant expansion has only two terms. This simplifies the forms for h k r ( x ) , f r ( x ) , g r ( x ) , and r 3 of Section 1, and for p r ( x ) , P r ( x ) , and r 3 of Section 3. And we showed that these results extend to missing data and to weighted means.
As E X ¯ = μ , a 11 = 0 . So the expansions of Section 1 to O ( n 3 / 2 ) require n , x and a 32 , a 22 , a 43 of (18) and (19) as input, or alternatively, a 32 , a 22 , a 43 of (20) and (21). That is, one needs n , x and the first three cross-cumulants of (13), k ( 0 T 2 T r ) , for r = 2 , 3 , 4 .
Similarly, for multivariate series, the Edgeworth expansions of (27) to O ( n 3 / 2 ) for Y n = n 1 / 2 ( X ¯ μ ) require p r ( x ) , P r ( x ) , r = 1 , 2 of (31)–(32) with k ¯ 1 1 = 0 . These require V = ( k 1 j 1 j 2 ) , k ¯ 1 1 = 0 , k ¯ 2 1 3 , k ¯ 3 1 4 of (38), (39) as inputs, or their limits in (40), (41). That is, one needs n , x and the cross-cumulants, k j 1 j r 0 T 2 T r of (35) for r = 2 , 3 , 4 .
In general, non-parametric methods are to be preferred to parametric methods, as a wrong parametric model will give wrong results, even asymptotically. However, most econometric models are parametric. Reference [10] considered the case of the stationary linear process
X i = j = 0 ρ j e i j ,
where ρ j , j 0 , are given constants, and { e i } are i.i.d. random variables with finite cumulants τ r , r 1 . For this very general class of semi-parametric models,
k ( i 1 , , i r ) = α ( i 1 , , i r ) τ r where α ( i 1 , , i r ) = j = 0 ρ j + i 1 ρ j + i r .
Example 1.
For the AR(1) model, X i φ X i 1 = e i with | φ | < 1 ,
ρ j = φ j , α ( i 1 , , i r ) = φ I 1 + + I r
for I k of (14). So the k ( i 1 , , i r ) we need are given by φ and τ r , r = 2 , 3 , 4 . For related work, see [18,19].
For a particular case like this, one can write a general computational code to plot graphs or make tables for various scenarios.
The theory of linear processes is well developed. See, for example, [20,21,22,23].

6. Four Examples of Future Directions

Traditionally, analysis of time series relies on parametric models, such as the autoregressive model of Example 1, or much more generally, ARIMA models, or the use of spectral theory that moves considerations from the time domain to the frequency domain. But as noted, a non-parametric approach is much to be preferred as the wrong parametric model gives incorrect results. A non-parametric estimate of the asymptotic variance a 21 = | T | < k ( 0 T ) of (20), is
a ^ 21 = | T | < N n k ^ ( 0 T )
where k ^ ( 0 T ) is the empirical estimate of k ( 0 T ) and N n . This should give one- and two-sided confidence intervals for μ of error O ( n 1 / 2 ) and O ( n 1 ) . It should also be possible to reduce the one-sided error to O ( n 1 ) or O ( n 3 / 2 ) with (more complicated) confidence intervals, as performed for the i.i.d. case in [4].
A second area where an extension should be possible is to sample central moments and smooth functions of them, as achieved for i.i.d. observations for non-parametric and parametric statistics in [3,4,15] and for linear processes in [10]. Again, it should be possible to develop confidence intervals.
A third, more difficult extension would be a small sample version, using the method of [14]. This would remove the problem of series divergence.
A fourth extension would be to a kernel estimate of the marginal density of X 0 . The authors of [24] gave Cornish–Fisher expansions for these based on a random sample. The expansions are in powers of ( n h ) 1 / 2 , not n 1 / 2 , where h = c n α and α > 0 can be made as small as desired by choice of kernel.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The author C.S.W. was employed by Industrial Research Ltd., now called Callaghan Innovation. The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Cornish, E.A.; Fisher, R.A. Moments and cumulants in the specification of distributions. Rev. de l’Inst. Int. de Statist. 1937, 5, 307–322, Reproduced in The Collected Papers of R.A. Fisher, 4. [Google Scholar] [CrossRef]
  2. Fisher, R.A.; Cornish, E.A. The percentile points of distributions having known cumulants. Technometrics 1960, 2, 209–225. [Google Scholar] [CrossRef]
  3. Withers, C.S. Asymptotic expansions for distributions and quantiles with power series cumulants. J. R. Statist. Soc. B 1984, 46, 389–396. [Google Scholar] [CrossRef]
  4. Withers, C.S. Expansions for the distribution and quantiles of a regular functional of the empirical distribution with applications to non-parametric confidence intervals. Annals Statist. 1983, 11, 577–587. [Google Scholar] [CrossRef]
  5. Withers, C.S. A simple expression for the multivariate Hermite polynomials. Stat. Prob. Lett. 2020, 47, 165–169. [Google Scholar] [CrossRef]
  6. Withers, C.S.; Nadarajah, S.N. Expansions for log densities of asymptotically normal estimates. Stat. Pap. 2010, 51, 247–257. [Google Scholar] [CrossRef]
  7. Withers, C.S.; Nadarajah, S. Expansions for log densities of multivariate estimates. Methodol. Comput. Appl. Probab. 2016, 18, 911–920. [Google Scholar] [CrossRef]
  8. Withers, C.S.; Nadarajah, S.N. Charlier and Edgeworth expansions via Bell polynomials. Probab. Math. Stat. 2009, 29, 271–280. [Google Scholar]
  9. Withers, C.S. The distribution and quantiles of a function of parameter estimates. Ann. Inst. Statist. Math. A 1982, 34, 55–68. [Google Scholar] [CrossRef]
  10. Withers, C.S.; Nadarajah, S. Cornish-Fisher expansions for sample autocovariances and other functions of sample moments of linear processes. Braz. J. Probab. Stat. 2012, 26, 149–166. [Google Scholar] [CrossRef]
  11. Ibragimov, I.A.; Linnik, Y.V. Independent and Stationary Sequences of Random Variables; Wolters-Noordhoff: Groningen, The Netherlands, 1971. [Google Scholar]
  12. Chernoff, H.; Zacks, S. Estimating the current mean of a normal distribution which is subjected to changes in time. Ann. Math. Statist. 1964, 35, 999–1018. [Google Scholar] [CrossRef]
  13. Comtet, L. Advanced Combinatorics; Reidel: Dordrecht, The Netherlands, 1974. [Google Scholar]
  14. Withers, C.S.; Nadarajah, S. Tilted Edgeworth expansions for asymptotically normal vectors. Ann. Inst. Stat. Math. 2010, 62, 1113–1142. [Google Scholar] [CrossRef]
  15. Withers, C.S. 5th-Order multivariate Edgeworth expansions for parametric estimates. Mathematics 2024, 12, 905. [Google Scholar] [CrossRef]
  16. Arfken, G.B.; Weber, H.J.; Harris, F.E. Mathematical Methods for Physicists, 7th ed.; Academic Press: Cambridge, MA, USA, 2012. [Google Scholar] [CrossRef]
  17. Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; U.S. Department of Commerce, National Bureau of Standards, Applied Mathematics Series; Dover Publications: Mineola, NY, USA, 1964; p. 55. [Google Scholar]
  18. Phillips, P.C.B. A general theorem in the theory of asymptotic expansions as an approximation to the finite sample distributions of econometric estimators. Econometrika 1977, 45, 1517–1534. [Google Scholar] [CrossRef]
  19. Phillips, P.C.B. Asymptotic expansions in nonstationary vector autoregressions. Econom. Theory 1987, 3, 45–68. [Google Scholar] [CrossRef]
  20. Hannan, E.J. Time Series Analysis; Wiley: New York, NY, USA, 1962. [Google Scholar]
  21. Hannan, E.J. Multiple Time Series; Wiley: New York, NY, USA, 1970. [Google Scholar]
  22. Kendall, M.G.; Ord, K. Time Series, 3rd ed.; Griffin: London, UK, 1990. [Google Scholar]
  23. Taniguchi, M.; Kakizawa, Y. Asymptotic Theory of Statistical Inference for Time Series; Springer: New York, NY, USA, 2000. [Google Scholar]
  24. Withers, C.S.; Nadarajah, S.N. Edgeworth and Cornish Fisher expansions and confidence intervals for the distribution, density and quantiles of kernel density estimates, and confidence intervals for densities. Statistica 2008, 68, 281–301. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Withers, C.S. The Distribution and Quantiles of the Sample Mean from a Stationary Process. Axioms 2025, 14, 406. https://doi.org/10.3390/axioms14060406

AMA Style

Withers CS. The Distribution and Quantiles of the Sample Mean from a Stationary Process. Axioms. 2025; 14(6):406. https://doi.org/10.3390/axioms14060406

Chicago/Turabian Style

Withers, Christopher S. 2025. "The Distribution and Quantiles of the Sample Mean from a Stationary Process" Axioms 14, no. 6: 406. https://doi.org/10.3390/axioms14060406

APA Style

Withers, C. S. (2025). The Distribution and Quantiles of the Sample Mean from a Stationary Process. Axioms, 14(6), 406. https://doi.org/10.3390/axioms14060406

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop