Abstract
Edgeworth–Cornish–Fisher expansions are hugely important, as they give the distribution, density and quantiles of any standard estimate. Here we show that the sample mean of a univariate or multivariate stationary process is a standard estimate, so that all the known results for standard estimates can be applied. We also show how to allow for missing data and weighted means.
MSC:
62E20
1. Introduction and Summary
Finding the distribution, density and quantiles of an estimate is of great importance. This has been accomplished for standard estimates by extending the expansions of Edgeworth, Cornish, and Fisher. In this section we summarize these results for a univariate standard estimate.
Section 2 gives our main result: we show that the mean of a sample from a univariate stationary process satisfies a special truncated form of the cumulant expansion (1) below, so that all the results of this section can be applied. It also considers the case where observations are not sequential, as for the case of missing data. And it considers unbiased weighted sample means.
In Section 4, we extend this to the mean of a sample from a multivariate stationary process, after summarising the multivariate Edgeworth expansions for a standard estimate in Section 3.
We now summarise the known results for the distribution and quantiles of a univariate standard estimate. Let be an estimate of an unknown based on a sample of size n. References [1,2] gave expansions in for its distribution and quantiles when its cumulants satisfied an artifical assumption removed by [3]. We call a standard estimate with respect to n, if as , and its rth cumulant can be expanded as
where ≈ indicates an asymptotic expansion that need not converge. For example, in [4], this holds if is a smooth function of the mean of independent identically distributed (i.i.d.) random variables. The cumulant coefficients of , may depend on n but must be bounded as , and must be bounded away from 0 as . For non-lattice, the distribution, density and quantiles of
have asymptotic expansions in powers of :
where and are the distribution and density of a unit normal random variable , and are polynomials of degrees both in x and in the standardized cumulant coefficients
In practice, one truncates these expansions for the distribution, density and quantiles of . The leading given below suffice to see if divergence has begun. For the kth Hermite polynomial is
and so on. See [5] for (6). For the expansions to of (2) and (4), see [3]. of (2), (3) has the form
where are polynomials in of (5), given for in the appendix of [6], or easily derived from [3]. The terms needed in expansions (2)–(4) to are
By [7], the log density has a simpler form than the density:
For is a polynomial of an order only , while is of an order . See [6] for other and . If , then so that (2)–(4) and (9) hold with
and of (8) is unchanged.
Note 1.
The original Edgeworth expansion was for the mean of n i.i.d. random variables from a distribution with the rth cumulant . So (1) holds with . An explicit formula for its general term was given in [8].
Edgeworth–Cornish–Fisher expansions were first extended to general parametric and non-parametric standard estimates in [3,4] and to functions of them in [9].
In [10], we gave the extended Edgeworth-Cornish-Fisher expansions for smooth functions of the sample cross-moments of a linear process. We now show that this extends easily to a stationary process.
2. The Cumulants of a Stationary Sample Mean
When is the mean of i.i.d. random variables, the cumulant expansion (1) has only one term. Remarkably, for the sample mean of a stationary process, its cumulant expansion has exactly two terms.
Suppose that is the mean of a sample from a real stationary process . So is an unbiased estimate of . We now show that its rth cumulant has the form
where are bounded as n increases, and is bounded away from 0. This makes it a special case of a standard estimate, so that Section 1 applies with for and , and for and .
If where then can be replaced by Here, means that is bounded.
Suppose that the stationary process has cross-cumulants,
Given integers set
Since is stationary,
These are not changed by permuting subscripts. Also, at least one is zero. For transforming from to for ,
For example, ,
So for , where
This proves that (12) holds. So the Edgeworth–Cornish–Fisher expansions (2)–(4) and (9)–(11) apply to with in (1) replaced by these , so that
If the cross-cumulants decrease exponentially in , as is true for a stationary ARMA process by [10], then for ,
so that for non-lattice, these Edgeworth–Cornish–Fisher expansions apply to with these . According to [3] or the appendix to [6], (10)–(8), simplify for . This expands the results given in [10] for a linear process.
For convergence in law of to with of (20), under mixing conditions on a stationary process, see Sections 18.4 and 18.5 of [11]. They also show how to express and in terms of the spectral distribution and density.
Missing values. Now suppose that we only have observations at times . Our estimate of is then
So if is bounded away from 0 and is bounded in n, we can apply Section 1 with for .
Weighted means. Let be given numbers adding to n. For example, the standardized form of the Chernoff weight , giving more weight to more recent observations, is See [12]. An unbiased estimate of is the weighted sample mean,
So if is bounded away from 0 and is bounded in n, we can apply Section 1 with for . Missing values can also be treated by giving them a weight of 0.
3. Multivariate Edgeworth Expansions
Ordinary Bell polynomials. For a sequence from R, say the partial ordinary Bell polynomial , is defined by the identity
where for They are tabled on p. 309 of [13]. The complete ordinary Bell polynomial, , is defined in terms of S by
Now suppose that
for some constants . Then for , we can write
where (24) and (25) use the tensor summation convention of implicitly summing over their range , and is a polynomial in . For is given by (25) in terms of
and the operator symmetrises over .
Multivariate estimates. Suppose that is a standard estimate of with respect to n. That is, as , and for , the rth order cumulants of can be expanded as
and the cumulant coefficients may depend on n but are bounded as . So
converges in law to the multivariate normal with covariance , and distribution and density and . So V may depend on n, but we assume that is bounded away from 0 as . By [14], for non-lattice, the density and distribution of can be expanded as
and in (24), so that for example,
This gives the Edgeworth expansion for the distribution of to . See [15] for more terms. According to (25),
is the multivariate Hermite polynomial,
( deserves a name. Let us call it the multivariate Hermite function.) So is just with replaced by . For example, according to (27), the leading corrections to the Central Limit Theorem are given by
for of (30). (So of Section 1 is a one-dimensional form of ). By [5], for ,
where is the element of and
The needed for are
According to [7], the log density can be expanded as
for of (22). If , then , so that for are given by (31)–(32) in terms of
Note 2.
The term Hermite function is also used by [16] for the non-polynomial solution of the 2nd order differential equation for of Section 1, given by [17] in terms of confluent hypergeometric functions.
4. Application to Stationary Vector
Suppose that lie in and are stationary with mean and finite moments. Suppose that is the mean of a sample . So For , denote the jth component of and by and and the cross-cumulants of , by
Given a sequence of integers , define as in (14), and again transform from to for . (15) becomes
In general, . For and of (17), by (16),
So where
This proves that a two-term version of (26) holds. So the expansions (27)–(29) and (33) hold for the density and distribution of with of (38), of (38), (39), and . If the cross-cumulants decrease exponentially in , then for ,
so that for non-lattice, the expansions (27)–(29) and (33) hold for the density and distribution of with of (40), and .
These results extend to missing data and weighted means as in Section 2.
5. Discussion and Conclusions
References [1,2] showed that their quantile expansion, updated here as (4), can give great accuracy, in fact to many decimal places. However, for a given n and a large enough x, or for a given x and a small enough n, the expansions (2)–(4) will clearly diverge.
We have shown that the sample mean from a stationary process is a standard estimate of the mean of the process, so that we can apply the Edgeworth–Cornish–Fisher expansions given in Section 1 and Section 3 for any standard estimate. We also showed that remarkably, its cumulant expansion has only two terms. This simplifies the forms for and of Section 1, and for and of Section 3. And we showed that these results extend to missing data and to weighted means.
As . So the expansions of Section 1 to require and of (18) and (19) as input, or alternatively, of (20) and (21). That is, one needs and the first three cross-cumulants of (13), , for
Similarly, for multivariate series, the Edgeworth expansions of (27) to for require of (31)–(32) with . These require of (38), (39) as inputs, or their limits in (40), (41). That is, one needs and the cross-cumulants, of (35) for .
In general, non-parametric methods are to be preferred to parametric methods, as a wrong parametric model will give wrong results, even asymptotically. However, most econometric models are parametric. Reference [10] considered the case of the stationary linear process
where are given constants, and are i.i.d. random variables with finite cumulants For this very general class of semi-parametric models,
Example 1.
For the AR(1) model, with ,
for of (14). So the we need are given by φ and For related work, see [18,19].
For a particular case like this, one can write a general computational code to plot graphs or make tables for various scenarios.
The theory of linear processes is well developed. See, for example, [20,21,22,23].
6. Four Examples of Future Directions
Traditionally, analysis of time series relies on parametric models, such as the autoregressive model of Example 1, or much more generally, ARIMA models, or the use of spectral theory that moves considerations from the time domain to the frequency domain. But as noted, a non-parametric approach is much to be preferred as the wrong parametric model gives incorrect results. A non-parametric estimate of the asymptotic variance of (20), is
where is the empirical estimate of and This should give one- and two-sided confidence intervals for of error and . It should also be possible to reduce the one-sided error to or with (more complicated) confidence intervals, as performed for the i.i.d. case in [4].
A second area where an extension should be possible is to sample central moments and smooth functions of them, as achieved for i.i.d. observations for non-parametric and parametric statistics in [3,4,15] and for linear processes in [10]. Again, it should be possible to develop confidence intervals.
A third, more difficult extension would be a small sample version, using the method of [14]. This would remove the problem of series divergence.
A fourth extension would be to a kernel estimate of the marginal density of . The authors of [24] gave Cornish–Fisher expansions for these based on a random sample. The expansions are in powers of , not , where and can be made as small as desired by choice of kernel.
Funding
This research received no external funding.
Data Availability Statement
The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.
Conflicts of Interest
The author C.S.W. was employed by Industrial Research Ltd., now called Callaghan Innovation. The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
- Cornish, E.A.; Fisher, R.A. Moments and cumulants in the specification of distributions. Rev. de l’Inst. Int. de Statist. 1937, 5, 307–322, Reproduced in The Collected Papers of R.A. Fisher, 4. [Google Scholar] [CrossRef]
- Fisher, R.A.; Cornish, E.A. The percentile points of distributions having known cumulants. Technometrics 1960, 2, 209–225. [Google Scholar] [CrossRef]
- Withers, C.S. Asymptotic expansions for distributions and quantiles with power series cumulants. J. R. Statist. Soc. B 1984, 46, 389–396. [Google Scholar] [CrossRef]
- Withers, C.S. Expansions for the distribution and quantiles of a regular functional of the empirical distribution with applications to non-parametric confidence intervals. Annals Statist. 1983, 11, 577–587. [Google Scholar] [CrossRef]
- Withers, C.S. A simple expression for the multivariate Hermite polynomials. Stat. Prob. Lett. 2020, 47, 165–169. [Google Scholar] [CrossRef]
- Withers, C.S.; Nadarajah, S.N. Expansions for log densities of asymptotically normal estimates. Stat. Pap. 2010, 51, 247–257. [Google Scholar] [CrossRef]
- Withers, C.S.; Nadarajah, S. Expansions for log densities of multivariate estimates. Methodol. Comput. Appl. Probab. 2016, 18, 911–920. [Google Scholar] [CrossRef]
- Withers, C.S.; Nadarajah, S.N. Charlier and Edgeworth expansions via Bell polynomials. Probab. Math. Stat. 2009, 29, 271–280. [Google Scholar]
- Withers, C.S. The distribution and quantiles of a function of parameter estimates. Ann. Inst. Statist. Math. A 1982, 34, 55–68. [Google Scholar] [CrossRef]
- Withers, C.S.; Nadarajah, S. Cornish-Fisher expansions for sample autocovariances and other functions of sample moments of linear processes. Braz. J. Probab. Stat. 2012, 26, 149–166. [Google Scholar] [CrossRef]
- Ibragimov, I.A.; Linnik, Y.V. Independent and Stationary Sequences of Random Variables; Wolters-Noordhoff: Groningen, The Netherlands, 1971. [Google Scholar]
- Chernoff, H.; Zacks, S. Estimating the current mean of a normal distribution which is subjected to changes in time. Ann. Math. Statist. 1964, 35, 999–1018. [Google Scholar] [CrossRef]
- Comtet, L. Advanced Combinatorics; Reidel: Dordrecht, The Netherlands, 1974. [Google Scholar]
- Withers, C.S.; Nadarajah, S. Tilted Edgeworth expansions for asymptotically normal vectors. Ann. Inst. Stat. Math. 2010, 62, 1113–1142. [Google Scholar] [CrossRef]
- Withers, C.S. 5th-Order multivariate Edgeworth expansions for parametric estimates. Mathematics 2024, 12, 905. [Google Scholar] [CrossRef]
- Arfken, G.B.; Weber, H.J.; Harris, F.E. Mathematical Methods for Physicists, 7th ed.; Academic Press: Cambridge, MA, USA, 2012. [Google Scholar] [CrossRef]
- Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; U.S. Department of Commerce, National Bureau of Standards, Applied Mathematics Series; Dover Publications: Mineola, NY, USA, 1964; p. 55. [Google Scholar]
- Phillips, P.C.B. A general theorem in the theory of asymptotic expansions as an approximation to the finite sample distributions of econometric estimators. Econometrika 1977, 45, 1517–1534. [Google Scholar] [CrossRef]
- Phillips, P.C.B. Asymptotic expansions in nonstationary vector autoregressions. Econom. Theory 1987, 3, 45–68. [Google Scholar] [CrossRef]
- Hannan, E.J. Time Series Analysis; Wiley: New York, NY, USA, 1962. [Google Scholar]
- Hannan, E.J. Multiple Time Series; Wiley: New York, NY, USA, 1970. [Google Scholar]
- Kendall, M.G.; Ord, K. Time Series, 3rd ed.; Griffin: London, UK, 1990. [Google Scholar]
- Taniguchi, M.; Kakizawa, Y. Asymptotic Theory of Statistical Inference for Time Series; Springer: New York, NY, USA, 2000. [Google Scholar]
- Withers, C.S.; Nadarajah, S.N. Edgeworth and Cornish Fisher expansions and confidence intervals for the distribution, density and quantiles of kernel density estimates, and confidence intervals for densities. Statistica 2008, 68, 281–301. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).