Previous Article in Journal
Bootstrap Initialization of MLE for Infinite Mixture Distributions with Applications in Insurance Data
Previous Article in Special Issue
Climate Policy Uncertainty and Sovereign Credit Risk: A Multivariate Quantile on Quantile Regression Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Symmetric Positive Semi-Definite Fourier Estimator of Spot Covariance Matrix with High Frequency Data

1
Department of Mathematical Sciences, Ritsumeikan University, Kusatsu 525-8577, Japan
2
Department of Business Economics, Tokyo University of Science, Tokyo 113-8654, Japan
3
Department of Economics and Management, University of Florence, Via delle Pandette 9, 50127 Firenze, Italy
4
Department of Economics, Social Studies, Applied Mathematics and Statistics, University of Turin, Corso Unione Sovietica 218/bis, 10134 Torino, Italy
*
Authors to whom correspondence should be addressed.
Risks 2025, 13(10), 197; https://doi.org/10.3390/risks13100197
Submission received: 30 April 2025 / Revised: 11 July 2025 / Accepted: 17 September 2025 / Published: 9 October 2025
(This article belongs to the Special Issue Integrating New Risks into Traditional Risk Management)

Abstract

This paper proposes a nonparametric estimator of the spot volatility matrix with high-frequency data. Our newly proposed Positive Definite Fourier (PDF) estimator produces symmetric positive semi-definite estimates and is consistent with a suitable choice of the localizing kernel. The PDF estimator is based on a modification of the Fourier estimation method introduced by Malliavin and Mancino. The estimator has two parameters: the frequency N, which controls the biases due to the asynchronicity effect and the market microstructure noise effect; and the localization parameter M for the employed Gaussian kernel. The sensitivity of the PDF estimator to the choice of these two parameters is studied in a simulated environment. The accuracy and the ability of the estimator to produce positive semi-definite covariance matrices are evaluated by an extensive numerical analysis, against competing estimators present in the literature. The results of the simulations are confirmed under different scenarios, including the dimensionality of the problem, the asynchronicity of data, and several different specifications of the market microstructure noise. The computational time required by the estimator and the stability of estimation are also tested with empirical data.

1. Introduction

Empirical studies have pointed out the importance of evaluating distinct time variations in correlations between asset prices. In particular, several recent studies address the efficient estimation of covariances using high-frequency data asynchronously sampled across different assets. While the literature about the estimation of integrated covariances has been steadily growing, it remains still sparse for the spot covariances estimation. An early proposal to cope with spot covariances estimation using asynchronous high-frequency data was made in Malliavin and Mancino (2009). Differently from other estimators that pre-process data to make them synchronous (e.g., linear interpolation, piecewise-constant previous-tick interpolation or the refresh-time procedure proposed by Barndorff-Nielsen et al. (2011)), the Fourier estimator is based on an integration procedure that uses all the available data. Using all data, with no preliminary manipulation (such as pre-averaging, see, e.g., Aït-Sahalia and Jacod (2014)), allows for directly feeding unevenly sampled returns, or even asynchronous data in the multivariate case. A significant property for an estimator of integrated or spot covariances requires the estimated covariance matrix to be positive semi-definite. This has important consequences in several contexts, such as the recently developed field of principal component analysis with high-frequency data (Liu and Ngo (2017), Aït-Sahalia and Xiu (2019), Chen et al. (2020)) or the asset allocation framework (see, e.g., Engle and Colacito (2006)). This property has sometimes been studied for integrated covariances estimators (see, e.g., Barndorff-Nielsen and Shephard (2004), Barndorff-Nielsen et al. (2011), Mancino et al. (2017), Park et al. (2016), Shepard and Xiu (2017), Cui et al. (2019)). However, to the best of our knowledge, the spot covariances estimator proposed in the present paper is the first to guarantee positive semi-definiteness for the estimated matrix. For example, when dealing with spot volatility, Chen et al. (2020) integrate the estimations before computing the eigenvalues of the covariance matrix; instead, Bu et al. (2023) impose positive semi-definiteness by applying suitable shrinkage techniques that effectively impose manipulation of the estimated matrix.
This work aims to propose a novel spot covariance estimator, to prove its positivity and consistency, and to analyze its finite-sample properties in a simulated environment. Our starting point is the spot Fourier estimator by Malliavin and Mancino (2009) (see also Mancino et al. (2017) for several results and applications of the estimation method). The spot Fourier estimator is defined in three steps. First, the discrete Fourier coefficients of the asset returns are computed using all the available observations. Second, it is proven that their convolution product leads to a consistent estimator of the Fourier coefficients of the volatility function. Finally, the Fourier-Fejér inversion formula provides the final spot volatility estimator. Due to a lack of symmetry in the Fejér kernel, however, the spot Fourier estimator may fail to yield positive semi-definite estimations when the asset prices are observed on asynchronous grids. To overcome this difficulty and ensure symmetric and positive semi-definite estimates, this paper introduces a modified version of the Fourier estimator, which we call the PDF estimator. Theorem 1 proves that the modified estimator fulfills the desired property. Theorem 2 gives bounds for its asymptotic error, providing sufficient conditions for the consistency of the estimator based on the rates of N and M with respect to the sampling frequency.
The new estimator relies on two parameters: the cutting frequency N, and the localizing frequency M. We address the question of how to optimally choose them to minimize the error, according to the asymptotic conditions in Theorem 2, via a simulation study. By setting N = c N ρ n α and M = c M ρ n β , where ρ n is the mesh of the given sampling, and α , β are suggested by Theorem 2, a grid of possible values of the constants c N and c M is tested against several different model specifications for both the efficient price process and the additive microstructure component. Concerning the parameter c M , which controls the localization Gaussian kernel, we find that it exhibits a stable optimal value across the scenarios considered, necessitating only a small downward correction in the presence of noise. A similar behavior was observed for the original Fourier estimator in Mancino et al. (2025). Moreover, for the four models of the efficient price, we find that the difference between close values of c N and c M is relatively small: therefore, making a slightly sub-optimal choice causes no significant increase in the error. Finally, we evaluate the finite-sample performance of the proposed PDF estimator by comparing its accuracy and the percentage of its positive semi-definite estimates against those obtained using the smoothed two-scale estimator by Mykland et al. (2019) and the local method of moments estimator by Bibinger et al. (2019), which are both able to manage asynchronous observations. We run the comparison focusing on the main problems that affect the estimation of variance-covariance matrices from high-frequency data. First of all, we address the dimensionality problem, by evaluating the estimates computed when the number of assets increases. Secondly, we focus on the level of asynchronicity, considering different intensities of the Poisson processes that drive the observation frequency. Lastly, we analyze market microstructure noise, considering noise coming from rounding, i.i.d. noise, auto-correlated noise, noise correlated with the efficient price process, and heteroskedastic noise. The exercise shows that only the PDF estimator consistently produces positive semi-definite estimations in 100% of the cases (as guaranteed by the theory), while maintaining a hedge with respect to its competitors in terms of mean square error.
The robustness of all the simulation results is confirmed by modifying the simulation model behind the analysis; in particular, we consider: a Heston Stochastic Volatility model (Heston (1993)), a One Factor Volatility model and a Two Factor Volatility model (Chernov et al. (2003)), and a Rough Heston model (El Euch and Rosenbaum (2019)), and in each case we get comparable results.
Finally, in an empirical exercise that employs trade data from twelve US firms collected from January 2023 to December 2023, we verify the stability of the estimations to changes in the cutting frequency N and in the location parameter M, looking at the same time at the computational time required in such real-world cases, which turns out to be not sensible to the choice of M.
The remainder of this work is organized as follows. Section 2 introduces the positive semi-definite (PDF) Fourier estimator of spot covariance, and its positivity is proved. Section 3 studies the asymptotic error of the PDF estimator with Gaussian kernel, proving its consistency and delivering its rate of convergence both for irregular and regular sampling schemes. Section 4 presents the simulation study, including a sensitivity analysis on the parameters of the estimator in terms of integrated mean square error, and a comparison against alternative estimators present in the literature, in which accuracy and ability to produce positive semi-definite matrices are considered. Section 5 presents the study with empirical data, while Section 6 concludes.

2. The Positive Semi-Definite Spot Covariance Estimator

Assume that the asset price is described by a d-dimensional Itô semimartingale X = ( X 1 , , X d )
X t j = x 0 j + 0 t b j ( s ) d s + k = 1 d 0 t σ j k ( s ) d W s k , j = 1 , , d
with W = ( W 1 , , W d ) a d-dimensional Brownian motion on the filtered probability space ( Ω , ( F t ) t [ 0 , T ] , P ) and b j and σ j k are adapted continuous processes. The d × d instantaneous (spot) covariance matrix V ( t ) has entries
V j , j ( t ) k = 1 d σ j k ( t ) σ j k ( t ) , for j , j = 1 , , d and t [ 0 , T ] .
For simplicity of notation, we assume T = 1 , without loss of generality.
We assume that the prices are observed on discrete, irregular and asynchronous time grids
0 = t 0 j < t 1 j < < t n j j = 1 for j = 1 , , d .
Let ρ n j = max 0 h n j 1 | t h + 1 j t h j | and ρ n = max j = 1 , , d ρ n j . In the following, Δ ( X l j ) denotes the discrete return X t l j X t l 1 j for j = 1 , , d and l = 1 , , n j .
We start introducing the classical Fourier estimation method, in order to better clarify the issue arising for the lack of symmetry and to understand the modification of this estimator we propose here.
For j = 1 , , d , define the discrete Fourier coefficients of the log-returns as
c k ( Δ X n j j ) : = l = 1 n j e 2 π i k t l j Δ ( X l j ) ,
and, for any j , j , define an unbiased estimator of the k-th Fourier coefficient of the covariance process, through the following convolution formula:1
c k ( V n , N j , j ) : = 1 2 N + 1 | h | N c h ( Δ X n j j ) c k h ( Δ X n j j ) .
Finally, for any t ( 0 , 1 ) , the spot covariance estimator is defined as
V ^ n , N , M j , j ( t ) : = | k | M 1 | k | M + 1 e 2 π i k t c k ( V n , N j , j ) .
By means of the Dirichlet kernel D N ( x ) : = 1 2 N + 1 | k | N e 2 π i k x , it is possible to express (1) as
c k ( V n , N j , j ) = 1 2 N + 1 l = 1 n j l = 1 n j D N ( t l j t l j ) e 2 π i k t l j Δ ( X l j ) Δ ( X l j ) .
Therefore, the Fourier spot volatility estimator (2) can be rewritten, using two kernels, as
V n , N , M j , j ( t ) = 1 2 N + 1 l = 1 n j l = 1 n j F M ( t t l j ) D N ( t l j t l j ) Δ ( X l j ) Δ ( X l j ) ,
where F M ( x ) : = | k | M 1 | k | M + 1 e 2 π i k x is the Féjèr kernel.
The Fourier estimation method was introduced in Malliavin and Mancino (2009) to estimate instantaneous multivariate volatilities from high-frequency observations of diffusion processes in a non-parametric way and without any stationarity assumption. The authors aimed at overcoming some difficulties arising from the application of the quadratic variation formula in the commonly used realized covariation methods with financial data. Detailed results and discussion on both statistical properties as well as numerical implementation and empirical applications can be found in the book Mancino et al. (2017).
The asymptotic properties of the Fourier spot volatility estimator (2) have been studied in Malliavin and Mancino (2009) (in the absence of noise) and in Mancino et al. (2025) (in the presence of noise). However, while the positivity of the Fourier estimator of the integrated covariance matrix is shown and debated also in Mancino et al. (2017), the spot covariance estimator may fail in producing symmetric positive semi-definite estimations, being F M ( t t l j ) D N ( t l j t l j ) not symmetric in j , j and could lead to complex eigenvalues in V ^ K , S ( t ) . In addition, simple symmetrizations such as ( V ^ j , j + V ^ j , j ) / 2 are not guaranteed to be positive-definite, possibly with negative eigenvalues.
In this paper, we propose the following estimator of spot covariance, which overcomes the issue of positivite semidefiniteness of the Fourier estimator (3).
Definition 1. 
Let K be a finite subset of Z , S { S ( k ) finite Z 2 : k K , ( s , s ) S ( k ) s + s = k } , and c be a complex function on K ; we define the estimator for V j , j ( t ) as
V ^ K , S j , j ( t ) = l = 1 n j l = 1 n j k K c ( k ) e 2 π i k t ( s , s ) S e 2 π i s t l j e 2 π i s t l j Δ ( X l j ) Δ ( X l j ) .
Remark 1. 
If we take K = { 0 , ± 1 , ± 2 , , ± M } for some positive integer M and S ( k ) = { ( s , s ) | s + s = k , s = 0 , ± 1 , ± 2 , , ± N } for some positive integer N, and
c ( k ) = 1 | k | M + 1 1 2 N + 1 ,
we obtain
V ^ K , S j , j ( t ) = k = M M 1 | k | M + 1 e 2 π i k t s = N N l = 1 n j l = 1 n j e 2 π i s t l j e 2 π i ( k s ) t l j Δ ( X l j ) Δ ( X l j ) .
The estimator (5) can be expressed as follows
V ^ K , S j , j ( t ) = 1 2 N + 1 l = 1 n j l = 1 n j F M ( t t l j ) D N ( t l j t l j ) Δ ( X l j ) Δ ( X l j ) .
Therefore, with a suitable choice of function c ( · ) , the estimator (4) coincides with the Fourier spot covariance estimator (3) introduced by Malliavin and Mancino (2009).
The main theoretical result of this work concerns the positive semi-definiteness of the proposed estimator (4) under suitable hypothesis and is stated in the following theorem.
Theorem 1. 
Let N and M be positive integers. Suppose that K = { 0 , ± 1 , ± 2 , , ± 2 N } , c M ( k ) is a positive semi-definite function on K and
S ( k ) = { ( N + k + v , N v ) : v = 0 , , 2 N k } 0 k 2 N { ( N + k v , N + v ) : v = 0 , , 2 N + k } 2 N k < 0 .
Then, V ^ K , S ( t ) defined in (4) is symmetric and positive semi-definite.
The proof of Theorem 1 is reported in the Appendix A.
Moreover, it emerges that V ^ K , S ( t ) can be rewritten as
V ^ N , M j , j ( t ) = 1 2 N + 1 l = 1 n j l = 1 n j u = N N u = N N c M ( u u ) e 2 π i u ( t t l j ) e 2 π i u ( t t l j ) Δ ( X l j ) Δ ( X l j ) ,
for two assets j and j and t [ 0 , 1 ] , where c M ( k ) is still a positive semi-definite function.2 We call the class of the estimators parameterized by the positive semi-definite function c M  the positive semi-definite Fourier (PDF) estimator.
By Bochner’s theorem, we know that, for each positive semi-definite function c M , there exists a bounded measure μ M on R such that
c M ( x ) = R e 2 π i y x μ M ( d y ) .
Therefore, we may also rewrite the PDF estimator (7) using the measure μ M instead of the positive semi-definite function c M ( k ) , and obtain
V ^ N , M j , j ( t ) = 1 2 N + 1 l = 1 n j l = 1 n j R D N ( t t l j + y ) D N ( t t l j + y ) μ M ( d y ) Δ ( X l j ) Δ ( X l j ) .
Thus, we can also say that the PDF estimators are parameterized by a measure μ M .
In the next Section, we prove the consistency of the estimator (7) (equivalently, (8)).

3. Asymptotic Properties of the PDF Estimator with Gaussian Kernel

In this section, we consider the case where μ M is the Gaussian kernel, or more precisely,
μ M ( d y ) = M 2 π e M y 2 2 d y ,
which is equivalent to
c M ( x ) = e 2 π 2 x 2 M .
While the parameter N controls the microstructure noise effect, as it will appear in the intensive simulation study carried on in the next Section, the parameter M controls the localizing kernel and the estimation error. As we will see, it is needed N ,   M as ρ n 0 with appropriate rates. We will call the estimator Gaussian PDF estimator, or GPDF for short.
In this section, we give an estimate of the error V j , j V ^ N , M j , j of the GPDF estimator under the following assumptions.
For simplicity, we consider d = 2 . Moreover, it is not restrictive to assume that the drift b 0 for the efficient price process3.
Further, assume that
(A)
the volatility processes V j , j , j , j = 1 , 2 satisfy
V : = max j , j ( E [ sup t [ 0 , 2 π ] | j , j | V j , j ( t ) | 2 ] ) 1 / 2 <
and σ j : = ( σ 1 j , σ 2 j ) , j = 1 , 2 are all twice Malliavin differentiable and
C : = max j , j E [ sup s , u , v [ 0 , 2 π ] | σ j ( v ) v ( σ j ( u ) s V j , j ( u ) ) | ] < ,
where ∇ denotes the Malliavin derivative. Further, we assume that V j , j , j , j = 1 , 2 are κ -Hölder continuous for some κ ( 0 , 1 ) in the sense that
k Z | k | 2 κ E [ | ( F V j , j ) ( k ) | 2 ] = : C κ < ,
where ( F V j , j ) ( k ) is the k-th Fourier coefficient of V j , j , i.e.,
( F V j , j ) ( k ) = 0 1 V j , j ( s ) e 2 π i k s d s .
Remark 2. 
The Malliavin differentiability assumption for the volatility process is not restrictive, as it is satisfied by most relevant stochastic volatility models, e.g., Heston model, and also by rough volatility models, see Clement and Gloter (2011); Alòs and Ewald (2008); Inahama (2014). Thus, our result applies to all those models. As far as it concerns volatility jumps, they are not ruled out by the Malliavin differentiability but it is required here that the volatility process is Hölder-continuous. In fact, if the volatility process V j , j ( t ) has a fixed time of discontinuity t, then the Fourier-Fejér inversion formula gives that (6) converges to ( V j , j ( t ) + V j , j ( t ) ) / 2 for a fixed time of discontinuity and V j , j ( t ) for any continuity time.
Theorem 2. 
(i) 
Under the assumption (A), for any j , j = 1 , 2 the L 2 -error between V j , j and the estimator V ^ N , M j , j is estimated as
E 0 1 ( V j , j ( t ) V ^ N , M j , j ( t ) ) 2 d t π 2 V 2 ρ n 2 N 2 M 2 π + ( 4 C + 2 V 2 ) 4 π 2 ρ n 2 N 2 + ( 2 N + 1 ) 1 M 2 π + 2 C κ ( 2 N ) 2 κ + 2 π 2 M κ .
(ii) 
In the case of synchronous and regular sampling, when t k j = k / n for k = 0 , 1 , , n , j = 1 , 2 , Equation (10) is improved as
E 0 1 ( V j , j ( t ) V ^ N , M j , j ( t ) ) 2 d t π 2 V 2 ρ n 2 M 2 π M 4 π 2 + 1 + ( 4 C + 2 V 2 ) ( 2 N + 1 ) 1 M 2 π + 2 C κ ( 2 N ) 2 κ + 2 π 2 M κ .
(iii) 
Consequently, for the general sampling scheme, if4  N ρ n α and M ρ n β , the consistency is attained if
0 < β < 4 3 , β 2 < α < 1 4 β + 1
and
E 0 1 ( V j , j ( t ) V ^ N , M j , j ( t ) ) 2 d t 1 / 2 = O ρ n min ( 1 α β 4 , α 2 β 4 , κ β 2 ) .
Further, the best rate is given as
max 0 < β < 4 3 , β 2 < α < 1 4 β + 1 min ( 1 α β 4 , α 2 β 4 , κ β 2 ) = 2 κ 6 κ + 3 ,
where the maximum is attained when α = 2 / 3 and β = 4 / ( 6 κ + 3 ) .
(iv) 
In the case of synchronous and regular sampling, when t k j = k / n for k = 0 , 1 , , n , j = 1 , 2 , the consistency is attained if
α > β 2 , 0 < β < 4 3
and
E 0 1 ( V j , j ( t ) V ^ N , M j , j ( t ) ) 2 d t 1 / 2 = O n min ( α 2 β 4 , 1 3 4 β , κ β 2 ) .
The best rate is given as
max α > β 2 , 0 < β < 4 3 min ( α 2 β 4 , 1 3 4 β , κ β 2 ) = 2 κ 2 κ + 3 ,
where the maximum is attained when β = 4 / ( 2 κ + 3 ) and α > β 2 .
A proof of Theorem 2 will be given in Appendix B.
Remark 3. 
In Theorem 2, when κ = 1 / 2 , the best rate under the general sampling scheme is 1 / 6 and under synchronous and equally spaced sampling, it is 1 / 4 .

4. Simulation Study

4.1. Simulation Settings

In this section we present an extensive numerical simulated study. The aim of this study is twofold: first in Section 4.2 we analyze the sensitivity of the estimator to the choice of parameters N and M and, with an unfeasible optimization, we find their optimal choices in different scenarios, having as a guide the theoretical results established in the previous section. Secondly, in Section 4.3 we evaluate the accuracy of the proposed GPDF estimator and its ability to produce symmetric and positive semi-definite estimations in a comparison with two alternative estimators that are present in the literature.
To give robustness to the results of our study, in the following analysis we consider many different simulation scenarios, focusing on both the two components of high-frequency financial data: the efficient price and the additive noise component given by market microstructure, so that the observed price X ˜ is
X ˜ t j = X t j + η t j , j = 1 , , d ,
with η being the noise component.
In particular, for the efficient price process we consider the following specifications:
while for the additive microstructure noise we take into account the following cases:
  • no noise case;
  • noise coming from rounding;
  • i.i.d. noise;
  • autocorrelated noise;
  • general noise auto-correlated and dependent of the efficient price process.
Since in the different cases where noise is present we analyze, respectively, 2, 4, 3, 4, different parameterizations, in our simulated analysis we study a total of 56 different scenarios.
For simplicity of the computations, through Section 4.2 and Section 4.3, all the simulated analysis is conducted on the interval [ 0 , 1 ] ; for that reason, and in the light of Section 3, we use the GPDF estimator (7) given by
V ^ N , M j , j ( t ) = 1 2 N + 1 l = 1 n j l = 1 n j u = N N u = N N e 2 π 2 ( u u ) 2 M e 2 π i u ( t t l j ) e 2 π i u ( t t l j ) Δ ( X l j ) Δ ( X l j ) .
where not stated otherwise, the simulations consist of K = 5000 daily trajectories, considering a trading day of length 6.5 h, and are carried out on an equally spaced grid of width 2 s. To introduce asynchronicity in the data, observations are drawn from a Poisson process with an average of one observation every 10 s. Moreover, where not explicitly stated, the correlation between Brownian motions driving the efficient processes of different assets, following Bibinger et al. (2019), is fixed to mimic the median estimated realized correlation of the Nasdaq components, i.e.,
W j , W i = 0.312 , j , i = 1 , , d j i .
In Section 4.1.1 and Section 4.1.2, we define the models used for the efficient price process and the microstructure noise. As a robustness check, additional unreported simulations have been carried out under slightly different choices for the parameters of the reference models, with results analogous to those reported in Section 4.3.

4.1.1. Efficient Price Process

Heston Model
The Heston stochastic volatility model by Heston (1993) is possibly the most widely used stochastic volatility model in the high-frequency econometric literature. It takes the form
d X t j = ( μ ( σ t j ) 2 / 2 ) d t + σ t j d W t j d ( σ t j ) 2 = γ ( θ ( σ t j ) 2 ) d t + ν σ t j d Z t j ,
with W j , Z j = λ to account for the leverage effect. The parameters are set to be
( μ , γ , θ , ν , λ ) = ( 0.05 / 252 , 5 / 252 , 0.1 , 0.5 / 252 , 0.5 ) ,
that is the same choice made by Zu and Boswijk (2014); Mancino et al. (2017) and Figueroa-López and Wu (2022).
Factor Volatility Models
Factor volatility models have been long used in the literature; see, for example, Huang and Tauchen (2005). First, we consider the One-Factor Stochastic Volatility model (SV1F) of the form
d X t j = μ d t + σ t j d W t j σ t j = e β 0 + β 1 τ t j d τ t j = α τ t j + d Z t j
for j = 1 , , d , with W j , Z j = λ , and Z j , Z j = 0 for j j . The simulation is carried out using as parameters
( μ , β 1 , α , β 0 , λ ) = ( 0.03 , 0.125 , 0.025 , β 1 / ( 2 α ) , 0.3 ) ,
which are the parameters used also in Zu and Boswijk (2014); Mancino et al. (2017); Figueroa-López and Wu (2022) and Mancino et al. (2024).
Second, we consider the Two-Factor Stochastic Volatility model (SV2F), proposed by Chernov et al. (2003), which is able to reproduce higher values of volatility of volatility. It takes the form
d X t j = μ d t + s-exp [ β 0 + β 1 τ t j , 1 + β 2 τ t j , 2 ] d W t j d τ t j , 1 = α 1 τ t j , 1 + d Z t j , 1 d τ t j , 2 = α 2 τ t j , 2 + ( 1 + β v τ t j , 2 ) d Z t j , 2
for j = 1 , , d , with W j , Z j , 1 = W j , Z j , 2 = λ , and Z j , i , Z j , i = 0 for j j and i i , i , i = 1 , 2 . For the parameters involved, our choice is to use
( μ , β 0 , β 1 , β 2 , β v , α 1 , α 2 , λ ) = ( 0.03 , 1.1 , 0.04 , 0.3 , 0.003 , 0.6 , 0.25 ) .
Rough Volatility
Starting with the seminal paper by Gatheral et al. (2018), a new strand of financial econometric literature has grown considering dynamics of the volatility process that are not driven by a standard Brownian motion but instead are driven by a fractional Brownian motion, with Hurst index H < 0.5 , which corresponds to the cases where κ < 0.5 . Theorem 2 states that the proposed PDF estimator is consistent even in the presence of rough volatility.
Rough volatility may also be modeled using the stochastic Volterra equation, as in the rough Heston model studied by El Euch and Rosenbaum (2019) and which we intend to use in this study.
X t j = X 0 j + 0 t X t j s σ s j d W s j ( σ t j ) 2 = ( σ 0 j ) 2 + 0 t K ( t s ) ( θ γ ( σ s j ) 2 ) d s + ν σ s j d Z s j ,
with W j , Z j = λ and K ( t ) = C t H 1 2 for H 0 , 1 2 and constant C. In order to simulate the rough Heston model we apply the discrete-time Euler-type scheme studied in Richard et al. (2023), referring in particular to Equation (14) thereof. The parameters of the model are set to ensure that in the exercise the simulated volatility process does not exhibits negative values, and in particular they take values:
( θ , γ , ν , λ , H ) = ( 0.2 , 0.3 , 0.2 , 0.7 , 0.1 ) ,
where the choice for the Hurst parameters is driven by the empirical evidence present in the literature; see, e.g., Gatheral et al. (2018).

4.1.2. Market Microstructure Noise Specifications

It is a known fact (see, e.g., Bandi and Russell (2008)) that high-frequency data are contaminated by the so-called market microstructure noise. In particular, the observed price is the sum of the efficient price and the noise component. The origin of noise is linked to specific characteristics of the microstructure of financial markets, such as bid-ask spread, rounding, strategic trading (see, e.g., Hasbrouck (2007)), and several models specifications for the noise have been proposed in the literature of high-frequency financial econometrics.
Noise Coming from Rounding
In the presence of rounding the observed price process has the following form:
X ˜ t j = log ( [ exp ( X t j ) / r ] r ) ,
where X denotes the efficient price process.
We consider two levels of rounding, corresponding to r = 1 or 5 cents, which are the most used in financial markets.
Noise i.i.d.
The most widely used characterization of noise is to consider it an i.i.d. additive component, as in (14), with mean equal to zero and given constant variance
η i . i . d . E [ η i j ] = 0 , E [ ( η i j ) 2 ] = var ( Δ X 10 s e c j ) σ η 2 , i , j
where X 10 s e c j denotes the regularly spaced series obtained by subsampling the originally simulated series every 10 s.
Here, as can often be found in the literature, we specify a Gaussian distribution for the noise. We consider four values for the variance of the noise component: σ η 2 = 1 , 1.5 , 2 , 2.5 .
Autocorrelated Noise
Here autocorrelation is introduced in the noise component, while keeping the additive form of Equation (14). In particular, it is modeled through an Ornstein–Uhlenbeck (OU) process defined as
d η t j = θ η η t j d t + σ η d E t j ,
where E is a standard Brownian motion independent of W. Three different levels of autocorrelation are considered, using θ η = 0.2 , 0.3 , 0.4 . σ η 2 is set to obtain a level of variance comparable to the second case in the previous scenario.
General Noise Correlated with the Efficient Price Process
In the final noise scenario, we opt for the general structure used in Jacod et al. (2019), which allows for both autocorrelation and dependence on the price process. For j { 1 , 2 } and i = 0 , 1 , , n j , using the simplified notation η i j : = η t i j j , the latter reads as
η i j = ψ i j χ i j ,
where χ i satisfies
χ i j = Z i j + l = 1 L g ( 1 + g ) ( l 1 + g ) l ! Z i l j , g ( 0.5 , 0.5 ) , Z i j i . i . d . N ( 0 , z ) ,
and ψ i is sampled from
d ψ j ( t ) = u ( h ( t ) ψ j ( t ) ) d t + v d W t j , h ( t ) = 1 + w cos 2 π T t ,
with W 1 and W 2 being the Brownian motions driving the dynamics of the efficient prices p 1 and p 2 .
This last model attempts to replicate the slow-decaying autocorrelation in the noise process empirically observed by Jacod et al. (2017), while accounting for the possible dependence between noise and the efficient price component, as observed, for example, by Hansen and Lunde (2006). This formulation still includes an OU dynamics in Equation (17), but modified to also account for heteroskedasticity in the noise, reproducing in particular a U-shaped pattern for the volatility of the noise, given by the deterministic component h.
For the simulation of the noise, we use the following parameter selection:
( z , L , u , v ) = ( var ( Δ X 10 s e c j ) · 2.5 , 100 , 10 , 0.5 ) ,
with the four possible cases obtained coupling g = 0.3 , 0.45 and w = 0.3 , 0.9 .
In a last, unreported, exercise, we also add a rounding of 1 cent to this simulation scheme, without significant changes in the results.

4.2. Selection of Parameters N and M

In this section we want to evaluate the sensitivity of the estimator to the choice of the parameter N and M appearing in the definition of the GPDF estimator. The performance of the estimator for each couple of parameters is evaluated over the entire time interval, across K simulated independent trajectories. In all the following analyses, the path of the spot variance is reconstructed on a regular grid of width 30 min.
In the optimization study, following the results of Theorem 2, we specify the cutting frequency N and the localization parameter M in terms of ρ n , using the fact that N ρ n α c N and M ρ n β c M , for a suitable choice of α and β , depending on the κ -Hölder continuity of the simulated volatility process5. In particular, we optimize over a grid defined by
c N = 0.5 , 1 , 3 , 5 , 7 , 9
and
c M = 0.5 , 1 , 2 , 3 , 4 , 5 .
For each scenario, in a setting with d = 2 , on the defined grid of values for the couple ( c N , c M ) , we look at the estimation error of variance V ^ 1 , 1 and covariance V ^ 1 , 2 , using in particular the integrated mean squared error
M I S E j = K 1 k = 1 K 0 1 ( V ^ 1 , j V 1 , j ) 2 d t , j = 1 , 2
and choosing as optimal the pair that minimizes
0.1 · M I S E 1 + 0.9 · M I S E 2 ,
where a higher weight is given to the estimation of the covariance, being the dominant component of a generic variance–covariance matrix with d 2 d covariance terms.6
Table 1 shows the optimal couple of c N and c M for each scenario. The first result we notice is that the optimal value for c M is pretty stable across the models considered in the analysis for the efficient price process. When the data are affected by noise, the optimal c N is reduced, coherently with known results for the other Fourier-type estimators of both integrated and spot volatility, see, e.g., Mancino et al. (2017) and Mancino et al. (2024), with a reduction that is stronger when the variance of noise is higher and in the presence of strong autocorrelation. Concerning the parameter c M , it exhibits a more stable optimal value in the scenario considered, with only a small downward correction needed in the presence of noise. In this case, c M manages the localizing Gaussian kernel, but a similar behavior was also observed for the original Fourier estimator in Mancino et al. (2024). Moreover, for the four models of the efficient price, the difference between close values of c N and c M is relatively small, meaning that making a slightly sub-optimal choice does not induce a significant increase in the error. This is shown in Table 2, where the M I S E 2 on the defined grid is reported for selected scenarios of under the Heston and the SVF2 model. From Table 2 it is also clear that overall, the estimator, on the grid used for this exercise, is more sensible to the choice of c M than to the choice of c N , with suboptimal values of c M producing a higher increase in the estimation error w.r.t. suboptimal values of c N . The figures for the remaining scenarios not reported in Table 2 are analogous.

4.3. Performance Comparison

After having analyzed the sensibility of our estimator to the choice of N and M, in this section we replicate the extensive simulation study adopted in Section 4.2 to evaluate the accuracy of the proposed PDF estimator in comparison with the one of two competing estimators that are present in the literature. Focusing on the estimators consistent in the presence of asynchronous observations contaminated by microstructure noise, we consider the following.
  • The Gaussian positive definite Fourier estimator proposed in this work (GPDF);
  • the smoothed two-scale spot estimator, by Mykland et al. (2019) (STS);
  • the local method of moments spot estimators, by Bibinger et al. (2019) (LMM).
The kernel-based estimator proposed by Bu et al. (2023), while it may be extended to manage irregular and asynchronous observations, relies on specifically tuned shrinkage techniques to impose positive semi-definiteness of the estimation and is therefore not considered in this analysis.
The analysis is carried out considering a maximum of d = 40 assets, and the performances of each estimator are evaluated according to the mean integrated square error (MISE) and its relative counterpart (RMISE), defined as
M I S E = ( K d 2 ) 1 k = 1 K 0 1 j , i = 1 d ( V ^ k i j ( t ) V k i j ( t ) ) 2 d t ,
R M I S E = ( K d 2 ) 1 k = 1 K 0 1 j , i = 1 d ( V ^ k i j ( t ) V k i j ( t ) ) 2 / V k i j ( t ) 2 d t .
Unreported results show that using a different loss function, in particular the Frobenius norm, the Euclidean norm, or the l 1 norm of the difference between the estimated and the real spot volatility matrix, does not affect the rankings that emerge from Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8. Most importantly, in this analysis we pay particular attention to the percentage of symmetric positive semi-definite (spsd) variance–covariance matrix that each estimator is able to produce in the different scenarios.
For the LMM estimator, we use the parameters that were found to be optimal in the numerical analysis by Bibinger et al. (2019), while for the STS estimator we choose the parameters in a neighborhood of the one used by Chen et al. (2020), minimizing the mean square error obtained on auxiliary simulations. In the following, when not explicitly stated otherwise, the results are meant to be achieve under the Heston specification for the efficient price process.

4.3.1. Absence of Noise

In Section 4.3.1 and Section 4.3.2, we report the results of the comparison, in terms of MISE and percentage of spsd estimates produced by the three competing estimators, when the efficient price process follows the Heston model. In this setting, the results in terms of RMISE are analogous. We begin considering noise-free data and we focus on two major features of high-frequency covariance matrix estimation: the dimensionality of the matrix and the asynchronicity of observations.
Table 3 shows the results for increasing values of d. In this simulated exercise, the modified Fourier estimator performs the best in terms of MSE, for any dimension of the volatility matrix. The effectiveness of the STS estimator to produce positive semi-definite estimations seems to decrease as the number of assets increases, in particular with d > 20 , while the other two estimators both produce 100% of spsd matrices, with a slight derease for the LMM estimator observed for d = 40 . For the important role that it plays in estimating variance–covariance matrices and for the influence that it has on the positivity of the estimation, dimensionality will always be taken into consideration in the remaining analysis. For simplicity of exposition, the dimensions considered in the following are limited to d = 5 , 10 , 15 , 20 .
In the no-noise setting, we also address the issue of different levels of asynchronicity in the data. To do so, we examine the changes in the performance of the three estimators as the average time between two consecutive observations increases. In particular, we extract the observations from the simulated trajectories according to homogeneous Poisson processes that produce on average one observation every 15, 20, and 30 s. Table 4 shows that, still maintaining an edge in terms of MISE with respect to the competitors, the proposed estimator is the only one that is able to produce spsd estimations in 100% of the cases, while both the STS and the LMM estimator can fail with increased frequency when the asynchronicity increases, even though the impact of this kind of changes seems to be quite small in terms of percentage of positive estimation obtained. It also seems that the accuracy of all the estimators decreases with higher values of Δ ¯ t ; this is of course in line with the fact that the consistency of these estimators is an asymptotic property.

4.3.2. Data Contaminated by Microstructure Noise

In this Section we run our comparison considering the noise specification described in Section 4.1.2. It is useful to note that the LMM estimator entails an explicit noise correction, and the STS estimator relies on pre-averaging of the observed data on a synchronous and equally spaced grid. For the proposed GPDF estimator instead, in line with the original Fourier estimator of spot volatility, there is no need to manipulate the data or correct the estimator to manage the presence of noise, but it is sufficient to cut the frequency N, as shown in Section 4.2.
Table 5 shows that the presence of rounding does not appear to significantly affect the accuracy of the estimators and the positive semi-definiteness of the estimations. This effect may be due to the scheme adopted to simulate irregularly sampled data, that implies a sub-sampling with respect to the rounded simulated series, reducing the intensity of this source of noise, whose only impact is to slightly decrease the percentage of spsd estimation for the STS estimator.
Table 6 shows that i.i.d. noise, especially with high noise variance σ η 2 , is able to negatively affect the ability of the STS and, marginally, of the LMM estimators to produce spsd estimations, with a stronger impact as the dimension of the estimated matrix grows. Also, the accuracy of all the estimators deteriorates with higher noise, with the GPDF confirmed as the top performer also in this scenario, but with reduced differences in accuracy among the three estimators. Since i.i.d. noise, from a market microstructure perspective, is usually linked to the presence bid-ask spread as modeled, e.g., as in Roll (1984), and being the bid-ask spread usually related to the liquidity of an asset, the ability of managing this kind of noise may be regarded as the ability to estimate correctly covariance also for illiquid assets.
Table 7 shows that autocorrelated noise is again able to significantly effect the ability of the STS estimators to produce spsd matrices, in particular with low values of θ η , while the LMM estimator seems to be only slightly affected. Low values of θ η , i.e., higher values of autocorrelation in the noise process, reduce the accuracy of the three competitors, while maintaining their ranking substantially unchanged.
Table 8 shows the results for the last specification of noise that we consider, that is, with general noise allowing for auto-covariance, correlation with the efficient price and time-varying noise variance. Our results show that, in this setting, both the LMM and the STS estimators may have difficulties in reaching satisfactory percentages of spsd estimations, depending on the intensity of the microstructure component. We confirm once again the ability of the GPDF estimator to produce variance–covariance matrix with the desired property, and with relatively low estimation error. at the same time, we still find that increasing the dimensionality of the estimation exercise hinders the ability of traditional estimators to produce spsd matrices.

4.4. Alternative Volatility Models

In the previous Sections the comparison results have been obtained in the case when the simulated efficient price process is an Heston model. Even though the error produced by the three estimators may be different changing the simulation model behind our analysis, and in particular the differences in MISE between the GPDF and the LMM estimators are reduced when using the SVF2 or the Rough Heston model, the results are substantially confirmed: the positive semi-definite Fourier estimator remain the only one able to consistently produce positive semi-definite estimations, and is the best performer in terms of MSE in almost any scenario. Table 9 shows the percentage of psd estimations obtained under the alternative volatility models, in absence of microstructure noise, together with the ranking of the estimators in terms of MISE. We can see that, in this exercise, it seems that, moving to the SV1F, to the SV2F or Rough Heston, does not influence significantly the ability of the estimators of producing positive matrices. Also the ranking of the estimators is essentially unaffected. More extensive results about the alternative models, showing the percentage of spsd estimations in the cases with rounding, i.i.d. and general noise, are reported in Appendix C. It is worth noting, as a final consideration, that the results in terms or RMISE are analogous, and always see the GPDF estimator having a competitive edge.

5. Empirical Study

In this section, we aim at using historical market data to comment, in a realistic case of use, on the usability of the proposed GPDF estimator in real world applications, complementing the analysis of the theoretical properties and finite sample performance that was carried out in the previous sections. In particular, we focus on two topics at same time: first, the stability of the estimation in the region of parameters identified as optimal in Section 4.2, since a clear identification of the appropriate scenario for noise might be challenging; second the speed of execution of the computation, in relation to the choice of N and M.
The exercise is carried out on a selection of 12 firms traded on the NYSE, coming from a variety of business sectors. In particular we consider: Accenture (ACN), American Tower corporation (AMT), The Boeing Company (BA), ConocoPhillips (COP), General Electric Company (GE), The Goldman Sachs Group (GS), Eli Lilly and Company (LLY), McDonald’s Corporation (MCD), NextEra Energy (NEE), Netflix (NFLX), Nike (NKE) and Walmart (WMT). The stocks, although all of them are from large companies, may exhibits different liquidity profiles, thus creating a mixture of different specification of noise that might be better captured in the most general form defined in Section 4.1.2. The selected stocks are recorded at the trade frequency for the time horizon of one year, from January 2023 to December 2023. Similar to the approach used in the previous sections, the volatility matrix is estimated every 30 min for each of the 248 full trading days of the sample.
In Section 4.2, the optimal choice for c M in the presence of noise is consistently 0.5, while the value of c N that minimize the MISE appears to lay in the range between 1 and 3, thus we perform the estimation exercise focusing on c N { 1 , 2 , 3 } and c M { 0.5 , 1 } , where c M = 1 is added to evaluate the impact of moving to a different c M on the computational time and the impact of shifting to more sub-optimal parameterizations on the estimation itself. Table 10 shows the results of this analysis. In particular, for each couple of selected ( c N , c M ) , it reports the computational time in seconds7 (computed as the average time needed to estimate the trajectory of the volatility matrix in one day on the defined time grid) and the average relative distance between the estimation and the baseline case of c N = 3 and c M = 0.5 , when the estimation is computed in the middle of the trading day (i.e., in t = 0.5 for t [ 0 , 1 ] ) and the relative distance is defined as the ratio between the Euclidean norm of the difference between the two estimations and the Euclidean norm of the baseline case:
d e m p = k = 1 K | | V ^ 0.5 , k V ^ b a s e 0.5 , k | | | | V ^ b a s e 0.5 , k | | ,
with K being the number of days in the sample.
Overall, the distance in estimation seems to be relatively close and, in most cases, it is smaller or comparable to the relative difference observed in the simulation exercise between the MISE of the GPDF and the ones of the alternative competitors. This confirms that, in the region identified in the simulation study (and even for worsen values of c M ), the estimations appears to be sufficiently stable even when applied to empirical data.

6. Conclusions

In the present paper, a modified version of the classical Fourier estimator for spot covariance by Malliavin and Mancino (2009) has been proposed to overcome the difficulty of obtaining symmetric and positive semi-definite estimation of the spot variance-covariance matrix. We showed that the proposed estimator is positive semi-definite and consistent with a suitable choice of the tuning parameters N , M . To the best of our knowledge, this is the first non-parametric estimator of the spot covariance that guarantees the positiveness. Based on the theoretical results obtained, a numerical study has been carried out to evaluate the optimal choice of the parameters in a variety of settings. The optimal couple seems to be quite stable, and, as usual for the class of the Fourier estimators, in the presence of asynchronicity and noisy data, the parameter N should be reduced with respect to the optimal no-noise case, which is the Nyquist frequency. Moreover, a thorough simulation study has been carried out to evaluate the accuracy of the estimator and its actual ability in producing psd estimations. Comparing the results with the ones of two alternative estimators present in the literature, the STS and the LMM estimators, we found out that the proposed PDF estimator usually outperforms the competitors in terms of mean square error and is the only one that, in this study, was able to always produce psd estimations. The simulation analysis was focused on many challenging aspects of high-frequency covariance estimations, such as the dimensionality of the problem, the degree of asynchronicity between assets and the presence of multiple specifications of market microstructure noise. The robustness of our results are confirmed using alternative data generating processes. The empirical exercise demonstrates the applicability of the method using real-world data. Therefore, we believe that the PDF estimator of spot covariance may be more competitive in terms of empirical applications due to its properties.

Author Contributions

Methodology, J.A., M.E.M., T.M. and Y.Y.; Investigation, R.K.; Data curation, T.M.; Writing—original draft, J.A., N.-L.L. and M.E.M. All authors have read and agreed to the published version of the manuscript.

Funding

Maria Elvira Mancino was partially supported from the European Union Next-GenerationEU—National Recovery and Resilience Plan (NRRP)—MISSION 4 COMPONENT 2, INVESTIMENT 1.1 grant number P2022AX5TH.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 1

Let a j for j = 1 , 2 , 3 be arbitrary functions on Z , from the definitions of K and S ( k ) we notice that:
k K ( s , s ) S ( k ) a 1 ( k ) a 2 ( s ) a 3 ( s ) = k = 0 2 N v = 0 2 N k a 1 ( k ) a 2 ( N + k + v ) a 3 ( N v ) + k = 2 N 1 v = 0 2 N + k a 1 ( k ) a 2 ( N + k v ) a 3 ( N + v ) A + B .
For the first term we have
A = k = 0 2 N u = k N N a 1 ( k ) a 2 ( k u ) a 3 ( u ) = u = N N k = 0 u + N a 1 ( k ) a 2 ( k u ) a 3 ( u ) = u = N N u = N N a 1 ( u + u ) a 2 ( u ) a 3 ( u ) ,
where we set u = N v in the first line, changed the order of the summations in the second line, and put u = k u . Similarly, using the convention that u = 0 1 = 0 , for the second term we have
B = k = 2 N 1 u = N N + k a 1 ( k ) a 2 ( k u ) a 3 ( u ) = u = N N k = u N 1 a 1 ( k ) a 2 ( k u ) a 3 ( u ) = u = N N u = N u 1 a 1 ( u + u ) a 2 ( u ) a 3 ( u ) .
Thus we see that
k K ( s , s ) S ( k ) a 1 ( k ) a 2 ( s ) a 3 ( s ) = u = N N u = N N a 1 ( u + u ) a 2 ( u ) a 3 ( u ) .
When a 1 ( k ) = c ( k ) e 2 π i k t , a 2 ( s ) = e 2 π i s t j l and a 3 ( s ) = e 2 π i s t l j , using the change of variable u u , we obtain
V ^ N j , j ( τ ) = l = 1 n j l = 1 n j u = N N u = N N c ( u u ) e 2 π i u ( τ t l j ) e 2 π i u ( τ t l j ) Δ ( X l j ) Δ ( X l j ) .
Then, for x C d
j , j V ^ N j , j ( τ ) x j x j ¯ = u = N N u = N N j = 1 d x j l = 1 n j e 2 π i u ( τ t l j ) Δ X l j j = 1 d x j l = 1 n j e 2 π i u ( τ t l j ) Δ X l j = u = N N u = N N f ( u ) f ( u ) ¯ 0 .
with f ( u ) j = 1 d x j l = 1 n j e 2 π i u ( τ t l j ) Δ X l j . The proof is complete. □

Appendix B. Proof of Theorem 2

For simplicity let j = 1 , j = 2 . By introducing the notation
φ n j ( s ) = k = 0 n j 1 t k j 1 [ t k j , t k + 1 j ) ( s ) , s [ 0 , 1 ) ,
we can rewrite V ^ N , M 1 , 2
V ^ N , M 1 , 2 ( t ) = 1 2 N + 1 R 0 1 D N ( t φ n 1 ( s ) + y ) d X s 1 0 1 D N ( t φ n 2 ( u ) + y ) d X u 2 μ M ( d y ) = 1 2 N + 1 R 0 1 D N ( t φ n 1 ( s ) + y ) D N ( t φ n 2 ( s ) + y ) V 1 , 2 ( s ) d s μ M ( d y ) + 1 2 N + 1 R 0 1 0 s D N ( t φ n 1 ( u ) + y ) D N ( t φ n 2 ( s ) + y ) d X u 1 d X s 2 μ M ( d y ) + 1 2 N + 1 R 0 1 0 s D N ( t φ n 2 ( u ) + y ) D N ( t φ n 1 ( s ) + y ) d X u 2 d X s 1 μ M ( d y ) .
We put
I ( t ) : = 1 2 N + 1 R 0 1 D N ( t φ n 1 ( s ) + y ) D N ( t φ n 2 ( s ) + y ) D N ( t s + y ) 2 V 1 , 2 ( s ) d s μ M ( d y )
I I ( t ) : = 1 2 N + 1 R 0 1 0 s D N ( t φ n 1 ( u ) + y ) D N ( t φ n 2 ( s ) + y ) d X u 1 d X s 2 μ M ( d y ) + 1 2 N + 1 R 0 1 0 s D N ( t φ n 1 ( s ) + y ) D N ( t φ n 2 ( u ) + y ) d X u 2 d X s 1 μ M ( d y )
and
I I I ( t ) : = 1 2 N + 1 0 1 R D N ( t s + y ) 2 μ M ( d y ) V 1 , 2 ( s ) d s V 1 , 2 ( t ) = R 0 1 F 2 N ( t s + y ) ( V 1 , 2 ( s ) V 1 , 2 ( t ) ) d s μ M ( d y ) ,
where F 2 N is the Fejér kernel defined in Remark 1. Then,
V 1 , 2 ( t ) V ^ N 1 , 2 ( t ) = I ( t ) + I I ( t ) + I I I ( t ) .
The following L 2 -estimates of I ( t ) , I I ( t ) , and I I I ( t ) are true for any μ M so far as 0 < c M ( k ) < 1 , which is true for the Gaussian case.
Lemma A1. 
We have
E 0 1 ( I ( t ) ) 2 d t π 2 V 2 ρ n 2 N 2 | k | 2 N | c M ( k ) | 2 ,
and, in the synchronous and regular case, when φ n 1 φ n 2 ,
E 0 1 ( I ( t ) ) 2 d t π 2 V 2 ρ n 2 | k | 2 N | c M ( k ) | 2 k 2 .
Proof. 
Since
0 1 ( I ( t ) ) 2 d t = 1 ( 2 N + 1 ) 2 [ 0 , 1 ] 3 d t d s d u R μ M ( d y ) R μ M ( d y ) V 1 , 2 ( s ) V 1 , 2 ( u ) × ( D N ( t φ n 1 ( s ) + y ) D N ( t φ n 2 ( s ) + y ) D N ( t s + y ) 2 ) × ( D N ( t φ n 1 ( u ) + y ) D N ( t φ n 2 ( u ) + y ) D N ( t u + y ) 2 ) = 1 ( 2 N + 1 ) 2 N k 1 , k 2 , k 3 , k 4 N R e 2 π i ( k 1 + k 2 ) y μ M ( d y ) R e 2 π i ( k 3 + k 4 ) y μ M ( d y ) × 0 1 e 2 π i ( k 1 + k 2 + k 3 + k 4 ) t d t 0 1 ( e 2 π i k 1 φ n 1 ( s ) 2 π i k 2 φ n 2 ( s ) e 2 π i ( k 1 + k 2 ) s ) V 1 , 2 ( s ) d s × 0 1 ( e 2 π i k 3 φ n 1 ( u ) 2 π i k 2 φ n 2 ( u ) e 2 π i ( k 3 + k 4 ) u ) V 1 , 2 ( u ) d u ,
we obtain (A1) once we establish
E 0 1 ( e 2 π i k 1 φ n 1 ( s ) 2 π i k 2 φ n 2 ( s ) e 2 π i ( k 1 + k 2 ) s ) V 1 , 2 ( s ) d s π V ( | k 1 | + | k 2 | ) ρ n .
To prove Equation (A3), we first observe that
| 0 1 ( e 2 π i k 1 φ n 1 ( s ) 2 π i k 2 φ n 2 ( s ) e 2 π i ( k 1 + k 2 ) s ) V 1 , 2 ( s ) d s | sup t [ 0 , 1 ] | V 1 , 2 ( t ) | 0 1 | 1 e 2 π i ( k 1 ( s φ n 1 ( s ) ) + i k 2 ( s φ n 2 ( s ) ) | d s ,
and
0 1 | 1 e 2 π i ( k 1 ( s φ n 1 ( s ) ) + i k 2 ( s φ n 2 ( s ) ) | d s 2 π 0 1 | k 1 ( s φ n 1 ( s ) ) + k 2 ( s φ n 2 ( s ) ) | d s 2 π | k 1 | 0 1 | s φ n 1 ( s ) | d s + 2 π | k 2 | 0 1 | s φ n 2 ( s ) | d s .
Then, (A3) holds since
0 1 | s φ n j ( s ) | d s = k = 0 n j 1 t k j t k + 1 j ( s t k j ) d s = 1 2 k = 0 n j 1 ( t k + 1 j t k j ) 2 ρ ( n j ) 2 k = 0 n j 1 ( t k + 1 j t k j ) ρ n 2
for j = 1 , 2 .
For (A2), we just need
0 1 | 1 e 2 π i ( k 1 ( s φ n 1 ( s ) ) + i k 2 ( s φ n 2 ( s ) ) | d s 2 π | k 1 + k 2 | 0 1 | s φ n 1 ( s ) | d s .
Lemma A2. 
For the general case, it holds
E 0 1 ( I I ( t ) ) 2 d t ( 4 C + 2 V 2 ) 4 π 2 ρ n 2 N 2 + ( 2 N + 1 ) 1 | k | 2 N c M ( k ) 2 ,
and, when t k j = k / n for k = 0 , 1 , , n , j = 1 , 2 ,
0 1 E ( I I ( t ) ) 2 d t 4 C + 2 V 2 2 N + 1 | k | N c M ( k ) 2 .
Proof. 
We first show that
E [ ( I I ( t ) ) 2 ] 4 C + 2 V 2 ( 2 N + 1 ) 2 [ 0 , 1 ] 2 ( G ( s , u ) ) 2 d s d u ,
where
G ( s , u ) G 1 , 2 ( s , u ) : = R μ M ( d y ) D N ( t φ n 1 ( s ) + y ) D N ( t φ n 2 ( u ) + y ) .
Let
A j , j : = R 0 1 0 s D N ( t φ n j ( u ) + y ) D N ( t φ n j ( s ) + y ) d X u j d X s j μ M ( d y )
for j , j { 1 , 2 } , so that
| I I ( t ) | 2 = 1 ( 2 N + 1 ) 2 | A 1 , 2 + A 2 , 1 | 2 2 ( 2 N + 1 ) 2 ( | A 1 , 2 | 2 + | A 2 , 1 | 2 ) .
Then, we have
E [ | A j , j | 2 ] = R 2 μ M 2 ( d y d y ) 0 1 ( D N ( t φ n j ( s ) + y ) D N ( t φ n j ( s ) + y ) × E V j , j ( s ) 0 s D N ( t φ n j ( u ) + y ) d X u j 0 s D N ( t φ n j ( u ) + y ) d X u j d s = 0 1 0 s ( G j , j ( s , u ) ) 2 E [ V j , j ( s ) V j , j ( u ) ] d u d s + R 2 μ M 2 ( d y d y ) ( 0 1 ( D N ( t φ n j ( s ) + y ) D N ( t φ n j ( s ) + y ) × E [ V j , j ( s ) 0 s 0 u ( D N ( t φ n j ( u ) + y ) D N ( t φ n j ( v ) + y ) + D N ( t φ n j ( u ) + y ) D N ( t φ n j ( v ) + y ) ) d X v j d X u j ] d s ) .
By the Malliavin integration by parts formula,
E [ V j , j ( s ) 0 s 0 u D N ( t φ n j ( u ) + y ) D N ( t φ n j ( v ) + y ) d X v j d X u j ] = 0 s D N ( t φ n j ( u ) + y ) E [ σ j ( u ) s V j , j ( u ) 0 u D N ( t φ n j ( v ) + y ) d X v j ] d u = 0 s D N ( t φ n j ( u ) + y ) 0 u E [ σ j ( v ) v ( σ j ( u ) s V j , j ( u ) ) ] D N ( t φ n j ( v ) + y ) d v d u .
Then applying Malliavin integration by parts formula again, we see that
0 1 d s R 2 μ M 2 ( d y d y ) ( D N ( t φ n j ( s ) + y ) D N ( t φ n j ( s ) + y ) E [ V j , j ( s ) 0 s 0 u ( D N ( t φ n j ( u ) + y ) D N ( t φ n j ( v ) + y ) + D N ( t φ n j ( u ) + y ) D N ( t φ n j ( v ) + y ) ) d X v j d X u j ] = 2 0 1 0 s 0 u G j , j ( s , u ) G j , j ( s , v ) E [ σ j ( v ) v ( σ j ( u ) s V j , j ( u ) ) ] d v d u d s 2 C 0 1 0 s 0 u | G j , j ( s , u ) G j , j ( s , v ) | d v d u d s C [ 0 , 1 ] 2 ( G j , j ( s , u ) ) 2 d u d s .
Thus we obtain (A7).
We proceed to prove (A5) and (A6). Observe that
[ 0 , 2 π ] 3 ( G ( s , u ) ) 2 d u d s d t = [ 0 , 2 π ] 2 d s d u R 2 μ M 2 ( d y d y ) N k 1 , k 2 , k 3 , k 4 N e 2 π i ( k 1 + k 2 + k 3 + k 4 ) t e 2 π i ( k 1 + k 2 ) φ n 1 ( s ) 2 π i ( k 3 + k 4 ) φ n 2 ( u ) e 2 π i ( k 1 + k 3 ) y + 2 π ( k 2 + k 4 ) y = N k 1 , k 2 , k 3 , k 4 N k 1 + k 2 + k 3 + k 4 = 0 c M ( k 1 + k 3 ) c M ( k 2 + k 4 ) 0 1 e 2 π i ( k 1 + k 2 ) ( φ n 1 ( s ) s ) d s 0 1 e 2 π i ( k 3 + k 4 ) ( φ n 2 ( s ) s ) d s .
When t k k / n , we have
0 1 e 2 π i k φ ( s ) d s = 1 n l = 0 n 1 e 2 π i l k n = 1 { k = 0 } ,
hence we obtain (A6). For the general case, we have
0 2 π e 2 π i k φ ( s ) d s = 0 1 ( e 2 π i k φ ( s ) e 2 π i k s ) d s + 0 1 e 2 π i k s d s = 0 1 ( e 2 π i k φ ( s ) e 2 π i k s ) d s + 1 { k = 0 } .
By (A4), we obtain (A5). □
Lemma A3. 
E 0 1 ( I I I ( t ) ) 2 d t 2 C κ ( 2 N ) 2 κ + sup 0 < | k | 2 N 1 c M ( k ) | k | 2 κ .
Proof. 
We first note that
1 2 π 0 2 π ( I I I ( t ) ) 2 d t = | k | > 2 N | ( F V ) ( k ) | 2 + 2 N k 2 N 1 ( 1 | k | 2 N + 1 ) c M ( k ) 2 | ( F V ) ( k ) | 2 ,
and
| k | > 2 N | ( F V ) ( k ) | 2 ( 2 N ) κ | k | > 2 N | k | 2 κ | ( F V ) ( k ) | 2 .
On the other hand, since 0 < c M ( k ) < 1 ,
0 < 1 ( 1 | k | 2 N + 1 ) c M ( k ) < 1 ,
we have
1 ( 1 | k | 2 N + 1 ) c M ( k ) 2 2 ( 1 c M ( k ) ) 2 + 2 ( | k | 2 N + 1 c M ( k ) ) 2 2 ( 1 c M ( k ) ) κ + 2 | k | c M ( k ) 2 N + 1 2 κ ,
and therefore we have
2 N k 2 N 1 ( 1 | k | 2 N + 1 ) c M ( k ) 2 | ( F V ) ( k ) | 2 2 sup 0 < | k | 2 N 1 c M ( k ) | k | 2 κ + ( 2 N + 1 ) 2 κ 2 N k 2 N | k | 2 α | ( F V ) ( k ) | 2 .
Combining (A10) and (A11), we get (A9) by the assumption (9). □
Now we are ready to prove Theorem 2
Proof of Theorem 2. 
First we prove (i) and (ii). We now set
c M ( k ) = e 2 π 2 k 2 M .
Then,
| k | 2 N | c M ( k ) | 2 2 k = 1 2 N k 1 k e 2 π 2 x 2 M d x = R e 2 π 2 x M d x = M 2 π
and
| k | 2 N | c M ( k ) | 2 k 2 2 l = 0 2 N l l + 1 ( x + 1 ) 2 e 2 π 2 x 2 M d x = R ( x + 1 ) 2 e 2 π 2 x 2 M d x = M 2 π M 4 π 2 + 1 .
Further, we have
1 c M ( k ) | k | 2 = 1 e 2 π 2 k 2 M | k | 2 2 π 2 M
so that
sup 0 < | k | 2 N 1 c M ( k ) | k | 2 κ 2 π 2 M κ .
We prove now (iii) and (iv). If N ρ n α and M ρ n β , we have
ρ n 2 N 2 M 2 π ρ n 2 2 α β 2 ,
( 2 N + 1 ) 1 M 2 π ρ n α β 2 ,
2 C κ ( 2 N ) 2 κ ρ n 2 κ α . 2 C κ 2 π 2 M κ ρ n κ β ,
and
ρ n 2 M 2 π M 4 + π 2 n 2 3 β 2 .
Finally, in order to attain the consistency of the proposed estimator under the general sampling scheme, we need to assume
2 2 α β 2 > 0 , α > β 2 , α > 0 , β > 0 ,
which is equivalent to the condition (12). In such a case κ β < 2 κ α .
When the sampling is synchronous and regularly spaced, the necessary condition for consistency clearly becomes (13). □

Appendix C. Additional Results of Comparison for Alternative Models

Table A1. The % of psd matrix produced by each estimator, when the efficient price process is produced by alternative models, in presence of i.i.d. noise.
Table A1. The % of psd matrix produced by each estimator, when the efficient price process is produced by alternative models, in presence of i.i.d. noise.
EstimatorSV1FSV2FRHSV1FSV2FRHSV1FSV2FRHSV1FSV2FRH
d = 5, σ η = 1 d = 5, σ η = 1.5 d = 5, σ η = 2 d = 5, σ η = 2.5
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM100%100%100%100%100%100%100%99.92%99.53%97.12%94.18%95.06%
STS100%100%100%100%100%100%100%99.76%99.62%98.53%96.64%98.01%
d = 10, σ η = 1 d = 10, σ η = 1.5 d = 10, σ η = 2 d = 10, σ η = 2.5
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM100%100%100%99.84%100%100%99.58%99.13%95.3390.27%86.01%86.83%
STS99.68%99.93%99.95%98.82%98.71%98.55%87.86%89.35%88.16%55.23%49.79%59.43%
d = 15, σ η = 1 d = 15, σ η = 1.5 d = 15, σ η = 2 d = 15, σ η = 2.5
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM100%100%99.98%99.97%99.74%99.63%99.15%98.02%98.31%78.15%10.71%76.37%
STS96.69%99.35%96.93%76.07%72.45%75.91%34.70%29.01%28.72%6.95%7.44%8.26%
d = 20, σ η = 1 d = 20, σ η = 1.5 d = 20, σ η = 2 d = 20, σ η = 2.5
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM100%100%99.99%99.94%98.38%99.07%97.21%95.80%95.99%66.13%62.74%64.50%
STS64.78%67.24%65.87%12.89%12.45%16.93%4.33%2.47%0.98%0.0%0.0%0.0%
Table A2. The % of psd matrix produced by each estimator, when the efficient price process is produced by alternative models, in presence of general noise.
Table A2. The % of psd matrix produced by each estimator, when the efficient price process is produced by alternative models, in presence of general noise.
EstimatorSV1FSV2FRHSV1FSV2FRHSV1FSV2FRHSV1FSV2FRH
d = 5, g = 0.3 , w = 0.3 d = 5, g = 0.3 , w = 0.9 d = 5, g = 0.45 , w = 0.3 d = 5, g = 0.45 , w = 0.9
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM100%99.98%99.84%98.43%98.21%98.02%93.21%90.42%90.16%99.86%98.41%97.62%
STS99.01%99.73%99.61%98.23%98.11%99.04%99.16%99.73%99.88%98.90%98.75%99.00%
d = 10, g = 0.3 , w = 0.3 d = 10, g = 0.3 , w = 0.9 d = 10, g = 0.45 , w = 0.3 d = 10, g = 0.45 , w = 0.9
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM100%99.56%99.58%95.86%95.92%96.02%79.20%80.11%81.56%91.25%89.37%90.84%
STS92.80%93.04%93.52%79.23%73.21%80.01%90.93%89.15%89.37%76.41%74.86%77.75%
d = 15, g = 0.3 , w = 0.3 d = 15, g = 0.3 , w = 0.9 d = 15, g = 0.45 , w = 0.3 d = 15, g = 0.45 , w = 0.9
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM100%98.12%99.03%85.48%88.23%90.03%68.21%67.39%68.98%80.16%77.56%81.43%
STS51.67%45.47%50.93%45.88%43.92%45.78%39.66%35.31%37.40%79.65%38.04%39.61%
d = 20, g = 0.3 , w = 0.3 d = 20, g = 0.3 , w = 0.9 d = 20, g = 0.45 , w = 0.3 d = 20, g = 0.45 , w = 0.9
PDF100%100%100%100%100%100%100%100%100%100%100%100%
LMM99.98%96.15%98.78%84.38%81.51%85.00%53.11%52.05%55.15%75.82%74.83%72.68%
STS8.99%7.15%8.01%21.67%21.42%12.89%4.57%3.66%3.01%75.43%14.38%15.02%

Notes

1
Hereafter, we accept abuse of notation by denoting the estimator V ^ n j , n j , N , M j , j ( t ) as V ^ n , N , M j , j ( t ) .
2
Here the notation V ^ N , M j , j ( t ) highlights the dependence on the two parameters N , M .
3
The fact that the drift does not contribute to the asymptotics can be proved analogously as in Malliavin and Mancino (2009).
4
Here a n b n means both lim sup n a n / b n and lim sup n b n / a n are finite.
5
Since all the simulations are conducted under irregularly-spaced and asynchronous observations, we follow point ( i i i ) of Theorem 2. Moreover, for the the Heston, the SVF1 and the SVF2 models the Hölder parameter is κ = 1 2 , while the Rough Heston model it depends on the chosen Hurst exponent.
6
Note that, according with Theorem 1, the semi-definite positiveness of the proposed estimator is granted when the optimal cutting frequency N is the same for each spot volatility-covariance entries estimates.
7
Computations are performed using a machine with 2.30 GHz clock speed Intel i7-11800H, 16 GB RAM on Arch Linux kernel 6.14.10-arch1-1 with MATLAB R2024a update 6.

References

  1. Aït-Sahalia, Yacine, and Dacheng Xiu. 2019. Principal component analysis of high-frequency data. Journal of the American Statistical Association 114: 287–303. [Google Scholar]
  2. Aït-Sahalia, Yacine, and Jean Jacod. 2014. High-Frequency Financial Econometrics. Princeton: Princeton University Press. [Google Scholar]
  3. Alòs, Elisa, and Christian-Oliver Ewald. 2008. Malliavin differentiability of the heston volatility and applications to option pricing. Advances in Applied Probability 40: 144–62. [Google Scholar] [CrossRef]
  4. Bandi, Federico M., and Jeffrey R. Russell. 2008. Microstructure noise, realized variance, and optimal sampling. The Review of Economic Studies 75: 339–69. [Google Scholar] [CrossRef]
  5. Barndorff-Nielsen, Ole E., and Neil Shephard. 2004. Econometric analysis of realized covariation: High frequency-based covariance, regression, and correlation in financial economics. Econometrica 72: 885–925. [Google Scholar] [CrossRef]
  6. Barndorff-Nielsen, Ole E., Peter Reinhard Hansen, Asger Lunde, and Neil Shephard. 2011. Multivariate realised kernels: Consistent positive semi-definite estimators of the covariation of equity prices with noise and non-synchronous trading. Journal of Econometrics 162: 149–69. [Google Scholar] [CrossRef]
  7. Bibinger, Markus, Nikolaus Hautsch, Peter Malec, and Markus Reiss. 2019. Estimating the spot covariation of asset prices—Statistical theory and empirical evidence. Journal of Business & Economic Statistics 37: 419–35. [Google Scholar]
  8. Bu, Ruijun, Degui Li, Oliver Linton, and Hanchao Wang. 2023. Nonparametric estimation of large spot volatility matrices for high-frequency financial data. arXiv arXiv:2307.01348. [Google Scholar] [CrossRef]
  9. Chen, Dachuan, Per A. Mykland, and Lan Zhang. 2020. The five trolls under the bridge: Principal component analysis with asynchronous and noisy high frequency data. Journal of the American Statistical Association 115: 1960–77. [Google Scholar] [CrossRef]
  10. Chernov, Mikhail, A. Ronald Gallant, Eric Ghysels, and George Tauchen. 2003. Alternative models for stock price dynamics. Journal of Econometrics 116: 225–57. [Google Scholar] [CrossRef]
  11. Clement, Emmanuelle, and Arnaud Gloter. 2011. Limit theorems in the Fourier transform method for the estimation of multivariate volatility. Stochastic Processes and Their Applications 121: 1097–124. [Google Scholar] [CrossRef]
  12. Cui, Liyuan, Yongmiao Hong, Yingxing Li, and Junhui Wang. 2024. Regularized High-Dimensional Positive Definite Covariance Estimator with High-Frequency Data. Management Science 70: 7242–7264. [Google Scholar] [CrossRef]
  13. El Euch, Omar, and Mathieu Rosenbaum. 2019. The characteristic function of rough Heston models. Mathematical Finance 29: 3–38. [Google Scholar] [CrossRef]
  14. Engle, Robert, and Riccardo Colacito. 2006. Testing and valuing dynamic correlations for asset allocation. Journal of Business & Economic Statistics 24: 238–53. [Google Scholar] [CrossRef]
  15. Figueroa-López, José E., and Bei Wu. 2022. Kernel estimation of spot volatility with microstructure noise using pre-averaging. Econometric Theory 40: 558–607. [Google Scholar] [CrossRef]
  16. Gatheral, Jim, Thibault Jaisson, and Mathieu Rosenbaum. 2018. Volatility is rough. Quantitative Finance 18: 933–49. [Google Scholar] [CrossRef]
  17. Hansen, Peter R., and Asger Lunde. 2006. Realized variance and market microstructure noise. Journal of Business & Economic Statistics 24: 127–161. [Google Scholar] [CrossRef]
  18. Hasbrouck, Joel. 2007. Empirical Market Microstructure: The Institutions, Economics, and Econometrics of Securities Trading. Oxford: Oxford University Press. [Google Scholar]
  19. Heston, Steven L. 1993. A closed-form solution for options with stochastic volatility with applications to bond and currency options. The Rreview of Financial Studies 6: 327–43. [Google Scholar] [CrossRef]
  20. Huang, Xin, and George Tauchen. 2005. The relative contribution of jumps to total price variance. Journal of Financial Econometrics 3: 456–99. [Google Scholar] [CrossRef]
  21. Inahama, Yuzuru. 2014. Malliavin differentiability of solutions of rough differential equations. Journal of Functional Analysis 267: 1566–84. [Google Scholar] [CrossRef]
  22. Jacod, Jean, Yingying Li, and Xinghua Zheng. 2017. Statistical properties of microstructure noise. Econometrica 85: 1133–74. [Google Scholar] [CrossRef]
  23. Jacod, Jean, Yingying Li, and Xinghua Zheng. 2019. Estimating the integrated volatility with tick observations. Journal of Econometrics 208: 80–100. [Google Scholar] [CrossRef]
  24. Liu, Nien-Lin, and Hoang-Long Ngo. 2017. Approximation of eigenvalues of spot cross volatility matrix with a view toward principal component analysis. Japan Journal of Industrial and Applied Mathematics 34: 747–61. [Google Scholar] [CrossRef]
  25. Malliavin, Paul, and Maria Elvira Mancino. 2009. A Fourier transform method for nonparametric estimation of multivariate volatility. The Annals of Statistics 37: 1983–2010. [Google Scholar] [CrossRef]
  26. Mancino, Maria Elvira, Maria Cristina Recchioni, and Simona Sanfelici. 2017. Fourier-Malliavin Volatility Estimation: Theory and Practice. Berlin: Springer International Publishing. [Google Scholar]
  27. Mancino, Maria Elvira, Tommaso Mariotti, and Giacomo Toscano. 2024. Asymptotic normality and finite-sample robustness of the Fourier spot volatility estimator in the presence of microstructure noise. Journal of Business & Economic Statistics, 1–23. [Google Scholar] [CrossRef]
  28. Mancino, Maria Elvira, Tommaso Mariotti, and Giacomo Toscano. 2025. Spot beta estimation with asynchronous noisy prices. Quantitative Finance 25: 733–55. [Google Scholar] [CrossRef]
  29. Mykland, Per A., Lan Zhang, and Dachuan Chen. 2019. The algebra of two scales estimation, and the s-tsrv: High frequency estimation that is robust to sampling times. Journal of Econometrics 208: 101–19. [Google Scholar] [CrossRef]
  30. Park, Sujin, Seok Young Hong, and Oliver Linton. 2016. Estimating the quadratic covariation matrix for asynchronously observed high frequency stock returns corrupted by additive measurement error. Journal of Econometrics 191: 325–47. [Google Scholar] [CrossRef]
  31. Richard, Alexandre, Xiaolu Tan, and Fan Yang. 2023. On the discrete-time simulation of the rough heston model. SIAM Journal on Financial Mathematics 14: 223–49. [Google Scholar] [CrossRef]
  32. Roll, Richard. 1984. A simple implicit measure of the effective bid-ask spread in an efficient market. The Journal of Finance 39: 1127–39. [Google Scholar] [CrossRef]
  33. Shepard, Neil, and Dacheng Xiu. 2017. Econometric analysis of multivariate realised qml: Estimation of the covariation of equity prices under asynchronous trading. Journal of Econometrics 201: 19–42. [Google Scholar] [CrossRef]
  34. Zu, Yang, and H. Peter Boswijk. 2014. Estimating spot volatility with high-frequency financial data. Journal of Econometrics 181: 117–35. [Google Scholar] [CrossRef]
Table 1. Optimal couple of c N , c M in the considered grid across the different models for volatility and microstructure noise.
Table 1. Optimal couple of c N , c M in the considered grid across the different models for volatility and microstructure noise.
HestonSVF1SVF2RH
No noise
5, 15, 15, 15, 1
rNoise from rounding
0.015, 15, 15, 15, 1
0.055, 15, 15, 15, 1
σ η I.i.d. noise
13, 0.53, 0.53, 0.53, 0.5
1.53, 0.53, 0.53, 0.53, 0.5
23, 0.53, 0.53, 0.53, 0.5
2.51, 0.51, 0.51, 0.51, 0.5
θ Auto-correlated noise
0.21, 0.51, 0.51, 0.51, 0.5
0.31, 0.51, 0.51, 0.51, 0.5
0.43, 0.53, 0.53, 0.53, 0.5
g , w General noise
0.3, 0.33, 0.53, 0.53, 0.53, 0.5
0.3, 0.93, 0.53, 0.51, 0.51, 0.5
0.45, 0.31, 0.51, 0.51, 0.51, 0.5
0.45, 0.91, 0.51, 0.51, 0.51, 0.5
Table 2. M I S E 2 in estimating covariance over the considered grid, for selected scenarios.
Table 2. M I S E 2 in estimating covariance over the considered grid, for selected scenarios.
Heston—No noise
c N / c M 0.512345
0.53.068 · 10 4 4.370 · 10 4 6.193 · 10 4 7.539 · 10 4 8.646 · 10 4 9.510 · 10 4
11.539 · 10 4 2.124 · 10 4 2.971 · 10 4 3.619 · 10 4 4.164 · 10 4 4.331 · 10 4
35.172 · 10 5 7.101 · 10 5 1.002 · 10 4 1.232 · 10 4 1.427 · 10 4 1.553 · 10 4
53.985 · 10 5 5.238 · 10 5 7.081 · 10 5 8.488 · 10 5 9.660 · 10 5 1.023 · 10 4
74.657 · 10 5 5.541 · 10 5 6.846 · 10 5 7.847 · 10 5 8.687 · 10 5 9.636 · 10 5
95.732 · 10 5 6.524 · 10 5 6.989 · 10 5 8.874 · 10 5 9.113 · 10 5 9.995 · 10 5
Heston—I.i.d. noise σ η = 2.5
c N / c M 0.512345
0.53.215 · 10 4 4.562 · 10 4 6.456 · 10 4 7.856 · 10 4 9.006 · 10 4 9.228 · 10 4
11.046 · 10 4 1.509 · 10 4 2.173 · 10 4 2.676 · 10 4 3.096 · 10 4 3.261 · 10 4
31.543 · 10 4 2.125 · 10 4 2.968 · 10 4 3.613 · 10 4 4.159 · 10 4 4.365 · 10 4
51.768 · 10 4 2.408 · 10 4 3.349 · 10 4 4.073 · 10 4 4.683 · 10 4 4.883 · 10 4
72.506 · 10 4 3.286 · 10 4 4.447 · 10 4 5.347 · 10 4 6.115 · 10 4 6.510 · 10 4
92.732 · 10 4 3.682 · 10 4 4.770 · 10 4 5.618 · 10 4 6.663 · 10 4 6.798 · 10 4
SVF2—No noise
c N / c M 0.512345
0.58.197 · 10 4 1.234 · 10 3 1.855 · 10 3 2.328 · 10 3 2.723 · 10 3 2.982 · 10 4
14.804 · 10 4 6.877 · 10 4 9.867 · 10 4 1.212 · 10 3 1.403 · 10 3 1.633 · 10 4
32.392 · 10 4 3.066 · 10 4 4.066 · 10 4 4.826 · 10 4 5.462 · 10 4 5.701 · 10 4
51.860 · 10 4 2.161 · 10 4 2.639 · 10 4 3.020 · 10 4 3.354 · 10 4 3.411 · 10 4
72.068 · 10 4 2.183 · 10 4 2.449 · 10 4 2.699 · 10 4 2.931 · 10 4 3.028 · 10 4
92.236 · 10 4 2.421 · 10 4 2.563 · 10 4 2.784 · 10 4 2.988 · 10 4 3.295 · 10 4
SVF2—I.i.d. noise σ η = 2.5
c N / c M 0.512345
0.58.616 · 10 4 1.311 · 10 3 1.974 · 10 3 2.473 · 10 3 2.884 · 10 3 3.115 · 10 3
14.529 · 10 4 6.001 · 10 4 8.176 · 10 4 9.825 · 10 4 1.117 · 10 3 1.228 · 10 3
35.701 · 10 4 8.084 · 10 4 1.144 · 10 3 1.394 · 10 3 1.604 · 10 3 1.773 · 10 4
56.232 · 10 4 8.579 · 10 4 1.163 · 10 3 1.377 · 10 3 1.553 · 10 3 1.640 · 10 3
76.173 · 10 4 8.607 · 10 4 1.254 · 10 3 1.574 · 10 3 1.849 · 10 3 1.930 · 10 3
96.454 · 10 4 8.773 · 10 4 1.296 · 10 3 1.602 · 10 3 1.890 · 10 3 1.981 · 10 3
Table 3. Accuracy (MISE) and % of spsd matrix produced by each estimator, when the dimension d of V increases.
Table 3. Accuracy (MISE) and % of spsd matrix produced by each estimator, when the dimension d of V increases.
EstimatorMISE% SPSDMISE% SPSD
d = 2d = 20
GPDF6.773 · 10 5 100%5.395 · 10 5 100%
LMM2.563 · 10 4 100%1.133 · 10 4 100%
STS2.240 · 10 4 100%2.156 · 10 4 88.60%
d = 5d = 25
GPDF5.670 · 10 5 100%5.258 · 10 5 100%
LMM1.495 · 10 4 100%9.659 · 10 5 100%
STS2.201 · 10 4 100%2.138 · 10 4 66.22%
d = 10d = 30
GPDF5.316 · 10 5 100%5.142 · 10 5 100%
LMM1.215 · 10 4 100%9.604 · 10 5 100%
STS2.022 · 10 4 100%2.006 · 10 4 9.87%
d = 15d = 40
GPDF5.299 · 10 5 100%5.223 · 10 5 100%
LMM9.977 · 10 4 100%9.501 · 10 5 99.54%
STS2.019 · 10 4 95.43%1.994 · 10 4 3.02%
Table 4. Accuracy and % of spsd matrix produced by each estimator, when the average time between consecutive observations Δ ¯ t changes.
Table 4. Accuracy and % of spsd matrix produced by each estimator, when the average time between consecutive observations Δ ¯ t changes.
EstimatorMISE% SPSDMISE% SPSDMISE% SPSD
d = 5, Δ ¯ t = 15 d = 5, Δ ¯ t = 20 d = 5, Δ ¯ t = 30
GPDF6.314 · 10 5 100%7.798 · 10 5 100%9.723 · 10 5 100%
LMM1.610 · 10 4 100%1.885 · 10 4 100%2.503 · 10 4 99.54%
STS2.426 · 10 4 100%2.603 · 10 4 100%2.686 · 10 4 100%
d = 10, Δ ¯ t = 15 d = 10, Δ ¯ t = 20 d = 10, Δ ¯ t = 30
GPDF6.397 · 10 5 100%7.339 · 10 5 100%9.188 · 10 5 100%
LMM1.956 · 10 4 100%1.882 · 10 4 100%1.975 · 10 4 98.58%
STS2.303 · 10 4 100%2.344 · 10 4 100%2.395 · 10 4 100%
d = 15, Δ ¯ t = 15 d = 15, Δ ¯ t = 20 d = 15, Δ ¯ t = 30
GPDF6.322 · 10 5 100%7.451 · 10 5 100%9.169 · 10 5 100%
LMM1.717 · 10 4 100%1.445 · 10 4 98.05%1.918 · 10 4 97.68%
STS2.285 · 10 4 99.90%2.310 · 10 4 99.15%2.383 · 10 4 98.54 %
d = 20, Δ ¯ t = 15 d = 20, Δ ¯ t = 20 d = 20, Δ ¯ t = 30
GPDF6.325 · 10 5 100%7.258 · 10 5 100%9.224 · 10 5 100%
LMM1.651 · 10 4 99.83%1.322 · 10 4 90.18%1.782 · 10 4 96.67%
STS2.213 · 10 4 91.45%2.289 · 10 5 88.01%2.330 · 10 4 82.66 %
Table 5. Accuracy and % of spsd matrix produced by each estimator, when a rounding of 1 or 5 cents is present.
Table 5. Accuracy and % of spsd matrix produced by each estimator, when a rounding of 1 or 5 cents is present.
EstimatorMISE% SPSDMISE% SPSD
d = 5, r = 0.01d = 5, r = 0.05
GPDF5.583 · 10 5 100%5.587 · 10 5 100%
LMM1.638 · 10 4 100%1.641 · 10 4 100%
STS2.188 · 10 4 100%2.188 · 10 4 100%
d = 10, r = 0.01d = 10, r = 0.05
GPDF5.376 · 10 5 100%5.377 · 10 5 100%
LMM1.094 · 10 4 100%1.095 · 10 4 100%
STS2.022 · 10 4 100%2.023 · 10 4 100%
d = 15, r = 0.01d = 15, r = 0.05
GPDF5.284 · 10 5 100%5.284 · 10 5 100%
LMM1.111 · 10 4 100%1.113 · 10 4 100%
STS1.998 · 10 4 99.97%1.997 · 10 4 99.85%
d = 20, r = 0.01d = 20, r = 0.05
GPDF5.203 · 10 5 100%5.204 · 10 5 100%
LMM1.011 · 10 4 100%1.011 · 10 4 100%
STS1.902 · 10 4 97.66%1.902 · 10 4 97.03%
Table 6. Accuracy and % of spsd matrix produced by each estimator, when the data is contaminated by i.i.d. noise.
Table 6. Accuracy and % of spsd matrix produced by each estimator, when the data is contaminated by i.i.d. noise.
EstimatorMISE% SPSDMISE% SPSDMISE% SPSDMISE% SPSD
d = 5, σ η = 1 d = 5, σ η = 1.5 d = 5, σ η = 2 d = 5, σ η = 2.5
GPDF8.229 · 10 5 100%8.242 · 10 5 100%1.350 · 10 4 100%2.054 · 10 4 100%
LMM1.485 · 10 4 100%1.588 · 10 4 100%1.727 · 10 4 100%1.991 · 10 4 100%
STS2.478 · 10 4 100%2.567 · 10 4 100%2.910 · 10 4 100%3.361 · 10 4 97.25%
d = 10, σ η = 1 d = 10, σ η = 1.5 d = 10, σ η = 2 d = 10, σ η = 2.5
GPDF6.922 · 10 5 100%7.901 · 10 5 100%1.503 · 10 4 100%1.889 · 10 4 100%
LMM1.320 · 10 4 100%1.191 · 10 4 100%1.583 · 10 4 100%2.198 · 10 4 99.25%
STS2.236 · 10 4 98.85%2.369 · 10 4 98.25%2.610 · 10 4 86.47%3.028 · 10 4 55.67%
d = 15, σ η = 1 d = 15, σ η = 1.5 d = 15, σ η = 2 d = 15, σ η = 2.5
GPDF6.873 · 10 5 100%7.588 · 10 5 100%1.474 · 10 4 100%1.801 · 10 4 100%
LMM1.225 · 10 4 100%1.348 · 10 4 100%1.920 · 10 4 100%1.955 · 10 4 98.42%
STS2.042 · 10 4 97.43%2.187 · 10 4 76.01%2.738 · 10 4 28.10%3.008 · 10 4 20.13%
d = 20, σ η = 1 d = 20, σ η = 1.5 d = 20, σ η = 2 d = 20, σ η = 2.5
GPDF6.468 · 10 5 100%7.622 · 10 5 100%1.497 · 10 4 100%1.628 · 10 4 100%
LMM1.008 · 10 4 100%1.581 · 10 4 97.83%1.621 · 10 4 96.26%2.044 · 10 4 92.10%
STS2.250 · 10 4 65.27%2.390 · 10 4 15.17%2.503 · 10 4 6.27%2.817 · 10 4 0.0%
Table 7. Accuracy and % of spsd matrix produced by each estimator, when the data is contaminated by autocorrelated noise.
Table 7. Accuracy and % of spsd matrix produced by each estimator, when the data is contaminated by autocorrelated noise.
EstimatorMISE% SPSDMISE% SPSDMISE% SPSD
d = 5, θ η = 0.2 d = 5, θ η = 0.3 d = 5, θ η = 0.4
GPDF2.093 · 10 4 100%1.956 · 10 4 100%1.743 · 10 4 100%
LMM2.187 · 10 4 100%2.632 · 10 4 100%2.693 · 10 4 100%
STS4.134 · 10 4 90.47%3.026 · 10 4 98.98%2.945 · 10 4 97.23%
d = 10, θ η = 0.2 d = 10, θ η = 0.3 d = 10, θ η = 0.4
GPDF1.887 · 10 4 100%1.810 · 10 4 100%1.752 · 10 4 100%
LMM2.789 · 10 4 100%2.863 · 10 4 98.94%1.953 · 10 4 100%
STS3.934 · 10 4 22.71%3.007 · 10 4 65.58%2.666 · 10 4 85.43%
d = 15, θ η = 0.2 d = 15, θ η = 0.3 d = 15, θ η = 0.4
GPDF1.755 · 10 4 100%1.721 · 10 4 100%1.599 · 10 4 100%
LMM2.662 · 10 4 100%2.659 · 10 4 93.44%1.781 · 10 4 95.90%
STS3.728 · 10 4 2.01%3.001 · 10 4 6.25%2.789 · 10 4 20.99%
d = 20, θ η = 0.2 d = 20, θ η = 0.3 d = 20, θ η = 0.4
GPDF1.743 · 10 4 100%1.657 · 10 4 100%1.580 · 10 4 100%
LMM2.514 · 10 4 93.87%2.501 · 10 4 93.48%1.751 · 10 4 93.60%
STS3.899 · 10 4 1.02%2.921 · 10 4 2.03%2.700 · 10 4 2.58%
Table 8. Accuracy and % of spsd matrix produced by each estimator, when the data is contaminated by the general noise process.
Table 8. Accuracy and % of spsd matrix produced by each estimator, when the data is contaminated by the general noise process.
EstimatorMISE% SPSDMISE% SPSDMISE% SPSDMISE% SPSD
d = 5, g = 0.3 , w = 0.3 d = 5, g = 0.3 , w = 0.9 d = 5, g = 0.45 , w = 0.3 d = 5, g = 0.45 , w = 0.9
GPDF2.543 · 10 4 100%2.521 · 10 4 100%2.845 · 10 4 100%2.981 · 10 4 100%
LMM2.701 · 10 4 100%2.695 · 10 4 99.52%3.103 · 10 4 100%3.499 · 10 4 99.40%
STS2.919 · 10 4 99.87%3.184 · 10 4 99.00%4.396 · 10 4 99.83%4.716 · 10 4 98.51%
d = 10, g = 0.3 , w = 0.3 d = 10, g = 0.3 , w = 0.9 d = 10, g = 0.45 , w = 0.3 d = 5, g = 0.45 , w = 0.9
GPDF1.762 · 10 4 100%2.123 · 10 4 100%2.458 · 10 4 100%2.702 · 10 4 100%
LMM2.198 · 10 4 99.76%2.707 · 10 4 97.17%2.660 · 10 4 97.89%3.543 · 10 4 91.06%
STS2.702 · 10 4 99.26%3.005 · 10 4 79.58%3.722 · 10 4 90.36%4.601 · 10 4 77.10%
d = 15, g = 0.3 , w = 0.3 d = 15, g = 0.3 , w = 0.9 d = 15, g = 0.45 , w = 0.3 d = 5, g = 0.45 , w = 0.9
GPDF1.508 · 10 4 100%1.702 · 10 4 100%2.344 · 10 4 100%2.523 · 10 4 100%
LMM1.891 · 10 4 98.04%3.085 · 10 4 87.25%2.622 · 10 4 90.93%3.582 · 10 4 81.48%
STS2.671 · 10 4 50.90%2.901 · 10 4 47.12%3.600 · 10 4 39.94%4.543 · 10 4 40.23%
d = 20, g = 0.3 , w = 0.3 d = 20, g = 0.3 , w = 0.9 d = 20, g = 0.45 , w = 0.3 d = 5, g = 0.45 , w = 0.9
GPDF1.305 · 10 4 100%1.587 · 10 4 100%2.317 · 10 4 100%2.458 · 10 4 100%
LMM2.674 · 10 4 96.15%2.991 · 10 4 81.34%2.956 · 10 4 85.17%3.478 · 10 4 78.63%
STS2.667 · 10 4 8.21%2.901 · 10 4 21.94%3.478 · 10 4 3.77%4.293 · 10 4 16.42%
Table 9. The % of psd matrix produced by each estimator, when the efficient price process is produced by alternative models.
Table 9. The % of psd matrix produced by each estimator, when the efficient price process is produced by alternative models.
SVF1SVF2Rough H.
EstimatorMISE% SPSDMISE% SPSDMISE% SPSD
d = 2
GPDF2.433 · 10 5 100%2.441 · 10 3 100%4.575 · 10 3 100%
LMM6.799 · 10 5 100%4.663 · 10 3 100%6.020 · 10 3 100%
STS9.345 · 10 5 100%6.217 · 10 3 100%6.995 · 10 3 100%
d = 5
GPDF1.805 · 10 5 100%1.496 · 10 3 100%2.421 · 10 3 100%
LMM4.022 · 10 5 100%2.036 · 10 5 100%3.640 · 10 3 100%
STS7.002 · 10 5 100%3.391 · 10 3 100%4.473 · 10 3 100%
d = 10
GPDF1.756 · 10 5 100%6.802 · 10 4 100%1.588 · 10 3 100%
LMM3.670 · 10 5 100%1.215 · 10 3 100%2.036 · 10 3 99.98%
STS7.287 · 10 5 100%1.902 · 10 3 100%3.654 · 10 3 99.95%
d = 15
GPDF1.699 · 10 5 100%5.185 · 10 4 100%1.403 · 10 3 100%
LMM3.476 · 10 5 100%9.004 · 10 4 100%1.890 · 10 3 99.64%
STS6.995 · 10 5 100%1.634 · 10 3 99.92%3.338 · 10 3 99.74%
d = 20
GPDF1.781 · 10 5 100%4.561 · 10 4 100%1.241 · 10 3 100%
LMM3.286 · 10 5 100%8.639 · 10 4 99.76%1.703 · 10 3 99.55%
STS6.994 · 10 5 99.56%1.464 · 10 3 96.85%3.263 · 10 3 98.23%
Table 10. Average computational time (in seconds) for estimation of volatility paths for different couple of ( c N , c M ) and relative distance to baseline case ( c N = 3 , c M = 0.5 ) .
Table 10. Average computational time (in seconds) for estimation of volatility paths for different couple of ( c N , c M ) and relative distance to baseline case ( c N = 3 , c M = 0.5 ) .
( c N , c M ) ( 1 , 0.5 ) ( 2 , 0.5 ) ( 3 , 0.5 ) ( 1 , 1 ) ( 2 , 1 ) ( 3 , 1 )
avg. comp. time1.05522.1133.10141.06662.31613.6283
d e m p 0.26740.129800.39660.34040.3124
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Akahori, J.; Kambara, R.; Liu, N.-L.; Mancino, M.E.; Mariotti, T.; Yasuda, Y. Symmetric Positive Semi-Definite Fourier Estimator of Spot Covariance Matrix with High Frequency Data. Risks 2025, 13, 197. https://doi.org/10.3390/risks13100197

AMA Style

Akahori J, Kambara R, Liu N-L, Mancino ME, Mariotti T, Yasuda Y. Symmetric Positive Semi-Definite Fourier Estimator of Spot Covariance Matrix with High Frequency Data. Risks. 2025; 13(10):197. https://doi.org/10.3390/risks13100197

Chicago/Turabian Style

Akahori, Jiro, Reika Kambara, Nien-Lin Liu, Maria Elvira Mancino, Tommaso Mariotti, and Yukie Yasuda. 2025. "Symmetric Positive Semi-Definite Fourier Estimator of Spot Covariance Matrix with High Frequency Data" Risks 13, no. 10: 197. https://doi.org/10.3390/risks13100197

APA Style

Akahori, J., Kambara, R., Liu, N.-L., Mancino, M. E., Mariotti, T., & Yasuda, Y. (2025). Symmetric Positive Semi-Definite Fourier Estimator of Spot Covariance Matrix with High Frequency Data. Risks, 13(10), 197. https://doi.org/10.3390/risks13100197

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop