A Note on Wavelet-Based Estimator of the Hurst Parameter

The signals in numerous fields usually have scaling behaviors (long-range dependence and self-similarity) which is characterized by the Hurst parameter H. Fractal Brownian motion (FBM) plays an important role in modeling signals with self-similarity and long-range dependence. Wavelet analysis is a common method for signal processing, and has been used for estimation of Hurst parameter. This paper conducts a detailed numerical simulation study in the case of FBM on the selection of parameters and the empirical bias in the wavelet-based estimator which have not been studied comprehensively in previous studies, especially for the empirical bias. The results show that the empirical bias is due to the initialization errors caused by discrete sampling, and is not related to simulation methods. When choosing an appropriate orthogonal compact supported wavelet, the empirical bias is almost not related to the inaccurate bias correction caused by correlations of wavelet coefficients. The latter two causes are studied via comparison of estimators and comparison of simulation methods. These results could be a reference for future studies and applications in the scaling behavior of signals. Some preliminary results of this study have provided a reference for my previous studies.


Introduction
The signals in numerous fields usually have scaling behavior (long-range dependence and self-similarity) which has been recognized as a key property for data characterization and decision making (see e.g., [1][2][3][4][5]). It is usually characterized by the Hurst parameter H [6]. The key point for detecting the scaling behavior is the estimation of the Hurst parameter H. The Hurst parameter was first computed via R/S statistic by Hurst [7] for the study of hydrological properties of Nile river. Hurst found that R/S statistic on the Nile data grew approximately as n H , H = 0.74. n is the number of observations. This phenomenon is called the Hurst phenomenon. To study the Hurst phenomenon, Mandelbrot introduced the concept of self-similar and explained the Hurst phenomenon successfully using self-similar fractional Brownian motion (FBM) [8]. A continuous process X(t) is said to be self-similar, if for a > 0, X(at) d = a H X(t), H is the self-similar parameter. When H > 0.5, the increments of FBM are long-range dependent, i.e., the summation of their auto-covariances is divergent. Thus, fractal Brownian motion and its increments (fractional Gaussian noise (FGN)) play important roles in modeling signals with self-similarity and long-range dependence. Most studies on this issue are based on FBM.
Wavelet analysis is a common method for signal processing (see e.g., [9,10]) and has been widely used for the fractal analysis of signals due to its multiresolution. Nicolis et al. [11] defined three kinds of wavelet-based entropy for studying the two-dimensional fractional Brownian field. Li et al. [12] used wavelet fractal and twin support vector machine to study the classification of heart sound signals. Ramírez-Pacheco et al. [13] studied fractal signal classification using non-extensive wavelet-based entropy.
The wavelet-based estimator of the Hurst parameter was well-established by Abry et al. (see [14][15][16][17][18][19][20][21]). Compared with other estimators, such as the R/S method, the periodogram, the variogram (semi-parametric or nonparametric estimator) and the parametric method, the wavelet-based estimator performs well in both the statistical and computational sense, and is superior in robustness (see [18,19] and the references therein). Besides, the wavelet-based method can also eliminate some trends (linear trends, polynomial trend, or more) by the property of its vanishing moments [17], which makes the estimator robust to some nonstationarities. More simulation studies for the estimation of Hurst parameter can be seen in [22]. Based on the standard wavelet-based estimator, some robust estimators are proposed. Soltani et al. [23] proposed an improved wavelet-based estimator via the average of two wavelet coefficients of half length apart and taking the logarithm first. Shen et al. [24] proposed a robust estimator of self-similar parameter using wavelet transform, which was less sensitive to some non-stationary traffic conditions. Park & Park [25] introduced a robust wavelet-based estimator which took the logarithm of wavelet coefficients first and averaged them later. Feng & Vidakovic [26] estimated the Hurst parameter via a general trimean estimator on nondecimated wavelet coefficients Kang & Vidakovic [27] proposed a robust estimator of Hurst parameter via medians of log-squared nondecimated wavelet coefficients.
Despite extensive studies of standard wavelet-based estimator proposed by Abry et al., there is still a lack of comprehensive and detailed numerical simulation study on fractal Brownian motion, especially for the selection of parameters and the empirical bias. I have not seen studies on the changes of bias and variance with all different Hs and with different data lengths, which I think is important for the selection of the lower octave bound j 1 , especially at small values of H. j 1 is selected via the minimum mean square error. Thus, this paper conducts a detailed numerical simulation study on the selection of parameters including the following contents.

•
The changes of bias and variance with all different Hs, different data lengths, different j 1 s and different wavelets; • The relations of selected j 1 with data length and H; For the causes of the empirical bias which exist in standard wavelet-based estimator, the following three causes in the case of FBM are concluded.

•
The initialization for initial approximation wavelet coefficients which introduces errors in used detailed wavelet coefficients.

•
The inaccurate bias correction caused by correlations of wavelet coefficients.

•
The method of simulation for FBM is not enough exact that the empirical bias is caused.
There exist many studies on the reduction of empirical bias caused by the first two reasons, but lack of study on determining which is the main cause of empirical bias. It is important for reducing empirical bias via appropriate techniques. Combining with results of parameters selection, this paper analyzes the above three causes of empirical bias in the case of FBM via comparison of estimators and comparison of simulation methods. The results obtained from above numerical simulations can be a reference for future studies and applications in the scaling behavior of signals. Some preliminary results of this study have provided a reference for my previous studies on wavelet-based estimation of Hurst parameters [28][29][30].
This paper is organized as follows. In Section 2, this paper introduces two available estimators for the Hurst parameter, and the initialization methods for the initial approximation wavelet coefficients. The simulation methods of FBM are described in Section 3. The main results are reported and discussed in Section 4 and my works are concluded in Section 5.

Definitions and Properties
Fractional Brownian motion {X(t), t ∈ R} with Hurst parameter H > 0 is a real-valued mean-zero Gaussian process with the following covariance structure: It is a self-similar process with stationary increment. Its wavelet coefficient is defined by ψ(t) is the mother wavelet, which is defined through a scaling function φ(t). Usually we choose ψ(t) as a base function, and we can change it to ψ j,k (t) = 2 −j/2 ψ(2 −j t − k), j, k ∈ Z. The factors 2 j and j are called the scale and octave respectively. please note that FBM is usually denoted by B H . this paper uses the symbol X instead of B H since the methods in this section for FBM can be applicable to a more general process named self-similar process with stationary increment and finite variance [20].
Some key properties of the wavelet coefficient of X are given in the following lemma. The proof of this lemma can be found in [19,20,[31][32][33].
is an orthonormal wavelet with compact support and have N ≥ 1 vanishing moments. The wavelet coefficients of X(t) given in (2) have these properties below, (1) Ed X (j, k) = 0 and d X (j, k) is Gaussian, for any j, k ∈ Z.

Remark 1.
In view of Equation (5), to avoid long-range dependence for d X (j, k), i.e., to ensure that ∑ j,k∈Z E|d X (j, k)d X (0, 0)| < ∞, one needs to choose that is, have at least N = 2. Under this condition, the correlation of d X (j, k) tends rapidly to 0 at large lags.
According to Remark 1 and Equation (5), let the number of vanishing moments N ≥ 2. It is reasonable to impose the following assumptions.

•
For fixed j, the d X (j, ·) are independent and identically distributed; • The processes d X (j, ·) and d X (j , ·), j = j , are independent.
Under these two assumptions, according to Lemma 1, there exist two available least squares estimators for the Hurst parameter. [19,20,34], one of which is first applied in the case of FBM for the bias study of common estimator.

The First Estimator
The first estimator is the standard wavelet-based estimator of Hurst parameter which is proposed by Abry et al and commonly used in applications of various fields. In view of Equations (3) and (4), I can check the following formula.
Take the logarithm: So the estimation of H can be conducted by a linear regression in the left part versus j diagram. The Ed 2 X (j, k) is estimated by where n j stands for the number of d 2 X (j, k) actually available at octave j. Due to different variances of log 2 S(j) at different js, the weighted least squares for this regression model is needed. The weight is the reciprocal of the variance of log 2 S(j).
This can lead to the bias of estimator. Define the variables y 1 (j)s as where g(j) is calculated such that Ey 1 (j) = log 2 ES(j). To ensure the unbiasedness of the estimator, I use y 1 (j) as the response variable instead of log 2 S(j). Moreover, Vary 1 (j) = Var log 2 S(j). The calculation of g(j) and Vary 1 (j) are shown in [18][19][20].

The Second Estimator
As mentioned above, since E log 2 S(j) = log 2 ES(j), the first estimator needs to correct bias. For avoiding this case, the second least squares estimator for Hurst parameter is proposed. This estimator is also originally proposed by Abry et al. [34] and then studied by Park & Park [25] for the purpose of robustness .
Based on Equations (3) and (4), Now take the logarithm first and then the expectation, obtain the following new equation.
So the estimation of H can be conducted by a weighted linear regression in the left part versus j diagram.
where n j stands for the number of d 2 X (j, k) actually available at octave j. Compared with the first estimator, the second estimator changes the order of expectation and logarithmic. The idea of this estimator is first proposed for analyzing the α-stable self-similar processes with infinite second-order statistics and long-range dependence [34].
Define the variables y 2 (j) as We can check that Ey 2 (j) = ELS(j) = E log 2 d 2 X (j, k). Let y 2 (j) be the response variable of weighted linear regression. The unbiasedness of the estimator follows from the unbiasedness of y 2 (j).
Similar to the calculation shown in [18][19][20], the variance of y 2 (j) can be calculated for the weight of regression.
The Var log 2 d 2 X (j, ·) is estimated using its sample variance at each octave.

Explicit Formula of theTwo Estimators
Let j 1 denote the lower bound of j, and j 2 denote the upper bound of j, i.e., the values of j are chosen j 1 ≤ j ≤ j 2 . According to the weighted least squares, the explicit formula of estimators can be obtained as follows,Ĥ where ω(j) = When using the first method, y(j) = y 1 (j) and σ 2 (j) = Vary 1 (j), letĤ 1 denote the first estimator. When using the second method, y(j) = y 2 (j) and σ 2 (j) = Vary 2 (j), letĤ 2 denote the second estimator.

Variance Comparison
The variances ofĤ 1 andĤ 2 can be compared via a simple theoretical analysis. In view of Equation (19), the variance ofĤ can be calculated by When n j is large, recall that the asymptotic form of Vary 1 (j) (see [19]).
Also recall Equation (17), So when n j is large, the asymptotic form of ratio can be obtained, The variance ofĤ 1 is smaller than that ofĤ 2 . Please note that the nondecimated wavelet coefficients have been used in wavelet-based estimator since they can decrease the variance due to their redundancy [26,27]. However, they can also increase the correlations in wavelet coefficients. Then when using nondecimated wavelet coefficients, we should take logarithm first. It is suitable to reduce the variance of the second estimator via nondecimated wavelet coefficients. For further considering the possible outliers caused by logarithmic transform, Kang & Vidakovic [27] suggest using medians for estimation of Hurst parameter in this case. This method is denoted by MEDL.

Calculation of Wavelet Coefficients
According to the multiresolution analysis (MRA), the wavelet coefficients can be calculated by fast pyramidal algorithm. The scaling function φ and the wavelet ψ satisfy so-called two-scale equation: where {u n } and {v n } are two existing sequences belonging to l 2 .
Define the approximation coefficients a X (j, k): In view of above formulas, the a X (0, ·) are obtained via integral. However, in practice, the data we obtained are always discrete and finite. The a X (0, ·) cannot be obtained by integral in continuous time.
When sampling frequency is high and the scale of wavelet transform is small, the typical approach is to set [35][36][37] a X (0, k) = X(k). (27) where {X(k), k ∈ Z, 1 ≤ k ≤ n} is discrete and finite FBM.
In view of Equations (25) and (26), the number of available wavelet coefficients n j decreases by half. Then n j ≈ n2 −j .

Remark 2.
For a wavelet which has time support (finite or decreases very fast as |t| → ∞), an increase in the number of vanishing moments N comes with an enlargement of the time support [20]. In the case of finite data, because of the boundary effects of wavelet transform, this will lead to the decrease of the number of available wavelet coefficients n j at each octave.

The Initialization Method
The discrete sampling for a continuous process X(t) usually implies an irrevocable loss of information on X(t) [35]. So the approach shown in Equation (27) introduces errors in d X (j, k)s. It is known [14,18] that these initialization errors are significant on small octaves but quickly decrease with increasing j. For large j, the initialization issue can be ignored. Veitch et al. [35] introduce an initialization method for discrete time series, which has been proved meaningful for correction of the initialization errors in the case of long-range dependent process.
This initialization method is based on the stochastic version of the Shannon sampling theorem [35,38]. Consider the bandlimited stationary stochastic process {X(t), t ∈ R}, construct X(t) by The X(t) is bandlimited, and has the same spectral density as that of X(t) in the frequency band [−1/2, 1/2] (otherwise is zero). It is easy to check where [35]. Please note that because of the boundary effects, the initialization will lead to the decrease of the number of available wavelet coefficients n j .

Simulation of FBM
For studying the statistical performance of the two estimators in the case of FBM, the numerical simulation of FBM is conducted. Here, this section briefly introduces two simulation methods of FBM [39][40][41][42]. Let {X(t), t ∈ [0, 1]} be a mean-zero fractional Brownian motion with Hurst parameter H ∈ (0, 1).

The Cholesky Method
The Cholesky method uses the Cholesky decomposition of the covariance matrix. The FBM generated by this method is exact in the sense of covariance structure, but this method is slow.

The Circulant Embedding Method
The simulation procedure is based on the method of circulant embedding. The algorithm of circulant embedding was originally proposed by Davies and Harte [39], and was later simultaneously generalized by Dietrich and Newsam (see [40][41][42] and the references therein). It has been regarded as a fast and exact simulation of stationary Gaussian processes [42]. I use this method to generate a fractional Gaussian noise, and construct a fractional Brownian motion via the cumulative sum of generated fractional Gaussian noise [41].
First consider the fractional Gaussian noise, which is a zero-mean stationary Gaussian process Such a stationary Gaussian noise can be efficiently and exactly generated by the method of circulant embedding and fast Fourier transform [41,42]. The fractional Brownian motion {X(t), t ∈ [0, 1]} is constructed on a uniformly spaced grid via the cumulative sum [41]

Simulation Results and Discussions
This section focuses on the numerical study of commonly used wavelet-based estimator (the first estimator) which still lacks of comprehensive and detailed numerical study on estimation of fractal Brownian motion, especially on its empirical bias and the selection of parameters. The second estimator was also compared with the commonly used estimator in this section for the purpose of empirical bias analysis. If not specified, the sample trajectory of FBM used in this section is generated by the circulant embedding method.

Selection of Parameters
It is a key step to select octaves js and the number of vanishing moments N (or wavelet) before estimation. First this subsection studies the selection of these parameters for later estimations. For octaves js, the lower bound j 1 and the upper bound j 2 need to be determined. The j 2 is chosen as the largest possible. In practice, it is set equal to where n denotes the data length and C is a constant (with value log 2 (2N + 1) corresponding to the length of the support of the wavelet [20]). As the discussion in Section 2, the initialization for a X (0, k) given in (27) introduces errors in the d X (j, k). It is known [14,18] that initialization errors are significant on small octaves but decrease with increasing j. So small octave cannot be chosen as j 1 . Based on prior studies [18,20], and this paper selects j 1 by the minimum mean square error (MSE), where the MSE is defined as It allows the tradeoff between variance and bias. The results for the selection of j 1 are shown in Table 1 and Figure 1. Figure 1 shows that the increase of j 1 causes the decrease of bias and the increase of standard deviation for all Hs. So it is suitable to choose the j 1 by the minimum of MSE. From Table 1, when H > 0.5, j 1 is chosen j 1 = 3 by minimum MSE. When 0.4 ≤ H ≤ 0.5, the RMSE of j 1 = 3 is close to that of chosen j 1 = 4. So considering most Hs, j 1 = 3 should be chosen in the case of FBM.
Please note that the results of Figure 1 and Table 1 are based on long series. In this case, the variances of all Hs are small. For small values of H, the bias is large, and the MSE is mainly determined by the bias. So the estimator of small H trends to select large j 1 which can lead to small bias. Now I study the effects of data length and the selection of j 1 at different data lengths. The results of this issue are shown in Figure 2 and Table 2. Figure 2 shows that the data length has little effect on the bias, but its decrease causes the increase of standard deviation for all Hs. The increase of standard deviation may affect the selection of j 1 . Thus, continue to use the minimum MSE to select j 1 at different data lengths for the tradeoff between variance and bias. The results of selection are shown in Table 2. From Table 2, it can be seen that the selected j 1 increases with the increase of data length, and the smaller the value of H, the faster the increase. Based on simulation results, the following formula is given for explanation.
where Bias(H, j 1 ) denotes the bias of estimator which decreases with the increase of H and the increase of j 1 . var(j 1 , n) denotes the variance which decreases with the decrease of j 1 and the increase of n. When n increases, the variance becomes smaller, the selected j 1 trends to increase for the tradeoff between variance and bias. The smaller the value of H, the larger the bias, and the more the selected j 1 increase.
For the wavelet, this paper chooses the classical Daubechies wavelets, which are orthonormal and have compact support. According to Remark 1, the number of vanishing moments must be chosen N ≥ 2. For analyzing the effect of N, I use N = 1 ∼ 8 to estimate the Hurst parameter of FBM. The results are shown in Figure 3.
From Figure 3, when N ≥ 2, the increase of N makes no improvements to the performance of the estimator. Besides, according to Remark 2, large N will cause the loss of available wavelet coefficients. So appropriately we should choose N = 3.
Finally, this subsection studies the performance of this estimator using various wavelets for further chosen of wavelet. The results are shown in Figure 4. db3 stands for Daubechies wavelet with three vanishing moments. sym4 stands for Symlets wavelet with four vanishing moments. dmey stands for discrete Meyer wavelet. bior3.1 stands for biorthogonal spline wavelets with orders N r = 3 (vanishing moments) and N d = 1. Since the Symlets wavelet with three vanishing moments has the same filters as db3, this part uses this kind of wavelet with four vanishing moments. The first three wavelets are orthogonal and have compact support. The last wavelet is biorthogonal. It can be seen in Figure 4 that performance using the first three wavelets are almost the same except for the standard deviation of dmey at H = 0.95. The biorthogonal spline wavelet performs worse than orthogonal wavelets except at H ≤ 0.1. This is due to the large bias caused by strong correlations of biorthogonal wavelet coefficients, and is consistent with the conclusion of Lemma 1. We need to use orthogonal compact supported wavelet to control these correlations via vanishing moments. Table 1. Estimation quality for FBM series. On the left, the j 1 for minimum MSE and its Bias, Std, RMSE is given. On the right, the same quantities with j 1 = 3 are also given for comparison. RMSE is the square root of MSE. All the results are the estimated versions of Bias, Std, RMSE for 1000 independent copies of FBM with length n = 2 18 . The used wavelet is the Daubechies wavelet with N = 3 vanishing moments.

Results and Discussions on Empirical Bias
This subsection conducts a detailed numerical analysis on the empirical bias exits in the commonly used wavelet-based estimator (the first estimator). Based on previous analysis, the following three possible causes of empirical bias are concluded.

•
The initialization for a X (0, k) given in (27) introduces errors in d X (j, k), and the initialization errors are significant on small octaves but decrease with increasing j.

•
The inaccurate bias correction for E log 2 S(j) = log 2 ES(j) (under independent assumptions) caused by correlations of wavelet coefficients.

•
The method of simulation for FBM is not enough exact that the empirical bias is caused.
From results of Section 4.1, I have the following information on empirical bias • The increase of N and change of wavelet made no improvements to the empirical bias. The chosen of biorthogonal wavelet makes the empirical bias worse.

•
The increase of j 1 leads to the decrease of empirical bias.

•
The empirical bias increases with the decrease of H. when choosing j 1 = 3 and N = 3, the empirical bias of estimatorĤ 1 can be ignored for H ≥ 0.4. So the estimatorĤ 1 is suitable to detect the long-range dependence (can be described by H > 0.5).
The fact that increase of j 1 leads to decrease of empirical bias is consistent with the first cause. As we know, the larger the value of H is, the smoother the sample path of FBM is, and the more exact the initialization given in (27) is. It is consistent with the fact that the empirical bias increases with the decrease of H. So I conclude that the initialization errors caused by (27) contribute to the empirical bias.
The first information indicates the empirical bias is related to correlations of wavelet coefficients. However, this effects can be fixed (maybe eliminated) via the selection of orthogonal compact supported wavelet.
Next, after choosing the orthogonal compact supported wavelet (db3) and fixing j 1 = 3, this study analyzes the latter two causes via comparing with the second estimator and comparison of simulation methods respectively.

Comparison of Estimators
Since the unbiasedness of the second estimatorĤ 2 is get naturally without independence assumptions of wavelet coefficients. This part comparesĤ 2 withĤ 1 for studying its empirical bias. The length of simulation data is n = 2 18 . (j 1 , j 2 ) are chosen (3,15). The wavelet coefficients are computed using the classical Daubechies wavelet with N = 3 vanishing moments. The results are shown in Figure 5 and Table 3.   Table 3 shows the results for the estimator of ratio given in Equation (22). It indicates that the variance ofĤ 2 is about twice that ofĤ 1 , which roughly satisfy the theoretical results given in (22). From Figure 5, it can be seen that when H < 0.4, bothĤ 1 andĤ 2 have the same obvious bias despite the theoretical unbiasedness of the two estimators under independence assumptions of wavelet coefficients. Besides, the same as the results shown in Table 3, the standard deviation (Std) ofĤ 2 is larger than that ofĤ 1 .
Because the empirical bias also exists inĤ 2 whose unbiasedness is get naturally, and is the same as that ofĤ 2 . I conclude that the empirical bias ofĤ 1 is not due to the inaccurate bias correction for E log 2 S(j) = log 2 ES(j) caused by correlations of wavelet coefficients.
Besides, considering the variances of the two estimators, we should choose the first estimatorĤ 1 for the estimation of Hurst parameter.

Comparison of Simulation Methods
For the third cause, this part appliesĤ 1 to the FBM that is exactly generated by the Cholesky method for comparison. The results are shown in Figure 6. From Figure 6, it can be seen that estimations for the FBM respectively generated by the Cholesky method and the circulant embedding method has almost the same empirical bias. I conclude that the method of simulation is not the cause of empirical bias.

Analysis of the Initialization Method
It has been shown above that the empirical bias ofĤ 1 is due to the initialization errors caused by (27). The initialization method given in (29) has proved effective for errors in the case of long-range dependent process [35]. Although FBM is not a bandlimited stationary stochastic process, I tend to check whether this method is suitable for FBM. This subsection applies the estimatorĤ 1 with this initialization to FBM for analysis, and compares it with the initialization by itself (or by Equation (27)). The length of simulation data is n = 2 18 . (j 1 , j 2 ) are chosen (3,15). The wavelet coefficients are computed using the classical Daubechies wavelet with N = 3 vanishing moments. The results are shown in Figure 7.  Figure 7 shows that both Biases and Stds for the two initializations are almost the same. It indicates that the initialization method given in (29) is inefficient in the case of FBM. Beside, it is known that the method Init2 will lead to the decrease of the number of available wavelet coefficients n j for the boundary effects, which may result in the increase of the variance of estimator. So I suggest choosing the initialization for a X (0, k) given in (27) in the future work.

Analysis of Noise Effects
At last, this paper adds this subsection for analysis of noise effects on the first estimator, which possibly happen in the real data. Various independent and identically distributed noises are added to the generated FBM for this issue. The signal-to-noise ratio (SNR) is defined as follows.
where ε means noise. Set SNR = 2 in this subsection. The results are shown in Figure 8. From Figure 8, I found that noises have some effects on the performance, and can lead to increase of bias. The effects of Gaussian and uniform noises are almost the same. The Cauchy noise can cause more increase of bias than the other two noises.

Conclusions
This paper focuses on the numerical simulation study of wavelet-based estimators in the case of FBM concluding the selection of parameters and the analysis of empirical bias which have not been studied comprehensively in previous studies. This study adds to previous numerical simulation studies of wavelet-based estimators which are not comprehensive in the case of FBM.
Results of the parameter selection showed that the increase of lower bound j 1 causes the decrease of bias and the increase of standard deviation for all Hs, and suggested j 1 = 3 via the minimum mean square error at a long data length n = 2 18 . In addition, it was also found that the empirical bias increased with the decrease of H and could be ignored for H ≥ 0.4 when j 1 = 3 and N = 3. The effects of n on performance and relations of selected j 1 with n were also concluded via simulation studies.
It was shown that the data length had little effect on the bias, but its decrease caused the increase of standard deviation for all Hs. The selected j 1 increased with the increase of data length, and the smaller the value of H, the faster the increase. For the vanishing moments N, when N ≥ 2, the increase of N made no improvements to the performance of estimator. The change of orthogonal wavelets made no improvements to the empirical bias. The chosen of biorthogonal wavelet made empirical bias worse.
The analysis of empirical bias was conducted first via comparison of two available estimators and comparison of simulation methods. The results showed that the empirical bias was due to the initialization errors caused by discrete sampling, and was not related to simulation methods. When choosing an appropriate orthogonal compact supported wavelet, the empirical bias was almost not related to the inaccurate bias correction caused by correlations of wavelet coefficients. I regret to note that the initialization method given in (29), which has proved effective in the case of long-range dependent process, made no improvements to the empirical bias. All these results will be a guide for my future studies.