A Least Squares Estimator for Gradual Change-Point in Time Series with m -Asymptotically Almost Negatively Associated Errors

: As a new member of the NA (negative associated) family, the m -AANA ( m -asymptotically almost negatively associated) sequence has many statistical properties that have not been developed. This paper mainly studies its properties in the gradual change point model. Firstly, we propose a least squares type change point estimator, then derive the convergence rates and consistency of the estimator, and provide the limit distributions of the estimator. It is interesting that the convergence rates of the estimator are the same as that of the change point estimator for independent identically distributed observations. Finally, the effectiveness of the estimator in limited samples can be veriﬁed through several sets of simulation experiments and an actual hydrological example.


Introduction
Change point problems originally arose from quality control engineering [1].Because of the heterogeneities in real data sequences, the problems of change point estimation and detection in real data sequences have drawn attention.Scholars have proposed many methods to solve various data problems (see [2][3][4][5][6][7][8][9]). The purpose of change point detection and estimation is to divide a data sequence into several homogeneous segments, and the theories have been applied in many fields like finance [10], medicine [11], environment [12] and so on.For some special problems, such as hydrological and meteorological problems, most of the change point that occur are gradual rather than abrupt.Therefore, the research on the problems of gradual change is very meaningful.
In earlier research, most theories of gradual changes were derived from two-stage regression model [13,14].Hušková [15] used the least-squares method to estimate an unknown gradual change point and the model is as follows: For n > 1, observations X 1 , X 2 , • • • , X n shall satisfy: where (a) + = max(0, a), µ, δ = 0, m is the location of the change, e 1 , e 2 , • • • , e n are i.i.d.random variables with Ee i = 0, var(e i ) = σ 2 , E|e i | 2+∆ < ∞ for some ∆ > 0. In the same year, Jarušková [16] conducted a log likelihood ratio test of the model on the basis of Hušková [15], and obtained that the asymptotic distribution of the test statistic is Gumbel distribution.
Later, Wang [17] extended the error terms e i from the traditional i.i.d.sequences to the long memory i.i.d sequences and obtained the consistency of the estimator of the mean gradual change point and the limit distribution of the test statistic.Then, Timmermann [18] tested the gradual change in general random processes and update processes respectively, and also obtained the limit distribution of test statistic.It is well known that most of the previous studies tend to focus on the abrupt change.But for some important time series, such as temperature and hydrology, there is a greater possibility of gradual mean change.
Therefore, this paper considers the gradual change problem and the following model based on Hušková [15] is constructed: where The least squares method is a classic method (see [19,20]).In model ( 1), the estimators of k * and τ * based on the least-squares-type are, respectively: where: For convenience of illustration, x tj and xj are It can be seen that most previous studies have one thing in common, namely that the error terms are independently and identically distributed.However, the constraints of independent sequences are quite strict, and in practical problems, many time series models may not meet the independent conditions.This leads to a classical question of whether the error terms in a model can be generalized to some more general cases.Therefore, the idea of extending the error terms to the m-AANA sequences is proposed in this paper.Before presenting the main asymptotic results, it is necessary to understand the following definition:

Definition 1 ([21]
).There is an fixed integer m ≥ 1, the random variable sequence {Y n , n ≥ 1} is called m-AANA sequence if there exists a non negative sequence q(n) → 0 as n → ∞ such that: for all n ≥ 1, k ≥ m and for all coordinatewise nondecreasing continuous functions f and g whenever the variances exist.The sequence {q(n), n ≥ 1} is called the mixing coefficients of {Y n , n ≥ 1}.It is not difficult to see that NA family includes NA [22], m-NA [23], AANA [24], and independent sequences.
Scholars have great interest in the sequences of NA family.For example, NA sequence is opposite to PA (positively associated) sequence, but NA sequence has better property than other existing ND (negatively dependent) sequence: under the influence of increasing functions, the disjoint subset of NA random variable sequence is still NA.Therefore, NA sequences appeared in many literary works.Przemysaw [25] obtained the convergence of partial sums of NA sequences; Yang [26] obtained Bernstein type inequalities for NA sequences; and Cai [27] obtained the Marcinkiewicz -Zygmund type strong law of large numbers of NA sequences.
There are many classical studies about m-NA and AANA sequences as generalizations of NA sequences.Hu [23] discussed the complete convergence of m-NA sequences; Yuan [28] proposed a Marcinkiewics-Zygmund type moment inequality for the maximum partial sum of AANA sequences.
For the m-AANA sequence mentioned in this paper, it is a relatively new concept and its research results are less than those of other sequences in NA family.Ko [29] extended Hájek-Rényi inequalities and the strong law of large numbers of Nam [21] to Hilbert space; Ding [30] proposed the CUSUM method to estimate the abrupt change point in a sequence with the error term of m-AANA process, but for a more general case, he did not discuss the gradual change.Therefore, another reason for constructing the main ideas of this paper is is to fill the research gaps of Ding [30].
The rest is arranged as follows: Section 2 describes the main results.A small simulation study under different parameters and an example is provided in Section 3. Section 4 contains the conclusions and outlooks, and the main results are proved in Appendix A.

Main Results
For our asymptotic results, assume the model to satisfy the following assumptions: In addition, there exists strictly ascending sequence of natural numbers {n k } with n 0 = 0 and Remark 1. Assumption 1 and 2 are the underlying assumptions, and if these assumptions cannot be met, serious problems such as bias in the estimates, inconsistency in the estimates, and invalidity of the estimates may arise in the proof process.
In addition, what can be verified is that the m-AANA random variables satisfy the central limit theorem when Assumption 1-3 are ture.
Remark 2. Interestingly, these convergence rates are the same as those in Hušková [31], but independent sequences are contained in m-AANA sequences.

Simulations and Example
In this section, assume that there is only one gradual change point of mean at k * in (1), such that: where ω 1 , ω 2 ≥ 0, ω 1 + ω 2 = 1, I n is a identity matrix, Σ n satisfies: where For comparison, we take µ = 1, τ * = 0.5, ω 1 = ω 2 = 0.5, ρ = −0.6 and δ n = (n −0.1 , 1, n 0.1 , n 0.2 ), γ = (0.75, 0.5, 0.25).It is worth noting that after verification, under these conditions, Σ n is a non singular matrix, which can be used for simulation experiments.Then, 1000 simulation processes are carried out.Figures 1-3 are based on the simulation of δ n = n −0.1 , 1, n 0.1 , n 0.2 from left to right.In Figures 1-3, the ordinate axis represents the value of τ * − τ * , and the abscissa is the size of sample n.It is not difficult to find that the larger the gradual coefficient γ, the worse the performance of our estimator.And as n increases, the estimation effect becomes better, this also implies the result in Theorem 1.For different ρ and τ * , similar results can be obtained, which will not be repeated here.
Finally, we do the change-point analysis based on the sequence of monthly average water levels of Hulun Lake in China from 1992 to 2008.For the convenience of description, we subtract the median of the observations.Then, Figures 4 and 5 can be obtained:  It can be seen from Figure 5 that we have no sufficient reason to believe that the water levels of Hulun Lake does not meet the conditions of m-AANA sequence.According to Sun [32], the water levels of Hulun Lake have declined rapidly since 2000, due to the influence of the monsoon, changes in precipitation patterns, and the degradation of frozen soil.Using the method in this paper, Table 1 shows the values of k * under different γ.
The position of 105 represents the year 2000.The average water levels of Hulun Lake have been decreasing since 2000.

Conclusions
This paper proposes a least-squares-type estimator of the gradual change point of sequence based on m-AANA noise and study the consistency of the estimator.At the same time, the convergence rates are obtained in Theorem 2: Therefore, Theorems 1 and 2 generalize the results in Hušková [31].Furthermore, due to the asymptotic normality of m-AANA sequences, this paper also derives the limit distributions of the estimator under different γ in Throrem 3. It can be known that the inappropriate γ has a great impact on the change point estimator.If γ → 0, the gradual change point in (2) may be very similar to the abrupt change point, and lose the gradual change properties.If γ → 1, the dispersion of data may be very large, which is not conducive to determining the correct change point position.So we conduct several simulations to verify the results, and the results show that the larger γ is, the worse the estimation effect is, but the consistency is still satisfied.Finally, the paper discusses the gradual change of water levels of Hulun Lake in Section 3, and the estimator successfully finds the position of the change point.
There is also some regret in this paper.For example, for series where the variances cannot be estimated, such as a financial heavy-tailed sequence with a heavy-tailed index κ n < 2, the location of the change point cannot be obtained using the method in this paper.Therefore, more suitable methods should be promoted in future works.Moreover, we suspect that there may be more common cases in the selection of e i , this is also one of the key points to be solved in the future.
can be equivalent to: after the elementary operation, V k (γ) can be divided into five parts: where: and At this moment, a lemma is needed.

Figure 4 .
Figure 4.The plot graph of monthly average water levels of Hulun Lake from 1992 to 2008.

Table 1 .
The values of k * based on water level.