Abstract
We consider parameter estimation for linear stochastic differential equations with independent experiments observed at infrequent and irregularly spaced follow-up times. The maximum likelihood method is used to obtain an asymptotically consistent estimator. A kernel-weighted score function is proposed for the parameter in drift terms. The strong consistency and the rate of convergence of the estimator are obtained. The numerical results show that the proposed estimator performs well with moderate sample sizes.
1. Introduction
To simulate the dynamic behavior of a complex system, linear stochastic differential equations (LSDEs) are frequently used. In many real-world applications, it is customary that the parameters that define the system must be estimated from the data. As an example, geometric Brownian motion (GBM) is one of the most popular stochastic processes and undoubtedly an effective instrument in modeling and predicting random changes in stock prices [1,2]. Both deterministic and stochastic components contribute to the pharmacokinetic and pharmacodynamic models: Although there are predictable trends in drug concentrations, it is not always possible to establish the precise concentration at any particular time [3].
In biometrics, a GBM model and an estimation procedure were developed for predicting the height growth of even-aged forest stands as part of a methodology for modeling growth in forest plantations [4].
Due to its growing use in a variety of domains, parameter estimation problems involving stochastic differential equations have received a lot of attention lately. Using the data, one should estimate the parameters that characterize the system. Several methods are proposed to evaluate the parameters, such as the least squares method [5,6,7,8,9], the maximum likelihood method [10,11,12,13,14], and the numerical approximation approach [15]. Several other methods, such as the generalized method of moments procedures [16,17], local linearization method [11,18], and MCMC methods [19], are also proposed.
Assume that n identical and independently distributed paths are observed. When a number of patients can be watched, for example, this situation can arise in pharmacokinetics. For each patient, a bolus of the medication is given, and the “path” of its diffusion through the body can be observed [3]. Such observations are typically sparse and only observed at infrequent and irregularly spaced follow-up times; the above methods are no longer applicable. In this case, we develop a computationally efficient method to deal with the observations with infrequent and irregularly spaced follow-up times. In this paper, we apply kernel methods to the parameter estimation of LSDEs. At the heart of the proposed approach is to “smooth” an individual’s contributions to the likelihood based on the distance of their observed time to the time of interest. The smoothing methods employed are where smoothing happens on an individual basis as compared to the population level, where all individuals are given the same weights. With a suitable choice of bandwidth, the consistency and asymptotic normality for the proposed estimator can be obtained. One can refer to [20,21] for statistical inference of diffusion processes.
In future research, we may consider parameter estimation for a more generalized drift term such as and the nonparametric estimation for the drift function where is a measurable function based on identical and independently distributed observations.
Our paper is organized as follows. In Section 2, we propose an estimator for the drift parameter of the LSDE; we also obtain the consistency of and and show their asymptotic normality where the convergence rate is , which is slower than . In Section 3, numerical simulations are performed. Simulation findings show that the large sample approximations are suitable for usage in practice. Section 4 is the conclusion.
2. Models and Methods
2.1. Description of Models
Consider an LSDE model as follows
where is an unknown parameter, is a constant, and are independent standard Brownian motions. is characterized by the following properties: (1) ; (2) has independent increments, which is, for every , the future increments , , are independent of the past values , and ; (3) is continuous in t. Under the condition of Lipschitz and linear growth, the LSDE (1) has a unique strong solution ,
Let be n dependent copies of . As is known to all (see, e.g., [22]), for given t, follow a lognormal distribution
We are aiming to use the observations to estimate , where the observations consist of , and . The probability of the ith subject at time point is
where .
If the observations are continuous, consider a time point . We know that are independent and identically distributed variables on the same probability space. Then we get the likelihood function
where . The log-likelihood function is
where and the score function is
2.2. Kernel Estimation with Forward and Lagged Observation
The data are usually not observed continuously, and it is almost impossible for each individual to be observed at . Hence is not computable from the observations. We propose a method that formalizes the forwarding and lagging strategy, with kernel weighting enabling the use of all available forward and lagged observations. We “smooth” the observations’ contribution to the likelihood based on the distance of their observation time to the time of interest. If data continues to be collected on subjects for which observation has occurred, as in the case of the recurrent event, we use the kernel to impute missing values using both forward and backward-lagged observations. We construct a smoothed log-likelihood function by using kernel estimation
where is the variance of , , is the bandwidth, and the kernel function is a symmetric probability density with support and mean 0 that bound the first derivative. In addition, , where is twice continuously differentiable and strictly positive for . The scoring function is given by
Assume that the following conditions hold:
- (A.1)
- is an open sets of , and for some and is the true parameter.
- (A.2)
- is twice continuously differentiable.
- (A.3)
- is a symmetric density function satisfying In addition, , , .
Condition (A.1) is a usual assumption for the proof of consistency, and condition (A.2) ensures the Taylor expansion of the score function to the second order. Our methods depend on a proper choice of bandwidth, which is shown in condition (A.3). The estimator is obtained based on solving Equation (5) with a kernel bandwidth selected to obtain the consistency.
Lemma 1.
Under conditions (A.1)–(A.3), we have
as.
Proof of Lemma 1.
From the smoothed likelihood function (5), we have the smoothed scoring function
Let . We have
Define . Obviously, . By taking expectations together with Taylor expansion, and , we have
From condition (A.3), we have . □
The following theorem shows the consistency of the proposed estimator obtained based on solving Equation (5).
Theorem 1.
Under conditions (A.1)–(A.3), admits the consistency as .
Proof of Theorem 1.
Solving Equation (5), we have
By properties of the kernel function , we have
where is some constant. By Equation (3), we have
Hence, we have
where is some constant. By the Wiener–Khinchin law of large numbers, we have
which goes to zero, as . □
The following theorem shows the asymptotic normality of .
Theorem 2.
Assume conditions (A.1)–(A.3) hold, is consistent, and the asymptotic distribution of satisfies
as , where
and
Proof of Theorem 2.
Let be a strongly consistent sequence of , i.e., , and . We can seek a solution of the log-likelihood function , and is a strongly consistent sequence. Note that
we denote . Expand as
Let , we have . Then
Multiply both sides of Equation (7) by , we denote
Then
Hence we give the variance of ,
From Lemma 1, we have that . By central limit theorem, , and we denote . Then
Assume that for , , where means the probability, is continuous for , and exists. Then
Using a change of variables, we have .
With notation , we have
From the Lyapunov central limit theorem, we have that converges to a continuous Gaussian process . Hence, we have
□
Remark 1.
When there are several observations , one can estimate the drift parameter μ by a standard maximum likelihood method. Let
for and . Thus, conditional on the observation times,
and they are independent. For example, if we reparameterize μ as , the MLE for ν would be given by
where . Then μ is estimated by .
Remark 2.
When there is only one observation , the estimator proposed in Equation (8) is not effective. Our estimator performs reasonably well in this extreme case, and we could give an explicit asymptotic variance for our estimator, which is .
3. Simulation
In this section, utilizing both forward and backward-lagged observation, we examine the kernel estimator. We generate 1000 datasets, and each dataset consists of subjects with different bandwidths (BD). The process is generated through model (1); we set the initial condition , and . Then the solution is
where is a standard Brownian motion. The number of observation times for each subject is a Poisson distributed with an intensity rate of 5. The time points of each individual’s observation are generated from a uniform distribution, Unif. The outcomes from other models’ parameter selections are not mentioned because they are essentially identical. All simulations were performed on a laptop running R 4.2.9 with 8G of RAM.
Based on Theorems 1 and 2, we obtain a kernel estimator with asymptotically negligible bias and employ bandwidths in the range when calculating using the smoothed likelihood score function (6). The kernel function we choose is the Epanechnikov kernel, which is . The usage of additional kernel functions has little effect on the estimator’s empirical performance, according to additional simulations (not published).
The simulation results show that the estimates for the parameter in the model are accurate. Table 1 summarizes the main findings from over 1000 simulations. We note that the bias diminishes and is minor as the sample size grows. The performance improves the larger sample sizes and smaller bandwidths. The overall parameter estimates are evaluated by the bias and relative bias (RB), which are defined as
where denotes the true parameter.
Table 1.
Simulation results with different n and bandwidths.
4. Conclusions
In this paper, we have presented kernel-weighting methods for the estimation of the LSDE model (1) in repeatable experiments when the observation time is a random variable, and the number of observations of each individual is uncertain or even sparse. This is a real improvement because the past literature usually supposed that observation intervals are equally spaced and could not deal with the sparse observations. We consider the maximum likelihood estimation of the drift parameter. This method has some assumptions, and we give the asymptotic normality of the proposed estimator. In numerical studies, we set the true parameter , , and the initial condition for each individual with sparse observations (the frequency of observation follows a Poisson distribution with mean 5). Using the smoothed scoring function, we obtain the estimation of the drift parameter.
Author Contributions
All authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.
Funding
This work was partially supported by NSFC (grants 11871244), and the Fundamental Research Funds for the Central Universities, JLU.
Data Availability Statement
There is no data used in this paper.
Conflicts of Interest
There are no competing interest to declare that arose during the preparation or publication process of this article.
References
- Black, F.; Scholes, M. The pricing of options and corporate liabilities. J. Political Econ. 1973, 81, 637–654. [Google Scholar] [CrossRef]
- Merton, R.C. Theory of rational option pricing. Bell J. Econ. Manag. Sci. 1973, 4, 141–183. [Google Scholar] [CrossRef]
- Donnet, S.; Samson, A. A review on estimation of stochastic differential equations for pharmacokinetic/pharmacodynamic models. Adv. Drug Deliv. Rev. 2013, 65, 929–939. [Google Scholar] [CrossRef] [PubMed]
- Garcia, O. A stochastic differential equation model for the height growth of forest stands. Biometrics 1983, 39, 1059–1072. [Google Scholar] [CrossRef]
- Hu, Y.; Long, H. Least squares estimator for Ornstein–Uhlenbeck processes driven by α-stable motions. Stoch. Process. Their Appl. 2009, 119, 2465–2480. [Google Scholar] [CrossRef]
- Hu, Y.; Nualart, D.; Zhou, H. Drift parameter estimation for nonlinear stochastic differential equations driven by fractional Brownian motion. Stochastics 2019, 91, 1067–1091. [Google Scholar] [CrossRef]
- Long, H.; Ma, C.; Shimizu, Y. Least squares estimators for stochastic differential equations driven by small lévy noises. Stoch. Process. Their Appl. 2017, 127, 1475–1495. [Google Scholar] [CrossRef]
- Neuenkirch, A.; Tindel, S. A least square-type procedure for parameter estimation in stochastic differential equations with additive fractional noise. Stat. Inference Stoch. Process. 2014, 17, 99–120. [Google Scholar] [CrossRef]
- Gallant, A.R.; Long, J.R. Estimating stochastic differential equations efficiently by minimum chi-squared. Biometrika 1997, 84, 125–141. [Google Scholar] [CrossRef]
- Elerian, O.; Chib, S.; Shephard, N. Likelihood inference for discretely observed nonlinear diffusions. Econometrica 2001, 69, 959–993. [Google Scholar] [CrossRef]
- Shoji, I.; Ozaki, T. A statistical method of estimation and simulation for systems of stochastic differential equations. Biometrika 1998, 85, 240–243. [Google Scholar] [CrossRef]
- Shimizu, Y. M-estimation for discretely observed ergodic diffusion processes with infinitely many jumps. Stat. Inference Stoch. Process. 2006, 9, 179–225. [Google Scholar] [CrossRef]
- Shimizu, Y.; Yoshida, N. Estimation of parameters for diffusion processes with jumps from discrete observations. Stat. Inference Stoch. Process. 2006, 9, 227–277. [Google Scholar] [CrossRef]
- Yacine, A.S. Maximum likelihood estimation of discretely sampled diffusions: A closed-form approximation approach. Econometrica 2002, 70, 223–262. [Google Scholar]
- Milshtein, G.N. A method of second-order accuracy integration of stochastic differential equations. Theory Probab. Its Appl. 1979, 23, 396–401. [Google Scholar] [CrossRef]
- Andersen, T.G.; Sørensen, B.E. Gmm estimation of a stochastic volatility model: A Monte Carlo study. J. Bus. Econ. Stat. 1996, 14, 328–352. [Google Scholar]
- Hu, Y.; Xi, Y. Estimation of all parameters in the reflected Ornstein–Uhlenbeck process from discrete observations. Stat. Probab. Lett. 2021, 174, 109099. [Google Scholar] [CrossRef]
- Shoji, I. Approximation of continuous time stochastic processes by a local linearization method. Math. Comput. 1998, 67, 287–298. [Google Scholar] [CrossRef]
- Martin, J.; Wilcox, L.C.; Burstedde, C.; Ghattas, O. A stochastic newton MCMC method for large-scale statistical inverse problems with application to seismic inversion. Siam J. Sci. Comput. 2012, 34, 1460–1487. [Google Scholar] [CrossRef]
- Brown, B.M.; Hewitt, J.I. Asymptotic likelihood theory for diffusion processes. J. Appl. Probab. 1975, 12, 228–238. [Google Scholar] [CrossRef]
- Bladt, M.; Sørensen, M. Statistical inference for discretely observed markov jump processes. J. R. Stat. Soc. Ser. 2005, 67, 395–410. [Google Scholar] [CrossRef]
- Øksendal, B. Stochastic Differential Equations: An Introduction with Applications; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).