Two-Parameter Stochastic Weibull Diffusion Model: Statistical Inference and Application to Real Modeling Example

Nafidi, Ahmed; Bahij, Meriem; Gutiérrez-Sánchez, Ramón; Achchab, Boujemâa

doi:10.3390/math8020160

Open AccessArticle

Two-Parameter Stochastic Weibull Diffusion Model: Statistical Inference and Application to Real Modeling Example

by

Ahmed Nafidi

^1,†,

Meriem Bahij

^1,†,

Ramón Gutiérrez-Sánchez

^2,*,† and

Boujemâa Achchab

^1,†

¹

Department of mathematics and informatics, LAMSAD, National School of Applied Sciences of Berrechid, University of Hassan 1, Avenue de l’université, BP 280, Berrechid 26100, Morocco

²

Department of Statistics and Operational Research, University of Granada, Facultad de Ciencias, Campus de Fuentenueva, 18071 Granada, Spain

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2020, 8(2), 160; https://doi.org/10.3390/math8020160

Submission received: 10 December 2019 / Revised: 9 January 2020 / Accepted: 16 January 2020 / Published: 23 January 2020

(This article belongs to the Special Issue Stochastic Differential Equations and Their Applications)

Download

Browse Figures

Versions Notes

Abstract

This paper describes the use of the non-homogeneous stochastic Weibull diffusion process, based on the two-parameter Weibull density function (the trend of which is proportional to the two-parameter Weibull probability density function). The trend function (conditioned and non-conditioned) is analyzed to obtain fits and forecasts for a real data set, taking into account the mean value of the process, the maximum likelihood estimators of the parameters of the model and the computational problems that may arise. To carry out the task, we employ the simulated annealing method for finding the estimators values and achieve the study. Finally, to evaluate the capacity of the model, the study is applied to real modeling data where we discuss the accuracy according to error measures.

Keywords:

weibull distribution; stochastic diffusion process; likelihood estimation; statistical computation; simulation; age dependency ratio

1. Introduction

A diffusion process

X_{t}

is a solution of the stochastic differential equation (SDE) of the form

d X_{t} = a (t, X_{t}, θ) d t + σ (t, X_{t}, θ) d w_{t},

with

w_{t}

a standard unidimensional or multidimensional Wiener process and a and

σ

known functions (with a vector-valued and

σ

matrix-valued if

X_{t}

is a multivariate process).

θ

indicates the unknown parameter and the inference issue discussed in that of estimating

θ

under continuous observation or discrete observations of

X_{t}

. In order to give an example of stochastic processes, we cite the Brownian motion which plays a central role in the development of stochastic analysis. It is a process which is Gaussian, Markov, self-similar, a martingale and has stationary, independent increments. Brownian motion is also known as a Wiener process in honor of Norbert Wiener who’s work appeared in a series of papers in the early 1920s, a decade before Kolmogorov’s monograph that set probability theory on a rigorous mathematical foundation.

Stochastic modeling deals with real-world situations in which uncertainty is present and employ probability skills to model those circumstances. Therefore, the purpose of stochastic modeling is to study a forecast and to estimate the probability of its outcomes, to explain what conditions or decisions might happen under different situations for good results. Stochastic diffusion processes are well adapted to illustrate the advancement of diverse phenomena and to forecast their future trends, by using statistical inference methods. For instance, stochastic diffusion processes have been employed with respect to demography [1], electricity consumption [2], life expectancy at birth [3], effect of therapy on tumors [4] and population extinction [5].

These models are defined by stochastic diffusion processes, considered using stochastic calculus methods and on the corresponding statistical inference. In general, the solution to an Itô-type SDE is a diffusion process, whose trend function

E [X (t)] = f (t)

has a form similar to a curve associated with known distribution. In some cases, the maximum likelihood (ML) method is the feasible procedure since the transition density function of the diffusion process is known explicitly.

The difficulty of estimating parameters of the drift coefficient has collected important interest in latest years. In most cases, the statistical inference is based on approximating the ML methodology, see for example, Prakasa Rao [6].

In the same context described above, we propose in this paper a study of the Weibull-type stochastic diffusion model. The trend function (TF) of this model corresponds to the graph of the probability density function of the Weibull distribution. From the explicit expression of the transition density function of the process, the ML method is applied to find out the estimators of the parameters of the process. In the measure to estimate the parameters, we must overcome the difficulty appeared when we were solving the ML principle. To carry out the problem, we suggest using the simulated annealing (SA) method. This methodology is implemented on an example with real data also illustrated with simulated data by employing the resulted values of the parameters. The estimation of parameters produces a computational problem. This paper is structured as next: in Section 2, we introduce the non-homogeneous Weibull diffusion process and its probabilistic aspects. The parameters are then estimated in Section 3, using the ML method with discrete sampling in time and considering the computational problems involved by means of SA presented in Section 4. We then determine the approximate confidence bounds of the process. Finally, in Section 5, this method is applied to real data.

2. Stochastic Weibull Diffusion Process (SWDP)

2.1. The PDF and Moments of the Process

The SWDP, which is the proposed model in this study, is established as the non-homogeneous diffusion process depending on time

\{x (t), t \in [t_{1}, T], t_{1} > 0\}

and taking values in

(0, + \infty)

by the next Itô’s SDE

\begin{matrix} d x (t) = (\frac{α}{t} - β t^{α}) x (t) d t + σ x (t) d w (t); x (t_{1}) = x_{1} a . s, \end{matrix}

(1)

where

w (t)

is a univariate standard Wiener process and

x_{1}

is a constant. Thus, we give the infinitesimal moments by the equations:

\begin{matrix} \begin{matrix} a (t, x) = & (\frac{α}{t} - β t^{α}) x, \\ b (t, x) = & σ^{2} x^{2}, \end{matrix} \end{matrix}

(2)

where

σ > 0,

and

α

and

β

are real constants.

This model is an extension of the SWDP defined in Reference [7]. In fact, by considering a constant

β

instead of the terme

α + 1

in the drift coefficient in Reference [7]; that is,

a (x, t) = (\frac{α}{t} - (α + 1) t^{α}) x

. Then, we obtain our stochastic Weibull diffusion process with a new drift coefficient defined in Equation (2).

Since, the functions

a (t, x)

and

b (t, x)

,

0 < x < + \infty

, are Borel measurable and satisfy the uniform Lipschitz and the growth conditions (see Kloeden and Platen [8]). We conclude that there exists a constant

C > 0

such as the infinitesimal moments specified in Equation (2) verify the Lipschitz and growth conditions

\forall x, y \in R^{+}

and

t \in [t_{1}, T]

.

In fact, let us consider

x, y \in R^{+}

and

t \in [t_{1}, T],

then from one side we have

\begin{matrix} \begin{matrix} ∣ a (t, x) - a (t, y) ∣ + ∣ \sqrt{b (t, x)} - \sqrt{b (t, y)} ∣ = & ∣ a (t, x - y) ∣ + ∣ \sqrt{b (t, x - y)} ∣, \\ = & ∣ (\frac{α}{t} - β t^{α}) (x - y) ∣ + ∣ σ (x - y) ∣, \\ = & (∣ (\frac{α}{t} - β t^{α}) ∣ + ∣ σ ∣) ∣ x - y ∣, \\ \leq & (\underset{t_{0} \leq t \leq T}{s u p} \{∣ \frac{α}{t} - β t^{α} ∣\} + σ) ∣ x - y ∣ . \end{matrix} \end{matrix}

(3)

From another side, for the particular case where

y = 0,

we have

\begin{matrix} \begin{matrix} ∣ a (t, x) ∣^{2} + ∣ \sqrt{b (t, x)} ∣^{2} \leq & {(∣ a (t, x) ∣ + ∣ \sqrt{b (t, x)} ∣)}^{2}, \\ \leq & {[(\underset{t_{0} \leq t \leq T}{s u p} \{∣ \frac{α}{t} - β t^{α} ∣\} + σ) ∣ x ∣]}^{2}, \\ \leq & {(\underset{t_{0} \leq t \leq T}{s u p} \{∣ \frac{α}{t} - β t^{α} ∣\} + σ)}^{2} (1 + ∣ x ∣^{2}), \end{matrix} \end{matrix}

we note

C = (\underset{t_{0} \leq t \leq T}{s u p} \{∣ \frac{α}{t} - β t^{α} ∣\} + σ) .

Thus, there exist an (a.s.) continuous process

{x (t), t \in [t_{1}, T]; t_{1} > 0}

, separable and measurable, which is the unique (a.s.) solution of the SDE (1). This solution is obtained by using Itô’s formula. Let us define a new variable by

y (t) = log (x (t)),

so that

\begin{matrix} d y (t) = (\frac{α}{t} - β t^{α} - \frac{σ^{2}}{2}) d t + σ d w (t); y (t_{1}) = log (x_{1}) . \end{matrix}

This equation can be directly integrated, thus obtaining

y (t) - y (t_{1}) = \int_{t_{1}}^{t} (\frac{α}{s} - β s^{α} - \frac{σ^{2}}{2}) d s + σ (w (t) - w (t_{1})),

and hence

\begin{matrix} y (t) = y (t_{1}) + α log (t / t_{1}) - \frac{β}{α + 1} (t^{α + 1} - t_{1}^{α + 1}) - \frac{σ^{2}}{2} (t - t_{1}) + σ (w (t) - w (t_{1})) . \end{matrix}

(4)

The analytical expression of the solution of Equation (1) is easily deduced from Equation (4):

\begin{matrix} x (t) = x_{1} {(\frac{t}{t_{1}})}^{α} exp (- \frac{β}{α + 1} (t^{α + 1} - t_{1}^{α + 1}) - \frac{σ^{2}}{2} (t - t_{1})) e^{σ (w (t) - w (t_{1}))} . \end{matrix}

(5)

Since

y (t)

conditionally on

{y (s) = y_{s}}

has a one-dimensional normal distribution

N_{1} [μ (s, t, x_{s}), σ^{2} (t - s)]

. Consequently,

x (t)

conditionally on

{x (s) = x_{s}}

is lognormally distributed denoted by

Λ_{1} [μ (s, t, x_{s}), σ^{2} (t - s)]

and we have

μ (s, t, x_{s})

given by

\begin{matrix} μ (s, t, x_{s}) = log (x_{s}) + α log (t / s) - \frac{β}{α + 1} (t^{α + 1} - s^{α + 1}) - \frac{σ^{2}}{2} (t - s) . \end{matrix}

(6)

From the above, the probability density function (PDF) of the process considered has the next form

\begin{matrix} f (y, t ∣ x_{s}, s) = \frac{1}{y} {[2 π σ^{2} (t - s)]}^{- 1 / 2} exp (- \frac{{[log (y) - μ (s, t, x_{s})]}^{2}}{2 σ^{2} (t - s)}) . \end{matrix}

(7)

2.2. Moments of the Process

To determine the moments of the process, we take into account the useful property of the lognormal distribution, that the r-th conditional moment of the process is defined by

\begin{matrix} \begin{matrix} E [x^{r} (t) | x (s) = x_{s}] & = exp (r μ (s, t, x_{s}) + \frac{r^{2} σ^{2}}{2} (t - s)), \\ = x_{s}^{r} {(\frac{t}{s})}^{r α} e^{- \frac{r β}{α + 1} (t^{α + 1} - s^{α + 1})} e^{\frac{r}{2} (r - 1) σ^{2} (t - s)} . \end{matrix} \end{matrix}

As matter of fact, when we consider the situation where

r = 1

, the conditional trend function (CTF) of the process is:

\begin{matrix} E [x (t) ∣ x (s) = x_{s}] = x_{s} {(\frac{t}{s})}^{α} e^{- \frac{β}{α + 1} (t^{α + 1} - s^{α + 1})} . \end{matrix}

(8)

Thereby under the initial condition

P [x (t_{1}) = x_{1}] = 1

, the TF of the process is expressed by:

\begin{matrix} E [x (t)] = x_{1} \frac{e^{\frac{β}{α + 1} t_{1}^{α + 1}}}{t_{1}^{α}} t^{α} e^{- \frac{β}{α + 1} t^{α + 1}} . \end{matrix}

(9)

Remark 1.

-As mentioned above, this process is a generalisation of the one defined in Reference [7]. In fact, assuming $β = α + 1$ , the SWDP obtained becomes the SWDP based on the two-parameters Weibull distribution.
-Moreover, the trend function of the process, given in Equation (9), is corresponding to the PDF of the Weibull distribution.

3. Statistical Inference

3.1. Maximum Likelihood Estimation

The drift and diffusion parameters of the process that are

α

,

β

and

σ^{2}

are estimated by ML method and discrete sampling. Therefore, we treat a discrete sampling of the process

x (t_{1}), x (t_{2}), \dots, x (t_{n})

at times

t_{1}, t_{2}, \dots, t_{n},

and we denote

x (t_{i}) = x_{i}, for i = 1, \dots, n

in the following. Moreover, we presume that the time gap among two successive observations is constant (i.e.,

t_{i} - t_{i - 1} = h,

for

i = 2, \dots, n

). Hereafter, by taking

P [x (t_{1}) = x_{1}] = 1

the initial condition, the linked likelihood function can be obtained from Equation (7) by:

\begin{matrix} L (x_{1}, \dots, x_{n}; α, β, σ^{2}) = \prod_{j = 2}^{n} f (x_{j}, t_{j} ∣ x_{j - 1}, t_{j - 1}) . \end{matrix}

(10)

Since taking derivatives of a product is tedious, the log-likelihood for Equation (10) is usually maximised, that is,

\begin{matrix} \begin{matrix} log (L (x_{1}, \dots, x_{n}; α, β, σ^{2})) & = - \frac{n - 1}{2} log (2 π h) - \frac{n - 1}{2} log (σ^{2}) - \sum_{j = 2}^{n} log (x_{j}) \\ - \frac{1}{2 σ^{2} h} \sum_{j = 2}^{n} {[log (\frac{x_{j}}{x_{j - 1}}) - α log (\frac{t_{j}}{t_{j - 1}}) + \frac{β}{α + 1} [t_{j}^{α + 1} - t_{j - 1}^{α + 1}] + \frac{σ^{2}}{2} h]}^{2} . \end{matrix} \end{matrix}

(11)

By applying the principle of ML, we obtain

\hat{α}

,

\hat{β}

and

{\hat{σ}}^{2},

which are the estimators of

α

,

β

and

σ^{2}

respectively. As a matter of fact, we derivate the log-likelihood function with respect to

α

,

β

and

σ^{2}

then we get the next equations:

\begin{matrix} - (n - 1) {\hat{σ}}^{2} h + \sum_{j = 2}^{n} B_{j}^{2} (\hat{α}, \hat{β}) - \frac{n - 1}{4} {\hat{σ}}^{4} h^{2} = 0, \end{matrix}

(12a)

\begin{matrix} \sum_{j = 2}^{n} (B_{j} (α, β) + \frac{σ^{2}}{2} h) (t_{j}^{α + 1} - t_{j - 1}^{α + 1}) = 0, \end{matrix}

(12b)

\begin{matrix} \sum_{j = 2}^{n} \frac{\partial B_{j} (α, β)}{\partial α} (B_{j} (α, β) + \frac{σ^{2}}{2} h) = 0 . \end{matrix}

(12c)

For

j = 2, \dots, n,

we denote:

\begin{matrix} B_{j} (α, β) = log (x_{j} / x_{j - 1}) - α log (t_{j} / t_{j - 1}) + \frac{β}{α + 1} (t_{j}^{α + 1} - t_{j - 1}^{α + 1}), \end{matrix}

From Equation (12a), we obtain (as a positive solution) the expression of the estimator

{\hat{σ}}^{2}

on the following result:

\begin{matrix} \frac{{\hat{σ}}^{2}}{2} = \frac{1}{h} [{(1 + \frac{1}{n - 1} \sum_{j = 2}^{n} B_{j}^{2} (\hat{α}, \hat{β}))}^{1 / 2} - 1] . \end{matrix}

(13)

And consequently, by substituting

\frac{σ^{2}}{2}

in Equations (12b) and (12c) by the expression of its estimator (see Equation (13)), the following nonlinear equations are obtained for the estimators

\hat{α}

and

\hat{β}

:

\begin{matrix} \sum_{j = 2}^{n} (B_{j} (\hat{α}, \hat{β}) + \frac{{\hat{σ}}^{2}}{2} h) (t_{j}^{\hat{α} + 1} - t_{j - 1}^{\hat{α} + 1}) = 0, \\ \sum_{j = 2}^{n} \frac{\partial B_{j} (\hat{α}, \hat{β})}{\partial α} (B_{j} (\hat{α}, \hat{β}) + \frac{{\hat{σ}}^{2}}{2} h) = 0 . \end{matrix}

3.2. Confidence Bounds of the Process

The confidence bounds (CB) of the process are obtained using the same procedure as in Reference [9]. Thus, from Equation (5), we consider the variable

Y = σ (w (t) - w (t_{1})) = log (\frac{x (t)}{x_{1}}) - α log (\frac{t}{t_{1}}) + \frac{β}{α + 1} (t^{α + 1} - t_{1}^{α + 1}) + \frac{σ^{2}}{2} (t - t_{1}) .

Since

\forall t \geq t_{1},

the random variable

w (t) - w (t_{1})

is the so-called independent increments and is normally distributed

N_{1} (0, t - t_{1})

an estimation for the variable

Y,

is normally distributed

Z = \frac{Y - E (Y)}{\sqrt{V a r (Y)}} = \frac{log (\frac{x (t)}{x_{1}}) - α log (\frac{t}{t_{1}}) + \frac{β}{α + 1} (t^{α + 1} - t_{1}^{α + 1}) + \frac{σ^{2}}{2} (t - t_{1}) - 0}{σ \sqrt{t - t_{1}}} \sim N (0, 1) .

Thus, the 95% CB for the variable

x (t)

is obtained from the next characteristic:

\begin{matrix} P [- 1.96 \leq \frac{log (\frac{x (t)}{x_{1}}) - α log (\frac{t}{t_{1}}) + \frac{β}{α + 1} (t^{α + 1} - t_{1}^{α + 1}) + \frac{σ^{2}}{2} (t - t_{1})}{σ \sqrt{t - t_{1}}} \leq 1.96] \approx 0.95 . \end{matrix}

A CB for

x (t)

with the following form can thus be obtained:

x_{l o w e r} (t) \leq x (t) \leq x_{u p p e r} (t),

where,

\begin{matrix} x_{l o w e r} (t) = & x_{1} exp [- 1.96 σ \sqrt{t - t_{1}} + α log (\frac{t}{t_{1}}) - \frac{β}{α + 1} (t^{α + 1} - t_{1}^{α + 1}) - \frac{σ^{2}}{2} (t - t_{1})], \\ x_{u p p e r} (t) = & x_{1} exp [1.96 σ \sqrt{t - t_{1}} + α log (\frac{t}{t_{1}}) - \frac{β}{α + 1} (t^{α + 1} - t_{1}^{α + 1}) - \frac{σ^{2}}{2} (t - t_{1})] . \end{matrix}

(14)

4. Computational Aspects

4.1. Estimated TF and Estimated CBs

From Zenha’s theorem [10], by replacing the parameters by their estimators in Equations (8) and (9), the estimated conditional trend (ECTF) function can be obtained from:

\begin{matrix} \hat{E} [x (t) ∣ x (s) = x_{s}] = x_{s} {(\frac{t}{s})}^{\hat{α}} e^{- \frac{\hat{β}}{\hat{α} + 1} (t^{\hat{α} + 1} - s^{\hat{α} + 1})}, \end{matrix}

(15)

and the estimated trend function (ETF) is given by:

\begin{matrix} \hat{E} [x (t)] = \frac{x_{1} e^{\frac{\hat{β}}{\hat{α} + 1} t_{1}^{\hat{α} + 1}}}{t_{1}^{\hat{α}}} t^{\hat{α}} e^{- \frac{\hat{β}}{\hat{α} + 1} t^{\hat{α} + 1}} . \end{matrix}

(16)

What is more, the ECB are contructed by replacing the parameters by their estimators in Equation (14).

4.2. Simulated Annealing Method

Simulated Annealing (SA) was first introduced by References [11,12], who showed up significant initial results, following a prior investigation by Reference [13] who attempted to minimise a function on a very large, finite set. The actual approach was subsequently applied to optimising a continuous set by Reference [14].

SA is a technique to approximating the solution to tough combinatorial optimisation questions. The problem we get into is

max_{S \in F} (f (S)),

or equivalently

min_{S \in F} (- f (S)) .

Under the proposed algorithm, in every repetition, we have an actual solution x which is represented by an objective function value

f (x)

, for this solution a neighbour

x^{'}

is chosen from the neighbourhood of x indicated

K (x),

and determined as the set of all its nearest neighbours. For every move, the objective variance

η = f (x^{'}) - f (x)

is measured. From maximisation problems,

x^{'}

takes the place of x when

η \geq 0 .

Moreover,

x^{'}

could also be admitted with a probability

ω = e^{\frac{- η}{T}} .

The approval probability is compared to a randomly-generated number r and

x^{'}

is accepted whenever

ω > r .

We have to fulfill the stopping criteria to find out the point

x^{*}

which is a close solution to the issue.

In our situation, the problem is to maximise log-likelihood function obtained in Equation (11). Therefore, the objective function to minimise is a function of parameters

α

,

β

and

σ^{2}

:

\begin{matrix} G (α, β, σ^{2}) = \frac{n - 1}{2} log (σ^{2}) & + \frac{1}{2 σ^{2} h} \sum_{j = 2}^{n} [log (\frac{x_{j}}{x_{j - 1}}) - α log (\frac{t_{j}}{t_{j - 1}}) \\ {+ \frac{β}{α + 1} [t_{j}^{α + 1} - t_{j - 1}^{α + 1}] + \frac{σ^{2}}{2} h]}^{2} . \end{matrix}

(17)

In SA the motivation is to avoid trapping local optima, thereby enabling upward moves to higher-cost solutions under the orientation of a control parameter termed ‘temperature’.

5. Application and Simulation: The Age Dependency Ratio

5.1. Application

The following time-dependent stochastic variable (stochastic process) is considered:

x (t),

that is, the ratio of the dependent population (those aged under 15 or over 65 years) to the working-age population (aged 15 to 65 years) during year t in Morocco. This ratio is expressed as the number of “dependents” per 100 “workers”. This indicator is a decisive quantity of concern for demographic analysis also for pay-as-you-go retirement structure, social security system and health care insurance [15,16]. Indeed, the age dependency ratio measures the charge that the old population shows for the workers also it demonstrates how the dependency between young and old populations is making progress during demographic transitions. Formally, the age dependency ratio

r (t)

,

\begin{matrix} r (t) = \frac{g ([0, 15), t) + g ([65, \infty), t)}{g ([15, 65), t)} \times 100, \end{matrix}

where

g ([a_{1}, a_{2}),) = \int_{[a_{1}, a_{2})} g (a, t) d a

represents the number of individuals with age

a \in [a_{1}, a_{2})

at time

t .

We also introduce

g (a, t)

for the average number of individuals with age a at time

t .

The age dependency ratio in Morocco has significantly decreased; according to official Data in Table 1, the annual age dependency ratio fell from 105.58% (i.e., 105.58 dependents per 100 persons of working age) in 1968 to 51.89% in 2017. The mean ratio during this period was 74.55% with a minimum of 51.64% in 2015. The evolution of this ratio is associated with factors such as birth rate, fertility rate, employment trends, life expectancy and economic growth rates.

The data used for this purpose correspond to the period 1968–2017 (see Table 1) and were provided in World Bank’s database. The method applied is composed of two phases:

Step 1: Data for 1968–2014 are used to estimate the process parameters as described above. Using the Matlab package, the following estimator values are obtained: $\hat{α} = - 0.5337,$ $\hat{β} = 0.8457$ and ${\hat{σ}}^{2} = 3.8755 \times 10^{- 5} .$
Step 2: Data for 2015–2017 are explored to forecast the expected values of the process. The results in Table 2 resume the behaviour of the conditional and the non-conditional trend functions given, respectively, by Equations (15) and (16) also the values of the confidence bounds (given 95%) established from Equation (14). The performance of the SWDP for the previsions is represented in Figure 1 and Figure 2.

5.2. Goodness of Fit

The following scale-dependent quantities are based on the absolute error or squared errors and measures based on percentage errors:

Mean Absolute Error (MAE) = mean ( $∣ e_{t} ∣$ ),
Root Mean Square Error (RMSE) = $\sqrt{m e a n (e_{t}^{2})},$
Mean Absolute Percentage Error (MAPE) = mean ( $100 * e_{t} / x (t)$ ),

assuming

e_{t} = x (t) - \hat{x} (t)

with

\hat{x} (t)

is obtained by substituting the parameters in Equation (5) by their estimators.

The values obtained for the above error measures are acceptably low, especially the MAPE according to Table 3. The statistical measures obtained are illustrated in the Table 4.

5.3. Simulation

The sample paths were simulated by Equation (5), taking values of

α

,

β

,

σ^{2}

and

x_{1}

tight to those evaluated for these parameters in the real example in the application for which this investigation was established in Section 5.1. Ten trajectories with 500 values each were generated and the following time instants considered.

Figure 3 shows the simulated trajectories of the SWDP, where the red curve represents the theoretical trend function, for the particular case of

α = - 0.5337, β = 0.8457, σ^{2} = 3.8755 \times 10^{- 5}

,

h = 0.096, t_{1} = 1968, x_{1} = 105.5770,

which match, respectively to values near to those obtained in the study of

x (t) .

6. Conclusions

The SWDP was applied to analyse the age dependency ratio in Morocco. This obtained an improved description of the time series considered (1968–2014) and improved medium-term forecasts (2015–2017). From the results obtained (see Table 2, Figure 1 and Figure 2), we deduce that when the real case considered is modelled by the SWDP model according to the estimation procedure designated in Section 3, the fit and prediction achieved, based on ETF and ECTF, present an important degree of accuracy Table 4.

From one hand, as the retirement age is stable, when the life expectancy is rising, an important part of one’s lifetime is spent in pension. On the other hand, while the birth rates is decreasing, the part of population who will afterwards represent the support to the rest of the population is going down. In view of the fact that the dependency ratio indicates how many people need to be supported relative to the number of people who are working, consequently, the increasing number of retirees and the decreasing workforce drive up the dependency ratio.

An interesting area for future research would be to examine the possibility of defining a non-homogeneous Weibull model, introducing exogenous factors into the drift, similarly to the approach adopted for other diffusions [17,18]. This would enable us to study the factors affecting the evolution of the age dependency ratio for example: fertility, immigration, mortality, health and work ability.

Author Contributions

A.N. accomplished the formal analysis; A.N., and M.B. developed the methodology; M.B., B.A. and R.G.-S. analysed the data, M.B. wrote the first draft, A.N., R.G.-S. and B.A. wrote the evaluation and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This reasearch was financed by LAMSAD from “Fonds propres de l’Université Hassan I” (Morocco) and FQM-147 from “Plan Andaluz de l’Investigaciòn” (Spain).

Acknowledgments

We would like to thank the Editor and all the referees for constructive comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gutiérrez, R.; Gutiérrez-Sánchez, R.; Nafidi, A. Trend analysis and computational statistical estimation in a stochastic Rayleigh model: Simulation and application. Math. Comput. Simul. 2008, 77, 209–217. [Google Scholar] [CrossRef]
Giovanis, A.; Skiadas, C. A stochastic logistic innovation diffusion model studying the electricity consumption in Greece and the United States. Technol. Forecast. Soc. Chang. 1999, 61, 235–246. [Google Scholar] [CrossRef]
Gutiérrez, R.; Gutiérrez-Sánchez, R.; Nafidi, A. The Stochastic Rayleigh diffusion model: Statistical inference and computational aspects. Applications to modelling of real cases. Appl. Math. Comput. 2006, 175, 628–644. [Google Scholar] [CrossRef]
Albano, G.; Giorno, V.; Román-Román, P.; Torres-Ruiz, F. Inferring the effect of therapy on tumors showing stochastic Gompertzian growth. J. Theor. Biol. 2011, 276, 67–77. [Google Scholar] [CrossRef] [PubMed]
Skvortsov, A.; Ristic, B.; Kamenev, A. Predicting population extinction from early observations of the Lotka–Volterra system. Appl. Math. Comput. 2018, 320, 371–379. [Google Scholar] [CrossRef]
Rao, B.P.; Rao, B.P. Statistical Inference for Diffusion Type Processes; Arnold: London, UK, 1999. [Google Scholar]
Nafidi, A.; Bahij, M.; Achchab, B.; Gutiérrez-Sanchez, R. The stochastic Weibull diffusion process: Computational aspects and simulation. Appl. Math. Comput. 2019, 348, 575–587. [Google Scholar] [CrossRef]
Kloeden, P.E.; Platen, E.; Gelbrich, M.; Romisch, W. Numerical Solution of Stochastic Differential Equations. SIAM Rev. 1995, 37, 272–274. [Google Scholar]
Katsamaki, A.; Skiadas, C. Analytic solution and estimation of parameters on a stochastic exponential model for a technological diffusion process. Appl. Stoch. Model. Data Anal. 1995, 11, 59–75. [Google Scholar] [CrossRef]
Zehna, P.W. Invariance of maximum likelihood estimators. Ann. Math. Stat. 1966, 37, 744. [Google Scholar] [CrossRef]
Kirkpatrick, S.; Gelatt, C.D.; Vecchi, M.P. Optimization by Simulated Annealing. Science 1983, 220, 671–680. [Google Scholar] [CrossRef] [PubMed]
Černỳ, V. Thermodynamical approach to the traveling salesman problem: An efficient simulation algorithm. J. Optim. Theory Appl. 1985, 45, 41–51. [Google Scholar] [CrossRef]
Metropolis, N.; Rosenbluth, A.W.; Rosenbluth, M.N.; Teller, A.H.; Teller, E. Equation of State Calculations by Fast Computing Machines. J. Chem. Phys. 1953, 21, 1087–1092. [Google Scholar] [CrossRef]
Duflo, M. Random Iterative Models; Springer Science & Business Media: Berlin, Germany, 2013; Volume 34. [Google Scholar]
Boumezoued, A.; Hardy, H.L.; Karoui, N.E.; Arnold, S. Cause-of-death mortality: What can be learned from population dynamics? Insur. Math. Econ. 2018, 78, 301–315. [Google Scholar] [CrossRef]
Boyle, P.P.; Freedman, R. Population waves and fertility fluctuations: Social security implications. Insur. Math. Econ. 1985, 4, 65–74. [Google Scholar] [CrossRef]
Gutiérrez, R.; Gutiérrez-Sánchez, R.; Nafidi, A. Electricity consumption in Morocco: Stochastic Gompertz diffusion analysis with exogenous factors. Appl. Energy 2006, 83, 1139–1151. [Google Scholar] [CrossRef]
Nafidi, A.; Gutiérrez, R.; Gutiérrez-Sánchez, R.; Ramos-Ábalos, E.; El Hachimi, S. Modelling and predicting electricity consumption in Spain using the stochastic Gamma diffusion process with exogenous factors. Energy 2016, 113, 309–318. [Google Scholar] [CrossRef]

Figure 1. Observed data, estimated trend function (ETF) and the forecasted values.

Figure 2. Observed data, estimated conditional trend function (ECTF) and the forecasted values.

Figure 3. The Stochastic Weibull Diffusion Process (SWDP) simulated with the theoretical trend function.

Table 1. Age dependency ratio (% of working-age population) in Morocco.

Year	1968	1969	1970	1971	1972	1973
Data	105.5770	105.0150	104.2379	103.4307	102.2719	100.8111
Year	1974	1975	1976	1977	1978	1979
Data	99.0847	97.1586	95.1705	93.0931	91.0080	89.0184
Year	1980	1981	1982	1983	1984	1985
Data	87.1933	85.9912	84.8607	83.8064	82.7859	81.7496
Year	1986	1987	1988	1989	1990	1991
Data	81.1141	80.2438	79.2422	78.2533	77.3297	76.1942
Year	1992	1993	1994	1995	1996	1997
Data	75.2913	74.4163	73.2973	71.8304	70.4705	68.7203
Year	1998	1999	2000	2001	2002	2003
Data	66.7817	64.9550	63.3799	61.7694	60.5311	59.5211
Year	2004	2005	2006	2007	2008	2009
Data	58.5356	57.4950	56.5551	55.5167	54.4817	53.6134
Year	2010	2011	2012	2013	2014	2015
Data	52.9908	52.3518	51.9660	51.7834	51.6961	51.6429
Year	2016	2017
Data	51.8101	51.8878

Table 2. Predictions with trend function (TF) and conditional trend function (CTF) of the process.

Years	Real Data	Trend Function	Conditional Trend	Confidence Bounds
2015	51.6429	52.3115	50.9342	(48.0698–56.8238)
2016	51.8101	51.5407	50.8820	(47.3187–56.0351)
2017	51.8878	50.7815	51.0469	(46.5799–55.2570)

Table 3. Interpretation of typical Mean Absolute Percentage Error (MAPE) values.

MAPE	Interpretation
<10	Highly accurate forecasting
20–30	Good forecasting
30–50	Reasonable forecasting
>50	Inaccurate forecasting

Table 4. Goodness of fit of the model.

MAE	RMSE	MAPE
1.6810	1.9952	2.5312%

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nafidi, A.; Bahij, M.; Gutiérrez-Sánchez, R.; Achchab, B. Two-Parameter Stochastic Weibull Diffusion Model: Statistical Inference and Application to Real Modeling Example. Mathematics 2020, 8, 160. https://doi.org/10.3390/math8020160

AMA Style

Nafidi A, Bahij M, Gutiérrez-Sánchez R, Achchab B. Two-Parameter Stochastic Weibull Diffusion Model: Statistical Inference and Application to Real Modeling Example. Mathematics. 2020; 8(2):160. https://doi.org/10.3390/math8020160

Chicago/Turabian Style

Nafidi, Ahmed, Meriem Bahij, Ramón Gutiérrez-Sánchez, and Boujemâa Achchab. 2020. "Two-Parameter Stochastic Weibull Diffusion Model: Statistical Inference and Application to Real Modeling Example" Mathematics 8, no. 2: 160. https://doi.org/10.3390/math8020160

APA Style

Nafidi, A., Bahij, M., Gutiérrez-Sánchez, R., & Achchab, B. (2020). Two-Parameter Stochastic Weibull Diffusion Model: Statistical Inference and Application to Real Modeling Example. Mathematics, 8(2), 160. https://doi.org/10.3390/math8020160

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Two-Parameter Stochastic Weibull Diffusion Model: Statistical Inference and Application to Real Modeling Example

Abstract

1. Introduction

2. Stochastic Weibull Diffusion Process (SWDP)

2.1. The PDF and Moments of the Process

2.2. Moments of the Process

3. Statistical Inference

3.1. Maximum Likelihood Estimation

3.2. Confidence Bounds of the Process

4. Computational Aspects

4.1. Estimated TF and Estimated CBs

4.2. Simulated Annealing Method

5. Application and Simulation: The Age Dependency Ratio

5.1. Application

5.2. Goodness of Fit

5.3. Simulation

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI