New Closed Form Estimators for the Beta Distribution

Nawa, Victor Mooto; Nadarajah, Saralees

doi:10.3390/math11132799

Open AccessArticle

New Closed Form Estimators for the Beta Distribution

by

Victor Mooto Nawa

¹

and

Saralees Nadarajah

^2,*

¹

Department of Mathematics and Statistics, University of Zambia, Lusaka 10101, Zambia

²

Department of Mathematics, University of Manchester, Manchester M13 9PL, UK

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(13), 2799; https://doi.org/10.3390/math11132799

Submission received: 24 May 2023 / Revised: 15 June 2023 / Accepted: 19 June 2023 / Published: 21 June 2023

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we detail closed form estimators for beta distribution that are simpler than those proposed by Tamae, Irie and Kubokawa. The proposed estimators are shown to have smaller asymptotic variances and smaller asymptotic covariances compared to Tamae estimators and maximum likelihood estimators. The proposed estimators are also shown to perform better in real data applications.

Keywords:

covariance; digamma function; trigamma function; variance

MSC:

62E99

1. Introduction

The most popular model for data in a finite interval is that based on the beta distribution. It is a two parameter distribution with probability density function given by

\begin{matrix} f (x; α, β) = \frac{x^{α - 1} {(1 - x)}^{β - 1}}{B (α, β)} \end{matrix}

for

0 < x < 1

,

α > 0

and

β > 0

, where

B (α, β)

denotes the beta function defined by

\begin{matrix} B (α, β) = \int_{0}^{1} t^{α - 1} {(1 - t)}^{β - 1} d t, \end{matrix}

which can be equivalently expressed as

B (α, β) = \frac{Γ (α) Γ (β)}{Γ (α + β)}

, where

Γ (α)

denotes the gamma function. We shall write

X \sim B e t a (α, β)

to mean that a random variable X has the beta distribution. The mean and variance of

X \sim B e t a (α, β)

are

\begin{matrix} E (X) = \frac{α}{α + β} and Var (X) = σ^{2} = \frac{α β}{{(α + β)}^{2} (α + β + 1)}, \end{matrix}

(1)

respectively. There are hundreds if not thousands of papers on the theory and applications of the beta distribution. It is impossible to cite all of these papers. Comprehensive accounts of the theory and applications of the beta distribution can be found in [1,2,3,4,5]. See also [6].

For a long time, estimating parameters of the beta distribution and other related distributions, such as the gamma distribution, was only possible through iteration methods. For instance, if x is an observation on

X \sim B e t a (α, β)

then maximum likelihood estimators of

α

and

β

, say

\hat{α}

and

\hat{β}

, respectively, can be obtained by solving

\begin{matrix} log x = A, \\ log (1 - x) = B, \end{matrix}

where

A = ψ (α) - ψ (α + β)

,

B = ψ (β) - ψ (α + β)

and

ψ (α) = \frac{\partial log Γ (α)}{\partial α}

denotes the digamma function. Furthermore, the corresponding asymptotic variances and asymptotic covariance are given by

\sqrt{n} (\hat{α} - α, \hat{β} - β) \to N ((\begin{matrix} 0 \\ 0 \end{matrix}), \frac{1}{G} (\begin{matrix} D & ψ_{1} (C) \\ ψ_{1} (C) & E \end{matrix}))

as

n \to \infty

, where

ψ_{1} (α) = \frac{\partial ψ Γ (α)}{\partial α}

denotes the trigamma function,

C = α + β

,

D = ψ_{1} (β) - ψ_{1} (C)

,

E = ψ_{1} (α) - ψ_{1} (C)

and

G = ψ_{1} (α) ψ_{1} (β) - ψ_{1} (α) ψ_{1} (C) - ψ_{1} (β) ψ_{1} (C)

. Furthermore, let

F = ψ_{1} (α) + ψ_{1} (β)

.

However, recently closed form estimators for the gamma and beta distributions have been proposed. Ref. [7] proposed closed form estimators for the gamma distribution by considering the likelihood equations of the generalized gamma distribution and taking the gamma distribution as a special case. Ref. [8] proposed closed form estimators for the gamma and beta distributions using the “score adjusted approach”. The estimators for the gamma distribution in [8] turned out to be the same as those obtained by [7].

Following the method applied by [8], two closed form and simpler estimators for the beta distribution are proposed in this paper. The two estimators appear to have smaller variances and smaller covariances for certain values of the parameters of the beta distribution.

The remainder of this paper is organized as follows. Section 2 re-derives the closed form estimators for the beta distribution due to [8]. Section 3 derives two new closed form estimators for the beta distribution. Section 4 establishes asymptotic normality of the new estimators. It also derives expressions for asymptotic variances and asymptotic covariances. Section 5 conducts a numerical comparison to show that the new estimators can be better than the estimators due to [8] as well as maximum likelihood estimators. A simulation study is conducted in Section 6 to check finite sample performance of all the estimators. Section 7 shows that the new estimators can provide better fit to a real dataset. Finally, some conclusions are given in Section 8. All computations in the paper were performed using the R software [9]. Sample code are given in Appendix A.

2. Tamae et al.’s [8] Closed Form Estimators

Let

X \sim B e t a (α, β)

. Using the facts

\begin{matrix} E (log X) = ψ (α) - ψ (C) \end{matrix}

(2)

and

\begin{matrix} E (log (1 - X)) = ψ (β) - ψ (C), \end{matrix}

(3)

we can show that

\begin{matrix} E (X log X) = \frac{α}{C} [ψ (α) - ψ (C) + \frac{1}{α} - \frac{1}{C}] \end{matrix}

(4)

and

\begin{matrix} E (X log (1 - X)) = \frac{α}{C} [ψ (β) - ψ (C) - \frac{1}{C}] . \end{matrix}

(5)

Subtracting (5) from (4), using (1), (2) and (3) and solving for

α

gives

\begin{matrix} E (X log X) - E (X log (1 - X)) = E (X) [E (log X) - E (log (1 - X)) + \frac{1}{α}], \end{matrix}

(6)

which implies

\begin{matrix} α = \frac{E (X)}{E (X log X) - E (X log (1 - X)) - E (X) E (log X) + E (X) E (log (1 - X))} . \end{matrix}

Another way of expressing (6) is

\begin{matrix} E (X log X) - E (X log (1 - X)) = \frac{α}{C} [E (log X) - E (log (1 - X)) + \frac{1}{α}] . \end{matrix}

(7)

Substituting

α

from (6) into (7) and solving for

β

yields

\begin{matrix} β = \frac{1 - E (X)}{E (X log X) - E (X log (1 - X)) - E (X) E (log X) + E (X) E (log (1 - X))} . \end{matrix}

(8)

By the weak law of large numbers, we can replace the expectations in (6) and (8) by their sample versions yielding the closed form estimators proposed by [8] as

\begin{matrix} \hat{α} = \frac{\bar{X}}{\bar{X log X} - \bar{X log (1 - X)} - \bar{X} \bar{log X} + \bar{X} \bar{log (1 - X)}} = \frac{\bar{X}}{\bar{X log (\frac{X}{1 - X})} - \bar{X} \bar{log (\frac{X}{1 - X})}}, \\ \hat{β} = \frac{1 - \bar{X}}{\bar{X log X} - \bar{X log (1 - X}) - \bar{X} \bar{log X} + \bar{X} \bar{log (1 - X)}} = \frac{1 - \bar{X}}{\bar{X log (\frac{X}{1 - X})} - \bar{X} \bar{log (\frac{X}{1 - X})}} . \end{matrix}

The corresponding asymptotic variances and asymptotic covariance are given by

\sqrt{n} (\hat{α} - α, \hat{β} - β) \to N ((\begin{matrix} 0 \\ 0 \end{matrix}), [σ^{2} C^{2} F + 1] (\begin{matrix} α^{2} & α β \\ α β & β^{2} \end{matrix}) - \frac{1}{C + 1} (\begin{matrix} α β & C^{2} - α β \\ C^{2} - α β & α β \end{matrix}))

as

n \to \infty

.

3. New Closed Form Estimators

In this section, we propose two new estimators for

α

and

β

. Throughout, we suppose that

X \sim B e t a (α, β)

.

Substituting (3) into (5) and rearranging the terms gives

\begin{matrix} E (X log (1 - X)) = E (X) [E (log (1 - X)) - \frac{1}{C}], \end{matrix}

which implies

\begin{matrix} \frac{1}{C} = E (log (1 - X)) - \frac{E (X log (1 - X))}{E (X)} . \end{matrix}

(9)

Multiplying (9) by

α

, using (1) and solving for

α

gives

\begin{matrix} α = \frac{{[E (X)]}^{2}}{E (X) E (log (1 - X)) - E (X log (1 - X))} . \end{matrix}

(10)

Substituting (10) in (9) and solving for

β

gives

\begin{matrix} β = \frac{E (X) - {[E (X)]}^{2}}{E (X) E (log (1 - X)) - E (X log (1 - X))} . \end{matrix}

(11)

By the weak law of large numbers, we can replace the expectations in (10) and (11) by sample versions to obtain the estimators:

\begin{matrix} \hat{α} = \frac{{(\bar{X})}^{2}}{\bar{X} \bar{log (1 - X)} - \bar{X log (1 - X)}}, and \hat{β} = \frac{\bar{X} - {(\bar{X})}^{2}}{\bar{X} \bar{log (1 - X)} - \bar{X log (1 - X)}} . \end{matrix}

(12)

Substituting (1) and (2) in (4) and rearranging yields

\begin{matrix} E (X log X) = E (X) [E (log X) + \frac{1}{α} - \frac{1}{C}], \end{matrix}

which implies

\begin{matrix} \frac{1}{α} - \frac{1}{C} = \frac{E (X log X) - E (X) E (log X)}{E (X)} . \end{matrix}

(13)

Multiplying (13) by

α

, using (1) and solving for

α

gives

\begin{matrix} α = \frac{E (X) [1 - E (X)]}{E (X log X) - E (X) E (log X)} . \end{matrix}

(14)

Substituting (14) in (13) and solving for

β

gives

\begin{matrix} β = \frac{{[1 - E (X)]}^{2}}{E (X log X) - E (X) E (log X)} . \end{matrix}

(15)

By the weak law of large numbers, we can replace the expectations in (14) and (15) by their sample versions to obtain

\begin{matrix} \hat{α} = \frac{\bar{X} (1 - \bar{X})}{\bar{X log X} - \bar{X} \bar{log X}} and \hat{β} = \frac{{(1 - \bar{X})}^{2}}{\bar{X log X} - \bar{X} \bar{log X}} . \end{matrix}

(16)

Note that

\frac{\hat{β}}{\hat{α}} = \frac{1}{\bar{X}} - 1

. This relationship can be useful in deriving large sample properties given in Section 4.

4. Large Sample Properties

Large sample properties of the estimators given by (12) and (16) are derived in this section. Theorem 1 proves asymptotic normality of the estimators given by (12). Theorem 2 proves asymptotic normality of the estimators given by (16).

Theorem 1.

The estimators given in (12) satisfy

\sqrt{n} (\hat{α} - α, \hat{β} - β) \underset{d}{⟶} N ((\begin{matrix} 0 \\ 0 \end{matrix}), (\begin{matrix} Σ_{11} & Σ_{12} \\ Σ_{12} & Σ_{22} \end{matrix}))

as

n \to \infty

, where

Σ_{11} = \frac{α β}{C + 1} (1 + C^{2} D) + \frac{α C (α + 1) (C^{2} + 1)}{{(C + 1)}^{3}},

\begin{matrix} Σ_{12} = \frac{β^{2} C^{2} D - α (β + C)}{C + 1} + \frac{β C}{{(C + 1)}^{3}} [(α + 2) C^{2} + C + α + 1] \end{matrix}

and

\begin{matrix} Σ_{22} = \frac{α β}{C + 1} + \frac{β^{2} C}{α (C + 1)} (β C D - α + 1) + \frac{2 β C (α + 1)}{α {(C + 1)}^{3}} (β C^{2} - α C - α) . \end{matrix}

Proof.

Let the empirical means of X,

log (1 - X)

, and

X log (1 - X)

be denoted by

\bar{X}

,

\bar{Y}

, and

\bar{Z}

, respectively. We can easily show that

\begin{matrix} E (\bar{X}) = \frac{α}{C}, \end{matrix}

\begin{matrix} E (\bar{Y}) = B \end{matrix}

and

\begin{matrix} E (\bar{Z}) = \frac{α}{C} (B - \frac{1}{C}) . \end{matrix}

We can also show that

\begin{matrix} Var (\sqrt{n} \bar{Y}) = D . \end{matrix}

(17)

Using the fact that

\begin{matrix} E [X^{2} {log}^{2} (1 - X)] = \frac{α (α + 1)}{C (C + 1)} \{{[ψ (β) - ψ (C + 2)]}^{2} + ψ_{1} (β) - ψ_{1} (C + 2)\}, \end{matrix}

we can show that

\begin{matrix} Var (\sqrt{n} \bar{Z}) = σ^{2} {(B - \frac{1}{C})}^{2} + \frac{α (α + 1)}{C (C + 1)} [\frac{2}{{(C + 1)}^{2}} - \frac{2}{C + 1} (B - \frac{1}{C}) + \frac{1}{C^{2}} + D] . \end{matrix}

(18)

Similarly, we can show that

\begin{matrix} Cov (\sqrt{n} \bar{X}, \sqrt{n} \bar{Y}) = - \frac{α}{C^{2}}, \end{matrix}

(19)

\begin{matrix} Cov (\sqrt{n} \bar{X}, \sqrt{n} \bar{Z}) = σ^{2} (B - \frac{1}{C}) - \frac{α (α + 1)}{C {(C + 1)}^{2}} \end{matrix}

(20)

and

\begin{matrix} Cov (\sqrt{n} \bar{Y}, \sqrt{n} \bar{Z}) = \frac{α}{C} (\frac{2}{C^{2}} - \frac{B}{C} + D) . \end{matrix}

(21)

By the central limit theorem,

\begin{matrix} \sqrt{n} [(\bar{X}, \bar{Y}, \bar{Z}) - \{\frac{α}{C}, B, \frac{α}{C} [ψ (β) - ψ (C + 1)]\}] \underset{d}{⟶} N (0_{3}, Σ) \end{matrix}

as

n \to \infty

, where

\begin{matrix} 0_{3} = (\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}), \end{matrix}

and

\begin{matrix} Σ = (\begin{matrix} Var (X) & Cov (X, Y) & Cov (X, Z) \\ Cov (X, Y) & Var (Y) & Cov (Y, Z) \\ Cov (X, Z) & Cov (Y, Z) & Var (Z) \end{matrix}) . \end{matrix}

The entries of this matrix are given by (1) and (17)–(21) as

\begin{matrix} Var (X) = \frac{α β}{C^{2} (C + 1)} = σ^{2}, \end{matrix}

\begin{matrix} Cov (X, Y) = - \frac{α}{C^{2}}, \end{matrix}

\begin{matrix} Cov (X, Z) = σ^{2} (B - \frac{1}{C}) - \frac{α (α + 1)}{C {(C + 1)}^{2}}, \end{matrix}

\begin{matrix} Var (Y) = D, \end{matrix}

\begin{matrix} Cov (Y, Z) = \frac{α}{C} (\frac{2}{C^{2}} - \frac{B}{C} + D) \end{matrix}

and

\begin{matrix} Var (Z) = σ^{2} {(B - \frac{1}{C})}^{2} + \frac{α (α + 1)}{C (C + 1)} [\frac{2}{{(C + 1)}^{2}} - \frac{2}{C + 1} (B - \frac{1}{C}) + \frac{1}{C^{2}} + D] . \end{matrix}

Let

g_{1} (x, y, z) = \frac{x^{2}}{x y - z}

,

g_{2} (x, y, z) = \frac{x - x^{2}}{x y - z}

and

(x_{0}, y_{0}, z_{0}) = [\frac{α}{C}, B, \frac{α}{C} [ψ (β) - ψ (C + 1)]]

. Then

\begin{matrix} \frac{\partial g_{1}}{\partial x} |_{(x_{0}, y_{0}, z_{0})} = C (2 - B C), \end{matrix}

\begin{matrix} \frac{\partial g_{1}}{\partial y} |_{(x_{0}, y_{0}, z_{0})} = - α C, \end{matrix}

\begin{matrix} \frac{\partial g_{1}}{\partial z} |_{(x_{0}, y_{0}, z_{0})} = C^{2}, \end{matrix}

\begin{matrix} \frac{\partial g_{2}}{\partial x} |_{(x_{0}, y_{0}, z_{0})} = C^{2} (\frac{β - α}{α C} - \frac{β B}{α}), \end{matrix}

\begin{matrix} \frac{\partial g_{2}}{\partial y} |_{(x_{0}, y_{0}, z_{0})} = - β C \end{matrix}

and

\begin{matrix} \frac{\partial g_{2}}{\partial z} |_{(x_{0}, y_{0}, z_{0})} = \frac{β C^{2}}{α} . \end{matrix}

Let

\begin{matrix} M = (\begin{matrix} C (2 - B C) & - α C & C^{2} \\ C^{2} (\frac{β - α}{α C} - \frac{β B}{α}) & - β C & \frac{β C^{2}}{α} \end{matrix}) . \end{matrix}

Using the delta method,

\begin{matrix} \sqrt{n} (\hat{α} - α, \hat{β} - β) \underset{d}{⟶} N (0_{2}, M Σ M^{T}) \end{matrix}

as

n \to \infty

, where

\begin{matrix} 0_{2} = (\begin{matrix} 0 \\ 0 \end{matrix}) \end{matrix}

and

\begin{matrix} M Σ M^{T} = (\begin{matrix} M Σ M_{11}^{T} & M Σ M_{12}^{T} \\ M Σ M_{12}^{T} & M Σ M_{22}^{T} \end{matrix}), \end{matrix}

where

\begin{matrix} M Σ M_{11}^{T} = σ^{2} C^{2} {(2 - B C)}^{2} + 2 Cov (X, Z) C^{3} (2 - B C) + Var (Z) C^{4} - α^{2} C^{2} D, \end{matrix}

\begin{matrix} M Σ M_{12}^{T} = σ^{2} C^{3} (2 - B C) (\frac{β - α}{α C} - \frac{β B}{α}) + Cov (X, Z) \frac{C^{3}}{α} (3 β - 2 β B C - α) \\ + Var (Z) \frac{β C^{4}}{α} - α (C + β C^{2} D) \end{matrix}

and

\begin{matrix} M Σ M_{22}^{T} = σ^{2} C^{4} {(\frac{β - α}{α C} - \frac{β B}{α})}^{2} + 2 Cov (X, Z) \frac{β C^{4}}{α} (\frac{β - α}{α C} - \frac{β B}{α}) \\ + Var (Z) \frac{β^{2} C^{4}}{α^{2}} - β C (2 + β C D) . \end{matrix}

The theorem follows by simplification of these expressions. □

Theorem 2.

The estimators given in (16) satisfy

\sqrt{n} (\hat{α} - α, \hat{β} - β) \underset{d}{⟶} N ((\begin{matrix} 0 \\ 0 \end{matrix}), (\begin{matrix} Σ_{11} & Σ_{12} \\ Σ_{12} & Σ_{22} \end{matrix}))

as

n \to \infty

, where

\begin{matrix} Σ_{11} = \frac{σ^{2} α^{2} C^{2}}{β^{2}} [1 + \frac{2 C^{2}}{C + 1} + \frac{β C^{3}}{(α + 1) {(C + 1)}^{2}}] + \frac{α^{2} C}{β} [2 + \frac{α C E}{C + 1}] \\ + \frac{α^{3} C^{3} (α + 1)}{β^{2} (C + 1)} [\frac{1}{C^{2}} + \frac{1}{{(C + 1)}^{2}} - \frac{1}{α^{2}} - \frac{1}{{(α + 1)}^{2}}], \end{matrix}

\begin{matrix} Σ_{12} = \frac{σ^{2} α C^{2}}{β} [\frac{β}{α} (1 + \frac{C^{2}}{C + 1}) + \frac{β C^{3}}{(α + 1) {(C + 1)}^{2}} + 2 + \frac{3 C^{2}}{C + 1}] + α C [1 + \frac{α C E}{C + 1}] \\ + \frac{α^{2} C^{3} (α + 1)}{β (C + 1)} [\frac{1}{C^{2}} + \frac{1}{{(C + 1)}^{2}} - \frac{1}{α^{2}} - \frac{1}{{(α + 1)}^{2}}] \end{matrix}

and

\begin{matrix} Σ_{22} = σ^{2} C^{2} [4 (1 + \frac{β}{α} + \frac{C^{2}}{C + 1}) + \frac{β}{α} (\frac{β}{α} + \frac{2 C^{2}}{C + 1}) + \frac{β C^{3}}{(α + 1) {(C + 1)}^{2}}] + \frac{α β C^{2} E}{C + 1} \\ + \frac{α C^{3} (α + 1)}{C + 1} [\frac{1}{C^{2}} + \frac{1}{{(C + 1)}^{2}} - \frac{1}{α^{2}} - \frac{1}{{(α + 1)}^{2}}] . \end{matrix}

Proof.

Let the empirical means of X,

log (X)

and

X log (X)

be denoted by

\bar{X}

,

\bar{U}

and

\bar{V}

, respectively. We can easily show that

\begin{matrix} E (\bar{U}) = A \end{matrix}

and

\begin{matrix} E (\bar{V}) = \frac{α}{C} [A + \frac{1}{α} - \frac{1}{C}] . \end{matrix}

We can also show that

\begin{matrix} Var (\sqrt{n} \bar{U}) = E . \end{matrix}

(22)

Using the fact that

\begin{matrix} E [X^{2} {log}^{2} (X)] = \frac{α (α + 1)}{C (C + 1)} \{{[ψ (α + 2) - ψ (C + 2)]}^{2} + ψ_{1} (α + 2) - ψ_{1} (C + 2)\}, \end{matrix}

we have

\begin{matrix} Var (\sqrt{n} \bar{V}) & = & σ^{2} {(A + \frac{β}{α C})}^{2} + \frac{σ^{2} C}{C + 1} [\frac{β}{(α + 1) (C + 1)} + 2 A + \frac{2 β}{α C}] \\ + \frac{α (α + 1)}{C (C + 1)} [\frac{1}{C^{2}} + \frac{1}{{(C + 1)}^{2}} - \frac{1}{α^{2}} - \frac{1}{{(α + 1)}^{2}} + E] . \end{matrix}

(23)

Similarly, we can show that

\begin{matrix} Cov (\sqrt{n} \bar{X}, \sqrt{n} \bar{U}) = \frac{β}{C^{2}}, \end{matrix}

(24)

\begin{matrix} Cov (\sqrt{n} \bar{X}, \sqrt{n} \bar{V}) = σ^{2} [A + \frac{β}{α C} + \frac{C}{C + 1}] \end{matrix}

(25)

and

\begin{matrix} Cov (\sqrt{n} \bar{U}, \sqrt{n} \bar{V}) = \frac{α}{C} [\frac{A β}{α C} + E - \frac{2 β}{α C^{2}}] . \end{matrix}

(26)

By the central limit theorem,

\begin{matrix} \sqrt{n} [(\bar{X}, \bar{U}, \bar{V}) - \{\frac{α}{C}, A, \frac{α}{C} [ψ (α + 1) - ψ (C + 1)]\}] \underset{d}{⟶} N (0_{3}, Σ) \end{matrix}

as

n \to \infty

, where

\begin{matrix} Σ = (\begin{matrix} Var (X) & Cov (X, U) & Cov (X, V) \\ Cov (X, U) & Var (U) & Cov (U, V) \\ Cov (X, V) & Cov (U, V) & Var (V) \end{matrix}) . \end{matrix}

The entries of this matrix are given by (1) and (22)–(26) as

\begin{matrix} Var (X) = \frac{α β}{C^{2} (C + 1)} = σ^{2}, \end{matrix}

\begin{matrix} Cov (X, U) = \frac{β}{C^{2}}, \end{matrix}

\begin{matrix} Cov (X, V) = σ^{2} [A + \frac{β}{α C} + \frac{C}{C + 1}], \end{matrix}

\begin{matrix} Var (U) = E, \end{matrix}

\begin{matrix} Cov (U, V) = \frac{α}{C} [\frac{A β}{α C} + E - \frac{2 β}{α C^{2}}] \end{matrix}

and

\begin{matrix} Var (V) = σ^{2} {(A + \frac{β}{α C})}^{2} + \frac{σ^{2} C}{C + 1} [\frac{β}{(α + 1) (C + 1)} + 2 A + \frac{2 β}{α C}] \\ + \frac{α (α + 1)}{C (C + 1)} [\frac{1}{C^{2}} + \frac{1}{{(C + 1)}^{2}} - \frac{1}{α^{2}} - \frac{1}{{(α + 1)}^{2}} + E] . \end{matrix}

Let

h_{1} (x, y, z) = \frac{x - x^{2}}{x z - y}

,

h_{2} (x, y, z) = \frac{1 - 2 x + x^{2}}{z - x y}

and

(x_{0}, y_{0}, z_{0}) = [\frac{α}{C}, A, \frac{α}{C} [ψ (α + 1)

- ψ (C + 1)]]

. Then,

\begin{matrix} \frac{\partial h_{1}}{\partial x} |_{(x_{0}, y_{0}, z_{0})} = \frac{C}{β} (α A C + β - α), \end{matrix}

\begin{matrix} \frac{\partial h_{1}}{\partial y} |_{(x_{0}, y_{0}, z_{0})} = \frac{α^{2} C}{β}, \end{matrix}

\begin{matrix} \frac{\partial h_{1}}{\partial z} |_{(x_{0}, y_{0}, z_{0})} = - \frac{α C^{2}}{β}, \end{matrix}

\begin{matrix} \frac{\partial h_{2}}{\partial x} |_{(x_{0}, y_{0}, z_{0})} = A C^{2} - 2 C, \end{matrix}

\begin{matrix} \frac{\partial h_{2}}{\partial y} |_{(x_{0}, y_{0}, z_{0})} = α C \end{matrix}

and

\begin{matrix} \frac{\partial h_{2}}{\partial z} |_{(x_{0}, y_{0}, z_{0})} - C^{2} . \end{matrix}

Let

\begin{matrix} W = (\begin{matrix} \frac{C}{β} (α A C + β - α) & \frac{α^{2} C}{β} & - \frac{α C^{2}}{β} \\ A C^{2} - 2 C & α C & - C^{2} \end{matrix}) . \end{matrix}

Using the delta method,

\begin{matrix} \sqrt{n} (\hat{α} - α, \hat{β} - β) \underset{d}{⟶} N (0_{2}, W Σ W^{T}) \end{matrix}

as

n \to \infty

, where

\begin{matrix} W Σ W^{T} = (\begin{matrix} W Σ W_{11}^{T} & W Σ W_{12}^{T} \\ W Σ W_{12}^{T} & W Σ W_{22}^{T} \end{matrix}), \end{matrix}

where

\begin{matrix} W Σ W_{11}^{T} = \frac{2 α^{2} C}{β} - \frac{σ^{2} α C^{2}}{β^{2}} (α A C + β - α) (1 + A C + \frac{β}{α} + \frac{2 C^{2}}{C + 1}) - \frac{α^{4} C^{2} E}{β^{2}} + \frac{α^{2} C^{4}}{β^{2}} Var (V), \end{matrix}

\begin{matrix} W Σ W_{12}^{T} = - \frac{σ^{2} α C^{2}}{β} [A C (A C + \frac{2 β}{α} + \frac{2 C^{2}}{C + 1}) + \frac{β}{α} (- 1 + \frac{β}{α} + \frac{C^{2}}{C + 1}) - 2 - \frac{3 C^{2}}{C + 1}] \\ + α C - \frac{α^{3} C^{2} E}{β} + \frac{α C^{4}}{β} Var (V) \end{matrix}

and

\begin{matrix} W Σ W_{22}^{T} = - σ^{2} C^{2} (A C - 2) (2 + \frac{2 β}{α} + \frac{2 C^{2}}{C + 1} + A C) - α^{2} C^{2} E + C^{4} Var (V) . \end{matrix}

The theorem follows by simplification of these expressions. □

5. Numerical Comparison

In this section, we compare the asymptotic variances and asymptotic covariances of the estimators given by Theorems 1 and 2, Tamae et al.’s estimators and maximum likelihood estimators. Figure 1, Figure 2 and Figure 3 show how the asymptotic variances and asymptotic covariances of these estimators vary versus

α = 0.1, 0.2, \dots, 10

and

β = 0.1, 0.2, \dots, 10

. Figure 4, Figure 5 and Figure 6 show the differences in asymptotic variances and the differences in asymptotic covariances for two of the estimators at a time as

α = 0.1, 0.2, \dots, 10

and

β = 0.1, 0.2, \dots, 10

. We shall refer to Tamae et al.’s estimators as TIK estimators.

We can observe the following from Figure 1, Figure 2 and Figure 3. Asymptotic variances of all of the estimators for

α

increase with respect to

α

; asymptotic variances of all of the estimators for

α

decrease with respect to

β

; asymptotic variances of all of the estimators for

β

decrease with respect to

α

; asymptotic variances of all of the estimators for

β

increase with respect to

β

; asymptotic covariances of all of the estimators increase with respect to

α

; and asymptotic covariances of all of the estimators increase with respect to

β

. All four estimators in each of the Figure 1, Figure 2 and Figure 3 appear to behave similarly.

We can observe the following from Figure 4, Figure 5 and Figure 6. With respect to asymptotic variances of the estimators for

α

, the estimator in Theorem 1 is more efficient than the corresponding estimator in Theorem 2 for all

α > β