Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy

Weiß, Christian H.

doi:10.3390/e24010042

Open AccessArticle

Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy

by

Christian H. Weiß

Department of Mathematics and Statistics, Helmut Schmidt University, 22043 Hamburg, Germany

Entropy 2022, 24(1), 42; https://doi.org/10.3390/e24010042

Submission received: 1 December 2021 / Revised: 22 December 2021 / Accepted: 23 December 2021 / Published: 26 December 2021

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

:

The family of cumulative paired

ϕ

-entropies offers a wide variety of ordinal dispersion measures, covering many well-known dispersion measures as a special case. After a comprehensive analysis of this family of entropies, we consider the corresponding sample versions and derive their asymptotic distributions for stationary ordinal time series data. Based on an investigation of their asymptotic bias, we propose a family of signed serial dependence measures, which can be understood as weighted types of Cohen’s

κ

, with the weights being related to the actual choice of

ϕ

. Again, the asymptotic distribution of the corresponding sample

κ_{ϕ}

is derived and applied to test for serial dependence in ordinal time series. Using numerical computations and simulations, the practical relevance of the dispersion and dependence measures is investigated. We conclude with an environmental data example, where the novel

ϕ

-entropy-related measures are applied to an ordinal time series on the daily level of air quality.

Keywords:

Cohen’s

κ

; dispersion; entropy; ordinal time series; serial dependence

1. Introduction

During the last years, ordinal data in general [1] and ordinal time series in particular [2] received a great amount of interest in research and applications. Here, a random variable X is said to be ordinal if X has a bounded qualitative range exhibiting a natural order among the categories. We denote the range as

S = {s_{0}, s_{1}, \dots, s_{m}}

with some

m \in N = {1, 2, \dots}

, and we assume that

s_{0} < \dots < s_{m}

. The realized data are denoted as

x_{1}, \dots, x_{n}

with

n \in N

. They are assumed to stem either from independent and identically distributed (i. i. d.) replications of X (then, we refer to

x_{1}, \dots, x_{n}

as an ordinal random sample), or from a stationary ordinal stochastic process

{(X_{t})}_{Z = {\dots, - 1, 0, 1, \dots}}

(then,

x_{1}, \dots, x_{n}

are said to be an ordinal time series).

In what follows, we take up several recent works on measures of dispersion and serial dependence in ordinal (time series) data. Regarding ordinal dispersion, the well-known measures such as variance or mean absolute deviation cannot be used as the data are not quantitative. Therefore, several tailor-made measures for ordinal dispersion have been developed and investigated in the literature, see, among others, [3,4,5,6,7,8,9,10,11,12,13,14,15]. The unique feature of all these measures is that they rely on the cumulative distribution function (CDF) of X, i.e., on

f = {(f_{0}, \dots, f_{m - 1})}^{⊤}

with

f_{i} = P (X \leq s_{i})

for

i = 0, \dots, m

(

f_{m}

is omitted in f as it necessarily equals one). They classify any one-point distribution on

S

as a scenario of minimal dispersion, i.e., if all probability mass concentrates on one category from

S

(maximal consensus):

f_{o n e} \in \{(\begin{matrix} 1 \\ 1 \\ ⋮ \\ 1 \end{matrix}), (\begin{matrix} 0 \\ 1 \\ ⋮ \\ 1 \end{matrix}), \dots, (\begin{matrix} 0 \\ ⋮ \\ 0 \\ 1 \end{matrix}), (\begin{matrix} 0 \\ ⋮ \\ 0 \\ 0 \end{matrix})\} .

By contrast, maximal dispersion is achieved exactly in the case of the extreme two-point distribution (polarized distribution),

f_{t w o} = {(\frac{1}{2}, \dots, \frac{1}{2})}^{⊤}

, where we have 50 % probability mass in each of the outermost categories (maximal dissent). Further details on ordinal dispersion measures are presented in Section 2 below.

Building upon earlier works by Klein [16], Yager [17], it was recently shown by Klein Doll [18], Klein et al. [19] that the aforementioned ordinal dispersion measures can be subsumed under the family of so-called “cumulative paired

ϕ

-entropies” (see Section 2), abbreviated as

{CPE}_{ϕ}

, which constitutes the starting point of the present article. Our first main task is to derive the asymptotic distribution of the corresponding sample version,

{\hat{CPE}}_{ϕ}

, for both i. i. d. and time series data, and to check the finite sample performance of the resulting approximate distribution, see Section 3 and Section 5.

In the recent paper by Weiß [20] on the asymptotics of some well-known dispersion measures for nominal data (i.e., qualitative data without a natural ordering), it turned out that the corresponding dispersion measures—if these are applied to time series data—are related to specific measures of serial dependence. Therefore, our second main task is to explore for a similar relation in the ordinal case, if considering the

{CPE}_{ϕ}

-family for measuring dispersion. Ordinal dependence measures can be defined in analogy to the popular autocorrelation function (ACF) for quantitative time series, namely by using the lagged bivariate CDF

f_{i j} (h) = P (X_{t} \leq s_{i}, X_{t - h} \leq s_{j})

for time lags

h \in N

as their base [14]. For the novel family of

κ_{ϕ} (h)

measures, which cover the existing ordinal Cohen’s

κ

[14,15] as a special case, we derive the asymptotics under the null hypothesis of i. i. d. time series data, see Section 4. This result is used in Section 5 to test for significant serial dependence, in analogy to the application of the sample ACF to quantitative time series. In Section 6, we discuss an illustrative real-world data example from an environmental application, namely regarding the daily level of air quality. The article concludes in Section 7.

2. The Family of Cumulative Paired $ϕ$ -Entropies

Klein Doll [18], Klein et al. [19] proposed and investigated a family of cumulative paired

ϕ

-entropies. Although their main focus was on continuously distributed random variables, they also referred to the ordinal case and pointed out that many well-known ordinal dispersion measures are included in this family. Here, we exclusively concentrate on the ordinal case as introduced in Section 1, and we define the (normalized) cumulative paired

ϕ

-entropy as (see Section 2.3 in Klein et al. [19])

{CPE}_{ϕ} (f) = \frac{1}{2 m ϕ (1 / 2)} \sum_{i = 0}^{m - 1} (ϕ (f_{i}) + ϕ (1 - f_{i})) .

(1)

The entropy generating function (EGF)

ϕ

is defined on

[0; 1]

, it satisfies

ϕ (0) = ϕ (1) = 0

, and it is assumed to be concave on

[0; 1]

. Later in Section 3, when deriving the asymptotic distribution of the sample counterpart

{\hat{CPE}}_{ϕ} = {CPE}_{ϕ} (\hat{f})

, we shall also require that

ϕ

is (twice) differentiable. As pointed out in Section 2.3 and 3.1 of Klein et al. [19], several well-known measures of ordinal dispersion can be expressed by (1) with an appropriate choice of

ϕ

.

Leik’s ordinal variation [11] corresponds to the choice $ϕ (z) = min {z, 1 - z}$ (which is not differentiable in $z = 1 / 2$ ) because of the equality $| 2 z - 1 | = 1 - 2 min {z, 1 - z}$ :

$LOV = 1 - \frac{1}{m} \sum_{i = 0}^{m - 1} | 2 f_{i} - 1 | = \frac{2}{m} \sum_{i = 0}^{m - 1} min {f_{i}, 1 - f_{i}} .$

(2)
An analogous argument applies to the whole family of ordinal variation measures, ${OV}_{q}$ with $q \geq 1$ [3,9,10,13]. Choosing $ϕ_{q} (z) = 1 - {(1 - 2 min {z, 1 - z})}^{q} = 1 - {| 2 z - 1 |}^{q}$ with $ϕ_{q} (1 / 2) = 1$ , we have the relation

${OV}_{q} = 1 - {(\frac{1}{m} \sum_{i = 0}^{m - 1} {| 2 f_{i} - 1 |}^{q})}^{1 / q} = 1 - {(1 - {CPE}_{ϕ} (f))}^{1 / q} .$

(3)

Note that $q = 1$ leads to the LOV, and the case $q = 2$ is known as the coefficient of ordinal variation, $COV = {OV}_{2}$ [4,8].
Related to the previous case ${OV}_{q}$ with $q = 2$ , ${CPE}_{ϕ} (f)$ becomes the widely-used index of ordinal variation [3,7,8] if choosing $ϕ (z) = z (1 - z)$ :

$IOV = \frac{4}{m} \sum_{i = 0}^{m - 1} f_{i} (1 - f_{i}) = 1 - \frac{1}{m} \sum_{i = 0}^{m - 1} {(2 f_{i} - 1)}^{2} .$

(4)
The cumulative paired (Shannon) entropy [12] corresponds to the choice $ϕ (z) = - z ln z$ (with the convention $0 ln 0 = 0$ ):

$CPE = \frac{- 1}{m ln 2} \sum_{i = 0}^{m - 1} (f_{i} ln f_{i} + (1 - f_{i}) ln (1 - f_{i})) .$

(5)
$ϕ (z) = - z ln z$ can be embedded into the family of a-entropies [21,22],

$ϕ_{a} (z) = \frac{z - z^{a}}{a - 1} with a > 0 and a \neq 1,$

(6)

as the boundary case $a \to 1$ . Plugging-in (6) into (1), one obtains

${CPE}_{a} = \frac{2^{a - 1}}{m (2^{a - 1} - 1)} \sum_{i = 0}^{m - 1} (1 - f_{i}^{a} - {(1 - f_{i})}^{a}) .$

(7)

Note that both $a = 2$ and $a = 3$ in (7) lead to the IOV (4).

The EGFs

ϕ

involved in (2)–(4) are symmetric around

1 / 2

, i.e., they satisfy

ϕ (z) = ϕ (1 - z)

. This is also illustrated by Figure 1, where the cases

q = 2

(left) and

a = 2

(right; both in bold black) agree with each other except a scaling factor. The plotted EGFs

ϕ_{q} (z)

are maximal in

1 / 2

with

ϕ_{q} (1 / 2) = 1

. The EGF

ϕ_{a} (z)

is maximal in

a^{1 / (1 - a)}

with

ϕ_{a} (a^{1 / (1 - a)}) = a^{a / (1 - a)}

for

a \neq 1

, whereas in the boundary case

a \to 1

,

ϕ_{a} (z)

takes its maximal value

1 / e

at

1 / e

. However, since

{CPE}_{ϕ}

in (1) is normalized, it is not necessary to care for a possible rescaling of

ϕ_{a} (z)

if computing

{CPE}_{ϕ}

.

Remark 1.

The dotted curve in the right part of Figure 1, which connects the maxima of

ϕ_{a} (z)

for different a, is computed by using the Lambert W function (also referred to as the product logarithm). This can be seen as follows:

ϕ_{a} (z)

is maximal in

z_{0} = a^{1 / (1 - a)}

with

ϕ_{a} (z_{0}) = z_{0}^{a}

. It holds that

a \cdot z_{0}^{a} = a^{1 + a / (1 - a)} = z_{0} .

Using that

z_{0}^{a} = exp (a ln z_{0})

, this implies that

(a ln z_{0}) \cdot exp (a ln z_{0}) = z_{0} ln z_{0} .

The Lambert W function is defined to solve the equation

w exp w = z

as

w = W (z)

, so we get

a ln z_{0} = W (z_{0} ln z_{0}) \Leftrightarrow a = W (z_{0} ln z_{0}) / ln z_{0} .

Thus, since

ϕ_{a} (z_{0}) = a^{a / (1 - a)} = a^{1 / (1 - a)} / a = z_{0} / a

, the dotted curve in Figure 1 follows the function

z ln z / W (z ln z)

. More precisely, since

z ln z

is minimal in

z = 1 / e

with minimal value

- 1 / e

, we have to use the principal branch

W (z) = W_{0} (z)

for

z \leq 1 / e

, and the secondary branch

W (z) = W_{- 1} (z)

for

z > 1 / e

.

Let us conclude this section with some examples of

{CPE}_{ϕ}

measures (see Figure 2). For all examples, we set

m = 4

, i.e., we have five ordinal categories like, for example, in the case of a simple Likert scale. In the left part of Figure 2, f was computed according to the binomial distribution

Bin (4, p)

, which has maximal dispersion for

p = 0.5

. This is also recognized by any of the plotted measures

{CPE}_{ϕ}

, with their maximal dispersion values varying around 0.6. This medium level of dispersion is plausible because

Bin (4, 0.5)

is far away from the extreme two-point distribution. The right part of Figure 2, by contrast, shows the

{CPE}_{ϕ}

for the two-point distribution with

f_{0} = p

(

= f_{1} = \dots = f_{m - 1}

). So

p = 0

corresponds to a one-point distribution in

s_{m}

(minimal dispersion), and

p = 0.5

to the extreme two-point distribution (maximal dispersion). Accordingly, all measures reach their extreme values 0 and 1, respectively. It is interesting to note that outside these extreme cases, the dispersion measures judge the actual dispersion level quite differently; see the related discussion in Kvålseth [10], Weiß [13].

3. Asymptotic Distribution of Sample ${CPE}_{ϕ}$

From now on, we focus on the sample version of

{CPE}_{ϕ}

from (1), i.e., on

{\hat{CPE}}_{ϕ} = {CPE}_{ϕ} (\hat{f})

, where

\hat{f}

denotes the vector of cumulative relative frequencies computed from

X_{1}, \dots, X_{n}

. To derive the asymptotic distribution of

{\hat{CPE}}_{ϕ}

, which is to be used as an approximation to the true distribution of

{\hat{CPE}}_{ϕ}

, we recall the following results from Weiß [14]. Provided that the data-generating process (DGP) satisfies appropriate mixing conditions, e.g.,

α

-mixing with exponentially decreasing weights (which includes the i. i. d.-case), it holds that

\begin{matrix} \sqrt{n} (\hat{f} - f) \overset{d}{\to} N (0, Σ) with Σ = {(σ_{i j})}_{i, j = 0, \dots, m - 1}, where \\ σ_{i j} \overset{a}{=} f_{min {i, j}} - f_{i} f_{j} + \sum_{h = 1}^{\infty} (f_{i j} (h) + f_{j i} (h) - 2 f_{i} f_{j}) . \end{matrix}

(8)

For an analogous result in the presence of missing data, see Theorem 1 in Weiß [15]. In (8), finite (co)variances are ensured if we require that

\sum_{h = 1}^{\infty} (f_{i j} (h) - f_{i} f_{j}) < \infty

holds for all

i, j

(“short memory”). In particular, all sums

\sum_{h = 1}^{\infty} (f_{i j} (h) + f_{j i} (h) - 2 f_{i} f_{j})

vanish in the i. i. d.-case. Otherwise, they account for the serial dependence of the DGP. This can be seen by considering the trace of

Σ

, which equals

\sum_{i = 0}^{m - 1} (f_{i} (1 - f_{i}) + 2 \sum_{h = 1}^{\infty} (f_{i i} (h) - f_{i}^{2})) = \sum_{i = 0}^{m - 1} f_{i} (1 - f_{i}) (1 + 2 \sum_{h = 1}^{\infty} κ_{ord} (h)) .

Here, the term

\sum_{i = 0}^{m - 1} f_{i} (1 - f_{i})

agrees with the IOV in (4) except the normalizing factor

\frac{4}{m}

, i.e., it corresponds to

{CPE}_{ϕ}

with

ϕ (z) = z (1 - z)

. The term

κ_{ord} (h)

, in turn, is the ordinal Cohen’s

κ

[14] defined by

κ_{ord} (h) = \frac{\sum_{i = 0}^{m - 1} (f_{i i} (h) - f_{i}^{2})}{\sum_{i = 0}^{m - 1} f_{i} (1 - f_{i})} .

(9)

It is a measure of signed serial dependence, which evaluates the extent of (dis)agreement between

X_{t}

and

X_{t - h}

by positive (negative) values.

Based on Taylor expansions of

{\hat{CPE}}_{ϕ} = {CPE}_{ϕ} (\hat{f})

in f, we shall now study its asymptotic behavior. To establish asymptotic normality and to derive the asymptotic variance of

{\hat{CPE}}_{ϕ}

, we need

ϕ

to be differentiable. For an asymptotic bias correction, which relies on a second-order Taylor expansion,

ϕ

even has to be twice differentiable (then, the concavity of

ϕ

implies that

ϕ^{″} (z) < 0

).

Remark 2.

From the examples discussed in Section 2, the EGF corresponding to the LOV (i.e.,

ϕ_{q}

with

q = 1

) is not differentiable (in

z = 1 / 2

).

ϕ_{q}

is only once differentiable for

1 < q < 2

, while

q \geq 2

ensures

ϕ_{q}

to be at least twice differentiable; see Example 1 below. Accordingly, in these cases, it is not possible to establish asymptotic normality (

q = 1

) or an asymptotic bias correction (

1 < q < 2

), respectively. In fact, Weiß [13] was faced with the same problem when studying the asymptotics of the sample

{OV}_{q}

, and a solution to it was not possible. In simulations, he showed that even modified asymptotics (using a folded-normal distribution) did not lead to an acceptable approximation quality. We shall therefore exclude such cases from our further discussion. If, in applications, the cases

q = 1

or

1 < q < 2

appear to be relevant, a bootstrap approach might be an option to gain insight into the sample distribution of

{\hat{CPE}}_{ϕ}

.

Assuming

ϕ

to be (twice) differentiable, the partial derivatives of

{CPE}_{ϕ} (f)

according to (1) are

\begin{matrix} \frac{\partial}{\partial f_{k}} C P E_{ϕ} (f) = & \frac{1}{2 m ϕ (1 / 2)} (ϕ^{'} (f_{k}) - ϕ^{'} (1 - f_{k})) = : d_{k}, \\ \frac{\partial^{2}}{\partial^{2} f_{k}} C P E_{ϕ} (f) = & \frac{1}{2 m ϕ (1 / 2)} (ϕ^{″} (f_{k}) + ϕ^{″} (1 - f_{k})) = : h_{k k}, \\ \frac{\partial^{2}}{\partial f_{k} \partial f_{l}} C P E_{ϕ} (f) = & 0 for k \neq l . \end{matrix}

(10)

We denote the gradient (Jacobian) of

{CPE}_{ϕ} (f)

by

D = (d_{0}, \dots, d_{m - 1})

, and the Hessian equals

H = d i a g (h_{00}, \dots, h_{m - 1, m - 1})

. Note that if

ϕ

is symmetric around

1 / 2

, i.e., if

ϕ (z) = ϕ (1 - z)

, then

d_{k} = ϕ^{'} (f_{k}) / (m ϕ (1 / 2))

and

h_{k k} = ϕ^{″} (f_{k}) / (m ϕ (1 / 2))

.

Example 1.

Let us compute the derivatives required in (10) for the EGF examples presented in Section 2.

For

ϕ_{a} (z) = \frac{z - z^{a}}{a - 1}

, the constant factor becomes

\frac{1}{2 m ϕ_{a} (1 / 2)} = \frac{2^{a - 1} (a - 1)}{m (2^{a - 1} - 1)}

, and the derivatives are

ϕ_{a}^{'} (z) = \frac{1 - a z^{a - 1}}{a - 1}

and

ϕ_{a}^{″} (z) = - a z^{a - 2}

. Here,

ϕ_{a}^{'} (z)

exists in the boundary value

z = 0

only if

a > 1

, and

ϕ_{a}^{″} (z)

if

a \geq 2

. Important special cases are

ϕ (z) = - z ln z \Rightarrow \frac{1}{2 m ϕ (1 / 2)} = \frac{1}{m ln 2}, ϕ^{'} (z) = - 1 - ln z, ϕ^{″} (z) = - 1 / z for a \to 1,

ϕ (z) = z (1 - z) \Rightarrow \frac{1}{2 m ϕ (1 / 2)} = \frac{2}{m}, ϕ^{'} (z) = 1 - 2 z, ϕ^{″} (z) = - 2 for a = 2,

and for

a = 3

,

ϕ (z) = \frac{1}{2} z (1 - z^{2}) \Rightarrow \frac{1}{2 m ϕ (1 / 2)} = \frac{8}{3 m}, ϕ^{'} (z) = \frac{1}{2} (1 - 3 z^{2}), ϕ^{″} (z) = - 3 z .

Note that both

a = 2, 3

lead to the same expressions for

d_{k}, h_{k k}

in (10); see Table 1. This is in accordance with the equivalence of these cases as discussed after (7).

For

ϕ_{q} (z) = 1 - {(1 - 2 min {z, 1 - z})}^{q} = 1 - {| 2 z - 1 |}^{q}

with

\frac{1}{2 m ϕ_{q} (1 / 2)} = \frac{1}{2 m}

, the derivatives are expressed using the sign function,

sgn (\cdot)

, which is not continuous at 0. Note that the following relations hold:

\frac{d}{d x} | x | = sgn (x) f o r x \neq 0, x = sgn (x) \cdot | x |, | x | = sgn (x) \cdot x .

For

q \geq 2

, it then follows by applying the chain rule and the product rule that

\begin{matrix} ϕ_{q}^{'} (z) = & {- 2 q | 2 z - 1 |}^{q - 1} sgn (2 z - 1) = - 2 q (2 z - 1) {| 2 z - 1 |}^{q - 2}, \\ ϕ_{q}^{″} (z) = & {- 4 q | 2 z - 1 |}^{q - 2} - 4 q (q - 2) (2 z - 1) {| 2 z - 1 |}^{q - 3} sgn (2 z - 1) \\ = & {- 4 q (q - 1) | 2 z - 1 |}^{q - 2} . \end{matrix}

Note that these derivatives are continuous in

z = 1 / 2

for

q \geq 2

, using that

0^{0} = 1

. The final expressions for (10) are summarized in Table 1.

Table 1. Partial derivatives (10) for EGFs discussed in Example 1.

EGF $ϕ (z)$	$d_{k}$	$h_{kk}$
$(z - z^{a}) / (a - 1)$	$\frac{2^{a - 1} a}{m (2^{a - 1} - 1)} ({(1 - f_{k})}^{a - 1} - f_{k}^{a - 1})$	$- \frac{2^{a - 1} a (a - 1)}{m (2^{a - 1} - 1)} (f_{k}^{a - 2} + {(1 - f_{k})}^{a - 2})$
$- z ln z$	$\frac{1}{m ln 2} (ln (1 - f_{k}) - ln f_{k})$	$\frac{- 1}{m ln 2} (1 / f_{k} + 1 / (1 - f_{k}))$
$z (1 - z), \frac{1}{2} z (1 - z^{2})$	$\frac{4}{m} (1 - 2 f_{k})$	$- \frac{8}{m}$
${1 - \| 2 z - 1 \|}^{q}$	$- \frac{2 q}{m} (2 f_{k} - 1) {\| 2 f_{k} - 1 \|}^{q - 2}$	$- \frac{4 q (q - 1)}{m} {\| 2 f_{k} - 1 \|}^{q - 2}$

Using (10), the second-order Taylor expansion equals

{CPE}_{ϕ} (\hat{f}) \approx {CPE}_{ϕ} (f) + \sum_{k = 0}^{m - 1} d_{k} ({\hat{f}}_{k} - f_{k}) + \frac{1}{2} \sum_{k = 0}^{m - 1} h_{k k} {({\hat{f}}_{k} - f_{k})}^{2} .

(11)

According to (8), the linear term in (11) is asymptotically normally distributed (“Delta method”), provided that

D

does not vanish (see Remark 3 below). Then, we conclude from (8) that

\sqrt{n} ({CPE}_{ϕ} (\hat{f}) - {CPE}_{ϕ} (f)) \overset{d}{\to} N (0, D Σ D^{⊤}), D Σ D^{⊤} = \sum_{i, j = 0}^{m - 1} d_{i} d_{j} σ_{i j} .

(12)

The approximate bias correction relies on the quadratic term in (11), because

E [{\hat{f}}_{k} - f_{k}] = 0

. Using (8), we conclude that

n E [{CPE}_{ϕ} (\hat{f}) - {CPE}_{ϕ} (f)] \approx \frac{1}{2} \sum_{i = 0}^{m - 1} h_{i i} σ_{i i} .

(13)

Let us summarize the results implied by (12) and (13) in the following theorem.

Theorem 1.

Under the mixing assumptions stated before (8), assuming the EGF ϕ to be twice differentiable, and recalling the

d_{k}, h_{k k}

from (10) where

D

must not vanish, it holds that

\begin{matrix} \sqrt{n} ({CPE}_{ϕ} (\hat{f}) - {CPE}_{ϕ} (f)) \overset{d}{\to} N (0, σ_{ϕ}^{2}), σ_{ϕ}^{2} = σ_{ϕ, i i d}^{2} (1 + 2 \sum_{h = 1}^{\infty} ϑ_{ϕ} (h)) \\ w i t h σ_{ϕ, i i d}^{2} = \sum_{i, j = 0}^{m - 1} d_{i} d_{j} (f_{min {i, j}} - f_{i} f_{j}) a n d ϑ_{ϕ} (h) = \frac{\sum_{i, j = 0}^{m - 1} d_{i} d_{j} (f_{i j} (h) - f_{i} f_{j})}{\sum_{i, j = 0}^{m - 1} d_{i} d_{j} (f_{min {i, j}} - f_{i} f_{j})} . \end{matrix}

In addition, the bias-corrected mean of

{CPE}_{ϕ} (\hat{f})

is

\begin{matrix} E [{CPE}_{ϕ} (\hat{f})] \approx {CPE}_{ϕ} (f) + \frac{1}{2 n} (\sum_{i = 0}^{m - 1} h_{i i} f_{i} (1 - f_{i})) (1 + 2 \sum_{h = 1}^{\infty} κ_{ϕ} (h)), \\ w h e r e κ_{ϕ} (h) = \frac{\sum_{i = 0}^{m - 1} h_{i i} (f_{i i} (h) - f_{i}^{2})}{\sum_{i = 0}^{m - 1} h_{i i} f_{i} (1 - f_{i})} . \end{matrix}

Note that the second-order derivatives are negative due to the concavity of

ϕ

, so

{CPE}_{ϕ} (\hat{f})

exhibits a negative bias.

ϑ_{ϕ} (h)

expresses the effect of serial dependence on

σ_{ϕ}^{2}

. For i. i. d. ordinal data,

ϑ_{ϕ} (h) = 0

, so Theorem 1 simplifies considerably in this case, namely to

σ_{ϕ}^{2} = σ_{ϕ, i i d}^{2}

. The bias of

{CPE}_{ϕ} (\hat{f})

is affected by serial dependence via

κ_{ϕ} (h)

, which is a

κ

-type measure reflecting the extent of (dis)agreement between lagged observations, recall (9). More precisely,

κ_{ϕ} (h)

can be interpreted as weighted type of

κ_{ord} (h)

, where the weights

h_{i i}

depend on the particular choice of

ϕ

. It thus provides a novel family of measures of signed serial dependence, the asymptotics of which are analyzed in Section 4 below.

Example 2.

In the special case

ϕ (z) = z (1 - z)

(as well as for

ϕ (z) = \frac{1}{2} z (1 - z^{2})

), which corresponds to the IOV in (4),

h_{i i} = - \frac{8}{m}

is constant (see Table 1). Thus,

κ_{ϕ} (h) = κ_{ord} (h)

in this case (see (9)). Furthermore, the first factor of the bias in Theorem 1 becomes

\frac{1}{2 n} \sum_{i = 0}^{m - 1} h_{i i} f_{i} (1 - f_{i}) = - \frac{1}{n} \cdot \frac{4}{m} \sum_{i = 0}^{m - 1} f_{i} (1 - f_{i}) = - \frac{1}{n} IOV .

Hence, the bias is determined by both the serial dependence and the dispersion of the process.

As another simple example, consider the choice

ϕ (z) = - z ln z

for the

CPE

in (5). Then, using

h_{i i} = \frac{- 1}{m ln 2} (\frac{1}{f_{i}} + \frac{1}{1 - f_{i}}) = \frac{- 1}{m ln 2} \frac{1}{f_{i} (1 - f_{i})}

from Table 1, we get

\frac{1}{2 n} \sum_{i = 0}^{m - 1} h_{i i} f_{i} (1 - f_{i}) = \frac{1}{2 n} \frac{- 1}{m ln 2} \sum_{i = 0}^{m - 1} 1 = \frac{- 1}{(2 ln 2) n} .

Thus, we have a unique i. i. d.-bias, independent of the actual marginal CDF f. Under serial dependence, we get

κ_{ϕ} (h) = \frac{\sum_{i = 0}^{m - 1} h_{i i} (f_{i i} (h) - f_{i}^{2})}{\sum_{i = 0}^{m - 1} h_{i i} f_{i} (1 - f_{i})} = \frac{1}{m} \sum_{i = 0}^{m - 1} \frac{f_{i i} (h) - f_{i}^{2}}{f_{i} (1 - f_{i})} = : κ_{o r d}^{*} (h) .

(14)

So, besides the pair

(IOV, κ_{ord} (h))

,

(CPE, κ_{ord}^{*} (h))

also belongs to the

({CPE}_{ϕ}, κ_{ϕ} (h))

-family.

A few examples are plotted in Figure 3, where the DGP

X_{t} = s_{I_{t}}

assumes the rank counts

I_{t}

to have

Bin (4, p)

-marginals. In the top panel, the DGP is i. i. d., whereas

(I_{t})

follows a so-called first-order binomial autoregressive (BAR

(1)

) model with dependence parameter

ρ = 0.4

([2] Section 3.3) in the lower panel, i.e., the DGP exhibits a medium level of positive dependence. While the resulting dependence structure is investigated in more detail in Section 4, Figure 3 considers the asymptotic standard error (SE)

σ_{ϕ}

and bias

n (E [{CPE}_{ϕ} (\hat{f})] - {CPE}_{ϕ} (f))

according to Theorem 1. Obviously, an increase of serial dependence causes an increase of both SE and bias. While most measures have a rather stable SE for varying p (except for extremely small p, where we are close to a one-point distribution), the EGF

ϕ_{a}

with

a = 1 / 2

varies a lot. In particular, the bias takes rather extreme values with decreasing p for this case, which can be explained by the strongly negative exponents at

f_{k}, 1 - f_{k}

in

h_{k k}

from Table 1. Thus, choices

a < 1

seem not advisable for practice. The boundary case

a = 1

has a constant bias for an i. i. d. DGP. For

ϕ_{q}

with

q = 4

, we note an oscillating behavior of both bias and SE. The lowest bias and SE are achieved for the cases

ϕ_{a}

with

a > 1

.

The newly obtained measure

κ_{ord}^{*} (h)

from (14) constitutes a counterpart to the nominal measures

κ^{*} (h), κ^{☆} (h)

in Weiß [20]. It is worth pointing out that the latter measures were derived from the nominal entropy and extropy, respectively, while the

CPE

in (5) can be interpreted as a combination of cumulative entropy and extropy. It has to be noted that

κ_{ord}^{*} (h)

also shares a disadvantage with

κ^{*} (h)

: if only one of the

f_{i}

equals 0 or 1, we suffer from a division by 0 in (14). For

κ_{ord} (h)

according to (9), by contrast, a division by 0 only happens in the (deterministic) case of a one-point distribution. As a workaround when computing

κ_{ord}^{*} (h)

, it is recommended to replace the affected summands in (14) by 0.

Remark 3.

If

f = f_{t w o}

, then all

d_{k} = 0

in (10). Therefore, the linear term in (11) vanishes. In fact, for any two-point distribution on

s_{0}

and

s_{m}

, we necessarily have

f_{0} = \dots = f_{m - 1}

and

{\hat{f}}_{0} = \dots = {\hat{f}}_{m - 1}

. Therefore,

{CPE}_{ϕ} (\hat{f})

reduces to

{CPE}_{ϕ} (\hat{f}) = \frac{1}{2 ϕ (1 / 2)} (ϕ ({\hat{f}}_{0}) + ϕ (1 - {\hat{f}}_{0}))

, and the quadratic term in (11) to

\frac{m}{2} h_{00} {({\hat{f}}_{0} - f_{0})}^{2}

. Hence, in this special case,

n ({CPE}_{ϕ} (\hat{f}) - {CPE}_{ϕ} (f)) \overset{a}{\sim} \frac{m}{2} h_{00} σ_{00} \cdot χ_{1}^{2} .

For example, plugging-in

h_{00} = - \frac{8}{m}

for

ϕ (z) = z (1 - z)

corresponding to the IOV in (4), we obtain the result in Remark 7.1.2 in Weiß [14].

4. Asymptotic Distribution of Sample $κ_{ϕ} (h)$

The bias equation in Theorem 1 gives rise to a novel family of serial dependence measures for ordinal time series, namely

κ_{ϕ} (h) = \frac{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) (f_{i i} (h) - f_{i}^{2})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})}

(15)

for a given EGF

ϕ

. Some examples are plotted in the left part of Figure 4, where the DGP

X_{t} = s_{I_{t}}

assumes that the rank counts

I_{t}

follow the BAR

(1)

model with marginal distribution

Bin (4, 0.3)

and dependence parameter

ρ

; recall the discussion of Figure 3. So, the rank counts

{(I_{t})}_{Z}

have the first-order ACF

ρ

, whereas the plotted

κ_{ϕ} (1)

have absolute value

\leq | ρ |

.

In practice, the sample version of this measure,

{\hat{κ}}_{ϕ} (h)

, is particularly important, where the cumulative (bivariate) probabilities are replaced by the corresponding relative frequencies. For uncovering significant deviations from the null hypothesis of serial independence (then,

κ_{ϕ} (h) = 0

), the asymptotic distribution of

{\hat{κ}}_{ϕ} (h)

under the null of i. i. d. time series data is required. For its derivation, we proceed in an analogous way as in Section 3. As the starting point, we have to extend the asymptotics of the marginal sample CDF in (8) by also considering the bivariate sample CDF

{\hat{f}}_{i i} (h)

. Let

f_{(h)} = {(f_{0}, \dots, f_{m - 1}, f_{00} (h), \dots, f_{m - 1, m - 1} (h))}^{⊤}

, and denote its sample version by

{\hat{f}}_{(h)}

. Then, under the same mixing conditions as in Section 3, Weiß [14] established the asymptotic normality

\sqrt{n} ({\hat{f}}_{(h)} - f_{(h)}) \overset{d}{\to} N (0, Σ^{(h)}) with Σ^{(h)} = {(σ_{i, j}^{(h)})}_{i, j = 0, \dots, 2 m - 1},

(16)

and he derived general expressions for the asymptotic (co)variances

σ_{i, j}^{(h)}

. Analogous results for the case of missing data can be found in Supplement S.3 of Weiß [15]. For the present task, the asymptotics of the i. i. d.-case are sufficient. Then,

f_{i i} (h) = f_{i}^{2}

for all

i = 0, \dots, m - 1

, and the covariances in (16) are given by

\begin{matrix} σ_{i, j}^{(h)} = σ_{i, j} = & f_{min {i, j}} - f_{i} f_{j} (see (8)), \\ σ_{i, m + j}^{(h)} = & 2 f_{j} (f_{min {i, j}} - f_{i} f_{j}), \\ σ_{m + i, m + j}^{(h)} = & (f_{min {i, j}} + 3 f_{i} f_{j}) (f_{min {i, j}} - f_{i} f_{j}) \end{matrix} for i, j \in {0, \dots, m - 1},

(17)

see Weiß [14] (as well as p. 8 in Supplement S.3 of Weiß [15] if being concerned with missing data).

Next, we derive the asymptotics of

{\hat{κ}}_{ϕ} (h)

under the i. i. d.-null, and this requires to derive the second-order Taylor expansion for

{\hat{κ}}_{ϕ} (h)

; details are postponed to Appendix A.1. As higher-order derivatives of

ϕ

, which are initially used while deriving a bias correction of

{\hat{κ}}_{ϕ} (h)

, cancel out, the final result still relies on derivatives of

ϕ

up to order 2 only.

Theorem 2.

Under the null hypothesis of i. i. d. data, i.e., if

κ_{ϕ} (h) = 0

for all lags

h \in N

, and assuming the EGF ϕ to be twice differentiable, it holds that

\begin{matrix} \sqrt{n} ({\hat{κ}}_{ϕ} (h) - κ_{ϕ} (h)) \overset{d}{\to} N (0, σ_{κ}^{2}) w i t h σ_{κ}^{2} = \sum_{j, k = 0}^{m - 1} u_{j} u_{k} {(f_{min {j, k}} - f_{j} f_{k})}^{2}, \\ w h e r e u_{j} = \frac{ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} . \end{matrix}

In addition, the bias-corrected mean of

{\hat{κ}}_{ϕ} (h)

is

E [{\hat{κ}}_{ϕ} (h)] \approx - \frac{1}{n}

.

Note that we have a unique bias correction for any of the measures

{\hat{κ}}_{ϕ} (h)

, independent of the choice of the EGF

ϕ

. Thus, for application in practice, it remains to compute the asymptotic variance

σ_{κ}^{2}

in Theorem 2. This only requires knowledge about

ϕ^{″} (z)

to evaluate the

u_{j}

, but not about higher-order derivatives of the EGF

ϕ

(see Example 3 for illustration). Further examples are plotted in the right part of Figure 4, where

σ_{κ}

was computed for the marginal distribution

Bin (4, p)

. The oscillating behavior of

σ_{κ}

for

ϕ_{q} (z)

with

q = 4

is quite striking. It is also interesting to note that among the plotted

κ

-measures, the novel

{\hat{κ}}_{ord}^{*} (h)

(case

a = 1

) has the lowest variance.

Example 3.

While we have a unique bias correction for

{\hat{κ}}_{ϕ} (h)

, the asymptotic variance

σ_{κ}^{2}

according to Theorem 2 differs for different choices of the EGF ϕ, as the involved

u_{j}

depend on

ϕ^{″} (z)

. For example,

for $ϕ_{a} (z) = \frac{z - z^{a}}{a - 1}$ , we have $ϕ_{a}^{″} (z) = - a z^{a - 2}$ according to Example 1,
while for $ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}$ with $q \geq 2$ , we have $ϕ_{q}^{″} (z) = - 4 q (q - 1) {| 2 z - 1 |}^{q - 2}$ .

Important special cases are:

For $ϕ (z) = z (1 - z)$ , i.e., for the basic $κ_{o r d} (h)$ according to (9), we have

$ϕ^{″} (z) = - 2, so u_{j} = {(\sum_{i = 0}^{m - 1} f_{i} (1 - f_{i}))}^{- 1} .$

Thus, we get

$σ_{κ}^{2} = \frac{\sum_{j, k = 0}^{m - 1} {(f_{min {j, k}} - f_{j} f_{k})}^{2}}{{(\sum_{i = 0}^{m - 1} f_{i} (1 - f_{i}))}^{2}};$

see Theorem 7.2.1 in Weiß [14].
For $ϕ (z) = - z ln z$ , i.e., for the novel $κ_{o r d}^{*} (h)$ according to (14), we have

$ϕ^{″} (z) = - \frac{1}{z}, so u_{j} = \frac{1}{m} \frac{1}{f_{j} (1 - f_{j})} .$

Thus, we get

$σ_{κ}^{2} = \frac{1}{m^{2}} \sum_{j, k = 0}^{m - 1} \frac{{(f_{min {j, k}} - f_{j} f_{k})}^{2}}{f_{j} (1 - f_{j}) f_{k} (1 - f_{k})} = \frac{1}{m} + \frac{2}{m^{2}} \sum_{j < k} \frac{f_{j} (1 - f_{k})}{f_{k} (1 - f_{j})} .$

For any other choice of

ϕ_{a} (z)

and

ϕ_{q} (z)

,

σ_{κ}^{2}

is easily computed using the aforementioned expressions for

ϕ_{a}^{″} (z)

and

ϕ_{q}^{″} (z)

from Example 1. Since the obtained closed-form formulae do not much simplify, further details are omitted here.

5. Simulation Results

In what follows, we discuss results from a simulation study, being tabulated in Appendix B, where

10^{4}

replications per scenario were used throughout. In view of our previous findings, achieved when discussing the asymptotics plotted in Figure 3 and Figure 4, we do not further consider the choice

a = 1 / 2 < 1

for the EGF

ϕ_{a}

, but we use

a = 5 / 2 > 2

instead. The latter choice, in turn, was not presented before as the resulting asymptotic curves could hardly be distinguished from the case

a = 2

. So, altogether,

a = 1, 3 / 2, 2, 5 / 2

as well as

q = 4

were taken into account for simulations. The ordinal data were generated via binomial rank counts,

X_{t} = s_{I_{t}}

with

I_{t} \sim Bin (m, p)

, which either exhibit serial dependence caused by a BAR

(1)

DGP with dependence parameter

ρ

, or which are i. i. d. (corresponding to

ρ = 0

).

Let us start with the ordinal dispersion measures

{\hat{CPE}}_{ϕ}

. Table A1 presents the simulated means (upper part) and SEs (lower part) for the case of i. i. d. ordinal data, and these are compared to the asymptotic values obtained from Theorem 1. Generally, we have an excellent agreement between simulated and asymptotic values, i.e., the derived asymptotic approximation to the true distribution of

{\hat{CPE}}_{ϕ}

works well in practice. This is even more remarkable as also the low sample size

n = 50

is included. There is a somewhat larger deviation only for the mean in the case

a = 1

, i.e., for the CPE (5), if

n \leq 100

and

p = 0.1

. In this specific case, the simulated sample distribution might be quite close to a one-point distribution, which might cause computational issues for (5); recall that the convention

0 ln 0 = 0

has to be used. However, as the approximation quality is good throughout, a pivotal argument for the choice of

ϕ

in practice might be that the least SEs are observed if using

ϕ_{a}

with

a = 3 / 2, 2, 5 / 2

.

Table A2 considers exactly the same marginals as before, but now in the presence of additional serial dependence (

ρ = 0.4

). Comparing Table A1 and Table A2, it becomes clear that the additional dependence causes increased bias and SE. However, and this is the crucial point for practice, the asymptotic approximations from Theorem 1 work as well as they do in the i. i. d.-case. If there are visible deviations at all, then these happen again mainly for

p = 0.1

and low sample sizes. Overall, we have an excellent approximation quality throughout, but with least SEs again for

a = 3 / 2, 2, 5 / 2

.

While the

{CPE}_{ϕ}

-type dispersion measures and their asymptotics perform well, essentially for any choice of

ϕ

, the gap becomes wider when looking at the serial dependence measure

{\hat{κ}}_{ϕ} (h)

. The asymptotics in Theorem 2 refer to the i. i. d.-case, which is used as the null hypothesis (

H_{0}

) if testing for significant serial dependence. Thus, let us start by investigating again the mean and SE of

{\hat{κ}}_{ϕ} (1)

for i. i. d. data (same DGPs as in Table A1); see the results in Table A3. For the asymptotic mean, we have the unique approximation

- 1 / n

, and this works well except for

p = 0.1

and low sample sizes. In particular, for

ϕ_{a}

with

a = 1

, i.e., for

{\hat{κ}}_{ord}^{*} (1)

, we get notable deviations. The reason is given by the computation of

{\hat{κ}}_{ord}^{*} (1)

, where division by zero might happen (in the simulations, this was circumvented by replacing a zero by

10^{- 6}

). In a weakened form, we observe a similar issue for the case

a = 3 / 2

; generally, we are faced with the zero problem if

a < 2

because of the second-order derivatives of

ϕ_{a} (z)

. Analogous deviations are observed for the SEs. Here, generally, the simulated SEs tend to be larger than the asymptotic ones. As a consequence, if using the asymptotic SEs for calculating the critical values when testing

H_{0}

, we expect a tendency to oversizing.

If looking at the simulated rejection rates in Table A4, first at the size values (

ρ = 0

) being printed in italic font, we indeed see sizes being slightly larger than the nominal 5%-level, as long as

n \leq 100

. For larger sample sizes, by contrast, the

{\hat{κ}}_{ϕ} (1)

-test satisfies the given size requirement quite precisely. Here, we computed the critical values by plugging-in the respective sample CDF

\hat{f}

into the formula for

σ_{κ}^{2}

. In Table A4, power values for

ρ \neq 0

are also shown. Note that for a BAR

(1)

process,

ρ

can take any positive value in

(0; 1)

, but the attainable range of negative values is bounded from below by

max \{- \frac{1 - p}{p}, - \frac{p}{1 - p}\}

[2]. Thus, only

ρ = - 0.4, - 0.2

are considered in Table A4. Generally, the

{\hat{κ}}_{ϕ} (1)

-tests are powerful regarding both positive and negative dependencies, but the actual power performance differs a lot for different

ϕ

. The worst power is observed for

ϕ_{q}

with

q = 4

, followed by

ϕ_{a}

with

a = 1

. Regarding the remaining

ϕ_{a}

-cases,

a = 3 / 2

does slightly worse than

a = 2, 5 / 2

, and we often have a slight advantage for

a = 5 / 2

, especially for negative dependencies.

To sum up, while the whole

({CPE}_{ϕ}, κ_{ϕ} (h))

-family is theoretically appealing, and while there are hardly any noteworthy problems with the sample dispersion measures

{\hat{CPE}}_{ϕ}

, the performance of

{\hat{κ}}_{ϕ} (h)

clearly depends on the choice of

ϕ

. It is recommended to use the family of a-entropies (6), and there,

a \geq 2

is preferable. The measure

{\hat{κ}}_{ord}^{*} (1)

from (14), for example, although theoretically appealing as a combination of entropy and extropy, has a relatively bad finite-sample performance. The probably most well-known pair,

(IOV, κ_{ord} (h))

, has a good performance, although there appears to be a slight advantage if choosing a somewhat larger than 2, such as

a = 5 / 2

(recall that

a = 3

leads back to the case

a = 2

).

6. Data Application

Ordinal time series are observed in quite diverse application areas. Economic examples include time series on credit ratings [14] or on fear states at the stock market [20], and a climatological example is the level of cloudiness of the sky [23]. Health-related examples are time series of electroencephalographic (EEG) sleep states [24], the pain severity of migraine attacks, and the level of perceived stress [15]. In this section, we are concerned with an environmental application, namely the level of air quality. Different definitions of air quality have been reported in the literature. In Chen Chiu [25], the air quality index (AQI) is used for expressing the daily air quality, with levels ranging from

s_{0} = “ good ”

to

s_{5} = “ hazardous ”

. Another case study is reported by Liu et al. [26], who use the classification of the Chinese government, which again distinguishes

m + 1 = 6

levels, but now ranging from

s_{0} = “ excellent ”

to

s_{5} = “ severely polluted ”

. The latter article investigates daily time series from thirty Chinese cities for the period December 2013–July 2019, i.e., the sample size equals

n = 2 068

. In what follows, we use one of the time series studied by Liu et al. [26], namely the daily air quality levels

x_{1}, \dots, x_{n}

in Shanghai, for illustrating our novel results about cumulative paired

ϕ

-entropies.

The considered time series is plotted in the top panel of Figure 5. The bottom left graph shows the sample version of the probability mass function (PMF)

P (X = s_{i})

, i.e., the relative frequencies of the categories. It exhibits a unimodal shape with mode (=median) in

s_{1} = “ good ”

. The serial dependence structure is analyzed in the bottom right graph, where

{\hat{κ}}_{ϕ} (h)

with a-entropy having

a = 5 / 2

is used, as this is the recommended choice according to Section 5. All of the plotted

{\hat{κ}}_{ϕ} (h)

-values are significantly different from 0 at the 5 %-level, where the critical values (plotted as dashed lines in Figure 5) are computed as

{- 0.029, 0.028}

according to Theorem 2 (and by plugging-in the sample CDF). We recognize a medium level of dependence (

{\hat{κ}}_{ϕ} (1) \approx 0.378

), which quickly decreases with increasing time lag h, similar to an AR-type process.

Let us now have a closer look at the dispersion properties of the Shanghai series. The different choices of the

{CPE}_{ϕ}

-measure considered so far provide slightly different results regarding the extent of dispersion. In accordance with Figure 2, the largest point estimates are computed for

a = 1 / 2

(0.514) and

q = 4

(0.465), followed by

a = 1

with 0.394, whereas

a = 3 / 2

(0.349),

a = 2

(0.332), and

a = 5 / 2

(0.328) lead to similar but clearly lower values. Comparing the sample PMF in Figure 5 to the extreme scenarios of a one-point and an extreme two-point distribution, the PMF appears to be more close to a one-point than to a two-point distribution, i.e., the lower ones among the above dispersion values seem to be more realistic here.

The novel asymptotics of Theorem 1 allow to judge the estimation uncertainty for the above point estimates. To keep the discussion simple, let us focus again on the case

a = 5 / 2

. In the first step, we compute the i. i. d.-approximations of bias and SE,

\frac{1}{2 n} (\sum_{i = 0}^{m - 1} h_{i i} f_{i} (1 - f_{i}))

and

n^{- 1 / 2} σ_{ϕ, i i d}

, respectively. By plugging-in the sample CDF, these are obtained as

- 1.54 \cdot 10^{- 4}

and

7.83 \cdot 10^{- 3}

, respectively. However, these i. i. d.-results are misleading in the present example as the data exhibit significant serial dependence (recall Figure 5). As we know from Theorem 1, the bias has to be increased by the factor

1 + 2 \sum_{h = 1}^{\infty} κ_{ϕ} (h)

, and the SE by

(1 + 2 \sum_{h = 1}^{\infty} ϑ_{ϕ} (h))^{1 / 2}

. These factors shall now be computed based on the so-called “ZOBPAR model” proposed by Liu et al. [26], which constitutes a rank-count approach,

X_{t} = s_{I_{t}}

. In view of the AR

(1)

-like dependence structure and the high frequency for

s_{1}

, namely 0.560, the conditional distribution of

I_{t} | I_{t - 1}, \dots

is assumed to be a truncated Poisson distribution, truncated to the range

{0, \dots, 5}

, with time-varying Poisson parameter

λ_{t} = 0.3489 + 0.7594 I_{t - 1}

and additional one-inflation parameter 0.3463 ([26] Table III). For this model fit, we compute

1 + 2 \sum_{h = 1}^{\infty} κ_{ϕ} (h) \approx 1.907, \sqrt{1 + 2 \sum_{h = 1}^{\infty} ϑ_{ϕ} (h)} \approx 1.343 .

Thus, an approximate 95 %-confidence interval (CI) for

{CPE}_{ϕ}

is given by

\approx (0.308; 0.349)

. CIs for the remaining

{CPE}_{ϕ}

-measures are computed analogously, leading to

(0.487; 0.544)

for

a = 1 / 2

, to

(0.372; 0.418)

for

a = 1

, to

(0.328; 0.371)

for

a = 3 / 2

, to

(0.312; 0.353)

for

a = 2

, and to

(0.439; 0.492)

for

q = 4

.

7. Conclusions

In this article, we considered the family of cumulative paired

ϕ

-entropies. For each appropriate choice of the EGF

ϕ

, an ordinal dispersion measure

{CPE}_{ϕ} (f)

is implied. For example, particular choices from the families of a-entropies or q-entropies, respectively, lead to well-known dispersion measures from the literature. The first main contribution of this work was the derivation of the asymptotic distribution of the sample version

{\hat{CPE}}_{ϕ}

for ordinal time series data. These asymptotics can be used to approximate the true distribution of

{\hat{CPE}}_{ϕ}

, e.g., to compute approximate confidence intervals. Simulations showed that these asymptotics lead to an excellent finite-sample performance. Based on the obtained expression for the asymptotic bias of

{\hat{CPE}}_{ϕ}

, we recognized that each EGF

ϕ

also implies a

κ

-type serial dependence measures, i.e., altogether, we have a matched pair

({CPE}_{ϕ}, κ_{ϕ} (h))

for each EGF

ϕ

. Again, we analyzed the asymptotics of the sample version

{\hat{κ}}_{ϕ} (h)

, and these can be utilized for testing for significant serial dependence in the given ordinal time series. This time, however, the finite-sample performance clearly depends on the choice of

ϕ

. Choosing

{\hat{κ}}_{ϕ} (h)

based on an a-entropy with

a \geq 2

, such as

a = 5 / 2

, ensures good finite-sample properties. The practical application of the measures

({CPE}_{ϕ}, κ_{ϕ} (h))

and their asymptotics was demonstrated with an ordinal time series on the daily level of air quality in Shanghai.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data discussed in Section 6 were taken from Liu et al. [26] and are available at https://doi.org/10.1111/jtsa.12625 (accessed on 22 December 2021).

Acknowledgments

The author thanks the two referees for their useful comments on an earlier draft of this article.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Proofs

Appendix A.1. Proof of Theorem 2

First, we derive the second-order Taylor expansion for

{\hat{κ}}_{ϕ} (h)

. For this purpose, define

g : {(0; 1)}^{2 m} \to R

by mapping

y = (y_{0}, \dots, y_{2 m - 1})

onto

g (y) = \frac{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) (y_{m + i} - y_{i}^{2})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} .

So, according to (15),

g (f_{(h)}) = κ_{ϕ} (h)

and

g ({\hat{f}}_{(h)}) = {\hat{κ}}_{ϕ} (h)

. In fact, because of the i. i. d.-assumption, we even have

g (f_{(h)}) = 0

.

For the intended Taylor expansion of

{\hat{κ}}_{ϕ} (h)

, we have to compute all partial derivatives of

g (y)

up to order 2, and to evaluate these derivatives in

y = f_{(h)}

, using that

f_{i i} (h) = f_{i}^{2}

for all

i = 0, \dots, m - 1

under the i. i. d.-assumption, and thus

g (f_{(h)}) = 0

. We use the notations

u_{j} : = \frac{ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})}, v_{j} : = \frac{ϕ^{‴} (f_{j}) - ϕ^{‴} (1 - f_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} .

(A1)

Note that if

ϕ

is symmetric around

1 / 2

, i.e., if

ϕ (z) = ϕ (1 - z)

, then

u_{j}, v_{j}

simplify because of

ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j}) = 2 ϕ^{″} (f_{j})

and

ϕ^{‴} (f_{j}) - ϕ^{‴} (1 - f_{j}) = 2 ϕ^{‴} (f_{j})

. Using (A1), we get for

0 \leq j, k \leq m - 1

that

\begin{array}{l} d_{j}^{(κ)} : = \frac{\partial}{\partial y_{j}} g (f_{(h)}) = - 2 f_{j} u_{j}, \\ d_{m + j}^{(κ)} : = \frac{\partial}{\partial y_{m + j}} g (f_{(h)}) = u_{j}, \\ h_{j, k}^{(κ)} : = \frac{\partial^{2}}{\partial y_{j} \partial y_{k}} g (f_{(h)}) = 2 f_{j} f_{k} ((1 - f_{k}) u_{j} v_{k} + (1 - f_{j}) u_{k} v_{j}) \\ + 2 (f_{j} + f_{k} - 4 f_{j} f_{k}) u_{j} u_{k} - 2 δ_{j, k} (u_{j} + 2 f_{j} v_{j}), \\ h_{j, m + k}^{(κ)} : = \frac{\partial^{2}}{\partial y_{j} \partial y_{m + k}} g (f_{(h)}) = (δ_{j, k} - f_{j} (1 - f_{j}) u_{k}) v_{j} - (1 - 2 f_{j}) u_{j} u_{k}, \\ h_{m + j, m + k}^{(κ)} : = \frac{\partial^{2}}{\partial y_{m + j} \partial y_{m + k}} g (f_{(h)}) = 0 . \end{array}

(A2)

The proof of Equation (A2) is provided by Appendix A.2. We denote the gradient (Jacobian) of

g (f_{(h)})

by

D_{κ} = (d_{0}^{(κ)}, \dots, d_{2 m - 1}^{(κ)})

, and the Hessian equals

H_{κ} = d i a g (h_{00}^{(κ)}, \dots, h_{2 m - 1, 2 m - 1}^{(κ)})

.

Example A1.

Let us pick up Example 1 and continue with the derivatives of the EGFs, as required for evaluating (A1) and (A2).

For

ϕ_{a} (z) = \frac{z - z^{a}}{a - 1}

, we have

ϕ_{a}^{″} (z) = - a z^{a - 2}

, and thus

ϕ_{a}^{‴} (z) = - a (a - 2) z^{a - 3}

. Specific examples are

$a \to 1$ corresponding to the EGF $ϕ (z) = - z ln z$ , then

$ϕ^{″} (z) = - \frac{1}{z}, ϕ^{″} (z) = \frac{1}{z^{2}}, so u_{j} = \frac{1}{m} \frac{1}{f_{j} (1 - f_{j})}, v_{j} = \frac{1}{m} \frac{2 f_{j} - 1}{f_{j}^{2} {(1 - f_{j})}^{2}};$
$a = 2$ corresponding to the EGF $ϕ (z) = z (1 - z)$ , then

$ϕ^{″} (z) = - 2, ϕ^{″'} (z) = 0, so u_{j} = {(\sum_{i = 0}^{m - 1} f_{i} (1 - f_{i}))}^{- 1}, v_{j} = 0;$
$a = 3$ corresponding to the EGF $ϕ (z) = \frac{1}{2} z (1 - z^{2})$ , then

$ϕ^{″} (z) = - 3 z, ϕ^{‴} (z) = - 3, so u_{j}, v_{j} are as before .$

For the equivalence of the $a = 2, 3$ , recall (7).

For

ϕ_{q} (z) = 1 - {(1 - 2 min {z, 1 - z})}^{q} = 1 - {| 2 z - 1 |}^{q}

with

q > 3

, we get

\begin{matrix} ϕ_{q}^{″} (z) = & {- 4 q (q - 1) | 2 z - 1 |}^{q - 2}, ϕ_{q}^{‴} (z) = - 8 q (q - 1) (q - 2) {| 2 z - 1 |}^{q - 3} sgn (2 z - 1) . \end{matrix}

Recall that (A1) simplifies because of the symmetry of

ϕ_{q} (z)

around

1 / 2

.

Using (A2), the second-order Taylor expansion of

{\hat{κ}}_{ϕ} (h)

equals

{\hat{κ}}_{ϕ} (h) \approx κ_{ϕ} (h) + D_{κ} ({\hat{f}}_{(h)} - f_{(h)}) + \frac{1}{2} {({\hat{f}}_{(h)} - f_{(h)})}^{⊤} H_{κ} ({\hat{f}}_{(h)} - f_{(h)}) .

(A3)

In analogy to Section 3, we now conclude that

\begin{matrix} \sqrt{n} ({\hat{κ}}_{ϕ} (h) - κ_{ϕ} (h)) \overset{d}{\to} N (0, σ_{κ}^{2}) with σ_{κ}^{2} = D_{κ} Σ^{(h)} D_{κ}^{⊤}, \\ where σ_{κ}^{2} = \sum_{j, k = 0}^{m - 1} u_{j} u_{k} {(f_{min {j, k}} - f_{j} f_{k})}^{2} . \end{matrix}

(A4)

The proof of Equation (A4) is provided by Appendix A.3.

σ_{κ}^{2}

is computed explicitly by plugging-in (A1).

An approximate bias correction is obtained from (A3) (see the analogous arguments in Section 3). Using that

h_{m + j, m + k}^{(κ)} = 0

, we get that

\begin{matrix} E [{\hat{κ}}_{ϕ} (h)] \approx 0 + \frac{1}{2 n} \sum_{j, k = 0}^{m - 1} (h_{j, k}^{(κ)} σ_{j, k} + 2 h_{j, m + k}^{(κ)} σ_{j, m + k}^{(h)}) = - \frac{1}{n}, \end{matrix}

(A5)

see the proof in Appendix A.4. Using (A4) and (A5), the derivation of Theorem 2 is complete.

Appendix A.2. Proof of Equation (A2)

For

g (y) = \frac{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) (y_{m + i} - y_{i}^{2})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})},

we compute all partial derivatives up to order 2. For doing this, we have to require that the EGF

ϕ

is even four times differentiable. Then,

\frac{\partial}{\partial y_{m + j}} g (y) = \frac{ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} and \frac{\partial^{2}}{\partial y_{m + j} \partial y_{m + k}} g (y) = 0 for all 0 \leq j, k \leq m - 1 .

Next, using the product and chain rule,

\frac{\partial}{\partial y_{j}} g (y) = \frac{\frac{\partial}{\partial y_{j}} (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) (y_{m + j} - y_{j}^{2})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} - \frac{g (y) \frac{\partial}{\partial y_{j}} (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) y_{j} (1 - y_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} .

Here,

\begin{matrix} \frac{\partial}{\partial y_{j}} (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) (y_{m + j} - y_{j}^{2}) \\ = (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) (y_{m + j} - y_{j}^{2}) - 2 y_{j} (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})), \end{matrix}

and

\begin{matrix} \frac{\partial}{\partial y_{j}} (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) y_{j} (1 - y_{j}) \\ = (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) y_{j} (1 - y_{j}) + (1 - 2 y_{j}) (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) . \end{matrix}

Thus,

\begin{matrix} \frac{\partial}{\partial y_{j}} g (y) = & {(\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i}))}^{- 1} \\ \cdot ((ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) (y_{m + j} - y_{j}^{2} - g (y) y_{j} (1 - y_{j})) \\ - (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) (2 y_{j} + g (y) (1 - 2 y_{j}))) . \end{matrix}

Consequently,

\begin{matrix} \frac{\partial^{2}}{\partial y_{j} \partial y_{m + k}} g (y) = \frac{δ_{j, k} (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j}))}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} - \frac{\frac{\partial}{\partial y_{m + k}} g (y)}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} \\ \cdot ((ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) y_{j} (1 - y_{j}) + (1 - 2 y_{j}) (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j}))) . \end{matrix}

Finally,

\begin{matrix} \frac{\partial^{2}}{\partial y_{j} \partial y_{k}} g (y) = - \frac{\frac{\partial}{\partial y_{j}} g (y) \frac{\partial}{\partial y_{k}} (ϕ^{″} (y_{k}) + ϕ^{″} (1 - y_{k})) y_{k} (1 - y_{k})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} \\ + {(\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i}))}^{- 1} \\ \begin{matrix} \cdot \frac{\partial}{\partial y_{k}} ((ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) (y_{m + j} - y_{j}^{2} - g (y) y_{j} (1 - y_{j})) \\ - (ϕ^{''} (y_{j}) + ϕ^{''} (1 - y_{j})) (2 y_{j} + g (y) (1 - 2 y_{j}))) \end{matrix}\} = : A . \end{matrix}

Here, we have for

k \neq j

that

\begin{matrix} A = - ( & y_{j} (1 - y_{j}) (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) + (1 - 2 y_{j}) (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j}))) \frac{\partial}{\partial y_{k}} g (y) . \end{matrix}

For

k = j

, we have that

\begin{matrix} A = & \frac{\partial}{\partial y_{j}} (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) (y_{m + j} - y_{j}^{2} - g (y) y_{j} (1 - y_{j})) \\ - \frac{\partial}{\partial y_{j}} (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) (2 y_{j} + g (y) (1 - 2 y_{j})) \\ = & (ϕ^{(4)} (y_{j}) + ϕ^{(4)} (1 - y_{j})) (y_{m + j} - y_{j}^{2} - g (y) y_{j} (1 - y_{j})) \\ - (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) y_{j} (1 - y_{j}) \frac{\partial}{\partial y_{j}} g (y) \\ - 2 (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) (2 y_{j} + g (y) (1 - 2 y_{j})) \\ - (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) (2 - 2 g (y) + (1 - 2 y_{j}) \frac{\partial}{\partial y_{j}} g (y)) . \end{matrix}

So, altogether,

\begin{matrix} A = & - δ_{j k} (2 (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) (1 - g (y)) + 2 (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) (2 y_{j} + g (y) (1 - 2 y_{j})) \\ - (ϕ^{(4)} (y_{j}) + ϕ^{(4)} (1 - y_{j})) (y_{m + j} - y_{j}^{2} - g (y) y_{j} (1 - y_{j}))) \\ - (y_{j} (1 - y_{j}) (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) + (1 - 2 y_{j}) (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j}))) \frac{\partial}{\partial y_{k}} g (y) . \end{matrix}

Hence,

\begin{matrix} \frac{\partial^{2}}{\partial y_{j} \partial y_{k}} g (y) = - \frac{(y_{k} (1 - y_{k}) (ϕ^{‴} (y_{k}) - ϕ^{‴} (1 - y_{k})) + (1 - 2 y_{k}) (ϕ^{″} (y_{k}) + ϕ^{″} (1 - y_{k}))) \frac{\partial}{\partial y_{j}} g (y)}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} \\ - \frac{(y_{j} (1 - y_{j}) (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) + (1 - 2 y_{j}) (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j}))) \frac{\partial}{\partial y_{k}} g (y)}{\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i})} \\ - δ_{j k} {(\sum_{i = 0}^{m - 1} (ϕ^{″} (y_{i}) + ϕ^{″} (1 - y_{i})) y_{i} (1 - y_{i}))}^{- 1} \\ \cdot (2 (ϕ^{″} (y_{j}) + ϕ^{″} (1 - y_{j})) (1 - g (y)) + 2 (ϕ^{‴} (y_{j}) - ϕ^{‴} (1 - y_{j})) (2 y_{j} + g (y) (1 - 2 y_{j})) \\ - (ϕ^{(4)} (y_{j}) + ϕ^{(4)} (1 - y_{j})) (y_{m + j} - y_{j}^{2} - g (y) y_{j} (1 - y_{j}))) . \end{matrix}

For the required Taylor expansion, we have to evaluate all these derivatives in

y = f_{(h)}

, using that

f_{i i} (h) = f_{i}^{2}

for all

i = 0, \dots, m - 1

, and thus

g (f_{(h)}) = 0

. We use the notations introduced in (A1), i.e.,

u_{j} : = \frac{ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})}, v_{j} : = \frac{ϕ^{‴} (f_{j}) - ϕ^{‴} (1 - f_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} .

Then, we get for

0 \leq j \leq m - 1

that

\begin{matrix} d_{j}^{(κ)} : = & \frac{\partial}{\partial y_{j}} g (f_{(h)}) = \frac{0 - 2 f_{j} (ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j}))}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} = - 2 f_{j} u_{j}, \\ d_{m + j}^{(κ)} : = & \frac{\partial}{\partial y_{m + j}} g (f_{(h)}) = u_{j} . \end{matrix}

For the second-order derivatives, we get for

0 \leq j, k \leq m - 1

that

\begin{matrix} h_{j, k}^{(κ)} : = & \frac{\partial^{2}}{\partial y_{j} \partial y_{k}} g (f_{(h)}) \\ = & - \frac{(f_{k} (1 - f_{k}) (ϕ^{‴} (f_{k}) - ϕ^{‴} (1 - f_{k})) + (1 - 2 f_{k}) (ϕ^{″} (f_{k}) + ϕ^{″} (1 - f_{k}))) d_{j}^{(κ)}}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} \\ - \frac{(f_{j} (1 - f_{j}) (ϕ^{‴} (f_{j}) - ϕ^{‴} (1 - f_{j})) + (1 - 2 f_{j}) (ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j}))) d_{k}^{(κ)}}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} \\ - \frac{δ_{j k} 2 ((ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j})) + 2 f_{j} (ϕ^{‴} (f_{j}) - ϕ^{‴} (1 - f_{j})))}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} \\ = & - f_{k} (1 - f_{k}) v_{k} d_{j}^{(κ)} - (1 - 2 f_{k}) u_{k} d_{j}^{(κ)} \\ - f_{j} (1 - f_{j}) v_{j} d_{k}^{(κ)} - (1 - 2 f_{j}) u_{j} d_{k}^{(κ)} - 2 δ_{j k} u_{j} - 4 δ_{j k} f_{j} v_{j} \\ = & - f_{k} (1 - f_{k}) (- 2 f_{j} u_{j}) v_{k} - (1 - 2 f_{k}) (- 2 f_{j} u_{j}) u_{k} \\ - f_{j} (1 - f_{j}) (- 2 f_{k} u_{k}) v_{j} - (1 - 2 f_{j}) (- 2 f_{k} u_{k}) u_{j} - 2 δ_{j k} (u_{j} + 2 f_{j} v_{j}) \\ = & 2 f_{j} f_{k} (1 - f_{k}) u_{j} v_{k} + 2 f_{j} f_{k} (1 - f_{j}) u_{k} v_{j} \\ + 2 f_{j} (1 - 2 f_{k}) u_{j} u_{k} + 2 f_{k} (1 - 2 f_{j}) u_{j} u_{k} - 2 δ_{j k} (u_{j} + 2 f_{j} v_{j}) \\ = & 2 f_{j} f_{k} ((1 - f_{k}) u_{j} v_{k} + (1 - f_{j}) u_{k} v_{j}) + 2 (f_{j} + f_{k} - 4 f_{j} f_{k}) u_{j} u_{k} - 2 δ_{j, k} (u_{j} + 2 f_{j} v_{j}) . \end{matrix}

Similarly,

\begin{matrix} h_{j, m + k}^{(κ)} : = & \frac{\partial^{2}}{\partial y_{j} \partial y_{m + k}} g (f_{(h)}) = \frac{δ_{j, k} (ϕ^{‴} (f_{j}) - ϕ^{‴} (1 - f_{j}))}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} \\ - \frac{d_{m + k}^{(κ)} (ϕ^{‴} (f_{j}) - ϕ^{‴} (1 - f_{j})) f_{j} (1 - f_{j})}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} - \frac{d_{m + k}^{(κ)} (1 - 2 f_{j}) (ϕ^{″} (f_{j}) + ϕ^{″} (1 - f_{j}))}{\sum_{i = 0}^{m - 1} (ϕ^{″} (f_{i}) + ϕ^{″} (1 - f_{i})) f_{i} (1 - f_{i})} \\ = & (δ_{j, k} - d_{m + k}^{(κ)} f_{j} (1 - f_{j})) v_{j} - d_{m + k}^{(κ)} (1 - 2 f_{j}) u_{j} \\ = & (δ_{j, k} - f_{j} (1 - f_{j}) u_{k}) v_{j} - (1 - 2 f_{j}) u_{j} u_{k} . \end{matrix}

Finally,

h_{m + j, m + k}^{(κ)} : = \frac{\partial^{2}}{\partial y_{m + j} \partial y_{m + k}} g (f_{(h)}) = 0 .

This completes the proof of (A2).

Appendix A.3. Proof of Equation (A4)

σ_{κ}^{2} = D_{κ} Σ^{(h)} D_{κ}^{⊤}

is computed as follows:

\begin{matrix} σ_{κ}^{2} = & \sum_{j, k = 0}^{m - 1} (d_{j}^{(κ)} d_{k}^{(κ)} σ_{j, k} + 2 d_{j}^{(κ)} d_{m + k}^{(κ)} σ_{j, m + k}^{(h)} + d_{m + j}^{(κ)} d_{m + k}^{(κ)} σ_{m + j, m + k}^{(h)}) \\ \overset{(A 2)}{=} & \sum_{j, k = 0}^{m - 1} (4 f_{j} f_{k} u_{j} u_{k} σ_{j, k} - 4 f_{j} u_{j} u_{k} σ_{j, m + k}^{(h)} + u_{j} u_{k} σ_{m + j, m + k}^{(h)}) \\ \overset{(17)}{=} & \sum_{j, k = 0}^{m - 1} (- 4 f_{j} f_{k} u_{j} u_{k} (f_{min {j, k}} - f_{j} f_{k}) + u_{j} u_{k} (f_{min {j, k}} + 3 f_{j} f_{k}) (f_{min {j, k}} - f_{j} f_{k})) \\ = & \sum_{j, k = 0}^{m - 1} u_{j} u_{k} {(f_{min {j, k}} - f_{j} f_{k})}^{2} . \end{matrix}

This completes the proof of (A4).

Appendix A.4. Proof of Equation (A5)

Using that

h_{m + j, m + k}^{(κ)} = 0

, we get that

\begin{matrix} E [{\hat{κ}}_{ϕ} (h)] \approx & 0 + \frac{1}{2 n} \sum_{j, k = 0}^{m - 1} (h_{j, k}^{(κ)} σ_{j, k} + 2 h_{j, m + k}^{(κ)} σ_{j, m + k}^{(h)}) \\ \overset{(17)}{=} & \frac{1}{2 n} \sum_{j, k = 0}^{m - 1} σ_{j, k} (h_{j, k}^{(κ)} + 4 f_{k} h_{j, m + k}^{(κ)}) \\ \overset{(A 2)}{=} & \frac{1}{2 n} \sum_{j, k = 0}^{m - 1} σ_{j, k} (2 f_{j} f_{k} ((1 - f_{k}) u_{j} v_{k} + (1 - f_{j}) u_{k} v_{j}) \\ + 2 (f_{j} + f_{k} - 4 f_{j} f_{k}) u_{j} u_{k} - 2 δ_{j, k} (u_{j} + 2 f_{j} v_{j}) \\ + 4 f_{k} (δ_{j, k} - f_{j} (1 - f_{j}) u_{k}) v_{j} - 4 f_{k} (1 - 2 f_{j}) u_{j} u_{k}) \\ = & \frac{1}{2 n} \sum_{j, k = 0}^{m - 1} σ_{j, k} (4 f_{j} f_{k} (1 - f_{j}) u_{k} v_{j} + 4 f_{k} (1 - 2 f_{j}) u_{j} u_{k} - 2 δ_{j, k} u_{j} - 4 δ_{j, k} f_{j} v_{j} \\ + 4 δ_{j, k} f_{j} v_{j} - 4 f_{k} f_{j} (1 - f_{j}) u_{k} v_{j} - 4 f_{k} (1 - 2 f_{j}) u_{j} u_{k}) \\ = & \frac{1}{2 n} \sum_{j, k = 0}^{m - 1} σ_{j, k} (0 - 2 δ_{j, k} u_{j}) = - \frac{1}{n} \sum_{j = 0}^{m - 1} u_{j} σ_{j, j} = - \frac{1}{n}, \end{matrix}

where the last equality follows from (A1) by using

σ_{j, j} = f_{j} (1 - f_{j})

according to (17). This completes the proof of (A5).

Appendix B. Tables

Table A1. Simulated vs. asymptotic mean and SE of

{\hat{CPE}}_{ϕ}

for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where i. i. d. rank counts

I_{t} \sim Bin (4, p)

and sample size n.

Table A1. Simulated vs. asymptotic mean and SE of

{\hat{CPE}}_{ϕ}

for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where i. i. d. rank counts

I_{t} \sim Bin (4, p)

and sample size n.

		Simulated Mean					Asymptotic Mean
$p$	$n$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$
0.1	50	0.305	0.282	0.273	0.272	0.336	0.301	0.282	0.273	0.272	0.337
	100	0.310	0.285	0.276	0.274	0.341	0.308	0.285	0.276	0.274	0.341
	250	0.313	0.287	0.278	0.276	0.343	0.312	0.287	0.278	0.276	0.343
	500	0.314	0.288	0.278	0.277	0.344	0.314	0.288	0.278	0.277	0.344
	1000	0.314	0.288	0.279	0.277	0.344	0.315	0.288	0.279	0.277	0.344
0.3	50	0.538	0.500	0.484	0.481	0.610	0.538	0.500	0.484	0.481	0.610
	100	0.546	0.506	0.490	0.486	0.618	0.545	0.505	0.489	0.486	0.617
	250	0.550	0.508	0.492	0.488	0.622	0.550	0.509	0.492	0.489	0.622
	500	0.551	0.509	0.493	0.489	0.624	0.551	0.510	0.493	0.490	0.624
	1000	0.552	0.510	0.494	0.490	0.625	0.552	0.510	0.494	0.490	0.625
0.5	50	0.601	0.554	0.536	0.532	0.679	0.602	0.555	0.536	0.532	0.679
	100	0.609	0.560	0.541	0.537	0.687	0.609	0.561	0.541	0.537	0.688
	250	0.613	0.564	0.544	0.540	0.693	0.614	0.564	0.545	0.540	0.693
	500	0.616	0.566	0.546	0.542	0.696	0.615	0.565	0.546	0.542	0.695
	1000	0.616	0.566	0.546	0.542	0.696	0.616	0.566	0.546	0.542	0.696
0.1	50	0.048	0.044	0.043	0.043	0.053	0.049	0.044	0.043	0.043	0.054
	100	0.034	0.031	0.030	0.030	0.037	0.034	0.031	0.030	0.030	0.038
	250	0.022	0.020	0.019	0.019	0.024	0.022	0.020	0.019	0.019	0.024
	500	0.015	0.014	0.014	0.014	0.017	0.015	0.014	0.014	0.014	0.017
	1000	0.011	0.010	0.010	0.009	0.012	0.011	0.010	0.010	0.010	0.012
0.3	50	0.053	0.051	0.051	0.050	0.059	0.053	0.051	0.051	0.051	0.058
	100	0.037	0.036	0.036	0.036	0.041	0.037	0.036	0.036	0.036	0.041
	250	0.024	0.023	0.023	0.023	0.026	0.024	0.023	0.023	0.023	0.026
	500	0.017	0.016	0.016	0.016	0.018	0.017	0.016	0.016	0.016	0.018
	1000	0.012	0.011	0.011	0.011	0.013	0.012	0.011	0.011	0.011	0.013
0.5	50	0.056	0.055	0.054	0.054	0.065	0.055	0.055	0.054	0.054	0.065
	100	0.039	0.038	0.038	0.038	0.046	0.039	0.039	0.038	0.038	0.046
	250	0.025	0.025	0.025	0.025	0.029	0.024	0.025	0.024	0.024	0.029
	500	0.017	0.017	0.017	0.017	0.021	0.017	0.017	0.017	0.017	0.021
	1000	0.012	0.012	0.012	0.012	0.015	0.012	0.012	0.012	0.012	0.015

Table A2. Simulated vs. asymptotic mean and SE of

{\hat{CPE}}_{ϕ}

for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where BAR

(1)

rank counts

I_{t} \sim Bin (4, p)

with

ρ = 0.4

and sample size n.

Table A2. Simulated vs. asymptotic mean and SE of

{\hat{CPE}}_{ϕ}

for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where BAR

(1)

rank counts

I_{t} \sim Bin (4, p)

with

ρ = 0.4

and sample size n.

		Simulated Mean					Asymptotic Mean
$p$	$n$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$
0.1	50	0.298	0.276	0.268	0.266	0.330	0.292	0.275	0.267	0.266	0.330
	100	0.306	0.282	0.273	0.271	0.337	0.304	0.282	0.273	0.271	0.337
	250	0.312	0.286	0.277	0.275	0.342	0.311	0.286	0.277	0.275	0.342
	500	0.313	0.287	0.278	0.276	0.343	0.313	0.287	0.278	0.276	0.343
	1000	0.314	0.288	0.279	0.277	0.344	0.314	0.288	0.278	0.277	0.344
0.3	50	0.529	0.491	0.476	0.473	0.598	0.528	0.491	0.476	0.472	0.597
	100	0.540	0.501	0.485	0.481	0.611	0.540	0.501	0.485	0.481	0.611
	250	0.547	0.506	0.490	0.487	0.620	0.548	0.507	0.490	0.487	0.620
	500	0.551	0.509	0.493	0.489	0.623	0.550	0.509	0.492	0.489	0.623
	1000	0.552	0.510	0.493	0.490	0.624	0.551	0.510	0.493	0.490	0.624
0.5	50	0.590	0.545	0.527	0.523	0.667	0.592	0.545	0.527	0.523	0.666
	100	0.604	0.555	0.537	0.532	0.682	0.604	0.556	0.537	0.533	0.682
	250	0.612	0.562	0.543	0.539	0.691	0.612	0.562	0.543	0.539	0.691
	500	0.614	0.564	0.545	0.540	0.694	0.614	0.564	0.545	0.541	0.694
	1000	0.615	0.565	0.546	0.542	0.695	0.615	0.566	0.546	0.542	0.695
0.1	50	0.066	0.062	0.061	0.061	0.070	0.070	0.065	0.064	0.064	0.073
	100	0.047	0.045	0.044	0.044	0.050	0.049	0.046	0.045	0.045	0.052
	250	0.031	0.029	0.028	0.028	0.032	0.031	0.029	0.029	0.029	0.033
	500	0.022	0.020	0.020	0.020	0.023	0.022	0.021	0.020	0.020	0.023
	1000	0.015	0.014	0.014	0.014	0.016	0.016	0.015	0.014	0.014	0.016
0.3	50	0.064	0.061	0.061	0.060	0.071	0.065	0.063	0.062	0.062	0.073
	100	0.046	0.044	0.043	0.043	0.050	0.046	0.044	0.044	0.044	0.051
	250	0.029	0.028	0.027	0.027	0.032	0.029	0.028	0.028	0.028	0.032
	500	0.021	0.020	0.019	0.019	0.023	0.021	0.020	0.020	0.020	0.023
	1000	0.014	0.014	0.014	0.014	0.016	0.015	0.014	0.014	0.014	0.016
0.5	50	0.065	0.063	0.062	0.062	0.074	0.064	0.064	0.064	0.064	0.076
	100	0.046	0.046	0.045	0.045	0.054	0.045	0.046	0.045	0.045	0.054
	250	0.029	0.029	0.029	0.029	0.034	0.029	0.029	0.029	0.028	0.034
	500	0.020	0.020	0.020	0.020	0.024	0.020	0.020	0.020	0.020	0.024
	1000	0.014	0.014	0.014	0.014	0.017	0.014	0.014	0.014	0.014	0.017

Table A3. Simulated vs. asymptotic mean and SE of

{\hat{κ}}_{ϕ} (1)

for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where i. i. d. rank counts

I_{t} \sim Bin (4, p)

and sample size n.

Table A3. Simulated vs. asymptotic mean and SE of

{\hat{κ}}_{ϕ} (1)

for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where i. i. d. rank counts

I_{t} \sim Bin (4, p)

and sample size n.

		Simulated Mean					Asymptotic Mean
$p$	$n$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$
0.1	50	−0.020	−0.019	−0.019	−0.019	−0.020	−0.020	−0.020	−0.020	−0.020	−0.020
	100	0.005	−0.008	−0.008	−0.008	−0.009	−0.010	−0.010	−0.010	−0.010	−0.010
	250	0.003	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004
	500	0.000	−0.001	−0.002	−0.002	−0.001	−0.002	−0.002	−0.002	−0.002	−0.002
	1000	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001
0.3	50	−0.020	−0.020	−0.020	−0.020	−0.020	−0.020	−0.020	−0.020	−0.020	−0.020
	100	−0.010	−0.010	−0.010	−0.010	−0.011	−0.010	−0.010	−0.010	−0.010	−0.010
	250	−0.003	−0.004	−0.004	−0.004	−0.003	−0.004	−0.004	−0.004	−0.004	−0.004
	500	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002
	1000	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001
0.5	50	−0.020	−0.021	−0.021	−0.021	−0.020	−0.020	−0.020	−0.020	−0.020	−0.020
	100	−0.010	−0.010	−0.010	−0.010	−0.010	−0.010	−0.010	−0.010	−0.010	−0.010
	250	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004	−0.004
	500	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002	−0.002
	1000	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001	−0.001
0.1	50	0.124	0.124	0.130	0.132	0.134	0.074	0.105	0.120	0.122	0.103
	100	0.134	0.082	0.088	0.089	0.085	0.053	0.074	0.085	0.086	0.073
	250	0.086	0.050	0.055	0.056	0.050	0.033	0.047	0.054	0.055	0.046
	500	0.045	0.034	0.038	0.039	0.034	0.023	0.033	0.038	0.039	0.033
	1000	0.022	0.024	0.027	0.027	0.023	0.017	0.023	0.027	0.027	0.023
0.3	50	0.098	0.097	0.101	0.101	0.101	0.079	0.089	0.096	0.097	0.088
	100	0.066	0.067	0.070	0.071	0.067	0.056	0.063	0.068	0.068	0.062
	250	0.040	0.040	0.043	0.043	0.040	0.035	0.040	0.043	0.043	0.040
	500	0.026	0.028	0.030	0.031	0.028	0.025	0.028	0.030	0.031	0.028
	1000	0.018	0.020	0.021	0.021	0.020	0.018	0.020	0.021	0.022	0.020
0.5	50	0.084	0.092	0.098	0.099	0.089	0.080	0.086	0.092	0.094	0.080
	100	0.058	0.063	0.067	0.068	0.059	0.057	0.061	0.065	0.066	0.057
	250	0.036	0.039	0.042	0.043	0.037	0.036	0.039	0.041	0.042	0.036
	500	0.025	0.027	0.029	0.030	0.025	0.025	0.027	0.029	0.030	0.025
	1000	0.018	0.019	0.021	0.021	0.018	0.018	0.019	0.021	0.021	0.018

Table A4. Simulated rejection rate (

ρ = 0

: size;

ρ \neq 0

: power) of

{\hat{κ}}_{ϕ} (1)

-test at 5 %-level (

H_{0}

:

ρ = 0

, i.e., i. i. d.-case) for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where BAR

(1)

rank counts

I_{t} \sim Bin (4, 0.3)

and sample size n.

Table A4. Simulated rejection rate (

ρ = 0

: size;

ρ \neq 0

: power) of

{\hat{κ}}_{ϕ} (1)

-test at 5 %-level (

H_{0}

:

ρ = 0

, i.e., i. i. d.-case) for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

, where BAR

(1)

rank counts

I_{t} \sim Bin (4, 0.3)

and sample size n.

	Simulated Rejection Rate						Simulated Rejection Rate
$p$	$n$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$	$a = 1$	$a = 1.5$	$a = 2$	$a = 2.5$	$q = 4$
							$ρ = 0.2$
						50	0.227	0.265	0.263	0.262	0.222
						100	0.367	0.447	0.451	0.450	0.370
						250	0.721	0.830	0.830	0.829	0.723
						500	0.950	0.982	0.982	0.981	0.947
						1000	1.000	1.000	1.000	1.000	0.999
	$ρ = - 0.4$						$ρ = 0.4$
50	0.484	0.565	0.585	0.588	0.312	50	0.663	0.749	0.756	0.757	0.625
100	0.807	0.878	0.891	0.893	0.672	100	0.917	0.959	0.962	0.962	0.897
250	0.989	0.999	1.000	1.000	0.987	250	1.000	1.000	1.000	1.000	0.999
500	0.997	1.000	1.000	1.000	1.000	500	1.000	1.000	1.000	1.000	1.000
1000	1.000	1.000	1.000	1.000	1.000	1000	1.000	1.000	1.000	1.000	1.000
	$ρ = - 0.2$						$ρ = 0.6$
50	0.151	0.188	0.195	0.196	0.094	50	0.956	0.979	0.981	0.981	0.923
100	0.298	0.365	0.374	0.375	0.231	100	1.000	1.000	1.000	1.000	0.998
250	0.643	0.744	0.751	0.751	0.584	250	1.000	1.000	1.000	1.000	1.000
500	0.925	0.965	0.966	0.965	0.900	500	1.000	1.000	1.000	1.000	1.000
1000	0.994	1.000	1.000	1.000	0.998	1000	1.000	1.000	1.000	1.000	1.000
	$ρ = 0$						$ρ = 0.8$
50	0.061	0.056	0.054	0.054	0.061	50	0.997	1.000	1.000	1.000	0.981
100	0.056	0.060	0.058	0.057	0.057	100	1.000	1.000	1.000	1.000	0.999
250	0.047	0.048	0.046	0.046	0.049	250	1.000	1.000	1.000	1.000	1.000
500	0.049	0.050	0.050	0.050	0.052	500	1.000	1.000	1.000	1.000	1.000
1000	0.053	0.047	0.048	0.049	0.050	1000	1.000	1.000	1.000	1.000	1.000

References

Agresti, A. Analysis of Ordinal Categorical Data, 2nd ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2010. [Google Scholar]
Weiß, C.H. An Introduction to Discrete-Valued Time Series; John Wiley & Sons, Inc.: Chichester, UK, 2018. [Google Scholar]
Blair, J.; Lacy, M.G. Measures of variation for ordinal data as functions of the cumulative distribution. Percept. Mot. Ski. 1996, 82, 411–418. [Google Scholar] [CrossRef]
Blair, J.; Lacy, M.G. Statistics of ordinal variation. Sociol. Methods Res. 2000, 28, 251–280. [Google Scholar] [CrossRef]
Gadrich, T.; Bashkansky, E. ORDANOVA: Analysis of ordinal variation. J. Stat. Plan. Inference 2012, 142, 3174–3188. [Google Scholar] [CrossRef]
Gadrich, T.; Bashkansky, E.; Zitikis, R. Assessing variation: A unifying approach for all scales of measurement. Qual. Quant. 2015, 49, 1145–1167. [Google Scholar] [CrossRef]
Kiesl, H. Ordinale Streuungsmaße—Theoretische Fundierung und statistische Anwendungen; Josef Eul Verlag: Lohmar, Cologne, 2003. (In German) [Google Scholar]
Kvålseth, T.O. Coefficients of variation for nominal and ordinal categorical data. Percept. Mot. Ski. 1995, 80, 843–847. [Google Scholar] [CrossRef]
Kvålseth, T.O. Variation for categorical variables. In International Encyclopedia of Statistical Science; Lovric, M., Ed.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 1642–1645. [Google Scholar]
Kvålseth, T.O. The lambda distribution and its applications to categorical summary measures. Adv. Appl. Stat. 2011, 24, 83–106. [Google Scholar]
Leik, R.K. A measure of ordinal consensus. Pac. Sociol. Rev. 1966, 9, 85–90. [Google Scholar] [CrossRef]
Vogel, F.; Dobbener, R. Ein Streuungsmaß für komparative Merkmale (In German). Jahrbücher für Natl. und Stat. 1982, 197, 145–158. [Google Scholar]
Weiß, C.H. On some measures of ordinal variation. J. Appl. Stat. 2019, 46, 2905–2926. [Google Scholar] [CrossRef]
Weiß, C.H. Distance-based analysis of ordinal data and ordinal time series. J. Am. Stat. Assoc. 2020, 115, 1189–1200. [Google Scholar] [CrossRef]
Weiß, C.H. Analyzing categorical time series in the presence of missing observations. Stat. Med. 2021, 40, 4675–4690. [Google Scholar] [CrossRef] [PubMed]
Klein, I. Rangordnungsstatistiken als Verteilungsmaßzahlen für ordinalskalierte Merkmale: I. Streuungsmessung; Diskussionspapier No. 27/1999; Friedrich-Alexander-Universität Erlangen-Nürnburg, Lehrstuhl für Statistik und Ökonometrie: Nuremberg, Germany, 1999; Volume 27. [Google Scholar]
Yager, R.R. Dissonance: A measure of variability for ordinal random variables. Int. J. Uncertainty, Fuzziness -Knowl.-Based Syst. 2001, 9, 39–53. [Google Scholar] [CrossRef]
Klein, I.; Doll, M. (Generalized) maximum cumulative direct, residual, and paired Φ entropy approach. Entropy 2020, 22, 91. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Klein, I.; Mangold, B.; Doll, M. Cumulative paired ϕ-entropy. Entropy 2016, 18, 248. [Google Scholar] [CrossRef] [Green Version]
Weiß, C.H. Measures of dispersion and serial dependence in categorical time series. Econometrics 17. [CrossRef] [Green Version]
Havrda, J.; Charvát, F. Quantification method of classification processes: Concept of structural a-entropy. Kybernetika 1967, 3, 30–35. [Google Scholar]
Rao, C.R. Convexity properties of entropy functions and analysis of diversity. IMS Lect. Notes—Monogr. Ser. 1984, 5, 68–77. [Google Scholar]
Weiß, C.H. Regime-switching discrete ARMA models for categorical time series. Entropy 2020, 22, 458. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Stoffer, D.S.; Tyler, D.E.; Wendt, D.A. The spectral envelope and its applications. Stat. Sci. 2000, 15, 224–253. [Google Scholar] [CrossRef]
Chen, C.W.S.; Chiu, L.M. Ordinal time series forecasting of the air quality index. Entropy 2021, 23, 1167. [Google Scholar] [CrossRef] [PubMed]
Liu, M.; Zhu, F.; Zhu, K. Modeling normalcy-dominant ordinal time series: An application to air quality level. J. Time Series Anal. 2021; forthcoming. [Google Scholar]

Figure 1. Plot of EGFs

ϕ (z)

against z. (Left):

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

; (right):

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

.

Figure 1. Plot of EGFs

ϕ (z)

against z. (Left):

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

; (right):

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

.

Figure 2. Plot of

{CPE}_{ϕ}

against p for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

. (Left): binomial distribution

Bin (4, p)

; (Right): two-point distribution with

f_{0} = p

.

Figure 2. Plot of

{CPE}_{ϕ}

against p for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

. (Left): binomial distribution

Bin (4, p)

; (Right): two-point distribution with

f_{0} = p

.

Figure 3. Plots for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

:

σ_{ϕ}

(left) and

n (E [C P E_{ϕ} (\hat{f})] - C P E_{ϕ} (f))

(right) against p of marginal

Bin (4, p)

, where i. i. d. DGP in (a,b), and BAR

(1)

DGP with dependence parameter

ρ = 0.4

in (c,d).

Figure 3. Plots for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

:

σ_{ϕ}

(left) and

n (E [C P E_{ϕ} (\hat{f})] - C P E_{ϕ} (f))

(right) against p of marginal

Bin (4, p)

, where i. i. d. DGP in (a,b), and BAR

(1)

DGP with dependence parameter

ρ = 0.4

in (c,d).

Figure 4. Plots for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

:

κ_{ϕ} (1)

against BAR

(1)

’s dependence parameter

ρ

with marginal

Bin (4, 0.3)

(left);

σ_{κ}

against p of marginal

Bin (4, 0.3)

(right).

Figure 4. Plots for specific cases of

ϕ_{a} (z) = (z - z^{a}) / (a - 1)

and

ϕ_{q} (z) = 1 - {| 2 z - 1 |}^{q}

:

κ_{ϕ} (1)

against BAR

(1)

’s dependence parameter

ρ

with marginal

Bin (4, 0.3)

(left);

σ_{κ}

against p of marginal

Bin (4, 0.3)

(right).

Figure 5. Daily air quality level in Shanghai: plot of time series

(x_{t})

in top panel; plot of sample PMF (left) and

{\hat{κ}}_{ϕ} (h)

(right) in bottom panel;

{\hat{κ}}_{ϕ} (h)

uses a-entropy with

a = 5 / 2

.

Figure 5. Daily air quality level in Shanghai: plot of time series

(x_{t})

in top panel; plot of sample PMF (left) and

{\hat{κ}}_{ϕ} (h)

(right) in bottom panel;

{\hat{κ}}_{ϕ} (h)

uses a-entropy with

a = 5 / 2

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Weiß, C.H. Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy. Entropy 2022, 24, 42. https://doi.org/10.3390/e24010042

AMA Style

Weiß CH. Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy. Entropy. 2022; 24(1):42. https://doi.org/10.3390/e24010042

Chicago/Turabian Style

Weiß, Christian H. 2022. "Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy" Entropy 24, no. 1: 42. https://doi.org/10.3390/e24010042

APA Style

Weiß, C. H. (2022). Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy. Entropy, 24(1), 42. https://doi.org/10.3390/e24010042

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy

Abstract

1. Introduction

2. The Family of Cumulative Paired $ϕ$ -Entropies

3. Asymptotic Distribution of Sample ${CPE}_{ϕ}$

4. Asymptotic Distribution of Sample $κ_{ϕ} (h)$

5. Simulation Results

6. Data Application

7. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proofs

Appendix A.1. Proof of Theorem 2

Appendix A.2. Proof of Equation (A2)

Appendix A.3. Proof of Equation (A4)

Appendix A.4. Proof of Equation (A5)

Appendix B. Tables

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Measuring Dispersion and Serial Dependence in Ordinal Time Series Based on the Cumulative Paired ϕ-Entropy

Abstract

1. Introduction

2. The Family of Cumulative Paired ϕ -Entropies

3. Asymptotic Distribution of Sample CPE ϕ

4. Asymptotic Distribution of Sample κ ϕ ( h )

5. Simulation Results

6. Data Application

7. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Proofs

Appendix A.1. Proof of Theorem 2

Appendix A.2. Proof of Equation (A2)

Appendix A.3. Proof of Equation (A4)

Appendix A.4. Proof of Equation (A5)

Appendix B. Tables

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. The Family of Cumulative Paired $ϕ$ -Entropies

3. Asymptotic Distribution of Sample ${CPE}_{ϕ}$

4. Asymptotic Distribution of Sample $κ_{ϕ} (h)$