Two New Tests for the Relationships of Stochastic Dominance

Zhuang, Weiwei; He, Haowei; Qiu, Guoxin

doi:10.3390/axioms13020089

Open AccessArticle

Two New Tests for the Relationships of Stochastic Dominance

by

Weiwei Zhuang

¹,

Haowei He

² and

Guoxin Qiu

^3,*

¹

International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, China

²

School of Management, University of Science and Technology of China, Hefei 230026, China

³

School of Business, Xinhua University of Anhui, Hefei 230088, China

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(2), 89; https://doi.org/10.3390/axioms13020089

Submission received: 13 December 2023 / Revised: 22 January 2024 / Accepted: 24 January 2024 / Published: 29 January 2024

Download

Browse Figures

Versions Notes

Abstract

To test the relationships of stochastic dominance, this paper firstly proposes two new consistent K-S-type statistics based on a quantile rule. Then, we consider their asymptotic properties. We introduce the bootstrap method to calculate the p-values of our proposed tests and use the Monte Carlo method to compare the power of our proposed test with the DD test and the BD test. The simulations showed that our test performed better than these two tests. Finally, we applied our proposed tests to compare the visibility of four provinces in China and compared the effects with the DD test and BD test. The empirical results showed that the conclusions of our proposed method are more-consistent with the dominance relationships of the four provinces. For a given significance level, the results of our proposed test demonstrate that the visibility of Jiangxi province is the best. The next one is the visibility of Hubei province, which outperforms the visibility of Anhui province. The visibility of Hunan province is the poorest.

Keywords:

bootstrap method; hypothesis test; quantile rule; stochastic dominance; visibility

MSC:

62P20; 62E20; 62F40

1. Introduction

Lehmann [1] first introduced the stochastic dominance theory, which is a well-established theory and has been widely applied in economics and the finance area. It takes into account different types of preferences of decision-makers and mainly concentrates on the comparison of two populations or portfolios [2]. Based on the stochastic predominance theory, scholars have constructed many different types of statistics to test stochastic dominance. Referring to the literature of Zhu [3], the statistics for testing stochastic dominance can be divided into three categories. The first type is the Kolmogorov–Smirnov (K-S)-type statistics. McFadden [4] first adopted this type of statistic to test the first-order stochastic dominance. Barrett and Donald [5] extended the conclusions of McFadden [4] to higher-order scenarios and developed a consistency test based on the K-S-type statistic. Barrett et al. [6] applied the K-S-type test in Barrett and Donald [5] to test Lorenz dominance and derive some asymptotic properties and simulation results. Donald and Hsu [7] introduced the re-centering function into the calculation of critical values and, finally, established the K-S-type statistic, which improves the efficacy of the test. The second type is the t-type statistics, which was first proposed by Kaur et al. [8] to test the second-order stochastic dominance. Then, Anderson [9] constructed an improved t-statistic inspired by Pearson’s goodness-of-fit test and completed tests for the first-order, the second-order, and the third-order stochastic dominance. Furthermore, Davidson and Duclos [10] improved the t-statistic proposed by Kaur et al. [8] and extended it to the situation of dependent samples. Based on the tests proposed by Anderson [9] and Davidson and Duclos [10], Ledwina and Wyłupek [11] presented two new tests for stochastic ordering in a standard two-sample scheme. The third type is integral-type statistics, which were proposed and promoted by several scholars; see, for example, Deshpande and Singh [12], Eubank et al. [13], Schmid and Trede [14], Hall and Yatchew [15], and Lee et al. [16].

Since the calculation of integral-type statistics is too complicated, researchers often choose two other types of statistics to test the dominance relationships. Compared with the t-type statistics, the K-S-type statistics are applicable to all distributions and are highly robust. So, they are the most-widely used in stochastic dominance testing issues. Motivated by the work in Barrett and Donald [5], this paper proposes another consistent K-S-type test based on a quantile rule. To this end, the empirical quantile functions are utilized to construct the K-S-type statistics for the first-order stochastic dominance and the second-order stochastic dominance. Then, we investigate the asymptotic properties of the two proposed statistics. To calculate the p-values of the proposed tests, the bootstrap method is introduced. Besides, we apply the Monte Carlo method to obtain the rejection rates and compare the power performance of our K-S test with the Davidson and Duclos (DD) test and the Barrett and Donald (BD) test. These two tests were proposed by Davidson and Duclos [10] and Barrett and Donald [5], respectively. Finally, we apply our proposed tests to analyze the visibility data of four central provinces, namely Hunan, Hubei, Anhui, and Jiangxi provinces.

The rest of the present paper is organized as follows. Section 2 introduces the definitions of the first-order stochastic dominance and the second-order stochastic dominance. Then, the equivalent definitions based on quantile rules are given for these two stochastic dominance relationships. These two equivalent definitions are used to construct our test statistics. After expressing the null/alternative hypotheses for the first-order stochastic dominance and the second-order stochastic dominance, the test statistics are given in Section 3. We show the asymptotic properties of the proposed statistics in this section as well. To end Section 3, we exhibit how to calculate the p-values of the proposed tests employing the bootstrap method. Section 4 compares the power of our proposed test with the DD test and the BD test. The simulations show that our test performs better and is more robust than the DD test and the BD test. In Section 5, we applied our two tests to analyze the visibility data of four provinces in China. We found that the visibility of the four provinces can be correctly compared by our proposed test. But, the DD test and the BD test fail to do this work. For a given significance level

α = 0.05

, we see from our tests that the visibility of Jiangxi province is the best, while the visibility of Hunan province is the poorest. Finally, we conclude this paper in Section 6.

2. Definitions and Equivalent Properties

Stochastic dominance theory is based on the expected utility theory [17], which has a wide range of applications in measuring social welfare, poverty measurement, portfolio selection, and evaluation. In contrast to the expected utility theory, stochastic dominance theory does not require commanding the specific expression of the utility function; some relevant assumptions about the utility function are enough for researchers to compare two populations. This paper is concerned with the relationships of the first-order stochastic dominance and the second-order stochastic dominance. The first-order stochastic dominance is targeted at decision-makers with increasing utility functions. The second-order stochastic dominance is applicable to risk-averse decision-makers with increasing and concave utility functions. Formally, if u is a utility function and we let

U_{1} = \{u | u^{'} (x) \geq 0, \forall x \in R\}

,

U_{2} = \{u | u^{'} (x) \geq 0, u^{″} (x) \leq 0, \forall x \in R\}

, then we can give the definitions of these two stochastic dominance relationships as follows.

Definition 1

([18]). For two random variables X and Y, Y is said to stochastic dominate X with the first order, denoted by

Y ≻_{F S D} X

, if

E (u (Y)) \geq E (u (X)), \forall u \in U_{1}

.

Definition 2

([19]). For two random variables X and Y, Y is said to stochastic dominate X with the second order, denoted by

Y ≻_{S S D} X

, if

E (u (Y)) \geq E (u (X)), \forall u \in U_{2}

.

By Definitions 1 and 2, it is obvious that

Y ≻_{S S D} X

if

Y ≻_{F S D} X

. That is to say, the relationship of the first-degree stochastic dominance is stronger than the relationship of the second-degree stochastic dominance. For more details about the above two stochastic dominance relationships, readers can refer to Levy [20].

The main purpose of this paper is to test whether there is the first-order or the second-order stochastic dominance relationship between two populations. However, it is complicated to complete our purpose using the above two definitions since u is arbitrary in

U_{1}

and

U_{2}

. So, we introduce the following Theorem 1, which gives the equivalent definitions for the first-order and the second-order stochastic dominance relationships based on quantile functions.

Theorem 1

([21]). Let

Q_{F} (p)

and

Q_{G} (p)

be the quantile functions of X and Y, respectively:

(i): $Y ≻_{F S D} X$ if and only if $Q_{G} (p) \geq Q_{F} (p), \forall p \in [0, 1]$ .
(ii): $Y ≻_{S S D} X$ if and only if $\int_{0}^{p} Q_{G} (u) d u \geq \int_{0}^{p} Q_{F} (u) d u, \forall p \in [0, 1]$ .

Here,

Q_{F} (p) = inf_{x} {x : F (x) \geq p, p \in [0, 1]}

,

Q_{G} (p) = inf_{x} {x : G (x) \geq p, p \in [0, 1]}

, and F and G are the cumulative distribution functions of X and Y, respectively.

Theorem 1 tells us that we can determine whether or not there is a stochastic dominance relationship between two populations by using the images of the quantile functions and their associated function curves, for example if X has the normal distribution

N (1, 3)

and Y has the normal distribution

N (2, 3)

. We draw the plots of the quantile functions of X and Y in Figure 1 for all

p \in [0, 1]

. It can be seen from Figure 1 that the plot of the quantile function of Y always lies above that of X. This implies that

Y ≻_{F S D} X

by Theorem 1 (i). On the other hand, if X has the normal distribution

N (1, 1.2)

with the quantile function

Q_{F} (p)

, Y has the normal distribution

N (1.2, 1)

with the quantile function

Q_{G} (p)

. We draw the plots of

\int_{0}^{p} Q_{F} (u) d u

and

\int_{0}^{p} Q_{G} (u) d u

for all

p \in [0, 1]

in Figure 2. It is obvious from Figure 2 that the plot of

\int_{0}^{p} Q_{G} (u) d u

lies above that of

\int_{0}^{p} Q_{F} (u) d u

for all

p \in [0, 1]

, which implies

Y ≻_{S S D} X

by Theorem 1 (ii).

3. Test Statistics

To test whether there is a stochastic dominance relationship between X and Y, this section constructs two K-S-type test statistics based on the samples, and then, the asymptotic properties of the proposed statistics are investigated. According to Theorem 1, we first present the null hypothesis, the alternative hypothesis, and the corresponding test statistic for the first-order stochastic dominance relationship in Section 3.1.

3.1. Test Statistic for the First-Order Stochastic Dominance

Using Theorem 1 (i), whether Y stochastic dominates X with the first-order can be expressed by the following hypotheses:

H_{0}^{1} : Q_{G} (p) \geq Q_{F} (p), \forall p \in [0, 1],

H_{1}^{1} : Q_{G} (p) < Q_{F} (p), \exists p \in [0, 1] .

If we accept the null hypothesis

H_{0}^{1}

, then it means that we cannot reject Y stochastic dominating X with the first order, and if we reject the null hypothesis

H_{0}^{1}

, it shows that we have a high degree of certainty that Y does not stochastic dominate X with the first order.

Based on the above hypotheses, we construct a K-S-type statistic as follows:

\begin{matrix} {\hat{T}}_{1} = sup_{p \in [0, 1]} {\hat{L}}_{1} (p), \end{matrix}

where

{\hat{L}}_{1} (p) = \sqrt{\frac{M N}{M + N}} \{{\hat{Q}}_{F} (p) - {\hat{Q}}_{G} (p)\},

and

{\hat{Q}}_{H} (p) (H = F, G)

are the estimators of the corresponding quantile functions. M and N are the sizes of two samples from X and Y, respectively. We will accept

H_{0}^{1}

if and only if

{\hat{T}}_{1}

is big enough. Otherwise, we will reject the null hypothesis

H_{0}^{1}

.

3.2. Test Statistic for the Second-Order Stochastic Dominance

Using Theorem 1 (ii), whether Y stochastic dominates X with the second order can be expressed by the following hypotheses:

H_{0}^{2} : \int_{0}^{p} Q_{G} (u) d u \geq \int_{0}^{p} Q_{F} (u) d u, \forall p \in [0, 1],

H_{1}^{2} : \int_{0}^{p} Q_{G} (u) d u < \int_{0}^{p} Q_{F} (u) d u, \exists p \in [0, 1] .

If we accept the null hypothesis

H_{0}^{2}

, then it means that we cannot reject Y stochastic dominating X with the second order, and if we reject the null hypothesis

H_{0}^{2}

, it shows that we have a high degree of certainty that Y does not stochastic dominate X with the second order.

Based on the above hypotheses, we construct a K-S-type statistic as follows:

{\hat{T}}_{2} = sup_{p \in [0, 1]} {\hat{L}}_{2} (p),

where

{\hat{L}}_{2} (p) = \sqrt{\frac{M N}{M + N}} \{\int_{0}^{p} {\hat{Q}}_{F} (u) d u - \int_{0}^{p} {\hat{Q}}_{G} (u) d u\} .

Similarly, we will accept the null hypothesis

H_{0}^{2}

if and only if

{\hat{T}}_{2}

is big enough. Otherwise, we will reject the null hypothesis

H_{0}^{2}

.

In this paper, we chose to utilize the empirical distribution to estimate

{\hat{Q}}_{H} (p)

(H = F, G)

. Denote the cumulative distribution functions of two mutually independent populations X and Y by

F (x)

and

G (x)

, respectively. The simple random samples from X and Y are

{\{X_{i}\}}_{i = 1}^{M}

and

{\{Y_{i}\}}_{i = 1}^{N}

with sample sizes M and N, respectively. Then, for

p \in [0, 1]

, the concrete forms of

{\hat{Q}}_{F} (p)

and

{\hat{Q}}_{G} (p)

are given, respectively, as follows:

\begin{matrix} {\hat{Q}}_{F} (p) = inf_{x} \{\hat{F} (x) \geq p\}, {\hat{Q}}_{G} (p) = inf_{x} \{\hat{G} (x) \geq p\}, \end{matrix}

(1)

where

\begin{matrix} \hat{F} (x) = \frac{1}{M} \sum_{i = 1}^{M} I (X_{i} \leq x), \hat{G} (x) = \frac{1}{N} \sum_{j = 1}^{N} I (Y_{j} \leq x), \end{matrix}

and

I (A)

is the indicator function for the set A.

3.3. Asymptotic Properties

In this section, we will investigate the asymptotic properties of our proposed statistics. Firstly, we introduce some basic assumptions.

Assumption 1.

(i): $F (x)$ and $G (x)$ have a common support $[a, b]$ , where $[a, b]$ is the bounded closed interval on R and $F (a) = G (a) = 0, F (b) = G (b) = 1$ .
(ii): $F (x)$ and $G (x)$ are continuously differentiable on the interior of their supports with strictly positive derivatives $f (x)$ and $g (x)$ .
(iii): ${\{X_{i}\}}_{i = 1}^{M}$ and ${\{Y_{i}\}}_{i = 1}^{N}$ are simple random samples from X and Y, respectively. M and N satisfy

$\begin{matrix} lim_{M \to \infty} \frac{M N}{M + N} \to \infty, lim_{M \to \infty} \frac{M}{M + N} \to λ . \end{matrix}$

Assumption 1 (i) defines the cumulative distribution functions in a bounded closed interval that guarantees the values of the quantile functions are finite. Assumption 1 (ii) guarantees the existence of quantile functions and density functions of two populations. Assumption 1 (iii) makes assumptions about sampling independence and sample size to prevent the degeneration of the limiting distribution when the sample size is too large.

Under Assumption 1, the next step is to deduce the asymptotic distributions of our proposed K-S-type statistics. Since our proposed statistics are constructed based on the empirical distribution functions, we need a lemma for the quantile process, which can be found in Section 21 of Van der Vaart [22]. This lemma is shown as follows.

Lemma 1

([22]). Under the conditions of Assumption 1, when

M \to \infty

,

\sqrt{\frac{M N}{M + N}} (\binom{{\hat{Q}}_{F} (p) - Q_{F} (p)}{{\hat{Q}}_{G} (p) - Q_{G} (p)}) \overset{w}{⟶} (\binom{\sqrt{1 - λ} \frac{B_{1} \circ F}{f (Q_{F} (p))}}{\sqrt{λ} \frac{B_{2} \circ G}{g (Q_{G} (p))}}),

\sqrt{\frac{M N}{M + N}} (\binom{\int_{0}^{p} {\hat{Q}}_{F} (u) d u - \int_{0}^{p} Q_{F} (u) d u}{\int_{0}^{p} {\hat{Q}}_{G} (u) d u - \int_{0}^{p} Q_{G} (u) d u}) \overset{w}{⟶} (\binom{\sqrt{1 - λ} \int_{0}^{p} \frac{B_{1} (u)}{f (Q_{F} (u))} d u}{\sqrt{λ} \int_{0}^{p} \frac{B_{2} (u)}{g (Q_{G} (u))} d u}),

where

\overset{w}{⟶}

represents weak convergence and

B_{1} \circ F

represents the Brownian Bridge process composed of F. Meanwhile,

B_{2} \circ G

represents the Brownian Bridge process composed of G. They are two independent Brownian Bridge processes.

The first result in this lemma is related to the convergence of the quantile process. Its proof can be found in Corollary 21.5 in Van der Vaart [22]. The second result can be obtained based on the first result and the continuity of quantile functions by using the continuous mapping theorem.

From Lemma 1 and Slutsky’s theorem, it is not difficult to derive the following theorem.

Theorem 2.

Under the conditions of Assumption 1, when

M \to \infty

,

\begin{matrix} \sqrt{\frac{M N}{M + N}} [({\hat{Q}}_{F} (p) - {\hat{Q}}_{G} (p)) - (Q_{F} (p) - Q_{G} (p))] \overset{w}{⟶} φ_{1} (p), \\ \sqrt{\frac{M N}{M + N}} [(\int_{0}^{p} {\hat{Q}}_{F} (u) d u - \int_{0}^{p} {\hat{Q}}_{G} (u) d u) - (\int_{0}^{p} Q_{F} (u) d u - \int_{0}^{p} Q_{G} (u) d u)] \overset{w}{⟶} φ_{2} (p), \end{matrix}

where

\begin{matrix} φ_{1} (p) = & \sqrt{λ} \frac{B_{2} \circ G}{g (Q_{G} (p))} - \sqrt{1 - λ} \frac{B_{1} \circ F}{f (Q_{F} (p))}, \\ φ_{2} (p) = & \sqrt{λ} \int_{0}^{p} \frac{B_{2} (u)}{g (Q_{G} (u))} d u - \sqrt{1 - λ} \int_{0}^{p} \frac{B_{1} (u)}{f (Q_{F} (u))} d u . \end{matrix}

Next, we investigate the asymptotic distributions of our proposed K-S-type statistics. Whether or not our proposed K-S-type statistics are degenerate depends on the null hypothesis. In the following Theorem 3, we will discuss the asymptotic distributions of our proposed K-S-type statistics under different situations.

Theorem 3.

Let

Δ_{1} = {p : p \in [0, 1], Q_{G} (p) = Q_{F} (p)},

Δ_{2} = \{p : p \in [0, 1], \int_{0}^{p} Q_{G} (u) d u = \int_{0}^{p} Q_{F} (u) d u\} .

Under the conditions of Assumption 1, for

i = 1, 2

, when

M \to \infty

:

(i): If $H_{0}^{i}$ is true and $Δ_{i} \neq ⌀$ , then

$\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} ({\hat{Q}}_{F} (p) - {\hat{Q}}_{G} (p)) \overset{w}{⟶} sup_{p \in Δ_{1}} φ_{1} (p),$

$\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} (\int_{0}^{p} {\hat{Q}}_{F} (u) d u - \int_{0}^{p} {\hat{Q}}_{G} (u) d u) \overset{w}{⟶} sup_{p \in Δ_{2}} φ_{2} (p) .$
(ii): If $H_{0}^{i}$ is true and $Δ_{i} = ⌀$ , then

$\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} ({\hat{Q}}_{F} (p) - {\hat{Q}}_{G} (p)) \overset{w}{⟶} - \infty,$

$\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} (\int_{0}^{p} {\hat{Q}}_{F} (u) d u - \int_{0}^{p} {\hat{Q}}_{G} (u) d u) \overset{w}{⟶} - \infty .$
(iii): If $H_{0}^{i}$ is not true, then

$\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} ({\hat{Q}}_{F} (p) - {\hat{Q}}_{G} (p)) \overset{w}{⟶} + \infty,$

$\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} (\int_{0}^{p} {\hat{Q}}_{F} (u) d u - \int_{0}^{p} {\hat{Q}}_{G} (u) d u) \overset{w}{⟶} + \infty .$

Proof.

Recall that

{sup}_{x \in C} K (x)

is continuous on C if

K (x)

is a continuous function and C is a compact set. Based on this conclusion and noting that the support of

Q_{H} (p), (H = F, G)

is

[0, 1]

, we can easily conclude that both

{sup}_{p \in [0, 1]} Q_{H} (p)

and

{sup}_{p \in [0, 1]} \int_{0}^{p} Q_{H} (u) d u

are continuous functions. By Theorem 2 and the continuous mapping theorem, the following results are easily derived:

\sqrt{\frac{M N}{M + N}} [sup_{p \in [0, 1]} ({\hat{Q}}_{F} (p) - {\hat{Q}}_{G} (p)) - sup_{p \in [0, 1]} (Q_{F} (p) - Q_{G} (p))] \overset{w}{⟶} sup_{p \in [0, 1]} φ_{1} (p),

\sqrt{\frac{M N}{M + N}} [sup_{p \in [0, 1]} (\int_{0}^{p} {\hat{Q}}_{F} (u) d u - \int_{0}^{p} {\hat{Q}}_{G} (u) d u) - sup_{p \in [0, 1]} (\int_{0}^{p} Q_{F} (u) d u - \int_{0}^{p} Q_{G} (u) d u)]

\overset{w}{⟶} sup_{p \in [0, 1]} φ_{2} (p) .

If

Δ_{i} \neq ⌀, i = 1, 2

, then there exists

p_{1} \in [0, 1]

and

p_{2} \in [0, 1]

, such that

Q_{F} (p_{1}) = Q_{G} (p_{1})

and

\int_{0}^{p_{2}} Q_{F} (u) d u = \int_{0}^{p_{2}} Q_{G} (u) d u

. Thus, it is easily concluded that

sup_{p \in [0, 1]} (Q_{F} (p) - Q_{G} (p)) = 0,

and

sup_{p \in [0, 1]} (\int_{0}^{p} Q_{F} (u) d u - \int_{0}^{p} Q_{G} (u) d u) = 0,

if

H_{0}^{i}

is true. This completes the proof of Theorem 3 (i).

If

H_{0}^{i}

is true and

Δ_{i} = ⌀, i = 1, 2

, then there exists

δ_{i} > 0

, such that

Q_{G} (p) - Q_{F} (p) > δ_{1}

and

\int_{0}^{p} Q_{G} (u) d u - \int_{0}^{p} Q_{F} (u) d u > δ_{2}

for all

p \in [0, 1]

. According to Assumption 1 (iii), we can easily conclude that

\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} (Q_{F} (p) - Q_{G} (p)) \to - \infty,

and

\sqrt{\frac{M N}{M + N}} sup_{p \in [0, 1]} (\int_{0}^{p} Q_{F} (u) d u - \int_{0}^{p} Q_{G} (u) d u) \to - \infty .

Therefore, the proof of Theorem 3 (ii) is complete.

Referring to the proof of Theorem 3 (ii), we can easily obtain the opposite results of (ii) if

H_{0}^{i}

is not true. Thus, the proof of Theorem 3 (iii) is complete. □

3.4. Bootstrap Method for Approximating p-Value

In Section 3.3, we discussed the asymptotic properties of our K-S-type statistics and proved that our proposed statistics have excellent properties such as asymptotic normality. In this section, we introduce the process of the bootstrap method to approximate the p-values of our tests. Shao and Tu [23] found that this method can improve the efficiency and robustness of the test. So, it is extensively applied to many statistical tests, especially in the situations where the distribution of the sample is unknown or the original sample needs to be enlarged.

Denote the sample sets from X and Y by

\{X_{1}, X_{2}, \dots, X_{M}\}

,

\{Y_{1}, Y_{2}, \dots, Y_{N}\}

, respectively. We draw independent and identically distributed samples of sizes m and n from these two sets, then we can obtain the bootstrap samples

{\{{X_{i}}^{*}\}}_{i = 1}^{m}

and

{\{{Y_{i}}^{*}\}}_{i = 1}^{n}

. Based on these two samples, we obtain the bootstrap estimators of

F (x)

and

Q_{F} (x)

(

G (x)

and

Q_{G} (x)

) as follows.

{\hat{F}}^{*} (x) = \frac{1}{m} \sum_{i = 1}^{m} I ({X_{i}}^{*} \leq x), {\hat{G}}^{*} (x) = \frac{1}{n} \sum_{i = 1}^{n} I ({Y_{i}}^{*} \leq x),

{\hat{Q}}_{F}^{*} (p) = inf_{x} \{{\hat{F}}^{*} (x) \geq p\}, {\hat{Q}}_{G}^{*} (p) = inf_{x} \{{\hat{G}}^{*} (x) \geq p\} .

According to the above estimators, we derive the bootstrap statistic:

{\hat{L}}_{1}^{*} (p) = \sqrt{\frac{M N}{M + N}} \{{\hat{Q}}_{F}^{*} (p) - {\hat{Q}}_{G}^{*} (p)\}, p \in [0, 1],

for testing

H_{0}^{1}

, and

{\hat{L}}_{2}^{*} (p) = \sqrt{\frac{M N}{M + N}} \{\int_{0}^{p} {\hat{Q}}_{F}^{*} (u) d u - \int_{0}^{p} {\hat{Q}}_{G}^{*} (u) d u\}, p \in [0, 1],

for testing

H_{0}^{2}

. By Theorem 11 in Sergio and Salim [24], we can conclude the following theorem under Assumption 1.

Theorem 4.

Under Assumption 1, for

i = 1, 2

, as

M \to \infty

,

\begin{matrix} sup_{t} |P^{*} \{L_{i}^{*} (p) - {\hat{L}}_{i} (p) \leq t\} - P \{{\hat{L}}_{i} (p) - L_{i} (p) \leq t\}| = O_{P} (1), \end{matrix}

where

L_{1} (p) = \sqrt{\frac{M N}{M + N}} (Q_{F} (p) - Q_{G} (p)),

L_{2} (p) = \sqrt{\frac{M N}{M + N}} (\int_{0}^{p} Q_{F} (u) d u - \int_{0}^{p} Q_{G} (u) d u),

and

P^{*}

represents the conditional probability under the bootstrap samples.

Theorem 4 indicates that the statistics constructed by the bootstrap method are also consistent if the sample size is large enough, which provides the theoretical foundation for the subsequent empirical analysis. Based on Theorems 3 and 4, we can calculate the p-values for our two tests as follows.

\begin{matrix} {\hat{p}}_{i} = P \{sup_{p \in [0, 1]} (L_{i}^{*} (p) - {\hat{L}}_{i} (p)) \geq {\hat{T}}_{i}| {\{{X_{i}}^{*}\}}_{j = 1}^{M}, {\{{Y_{j}}^{*}\}}_{j = 1}^{N}\}, i = 1, 2 . \end{matrix}

However, directly calculating the p-values by the above equations is impossible since we do not know the exact distributions of

{\hat{L}}_{1}^{*} (p)

and

{\hat{L}}_{2}^{*} (p)

under the null hypothesis, and therefore, we approximate the p-values utilizing the Monte Carlo method. We set the times of resampling as B, and the estimated statistic

{\hat{L}}_{i}^{*} (p)

for each bootstrap sample is denoted as

L_{i}^{* b} (p), i = 1, 2, b = 1, 2, \dots, B

, then the p-values for our two tests can be approximated as follows.

\begin{matrix} {\hat{p}}_{i} \approx \frac{1}{B} \sum_{b = 1}^{B} I (sup_{p \in [0, 1]} (L_{i}^{* b} (p) - {\hat{L}}_{i} (p)) \geq {\hat{T}}_{i}), i = 1, 2 . \end{matrix}

For a given level of significance

α

, if

{\hat{p}}_{i} < α

, then we reject the null hypothesis

H_{0}^{i}, i = 1, 2

. Otherwise, there is no apparent evidence to reject it.

4. Monte Carlo Simulations

In this section, we conduct a Monte Carlo simulation to compare the performance of our proposed statistics (denoted by K-S) with the two statistics introduced by Davidson and Duclos [10] (denoted by DD) and Barrett and Donald [5] (denoted by BD), respectively. Let

{\{X_{i}\}}_{i = 1}^{n_{1}}

and

{\{Y_{i}\}}_{i = 1}^{n_{2}}

be the observations drawn from the independent populations X and Y, respectively. In Section 4.1, we give a brief introduction of these two statistics.

4.1. DD Statistic and BD Statistic

4.1.1. DD Statistic

The DD test statistic is a modified version of the statistic in Kaur et al. [8]. It is exactly analogous to the t test statistics. The main superiority of the DD statistic is that it can make high-order dominance testing feasible. The concrete forms of the DD statistics for testing the first-order and second-order stochastic dominance are given as follows.

D_{1} (x) = \frac{{\hat{F}}^{(1)} (x) - {\hat{G}}^{(1)} (x)}{\sqrt{V^{(1)} (x)}}, D_{2} (x) = \frac{{\hat{F}}^{(2)} (x) - {\hat{G}}^{(2)} (x)}{\sqrt{V^{(2)} (x)}},

where

{\hat{F}}^{(1)} (x) = \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} I (X_{i} \leq x), {\hat{F}}^{(2)} (x) = \frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {(x - X_{i})}_{+},

{\hat{G}}^{(1)} (x) = \frac{1}{n_{2}} \sum_{i = 1}^{n_{2}} I (Y_{i} \leq x), {\hat{G}}^{(2)} (x) = \frac{1}{n_{2}} \sum_{i = 1}^{n_{2}} {(x - Y_{i})}_{+},

V^{(1)} (x) = {\hat{V}}_{F^{(1)}} (x) + {\hat{V}}_{G^{(1)}} (x), V^{(2)} (x) = {\hat{V}}_{F^{(2)}} (x) + {\hat{V}}_{G^{(2)}} (x),

and

{\hat{V}}_{F^{(1)}} (x) = \frac{1}{n_{1}} [{\hat{F}}^{(1)} (x) - {({\hat{F}}^{(1)} (x))}^{2}], {\hat{V}}_{F^{(2)}} (x) = \frac{1}{n_{1}} [\frac{1}{n_{1}} \sum_{i = 1}^{n_{1}} {(x - X_{i})}_{+}^{2} - {({\hat{F}}^{(2)} (x))}^{2}],

{\hat{V}}_{G^{(1)}} (x) = \frac{1}{n_{2}} [{\hat{G}}^{(1)} (x) - {({\hat{G}}^{(1)} (x))}^{2}], {\hat{V}}_{G^{(2)}} (x) = \frac{1}{n_{2}} [\frac{1}{n_{2}} \sum_{i = 1}^{n_{2}} {(x - Y_{i})}_{+}^{2} - {({\hat{G}}^{(2)} (x))}^{2}] .

4.1.2. BD Statistic

The BD statistic is a typical K-S-type statistic. The concrete forms of the BD statistics for testing the first-order and second-order stochastic dominance are given as follows.

B_{1} (x) = \sqrt{\frac{n_{1}}{n_{1} + n_{2}}} sup_{x} ({\hat{F}}^{(1)} (x) - {\hat{G}}^{(1)} (x)),

B_{2} (x) = \sqrt{\frac{n_{1}}{n_{1} + n_{2}}} sup_{x} (\int {\hat{F}}^{(1)} (x) d x - \int {\hat{G}}^{(1)} (x) d x) .

The definitions of

{\hat{F}}^{(1)} (x)

and

{\hat{F}}^{(2)} (x)

are the same as above. The main difference between our proposed statistics and the BD statistics is that the BD statistics are constructed based on the empirical distribution function, while our statistics are constructed based on the empirical quantile function. The simulations show that this change can have a good effect.

As suggested by Hansen [25], the grid point method is used to approximate the supremum of the statistics both in the BD test and our proposed test. The core of the grid point method is to divide the interval into small grids, then calculate the maximum values in each small grid and take the maximum value of these values as an approximation of the supremum. For example, if the interval is

[0, 1]

and the length of small grids is 0.01, then the set of grid points is

{0.01, 0.02, \dots, 0.99}

. Compared with the BD statistics, we can control the length of the interval to 1 regardless of any distribution. This will promote the accuracy and efficiency of our simulations. In this paper, we set the length of each grid as 0.01.

4.2. Simulations

Next, we make a comparison of the powers of the DD test and BD test with our test by using the R language. To this end, we used the bootstrap method in Section 3.4 to obtain p-values of our tests. The method of obtaining the p-values of BD test is similar to ours. Besides, to approximate the supremum of the statistics, we applied the function gridSearch() in the Package NMOF to achieve it. For the approximated p-values of the DD test, we borrowed them from Bai et al. [26] and Bai et al. [27].

Based on the approximated p-values, we performed R-times Monte Carlo simulations to obtain the rejection rates. For each simulation, we assumed that X and Y are two normally distributed populations, i.e.,

X \sim N (μ_{1}, σ_{1}^{2})

and

Y \sim N (μ_{2}, σ_{2}^{2})

, and the following four different cases were considered in the process of the simulations.

Case 1:

X \sim N (0.8, 0 . 6^{2})

and

Y \sim N (0.8, 0 . 6^{2})

. In this case, X and Y have the same distribution. The rejection rates ought to have a relative frequency that is close to the nominal significance level

α

.

Case 2:

X \sim N (1.0, 0 . 6^{2})

and

Y \sim N (0.8, 0 . 6^{2})

. In this case, it is easy to verify that

X ≻_{F S D} Y

and

X ≻_{S S D} Y

.

Case 3:

X \sim N (1.0, 0 . 4^{2})

and

Y \sim N (0.8, 0 . 6^{2})

. In this case, we have

X ≻_{S S D} Y

, but we cannot say that

X ≻_{F S D} Y

.

Case 4:

X \sim N (0.8, 0 . 6^{2})

and

Y \sim N (1.0, 0 . 4^{2})

. Both

X ≻_{F S D} Y

and

X ≻_{S S D} Y

should be rejected. The rejection rates of this case should converge to 1 as the sample size increases.

In each case, to balance the time and accuracy of the simulation results, the experimental replication, bootstrap replication, and sample size were selected as

R = 500, B = 500,

n_{2} = 2 n_{1}

. To improve the speed and efficiency of the simulation, we exerted several functions in the Package Parallel to conduct parallel computing. The simulation results are listed in Table 1, Table 2, Table 3, Table 4, Table 5 and Table 6.

For Case 1, we can easily find from Table 1, Table 2 and Table 3 that the rejection rates of our K-S test are more robust than the DD test when

α = 0.01

and

α = 0.05

. As for

α = 0.1

, the rejection rates of our K-S test are a little higher than the nominal level when

n_{1}

is smaller than 200. But, they decrease to 0.1 when the sample size increases. However, the fluctuations of the rejection rates of the DD test are still large when the sample size increases to 400. For the BD test, the fluctuations of their rejection rates are greater than our K-S test at any significance level.

For Case 2, we see from Table 1, Table 2 and Table 3 that the rejection rates of all tests equal approximately zero, as they are supposed to be. Moreover, the rejection rate of our K-S test and BD test of each sample size is closer to zero than the DD test. This means that our K-S test and the BD test outperform the DD test. Meanwhile, the performance of our proposed K-S test is not inferior to the BD test.

The rejection rates of Cases 3 and 4 are supposed to increase to 1 as

n_{1}

increases. This can be easily seen from Table 1, Table 2 and Table 3. As the sample size gradually increased, all three tests demonstrated great power performance and were consistent with our expectations.

Then, we analyzed the simulation results of SSD experiments. From Table 4, Table 5 and Table 6, it is easily found that the rejection rates of our K-S test in Case 1 are more robust than the DD test and the BD test for all three nominal levels. In Case 2, when

α = 0.01

and

α = 0.05

, the rejection rates of our K-S test and the BD test decrease to zero rapidly and are lower than the DD test in almost every sample size. When

α = 0.1

, even the rejection rates of the DD test and the BD test decline to a lower level than our K-S test as

n_{1}

reaches 200. As the sample size continues to expand, the rejection rates of the DD test demonstrate a remarkable rebound phenomenon. But, for our K-S test, the rejection rates drop to zero steadily. Meanwhile, the convergence rate is also faster than the BD test. Unlike the FSD experiment, the rejection rates in Case 3 converge to zero eventually. The rejection rates of three tests show a great performance in both Cases 3 and 4.

According to the above simulation results, we can conclude that three tests have different performances in Cases 1 and 2. Our K-S test performs better than the DD test and the BD test when the two distributions are identical. In Case 2, when the dominance relationship is

X ≻_{F S D / S S D} Y

, the performance of our K-S test and the BD test is better than the DD test. Moreover, when

α = 0.1

, we find that the speed of the rejection rate converging to zero in our K-S test is faster than that in the BD test when we test the SSD relationship in Case 2. In Cases 3 and 4, all three tests demonstrate great effects. Therefore, on the whole, our proposed K-S tests are better than the other two tests.

5. Empirical Analysis

In this section, we apply our K-S test to analyze the environmental monitoring data of four central provinces in China. The data were obtained from Douglas and Zhang [28]. This data set consists of 1.63 million observations and 11 variables. These observations were collected from numerous meteorological stations in 34 Chinese provinces. Several important environmental monitoring indicators such as temperature, visibility, wind speed, and precipitation are included in this data set. Douglas and Zhang [28] utilized a linear double-difference approach to investigate some environmental monitoring data and found that the air quality in the pilot areas has improved significantly after implementing the local emissions trading pilot policy in China. We selected the visibility data of Hunan, Hubei, Anhui, and Jiangxi provinces as our research objects. Visibility is an important indicator in environmental monitoring, which is defined as the maximum distance from which a person with normal vision can see the outline of a target under the prevailing weather. This indicator plays a very important role in monitoring transportation, border security, and air transport. Moreover, it can be used as an important indicator of air quality since it can reflect the degree of atmospheric pollution. As the four central provinces in China, Hunan, Hubei, Anhui, and Jiangxi are geographically adjacent and have little difference in regional development level. So, it is significant to compare the relevant data of these four provinces.

Firstly, to obtain a preliminary understanding of the first-order stochastic dominance relationship between the four provinces, we drew the plots of the estimated quantile functions based on the visibility data of the four provinces. The estimated quantile functions were calculated by Equation (1), and their plots are drawn in Figure 3. We easily see from Figure 3 that the plot of the estimated quantile function for the data from Jiangxi province is located above that of Anhui, Hubei, and Hunan provinces. This reveals that the visibility of Jiangxi province stochastic dominates the other three provinces with the first order. The plot of the estimated quantile function for the data from Hubei province crosses that of Anhui province many times. But, it is always above that of Hunan province. This means that the visibility of Hubei province stochastic dominates that of Hunan province with the first order.

Then, we applied our proposed tests to analyze whether there is a stochastic dominance relationship or not between the visibility of the four provinces and compared the effects with the DD test and the BD test. To complete the bootstrap method, the size of the original sample was set as

10 %

of the number of visibility data, and the times of resampling were set as

B = 1000

. Table 7, Table 8 and Table 9 give the p-values for the tests of the first-order stochastic dominance relationship between the visibility of the four provinces.

We discuss the results of our K-S tests first. For the given significance level

α = 0.05

, it can be seen from the last line of Table 7 that the visibility of Jiangxi province stochastic dominates that of the other three provinces with the first order. Meanwhile, the first number in the third line of Table 7 is

0.715 > 0.05

. This means that the visibility of Hubei province stochastic dominates that of Hunan province with the first order. For other cases, the p-values are zero, which indicates we cannot accept that there are first-order stochastic dominance relationships between each other.

In Table 8, the first and the second numbers in the third line are 0.073 and 0.052, respectively. This means that, when the given significance level

α = 0.05

, the DD test gives a result that we should accept Anhui

≻_{F S D}

Hunan and Anhui

≻_{F S D}

Hubei. In fact, these two results are wrong since the estimated quantile functions of Anhui and Hubei have several intersections. Similar to the estimated quantile function of Anhui and Hunan, we see from Table 9 that the BD test gives a result of accepting Anhui

≻_{F S D}

Hunan, but it is wrong. By contrast, our K-S test always gives the right judgment.

Based on the visibility data of Jiangxi, Anhui, Hubei, and Hunan provinces, if we repeatedly calculate the values of

\hat{Q} (u)

by Equation (1) and draw the plots of

\int_{0}^{p} \hat{Q} (u) d u,

p \in [0, 1]

for these four provinces in Figure 4, as can be seen, the second-order stochastic dominance relationships between provinces are more significant than the first-order ones. The plot related to Jiangxi province always lies above that of the other three provinces. This indicates the visibility data of Jiangxi province stochastic dominate those of other provinces with the second order. The related curves of Hubei and Anhui are also above Hunan.

Table 10, Table 11 and Table 12 show the p-values of the tests for the second-order stochastic dominance relationship between the visibility of the four provinces. If the significance level

α = 0.05

, all three tests indicate that the visibility of Jiangxi province stochastic dominates that of the other three provinces with the second order. The visibility of Hubei province stochastic dominates that of Hunan province with the second order as well. These two conclusions can also be derived from the results obtained from Table 7, Table 8 and Table 9 since the relationship of the first-order stochastic dominance is stronger than the second-order stochastic dominance. Besides, the results of all three tests also reveal that Anhui

≻_{S S D}

Hunan. The divergence between these three tests is in judging the second-order stochastic dominance relationship between the visibility of Hubei and Anhui province.

For the given significance level

α = 0.05

, the DD test and the BD test show that we should accept both Hubei

≻_{S S D}

Anhui and Anhui

≻_{S S D}

Hubei, while our proposed test indicates that we should accept Hubei

≻_{S S D}

Anhui, but reject Anhui

≻_{S S D}

Hubei. When we accept the null and alternative hypotheses at the same time, it often indicates that the two populations have the same distributions. But, if we carefully observe the curves in Figure 4, we can find that the curve of Hubei province is still a little higher than Anhui province, which means that our test results are more accurate.

Based on the above discussion, we can say that our K-S test has a better performance than the DD test and the BD test if we use them to test the first and the second stochastic dominance relationships for the visibility data of the four central provinces in China. To end this section, we summarize the results of our proposed test obtained from Table 7 and Table 10 for significance level

α = 0.05

. The conclusions are shown in Table 13, where “✓✓” means that there are first-order and the second-order stochastic dominance relationships between the column labels and the row labels, and “✓” means that the column labels stochastic dominate the row labels with the second order, but not with the first order. According to the results stated in Table 13, for the given significance level

α = 0.05

, we can conclude that the visibility of Jiangxi province is the best. The next one is the visibility of Hubei province, which outperforms the visibility of Anhui province. The visibility of Hunan province is the poorest.

6. Conclusions and Extensions

This paper is devoted to proposing a consistent K-S-type test based on a quantile rule. To this end, we constructed the K-S-type statistics for our tests and explored their asymptotic properties under some assumptions. To obtain the p-values of our proposed tests, we introduced the bootstrap method. Then, we applied the Monte Carlo method to obtain the rejection rates and compared the power performance of our proposed test with the DD test and the BD test. The simulations showed that our test performed better and was more robust than the other two tests. Finally, for the given significance level

α = 0.05

, we applied our proposed test to compare the visibility of the four central provinces in China and compared the effects with the DD tests and the BD tests.

In the future, at least three topics can be further considered. Firstly, due to the strict assumptions of integer stochastic dominance, sometimes, there may not be an integer stochastic dominance relationship between two populations. Therefore, we can try to apply our K-S test to test whether there is a fractional degree stochastic dominance relationship or an almost stochastic dominance relationship between two populations. Secondly, we assumed that the samples of the two populations in this paper are independent and identically distributed. However, most samples are dependent on our real lives. We can improve our proposed statistics or bootstrap method to handle dependent samples such as paired or time series samples. Thirdly, in the empirical analysis of our paper, we applied our proposed test to compare the visibility conditions of the four central provinces. However, measuring the environmental level of a region through a single indicator is often not enough. In the future, maybe, we can improve our proposed test to apply it to multidimensional situations to obtain a more-comprehensive result.

Author Contributions

H.H. methodology, writing and original draft preparation. G.Q. methodology, writing and original draft preparation. W.Z. supervision and project administration. G.Q. editing and methodology. All authors conceived of the study, participated in its design and coordination, drafted the manuscript, participated in the sequence alignment. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Nature Science Foundation of China under Grant Nos. 71971204, 71871208, and 11701518, the Excellent Youth Foundation under Grant No. 2208085J43, and the foundation of Anhui Xinhua University under Grant Nos. 2017kyqd01, 2021xbxm006, and DSJF001.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Lehmann, E.L. Testing Statistical Hypotheses, 2nd ed.; J. Wiley and Sons: New York, NY, USA, 1986. [Google Scholar]
Post, T.W. Empirical tests for stochastic dominance efficiency. J. Financ. 2003, 58, 1905–1931. [Google Scholar] [CrossRef]
Zhu, Z. Nth Order Stochastic Dominance Test with Applications. Ph.D. Thesis, Northeast Normal University, Changchun, China, 2017. [Google Scholar]
McFadden, D. Studies in the Economics of Uncertainty; Springer: New York, NY, USA, 1989. [Google Scholar]
Barrett, G.F.; Donald, S.G. Consistent tests for stochastic dominance. Econometrica 2003, 71, 71–104. [Google Scholar] [CrossRef]
Barrett, G.F.; Donald, S.G.; Bhattacharya, D. Consistent nonparametric tests for Lorenz dominance. J. Bus. Econ. Stat. 2014, 32, 1–13. [Google Scholar] [CrossRef]
Donald, S.G.; Hsu, Y.C. Improving the power of tests of stochastic dominance. Econom. Rev. 2016, 35, 553–585. [Google Scholar] [CrossRef]
Kaur, A.; Rao, B.L.S.P.; Singh, H. Testing for second-order stochastic dominance of two distributions. Econom. Theory 1994, 10, 849–866. [Google Scholar] [CrossRef]
Anderson, G. Nonparametric tests of stochastic dominance in income distributions. Econometrica 1996, 64, 1183–1193. [Google Scholar] [CrossRef]
Davidson, R.; Duclos, J.Y. Statistical inference for stochastic dominance and for the measurement of poverty and inequality. Econometrica 2000, 68, 1435–1464. [Google Scholar] [CrossRef]
Ledwina, T.; Wyłupek, G. Nonparametric tests for stochastic ordering. Test 2012, 21, 730–756. [Google Scholar] [CrossRef]
Deshpande, J.V.; Singh, H. Testing for second order stochastic dominance. Commun. Stat.-Theory Methods 1985, 14, 887–893. [Google Scholar] [CrossRef]
Eubank, R.; Schechtman, E.; Yitzhaki, S. A test for second order stochastic dominance. Commun. Stat.-Theory Methods 1993, 22, 1893–1905. [Google Scholar] [CrossRef]
Schmid, F.; Trede, M. Testing for first-order stochastic dominance: A new distribution-free test. J. R. Stat. Soc. Ser. D Stat. 1996, 45, 371–380. [Google Scholar] [CrossRef]
Hall, P.; Yatchew, A. Unified approach to testing functional hypotheses in semiparametric contexts. J. Econom. 2005, 127, 225–252. [Google Scholar] [CrossRef]
Lee, K.; Linton, O.; Whang, Y.J. Testing for time stochastic dominance. J. Econom. 2023, 235, 352–371. [Google Scholar] [CrossRef]
Morgenstern, O.; Von Neumann, J. Theory of Games and Economic Behavior; Princeton University Press: Princeton, NJ, USA, 1953. [Google Scholar]
Hadar, J.; Russell, W. Rules for ordering uncertain prospects. Am. Econ. Rev. 1969, 59, 25–34. [Google Scholar]
Hanoch, G.; Levy, H. The efficiency analysis of choices involving risk. Rev. Econ. Stud. 1975, 36, 335–346. [Google Scholar] [CrossRef]
Levy, H. Stochastic Dominance, 3rd ed.; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Levy, H.; Kroll, Y. Ordering uncertain options with borrowing and lending. J. Financ. 1978, 33, 553–574. [Google Scholar] [CrossRef]
Van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Shao, J.; Tu, D. The Jackknife and Bootstrap; Springer Science and Business Media: Berlin, Germany, 2012. [Google Scholar]
Sergio, A.; Salim, B. Strong approximations for weighted bootstrap of empirical and quantile processes with applications. Stat. Methodol. 2013, 35, 553–585. [Google Scholar]
Hansen, B.E. Inference when a nuisance parameter is not identified under the null hypothesis. Econom. J. Econom. Soc. 1996, 64, 413–430. [Google Scholar] [CrossRef]
Bai, Z.; Li, H.; Liu, H.; Wong, W.K. Test statistics for prospect and Markowitz stochastic dominances with applications. Econom. J. 2011, 14, 278–303. [Google Scholar] [CrossRef]
Bai, Z.; Li, H.; McAleer, M.; Wong, W.K. Stochastic dominance statistics for risk averters and risk seekers: An analysis of stock preferences for USA and China. Quant. Financ. 2015, 15, 889–900. [Google Scholar] [CrossRef]
Douglas, A.; Zhang, S. Carbon-trading pilot programs in China and local air quality. AEA Pap. Proc. 2021, 111, 391–395. [Google Scholar]

Figure 1. The plots of

Q_{F} (p)

and

Q_{G} (p)

, where X has the normal distribution

N (1, 3)

and Y has the normal distribution

N (2, 3)

.

Figure 1. The plots of

Q_{F} (p)

and

Q_{G} (p)

, where X has the normal distribution

N (1, 3)

and Y has the normal distribution

N (2, 3)

.

Figure 2. The plots of

\int_{0}^{p} Q_{F} (u) d u

and

\int_{0}^{p} Q_{G} (u) d u

, where X has the normal distribution

N (1, 1.2)

and Y has the normal distribution

N (1.2, 1)

.

Figure 2. The plots of

\int_{0}^{p} Q_{F} (u) d u

and

\int_{0}^{p} Q_{G} (u) d u

, where X has the normal distribution

N (1, 1.2)

and Y has the normal distribution

N (1.2, 1)

.

Figure 3. The plots of the estimated quantile functions.

Figure 4. The plots of

\int_{0}^{p} \hat{Q} (u) d u, p \in [0, 1]

for different provinces.

Figure 4. The plots of

\int_{0}^{p} \hat{Q} (u) d u, p \in [0, 1]

for different provinces.

Table 1. Rejection rates for testing FSD (

α = 0.01

).

Table 1. Rejection rates for testing FSD (

α = 0.01

).

				$n_{1}$
		50	100	B	300	400
K-S	Case 1	0.008	0.006	0.008	0.008	0.010
	Case 2	0.000	0.002	0.000	0.000	0.000
	Case 3	0.004	0.030	0.094	0.146	0.186
	Case 4	0.310	0.506	0.740	0.752	0.856
DD	Case 1	0.002	0.002	0.002	0.004	0.000
	Case 2	0.010	0.004	0.002	0.002	0.000
	Case 3	0.030	0.020	0.070	0.140	0.216
	Case 4	0.306	0.578	0.728	0.830	0.812
BD	Case 1	0.002	0.000	0.004	0.000	0.004
	Case 2	0.000	0.000	0.000	0.000	0.000
	Case 3	0.004	0.028	0.066	0.136	0.182
	Case 4	0.310	0.526	0.740	0.796	0.842

Table 2. Rejection rates for testing FSD (

α = 0.05

).

Table 2. Rejection rates for testing FSD (

α = 0.05

).

				$n_{1}$
		50	100	200	300	400
K-S	Case 1	0.050	0.080	0.056	0.064	0.058
	Case 2	0.000	0.000	0.010	0.002	0.008
	Case 3	0.060	0.178	0.346	0.422	0.508
	Case 4	0.606	0.724	0.856	0.880	0.924
DD	Case 1	0.018	0.036	0.038	0.064	0.044
	Case 2	0.034	0.028	0.022	0.006	0.008
	Case 3	0.056	0.174	0.338	0.434	0.522
	Case 4	0.562	0.752	0.858	0.898	0.888
BD	Case 1	0.022	0.014	0.042	0.050	0.058
	Case 2	0.004	0.004	0.004	0.002	0.008
	Case 3	0.070	0.172	0.316	0.428	0.488
	Case 4	0.592	0.724	0.858	0.868	0.912

Table 3. Rejection rates for testing FSD (

α = 0.10

).

Table 3. Rejection rates for testing FSD (

α = 0.10

).

				$n_{1}$
		50	100	200	300	400
K-S	Case 1	0.112	0.132	0.120	0.114	0.106
	Case 2	0.018	0.016	0.020	0.026	0.022
	Case 3	0.146	0.342	0.530	0.608	0.642
	Case 4	0.714	0.814	0.920	0.924	0.948
DD	Case 1	0.062	0.074	0.096	0.142	0.128
	Case 2	0.060	0.068	0.046	0.026	0.024
	Case 3	0.164	0.330	0.486	0.606	0.670
	Case 4	0.672	0.840	0.912	0.924	0.920
BD	Case 1	0.068	0.072	0.126	0.132	0.136
	Case 2	0.026	0.022	0.026	0.028	0.022
	Case 3	0.174	0.342	0.494	0.614	0.632
	Case 4	0.712	0.806	0.916	0.936	0.946

Table 4. Rejection rates for testing SSD (

α = 0.01

).

Table 4. Rejection rates for testing SSD (

α = 0.01

).

				$n_{1}$
		50	100	200	300	400
K-S	Case 1	0.010	0.012	0.014	0.010	0.014
	Case 2	0.002	0.000	0.000	0.000	0.002
	Case 3	0.000	0.000	0.000	0.000	0.000
	Case 4	0.080	0.142	0.348	0.512	0.682
DD	Case 1	0.012	0.010	0.010	0.002	0.014
	Case 2	0.000	0.002	0.000	0.000	0.002
	Case 3	0.000	0.002	0.000	0.000	0.000
	Case 4	0.082	0.200	0.382	0.536	0.632
BD	Case 1	0.018	0.004	0.008	0.014	0.006
	Case 2	0.000	0.000	0.000	0.004	0.000
	Case 3	0.000	0.000	0.000	0.000	0.000
	Case 4	0.088	0.160	0.382	0.528	0.654

Table 5. Rejection rates for testing SSD (

α = 0.05

).

Table 5. Rejection rates for testing SSD (

α = 0.05

).

				$n_{1}$
		50	100	200	300	400
K-S	Case 1	0.048	0.048	0.042	0.058	0.054
	Case 2	0.024	0.008	0.004	0.000	0.002
	Case 3	0.006	0.000	0.000	0.000	0.000
	Case 4	0.284	0.442	0.662	0.788	0.898
DD	Case 1	0.054	0.054	0.042	0.034	0.052
	Case 2	0.016	0.012	0.004	0.010	0.012
	Case 3	0.004	0.004	0.002	0.000	0.000
	Case 4	0.266	0.452	0.712	0.768	0.844
BD	Case 1	0.054	0.042	0.036	0.048	0.040
	Case 2	0.014	0.014	0.012	0.014	0.002
	Case 3	0.004	0.000	0.000	0.000	0.000
	Case 4	0.268	0.422	0.664	0.782	0.878

Table 6. Rejection rates for testing SSD (

α = 0.10

).

Table 6. Rejection rates for testing SSD (

α = 0.10

).

				$n_{1}$
		50	100	200	300	400
K-S	Case 1	0.104	0.086	0.084	0.104	0.102
	Case 2	0.056	0.040	0.020	0.010	0.008
	Case 3	0.010	0.002	0.000	0.000	0.000
	Case 4	0.442	0.594	0.782	0.872	0.940
DD	Case 1	0.096	0.088	0.072	0.088	0.104
	Case 2	0.058	0.026	0.006	0.026	0.016
	Case 3	0.006	0.006	0.006	0.000	0.002
	Case 4	0.432	0.602	0.814	0.842	0.914
BD	Case 1	0.124	0.078	0.122	0.100	0.096
	Case 2	0.034	0.034	0.028	0.026	0.012
	Case 3	0.006	0.000	0.000	0.000	0.000
	Case 4	0.412	0.584	0.786	0.886	0.924

Table 7. The p-values of the K-S tests for the first-order stochastic dominance.

$≻_{FSD}$	Hunan	Hubei	Anhui	Jiangxi
Hunan	∖	0.000	0.000	0.000
Hubei	0.715	∖	0.000	0.000
Anhui	0.000	0.000	∖	0.000
Jiangxi	0.698	0.151	0.791	∖

Table 8. The p-values of the DD tests for the first-order stochastic dominance.

$≻_{FSD}$	Hunan	Hubei	Anhui	Jiangxi
Hunan	∖	0.000	0.003	0.034
Hubei	0.806	∖	0.007	0.031
Anhui	0.073	0.052	∖	0.012
Jiangxi	0.746	0.249	0.839	∖

Table 9. The p-values of the BD tests for the first-order stochastic dominance.

$≻_{FSD}$	Hunan	Hubei	Anhui	Jiangxi
Hunan	∖	0.001	0.008	0.001
Hubei	0.495	∖	0.028	0.031
Anhui	0.437	0.016	∖	0.006
Jiangxi	0.999	0.807	0.928	∖

Table 10. The p-values of the K-S tests for the second-order stochastic dominance.

$≻_{SSD}$	Hunan	Hubei	Anhui	Jiangxi
Hunan	∖	0.000	0.015	0.012
Hubei	0.924	∖	0.976	0.044
Anhui	0.760	0.000	∖	0.000
Jiangxi	1.000	0.746	1.000	∖

Table 11. The p-values of the DD tests for the second-order stochastic dominance.

$≻_{SSD}$	Hunan	Hubei	Anhui	Jiangxi
Hunan	∖	0.000	0.006	0.000
Hubei	0.843	∖	0.957	0.006
Anhui	0.678	0.077	∖	0.000
Jiangxi	0.580	0.272	0.994	∖

Table 12. The p-values of the BD tests for the second-order stochastic dominance.

$≻_{SSD}$	Hunan	Hubei	Anhui	Jiangxi
Hunan	∖	0.000	0.003	0.000
Hubei	0.991	∖	0.999	0.000
Anhui	0.529	0.109	∖	0.000
Jiangxi	1.000	0.935	0.997	∖

Table 13. Summary of the comparisons on the visibility of four provinces.

	Hunan	Hubei	Anhui
Hunan
Hubei	✓✓		✓
Anhui	✓
Jiangxi	✓✓	✓✓	✓✓

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhuang, W.; He, H.; Qiu, G. Two New Tests for the Relationships of Stochastic Dominance. Axioms 2024, 13, 89. https://doi.org/10.3390/axioms13020089

AMA Style

Zhuang W, He H, Qiu G. Two New Tests for the Relationships of Stochastic Dominance. Axioms. 2024; 13(2):89. https://doi.org/10.3390/axioms13020089

Chicago/Turabian Style

Zhuang, Weiwei, Haowei He, and Guoxin Qiu. 2024. "Two New Tests for the Relationships of Stochastic Dominance" Axioms 13, no. 2: 89. https://doi.org/10.3390/axioms13020089

APA Style

Zhuang, W., He, H., & Qiu, G. (2024). Two New Tests for the Relationships of Stochastic Dominance. Axioms, 13(2), 89. https://doi.org/10.3390/axioms13020089

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Two New Tests for the Relationships of Stochastic Dominance

Abstract

1. Introduction

2. Definitions and Equivalent Properties

3. Test Statistics

3.1. Test Statistic for the First-Order Stochastic Dominance

3.2. Test Statistic for the Second-Order Stochastic Dominance

3.3. Asymptotic Properties

3.4. Bootstrap Method for Approximating p-Value

4. Monte Carlo Simulations

4.1. DD Statistic and BD Statistic

4.1.1. DD Statistic

4.1.2. BD Statistic

4.2. Simulations

5. Empirical Analysis

6. Conclusions and Extensions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI