Homogeneity Test and Sample Size of Relative Risk Ratios for Complex Paired Data Under Dalla’s Model

Sun, Shuman; Li, Zhiming

doi:10.3390/axioms15040268

Open AccessArticle

Homogeneity Test and Sample Size of Relative Risk Ratios for Complex Paired Data Under Dalla’s Model

by

Shuman Sun

¹ and

Zhiming Li

^2,*

¹

College of Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China

²

College of Mathematics and System Science, Xinjiang University, Urumqi 830046, China

^*

Author to whom correspondence should be addressed.

Axioms 2026, 15(4), 268; https://doi.org/10.3390/axioms15040268

Submission received: 25 January 2026 / Revised: 27 March 2026 / Accepted: 2 April 2026 / Published: 7 April 2026

Download

Browse Figures

Versions Notes

Abstract

In clinical research, unilateral data and bilateral data are commonly collected when paired organs or body parts of people receive treatment. Existing models are often inadequate for the research of combined unilateral and bilateral data. Considering population heterogeneity, this paper proposes three statistical tests and sample size estimation methods for the relative risk ratio in stratified unilateral and bilateral data under Dallal’s model. We derive test statistics (i.e., likelihood ratio, Wald-type, and score statistics) and evaluate their performance in terms of type I error rates and powers. Then, sample size determination is performed using an iterative algorithm. Monte Carlo simulations demonstrate that the score test performs well across various parameter configurations. Moreover, the estimated powers for determining sample size based on the score test are closer to the actual empirical powers. Two real examples of otolaryngology and myopathy are provided to illustrate the effectiveness of the proposed methods.

Keywords:

Dallal’s model; relative risk ratio; homogeneity test; sample size

MSC:

60-11

1. Introduction

In the clinical trial, the bilateral data are usually collected when the patients receive treatment on paired organs or body parts. Meantime, the unilateral data is often encountered when only one of the patient’s paired organs is diseased or has received treatment. Many current studies tend to analyze the unilateral or bilateral data, respectively [1,2]. Descriptive statistics and regression analysis methods are relatively straightforward, and may be insufficient for analyzing bilateral data [3]. This limitation arises from the inherent internal correlations within the paired organs or parts. Ignoring these correlations could lead to biased results.

Up to now, the analysis of interclass correlation in bilateral data has been addressed by various probability models. For instance, Rosner [4] introduced an interclass correlation model specifically for this purpose. The model supposes that the probability of a response on the other side is proportional to the prevalence rate of the corresponding group when one organ gives a response. However, Rosner’s model might lead to a poor fit if the characteristic is almost certain to occur bilaterally with widely varying group-specific prevalence. In view of this, Dallal [5] proposed an alternative model, which assumed that the probability that another organ responds when one organ responds was independent of the probability that the organ responds. Later, Donner [6] proposed another model, in which the interclass correlation coefficient was common in each of the two groups. In the statistical inference of bilateral data, it is very important to select an appropriate method to explain the interclass correlation within a subject. By ignoring this, the interclass correlation of real data will undermine the ability to identify the true therapeutic effect accurately. There have been research results on the statistical inference of bilateral data, such as asymptotic tests [7,8,9] and confidence intervals [10,11,12].

Although studies of bilateral data can well reflect the therapeutic effect of paired organ treatment. However, it is inevitable to include unilateral data when collecting actual data. At this point, the research methods that are only applicable to bilateral data become ineffective. Just as in the following two real examples of otolaryngology and myopathy. The first data example comes from a double-blinded randomized clinical trial conducted in the context of acute otitis media with effusion [13]. It can be used to compare the effect of two antibiotics (Cefaclor and Amoxicillin) in the treatment of otitis media with effusion (OME). Another example comes from an observational study on myopia patients [14]. There are 60 subjects diagnosed with myopia who will receive Orthokeratology (Ortho-k). The research on unilateral and bilateral data can better preserve the effective information of the original data. Their research results can also be easily extended to the application research of either unilateral data or bilateral data. The two aforementioned examples belong to the stratified unilateral and bilateral correlated data structure. They categorize the unilateral and bilateral correlated data based on different attributes, such as gender, age, treatment method, etc. Recently, some studies have been discussing whether patients need to be stratified and grouped for treatment under this data structure. Based on Donner’s model, Wang et al. [15] derived three homogeneity tests to detect if the risk ratio retains consistency across strata. The result showed that the score test provided a robust type I error rate and satisfactory power performance. Moreover, the complete test procedure and interval estimation of the odds ratio are discussed by Hua et al. [16,17]. Under Dallal’s model, Liang et al. [14] proposed four confidence interval methods of common risk difference and indicated that the profile likelihood confidence interval outperformed other methods. Sun et al. [18,19] researched the homogeneity test and the common test of risk difference. The result showed that the score method consistently outperforms other methods. Moreover, Sun and Li [20] discussed the common test of the risk ratio. However, the results in reference [21] indicated that a homogeneity test is usually required before conducting the common test.

However, the fitting results of the aforementioned statistical inferences all demonstrate a common characteristic: when the sample size is limited, the fitting of the statistical inference method is suboptimal; whereas as the sample size grows, the fitting effect of statistics will also enhance. Therefore, in designing clinical trials, after determining the statistical inference method, how can one select an appropriate sample size that can achieve the desired effect and simultaneously reduce resource waste? The determination of the sample size becomes extremely important for asymptotic statistical inference methods. Through the study of sample size, the statistical test can achieve a specified power at a given nominal level in all paired medical trials [22]. Qiu et al. [23] proposed an iterative algorithm for sample size determination using score and likelihood tests under two models. For the stratified bilateral data, Mou et al. [24] proposed several methods to calculate sample sizes for a common test of relative risk ratios. However, there is little research to evaluate the methods’ performance based on the unilateral and bilateral data. Furthermore, statistical inference on the risk difference for stratified unilateral and bilateral data has been established under Dallal’s model. However, the theory of relative risk ratio has not yet been incorporated into Dallal’s model. Liang et al. [14] also pointed out that the relative risk ratio is a parameter; it is worth extending the current framework for such applications. Meanwhile, the relative risk ratio would perform better than the risk difference for statistical inference when the differences between different data sets are relatively small [25]. This paper aims to investigate the homogeneity test and sample size determination for the relative risk in stratified unilateral and bilateral data under Dallal’s model.

The rest of this paper is organized as follows. Section 2 describes the data structure and introduces Dallal’s model, along with the unconstrained and constrained maximum likelihood estimations (MLEs) under different hypotheses. In Section 3, we propose three test statistics for homogeneity and determine the sample size using an iterative algorithm. Section 4 presents Monte Carlo simulations to evaluate the performance of the test statistics in terms of type I error rates and power (Section 4.1), followed by an assessment of the sample size determination based on estimated power (Section 4.2). Two real-world applications, including a study on acute otitis media and a recent study on myopia, are illustrated in Section 5. Finally, Section 6 concludes the paper.

2. Dallal’s Model

Assume that there are M subjects, divided into J strata, and each stratum has two groups. For the jth stratum (

j = 1, 2, \dots, J

),

N_{j}

represents the number of subjects providing unilateral data and

N_{j}^{'}

represents the number of subjects providing bilateral data.

l (= 0, 1)

and

l^{'} (= 0, 1, 2)

represent the number of responses provided for unilateral data and bilateral data, respectively. Suppose that

m_{l i j}^{(1)}

is the number of unilateral patients with l responses, and

m_{l^{'} i j}^{(2)}

is the number of bilateral patients with

l^{'}

responses in the ith group (

i = 1, 2

) of the jth stratum (

j = 1, 2, \dots, J

). For each stratum, we denote:

m_{i j}^{(1)} = m_{0 i j}^{(1)} + m_{1 i j}^{(1)}, m_{i j}^{(2)} = m_{0 i j}^{(2)} + m_{1 i j}^{(2)} + m_{2 i j}^{(2)},

m_{l + j}^{(1)} = m_{l 1 j}^{(1)} + m_{l 2 j}^{(1)}, m_{l^{'} + j}^{(2)} = m_{l^{'} 1 j}^{(2)} + m_{l^{'} 2 j}^{(2)} .

Let

p_{l i j}^{(1)}

and

p_{l^{'} i j}^{(2)}

be the corresponding probabilities of

m_{l i j}^{(1)}

and

m_{l^{'} i j}^{(2)}

. The observed data of the jth stratum are shown in Table 1.

For unilateral data, let

Z_{i j k}^{(1)}

be the indicator for judging whether the kth patient has a response or not in the ith group of the jth stratum. If there is a response, then

Z_{i j k}^{(1)} = 1

; otherwise,

Z_{i j k}^{(1)} = 0

. For bilateral data, define

Z_{i j k h}^{(2)} = 1

if the hth organ (

h = 1, 2

) of the kth patient has a response, and

Z_{i j k h}^{(2)} = 0

otherwise. Under Dallal’s model, we assume that:

P (Z_{i j k}^{(1)} = 1) = P (Z_{i j k h}^{(2)} = 1) = π_{i j}, P (Z_{i j k h}^{(2)} = 1 | Z_{i j k (3 - h)}^{(2)} = 1) = γ_{i j},

where

π_{i j}

(

0 \leq π_{i j} \leq 1

) represents the probability that the organ will improve, and

γ_{i j}

(

0 \leq γ_{i j} \leq 1

) represents the probability that one organ will respond when another organ improves. The correlation coefficient is

ρ_{i j} = (γ_{i j} - π_{i j}) / (1 - π_{i j})

in the i group of the jth stratum. Specifically,

γ_{i j} = π_{i j}

if two organs are completely independent, while

γ_{i j} = 1

if two organs are completely dependent. By calculation, the probabilities can be obtained by

\begin{matrix} \begin{matrix} p_{0 i j}^{(1)} & = 1 - π_{i j}, p_{1 i j}^{(1)} = π_{i j}, p_{0 i j}^{(2)} & = π_{i j} (γ_{i j} - 2) + 1, p_{1 i j}^{(2)} = 2 π_{i j} (1 - γ_{i j}), p_{2 i j}^{(2)} = π_{i j} γ_{i j}, \end{matrix} \end{matrix}

where

p_{l i j}^{(1)}, p_{l^{'} i j}^{(2)} \in [0, 1]

,

i = 1, 2, a n d j = 1, 2, \dots, J

. For the observed data

m_{ij} = {m_{0 i j}^{(1)}, m_{1 i j}^{(1)}, m_{0 i j}^{(2)},

m_{1 i j}^{(2)}, m_{2 i j}^{(2)}}

, the joint probability function is given by:

\begin{matrix} \prod_{j = 1}^{J} \prod_{i = 1}^{2} \frac{m_{i j}^{(1)}! m_{i j}^{(2)}!}{m_{0 i j}^{(1)}! m_{1 i j}^{(1)}! m_{0 i j}^{(2)}! m_{1 i j}^{(2)}! m_{2 i j}^{(2)}!} {p_{0 i j}^{(1)}}^{m_{0 i j}^{(1)}} {p_{1 i j}^{(1)}}^{m_{1 i j}^{(1)}} {p_{0 i j}^{(2)}}^{m_{0 i j}^{(2)}} {p_{1 i j}^{(2)}}^{m_{1 i j}^{(2)}} {p_{2 i j}^{(2)}}^{m_{2 i j}^{(2)}} . \end{matrix}

(1)

Let

δ_{j} = π_{2 j} / π_{1 j} (j = 1, 2, \dots, J)

be the relative risk ratio between the two groups in the jth stratum. We are interested in whether there is the same risk ratio between the the two groups across J strata. Thus, the homogeneity test is given as follows:

H_{0} : δ_{1} = δ_{2} = \dots = δ_{J} vs H_{1} : δ_{r} \neq δ_{s} (r \neq s) .

Next, the expressions or algorithms for all MLEs will be provided under the homogeneity test. The MLEs under the alternative hypothesis and null hypothesis are called the unconstrained and constrained MLEs, respectively.

Unconstrained MLEs. Based on the hypothesis

H_{1}

and Equation (1), the log-likelihood function can be expressed as follows:

\begin{matrix} l_{1} (m_{i j}; π, γ) = & \sum_{j = 1}^{J} \sum_{i = 1}^{2} [m_{0 i j}^{(1)} log (1 - π_{i j}) + m_{1 i j}^{(1)} log π_{i j} + m_{0 i j}^{(2)} log (π_{i j} (γ_{i j} - 2) + 1) \\ + m_{1 i j}^{(2)} log (2 π_{i j} (1 - γ_{i j})) + m_{2 i j}^{(2)} log (π_{i j} γ_{i j})] + log C, \end{matrix}

(2)

where

π = (π_{1}, π_{2})

,

π_{i} = {(π_{i 1}, \dots, π_{i J})}^{T}

,

γ = (γ_{1}, γ_{2})

,

γ_{i} = {(γ_{i 1}, \dots, γ_{i J})}^{T}

(i = 1, 2)

, and

C = \prod_{j = 1}^{J} \prod_{i = 1}^{2} \frac{m_{i j}^{(1)}! m_{i j}^{(2)}!}{m_{0 i j}^{(1)}! m_{1 i j}^{(1)}! m_{0 i j}^{(2)}! m_{1 i j}^{(2)}! m_{2 i j}^{(2)}!}

. Differentia (2) with respect to

π_{i j}

and

γ_{i j}

, and set them to 0. That is:

\begin{matrix} \frac{\partial l_{1}}{\partial π_{i j}} = \frac{m_{1 i j}^{(1)} + m_{1 i j}^{(2)} + m_{2 i j}^{(2)}}{π_{i j}} + \frac{m_{0 i j}^{(1)}}{π_{i j} - 1} + \frac{m_{0 i j}^{(2)} (γ_{i j} - 2)}{π_{i j} (γ_{i j} - 2) + 1} = 0, \\ \frac{\partial l_{1}}{\partial γ_{i j}} = \frac{m_{2 i j}^{(2)}}{γ_{i j}} + \frac{m_{1 i j}^{(2)}}{γ_{i j} - 1} + \frac{m_{0 i j}^{(2)} π_{i j}}{π_{i j} (γ_{i j} - 2) + 1} = 0 . \end{matrix}

Since closed-form solutions may be not available, an iterative procedure is adopted for parameter estimation. The detailed process is as follows. Firstly, initial values are calculated from the explicit formulas of the counts as follows:

π_{i j}^{(0)} = \frac{m_{1 i j}^{(1)} + m_{1 i j}^{(2)} + 2 m_{2 i j}^{(2)}}{m_{i j}^{(1)} + 2 m_{i j}^{(2)}}, γ_{i j}^{(0)} = \frac{4 N_{j} π_{i j}^{(0)} (m_{21 j}^{(2)} + m_{22 j}^{(2)})}{{(\sum_{i = 1}^{2} (m_{1 i j}^{(1)} + m_{1 i j}^{(2)} + 2 m_{2 i j}^{(2)}))}^{2}} .

Then, the (

t + 1

)th approximation

π_{i j}^{(t + 1)}

and

γ_{i j}^{(t + 1)}

can be obtained,

[\begin{matrix} π_{i j}^{(t + 1)} \\ γ_{i j}^{(t + 1)} \end{matrix}] = [\begin{matrix} π_{i j}^{(t)} \\ γ_{i j}^{(t)} \end{matrix}] + I_{1} {(π_{i j}^{(t)}, γ_{i j}^{(t)})}^{- 1} [\begin{matrix} \frac{\partial l_{1}}{\partial π_{i j}} \\ \frac{\partial l_{1}}{\partial γ_{i j}} \end{matrix}],

where

I_{1}

is the Fisher information matrix (Appendix A.1). Repeat the above step until all estimates converge. Then, the unconstrained MLEs

{\hat{π}}_{i j}

and

{\hat{γ}}_{i j}

can be obtained.

Constrained MLEs. Under the null hypothesis

H_{0} : δ_{1} = δ_{2} = \dots = δ_{J} ≜ δ

, it follows that

π_{2 j} = π_{1 j} δ

for each

j = 1, \dots, J

. Thus, Equation (2) can be expressed as follows:

\begin{matrix} l_{0} (δ, π_{1}, γ) = & \sum_{j = 1}^{J} [m_{01 j}^{(1)} log (1 - π_{1 j}) + m_{11 j}^{(1)} log π_{1 j} + m_{02 j}^{(1)} log (1 - δ π_{1 j}) + m_{12 j}^{(1)} log (δ π_{1 j}) \\ + m_{01 j}^{(2)} log (π_{1 j} (γ_{1 j} - 2) + 1) + m_{11 j}^{(2)} log (2 π_{1 j} (1 - γ_{1 j})) + m_{21 j}^{(2)} log (π_{1 j} γ_{1 j}) \\ + m_{02 j}^{(2)} log (δ π_{1 j} (γ_{2 j} - 2) + 1) + m_{12 j}^{(2)} log (2 δ π_{1 j} (1 - γ_{2 j})) \\ + m_{22 j}^{(2)} log (δ π_{1 j} γ_{2 j})] + log C, \end{matrix}

(3)

where

δ

is an parameter, which is the focus of the homogeneity test.

π_{1 j}

and

γ_{i j}

(

i = 1, 2, j = 1, 2, \dots, J

) are nuisance parameters. The constrained MLEs of

π_{1 j}, γ_{i j}

and

δ

can be denoted as

{\tilde{π}}_{1 j}, {\tilde{γ}}_{i j}

and

\tilde{δ}

. The estimates are the solution of the following equations:

\frac{\partial l_{0}}{\partial π_{1 j}} = 0, \frac{\partial l_{0}}{\partial γ_{i j}} = 0, \frac{\partial l_{0}}{\partial δ} = 0 .

However, there is no closed-form solution. It can be solved by the Newton–Raphson process and Fisher scoring method. First, take

π_{i j}^{(0)} = {\hat{π}}_{i j}, γ_{i j}^{(0)} = {\hat{γ}}_{i j}, δ^{(0)} = \frac{1}{J} \sum_{j = 1}^{J} {\hat{π}}_{2 j} / {\hat{π}}_{1 j}

as the initial values. By iterating steps (i) and (ii) until convergence, the constrained MLEs are obtained as follows:

(i): The ( $t + 1$ )th approximation $δ^{(t + 1)}$ is:

$δ^{(t + 1)} = δ^{(t)} - {(\frac{\partial^{2} l_{0}}{\partial δ^{2}})}^{- 1} \frac{\partial l_{0}}{\partial δ} |_{π_{1 j} = π_{1 j}^{(t)}, γ_{i j} = γ_{i j}^{(t)}} .$
(ii): $π_{1 j}$ and $γ_{i j}$ can be updated by the Fisher scoring algorithm:

$[\begin{matrix} π_{1 j}^{(t + 1)} \\ γ_{1 j}^{(t + 1)} \\ γ_{2 j}^{(t + 1)} \end{matrix}] = [\begin{matrix} π_{1 j}^{(t)} \\ γ_{1 j}^{(t)} \\ γ_{2 j}^{(t)} \end{matrix}] + I_{0} {(π_{1 j}^{(t)}, γ_{1 j}^{(t)}, γ_{2 j}^{(t)})}^{- 1} [\begin{matrix} \frac{\partial l_{0}}{\partial π_{1 j}} \\ \frac{\partial l_{0}}{\partial γ_{1 j}} \\ \frac{\partial l_{0}}{\partial γ_{2 j}} \end{matrix}] |_{δ = δ^{(t + 1)}},$

where $I_{0}$ is the Fisher information matrix. See Appendix A.1 for more details.

3. Asymptotic Tests and Sample Determination

In this section, three statistics are derived to investigate the homogeneity test of relative risk ratio, including the likelihood ratio, Wald-type, and score statistics. Then, the three corresponding sample size determination methods are discussed based on an iterative algorithm.

3.1. Asymptotic Tests

Likelihood ratio test. The likelihood ratio test statistic is given by:

\begin{matrix} \begin{matrix} T_{L} = 2 [l_{1} (\hat{π}, \hat{γ}) - l_{0} (\tilde{δ}, {\tilde{π}}_{1}, \tilde{γ})], \end{matrix} \end{matrix}

where

\hat{π}, \hat{γ}

are the unconstrained MLEs, and

\tilde{δ}, {\tilde{π}}_{1}, \tilde{γ}

are the constrained MLEs. Under the null hypothesis

H_{0}

,

T_{L}

is asymptotically distributed as the

χ^{2}

distribution with

J - 1

degrees of freedom. If

T_{L} > χ_{J - 1, 1 - α}^{2}

, we should reject

H_{0}

, where

χ_{J - 1, 1 - α}^{2}

is the

100 (1 - α)

percentile of

χ^{2}

distribution with

J - 1

degrees of freedom.

Wald-type test. The null hypothesis

H_{0} : δ_{1} = δ_{2} = \dots = δ_{J}

is equivalent to the matrix form

A δ^{T} = 0

, where

δ = (δ_{1}, δ_{2}, \dots, δ_{J})

, and

\begin{matrix} A = {[\begin{matrix} 1 & - 1 & 0 & 0 & \dots & 0 & 0 \\ 0 & 1 & - 1 & 0 & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 0 & \dots & 1 & - 1 \end{matrix}]}_{(J - 1) \times J} . \end{matrix}

Thus, the Wald-type test is defined as follows:

\begin{matrix} \begin{matrix} T_{W} = (\hat{δ} A^{T}) {(A I^{- 1} A^{T})}^{- 1} (A {\hat{δ}}^{T}), \end{matrix} \end{matrix}

where

\hat{δ} = ({\hat{δ}}_{1}, {\hat{δ}}_{2}, \dots, {\hat{δ}}_{J})

is the unconstrained MLEs, and the Fisher information matrix is:

I = d i a g (E (- \frac{\partial^{2} l_{1}}{\partial δ_{1}^{2}}), E (- \frac{\partial^{2} l_{1}}{\partial δ_{2}^{2}}), \dots, E (- \frac{\partial^{2} l_{1}}{\partial δ_{J}^{2}})),

and

\begin{matrix} \begin{matrix} E (- \frac{\partial^{2} l_{1}}{\partial δ_{j}^{2}}) ≜ e_{j} = & \frac{m_{+ 2 j}^{(1)} π_{1 j} - m_{+ 2 j}^{(2)} π_{1 j} (γ_{2 j} - 2)}{δ_{j}} - \frac{m_{+ 2 j}^{(1)} π_{1 j}^{2}}{π_{1 j} δ_{j} - 1} + \frac{m_{+ 2 j}^{(2)} π_{1 j}^{2} {(γ_{2 j} - 2)}^{2}}{π_{1 j} δ_{j} (γ_{2 j} - 2) + 1} . \end{matrix} \end{matrix}

Then, the Wald-type test

T_{W}

is asymptotically distributed as a chi-square distribution with

J - 1

degree of freedom. Denote

C_{{e_{1}, e_{2}, \dots, e_{p}}}^{1} = e_{1} e_{2} \dots e_{p - 1} + e_{1} e_{2} \dots e_{p - 2} e_{p} + e_{1} e_{2} \dots e_{p - 3} e_{p - 1} + \dots + e_{2} e_{3} \dots e_{p}

and

e_{p, q} = e_{p + 1} e_{p + 2} \dots e_{q} (p < q)

. If

p = q

, then

e_{p, q} = 1

. Further, the Wald-type test can be rewritten as follows:

\begin{matrix} T_{W} = \sum_{p = 1}^{J - 1} \sum_{q = 1}^{J - 1} ({\hat{δ}}_{p} - {\hat{δ}}_{p + 1}) ({\hat{δ}}_{q} - {\hat{δ}}_{q + 1}) E_{p, q}^{- 1} (\hat{δ}), \end{matrix}

where

\begin{matrix} \begin{matrix} E_{p, q}^{- 1} (\hat{δ}) = \frac{C_{{e_{1}, e_{2}, \dots, e_{p}}}^{1} C_{{e_{q + 1}, e_{q + 2}, \dots, e_{J}}}^{1} e_{p, q}}{C_{{e_{1}, e_{2}, \dots, e_{J}}}^{1}} . \end{matrix} \end{matrix}

The detailed calculation process is given in Appendix A.2.

Score test. Under

H_{1}

, let

θ = {(δ_{1}, \dots, δ_{J}, π_{1}^{T}, γ^{T})}^{T}

denote the full parameter vector. The score vector is

S_{δ} (θ) = {(\partial l_{1} / \partial δ_{1}, \dots, \partial l_{1} / \partial δ_{J})}^{T}

. The Fisher information matrix is written accordingly as follows:

I (θ) = (\begin{matrix} I_{δ δ} & I_{δ π_{1}} & I_{δ γ} \\ I_{π_{1} δ} & I_{π_{1} π_{1}} & I_{π_{1} γ} \\ I_{γ δ} & I_{γ π_{1}} & I_{γ γ} \end{matrix}) .

Let

\tilde{θ} = {(\tilde{δ}, \dots, \tilde{δ}, {\tilde{π}}_{1}^{T}, {\tilde{γ}}^{T})}^{T}

be the constrained MLE under

H_{0}

. The efficient information matrix for

δ

adjusted for the nuisance parameters

π_{1}

and

γ

is:

I_{δ} = I_{δ δ} - (\begin{matrix} I_{δ π_{1}} & I_{δ γ} \end{matrix}) {(\begin{matrix} I_{π_{1} π_{1}} & I_{π_{1} γ} \\ I_{γ π_{1}} & I_{γ γ} \end{matrix})}^{- 1} (\begin{matrix} I_{π_{1} δ} \\ I_{γ δ} \end{matrix}) .

Then, the score test statistic for testing

H_{0}

is:

T_{S} = {(A S_{δ} (\tilde{θ}))}^{T} {(A I_{δ}^{- 1} (\tilde{θ}) A^{t})}^{- 1} (A S_{δ} (\tilde{θ})),

which asymptotically follows a chi-square distribution with

J - 1

degrees of freedom under

H_{0}

. The explicit form of

I (θ)

is given in Appendix A.3.

3.2. Sample Size Determination

Under the alternative hypothesis

H_{1}

, the asymptotic power of a test statistic

T_{h}

(

h = L, W, S

) is given by

P (T_{h} \geq χ_{J - 1, 1 - α}^{2} ∣ H_{1})

. The sample size required to achieve a target power

1 - β

at significance level

α

must satisfy:

P (T_{h} \geq χ_{J - 1, 1 - α}^{2} ∣ H_{1}) = 1 - β .

Since no closed-form solution exists, we propose an iterative algorithm to determine the sample size. The steps are described as follows:

(I): Initialize $M^{(0)} = 0$ , step size $d = 100$ , and flag $f = 1$ for given J, $π_{i j}$ , and $γ_{i j}$ .
(II): Update $M^{(t + 1)} = M^{(t)} + d \times f$ . Under $H_{1}$ , we randomly generate k samples $m_{i j} = {m_{0 i j}^{(1)}, m_{1 i j}^{(1)}, m_{0 i j}^{(2)}, m_{1 i j}^{(2)}, m_{2 i j}^{(2)}}$ for $i = 1, 2$ and $j = 1, \dots, J$ . Compute the approximate power $p^{*} (M^{(t + 1)})$ of $T_{h}$ based on these samples.
(III): If $f \cdot p^{*} (M^{(t + 1)}) < f \cdot (1 - β)$ , return to Step II. Otherwise, set $d = 0.1 \times d$ , $f = - f$ , and return to step (II).

Repeat steps (II)–(III) until

| p^{*} (M^{(t + 1)}) - (1 - β) | \leq 0.0001

. The desired sample size is

M^{(t + 1)}

.

4. Monte Carlo Simulation

In this section, we conduct Monte Carlo simulations to evaluate the performance of the proposed statistical tests and sample size methods. 10,000 samples are simulated under the null or alternative hypothesis. Table 2 summarizes the parameter configurations for different event rates

π_{1}

and the probability

γ_{i} (i = 1, 2)

of the other organ responding if one organ has improvement. In each configuration, we consider the number of each group

m = (m_{i j}^{(1)}, m_{i j}^{(2)}) = (25, 25), (50, 50), (100,

100)

and the stratum

J = 2, 4, 6

.

4.1. The Performance of Test Statistics

The performance of the proposed testing methods is assessed by examining empirical type I error rates (TIEs) and empirical powers. In the simulation studies, all tests are conducted at the significance level

α = 0.05

. 10,000 samples are randomly generated under the null hypothesis, and empirical type I error rates are computed by dividing the number of times the null hypothesis is rejected by 10,000. According to Tang et al. [26], a test is liberal if its empirical TIEs are greater than 0.06, conservative if the TIEs are less than 0.04, and otherwise robust. The results for

J = 4

are shown in Table 3, while those for

J = 2

and

J = 6

are provided in Supplementary Material S1. The findings indicate that tests based on larger sample sizes perform more satisfactorily than those based on smaller sample sizes, across a range of parameter configurations.

T_{W}

tends to be more conservative in the small sample size

m = (25, 25), (50, 50)

. And

T_{L}

is more liberal for the

m = (25, 25)

.

T_{S}

behaves satisfactorily, in the sense that its type I error rate is close to the pre-determined nominal level of 0.05 for any configuration. A total of 1000 parameter configurations are randomly generated to evaluate the test statistics, with empirical TIEs visualized using box plots in Figure 1. The results indicate that

T_{L}

tends to be increasingly liberal as the number of strata grows, while

T_{W}

is even more affected, although its performance improves with larger sample sizes and more strata. Notably,

T_{S}

demonstrates superior robustness across all settings and is thus recommended for testing homogeneity.

Then, the performance of powers is investigated for the proposed test statistics under the same parameter configurations (Table 2). In the simulation studies, the data are generated under the alternative hypothesis. The empirical power is calculated as the proportion of correct rejections of the null hypothesis in 10,000 Monte Carlo repetitions. Empirical power results for

J = 4

are presented in Table 4 and Supplementary Material S2. The powers of all three test statistics increase with larger sample sizes and more strata. For

δ_{1}

and

δ_{2}

, the powers of

T_{L}

and

T_{S}

are greater than that of

T_{W}

. However, the opposite conclusion will be reached for

δ_{3}

. Furthermore, Figure 2 further shows the relationship between the power and parameter

δ

in the alternative hypotheses. Suppose that

δ = 0.6 (0.05) 1.55

, i.e., from 0.6 to 1.55 with step size 0.05. Other parameter configurations are set in case III, a₂ and b₂ (Table 2) for strata

J = 2, 4, 6

and group number M1 = (25, 25), M2 = (50, 50), M3 = (100, 100). The results reflect that the power is greatly influenced by

δ

. Interestingly, the closer the alternative hypothesis is to the null hypothesis, the lower the power. Moreover, the power of

T_{W}

is significantly smaller than the power of

T_{L}

and

T_{S}

with

δ \in (0.6, 1.45)

and M1.

4.2. The Performance of Sample Size Determination

The sample size determination methods are evaluated based on their empirical powers. In the simulation studies, the same parameter settings of

δ

,

π_{1}

and

γ_{i} (i = 1, 2)

are selected as in Table 2 for the stratum

J = 2, 4, 6

. For any given parameters, the sample size and estimated power are computed iteratively at

α = 0.05

for target powers of

80 %

or

90 %

. Results for target powers of

80 %

and

90 %

are presented in Table 5 and Table 6. The estimated powers fluctuate around the desired levels. As expected, the required sample size for

90 %

power is substantially larger than that for

80 %

power. Sample sizes increase with both the stratum size (i.e., the number of subjects per stratum) and the number of strata. They also vary across different parameter configurations. Figure 3 and Figure 4 further illustrate the relationship between sample size and these factors. As the number of strata J increases from 2 to 20, the total sample size increases while the average sample size per stratum decreases. In Figure 4, with

π_{1} = (0.4, π)

and

γ_{1} = (0.5, γ)

, we observe that the sample size is more sensitive to changes in

δ

and

π_{1 j}

than to those in

γ_{1 j}

. And the sample size becomes larger when

δ

is closer to 1. This conclusion coincides with the result of the correlation between power and

δ

in Section 4.1. Moreover, the sample size decreases as

π

increases, which is because the change in

π

will affect the value of

δ

.

5. Two Real Examples

To address the unilateral and bilateral combined data structure, two real examples of otolaryngology and myopathy provide us with the ability to implement our methodology for real-world data. For the otolaryngology study (Table 5 of Reference [20]), the risk ratio is

δ_{j} = π_{2 j} / π_{1 j} (j = 1, 2, 3)

. The homogeneity hypothesis

H_{0} : δ_{1} = δ_{2} = δ_{3} ≜ δ

versus

H_{1} : δ_{r} \neq δ_{s} (r \neq s)

is tested to determine whether children of different ages require distinct antibiotics for improved treatment outcomes. The unconstrained MLEs are

\hat{δ} = (2.1749, 1.5609, 0.9503)

,

{\hat{π}}_{1} = (0.1871, 0.4375, 0.6803)

,

{\hat{γ}}_{1} = (0.6553, 0.8790, 0.9878)

and

{\hat{γ}}_{2} = (0.8688, 0.8199, 0.7633)

. The results of the constrained MLEs can be found in Table 6 of Reference [20].

Table 7 presents the corresponding statistical values and p-values based on the three proposed tests for two examples. The results show that all p-values are above

0.05

under

H_{0}

, failing to reject the null hypothesis of no treatment difference in relative risk ratio across the three age strata between Cefaclor and Amoxicillin.

The parameter estimates are identical to the MLEs given above. Table 8 reports the sample sizes and corresponding estimated powers under target powers of

80 %

and

90 %

at

α = 0.05

. The sample sizes for the three tests are generally accurate, with empirical powers close to the pre-specified levels, and are thus recommended for this example. While

T_{W}

requires a smaller sample size than

T_{L}

and

T_{S}

,

T_{S}

yields estimated powers closer to the nominal values. Consequently, to achieve

80 %

(or

90 %

) power, sample sizes of 504 (or 756) are needed.

Another example is a recorded observational study on myopia patients (Table 9 of Reference [20]). We are interested in comparing the effectiveness of treatments between the VST and CRT groups across genders. Apply the proposed methods to test

H_{0} : δ_{1} = δ_{2} ≜ δ

vs.

H_{1} : δ_{r} \neq δ_{s} (r \neq s; s, r \in {1, 2})

. According to the calculation, the unconstrained MLEs are

\hat{δ} = ({\hat{δ}}_{1}, {\hat{δ}}_{2}) = (0.0737, 0.9467)

,

{\hat{π}}_{1} = (0.4340, 0.3201)

,

{\hat{γ}}_{1} = (0.8189, 0.6354)

and

{\hat{γ}}_{2} = (0.7282, 0.6688)

. Table 8 indicates that

T_{L}

and

T_{S}

have greater estimated power than

T_{W}

. In order to obtain more robust results and the desired power to reach 80% (or 90%), 120 (or 160) samples are needed.

6. Conclusions

We develop three asymptotic tests and three iterative sample size determination methods for the risk ratio based on stratified unilateral and bilateral data within Dallal’s model. The unconstrained and constrained MLEs are derived using the Newton–Raphson procedure and the Fisher scoring algorithm. Simulation results support the recommendation of the score test for evaluating treatment effectiveness under a variety of data-generating scenarios. The sample size methods based on the score test or the likelihood ratio test are also suggested for determining the empirical sample size, because their estimated powers are closer to empirical powers than those based on the Wald-type test. Furthermore, two real-world datasets of acute otitis media and myopic eyes are used to illustrate the application of the proposed tests and sample size determination.

The contributions of this work extend beyond the following: (i) Many current studies tend to analyze bilateral data without considering unilateral data. However, the unilateral data is also obtained in clinical practice, when only one side of the patient’s paired organs is diseased or has received treatment. Our methodologies can be applied not only to the research of bilateral data, but also to the research of unidirectional and bilateral data. In our context, the scenario of bilateral data alone constitutes a special case. (ii) In practice, sample size is one of the essential factors in designing clinical accuracy trials. Through the study of sample size, the statistical test can achieve a specified power at a given nominal level in all paired medical trials. In actual data research, it will lead to inaccurate test results if the sample size is insufficient. When the sample size is too large, it will lead to unnecessary waste of resources. Therefore, the sample size determination is discussed based on the stratified unilateral and bilateral data in this paper.

Despite its satisfactory performance, the proposed method has two limitations. First, the iterative sample size determination process is computationally intensive. Second, the reliability of the method relies on the assumption of large stratum sizes, and its performance in sparse data settings requires further investigation. Therefore, future research should focus on developing exact methods for small-sample data to enhance its applicability.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/axioms15040268/s1, Table S1: The empirical TIEs (%) for J = 2. Table S2: The empirical TIEs (%) for J = 6. Table S3: The empirical powers (%) for J = 2. Table S4: The empirical powers (%) for J = 6.

Author Contributions

Conceptualization, investigation, project administration, writing—original draft, S.S.; Writing—review and editing, supervision, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by the Natural Science Research Start-up Foundation of Recruiting Talents of Nanjing University of Posts and Telecommunications (XK0070922093), the Central Guidance for Local Science and Technology Development Fund (ZYYD2025ZY20), the National Natural Science Foundation of China (12561047), and the Xinjiang Talent Development Fund (XJRC-2025-KJ-PY-KJLJ-108).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in [Pediatr. Infect. Dis. J.] at [DOI: https://doi.org/10.1097/00006454-198209000-00006] and in [Stat. Pap.] at [DOI: https://doi.org/10.1007/s00362-024-01532-6].

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. Information Matrix for Computing MLEs

The Fisher information matrix

I_{1}

is given by:

I_{1} (π_{i j}, γ_{i j}) = [\begin{matrix} E (- \frac{\partial^{2} l_{1}}{\partial π_{i j}^{2}}) & E (- \frac{\partial^{2} l_{1}}{\partial π_{i j} \partial γ_{i j}}) \\ E (- \frac{\partial^{2} l_{1}}{\partial π_{i j} \partial γ_{i j}}) & E (- \frac{\partial^{2} l_{1}}{\partial γ_{i j}^{2}}) \end{matrix}],

and

\begin{matrix} E (- \frac{\partial^{2} l_{1}}{\partial π_{i j}^{2}}) = - \frac{m_{i j}^{(1)}}{π_{i j} - 1} + \frac{m_{i j}^{(1)} - m_{i j}^{(2)} (γ_{i j} - 2)}{π_{i j}} + \frac{m_{i j}^{(2)} {(γ_{i j} - 2)}^{2}}{π_{i j} (γ_{i j} - 2) + 1}, \\ E (- \frac{\partial^{2} l_{1}}{\partial γ_{i j}^{2}}) = \frac{m_{i j}^{(2)} π_{i j}}{γ_{i j}} - \frac{2 m_{i j}^{(2)} π_{i j}}{γ_{i j} - 1} + \frac{m_{i j}^{(2)} π_{i j}^{2}}{π_{i j} (γ_{i j} - 2) + 1}, \\ E (- \frac{\partial^{2} l_{1}}{\partial π_{i j} \partial γ_{i j}}) = - \frac{m_{i j}^{(2)}}{π_{i j} (γ_{i j} - 2) + 1} . \end{matrix}

Differentiating the log-likelihood function (3) with

δ, π_{1 j}, γ_{1 j},

and

γ_{2 j}

are as follows:

\begin{matrix} \frac{\partial l_{0}}{\partial δ} = & \sum_{j = 1}^{J} (\frac{m_{12 j}^{(1)} + m_{12 j}^{(2)} + m_{22 j}^{(2)}}{δ} + \frac{m_{02 j}^{(1)} π_{1 j}}{π_{1 j} δ - 1} + \frac{m_{02 j}^{(2)} π_{1 j} (γ_{2 j} - 2)}{π_{1 j} δ (γ_{2 j} - 2) + 1}), \\ \frac{\partial^{2} l_{0}}{\partial δ^{2}} = & - \sum_{j = 1}^{J} (\frac{m_{12 j}^{(1)} + m_{12 j}^{(2)} + m_{22 j}^{(2)}}{δ^{2}} + \frac{m_{02 j}^{(1)} π_{1 j}^{2}}{{(π_{1 j} δ - 1)}^{2}} + \frac{m_{02 j}^{(2)} π_{1 j}^{2} {(γ_{2 j} - 2)}^{2}}{{(π_{1 j} δ (γ_{2 j} - 2) + 1)}^{2}}), \\ \frac{\partial l_{0}}{\partial π_{1 j}} = & \frac{m_{1 + j}^{(1)} + m_{1 + j}^{(2)} + m_{2 + j}^{(2)}}{π_{1 j}} + \frac{m_{01 j}^{(1)}}{π_{1 j} - 1} + \frac{m_{02 j}^{(1)} δ}{π_{1 j} δ - 1} \\ + \frac{m_{01 j}^{(2)} (γ_{1 j} - 2)}{π_{1 j} (γ_{1 j} - 2) + 1} + \frac{m_{02 j}^{(2)} δ (γ_{2 j} - 2)}{π_{1 j} δ (γ_{2 j} - 2) + 1}, \\ \frac{\partial l_{0}}{\partial γ_{1 j}} = & \frac{m_{21 j}^{(2)}}{γ_{1 j}} + \frac{m_{11 j}^{(2)}}{γ_{1 j} - 1} + \frac{m_{01 j}^{(2)} π_{1 j}}{π_{1 j} (γ_{1 j} - 2) + 1}, \\ \frac{\partial l_{0}}{\partial γ_{2 j}} = & \frac{m_{22 j}^{(2)}}{γ_{2 j}} + \frac{m_{12 j}^{(2)}}{γ_{2 j} - 1} + \frac{m_{02 j}^{(2)} π_{1 j} δ}{π_{1 j} δ (γ_{2 j} - 2) + 1} . \end{matrix}

The Fisher information matrix

I_{0}

is written as follows:

I_{0} (π_{1 j}, γ_{1 j}, γ_{2 j}) = [\begin{matrix} I_{11}^{j} & I_{12}^{j} & I_{13}^{j} \\ I_{21}^{j} & I_{22}^{j} & I_{23}^{j} \\ I_{13}^{j} & I_{23}^{j} & I_{33}^{j} \end{matrix}],

where

\begin{matrix} I_{11}^{j} = & E (- \frac{\partial^{2} l_{0}}{\partial π_{1 j}^{2}}) = \frac{m_{+ 1 j}^{(1)} + m_{+ 2 j}^{(1)} δ - m_{+ 1 j}^{(2)} (γ_{1 j} - 2) - m_{+ 2 j}^{(2)} (γ_{2 j} - 2) δ}{π_{1 j}} \\ - \frac{m_{+ 1 j}^{(1)}}{π_{1 j} - 1} - \frac{m_{+ 2 j}^{(1)} δ^{2}}{π_{1 j} δ - 1} + \frac{m_{+ 1 j}^{(2)} {(γ_{1 j} - 2)}^{2}}{π_{1 j} (γ_{1 j} - 2) + 1} + \frac{m_{+ 2 j}^{(2)} {(γ_{2 j} - 2)}^{2} δ^{2}}{π_{1 j} (γ_{2 j} - 2) δ + 1}, \\ I_{12}^{j} = & E (- \frac{\partial^{2} l_{0}}{\partial π_{1 j} \partial γ_{1 j}}) = - \frac{m_{+ 1 j}^{(2)}}{π_{1 j} (γ_{1 j} - 2) + 1}, \\ I_{13}^{j} = & E (- \frac{\partial^{2} l_{0}}{\partial π_{1 j} \partial γ_{2 j}}) = - \frac{m_{+ 2 j}^{(2)} δ}{π_{1 j} (γ_{2 j} - 2) δ + 1}, \\ I_{22}^{j} = & E (- \frac{\partial^{2} l_{0}}{\partial γ_{1 j}^{2}}) = \frac{m_{+ 1 j}^{(2)} π_{1 j}}{γ_{1 j}} - \frac{2 m_{+ 1 j}^{(2)} π_{1 j}}{γ_{1 j} - 1} + \frac{m_{+ 1 j}^{(2)} π_{1 j}^{2}}{π_{1 j} (γ_{1 j} - 2) + 1}, \\ I_{23}^{j} = & E (- \frac{\partial^{2} l_{0}}{\partial γ_{1 j} \partial γ_{2 j}}) = 0, \\ I_{33}^{j} = & E (- \frac{\partial^{2} l_{0}}{\partial γ_{2 j}^{2}}) = \frac{m_{+ 2 j}^{(2)} π_{1 j} δ}{γ_{2 j}} - \frac{2 m_{+ 2 j}^{(2)} π_{1 j}^{2} δ^{2}}{γ_{2 j} - 1} + \frac{m_{+ 2 j}^{(2)} π_{1 j}^{2}}{π_{1 j} (γ_{2 j} - 2) δ + 1} . \end{matrix}

Appendix A.2. Derivation of Wald-Type Test

The Wald test statistic for

H_{0} : δ_{1} = \dots = δ_{J}

is:

T_{W} = \sum_{p = 1}^{J - 1} \sum_{q = 1}^{J - 1} ({\hat{δ}}_{p} - {\hat{δ}}_{p + 1}) ({\hat{δ}}_{q} - {\hat{δ}}_{q + 1}) E_{p, q}^{- 1} (\hat{δ}),

where

E_{p, q}^{- 1} (\hat{δ})

denotes the

(p, q)

-th element of the inverse of the covariance matrix of the successive differences

{\hat{δ}}_{p} - {\hat{δ}}_{p + 1}

(

p = 1, \dots, J - 1

). To derive its explicit form, we assume that

{\hat{δ}}_{1}, \dots, {\hat{δ}}_{J}

are independent and denote their variances by

e_{1}, \dots, e_{J}

(

e_{j} > 0

). Under this assumption, the covariance matrix

V

of

{({\hat{δ}}_{1} - {\hat{δ}}_{2}, \dots, {\hat{δ}}_{J - 1} - {\hat{δ}}_{J})}^{⊤}

takes the following symmetric tridiagonal form:

V = (\begin{matrix} e_{1} + e_{2} & - e_{2} & 0 & \dots & 0 \\ - e_{2} & e_{2} + e_{3} & - e_{3} & \dots & 0 \\ 0 & - e_{3} & e_{3} + e_{4} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & 0 & 0 & - e_{J - 1} & e_{J - 1} + e_{J} \end{matrix}) .

To derive the explicit form of its inverse, we introduce the following notation. For positive quantities

e_{1}, \dots, e_{J}

, define

C_{{a_{1}, \dots, a_{m}}}^{1} = \sum_{i = 1}^{m} \prod_{j \neq i} a_{j}

(the sum of all products obtained by deleting one element), with the convention

C_{{a_{1}}}^{1} = 1

. Also define

e_{p, q} = \prod_{k = p + 1}^{q} e_{k}

for

p < q

, and

e_{p, p} = 1

.

The determinant of

V

can be computed via the standard recurrence for tridiagonal matrices and simplifies to:

det (V) = C_{{e_{1}, \dots, e_{J}}}^{1} .

For

1 \leq p, q \leq J - 1

, let

V_{(q, p)}

be the matrix obtained from

V

by deleting the q-th row and the p-th column. By exploiting the tridiagonal structure, one finds that for

p \leq q

,

det (V_{(q, p)}) = {(- 1)}^{p + q} e_{p, q} C_{{e_{1}, \dots, e_{p}}}^{1} C_{{e_{q + 1}, \dots, e_{J}}}^{1},

and the case

p > q

follows by symmetry. The inverse elements are given by

{(V^{- 1})}_{p q} = {(- 1)}^{p + q} det (V_{(q, p)}) / det (V)

. For all

p, q

, substituting the expressions above yields:

{(V^{- 1})}_{p q} = \frac{C_{{e_{1}, \dots, e_{min (p, q)}}}^{1} C_{{e_{max (p, q) + 1}, \dots, e_{J}}}^{1} e_{min (p, q), max (p, q)}}{C_{{e_{1}, \dots, e_{J}}}^{1}} .

Evaluating this at the estimated values

\hat{δ}

(which determine the

e_{i}

) gives

E_{p, q}^{- 1} (\hat{δ})

as stated in the main text.

Appendix A.3. Derivation of Score Statistic

The Fisher information matrix

I (θ)

is:

I (θ) = (\begin{matrix} I_{δ δ} & I_{δ π_{1}} & I_{δ γ} \\ I_{π_{1} δ} & I_{π_{1} π_{1}} & I_{π_{1} γ} \\ I_{γ δ} & I_{γ π_{1}} & I_{γ γ} \end{matrix}),

where

\begin{matrix} I_{δ δ} & = - E (\frac{\partial^{2} l_{0}}{\partial δ \partial δ^{T}}), I_{δ π_{1}} = I_{π_{1} δ}^{T} = - E (\frac{\partial^{2} l_{0}}{\partial δ \partial π_{1}^{T}}), I_{δ γ} = I_{γ δ}^{T} = - E (\frac{\partial^{2} l_{0}}{\partial δ \partial γ^{T}}), \\ I_{π_{1} π_{1}} & = - E (\frac{\partial^{2} l_{0}}{\partial π_{1} \partial π_{1}^{T}}), I_{π_{1} γ} = I_{γ π_{1}}^{T} = - E (\frac{\partial^{2} l_{0}}{\partial π_{1} \partial γ^{T}}), I_{γ γ} = - E (\frac{\partial^{2} l_{0}}{\partial γ \partial γ^{T}}), \end{matrix}

and

\begin{matrix} \begin{matrix} E (- \frac{\partial^{2} l_{0}}{\partial δ \partial π_{1 j}}) = & - \frac{m_{+ 2 j}^{(1)}}{π_{1 j} δ - 1} - \frac{m_{+ 2 j}^{(2)} (γ_{2 j} - 2)}{π_{1 j} δ (γ_{2 j} - 2) + 1}, \\ E (- \frac{\partial^{2} l_{0}}{\partial δ \partial γ_{1 j}}) = & 0, E (- \frac{\partial^{2} l_{0}}{\partial δ \partial γ_{2 j}}) = - \frac{m_{+ 2 j}^{(2)} π_{1 j}}{π_{1 j} δ (γ_{2 j} - 2) + 1} . \end{matrix} \end{matrix}

References

Gu, J.X.; Chen, Q.; Zhang, P.J.; Zhang, T.; Zhou, X.; Zhang, K.; Jiang, T.; Liu, S.; Chen, W.; Zhou, M.; et al. Characteristics of vitreoretinal lymphoma in B-scan ultrasonography: A case-control study. Ophthalmol. Retin. 2024, 8, 264–269. [Google Scholar]
Bomdica, P.R.; MacCumber, M.; Abdel-Hadi, S.; Parker, M.; Minaker, S. Surgical outcomes of rhegmatogenous retinal detachment and fellow eye involvement in adolescent and young adult patients. Ophthalmol. Retin. 2024, 8, 148–154. [Google Scholar] [CrossRef] [PubMed]
Zhang, H.G.; Ying, G.S. Statistical approaches in published ophthalmic clinical science papers: A comparison to statistical practice two decades ago. Brit. J. Ophthalmol. 2018, 102, 1188–1191. [Google Scholar] [CrossRef]
Rosner, B. Statistical methods in ophthalmology: An adjustment for the intraclass correlation between eyes. Biometrics 1982, 38, 105–114. [Google Scholar] [CrossRef]
Dallal, G.E. Paired Bernoulli trials. Biometrics 1988, 44, 253–257. [Google Scholar] [CrossRef]
Donner, A. Statistical methods in opthalmology: An adjusted chi-square approach. Biometrics 1989, 45, 605–611. [Google Scholar] [CrossRef]
Tang, N.S.; Tang, M.L.; Qiu, S.F. Testing the equality of proportions for correlated otolaryngologic data. Comput. Stat. Data Anal. 2008, 52, 3719–3729. [Google Scholar] [CrossRef]
Pei, Y.B.; Tian, G.L.; Tang, M.L. Testing homogeneity of proportion ratios for stratified correlated bilateral data in two arm randomized clinical trials. Stat. Med. 2014, 33, 4370–4386. [Google Scholar] [CrossRef]
Zhuang, T.T.; Tian, G.L.; Ma, C.X. Homogeneity test of ratio of two proportions in stratified bilateral data. Stat. Biopharm. Res. 2019, 11, 200–209. [Google Scholar] [CrossRef]
Tang, N.S.; Qiu, S.F.; Tang, M.L.; Pei, Y.B. Asymptotic confidence interval construction for proportion difference in medical studies with bilateral data. Stat. Methods Med. Res. 2011, 20, 233–259. [Google Scholar] [CrossRef]
Pei, Y.B.; Tang, M.L.; Wong, W.K.; Guo, J.H. Confidence intervals for correlated proportion differences from paired data in a two-arm randomised clinical trial. Stat. Methods Med. Res. 2012, 21, 167–187. [Google Scholar] [CrossRef]
Shen, X.; Ma, C.X.; Yuen, K.C.; Tian, G.L. Common risk difference test and interval estimation of risk difference for stratified bilateral correlated data. Stat. Methods Med. Res. 2018, 28, 2418–2438. [Google Scholar] [CrossRef]
Mandel, E.M.; Bluestone, C.D.; Rockette, H.E.; Blatter, M.M.; Reisinger, K.S.; Wucher, F.P.; Harper, J. Duration of effusion after antibiotic treatment for acute otitis media: Comparison of Cefaclor and Amoxicillin. Pediatr. Infect. Dis. 1982, 1, 310–316. [Google Scholar] [CrossRef]
Liang, S.Y.; Fang, K.T.; Huang, X.W.; Xin, Y.; Ma, C.X. Homogeneity tests and interval estimations of risk differences for stratified bilateral and unilateral correlated data. Stat. Pap. 2024, 65, 3499–3543. [Google Scholar] [CrossRef]
Wang, H.P.; Ma, C.X. Homogeneity test of ratios of two proportions in stratified bilateral and unilateral data. Mathematics 2024, 12, 2719. [Google Scholar] [CrossRef]
Hua, S.C.; Ma, C.X. Testing the homogeneity of odds ratio across strata for combined bilateral and unilateral data. PloS One 2024, 19, e0307276. [Google Scholar] [CrossRef]
Hua, S.C.; Ma, C.X. Common odds ratio test and interval estimation for stratified bilateral and unilateral data. Stat. Methods Med. Res. 2024, 33, 1559–1576. [Google Scholar] [CrossRef]
Sun, S.M.; Li, Z.M.; Mou, K.Y. Interval estimation of common risk difference for stratified unilateral and bilateral data. J. Biopharm. Stat. 2024, 35, 85–105. [Google Scholar] [CrossRef] [PubMed]
Sun, S.M.; Li, Z.M.; Jiang, H.J. Homogeneity test and sample size of risk difference for stratified unilateral and bilateral data. Commun. Stat. Simul. C 2024, 53, 4209–4232. [Google Scholar] [CrossRef]
Sun, S.M.; Li, Z.M. Common test and interval estimation of risk ratio for complex paired data under Dallal’s model. Mathematics 2026, 14, 948. [Google Scholar] [CrossRef]
Sun, S.M.; Li, Z.M.; Ai, M.Y.; Jiang, H.J. Risk difference tests for stratified binary data under Dallal’s model. Stat. Methods Med. Res. 2022, 31, 1135–1156. [Google Scholar] [CrossRef]
Maria, S.; Antonia, Z. Sample size calculation and re-estimation based on the prevalence in a single-arm confirmatory diagnostic accuracy study. Stat. Methods Med. Res. 2020, 29, 2958–2971. [Google Scholar]
Qiu, S.F.; Fu, Q.X. Homogeneity testing for binomial proportions under stratified double-sampling scheme with two fallible classifiers. Stat. Methods Med. Res. 2020, 29, 3547–3568. [Google Scholar] [CrossRef] [PubMed]
Mou, K.Y.; Li, Z.M.; Ma, C.X. Asymptotic sample size for common test of relative risk ratios in stratified bilateral data. Mathematics 2023, 11, 4198. [Google Scholar] [CrossRef]
Sampson, J.N.; Gail, M.H. Confidence intervals for the difference between two relative risks. Stat. Methods Med. Res. 2020, 29, 3048–3058. [Google Scholar] [CrossRef]
Tang, N.S.; Qiu, S.F. Homogeneity test, sample size determination and interval construction of difference of two proportions in stratified bilateral-sample designs. J. Stat. Plan Inference 2012, 142, 1243–1251. [Google Scholar] [CrossRef]

Figure 1. Box-plots of empirical TIEs for the likelihood ratio test (

T_{L}

), Wald-type test (

T_{W}

), and score test (

T_{S}

) when

J = 2, 4, 6

and

m = (25, 25), (50, 50), (100, 100)

.

Figure 1. Box-plots of empirical TIEs for the likelihood ratio test (

T_{L}

), Wald-type test (

T_{W}

), and score test (

T_{S}

) when

J = 2, 4, 6

and

m = (25, 25), (50, 50), (100, 100)

.

Figure 2. The curve graph of empirical powers for

δ

under the parameter settings III, a₂ and b₂. The quantities for each group are selected as M1 = (25, 25), M2 = (50, 50), and M3 = (100, 100).

Figure 2. The curve graph of empirical powers for

δ

under the parameter settings III, a₂ and b₂. The quantities for each group are selected as M1 = (25, 25), M2 = (50, 50), and M3 = (100, 100).

Figure 3. The estimated sample size with each stratum for the desired power 80%.

Figure 4. The estimated sample size with three parameters

δ

,

π

and

γ

for the desired power 80%.

Figure 4. The estimated sample size with three parameters

δ

,

π

and

γ

for the desired power 80%.

Table 1. Data structure of the jth stratum.

Number of Responses	Group (i)		Total
Number of Responses	1	2	Total
0	$m_{01 j}^{(1)}$ ( $p_{01 j}^{(1)}$ )	$m_{02 j}^{(1)}$ ( $p_{02 j}^{(1)}$ )	$m_{0 + j}^{(1)}$
1	$m_{11 j}^{(1)}$ ( $p_{11 j}^{(1)}$ )	$m_{12 j}^{(1)}$ ( $p_{12 j}^{(1)}$ )	$m_{1 + j}^{(1)}$
Total	$m_{1 j}^{(1)}$	$m_{2 j}^{(1)}$	$N_{j}$
0	$m_{01 j}^{(2)}$ ( $p_{01 j}^{(2)}$ )	$m_{02 j}^{(2)}$ ( $p_{02 j}^{(2)}$ )	$m_{0 + j}^{(2)}$
1	$m_{11 j}^{(2)}$ ( $p_{11 j}^{(2)}$ )	$m_{12 j}^{(2)}$ ( $p_{12 j}^{(2)}$ )	$m_{1 + j}^{(2)}$
2	$m_{21 j}^{(2)}$ ( $p_{21 j}^{(2)}$ )	$m_{22 j}^{(2)}$ ( $p_{22 j}^{(2)}$ )	$m_{2 + j}^{(2)}$
Total	$m_{1 j}^{(2)}$	$m_{2 j}^{(2)}$	$N_{j}^{'}$

Table 2. The parameter configuration.

Parameter	Cases	Number of Strata
Parameter	Cases	$J = 2$	$J = 4$	$J = 6$
$π_{1}$	I	(0.3,0.5)	(0.3,0.5,0.3,0.5)	(0.3,0.5,0.3,0.5,0.3,0.5)
	II	(0.4,0.5)	(0.4,0.5,0.4,0.5)	(0.4,0.5,0.4,0.5,0.4,0.5)
	III	(0.4,0.4)	(0.4,0.4,0.4,0.4)	(0.4,0.4,0.4,0.4,0.4,0.4)
$γ_{1}$	a₁	(0.5,0.5)	(0.5,0.5,0.5,0.5)	(0.5,0.5,0.5,0.5,0.5,0.5)
	a₂	(0.5,0.6)	(0.5,0.6,0.5,0.6)	(0.5,0.6,0.5,0.6,0.5,0.6)
$γ_{2}$	b₁	(0.5,0.6)	(0.5,0.6,0.5,0.6)	(0.5,0.6,0.5,0.6,0.5,0.6)
	b₂	(0.6,0.7)	(0.6,0.7,0.6,0.7)	(0.6,0.7,0.6,0.7,0.6,0.7)

Table 3. The empirical TIEs (%) of

T_{L}

,

T_{W}

,

T_{S}

for

J = 4

.

Table 3. The empirical TIEs (%) of

T_{L}

,

T_{W}

,

T_{S}

for

J = 4

.

$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$m = (25, 25)$			$m = (50, 50)$			$m = (100, 100)$
$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$
0.7	I	a₁	b₁	5.22	4.72	4.88	5.47	5.21	5.30	5.06	5.11	4.94
		a₁	b₂	5.51	5.30	5.18	5.16	5.13	4.91	4.96	4.78	4.88
		a₂	b₁	5.44	4.70	5.05	5.19	4.87	5.10	4.88	4.70	4.80
		a₂	b₂	5.35	4.88	5.00	5.47	5.35	5.32	5.26	4.98	5.16
	II	a₁	b₁	5.33	4.47	4.86	4.82	4.55	4.75	5.65	5.33	5.55
		a₁	b₂	5.52	4.82	5.33	5.06	4.86	4.91	5.50	5.23	5.40
		a₂	b₁	5.88	4.71	5.62	5.05	4.58	4.91	4.77	4.57	4.73
		a₂	b₂	5.36	4.59	5.01	5.04	4.43	4.91	5.16	4.75	5.08
0.9	I	a₁	b₁	5.73	5.01	5.37	5.33	5.13	5.20	5.30	4.80	5.18
		a₁	b₂	5.32	4.47	4.99	5.04	4.69	4.94	5.28	5.03	5.23
		a₂	b₁	5.58	4.68	5.29	5.50	4.87	5.31	4.88	4.50	4.81
		a₂	b₂	5.52	4.71	5.21	5.39	5.09	5.23	5.06	4.71	4.94
	II	a₁	b₁	5.77	4.61	5.55	5.32	4.74	5.15	5.33	5.08	5.28
		a₁	b₂	5.43	4.65	5.13	4.94	4.19	4.79	5.04	4.72	4.94
		a₂	b₁	5.51	4.14	5.12	5.11	4.26	5.00	5.41	4.99	5.30
		a₂	b₂	5.23	4.17	5.02	5.29	4.57	5.14	5.35	5.08	5.32
1.2	I	a₁	b₁	4.92	4.06	4.46	4.95	4.23	4.78	4.74	4.41	4.70
		a₁	b₂	5.45	4.26	5.18	5.33	4.63	5.21	4.83	4.55	4.74
		a₂	b₁	4.96	4.31	4.67	5.82	4.88	5.65	4.99	4.66	4.87
		a₂	b₂	5.59	4.07	5.31	4.83	3.99	4.61	4.95	4.53	4.86
	II	a₁	b₁	5.14	3.91	4.77	5.21	4.18	5.00	5.07	4.44	4.94
		a₁	b₂	5.62	3.95	5.32	5.22	4.57	5.00	4.83	4.55	4.74
		a₂	b₁	5.49	3.80	5.26	4.77	3.98	4.65	5.25	4.63	5.21
		a₂	b₂	5.66	3.93	5.20	5.06	4.24	4.96	5.13	4.63	5.08

Table 4. The empirical powers (%) for

J = 4

.

Table 4. The empirical powers (%) for

J = 4

.

$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$m = (25, 25)$			$m = (50, 50)$			$m = (100, 100)$
$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$
$δ_{1}$	I	a₁	b₁	21.44	9.92	20.85	39.45	26.50	39.20	70.98	62.60	70.94
		a₁	b₂	21.05	9.27	20.52	37.58	26.35	37.54	69.10	61.13	69.14
		a₂	b₁	21.03	9.87	20.53	39.25	27.56	39.12	69.19	62.03	69.19
		a₂	b₂	20.83	9.74	20.61	37.91	26.39	37.86	68.07	60.57	68.10
	II	a₁	b₁	27.81	20.73	26.73	50.31	45.17	49.78	83.48	81.79	83.42
		a₁	b₂	27.65	20.77	27.05	48.67	43.64	48.36	81.04	79.03	80.92
		a₂	b₁	27.55	20.41	26.77	50.67	45.97	50.19	82.46	80.82	82.33
		a₂	b₂	26.60	20.41	26.01	48.65	44.10	48.18	80.13	78.18	79.93
$δ_{2}$	I	a₁	b₁	6.77	3.52	6.35	8.43	4.91	8.22	10.70	7.23	10.55
		a₁	b₂	6.69	3.71	6.43	7.56	4.24	7.42	10.47	7.16	10.48
		a₂	b₁	6.70	3.38	6.42	7.90	5.04	7.82	10.59	7.50	10.49
		a₂	b₂	6.33	3.65	6.03	7.82	4.53	7.73	10.80	7.59	10.76
	II	a₁	b₁	7.51	4.57	7.14	8.86	6.39	8.64	13.00	10.58	12.86
		a₁	b₂	7.37	4.84	6.99	8.41	6.37	8.29	12.54	10.73	12.44
		a₂	b₁	7.57	4.79	7.23	8.26	6.57	8.14	12.67	10.73	12.49
		a₂	b₂	7.09	4.37	6.74	8.51	6.51	8.36	12.52	10.59	12.33
$δ_{3}$	I	a₁	b₁	9.55	13.33	9.33	14.62	19.70	14.49	25.85	31.35	25.84
		a₁	b₂	10.09	13.24	9.61	14.58	19.12	14.38	25.20	30.66	25.18
		a₂	b₁	9.02	12.34	8.65	14.68	18.71	14.54	25.79	30.57	25.74
		a₂	b₂	9.58	13.03	9.17	14.08	17.97	13.88	24.48	29.35	24.38
	II	a₁	b₁	12.10	13.20	11.66	19.67	21.78	19.48	36.06	39.34	36.00
		a₁	b₂	11.37	12.57	10.90	19.39	21.16	19.18	34.66	37.65	34.46
		a₂	b₁	11.94	12.31	11.58	18.74	20.45	18.41	35.38	37.57	35.23
		a₂	b₂	11.54	12.19	11.10	18.68	20.08	18.39	33.54	36.14	33.44

Note:

δ_{1}

= (1,0.7);

δ_{2}

= (1,0.9); and

δ_{3}

= (1,1.2).

T_{L}

,

T_{W}

, and

T_{S}

respectively represent three tests, corresponding to their empirical powers.

Table 5. Sample size (estimated powers %) for the homogeneity test with 80% power.

$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$J = 2$			$J = 4$			$J = 6$
$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$
$δ_{1}$	I	a₁	b₁	1424(79.78)	1536(79.71)	1416(79.97)	1984(79.66)	2192(80.00)	1984(79.48)	2328(79.82)	2664(79.75)	2352(80.00)
		a₁	b₂	1488(79.81)	1592(79.53)	1472(79.67)	2080(79.97)	2288(79.76)	2064(79.82)	2424(79.90)	2760(79.84)	2424(80.03)
		a₂	b₁	1448(79.78)	1544(80.10)	1440(79.76)	2000(79.74)	2224(79.93)	2000(79.46)	2352(79.22)	2688(79.99)	2400(80.17)
		a₂	b2	1512(79.89)	1608(79.45)	1512(79.87)	2112(80.08)	2304(79.83)	2096(80.06)	2448(79.98)	2808(79.99)	2448(80.18)
	II	a₁	b₁	1064(80.22)	1096(79.80)	1072(79.88)	1472(79.60)	1552(80.14)	1488(80.25)	1728(79.62)	1872(80.00)	1728(79.41)
		a₁	b₂	1128(79.95)	1144(79.82)	1112(79.64)	1552(80.16)	1632(79.99)	1520(79.48)	1824(80.43)	1944(79.99)	1824(79.95)
		a₂	b₁	1088(80.01)	1112(80.08)	1072(79.73)	1520(79.71)	1552(79.64)	1536(80.00)	1776(79.77)	1872(80.27)	1800(79.85)
		a₂	b₂	1144(80.19)	1168(80.13)	1136(79.96)	1568(80.09)	1648(80.37)	1584(79.57)	1848(79.95)	1944(80.00)	1848(79.40)
$δ_{2}$	I	a₁	b₁	14,920(80.04)	15,408(79.79)	14,912(79.88)	20,560(79.96)	21,424(80.03)	20,816(79.64)	24,336(79.95)	25,440(79.81)	24,432(79.61)
		a₁	b₂	15,424(79.76)	15,712(79.92)	15,512(79.96)	21,440(80.29)	22,304(79.53)	21,344(79.40)	25,008(79.77)	26,232(79.93)	24,936(79.26)
		a₂	b₁	15,024(79.95)	15,608(79.99)	15,136(79.93)	21,232(80.52)	21,808(79.87)	21,024(79.79)	24,816(79.92)	25,728(79.75)	24,624(79.99)
		a₂	b₂	15,920(79.85)	15,912(79.68)	15,720(80.24)	21,840(79.56)	22,800(79.54)	21,824(79.98)	25,824(79.90)	26,640(79.81)	25,728(80.05)
	II	a₁	b₁	10,720(80.18)	10,920(79.70)	10,848(79.76)	14,880(79.69)	15,408(79.73)	14,944(80.50)	17,808(79.77)	18,288(79.68)	17,712(79.92)
		a₁	b₂	11,224(80.11)	11,504(80.13)	11,312(79.68)	15,712(79.99)	16,016(79.96)	15,632(80.20)	18,552(79.84)	18,816(80.31)	18,528(79.70)
		a₂	b₁	10,944(79.78)	11,216(80.18)	11,120(79.87)	15,328(80.28)	15,632(80.29)	15,408(79.80)	18,216(79.67)	18,408(79.81)	18,120(80.11)
		a₂	b₂	11,528(79.81)	11,648(79.67)	11,520(79.92)	16,128(79.96)	16,208(79.92)	16,112(79.77)	18,840(79.84)	19,320(79.97)	18,936(79.91)
$δ_{3}$	I	a₁	b₁	4512(79.81)	4264(79.74)	4480(79.75)	6256(79.90)	5808(79.76)	6224(79.97)	7272(80.11)	6720(80.20)	7344(79.85)
		a₁	b₂	4624(79.90)	4448(79.74)	4712(79.49)	6496(79.99)	6032(80.61)	6432(80.11)	7632(80.10)	7008(79.91)	7632(80.35)
		a₂	b₁	4608(79.66)	4384(79.78)	4608(79.97)	6320(79.73)	5936(79.99)	6320(79.94)	7440(79.88)	6936(80.03)	7392(79.92)
		a₂	b₂	4712(79.93)	4528(80.23)	4808(80.09)	6624(79.70)	6144(80.36)	6528(79.82)	7680(79.43)	7128(79.81)	7656(79.94)
	II	a₁	b₁	3112(80.08)	3032(79.86)	3136(79.80)	4320(79.45)	4208(80.03)	4400(80.41)	5136(80.00)	4848(79.78)	5208(79.80)
		a₁	b₂	3264(79.79)	3224(79.80)	3280(79.95)	4544(80.14)	4352(79.25)	4560(80.05)	5328(79.95)	5088(80.29)	5424(79.84)
		a₂	b₁	3216(79.88)	3072(79.17)	3224(79.95)	4448(80.19)	4352(80.38)	4496(79.96)	5232(80.32)	5040(79.90)	5256(80.19)
		a₂	b₂	3408(79.99)	3320(79.95)	3352(79.76)	4656(79.89)	4480(79.73)	4624(79.16)	5520(80.51)	5280(80.14)	5448(79.93)

Note:

δ_{1}

= (1,0.7);

δ_{2}

= (1,0.9);

δ_{3}

= (1,1.2).

Table 6. Sample size (estimated powers %) for the homogeneity test with 90% power.

$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$J = 2$			$J = 4$			$J = 6$
$δ$	$π_{1}$	$γ_{1}$	$γ_{2}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$
$δ_{1}$	I	a₁	b₁	1912(89.76)	2016(90.08)	1928(89.88)	2608(89.93)	2816(90.14)	2560(89.49)	2976(89.98)	3336(89.92)	3000(89.89)
		a₁	b₂	1992(90.00)	2064(89.62)	1984(89.99)	2672(89.65)	2896(89.74)	2704(89.92)	3120(89.98)	3432(90.19)	3120(89.87)
		a₂	b₁	1928(89.89)	2048(89.89)	1912(89.72)	2624(89.96)	2816(89.73)	2624(90.11)	3048(89.90)	3312(89.84)	3024(90.04)
		a₂	b₂	2016(89.76)	2120(89.87)	2024(89.75)	2720(90.00)	2912(89.91)	2704(89.72)	3192(89.87)	3480(89.95)	3144(90.25)
	II	a₁	b₁	1416(89.71)	1456(89.90)	1432(89.87)	1904(89.63)	1968(89.60)	1920(89.93)	2232(89.88)	2352(90.09)	2256(89.73)
		a₁	b₂	1512(89.91)	1520(89.93)	1480(89.94)	2016(90.07)	2096(89.87)	2032(89.98)	2328(89.92)	2448(89.85)	2352(90.08)
		a₂	b₁	1464(89.81)	1480(89.85)	1464(89.96)	1952(90.05)	1984(89.24)	1968(89.82)	2280(90.15)	2352(89.92)	2280(89.92)
		a₂	b₂	1520(89.74)	1552(90.22)	1520(90.07)	2048(89.95)	2128(89.78)	2064(89.91)	2376(89.95)	2496(90.04)	2400(89.95)
$δ_{2}$	I	a₁	b₁	19,712(89.95)	20,040(89.99)	20,024(90.03)	26,944(89.87)	27,808(90.18)	26,784(89.66)	31,512(89.88)	32,016(89.82)	31,416(89.90)
		a₁	b₂	20,816(89.95)	20,904(89.95)	20,824(89.95)	27,920(89.69)	28,800(89.86)	27,920(89.90)	32,424(90.19)	33,432(89.90)	32,520(89.81)
		a₂	b₁	20,312(89.57)	20,536(89.91)	20,216(89.98)	27,232(89.93)	28,000(89.90)	27,408(89.74)	31,728(89.76)	32,832(89.86)	31,920(89.54)
		a₂	b₂	21,016(89.99)	21,152(89.69)	21,112(89.98)	28,416(89.97)	28,912(90.34)	28,320(89.99)	32,832(89.79)	33,864(89.66)	32,928(89.98)
	II	a₁	b₁	14,336(89.50)	14,608(90.12)	14,512(90.10)	19,888(89.83)	19,808(90.05)	19,712(89.72)	22,536(89.77)	22,896(90.13)	22,752(90.29)
		a₁	b₂	15,120(89.97)	15,136(89.91)	15,208(89.92)	20,624(89.95)	20,832(89.91)	20,320(89.93)	23,904(89.77)	23,952(89.94)	23,928(89.63)
		a₂	b₁	14,752(89.71)	14,944(89.76)	14,840(89.91)	19,920(89.57)	20,320(89.86)	19,920(89.86)	23,424(89.95)	23,520(90.22)	23,232(89.76)
		a₂	b₂	15,416(89.77)	15,616(89.94)	15,432(89.83)	21,088(89.91)	21,120(90.27)	20,912(89.82)	24,240(89.92)	24,720(89.89)	24,312(90.00)
$δ_{3}$	I	a₁	b₁	6016(90.14)	5760(89.57)	6024(89.96)	8080(89.92)	7712(89.92)	8112(89.81)	9432(90.08)	8832(89.84)	9432(89.80)
		a₁	b₂	6312(89.89)	6008(89.72)	6304(89.89)	8512(90.17)	7984(90.31)	8432(89.99)	9720(89.71)	9144(89.50)	9792(90.10)
		a₂	b₁	6120(89.98)	6000(89.96)	6112(90.24)	8320(89.82)	7824(89.93)	8320(90.00)	9648(89.90)	8976(89.83)	9528(90.00)
		a₂	b₂	6360(89.94)	6096(89.58)	6336(89.93)	8624(89.99)	8112(89.52)	8624(90.08)	9864(89.86)	9312(89.85)	9912(89.55)
	II	a₁	b₁	4232(89.84)	4064(89.76)	4136(89.93)	5712(89.85)	5520(89.98)	5568(89.82)	6624(89.98)	6240(89.70)	6552(89.90)
		a₁	b₂	4360(89.86)	4320(90.08)	4336(89.84)	5920(89.91)	5760(90.19)	5920(89.88)	6840(90.17)	6624(89.97)	6936(89.94)
		a₂	b₁	4224(89.48)	4240(89.82)	4232(89.49)	5856(89.96)	5632(90.12)	5808(89.74)	6672(89.70)	6528(89.96)	6648(89.94)
		a₂	b₂	4472(90.33)	4424(90.34)	4512(90.08)	6112(89.83)	5888(89.94)	6016(89.58)	7032(89.90)	6816(89.58)	6984(89.92)

Note:

δ_{1}

= (1,0.7);

δ_{2}

= (1,0.9);

δ_{3}

= (1,1.2).

Table 7. Statistic values and p-values of the homogeneity test.

Test Statistics	Otolaryngology			Myopathy
Test Statistics	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$
Statistic	4.8107	4.2450	4.6831	3.4108	3.1335	2.8116
p-value	0.0902	0.1197	0.0962	0.0648	0.0767	0.0936

Table 8. Sample size (empirical power) required to achieve the desired power.

Desired Power	Otolaryngology			Myopathy
Desired Power	$T_{L}$	$T_{W}$	$T_{S}$	$T_{L}$	$T_{W}$	$T_{S}$
80%	504(0.8034)	432(0.7954)	504(0.7996)	112(0.7853)	112(0.7550)	120(0.7991)
90%	744(0.8961)	528(0.8962)	756(0.9005)	160(0.8995)	128(0.8889)	160(0.8985)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sun, S.; Li, Z. Homogeneity Test and Sample Size of Relative Risk Ratios for Complex Paired Data Under Dalla’s Model. Axioms 2026, 15, 268. https://doi.org/10.3390/axioms15040268

AMA Style

Sun S, Li Z. Homogeneity Test and Sample Size of Relative Risk Ratios for Complex Paired Data Under Dalla’s Model. Axioms. 2026; 15(4):268. https://doi.org/10.3390/axioms15040268

Chicago/Turabian Style

Sun, Shuman, and Zhiming Li. 2026. "Homogeneity Test and Sample Size of Relative Risk Ratios for Complex Paired Data Under Dalla’s Model" Axioms 15, no. 4: 268. https://doi.org/10.3390/axioms15040268

APA Style

Sun, S., & Li, Z. (2026). Homogeneity Test and Sample Size of Relative Risk Ratios for Complex Paired Data Under Dalla’s Model. Axioms, 15(4), 268. https://doi.org/10.3390/axioms15040268

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Homogeneity Test and Sample Size of Relative Risk Ratios for Complex Paired Data Under Dalla’s Model

Abstract

1. Introduction

2. Dallal’s Model

3. Asymptotic Tests and Sample Determination

3.1. Asymptotic Tests

3.2. Sample Size Determination

4. Monte Carlo Simulation

4.1. The Performance of Test Statistics

4.2. The Performance of Sample Size Determination

5. Two Real Examples

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix A.1. Information Matrix for Computing MLEs

Appendix A.2. Derivation of Wald-Type Test

Appendix A.3. Derivation of Score Statistic

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI