Directly and Simultaneously Expressing Absolute and Relative Treatment Effects in Medical Data Models and Applications

Haoyang Teng; Zhengjun Zhang

doi:10.3390/e23111517

and

¹

Department of Mathematics and Statistics, Arkansas State University, P.O. Box 70, Jonesboro, AR 72467, USA

²

Department of Statistics, University of Wisconsin-Madison, 1300 University Ave, Madison, WI 53706, USA

^*

Author to whom correspondence should be addressed.

^†

Equal contribution.

Entropy2021, 23(11), 1517;https://doi.org/10.3390/e23111517

This article belongs to the Special Issue Statistical Methods for Medicine and Health Sciences

Version Notes

Order Reprints

Abstract

Logistic regression is widely used in the analysis of medical data with binary outcomes to study treatment effects through (absolute) treatment effect parameters in the models. However, the indicative parameters of relative treatment effects are not introduced in logistic regression models, which can be a severe problem in efficiently modeling treatment effects and lead to the wrong conclusions with regard to treatment effects. This paper introduces a new enhanced logistic regression model that offers a new way of studying treatment effects by measuring the relative changes in the treatment effects and also incorporates the way in which logistic regression models the treatment effects. The new model, called the Absolute and Relative Treatment Effects (AbRelaTEs) model, is viewed as a generalization of logistic regression and an enhanced model with increased flexibility, interpretability, and applicability in real data applications than the logistic regression. The AbRelaTEs model is capable of modeling significant treatment effects via an absolute or relative or both ways. The new model can be easily implemented using statistical software, with the logistic regression model being treated as a special case. As a result, the classical logistic regression models can be replaced by the AbRelaTEs model to gain greater applicability and have a new benchmark model for more efficiently studying treatment effects in clinical trials, economic developments, and many applied areas. Moreover, the estimators of the coefficients are consistent and asymptotically normal under regularity conditions. In both simulation and real data applications, the model provides both significant and more meaningful results.

Keywords:

asymptotics; enhanced logistic regression; estimator consistency; interpretability; precision medicine; predictability

1. Introduction

Studying treatment effects is central in clinical trials and epidemiology. When response variables are dichotomous, numerous applications of the logistic regression model can be found in the literature. Using the logistic regression model in the analysis of the medical data allows the researchers to understand and estimate the effects of the explanatory variables on the response variable, control the confounding factors and study the interaction effects. The purpose of the analysis using the logistic regression is to identify risk factors that are associated with the response variable of interests and the variables (confounder) that influence the effect of exposure on disease and the risk factors. For instance, if the primary goal is to measure the association between physical inactivity and heart disease with age being a confounding factor, the logistic regression is not only useful to model dichotomous variables (e.g., the values of 0 and 1 represent the status of heart disease, respectively), but it can also be used to explain the effects of physical inactivity on heart disease while controlling for the age variable. Odds ratio, which is often used for interpretations in the logistic regression model, is adjusted to account for other covariates (including confounders). Other applications can be found in genetics, clinical trials, or any studies that involve treatment groups. This statistical model has been a benchmark model due to its easy computability, interpretability, predictability, and stability (CIPS).

Logistic regression is also widely used in classifications, e.g., in cancer [1], diabetes [2], and osteoarthritis [3] among numerous literature publications. Due to the desirable CIPS properties, logistic regression is often used as a baseline/benchmark model in popular machine learning to perform classification, including via Support Vector Machine (SVM) [4] and Naive Bayes Classifier [5]. Applications using machine learning models can be found in acute coronary syndromes [6], heart failure [7], pancreatic cancer [8], text mining [9], COVID-19 and seven subtypes [10], etc.

The studies of logistic regression and related models have been drawing much attention in the literature. For instance, as the number of predictor dimensions increases and exceeds the sample size, direct estimation using the logistic regression model may fail because the matrix inversion can be a problem due to the matrix being singular. In addition, issues such as numerical problems lead to poor convergence, overfitting and low predictive power [11,12]. Regularization is often used to handle high-dimensional data. Popular penalty functions include but are not limited to the least absolute shrinkage and selection operator (Lasso) [13], the smoothly clipped absolute deviation (SCAD) [14] and the minimax concave penalty (MCP) [15]. On the other hand, there have been many model developments in the handling of semi-continuous data in recent years where the response data consist of a substantial portion of single value and positive values. In handling such data, a two-part model was proposed, which handles a combination of binary and continuous data. Since the logistic regression model, which preserves many desirable properties, is suitable to model the binary part, it is included as part of the model. The model development and discussions can be found in recent studies [16,17,18].

In this paper, we focused on developing a more general logistic regression model. Our paper’s contributions to the literature can be concluded in three-fold: (1) The Absolute and Relative Treatment Effects (AbRelaTEs or Abrelates) are directly, explicitly, and simultaneously introduced in our proposed enhanced logistic regression. The AbRelaTEs model incorporates how treatment effects are modeled in the classical logistic regression (absolute treatment effect) and offers a different way of modeling the effects (relative treatment effect). In the model, the absolute treatment effect incrementally measures the treatment effects, while the relative treatment effect accounts for the proportion change in the treatment effects. Additionally, the new model unifies the logistic regression model in a new framework. Parameter estimation can be easily performed using software packages, with the classical logistic regression model being treated as a particular case. (2) The interpretations can be made in two ways, via “between-group” and “within-group” treatments. The former considers all covariate information (attributes) of each patient/participant, which can be regarded as an individualized effect. If the individualized effect is better for a patient/participant in the treatment group than the control group, the treatment is suitable for the patient. The treatment can also be recommended for other patients with similar attributes, making it a potential precision medicine. The latter, on the other hand, individually interprets the effects of each predictor. (3) Simulation examples show that the classical logistic regression will fail to model data when a relative treatment effect exists. In addition, the statistical model might fail to capture the absolute treatment effect if the relative treatment effect exists, resulting in researchers being misled into believing that the treatments are not significant. The AbRelaTEs model, which offers another way of modeling the treatment effects, captures the effects via an absolute or relative or even both. These were shown to be possible as explored using four real datasets. The AbRelaTEs model can be viewed as a new benchmark model for randomized controlled trial studies.

This paper is organized as follows. In Section 2.1, we will present the classical logistic regression model. We then introduce the Absolute and Relative Treatment Effects model and discuss the interpretations in Section 2.2. In Section 3, we will discuss the relative effect term’s estimation procedures and other coefficient parameters of the new model. Moreover, asymptotic theories such as the model’s consistency and asymptotic normality are presented in the same section. The new model’s computational procedures will be discussed in Section 4 and simulation examples and discussions will be presented in Section 4. Furthermore, four real data examples will be explored and discussed in detail in Section 5. Finally, the concluding remarks are given in Section 6. Technical arguments and additional simulation results are presented in the Appendices.

2. Logistic Regression and the Enhanced Absolute and Relative Treatment Effects Model

In this section, we will first present the logistic regression model and then introduce the AbRelaTEs model. We will deliver the features and interpretations of the AbRelaTEs model in a more general aspect, and more detailed interpretations will be discussed using the real data examples in the latter sections.

2.1. Logistic Regression

Before introducing our model, we provide an overview of the ordinary logistic regression model commonly used to model data with binary outcomes. Due to its easy application and high interpretability, the model is often used to analyze data in various fields. One common application can be found in randomized controlled trials to investigate whether the treatment effects are significant in explaining the outcomes. If the treatment effects are significant, meaningful interpretations of the treatment effects and other covariates are often made in the forms of an odds ratio and relative risk.

We now describe the classical logistic regression model in a randomized controlled trial setting. Suppose we consider g treatment groups. Throughout this paper, the g-th treatment group is considered a control group. In addition, the term “treatment group” excludes the control group to distinguish and make comparisons with the control group throughout this paper. In the j-th group, we have

n_{j}

patients with total patients being n. Let

Y_{i j}

be the binary response (0 or 1) of i-th patient in treatment group j. Let

μ

be a constant and

τ_{j}

be the treatment effect of j-th treatment level. Let

X_{i j}

be a

p \times 1

covariate vector where p is the number of predictors and

β

is the corresponding

p \times 1

coefficient vector. Denote by

π_{i j}

the probability

P (Y_{i j} = 1 | X_{i j})

for

i = 1, 2, . . ., n_{j}

and

j = 1, 2, . . ., g

. The classical logistic regression model is given by

logit (π_{i j}) = log (\frac{π_{i j}}{1 - π_{i j}}) = μ + τ_{j} + X_{i j}^{'} β,

(1)

where

i = 1, 2, . . ., n_{j}

and

j = 1, 2, . . ., g

. The observations are i.i.d. samples

(Y_{i j}, X_{i j})

for

i = 1, 2, . . ., n_{j}

,

j = 1, 2, . . ., g

and

Y_{i j} | X_{i j} \sim Bernoulli (π_{i j})

. We note that the model (1) has been widely used as a benchmark model in many classification problems and treatment effects analyses in medical data. In many real applications, if the (absolute) treatment effects

τ_{j}

s in model (1) are found to be significant, the treatment groups can then be recommended to be practiced or adopted by the general public. If the treatment effects result insignificant, the classical logistic regression is not capable of measuring the treatment effects and is deemed to be insufficient to model treatment effects for some clinical trials, but is actually effective. If the treatment effects are tested and result not significant, the logistic regression model not only fails to detect any overall treatment effect but it also has a low predictive power. Furthermore, the treatment effect

τ_{j}

in the classical logistic regression does not detect any individualized effect of the treatment groups. If the treatment group is found to be significant, it is highly questionable that the treatment group will be effective for all patients. It is of interest to many researchers whether the treatment groups can be further interpreted as precision medicine for specific groups of people with the same attributes or characteristics, which is also one of the aspects of this paper. In contrast to the approaches in the literature, we generalize the model (1) which preserves many desirable properties both theoretically and practically to serve the purposes mentioned above but with better and easier interpretations. We will present and discuss our model in the following subsection.

2.2. Logistic Regression with Absolute and Relative Treatment Effects

We will first introduce some additional notations and some motivations before presenting our model. In the literature, both absolute errors, e.g.,

| a - b | = | τ |

, and relative errors, e.g.,

| (a - b) / b | = | δ |

or

a = (1 + δ) b

are useful and powerful measurements for studying changes between two variables a and b. In many applied scientific areas, relative changes are also regarded as an increasing rate or decreasing rate; e.g., in economics, we measure the gross domestic product (GDP) change using the rate; in finance and banking, the changes are also termed returns or interests. Without loss of generality, we shall call

τ

and

τ_{j}

absolute errors or absolute changes, and

δ

and

δ_{j}

relative errors or relative changes throughout the paper.

Motivated by the relative measurements, we propose a model that also considers the relative treatment effects of the treatment groups in addition to the absolute treatment effects

τ_{j}

of the treatment groups in model (1). Moreover, we also include important predictors in our model. Let

δ_{j}

be the relative treatment effect of the j-th treatment level. Then, our newly proposed model, the Absolute and Relative Treatment Effects (AbRelaTEs) Model, is given by

logit (π_{i j}) = log (\frac{π_{i j}}{1 - π_{i j}}) = (μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j}),

(2)

for

i = 1, 2, . . ., n_{j}

and

j = 1, 2, . . ., g

. In our setting, the parameters

μ

,

τ_{j}

and

β

are similarly defined in the logistic regression model setting. Since the parameter

δ_{j}

in our model measures the relative effect of the treatments, the parameter can take any value between −1 and 1.

It is clear that model (2) will be reduced to model (1) when

δ_{j} = 0

for all j, and that there is no relative treatment effect. We note that when model (1) is the true model, model (2) is also true since

δ_{j}

will be estimated to be 0. Furthermore, it is worth noting that the model (2) is the same as the classical logistic regression when no covariates

(X_{i j})

are available. On the other hand, if model (1) is not a correct/appropriate model for the analysis of a randomized controlled trial, model (2) is still applicable. Therefore, the AbRelaTEs model can serve as a new “benchmark" model for better applicability, more flexibility, and increased interpretability, which can be applied to many fields of medical research.

In our model setting, the term

1 + δ_{j}

, which will always be positive, can be viewed as a multiplier effect on the log-odds depending on the sign and magnitude of the estimated relative treatment effects of

δ_{j}

. In other words, if the relative treatment effect is significant for a randomized controlled trial, there will be an additional multiplier effect on the log-odds for patients receiving treatments compared to the control group. The multiplier effect on the log-odds will depend on the estimated coefficients of the constant

μ

, absolute treatment effect

τ_{j}

, and covariates

X_{i j}

. In fact, the multiplier effect is more interpretable by computing the overall magnitude and sign of the term

μ + τ_{j} + X_{i j}^{'} β

. Since the covariates

X_{i j}

are usually the attributes or characteristics of a patient (e.g., weight, height, age, gender, etc.) in randomized controlled trials, the relative treatment effect will have a different impact for different patients if some attributes are continuous or a group of patients sharing similar attributes if the covariates are all discrete or categorical in a particular treatment group. For example, patients in a specific weight range and age group will benefit more from receiving the treatment than patients in other weight range and age groups. As a result, the relative treatment effect is the key to measure individualized treatment effects, and the model (2) can be viewed as a benchmark model dealing with precision treatments which can be seen as an advantage over the classical logistic regression model.

Model (2) can be expressed as

logit (π_{i j}) = log (\frac{π_{i j}}{1 - π_{i j}}) = (μ + τ_{j}^{*} + X_{i j}^{'} β_{j}^{*}),

(3)

for

i = 1, 2, . . ., n_{j}

and

j = 1, 2, . . ., g

, where

τ_{j}^{*} = τ_{j} + μ δ_{j} + τ_{j} δ_{j}

and

β_{j}^{*} = β (1 + δ_{j})

. We note that the AbRelaTEs model is different from the classical logistic regression where the coefficient vector

β_{j}^{*}

in Equation (3) depends on the treatment group and it is not the case for the classical logistic regression though the form resembles the classical logistic regression. In the model setup, the effects of the coefficient depend on the treatment groups of the patients. For patients receiving the treatments, the coefficient is

β (1 + δ_{j})

for

j = 1, 2, . . ., g - 1

while the coefficient is

β

for patients in the control group. The coefficients are different for patients receiving different treatment. From the construction of the model, the AbRelaTEs model is different from the standard logistic regression model including interactions between variables. Furthermore, the AbRelaTEs model can also be expressed in the following form:

logit (π_{i j}) = log (\frac{π_{i j}}{1 - π_{i j}}) = (μ + τ_{j}^{*} + {\tilde{X}}_{i j}^{'} β),

(4)

for

i = 1, 2, . . ., n_{j}

and

j = 1, 2, . . ., g

, where

τ_{j}^{*} = τ_{j} + μ δ_{j} + τ_{j} δ_{j}

and

{\tilde{X}}_{i j}^{'} = X_{i j}^{'} (1 + δ_{j})

.

At a first glance, model (4) looks like a classical logistic regression model. However, upon closer examination of

{\tilde{X}}_{i j}^{'} = X_{i j}^{'} (1 + δ_{j})

, we see that within the j-th treatment, each component covariate has a multiplier of

1 + δ_{j}

, i.e.,

∣ δ_{j} ∣

is the relative error of

{\tilde{X}}_{i j}^{'}

to

X_{i j}^{'}

. Note that in

X_{i j}^{'}

, some components can be products of other component variables, i.e., interactions, which are also kept in

{\tilde{X}}_{i j}^{'}

. As a result, expressing the logistic regression model as the AbRelaTEs model clearly shows that

δ_{j}

is a relative treatment effect coefficient, and it should not be interpreted as an interaction effect between

τ_{j}

and the covariates.

In the classical logistic regression, after computing the odds ratios or relative risks, the treatment effects can be related to covariates. Conventionally, the interpretations can be made based on each treatment group’s effects and predictors using the coefficients’ magnitude and sign. However, the interpretations of the coefficients in our model are not as straightforward. The interpretations can be made in two ways which are “between-group" and “within-group” treatments. For the “within-group” treatment effect, each predictor’s effect on the log-odds is interpreted. In contrast, all covariates for each patient are considered for the “between-group” treatment effect. Whether or not a treatment group is suitable for all people or a particular subgroup of people depends on the interpretations of the “between” group treatment. If treatment is beneficial for an individual or a subgroup of people with similar attributes, the treatment is viewed as precision medicine.

To illustrate the concepts of the absolute and relative treatment effects, we consider the case of two treatment groups, i.e.,

g = 2

with

τ_{g}

and

δ_{g}

being 0. Additionally, we also assume that the response variable is the event of a patient having a particular disease with the same attributes. We first consider the case where there is an absolute treatment effect without a relative treatment effect. The absolute change in the log odds between the treatments or log odds ratio of a patient contracting the disease is given as (

j = 1

when

g = 2

)

\begin{matrix} log (Odds (Treatment)) - log (Odds (Control)) & = (μ + τ_{j} + X_{i j}^{'} β) - (μ + X_{i j}^{'} β) = τ_{j} . \end{matrix}

(5)

Subsequently, we consider the case where there is a relative treatment effect without an absolute treatment effect. The change in the log odds can be measured in the following way:

\begin{matrix} \frac{log (Odds (Treatment)) - log (Odds (Control))}{log (Odds (Control))} & = \frac{(μ + X_{i j}^{'} β) (1 + δ_{j}) - (μ + X_{i j}^{'} β)}{(μ + X_{i j}^{'} β)} = δ_{j} . \end{matrix}

(6)

It is worth noting that

δ_{j}

under the circumstances measures the relative change in the context of log odds. When considering both treatment effects, the interpretations and forms are not as straightforward. The absolute change in the log odds between the treatments or log odds ratio of a patient contracting the disease is given as (

j = 1

when

g = 2

)

\begin{matrix} log (Odds (Treatment)) - log (Odds (Control)) & = (μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j}) - (μ + X_{i j}^{'} β) \\ = τ_{j} + (μ + τ_{j} + X_{i j}^{'} β) δ_{j} . \end{matrix}

(7)

If there is no relative treatment effect (

δ_{j} = 0

), the log odds ratio computation only depends on the treatment effect

τ_{j}

, which is the case of the classical logistic regression. When there is a relative treatment effect (

δ_{j} \neq 0

), the log odds ratio of contracting the disease also depends on the attributes

X_{i j}^{'}

of the patient. The treatment group will have varying changes depending on

(μ + τ_{j} + X_{i j}^{'} β)

, e.g., a larger decrease in the log odds ratio for some patients

i = 1, 2, . . ., n

of the same attributes and a smaller decrease for a certain group of people are possible. For example, if the effect of a particular treatment is more prominent for obese patients than patients with a normal body weight holding other attributes constant, this is reflected in the smaller log odds ratio for the former patients than the latter. As a result, the AbRelaTEs model is ideal for interpreting the treatment effect in the context of precision medicine for some patients

i = 1, 2, . . ., n

.

Furthermore, the relative treatment effect in the AbRelaTEs model can be better explained in the context of the percentage increase/decrease, (

(a - b) / b

), in the odds of contracting the disease in a particular treatment group as discussed at the beginning of the section. The relative change in the odds between the treatments is:

\begin{matrix} \frac{Odds (Treatment) - Odds (Control)}{Odds (Control)} & = \frac{exp {(μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j})}}{exp {μ + X_{i j}^{'} β}} - 1 \\ = exp {τ_{j}} exp {(μ + τ_{j} + X_{i j}^{'} β) δ_{j}} - 1 . \end{matrix}

Since the effect of

τ_{j}

is constant while the effect of

δ_{j}

is proportional based on the absolute change in log odds and relative change in odds, we name the effects of

τ_{j}

and

δ_{j}

the absolute and relative effects in this paper.

In addition, we present additional discussions on different settings of the parameter values of

τ_{g}

and

δ_{g}

which can be set differently under our model setting for

g = 2

. The parameters for the control group effects are

τ_{2} = - τ_{1}

and

δ_{2} = - δ_{1}

using the constraints

\sum_{j = 1}^{2} τ_{j} = 0

and

\sum_{j = 1}^{2} δ_{j} = 0

by the convention. Using the example for Equation (5), the absolute change in the log odds between the treatments or log odds ratio of a patient contracting the disease is given as (

j = 1

when

g = 2

):

\begin{matrix} log (Odds (Treatment)) - log (Odds (Control)) & = (μ + τ_{j} + X_{i j}^{'} β) - (μ - τ_{j} + X_{i j}^{'} β) = 2 τ_{j} . \end{matrix}

(8)

The relative change in log odds for the case of a relative treatment effect without an absolute treatment effect is given in the following:

\begin{matrix} \frac{log (Odds (Treatment)) - log (Odds (Control))}{log (Odds (Control))} & = \frac{(μ + X_{i j}^{'} β) (1 + δ_{j}) - (μ + X_{i j}^{'} β) (1 - δ_{j})}{(μ + X_{i j}^{'} β) (1 - δ_{j})} \\ = \frac{2 δ_{j}}{1 - δ_{j}} . \end{matrix}

(9)

We first note that

τ_{j}

under this constraint can still be viewed as an absolute treatment effect. However, it can be seen from Equation (9) that the relative change in log odds is not equal to

δ_{j}

when

g = 2

, as shown in Equation (6) without using the constraints. Even though this constrained setting does not affect the interpretability aspects of our model as we interpret the additional treatment effects (i.e., relative treatment effect) of our model for every individual discussed above, the impact of

δ_{j}

is not exactly relative when using the constraints. On a further note, the constraints can be applied to

g > 2

for the absolute treatment effects, but are not applicable for the relative treatment effects under our setting. For instance, we consider the case of

g = 3

. If

δ_{1}

and

δ_{2}

take values of 0.5 and 0.7, respectively, then

δ_{3}

will take a value of −1.2 under the constraint which violates our model assumption on

δ_{j}

(i.e.,

- 1 < δ_{j} < 1

for

j = 1, 2, . . ., g

). For

g = 2

, the constrained setting can be applied in our model setting for both

τ_{j}

and

δ_{j}

. It cannot only be viewed as a special setting in our model framework but also provides more flexibility in modeling the treatment groups. For a more general framework

g > 2

, we require that

δ_{g} = 0

and the additional treatment effect is interpreted as a relative treatment effect on the baseline group which is the control group. Subsequently, the previous discussions for

g = 2

can be extended to

g > 2

, which we will not further discuss in this paper.

2.3. Toy Example

In this subsection, we provide some toy examples to better understand the AbRelaTEs model. Two simulated examples from the AbRelaTEs model are presented in Figure 1 and compared to the logistic regression model. The simulated example in the left panel is simulated with two levels of treatments and two covariates and the simulated example in the right panel is simulated with two levels of treatments—one covariate and an interaction effect between the treatment and covariate using model (2). For simplicity, we denote by treatment group 1 patients receiving a specific treatment and by treatment group 0 the control group and the outcomes are whether the patients recover from a particular disease or not. Using the notations introduced above,

τ_{1}

and

δ_{1}

are set to 0.6 and −0.6, respectively, in Figure 1a and

τ_{1}

and

δ_{1}

are set to 0.6 and −0.4, respectively, in Figure 1b.

τ_{2}

and

δ_{2}

are set to 0 for both simulated examples. The log odds for both examples are computed and plotted against treatment groups for models (1) and (2). In Figure 1a, the log odds of the logistic regression model are similar between the treatment group and the control group. The effectiveness of the treatment group is not obvious for the logistic regression model. The log odds are more spread out using the AbRelaTEs model which is reasonable and can be interpreted in our setting. Higher log odds suggest a high probability of recovering from the disease for patients with similar attributes receiving the treatment. Lower log odds, as observed for the treatment group, show that patients with different attributes (different weight range, age group, etc.) have a lower probability of recovering from the disease. These observations indicate that the treatment group can be recommended for patients sharing similar attributes (similar weight range, age group, etc.) using the AbRelaTEs model since the AbRelaTEs model also considers the attributes of the patients as discussed above. Similarly, in panel (b), even though the log odds are generally higher in the treatment group using the logistic regression model, the log odds computed from the logistic regression model are underestimated/overestimated for some patients. In addition, the treatment group can be recommended for patients sharing similar attributes based on the log odds using the AbRelaTEs model.

Figure 1. Log odds of two simulated examples are plotted for the AbRelaTEs model and logistic regression model. The simulated example in panel (a) considers treatment effects and covariate effects. The simulated example in panel (b) considers treatment effects, covariate effects and an interaction effect between the covariate and treatment.

The purpose of the two simulated examples is to show that the AbRelaTEs model provides enhanced interpretations and more significant results that the logistic regression may fail to capture. In addition, it is clear from the toy examples that the term

δ_{j}

is proposed to detect the relative treatment effects. Our model’s applicability and interpretability will be further discussed and presented with some real data examples in the numerical analysis section. In the subsequent section, we will present the theoretical guarantees of the estimation procedure in our model setting.

3. Estimation and Asymptotic Theory

In this section, we provide some additional discussions on the AbRelaTEs model for estimation purposes. Subsequently, we present the maximum likelihood estimation procedure and discuss the asymptotic properties in our model setup.

Different treatment effect representations can be applied to represent whether a patient is in the treatment or control group. For instance, consider the case of two treatment groups; the treatment group can be represented by 1 if the patient is in the treatment group and by −1 if the patient is in the control group. Alternatively, the treatment group can be represented by 1 if the patient is in the treatment group and otherwise 0. Since treatment effects are measured differently in the AbRelaTEs model, the control group’s constraint can be differently set for the absolute and relative treatment effects. However, using the same representation has an advantage. Here, we simply provide some discussions of the parameter

δ_{j}

and show our model’s versatility by different specifications of the treatment variables. In this paper, we only considered and focused on the parameter

δ_{j}

being the relative treatment effect. Additionally, regardless of the choice of representing the treatment groups, the interpretations are similarly made for each treatment group at the patient level as discussed in Section 2.2.

We denote

τ = {(τ_{1}, τ_{2}, . . ., τ_{g - 1})}^{'}

and

δ = {(δ_{1}, δ_{2}, . . ., δ_{g - 1})}^{'}

as

(g - 1) \times 1

vectors of the absolute and relative treatment coefficients and let

θ = {(μ, τ^{'}, β^{'}, δ^{'})}^{'}

be a

(2 g + p - 1) \times 1

parameter vectors. In this paper, we set the gth group as the control group. The theoretical guarantees can be established using the setting discussed in the previous section for the parameters

τ_{j}

and

δ_{j}

. The log-likelihood function

l (θ)

using the model (2) is given by

l (θ) = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{Y_{i j} (μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j}) - log {1 + exp [(μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j})]}\} .

(10)

The maximum likelihood estimator

\hat{θ}

is obtained by optimizing the log-likelihood function:

\hat{θ} = arg max_{θ} l (θ) .

(11)

For parameter estimation and theoretical purposes, we expressed the model (2) in a matrix form. We denote by

T_{i j} = {(T_{i, 1}, T_{i, 2} . . ., T_{i, (g - 1)})}^{'}

as a

(g - 1) \times 1

vector containing the treatment group information of i-th patient, e.g., if the i-th patient is in treatment group 1, the vector is shown as

{(1, 0, . . ., 0)}^{'}

and

τ = {(τ_{1}, . . ., τ_{g - 1})}^{'}

is the corresponding coefficient vector. Similarly, we let

R_{i j} = {(R_{i, 1}, R_{i, 2} . . ., R_{i, (g - 1)})}^{'}

be a

(g - 1) \times 1

vector containing the treatment group information for the relative term and

δ = {(δ_{1}, . . ., δ_{g - 1})}^{'}

is the corresponding coefficient vector. We define

T_{i j}^{*} (δ) = {(T_{i, 1} (1 + R_{i j}^{'} δ), T_{i, 2} (1 + R_{i j}^{'} δ) . . ., T_{i, (g - 1)} (1 + R_{i j}^{'} δ))}^{'}

and

X_{i j}^{*} (δ) = {(X_{i, 1} (1 + R_{i j}^{'} δ), X_{i, 2} (1 + R_{i j}^{'} δ), . . ., X_{i, p} (1 + R_{i j}^{'} δ))}^{'}

. Additionally, we let

W_{i j} (δ) = {(1 + R_{i j}^{'} δ, T_{i j}^{*^{'}} (δ), X_{i j}^{*^{'}} (δ))}^{'}

be a

(g + p) \times 1

vector and

β^{*} = {(μ, τ^{'}, β^{'})}^{'}

be the corresponding coefficient vector. Let

θ_{0} = {(β_{0}^{*^{'}}, δ_{0}^{'})}^{'} = {(μ_{0}, τ_{0}^{'}, β_{0}^{'}, δ_{0}^{'})}^{'}

be the true parameter vector and

Θ

be the parameter space of

θ_{0}

. We let

ϕ (u)

be defined by

ϕ (u) = exp (u) / (1 + exp (u))

and

Z_{i j} = {(W_{i j}^{'} (δ_{0}), V_{i j}^{'} {β_{0}}^{*} R_{i, 1}, V_{i j}^{'} {β_{0}}^{*} R_{i, 2}, . . ., V_{i j}^{'} {β_{0}}^{*} R_{i, (g - 1)})}^{'}

be a

(2 g + p - 1) \times 1

vector where

V_{i j} = {(1, T_{i j}^{'}, X_{i j}^{'})}^{'}

. To establish the asymptotic properties of the maximum likelihood estimator, we need the following assumptions.

(A1): Define $C = (- 1, 1)$ . $θ_{0}$ is an interior point of an open set in the parameter space $Θ \subseteq R^{g + p} \times C^{g - 1}$ .
(A2): For all i and $l = 1, 2, . . ., p$ , $E | X_{i l} |^{k} < \infty$ for $k = 1, 2, 3, 4$ .
(A3): $E (W_{i j} (δ_{0}) W_{i j}^{'} (δ_{0}))$ and $E {ϕ (W_{i j}^{'} (δ_{0}) β_{0}^{*}) [1 - ϕ (W_{i j}^{'} (δ_{0}) β_{0}^{*})] Z_{i j} Z_{i j}^{'}}$ are positive definite matrices.

The assumptions (A1)–(A3) are commonly seen in the proofs of consistency and asymptotic normality of the maximum likelihood estimator. We adjusted the assumptions to fit our model setup.

Theorem 1.

(Consistency) Under assumptions (A1)–(A3), as

n_{j} \to \infty

and

n \to \infty

, we have

\hat{θ} \to_{p} θ_{0}

.

Theorem 2.

(Asymptotic Normality) Under assumptions (A1)–(A3), as

n_{j} \to \infty

and

n \to \infty

, we have:

\sqrt{n} (\hat{θ} - θ_{0}) \to_{D} N (0, {[I (θ_{0})]}^{- 1}),

where

I (θ_{0})

is the expected Fisher information at

θ_{0}

and the expression is given in the Appendix A.

Remark 1.

In addition, since the AbRelaTEs model is a generalization of the logistic regression, it preserves other desirable properties: it can be shown that the AbRelaTEs model is identifiable and belongs to a full-rank exponential family with the assumptions.

4. Numerical Analysis

We will present the estimation procedure for the simulation and real data analyses in this section. Firstly, the first partial derivatives of the log-likelihood function in (10) with respect to parameters

μ

,

τ

,

β

and

δ

are given by

\frac{\partial l (θ)}{\partial μ} = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{Y_{i j} (1 + R_{i j}^{'} δ) - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} (1 + R_{i j}^{'} δ)\},

(12)

\frac{\partial l (θ)}{\partial τ_{j}} = \sum_{i = 1}^{n_{j}} \{Y_{i j} T_{i, j} (1 + R_{i j}^{'} δ) - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} T_{i, j} (1 + R_{i j}^{'} δ)\},

(13)

\frac{\partial l (θ)}{\partial β_{k}} = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{Y_{i j} X_{i k} (1 + R_{i j}^{'} δ) - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} X_{i k} (1 + R_{i j}^{'} δ)\},

(14)

\frac{\partial l (θ)}{\partial δ_{j}} = \sum_{i = 1}^{n_{j}} \{Y_{i j} V_{i j}^{'} β^{*} R_{i, j} - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} V_{i j}^{'} β^{*} R_{i, j}\},

(15)

for

j = 1, 2, . . ., g - 1

and

k = 1, 2, . . ., p

. Based on the Equations (12) and (15), there are no closed form solutions for the MLE

\hat{θ}

. We applied the Newton–Raphson method to obtain the estimates. At

(t + 1)

-th iteration, the estimates

{\hat{θ}}^{(t + 1)}

are computed using the following equation:

{\hat{θ}}^{(t + 1)} = {\hat{θ}}^{(t)} - H^{- 1} ({\hat{θ}}^{(t)}) s ({\hat{θ}}^{(t)}),

(16)

where

s (θ)

is the score function in Equations (12)–(15) and

H (θ)

is the second derivatives of the log-likelihood function (10). The iterations using Equation (16) are performed until convergence is attained.

In some cases, the optimal values of the parameters

δ

might fall outside the interval (−1, 1) in the optimization procedure. To overcome the issue, we conduct a reparameterization as

δ_{j} = \frac{e^{η_{j}} - 1}{1 + e^{η_{j}}}

where

δ_{j}

is a monotone increasing function of

η_{j}

, and we solve

η_{j}

in the optimization.

Furthermore, the estimation procedure above is highly dependent on the initial values of the parameters. If there are two treatment groups, we propose the following estimation procedure. We first split the parameter space of

δ_{1}

, which ranges from −1 to 1, into equally-spaced smaller grids, and we estimate the coefficient parameters

β^{*}

for each grid value of

δ_{1}

. The coefficient parameters are then estimated using the Newton–Raphson method. At

(t + 1)

-th iteration, the estimates

{\hat{β}}^{* (t + 1)}

are computed using the following equation:

{\hat{β}}^{* (t + 1)} = {\hat{β}}^{* (t)} - H^{- 1} ({\hat{β}}^{* (t)}) s ({\hat{β}}^{* (t)}) .

(17)

The iterations using Equation (17) are performed until convergence is attained. Subsequently, the log-likelihood (10) is evaluated at

\hat{θ} = {({\hat{β}}^{*^{'}}, {\hat{δ}}_{1})}^{'}

. The values of

δ_{1}

and

β^{*}

, which maximize the log-likelihood function, are selected as the estimates for

{\hat{δ}}_{1}

and

{\hat{β}}^{*}

.

The proposed estimation procedure not only removes the need to choose an initial value for

δ_{1}

but also searches through a fair number of

δ_{1}

values and selects the solution which maximizes (10). This approach is similar to a grid-search approach that is widely adopted in the threshold or change-point regression literature. It is useful to search for the solution when there is no closed-form solution for the parameter with acceptable computational costs when performing the grid-search approach for one parameter—however, the computational costs for the grid search procedure increase as the number of treatments increases. Therefore, if the number of treatments is more than 2, we apply the estimation procedure as described in Equation (16).

In the next subsection, we will present some simulation examples to evaluate the AbRelaTEs model’s performance.

Simulation

In this section, some simulation studies are conducted to assess the performance of the AbRelaTEs model. We considered a similar data structure as in our real data examples where there are two treatment groups (treatment and control)—each group having a similar number of patients/participants. We compared the performances of the AbRelaTEs model and logistic regression model in terms of their estimation and classification rates.

To compare the classification rates of the AbRelaTEs model and logistic regression model, we produced 1000 data simulated with

n = 1000

using different parameter values. Subsequently, the sensitivity and specificity for 1000 different simulations were computed for each model and the results are displayed using box plots. The first two covariates

x_{i 1}

and

x_{i 2}

are independently simulated from a normal distribution with a mean of 0 and a variance of 1. The third covariate

x_{i 3}

is simulated from a Bernoulli distribution. We also include the interaction term between the treatment effects and the first covariate

t_{i 1} x_{i 1}

. The coefficient parameters are simulated from a uniform distribution from −2.5 to 2.5 (

β_{j, 0} \sim

Uniform(−2.5, 2.5) for

j = 1, 2, 3, 4

). The absolute and relative treatment effect parameters are simulated using

τ_{1, 0} \sim

Uniform(0, 2) and

δ_{1, 0} \sim

Uniform(−0.7, −0.3) with

τ_{2, 0} = - τ_{1, 0}

and

δ_{2, 0} = - δ_{1, 0}

. In addition, we produced another simulation with

δ_{1, 0} \sim

Uniform (0.3, 0.7) and all other settings remain unchanged.

The simulation procedure is similar to the classical logistic regression model. Firstly, the success probability shown below is computed using the specified settings for the parameter values. For each patient/participant i in the treatment group, the success probability is:

\begin{matrix} π_{i j} = \frac{exp [(μ_{0} + τ_{j, 0} + x_{i j}^{'} β_{0}) (1 + δ_{j, 0})]}{1 + exp [(μ_{0} + τ_{j, 0} + x_{i j}^{'} β_{0}) (1 + δ_{j, 0})]} . \end{matrix}

The binary response variable is generated from Bernoulli experiments with success probability

π_{i j}

. Once the binary responses are generated for each patient/participant, the coefficients are estimated using the estimation procedure we described earlier in this section. The sensitivity and specificity for the 1000 data simulated from different parameter values are then computed for each model.

Subsequently, we present the simulation settings for estimation purposes. The number of variables considered in our model setup is

p = 4

. The covariates

x_{i j}

are independently simulated from a normal distribution with a mean of 0 and a variance of 1 (

x_{i j} \sim N (0, 1)

). The coefficients for the covariates are set to

β_{0} = {(- 0.5, 0.5, - 0.5, 0.5)}^{'}

. We considered both absolute and relative treatment effects where the coefficients of the absolute and relative treatment effects are set to

τ_{1, 0} = - 1, τ_{2, 0} = 0

and

δ_{1, 0} = - 0.5, δ_{2, 0} = 0

. Additionally, we also considered

δ_{1, 0} = - 0.3, 0.3, 0.5

as other parameter settings remain unchanged. The number of observations was set to

n = 300, 500, 700, 1000

. The simulation and estimation procedures were similarly performed as described above. In total, 1000 simulation runs were conducted for each of the settings. The averages of the estimated coefficients, standard deviations, standard errors and coverage probabilities were reported for both models. Similar quantities were computed and reported for the classical logistic regression.

We also tested our model performance by simulating data from the logistic regression model with

τ_{1, 0} = - 1, τ_{2, 0} = 0

and

β_{0} = {(- 0.5, 0.5, - 0.5, 0.5)}^{'}

with all other settings remain unchanged. In addition, we also considered the case when the absolute treatment effect was not significant and the relative treatment effect was significant. We set

τ_{1, 0} = 0

as all other settings remain unchanged.

Furthermore, we presented simulation results to demonstrate the performance of the AbRelaTEs model when interaction effects exist. Two covariates and two interaction terms were considered with coefficients set to

β_{0} = {(- 0.5, 0.5, - 0.5, 0.5)}^{'}

. The interaction terms considered are the interaction effects between the treatment effects and covariates, that are

t_{i 1} x_{i 1}

and

t_{i 1} x_{i 2}

using the notations introduced in Section 3. The interaction terms are included in the covariate matrix in model (2) by the design of the matrix. The absolute and relative treatment effects are similarly set to

τ_{2, 0} = - τ_{1, 0}

and

δ_{2, 0} = - δ_{1, 0}

.

Based on Figure 2 and Figure 3, the box plots show that the sensitivity and specificity are overall higher for the AbRelaTEs model based on the first quartiles, medians and third quartiles with similar variabilities between the AbRelaTEs and logistic regression models, suggesting that the AbRelaTEs model produces results with improved sensitivity and specificity when the relative treatment effects exist in the simulated datasets. The findings are reasonable since both models are based on the logistic regression model for binary classification which is the same type of classifier to achieve the optimal separation between two classes. Moreover, the relative treatment effects in the AbRelaTEs model helps improve the results for some data points in a certain range for continuous variables or of similar values for discrete variables (i.e., individualized effects), resulting in generally better sensitivity and specificity rates for the AbRelaTEs model, which were discussed in previous sections.

Figure 2. Box plots are displayed based on the sensitivity in panel (a) and specificity in panel (b) computed for the AbRelaTEs and logistic models over 1000 datasets simulated using different parameter values with

δ_{1, 0}

simulated from a uniform distribution from −0.7 to −0.3.

Figure 3. Box plots are displayed based on the sensitivity in panel (a) and specificity in panel (b) computed for the AbRelaTEs and logistic models over 1000 datasets simulated using different parameter values with

δ_{1, 0}

simulated from a uniform distribution from 0.3 to 0.7.

The results are shown in Table 1 for the case of

τ_{1, 0} = - 1

,

δ_{1, 0} = - 0.3

whereas the results are given in the supplementary file for the cases of

δ_{1, 0} = - 0.5, 0.3, 0.5

. The optimization is mainly based on the Newton–Raphson algorithm in model (17). The code can be obtained from the authors upon request or downloaded from Github. Based on the results in Table 1, the mean estimate for

δ_{1}

improves and approaches

- 0.3

as n increases from 300 to 1000. It was also observed that the standard deviation and standard error for the relative effect term decreases as the sample size increases. Similarly, the average estimate, standard deviation, and standard error improve for

τ_{1}

as n increases. For other coefficients, the average estimates are already closed to the specified coefficients

β_{0} = {(- 0.5, 0.5, - 0.5, 0.5)}^{'}

when

n = 500

whereas the standard deviations and standard errors improve as the sample size increases. On the other hand, the estimates for the coefficients using the logistic regression model are similar for all sample sizes. One interesting finding is that the coverage probability for the absolute effect term

τ_{1}

decreases from 0.690 to 0.279 for the logistic regression model as the sample size increases. This significant observation suggests that the logistic regression model might fail to capture or explain the absolute treatment effect when the relative treatment effect is significant as the sample size increases. We will further explore this aspect in the real data examples. Similar findings were also observed for the cases

δ_{1, 0} = - 0.5, 0.3, 0.5

.

Table 1. Estimate, standard deviation (SD), standard error (SE), and coverage probability (CP) when

τ_{1, 0} = - 1

and

δ_{1, 0} = - 0.3

with 1000 simulation runs for the AbRelaTEs model and logistic regression.

In addition, Table 2 shows that the AbRelaTEs model performance is comparable to that of the logistic regression model when

δ_{1, 0} = 0

(i.e., no relative treatment effects). It was observed that the average estimate for

δ_{1}

significantly improves as the sample size increases with improved standard deviation and standard error. The coefficient estimates obtained from the AbRelaTEs model were seen to be comparable to the logistic regression model even when

n = 300

. The standard deviations and standard errors improve as the sample size increases. Similar findings are observed for the case of

τ_{1, 0} = 0

(i.e., no absolute treatment effects) and

δ_{1, 0} = - 0.5, - 0.3, 0

, which are shown in the supplementary file. The coverage probability of the treatment effect using the logistic regression decreases as the magnitude of the relative treatment effect increases, which suggests that the logistic regression model might fail to capture any treatment effects if the relative treatment effect is significant. These findings suggest that the AbRelaTEs model can also model datasets when the relative treatment effect is not significant. This will also further be shown and discussed using the MEPARI-2 dataset in the real data analysis part.

Table 2. Estimate, standard deviation (SD), standard error (SE), and coverage probability (CP) when

τ_{1, 0} = - 1

and

δ_{1, 0} = 0

with 1000 simulation runs for the AbRelaTEs model and logistic regression.

The performance of the AbRelaTEs model is desirable when interaction effects exist as shown in Table 3. On the other hand, the estimates of the coefficients for the treatment effects, covariates, and interaction effects are similar for varying sample sizes. The coverage probabilities for the treatment effect in the logistic regression model are also similar which are approximately 94% for different sample sizes and the coefficient estimates for the treatment effect are similar to the coefficient estimates for the absolute treatment effect in the AbRelaTEs model. However, as the sample size increases, the coverage probabilities for the covariates and interaction terms substantially decrease from 66% to approximately 20%. In Table 4, the AbRelaTEs model outperforms the logistic regression model when interaction effects exist with the relative treatment effect being 0.5—as observed in Table 4. The coefficient estimates are similar for the treatment effects, covariates, and interaction effects for different sample sizes using the logistic regression model. The coverage probabilities for the parameters decrease as the sample size increases. The coverage probability decreases from approximately 82% to 40% as the sample size increases from 300 to 1000. These suggest that the logistic regression model is able to capture the absolute treatment effect but the performance is poor in capturing the covariates and interaction effects for a larger sample size when

δ_{1, 0} = - 0.5

and the logistic regression model is poor in capturing the absolute treatment effect when

δ_{1, 0} = 0.5

. For a smaller magnitude of the relative treatment effects, the performance of the logistic regression is reasonable.

Table 3. Estimate, standard deviation (SD), standard error (SE), and coverage probability (CP) when

τ_{1, 0} = - 1

and

δ_{1, 0} = - 0.5

with 1000 simulation runs for the AbRelaTEs model and logistic regression with two covariates and two interaction terms.

Table 4. Estimate, standard deviation (SD), standard error (SE), and coverage probability (CP) when

τ_{1, 0} = - 1

and

δ_{1, 0} = 0.5

with 1000 simulation runs for the AbRelaTEs model and logistic regression with two covariates and two interaction terms.

From these simulation examples, we showed that the AbRelaTEs model outperforms the logistic regression under no interaction/with interaction effect settings. We note that

δ_{j}

should not be interpreted as interaction effects as used in the classical logistic regression models based on our theoretical arguments and numerical results (i.e., it is truly a relative effect indicator). In addition, we also demonstrated that the AbRelaTEs model was able to estimate the parameters simulated by the logistic regression (i.e., no relative treatment effect). In addition, the estimates produced by the logistic regression model will result in incorrect log odds and odds ratio as the model is incapable of capturing the relative treatment effects, as shown in the simulation results. Consequently, decision making and developing an optimal treatment plan based on the log odds and odds ratio will be challenging. These simulation examples suggest that the AbRelaTEs model can be used as a new benchmark model, as mentioned in the previous section. In the subsequent section, we will show that the AbRelaTEs model is able to capture significant treatment effects through either the absolute or relative or both ways.

5. Real Data

We present the statistical analyses of four different datasets using our model and the classical logistic regression model. We aimed to show the flexibility and interpretability aspects of the AbRelaTEs model in handling different clinical trials datasets with detailed analyses. In addition, it is also important to note that the AbRelaTEs model is capable of capturing the treatment effects of a randomized controlled trial through either the relative or the absolute treatment effect terms, which we will show through four real data examples in the following subsections. Table 5 shows the three possible outcomes of whether a treatment effect is significant in the AbRelaTEs model.

Table 5. Possible combinations if a treatment effect is significant using the AbRelaTEs model.

5.1. Sepsis Data

This section will explore a randomized controlled trial on the use of synbiotics as a treatment for sepsis. The occurrence of sepsis is due to systemic inflammation and circulatory compromise by means of infection. Sepsis is a leading cause of death in infants with a 5–60% fatality rate [19]. Currently, there are no efficient ways to prevent sepsis. A dataset was obtained from a randomized controlled trial study conducted on 4556 rural Indian newborns [20]. The infants were randomized into the synbiotic group (2278) and placebo (2278). Among the 4556 infants, 4326 completed the study. Synbiotics are combinations of prebiotics and probiotics (Lactobacillus plantarum plus fructooligosaccharide) in the trial. The primary outcome of interest is the combination of sepsis and death.

The covariates that are significant in our analysis are birth weights (in grams) and sex. The weight variable is transformed using a reciprocal transformation. The estimation results based on the AbRelaTEs model and logistic regression model are shown in Table 6. The results show that the variables are all significant for the logistic regression model except for the variable birth weight. On the other hand, only the absolute treatment effect term is not significant in our model, while other covariates are significant. This illustrates that the relative treatment effect is significant for the data. Table 7 displays our model’s estimation results after removing the absolute treatment effect term. The results show that the relative treatment effect term and the covariates are significant. There is one difference in the coefficient sign of the relative treatment effect term we will address in the interpretation part.

Table 6. Estimation results from the AbRelaTEs model and the classical logistic regression for sepsis data.

Table 7. Estimation results from the AbRelaTEs model after removing the absolute treatment effect.

The log-odds of infants having sepsis or death change by

(- 6.755) * (1 + 0.240) = - 8.376

(synbiotic) and

- 6.755

(control) for every unit increase in weight. The odds of having sepsis or death in infants are

exp (0.225 * (1 + 0.240)) = 1.322

(synbiotic) and

exp (0.225) = 1.252

(control) higher for the male infants than the female infants. We now interpret the results by comparing them between treatment effects. After computing the odds ratios for each weight and gender, the odds ratios are consistently smaller than 1, which shows that the treatment is effective for all weight groups and both genders. The interpretation is also consistent with that of the logistic regression model, even though the treatment effect appears as a relative term in our model. Furthermore, after removing the absolute treatment effect, the positive coefficient sign of the relative effect term without the absolute treatment term implies that there is a multiplier effect on the log-odds uniformly for all infants receiving the treatment. Additionally, the sensitivity and specificity for the AbRelaTEs model are 60.5% and 50.7% while the sensitivity and specificity are 39.4% and 75.6% for the logistic regression model. Both interpretations and results for these data show that the AbRelaTEs model not only gives interpretations that are consistent with the logistic regression model but also shows that the birth weight variable is actually a significant predictor under our framework. On the other hand, the sensitivity of 39.4%, which is smaller than 50%, calculated from the logistic regression, is problematic as it leads to conclude that synbiotics are not effective and that the interpretation can be wrong.

5.2. MEPARI-2 Data

In this subsection, we will explore a randomized controlled trial on meditation or exercise for an acute respiratory infection prevention (MEPARI-2) dataset [21]. It is of interest to investigate whether interventions such as meditation and exercise help reduce acute respiratory infection (ARI) outcomes and whether self-reported psychosocial scores from the participants are associated with ARI outcomes. Out of 413 participants enrolled in the study, there were 389 data points after removing the participants with missing information and incomplete data during the study.

Based on the estimation results in Table 8, the exercise group was found to be significant and the meditation group was removed from the model since it was not significant. The results show that the relative treatment effect term was not significant in the AbRelaTEs model with a high p-value. In addition, the coefficient estimates for the treatment group, age, self-reported psychosocial scores, and interaction terms are closed to the estimates from the logistic regression model. This shows that the AbRelaTEs model produces results that are similar to the logistic regression model when the relative treatment effect term is not significant, and the absolute treatment effect is significant. We also note that the coefficients, standard errors, and p-values are the same after we remove the relative treatment effect term from our model. The sensitivity and specificity for both models are the same which are 56.8% and 59.5%, respectively. Moreover, since the interpretations under this scenario will be similar to the interpretations using the logistic regression model by interpreting each predictor’s effects, we will not discuss it further.

Table 8. Estimation results from the AbRelaTEs model and the classical logistic regression using the MEPARI2 dataset.

5.3. Influenza Data

In this subsection, we investigated a flu vaccination dataset [22]. Vaccination is essential in preventing the infection and transmission of influenza viruses. To investigate the effect of vaccinating children in the household environment, 796 households were enrolled in this study and randomized into the vaccination group (479 households) or control group (317 households) with at least one child. Since there are adults who are not vaccinated assigned to the treatment group and adults who are vaccinated in the control group, we focused on the effect of vaccination on children. The response variable of interest is whether the individual is infected or not.

The covariates that we found to be significant and include in our analysis are round (1,2,3) and the HAI titer level (0,1,2). The estimation results for our model and the logistic regression model are given in Table 9. The p-values are not significant for the treatment effect (in logistic regression) and round (in AbRelaTEs model). The results based on the AbRelaTEs model show the relative, absolute treatment effects and HAI titer level are significant. Since there are three rounds of sera collections in the study, we retained the variable as it indicates the period of time the data are collected though it is not significant.

Table 9. Estimation results from the AbRelaTEs model and the classical logistic regression for the flu vaccination data.

For every increase in HAI titer level, the children’s log-odds have an influenza change by

- 0.59 * (1 - 0.242) = - 0.447

for the vaccinated group and decreases by 0.59 for the control group. After computing the overall effects, it was found that the vaccinated treatment was beneficial for all HAI titer levels across different rounds. In addition, the sensitivity and specificity for the AbRelaTEs model are 62.5% and 64.6% while the sensitivity and specificity for the logistic regression model is 33.9% and 74.4%. Therefore, vaccination is highly recommended for all children based on the results. Again, a sensitivity of 33.9% calculated from the logistic regression may be meaningless.

5.4. COVID-19 Data

Our following statistical analysis was to explore a randomized controlled trial on the use of the hydroxychloroquine drug on the novel coronavirus disease (COVID-19). There have been many studies on the novel coronavirus disease 2019 (COVID-19) since its outbreak. To date, there are still many ongoing types of research with continued efforts to find effective antiviral treatments for patients with COVID-19. The dataset considered for our analysis was obtained from one of the studies on hydroxychloroquine [23]. The purpose of the study was to investigate whether hydroxychloroquine can prevent symptomatic infection after SARS-CoV-2 exposure. A total of 821 patients with occupational or household exposure to people with confirmed COVID-19 infection were enrolled in the study. The patients were randomized into hydroxychloroquine and placebo within four days of exposure. The primary outcome of the study was the incidence of laboratory-confirmed COVID-19 infections. The predictors considered for the analysis are treatments (hydroxychloroquine and placebo), age, and weight. Additionally, other independent variables include data on patients having symptoms (cough, shortness of breath, difficulty breathing, fever, chills, rigors, myalgia, headache, sore throat, new olfactory, taste disorders, and diarrhea). After removing patients with missing information, there were 746 patients for the statistical analysis. The number of patients for each variable in each treatment is presented in Table 10.

Table 10. Number of patients with symptoms/outcomes in different treatments.

The estimation results using the classical logistic regression model and the AbRelaTEs model are presented in Table 11. In addition, the weight variable is transformed using a reciprocal transformation (weight

^{*}

= 1/(weight/500)) where weight

^{*}

is the transformed variable. The scaling factor is used here so that the magnitude of the estimated coefficient is not large. BMI is not available as the height data are not available. The results show that the absolute treatment effect is not significant using the classical logistic regression model and all predictors except age and number of symptoms are also not significant. These logistic regression-based results suggest that the hydroxychloroquine treatment is not significant in predicting the probability that a patient who has COVID-19 infection. They are consistent with other earlier and recent studies on the hydroxychloroquine drug [24,25,26] which show that the hydroxychloroquine treatment has no clinical benefits or does not prevent illness compatible with COVID-19 [23]. In contrast to our analysis, the aforementioned studies analyzed the data using statistical methods such as survival models, hazard/risk ratios and Fisher’s exact test which is not directly comparable in our case. However, compared with the fitted AbRelaTEs model, the resulting p-values associated with logistic regression in Table 11 are doubtful; they lack interpretability, which raises questions concerning whether the logistic regression model is correctly specified and has sufficient detecting power to detect the predictors’ effectiveness.

Table 11. Estimation results from the AbRelaTEs model and the classical logistic regression for COVID-19 data.

We will now interpret the results of our model shown in Table 11. The interpretations of the treatment effects can be made in two ways—between and within treatment groups. For within treatment groups, the effect of each covariate is illustrated and discussed using the odds. For a unit increase in age, the log-odds of people having COVID-19 change by

- 0.099 * (1 - 0.513) = - 0.048

(hydroxychloroquine) and

- 0.099 * (1 + 0.513) = - 0.150

(placebo). The log-odds changes are

- 0.947 * (1 - 0.513) = - 0.461

(hydroxychloroquine) and

- 0.947 * (1 + 0.513) = - 1.433

for a unit increase in weight

^{*}

, respectively. Again, the significance of the age and weight is due to the effectiveness of hydroxychloroquine relative to placebo. Furthermore, the log-odds of people contracting COVID-19 increased by

(0.833 + 0.463) * (1 - 0.513) = 0.631

(hydroxychloroquine) and

(0.833 - 0.463) * (1 + 0.513) = 0.560

(placebo) for every additional number of symptoms. The odds presented for each treatment are not directly comparable between two treatments. The effects of between treatment effects are discussed later.

With regard to the between treatment group interpretations, all covariates are considered when making comparisons between the treatment groups. The interpretations are made using the overall effects for a group of patients in certain age and weight groups with or without symptoms (individualized effects). We first made interpretations for patients who did not show any symptoms. The odds ratios for patients were compared between the hydroxychloroquine and placebo groups to identify patients of the age group and weight range which would benefit from the treatment. For instance, the odds ratio for patients with no symptoms can be computed as follows:

\frac{Odds (hydroxychloroquine)}{Odds (placebo)} = \frac{exp {(- 3.279 - 0.099 age - 0.947 {weight}^{*}) (1 - 0.513)}}{exp {(3.279 - 0.099 age - 0.947 {weight}^{*}) (1 + 0.513)}},

(18)

where Odds(hydroxychloroquine) and Odds(placebo) are the odds of having COVID-19 for patients receiving the respective treatments. We note that the results are not evident and certain as to which treatment group consistently outperforms the other for all age and weight groups. It is also worth noting that the hydroxychloroquine treatment is only beneficial for certain age and weight groups which are our goal to identify here. The hydroxychloroquine treatment is more effective if the odds ratio is less than 1 and is less effective if the odds ratio is greater than 1. The odds ratios are shown in Table 12 for selective age, weight variable, and symptoms since the odds ratios of other age, weight, and symptom groups can be similarly computed. For instance, for patients who do not have any symptoms with the age of 30 and weight (pounds) between 139 and 385, the odds ratio is between 0.106 and 0.991. The odds ratio is between 0.114 and 0.921 for patients who have one symptom with the same age and weight between 145 and 385. We will first interpret the results for patients who do not show any symptoms. The odds of having COVID-19 are lower for patients receiving hydroxychloroquine treatment with ages ranging from 18 to 25 and weight above 122 pounds. The hydroxychloroquine treatment has a lower odds than the placebo in contracting the disease for patients who weigh more than 139 pounds and in the age group of 25–30 with no symptoms. Furthermore, patients who are in the age range of 30–40 and weigh at least 198 pounds have a lower odds ratio. Finally, for patients aged between 40 and 50 that weigh more than 335 pounds, the odds of contracting COVID-19 are lower for the hydroxychloroquine treatment group.

Table 12. Computed odds ratio between the hydroxychloroquine treatment and placebo based on different covariate information. The weight variable is shown based on the original scale.

Subsequently, we will interpret the results for patients who show one symptom. Patients who are in the age group of 18–25 and weigh more than 105 pounds have lower odds of contracting COVID-19 in the hydroxychloroquine treatment. The odds of contracting the disease are lower for patients in the age range between 25 and 30 and those with weights above 145 pounds receiving hydroxychloroquine treatment. The odds are lower for the hydroxychloroquine treatment group within the age groups of 30–40 and 40–50, who are at least 202 pounds and 348 pounds in the respective age group. Similar interpretations can be made for patients who show up to ten symptoms (2, 3, ⋯, 10). It is also important to note that a more accurate weight range can be obtained for a given age so that the effects of the hydroxychloroquine treatment can be further explored. We consider a reasonable age range for easier interpretations as a group and identify the corresponding weight range where the hydroxychloroquine treatment is deemed beneficial.

Figure 4 illustrates the estimated probabilities of having COVID-19 computed using the estimated coefficients from the AbRelaTEs model against the covariates in the model (treatments, age, weight and number of symptoms) for each patient in the dataset. The comparisons and discussions made above based on the odds are similarly observed in Figure 4. The interpretations based on the odds of contracting the disease are similar to the estimated probabilities that a patient is infected. However, the figure provides additional insights. It is observed that there are two separate groups of patients undergoing hydroxychloroquine treatment based on the estimated probabilities. The separation is more apparent when looking at the plot for treatments, age, and weight. Further investigation shows that the group of patients with higher estimated probabilities experience all ten symptoms while another group of patients with lower estimated probabilities of contracting COVID-19 show fewer symptoms. These suggest the fact that the hydroxychloroquine treatment helps lower the probability of having COVID-19 with fewer symptoms. Furthermore, the sensitivity and specificity for the AbRelaTEs model are 78.9% and 86.2% while the sensitivity and specificity are 73.7% and 90.1% for the logistic regression model. Based on the significant results and interpretations, since the treatment is beneficial for a certain group of people but not for every patient, they should consult a medical doctor before taking the drug.

Figure 4. Estimated probabilities for each patient for hydroxychloroquine treatment and placebo.

5.5. Discussion

The AbRelaTEs model not only produces significant treatment effects with better interpretability through the real data examples but the model can also be applied to other medical data in epidemiology. When using other medical data in epidemiology such as in the case-control or cohort studies, it is often of interest to model the exposure and the response by including other risk factors. The exposure in such studies can be captured by either the absolute or relative “exposure” effect terms in the AbRelaTEs model. If the absolute exposure effects are significant and relative exposure effects are not significant, the interpretations are similar to the logistic regression. On the other hand, if both terms are significant, the interpretations can be made based on “between” and “within” exposure effects together with the risk factors. Compared to other multivariable methods such as the logistic regression, the main advantage of the AbRelaTEs model is that it allows researchers to interpret results based on each exposure specific to each risk factor so that a subgroup of individuals with exposure and a specific risk factor can be identified as having lower/higher risk in relation to the response of interests.

Similar to the logistic regression, the odds ratio can be reported for the AbRelaTEs model. In addition, a more detailed odds ratio can be computed and tabulated as in Table 12 to report which subgroups of individuals/patients could benefit the most from or be least affected by the exposure/treatments.

With the four real data examples we presented, we summarize the essential findings of the treatment effects that we discussed in the previous subsections in Table 13 and include more details, e.g., covariates, response, treatment effects, to provide an overview of the results of the four real datasets for the AbRelaTEs and logistic regression models in Table 14. This shows that significant treatment effects are better explained in terms of absolute or relative or both ways with increased flexibility in the AbRelaTEs model. In addition, we also showed that the treatment effects can also be interpreted using individualized information for each patient/participant. In contrast, the widely used multivariable methods were not able to detect these features.

Table 13. Summary outcomes of the treatment effects using COVID-19, influenza, sepsis, and MEPARI-2 datasets.

Table 14. Summary outcomes of the treatment effects and covariates using COVID-19, influenza, sepsis, and MEPARI-2 datasets for the AbRelaTEs model and logistic regression model.

The synbiotic treatment was found to be beneficial for all infants with sepsis using the AbRelaTEs model. The birth weights and gender of infants were found to be significant variables in predicting sepsis. It was found that infants receiving the synbiotic treatment have lower odds of having sepsis as compared to the control group as weight increases. Furthermore, the odds were higher for male infants as compared to female infants for the synbiotic and control groups.

Acute respiratory infection can be improved by engaging in more physical activities (exercise group). It was found that the odds of having ARI decrease as age increases and the MASS score increases. On the other hand, the odds of having ARI increase as the SF12 score increases.

Additionally, the flu vaccination is recommended for children based on the AbRelaTEs model. A higher HAI titer level was also found to lower the odds of contracting a flu.

For the COVID-19 dataset, the hydroxychloroquine treatment, symptoms, age, and weight were found to be significant using the AbRelaTEs model. The odds of contracting COVID-19 decrease as the age and weight

^{*}

increase. Furthermore, a higher number of symptoms is related to increased odds of having COVID-19. The hydroxychloroquine treatment for COVID-19 was found to be beneficial for specific groups of patients with certain symptoms, age, and weight, resulting in the treatment being suitable as a precision medicine (see Table 12). Therefore, people should consult a medical doctor before taking the drug.

6. Conclusions

In this paper, a more general logistic regression was proposed to model randomized controlled trials, which allows us to compare different treatment effects absolutely and relatively due to the AbRelaTEs model’s flexibilities. Our model maintains the CIPS properties as mentioned in the introduction and is highly flexible in modeling randomized controlled trials’ data with absolute or relative or both effects. To identify the treatment effects, we observed the absolute and relative treatment effects. The absolute treatment effect

τ_{j}

is an overall treatment effect while the relative treatment effect

δ_{j}

is a treatment effect relative to the baseline control group. If

τ_{j} \neq 0

, there is an absolute treatment effect. There is a relative treatment effect if

δ_{j} \neq 0

. In both cases, the treatment groups are effective. In addition, the signs of the treatment effects are important. If we investigate whether a drug is effective in curing a disease, then significant absolute treatment effect with a positive sign implies that the drug is effective. On the other hand, if we investigate whether a treatment is effective in lowering the likelihood of being infected by a disease, a significant absolute treatment effect with negative sign signifies that the treatment is effective. In both cases,

δ_{j}

can be positive or negative as the effectiveness of the treatment for patients depends on the patients’ attributes which are the individualized effects. Furthermore, the epidemiologists can compute a score based on

(μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j})

. We can use a score of 0 as a benchmark, i.e., a probability threshold of 0.5. If

(μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j}) > 0

, then the treatment groups are viewed as effective. If

(μ + τ_{j} + X_{i j}^{'} β) (1 + δ_{j}) < 0

, then the treatment groups are viewed as ineffective. If the probability threshold is taken to a different value other than 0.5, the cut-off value 0 should also be changed accordingly.

Furthermore, the AbRelaTEs model can be interpreted in two ways—“between” and “within” treatment effects. When interpreting the “within” treatment effects, each individual predictor’s effects can be interpreted. Additionally, the “between” treatment effects allow us to make interpretations using the information of all covariates from each patient/participant in the data. The overall effects of a patient or a certain group of people sharing the same attributes known as the individualized effects are then compared between treatments. This enables us to make recommendations if a treatment is suitable for the general public or a specific group of people, allowing us to determine whether or not a treatment can be treated as a precision medicine.

In addition, the AbRelaTEs model has several advantages if we consider using a logistic regression model with treatment-specific coefficients

β_{j}

for

X_{i j}

given in model (19):

logit (π_{i j}) = μ + τ_{j}^{*} + X_{i j}^{'} β_{j}

(19)

for

i = 1, 2, . . ., n_{j}

and

j = 1, 2, . . ., g

.

There will be three additional difficulties for such a general framework (19): (1)

τ_{j}^{*}

may not be significant due to treatment-specific coefficients for

X_{i j}

; (2) for medical data (i.e., clinical trials),

X_{i j}^{'}

s are often measured at the baseline, and

X_{i j}^{'} β

are used as baseline characteristics in order to test whether the treatment indicator

τ_{j}^{*}

is significant or not. In a logistic regression model with treatment-specific coefficients,

β_{j}

s can be very different, and the interpretations of

μ

and

τ_{j}^{*}

can be difficult; and (3) the estimation of

β_{j}

s can be difficult. In addition, it is not feasible to define an overall relative effect for the treatment j. In contrast, in the AbRelaTEs model, we only need to estimate the relative treatment effect

δ_{j}

, and all interpretations presented in this paper are valid. Furthermore, the AbRelaTEs model can be viewed as a bridge between the classical logistic regression model for medical data and the logistic regression model with treatment-specific coefficients for each predictor.

Similarly, the interpretations on the “between” and “within” group effects can be made when analyzing medical data in epidemiological studies (e.g., case-control studies or cohort studies) using the AbRelaTEs model. The groups of individuals or people with different exposure status or degree of exposure in epidemiology are used to study the absolute and relative group effects in the AbRelaTEs model. The main advantage of the AbRelaTEs model in analyzing such data is to better interpret the effects of the exposure levels on the response variable specific to each category in the risk factors, which is known as the “individualized” effect as discussed in the previous sections.

In addition, we showed that our model is capable of modeling the absolute and relative treatment effects through simulation examples. Moreover, it was also shown through four real-world randomized controlled trials data that our model is highly interpretable, resulting in better understandings of the treatment effects. In addition, it is also established that the model preserves desired theoretical properties such as consistency and asymptotic normality under regularity conditions. These properties suggest that the AbRelaTEs model can be used as a new benchmark model for modeling randomized controlled trials. The AbRelaTEs model which considers the treatment effects can be further extended to accommodate two-way effects.

Finally, our model can be extended to response variables being continuous or semi-continuous, and predictors being high dimensional. We can also specify the relative effect indicators

δ_{j}

to be functions of predictors. We will consider these topics in the future research.

Author Contributions

All authors have read and agreed to the published version of the manuscript. Both authors designed the research problems and the writing. The first author did the derivation and computation. The second author proposed the ideas and concepts.

Funding

Zhang thanks the partial support of NSF grant DMS-2012298.

Institutional Review Board Statement

This work does not require the Institutional Review Board Statement and approval.

Informed Consent Statement

This work does not require the written informed consent.

Data Availability Statement

The data used for the statistical analysis can be found in the research articles cited in the real data section.

Acknowledgments

The authors thank the academic editor and four anonymous reviewers for their constructive comments which improved the paper. Zhang also thanks the partial support of NSF grant DMS-2012298.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proofs of Theorem 1

We let the notation

U_{i j}

denote

Y_{i j} | X_{i j}

and

f (U_{i j} | θ)

be the likelihood function for each

i, j

given below:

f (U_{i j} | θ) = {(\frac{exp [W_{i j}^{'} (δ) β^{*}]}{1 + exp [W_{i j}^{'} (δ) β^{*}]})}^{Y_{i j}} {(\frac{1}{1 + exp [W_{i j}^{'} (δ) β^{*}]})}^{1 - Y_{i j}} .

(A1)

Taking log of the likelihood function (A1) gives us:

log f (U_{i j} | θ) = \{Y_{i j} W_{i j}^{'} (δ) β^{*} - log {1 + exp [W_{i j}^{'} (δ) β^{*}]}\} .

(A2)

From the main text, the log-likelihood function can then be rewritten using (A2) as

\begin{matrix} l (θ) = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} log f (U_{i j} | θ) . \end{matrix}

Before we show the consistency results, we let

Q_{n} (θ)

be defined as

Q_{n} (θ) = \frac{1}{n} \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} log f (U_{i j} | θ) .

(A3)

Note that dividing the log-likelihood by n does not change the optimization in the main text but this allows us to easily obtain the consistency results. By assumptions (A2) and (A3), parameter identification is satisfied. Using the notations defined in the main text:

\begin{matrix} | log ϕ (W_{i j}^{'} (δ) β^{*}) |} & = | log ϕ (0) + λ (W_{i j}^{'} (\tilde{δ}) {\tilde{β}}^{*}) W_{i j}^{'} (δ) β^{*} | \\ \leq | log ϕ (0) | + λ (W_{i j}^{'} (\tilde{δ}) {\tilde{β}}^{*}) | W_{i j}^{'} (δ) β^{*} | \\ \leq | log 2 | + {1 + C | W_{i j}^{'} (\tilde{δ}) {\tilde{β}}^{*} |} | W_{i j}^{'} (δ) β^{*} | \\ \leq | log 2 | + {1 + C | | W_{i j} {(δ) | |}_{2} | | β^{*} {| |}_{2}} | | W_{i j} {(δ) | |}_{2} | | β^{*} {| |}_{2} \end{matrix}

where

ϕ (u) = e^{u} / (1 + e^{u})

,

λ (u) = ϕ^{'} (u) / ϕ (u)

and

ϕ^{'} (u)

is the first derivative with respect to u. The first equality is by mean value theorem. The second inequality is by triangular inequality. Third inequality is by continuity of

λ (u)

and last inequality is by Cauchy–Schwartz. By assumptions (A1) and (A2), the expectation of the moments and parameters are bounded. Similarly,

log {1 - ϕ (W_{i j}^{'} (δ) β^{*})}

is also bounded and

Y_{i j}

is bounded too. By Lemma 2.2 in Newey and McFadden [27],

Q_{0} (θ)

has a unique maximum at

θ_{0}

.

By weak law of the large number, we have point-wise convergence:

Q_{n} (θ) = \frac{1}{n} \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} log f (U_{i j} | θ) \to_{p} E (log f (U_{i j} | θ)) = Q_{o} (θ) .

Additionally, note that

log f (U_{i j} | θ)

is concave. By Theorem 2.7 in Newey and McFadden [27],

\hat{θ} \to_{p} θ_{0}

.

Appendix A.2. Proofs of Theorem 2

Condition (i) of Theorem 3.3 in Newey and McFadden [27] is satisfied by assumption (A1); (ii) is satisfied since the likelihood function is twice continuously differentiable; (iii) is satisfied since the moments and parameters are bounded by assumptions (A1) and (A2); (iv) is satisfied by assumption (A3). By differentiating (A2) twice, we obtain the hessian matrix shown below. The expectations of the moments and parameters are all bounded by assumptions (A1)–(A2) so condition (v) is satisfied. By Theorem 3.3 in Newey and McFadden, [27] we establish

\sqrt{n} (\hat{θ} - θ_{0}) \to_{D} N (0, {[I (θ_{0})]}^{- 1}),

where

I (θ_{0})

is the expected Fisher information at

θ_{0}

.

The first partial derivatives of the log-likelihood are:

\begin{matrix} \frac{\partial l (θ)}{\partial μ} = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{Y_{i j} (1 + R_{i j}^{'} δ) - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} (1 + R_{i j}^{'} δ)\}, \end{matrix}

\begin{matrix} \frac{\partial l (θ)}{\partial τ_{j}} = \sum_{i = 1}^{n_{j}} \{Y_{i j} T_{i, j} (1 + R_{i j}^{'} δ) - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} T_{i, j} (1 + R_{i j}^{'} δ)\}, \end{matrix}

\begin{matrix} \frac{\partial l (θ)}{\partial β_{k}} = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{Y_{i j} X_{i k} (1 + R_{i j}^{'} δ) - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} X_{i k} (1 + R_{i j}^{'} δ)\}, \end{matrix}

\begin{matrix} \frac{\partial l (θ)}{\partial δ_{j}} = \sum_{i = 1}^{n_{j}} \{Y_{i j} V_{i j}^{'} β^{*} R_{i, j} - \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})} V_{i j}^{'} β^{*} R_{i, j}\}, \end{matrix}

for

j = 1, 2, . . ., g - 1

and

k = 1, 2, . . ., p

.

The second partial derivatives of the log-likelihood are:

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial μ^{2}} = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{- \frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} {(1 + R_{i j}^{'} δ)}^{2}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial μ \partial τ_{j}} = \sum_{i = 1}^{n_{j}} \{- \frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} T_{i, j} {(1 + R_{i j}^{'} δ)}^{2}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial μ β_{k}} = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{- \frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} X_{i k} {(1 + R_{i j}^{'} δ)}^{2}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial μ \partial δ_{j}} = \sum_{i = 1}^{n_{j}} \{Y_{i j} R_{i, j} - [\frac{V_{i j}^{'} β^{*} R_{i, j} (1 + R_{i j}^{'} δ)}{1 + exp (W_{i j}^{'} (δ) β^{*})} + R_{i, j}] \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial τ_{j}^{2}} = \sum_{i = 1}^{n_{j}} \{- \frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} T_{i, j}^{2} {(1 + R_{i j}^{'} δ)}^{2}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial τ_{j} \partial β_{k}} = \sum_{i = 1}^{n_{j}} \{- \frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} T_{i, j} X_{i k} {(1 + R_{i j}^{'} δ)}^{2}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial τ_{j} \partial δ_{j}} = \sum_{i = 1}^{n_{j}} \{Y_{i j} T_{i, j} R_{i, j} - [\frac{(V_{i j}^{'} β^{*} R_{i, j}) T_{i, j} (1 + R_{i j}^{'} δ)}{1 + exp (W_{i j}^{'} (δ) β^{*})} + T_{i, j} R_{i, j}] \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})}\}, \end{matrix}

\begin{matrix} \frac{\partial l (θ)}{\partial β_{k} \partial β_{k^{'}}} = \sum_{j = 1}^{g} \sum_{i = 1}^{n_{j}} \{- \frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} X_{i k} X_{i k^{'}} {(1 + R_{i j}^{'} δ)}^{2}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial β_{k} \partial δ_{j}} = \sum_{i = 1}^{n_{j}} \{Y_{i j} X_{i k} R_{i, j} - [\frac{(V_{i j}^{'} β^{*} R_{i, j}) X_{i k} (1 + R_{i j}^{'} δ)}{1 + exp (W_{i j}^{'} (δ) β^{*})} + X_{i k} R_{i, j}] \frac{exp (W_{i j}^{'} (δ) β^{*})}{1 + exp (W_{i j}^{'} (δ) β^{*})}\}, \end{matrix}

\begin{matrix} \frac{\partial^{2} l (θ)}{\partial δ_{j}^{2}} = \sum_{i = 1}^{n_{j}} \{- \frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} {(V_{i j}^{'} β^{*} R_{i, j})}^{2}\} . \end{matrix}

The elements of the expected fisher information matrix at

θ

are easily obtained for terms without

Y_{i j}

. Here, we will only show the elements of the expected Fisher information matrix for terms involving

Y_{i j}

:

\begin{matrix} - E (\frac{\partial^{2} l (θ)}{\partial μ \partial δ_{j}}) = E \{\frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} V_{i j}^{'} β^{*} R_{i, j} (1 + R_{i j}^{'} δ)\}, \end{matrix}

\begin{matrix} - E (\frac{\partial^{2} l (θ)}{\partial τ_{j} \partial δ_{j}}) = E \{\frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} (V_{i j}^{'} β^{*} R_{i, j}) T_{i, j} (1 + R_{i j}^{'} δ)\}, \end{matrix}

\begin{matrix} - E (\frac{\partial^{2} l (θ)}{\partial β_{k} \partial δ_{j}}) = E \{\frac{exp (W_{i j}^{'} (δ) β^{*})}{{[1 + exp (W_{i j}^{'} (δ) β^{*})]}^{2}} (V_{i j}^{'} β^{*} R_{i, j}) X_{i k} (1 + R_{i j}^{'} δ)\}, \end{matrix}

Using the notations defined in the main text

Z_{i j} = {(W_{i j}^{'} (δ_{0}), V_{i j}^{'} β_{0}^{*} R_{i, 1}, . . ., V_{i j}^{'} β_{0}^{*} R_{i, (g - 1)})}^{'}

be a

(2 g + p - 1) \times 1

, we can rewrite the expected Fisher information in the following matrix form:

I (θ) = E {ϕ (W_{i j}^{'} (δ_{0}) β_{0}^{*}) [1 - ϕ (W_{i j}^{'} (δ_{0}) β_{0}^{*})] Z_{i j} Z_{i j}^{'}} .

(A4)

Table A1. Estimate, standard deviation (SD), standard error (SE), and coverage probability (CP) when

τ_{1, 0} = - 1

and

δ_{1, 0} = - 0.5