Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family

Qian, Kaihuan; Zhou, Xiaohua

doi:10.3390/math10152573

Open AccessArticle

Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family

by

Kaihuan Qian

¹

and

Xiaohua Zhou

^1,2,3,*

¹

Department of Biostatistics, School of Public Health, Peking University, Beijing 100191, China

²

Beijing International Center for Mathematical Research, Peking University, Beijing 100871, China

³

Pazhou Lab, No. 70 Yuean Road, Haizhu District, Guangzhou 510335, China

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(15), 2573; https://doi.org/10.3390/math10152573

Submission received: 30 June 2022 / Revised: 14 July 2022 / Accepted: 20 July 2022 / Published: 25 July 2022

(This article belongs to the Special Issue Mathematics in Biomedicine, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

In clinical trials with delayed treatment effect, the standard log-rank method in testing the difference between survival functions may have problems, including low power and poor robustness, so the method of weighted log-rank test (WLRT) is developed to improve the test performance. In this paper, a hyperbolic-cosine-shaped (

C H

) hazard function family model is proposed to simulate delayed treatment effect scenarios. Then, based on Fleming and Harrington’s method, this paper derives the corresponding weight function and its regular corrections, which are powerful in test, theoretically. Alternative methods of parameters selection based on potential information are also developed. Further, the simulation study is conducted to compare the power performance between

C H

WLRT, classical WLRT, modest weighted log-rank test and WLRT with logistic-type weight function under different hazard scenarios and simulation settings. The results indicate that the

C H

statistics are powerful and robust in testing the late difference, so the

C H

test is useful and meaningful in practice.

Keywords:

clinical trial; survival analysis; weighted log-rank test; non-proportional hazard; delayed treatment effect

MSC:

62N03

1. Introduction

Recently, with the rapid development of medicine, clinical trials, as important processes in the evaluation of new drugs, have received more attention, and the related biostatistics methods need to be improved. In many clinical trials collecting time-to-event data as the endpoint and conducting data analysis under the framework of survival analysis, one of the most significant problem is how to compare two survival curves, or using a mathematical expression, how to test whether there exists a difference between survival functions from different groups.

The log-rank test [1] and Cox regression [2] are two of the most classical and widely used statistical methods in comparing survival functions. The log-rank test as a non-parametric method is usually used in randomized controlled trials (RCT) which matches covariates between groups through randomization before starting the trial, while Cox regression is often used in population studies to quantify the effects of covariates [3]. Both of these methods are actually based on the proportional hazard assumption, which assumes that the hazard ratio of two groups are constant through time, and the standard log-rank test has the highest power in that case [1]. Most traditional clinical trials satisfy proportional hazard approximately, so subsequent statistical inferences are usually based on the assumption.

However, the advances in immuno-oncology have presented many cases violating the proportional hazard assumption, and that is called the non-proportional hazard (NPH) scenario. The mechanism of immunotherapy, such as the PD-1/PD-L1 inhibitor, is motivating the human body’s immune system to take anti-tumor reactions [4]. Its treatment effects are indirect and need a transition period to reveal and stabilize compared with the traditional treatment methods, such as chemotherapy [5], so the situation of the delayed treatment effect is such that the difference between the treatment and control group is not revealed at once but, after a certain period, it often takes place in related studies. Some examples are nivolumab versus docetaxel in treating advance non-squamous non-small-cell lung cancer [6], pembrolizumab versus docetaxel for treating the same disease [7] and eribulin mesylate versus capecitabine in treating locally advanced or metastatic breast cancer [8]. Besides the immuno-oncology study, the delayed treatment effect is also observed in RCT studies on other diseases, such as nephrosis [9], cardiovascular disease [10], bone marrow transplantation [11] and even infectious disease [12]. The delayed treatment effect is the most common NPH scenario in practice [13] and deserves special methods in order to be dealt with.

Delayed treatment effect scenarios represented by immunotherapy have brought challenges to traditional test methods. Plenty of numeric simulation results have shown that the standard log-rank test has problems, including low power and poor robustness in the existing NPH scenario [13,14,15], while the result of Cox regression is difficult to interpret when the proportional hazard assumption is violated. Therefore, classical test methods need to be improved when there is a potential delayed treatment effect.

There are two main alternative test methods under RCT designs, including the weighted log-rank test (WLRT) [16] and methods based on weighted difference in survival curves, such as restricted mean survival time (RMST) [17,18]. The latter one has not received wide recognition in the field of medicine and clinical trials [19]. The WLRT is an improved method of the standard log-rank test, which is the most common test method in clinical trials and has gained more attention, so this paper is mainly about the WLRT.

The basic idea of WLRT is adding weights to different observations, according to the scenarios of the hazard ratio, and a related theoretical framework was proposed by Fleming and Harrington [20]. Using notations from martingale theory, the general form of WLRT statistics

W_{K}

is as follows:

W_{K} = \sqrt{\frac{1}{n_{1}} + \frac{1}{n_{2}}} \int_{0}^{\infty} W (t) \frac{{\bar{Y}}_{1} (t) {\bar{Y}}_{2} (t)}{{\bar{Y}}_{1} (t) + {\bar{Y}}_{2} (t)} \{\frac{d {\bar{N}}_{1} (t)}{{\bar{Y}}_{1} (t)} - \frac{d {\bar{N}}_{2} (t)}{{\bar{Y}}_{2} (t)}\},

(1)

where

n_{i}

denotes the sample size of the ith group,

{\bar{Y}}_{i} (t)

denotes the number of individuals at risk in the ith group at time t,

{\bar{N}}_{i} (t)

denotes the number of individuals taking events in the ith group by time t, and

W (t)

denotes the weight function. Fleming and Harrington proved that

W_{K}

follows normal distribution asymptotically in a large sample [16], so

Z_{K} = W_{K} / \sqrt{{\hat{σ}}_{W_{K}}} \sim N (0, 1),

(2)

where

{\hat{σ}}_{W_{K}}^{2} = \int (\frac{K^{2}}{{\bar{Y}}_{1} (t)} + \frac{K^{2}}{{\bar{Y}}_{2} (t)}) (1 - \frac{Δ {\bar{N}}_{1} (t) + Δ {\bar{N}}_{2} (t) - 1}{{\bar{Y}}_{1} (t) + {\bar{Y}}_{2} (t) - 1}) \frac{d ({\bar{N}}_{1} (t) + {\bar{N}}_{2} (t))}{{\bar{Y}}_{1} (t) + {\bar{Y}}_{2} (t)},

(3)

and

K = {(\frac{n_{1} n_{2}}{n_{1} + n_{2}})}^{1 / 2} W (t) \frac{{\bar{Y}}_{1} {\bar{Y}}_{2}}{n_{1} n_{2}} \frac{n_{1} + n_{2}}{{\bar{Y}}_{1} + {\bar{Y}}_{2}} .

(4)

By selecting different weight functions

W (t)

, different WLRT statistics can be acquired. A classical weight function called the

G^{ρ, γ}

family proposed by Fleming and Harrington [20] is

W (t) = {(S (t))}^{ρ} {(1 - S (t))}^{γ},

(5)

where

S (t)

is the survival function at time t (usually use Kaplan–Meier estimation [21] to approximate), and

ρ

and

γ

are two parameters that can be selected. When

ρ = 0, γ > 0

, more weight is put on the late observations, thus the corresponding WLRT can have better performance in testing the delayed treatment effect.

G^{0, 1}

and

G^{0, 2}

are two common

G^{ρ, γ}

tests.

Besides the

G^{ρ, γ}

family, other forms of weight function have also been developed in the literature. Magirr et al. proposed the modestly weighted log-rank (mWLRT) test which has the following weight function:

W_{m W L R T} (t) = \frac{1}{m a x {\hat{S} (t), \hat{S} (t^{*})}},

(6)

where

\hat{S} (t^{*})

is the estimation of survival function at some certain time

t^{*}

, and the time

t^{*}

can be calculated using their method and is associated with the expected occurrence time of treatment effect [22]. Magirr et al. proved that their method is non-inferior to the standard log-rank test under almost all scenarios. Unlike other WLRT, Yu et al. considered the weight function based on the time instead of the estimation of survival functions, and a logistic-like weight function (denoted as

w L R T

) proposed by them is as follows:

W_{w L R T} = \frac{e^{a (t - τ)}}{1 + e^{a (t - τ)}},

(7)

where

τ

is associated with the median time of transition period and a is associated with the length of transition period [23]. Moreover, Breslow et al. [24], Xu et al. [25], Zucker and Lakatos [26], Yang and Prentice [27] and others also developed various WLRT methods to deal with NPH scenarios.

However, all of these WLRT are not uniformly most powerful under various delayed treatment effect scenarios. The simulation results from these studies and other review studies [14,28] show that a proper selection of the weight function and corresponding parameters is vital to the performance of the test. If the selection does not meet the real situation well, the power of the test may be very low, and the robustness will be affected a lot. Some methods, such as the

w L R T

proposed by Yu et al., rely on prior information a lot, so they may be not very powerful when such information is lacking. Furthermore, these weight functions may be hard to interpret from the medical perspective and not associated with the expression of survival curves intuitively. Moreover, although there have been many studies on improved WLRT, the references to be compared in these studies are the classical

G^{ρ, γ}

test, and there are rare parallel comparisons between these novel methods.

It should be mentioned that versatile combination tests based on WLRT have been proposed recently. Such tests, such as the “Max-Combo” test, take the maximum or linear combination of some WLRT [14,20], so they are more robust than the single WLRT in general situations and do not require too much prior information [13,26,29,30]. However, the construction of these combination tests does rely on the selection of single WLRT, and their performances are greatly influenced by the properties of single WLRT. Additionally, though combination tests have good robustness, their power in specific scenarios will not exceed the most powerful WLRT or their component test statistics. In that case, our study still focuses on the single WLRT under the specific situation that the delayed treatment effect appears.

This paper aims to establish a parametric model of hazard functions family to fit the delayed treatment effect scenarios and solve its corresponding most theoretically powerful WLRT under the Fleming and Harrington framework. Additionally, this paper compares the novel WLRT with the classical

G^{ρ, γ}

test and other latest WLRT in the simulation study, and two application examples are analyzed to evaluate their practical performance. The rest of this paper is organized as follows: In Section 2, we define the novel

C H

hazard function family to fit the delayed treatment effect scenario and discuss its associated properties. In Section 3, the corresponding

C H

weight function is derived to construct the powerful WLRT, and methods of parameter selection are also demonstrated. We conduct numerical simulation study to evaluate the performance of different WLRT under various settings and report the results in Section 4. In Section 5, we demonstrate two examples with real world data. In Section 6, we discuss some inspirations from our study and give some suggestions on testing the delayed treatment effect in practice. We summarize the content of this paper and give the conclusion in Section 7.

2. $CH$ Hazard Function Family

2.1. Definition

In order to establish the model of hazard function to fit the scenario of the delayed treatment effect and obtain its related weight function, the features of delayed treatment effect in clinical trials should be considered first, and they can be summarized as follows:

In the early stage of trial, the control group which applies traditional treatment has revealed certain treatment effects, while the effect of the treatment group remains invalid or not obvious due to the mechanism of drugs. In that case, the hazard ratio at this time can be considered equivalent to 1 or belonging to a neighborhood containing 1.
As the trial goes on, the effect of the treatment group begins to be revealed, so the hazard ratio decreases gradually and is less than 1 in the end.
In the late stage of the trial, the effect of the treatment group is fully revealed and tends to be stable. The scenario in this stage can be considered approximate to the proportional hazard [31], so the hazard ratio asymptotically converges to a constant less than 1.

Based on such features and inspired by properties of the hyperbolic cosine (

c o s h

) function

f (x) = \frac{1}{2} (e^{- x} + e^{x})

, we propose a hazard function family called the

C H

family, which has the following form:

λ_{θ} (t; ρ, γ) = γ \cdot (\frac{ρ}{e^{t + θ}} + e^{t + θ}), t \geq 0, ρ \geq 0, γ > 0,

(8)

where t is the time,

{ρ, γ}

is defined as the shape parameters, which determines the shape of the hazard function and corresponding survival function, and

θ

is defined as the group parameter which distinguishes different groups. In fact, it should be mentioned that this model can be simplified to two parameters by reparameterization. However, we write the function family in this form for two main reasons: (1) each parameter can have relatively intuitive meaning in this form, which will be illustrated soon; and (2) the most powerful weight function of

C H

family can be easily derived in this form. As the name implies, the differences between the clinical trials are determined by shape parameters, while the difference between groups in one clinical trial is determined by the group parameter. In other words, only group parameters define the specific

C H {ρ, γ}

family and we assume that groups from one trial share the same shape parameters. Therefore, the role of

θ

is reflected in fitting the survival curve, and it is inessential for testing the difference between groups from a family. Knowing the relationship between hazard function

λ (t)

and survival function

S (t) = e x p (- \int_{0}^{t} λ (s) d s)

, the statistical inference on survival functions can be conducted with hazard functions as the medium, and the corresponding

C H

survival function family has the following form:

S_{θ} (t; ρ, γ) = e x p (- \int_{0}^{t} λ (s, θ; ρ, γ) d s) = e x p {(ρ \cdot e^{- t - θ} - e^{t + θ} + e^{θ} - ρ \cdot e^{- θ})}^{γ}

(9)

The clinical trial with hazard functions in the

C H

family can almost fit three features of the delayed treatment effect discussed above, and the details of the proof are shown in the Appendix A. Therefore, the

C H

function family can simulate different scenarios of the delayed treatment effect and help conduct further inference.

2.2. Properties

We consider the properties of the

C H

function family and discuss how it can fit delayed treatment effect scenarios first. Because two hazard functions of groups in a clinical trial share the same shape parameters, we denote them as

{ρ_{0}, γ_{0}}

, and we assume that the group parameters of the control and treatment group are

θ_{C}

and

θ_{T}

, respectively. Without loss of generality, we assume that

θ_{C} > θ_{T}

. Thus, the hazard functions of two groups are

λ_{θ_{C}} (t) = γ_{0} \cdot (ρ_{0} e^{- (t + θ_{C})} + e^{t + θ_{C}}),

(10)

λ_{θ_{T}} (t) = γ_{0} \cdot (ρ_{0} e^{- (t + θ_{T})} + e^{t + θ_{T}}) .

(11)

The role of group parameter

θ

is discussed under the scene of fitting survival functions. It should be noted that all graphs of the

C H

family hazard function are the same when the shape parameters are fixed as

{ρ_{0}, γ_{0}}

, and can be obtained by shifting the graph of

λ (t; ρ, γ) = γ \cdot (ρ e^{- t} + e^{t})

by

θ

horizontally. A larger difference between two group parameters

θ_{C} - θ_{T}

will lead to a larger difference in performance between the two groups. As

t \to + \infty

, the hazard ratio

r (t)

converges to the constant

e^{θ_{C} - θ_{T}}

. Meanwhile, the sum of group parameters

θ_{C} + θ_{T}

is associated with the time when two survival curves separate, and we recommend controlling the range of

θ

in an interval near 0 to have a better fitting effect based on plenty of numerical experiments.

Various combinations of group parameters lead to different initial survival scenarios and hazard ratios in the long term. To illustrate the effect of various parameters selection on the relative relation of survival functions and hazard ratio between groups, we present some graphs. Fixing

ρ = 2

,

γ = 0.1

, the survival curves and the hazard ratio with different alternatives of

θ

are shown in Figure 1. For obviousness, we choose some extreme value of parameters, so the graphs may be not as typical as the delayed treatment effect scenario. When the value of

θ_{C} - θ_{T}

is relatively small, the survival function may cross in the beginning and this situation actually corresponds to the scenario that the experimental drug is ineffective, while the other is effective at that time. If the superiority is slight, this scenario can be considered a special delayed treatment effect case rather than a cross hazard.

When

ρ = 0

,

λ (t; γ) = γ \cdot e^{t}

, the

C H

function family degenerates to the classical exponential function and the survival time follows an exponential distribution. In the beginning of trials, the first term of the

C H

family function outweighs the second term but the difference between the different functions is relatively small. However, the situation becomes opposite as the time increases. In fact, parameter

ρ

is the relative weight between two terms of the

C H

function family, and its influence on the value of the function is from the horizontal dimension. The relatively smaller the

ρ

, the closer the time-to-event data follows the exponential distribution, while the relatively larger the

ρ

, the later the second term

e^{t + θ}

takes effect on the value of the function. Thus, by altering

ρ

, the time for the two survival curves to separate, which is also the initial time of the relative treatment effect, will be different. Fixing

γ = 0.1

,

θ_{C} = - 0.4

and

θ_{T} = 0.4

, the survival curves and hazard ratio with different alternatives of

ρ

are illustrated in Figure 2. Therefore, to increase

ρ

, the expected relative treatment effect of the experiment drug will be postponed, but its long-term effects will remain the same.

As for the parameter

γ

, its influence on the value of the function is from the vertical dimension. Since the value of

ρ

is often determined by the specific scenario of the delayed treatment effect and the value of the hazard function will remain at a high level when

ρ

is large, the role of

γ

is to balance the range of hazard functions. Fixing

ρ = 1

,

θ_{C} = - 0.4

and

θ_{T} = 0.4

, the survival curves and hazard ratio with different alternatives of

γ

are illustrated in Figure 3. Therefore, increasing

γ

, the time when the relative treatment effect occurs will not change, but the expected survival rate at that time will decrease, which means that the trial will reach its designed end earlier.

With proper parameters setting, a pair of

C H

hazard functions can fit the delayed treatment effect scenario intuitively as expected, and the specific pattern can be constructed by adjusting parameters.

3. $CH$ Weight Function and CH Test

3.1. CH Class Weight Function

Since the

C H

function family can simulate the delayed treatment effect scenario, we can solve its corresponding weight function, which is most theoretically powerful, and construct the WLRT for the delayed treatment effect thusly. Using the method that solves the most powerful weight function with the hazard function family known proposed by Fleming and Harrington [20], the corresponding weight function and test statistics of

C H

hazard function family can be derived with the following expressions:

W_{C H} (t) = \frac{8 ρ}{4 ρ + {(\sqrt{m^{2} + 4 ρ} + m)}^{2}} - 1,

(12)

where

m = \frac{1}{γ} l o g \hat{S} (t) + ρ - 1,

(13)

and

\hat{S} (t)

is the pooled estimation of survival function at time t.

However, the result through direct derivation may not meet the regulation conditions of WLRT sometimes, so its asymptotic normality in large sample may have problems. We give two alternative correction forms of the

C H

weight function. The first one is to take the non-negative part of (12) and we have the

C H_{+}

weight function as follows:

W_{C H_{+}} (t) = m a x {0, \frac{8 ρ}{4 ρ + {(\sqrt{m^{2} + 4 ρ} + m)}^{2}} - 1} .

(14)

However, this correction is rough and gives extreme weights to early observations, which may be difficult to interpret. Therefore, we propose the second correction

C H_{C}

which is a smooth version of (12), and the weight function has the expression of

W_{C H_{C}} (t) = \frac{8 ρ^{2}}{4 ρ^{2} + {(\sqrt{m^{2} + 4 ρ} + m)}^{2}} - 1 .

(15)

The m in the (14) and (15) is the same as m in the (12). The WLRT with the

C H_{+}

and

C H_{C}

weight function follows normal distribution asymptotically, and the detailed derivation of these weight functions and the proof of their large sample properties are shown in Appendix B. For simplicity, the

C H

weight function refers to

C H_{+}

and

C H_{C}

in the following part of this paper, and we can derive their

C H

test statistics and construct corresponding

C H

tests. Though the expressions of the

C H

weight functions are more complex than

G^{ρ, γ}

to some degree, it will not increase the computational burden a lot when using statistical software. As for the two-sided hypothesis test that

H_{0} : \forall t, S_{C} (t) = S_{T} (t) v s H_{1} : \exists t, S_{C} (t) \neq S_{T} (t),

(16)

the null hypothesis will be rejected when

\frac{| W_{C H} |}{\sqrt{{\hat{σ^{2}}}_{C H}}} > z_{1 - α / 2},

(17)

where

\hat{σ^{2}}

can be obtained from (3) and (4),

W_{C H}

is the WLRT statistics in (1) with

C H

weight function

W_{C H_{_}} (t)

and

z_{1 - α / 2}

is the

1 - α / 2

quantile of normal distribution. The one-sided test can be constructed similarly.

Though the expressions of the two

C H

weight functions have slight differences from the original version, they share its properties generally. Since the

C H

function family can simulate the delayed treatment effect scenario well, the two

C H

weight functions derived from the original version should have high power in testing the delayed treatment effect, especially for the trial data fitting the

C H

function family.

3.2. Selection of Parameters

There are two undefined parameters

{ρ, γ}

in the

C H

weight functions, so how to select proper parameters in practice is very important. According to the process of derivation, we learn that when the shape parameters of the

C H

statistics

{ρ, γ}

are the same as the actual parameters of the real data, the test will have the highest power. However, the statistical decision must be taken before conducting the clinical trial, so it is impossible to select the best parameters when the data are inaccessible. In that case, we propose two methods of parameter selection for the

C H

test.

3.2.1. Prior Information Method

Because the effects of shape parameters

{ρ, γ}

are relatively obvious and each combination can map to a specific delayed treatment effect scenario, some prior information about hazard functions or treatment effects can help us select better parameters. In fact, in practice, some information about the specific delayed treatment effect scenario is often accessible [23,27], and this information may come from pilot trials, clinical trials of previous phase, parallel studies, the literature, an estimation based on professional experience or knowledge, etc.

Under the scenario of the delayed treatment effect, some significant indices of interest in the clinic include the constant hazard ratio in the late stage of trials, the time when two survival functions or hazard functions separate, the survival rate when the treatment take delayed effects, etc. Using the expression of

C H

weight functions, we can derive parameters with such available prior information. In derivation, to control the range of

θ

, we can let

θ_{C} = - θ_{T} = θ_{0} > 0

. Because the derivatives of the

C H

function family increase as t goes up and the difference between the two functions keeps increasing, we can consider the time

t_{0}

, letting

λ_{θ_{C}} (t_{0}) = λ_{θ_{T}} (t_{0})

as the time when the hazard functions of two groups begin to be different and we have

e x p {(ρ \cdot e^{- t_{0} - θ_{0}} - e^{t_{0} + θ_{0}} + e^{θ_{0}} - ρ \cdot e^{- θ_{0}})}^{γ} = e x p {(ρ \cdot e^{- t_{0} + θ_{0}} - e^{t_{0} - θ_{0}} + e^{- θ_{0}} - ρ \cdot e^{θ_{0}})}^{γ} .

(18)

Meanwhile, having let

θ_{C} = - θ_{T}

, we can use

λ_{0} (t)

to approximate

λ_{θ_{C}} (t)

and

λ_{θ_{T}} (t)

or

S_{0} (t)

to approximate

S_{θ_{C}} (t)

and

S_{θ_{T}} (t)

. Thus, if the approximate total survival rate at time

t_{0}

is

P_{0}

, we can let

P_{0} = S_{0} (t_{0}) = {[e x p (ρ e^{- t_{0}} - e^{t_{0}} + 1 - ρ)]}^{γ} .

(19)

In that case, if we have information about the time

t_{0}

when the hazards of two groups differ and the total survival rate

P_{0}

at time

t_{0}

, by solving (18) and (19), we have

\begin{matrix} \{\begin{matrix} \hat{ρ} = e^{2 t_{0}} \\ \hat{γ} = \frac{l o g P_{0}}{1 - e^{2 t_{0}}} . \end{matrix} \end{matrix}

Thus,

\hat{ρ}

and

\hat{γ}

can be used to in

C H

statistics to test the expected scenario with the prior information. The above derivation is only an example for illustration. If other types of prior information can be acquired, such as information about survival functions, similar derivation of parameters can be conducted.

It should be noted that sometimes when the time scale of trial data is small, e.g., using day as the unit, the value of time data will be large and the shape parameter

γ

cannot control the range of the hazard function as a result. Therefore, estimating parameters through information associated with absolute time directly may cause numerical inflation, and the result may have large errors and be unreliable. In that case, we recommend conducting unit conversion or numerical transformation to decrease the magnitude of time data before analysis. When using weeks or months as the time unit or scaling the data so that

t_{0} = 1

, the method will have better performance. The expected median survival time should be not longer than 15 after the numerical transformation.

3.2.2. Default Selection Method

Just as

G^{0, 1}

is often the default selection from

G^{ρ, γ}

family statistics in testing the delayed treatment effect, there is little or no prior information about hazard functions or survival curves when making the statistical decision sometimes, so we give a pair of shape parameters as the default selection. Through plenty of numerical simulation experiments, we recommend

ρ = 2, γ = 0.1

as the default selection because of its high power and robustness.

By altering

θ

, the

C H {2, 0.1}

survival function family can represent an “average” scenario of the delayed treatment effect. If we consider that the trial enters the late period when the survival rate is below 0.2, the

C H {2, 0.1}

survival functions will separate around the

1 / 3

start time of the late period. The hazard ratio will be similar to the proportional hazard scenario if the treatment effect occurs too early, while the treatment effect may be not so clinically significant if it occurs too late. Figure 4 shows the survival curves of the

C H {2, 0.1}

family with different settings of

θ

(

0.4

,

0.2

, 0,

- 0.2

and

- 0.4

for

θ

from left to right).

Though the default selection method is not so sensitive to the scale of data as the prior information method, we still recommend conducting similar data transformation to improve its performance.

4. Simulation Study

In order to evaluate the performance of

C H

weight functions in practice and compare them with other WLRT dealing with the delayed treatment effect, we conduct comprehensive simulation studies. The simulation is set up as an RCT with two arms, treatment and control separately. The hypothesis test is set as two-sided, and the significant level

α

is

0.05

. The type I error rate under null hypothesis and the empirical power under alternative hypothesis of various test will be calculated and compared to evaluate their performance under different scenarios. The survival functions of the treatment and control group are denoted as

S_{T}

and

S_{C}

, and hazard functions are denoted as

λ_{T}

and

λ_{C}

, respectively. Like other parallel studies, most scenarios in this paper are simulated by generating time-to-event data following piece-wise exponential distribution, so different settings are distinguished by hazard functions.

Two test statistics

C H_{+} {ρ, γ}

and

C H_{C} {ρ, γ}

are proposed in this paper, and each statistic has two methods to select the parameter pair, so the

C H_{+} {ρ, γ}

using the prior information method and default selection method, and

C H_{C} {ρ, γ}

using these two methods are denoted as

C H_{+}^{P}

,

C H_{+}^{D}

,

C H_{C}^{P}

and

C H_{C}^{D}

, respectively. In the

G^{ρ, γ}

family, we choose

G^{0, 1}

and

G^{0, 2}

, which put weights on the late observations and

G^{0, 0}

, which is the standard log-rank test as comparisons. In other types of WLRT, we choose the modestly weighted log-rank test (

m W L R T

) proposed by Magirr et al. [22] and a novel WLRT (

w L R T

) proposed by Yu et al. [23] to make comparisons. Both these two weight functions were introduced briefly in Section 1. We choose these two WLRT for two main reasons: (1) Both them are relatively recent methods and have shown good performance in their simulation studies according to the literature. (2) There are also undetermined parameters that may depend on prior information before trials in these two WLRT, which ensures comparability with the

C H

WLRT.

A total of four statistics have parameters depending on prior information:

C H_{+}^{P}

,

C H_{C}^{P}

,

m W L R T

and

w L R T

. Similar to the example shown in Section 3.2.1, as for

C H

statistics based on the prior information method (

C H_{+}^{P}

and

C H_{C}^{P}

), we use the time

t_{0}

when the two survival curves separate and

P_{0}

, which is the total survival rate at

t_{0}

, to help determine the parameters. We assume we have certain information about

t_{0}

in each scenario and the average hazard function of two groups before

t_{0}

(

λ_{0}

) can be obtained by the pilot study or previous studies to calculate the expected

P_{0}

. The

m W L R T

also depends on the separation time of two curves, so it shares the same

t_{0}

with

C H_{+}^{P}

and

C H_{C}^{P}

in our simulation. The method of

w L R T

has two parameters: a is the length of the transition period that the drug takes to be effective, and

τ

is the center of the transition period. All of this information will be specified in their applicable range later. In order to control the range of parameters, all time-related data will be scaled so that

t_{0} = 1

in dealing with

C H

statistics.

4.1. Type I Error Rate

First, we set

λ_{T} (t) = λ_{C} (t)

to evaluate the type I error rate of these statistics. The total sample size (denoted as n) is 50, 100 and 200, respectively, with a

1 : 1

ratio to the two groups, and the number of repetitions is set as 10,000 to see whether the type I error rate can be controlled well. In this series of simulations,

t_{0} = 5

,

a = 5

,

τ = 7.5

, and

P_{0}

is estimated by the hazard function before

t_{0}

. Three scenarios under a null hypothesis (NH) are shown in Table 1, and hazard functions of two groups are the same in each scenarios.

The simulation result is shown in Table 2. The result shows that all these statistics arise with a type I error rate inflation to a different degree when

n = 50

. As the sample size increases to 100, only

G^{0, 2}

shows a slight type I error rate inflation (

α > 0.06

), while all statistics can generally control the type I error rate when

n = 200

. Therefore, the recommended total sample size should be more than 100 to ensure the reliability of the test in practice. Four

C H

family statistic of interests in this study can control the type I error rate relatively well and show non-inferiority to other WLRT.

4.2. Simple Settings

Then, we consider the simulation of the delayed treatment effect with simple settings and compare powers of various statistics in the test. “Simple” means we generate the time-to-event data based on the designated hazard functions directly using the Monte Carlo method, and there will be no censoring in that case. In this series of simulations,

t_{0} = 0.5

,

a = 0.5

,

τ = 0.75

, and

P_{0}

is estimated by the hazard function of control group before

t_{0}

. Table 3 shows the hazard functions of each scenario. As for the delayed treatment effect scenario (denoted by DT), we copy the classical scenario which was studied by Fleming [32], Lee [33] et al. in

D T_{1}

and set the

D T_{2}

in which the treatment takes a longer time to be effective. We set two scenarios

D T_{3}

and

D T_{4}

with the

C H

family hazard function, so

C H

family test statistics should be the most powerful, theoretically, in that case. Thus, in these two scenarios, we use the exact

{ρ, γ}

for

C H_{+}^{P}

and

C H_{C}^{P}

rather than the estimated. Moreover, we add

P H_{1}

and

P H_{2}

, which are two proportional hazard scenarios to evaluate their robustness. In

P H_{1}

and

P H_{2}

, we assume the prior information is known exactly and set

t_{0} = 0

,

a = 0

, and

τ = 0

. The total sample size is fixed as 200 and the group ratio remains at

1 : 1

, while the repetition number is 5000.

The power result of the simulation is reported in Table 4 and Figure 5. In the classical scenario

D T_{1}

, powers of all statistics are close to 1 because of the larger sample size compared to the literature, so this result is seldom reflected, except the good large sample properties of all statistics. In the

D T_{2}

,

C H_{C}^{P}

is the most powerful one and

G^{ρ, γ}

WLRT has relatively good performance, while powers of other WLRT are relatively low. In

D T_{3}

and

D T_{4}

generated from the

C H

function family, four

C H

test statistics are almost the most powerful as expected. In

D T_{3}

, the

G^{ρ, γ}

WLRT shows great disadvantages compared with other statistics. The performance of

w L R T

is still good in

D T_{3}

but very poor in

D T_{4}

. In

D T_{4}

,

C H

statistics outperform other WLRT overwhelmingly. The powers of

w L R T

and

G^{0, 2}

are lower than 0.5, which is very likely to cause a type II error in practice. As for the same type of

C H

statistics, the statistic using the prior information method are more powerful than using the default selection method, while the latter one can still let the power remain at a relatively high level.

As for the

P H_{1}

and

P H_{2}

, the standard log-rank test

G^{0, 0}

is the most powerful one as expected. The statistics based on prior information actually degenerate to

G^{0, 0}

, so they share the same power. Powers of

C H

statistics using the default selection method are much higher than

G^{0, 1}

and

G^{0, 2}

, which also have default parameters. In particular, the power of

C H_{C}^{D}

is very close to the power of

G^{0, 0}

.

4.3. Clinical Trial Settings

In simple settings, we only consider the ideal situation in which time-to-event data follow the designate distribution completely and the settings of hazard function are relatively simple. However, the data in clinical trials have certain differences from the ideal situation:

The order of magnitude of hazard function will be smaller, while the order of magnitude of time will be larger. The duration will be longer.
Subjects will not be enrolled in the trial simultaneously but during a enrollment period lasting for several months one by one.
Due to the factors including cost and efficiency, the trial will be ended after a certain time, or a planned proportion of subjects will show endpoint events rather than all endpoints observed. In that case, there will be type III censoring in practice.

The existence of censoring may be the most influential factor for power of test statistics, and the features of real trials discussed above should be simulated to evaluate their practical performance. In this part, we do not consider the censoring caused by accidents such as a drop-out or adverse event. In this series of simulations,

t_{0} = 3

,

a = 4

,

τ = 5

and

P_{0}

are estimated by the hazard function of the control group before

t_{0}

in each scenario.

Taking the study of Ray et al. [13] as a reference, we set the time unit as month, and the patient enrollment follows a Poisson process with a parameter equal to 25. The total enrollment period lasts for 12 months, which means about 300 subjects will be enrolled in the trial. The end of the trial is set as

70 %

of the total, which is that about 210 subjects are observed to occur endpoint events. The group ratio is still

1 : 1

and the number of repetition is 5000.

To ensure the comparability, we take the simulation study settings in the study of Ray et al. [13] as references and copy four delayed treatment effect scenarios denoted as

C - D T_{1}

,

C - D T_{2}

,

C - D T_{3}

and

C - D T_{4}

.

C - D T_{1}

and

C - D T_{2}

are two typical delayed treatment effect scenarios, while

C - D T_{3}

and

C - D T_{4}

are more complex with converging tails. To enrich the scenarios and evaluate the performance of statistics when prior information is lacking or unreliable, we set

C - D T_{5}

and

C - D T_{6}

where the delayed treatment effects do not take at

t_{0} = 3

, so the distributions of analysis data are different from the distributions expected by prior information. Furthermore, we add two proportional hazard scenarios

C - P H_{1}

and

C - P H_{2}

to make the comparison, and statistics based on prior information keep the same parameter selection as above rather than the real. The specific hazard function settings are shown in the Table 5.

The empirical power result of the simulation in the clinical trial settings is reported in Table 6 and Figure 6. In the classic scenarios of

C - D T_{1}

and

C - D T_{2}

,

G^{0, 0}

has the lowest power, while other statistics maintain good performance.

C H_{+}^{P}

,

w L R T

,

G^{0, 1}

and

G^{0, 2}

have relative higher powers over 0.9, and

m W L R T

and

C H_{C}^{P}

are in the second tier. In

C - D T_{3}

and

C - D T_{4}

with a converging tail, the

C H

family statistics manifest an obvious advantage over others. In

C - D T_{3}

, only

C H

and

m W L R T

statistics have power over 0.4, while other statistics perform very poorly. In

C - D T_{4}

, the power of

C H

statistics is over 0.1 above the power of the

G^{ρ, γ}

family. The power of

w L R T

and

G^{0, 2}

are the lowest in these two scenarios and have great contrasts with the first two scenarios, which shows relatively poor robustness of these two statistics. In

C - D T_{5}

and

C - D T_{6}

, where the delayed treatment effect occurs later, powers of

C H_{+}^{P}

,

w L R T

,

G^{0, 1}

and

G^{0, 2}

are the highest and over 0.8, while powers of the other three

C H

statistics are in the second tier but higher than

m W L R T

and

G^{0, 0}

obviously. In

C - P H_{1}

and

C - P H_{2}

that belong to the proportional hazard scenarios,

G^{0, 0}

is the most powerful one as expected. Similar to the simple settings, the power of

m W L R T

is very close to the

G^{0, 0}

, and the

C H

family statistics also have robust performance. The powers of

C H_{C}^{P}

and

C H_{C}^{D}

are very close to the highest one with differences less than 0.05. The performance of

w L R T

is relatively poor in the proportional hazard ratio when the prior information is unreliable and is only higher than

G^{0, 2}

.

5. Application in Real Studies

To evaluate and compare the performance of

C H

test statistics in practical application, we conduct an analysis on some simple real world studies that fit the scenario of the delayed treatment effect. Similar to the simulation study, four

C H

statistics,

w L R T

,

m W L R T

and three

G^{ρ, γ}

statistics are used to test the difference in each example.

5.1. Head-and-Neck-Cancer Study

The Northern Oncology Group (NCOG) studied the effects of a different treatment strategy on head-and-neck-cancer patients, and their survival time was collected [34]. Patients who took radiation therapy alone were allocated randomly to Group 1, or Group 2 if they took radiation plus chemotherapy. The Kaplan–Meier curves of these time-to-event data are shown in Figure 7.

It is obvious that two curves are almost the same at the beginning, but Group 2 shows better survival performance later. This example is a typical delayed treatment effect scenario, and we set

t_{0} = 5

,

P_{0} = 0.75

,

a = 2

,

τ = 5

based on the KM plot for statistics depending on prior information. Table 7 gives the p-values of various tests.

The result shows that the p-values of all statistics, including four

C H

statistics, are less than 0.05. Thus, the null hypothesis is rejected and we can draw the conclusion that radiation plus chemotherapy has a better treatment effect on head-and-neck cancer. Even the standard log-rank test

G^{0, 0}

can reject the null hypothesis; the reason may be that the difference is too large between two groups and the performance of Group 2 is always better than Group 1 actually.

5.2. Kidney Infection Study

Nahman et al. conducted a study to assess the time to first exit-site infection in patients with renal insufficiency, and 43 patients utilized a surgically placed catheter (Group 1), while 76 patients utilized a percutaneous placement of their catheter (Group 2) [35]. The Kaplan–Meier curves of the time-to-event data are shown in Figure 8.

We can intuitively find that two curves are close in the beginning of the study, while Group 1 has slight better survival performance. From the 5th to 9th month, two curves cross and separate finally, and Group 2 overwhelming has a survival advantage after. In that case, this study can be considered an example of the delayed treatment effect generally.

The setting of prior information is as following:

t_{0} = 7

,

P_{0} = 0.85

,

a = 4

,

τ = 7

. Table 8 gives the p-values of various tests.

The p-values of

m W L R T

and

G^{0, 0}

are greater than 0.05, while the other statistics have a p-value less than 0.05 and reject the null hypothesis. Since the difference between the two groups is obvious,

m W L R T

and

G^{0, 0}

have relatively poor performance and will cause a type II error in this example.

These two examples show that the

C H

statistics can test the delayed treatment effect correctly while some tests may not, so the

C H

statistics is useful in certain application scenarios.

6. Discussion

How to deal with the delayed treatment effect and conduct a corresponding hypothesis test is a very significant topic in the clinical trial research. There has been rich work about WLRT, but their comparisons and adaptive scenarios remain to be figured out. We propose a novel parametric family of hazard function to fit the delayed treatment effect and derive the corresponding weight function based on the method develop by Fleming and Harrington. The flexibility of the

C H

family parameters in fitting the delayed treatment effect and the good performance of

C H

statistics show that setting up a useful and adaptive hazard function family first and then solving a corresponding weight function may be a feasible way to develop practical WLRT. Theoretically, the most powerful test can always be found through this idea, but how to improve the compatibility of the function family model and whether the result from Fleming and Harrington’s method has an analytical expression and meets the regularity conditions are problems that seem to be contradictory sometimes and need to be solved. Compared with methods defining the weight function directly, deriving the weight function based on a specific hazard function family can connect the statistical inference to the actual survival scenarios closely and thus have better clinical interpretations. This study offers an example, but the application of such ideas should not be limited. It should be mentioned that all WLRT, including

C H

statistics, can be used for constructing combination tests according to the related large sample properties proposed by Fleming and Harrington [20], so a similar Max–Combo test (for example,

m a x {C H_{C}^{R}, G^{0, 0}}

) can also be constructed to improve the robustness.

No test statistic can remain the most powerful uniformly under all scenarios through the result of simulation, so the relatively high power under certain common scenarios and the property of robustness are very important. From this study, the performance of

m W L R T

is poor and only better than the standard log-rank test. Its high power under proportional hazard scenarios reflects that it is a rather conservative weight compared to other WLRT. The performance of

w L R T

is a little extreme in the simulation: very high power in some settings and very low power in others. This phenomenon may be associated with its relatively extreme weight compared to others and the high dependence on the prior information. Apparently, such features will influence its practicability when little information is known before the trial. Generally, the

C H

test is the recommended single WLRT of this paper. Its similar power to the standard log-rank test under proportional hazard and relative high power dealing with the delayed treatment effect ensures its robustness in testing various scenarios. Moreover, different forms of

g (t)

in (A1) can be considered to derive test statistics that are more powerful and generalized. Both

C H_{+}

and

C H_{C}

have their advantages and disadvantages in different settings, but

C H_{C}

has a smoother and simpler expression, which also inspires us to find better correction forms. The power of test using the prior information method is generally higher than the default selection method, but the latter one still remains at a high level. Even when there is little or no information about the pattern of the treatment effect, the

C H

test can ensure a relative high power and increase the efficiency. In practice, the decision on the selection of test methods should be based on the properties of such test statistics and conditions of known information, and data simulation is recommended to be conducted in advance if the conditions permit.

7. Conclusions

The

C H

hazard function has a certain significance for the modeling of delayed treatment effect with its better performance and clear parameter selection, and

C H

tests have good robustness in application. It not only has high power in different delayed treatment effect scenarios, but is also very powerful in proportional hazards scenarios, regardless of whether the prior information is known or not. Because of these good features and stable robustness,

C H

tests are helpful and meaningful in clinical trials when there is delayed treatment effect scenario.

Author Contributions

Conceptualization, K.Q. and X.Z.; Formal analysis, K.Q. and X.Z.; Methodology, K.Q.; Software, K.Q.; Supervision, X.Z.; Validation, K.Q.; Writing—original draft, K.Q.; Writing—review and editing, K.Q. and X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research did not receive external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Properties of CH Hazard Function Family

Fix shape parameters

{ρ_{0}, γ_{0}}

and consider group parameters

{θ_{C}, θ_{T}}

, we compare the hazard function of two groups. Let

r (t)

be the hazard ratio of the control group to the treatment group:

\begin{matrix} r (t) & = \frac{λ_{C} (t)}{λ_{T} (t)} \\ = \frac{ρ_{0} e^{- (t + θ_{C})} + e^{t + θ_{C}}}{ρ_{0} e^{- (t + θ_{T})} + e^{t + θ_{T}}} . \end{matrix}

(1) When

θ_{C} + θ_{T} < l o g (ρ_{0})

, we have

\begin{matrix} r (t) \{\begin{matrix} < 1, & when 0 < t < l o g \sqrt{\frac{ρ_{0}}{e^{θ_{C} + θ_{T}}}}, \\ = 1, & when t = l o g \sqrt{\frac{ρ_{0}}{e^{θ_{C} + θ_{T}}}}, \\ > 1, & when t > l o g \sqrt{\frac{ρ_{0}}{e^{θ_{C} + θ_{T}}}} . \end{matrix} \end{matrix}

when

θ_{C} + θ_{T} \geq l o g (ρ_{0})

,

r (t) \geq 1

always hold, which is equivalent to

λ (t, θ_{C}) \geq λ (t, θ_{T})

.

The derivation of

r (t)

is

r^{'} (t) = (e^{θ_{C} - θ_{T}} - e^{θ_{T} - θ_{C}}) \frac{2 ρ_{0} e^{2 t + 2 θ_{T}}}{ρ_{0} + e^{2 t + 2 θ_{T}}} < 2 ρ_{0} (e^{θ_{C} - θ_{T}} - e^{θ_{T} - θ_{C}}),

so with certain selection of parameters,

r^{'} (t)

can be controlled within a relative small range and the range of

r (t)

can also be controlled near 1 in the beginning of the trial.

(2) As for the hazard ratio

r (t)

, we have

\begin{matrix} r (t) & = \frac{ρ_{0} e^{- (t + θ_{C})} + e^{t + θ_{C}}}{ρ_{0} e^{- (t + θ_{T})} + e^{t + θ_{T}}} \\ = \frac{ρ_{0} e^{θ_{T} - θ_{C}} - ρ_{0} e^{θ_{C} - θ_{T}}}{ρ_{0} + e^{2 t + 2 θ_{T}}} + e^{θ_{C} - θ_{T}}, \end{matrix}

so when

θ_{C} - θ_{T} > 0

,

r (t)

is monotonically increasing on

[0, + \infty)

and the hazard ratio of the treatment group to the control group monotonically decreases.

(3) When

t \to + \infty

, we have

\begin{matrix} lim_{t \to + \infty} r (t) & = lim_{t \to + \infty} \frac{ρ_{0} e^{- (t + θ_{C})} + e^{t + θ_{C}}}{ρ_{0} e^{- (t + θ_{T})} + e^{t + θ_{T}}} \\ = lim_{t \to + \infty} \frac{e^{t + θ_{C}}}{e^{t + θ_{T}}} \\ = e^{θ_{C} - θ_{T}} . \end{matrix}

With the increase in t, the contribution of term

ρ_{0} e^{- (t + θ_{i})}

to the value of

C H

family hazard function decreases, and the hazard ratio converges to the constant

e^{θ_{C} - θ_{T}}

. The constant is only determined by the group parameter

{θ_{C}, θ_{T}}

, and is independent with

{ρ_{0}, γ_{0}}

.

Properties (1)–(3) correspond to features 1–3 of the delayed treatment effect scenario, respectively, so the

C H

function family can simulate the delayed treatment effect scenario.

Appendix B. Construction of CH Test

Appendix B.1. Derivation of CH Weight Function

Fleming and Harrington proposed the method of deriving the most powerful WLRT weight function with a known survival function hazard [20]. If the distribution function of survival time has the following forms,

F_{θ} (t) = Φ {g (t) + θ}, t \in [0, + \infty)

(A1)

where g is a differentiable non-decreasing function from

[0, + \infty)

to

(- \infty, + \infty)

,

θ

belongs to the parameter function

Θ

on

(- \infty, + \infty)

, and

Φ

is a continuous distribution function with positive density

ϕ

having a derivative

ϕ^{'}

with continuous but finitely many points. Then, the statistics with maximal asymptotic efficacy against alternative hypothesis has a weight function proportional to

\frac{\partial}{\partial θ} l o g λ_{θ} {(t) |}_{θ = θ_{0}} = l^{'} [Φ^{- 1} {F_{θ_{0}} (t)}],

where

l = l o g λ

is the logarithm of the corresponding hazard function of the known survival function family, and

F_{θ_{0}} (t)

is the observed value of distribution function which can be approximated by the KM estimation of the survival function in practice. As for the

C H

hazard function family, the corresponding distribution function follows:

F_{θ} (t) = 1 - S_{θ} (t) = 1 - e x p (- \int_{0}^{t} γ \cdot (\frac{ρ}{e^{s + θ}} + e^{s + θ}) d s) .

Let

g (t) = t

and

Φ (t) = 1 - e x p (- \int_{0}^{t} γ \cdot (\frac{ρ}{e^{s}} + e^{s}) d s)

, so the

C H

distribution function with determined shape parameters

{ρ_{0}, γ_{0}}

and undetermined group parameter

θ

meet the condition of Fleming and Harrington’s method. We have

\begin{matrix} l^{'} (t) & = l o g^{'} λ_{θ} (t) \\ = l o g^{'} [γ \cdot (\frac{ρ}{e^{t}} + e^{t})] \\ = \frac{2}{ρ e^{- 2 t} + 1} - 1 . \end{matrix}

Solve the inverse function of the original distribution function without group parameter

θ

, such that

Φ (t) = 1 - e x p (- \int_{0}^{t} γ \cdot (\frac{ρ}{e^{s}} + e^{s}) d s) = 1 - {[e x p (ρ e^{- t} - e^{t} + 1 - ρ)]}^{γ}

, and we have

\begin{matrix} Φ^{- 1} (F_{θ_{0}} (t)) = l o g [\frac{1}{2} (\sqrt{m^{2} + 4 ρ} - m)] \end{matrix}

where,

m = \frac{1}{γ} l o g (1 - F_{θ_{0}} (t)) + ρ - 1 .

In practice, we replace

F_{θ_{0}} (t)

with the KM estimation:

m = \frac{1}{γ} l o g \hat{S} + ρ - 1 .

Substituting the inverse function in the upper formula, we can acquire the most theoretically powerful weight function under the scenarios with the

C H

hazard function family:

\begin{matrix} W_{C H} (t) & = \frac{{(\sqrt{m^{2} + 4 ρ} - m)}^{2} - 4 ρ}{{(\sqrt{m^{2} + 4 ρ} - m)}^{2} + 4 ρ} \\ = \frac{8 ρ}{4 ρ + {(\sqrt{m^{2} + 4 ρ} + m)}^{2}} - 1, \end{matrix}

where

m = \frac{1}{γ} l o g \hat{S} + ρ - 1 .

Appendix B.2. Asymptotic Distribution of CH Statistics

Fleming and Harrington proved the following theorem [20]:

Theorem A1.

Let

\hat{S} (t)

be the KM estimation in the pooled sample. Let f be a non-negative bounded continuous function with bounded variation on

[0, 1]

. Suppose

W_{K}

is a statistics of the class K with the form

W_{K} = \int K \frac{d {\bar{N}}_{1}}{\bar{Y_{1}}} - \int K \frac{d {\bar{N}}_{1}}{\bar{Y_{2}}} .

where

K (t) = {(\frac{n_{1} n_{2}}{n_{1} + n_{2}})}^{1 / 2} W (t) \frac{\bar{Y_{1}} (t) \bar{Y_{2}} (t)}{n_{1} n_{2}} \frac{n_{1} + n_{2}}{\bar{Y_{1}} (t) + \bar{Y_{2}} (t)}

and where

W (t) = f (\hat{S} (t))

.

Let

F_{1}

and

F_{2}

be the survival time distribution functions of two groups. When

F_{1}^{n} = F_{2}^{n} = F

for all n,

\frac{W_{K}}{\sqrt{{\hat{σ}}_{W_{K}}^{2}}} \overset{D}{\to} N (0, 1), as n \to \infty

where

{\hat{σ}}_{W_{K}}^{2} = \int (\frac{K^{2}}{{\bar{Y}}_{1}} + \frac{K^{2}}{{\bar{Y}}_{2}}) (1 - \frac{Δ {\bar{N}}_{1} + Δ {\bar{N}}_{2} - 1}{{\bar{Y}}_{1} + {\bar{Y}}_{2} - 1}) \frac{d ({\bar{N}}_{1} + {\bar{N}}_{2})}{{\bar{Y}}_{1} + {\bar{Y}}_{2}} .

However, the original version of the

C H

weight function in the (A1) may not meet the regulation conditions of Theorem A1. For example, when

ρ = 3

, as for

W (0)

, we have

m = l o g \hat{S} + ρ - 1 = l o g 1 + 3 - 1 = 2

,

W (0) = - 0.5 < 0

and the condition on function f of the non-negative cannot be met. Then we prove that the two correction forms of the original version

C H

weight function as (14) and (15) can meet the conditions of Theorem A1.

Proof.

As for the

W_{C H_{+}} (t) = m a x {0, 8 ρ / [4 ρ + {(\sqrt{m^{2} + 4 ρ} + m)}^{2}] - 1}

where

m = \frac{1}{γ} l o g \hat{S} + ρ - 1

, because

W_{C H} (t)

is a bounded continuous function on

[0, 1]

, it is easy to find that

W_{C H_{+}} (t) = m a x {0, W_{C H} (t)}

is also a bounded continuous function on

[0, 1]

, and

W_{C H_{+}} (t) \geq 0

always hold on

[0, 1]

. Therefore, the regulation conditions of the Theorem A1 are met. □

Proof.

As for the

W_{C H_{C}} (t) = 8 ρ^{2} / [4 ρ^{2} + {(\sqrt{m^{2} + 4 ρ} + m)}^{2}] - 1

where

m = \frac{1}{γ} l o g \hat{S} + ρ - 1

, because

\hat{S} \in [0, 1]

, we have

m = \frac{1}{γ} l o g \hat{S} + ρ - 1 \leq ρ - 1

. Therefore,

\begin{matrix} W_{C H_{C}} (t) & = \frac{8 ρ^{2}}{4 ρ^{2} + {(\sqrt{m^{2} + 4 ρ} + m)}^{2}} - 1 \\ \geq \frac{8 ρ^{2}}{4 ρ^{2} + {[\sqrt{{(ρ - 1)}^{2} + 4 ρ} + (ρ - 1)]}^{2}} - 1 \\ = \frac{8 ρ^{2}}{4 ρ^{2} + {(2 ρ)}^{2}} - 1 \\ = 0 . \end{matrix}

Obviously,

W_{C H_{C}} (t)

is a bounded continuous function on

[0, + \infty)

of t, and it is non-negative on

[0, 1]

, so

W_{C H_{C}} (t)

is also a weight function that meets the conditions of Theorem A1. □

In summary, the two correction forms of the

C H

weight function meet the conditions of Theorem A1. Thus, their asymptotic normality is ensured and the corresponding

C H

test can be constructed.

References

Peto, R.; Peto, J. Asymptotically Efficient Rank Invariant Test Procedures. J. R. Stat. Soc. Ser. A Gen. 1972, 135, 185–198. [Google Scholar] [CrossRef]
Cox, D.R. Regression Models and Life-Tables. J. R. Stat. Soc. Ser. B Stat. Methodol. 1972, 34, 187–202. [Google Scholar] [CrossRef]
Putter, H.; Sasako, M.; Hartgrink, H.H.; van de Velde, C.J.H.; van Houwelingen, J.C. Long-term survival with non-proportional hazards: Results from the Dutch Gastric cancer Trial. Stat. Med. 2005, 24, 2807–2821. [Google Scholar] [CrossRef]
Maia, M.C.; Hansen, A.R. A comprehensive review of immunotherapies in prostate cancer. Crit. Rev. Oncol. Hematol. 2017, 113, 292–303. [Google Scholar] [CrossRef]
Anagnostou, V.; Yarchoan, M.; Hansen, A.R.; Wang, H.; Verde, F.; Sharon, E.; Collyar, D.; Chow, L.Q.M.; Forde, P.M. Immuno-oncology Trial Endpoints: Capturing Clinically Meaningful Activity. Clin. Cancer Res. 2017, 23, 4959–4969. [Google Scholar] [CrossRef] [Green Version]
Borghaei, H.; Paz-Ares, L.; Horn, L.; Spigel, D.R.; Steins, M.; Ready, N.E.; Chow, L.Q.; Vokes, E.E.; Felip, E.; Holgado, E.; et al. Nivolumab versus Docetaxel in Advanced Nonsquamous Non-Small-Cell Lung Cancer. N. Engl. J. Med. 2015, 373, 1627–1639. [Google Scholar] [CrossRef]
Bellmunt, J.; Sonpavde, G.; De Wit, R.; Choueiri, T.K.; Siefker-Radtke, A.O.; Plimack, E.R.; Lewis, N.M.; Brown, H.; Mai, Y.B.; Gause, C.K.; et al. KEYNOTE-045: Randomized phase 3 trial of pembrolizumab (MK-3475) versus paclitaxel, docetaxel, or vinflunine for previously treated metastatic urothelial cancer. J. Clin. Oncol. 2015, 33, TPS4571. [Google Scholar] [CrossRef] [Green Version]
Kaufman, P.A.; Awada, A.; Twelves, C.; Yelle, L.; Perez, E.A.; Velikova, G.; Olivo, M.S.; He, Y.; Dutcus, C.E.; Cortes, J. Phase III Open-Label Randomized Study of Eribulin Mesylate versus Capecitabine in Patients with Locally Advanced or Metastatic Breast Cancer Previously Treated with an Anthracycline and a Taxane. J. Clin. Oncol. 2015, 33, 594–601. [Google Scholar] [CrossRef]
Eknoyan, G.; Beck, G.J.; Cheung, A.K.; Daugirdas, J.T.; Greene, T.; Kusek, J.W.; Allon, M.; Bailey, J.; Delmez, J.A.; Depner, T.A.; et al. Effect of dialysis dose and membrane flux in maintenance hemodialysis. N. Engl. J. Med. 2002, 347, 2010–2019. [Google Scholar] [CrossRef]
Cannon, C.P.; Braunwald, E. Intensive versus moderate lipid lowering with statins after acute coronary syndromes—Reply. N. Engl. J. Med. 2004, 351, 716–717. [Google Scholar]
Logan, B.R.; Klein, J.P.; Zhang, M.J. Comparing treatments in the presence of crossing survival curves: An application to bone marrow transplantation. Biometrics 2008, 64, 733–740. [Google Scholar] [CrossRef] [Green Version]
Samandari, T.; Agizew, T.B.; Nyirenda, S.; Tedla, Z.; Sibanda, T.; Shang, N.; Mosimaneotsile, B.; Motsamai, O.I.; Bozeman, L.; Davis, M.K.; et al. 6-month versus 36-month isoniazid preventive treatment for tuberculosis in adults with HIV infection in Botswana: A randomised, double-blind, placebo-controlled trial. Lancet 2011, 377, 1588–1598. [Google Scholar] [CrossRef]
Lin, R.S.; Lin, J.; Roychoudhury, S.; Anderson, K.M.; Hu, T.L.; Huang, B.; Leon, L.F.; Liao, J.J.Z.; Liu, R.; Luo, X.D.; et al. Alternative Analysis Methods for Time to Event Endpoints Under Nonproportional Hazards: A Comparative Analysis. Stat. Biopharm. Res. 2020, 12, 187–198. [Google Scholar] [CrossRef] [Green Version]
Roychoudhury, S.; Anderson, K.M.; Ye, J.; Mukhopadhyay, P. Robust Design and Analysis of Clinical Trials with Nonproportional Hazards: A Straw Man Guidance from a Cross-Pharma Working Group. Stat. Biopharm. Res. 2021, 1–15. [Google Scholar] [CrossRef]
Su, Z.; Zhu, M. Is it time for the weighted log-rank test to play a more important role in confirmatory trials? Contemp. Clin. Trials Commun. 2018, 10, A1–A2. [Google Scholar] [CrossRef]
Fleming, T.R.; Harrington, D.P. A class of hypothesis tests for one and two sample censored survival data. Commun. Stat. Theory Methods 1981, 10, 763–794. [Google Scholar] [CrossRef]
Royston, P.; Parmar, M.K.B. The use of restricted mean survival time to estimate the treatment effect in randomized clinical trials when the proportional hazards assumption is in doubt. Stat. Med. 2011, 30, 2409–2421. [Google Scholar] [CrossRef]
Tian, L.; Zhao, L.H.; Wei, L.J. Predicting the restricted mean event time with the subject’s baseline covariates in survival analysis. Biostatistics 2014, 15, 222–233. [Google Scholar] [CrossRef]
Bartlett, J.W.; Morris, T.P.; Stensrud, M.J.; Daniel, R.M.; Vansteelandt, S.K.; Burman, C.F. The Hazards of Period Specific and Weighted Hazard Ratios. Stat. Biopharm. Res. 2020, 12, 518–519. [Google Scholar] [CrossRef]
Fleming, T.R.; Harrington, D.P. Counting Processes and Survival Analysis; Wiley-Interscience: Hoboken, NJ, USA, 2005; pp. 1–545. [Google Scholar]
Kaplan, E.L.; Meier, P. Nonparametric-Estimation from Incomplete Observations. J. Am. Stat. Assoc. 1958, 53, 457–481. [Google Scholar] [CrossRef]
Magirr, D.; Burman, C.F. Modestly weighted logrank tests. Stat. Med. 2019, 38, 3782–3790. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Yu, C.; Huang, X.; Hui, N.A.; He, P.L. A weighted log-rank test and associated effect estimator for cancer trials with delayed treatment effect. Pharm. Stat. 2021, 20, 528–550. [Google Scholar] [CrossRef]
Breslow, N.E.; Edler, L.; Berger, J. A 2-Sample Censored-Data Rank Test for Acceleration. Biometrics 1984, 40, 1049–1062. [Google Scholar] [CrossRef] [PubMed]
Xu, Z.Z.; Zhen, B.G.; Park, Y.; Zhu, B. Designing therapeutic cancer vaccine trials with delayed treatment effect. Stat. Med. 2017, 36, 592–605. [Google Scholar] [CrossRef] [Green Version]
Zucker, D.M.; Lakatos, E. Weighted Log Rank Type Statistics for Comparing Survival Curves When There Is a Time-Lag in the Effectiveness of Treatment. Biometrika 1990, 77, 853–864. [Google Scholar] [CrossRef]
Yang, S.; Prentice, R. Semiparametric analysis of short-term and long-term hazard ratios with two-sample survival data. Biometrika 2005, 92, 1–17. [Google Scholar] [CrossRef]
Callegaro, A.; Spiessens, B. Testing Treatment Effect in Randomized Clinical Trials with Possible Nonproportional Hazards. Stat. Biopharm. Res. 2017, 9, 204–211. [Google Scholar] [CrossRef]
Lee, S.H. On the versatility of the combination of the weighted log-rank statistics. Comput. Stat. Data Anal. 2007, 51, 6557–6564. [Google Scholar] [CrossRef]
Karrison, T.G. Versatile tests for comparing survival curves based on weighted log-rank statistics. Stata J. 2016, 16, 678–690. [Google Scholar] [CrossRef] [Green Version]
Hoos, A. Evolution of end points for cancer immunotherapy trials. Ann. Oncol. 2012, 23, 47–52. [Google Scholar] [CrossRef]
Fleming, T.R.; Harrington, D.P.; Osullivan, M. Supremum Versions of the Log-Rank and Generalized Wilcoxon Statistics. J. Am. Stat. Assoc. 1987, 82, 312–320. [Google Scholar] [CrossRef]
Lee, J.W. Some versatile tests based on the simultaneous use of weighted log-rank statistics. Biometrics 1996, 52, 721–725. [Google Scholar] [CrossRef]
Efron, B. Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve. J. Am. Stat. Assoc. 1988, 83, 414–425. [Google Scholar] [CrossRef]
Nahman, N.S.; Middendorf, D.F.; Bay, W.H.; Mcelligott, R.; Powell, S.; Anderson, J. Modification of the Percutaneous Approach to Peritoneal-Dialysis Catheter Placement under Peritoneoscopic Visualization—Clinical-Results in 78 Patients. J. Am. Soc. Nephrol. 1992, 3, 103–107. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Survival curves with

C H

function family with parameters settings:

ρ = 2

,

γ = 0.1

(a)

θ_{C} = - 0.8, θ_{T} = 0

(b)

θ_{C} = - 0.4, θ_{T} = 0.4

(c)

θ_{C} = 0, θ_{T} = 0.8

(d)

θ_{C} = - 0.8, θ_{T} = 0.8

.

Figure 1. Survival curves with

C H

function family with parameters settings:

ρ = 2

,

γ = 0.1

(a)

θ_{C} = - 0.8, θ_{T} = 0

(b)

θ_{C} = - 0.4, θ_{T} = 0.4

(c)

θ_{C} = 0, θ_{T} = 0.8

(d)

θ_{C} = - 0.8, θ_{T} = 0.8

.

Figure 2. Survival curves with

C H

function family with parameter settings:

γ = 0.1

,

θ_{C} = - 0.4

,

θ_{T} = 0.4

(a)

ρ = 1

(b)

ρ = 2

(c)

ρ = 3

(d)

ρ = 4

.

Figure 2. Survival curves with

C H

function family with parameter settings:

γ = 0.1

,

θ_{C} = - 0.4

,

θ_{T} = 0.4

(a)

ρ = 1

(b)

ρ = 2

(c)

ρ = 3

(d)

ρ = 4

.

Figure 3. Survival curves with

C H

function family with parameters settings:

ρ = 2

,

θ_{C} = - 0.4

,

θ_{T} = 0.4

(a)

γ = 0.05

, (b)

γ = 0.1

, (c)

γ = 0.2

, (d)

γ = 0.4

.

Figure 3. Survival curves with

C H

function family with parameters settings:

ρ = 2

,

θ_{C} = - 0.4

,

θ_{T} = 0.4

(a)

γ = 0.05

, (b)

γ = 0.1

, (c)

γ = 0.2

, (d)

γ = 0.4

.

Figure 4. Survival curves of

C H {2, 0.1}

family.

Figure 4. Survival curves of

C H {2, 0.1}

family.

Figure 5. Empirical power of tests in simple settings.

Figure 6. Empirical power of tests in clinical trial settings.

Figure 7. Kaplan–Meier curves of head-and-neck-cancer patients survival time.

Figure 8. Kaplan–Meier curves of time to infection of kidney dialysis patients.

Table 1. Hazard functions of scenarios under null hypothesis.

Scenarios	$0 \leq t < 5$	$5 \leq t < 10$	$10 \leq t$
$N H_{1}$	0.1	0.1	0.1
$N H_{2}$	0.04	0.08	0.12
$N H_{3}$	0.1	0.08	0.05

Table 2. Type I error rates.

Scenarios	n	${CH}_{+}^{P}$	${CH}_{+}^{D}$	${CH}_{C}^{P}$	${CH}_{C}^{D}$	$wLRT$	$mWLRT$	$G^{0, 1}$	$G^{0, 2}$	$G^{0, 0}$
$N H_{1}$	50	0.064	0.064	0.062	0.062	0.067	0.065	0.072	0.081	0.058
	100	0.054	0.054	0.051	0.051	0.055	0.051	0.057	0.062	0.049
	200	0.050	0.049	0.052	0.052	0.055	0.052	0.049	0.050	0.051
$N H_{2}$	50	0.059	0.058	0.059	0.059	0.057	0.059	0.066	0.079	0.057
	100	0.054	0.053	0.052	0.052	0.053	0.051	0.057	0.063	0.051
	200	0.050	0.049	0.052	0.052	0.055	0.052	0.049	0.050	0.051
$N H_{3}$	50	0.061	0.061	0.059	0.059	0.062	0.062	0.068	0.079	0.058
	100	0.055	0.053	0.055	0.055	0.055	0.053	0.058	0.063	0.054
	200	0.050	0.049	0.052	0.052	0.055	0.052	0.049	0.050	0.051

Table 3. Hazard functions of scenarios in simple settings.

Scenario	Group	$0 \leq t < 0.5$	$0.5 \leq t < 1$	$1 \leq t$
$D T_{1}$	Control	2	4	4
$D T_{1}$	Treatment	2	0.4	0.4
$D T_{2}$	Control	1	2	4
$D T_{2}$	Treatment	1	1.4	1.8
$D T_{3}$	Control	$C H_{θ = 0.1} {3, 0.05$ }
$D T_{3}$	Treatment	$C H_{θ = - 0.1} {3, 0.05$ }
$D T_{4}$	Control	$C H_{θ = 0.11} {4, 0.02$ }
$D T_{4}$	Treatment	$C H_{θ = - 0.11} {4, 0.02$ }
$P H_{1}$	Control	1.4	1.4	1.4
$P H_{1}$	Treatment	1.0	1.0	1.0
$P H_{2}$	Control	1.2	1.5	1.8
$P H_{2}$	Treatment	0.8	1.0	1.2

Table 4. Empirical power in simple settings.

Scenarios	${CH}_{+}^{P}$	${CH}_{+}^{D}$	${CH}_{C}^{P}$	${CH}_{C}^{D}$	$wLRT$	$mWLRT$	$G^{0, 1}$	$G^{0, 2}$	$G^{0, 0}$
$D T_{1}$	1.000	0.997	1.000	0.991	1.000	1.000	1.000	1.0000	0.947
$D T_{2}$	0.663	0.629	0.755	0.582	0.687	0.537	0.705	0.748	0.455
$D T_{3}$	0.868	0.844	0.819	0.845	0.863	0.524	0.704	0.395	0.353
$D T_{4}$	0.872	0.723	0.835	0.831	0.420	0.612	0.570	0.257	0.551
$P H_{1}$	0.803	0.714	0.699	0.768	0.685	0.803	0.691	0.578	0.803
$P H_{2}$	0.792	0.709	0.687	0.749	0.689	0.792	0.679	0.564	0.792

Table 5. Hazard functions of scenarios in clinical trial settings.

Scenario	Group	$0 \leq t < 3$	$3 \leq t < 7$	$7 \leq t$
$C - D T_{1}$	Control	0.104	0.161	0.161
$C - D T_{1}$	Treatment	0.103	0.077	0.077
$C - D T_{2}$	Control	0.226	0.222	0.222
$C - D T_{2}$	Treatment	0.210	0.079	0.079
$C - D T_{3}$	Control	0.104	0.161	0.140
$C - D T_{3}$	Treatment	0.103	0.077	0.168
$C - D T_{4}$	Control	0.104	0.161	0.161
$C - D T_{4}$	Treatment	0.103	0.077	0.137
$C - D T_{5}$	Control	0.072	0.072	0.223
$C - D T_{5}$	Treatment	0.072	0.072	0.112
$C - D T_{6}$	Control	0.097	0.097	0.097
$C - D T_{6}$	Treatment	0.103	0.065	0.049
$C - P H_{1}$	Control	0.121	0.121	0.121
$C - P H_{1}$	Treatment	0.083	0.083	0.083
$C - P H_{2}$	Control	0.080	0.105	0.140
$C - P H_{2}$	Treatment	0.056	0.074	0.098

Table 6. Empirical power in clinical trial settings.

Scenarios	${CH}_{+}^{P}$	${CH}_{+}^{D}$	${CH}_{C}^{P}$	${CH}_{C}^{D}$	$wLRT$	$mWLRT$	$G^{0, 1}$	$G^{0, 2}$	$G^{0, 0}$
$C - D T_{1}$	0.983	0.979	0.965	0.965	0.978	0.947	0.978	0.963	0.916
$C - D T_{2}$	0.972	0.904	0.931	0.867	0.974	0.928	0.951	0.972	0.801
$C - D T_{3}$	0.405	0.416	0.421	0.420	0.223	0.409	0.265	0.118	0.368
$C - D T_{4}$	0.7300	0.726	0.702	0.696	0.602	0.662	0.630	0.457	0.604
$C - D T_{5}$	0.880	0.792	0.796	0.726	0.835	0.618	0.841	0.8672	0.590
$C - D T_{6}$	0.865	0.846	0.796	0.793	0.890	0.724	0.861	0.871	0.655
$C - P H_{1}$	0.623	0.659	0.726	0.721	0.545	0.744	0.647	0.507	0.746
$C - P H_{2}$	0.561	0.595	0.650	0.654	0.547	0.698	0.582	0.456	0.700

Table 7. p-values of the tests for the head-and-neck cancer data.

	${CH}_{+}^{P}$	${CH}_{+}^{R}$	${CH}_{C}^{P}$	${CH}_{C}^{R}$	$wLRT$	$mWLRT$	$G^{0, 1}$	$G^{0, 2}$	$G^{0, 0}$
p-value	0.018	0.021	0.019	0.022	0.017	0.020	0.014	0.012	0.022

Table 8. p-values of the tests for the kidney infection data.

	${CH}_{+}^{P}$	${CH}_{+}^{R}$	${CH}_{C}^{P}$	${CH}_{C}^{R}$	$wLRT$	$mWLRT$	$G^{0, 1}$	$G^{0, 2}$	$G^{0, 0}$
p-value	0.008	0.005	0.019	0.005	0.002	0.067	0.005	0.009	0.112

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Qian, K.; Zhou, X. Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family. Mathematics 2022, 10, 2573. https://doi.org/10.3390/math10152573

AMA Style

Qian K, Zhou X. Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family. Mathematics. 2022; 10(15):2573. https://doi.org/10.3390/math10152573

Chicago/Turabian Style

Qian, Kaihuan, and Xiaohua Zhou. 2022. "Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family" Mathematics 10, no. 15: 2573. https://doi.org/10.3390/math10152573

APA Style

Qian, K., & Zhou, X. (2022). Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family. Mathematics, 10(15), 2573. https://doi.org/10.3390/math10152573

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family

Abstract

1. Introduction

2. $CH$ Hazard Function Family

2.1. Definition

2.2. Properties

3. $CH$ Weight Function and CH Test

3.1. CH Class Weight Function

3.2. Selection of Parameters

3.2.1. Prior Information Method

3.2.2. Default Selection Method

4. Simulation Study

4.1. Type I Error Rate

4.2. Simple Settings

4.3. Clinical Trial Settings

5. Application in Real Studies

5.1. Head-and-Neck-Cancer Study

5.2. Kidney Infection Study

6. Discussion

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Properties of CH Hazard Function Family

Appendix B. Construction of CH Test

Appendix B.1. Derivation of CH Weight Function

Appendix B.2. Asymptotic Distribution of CH Statistics

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Weighted Log-Rank Test for Clinical Trials with Delayed Treatment Effect Based on a Novel Hazard Function Family

Abstract

1. Introduction

2. CH Hazard Function Family

2.1. Definition

2.2. Properties

3. CH Weight Function and CH Test

3.1. CH Class Weight Function

3.2. Selection of Parameters

3.2.1. Prior Information Method

3.2.2. Default Selection Method

4. Simulation Study

4.1. Type I Error Rate

4.2. Simple Settings

4.3. Clinical Trial Settings

5. Application in Real Studies

5.1. Head-and-Neck-Cancer Study

5.2. Kidney Infection Study

6. Discussion

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Properties of CH Hazard Function Family

Appendix B. Construction of CH Test

Appendix B.1. Derivation of CH Weight Function

Appendix B.2. Asymptotic Distribution of CH Statistics

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. $CH$ Hazard Function Family

3. $CH$ Weight Function and CH Test