A Goodness-of-Fit Test for Log-Linearity in Cox Proportional Hazards Model Under Monotonic Covariate Effects

Chen, Huan; Tang, Chuan-Fa

doi:10.3390/math13142264

Open AccessArticle

A Goodness-of-Fit Test for Log-Linearity in Cox Proportional Hazards Model Under Monotonic Covariate Effects

by

Huan Chen

^†

and

Chuan-Fa Tang

^*,†

Department of Mathematical Sciences, University of Texas at Dallas, Richardson, TX 75080, USA

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2025, 13(14), 2264; https://doi.org/10.3390/math13142264

Submission received: 30 December 2024 / Revised: 30 June 2025 / Accepted: 1 July 2025 / Published: 14 July 2025

(This article belongs to the Special Issue Statistical Analysis and Data Science for Complex Data)

Download

Browse Figures

Versions Notes

Abstract

The Cox proportional hazards (PH) model is widely used because it models the covariates to the hazard through a log-linear effect. However, exploring flexible effects becomes desirable within the Cox PH framework when only a monotonic relationship between covariates and the hazard is assumed. This work proposes a partial-likelihood-based goodness-of-fit (GOF) test to assess the log-linear effect assumption in a univariate Cox PH model. Rejection of log-linearity suggests the need to incorporate monotonic and non-log-linear covariate effects on the hazard. Our simulation studies show that the proposed GOF test controls type I error rates and exhibits consistency across various scenarios. We illustrate the proposed GOF test with two datasets, breast cancer data and lung cancer data, to assess the presence of log-linear effects in the Cox PH model.

Keywords:

bootstrap; isotonic proportional hazard model; isotonic regression; Kaplan–Meier estimator; likelihood ratio test; partial likelihood

MSC:

92B15

1. Introduction

The Cox proportional hazards (PH) model [1,2] is one of the most commonly applied survival analysis models since it effectively associates the covariates with the survival time through a hazard function. Since the hazard function is also known as the instantaneous risk of experiencing an event of interest, such as death, disease, or failure, given that the individual has survived to that point, the Cox PH model is also widely used in many areas, such as biology, medicine, and the social sciences.

The hazard function in the Cox PH model with time-independent covariates has the following functional form:

\begin{matrix} λ (t | Z) = λ_{0} (t) exp (Z^{⊤} β), \end{matrix}

(1)

where Z is a

p \times 1

vector of covariates,

β

is a

p \times 1

coefficient vector, and

λ_{0} (t)

is a baseline hazard function. In (1), the proportional hazard and the log-linearity hazard are two key assumptions of practical interest. The proportional hazard suggests that the hazard ratio for different individuals only depends on the value of covariates and the model coefficient

β

. On the other hand, taking the logarithm of (1),

\begin{matrix} log λ (t | Z) = log λ_{0} (t) + Z^{⊤} β . \end{matrix}

The log-linearity hazard enables researchers to analyze the linear effect of every unit change in the covariate on the log of the hazard function.

While most of the existing literature focuses on testing the proportional hazards assumption [3,4], our primary focus is on validating the log-linearity assumption. This assumption is critical, as assessing its validity provides a foundation for exploring alternative monotonic effects when the log-linear effect may not hold. To this end, we propose a goodness-of-fit (GOF) test designed to evaluate the log-linearity assumption in the Cox PH model.

In the literature, martingale-based residual methods have been widely used to investigate and test the GOF for log-linearity. Therneau et al. [5] introduced a graphical approach using smoothed martingale residuals to examine the functional form of covariate effects. Lin et al. [6] proposed an analytical test for log-linearity based on partial cumulative sums of martingale residuals, providing asymptotic properties, such as the limiting Gaussian process of the residuals and the null distribution of the test statistics. However, the existing literature has not explicitly addressed testing log-linearity against monotonic effects.

In this work, we consider a univariate covariate Z and a monotonic function

ϕ

to explore the monotonic effect. Relaxing

exp (Z β)

in the Cox PH model to

ϕ (Z)

, Chung et al. [7] considered the isotonic PH model with a hazard function

\begin{matrix} λ (t | Z) = λ_{0} (t) exp {ϕ (Z)}, \end{matrix}

which maintains the proportional hazard assumption. Similar to the estimation process in the traditional Cox PH model, Chung et al. [7] developed nonparametric partial-likelihood-based estimation for

ϕ

. Therefore, a ratio of partial likelihoods from the Cox and isotonic PH models is a natural test statistic for checking log-linearity. Inspired by Xu et al. [8], we propose a bootstrapping method conditional on covariates and censoring time to determine the critical values.

The remaining sections are organized as follows. Section 2 reviews the estimations in the Cox PH model and the isotonic proportional hazards models for a univariate time-independent covariate. In Section 3, we propose the GOF test for the Cox PH model under monotonicity constraints. We provide a numerical study in Section 4 to evaluate the performance of the GOF test. Lastly, in Section 5, we apply the proposed GOF test in two data examples, including a lung cancer study conducted by the North Central Cancer Treatment Group [9] and a breast cancer study conducted by the German Breast Cancer Study Group in the R package survival. All the R codes are available on GitHub https://github.com/cftang9/LLGOF_UniCoxPH, accessed on 30 June 2025.

2. Partial Likelihood Estimators

We assume that the survival time

T_{i}

follows a continuous distribution

F (t | Z_{i})

associating with a covariate

Z_{i} \in R

for

t \geq 0

, for

i = 1, \dots, n

. Let

C_{i}

denote the censoring time. We observe that

X_{i} = min {T_{i}, C_{i}}

with the censoring indicator

Δ_{i} = I (T_{i} \leq C_{i})

, where I is an indicator function. We assume that

T_{i}

and

C_{i}

are independent and that

C_{i}

follows a distribution G. Thus, the observed data consist of triplets

{(X_{i}, Δ_{i}, Z_{i})}_{i = 1}^{n}

.

A univariate Cox PH model relates the hazard of

T_{i}

and the covariate

Z_{i}

as follows:

\begin{matrix} λ (t | Z_{i}) = λ_{0} (t) exp (Z_{i} β), \end{matrix}

(2)

where

β \in R

is a vector of coefficients and

λ_{0} (t)

is a baseline hazard function invariant of covariates. The parameter

β

is estimated by maximizing the partial likelihood

L_{C o x}

, given by

L_{C o x} (β) = \prod_{i = 1}^{n} \prod_{t \geq 0} {[\frac{exp (Z_{i} β)}{\sum_{j = 1}^{n} Y_{j} (t) exp (Z_{j} β)}]}^{d N_{i} (t)},

(3)

where

N_{i} (t) = I (X_{i} \leq t, Δ_{i} = 1)

are the counting processes indicating that the event of interest occurs at or before time t, and

Y_{i} (t) = I (X_{i} \geq t)

are the at-risk processes that no events of interest or censoring occurs before time t. The log-partial-likelihood, denoted by

ℓ_{C o x} (β)

, is given by

\begin{matrix} ℓ_{C o x} (β) = & \sum_{i = 1}^{n} \int_{0}^{\infty} {Z_{i} β - log \sum_{j = 1}^{n} Y_{j} (u) exp (Z_{j} β)} d N_{i} (u) . \end{matrix}

(4)

The maximum partial likelihood estimator (MPLE) is denoted by

\hat{β} = arg {max}_{β \in R} ℓ_{C o x} (β)

and can be obtained numerically by the Newton–Raphson method.

When a monotonic covariate effect is preferable to a log-linearity effect in (2), it is natural to relax from

exp (Z_{i} β)

to

ϕ (Z_{i})

, where

ϕ

is monotonic. Hence,

λ (t | Z_{i})

can be relaxed to

\begin{matrix} λ (t | Z_{i}) = λ_{0} (t) exp {ϕ (Z_{i})} \end{matrix}

(5)

which is also known as the isotonic PH model [7]. Without loss of generality, we assume that

ϕ (\cdot)

is a non-decreasing function. Under the isotonic PH model, the partial likelihood is slightly different. Define

Z_{(i)}^{⋆}

as the ith-order statistic of

Z_{i}^{⋆}

. Given a monotonic function

ϕ

, define

ϕ_{i}^{⋆} = ϕ (Z_{(i)}^{⋆})

as the value of

ϕ

at

Z_{(i)}^{⋆}

for

i = 1, \dots, n^{⋆}

, where

n^{⋆}

is the number of uncensored subjects, so that

ϕ_{1}^{⋆} \leq \dots \leq ϕ_{n^{⋆}}^{⋆}

. We further denote

ϕ^{⋆} = {(ϕ_{1}^{⋆}, \dots, ϕ_{n^{⋆}}^{⋆})}^{⊤}

as the vector of parameters

ϕ_{i}^{⋆}

and

I_{i}^{⋆} = [Z_{(i)}^{⋆}, Z_{(i + 1)}^{⋆})

for

i = 1, \dots, n^{⋆} - 1

and

I_{n^{⋆}}^{⋆} = [Z_{(n^{⋆})}^{⋆}, \infty)

, which are the intervals formed by order statistics

Z_{(i)}^{*}

. Chung et al. [7] proposed the partial likelihood for the isotonic PH model (5) as

L_{I s o} (ϕ^{⋆}) = \prod_{i = 1}^{n^{⋆}} \prod_{t \geq 0} {[\frac{exp (ϕ_{i}^{⋆})}{\sum_{j = 1}^{n^{⋆}} Y_{j}^{⋆} (t) exp (ϕ_{j}^{⋆})}]}^{d N_{i}^{⋆} (t)},

(6)

where the counting process

N_{i}^{⋆} (t) = \sum_{i = 1}^{n} I (Z_{(i)}^{⋆} \leq t)

and the at-risk process

Y_{j}^{⋆} (t) = \sum_{h \in R_{j}} Y_{h} (t)

, where

R_{j} = {h : Z_{h} \in I_{j}^{⋆}, h = 1, . . ., n}

. Hence, the log partial likelihood can be defined accordingly as

\begin{matrix} ℓ_{I s o} (ϕ^{⋆}) = \sum_{i = 1}^{n^{⋆}} \int_{0}^{\infty} \{ϕ_{i}^{⋆} - log \sum_{j = 1}^{n^{⋆}} Y_{j}^{⋆} (u) exp (ϕ_{j}^{⋆})\} d N_{i}^{⋆} (u) . \end{matrix}

(7)

Note that

ℓ_{I s o} (ϕ^{⋆}) = ℓ_{I s o} (ϕ^{⋆} + c)

for any constant c. Due to the identifiability issue, restricting the monotonic function

ϕ

satisfying

ϕ (Z_{K}) = 0

for some

Z_{K}

suffices. In this work, we consider

Z_{K}

as the median of the covariates

Z_{i}

as in Chung et al. [7]. They also developed a computationally efficient pseudo-iterative convex minorant algorithm to obtain the MPLE, denoted by

{\hat{ϕ}}^{⋆}

.

3. Goodness-of-Fit Test

Here, we propose a GOF test for the log-linearity in the Cox PH model (2) under the isotonic PH model (5). The null and alternative hypotheses,

H_{0}

and

H_{1}

, respectively, are defined as follows:

\begin{matrix} \begin{matrix} H_{0} : ϕ (Z) = Z β, for some β \geq 0; \\ H_{1} : ϕ (Z) is increasing in Z, but not H_{0} . \end{matrix} \end{matrix}

(8)

Given a sample

{(X_{i}, Δ_{i}, Z_{i})}_{i = 1}^{n}

, we have MPLEs

\hat{β}

and

{\hat{ϕ}}^{⋆}

of partial likelihood (4) and (6) under the Cox PH model (2) and isotonic PH model (5), respectively. Under

H_{0}

, it is natural to consider a restricted MPLE

{\hat{β}}_{+} = max {\hat{β}, 0}

. Plugging in the MPLEs into their corresponding partial likelihoods, we propose a partial-likelihood-based test statistic

T_{n} = ℓ_{I s o} ({\hat{ϕ}}^{⋆}) - ℓ_{C o x} ({\hat{β}}_{+}) .

(9)

Since the linear relationship

Z β

with

β \geq 0

under

H_{0}

is a special case of a monotone relationship

ϕ (Z)

under

H_{0} \cup H_{1}

, it is expected that

ℓ_{I s o} ({\hat{ϕ}}^{⋆}) \geq ℓ_{C o x} ({\hat{β}}_{+})

and

T_{n} \geq 0

. Therefore, large values of

T_{n}

suggest rejecting

H_{0}

. On the other hand, if

ϕ

in (8) is decreasing in Z, the restricted MPLE

{\hat{β}}_{+}

can be replaced by a restricted MPLE

{\hat{β}}_{-} = min {0, \hat{β}}

and

{\hat{ϕ}}^{⋆}

under

ϕ_{1} \geq \dots \geq ϕ_{n^{⋆}}

. Then, the test statistic is obtained by

ℓ_{I s o} ({\hat{ϕ}}^{⋆}) - ℓ_{C o x} ({\hat{β}}_{-})

and the log-linearity is rejected when

T_{n}

is too small.

We propose a bootstrap method to determine a critical value for

T_{n}

to reject

H_{0}

. Inspired by Xu et al. [8], we consider a bootstrapping approach conditional on covariates and censoring times under the null hypothesis

H_{0}

. To generate survival time in the bootstrap samples, we apply the traditional inverse of the distribution function method to generate the bootstrap survival time. Note that a survival function defined by

S (t | Z_{i}) = 1 - F (t | Z_{i})

of the Cox PH model is

S (t | Z_{i}) = exp {- Λ_{0} (t) exp (Z_{i} β)},

where

Λ_{0} (t) = \int_{0}^{t} λ_{0} (u) d u

is the cumulative baseline hazard function. Therefore, provided by existences, an inverse of

S (t | Z)

is given by

S^{- 1} (u | Z_{i}) = Λ_{0}^{- 1} {- log (u) exp (- Z_{i} β)}

for

u \in [0, 1]

, where

Λ_{0}^{- 1} (t)

is the inverse of

Λ_{0} (t)

.

Since the estimators of

Λ_{0}

and

β

are required when estimating

S^{- 1} (u | Z_{i})

, we first consider the Breslow-type estimator [10]

{\hat{Λ}}_{0} (t) = \sum_{i = 1}^{n} \int_{0}^{t} \frac{d N_{i} (u)}{\sum_{j = 1}^{n} Y_{j} (u) exp (Z_{j} {\hat{β}}_{+})},

and the corresponding inverse defined by

{\hat{Λ}}_{0}^{- 1} (v) = inf {t : {\hat{Λ}}_{0} (t) \geq v},

for

0 \leq v < \infty

. Note that

{\hat{Λ}}_{0}^{- 1}

is a step function such that the bootstrapped survival times generated from it may contain ties, potentially reducing the efficiency and stability of the proposed test. To address this issue, we consider a continuous and piecewise linear version of

{\hat{Λ}}_{0}

, denoted by

{\tilde{Λ}}_{0} (t)

, to smooth

{\hat{Λ}}_{0}

and avoid ties of the bootstrapped survival times and enhance the performance of the test. Specifically, denote

t_{0} = 0

, and

0 < t_{1} < \dots < t_{m} < \infty

are the locations of jump points of

{\hat{Λ}}_{0}

such that

{lim}_{δ \to 0^{+}} {\hat{Λ}}_{0} (t_{i} - δ) < {\hat{Λ}}_{0} (t_{i})

for

1 \leq i \leq m

. Then, we define

{\tilde{Λ}}_{0} (t) = \frac{{\hat{Λ}}_{0} (t_{i}) - {\hat{Λ}}_{0} (t_{i - 1})}{t_{i} - t_{i - 1}} (t - t_{i - 1}) + {\hat{Λ}}_{0} (t_{i - 1})

for

t \in [t_{i - 1}, t_{i})

,

i = 1, \dots, m

, and

{\tilde{Λ}}_{0} (t) = {\hat{Λ}}_{0} (t_{m})

for

t \in [t_{m}, \infty)

. The corresponding inverse is defined by

\begin{matrix} {\tilde{Λ}}_{0}^{- 1} (w) = & \{\begin{matrix} \frac{(t_{i} - t_{i - 1}) (w - {\hat{Λ}}_{0} (t_{i - 1})}{{\hat{Λ}}_{0} (t_{i}) - {\hat{Λ}}_{0} (t_{i - 1})} + t_{i - 1}, & w \in ({\hat{Λ}}_{0} (t_{i - 1}), {\hat{Λ}}_{0} (t_{i})], i = 1, \dots, m; \\ t_{m}, & w \in ({\hat{Λ}}_{0} (t_{m}), \infty) . \end{matrix} \end{matrix}

Therefore, in the bth bootstrapped sample, we generate the survival times

\begin{matrix} T_{i}^{b} = {\tilde{Λ}}_{0}^{- 1} {- log (U_{i}^{b}) exp (- Z_{i} {\hat{β}}_{+})}, \end{matrix}

(10)

for the

i = 1, \dots, n

, where

U_{i}^{b}

are random variables independently and identically generated from a univariate uniform distribution with support

(0, 1)

, denoted by

U (0, 1)

.

Based on the censoring status

Δ_{i}

, we generate the bootstrapped censoring time for the ith subject in the bth bootstrapped sample, denoted by

C_{i}^{b}

. If the survival time is censored (

Δ_{i} = 0

), obtain the bootstrapped censoring time as the original censoring time with

C_{i}^{b} = C_{i}

. If the survival time is observed (

Δ_{i} = 1

), we again apply the inverse distribution method from the estimated censoring distribution G. We approximate G by the traditional Kaplan–Meier estimator, denoted by

\hat{G}

, with modified data

{(X_{i}, 1 - Δ_{i})}_{i = 1}^{n}

. Similar to bootstrapped survival times, to generate bootstrapped censoring times, we smooth and interpolate

\hat{G}

linearly and obtain

\tilde{G}

. Then, we obtain the corresponding inverse

\begin{matrix} {\tilde{G}}^{- 1} (u) = & \{\begin{matrix} \frac{(c_{i} - c_{i - 1}) (u - \hat{G} (t_{i - 1})}{\hat{G} (t_{i}) - \hat{G} (t_{i - 1})} + c_{i - 1}, & u \in (\hat{G} (c_{i - 1}), \hat{G} (c_{i})], i = 1, \dots, m^{'}; \\ c_{m^{'}}, & u \in (\hat{G} (c_{m^{'}}), 1), \end{matrix} \end{matrix}

where

c_{0} = 0

,

0 < c_{1} < \dots < c_{m^{'}} < \infty

are the locations of jump points of

\hat{G}

such that

{lim}_{δ \to 0^{+}} \hat{G} (c_{i} - δ) < \hat{G} (c_{i})

for

1 \leq i \leq m^{'}

. Hence, we obtain

\begin{matrix} C_{i}^{b} = {\tilde{G}}^{- 1} (V_{i}^{b}) I (Δ_{i} = 1) + C_{i} I (Δ_{i} = 0), \end{matrix}

(11)

where

V_{i}^{b}

is independently and identically generated from

U (0, 1)

and is independent from

U_{i}^{b}

.

In summary, given data

{(X_{i}, Δ_{i}, Z_{i})}_{i = 1}^{n}

, we independently generate B bootstrapped survival and censoring times,

T_{i}^{b}

and

C_{i}^{b}

, from (10) and (11), respectively, and obtain

X_{i}^{b} = min {T_{i}^{b}, C_{i}^{b}}

and

Δ_{i}^{b} = I (T_{i}^{b} \leq C_{i}^{b})

for

b = 1, \dots, B

. Using each bootstrapped sample,

{(X_{i}^{b}, Δ_{i}^{b}, Z_{i})}_{i = 1}^{n}

, we calculate the bootstrapped test statistic

T_{n}^{b}

. The critical value, denoted by

c_{α}

, is approximated by the αth upper quantile of

{T_{n}^{b}}_{b = 1}^{B}

. We reject the null hypothesis when

T_{n} > c_{α}

.

4. Simulation

4.1. Size and Power Study

We set a significant level of

α = 0.05

for Section 4 and Section 5. To evaluate the size and power of the proposed GOF test, we consider seven different functions

ϕ (Z)

over

Z \in (0.1, 2)

in the isotonic PH model in (5) with the constant baseline hazard

λ_{0} (t) = 1

. Three effect functions are linear—

ϕ (Z) = 0, Z

, and

5 Z

, satisfying

H_{0}

—and the remaining four correspond to nonlinear increasing functions:

ϕ (Z) = Z^{2}, exp (Z)

,

log (Z)

, and

6 \sqrt{Z}

, satisfying

H_{1}

. The covariates

Z_{i}

were independently generated from scaled and shifted beta distributions over the support

(0.1, 2)

with densities proportional to

{(u - 0.1) / 1.9}^{a} {1 - (u - 0.1) / 1.9}^{b}

for

u \in (0.1, 2)

, with

a, b > 0

. We consider three covariate distribution scenarios, denoted by

C_{c}

,

C_{u}

, and

C_{b}

, corresponding to parameter pairs

(a, b) = (2, 2), (1, 1)

, and

(0.5, 0.5)

, respectively. In scenario

C_{c}

, covariate values are concentrated near the center of the support. In contrast, scenario

C_{b}

emphasizes values near the boundaries, while scenario

C_{u}

results in a uniform distribution of covariates across the support.

For each scenario, 500 Monte Carlo samples were generated to approximate the probabilities of rejecting

H_{0}

. We consider

n = {50, 100, 200, 500, 1000}

to assess the effect of sample size on test performance. Censoring times are drawn from a uniform distribution

U (0, θ)

with

θ > 0

such that the censoring rate is about

30 %

. For each sample, critical values for the proposed GOF test are computed using

B = 500

bootstrap replications conditional on covariates and censoring times. We compare the proposed GOF test with the commonly applied residual-based GOF test proposed by Lin et al. [6], denoted by

R_{n}

, which does not rely on the isotonic PH assumption.

Under

H_{0}

, according to the results in Table 1, all the type-I error probabilities are below

0.075

and do not significantly exceed the nominal level

α = 0.05

, with a margin of error of

0.025 \approx z_{0.995} \times \sqrt{0.05 (1 - 0.05) / 500}

for 500 Monte Carlo samples at a confidence level of

99 %

, where

z_{0.995}

is the 99.5th quantile of the standard normal distribution. Regarding the proposed test

T_{n}

, the case with

ϕ (Z) = Z

generally exhibits slightly larger type I error probabilities compared to

ϕ (Z) = 0

and

5 Z

. Conversely,

R_{n}

does not exhibit systematic patterns. Furthermore, neither

T_{n}

nor

R_{n}

demonstrates notable differences across the covariate distributions considered.

Under

H_{1}

, both

T_{n}

and

R_{n}

exhibit powers approaching 1 as sample size n increases, suggesting that both tests are capable of detecting deviations from log-linearity in the hazard function with high probability for large sample sizes. Among the considered covariate distributions,

C_{b}

generally leads to the highest power, followed by

C_{u}

and then

C_{c}

, since there is greater curvature near the boundaries, which aligns well with the boundary-focused distribution

C_{b}

. Notably,

T_{n}

shows greater power for small sample sizes

n = 50

for

ϕ (Z) = Z^{2}

,

exp (Z)

, and

log (Z)

. When the sample size grows,

T_{n}

continues to outperform under

C_{c}

when the covariates focus more around the center of the support, where

ϕ

is not as curved around boundaries. Conversely,

R_{n}

demonstrates higher power under

C_{b}

, while, under

C_{u}

, the performances of

T_{n}

and

R_{n}

are generally comparable, with neither test consistently dominating. Lastly, for

ϕ (Z) = 6 \sqrt{Z}

, although the power of

T_{n}

tends to be lower than

R_{n}

, it remains competitive across most of the covariate distributions and sample sizes.

4.2. Evaluation of Robustness Across Baseline Hazards

We evaluate the robustness of the tests

T_{n}

and

R_{n}

under different baseline hazard functions, using the effect functions

ϕ

and uniform covariate distribution

C_{u}

as in Section 4.1. Specifically, we consider baseline hazards generated from Gompertz distributions, denoted by G

(η, b)

, with shape parameter

η

and scale parameter b. Two Gompertz models are used: G

(1, 2)

and G

(2, 0.5)

. In contrast to the constant baseline hazard from the exponential distribution with mean 1, both G

(1, 2)

and G

(2, 0.5)

have increasing hazard functions, while G

(2, 0.5)

increases faster. The distinct shapes of these hazard functions are illustrated in Figure 1.

Under

H_{0}

, Table 2 shows that both

T_{n}

and

R_{n}

have similar type I error probabilities lower than

0.075

, maintaining well-controlled type-I error rates, as shown in the size study in Section 4.1. Under

H_{1}

, the test

T_{n}

demonstrates robustness between different baseline hazard functions, with most power differences between G

(1, 2)

and G

(2, 0.5)

remaining below

0.05

. In contrast,

R_{n}

is more sensitive to the changes in the baseline hazards, with power differences up to

0.168

and

0.166

when

ϕ (Z) = 6 \sqrt{Z}

with

n = 500

and 1000, respectively.

5. Illustrations with Real Data

5.1. German Breast Cancer Study

Breast cancer is the most common cancer among women worldwide, and although it predominantly affects women, men can also develop breast cancer. It can profoundly impact a person’s life, but with advances in medical research and treatment, the prognosis for many patients has improved significantly. Early detection, comprehensive treatment plans, and ongoing support can help individuals and their families manage the challenges posed by breast cancer and improve their overall quality of life.

Here, we analyze the gbsg dataset in the R package survival, which is also known as the German Breast Cancer Study Group (GBSG) dataset. It comes from a clinical trial conducted by the German Breast Cancer Study Group. The dataset contains information on 686 patients with primary node-positive breast cancer who were treated at 17 centers in Germany between 1984 and 1989. In the dataset, 387 of 686 patients were censored, so the censoring rate is

56.4 %

. Except for the survival time and status of whether a patient is censored, the dataset includes several variables, such as age, tumor size, and the number of positive lymph nodes.

Here, we wish to test the log-linearity assumption for an important prognostic factor in breast cancer: the number of positive lymph nodes. As suggested by Fitzgibbons et al. [11], a higher number of positive lymph nodes is associated with an increased risk of breast cancer recurrence and a higher risk of death from the disease. We assume the monotonicity constraints to be satisfied and conduct the GOF test with hypotheses

\begin{matrix} H_{0} : λ (t | Z) = λ_{0} (t) \exp (Z β) for some β \geq 0, \\ H_{1} : λ (t | Z) = λ_{0} (t) \exp {ϕ (Z)} for some increasing function ϕ, but not H_{0}, \end{matrix}

where Z is the number of positive lymph nodes. For the estimation of

ϕ

, we choose

Z_{K} = 2

such that

\hat{ϕ} (Z_{K}) = 0

. With the bootstrap sample size of 500, we obtain the test statistic

T_{n} = 18.215

and bootstrap critical value

c_{α} = 9.288

. We reject the null hypothesis that the log-linearity of the hazard associated with the number of positive lymph nodes is satisfied.

In Figure 2, we present shifted log-hazard effect

(Z - Z_{K}) {\hat{β}}_{+}

and

\hat{ϕ} (Z)

, so

(Z - Z_{K}) {\hat{β}}_{+} = \hat{ϕ} (Z) = 0

at

Z = Z_{K}

. This normalization mitigates identifiability issues and facilitates clearer graphical comparisons. From Figure 2,

ϕ (Z)

from the isotonic PH model reveals a faster increase in hazard for a smaller number of positive nodes and deviates from the log-linear effect

Z {\hat{β}}_{+}

with constant slope

{\hat{β}}_{+}

. In addition, the shape of estimate

\hat{ϕ} (Z)

motivates a square-root transformed covariate

\sqrt{Z}

in the Cox PH model with hazard

λ (t | Z) = λ_{0} (t) exp (\sqrt{Z} β^{*})

for some

β^{*} \geq 0

. For comparisons, we added

(\sqrt{Z} - \sqrt{Z_{K}}) {\tilde{β}}_{+}

, where

{\tilde{β}}_{+}

is the corresponding restricted MPLE of

β^{*}

, in Figure 2. Compared with

(Z - Z_{K}) {\hat{β}}_{+}

, the square-root transformed

(\sqrt{Z} - \sqrt{Z_{K}}) {\tilde{β}}_{+}

captures the faster increase in hazard for a smaller number of positive nodes but a slower increase for a larger number of positive nodes. To further evaluate the square-root transformation, we apply the proposed GOF test with the following hypotheses:

\begin{matrix} H_{0}^{*} : λ (t | Z) = λ_{0} (t) exp (\sqrt{Z} β^{*}) for some β^{*} \geq 0, \\ H_{1}^{*} : λ (t | Z) = λ_{0} (t) exp {ϕ^{*} (\sqrt{Z})} for some increasing function ϕ^{*}, but not H_{0}^{*} . \end{matrix}

With test statistic

T_{n} = 8.413

and an estimated critical value

c_{α} = 9.703

from

B = 500

bootstrapped samples, the evidence is not strong enough to reject

H_{0}^{*}

and support the appropriateness of

\sqrt{Z}

in the Cox PH model. This finding is consistent with the approach of Royston and Altman [12], which performed a square-root transformation on the number of nodes when applying the Cox PH model.

5.2. NCCTG Lung Cancer Data

While progress has been made in recent years, particularly in the areas of early detection, targeted therapies, and immunotherapies, lung cancer remains a challenging problem. Lung cancer holds the top position in cancer-related deaths across the globe, resulting in a higher number of fatalities than the sum of breast, prostate, and colorectal cancer deaths. Compared to early-stage lung cancer, advanced lung cancer, which refers to lung cancer that has progressed to a stage where it has spread beyond the lungs to other parts of the body or has become locally extensive, affecting nearby tissues and structures, typically has a poorer prognosis.

Here, we analyze the cancer dataset in the R package survival, which is collected from a study conducted by the North Central Cancer Treatment Group (NCCTG) on patients with advanced lung cancer. As mentioned by Loprinzi et al. [9], this study aimed to assess if the prognostic information gathered from a patient-completed questionnaire could offer independent insights beyond those already obtained by the patient’s physician through descriptive data. The Karnofsky Performance Score (KPS), rated by patients, assesses a patient’s ability to perform routine daily tasks and activities. The KPS was developed by Karnofsky [13] as a method for evaluating a patient’s functional status, particularly in assessing the response to chemotherapeutic agents in cancer treatment. The score ranges from 0 to 100, with higher scores indicating better functional ability. We consider the relationship between the hazard of failure and the KPS to be decreasing. Therefore, we can apply the GOF test for the Cox PH model to check if the log-linearity assumption is satisfied for the KPS. The hypotheses are as follows:

\begin{matrix} H_{0} : λ (t | Z) = λ_{0} (t) exp (Z β) for some β \leq 0, \\ H_{1} : λ (t | Z) = λ_{0} (t) exp {ϕ (Z)} for some increasing function ϕ (Z), but not H_{0}, \end{matrix}

where Z represents the KPS rated by the patient. In the dataset, 63 of the 225 patients were censored after removing the records with missing values. Here, we choose

Z_{K} = 70

such that

\hat{ϕ} (Z_{K}) = 0

. With the bootstrap sample size of 500, we obtain the test statistic

T_{n} = 1.308

and bootstrap critical value

c_{α} = 4.671

. We fail to reject the null hypothesis that the log-linearity assumption between the hazard of death and the Karnofsky Performance Score is satisfied. The shifted log-linear hazard effect

(Z - Z_{K}) {\hat{β}}_{-}

and log-monotonic hazard effect

\hat{ϕ} (Z)

are presented in Figure 3. These two log-hazard effects are close overall, also suggesting that the Cox PH model is a reasonable choice for studying the hazard through the patients’ self-rated KPS.

6. Discussion

In this work, we propose a GOF test for evaluating the log-linearity effect of a univariate covariate in the traditional Cox PH model framed within the isotonic PH model. The bootstrapped critical values used in the test demonstrate well-controlled type I error rates and strong power for detecting deviations from log-linearity. In addition, when the proposed GOF test rejects the log-linearity, from the estimated

\hat{ϕ} (Z)

plot, one can propose a transformation

g (Z)

and perform the GOF test for

\begin{matrix} H_{0}^{*} : λ (t | Z) = λ_{0} (t) exp (g (Z) β^{*}) for some β^{*} \geq 0, \\ H_{1}^{*} : λ (t | Z) = λ_{0} (t) exp [ϕ^{*} {g (Z)}] for some increasing function ϕ^{*}, but not H_{0}^{*}, \end{matrix}

to check if the transformed

g (Z)

is appropriate or further monotonic transformation is needed.

As shown in Section 4.1, the proposed test appears to be conservative, indicating potential for improvement. In addition to the robustness assessment in Section 4.2, we further provide more numerical results for a higher censoring rate of

50 %

and discrete censoring distributions in the Supplementary Material. As expected, a higher censoring rate leads to more conservative tests; however, the tests remain valid since the power approaches 1 as the sample size increases. On the other hand, we observe that a uniform discrete censoring distribution that mildly discretizes the continuous uniform censoring distribution has a minor impact on the rejection rates. In addition, exploring the asymptotic properties and theoretical justifications is crucial for improving our understanding of the test statistic’s distribution and refining the choice of critical values. Investigating how the shape of the monotonic function affects power could also yield valuable insights guided by these theoretical advancements.

Despite these challenges, extending the proposed partial-likelihood-based GOF test to handle multiple covariates or a partial linear PH model with monotonic effects [7] is a natural and promising direction. However, such extensions require careful consideration and warrant further investigation. Finally, resolving the open problem of understanding the distribution of the log partial likelihood in (7), even for univariate covariates, remains a challenging and necessary task.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math13142264/s1.

Author Contributions

Methodology, H.C. and C.-F.T.; Software, H.C.; Writing—original draft, H.C. and C.-F.T.; Writing—review & editing, C.-F.T.; Visualization, H.C.; Supervision, C.-F.T.; Funding acquisition, C.-F.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the National Science Foundation [DMS 2311292].

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CMF	Cyclophosphamide, methotrexate, and fluorouracil
GBSG	German Breast Cancer Study Group
GOF	Goodness-of-fit
KPS	Karnofsky Performance Score
MPLE	Maximum partial likelihood estimator
NCCTG	North Central Cancer Treatment Group
PH	Proportional hazards

References

Cox, D.R. Regression models and life-tables. J. R. Stat. Soc. Ser. (Methodol.) 1972, 34, 187–202. [Google Scholar] [CrossRef]
Cox, D.R. Partial likelihood. Biometrika 1975, 62, 269–276. [Google Scholar] [CrossRef]
Grambsch, P.M.; Therneau, T.M. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 1994, 81, 515–526. [Google Scholar] [CrossRef]
Therneau, T.M.; Grambsch, P.M. The Cox Model. In Modeling Survival Data: Extending the Cox Model; Springer: New York, NY, USA, 2000. [Google Scholar]
Therneau, T.M.; Grambsch, P.M.; Fleming, T.R. Martingale-based residuals for survival models. Biometrika 1990, 77, 147–160. [Google Scholar] [CrossRef]
Lin, D.Y.; Wei, L.J.; Ying, Z. Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika 1993, 80, 557–572. [Google Scholar] [CrossRef]
Chung, Y.; Ivanova, A.; Hudgens, M.G.; Fine, J.P. Partial likelihood estimation of isotonic proportional hazards models. Biometrika 2018, 105, 133–148. [Google Scholar] [CrossRef] [PubMed]
Xu, G.; Sen, B.; Ying, Z. Bootstrapping a change-point Cox model for survival data. Electron. J. Stat. 2014, 8, 1345. [Google Scholar] [CrossRef] [PubMed]
Loprinzi, C.L.; Laurie, J.A.; Wieand, H.S.; Krook, J.E.; Novotny, P.J.; Kugler, J.W.; Bartel, J.; Law, M.; Bateman, M.; Klatt, N.E. Prospective evaluation of prognostic variables from patient-completed questionnaires. North Central Cancer Treatment Group. J. Clin. Oncol. 1994, 12, 601–607. [Google Scholar] [CrossRef] [PubMed]
Breslow, N.E. Contribution to discussion of paper by DR Cox. J. R. Stat. Soc. Ser. B 1972, 34, 216–217. [Google Scholar]
Fitzgibbons, P.L.; Page, D.L.; Weaver, D.; Thor, A.D.; Allred, D.C.; Clark, G.M.; Ruby, S.G.; O’Malley, F.; Simpson, J.F.; Connolly, J.L.; et al. Prognostic factors in breast cancer: College of American Pathologists consensus statement 1999. Arch. Pathol. Lab. Med. 2000, 124, 966–978. [Google Scholar] [CrossRef] [PubMed]
Royston, P.; Altman, D.G. External validation of a Cox prognostic model: Principles and methods. BMC Med. Res. Methodol. 2013, 13, 1–15. [Google Scholar] [CrossRef] [PubMed]
Karnofsky, D.A. The clinical evaluation of chemotherapeutic agents in cancer. In Evaluation of Chemotherapeutic Agents; Columbia University Press: New York, NY, USA, 1949; pp. 191–205. [Google Scholar]

Figure 1. Hazard functions for the exponential distribution with mean 1 and the Gompertz distributions G

(1, 2)

and G

(2, 0.5)

.

Figure 1. Hazard functions for the exponential distribution with mean 1 and the Gompertz distributions G

(1, 2)

and G

(2, 0.5)

.

Figure 2. Shifted log-hazard effect estimates for the breast cancer dataset. The plot includes

(Z - Z_{K}) {\hat{β}}_{+}

from a Cox PH model,

\hat{ϕ} (Z)

from the isotonic PH model, and

(\sqrt{Z} - \sqrt{Z_{K}}) {\tilde{β}}_{+}

from a Cox PH model with the square-root transformed covariate

\sqrt{Z}

.

Figure 2. Shifted log-hazard effect estimates for the breast cancer dataset. The plot includes

(Z - Z_{K}) {\hat{β}}_{+}

from a Cox PH model,

\hat{ϕ} (Z)

from the isotonic PH model, and

(\sqrt{Z} - \sqrt{Z_{K}}) {\tilde{β}}_{+}

from a Cox PH model with the square-root transformed covariate

\sqrt{Z}

.

Figure 3. Shifted log-hazard effect estimates for the NCCTG lung cancer data. The plot includes

(Z - Z_{K}) {\hat{β}}_{+}

from a Cox PH model and

\hat{ϕ} (Z)

from the isotonic PH model, and

(\sqrt{Z} - \sqrt{Z_{K}}) {\tilde{β}}_{+}

from a Cox PH model with the square-root transformed covariate

\sqrt{Z}

.

Figure 3. Shifted log-hazard effect estimates for the NCCTG lung cancer data. The plot includes

(Z - Z_{K}) {\hat{β}}_{+}

from a Cox PH model and

\hat{ϕ} (Z)

from the isotonic PH model, and

(\sqrt{Z} - \sqrt{Z_{K}}) {\tilde{β}}_{+}

from a Cox PH model with the square-root transformed covariate

\sqrt{Z}

.

Table 1. Rejection rates for the GOF tests

T_{n}

and

R_{n}

with covariate distributions

C_{c}

,

C_{u}

, and

C_{b}

.

Table 1. Rejection rates for the GOF tests

T_{n}

and

R_{n}

with covariate distributions

C_{c}

,

C_{u}

, and

C_{b}

.

			$ϕ (Z)$ Under $H_{0}$			$ϕ (Z)$ Under $H_{1}$
n			0	Z	$5 Z$	$Z^{2}$	$\exp (Z)$	$log (Z)$	$6 \sqrt{Z}$
50	$C_{c}$	$T_{n}$	0.018	0.056	0.044	0.092	0.140	0.074	0.056
		$R_{n}$	0.044	0.040	0.038	0.044	0.080	0.062	0.070
	$C_{u}$	$T_{n}$	0.026	0.060	0.022	0.130	0.216	0.098	0.058
		$R_{n}$	0.054	0.062	0.046	0.102	0.134	0.072	0.062
	$C_{b}$	$T_{n}$	0.026	0.046	0.028	0.148	0.254	0.190	0.066
		$R_{n}$	0.042	0.032	0.038	0.108	0.166	0.144	0.112
100	$C_{c}$	$T_{n}$	0.028	0.052	0.026	0.222	0.276	0.102	0.070
		$R_{n}$	0.042	0.060	0.040	0.124	0.186	0.090	0.088
	$C_{u}$	$T_{n}$	0.030	0.058	0.028	0.222	0.428	0.202	0.100
		$R_{n}$	0.040	0.050	0.052	0.238	0.372	0.150	0.138
	$C_{b}$	$T_{n}$	0.038	0.060	0.022	0.278	0.534	0.364	0.148
		$R_{n}$	0.060	0.040	0.032	0.332	0.586	0.304	0.138
200	$C_{c}$	$T_{n}$	0.024	0.056	0.018	0.406	0.470	0.152	0.114
		$R_{n}$	0.042	0.044	0.054	0.232	0.408	0.138	0.136
	$C_{u}$	$T_{n}$	0.022	0.050	0.022	0.406	0.782	0.346	0.204
		$R_{n}$	0.052	0.040	0.038	0.512	0.792	0.342	0.266
	$C_{b}$	$T_{n}$	0.038	0.040	0.028	0.602	0.910	0.614	0.342
		$R_{n}$	0.040	0.044	0.038	0.748	0.938	0.596	0.426
500	$C_{c}$	$T_{n}$	0.034	0.060	0.038	0.870	0.886	0.316	0.282
		$R_{n}$	0.038	0.040	0.054	0.570	0.874	0.286	0.314
	$C_{u}$	$T_{n}$	0.046	0.068	0.026	0.870	0.998	0.730	0.508
		$R_{n}$	0.048	0.034	0.026	0.944	0.998	0.684	0.630
	$C_{b}$	$T_{n}$	0.038	0.068	0.030	0.948	1.000	0.950	0.812
		$R_{n}$	0.072	0.068	0.064	0.980	1.000	0.964	0.870
1000	$C_{c}$	$T_{n}$	0.030	0.066	0.046	0.994	0.998	0.624	0.500
		$R_{n}$	0.062	0.064	0.042	0.904	0.998	0.554	0.554
	$C_{u}$	$T_{n}$	0.018	0.052	0.032	0.994	1.000	0.962	0.856
		$R_{n}$	0.054	0.062	0.046	1.000	1.000	0.974	0.928
	$C_{b}$	$T_{n}$	0.026	0.050	0.022	1.000	1.000	1.000	0.986
		$R_{n}$	0.042	0.046	0.044	1.000	1.000	1.000	0.992

Table 2. Rejection rates for the GOF tests

T_{n}

and

R_{n}

under baseline hazards G

(1, 2)

and G

(2, 0.5)

. Power differences of at least 0.05 between the two settings are indicated with an asterisk (*), and differences exceeding 0.12 are further marked with a dagger (^†).

Table 2. Rejection rates for the GOF tests

T_{n}

and

R_{n}

under baseline hazards G

(1, 2)

and G

(2, 0.5)

. Power differences of at least 0.05 between the two settings are indicated with an asterisk (*), and differences exceeding 0.12 are further marked with a dagger (^†).

			$ϕ (Z)$ Under $H_{0}$			$ϕ (Z)$ Under $H_{1}$
n			0	Z	$5 Z$	$Z^{2}$	$\exp (Z)$	$log (Z)$	$6 \sqrt{Z}$
50	$T_{n}$	G $(1, 2)$	0.028	0.058	0.020	0.146	0.214	0.118	0.062
		G $(2, 0.5)$	0.030	0.056	0.020	0.150	0.230	0.132	0.058
	$R_{n}$	G $(1, 2)$	0.030	0.034	0.034	0.114	0.142	0.078	0.058
		G $(2, 0.5)$	0.032	0.030	0.040	0.090	0.102	0.060	0.058
100	$T_{n}$	G $(1, 2)$	0.026	0.068	0.028	0.234	0.440	0.240	0.094
		G $(2, 0.5)$	0.026	0.058	0.020	0.240	0.430	0.262	0.100
	$R_{n}$	G $(1, 2)$	0.042	0.054	0.052	0.230	0.362 *	0.148	0.132
		G $(2, 0.5)$	0.040	0.040	0.052	0.206	0.274 *	0.128	0.110
200	$T_{n}$	G $(1, 2)$	0.020	0.054	0.022	0.420	0.782	0.380	0.206
		G $(2, 0.5)$	0.020	0.060	0.024	0.436	0.776	0.414	0.198
	$R_{n}$	G $(1, 2)$	0.046	0.056	0.038	0.492 *	0.800 ^†	0.352	0.272 *
		G $(2, 0.5)$	0.052	0.038	0.040	0.384 *	0.672 ^†	0.312	0.222 *
500	$T_{n}$	G $(1, 2)$	0.044	0.074	0.032	0.886	0.998	0.746	0.508
		G $(2, 0.5)$	0.044	0.068	0.034	0.892	1.000	0.794	0.508
	$R_{n}$	G $(1, 2)$	0.050	0.036	0.024	0.948 *	0.996	0.710	0.662 ^†
		G $(2, 0.5)$	0.048	0.054	0.032	0.880 *	0.996	0.676	0.494 ^†
1000	$T_{n}$	G $(1, 2)$	0.024	0.052	0.030	0.994	1.000	0.968	0.872
		G $(2, 0.5)$	0.024	0.046	0.032	0.992	1.000	0.974	0.868
	$R_{n}$	G $(1, 2)$	0.048	0.060	0.052	1.000	1.000	0.980	0.932 ^†
		G $(2, 0.5)$	0.054	0.042	0.040	0.994	1.000	0.974	0.766 ^†

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, H.; Tang, C.-F. A Goodness-of-Fit Test for Log-Linearity in Cox Proportional Hazards Model Under Monotonic Covariate Effects. Mathematics 2025, 13, 2264. https://doi.org/10.3390/math13142264

AMA Style

Chen H, Tang C-F. A Goodness-of-Fit Test for Log-Linearity in Cox Proportional Hazards Model Under Monotonic Covariate Effects. Mathematics. 2025; 13(14):2264. https://doi.org/10.3390/math13142264

Chicago/Turabian Style

Chen, Huan, and Chuan-Fa Tang. 2025. "A Goodness-of-Fit Test for Log-Linearity in Cox Proportional Hazards Model Under Monotonic Covariate Effects" Mathematics 13, no. 14: 2264. https://doi.org/10.3390/math13142264

APA Style

Chen, H., & Tang, C.-F. (2025). A Goodness-of-Fit Test for Log-Linearity in Cox Proportional Hazards Model Under Monotonic Covariate Effects. Mathematics, 13(14), 2264. https://doi.org/10.3390/math13142264

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Goodness-of-Fit Test for Log-Linearity in Cox Proportional Hazards Model Under Monotonic Covariate Effects

Abstract

1. Introduction

2. Partial Likelihood Estimators

3. Goodness-of-Fit Test

4. Simulation

4.1. Size and Power Study

4.2. Evaluation of Robustness Across Baseline Hazards

5. Illustrations with Real Data

5.1. German Breast Cancer Study

5.2. NCCTG Lung Cancer Data

6. Discussion

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI