Next Article in Journal
Iceberg Model as a Digital Risk Twin for the Health Monitoring of Complex Engineering Systems
Previous Article in Journal
CONFIDE: CONformal Free Inference for Distribution-Free Estimation in Causal Competing Risks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Symmetric Double Normal Models for Censored, Bounded, and Survival Data: Theory, Estimation, and Applications

by
Guillermo Martínez-Flórez
1,
Hugo Salinas
2 and
Javier Ramírez-Montoya
1,*
1
Departamento de Matemática y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 230001, Colombia
2
Departamento de Matemática, Facultad de Ingeniería, Universidad de Atacama, Copiapó 7500015, Chile
*
Author to whom correspondence should be addressed.
Mathematics 2026, 14(2), 384; https://doi.org/10.3390/math14020384
Submission received: 15 December 2025 / Revised: 19 January 2026 / Accepted: 21 January 2026 / Published: 22 January 2026
(This article belongs to the Section D1: Probability and Statistics)

Abstract

We develop a unified likelihood-based framework for limited outcomes built on the two-piece normal family. The framework includes a censored specification that accommodates boundary inflation, a doubly truncated specification on ( 0 , 1 ) for rates and proportions, and a survival formulation with a log-two-piece normal baseline and Gamma frailty to account for unobserved heterogeneity. We derive closed-form building blocks (pdf, cdf, survival, hazard, and cumulative hazard), full log-likelihoods with score functions and observed information, and stable reparameterizations that enable routine optimization. Monte Carlo experiments show a small bias and declining RMSE with increasing sample size; censoring primarily inflates the variability of regression coefficients; the scale parameter remains comparatively stable, and the shape parameter is most sensitive under heavy censoring. Applications to HIV-1 RNA with a detection limit, household food expenditure on ( 0 , 1 ) , labor-supply hours with a corner solution, and childhood cancer times to hospitalization demonstrate improved fit over Gaussian, skew-normal, and beta benchmarks according to AIC/BIC/CAIC and goodness-of-fit diagnostics, with model-implied censoring closely matching the observed fraction. The proposed formulations are tractable, flexible, and readily implementable with standard software.

1. Introduction

The normal distribution serves as a fundamental element in statistical modeling because of its analytical ease and its essential role in limit theorems. However, real-world data frequently exhibit characteristics such as heavier or lighter tails, asymmetry, and, in certain contexts, multimodality, which a single Gaussian density cannot sufficiently represent [1]. Classes of normal distribution may be found [2]. However, there are flexible symmetric families that fit better and, in particular, may be bimodal. For rates and proportions on the unit interval, beta regression is a standard choice [3].
Within this landscape, two-piece constructions provide an appealing route to flexibility by allowing distinct shapes on either side of a central location while preserving closed-form expressions for key quantities. Building on this idea, the two-piece normal family (symmetric double normal; with an asymmetric extension) offers a tractable kernel with a shape parameter that controls departures from normality, enabling richer behavior than the symmetric Gaussian distribution and supporting likelihood-based inference with standard tools [4,5].
These features are consequential in applications involving censoring and bounded support. We leverage them to develop the following: (i) a censored model for limited outcomes, (ii) a doubly truncated model on ( 0 , 1 ) , and (iii) a survival specification with a log-two-piece baseline and Gamma frailty. In biostatistics and environmental monitoring, detection limits induce mass accumulation at thresholds; in econometrics, corner solutions yield limited dependent variables; and for proportions, support is intrinsically constrained to ( 0 , 1 ) [1]. A coherent framework based on the double normal (two-component normal mixture) family accommodates such constraints while allowing for flexible shapes (including possible bimodality) and straightforward computation [4,5]. Note that the T N ( λ ) specification considered in Section 2 is symmetric by construction; any asymmetry in the observed data distribution may arise from censoring/truncation mechanisms rather than from the latent error distribution itself.
This paper develops two extensions tailored to these scenarios. First, a censored specification of the double normal distribution provides a Tobit-type model that allows for flexible modality while accounting for accumulation at a censoring boundary [1]. Second, a doubly truncated specification on ( 0 , 1 ) offers a natural competitor to beta regression for proportions and rates, with the added ability to represent platykurtic shapes and, when supported by the data-generating mechanism, multimodality through a principled truncation operator [3,6]. We also consider a survival formulation with Gamma frailty, linking heterogeneity modeling to established survival-analysis methodology [7].
Our contributions are as follows. First, we establish formal properties for the proposed models, including stochastic representations, generating functions, moments, and conditions for modality [4,5]. Second, we propose regression structures via suitable link functions for bounded outcomes and survival contexts [1,7]; for completeness and reproducibility, we also provide the corresponding log-likelihood expressions under censoring/truncation and the analytical derivatives (score functions and observed Fisher information) used for numerical maximization. Third, we conduct Monte Carlo experiments to assess finite-sample performance (bias and root mean squared error) across sample sizes, censoring and truncation intensities, and modality regimes, and we compare models using classical information criteria [8,9,10]. Finally, empirical illustrations in biomedical, economic, and labor datasets benchmark the proposed models against Gaussian, skewed, and beta alternatives and include formal tests of unimodality using the Hartigan-Hartigan dip test [11].
Section 2 introduces the symmetric two-piece normal (double normal) distribution and summarizes its main properties (moments, kurtosis, and modality conditions). Section 3 presents the censored two-piece normal (CTN) model, including its distributional properties, the Tobit-type regression extension, and likelihood-based estimation components for CTN. Section 4 develops the doubly truncated two-piece normal (TTN) model on ( 0 , 1 ) and its regression specification. Section 5 introduces the survival formulation with a log-two-piece normal baseline and Gamma frailty and provides the resulting marginal likelihood. Section 6 consolidates general inference and implementation details (asymptotics, standard errors, and boundary issues). Section 7 reports the simulation study, and Section 8 presents the empirical applications. Finally, Section 9 concludes.

2. The Symmetric Two-Piece Normal (Double Normal) Distribution

2.1. Notation

Remark 1.
Vectors are typeset in bold; for example, x i R p and β R p ; scalar components are denoted by x i j and β j . We write ϕ ( · ) and Φ ( · ) for the standard normal pdf and cdf, respectively. Unless stated otherwise, λ 0 is a scalar shape parameter, and n is the sample size. To avoid ambiguity, we reserve θ > 0 for the frailty parameter (when applicable) and denote the full model parameter vector by Θ, with MLE Θ ^ and observed information I ( Θ ) . Superscripts such as * or ** are used only when explicitly defined.

2.2. The Two-Piece Normal Family

Motivated by tractability in likelihood-based analysis, we adopt a representation of the two-piece normal (symmetric “double normal”) distribution that makes its structure transparent: it is the equal-weight mixture of two unit-variance normal distributions centered at ± λ . This representation highlights both the name and the modeling intent (two shifted Gaussian “pieces”) and is more convenient for derivations than alternative kernels.
Definition 1.
A random variable Z follows a two-piece normal distribution with shape parameter λ 0 , denoted Z T N ( λ ) , if its pdf is
f Z ( z ; λ ) = 1 2 ( ϕ ( z λ ) + ϕ ( z + λ ) ) , z R .
Equation (1) admits the equivalent “tilted normal” form
f Z ( z ; λ ) = ϕ ( z ) e λ 2 / 2 cosh ( λ z ) ,
because ϕ ( z λ ) = ϕ ( z ) exp ( ± λ z λ 2 / 2 ) and cosh ( a ) = ( e a + e a ) / 2 . The normalizing factor e λ 2 / 2 follows from E ( cosh ( λ Z ) ) = e λ 2 / 2 for Z N ( 0 , 1 ) . Both (1) and (2) deliver closed-form expressions for common functionals and will be used interchangeably [4,5].
Lemma 1.
Let Z T N ( λ ) . Then
(i) 
The cdf is
F Z ( z ; λ ) = 1 2 ( Φ ( z λ ) + Φ ( z + λ ) ) .
(ii) 
Symmetry holds around zero: F Z ( z ; λ ) = 1 F Z ( z ; λ ) ; hence E ( Z ) = 0 and all odd central moments vanish.
(iii) 
The variance inflates with the separation of the two normals: Var ( Z ) = 1 + λ 2 .
Proposition 1.
Let Z T N ( λ ) ; then for any integer k 0 ,
E ( Z k ) = j = 0 k k j λ k j 1 + ( 1 ) k j 2 I j = 0 j k j k ( mod 2 ) k j λ k j I j .
where I j : = u j ϕ ( u ) d u is the j-th moment of the standard normal distribution (so that I j is zero for odd j and I j = 2 j / 2 1 ( 1 + ( 1 ) j ) Γ 1 + j 2 / π , in general). In particular, E ( Z 2 m + 1 ) = 0 and E ( Z 2 ) = 1 + λ 2 , in agreement with Lemma 1.
Proof. 
Using the mixture representation (1),
E ( Z k ) = 1 2 z k ϕ ( z λ ) d z + 1 2 z k ϕ ( z + λ ) d z .
With the changes of variables u = z λ and u = z + λ , respectively,
E ( Z k ) = 1 2 ( u + λ ) k ϕ ( u ) d u + 1 2 ( u λ ) k ϕ ( u ) d u .
Expanding using the binomial theorem and applying linearity,
E ( Z k ) = j = 0 k k j λ k j 1 2 u j ϕ ( u ) d u + 1 2 ( 1 ) k j u j ϕ ( u ) d u = j = 0 k k j λ k j 1 + ( 1 ) k j 2 I j .
The factor ( 1 + ( 1 ) k j ) / 2 equals 1 if k j is even and 0 otherwise, yielding the parity-restricted sum. For k = 1 , the sum vanishes (symmetry), and for k = 2 , we obtain E ( Z 2 ) = λ 2 I 0 + I 2 = λ 2 + 1 .    □
Remark 2.
Let Z T N ( λ ) . Then Var ( Z ) = 1 + λ 2 and the kurtosis satisfy
κ ( λ ) = E ( Z 4 ) Var ( Z ) 2 = λ 4 + 6 λ 2 + 3 ( 1 + λ 2 ) 2 = 3 2 λ 4 ( 1 + λ 2 ) 2 < 3 , λ > 0 ,
with equality only at λ = 0 (the standard normal case).
Remark 3.
Let Z T N ( λ ) with pdf f Z ( z ; λ ) = 1 2 ( ϕ ( z λ ) + ϕ ( z + λ ) ) . Then f Z ( z ; λ ) = f Z ( z ; λ ) for all z R , so T N ( λ ) is symmetric about 0. In particular, all odd moments are zero (when they exist), and hence the skewness is 0.
The family T N ( λ ) thus preserves the interpretability and computational convenience of the Gaussian law (closed-form pdf, cdf, and simple moments) while introducing a tunable shape mechanism that accommodates platykurtosis and, for sufficiently large separation λ , the emergence of bimodal features that are valuable in applications involving censoring and bounded support. Moreover, asymmetric generalizations (“asymmetric double normal”) extend this construction to skewed settings while retaining many of these analytical advantages [5]. These properties make T N ( λ ) a natural building block for the censored and doubly truncated models developed next [4,5]. From a probabilistic standpoint, increasing λ enhances the contribution of the component associated with larger latent locations, inflating upper-tail probabilities, while decreasing λ emphasizes the lower-tail behavior, making it a key parameter for capturing tail risk and latent subpopulation structure without introducing discrete mixture indicators.
Proposition 2.
Let Z have a density given in Equation (1). Then f Z ( · ) is unimodal if 0 λ 1 and bimodal if λ > 1 . Moreover,
f ( 0 ) = ( λ 2 1 ) ϕ ( λ ) ,
so that z = 0 is a local maximum if λ < 1 and a local minimum if λ > 1 .
Proof. 
By symmetry, f ( 0 ) = 0 . Differentiating twice and evaluating at 0 yields the expression above. For λ > 1 , 0 is a local minimum, and two symmetric modes appear.    □

3. Censored Two-Piece Normal (CTN) Models

3.1. Motivation: Censored Outcomes and Tobit-Type Models

It is common to observe response variables subject to a known bound T (lower, upper, or both), with a substantial fraction of the sample concentrated at that bound. This mixture of a point mass and a continuous component was popularized by [1] and is widely known as the Tobit model. In its standard left-censored form, one posits an unobserved latent outcome y i * = x i β + ε i with ε i N ( 0 , σ 2 ) and observes y i = max { y i * , T } for i = 1 , , n , that is
y i = x i β + ε i , if x i β + ε i > T , T , otherwise ,
which admits well-known likelihood expressions and inferential procedures [1]. In many applications, however, the proportion of censored observations, or the tail behavior of y i * , substantially exceeds what a Gaussian latent error can accommodate, motivating alternatives that relax distributional assumptions and better handle excess mass at the boundary and tail thickness [12]. Examples include detection-limit problems and antibody response analyzes, where log-normal and related specifications are natural baselines, as well as broader families that capture asymmetry and tail behavior beyond the censored normal [12,13]. In this spirit, power-normal, t-based, and other shape-parameter models have been explored under censoring. Our focus, instead, is on two-piece constructions that preserve analytical convenience while offering extra shape flexibility [4,5].

3.2. Censored Two-Piece Normal Model

We consider a left-censoring threshold T R and a latent outcome Y i * = μ + σ Z i with Z i T N ( λ ) , where μ R , σ > 0 , and λ 0 . The observed variable is
Y i = max { Y i * , T } , i = 1 , , n ,
so that observations at the boundary T are censored realizations of Y i * .
Remark 4.
For σ > 0 and λ 0 , the location-scale TN model is identifiable via Y * 1 2 N ( μ σ λ , σ 2 ) + 1 2 N ( μ + σ λ , σ 2 ) (the constraint λ 0 removes sign/label switching). Identification weakens at the boundary λ = 0 (Gaussian submodel) and may be practically compromised under heavy censoring/truncation when only one side of the distribution is effectively observed.
Definition 2.
If Y i * T N ( μ , σ , λ ) and Y i are given by (5), then Y i follows a censored two-piece normal distribution with parameters ( μ , σ , λ ; T ) , denoted Y i C T N ( μ , σ , λ ; T ) .
The distribution of Y i is a mixture of a point mass at T and a continuous component on ( T , ) :
P ( Y i = T ) = F Z T μ σ ; λ ,
f Y ( y i ) = 1 σ f Z y i μ σ ; λ , y i > T .
Equivalently, the cdf of Y i satisfies F Y ( y ) = 0 for y < T , F Y ( T ) = F Z ( T μ ) / σ ; λ , and F Y ( y ) = F Z ( y μ ) / σ ; λ for y > T .
Remark 5.
(i) Right-censoring at a known upper censoring point U (that is, Y i = min { Y i * , U } ) and interval censoring follow from symmetry and standard modifications of (6). (ii) Allowing subject-specific thresholds T i is immediate by replacing T with T i throughout. (iii) The representation in (1) is often more transparent for derivations and interpretation than cosine-hyperbolic tilts, since it explicitly states that the latent kernel is an equal-weight mixture of two unit-variance normals centered at ± λ .
Proposition 3.
Let Y * = μ + σ Z T N ( μ , σ , λ ) with μ R , σ > 0 , and define Y = max { Y * , T } for a fixed left-censoring threshold T R . Let z T = ( T μ ) / σ , Φ λ ( x ) : = F Z ( x ; λ ) = 1 2 ( Φ ( x λ ) + Φ ( x + λ ) ) , ϕ λ ( x ) : = 1 2 ( ϕ ( x λ ) + ϕ ( x + λ ) ) , and S Φ ( x ) : = 1 Φ ( x ) . Then:
(i) 
Mass at the censoring point.
P ( Y T ) = P ( Y = T ) = Φ λ ( z T ) = 1 2 ( Φ ( z T λ ) + Φ ( z T + λ ) ) .
(ii) 
Mixed measure, Y has an atom at T with mass given in (a), and for y > T it has density
f Y ( y ) = 1 σ f Z y μ σ ; λ = ϕ y μ σ λ + ϕ y μ σ + λ 2 σ .
There is no Lebesgue density at y = T ; all mass there is the atom in (a).
(iii) 
Conditional cdf given Y > T . For y > T and z = ( y μ ) / σ ,
P ( Y y | Y > T ) = Φ λ ( z ) Φ λ ( z T ) 1 Φ λ ( z T ) = ( Φ ( z λ ) Φ ( z T λ ) ) + ( Φ ( z + λ ) Φ ( z T + λ ) ) S Φ ( z T λ ) + S Φ ( z T + λ ) .
(iv) 
Raw r-th moment. For any integer r 0 ,
E ( Y r ) = T r Φ λ ( z T ) + 1 2 k = 0 r r k μ r k σ k j = 0 k k j λ k j I j ( z T λ ) + ( 1 ) k j I j ( z T + λ ) ,
where I j ( α ) : = α u j ϕ ( u ) d u .
(v) 
Mean and second moment.
E ( Y ) = T Φ λ ( z T ) + μ 1 Φ λ ( z T ) + σ ϕ λ ( z T ) + σ λ 2 Φ ( z T + λ ) Φ ( z T λ ) ,
and, using I 2 ( α ) = S Φ ( α ) + α ϕ ( α ) ,
E ( Y 2 ) = T 2 Φ λ ( z T ) + 1 2 k { 1 , 1 } σ 2 + ( μ + k σ λ ) 2 S Φ ( z T k λ ) + σ ( T + μ + k σ λ ) ϕ ( z T k λ ) .
Proof. 
(i) Since Y = max { Y * , T } , we have { Y = T } = { Y * T } , and thus P ( Y = T ) = P ( Y * T ) = F Z ( z T ; λ ) = Φ λ ( z T ) .
(ii) For y > T , we have Y = Y * ; hence f Y ( y ) = f Y * ( y ) = ( 1 / σ ) f Z ( ( y μ ) / σ ; λ ) . At y = T , the distribution has an atom with mass given by (a).
(iii) For y > T ,
P ( Y y Y > T ) = P ( T < Y y ) P ( Y > T ) = F Z ( z ; λ ) F Z ( z T ; λ ) 1 F Z ( z T ; λ ) .
Substituting F Z ( · ; λ ) = 1 2 ( Φ ( · λ ) + Φ ( · + λ ) ) and simplification yield the stated form.
(iv) Write
E ( Y r ) = T r P ( Y = T ) + T x r f Y * ( x ) d x , f Y * ( x ) = 1 2 σ ϕ x μ σ λ + ϕ x μ σ + λ .
With t = ( x μ ) / σ and then u = t λ in the first term and u = t + λ in the second term, expand ( μ + σ t ) r = k = 0 r r k μ r k σ k t k and ( u ± λ ) k = j = 0 k k j u j ( ± λ ) k j to obtain (8) in terms of I j ( α ) = α u j ϕ ( u ) d u .
(v) Take r = 1 , 2 in (8) and use I 0 ( α ) = S Φ ( α ) , I 1 ( α ) = ϕ ( α ) , and I 2 ( α ) = S Φ ( α ) + α ϕ ( α ) , together with S Φ ( a ) S Φ ( b ) = Φ ( b ) Φ ( a ) and the definition of ϕ λ ( · ) , to obtain the stated closed forms.    □
Remark 6.
In the standard case μ = 0 and σ = 1 , replace z T with T in all expressions.

3.3. Censored Two-Piece Normal Regression

We extend the classical (Gaussian) Tobit regression to the two-piece normal family. Let
Y i * = x i β + ε i , ε i = σ Z i , Z i T N ( λ ) ,
with μ R absorbed in x i β , scale σ > 0 , and shape λ 0 . The observed response is left-censored at a known threshold T R as follows:
Y i = max { Y i * , T } , i = 1 , , n .
We write z i = ( y i x i β ) / σ and z T , i = ( T x i β ) / σ use the shorthand
Φ λ ( z ) = 1 2 ( Φ ( z λ ) + Φ ( z + λ ) ) , ϕ λ ( z ) = 1 2 ( ϕ ( z λ ) + ϕ ( z + λ ) ) .
The law of Y i is a mixed measure with an atom at T and a continuous component on ( T , ) :
P ( Y i = T x i ) = Φ λ ( z T , i ) , f Y i ( y x i ) = 1 σ ϕ λ y x i β σ , y > T .
Equivalently,
F Y i ( y x i ) = 0 , y < T , Φ λ ( z T , i ) , y = T , Φ λ ( y x i β ) / σ , y > T .
Given Y i > T , the conditional density is
f Y i Y i > T ( y x i ) = σ 1 ϕ λ y x i β σ 1 Φ λ ( z T , i ) , y > T .
From the CTN moment formulas,
E ( Y i x i , Y i > T ) = x i β + σ ϕ λ ( z T , i ) 1 Φ λ ( z T , i ) + σ λ 2 Φ ( z T , i + λ ) Φ ( z T , i λ ) 1 Φ λ ( z T , i ) ,
and the unconditional mean is
E ( Y i x i ) = T Φ λ ( z T , i ) + 1 Φ λ ( z T , i ) E ( Y i x i , Y i > T ) .
Remark 7.
(i) When λ 0 , Z i N ( 0 , 1 ) , the model reduces to Gaussian Tobit regression. (ii) Right-censoring at a known U (for example, Y i = min { Y i * , U } ) or observation-specific thresholds T i is followed by replacing T with U or T i in the formulas above. (iii) A zero-threshold reparameterization is immediate: define y i * * = y i T , x i * * = ( x i , T ) , β * * = ( β , 1 ) , such that y i * * = max { x i * * β * * + ε i , 0 } .

4. Doubly Truncated Two-Piece Normal (TTN) Models on (0, 1)

4.1. Two-Piece Normal Truncated to the Unit Interval

Following the truncation generator in [6], let F be a distribution function on ( a , b ) with a 0 < 1 b . The distribution function of the variable U truncated to ( 0 , 1 ) is
F U ( 0 , 1 ) ( u ) = F ( u ) F ( 0 ) F ( 1 ) F ( 0 ) , 0 < u < 1 .
More generally, the TF-G class uses a baseline cdf F and a transformation cdf G; the induced cdf is
G X ( x ) = F ( G ( x ) ) F ( 0 ) F ( 1 ) F ( 0 ) .
If f and g denote the densities of F and G, then
f X ( x ) = g ( x ) f ( G ( x ) ) F ( 1 ) F ( 0 ) , S X ( x ) = F ( 1 ) F ( G ( x ) ) F ( 1 ) F ( 0 ) , h X ( x ) = g ( x ) f ( G ( x ) ) F ( 1 ) F ( G ( x ) ) .
Definition 3.
Let Y * = μ + σ Z with Z T N ( λ ) . Define Y = Y * | 0 < Y * < 1 . Then the TTN ( μ , σ , λ ) distribution on ( 0 , 1 ) has
f ( y ) = f Z z y ; λ σ ( F Z ( z 1 ; λ ) F Z ( z 0 ; λ ) ) = ϕ ( z y λ ) + ϕ ( z y + λ ) σ ( Φ ( z 1 λ ) + Φ ( z 1 + λ ) + Φ ( z 0 λ ) + Φ ( z 0 + λ ) 2 ) , F ( y ) = F Z ( z y ; λ ) F Z ( z 0 ; λ ) F Z ( z 1 ; λ ) F Z ( z 0 ; λ ) = Φ ( z y λ ) + Φ ( z y + λ ) + Φ ( z 0 λ ) + Φ ( z 0 + λ ) 2 Φ ( z 1 λ ) + Φ ( z 1 + λ ) + Φ ( z 0 λ ) + Φ ( z 0 + λ ) 2 , S ( y ) = F Z ( z 1 ; λ ) F Z ( z y ; λ ) F Z ( z 1 ; λ ) F Z ( z 0 ; λ ) = Φ ( z 1 λ ) + Φ ( z 1 + λ ) Φ ( z y λ ) Φ ( z y + λ ) Φ ( z 1 λ ) + Φ ( z 1 + λ ) + Φ ( z 0 λ ) + Φ ( z 0 + λ ) 2 , h ( y ) = f ( y ) S ( y ) = ϕ ( z y λ ) + ϕ ( z y + λ ) σ ( Φ ( z 1 λ ) + Φ ( z 1 + λ ) Φ ( z y λ ) Φ ( z y + λ ) ) , 0 < y < 1 ,
where z y = ( y μ ) / σ , z 0 = ( 0 μ ) / σ = μ / σ , z 1 = ( 1 μ ) / σ and the normalizing constant is
F Z ( z 1 ; λ ) F Z ( z 0 ; λ ) = 1 2 ( Φ ( z 1 λ ) + Φ ( z 1 + λ ) + Φ ( z 0 λ ) + Φ ( z 0 + λ ) 2 ) .
Remark 8.
(i) Since F Z is symmetric, F Z ( u ; λ ) = 1 F Z ( u ; λ ) ; writing the formulas via z 0 = μ / σ and z 1 = ( 1 μ ) / σ makes the cancelation of the factor 1 / 2 explicit. (ii) As λ 0 , TTN ( μ , σ , λ ) reduces to the doubly truncated Gaussian on ( 0 , 1 ) .

4.2. Covariates in the Unit-Interval

Let the latent model be
Y i * = μ i + σ Z i , Z i T N ( λ ) , σ > 0 , λ 0 ,
and the observed response must be doubly truncated to ( 0 , 1 )
Y i = Y i * | 0 < Y i * < 1 , i = 1 , , n .
We link the location parameter to covariates via a strictly monotonic, twice differentiable link g : ( 0 , 1 ) R :
g ( μ i ) = η i = x i β , x i = ( 1 , x 1 i , , x p i ) , β = ( β 0 , , β p ) .
We adopt the logit link for interpretability,
g ( μ i ) = log μ i 1 μ i = η i , μ i = e η i 1 + e η i , μ i β = μ i ( 1 μ i ) x i .
Under this specification, increasing covariate x k by m units multiplies the odds μ i / ( 1 μ i ) by exp ( m β k ) (holding the others fixed). Hence
Y i T T N ( μ i , σ , λ ) on ( 0 , 1 ) , i = 1 , , n .
Remark 9.
The likelihood below is for the doubly truncated model on ( 0 , 1 ) (TTN), not for censored data: the term N i = Pr ( 0 < Y i * < 1 x i ) is the truncation normalizing constant, which yields the contribution i = 1 n log ( N i ) in the log-likelihood.
Using the tilted TN kernel, the sample log-likelihood is
( Θ ) = n 2 log ( 2 π ) n log ( σ ) n 2 λ 2 1 2 i = 1 n z i 2 + i = 1 n log cosh ( λ z i ) i = 1 n log ( N i ) ,
for Θ = ( β , σ , λ ) . Therefore, the score function is defined as the derivative of the log-likelihood function with respect to each parameter.
λ = i = 1 n λ + z i tanh i i = 1 n ϕ ( A i ) + ϕ ( B i ) ϕ ( C i ) + ϕ ( D i ) N i , β j = 1 σ i = 1 n x i j μ i ( 1 μ i ) z i λ tanh i 1 σ i = 1 n x i j μ i ( 1 μ i ) ϕ ( A i ) ϕ ( B i ) + ϕ ( C i ) + ϕ ( D i ) N i , j = 0 , , p σ = 1 σ i = 1 n 1 + z i 2 λ z i tanh i + 1 σ 2 i = 1 n ( 1 μ i ) ( ϕ ( A i ) + ϕ ( B i ) ) + μ i ( ϕ ( C i ) + ϕ ( D i ) ) N i .
where z i = ( y i μ i ) / σ , A i = ( 1 μ i ) / σ λ , B i = ( 1 μ i ) / σ + λ , C i = μ i / σ λ , D i = μ i / σ + λ , tanh i = tanh ( λ z i ) , and the truncation normalizer N i = Φ ( A i ) + Φ ( B i ) + Φ ( C i ) + Φ ( D i ) 2 .
Remark 10.
MLEs are obtained by solving the score equations numerically. Standard errors follow from the inverse of the observed information at θ ^ ; Wald intervals and likelihood-ratio tests are standard.

5. Survival Models with Log-Two-Piece Normal Baseline and Gamma Frailty

5.1. Log-Two-Piece Normal with Gamma Frailty: Foundations

In survival analysis, unobserved heterogeneity is often modeled via a non-negative random effect (frailty) that multiplies the hazard [14]. Shared frailty in proportional hazards settings was developed in [15,16]; see [7] for an overview. We adopt a frailty specification over a log-two-piece normal kernel. Let Y i = min { T i , C i } and δ i = 1 { T i C i } , i = 1 , , n . Conditioned on V i with E ( V i ) = 1 ,
h ( y V i , x i ) = V i h ( y x i ) , log ( T i ) = μ i + σ Z i , Z i T N ( λ ) , z i = log ( y ) μ i σ , μ i = x i β , σ > 0 , λ 0 ,
where Φ is the standard normal cdf, Φ c ( u ) = 1 Φ ( u ) , and ϕ its pdf.
Baseline (no frailty).
F ( y x i ) = 1 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) , S ( y x i ) = 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) .
f ( y x i ) = 1 2 σ y ϕ ( z i λ ) + ϕ ( z i + λ ) , h ( y x i ) = 1 σ y ϕ ( z i λ ) + ϕ ( z i + λ ) Φ c ( z i λ ) + Φ c ( z i + λ ) .
H ( y x i ) = log ( S ( y x i ) ) = log 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) .
Frailty (conditional and marginal).
Given V i = v i ,
S ( y v i , x i ) = exp ( v i H ( y x i ) ) , h ( y v i , x i ) = v i h ( y x i ) .
If L V ( s ) = E ( e s V ) is the Laplace transform of V, then
S m ( y x i ) = L V H ( y x i ) .
For V Gamma ( 1 / θ , θ ) ,
S m ( y x i ) = 1 + θ H ( y x i ) 1 / θ , h m ( y x i ) = h ( y x i ) 1 + θ H ( y x i ) .
Likelihood. Note that now Θ = ( β , σ , λ , θ ) and
L i ( Θ v i ) = f ( y i v i , x i ) δ i S ( y i v i , x i ) 1 δ i ,
the complete likelihood is
L ( Θ ) = i = 1 n 0 L i ( Θ v i ) f V ( v i ) d v i , ( Θ ) = log ( L ( Θ ) ) .
For each i, using f ( y v i , x i ) = h ( y v i , x i ) S ( y v i , x i ) and S ( y v i , x i ) = exp ( v i H ( y x i ) ) , we can write
L i ( Θ v i ) = v i h ( y i x i ) δ i exp ( v i H ( y i x i ) ) .
Therefore, the marginal contribution is
L i ( Θ ) = 0 L i ( Θ v i ) f V ( v i ) d v i = h ( y i x i ) δ i E V δ i e H ( y i x i ) V .
With S m ( y x i ) = L V ( H ( y x i ) ) , we have E ( e H V ) = L V ( H ) = S m (when δ i = 0 ) and E ( V e H V ) = L V ( H ) (when δ i = 1 ), so that f m ( y x i ) = h ( y x i ) ( L V ( H ( y x i ) ) ) = h m ( y x i ) S m ( y x i ) . Hence, L i ( Θ ) = h m ( y i x i ) δ i S m ( y i x i ) .
Under Gamma frailty, the marginal log-likelihood is
( Θ ) = i = 1 n δ i log ( h m ( y i x i ) ) + log ( S m ( y i x i ) ) .
Other authors, such as [17], incorporate generalized Gamma and Normal random effects into a Weibull distribution.

Left-Censoring with Gamma Frailty: Model and Marginal Likelihood

We analyze left-censored survival data under a log-two-piece normal baseline with Gamma frailty [18,19,20,21]. Let y i be the observed time and let δ i = 1 indicate an exact time; δ i = 0 represents a left-censored observation at y i ( i = 1 , , n ) . Using the marginal functions in Section 5.1
L i ( Θ ) = f m ( y i x i ) δ i F m ( y i x i ) 1 δ i , F m ( y x ) = 1 S m ( y x ) ,
the log-likelihood is
( Θ ) = i = 1 n δ i log ϕ ( z i λ ) + ϕ ( z i + λ ) log ( σ y i ) log Φ c ( z i λ ) + Φ c ( z i + λ ) 1 θ log ( θ ) log ( Γ ( 1 θ ) ) + log Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + δ i log ( A i ) 1 θ + δ i ( 1 δ i ) Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + ( δ i + 1 ) log ( A i ) 1 θ + δ i .
Maximization of (13) yields ( β ^ , σ ^ , λ ^ , θ ^ ) . Technical details of the integral derivation (mixing over V), including score functions and the Hessian, are provided in Appendix A.

5.2. Likelihood Inference for CTN

5.2.1. Likelihood

The likelihood splits naturally because the observed response combines a point mass at the censoring threshold and a continuous component on ( T , ) . Throughout, we write z T , i = ( T x i β ) / σ and z i = ( y i x i β ) / σ .

5.2.2. Censored Contribution ( y i = T )

For a censored observation, the individual contribution is the probability that the latent variable falls below the threshold, evaluated under the two-piece normal kernel. Taking logs yields
1 i ( Θ ) = log Φ λ ( z T , i ) = log Φ ( z T , i λ ) + Φ ( z T , i + λ ) log ( 2 ) ,
where the term − log ( 2 ) reflects the equal mixture normalization implicit in Φ λ .

5.2.3. Uncensored Contribution ( y i > T )

For an uncensored outcome, the contribution equals the continuous density at y i . Using the tilted representation f Z ( z ; λ ) = ϕ ( z ) e λ 2 / 2 cosh ( λ z ) of the two-piece kernel, the log-term becomes
2 i ( Θ ) = 1 2 log ( 2 π ) log ( σ ) 1 2 z i 2 1 2 λ 2 + log cosh ( λ z i ) .
Full log-likelihood. Collecting both cases by means of the indicator δ i = 1 { y i > T } , the sample log-likelihood can be written compactly as
( Θ ) = i = 1 n ( 1 δ i ) 1 i ( Θ ) + δ i 2 i ( Θ ) ,
so that δ i simply toggles the discrete (censored) and continuous (uncensored) contributions. This decomposition will underpin the score functions and information matrices derived below.

5.3. Score Functions

5.3.1. Censored Term ( y i = T )

For observations piled up at the censoring point, the contribution to the score stems solely from the probability mass at T. In this setting, the gradient behaves like a probit-type score evaluated at the standardized index z T , i , but under the two-piece kernel, which modulates the tails through λ . The resulting components are
1 i β = ϕ λ ( z T , i ) σ Φ λ ( z T , i ) x i , 1 i σ = ϕ λ ( z T , i ) Φ λ ( z T , i ) z T , i σ , 1 i λ = ϕ + ( z T , i ) ϕ ( z T , i ) Φ ( z T , i ) + Φ + ( z T , i ) .
Here, the score in β pushes the linear predictor upward to reduce the chance of censoring, while the λ -component contrasts the two flanks of the mixture via ϕ ± .

5.3.2. Uncensored Term ( y i > T )

When the outcome is observed, the score follows from the tilted-normal density in the continuous region. The hyperbolic terms arise from differentiating log cosh ( λ z i ) , since λ log cosh ( λ z i ) = z i tanh ( λ z i ) and z i log cosh ( λ z i ) = λ tanh ( λ z i ) . This yields
2 i β = z i λ tanh ( λ z i ) σ x i , 2 i σ = 1 + z i 2 λ z i tanh ( λ z i ) σ , 2 i λ = λ + z i tanh ( λ z i ) .
The correction terms involving tanh ( λ z i ) capture how departures from normality reweight central versus tail residuals.

5.3.3. Compact Form with Indicators

Combining both data types is conveniently handled with an indicator that toggles the appropriate contributions. Letting δ i = 1 { y i > T } , the overall score becomes
( Θ ) β = i = 1 n ( 1 δ i ) ϕ λ ( z T , i ) σ Φ λ ( z T , i ) x i + δ i z i λ tanh ( λ z i ) σ x i , ( Θ ) ) σ = i = 1 n ( 1 δ i ) ϕ λ ( z T , i ) Φ λ ( z T , i ) z T , i σ + δ i 1 + z i 2 λ z i tanh ( λ z i ) σ , ( Θ ) ) λ = i = 1 n ( 1 δ i ) ϕ + ( z T , i ) ϕ ( z T , i ) Φ ( z T , i ) + Φ + ( z T , i ) + δ i λ + z i tanh ( λ z i ) ,
where ϕ ± ( z ) = ϕ ( z ± λ ) and Φ ± ( z ) = Φ ( z ± λ ) .
This representation is compact for implementation and clarifies the limit λ 0 , where the expressions collapse to the Gaussian Tobit score.

5.4. Observed and Fisher Information

Let I ( Θ ) = 2 ( Θ ) be the observed information. Writing H 1 i = 2 1 i ( Θ ) ) and H 2 i = 2 2 i ( Θ ) ) ,
I ( Θ ) ) = i = 1 n ( 1 δ i ) H 1 i + δ i H 2 i .
Closed forms for the blocks of H 2 i (uncensored) follows from differentiating (15), letting t i = tanh ( λ z i ) and s i = sech 2 ( λ z i )
2 2 i β β = 1 + λ 2 s i x i x i σ 2 , 2 2 i β σ = 2 z i + λ t i + λ 2 z i s i σ 2 x i , 2 2 i β λ = t i λ z i s i σ x i , 2 2 i σ 2 = 1 3 z i 2 + 2 λ z i t i + λ 2 z i 2 s i σ 2 , 2 2 i σ λ = z i t i λ z i 2 s i σ , 2 2 i λ 2 = 1 + z i 2 s i .
For censored terms, define R i = ϕ λ ( z T , i ) / Φ λ ( z T , i ) , A i = Φ ( z T , i λ ) , B i = Φ ( z T , i + λ ) , a i = ϕ ( z T , i λ ) , b i = ϕ ( z T , i + λ ) . Then
2 1 i β β = R i σ 2 x i x i , 2 1 i β σ = R i z T , i σ 2 + R i σ 2 x i , 2 1 i σ 2 = R i z T , i 2 + 2 R i z T , i σ 2 ,
with
R i = ϕ λ ( z T , i ) Φ λ ( z T , i ) ϕ λ ( z T , i ) 2 Φ λ ( z T , i ) 2 , ϕ λ ( z ) = 1 2 ( z λ ) ϕ ( z λ ) + ( z + λ ) ϕ ( z + λ ) .
Mixed derivatives with respect to λ can be obtained analogously; in practice, we compute I ( Θ ) analytically for uncensored terms and numerically (via automatic differentiation or finite differences) for censored terms to enhance stability.
The Fisher information is J F ( Θ ) = E ( I ( Θ ) ) . Closed forms are unwieldy; throughout, we use the observed information at the MLE as a consistent estimator Σ ^ Θ = I ( Θ ^ ) 1 .

6. Inference and Practical Implementation

The model-specific likelihoods, score functions, and observed information are presented within each modeling section as follows: CTN in Section 3, TTN in Section 4, and the survival frailty model in Section 5. This section consolidates general inferential remarks (asymptotics, standard errors, and boundary considerations) that apply across the proposed likelihood-based specifications.

Asymptotic Theory

Let Θ denote the model parameter vector with dimension d (for example, d = p + 2 in the case of CTN/TTN and d = p + 3 for the Gamma-frailty survival specification). Under standard regularity conditions (identifiability, interior parameter values for the nuisance/scale components, and a non-degenerate sampling scheme, for instance, for CTN 0 < Pr ( Y = T ) < 1 , for TTN 0 < Pr ( 0 < Y * < 1 ) < 1 , and for survival 0 < Pr ( δ = 1 ) < 1 ), the MLE is consistent and asymptotically normal as follows:
Θ ^ d N d Θ 0 , J F ( Θ 0 ) 1 , Σ ^ Θ I ( Θ ^ ) 1 p J F ( Θ 0 ) 1 .
Wald intervals for component Θ are used:
Θ ^ ± z 1 α / 2 Σ ^ Θ .
Wald tests, likelihood-ratio tests and score tests are standard. When enforcing positivity constraints, we recommend reparameterizations such as η = log ( σ ) and ξ = log ( 1 + λ ) ; for frailty models, one may additionally use ζ = log ( θ ) . Standard errors on the original scale follow from the delta method with the appropriate Jacobian.
Remark 11.
Under H 0 : λ = 0 , the parameter lies on the boundary of the parameter space ( λ 0 ). Consequently, the usual χ 1 2 approximation for the LR statistic may fail; under standard regularity conditions, a mixture 1 2 χ 0 2 + 1 2 χ 1 2 typically arises.

7. Simulation Study

Here, LTN denotes the log-two-piece normal baseline survival model (Section 5); left-censoring at L enters the likelihood through F ( · ) or F m ( · ) , paralleling the boundary contribution induced by censoring in the CTN setting.
We conducted a Monte Carlo study to assess the finite-sample performance of the MLEs for the left-censored LTN regression model. A categorical covariate with three levels was simulated, and inference was based on 1000 replicates with sample sizes n { 100 , 500 , 1000 } . The true parameters were β 0 = 1 , β 1 = 0.5 , β 2 = 0.5 , σ = 1.0 , and λ { 0.8 , 1.6 , 2.0 } . For the Gamma-frailty extension, we considered θ { 0.1 , 0.25 } . We report the mean estimates, absolute bias, RMSE, and Monte Carlo standard error (SE). Results are shown in Table 1, Table 2 and Table 3; NF denotes the LTN model without frailty. Estimation used optim in R with the BFGS method.
We note that we do not include a separate TTN simulation study for proportion data: the TTN model on ( 0 , 1 ) is a purely doubly truncated continuous specification, and its likelihood-based estimation follows the standard truncated-likelihood construction. Our simulations, therefore, focus on the settings that introduce the main inferential challenges of the paper, namely censoring-induced point mass (CTN) and frailty marginalization in survival models.
The data were generated as follows:
(i)
Fix the model parameters: β = ( β 0 , β 1 , β 2 ) , σ > 0 , the LTN shape parameter λ 0 , the frailty variance θ > 0 (when applicable), the sample size n, and the target left-censoring proportion p c .
(ii)
For each i = 1 , , n , draw a group indicator g i { 0 , 1 , 2 } with equal probabilities. Construct x i and set μ i = x i β . In the frailty model, draw v i Gamma ( 1 / θ , θ ) ; otherwise, set v i 1 . Conditional on ( x i , v i ) , draw U i Uniform ( 0 , 1 ) , set s i = U i 1 / v i , and obtain the failure time by inversion as follows:
T i = S 0 1 s i μ i , σ , λ ,
where S 0 is the LTN baseline survival function (no frailty).
(iii)
Given p c , choose a common threshold L such that Pr ( T i L ) p c by solving
E F ( L μ i , σ , λ , θ ) p c = 0 ,
using F 0 ( · ) for the model without frailty and F m ( · ) for the marginal frailty model. Finally, define the observed time and indicator as follows:
Y i = T i , δ i = 1 if T i > L ( exact ) , L , δ i = 0 if T i L ( left-censored ) .
Figure 1, Figure 2 and Figure 3 display MSEs by censoring level (0%, 25%, 50%) and sample size. Panels labeled without frailty correspond to independence. The MSEs of the regression coefficients are most affected by heavier censoring, particularly under independence, and decrease as n grows. The LTN parameters show some sensitivity in λ , while σ is comparatively stable. Overall, with or without frailty, the estimators track the reference values well, even under censoring.

8. Applications

We present two real-data examples to illustrate two-piece normal models for censored data, both with and without covariates, and to highlight their practical value under censoring.

8.1. HIV Dataset

As a first illustration, we analyze HIV-1 RNA measurements from the Colombian SIVIGILA surveillance system (Ministry of Health). The database preserves patient anonymity and records age, sex, date of entry, HAART (highly active antiretroviral therapy) status at different stages, CD4 count, and viral load (copies of HIV-1 RNA). We focus on 106 women with at least one year of HAART. The assay detection limit is L = 50 copies/mL; on the log scale, the censoring threshold is log 10 ( L ) . In this sample, 34.9 % of observations fall below L.
We fit the censored two-piece normal model to the log 10 viral-load data. For the 69 uncensored observations (women on HAART ≥ 1 year), the summary statistics were: mean 3.4101 , variance 1.4249 , skewness 0.3549 , and kurtosis 1.9836 . The mild right skewness and kurtosis below 3 indicate lighter-than-normal tails, a feature that CTN accommodates under censoring.
Hartigan’s dip test [11] yields D = 0.0686 (p-value = 0.0133 ), rejecting unimodality. Alongside CTN, we also fitted a censored bimodal normal (CBN) model and a censored flexible normal (CFN) model (see [22] for a flexible bimodal normal family). Table 4 reports the corresponding MLEs (with standard errors), information criteria, and Anderson–Darling goodness-of-fit results. Maximum likelihood estimation was carried out using the optim function in R. Model comparison relied on AIC [8], BIC [9], and CAIC [10]. Larger p-values in the Anderson–Darling (AD) goodness-of-fit test indicate better agreement with the fitted cdf; we computed the AD statistic using the goftest package in R. Note that in the CFN competitor, the shape parameter is not restricted in sign; hence, negative estimates (for instance, when λ ^ < 0 ) are admissible; by contrast, in our TN-based models, we assume λ 0 by construction.
As shown in Table 4, according to AIC/BIC/CAIC, CTN and CFN provide the best fits (in that order). The AD p-values likewise support CTN/CFN for these data. The observed censoring proportion is 34.9 % . Model-implied expected censoring proportions, computed as p ^ c = n 1 i = 1 n F ^ ( L x i ) , were 32.98 % (CTN), 31.11 % (CFN), and 21.25 % (CBN), indicating that CTN and CFN closely reproduce the observed censoring level.

8.2. FoodExpenditure Dataset

Following [3], we analyze the FoodExpenditure data, where the response is the proportion of household income spent on food ( Y ( 0 , 1 ) ), and the covariates are household income ( X 1 ) and household size ( X 2 ) [23]. The data are distributed using the R package betareg [24].
We fit three regression models for Y as follows: (i) a beta regression as in [3]; (ii) a doubly truncated normal (truncated Gaussian, TG); and (iii) a doubly truncated two-piece normal (TTN), noting that TG is the limit of TTN as λ 0 . Table 5 reports MLEs (SEs) and the information criteria AIC, BIC, and CAIC. Estimation used optim in R; beta regression used betareg [24]. By all three criteria, TTN provides the best fit, followed by TG and then beta.
To check model adequacy and detect outliers, we used the transformed martingale residual r M T i with normal envelopes [25,26]. The QQ envelopes (Figure 4) indicate that TTN adheres more closely to the reference line than TG and beta, corroborating the information–criteria ranking. We also report the Anderson–Darling (AD) statistic with p-values in Table 5; larger p-values favor TTN over TG and beta.

8.3. Mroz Dataset

We analyze 753 married women from the classic labor–supply study: 428 work positive hours, while 325 do not. Let H i be the annual hours worked and define Y i = log H max H i + 1 , where H max = max j H j , so non-workers map to the point T 1 = log ( H max + 1 ) 8.5073 and workers to a continuous range below T 1 . As covariates, we use the number of children aged 6–18 ( X 1 ), the woman’s age ( X 2 ), and years of work experience ( X 3 ). We fit censored normal (CN), censored skew–normal (CSN), and censored two-piece normal (CTN) linear models by maximum likelihood.
Table 6 reports parameter estimates (SEs) and information criteria. According to AIC, BIC, and CAIC, the CTN model provides the best fit, followed by CSN and CN. QQ envelopes of the transformed martingale residuals r M T i (Figure 5) also favor CTN over the competitors.
The r M T i envelopes for CN, CSN, and CTN are shown in Figure 5a–c. The CTN envelopes adhere most closely to the reference, reinforcing the information–criteria ranking.

8.4. Childhood Cancer Dataset

We analyze children in Colombia (2023) with confirmed cancer who were hospitalized, obtained from the National Institute of Health via the SIVIGILA portal [27]. The outcome is the time from symptom onset to hospitalization (in days). Left-censoring arises when the onset date is unknown at the time of admission. Covariates are sex (female/male) and age in years at consultation. Figure 6 indicates a skewed distribution of times.
In total, n = 37 children were analyzed; the event proportion was 0.81 . Sample summaries were as follows: mean = 23 , sd = 33.2 , median = 9 , min = 0.25 , max = 123.64 (days). Additional descriptive statistics by censoring status and sex are reported in Appendix B (Table A1).

Model Estimates

We fitted log-two-piece normal baseline models for the time distribution, both without and with Gamma frailty V Gamma ( 1 / θ , θ ) . The parameters are the regression vector β , scale σ , shape λ , and (for frailty) variance θ . Table 7 reports MLEs (SEs); bold entries denote Wald p < 0.05 . The model without frailty is labeled LTN, and the model with Gamma frailty is labeled LTN–G. Covariates: β 0 (intercept), β 1 (male vs. female), and β 2 (age in years, centered).
Under the model without frailty, covariate effects are not statistically different from zero. In contrast, with Gamma frailty, there is strong evidence of association as follows: β ^ 1 = 0.87 and β ^ 2 = 0.08 , indicating (conditional on frailty) shorter expected times for males and for older children on the log–time scale. The estimated scale is σ ^ = 1.92 (SE 0.051 ), and the frailty variance θ ^ = 0.50 suggests substantial unobserved heterogeneity. A likelihood–ratio comparison favors the frailty model (LR = 450.91 , p < 0.01 ), with fit indices: LTN, = 225.70 , AIC = 461.40 , BIC = 469.45 , CAIC = 474.45 ; LTN–G, = 0.24 , AIC = 12.49 , BIC = 22.15 , CAIC = 28.15 . This specification accommodates baseline bimodality (via λ ) and heterogeneity (via θ ).

9. Conclusions

This paper develops a unified likelihood-based framework for limited outcomes using the two-piece normal construction. The framework covers a censored specification for boundary-inflated responses, a doubly truncated specification on ( 0 , 1 ) for rates and proportions, and a survival formulation with a log-two-piece normal baseline and Gamma frailty. We provide explicit closed-form building blocks (pdf, cdf, survival, hazard, and cumulative hazard) and likelihood expressions that support routine maximum likelihood estimation and reproducible implementation.
Monte Carlo results suggest that the maximum likelihood estimators exhibit a small bias and decreasing RMSE as the sample size increases. Censoring primarily increases the variability of regression coefficients; the scale parameter is comparatively stable, whereas the shape parameter can be more sensitive under heavy censoring. In the frailty setting, the variance component is recoverable at moderate sample sizes in the scenarios considered, and the observed information provides standard errors with satisfactory frequentist behavior.
The empirical analyzes broadly support these findings. For HIV-1 RNA subject to a detection limit, the censored two-piece normal attains competitive information–criterion values and passes goodness-of-fit checks; model-implied censoring proportions closely match the observed fraction. For household food expenditure on ( 0 , 1 ) , the doubly truncated two-piece normal improves fit relative to the truncated Gaussian and beta regression in these datasets. In the labor-supply application, the censored specification improves fit over Gaussian and skew-normal alternatives. For childhood cancer survival data, adding Gamma frailty to a log-two-piece baseline improves fit and highlights covariate effects consistent with unobserved heterogeneity; however, sensitivity to starting values and parameterization should be assessed.
Limitations and future work. While the proposed models retain closed forms and integrate naturally with standard diagnostics (information criteria, Anderson–Darling tests, QQ envelopes, and residual analyzes), caution is warranted under extreme censoring/truncation or weak separation, where the shape parameter may be weakly identified. Practical strategies include warm starts from simpler baselines, multiple initializations, and profile-likelihood checks; moreover, when λ approaches the boundary value 0, standard large-sample approximations for inference and likelihood-ratio testing may require adjustment. Finally, in the doubly truncated setting, regression on the location parameter does not necessarily imply mean regression, and interpretation should be made explicit. Future work includes Bayesian implementations with identifiability-aware priors, penalized and robust estimation in contaminated settings, shared-frailty and multilevel extensions, interval/combined censoring and time-varying covariates, semi-/nonparametric baselines within the two-piece construction, and tailored diagnostics for multimodality; public software and fully reproducible workflows would further support adoption in applied domains.

Author Contributions

Conceptualization, G.M.-F., H.S. and J.R.-M.; methodology, G.M.-F., H.S. and J.R.-M.; software, J.R.-M.; validation, G.M.-F., H.S. and J.R.-M.; formal analysis, G.M.-F., H.S. and J.R.-M.; investigation, G.M.-F., H.S. and J.R.-M.; writing—original draft preparation, G.M.-F., H.S. and J.R.-M.; writing—review and editing, H.S.; visualization, G.M.-F., H.S. and J.R.-M.; project administration, G.M.-F. and J.R.-M.; funding acquisition, G.M.-F. and J.R.-M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Vice-rectorate for Research of the Universidad de Córdoba, Colombia, project grant FCB-02-24: “Modelos estadísticos aplicados a eventos dependientes” (G.M.-F. and J.R.-M.).

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors gratefully acknowledge the Universidad de Córdoba, Colombia for the financial support for this project.

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
RMSERoot Mean Squared Error
TNTwo-piece normal distribution
CTNTwo-piece normal distribution censored
TTNDoubly truncated two-piece normal distribution
LTNTwo-piece Lognormal distribution
LTN-GTwo-piece Lognormal distribution with Gamma frailty
AICAkaike Information Criterion
BICBayesian Information Criterion
CAICConsistent AIC

Appendix A. Derivation of the Left-Censoring Likelihood Under Gamma Frailty

Log-Two-Piece Normal Model with Left-Censoring and Gamma Frailty

We analyze left-censored survival data under a log-two-piece normal baseline with a multiplicative Gamma frailty. Let Y i be the survival time of subject i  ( i = 1 , , n ) , and let δ i = 1 indicate an exact observation and δ i = 0 a left-censored one at y i .
Under the CLTN baseline with covariates,
f ( y i x i ) = 1 2 σ y i ϕ log ( y i ) x i β σ λ + ϕ log ( y i ) x i β σ + λ ,
where ϕ is the standard normal density, σ > 0 is a scale parameter, λ controls the shape, and x i β shifts log ( Y i ) . Write z i = log ( y i ) x i β σ , Φ c ( u ) = 1 Φ ( u ) , with Φ as the standard normal cdf.
Frailty terms are assumed V Gamma ( 1 / θ , θ ) , so E ( V ) = 1 and Var ( V ) = θ > 0 represent unobserved heterogeneity. The marginal survival (integrating out v i ) is
S ( y i x i ) = 1 + θ H 0 ( y i x i ) 1 / θ , H 0 ( y i x i ) = log ( S 0 ( y i x i ) ) ,
where S 0 denotes the CLTN survival without frailty.
For left-censored data, the individual contribution is
L i ( β , σ , λ , θ ) = 0 f ( y i x i , v i ) δ i 1 S ( y i x i , v i ) 1 δ i f ( v i ) d v i ,
with f ( v ) = v 1 / θ 1 e v / θ / θ 1 / θ Γ ( 1 / θ ) . Hence,
L ( Θ ) = i = 1 n L i ( β , σ , λ , θ ) = i = 1 n 0 ( h ( y i x i , v i ) S ( y i x i , v i ) ) δ i 1 S ( y i x i , v i ) 1 δ i f ( v i ) d v i = i = 1 n 0 v i δ i f ( y i x i ) S ( y i x i ) S ( y i x i , v i ) δ i 1 S ( y i x i , v i ) 1 δ i v i 1 θ 1 exp ( v i / θ ) Γ ( 1 θ ) θ 1 / θ d v i = i = 1 n ϕ ( z i λ ) + ϕ ( z i + λ ) σ y i [ Φ c ( z i λ ) + Φ c ( z i + λ ) ] δ i 0 v i 1 θ + δ i 1 e v i / θ 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) δ i v i 1 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) v i 1 δ i d v i θ 1 θ Γ 1 θ = i = 1 n ϕ ( z i λ ) + ϕ ( z i + λ ) σ y i [ Φ c ( z i λ ) + Φ c ( z i + λ ) ] δ i 0 v i 1 θ + δ i 1 e ( v i θ v i δ i log 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) 1 ( 1 δ i ) ( 1 2 ( Φ c ( z i λ ) + Φ c ( z i + λ ) ) ) v i d v i θ 1 θ Γ ( 1 / θ ) .
Note that, S ( y i x i , v i ) = exp v i H ( y i x i ) = 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) v i . Also, the term 1 S ( y i x i , v i ) 1 δ i = 1 ( 1 δ i ) 1 2 ( Φ c ( z i λ ) + Φ c ( z i + λ ) ) v i for δ i = 1 or 0.
After simplification,
L ( Θ ) = i = 1 n 1 θ 1 θ Γ ( 1 θ ) ϕ ( z i λ ) + ϕ ( z i + λ ) σ y i [ Φ c ( z i λ ) + Φ c ( z i + λ ) ] δ i 0 v i 1 θ + δ i 1 e v i 1 θ δ i log ( A i ) d v i ( 1 δ i ) 0 v i 1 θ + δ i 1 e v i 1 θ ( δ i + 1 ) log ( A i ) d v i ,
where A i = 1 2 Φ c ( z i λ ) + Φ c ( z i + λ ) . These integrals reduce to Gamma functions via the substitutions
u = v i 1 θ δ i log ( A i ) , a n d u = v i 1 θ ( δ i + 1 ) log ( A i ) ,
which yields
L ( Θ ) = i = 1 n ϕ ( z i λ ) + ϕ ( z i + λ ) σ y i [ Φ c ( z i λ ) + Φ c ( z i + λ ) ] δ i 1 θ 1 θ Γ ( 1 θ ) × Γ ( 1 θ + δ i ) 1 θ + δ i log ( A i ) 1 θ + δ i ( 1 δ i ) Γ ( 1 θ + δ i ) 1 θ + ( δ i + 1 ) log ( A i ) 1 θ + δ i .
Finally, the log-likelihood is
( Θ ) = i = 1 n δ i log ϕ ( z i λ ) + ϕ ( z i + λ ) log ( σ y i ) log Φ c ( z i λ ) + Φ c ( z i + λ ) 1 θ log ( θ ) log ( Γ ( 1 θ ) ) + log Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + δ i log ( A i ) 1 θ + δ i ( 1 δ i ) Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + ( δ i + 1 ) log ( A i ) 1 θ + δ i .
The parameter vector Θ = ( β , σ , λ , θ ) is estimated by maximum likelihood. The estimator is obtained by numerically solving the score equations U ( Θ ) = ( Θ ) / Θ = 0 using a quasi-Newton algorithm. The expressions for the score components U β ( Θ ) and U θ ( Θ ) are provided below; the derivatives with respect to σ and λ follow analogously.
β = i = 1 n δ i h ( y i x i ) h ( y i x i ) β + log β Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + δ i log ( A i ) 1 θ + δ i ( 1 δ i ) Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + ( δ i + 1 ) log ( A i ) 1 θ + δ i θ = i = 1 n 1 θ log ( θ 1 ) log ( Γ ( θ ) ) θ + log θ Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + δ i log ( A i ) 1 θ + δ i ( 1 δ i ) Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + ( δ i + 1 ) log ( A i ) 1 θ + δ i
The second derivatives are given by
2 β β = i = 1 n δ i h ( y i x i ) 2 h ( y i x i ) β β + 2 log β β Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + δ i log ( A i ) 1 θ + δ i ( 1 δ i ) Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + ( δ i + 1 ) log ( A i ) 1 θ + δ i 2 θ 2 = i = 1 n 1 θ 2 2 log ( θ ) 2 log ( Γ ( θ ) ) θ 2 + 2 log θ 2 Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + δ i log ( A i ) 1 θ + δ i ( 1 δ i ) Γ ( 1 θ + δ i ) Γ ( 1 θ ) 1 θ + ( δ i + 1 ) log ( A i ) 1 θ + δ i
For inference, under standard regularity conditions, the MLE is asymptotically normal,
n Θ ^ Θ d N p + 3 0 , I ( Θ ) 1 ,
where I ( Θ ) denotes the Fisher information matrix, consistently estimated by the observed information I ^ ( Θ ^ ) = 2 ( Θ ) / Θ Θ | Θ = Θ ^ .

Appendix B. Additional Descriptive Statistics for the Childhood Cancer Dataset

Table A1. Exploratory summary by censoring status and sex for the childhood cancer dataset.
Table A1. Exploratory summary by censoring status and sex for the childhood cancer dataset.
TypeGenderMean_TimeMin_TimeMax_TimeMean_AgeSd_AgeMin_AgeMax_Age
CensoredF0.500.500.5012.334.048.0016.00
CensoredM0.500.500.5014.003.469.0017.00
ObservedF31.910.25124.007.255.182.0017.00
ObservedM26.930.25122.009.596.011.0017.00

References

  1. Tobin, J. Estimation of relationships for limited dependent variables. Econometrica 1958, 26, 24–36. [Google Scholar] [CrossRef]
  2. Martínez-Flórez, G.; Tovar-Falón, R.; Elal-Olivero, D. Some new flexible classes of normal distribution for fitting multimodal data. Statistics 2022, 56, 182–205. [Google Scholar] [CrossRef]
  3. Ferrari, S.L.P.; Cribari-Neto, F. Beta regression for modelling rates and proportions. J. Appl. Stat. 2004, 31, 799–815. [Google Scholar] [CrossRef]
  4. Salinas, H.S.; Bakouch, H.; Qarmalah, N.; Martínez-Flórez, G. A flexible class of two-piece normal distribution with a regression illustration to biaxial fatigue data. Mathematics 2023, 11, 1271. [Google Scholar] [CrossRef]
  5. Salinas, H.S.; Martínez-Flórez, G.; Bakouch, H.S.; Alyami, L.; Caimanque, W.E. Modeling Bimodal and Skewed Data: Asymmetric Double Normal Distribution with Applications in Regression. Symmetry 2025, 17, 942. [Google Scholar] [CrossRef]
  6. Mahdavi, A.; da Oliveira Silva, G. A Method to Expand Family of Continuous Distributions Based on Truncated Variable on (0,1). J. Stat. Res. Iran 2016, 13, 231–247. [Google Scholar]
  7. Klein, J.P.; Moeschberger, M.L. Survival Analysis: Techniques for Censored and Truncated Data, 2nd ed.; Springer: New York, NY, USA, 2003. [Google Scholar]
  8. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  9. Schwarz, G. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  10. Bozdogan, H. Model selection and Akaike’s Information Criterion (AIC): The general theory and its analytical extensions. Psychometrika 1987, 52, 345–370. [Google Scholar] [CrossRef]
  11. Hartigan, J.A.; Hartigan, P.M. The Dip Test of Unimodality. Ann. Stat. 1985, 13, 70–84. [Google Scholar] [CrossRef]
  12. Burke, W.J. Fitting and interpreting Cragg’s Tobit alternative using Stata. Stata J. 2009, 9, 584–592. [Google Scholar] [CrossRef]
  13. Moulton, L.H.; Halsey, N.A. A mixture model with detection limits for regression analyses of antibody response to vaccine. J. Am. Stat. Assoc. 1995, 90, 545–555. [Google Scholar]
  14. Vaupel, J.W.; Manton, K.G.; Stallard, E. The impact of heterogeneity in individual frailty on the dynamics of mortality. Demography 1979, 16, 439–454. [Google Scholar] [CrossRef] [PubMed]
  15. Clayton, D.G. A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence. Biometrika 1978, 65, 141–151. [Google Scholar] [CrossRef]
  16. Clayton, D.; Cuzick, J. Multivariate generalizations of the proportional hazards model. J. R. Stat. Soc. Ser. A 1985, 148, 82–108. [Google Scholar] [CrossRef]
  17. Liu, K.; Wang, Y.Q.; Zhu, X.; Balakrishnan, N. Generalized Gamma Frailty and Symmetric Normal Random Effects Model for Repeated Time-to-Event Data. Symmetry 2025, 17, 1760. [Google Scholar] [CrossRef]
  18. Hougaard, P. Analysis of Multivariate Survival Data; Springer: New York, NY, USA, 2000. [Google Scholar]
  19. Aalen, O.O.; Borgan, Ø.; Gjessing, H.K. Survival and Event History Analysis: A Process Point of View; Springer: New York, NY, USA, 2008. [Google Scholar]
  20. Wienke, A. Frailty Models in Survival Analysis; Chapman & Hall/CRC: Boca Raton, FL, USA, 2010. [Google Scholar]
  21. Hanagal, D.D. Modeling Survival Data Using Frailty Models, 2nd ed.; Springer Nature: Singapore, 2019. [Google Scholar]
  22. Elal-Olivero, D. Alpha-skew-normal distribution. Proyecciones 2010, 29, 224–240. [Google Scholar] [CrossRef]
  23. Griffiths, W.E.; Hill, R.C.; Judge, G.G. Learning and Practicing Econometrics; John Wiley & Sons: New York, NY, USA, 1993. [Google Scholar]
  24. Cribari-Neto, F.; Zeileis, A. Beta regression in R. J. Stat. Softw. 2010, 34, 1–24. [Google Scholar]
  25. Barros, M.; Galea, M.; González, M.; Leiva, V. Influence diagnostics in the tobit censored response model. Stat. Methods Appl. 2010, 19, 379–397. [Google Scholar] [CrossRef]
  26. Silva, G.O.; Ortega, E.M.M.; Cordeiro, G.M. Residuals for log–Burr XII regression models in survival analysis. J. Appl. Stat. 2011, 38, 1435–1445. [Google Scholar] [CrossRef]
  27. Instituto Nacional de Salud (INS). Sistema Nacional de Vigilancia en Salud Pública (SIVIGILA), Colombia. Portal SIVIGILA 4.0 y Microdatos. 2023. Available online: https://portalsivigila.ins.gov.co/ (accessed on 19 January 2026).
Figure 1. MSE for the LTN model ( λ = 0.8 ) with and without frailty.
Figure 1. MSE for the LTN model ( λ = 0.8 ) with and without frailty.
Mathematics 14 00384 g001
Figure 2. MSE for the LTN model ( λ = 1.6 ) with and without frailty.
Figure 2. MSE for the LTN model ( λ = 1.6 ) with and without frailty.
Mathematics 14 00384 g002
Figure 3. MSE for the LTN model ( λ = 2.0 ) with and without frailty.
Figure 3. MSE for the LTN model ( λ = 2.0 ) with and without frailty.
Mathematics 14 00384 g003
Figure 4. Envelope graphs: (a) beta model, (b) TG model, and (c) TTN model.
Figure 4. Envelope graphs: (a) beta model, (b) TG model, and (c) TTN model.
Mathematics 14 00384 g004
Figure 5. Confidence–envelope QQ plots of r M T i for the fitted models.
Figure 5. Confidence–envelope QQ plots of r M T i for the fitted models.
Mathematics 14 00384 g005
Figure 6. Distribution of observed times and censoring.
Figure 6. Distribution of observed times and censoring.
Mathematics 14 00384 g006
Table 1. Estimates for LTN model ( λ = 0.8 ) with and without frailty, censored and uncensored.
Table 1. Estimates for LTN model ( λ = 0.8 ) with and without frailty, censored and uncensored.
Censoring θ Parametern = 100n = 500n = 1000
Mean | Bias | RMSESEMean | Bias | RMSESEMean | Bias | RMSESE
0.00NF β 0 0.9830.0170.1850.2161.0010.0010.0800.0981.0000.0000.0550.069
β 1 0.5110.0110.2900.3060.5000.0000.1260.1380.4990.0010.0850.098
β 2 −0.4780.0220.2810.306−0.4930.0070.1290.138−0.4980.0020.0860.098
λ 0.8160.0160.2850.3970.8050.0050.1320.1880.7980.0020.0970.125
σ 0.9650.0350.1270.1770.9950.0050.0630.0901.0000.0000.0470.061
0.10 β 0 0.9710.0290.2040.2520.9930.0070.0910.1160.9990.0010.0640.082
β 1 0.5170.0170.3170.3230.5070.0070.1320.1470.5020.0020.0940.104
β 2 −0.4890.0110.3070.325−0.4980.0020.1290.147−0.5040.0040.0940.104
λ 0.8630.0630.3070.3960.8060.0060.1410.1960.8020.0020.0940.129
σ 0.9290.0710.1530.1990.9940.0060.0780.1090.9970.0030.0550.075
θ 0.1220.0220.0950.1450.0970.0030.0450.0650.0990.0010.0380.047
0.25 β 0 1.0030.0030.2250.2771.0020.0020.1030.1241.0020.0020.0680.088
β 1 0.4980.0020.3310.3490.5040.0040.1480.1580.5010.0010.0980.112
β 2 −0.5020.0020.3380.348−0.5000.0000.1410.158−0.4970.0030.0960.112
λ 0.8360.0360.3130.4170.7790.0210.1680.1900.7980.0020.0920.131
σ 0.9530.0470.1570.2201.0010.0010.0840.1101.0020.0020.0580.077
θ 0.2640.0140.1550.1770.2480.0020.0610.0760.2490.0010.0420.054
0.25NF β 0 0.9850.0150.1940.2321.0030.0030.0840.1041.0010.0010.0580.074
β 1 0.5120.0120.3050.3180.4980.0020.1310.1430.4980.0020.0890.102
β 2 −0.4720.0280.3000.333−0.4910.0090.1360.149−0.4990.0010.0920.105
λ 0.8260.0260.3160.4780.7800.0200.1860.2280.7970.0030.1170.158
σ 0.9580.0420.1570.2290.9950.0050.0830.1171.0010.0010.0630.083
0.10 β 0 0.9900.0100.2060.2630.9960.0040.0940.1221.0000.0000.0670.088
β 1 0.5110.0110.3270.3400.5040.0040.1360.1530.5020.0020.0970.109
β 2 −0.4960.0040.3370.364−0.4980.0020.1400.160−0.5020.0020.1020.114
λ 0.9050.1050.3910.6020.7800.0200.2240.3350.7640.0360.1810.260
σ 0.9220.0780.1980.3510.9950.0050.1290.2231.0090.0090.1060.182
θ 0.1240.0240.1080.1960.1000.0000.0610.1030.0970.0030.0500.081
0.25 β 0 1.0300.0300.2340.2921.0130.0130.1090.1351.0150.0150.0780.096
β 1 0.5060.0060.3560.3790.5080.0080.1580.1680.5030.0030.1060.119
β 2 −0.5160.0160.3760.401−0.4980.0020.1600.177−0.4990.0010.1040.125
λ 0.7800.0200.3780.6650.6830.1170.2710.4580.6730.1270.2530.344
σ 1.0300.0300.2070.4161.0730.0730.1860.2991.0810.0810.1750.228
θ 0.2300.0200.1600.2260.2250.0250.0950.1250.2250.0250.0760.092
0.50NF β 0 0.9730.0270.2380.2901.0040.0040.0960.1201.0000.0000.0670.085
β 1 0.5250.0250.3370.3560.5010.0010.1400.1560.5000.0000.0960.110
β 2 −0.4870.0130.3420.368−0.4950.0050.1500.163−0.5000.0000.1020.116
λ 0.8960.0960.3910.5680.7770.0230.2090.2620.7770.0230.1640.182
σ 0.9480.0520.1640.2440.9940.0060.0840.1231.0010.0010.0670.087
0.10 β 0 0.9770.0230.2440.3240.9960.0040.1100.1400.9970.0030.0780.101
β 1 0.5380.0380.3720.3980.5080.0080.1520.1700.5060.0060.1090.121
β 2 −0.4890.0110.3630.408−0.5010.0010.1600.179−0.5020.0020.1140.127
λ 0.9090.1090.4200.8040.7680.0320.2490.4390.7480.0520.2170.357
σ 0.9220.0780.1850.4711.0030.0030.1340.2911.0270.0270.1250.248
θ 0.1170.0170.0940.2470.0970.0030.0620.1250.0910.0090.0540.101
0.25 β 0 1.0040.0040.2790.3551.0060.0060.1280.1571.0140.0140.0920.111
β 1 0.5280.0280.4190.4390.5140.0140.1830.1920.5060.0060.1210.135
β 2 −0.5110.0110.4210.461−0.4970.0030.1840.203−0.4990.0010.1240.143
λ 0.7980.0020.4070.8320.6520.1480.3020.5630.6390.1610.2860.474
σ 1.0390.0390.2070.5341.1140.1140.2180.3791.1180.1180.2050.318
θ 0.2160.0340.1630.2640.1960.0540.1190.1440.2080.0420.0910.114
Table 2. Estimates for LTN model ( λ = 1.6 ) with and without frailty, censored and uncensored.
Table 2. Estimates for LTN model ( λ = 1.6 ) with and without frailty, censored and uncensored.
Censoring θ Parametern = 100n = 500n = 1000
Mean | Bias | RMSESEMean | Bias | RMSESEMean | Bias | RMSESE
0.00NF β 0 0.9900.0100.2150.2250.9990.0010.0900.1011.0010.0010.0620.072
β 1 0.5040.0040.3180.3200.5010.0010.1350.1430.4960.0040.0920.101
β 2 −0.4830.0170.3130.321−0.4900.0100.1390.143−0.4980.0020.0930.101
λ 1.6570.0570.1940.2001.6150.0150.0790.0861.6050.0050.0560.061
σ 0.9720.0280.0870.0870.9930.0070.0360.0390.9980.0020.0270.028
0.10 β 0 0.9360.0640.2530.2660.9850.0150.1040.1201.0030.0030.0720.085
β 1 0.5090.0090.3460.3330.5120.0120.1390.1510.5010.0010.1000.107
β 2 −0.4960.0040.3340.333−0.4970.0030.1420.151−0.5060.0060.1010.107
λ 1.7160.1160.2210.2171.6200.0200.0810.0901.6070.0070.0570.063
σ 0.9250.0750.1190.1160.9880.0120.0470.0520.9960.0040.0340.037
θ 0.1730.0730.1270.1640.1080.0080.0510.0650.1020.0020.0400.045
0.25 β 0 0.9790.0210.2640.2821.0060.0060.1170.1261.0040.0040.0770.089
β 1 0.5150.0150.3630.3570.5020.0020.1590.1600.4990.0010.1040.114
β 2 −0.4940.0060.3640.356−0.5020.0020.1490.161−0.4980.0020.1050.114
λ 1.6960.0960.2210.2201.6140.0140.0870.0921.6050.0050.0570.065
σ 0.9510.0490.1190.1220.9960.0040.0520.0530.9990.0010.0350.038
θ 0.2780.0280.1410.1730.2450.0050.0670.0710.2480.0020.0470.050
0.25NF β 0 0.9790.0210.2430.2570.9980.0020.0990.1121.0010.0010.0690.080
β 1 0.5120.0120.3420.3470.5000.0000.1440.1520.4950.0050.0990.108
β 2 −0.4810.0190.3570.371−0.4850.0150.1570.163−0.4990.0010.1050.115
λ 1.6650.0650.2160.2241.6180.0180.0880.0941.6060.0060.0620.066
σ 0.9740.0260.1110.1150.9920.0080.0470.0500.9970.0030.0340.036
0.10 β 0 0.9530.0470.2580.2700.9840.0160.1100.1261.0010.0010.0780.090
β 1 0.4990.0010.3650.3520.5100.0100.1490.1620.5020.0020.1070.115
β 2 −0.5190.0190.4070.382−0.4960.0040.1610.175−0.5050.0050.1130.125
λ 1.7860.1860.2800.2611.6320.0320.0980.1081.6090.0090.0650.076
σ 0.8900.1100.1610.1650.9790.0210.0680.0790.9950.0050.0470.056
θ 0.1930.0930.1480.1900.1150.0150.0600.0770.1040.0040.0430.053
0.25 β 0 0.9820.0180.2820.3031.0060.0060.1290.1371.0040.0040.0840.096
β 1 0.5300.0300.4070.3930.5050.0050.1780.1770.4990.0010.1140.125
β 2 −0.5170.0170.4600.431−0.5030.0030.1830.194−0.5000.0000.1220.137
λ 1.7370.1370.2730.3101.6130.0130.1070.1141.6030.0030.0670.077
σ 0.9490.0510.1840.1961.0020.0020.0820.0841.0030.0030.0530.058
θ 0.2800.0300.1540.2040.2420.0080.0790.0840.2450.0050.0550.058
0.50NF β 0 0.8090.1910.6772.1320.9960.0040.1840.2240.9990.0010.1260.140
β 1 0.4970.0030.4550.4210.5000.0000.1830.1910.4960.0040.1240.136
β 2 −0.4860.0140.4400.429−0.4900.0100.1880.193−0.4980.0020.1260.138
λ 1.9030.3030.6572.1281.6320.0320.1240.1601.6120.0120.0870.092
σ 0.9540.0460.1460.1460.9870.0130.0600.0640.9960.0040.0440.046
0.10 β 0 0.7720.2280.7655.2740.9650.0350.2280.4050.9900.0100.1440.164
β 1 0.5410.0410.4800.4220.5040.0040.2000.2100.5020.0020.1420.151
β 2 −0.5050.0050.4840.444−0.4980.0020.1950.213−0.5030.0030.1430.152
λ 2.1710.5710.8656.1471.6800.0800.1820.3581.6210.0210.1050.124
σ 0.8290.1710.2450.2320.9620.0380.0940.1090.9940.0060.0690.079
θ 0.2200.1200.1830.2300.1250.0250.0690.0890.1060.0060.0480.062
0.25 β 0 0.7030.2970.8957.1490.9260.0740.3730.8920.9690.0310.2331.863
β 1 0.5280.0280.5720.5110.5030.0030.2440.2400.4970.0030.1620.170
β 2 −0.4970.0030.5610.527−0.5000.0000.2370.244−0.4960.0040.1600.172
λ 2.0240.4241.0137.9801.6920.0920.3380.8481.6370.0370.1932.014
σ 1.0260.0260.3590.3191.0190.0190.1340.1221.0080.0080.0770.081
θ 0.2450.0050.1530.2330.2300.0200.0880.0960.2410.0090.0630.067
Table 3. Estimates for LTN model ( λ = 2.0 ) with and without frailty, censored and uncensored.
Table 3. Estimates for LTN model ( λ = 2.0 ) with and without frailty, censored and uncensored.
Censoring θ Parametern = 100n = 500n = 1000
Mean | Bias | RMSESEMean | Bias | RMSESEMean | Bias | RMSESE
0.00NF β 0 0.9950.0050.1870.2020.9990.0010.0800.0911.0010.0010.0550.064
β 1 0.4980.0020.2770.2870.5010.0010.1200.1280.4960.0040.0820.091
β 2 −0.4870.0130.2790.287−0.4910.0090.1220.128−0.4990.0010.0830.091
λ 2.0620.0620.1900.1982.0160.0160.0790.0862.0060.0060.0560.061
σ 0.9750.0250.0750.0760.9940.0060.0320.0350.9980.0020.0240.025
0.10 β 0 0.9550.0450.2100.2270.9890.0110.0900.1031.0030.0030.0620.073
β 1 0.5110.0110.3050.2960.5090.0090.1240.1340.5000.0000.0890.095
β 2 −0.4950.0050.2980.297−0.4980.0020.1240.134−0.5060.0060.0910.095
λ 2.1130.1130.2190.2122.0200.0200.0810.0902.0070.0070.0570.063
σ 0.9420.0580.1000.0970.9900.0100.0400.0440.9970.0030.0290.031
θ 0.1590.0590.1120.1450.1050.0050.0470.0590.1020.0020.0370.041
0.25 β 0 0.9870.0130.2270.2401.0040.0040.1000.1081.0020.0020.0660.076
β 1 0.5140.0140.3170.3170.5000.0000.1410.1420.5000.0000.0920.100
β 2 −0.4970.0030.3200.315−0.5030.0030.1300.142−0.4990.0010.0930.101
λ 2.0900.0900.2120.2152.0140.0140.0870.0922.0060.0060.0570.065
σ 0.9610.0390.1000.1010.9960.0040.0430.0450.9990.0010.0300.032
θ 0.2710.0210.1330.1560.2470.0030.0610.0650.2490.0010.0430.046
0.25NF β 0 0.9890.0110.2040.2230.9990.0010.0870.0981.0010.0010.0610.070
β 1 0.5020.0020.2920.3050.5010.0010.1270.1350.4960.0040.0870.096
β 2 −0.4850.0150.3110.324−0.4870.0130.1360.142−0.4990.0010.0920.101
λ 2.0700.0700.2060.2152.0190.0190.0860.0932.0060.0060.0610.065
σ 0.9750.0250.0910.0950.9930.0070.0400.0430.9980.0020.0290.030
0.10 β 0 0.9670.0330.2170.2360.9880.0120.0940.1081.0020.0020.0670.077
β 1 0.5040.0040.3060.3130.5090.0090.1280.1420.5000.0000.0940.101
β 2 −0.5100.0100.3450.337−0.4960.0040.1380.152−0.5060.0060.0990.107
λ 2.1440.1440.2530.2492.0270.0270.0950.1042.0080.0080.0640.073
σ 0.9250.0750.1230.1360.9860.0140.0550.0620.9970.0030.0390.044
θ 0.1710.0710.1220.1680.1100.0100.0530.0680.1020.0020.0400.047
0.25 β 0 0.9860.0140.2420.2601.0040.0040.1070.1151.0020.0020.0710.081
β 1 0.5220.0220.3410.3460.5010.0010.1530.1530.5000.0000.0990.108
β 2 −0.5100.0100.3790.372−0.5050.0050.1510.165−0.5000.0000.1050.116
λ 2.1080.1080.2430.2602.0130.0130.1010.1062.0040.0040.0650.075
σ 0.9590.0410.1300.1491.0000.0000.0640.0651.0010.0010.0410.045
θ 0.2760.0260.1410.1800.2440.0060.0700.0740.2470.0030.0490.052
0.50NF β 0 0.6720.3280.8323.5190.9620.0380.2880.4690.9890.0110.1470.184
β 1 0.4900.0100.4160.3910.5000.0000.1710.1780.4970.0030.1160.127
β 2 −0.4940.0060.4260.405−0.4930.0070.1770.181−0.4990.0010.1160.129
λ 2.4690.4690.7993.4542.0710.0710.2220.3862.0200.0200.1010.127
σ 0.9500.0500.1310.1280.9860.0140.0560.0590.9960.0040.0410.042
0.10 β 0 0.7010.2990.8507.2130.8610.1390.5001.6280.9640.0360.2340.520
β 1 0.5200.0200.4390.4110.5050.0050.1820.1930.5010.0010.1270.139
β 2 −0.5010.0010.4410.434−0.4970.0030.1830.198−0.5050.0050.1330.142
λ 2.5980.5980.9097.8082.1900.1900.4511.5862.0490.0490.1910.460
σ 0.8670.1330.2090.2130.9690.0310.0860.0990.9940.0060.0620.071
θ 0.1960.0960.1580.2150.1180.0180.0640.0820.1040.0040.0450.056
0.25 β 0 0.6620.3380.99711.2980.6910.3090.7435.6090.7040.2960.6252.142
β 1 0.5290.0290.5400.4940.5090.0090.2170.2220.4970.0030.1470.154
β 2 −0.5160.0160.5430.520−0.4960.0040.2220.231−0.4990.0010.1500.162
λ 2.5340.5340.97511.3332.3290.3290.7185.5572.2970.2970.5872.085
σ 0.9780.0220.2620.2731.0050.0050.0990.1051.0030.0030.0710.072
θ 0.2480.0020.1500.2270.2360.0140.0820.0890.2460.0040.0600.062
Table 4. MLEs (SEs) for the censored bimodal normal (CBN), censored flexible normal (CFN), and censored two-piece normal (CTN) models. AD reports the test statistic with a p-value in parentheses.
Table 4. MLEs (SEs) for the censored bimodal normal (CBN), censored flexible normal (CFN), and censored two-piece normal (CTN) models. AD reports the test statistic with a p-value in parentheses.
EstimateCBNCFNCTN
μ 3.1850 (0.0409)2.7058 (0.1378)2.5975 (0.3294)
σ 0.8897 (0.0428)1.0792 (0.2114)1.0771 (0.2864)
λ 0.9872 (0.3798)1.2080 (0.4059)
AIC358.92340.96340.78
BIC364.25348.95348.77
CAIC366.25351.95351.77
AD3.1312 (0.0236)1.3881 (0.2056)1.3069 (0.2301)
Table 5. Parameter estimates (standard errors) for the fitted Beta, TG, and TTN regression models.
Table 5. Parameter estimates (standard errors) for the fitted Beta, TG, and TTN regression models.
EstimateBetaTGTTN
β 0 0.6225 ( 0.2238 ) 0.6483 ( 0.2157 ) 1.5765 ( 0.8913 )
β 1 0.0122 ( 0.0030 ) 0.0134 ( 0.0031 ) 0.0944 ( 0.0207 )
β 2 0.1184 (0.0353)0.1399 (0.0357)0.7401 (0.1350)
σ 35.610 (8.080)0.0728 (0.0085)0.0641 (0.0079)
λ 3.7463 (0.4935)
AIC 82.66 83.68 88.33
BIC 76.10 77.13 80.15
CAIC 72.10 73.13 75.15
AD0.6073 (0.6400)0.5882 (0.6581)0.2835 (0.9497)
Table 6. Parameter estimates (standard errors) for CN, CSN, and CTN models.
Table 6. Parameter estimates (standard errors) for CN, CSN, and CTN models.
EstimateCNCSNCTN
β 0 2.8498 (1.2616) 2.5622 (0.8877)2.9253 (0.5736)
β 1 0.1917 (0.1673)0.1707 (0.0770)0.2091 (0.0671)
β 2 0.1692 (0.0287)0.0741 (0.0184)0.0888 (0.0145)
β 3 0.3344 (0.0281) 0.0616 (0.0149) 0.0602 (0.0112)
σ 5.0712 (0.1961)8.4453 (0.3303)1.6531 (0.1041)
λ 17.8845 (2.5303)2.6569 (0.1473)
AIC3119.022799.402502.20
BIC3142.142827.152529.94
CAIC3147.142833.152535.94
Table 7. Maximum likelihood estimates (standard errors) for childhood cancer.
Table 7. Maximum likelihood estimates (standard errors) for childhood cancer.
EstimateLTNLTN–G
β 0 1.99 (0.686)1.13 (0.049)
β 1 0.47 (0.817)0.87 (0.047)
β 2 0.02 (0.057) 0.08 (0.050)
σ 1.71 (0.282)1.92 (0.051)
λ 0.08 (0.831)0.10 (0.053)
θ 0.50 (0.051)
225.70 0.24
AIC 461.40 12.49
BIC 469.45 22.15
CAIC 474.45 28.15
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Martínez-Flórez, G.; Salinas, H.; Ramírez-Montoya, J. Symmetric Double Normal Models for Censored, Bounded, and Survival Data: Theory, Estimation, and Applications. Mathematics 2026, 14, 384. https://doi.org/10.3390/math14020384

AMA Style

Martínez-Flórez G, Salinas H, Ramírez-Montoya J. Symmetric Double Normal Models for Censored, Bounded, and Survival Data: Theory, Estimation, and Applications. Mathematics. 2026; 14(2):384. https://doi.org/10.3390/math14020384

Chicago/Turabian Style

Martínez-Flórez, Guillermo, Hugo Salinas, and Javier Ramírez-Montoya. 2026. "Symmetric Double Normal Models for Censored, Bounded, and Survival Data: Theory, Estimation, and Applications" Mathematics 14, no. 2: 384. https://doi.org/10.3390/math14020384

APA Style

Martínez-Flórez, G., Salinas, H., & Ramírez-Montoya, J. (2026). Symmetric Double Normal Models for Censored, Bounded, and Survival Data: Theory, Estimation, and Applications. Mathematics, 14(2), 384. https://doi.org/10.3390/math14020384

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop