Next Article in Journal
Hp-Adaptive Dynamic Trust Region Sequential Convex Programming: An Enhanced Strategy for Trajectory Optimization
Previous Article in Journal
Dynamic Behavior of Pseudo Almost Periodic Functions to Stochastic Differential Equations
Previous Article in Special Issue
PT-Symmetric Dirac Inverse Spectral Problem with Discontinuity Conditions on the Whole Axis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On Doubly-Generalized-Transmuted Distributions

by
Barry C. Arnold
1,
Yolanda M. Gómez
2,
Diego I. Gallardo
2,* and
Héctor W. Gómez
3,*
1
Department of Statistics, University of California, Riverside, CA 92504, USA
2
Departamento de Estadística, Facultad de Ciencias, Universidad del Bío-Bío, Concepción 4081112, Chile
3
Departamento de Estadística y Ciencia de Datos, Facultad de Ciencias Básicas, Universidad de Antofagasta, Antofagasta 1240000, Chile
*
Authors to whom correspondence should be addressed.
Symmetry 2025, 17(10), 1606; https://doi.org/10.3390/sym17101606 (registering DOI)
Submission received: 17 August 2025 / Revised: 23 September 2025 / Accepted: 24 September 2025 / Published: 27 September 2025
(This article belongs to the Special Issue Mathematics: Feature Papers 2025)

Abstract

Many parametric models can be enriched by introducing additional parameters through transmutation, mixing, or compounding techniques. In this paper, we develop the framework of doubly generalized transmutation models (DGTMs), obtained by the repeated application of rank transmutation maps and their generalizations. We show that several flexible families already available in the literature can be reinterpreted as instances of double or multiple transmutation, thus unifying apparently disparate constructions under a common perspective. A key feature of DGTMs is their ability to flexibly control symmetry through parameterization, enabling more accurate modeling of asymmetric or heavy-tailed phenomena. We also discuss the potential extension of these models to the bivariate case. In addition, we introduce the gentransmuted R package, Version 1.0, which provides routines for data generation, parameter estimation, and model comparison for generalized transmutation models. Two real data applications illustrate the practical advantages of this approach, highlighting improved model fit relative to classical alternatives. Our results underscore the value of transmutation-based methods as a systematic tool for generating flexible probability distributions and advancing their computational implementation.

1. Introduction

The concept of rank transmutation maps (RTMs) was first introduced by Shaw and Buckley [1,2], who proposed quadratic rank transmutation map (QRTM) as a mechanism to generate new distributions from a baseline model. Specifically, they considered cumulative distribution functions (cdfs) of the form
( 1 + λ ) F 0 ( x ̲ ; θ ̲ ) λ [ F 0 ( x ̲ ; θ ̲ ) ] 2 , λ [ 1 , 1 ] ,
where F 0 ( x ̲ ; θ ̲ ) denotes the baseline cdf to be extended. This can be rewritten as G ( F 0 ( x ̲ ; θ ̲ ) ; λ ) , where G ( u ; λ ) = ( 1 + λ ) u λ u 2 , 0 < u < 1 , is itself a cdf indexed by λ . Subsequent generalizations have considered replacing G with a broader class of parametric distribution functions supported on [ 0 , 1 ] , leading to what we call general transmutation models, and which was duly documented, for example, by Alzaatreh et al. [3].
Based on this, a doubly generalized transmutation model (DGT) is defined as
G 2 ( G 1 ( F 0 ( x ̲ ; θ ̲ ) ; λ ̲ ) ; τ ̲ ) ,
where F 0 ( · ; θ ̲ ) is the baseline family, and G 1 ( · ; λ ̲ ) , G 2 ( · ; τ ̲ ) are parametric families of distribution functions supported on [ 0 , 1 ] . Extensions to triply or multiply generalized transmutation models naturally follow from further compositions. While these constructions can alternatively be viewed simply as compositions of distributions, the terminology of “transmutation” emphasizes their connection with the seminal work of Shaw and Buckley. Note that the cdf in Equation (1) should be extended to the case where G 2 and G 1 have support different from [ 0 , 1 ] , considering a new cdf as
G 2 ( W 1 ( G 1 ( W 0 ( F 0 ( x ̲ ; θ ̲ ) ) ; λ ̲ ) ) ; τ ̲ ) ,
where W 0 ( · ) and W 1 ( · ) are ad hoc functions defined from [ 0 , 1 ] to the corresponding support of G 1 and G 2 , respectively, which would be a generalization of the idea used to extend distributions used in [3]. However, we will consider the specific case described through Equation (1).
As noted by Tahir and Cordeiro [4], related modeling strategies include scale-mixture constructions, compounding schemes based on probability generating functions, and several models that can be interpreted as special cases of doubly generalized transmutation models. Indeed, most of the flexible families cataloged in their survey can be reinterpreted as instances of double or multiple transmutation.
Recent contributions have reinforced the practical value of QRTM-based approaches. For example, Yedlapalli et al. [5] applied QRTM to the semicircular exponential stereographic distribution, introducing the transmuted semicircular distribution and demonstrating its superior adaptability for modeling geological data. Likewise, Shala and Merovci [6] proposed a three-parameter inverse Rayleigh distribution within the generalized transmuted family, establishing its robustness through extensive theoretical results and empirical applications. These works, together with the foundational studies of [1,2], underscore the ongoing importance of transmutation methods as powerful tools for extending classical probability distributions and improving their empirical performance. Other contributions include the transmuted odd Fréchet-G distribution proposed by Badr et al. [7] and the record-based transmuted Rayleigh distribution of order 3, as presented by Merovci [8]. Together, these works highlight the growing importance of transmutation methods as powerful tools for extending classical probability models and enhancing their practical performance. The primary objective of this paper is to demonstrate that certain generalizations already existing in the literature can be viewed as part of this double transmutation, while also providing a computational alternative associated with these models. The possible generalization of double transmutation to the bivariate case will also be addressed tangentially.
The paper proceeds as follows. In Section 2, we demonstrate the doubly generalized transmutation model with some examples and a selection of examples that can be considered as multiple generalized transmuted distributions. In Section 3, we present multiple transmutations and discuss bivariate transmuting models. In Section 4, we present some computational aspects of the gentransmuted package [9], along with a simulation study. In Section 5, we show two applications to real data. Finally, in Section 6, we will highlight how many of the models listed can be recognized as examples of multiply generalized transmuted models.

2. DGT Distributions

Definition 1 (Doubly Generalized Transmuted Distribution).
Let F 0 ( x ; θ ̲ ) denote a baseline family of univariate distributions that we aim to make more flexible. Consider two parametric families of univariate distributions supported on [ 0 , 1 ] , denoted by G 1 ( u ; τ ̲ ) and G 2 ( u ; δ ̲ ) .
The resulting family of doubly generalized transmuted distributions, which will be denoted as G 1 G 2 f model, is defined as
F ( x ; θ ̲ , τ ̲ , δ ̲ ) = G 2 G 1 F 0 ( x ; θ ̲ ) ; τ ̲ ; δ ̲ .
For this extended family to include F 0 ( x ; θ ̲ ) as a special case, it is necessary that both G 1 ( u ; τ ̲ ) and G 2 ( u ; δ ̲ ) include the standard uniform distribution U ( 0 , 1 ) as a member. As mentioned in the introduction, such models could alternatively be referred to as multiply compounded models, omitting any explicit reference to transmutation.
Remark 1.
Several well-known distribution families arise as particular cases of Definition 1 when specific choices of G 1 and G 2 are made. For example, setting G 1 = G SB , defined below, and G 2 ( u ) = u leads to the classical transmuted distributions of Shaw and Buckley [1]. Likewise, choosing G 1 ( u ) = u α or G 2 ( u ) = u β produces exponentiated-type distributions [10]. In addition, using Marshall–Olkin-type functions, i.e., mappings of the form
G ( u ; θ ) = u θ + ( 1 θ ) u , u [ 0 , 1 ] , θ > 0 ,
as originally introduced by Marshall and Olkin [11], generates Marshall–Olkin extended families. These models are widely used because they allow for tractable survival functions and hazard rate modifications. Finally, notice that if both G 1 and G 2 are chosen as the identity function, then the construction trivially reduces to the baseline distribution F 0 ( x ; θ ̲ ) . Thus, the proposed framework provides a unifying structure that encompasses transmuted, exponentiated, and Marshall–Olkin families, along with many other extensions frequently encountered in the literature.
Definition 2 (Transmuting functions).
A transmuting function is a distribution function G ( u ; η ̲ ) supported on [ 0 , 1 ] , depending on a parameter (or vector of parameters) η ̲ , such that the standard uniform distribution U ( 0 , 1 ) is included as a special case.
Five popular choices for the transmuting functions G 1 and G 2 are:
1.
The original transmuting function [1]:
G SB ( u ; λ ) = ( 1 + λ ) u λ u 2 , λ [ 1 , 1 ] .
Note that if λ = 0 , this reduces to the U ( 0 , 1 ) distribution.
2.
The exponentiating (EXP) function:
G EXP ( u ; α ) = u α , α ( 0 , ) .
If α = 1 , this is the U ( 0 , 1 ) distribution; if α = n N , it corresponds to the distribution of the maximum of n i.i.d. U ( 0 , 1 ) variables.
2A.
The exponentiating function of the second kind (EXP2):
G EXP 2 ( u ; α ) = 1 ( 1 u ) α , α ( 0 , ) .
For α = 1 , this yields the U ( 0 , 1 ) distribution; for α = n , it is the distribution of the minimum of n i.i.d. U ( 0 , 1 ) variables.
3.
The Marshall-Olkin function [11]:
G MO ( u ; θ ) = θ u 1 ( 1 θ ) u , θ ( 0 , ) .
When θ = 1 , this simplifies to the U ( 0 , 1 ) distribution.
3A.
The second kind Marshall-Olkin (MO2) function:
G MO 2 ( u ; θ ) = u 1 ( 1 θ ) ( 1 u ) , θ ( 0 , ) .
Again, θ = 1 yields the U ( 0 , 1 ) distribution.
Remark 2.
The MO and MO2 functions, if θ ( 0 , 1 ) , can be interpreted as arising from the maximum and minimum, respectively, of a sample of geometrically distributed size of U ( 0 , 1 ) variables. If the sample size instead follows a zero-truncated Poisson distribution, we obtain two new transmuting functions:
G ZTP ( u ; θ ) = e θ u 1 e θ 1 , G ZTP 2 ( u ; θ ) = 1 e θ u 1 e θ .
Similarly, using logarithmic sample sizes leads to:
G LOG ( u ; θ ) = ln ( 1 θ u ) ln ( 1 θ ) , G LOG 2 ( u ; θ ) = 1 ln ( 1 θ ( 1 u ) ) ln ( 1 θ ) .
4.
The Kumaraswamy function [12]:
G Kw ( u ; α , β ) = 1 ( 1 u α ) β , α > 0 ,   β > 0
If α = β = 1 , this corresponds to U ( 0 , 1 ) .
It can also be expressed as a composition of two exponentiating functions:
G Kw ( u ; α , β ) = G EXP 2 ( G EXP ( u ; α ) ; β ) .
By reversing the order of composition, we define a second-kind Kumaraswamy function:
G Kw 2 ( u ; α , β ) = G EXP ( G EXP 2 ( u ; β ) ; α ) = [ 1 ( 1 u ) β ] α .
5.
Mixtures. Granzotto and Louzada [13] observed that the SB transmutation function can be expressed as a convex combination of the maximum and minimum of a size-two sample from U ( 0 , 1 ) :
G SB ( u ; λ ) = δ G EXP ( u ; 2 ) + ( 1 δ ) G EXP 2 ( u ; 2 ) ,
where δ = ( 1 λ ) / 2 ( 0 , 1 ) .
They also explored convex combinations of multiple U ( 0 , 1 ) order statistics. More recently, Balakrishnan and He [14] proposed convex combinations of record value distributions (both lower and upper records). In general, convex combinations of any set of transmuting functions can be considered to generate more flexible transformation families.
Figure 1 presents the PDF for the exponential (exp) distribution and some extensions from these models based on the EXP and SB models. Note that the PDF of the exponential model is monotone, whereas the extensions provide non-monotonic PDFs in all the cases.
It is often convenient to restrict attention to transmuting functions with simple closed-form expressions for both the function and its first derivative. This is particularly helpful for inference. For this reason, the Kumaraswamy function may be preferred over the Beta distribution, which lacks a simple closed-form for its CDF.
For simulation purposes, it is especially useful if both the transmuting functions and the baseline distribution have analytical inverse functions.
Note that the MO function with θ ( 0 , 1 ) is the probability generating function (PGF) of a geometric random variable with parameter θ (taking values in { 1 , 2 , } ).
Similarly, the PGF of any positive integer-valued distribution may be used as a transmuting function. However, acceptable transmuting functions are not limited to PGFs with non-negative power series coefficients.
Another useful class of PGFs arises from the negative binomial distribution:
G NB ( u ; α , p ) = p u 1 ( 1 p ) u α ,
where α ( 0 , ) and p ( 0 , 1 ] . This function is a composition of the EXP and MO functions. Hence, the parameter p can be extended to ( 0 , ) .

Examples of Multiple Generalized Transmuted Distributions

The generalized transmutation approach, in which a family of generalized transmutation functions is applied to a parametric baseline family of distributions, yields an enriched extended family. Provided that the transmutation family includes the uniform distribution, this guarantees that the extended family will fit the data at least as well as the baseline, i.e., a non-worse maximized likelihood on the same data, but not necessarily a better predictive performance. However, the question of whether the improvement in fit is statistically significant must be carefully assessed.
Repeated applications of transmutation functions can produce increasingly flexible models, consistently enhancing the fit to the data. Naturally, this process should not be carried to excess. Nevertheless, rather complex models constructed from multiple transmutation functions have already been proposed in the literature. We cite below a selection from the extensive list compiled by Tahir and Cordeiro [4]:
  • Transmuted Complementary Exponentiated Weibull-Geometric (TCEWG) model
  • Transmuted Complementary Weibull-Geometric (TCWG) model
  • Exponentiated Transmuted Weibull-Geometric (ETWG) model
  • Exponentiated Geometric G-Poisson (EGGP) model
  • Complementary Extended Weibull Power Series (CEWPS) models
  • Complementary Exponentiated Weibull-Logarithmic (CEWLn) model
  • Complementary Exponentiated Power Lindley-Poisson (CEPLP) model
  • Poisson Generalized Linear Failure Rate (PGLFR) model

3. Multiple Transmutation

Several transmutation operations can be applied to a given baseline family of distributions to obtain ever more flexible models with more and more parameters. Alternatively, one can observe that compositions of two or more transmutation functions are again (albeit more complicated) transmutation functions. For example, we might consider the composition.
G * ( u ; λ , α , θ ) = G SB ( G EXP ( G MO ( u ; θ ) ; α ) ; λ )
= ( 1 + λ ) θ u 1 ( 1 θ ) u α λ θ u 1 ( 1 θ ) u 2 α .
The three transmutation functions used here can be composed in 3 ! = 6 different orders, leading to six distinct models. Moreover, the composition of more than three transmutation functions could be considered. Clear guidelines on how to select from the myriad of available models are lacking at this time.

Transmuting Bivariate Models

Given the success in generating flexible univariate models documented above, it is natural to consider the possibility of using the transmuting strategy to generate flexible bivariate (and perhaps multivariate) models.
To this end, consider a baseline family of bivariate distributions F 0 ( x , y ; θ ̲ ) that we wish to make more flexible. Let G ( u ; τ ̲ ) be a parametric family of univariate distributions with support [ 0 , 1 ] , i.e., a family of transmutation functions.
In one dimension, it is evident that compounding a transmutation function with any distribution function leads to a valid distribution function. Can the same thing be said in higher dimensions? Leaving aside parameters for the moment, we ask whether, when F ( x , y ) is a valid bivariate distribution function and if G ( u ) is a transmutation function, it is necessarily true that G ( F ( x , y ) ) is a valid bivariate distribution function? Or must some conditions be satisfied for this to happen?
Since F is a valid bivariate distribution function, we know that for any four real numbers x 1 , x 2 , y 1 , y 2 satisfying < x 1 < x 2 < and < y 1 < y 2 < , we have
F ( x 2 , y 2 ) F ( x 1 , y 2 ) F ( x 2 , y 1 ) + F ( x 1 , y 1 ) 0 .
Is it then true that for such x 1 , x 2 , y 1 , y 2 we also have
G ( F ( x 2 , y 2 ) ) G ( F ( x 1 , y 2 ) ) G ( F ( x 2 , y 1 ) ) + G ( F ( x 1 , y 1 ) ) 0 ?
A little investigation will confirm that this will not always be true. For example, suppose that our joint distribution F is such that, for four numbers x 1 * , x 2 * , y 1 * , y 2 * satisfying < x 1 * < x 2 * < and < y 1 * < y 2 * < we have
F ( x 2 * , y 2 * ) = 0.45 , F ( x 2 * , y 1 * ) = 0.3 , F ( x 1 * , y 2 * ) = 0.3 , F ( x 1 * , y 1 * ) = 0.16 ,
so that
F ( x 2 * , y 2 * ) F ( x 2 * , y 1 * ) F ( x 1 * , y 2 * ) + F ( x 1 * , y 1 * ) = 0.45 0.3 0.3 + 0.16 = 0.01 > 0 .
Now consider the particular transmutation function G ( u ) = u . In this case, we will have
G ( F ( x 2 * , y 2 * ) ) G ( F ( x 2 * , y 1 * ) ) G ( F ( x 1 * , y 2 * ) ) + G ( F ( x 1 * , y 1 * ) ) = 0.45 0.3 0.3 + 0.16 0.6708 0.5477 0.5477 + 0.4 = 0.0246 < 0 .
So, in this case, G ( F ( x , y ) ) is not a valid joint distribution function.
A positive result is obtainable if we assume that F is absolutely continuous and that the transmuting function G is a convex distribution function as noted in the following theorem.
Theorem 1.
If the joint distribution function F ( x , y ) is absolutely continuous and if the transmuting function G is a convex distribution function with support (0, 1), then G ( F ( x , y ) ) is a well-defined bivariate distribution function.
Proof. 
Since we assume absolute continuity of F ( x , y ) its density, denoted by f ( x , y ) , can be obtained as
f ( x , y ) = x y F ( x , y ) 0 .
The density of the putative transmuted distribution G ( F ( x , y ) ) denoted by f G ( F ) ( x , y ) will be of the form
f G ( F ) ( x , y ) = x y G ( F ( x , y ) ) = x G ( F ( x , y ) ) y F ( x , y ) = G ( F ( x , y ) ) x F ( x , y ) y F ( x , y ) + G ( F ( x , y ) ) x y F ( x , y ) .
It is this expression which must be shown to be non-negative. However, we know that x y F ( x , y ) is non-negative, as are x F ( x , y ) and y F ( x , y ) , so that in addition to requiring that G ( u ) be non-negative (which we know is true), we must also require that G ( u ) be non-negative, i.e., that G is convex, which was assumed in the hypothesis of the theorem. Using the fact that G is a valid distribution function with support ( 0 , 1 ) , we can conclude that G ( F ( x , y ) has a non-negative density and has total variation equal to 1, i.e., that it is a valid bivariate distribution function.    □
So the conclusion is that there will be no problem as long as G ( u ) is convex. We note that our earlier example, in which a problem was encountered, dealt with a choice for G of the form u which was concave rather than convex.
Note that if G ( u ) is the probability generating function of a non-negative integer-valued random variable, then it will be convex. Thus, for example, MO functions can be used in the bivariate case provided we restrict the parameter θ to the interval ( 0 , 1 ] .
Indeed, there will be no problems in higher dimensions, provided that G is a generating function. This is because we may think of a sequence of independent identically distributed k-dimensional random variables with common distribution function F ( x ̲ ) and an independent integer-valued random variable N with generating function G. Suppose we define Y ̲ = max { X ̲ ( 1 ) , X ̲ ( 2 ) , , X ̲ ( N ) } , then it can be readily verified that the distribution function of Y ̲ is given by G ( F ( x ̲ ) , which is then necessarily a valid k-dimensional distribution function. The required conditions when G is not assumed to be a generating function are more complex.
In general, we can consider parametric families of multivariate distributions that are subjected to parametric families of double transmutations, and these will be guaranteed to be valid models provided that all the transmutation functions involved are generating functions of non-negative integer-valued random variables. Such models will be of the form.
G 2 ( G 1 ( F ( x ̲ ; θ ̲ ) ; τ ̲ ) ; λ ̲ ) ,
where G 1 and G 2 are parametric families of generating functions, and F is a parametric family of k-dimensional distributions.
Although it is conceptually possible to consider triple transmutation (or indeed, k-fold transmutation) applied to multivariate distributions, it is not envisioned that such models would be much used in practice.

4. Computational Implementation and Simulation Study

In this section, we present the computational implementation of the models, as well as a brief simulation study to assess the performance of the maximum likelihood (ML) estimators in finite samples.

4.1. Computational Aspects

The gentransmuted package [9] of R [15] includes the computational implementation for the GDT class of models. To fit a given model in this class using the gentransmuted package, the following commands can be executed
1.
Install and load the necessary packages:
Symmetry 17 01606 i001
2.
Load your data into R (say y).
3.
To estimate a specific combination of distribution (dist) and G 1 and G 2 (comp1 and comp2, respectively), which are optional. For example, for the exponential model as baseline distribution with the G 1 and G 2 equal to EXP and MO models, we use the following sentence
Symmetry 17 01606 i002
The package also includes the choose.compound function, which performs all the combinations among dist, comp1, and comp2 and presents the results in a descendant order based on the Akaike Information Criterion (AIC, [16]) or the Bayesian Information Criterion (BIC, [17]). Depending on the nature of the data, the following options are available.
  • Positive data: exponential, gamma, log-normal, BS, and Pareto II.
  • Unit data: beta and Kumaraswamy.
  • Real data: normal, Cauchy, logistic and Gumbel.
For comp1 and comp2, all the combinations among the EXP, EXP2, MO, MO2, and SB models are considered, in addition to the case where they are specified as NULL. For instance, if the data are positive and the AIC is used as a selection criterion, the function is used as follows
  • Symmetry 17 01606 i003
In addition, functions to compute the PDF, CDF, quantile function, and drawn values are available as dcompound, pcompound, qcompound, and rcompound, respectively, specifying the arguments dist, comp1, comp2, and the corresponding parameters.

4.2. Simulation Study

In this subsection, we present a brief simulation study to assess the performance of the maximum likelihood (ML) estimators obtained with the gentransmuted package. We consider two baseline distributions f: the exponential distribution with parameter 1.5 and the Lomax distribution with parameters 1.5 and 2; two models for G 1 (EXP and MO); two values for the parameter θ (0.75 and 1.2); and sample sizes n = 100 , 200 , 500 , 1000 .
For each combination of f, G 1 , θ , and sample size, we generate 1000 replicates using the rcompound function and estimate parameters with the estimate.compound function. Table 1 summarizes the results based on these replicates: the mean bias (Bias), the mean of the standard errors (SE), the root of the mean squared error (RMSE), and the 95% coverage probability (CP) based on the asymptotic distribution of the ML estimators.
The results suggest that the bias of the estimators is acceptable in most cases considered and decreases as the sample size increases. Additionally, the SE and RMSE values are close, indicating that the variance of the estimators is well estimated. Coverage probabilities are close to nominal levels, especially for n = 1000 . The exception is for the combination of Lomax and MO for f and G 1 , where the estimators need a larger sample size to obtain desirable properties. This is probably because the Lomax distribution is a heavy-tailed distribution, which is inherited by the respective composite model.

5. Applications of a Multiple Transmutation with Real Data

Let X be a continuous random variable with pdf f X ( x ; θ ̲ ) and cdf F X ( x ; θ ̲ ) . The objective is to extend the pdf f X ( x ; θ ̲ ) using multiple transmutation, as follows.

5.1. First Transmutation

G 1 ( x ; θ ̲ , θ , λ ) = G S B G E X P F X ( x ; θ ̲ ) ; θ ; λ = ( 1 + λ ) [ F X ( x ; θ ̲ ) ] θ λ [ F X ( x ; θ ̲ ) ] 2 θ ,
where the first stage of transmutation is denoted as:
H 1 ( x ; θ ̲ , θ ) = G E X P ( F X ( x ; θ ̲ ) ; θ ) = [ F X ( x ; θ ̲ ) ] θ .
The corresponding PDFs of the singly and doubly transmuted distributions are:
h 1 ( x ; θ ̲ , θ ) = θ [ F X ( x ; θ ̲ ) ] θ 1 f X ( x ; θ ̲ ) , g 1 ( x ; θ ̲ , θ , λ ) = ( 1 + λ 2 λ H 1 ( x ; θ ̲ , θ ) ) h 1 ( x ; θ ̲ , θ ) ,
where θ > 0 and 1 λ 1 are shape parameters.

Log-Likelihood Function

Given a random sample X 1 , , X n from g 1 , the log-likelihood function is:
l ( θ ̲ , θ , λ ) = i = 1 n log 1 + λ 2 λ [ F X ( x i ; θ ̲ ) ] θ + i = 1 n log h 1 ( x i ; θ ̲ , θ ) ,
and can be maximized using the gentransmuted package [9].

5.2. Second Transmutation

G 2 ( x ; θ ̲ , λ , θ ) = G E X P G S B F X ( x ; θ ̲ ) ; λ ; θ = ( 1 + λ ) F X ( x ; θ ̲ ) λ [ F X ( x ; θ ̲ ) ] 2 θ ,
with first-stage transmutation:
H 2 ( x ; θ ̲ , λ ) = G S B ( F X ( x ; θ ̲ ) ; λ ) = ( 1 + λ ) F X ( x ; θ ̲ ) λ [ F X ( x ; θ ̲ ) ] 2 .
The corresponding PDFs are:
h 2 ( x ; θ ̲ , λ ) = [ 1 + λ 2 λ F X ( x ; θ ̲ ) ] f X ( x ; θ ̲ ) , g 2 ( x ; θ ̲ , λ , θ ) = θ [ H 2 ( x ; θ ̲ , λ ) ] θ 1 h 2 ( x ; θ ̲ , λ ) ,
with θ > 0 and 1 λ 1 .

Log-Likelihood Function

For a sample X 1 , , X n from g 2 , the log-likelihood is:
l ( θ ̲ , λ , θ ) = n log ( θ ) + ( θ 1 ) i = 1 n log ( 1 + λ ) F X ( x i ; θ ̲ ) λ [ F X ( x i ; θ ̲ ) ] 2 + i = 1 n log ( h 2 ( x i ; θ ̲ , λ ) ) .

5.3. Data I

For this analysis, we use the fatigue life dataset of Kevlar 373/epoxy specimens subjected to 90% of the ultimate stress until failure. The data, originally reported by Andrews et al. [18] and Barlow et al. [19], contain exact failure times: 0.0251, 0.0886, 0.0891, 0.2501, 0.3113, 0.3451, 0.4763, 0.5650, 0.5671, 0.6566, 0.6748, 0.6751, 0.6753, 0.7696, 0.8375, 0.8391, 0.8425, 0.8645, 0.8851, 0.9113, 0.9120, 0.9836, 1.0483, 1.0596, 1.0773, 1.1733, 1.2570, 1.2766, 1.2985, 1.3211, 1.3503, 1.3551, 1.4595, 1.4880, 1.5728, 1.5733, 1.7083, 1.7263, 1.7460, 1.7630, 1.7746, 1.8275, 1.8375, 1.8503, 1.8808, 1.8878, 1.8881, 1.9316, 1.9558, 2.0048, 2.0408, 2.0903, 2.1093, 2.1330, 2.2100, 2.2460, 2.2878, 2.3203, 2.3470, 2.3513, 2.4951, 2.5260, 2.9911, 3.0256, 3.2678, 3.4045, 3.4846, 3.7433, 3.7455, 3.9143, 4.8073, 5.4005, 5.4435, 5.5295, 6.5541, 9.0960.
Table 2 presents descriptive statistics of the dataset, including the sample coefficient of skewness (CS) and coefficient of kurtosis (CK). The choose.compound function is used to select an appropriate combination of distribution, G 1 , and G 2 for modeling these data.

Estimation and Model Selection

We illustrate the use of multiple transmutations with a real data set. We use the choose.compound function, as explained in the previous section, to choose a model for positive data. The output for the first six models is as follows:
  • Symmetry 17 01606 i004
Table 3 presents the results for the selected model. Figure 2 displays the Q-Q plots for the selected model. From them, we calculated the quantile residuals (QRs), which—if the model is appropriate for the data—should behave like a sample from the standard normal distribution (see [20]). This can be validated using traditional tests for normality such as the Anderson–Darling (AD), Cramér–von Mises (CVM), and Shapiro–Wilk (SW) tests.
In Figure 2, the p-values for the normality tests (AD, CVM, and SW) of the QRs for the two best models are shown below the corresponding Q-Q plots. In the g 1 and h 2 models, the QRs appear to behave like standard normal variates. This supports the earlier claim that model g 1 provides the best fit to the data.

5.4. Data II

In this subsection, we illustrate the use of multiple transmutations with another real data set. For this application, we use the Pareto II distribution as the baseline distribution (see Arnold, [21]). A random variable X has a Pareto II distribution if its cumulative distribution function (CDF) is given by
F X ( x ; θ ̲ ) = 1 1 + x α β , x > 0 ,
and its corresponding probability density function (PDF) is
f X ( x ; θ ̲ ) = β α β ( x + α ) β + 1 , x > 0 ,
where θ ̲ = ( α , β ) , α > 0 is a scale parameter, and β > 0 is a shape (or inequality) parameter. We denote this as X Pareto II ( α , β ) .
This dataset originates from the Survey of Consumer Finances (SCF), a nationally representative sample that provides extensive information on the assets, liabilities, income, and demographic characteristics of U.S. households. It includes a random sample of 500 households with positive incomes interviewed in the 2004 survey. The variable of interest is the annual family income (in thousands of U.S. dollars), divided by the number of household members. The data can be accessed from the Federal Reserve’s webpage: https://www.federalreserve.gov/econres/scfindex.htm (accessed on 31 August 2025).
The descriptive statistics corresponding to this dataset are shown in Table 4. The very high kurtosis coefficient indicates that the dataset has a heavy right tail, motivating the use of the Pareto II distribution as the baseline model.

Estimation and Model Selection

Codes to replicate this example using the gentransmuted package are provided in the Appendix A. In Table 5, the maximum likelihood estimates of the parameters of the five competing models are displayed. To compare the models—Pareto II, h 1 , h 2 , g 1 , and g 2 —we use the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). Table 5 also includes the AIC and BIC values for each model.
In particular, the probability density function (pdf) of h 1 is
h 1 ( x ; α , β , θ ) = θ 1 1 + x α β θ 1 β α β ( x + α ) β + 1 , x > 0 ,
the pdf of g 1 is
g 1 ( x ; α , β , θ , λ ) = 1 + λ 2 λ 1 1 + x α β θ h 1 ( x ; α , β , θ ) , x > 0 ,
the pdf of h 2 is
h 2 ( x ; α , β , λ ) = 1 + λ 2 λ 1 1 + x α β β α β ( x + α ) β + 1 , x > 0 ,
and the pdf of g 2 is
g 2 ( x ; α , β , λ , θ ) = θ ( 1 + λ ) 1 1 + x α β λ 1 1 + x α β 2 θ 1 h 2 ( x ; α , β , λ ) , x > 0 ,
where α > 0 is a scale parameter, and β > 0 , 1 λ 1 , and θ > 0 are shape parameters.
We observe from the AIC and BIC values in Table 5 that each application of generalized transmutation yields an improvement in model fit. Model g 1 demonstrates the best performance among the five models. Finally, Figure 3 presents the Q–Q plot for the QRs, suggesting that the g 1 and g 2 models are appropriated for this data set.

6. Final Remark

The results presented in this work highlight the advantages of using generalized transmutation techniques to construct flexible distribution families. As demonstrated in Section 5, each level of transmutation—whether through exponentiation, convex combinations, or the incorporation of generator functions—tends to enhance the goodness-of-fit to real data. This is evidenced by model g 1 outperforming the others according to AIC, BIC, and residual analysis.
Future work should focus on a more detailed study of specific members within the proposed family that may possess closed-form expressions for key distributional characteristics such as the mean, quantiles, or mode. Additionally, it would be valuable to extend the regression framework to cover these special cases, enabling their application in modeling conditional relationships in practical contexts.

Author Contributions

Conceptualization, B.C.A. and H.W.G.; methodology, B.C.A. and H.W.G.; software, Y.M.G. and D.I.G.; validation, Y.M.G. and D.I.G.; formal analysis, Y.M.G. and H.W.G.; investigation, B.C.A. and D.I.G.; data curation, Y.M.G. and D.I.G.; writing—original draft preparation, B.C.A.; writing—review and editing, Y.M.G., D.I.G. and H.W.G.; visualization, D.I.G. and H.W.G.; supervision, B.C.A. and H.W.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

In this section, we include the codes in R related to the simulation study and the second application.

Appendix A.1. Codes for the Simulation Study

install.packages(‘‘gentransmuted’’)
library(gentransmuted)
set.seed(2100); dist=‘‘exp’’; comp=‘‘EXP’’; n=100; para=1.5; theta1=0.75; replicas=1000
m=1;resultados=c(); real=c(para,theta1)
while(m<=replicas){
y=rcompound(n, dist=dist, comp1=comp1, comp2=comp2, gamma=para[1], theta1=theta1)
aux=estimate.compound(y, dist=dist, comp1=comp1, comp2=comp2,est.var=TRUE)
resultados=rbind(resultados, c(aux$coefficients[,1],aux$coefficients[,2])); m=m+1}
bias.aux=resultados[,1:2]-matrix(real, ncol=length(real), nrow=replicas, byrow=T)
bias=apply(bias.aux, 2, mean); SE=apply(base[,1:2+length(real)],2,mean)
RMSE=sqrt(apply(bias.aux^2,2,mean))
CP.aux=ifelse(bias.aux-1.96*resultados[,sele+length(real)]<0&bias.aux+
1.96*resultados[,sele+length(real)]>0,1,0)
CP=apply(CP.aux,2,mean); c(bias, SE, RMSE, CP)

Appendix A.2. Codes for the First Application

y<-c(0.0251, 0.0886, 0.0891, 0.2501, 0.3113, 0.3451, 0.4763, 0.5650, 0.5671, 0.6566, 0.6748,
0.6751, 0.6753, 0.7696, 0.8375, 0.8391, 0.8425, 0.8645, 0.8851, 0.9113, 0.9120, 0.9836,
1.0483, 1.0596, 1.0773, 1.1733, 1.2570, 1.2766, 1.2985, 1.3211, 1.3503, 1.3551, 1.4595,
1.4880, 1.5728, 1.5733, 1.7083, 1.7263, 1.7460, 1.7630, 1.7746, 1.8275, 1.8375, 1.8503,
1.8808, 1.8878, 1.8881, 1.9316, 1.9558, 2.0048, 2.0408, 2.0903, 2.1093, 2.1330, 2.2100,
2.2460, 2.2878, 2.3203, 2.3470, 2.3513, 2.4951, 2.5260, 2.9911, 3.0256, 3.2678, 3.4045,
3.4846, 3.7433, 3.7455, 3.9143, 4.8073, 5.4005, 5.4435, 5.5295, 6.5541, 9.0960)
choose.compound(y)

Appendix A.3. Codes for the Second Application

y=as.vector(read.table(‘‘Datos2.txt’’,h=F)$V1)
aux1=estimate.compound(y, dist=‘‘paretoII’’)
aux2=estimate.compound(y, dist=‘‘paretoII’’, comp1=‘‘EXP’’)
aux3=estimate.compound(y, dist=‘‘paretoII’’, comp1=‘‘EXP’’, comp2=‘‘SB’’)
aux4=estimate.compound(y, dist=‘‘paretoII’’, comp1=‘‘SB’’)
aux5=estimate.compound(y, dist=‘‘paretoII’’, comp1=‘‘SB’’, comp2=‘‘EXP’’)
aux1;aux2;aux3;aux4;aux5
c(aux1$AIC, aux2$AIC, aux3$AIC, aux4$AIC, aux5$AIC)
c(aux1$BIC, aux2$BIC, aux3$BIC, aux4$BIC, aux5$BIC)

References

  1. Shaw, W.T.; Buckley, I.R.C. The Alchemy of Probability Distributions: Beyond Gram-Charlier & Cornish-Fisher Expansions, and Skew-Normal or Kurtotic-Normal Distributions; Research Report; Wolfgram Memorial Library: Chester, PA, USA, 2007. [Google Scholar]
  2. Shaw, W.T.; Buckley, I.R.C. The alchemy of probability distributions: Beyond Gram-Charlier expansions, and a skew-kurtotic-normal distribution from a rank transmutation map. arXiv 2009, arXiv:0901.0434. [Google Scholar]
  3. Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2003, 71, 63–79. [Google Scholar] [CrossRef]
  4. Tahir, M.H.; Cordeiro, G.M. Compounding of distributions: A survey and new generalized classes. J. Stat. Distrib. Appl. 2016, 3, 13. [Google Scholar] [CrossRef]
  5. Yedlapalli, P.; Kishore, G.N.V.; Boulila, W.; Koubaa, A.; Mlaiki, N. Toward enhanced geological analysis: A novel approach based on transmuted semicircular distribution. Symmetry 2023, 15, 2030. [Google Scholar] [CrossRef]
  6. Shala, M.; Merovci, F. A New Three-Parameter Inverse Rayleigh Distribution: Simulation and Application to Real Data. Symmetry 2024, 16, 634. [Google Scholar] [CrossRef]
  7. El-Morshedy, M.; Alshammari, F.S.; Tyagi, A.; Elbatal, I.; Hamed, Y.S.; Eliwa, M.S. Bayesian and Frequentist Inferences on a Type I Half-Logistic Odd Fréchet Class of Distributions. Entropy 2020, 23, 446. [Google Scholar] [CrossRef]
  8. Merovci, F. A Three-Parameter Record-Based Transmuted Rayleigh Distribution (Order 3): Theory and Real-Data Applications. Symmetry 2025, 17, 1034. [Google Scholar] [CrossRef]
  9. Gómez, Y.M.; Gómez, H.W.; Arnold, B.C.; Gallardo, D.I. Gentransmuted: Estimation and Other Tools for Generalized Transmuted Models. R Package Version 1.0. 2025. Available online: https://CRAN.R-project.org/package=gentransmuted (accessed on 31 August 2025).
  10. Gupta, R.C.; Nadarajah, S. Handbook of Exponential and Related Distributions for Engineers and Scientists; Chapman and Hall: Boca Raton, FL, USA; CRC: Boca Raton, FL, USA, 2004. [Google Scholar]
  11. Marshall, A.W.; Olkin, I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
  12. Kumaraswamy, P. Generalized probability density function for double-bounded random processes. J. Hydrol. 1980, 462, 79–88. [Google Scholar] [CrossRef]
  13. Granzotto, D.C.T.; Louzada, F. The transmuted log-logistic distribution: Modeling, inference, and an application to a polled Tabapua race time up to first calving data. Comm. Statist. Theory Methods 2015, 44, 3387–3402. [Google Scholar] [CrossRef]
  14. Balakrishnan, N.; He, M. A record-based transmuted family of distributions. In Advances in Statistics—Theory and Applications; Emerging Topics in Statistics and Biostatistics; Springer: Cham, Switzerland, 2021; pp. 3–24. [Google Scholar]
  15. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2025; Available online: https://www.R-project.org/ (accessed on 31 August 2025).
  16. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  17. Schwarz, G. Estimating the dimension of a model. Ann. Statist. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  18. Andrews, D.F.; Herzberg, A.M. Data: A Collection of Problems from Many Fields for the Student and Research Worker; Springer Series in Statistics; Springer: New York, NY, USA, 1985. [Google Scholar]
  19. Barlow, R.E.; Tol, R.H.; Freeman, T. A Bayesian Analysis of Stress-Rupture Life of Kevlar 49/Epoxy Spherical Pressure Vessels; Marcel Dekker: New York, NY, USA, 1984. [Google Scholar]
  20. Dunn, P.K.; Smyth, G.K. Randomized Quantile Residuals. J. Comput. Graph. Stat. 1996, 5, 236–244. [Google Scholar] [CrossRef]
  21. Arnold, B.C. Pareto Distributions, 2nd ed.; Monographs on Statistics & Applied Probability; Chapman and Hall: Boca Raton, FL, USA; CRC: Boca Raton, FL, USA, 2015. [Google Scholar]
Figure 1. Plot of PDFs for different extensions of the exponential distribution: (a) exp ( λ ) ; (b) EXP-exp ( λ , θ 1 ) ; (c) SB-exp ( λ , θ 1 ) ; (d) EXP-SB-exp ( λ , θ 1 , θ 2 ) and (e) SB-EXP-exp ( λ , θ 1 , θ 2 ) .
Figure 1. Plot of PDFs for different extensions of the exponential distribution: (a) exp ( λ ) ; (b) EXP-exp ( λ , θ 1 ) ; (c) SB-exp ( λ , θ 1 ) ; (d) EXP-SB-exp ( λ , θ 1 , θ 2 ) and (e) SB-EXP-exp ( λ , θ 1 , θ 2 ) .
Symmetry 17 01606 g001
Figure 2. Q-Q plot for the exponential-MO model.
Figure 2. Q-Q plot for the exponential-MO model.
Symmetry 17 01606 g002
Figure 3. Q–Q plots for models g 1 (left panel) and g 2 (right panel). p-values of normality tests (AD, CVM, WS) for the QRs are indicated below each plot.
Figure 3. Q–Q plots for models g 1 (left panel) and g 2 (right panel). p-values of normality tests (AD, CVM, WS) for the QRs are indicated below each plot.
Symmetry 17 01606 g003
Table 1. Estimated bias, mean standard error (SE), root mean squared error (RMSE), and 95% coverage probability (CP) for different combinations of baseline distribution f (exponential and Lomax), transmutation function G (EXP and MO), parameter θ (0.75 and 1.2), and sample sizes ( n = 100 , 200 , 500 , 1000 ).
Table 1. Estimated bias, mean standard error (SE), root mean squared error (RMSE), and 95% coverage probability (CP) for different combinations of baseline distribution f (exponential and Lomax), transmutation function G (EXP and MO), parameter θ (0.75 and 1.2), and sample sizes ( n = 100 , 200 , 500 , 1000 ).
n = 100 n = 200 n = 500 n = 1000
f G 1 θ Parameter Bias SE RMSE CP Bias SE RMSE CP Bias SE RMSE CP Bias SE RMSE CP
expEXP0.75 γ 0.02570.21570.21420.96600.02030.15200.15680.94800.00940.09580.09500.94700.00010.06730.06710.9490
θ 0.01200.09550.10250.94800.01170.06750.06860.95900.00180.04200.04160.94600.00060.02960.02920.9560
1.2 γ 0.02950.18950.20490.94600.01800.13360.14070.93900.00650.08400.08780.93300.00450.05940.06230.9460
θ 0.03950.16910.18180.95700.01690.11680.12540.93300.00790.07320.07600.94500.00180.05140.05480.9290
MO0.75 γ 0.04790.27730.28000.96400.03130.19490.19370.95500.01210.12250.12510.94900.00690.08660.08840.9390
θ 0.01900.26360.26260.92900.00130.18110.18420.9270−0.00030.11410.11950.9390−0.00110.08050.08020.9380
1.2 γ 0.07870.32450.33790.95100.02430.22620.22070.95300.00650.14280.14010.95800.01440.10140.10380.9380
θ 0.00210.42910.43730.90400.00930.30340.29740.94100.00980.19120.19050.9530−0.00500.13330.13550.9320
lomaxEXP0.75 γ 0.64231.67292.64140.90400.26810.83871.23630.92700.06520.42710.43580.94600.04070.29300.30090.9500
β 0.34210.83861.36330.95400.14580.43100.66900.94300.03370.22280.22630.95600.02660.15430.15820.9590
θ 0.03350.14680.15680.95600.01510.09810.10080.96100.00670.06030.06170.95700.00290.04200.04210.9530
1.2 γ 0.53031.56242.37360.91900.21770.75830.92060.92400.06410.41790.42920.94300.04840.28940.29540.9580
β 0.22780.65380.86370.94700.10340.34020.39890.95100.03510.19160.19670.95900.01970.13220.13540.9540
θ 0.08300.33480.43780.94600.04750.20600.23150.95800.02030.12260.12570.95300.00100.08400.08490.9380
MO0.75 γ 3.173220.414610.74860.77402.782613.72909.39560.83501.47854.72455.45350.86600.62401.65062.70940.9080
β 1.13416.54813.64150.86100.86243.86262.82220.88900.44581.40211.64130.90900.18980.53190.82470.9270
θ 0.19851.49261.16860.77400.26981.17271.09630.83700.22480.70680.80020.87500.11830.44600.53000.9190
1.2 γ 1.727315.96857.64850.74102.710113.872710.20850.80901.88096.26796.68410.87600.94512.22793.12130.9090
β 0.80006.44713.47780.84200.92624.54383.34130.87600.60131.95532.13560.91400.30260.73421.04470.9300
θ 0.00742.29461.47240.73700.26261.84691.68980.81400.37241.24241.26410.88500.27490.81250.96420.9110
Table 2. Descriptive statistics for fatigue life data.
Table 2. Descriptive statistics for fatigue life data.
nMedianMeanVarianceCSCK
761.7361.9592.4772.0198.600
Table 3. Parameter estimates and standard errors (SE), AIC and BIC values for the exponential-MO-SB model.
Table 3. Parameter estimates and standard errors (SE), AIC and BIC values for the exponential-MO-SB model.
ParameterEstimateSE
γ ^ 1.250 1.339
β ^ 3.884 1.591
θ ^ 0.042 0.050
AIC 246.14
BIC 253.13
Table 4. Descriptive statistics for income data.
Table 4. Descriptive statistics for income data.
nMedianMeanVarianceCSCK
50021.125216.70911,270,00122.176494.470
Table 5. Parameter estimates, AIC, and BIC values for the models.
Table 5. Parameter estimates, AIC, and BIC values for the models.
EstimatesPareto II h 1 g 1 h 2 g 2
α ^ 40.151 ( 5.881 ) 19.230 ( 4.214 ) 25.261 ( 6.696 ) 56.927 ( 10.170 ) 14.734 ( 3.331 )
β ^ 1.590 ( 0.160 ) 1.371 ( 0.126 ) 1.041 ( 0.180 ) 1.209 ( 0.208 ) 1.332 ( 0.114 )
θ ^ 1.583 ( 0.162 ) 1.486 ( 0.140 ) 1.101 ( 0.227 )
λ ^ 0.760 ( 0.135 ) 0.797 ( 0.101 ) 0.802 ( 0.180 )
AIC 4861.623 4840.911 4835.926 4854.416 4837.574
BIC 4870.053 4853.555 4852.784 4867.059 4854.432
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Arnold, B.C.; Gómez, Y.M.; Gallardo, D.I.; Gómez, H.W. On Doubly-Generalized-Transmuted Distributions. Symmetry 2025, 17, 1606. https://doi.org/10.3390/sym17101606

AMA Style

Arnold BC, Gómez YM, Gallardo DI, Gómez HW. On Doubly-Generalized-Transmuted Distributions. Symmetry. 2025; 17(10):1606. https://doi.org/10.3390/sym17101606

Chicago/Turabian Style

Arnold, Barry C., Yolanda M. Gómez, Diego I. Gallardo, and Héctor W. Gómez. 2025. "On Doubly-Generalized-Transmuted Distributions" Symmetry 17, no. 10: 1606. https://doi.org/10.3390/sym17101606

APA Style

Arnold, B. C., Gómez, Y. M., Gallardo, D. I., & Gómez, H. W. (2025). On Doubly-Generalized-Transmuted Distributions. Symmetry, 17(10), 1606. https://doi.org/10.3390/sym17101606

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop