A Three-Parameter Record-Based Transmuted Rayleigh Distribution (Order 3): Theory and Real-Data Applications

Faton Merovci

doi:10.3390/sym17071034

Department of Computer Science and Engineering, University of Mitrovica “Isa Boletini”, 40000 Mitrovica, Kosovo

Symmetry2025, 17(7), 1034;https://doi.org/10.3390/sym17071034

This article belongs to the Special Issue Symmetric or Asymmetric Distributions and Its Applications

Version Notes

Order Reprints

Abstract

This paper introduces the record-based transmuted Rayleigh distribution of order 3 (rbt-R), a three-parameter extension of the classical Rayleigh model designed to address data characterized by high skewness and heavy tails. While traditional generalizations of the Rayleigh distribution enhance model flexibility, they often lack sufficient adaptability to capture the complexity of empirical distributions encountered in applied statistics. The rbt-R model incorporates two additional shape parameters, a and b, enabling it to represent a wider range of distributional shapes. Parameter estimation for the rbt-R model is performed using the maximum likelihood method. Simulation studies are conducted to evaluate the asymptotic properties of the estimators, including bias and mean squared error. The performance of the rbt-R model is assessed through empirical applications to four datasets: nicotine yields and carbon monoxide emissions from cigarette data, as well as breaking stress measurements from carbon-fiber materials. Model fit is evaluated using standard goodness-of-fit criteria, including AIC, AIC_c, BIC, and the Kolmogorov–Smirnov statistic. In all cases, the rbt-R model demonstrates a superior fit compared to existing Rayleigh-based models, indicating its effectiveness in modeling highly skewed and heavy-tailed data.

Keywords:

record-based transmuted-G distribution; Rayleigh distribution; maximum likelihood estimation; moments; order statistics; Rényi entropy

1. Introduction

Standard probability distributions often fail to adequately describe real-world data, particularly when the data exhibit non-standard or complex structural properties. To address this issue, researchers have focused on developing broader families of statistical models that more accurately capture the complexities observed in empirical data. A common and effective approach to achieving greater flexibility involves introducing additional shape parameters into traditional probability distributions.

Shaw and Buckley [1] introduced the Quadratic Rank Transmutation Map (QRTM), a methodology for generating new probability distributions from existing ones through rank-based transformations. This framework has since inspired further generalizations. Merovci, Alizadeh, and Hamedani [2] expanded on this concept by proposing the Exponentiated Transmuted-G family. Subsequently, Moolath and Jayakumar [3] introduced the T-transmuted X family, further enriching this line of research.

Furthermore, Granzotto, Louzada, and Balakrishnan [4] presented a cubic extension of the QRTM, termed the Cubic Rank Transmutation Map (CRTM). More recently, Rahman et al. [5] proposed a modified cubic transmuted-G distribution, adding another layer of adaptability within this class of statistical models.

The Rayleigh distribution is a continuous probability distribution commonly used to model non-negative random variables in probability theory and statistics [6]. It is named after Lord Rayleigh (1842–1919). This distribution often arises when the overall magnitude of a vector is determined by its orthogonal components [6].

The Rayleigh distribution is a special case of the Weibull family and is widely applied in reliability analysis, life-testing, and survival analysis. Specifically, if

X \sim Rayleigh (σ),

then X is equivalent to a Weibull random variable with shape parameter

k = 2

and scale parameter

λ = σ

, i.e.,

X \sim Weibull (k = 2, λ = σ) .

Moreover, the square of a Rayleigh-distributed variable with parameter

σ

has the following well-known interpretations:

$X^{2} \sim χ_{2}^{2}$ , the chi-squared distribution with 2 degrees of freedom;
Equivalently, $X^{2} \sim Exp (θ)$ , the exponential distribution with rate parameter $θ = 1 / (2 σ^{2})$ .

A notable characteristic of the Rayleigh distribution is its increasing hazard function, which makes it especially useful in certain reliability and survival contexts.

The Rayleigh distribution has a rich history, with early foundational contributions by Siddiqui [7,8] and Vickers [9]. Over the years, several authors have proposed generalizations to enhance its flexibility and applicability, including Beckmann [10], Kundu [11], and Voda [12]. More recently, Abd Elfattah et al. [13] explored parameter estimation techniques for the Rayleigh model under various censoring schemes, reflecting continued interest in adapting the model to real-world data scenarios.

However, in many practical situations, the traditional Rayleigh form may not adequately capture emerging data patterns, motivating the development of extended versions. Merovci [14,15] introduced the transmuted Rayleigh and transmuted generalized Rayleigh distributions by applying transmutation techniques to the classical Rayleigh model.

More recently, Mir and Ahmad [16] proposed the MTI Rayleigh distribution, designed to provide improved fit, particularly for datasets such as COVID-19 mortality figures. In a similar vein, Rivera et al. [17] developed the Scale Mixture of Rayleigh (SMR) distribution, which performs well in capturing data with strong skewness and heavy tails.

Definition 1 ([18]).

A continuous random variable X is said to follow a Rayleigh distribution with scale parameter

σ > 0

if its probability density function (PDF) is given by

f (x; σ) = \frac{x}{σ^{2}} e^{- x^{2} / (2 σ^{2})}, x \geq 0,

(1)

and its cumulative distribution function (CDF) is

F (x; σ) = 1 - e^{- x^{2} / (2 σ^{2})}, x \geq 0 .

(2)

Here, x denotes the random variable and σ is the scale parameter.

Despite these advancements, the classical Rayleigh distribution remains limited in its ability to accommodate data exhibiting skewness or heavy tails. To address these shortcomings, recent studies have introduced structural extensions aimed at increasing flexibility and improving tail behavior.

One such advancement was proposed by Santoro et al. (2023) [19], who introduced a modified version of the Lomax–Rayleigh distribution using a Slash-type transformation. This modification was designed to increase kurtosis, thereby enhancing the model’s capacity to capture extreme values.

In a different direction, Haj Ahmad et al. (2024) [20] developed a discrete version of the generalized Rayleigh distribution. Utilizing a survival-based discretization approach, their model was tailored for count data—particularly data characterized by overdispersion. They investigated the model’s properties under both classical and Bayesian frameworks and demonstrated its effectiveness through applications to real datasets.

Further extending the Rayleigh family, Dong and Gui (2024) [21] applied the generalized Rayleigh model to stress–strength reliability analysis. Their focus was on estimating the reliability measure

P (Y < X)

, using a sampling technique based on lower record ranked sets. The estimation procedures, developed under both likelihood and Bayesian paradigms, were enhanced with bootstrap confidence intervals, yielding improved precision over traditional sampling methods.

Motivated by these developments, we introduce a new generalization of the Rayleigh distribution: the record-based transmuted Rayleigh distribution of order 3 (rbt-Rayleigh). By incorporating two additional parameters, the proposed model offers increased flexibility while preserving a key reliability feature—the increasing failure rate (IFR)—under specific conditions. We evaluate the model using four distinct datasets and find that it consistently outperforms existing Rayleigh-type models, as assessed by standard criteria such as AIC, BIC, and the Kolmogorov–Smirnov statistic.

2. The Record-Based Transmuted Rayleigh Distribution of Order 3

Balakrishnan and He [22] introduced the record-based transmuted-G (RBT-G) generator of order 3, a flexible framework for constructing new probability models from any given baseline cumulative distribution. This generator includes two additional shape parameters that allow for better control over the distribution’s skewness and tail behavior. The CDF is expressed as

F (x) = 1 - (1 - G (x)) [1 + (1 - a) {- ln (1 - G (x))} + \frac{1 - a - b}{2} {- ln (1 - G (x))}^{2}],

(3)

subject to the constraints

0 \leq a, b \leq 1

and

a + b \leq 1

.

The corresponding probability density function (PDF) derived from this generator is given by

f (x) = g (x) [a + b {- ln (1 - G (x))} + \frac{1 - a - b}{2} {- ln (1 - G (x))}^{2}],

(4)

where

g (x)

denotes the probability density function (PDF) associated with the cumulative distribution function (CDF)

G (x)

.

By taking the Rayleigh distribution as the baseline, we develop a new and flexible model known as the record-based transmuted Rayleigh distribution of order 3 (rbt-Rayleigh). The corresponding PDF and CDF are obtained by substituting the Rayleigh CDF and PDF, given in Equations (1) and (2), into generator Formulas (3) and (4).

f_{r b t - R} (x, σ, a, b) = \frac{x}{σ^{2}} exp (- \frac{x^{2}}{2 σ^{2}}) (a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})

(5)

F_{r b t - R} (x, σ, a, b) = 1 - exp (- \frac{x^{2}}{2 σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})

(6)

0 \leq a, b \leq 1, a + b \leq 1 .

Remark on the Gamma function. The Gamma function, denoted by

Γ (z)

, is a classical extension of the factorial function to real and complex arguments. For any real number

z > 0

, it is defined by the integral

Γ (z) = \int_{0}^{\infty} t^{z - 1} e^{- t} d t .

One of its key properties is that, for every positive integer n, we have

Γ (n) = (n - 1)!,

and more generally, it satisfies the recurrence relation

Γ (z + 1) = z Γ (z) .

We make use of the following well-known integral identity involving the Gamma function:

\begin{matrix} \int_{0}^{\infty} x^{m} e^{- β x^{n}} d x = \frac{Γ (\frac{m + 1}{n})}{n β^{\frac{m + 1}{n}}}, \end{matrix}

(7)

where

ℜ (β) > 0

,

ℜ (m) > - 1

, and

ℜ (n) > 0

, as given in Gradshteyn and Ryzhik ([23], Eq. 3.326(2), p. 339).

This identity is used in the proof of Proposition 1 and will also be used in Theorem 1 to derive the moment expressions.

Proposition 1.

Let

f_{r b t - R} (x, σ, a, b)

and

F_{r b t - R} (x, σ, a, b)

denote the PDF and CDF of the record-based transmuted Rayleigh distribution of order 3 (rbt-Rayleigh), respectively, as defined in Equations (5) and (6). Then:

1.

The PDF

f_{r b t - R} (x, σ, a, b)

satisfies:

(a): $f_{r b t - R} (x, σ, a, b) \geq 0$ for all $x \geq 0$ .
(b): $\int_{0}^{\infty} f_{r b t - R} (x, σ, a, b) d x = 1 .$

2.

The CDF

F_{r b t - R} (x, σ, a, b)

satisfies:

(a): It is continuous on $[0, \infty)$ and right-continuous on $[0, \infty)$ .
(b): It is non-decreasing on $[0, \infty)$ .
(c): It satisfies the limits:

$lim_{x \to 0^{+}} F_{r b t - R} (x, σ, a, b) = 0 and lim_{x \to \infty} F_{r b t - R} (x, σ, a, b) = 1 .$

Proof.

1a. From the explicit form of the density, we note that it is composed of three factors:

f_{r b t - R} (x, σ, a, b) = (\frac{x}{σ^{2}}) exp (- \frac{x^{2}}{2 σ^{2}}) P (x),

where

P (x) = a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}} .

Clearly,

x \geq 0

and

σ > 0

imply

\frac{x}{σ^{2}} > 0

for

x > 0

, and

exp (- x^{2} / 2 σ^{2}) > 0

for all x. From the parameter constraints

0 \leq a, b \leq 1

and

a + b \leq 1

, it follows that

1 - a - b \geq 0 .

Thus,

P (x) \geq 0

for all

x \geq 0

, and therefore:

f_{r b t - R} (x, σ, a, b) \geq 0 for all x \geq 0 .

1b.

\begin{matrix} \int_{0}^{\infty} f_{r b t - R} (x, σ, a, b) d x & = \frac{a}{σ^{2}} \int_{0}^{\infty} x exp (- \frac{x^{2}}{2 σ^{2}}) d x \\ + \frac{b}{2 σ^{4}} \int_{0}^{\infty} x^{3} exp (- \frac{x^{2}}{2 σ^{2}}) d x \\ + \frac{1 - a - b}{8 σ^{6}} \int_{0}^{\infty} x^{5} exp (- \frac{x^{2}}{2 σ^{2}}) d x . \end{matrix}

By applying the integral identity from Equation (7), we obtain:

\int_{0}^{\infty} x exp (- \frac{x^{2}}{2 σ^{2}}) d x = \frac{Γ (1)}{2 (\frac{1}{2 σ^{2}})} = σ^{2},

\int_{0}^{\infty} x^{3} exp (- \frac{x^{2}}{2 σ^{2}}) d x = \frac{Γ (2)}{2 {(\frac{1}{2 σ^{2}})}^{2}} = 2 σ^{4},

\int_{0}^{\infty} x^{5} exp (- \frac{x^{2}}{2 σ^{2}}) d x = \frac{Γ (3)}{2 {(\frac{1}{2 σ^{2}})}^{3}} = 8 σ^{6} .

Therefore:

\int_{0}^{\infty} f_{r b t - R} (x, σ, a, b) d x = a + b + (1 - a - b) = 1 .

2a. The function

F_{r b t - R} (x, σ, a, b)

is a composition of exponential and polynomial terms, both of which are continuous on

[0, \infty)

. Hence,

F_{r b t - R} (x, σ, a, b)

is continuous and right-continuous on

[0, \infty)

. 2b. To verify monotonicity, we differentiate:

\frac{d}{d x} F_{r b t - R} (x) = f_{r b t - R} (x) .

From part (1), we know

f_{r b t - R} (x) \geq 0

for all

x \geq 0

, so

F_{r b t - R} (x)

is non-decreasing on

[0, \infty)

.

2c. We now evaluate the limits:

\begin{matrix} lim_{x \to 0^{+}} F_{r b t - R} (x) & = 1 - lim_{x \to 0^{+}} exp (- \frac{x^{2}}{2 σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) \end{matrix}

(8)

= 1 - 1 = 0 .

(9)

lim_{x \to \infty} F_{r b t - R} (x) = 1 - lim_{x \to \infty} exp (- \frac{x^{2}}{2 σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) .

By Theorem 3.20(d) from Rudin [24], which states that if

p > 0

and

α \in R

, then

lim_{n \to \infty} \frac{n^{α}}{{(1 + p)}^{n}} = 0,

we conclude:

exp (- \frac{x^{2}}{2 σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) \to 0 as x \to \infty .

Hence:

lim_{x \to \infty} F_{r b t - R} (x) = 1 - 0 = 1 .

Thus,

F_{r b t - R} (x)

satisfies:

\{\begin{matrix} F_{r b t - R} (0) = 0, \\ F_{r b t - R} (\infty) = 1, \\ F_{r b t - R} (x) is non-decreasing and right-continuous . \end{matrix}

Therefore,

F_{r b t - R} (x)

is a valid cumulative distribution function. Consequently, the record-based transmuted Rayleigh distribution of order 3 satisfies all the necessary conditions to be a valid probability distribution under the given parameter constraints. □

Figure 1 and Figure 2 illustrate the variability in the shapes of the PDF and CDF for the record-based transmuted Rayleigh distribution of order 3.

Figure 1. The PDFs of various rbt-Rayleigh distributions.

Figure 2. The CDFs of various rbt-Rayleigh- distributions.

The hazard rate function (HRF) of rbt-Rayleigh distribution is given by:

\begin{matrix} h (x; a, b, σ) & = \frac{f (x)}{1 - F (x)} \\ = \frac{(a x + \frac{1}{2} b \frac{x^{3}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{5}}{σ^{4}})}{σ^{2} (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})} \end{matrix}

(10)

Since the baseline distribution

G (x)

in our model is the Rayleigh distribution, whose hazard function is strictly increasing (i.e., the Rayleigh distribution is IFR), it is of interest to examine whether this property is preserved under the record-based transmuted transformation of order 3.

This question has been addressed and rigorously proven by Balakrishnan and He (see Section 3.3 in [22]), who showed that the resulting distribution retains the IFR property of the baseline if the transformation parameters satisfy the condition

b \geq a (1 - a) .

Hence, in our case, the proposed distribution is IFR whenever this condition holds.

3. Quantile Function

The cumulative distribution function (CDF) of the rbt–Rayleigh distribution is given by

F (x; a, b, σ) = 1 - exp (- \frac{x^{2}}{2 σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) .

To compute the quantile function

x_{p}

, we must solve the nonlinear equation

exp (- \frac{x^{2}}{2 σ^{2}}) (1 + (1 - a) \frac{x^{2}}{2 σ^{2}} + \frac{1}{2} (1 - a - b) {(\frac{x^{2}}{2 σ^{2}})}^{2}) = 1 - p,

which does not admit a closed-form solution for general values of a, b, and

σ

. Therefore, the quantile function is computed numerically.

To address this, we implemented a root-finding algorithm in R that solves the equation above for a given probability level

p \in (0, 1)

. The corresponding R code is provided below and can be used to generate a full quantile table or compute specific quantiles such as the median or quartiles.

R Code for Computing the Quantile Function of the rbt–Rayleigh Distribution

Listing 1 presents the R code that numerically computes the quantile function of the rbt–Rayleigh distribution by solving the nonlinear equation.

exp (- \frac{x^{2}}{2 σ^{2}}) [1 + (1 - a) (\frac{x^{2}}{2 σ^{2}}) + \frac{1}{2} (1 - a - b) {(\frac{x^{2}}{2 σ^{2}})}^{2}] = 1 - p .

Listing 1. R code for computing the quantile function of the rbt–Rayleigh distribution.

Table 1 reports the quantile values

x_{p}

of the rbt–Rayleigh distribution for selected probabilities, while Figure 3 illustrates the quantile function

x_{p}

for

p \in (0, 1)

, using parameters

a = 0.8

,

b = 0.15

, and

σ = 1

.

Table 1. Quantile values

x_{p}

of the rbt-R distribution for selected probabilities

p \in (0, 1)

. Parameters:

a = 0.8

,

b = 0.15

,

σ = 1

.

Figure 3. The plot of the quantile function

x_{p}

of the rbt-Rayleigh distribution for

p \in (0, 1)

, with parameters

a = 0.8

,

b = 0.15

, and

σ = 1

. The function was numerically evaluated using the inverse of the cumulative distribution function and visualized in R.

4. Moments

Theorem 1.

If

X \sim rbt-R (a, b, σ)

, then the

r^{th}

moment of X is given by:

E (X^{r}) = 2^{\frac{r}{2}} σ^{r} Γ (\frac{r}{2} + 1) [a + \frac{b}{2} (r + 2) + \frac{(1 - a - b)}{8} (r + 2) (r + 4)] .

Specifically, the mean and variance are obtained as follows:

E (X) = \frac{σ \sqrt{2 π}}{16} (15 - 7 a - 3 b),

Var (X) = E (X^{2}) - {(E (X))}^{2} = 2 σ^{2} (3 - 2 a - b) - \frac{π σ^{2}}{128} {(15 - 7 a - 3 b)}^{2} .

Proof.

\begin{matrix} E (X^{r}) & = \int_{0}^{\infty} x^{r} f (x) d x \\ = \int_{0}^{\infty} \frac{x^{r + 1}}{σ^{2}} exp (- \frac{x^{2}}{2 σ^{2}}) [a + \frac{b}{2 σ^{2}} x^{2} + \frac{(1 - a - b)}{8 σ^{4}} x^{4}] d x \\ = \frac{a}{σ^{2}} \int_{0}^{\infty} x^{r + 1} e^{- x^{2} / (2 σ^{2})} d x \\ + \frac{b}{2 σ^{4}} \int_{0}^{\infty} x^{r + 3} e^{- x^{2} / (2 σ^{2})} d x \\ + \frac{(1 - a - b)}{8 σ^{6}} \int_{0}^{\infty} x^{r + 5} e^{- x^{2} / (2 σ^{2})} d x \\ = 2^{\frac{r}{2}} σ^{r} a Γ (\frac{r}{2} + 1) \\ + 2^{\frac{r}{2}} σ^{r} b (\frac{r}{2} + 1) Γ (\frac{r}{2} + 1) \\ + 2^{\frac{r}{2}} σ^{r} \frac{1 - a - b}{2} (\frac{r}{2} + 2) (\frac{r}{2} + 1) Γ (\frac{r}{2} + 1) \\ = 2^{\frac{r}{2}} σ^{r} Γ (\frac{r}{2} + 1) [a + \frac{b}{2} (r + 2) + \frac{(1 - a - b)}{8} (r + 2) (r + 4)] . \end{matrix}

The result follows by applying the integral identity given in (7). □

Throughout the remainder of the manuscript, we denote the r-th raw moment of the distribution by

μ_{r} = E (X^{r})

. This notation is used for expressing skewness and kurtosis in terms of the central moments.

Theorem 2.

If

X \sim rbt-R (a, b, σ)

, then the moment generating function of X, denoted by

M_{X} (t)

, is given by:

M_{X} (t) = \sum_{i = 0}^{\infty} \frac{t^{i} 2^{i / 2} σ^{i} Γ (\frac{i}{2} + 1)}{i!} [a + \frac{b}{2} (i + 2) + \frac{(1 - a - b)}{8} (i + 2) (i + 4)] .

Proof.

By definition,

M_{X} (t) = \int_{0}^{\infty} e^{t x} f_{r b t - R} (x) d x .

Since

e^{t x} = \sum_{i = 0}^{\infty} \frac{t^{i} x^{i}}{i!},

one obtains

M_{X} (t) = \int_{0}^{\infty} \sum_{i = 0}^{\infty} \frac{t^{i} x^{i}}{i!} f_{r b t - R} (x) d x .

For any finite interval

[0, A]

, the function

f_{r b t - R} (x)

is continuous on

[0, A]

and hence bounded. Therefore, there exists a constant

M > 0

such that

f_{r b t - R} (x) \leq M for all x \in [0, A] .

Hence,

|\frac{t^{i} x^{i}}{i!} f_{r b t - R} (x)| \leq \frac{{| t |}^{i} A^{i}}{i!} M .

Since

\sum_{i = 0}^{\infty} \frac{{(| t | A)}^{i}}{i!} = e^{| t | A} < \infty,

the series

\sum_{i = 0}^{\infty} \frac{t^{i} x^{i}}{i!} f_{r b t - R} (x)

converges uniformly on

[0, A]

by the Weierstrass M-test. Each term is continuous on

[0, A]

. By the Uniform Convergence Theorem for the Riemann integral, one obtains

\int_{0}^{A} \sum_{i = 0}^{\infty} \frac{t^{i} x^{i}}{i!} f_{r b t - R} (x) d x = \sum_{i = 0}^{\infty} \int_{0}^{A} \frac{t^{i} x^{i}}{i!} f_{r b t - R} (x) d x .

The function

f_{r b t - R} (x)

is integrable on

[0, \infty)

and tends to zero faster than any polynomial as

x \to \infty

. Hence, letting

A \to \infty

yields

M_{X} (t) = \sum_{i = 0}^{\infty} \frac{t^{i}}{i!} E (X^{i}) .

From the explicit expression of the moments

E (X^{i})

one obtains

M_{X} (t) = \sum_{i = 0}^{\infty} \frac{t^{i} 2^{\frac{i}{2}} σ^{i} Γ (\frac{i}{2} + 1)}{i!} [a + \frac{b}{2} (i + 2) + \frac{(1 - a - b)}{8} (i + 2) (i + 4)] .

□

The values presented in Table 2 and Table 3 correspond to the mean and variance of the random variable

X \sim rbt-R (a, b, σ)

computed for selected combinations of the parameters a, b, and

σ

.

Table 2. Mean values of X for selected combinations of a, b, and

σ

.

Table 3. Variance values of X for selected combinations of a, b, and

σ

.

5. Skewness and Kurtosis

In addition to the first two moments, which characterize the location and dispersion of a distribution, the third and fourth central moments provide insight into its shape. These are commonly summarized by the coefficients of skewness and kurtosis.

The coefficient of skewness measures the asymmetry of the distribution around its mean. A positive skewness indicates a longer right tail, whereas a negative value implies a heavier left tail. For a random variable X, the skewness is defined as:

γ_{1} = \frac{μ_{3}}{μ_{2}^{3 / 2}} = \frac{E (X^{3}) - 3 E (X^{2}) E (X) + 2 {[E (X)]}^{3}}{{(E (X^{2}) - {[E (X)]}^{2})}^{3 / 2}} .

The coefficient of kurtosis, on the other hand, quantifies the heaviness of the tails and the sharpness of the peak relative to a normal distribution. It is given by:

γ_{2} = \frac{μ_{4}}{μ_{2}^{2}} = \frac{E (X^{4}) - 4 E (X^{3}) E (X) + 6 E (X^{2}) {[E (X)]}^{2} - 3 {[E (X)]}^{4}}{{(E (X^{2}) - {[E (X)]}^{2})}^{2}} .

For the proposed rbt-Rayleigh distribution, explicit expressions for the moments

E (X^{r})

have been derived in Theorem 1. These can be directly substituted into the formulas above to compute the skewness and kurtosis as functions of the parameters a, b, and

σ

.

6. Harmonic Mean

Theorem 3.

If

X \sim rbt-R (a, b, σ)

, then the harmonic mean of X, defined as

H = E (1 / X)

, is given by:

H = \frac{\sqrt{2 π}}{16 σ} (3 + 5 a + 5 b) .

Proof.

We compute the expected value of the reciprocal:

\begin{matrix} H & = E (\frac{1}{X}) \\ = \int_{0}^{\infty} \frac{1}{x} f (x) d x \\ = \int_{0}^{\infty} \frac{1}{x} \cdot \frac{x}{σ^{2}} exp (- \frac{x^{2}}{2 σ^{2}}) (a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) d x \\ = \frac{a}{σ^{2}} \int_{0}^{\infty} exp (- \frac{x^{2}}{2 σ^{2}}) d x + \frac{b}{2 σ^{4}} \int_{0}^{\infty} x^{2} exp (- \frac{x^{2}}{2 σ^{2}}) d x \\ + \frac{1 - a - b}{8 σ^{6}} \int_{0}^{\infty} x^{4} exp (- \frac{x^{2}}{2 σ^{2}}) d x . \end{matrix}

Using the Gaussian integrals:

\begin{matrix} \int_{0}^{\infty} exp (- \frac{x^{2}}{2 σ^{2}}) d x & = \frac{σ \sqrt{2 π}}{2}, \\ \int_{0}^{\infty} x^{2} exp (- \frac{x^{2}}{2 σ^{2}}) d x & = \frac{σ^{3} \sqrt{2 π}}{2}, \\ \int_{0}^{\infty} x^{4} exp (- \frac{x^{2}}{2 σ^{2}}) d x & = \frac{3 σ^{5} \sqrt{2 π}}{2}, \end{matrix}

we conclude that:

H = \frac{\sqrt{2 π}}{16 σ} (3 + 5 a + 5 b) .

□

7. Mean Deviations

The mean deviation about the mean and the mean deviation about the median are defined by:

δ_{1} = \int_{0}^{\infty} | x - μ | f (x) d x = 2 μ F (μ) - 2 \int_{0}^{μ} x f (x) d x,

(11)

δ_{2} = \int_{0}^{\infty} | x - M | f (x) d x = 2 \int_{M}^{\infty} (x - M) f (x) d x = μ - 2 \int_{0}^{M} x f (x) d x,

(12)

where

μ = E (X)

is the mean, and M denotes the median of the distribution.

Theorem 4.

For the rbt-R distribution, the mean deviations

δ_{1}

and

δ_{2}

are given by:

\begin{matrix} δ_{1} & = 2 μ [1 - exp (- \frac{μ^{2}}{2 σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{μ^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{μ^{4}}{σ^{4}})] \\ - \frac{2 μ (3 (5 a + b - 5) σ^{4} + ((5 a + b - 5) μ^{2} - 8 a) σ^{2} + μ^{4} (- 1 + a + b)) exp (- \frac{μ^{2}}{2 σ^{2}})}{8 σ^{2}} \\ + \frac{15 σ^{3} (erf (\frac{μ \sqrt{2}}{2 σ}) - 1) \sqrt{π} \sqrt{2} ((a + \frac{b}{5} - 1) σ^{2} - \frac{8 a}{15})}{8 σ^{2}}, \end{matrix}

(13)

and

\begin{matrix} δ_{2} & = μ + \frac{15 σ^{3} (erf (\frac{M \sqrt{2}}{2 σ}) - 1) \sqrt{π} \sqrt{2} ((a + \frac{b}{5} - 1) σ^{2} - \frac{8 a}{15})}{8 σ^{2}} \\ - \frac{2 M (3 (5 a + b - 5) σ^{4} + ((5 a + b - 5) M^{2} - 8 a) σ^{2} + M^{4} (- 1 + a + b)) exp (- \frac{M^{2}}{2 σ^{2}})}{8 σ^{2}} . \end{matrix}

(14)

Proof.

\begin{matrix} \int_{0}^{μ} x f (x) d x = \int_{0}^{μ} x [\frac{x}{σ^{2}} exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) (a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})] d x \\ = \frac{a}{σ^{2}} \int_{0}^{\infty} x^{2} exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) d x \\ + \frac{b}{2 σ^{2}} \int_{0}^{\infty} x^{4} exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) d x \\ + \frac{(1 - a - b)}{8 σ^{4}} \int_{0}^{\infty} x^{6} exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) d x \\ = \frac{a}{σ^{2}} [σ^{3} \sqrt{\frac{π}{2}} erfc (\frac{μ}{\sqrt{2} σ}) - μ σ^{2} exp (- \frac{μ^{2}}{2 σ^{2}})] \\ + \frac{b}{2 σ^{2}} [3 σ^{5} \sqrt{\frac{π}{2}} erfc (\frac{μ}{\sqrt{2} σ}) - μ σ^{2} exp (- \frac{μ^{2}}{2 σ^{2}}) (μ^{2} + 3 σ^{2})] \\ + \frac{(1 - a - b)}{8 σ^{4}} [15 σ^{7} \sqrt{\frac{π}{2}} erfc (\frac{μ}{\sqrt{2} σ}) - μ σ^{2} exp (- \frac{μ^{2}}{2 σ^{2}}) (μ^{4} + 5 μ^{2} σ^{2} + 15 σ^{4})] \\ = \frac{2 μ (3 (5 a + b - 5) σ^{4} + ((5 a + b - 5) μ^{2} - 8 a) σ^{2} + μ^{4} (- 1 + a + b)) exp (- \frac{μ^{2}}{2 σ^{2}})}{16 σ^{2}} \\ + \frac{15 σ^{3} (erf (\frac{μ \sqrt{2}}{2 σ}) - 1) \sqrt{π} \sqrt{2} ((a + \frac{b}{5} - 1) σ^{2} - \frac{8 a}{15})}{16 σ^{2}} . \end{matrix}

By substituting into Equations (11) and (12), we obtain the mean deviations. Here, erfc represents the complementary error function. □

8. Entropy

Entropy measures provide a formal means of quantifying the uncertainty inherent in probability distributions. Among them, Shannon entropy is the most widely used, while the Rényi entropy [25], a parametric generalization, offers a broader framework for analyzing distributional characteristics [26].

Let X be a continuous random variable with probability density function

f (x)

. The Rényi entropy of order

α > 0, α \neq 1

, is defined by

H_{α} (X) = \frac{1}{1 - α} log (\int_{- \infty}^{\infty} f {(x)}^{α} d x) .

Theorem 5.

Let

X \sim rbt-R (a, b, σ)

. The Rényi entropy of order α for the rbt-R distribution is given by:

\begin{matrix} H_{α} (X) & = \frac{1}{1 - α} log (\sum_{k = 0}^{\infty} \sum_{r = 0}^{\infty} (\binom{α}{k}) (\binom{k}{r}) \frac{a^{α - k} b^{k - r} {(1 - a - b)}^{r}}{2^{k + 2 r + 1} σ^{2 α + 2 k + 2 r}} \\ \times {(\frac{α}{2 σ^{2}})}^{- \frac{α + 2 k + 2 r + 1}{2}} Γ (\frac{α + 2 k + 2 r + 1}{2})) . \end{matrix}

Proof.

\begin{matrix} H_{α} (X) & = \frac{1}{1 - α} log (\int_{0}^{\infty} f {(x)}^{α} d x) \\ = \frac{1}{1 - α} log (\int_{0}^{\infty} \frac{x^{α}}{σ^{2 α}} exp (- \frac{α x^{2}}{2 σ^{2}}) {(a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})}^{α} d x) \end{matrix}

Applying the binomial expansion yields

\begin{matrix} {(a + \frac{b}{2 σ^{2}} x^{2} + \frac{1 - a - b}{8 σ^{4}} x^{4})}^{α} \\ = \sum_{k = 0}^{\infty} \sum_{r = 0}^{k} (\binom{α}{k}) (\binom{k}{r}) a^{α - k} b^{k - r} {(1 - a - b)}^{r} {(\frac{x^{2}}{2 σ^{2}})}^{k + r} . \end{matrix}

(15)

For any finite interval

[0, A]

, the function

f_{r b t - R} {(x)}^{α}

is continuous and hence bounded on

[0, A]

. Therefore, there exists a constant

M > 0

such that

|\frac{x^{α}}{σ^{2 α}} {(\frac{x^{2}}{2 σ^{2}})}^{k + r} exp (- \frac{α x^{2}}{2 σ^{2}})| \leq \frac{A^{α + 2 k + 2 r}}{σ^{2 α + 2 k + 2 r} 2^{k + r}} M .

(16)

Since

\sum_{k = 0}^{\infty} \sum_{r = 0}^{k} (\binom{α}{k}) (\binom{k}{r}) a^{α - k} b^{k - r} {(1 - a - b)}^{r} {(\frac{A^{2}}{2 σ^{2}})}^{k + r}

(17)

converges absolutely, by the Weierstrass M-test, the double series converges uniformly on

[0, A]

. Each term in the sum is continuous on

[0, A]

. By the Uniform Convergence Theorem for the Riemann integral, one obtains

\int_{0}^{A} \sum_{k = 0}^{\infty} \sum_{r = 0}^{k} g_{k, r} (x) d x = \sum_{k = 0}^{\infty} \sum_{r = 0}^{k} \int_{0}^{A} g_{k, r} (x) d x .

(18)

The function

f_{r b t - R} (x)

tends to zero faster than any polynomial as

x \to \infty

. Thus, taking the limit

A \to \infty

yields

\begin{matrix} H_{α} (X) \\ = \frac{1}{1 - α} log (\sum_{k = 0}^{\infty} \sum_{r = 0}^{k} (\binom{α}{k}) (\binom{k}{r}) \frac{a^{α - k} b^{k - r} {(1 - a - b)}^{r}}{σ^{2 α + 2 k + 2 r} 2^{k + 2 r}} \\ \times \int_{0}^{\infty} x^{α + 2 k + 2 r} exp (- \frac{α x^{2}}{2 σ^{2}}) d x) . \end{matrix}

(19)

Applying the integral identity

\int_{0}^{\infty} x^{ν - 1} e^{- μ x^{2}} d x = \frac{1}{2} μ^{- ν / 2} Γ (\frac{ν}{2})

gives

\begin{matrix} \int_{0}^{\infty} x^{α + 2 k + 2 r} exp (- \frac{α x^{2}}{2 σ^{2}}) d x \\ = \frac{1}{2} {(\frac{α}{2 σ^{2}})}^{- \frac{α + 2 k + 2 r + 1}{2}} Γ (\frac{α + 2 k + 2 r + 1}{2}) . \end{matrix}

(20)

Hence,

\begin{matrix} H_{α} (X) \\ = \frac{1}{1 - α} log (\sum_{k = 0}^{\infty} \sum_{r = 0}^{k} (\binom{α}{k}) (\binom{k}{r}) \frac{a^{α - k} b^{k - r} {(1 - a - b)}^{r}}{2^{k + 2 r + 1} σ^{2 α + 2 k + 2 r}} \\ \times {(\frac{α}{2 σ^{2}})}^{- \frac{α + 2 k + 2 r + 1}{2}} Γ (\frac{α + 2 k + 2 r + 1}{2})) . \end{matrix}

(21)

□

9. Order Statistics

Let

X_{(1)} \leq X_{(2)} \dots X_{(n)}

denote the order statistics from an i.i.d. sample

X_{1}, \dots, X_{n}

drawn from a continuous distribution with probability density function (PDF)

f_{X}

and cumulative distribution function (CDF)

F_{X}

.

The PDF of the

k^{th}

order statistic is given by:

f_{X_{(k)}} (x) = \frac{n!}{(k - 1)! (n - k)!} f_{X} (x) {[F_{X} (x)]}^{k - 1} {[1 - F_{X} (x)]}^{n - k}, k = 1, \dots, n .

If

X \sim rbt-R (a, b, σ)

, then

\begin{matrix} f_{X_{(k)}} (x) = & \frac{n!}{(k - 1)! (n - k)!} \frac{x}{σ^{2}} exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) \\ \times (a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) \\ \times {[1 - exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})]}^{k - 1} \\ \times {[exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})]}^{n - k} . \end{matrix}

When

k = 1

, we obtain the PDF of the smallest observation in the sample:

f_{X_{(1)}} (x) = n f_{X} (x) {[1 - F_{X} (x)]}^{n - 1} .

For the rbt-R distribution, this becomes:

\begin{matrix} f_{X_{(1)}} (x) = & n \frac{x}{σ^{2}} exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) (a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) \\ \times {[exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})]}^{n - 1} . \end{matrix}

When

k = n

, we obtain the PDF of the largest observation in the sample:

f_{X_{(n)}} (x) = n f_{X} (x) {[F_{X} (x)]}^{n - 1} .

For the rbt-R distribution, this becomes

\begin{matrix} f_{X_{(n)}} (x) = & n \frac{x}{σ^{2}} exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) (a + \frac{1}{2} b \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}}) \\ \times {[1 - exp (- \frac{1}{2} \frac{x^{2}}{σ^{2}}) (1 + \frac{1}{2} (1 - a) \frac{x^{2}}{σ^{2}} + \frac{1}{8} (1 - a - b) \frac{x^{4}}{σ^{4}})]}^{n - 1} . \end{matrix}

10. Maximum Likelihood Estimation

Let

x_{1}, x_{2}, \dots, x_{n}

denote a random sample drawn from the record-based transmuted Rayleigh distribution of order 3, with parameters

σ > 0

and

a, b \in [0, 1]

, subject to the constraint

a + b \leq 1

.

The likelihood function corresponding to this sample is given by:

L (σ, a, b) = \prod_{i = 1}^{n} f (x_{i}; σ, a, b) = \prod_{i = 1}^{n} \frac{x_{i}}{σ^{2}} exp (- \frac{x_{i}^{2}}{2 σ^{2}}) (a + \frac{b x_{i}^{2}}{2 σ^{2}} + \frac{(1 - a - b) x_{i}^{4}}{8 σ^{4}}) .

Taking logarithms, we obtain the log-likelihood function:

\begin{matrix} ℓ (σ, a, b) = & \sum_{i = 1}^{n} ln (x_{i}) - \frac{1}{2 σ^{2}} \sum_{i = 1}^{n} x_{i}^{2} - 2 n ln (σ) \\ + \sum_{i = 1}^{n} ln (a + \frac{b x_{i}^{2}}{2 σ^{2}} + \frac{(1 - a - b) x_{i}^{4}}{8 σ^{4}}) . \end{matrix}

To estimate

(\hat{σ}, \hat{a}, \hat{b})

, we differentiate

ℓ (σ, a, b)

with respect to each parameter and set the score equations to zero:

\begin{matrix} \frac{\partial ℓ}{\partial σ} & = \frac{1}{σ^{3}} \sum_{i = 1}^{n} x_{i}^{2} - \frac{2 n}{σ} + \sum_{i = 1}^{n} \frac{4 x_{i}^{2} (a x_{i}^{2} - 2 b σ^{2} + b x_{i}^{2} - x_{i}^{2})}{σ (8 a σ^{4} - a x_{i}^{4} + 4 b σ^{2} x_{i}^{2} - b x_{i}^{4} + x_{i}^{4})} = 0, \\ \frac{\partial ℓ}{\partial a} & = \sum_{i = 1}^{n} \frac{8 σ^{4} - x_{i}^{4}}{8 a σ^{4} - a x_{i}^{4} + 4 b σ^{2} x_{i}^{2} - b x_{i}^{4} + x_{i}^{4}} = 0, \\ \frac{\partial ℓ}{\partial b} & = \sum_{i = 1}^{n} \frac{x_{i}^{2} (4 σ^{2} - x_{i}^{2})}{8 a σ^{4} - a x_{i}^{4} + 4 b σ^{2} x_{i}^{2} - b x_{i}^{4} + x_{i}^{4}} = 0 . \end{matrix}

In general, this nonlinear system has no closed-form solution, and numerical methods such as the Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm or Newton–Raphson are employed to maximize the log-likelihood subject to the parameter constraints.

Under standard regularity conditions, the maximum likelihood estimator is asymptotically normal. Specifically:

\sqrt{n} (\hat{θ} - θ) \overset{D}{\to} N_{3} (0, I^{- 1} (θ)),

where

θ = {(σ, a, b)}^{⊤}

, and

I (θ)

is the Fisher information matrix:

I (θ) = E [- \nabla_{θ}^{2} ℓ (θ)] .

To confirm this result, we verify that the usual regularity conditions are satisfied:

Interior Point: The true parameter $θ_{0}$ lies in the interior of the space $Θ = {(σ, a, b) \in R_{+} \times {[0, 1]}^{2} : a + b \leq 1}$ .
Differentiability: The log-likelihood is continuously differentiable on $Θ$ for all $x_{i} > 0$ .
Identifiability: The model is identifiable since each parameter combination yields a distinct density.
Fisher Information: The matrix $I (θ)$ exists and is positive definite.
Finite Expectations: Expectations involving the first and second derivatives are finite due to the exponential tail of the density.

To clarify notation, the gradient

\nabla_{θ} ℓ (θ)

and the Hessian

\nabla_{θ}^{2} ℓ (θ)

are given by:

\nabla_{θ} ℓ (θ) = (\begin{matrix} \frac{\partial ℓ}{\partial σ} \\ \frac{\partial ℓ}{\partial a} \\ \frac{\partial ℓ}{\partial b} \end{matrix}), \nabla_{θ}^{2} ℓ (θ) = (\begin{matrix} \frac{\partial^{2} ℓ}{\partial σ^{2}} & \frac{\partial^{2} ℓ}{\partial σ \partial a} & \frac{\partial^{2} ℓ}{\partial σ \partial b} \\ \frac{\partial^{2} ℓ}{\partial a \partial σ} & \frac{\partial^{2} ℓ}{\partial a^{2}} & \frac{\partial^{2} ℓ}{\partial a \partial b} \\ \frac{\partial^{2} ℓ}{\partial b \partial σ} & \frac{\partial^{2} ℓ}{\partial b \partial a} & \frac{\partial^{2} ℓ}{\partial b^{2}} \end{matrix}) .

The explicit expressions for the second-order partial derivatives used in this Hessian matrix are provided in Appendix A.

The observed information matrix is:

{\hat{I}}_{i j} = - {\frac{\partial^{2} ℓ}{\partial θ_{i} \partial θ_{j}}|}_{θ = \hat{θ}}, i, j \in {σ, a, b} .

Its inverse gives the estimated variance–covariance matrix:

\hat{Var} (\hat{θ}) = {\hat{I}}^{- 1} .

Approximate

(1 - α) 100 %

confidence intervals are constructed as:

{\hat{θ}}_{j} \pm z_{α / 2} \sqrt{{[{\hat{I}}^{- 1}]}_{j j}}, j \in {σ, a, b},

where

z_{α / 2}

is the standard normal quantile.

For reporting purposes:

\hat{σ} \pm z_{α / 2} \sqrt{{[{\hat{I}}^{- 1}]}_{σ σ}}, \hat{a} \pm z_{α / 2} \sqrt{{[{\hat{I}}^{- 1}]}_{a a}}, \hat{b} \pm z_{α / 2} \sqrt{{[{\hat{I}}^{- 1}]}_{b b}} .

This formulation allows practitioners to assess parameter uncertainty and construct valid confidence intervals. Empirical examples using this procedure are provided in the following sections.

11. Application to Real Data

To evaluate the practical performance of the proposed record-based transmuted Rayleigh distribution of order 3 (rbtR), we fitted it to four real-world datasets. For comparison purposes, we also fitted the Rayleigh distribution (R), the transmuted Rayleigh distribution (T-R), the generalized Rayleigh distribution (G-R) as introduced in Surles and Padgett [27], and the transmuted generalized Rayleigh distribution (TGR). Model comparison was conducted using multiple goodness-of-fit criteria, namely the Akaike Information Criterion (AIC), the corrected Akaike Information Criterion (AIC_c), the Bayesian Information Criterion (BIC), and the Kolmogorov–Smirnov distance (KS). Additionally, overlay plots of the estimated probability density functions and cumulative distribution functions were provided to facilitate a visual assessment of fit quality.

11.1. Dataset 1: Nicotine Yields (FTC, 1994)

Source: Federal Trade Commission (FTC), Cigarette Yields Report, 1994 (EconDataUS).

The first dataset analyzed in this study relates to nicotine yield levels reported in 1994 by the U.S. Federal Trade Commission (FTC) in their widely referenced document titled “Tar, Nicotine, and Carbon Monoxide of the Smoke of 1206 Varieties of Domestic Cigarettes”. This report remains a central source for researchers studying chemical content in cigarettes, and it is freely available online: https://www.ftc.gov/system/files/documents/reports/report-tar-nicotine-carbon-monoxide-smoke-1206-varieties-domestic-cigarettes-year-1994/tarandnico.pdf (accessed on 10 January 2025).

According to the FTC’s documentation, nicotine yields were measured using the Cambridge Filter Method—an approach the agency has endorsed since 1967 to ensure consistency across cigarette brands. The measurements, expressed in milligrams per cigarette, were rounded to the nearest 0.1 mg.

Data were collected from various manufacturers across more than 50 locations in the United States. The report includes results from the five dominant companies at the time: Philip Morris, R. J. Reynolds, Lorillard, Brown & Williamson, and Liggett Group. In the case of lesser-known brands, data were often submitted directly by the manufacturers, following FTC guidelines.

In addition to yield values, the report outlines sample collection protocols and standard smoking conditions, such as the 23 mm smoked butt length, which contribute to the reproducibility of the data. Previous studies by Sloan and Sublett [28] and Schultz and Spears [29] further support the accuracy of the laboratory techniques applied.

This dataset serves as a useful illustration for examining how well the proposed rbt-Rayleigh distribution performs in practice. Descriptive statistics are provided in Table 4, while parameter estimates and model selection criteria—obtained via maximum likelihood estimation—are summarized in Table 5 and Table 6.

Table 4. Descriptive statistics for variable X (N = 346).

Table 5. Parameter estimates and log-likelihood for Rayleigh-type models for Dataset 1.

Table 6. Goodness-of-fit measures for Dataset 1.

A histogram of the observed nicotine yields is presented in Figure 4, while the empirical versus fitted CDFs are displayed in Figure 5. The log-likelihood contour plot, with

σ

fixed at its MLE, is shown in Figure 6.

Figure 4. Nicotine yields (mg per cigarette).

Figure 5. Empirical vs. fitted CDF—nicotine yields.

Figure 6. Log-likelihood contour—nicotine yields (

σ

fixed at MLE).

The observed Fisher information matrix (i.e., the negative of the Hessian matrix of the log-likelihood evaluated at the MLEs) under the rbt-Rayleigh distribution is estimated as:

\hat{I} = (\begin{matrix} 18418.040 & - 2081.0342 & - 1303.6585 \\ - 2081.0342 & 853.5247 & 255.8975 \\ - 1303.6585 & 255.8975 & 137.4793 \end{matrix}) .

The inverse of this matrix, denoted by

{\hat{I}}^{- 1}

, provides the estimated variance–covariance matrix of the maximum likelihood estimators (MLEs):

\hat{Var} (\hat{θ}) = {\hat{I}}^{- 1} = (\begin{matrix} 0.000174 & - 0.000159 & 0.001949 \\ - 0.000159 & 0.002797 & - 0.006720 \\ 0.001949 & - 0.006720 & 0.038266 \end{matrix}) .

Based on this, the approximate 95% confidence intervals for the parameters

σ

, a, and b are computed as:

σ \in [0.378, 0.430], a \in [0.095, 0.302], b \in [0, 0.416] .

11.2. Dataset 2: Carbon Monoxide Emissions (FTC, 2007)

Source: U.S. Federal Trade Commission (FTC), “Nicotine, Tar, and CO Content of Domestic Cigarettes in 2007—Regular Brands, sorted by nicotine, tar, and CO.” Available at: https://www.econdataus.com/cigrs.html, accessed on 23 June 2025.

This dataset reports carbon monoxide (CO) emissions per cigarette, measured in milligrams, as published by the FTC in 2007. The data were extracted from the publicly available table titled “Regular Brands, sorted by CO”, which presents standardized yield values for a wide range of domestic cigarette brands.

Measurements were obtained using the Cambridge Filter Method, a standardized laboratory technique recommended by the FTC to ensure comparability across brands. CO emission values are rounded to the nearest 0.1 mg and cover major tobacco manufacturers in the U.S. market.

The distribution of CO emissions exhibits moderate right skew due to a small number of high-emission brands. The proposed rbt-Rayleigh model fits the observed data closely, capturing the distributional shape more effectively than alternative models. Among the models evaluated, it achieves the lowest Kolmogorov–Smirnov (KS) distance (0.037), indicating superior goodness-of-fit.

Descriptive statistics for CO emissions are presented in Table 7. Parameter estimates and log-likelihood values for the considered models are reported in Table 8, while goodness-of-fit criteria are summarized in Table 9. A histogram of the observed CO emission values is shown in Figure 7, the empirical versus fitted CDFs are displayed in Figure 8, and the log-likelihood contour plot—with

σ

fixed at its MLE—is provided in Figure 9.

Table 7. Descriptive statistics for variable X (N = 816).

Table 8. Parameter estimates and log-likelihood for CO emission models.

Table 9. Goodness-of-fit measures for CO emission models.

Figure 7. CO emissions (mg per cigarette).

Figure 8. Empirical vs. fitted CDF—CO emissions.

Figure 9. Log-likelihood contour—CO emissions (

σ

fixed at MLE).

The estimated variance–covariance matrix of the MLEs under the rbt-R distribution is given by:

\begin{matrix} H^{- 1} = (\begin{matrix} 0.008926 & - 0.000546 & 0.006565 \\ - 0.000546 & 0.000802 & - 0.001975 \\ 0.006565 & - 0.001975 & 0.010555 \end{matrix}) . \end{matrix}

11.3. Dataset 3: Carbon-Fibre Breaking Stress (50 mm Gauge)

The carbon-fibre breaking stress values analyzed in this study correspond to tensile strength measurements (in GPa) collected from fibres with a gauge length of 50 mm, as reported by Lishamol and Jiju [30]. These measurements were obtained under controlled conditions from production samples, in accordance with standard testing procedures used to ensure that the fibres meet the necessary strength requirements for composite applications.

Of particular interest is the lower tail of the strength distribution—especially the first percentile—as reductions in this region may signal declining fibre quality and compromise the structural integrity of the resulting composite material.

This dataset serves as the third case study in our analysis (see Table 10). Descriptive statistics are visualized in Figure 10, and the empirical versus fitted CDFs are displayed in Figure 11. The proposed three-parameter rbt-Rayleigh distribution demonstrates a superior fit, achieving the lowest AIC, AIC_c, BIC, and KS values across competing models (Table 11 and Table 12). The corresponding log-likelihood surface (Figure 12) confirms the presence of a unique optimal solution.

Table 10. Breaking stress values (in GPa) for Dataset 3.

Figure 10. Breaking stress (Dataset 3, GPa).

Figure 11. Empirical vs. fitted CDF—breaking stress (Dataset 3).

Table 11. Parameter estimates and log-likelihood for Dataset 3.

Table 12. Goodness-of-fit measures for Dataset 3.

Figure 12. Log-likelihood contour—breaking stress (Dataset 3) (

σ

fixed).

The estimated variance–covariance matrix of the MLEs under the rbt-R distribution is given by:

\begin{matrix} H^{- 1} = (\begin{matrix} 2.4317 \times 10^{- 3} & 1.0494 \times 10^{- 3} & 4.2358 \times 10^{- 14} \\ 1.0494 \times 10^{- 3} & 3.5987 \times 10^{- 3} & 2.0707 \times 10^{- 14} \\ 4.2358 \times 10^{- 14} & 2.0707 \times 10^{- 14} & 1.1767 \times 10^{- 16} \end{matrix}) . \end{matrix}

11.4. Dataset 4: Carbon-Fibre Breaking Stress (20 mm Gauge)

This dataset consists of tensile strength measurements for carbon fibres tested at a gauge length of 20 mm, as reported by Badar and Priest [31]. The strength values, expressed in gigapascals (GPa), were obtained under controlled laboratory conditions.

Using a shorter gauge length than that in Dataset 3 reduces the likelihood of encountering surface flaws in the tested segment. As a result, the measured strengths in this dataset tend to be slightly higher.

This distinction in testing setup provides a good opportunity to examine how the proposed rbt-Rayleigh distribution performs when the data come from a similar material but under different conditions. The observed breaking stress values for this dataset are presented in Table 13.

Table 13. Observed breaking stress values (GPa) for Dataset 4 (carbon fibre, 20 mm gauge length).

A summary of the descriptive statistics is given in Table 14, while parameter estimates and goodness-of-fit results appear in Table 15 and Table 16. Visual diagnostics are shown in Figure 13, Figure 14 and Figure 15. Once again, the rbt-R model provides the best fit, outperforming all four competing distributions across all evaluation metrics (Table 15 and Table 16). This superiority is further supported by graphical diagnostics shown in Figure 13, Figure 14 and Figure 15.

Table 14. Descriptive statistics for Dataset 4 (

n = 69

).

Table 15. Parameter estimates and log-likelihood for Rayleigh-type distributions—Dataset 4.

Table 16. Goodness-of-fit measures for Rayleigh-type distributions—Dataset 4.

Figure 13. Breaking stress (Dataset 4, GPa).

Figure 14. Empirical vs. fitted CDF—breaking stress (Dataset 4).

Figure 15. Log-likelihood contour—breaking stress (Dataset 4) (

σ

fixed).

The estimated variance–covariance matrix of the MLEs under the rbt-R distribution is given by:

\begin{matrix} H^{- 1} = (\begin{matrix} 7.030 \times 10^{- 4} & 6.526 \times 10^{- 4} & 2.321 \times 10^{- 14} \\ 6.526 \times 10^{- 4} & 4.213 \times 10^{- 3} & 2.478 \times 10^{- 14} \\ 2.321 \times 10^{- 14} & 2.478 \times 10^{- 14} & 1.177 \times 10^{- 16} \end{matrix}) . \end{matrix}

11.5. Summary of Results Across Datasets

The rbt-R distribution provides the best overall fit according to AIC, AIC_c, BIC, and KS criteria.

12. Random Sampling via Inverse Transform and Newton–Raphson

To investigate the finite-sample properties of the rbt-R maximum likelihood estimators, synthetic data are generated using a combination of the inverse-transform method and Newton–Raphson root-finding. The procedure is as follows:

Fix the true parameter vector $(a, b, σ)$ , set the sample size n, and choose an initial value $x^{(0)} > 0$ (we use the Rayleigh quantile approximation below).
For each $i = 1, \dots, n$ , draw $u_{i} \sim U (0, 1)$ .
Compute

$x_{i}^{(0)} = σ \sqrt{- 2 ln (1 - u_{i})},$

which provides a Rayleigh-based initial guess.
Solve the equation

$F (x_{i}; a, b, σ) = u_{i}$

iteratively via

$x_{i}^{(k + 1)} = x_{i}^{(k)} - \frac{F (x_{i}^{(k)}; a, b, σ) - u_{i}}{f (x_{i}^{(k)}; a, b, σ)},$

where f and F denote the rbt-R density and CDF, respectively. Iteration stops when $| x_{i}^{(k + 1)} - x_{i}^{(k)} | < 10^{- 8}$ or after 50 steps, whichever occurs first.
Upon convergence, set $x_{i} = x_{i}^{(k + 1)}$ . Repeat steps 2–5 for all $i = 1, \dots, n$ to obtain the simulated sample ${x_{1}, \dots, x_{n}}$ .

13. Monte Carlo Experiment

We evaluated the performance of the estimator at the true parameter values

(a, b, σ) = (0.3, 0.4, 2)

, in the sample sizes

n \in {10, 20, 30, \dots, 500} .

For each n, we perform

S = 1000

independent replications. In each replication:

Generate a sample of size n using the inverse-Newton method described above;
Obtain the MLEs $(\hat{a}, \hat{b}, \hat{σ})$ via constrained optimization (L–BFGS–B);
Store the estimated triplet.

For each parameter

θ \in {a, b, σ}

, we compute:

\bar{\hat{θ}} = \frac{1}{S} \sum_{s = 1}^{S} {\hat{θ}}^{(s)}, MSE (\hat{θ}) = \frac{1}{S} \sum_{s = 1}^{S} {({\hat{θ}}^{(s)} - θ)}^{2} .

The results by sample size are presented in Table A1 and Table A2; both tables are provided in the Appendix B.

14. Conclusions

In this work, we present a new three-parameter extension of the classical Rayleigh distribution by applying the record-based transmuted-G (RBT-G) distribution of order 3, originally introduced by Balakrishnan and He. The resulting model, referred to as the rbt-Rayleigh distribution of order 3, offers increased flexibility to capture skewness and heavy-tailed behavior while retaining analytical tractability. Several analytical properties of the proposed distribution are derived, including the r-th raw and central moments, the harmonic mean, Shannon entropy, the quantile function, and the order statistics. The model parameters are estimated using the maximum likelihood method, implemented via numerical optimization in the R programming environment.

To assess the model’s performance, the rbt-Rayleigh distribution is applied to four empirical datasets: two related to cigarette composition (nicotine content and carbon monoxide emissions), and two concerning carbon-fibre tensile strength. A comparative analysis is conducted using standard goodness-of-fit criteria: Akaike Information Criterion (AIC), corrected Akaike Information Criterion (AICc), Bayesian Information Criterion (BIC), and the Kolmogorov–Smirnov (KS) statistic. In all cases, the rbt-Rayleigh model demonstrates a superior fit relative to the classical Rayleigh, transmuted Rayleigh, generalized Rayleigh, and transmuted generalized Rayleigh distributions.

Funding

This research received no external funding.

Data Availability Statement

Datasets 1 and 2 are publicly available from the EconDataUS repository https://econdataus.com/smoke.html, (accessed 10 May 2025). Datasets 3 and 4 are included within the main text of the paper.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A. Second-Order Derivatives of the Log-Likelihood

The second-order partial derivatives of the log-likelihood function

ℓ (σ, a, b)

are given below:

\begin{matrix} \frac{\partial^{2} ℓ}{\partial σ^{2}} & = - 3 \frac{\sum_{i = 1}^{n} x_{i}^{2}}{σ^{4}} + 2 \frac{n}{σ^{2}} \\ - \frac{5}{2} \sum_{i = 1}^{n} \frac{x_{i}^{2} A_{i}}{σ^{2} {[(\frac{1 - a - b}{8}) x_{i}^{4} + \frac{1}{2} b x_{i}^{2} σ^{2} + a σ^{4}]}^{2}}, \end{matrix}

where

A_{i} = - \frac{1}{40} {(1 - a - b)}^{2} x_{i}^{6} + \frac{1}{4} b σ^{2} (1 - a - b) x_{i}^{4} + σ^{4} (a^{2} + a b - \frac{1}{5} b^{2} - a) x_{i}^{2} - \frac{6}{5} a b σ^{6} .

\begin{matrix} \frac{\partial^{2} ℓ}{\partial σ \partial a} & = 8 σ \sum_{i = 1}^{n} \frac{x_{i}^{2} (8 σ^{4} b - 4 b x_{i}^{2} σ^{2} + b x_{i}^{4} + 4 x_{i}^{2} σ^{2})}{{(8 a σ^{4} - a x_{i}^{4} + 4 b x_{i}^{2} σ^{2} - b x_{i}^{4} + x_{i}^{4})}^{2}}, \\ \frac{\partial^{2} ℓ}{\partial σ \partial b} & = - 8 σ \sum_{i = 1}^{n} \frac{x_{i}^{2} (8 a σ^{4} - 4 a σ^{2} x_{i}^{2} + a x_{i}^{4} - x_{i}^{4})}{{(8 a σ^{4} - a x_{i}^{4} + 4 b x_{i}^{2} σ^{2} - b x_{i}^{4} + x_{i}^{4})}^{2}}, \\ \frac{\partial^{2} ℓ}{\partial a^{2}} & = - \sum_{i = 1}^{n} \frac{{(8 σ^{4} - x_{i}^{4})}^{2}}{{(8 a σ^{4} - a x_{i}^{4} + 4 b x_{i}^{2} σ^{2} - b x_{i}^{4} + x_{i}^{4})}^{2}}, \\ \frac{\partial^{2} ℓ}{\partial a \partial b} & = - \sum_{i = 1}^{n} \frac{(4 σ^{2} - x_{i}^{2}) (8 σ^{4} - x_{i}^{4}) x_{i}^{2}}{{(8 a σ^{4} - a x_{i}^{4} + 4 b x_{i}^{2} σ^{2} - b x_{i}^{4} + x_{i}^{4})}^{2}} . \end{matrix}

Appendix B. Simulation Study

Table A1. Empirical means of the MLEs for various n.

n	$a_{true}$	$b_{true}$	$σ_{true}$	${\hat{a}}_{MLE}$	${\hat{b}}_{MLE}$	${\hat{σ}}_{MLE}$
10	0.30	0.40	2.00	0.2776	0.2315	1.9304
20	0.30	0.40	2.00	0.2910	0.2766	1.9527
30	0.30	0.40	2.00	0.3053	0.2947	1.9701
40	0.30	0.40	2.00	0.3052	0.3238	1.9809
50	0.30	0.40	2.00	0.3113	0.3318	1.9909
60	0.30	0.40	2.00	0.3166	0.3291	1.9955
70	0.30	0.40	2.00	0.3158	0.3283	1.9948
80	0.30	0.40	2.00	0.3191	0.3231	1.9957
90	0.30	0.40	2.00	0.3152	0.3378	1.9988
100	0.30	0.40	2.00	0.3160	0.3417	2.0004
110	0.30	0.40	2.00	0.3171	0.3441	2.0024
120	0.30	0.40	2.00	0.3154	0.3472	2.0010
130	0.30	0.40	2.00	0.3148	0.3521	2.0043
140	0.30	0.40	2.00	0.3150	0.3542	2.0043
150	0.30	0.40	2.00	0.3157	0.3563	2.0062
160	0.30	0.40	2.00	0.3172	0.3549	2.0070
170	0.30	0.40	2.00	0.3165	0.3586	2.0087
180	0.30	0.40	2.00	0.3176	0.3548	2.0072
190	0.30	0.40	2.00	0.3177	0.3536	2.0063
200	0.30	0.40	2.00	0.3185	0.3559	2.0079
210	0.30	0.40	2.00	0.3182	0.3564	2.0071
220	0.30	0.40	2.00	0.3175	0.3581	2.0072
230	0.30	0.40	2.00	0.3182	0.3588	2.0076
240	0.30	0.40	2.00	0.3169	0.3602	2.0072
250	0.30	0.40	2.00	0.3170	0.3630	2.0087
260	0.30	0.40	2.00	0.3168	0.3669	2.0109
270	0.30	0.40	2.00	0.3158	0.3707	2.0114
280	0.30	0.40	2.00	0.3161	0.3685	2.0105
290	0.30	0.40	2.00	0.3149	0.3680	2.0092
300	0.30	0.40	2.00	0.3148	0.3706	2.0105
310	0.30	0.40	2.00	0.3149	0.3715	2.0110
320	0.30	0.40	2.00	0.3143	0.3720	2.0108
330	0.30	0.40	2.00	0.3136	0.3740	2.0110
340	0.30	0.40	2.00	0.3146	0.3745	2.0122
350	0.30	0.40	2.00	0.3143	0.3753	2.0116
360	0.30	0.40	2.00	0.3142	0.3744	2.0111
370	0.30	0.40	2.00	0.3142	0.3743	2.0112
380	0.30	0.40	2.00	0.3140	0.3734	2.0107
390	0.30	0.40	2.00	0.3136	0.3740	2.0103
400	0.30	0.40	2.00	0.3138	0.3741	2.0103
410	0.30	0.40	2.00	0.3135	0.3750	2.0101
420	0.30	0.40	2.00	0.3133	0.3748	2.0096
430	0.30	0.40	2.00	0.3127	0.3771	2.0102
440	0.30	0.40	2.00	0.3125	0.3805	2.0116
450	0.30	0.40	2.00	0.3125	0.3805	2.0114
460	0.30	0.40	2.00	0.3128	0.3814	2.0123
470	0.30	0.40	2.00	0.3131	0.3821	2.0133
480	0.30	0.40	2.00	0.3131	0.3823	2.0132
490	0.30	0.40	2.00	0.3130	0.3841	2.0135
500	0.30	0.40	2.00	0.3130	0.3838	2.0132

Table A2. Empirical MSEs of the MLEs for various n.

n	$a_{true}$	$b_{true}$	$σ_{true}$	$MSE (\hat{a})$	$MSE (\hat{b})$	$MSE (\hat{σ})$
10	0.30	0.40	2.00	0.07478	0.15215	0.12203
20	0.30	0.40	2.00	0.05540	0.13649	0.07197
30	0.30	0.40	2.00	0.04825	0.12766	0.05825
40	0.30	0.40	2.00	0.03807	0.12022	0.04425
50	0.30	0.40	2.00	0.03370	0.11448	0.04117
60	0.30	0.40	2.00	0.02918	0.11037	0.03931
70	0.30	0.40	2.00	0.02488	0.10290	0.03669
80	0.30	0.40	2.00	0.02082	0.09507	0.03437
90	0.30	0.40	2.00	0.01795	0.09336	0.03080
100	0.30	0.40	2.00	0.01656	0.09035	0.02928
110	0.30	0.40	2.00	0.01625	0.08864	0.02910
120	0.30	0.40	2.00	0.01422	0.08482	0.02623
130	0.30	0.40	2.00	0.01334	0.08412	0.02624
140	0.30	0.40	2.00	0.01190	0.08094	0.02453
150	0.30	0.40	2.00	0.01177	0.07878	0.02403
160	0.30	0.40	2.00	0.01097	0.07734	0.02290
170	0.30	0.40	2.00	0.01001	0.07396	0.02204
180	0.30	0.40	2.00	0.00985	0.07195	0.02145
190	0.30	0.40	2.00	0.00974	0.07043	0.02107
200	0.30	0.40	2.00	0.00936	0.06925	0.02096
210	0.30	0.40	2.00	0.00903	0.06693	0.02026
220	0.30	0.40	2.00	0.00858	0.06485	0.01939
230	0.30	0.40	2.00	0.00844	0.06370	0.01925
240	0.30	0.40	2.00	0.00814	0.06286	0.01891
250	0.30	0.40	2.00	0.00787	0.06234	0.01902
260	0.30	0.40	2.00	0.00749	0.06179	0.01858
270	0.30	0.40	2.00	0.00706	0.06006	0.01811
280	0.30	0.40	2.00	0.00677	0.05880	0.01824
290	0.30	0.40	2.00	0.00647	0.05755	0.01751
300	0.30	0.40	2.00	0.00614	0.05711	0.01723
310	0.30	0.40	2.00	0.00584	0.05656	0.01689
320	0.30	0.40	2.00	0.00562	0.05625	0.01653
330	0.30	0.40	2.00	0.00538	0.05562	0.01625
340	0.30	0.40	2.00	0.00532	0.05532	0.01625
350	0.30	0.40	2.00	0.00511	0.05436	0.01559
360	0.30	0.40	2.00	0.00505	0.05305	0.01543
370	0.30	0.40	2.00	0.00501	0.05193	0.01530
380	0.30	0.40	2.00	0.00492	0.05084	0.01499
390	0.30	0.40	2.00	0.00480	0.05048	0.01474
400	0.30	0.40	2.00	0.00470	0.04909	0.01454
410	0.30	0.40	2.00	0.00457	0.04835	0.01435
420	0.30	0.40	2.00	0.00439	0.04816	0.01412
430	0.30	0.40	2.00	0.00424	0.04810	0.01405
440	0.30	0.40	2.00	0.00417	0.04736	0.01388
450	0.30	0.40	2.00	0.00410	0.04731	0.01370
460	0.30	0.40	2.00	0.00401	0.04727	0.01368
470	0.30	0.40	2.00	0.00397	0.04656	0.01359
480	0.30	0.40	2.00	0.00386	0.04581	0.01332
490	0.30	0.40	2.00	0.00379	0.04539	0.01314
500	0.30	0.40	2.00	0.00374	0.04449	0.01311

References

Shaw, W.T.; Buckley, I.R. The Alchemy of Probability Distributions: Beyond Gram-Charlier and Cornish-Fisher Expansions, and Skew-Normal or Kurtotic-Normal Distributions. UCL Discovery Repository. 2007. Available online: https://library.wolfram.com/infocenter/Articles/6670/alchemy.pdf (accessed on 15 February 2025).
Merovci, F.; Alizadeh, M.; Hamedani, G.G. Another generalized transmuted family of distributions: Properties and applications. Austrian J. Stat. 2016, 45, 71–93. [Google Scholar] [CrossRef]
Moolath, G.B.; Jayakumar, K. T-transmuted X family of distributions. Statistica 2017, 77, 251–276. [Google Scholar]
Granzotto, D.C.T.; Louzada, F.; Balakrishnan, N. Cubic rank transmuted distributions: Inferential issues and applications. J. Stat. Comput. Simul. 2017, 87, 2760–2778. [Google Scholar] [CrossRef]
Rahman, M.M.; Gemeay, A.M.; Khan, M.A.I.; Meraou, M.A.; Bakr, M.E.; Muse, A.H.; Balogun, O.S. A new modified cubic transmuted-G family of distributions: Properties and different methods of estimation with applications to real-life data. AIP Adv. 2023, 13, 095025. [Google Scholar] [CrossRef]
Rayleigh, L. On the stability, or instability, of certain fluid motions. Proc. Lond. Math. Soc. 1880, 9, 57–70. [Google Scholar] [CrossRef]
Siddiqui, M.M. Some problems connected with Rayleigh distributions. J. Res. Natl. Bur. Stand. D 1962, 66, 167. [Google Scholar] [CrossRef]
Siddiqui, M.M. Statistical inference for Rayleigh distributions. J. Res. Natl. Bur. Stand. Sec. D 1964, 68, 1007. [Google Scholar] [CrossRef]
Vickers, J.W. A Parameter Estimation Technique for the Generalized Rayleigh-Rician Distribution and Laha’s Bessel Distribution; PN: Fort Belvoir, VA, USA, 1976. [Google Scholar]
Beckmann, P. Rayleigh distribution and its generalizations. Radio Sci. J. Res. NBS/USNC-URSI 1964, 68, 927–932. [Google Scholar] [CrossRef]
Kundu, D.; Raqab, M.Z. Generalized Rayleigh distribution: Different methods of estimations. Comput. Stat. Data Anal. 2005, 49, 187–200. [Google Scholar] [CrossRef]
Voda, V.G. A new generalization of Rayleigh distribution. Reliab. Theory Appl. 2007, 2, 47–56. [Google Scholar]
Abd Elfattah, A.M.; Hassan, A.S.; Ziedan, D.M. Efficiency of maximum likelihood estimators under different censored sampling schemes for Rayleigh distribution. Interstat 2006, 1, 1–16. [Google Scholar]
Merovci, F. Transmuted Rayleigh distribution. Austrian J. Stat. 2013, 42, 21–31. [Google Scholar] [CrossRef]
Merovci, F. Transmuted generalized Rayleigh distribution. J. Stat. Appl. Probab. 2014, 3, 9. [Google Scholar] [CrossRef]
Mir, A.A.; Ahmad, S.P. A New Extended Rayleigh Distribution with Applications of COVID-19 Data. Austrian J. Stat. 2025, 54, 69–84. [Google Scholar] [CrossRef]
Rivera, P.A.; Barranco-Chamorro, I.; Gallardo, D.I.; Gómez, H.W. Scale Mixture of Rayleigh Distribution. Mathematics 2020, 8, 1842. [Google Scholar] [CrossRef]
Vodă, V.G. Inferential procedures on a generalized Rayleigh variate. I. Apl. Mat. 1976, 21, 395–412. [Google Scholar] [CrossRef]
Santoro, K.I.; Gallardo, D.I.; Venegas, O.; Cortés, I.E.; Gómez, H.W. A Heavy-Tailed Distribution Based on the Lomax–Rayleigh Distribution with Applications to Medical Data. Mathematics 2023, 11, 4626. [Google Scholar] [CrossRef]
Haj Ahmad, H.; Ramadan, D.A.; Almetwally, E.M. Evaluating the discrete generalized Rayleigh distribution: Statistical inferences and applications to real data analysis. Mathematics 2024, 12, 183. [Google Scholar] [CrossRef]
Dong, Y.; Gui, W. Reliability Estimation in Stress Strength for Generalized Rayleigh Distribution Using a Lower Record Ranked Set Sampling Scheme. Mathematics 2024, 12, 1650. [Google Scholar] [CrossRef]
Balakrishnan, N.; He, M. A record-based transmuted family of distributions. In Advances in Statistics-Theory and Applications: Honoring the Contributions of Barry C. Arnold in Statistical Science; Springer: Cham, Switzerland, 2021; pp. 3–24. [Google Scholar]
Gradshteyn, I.S.; Ryzhik, I.M. Table of Integrals, Series, and Products; Academic Press: Cambridge, MA, USA, 2014. [Google Scholar]
Rudin, W. Principles of Mathematical Analysis, 3rd ed.; McGraw-Hill: New York, NY, USA, 1976. [Google Scholar]
Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics; University of California Press: Berkeley, CA, USA, 1961; Volume 4, pp. 547–562. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Surles, J.G.; Padgett, W.J. Inference for reliability and stress strength for a scaled Burr type X distribution. Lifetime Data Anal. 2001, 7, 187–200. [Google Scholar] [CrossRef] [PubMed]
Sloan, C.H.; Sublett, B.J. Determination of methyl nitrite in cigarette smoke. Tob. Sci. 1967, 11, 21–24. [Google Scholar]
Schultz, F.J.; Spears, A.W. Determination of moisture in total particulate matter. Tob. Sci. 1966, 10, 75–76. [Google Scholar]
Lishamol, T.; Jiju, G. A generalized Rayleigh distribution and its application. Biom. Biostat. Int. J. 2019, 8, 139–143. [Google Scholar]
Bader, M.G.; Priest, A.M. Statistical aspects of fibre and bundle strength in hybrid composites. In Progress in Science and Engineering of Composites; The Japan Society for Composite Materials: Tokyo, Japan, 1982; pp. 1129–1136. [Google Scholar]

Figure 1. The PDFs of various rbt-Rayleigh distributions.

Figure 2. The CDFs of various rbt-Rayleigh- distributions.

Figure 3. The plot of the quantile function

x_{p}

of the rbt-Rayleigh distribution for

p \in (0, 1)

, with parameters

a = 0.8

,

b = 0.15

, and

σ = 1

. The function was numerically evaluated using the inverse of the cumulative distribution function and visualized in R.

Figure 4. Nicotine yields (mg per cigarette).

Figure 5. Empirical vs. fitted CDF—nicotine yields.

Figure 6. Log-likelihood contour—nicotine yields (

σ

fixed at MLE).

Figure 7. CO emissions (mg per cigarette).

Figure 8. Empirical vs. fitted CDF—CO emissions.

Figure 9. Log-likelihood contour—CO emissions (

σ

fixed at MLE).

Figure 10. Breaking stress (Dataset 3, GPa).

Figure 11. Empirical vs. fitted CDF—breaking stress (Dataset 3).

Figure 12. Log-likelihood contour—breaking stress (Dataset 3) (

σ

fixed).

Figure 13. Breaking stress (Dataset 4, GPa).

Figure 14. Empirical vs. fitted CDF—breaking stress (Dataset 4).

Figure 15. Log-likelihood contour—breaking stress (Dataset 4) (

σ

fixed).

Table 1. Quantile values

x_{p}

of the rbt-R distribution for selected probabilities

p \in (0, 1)

. Parameters:

a = 0.8

,

b = 0.15

,

σ = 1

.

Table 1. Quantile values

x_{p}

of the rbt-R distribution for selected probabilities

p \in (0, 1)

. Parameters:

a = 0.8

,

b = 0.15

,

σ = 1

.

p	$x_{p}$	p	$x_{p}$	p	$x_{p}$	p	$x_{p}$	p	$x_{p}$
0.01	0.159	0.12	0.566	0.23	0.809	0.34	1.021	0.45	1.225
0.02	0.225	0.13	0.590	0.24	0.829	0.35	1.039	0.46	1.243
0.03	0.276	0.14	0.614	0.25	0.849	0.36	1.058	0.47	1.262
0.04	0.320	0.15	0.638	0.26	0.869	0.37	1.076	0.48	1.281
0.05	0.358	0.16	0.661	0.27	0.888	0.38	1.095	0.49	1.300
0.06	0.393	0.17	0.683	0.28	0.907	0.39	1.113	0.50	1.319
0.07	0.426	0.18	0.705	0.29	0.926	0.40	1.132	0.60	1.516
0.08	0.457	0.19	0.726	0.30	0.945	0.41	1.150	0.70	1.738
0.09	0.486	0.20	0.747	0.31	0.964	0.42	1.169	0.80	2.009
0.10	0.513	0.21	0.768	0.32	0.983	0.43	1.187	0.90	2.401
0.11	0.540	0.22	0.789	0.33	1.002	0.44	1.206	0.99	3.372

Table 2. Mean values of X for selected combinations of a, b, and

σ

.

Table 2. Mean values of X for selected combinations of a, b, and

σ

.

a	b	$σ = 1$	$σ = 1.5$	$σ = 2$	$σ = 3$	$σ = 5$	$σ = 7$	$σ = 9$
0.050	0.900	1.872	2.808	3.744	5.616	9.361	13.105	16.849
0.100	0.850	1.841	2.761	3.682	5.522	9.204	12.886	16.567
0.150	0.800	1.809	2.714	3.619	5.428	9.047	12.666	16.285
0.200	0.750	1.778	2.667	3.556	5.334	8.891	12.447	16.003
0.250	0.700	1.747	2.620	3.494	5.240	8.734	12.228	15.721
0.300	0.650	1.715	2.573	3.431	5.146	8.577	12.008	15.439
0.350	0.600	1.684	2.526	3.368	5.052	8.421	11.789	15.157
0.400	0.100	1.864	2.796	3.729	5.593	9.322	13.050	16.779
0.450	0.080	1.819	2.728	3.638	5.457	9.094	12.732	16.370
0.500	0.060	1.773	2.660	3.547	5.320	8.867	12.414	15.961
0.550	0.040	1.728	2.592	3.456	5.184	8.640	12.096	15.552
0.600	0.020	1.683	2.524	3.365	5.048	8.413	11.778	15.143
0.650	0.010	1.632	2.449	3.265	4.897	8.162	11.427	14.692
0.700	0.010	1.578	2.366	3.155	4.733	7.888	11.043	14.198
0.750	0.100	1.480	2.221	2.961	4.441	7.402	10.363	13.324
0.800	0.080	1.435	2.153	2.870	4.305	7.175	10.045	12.915
0.850	0.060	1.390	2.084	2.779	4.169	6.948	9.727	12.507
0.900	0.040	1.344	2.016	2.688	4.033	6.721	9.409	12.098
0.950	0.020	1.299	1.948	2.597	3.896	6.494	9.091	11.689

Table 3. Variance values of X for selected combinations of a, b, and

σ

.

Table 3. Variance values of X for selected combinations of a, b, and

σ

.

a	b	$σ = 1$	$σ = 1.5$	$σ = 2$	$σ = 3$	$σ = 5$	$σ = 7$	$σ = 9$
0.050	0.900	0.495	1.114	1.980	4.456	12.377	24.260	40.103
0.100	0.850	0.511	1.151	2.046	4.603	12.786	25.060	41.426
0.150	0.800	0.526	1.183	2.103	4.732	13.145	25.765	42.591
0.200	0.750	0.538	1.211	2.153	4.844	13.456	26.373	43.596
0.250	0.700	0.549	1.235	2.195	4.938	13.717	26.885	44.442
0.300	0.650	0.557	1.254	2.229	5.014	13.929	27.300	45.129
0.350	0.600	0.564	1.268	2.255	5.073	14.092	27.620	45.657
0.400	0.100	0.724	1.630	2.897	6.519	18.109	35.494	58.674
0.450	0.080	0.732	1.646	2.927	6.585	18.293	35.854	59.268
0.500	0.060	0.735	1.654	2.940	6.614	18.373	36.011	59.528
0.550	0.040	0.734	1.651	2.936	6.606	18.350	35.966	59.453
0.600	0.020	0.729	1.640	2.916	6.560	18.224	35.718	59.044
0.650	0.010	0.715	1.609	2.861	6.436	17.878	35.042	57.926
0.700	0.010	0.691	1.555	2.765	6.220	17.279	33.866	55.983
0.750	0.100	0.608	1.368	2.433	5.474	15.205	29.801	49.263
0.800	0.080	0.581	1.306	2.323	5.226	14.516	28.452	47.032
0.850	0.060	0.549	1.235	2.196	4.941	13.724	26.900	44.467
0.900	0.040	0.513	1.155	2.053	4.619	12.830	25.146	41.568
0.950	0.020	0.473	1.065	1.893	4.259	11.831	23.190	38.334

Table 4. Descriptive statistics for variable X (N = 346).

Statistic	N	Mean	SD	Min	$Q_{1}$	Median	$Q_{3}$	Max
X	346	0.85	0.33	0.10	0.60	0.90	1.10	2.00

Table 5. Parameter estimates and log-likelihood for Rayleigh-type models for Dataset 1.

Model	Parameters	Std. Error	LogLik
Rayleigh	$σ = 0.6478$	$0.0174$	$136.7884$
Transmuted Rayleigh	$\begin{matrix} σ = 0.5506 \\ λ = - 0.7842 \end{matrix}$	$\begin{matrix} 0.0133 \\ 0.0700 \end{matrix}$	$113.5013$
Generalized Rayleigh	$\begin{matrix} α = 1.5784 \\ β = 1.2503 \end{matrix}$	$\begin{matrix} 0.1185 \\ 0.0385 \end{matrix}$	$119.6494$
Transmuted Generalized Rayleigh	$\begin{matrix} α = 1.1731 \\ β = 1.3165 \\ λ = - 0.6810 \end{matrix}$	$\begin{matrix} 0.1410 \\ 0.0394 \\ 0.1205 \end{matrix}$	$112.6435$
Record-Based Transmuted Rayleigh	$\begin{matrix} σ = 0.4042 \\ a = 0.1991 \\ b = 0.0334 \end{matrix}$	$\begin{matrix} 0.0132 \\ 0.0529 \\ 0.1956 \end{matrix}$	$108.6344$

Table 6. Goodness-of-fit measures for Dataset 1.

Model	KS	AIC	AIC_c	BIC
Rayleigh	$0.1867$	$275.5768$	$275.5884$	$279.4232$
Transmuted Rayleigh	$0.1272$	$231.0026$	$231.0376$	$238.6955$
Generalized Rayleigh	$0.1382$	$243.2988$	$243.3337$	$250.9916$
Transmuted Generalized Rayleigh	$0.1189$	$231.2869$	$231.3571$	$242.8263$
Record-Based Transmuted Rayleigh	$0.0838$	$223.2689$	$223.3391$	$234.8082$

Table 7. Descriptive statistics for variable X (N = 816).

Statistic	N	Mean	SD	Min	Q1	Median	Q3	Max
X	816	12.05	4.06	1	9	12	15	21

Table 8. Parameter estimates and log-likelihood for CO emission models.

Model	Parameters	Std. Error	$- ℓ$
Rayleigh	$σ = 8.9938$	$0.1574$	2429.698
Transmuted Rayleigh	$\begin{matrix} σ = 7.4408 \\ λ = - 0.9536 \end{matrix}$	$\begin{matrix} 0.0983 \\ 0.0003 \end{matrix}$	2328.177
Generalized Rayleigh	$\begin{matrix} α = 2.1584 \\ β = 0.0978 \end{matrix}$	$\begin{matrix} 0.1114 \\ 0.0018 \end{matrix}$	2330.157
Transmuted Generalized Rayleigh	$\begin{matrix} α = 1.6236 \\ β = 0.1022 \\ λ = - 0.6556 \end{matrix}$	$\begin{matrix} 0.1838 \\ 0.0019 \\ 0.1238 \end{matrix}$	2316.907
Record-Based Transmuted Rayleigh	$\begin{matrix} σ = 5.4697 \\ a = 0.0815 \\ b = 0.1332 \end{matrix}$	$\begin{matrix} 0.0945 \\ 0.0283 \\ 0.1027 \end{matrix}$	2304.837

Table 9. Goodness-of-fit measures for CO emission models.

Model	AIC	AIC_c	BIC	KS
Rayleigh	4861.397	4861.402	4866.101	0.2154
Transmuted Rayleigh	4660.354	4660.369	4669.763	0.1415
Generalized Rayleigh	4664.314	4664.328	4673.723	0.1368
Transmuted Generalized Rayleigh	4639.814	4639.843	4653.927	0.1223
Record-Based Transmuted Rayleigh	4615.674	4615.704	4629.787	0.1087

Table 10. Breaking stress values (in GPa) for Dataset 3.

0.39	0.85	1.08	1.25	1.47	1.57	1.61	1.61	1.69	1.80	1.84
1.87	1.89	2.03	2.03	2.05	2.12	2.35	2.41	2.43	2.48	2.50
2.53	2.55	2.55	2.56	2.59	2.67	2.73	2.74	2.79	2.81	2.82
2.85	2.87	2.88	2.93	2.95	2.96	2.97	3.09	3.11	3.11	3.15
3.15	3.19	3.22	3.22	3.27	3.28	3.31	3.31	3.33	3.39	3.39
3.56	3.60	3.65	3.68	3.70	3.75	4.20	4.38	4.42	4.70	4.90

Table 11. Parameter estimates and log-likelihood for Dataset 3.

Model	Parameters	Std. Error	$- ℓ$
Rayleigh	$σ = 2.049$	$0.126$	$- 98.208$
Transmuted Rayleigh	$\begin{matrix} σ = 1.696 \\ λ = - 0.959 \end{matrix}$	$\begin{matrix} 0.079 \\ 0.0003 \end{matrix}$	$- 88.874$
Generalized Rayleigh	$\begin{matrix} α = 2.348 \\ β = 0.438 \end{matrix}$	$\begin{matrix} 0.431 \\ 0.028 \end{matrix}$	$- 88.637$
Transmuted Generalized Rayleigh	$\begin{matrix} α = 1.759 \\ β = 0.461 \\ λ = - 0.710 \end{matrix}$	$\begin{matrix} 0.495 \\ 0.029 \\ 0.257 \end{matrix}$	$- 86.920$
Record-Based Transmuted Rayleigh	$\begin{matrix} σ = 1.217 \\ a = 0.083 \\ b = 4.585 \times 10^{- 8} \end{matrix}$	$\begin{matrix} 0.002 \\ 0.004 \\ 1.177 \times 10^{- 16} \end{matrix}$	$- 85.490$

Table 12. Goodness-of-fit measures for Dataset 3.

Model	AIC	AIC_c	BIC	KS
Rayleigh	198.417	198.479	200.607	0.211
Transmuted Rayleigh	181.749	181.939	186.128	0.126
Generalized Rayleigh	181.274	181.464	185.653	0.105
Transmuted Generalized Rayleigh	179.839	180.226	186.408	0.083
Record-Based Transmuted Rayleigh	176.979	177.366	183.548	0.070

Table 13. Observed breaking stress values (GPa) for Dataset 4 (carbon fibre, 20 mm gauge length).

0.312	0.314	0.479	0.552	0.700	0.803	0.861	0.865	0.944	0.958
0.966	0.997	1.006	1.021	1.027	1.055	1.063	1.098	1.140	1.179
1.224	1.240	1.253	1.270	1.272	1.274	1.301	1.301	1.359	1.382
1.382	1.426	1.434	1.435	1.478	1.490	1.511	1.514	1.535	1.554
1.566	1.570	1.586	1.629	1.633	1.642	1.648	1.684	1.697	1.726
1.770	1.773	1.800	1.809	1.818	1.821	1.848	1.880	1.954	2.012
2.067	2.084	2.090	2.096	2.128	2.233	2.433	2.585	2.585

Table 14. Descriptive statistics for Dataset 4 (

n = 69

).

Table 14. Descriptive statistics for Dataset 4 (

n = 69

).

Statistic	N	Mean	SD	Min	$Q_{1}$	Median	$Q_{3}$	Max
Breaking stress (GPa)	69	1.451	0.495	0.312	1.098	1.478	1.773	2.585

Table 15. Parameter estimates and log-likelihood for Rayleigh-type distributions—Dataset 4.

Model	Parameters	Std. Error	$- ℓ$
Rayleigh	$σ = 1.083$	$0.06521$	$59.4183$
Transmuted Rayleigh	$\begin{matrix} σ = 0.894 \\ λ = - 0.961 \end{matrix}$	$\begin{matrix} 0.04055 \\ 0.00028 \end{matrix}$	$50.9525$
Generalized Rayleigh	$\begin{matrix} α = 2.174 \\ β = 0.813 \end{matrix}$	$\begin{matrix} 0.387 \\ 0.0521 \end{matrix}$	$50.9049$
Transmuted Generalized Rayleigh	$\begin{matrix} α = 1.659 \\ β = 0.855 \\ λ = - 0.663 \end{matrix}$	$\begin{matrix} 0.46035 \\ 0.05338 \\ 0.27535 \end{matrix}$	$49.6063$
Record-Based Transmuted Rayleigh	$\begin{matrix} σ = 0.649 \\ a = 0.110 \\ b = 2.45 \times 10^{- 8} \end{matrix}$	$\begin{matrix} 0.02651 \\ 0.06490 \\ 1.08 \times 10^{- 8} \end{matrix}$	$48.5363$

Table 16. Goodness-of-fit measures for Rayleigh-type distributions—Dataset 4.

Model	AIC	AIC_c	BIC	KS
Rayleigh	120.8367	120.8964	123.0708	0.185
Transmuted Rayleigh	105.9050	106.0868	110.3732	0.074
Generalized Rayleigh	105.8098	105.9916	110.2780	0.061
Transmuted Generalized Rayleigh	105.2126	105.5818	111.9149	0.039
Record-Based Transmuted Rayleigh	103.0726	103.4419	109.7750	0.035

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

A Three-Parameter Record-Based Transmuted Rayleigh Distribution (Order 3): Theory and Real-Data Applications

Abstract

1. Introduction

2. The Record-Based Transmuted Rayleigh Distribution of Order 3

3. Quantile Function

R Code for Computing the Quantile Function of the rbt–Rayleigh Distribution

4. Moments

5. Skewness and Kurtosis

6. Harmonic Mean

7. Mean Deviations

8. Entropy

9. Order Statistics

10. Maximum Likelihood Estimation

11. Application to Real Data

11.1. Dataset 1: Nicotine Yields (FTC, 1994)

11.2. Dataset 2: Carbon Monoxide Emissions (FTC, 2007)

11.3. Dataset 3: Carbon-Fibre Breaking Stress (50 mm Gauge)

11.4. Dataset 4: Carbon-Fibre Breaking Stress (20 mm Gauge)

11.5. Summary of Results Across Datasets

12. Random Sampling via Inverse Transform and Newton–Raphson

13. Monte Carlo Experiment

14. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Second-Order Derivatives of the Log-Likelihood

Appendix B. Simulation Study

References

Article Metrics

Citations

Article Access Statistics