A Lower-Bounded Extreme Value Distribution for Flood Frequency Analysis with Applications

Almuhayfith, Fatimah E.; Kachour, Maher; Daghestani, Amira F.; Rehman, Zahid Ur; Hussain, Tassaddaq; Bakouch, Hassan S.

doi:10.3390/math13213378

Open AccessArticle

A Lower-Bounded Extreme Value Distribution for Flood Frequency Analysis with Applications

by

Fatimah E. Almuhayfith

^1,*

,

Maher Kachour

^2,3

,

Amira F. Daghestani

⁴

,

Zahid Ur Rehman

⁵,

Tassaddaq Hussain

⁵

and

Hassan S. Bakouch

^6,*

¹

Department of Mathematics and Statistics, College of Science, King Faisal University, Alahsa 31982, Saudi Arabia

²

Department of Mathematics and Natural Sciences, Gulf University for Science and Technology, P.O. Box 7207, Hawally 32093, Kuwait

³

Center of Applied Mathematics and Bioinformatics (CAMB), Gulf University for Science and Technology, P.O. Box 7207, Hawally 32093, Kuwait

⁴

Department of Mathematics, College of Science and Humanities, Imam Abdulrahman Bin Faisal University, Jubail 35811, Saudi Arabia

⁵

Department of Mathematics, Mirpur University of Science and Technology (MUST), Mirpur 10250, Pakistan

⁶

Department of Mathematics, College of Science, Qassim University, Buraydah 51452, Saudi Arabia

^*

Authors to whom correspondence should be addressed.

Mathematics 2025, 13(21), 3378; https://doi.org/10.3390/math13213378

Submission received: 28 August 2025 / Revised: 19 September 2025 / Accepted: 15 October 2025 / Published: 23 October 2025

(This article belongs to the Special Issue Reliability Estimation and Mathematical Statistics)

Download

Browse Figures

Versions Notes

Abstract

This paper proposes the lower-bounded Fréchet–log-logistic distribution (LFLD), a probability model designed for robust flood frequency analysis (FFA). The LFLD addresses key limitations of traditional distributions (e.g., generalized extreme value (GEV) and log-Pearson Type III (LP3)) by combining bounded support (

α < x < \infty

) to reflect physical flood thresholds, flexible tail behavior via Fréchet–log-logistic fusion for extreme-value accuracy, and maximum entropy characterization, ensuring optimal parameter estimation. Thus, we obtain the LFLD’s main statistical properties (PDF, CDF, and hazard rate), prove its asymptotic convergence to Fréchet distributions, and validate its superiority through simulation studies showing MLE consistency (bias < 0.02 and mean squared error < 0.0004 for

α

) and empirical flood data tests (52- and 98-year AMS series), where the LFLD outperforms 10 competitors (AIC reductions of 15–40%; Vuong test p < 0.01). The LFLD’s closed-form quantile function enables efficient return period estimation, critical for infrastructure planning. Results demonstrate its applicability to heavy-tailed, bounded hydrological data, offering a 20–30% improvement in flood magnitude prediction over LP3/GEV models.

Keywords:

flood frequency analysis; bounded distributions; Fréchet distribution; log-logistic model; entropy; extreme value theory; return period estimation

MSC:

60E05; 62E15; 62F10

1. Introduction

The increasing frequency and severity of climate-induced flooding pose unprecedented risks to global infrastructure, economies, and human safety. The Intergovernmental Panel on Climate Change (IPCC) Sixth Assessment Report confirms that rising temperatures amplify rainfall variability, glacier melt, and storm surges, directly intensifying flood hazards (see [1,2,3]). The World Health Organization (WHO) estimates that 80–90% of natural disaster losses stem from floods, droughts, and cyclones, with floods alone displacing millions annually [4,5,6]. Economically, climate-related disasters could reduce global GDP by 18% by 2050, as exemplified by Pakistan’s 2022 floods (USD 18 billion losses and 33 million displaced) [7,8].

Flood frequency analysis is the primary tool for quantifying these risks, relying on probability distributions to model extreme events. Traditional models like generalized extreme value (GEV), Log-Pearson Type III (LP3), and Log-Normal (LN2) are entrenched in regional practices (e.g., GEV in the UK and LP3 in the USA and Australia) yet suffer from critical limitations [9,10,11]:

GEV and LP3 often fail to capture heavy-tailed or multimodal flood data [12].
Unbounded assumptions: Most distributions ignore natural lower bounds (e.g., minimum discharge thresholds), leading to unrealistic predictions [13].
Entropy inefficiency: Conventional models violate the maximum entropy principle, overfitting to sparse data [14,15].

To overcome these challenges, we propose the lower-bounded Fréchet–log-logistic distribution, a probability model merging the tail robustness of the Fréchet distribution [16] with the bounded adaptability of the log-logistic framework [17,18]. The LFLD advances flood frequency analysis through the following:

Bounded support: It explicitly incorporates a lower threshold ( $α$ ), aligning with physical flood constraints [19].
Entropy-optimal design: Parameters are derived via Shannon entropy maximization, ensuring robustness against skewness and outliers [20].
Analytical tractability: Closed-form quantile functions (Section 2.5) enable efficient return period estimation, outperforming nested models in Vuong tests [21,22].

This paper bridges statistical theory and hydrological practice by the following means:

Theoretical innovation: the LFLD’s modes, hazard rates, and asymptotic behavior are characterized (Section 2, Section 3 and Section 4).
Rigorous validation: Simulation studies (Section 5) confirm MLE consistency, while empirical tests (Section 6) demonstrate superiority over GEV, LP3, and log-logistic models [23,24,25].

This paper is structured as follows: Section 2 derives the LFLD’s properties, Section 3 and Section 4 analyze moments and entropy, Section 5 details estimation, and Section 6 applies the LFLD to global flood data sets.

2. Model Derivation and Mathematical Properties

This section introduces the mathematical formulation of the lower-bounded Fréchet–log-logistic distribution, a distribution designed to model bounded hydrological extremes such as flood data. We derive its cumulative distribution function (CDF), probability density function (PDF), hazard rate function (HRF) and quantile function, and analyze its asymptotic properties.

2.1. Construction of the Distribution and CDF

To construct a bounded distribution capable of capturing heavy tails, we fuse the log-logistic transformation with a Fréchet baseline distribution. This allows flexibility in shape and tail weight, while preserving physical constraints in hydrological processes, such as a nonzero lower bound on flood magnitudes.

Let the baseline CDF be the two-parameter log-logistic distribution

D (x; α, β) = \frac{{(x / α)}^{β}}{1 + {(x / α)}^{β}}, x > α > 0, β > 0,

with the corresponding log-odds transformation (also called an odd CDF link function)

Q (x; α, β) = log (\frac{D (x)}{1 - D (x)}) = β log (\frac{x}{α}) .

We embed this transformation within the Fréchet distribution’s CDF to obtain

F_{LFLD} (x; α, β, θ) = exp (- {[β log (\frac{x}{α})]}^{- θ}), x > α, θ > 0 .

(1)

This construction provides

A lower-bounded support ( $x > α$ );
Heavy-tailed behavior through the Fréchet structure;
Parametric control over skewness and scale via $β$ and $θ$ .

Relation to Existing Distributions

Now, it is important to clarify the connection between the proposed lower-bounded Fréchet–log-logistic distribution (LFLD) and the extended log-inverse Weibull distribution (ELIWD) introduced by [26,27]. The ELIWD distribution function is given by

F_{ELIWD} (z) = exp (- {[\frac{ln z - a}{b}]}^{- c}), z \geq e^{a}, b > 0, c > 0 .

Based on the above results, one can deduce that by setting

α = e^{a}, β = \frac{1}{b}, θ = c,

we recover

{[β ln (x / α)]}^{- θ} = {[\frac{ln (x / e^{a})}{b}]}^{- c} = {[\frac{ln x - a}{b}]}^{- c},

so that

F_{LFLD} (x) \equiv F_{ELIWD} (x)

. This shows that the LFLD is mathematically equivalent to the ELIWD under a reparameterization. Thus, both share the same functional form. However, they differ in their derivation: the LFLD is obtained using an odd link function, while the ELIWD arises from a CDF transformation.

Contribution of the present formulation. While the functional form is equivalent, the parametrization adopted here offers several advantages for both theory and application:

Hydrological relevance: The parameter $α$ serves as a physical lower bound for streamflow, which is directly interpretable in practice. Furthermore, the log-logistic distribution—already recommended in several hydrological guidelines (e.g., in the UK) for modeling flood flows due to its flexibility—is embedded within the current construction.
Extreme-value foundation: The model incorporates the Fréchet distribution, a classical law for block maxima and the limiting distribution for a wide class of parent processes. By integrating the log-logistic and Fréchet distributions within a log-odds framework, the LFLD retains the boundedness and practical applicability of the former while capturing the heavy-tail properties of the latter.
Theoretical development: We provide formal proofs of asymptotic convergence to the two- and three-parameter Fréchet distributions (see Section 2.5), together with detailed derivations of the quantile function (see Section 2.6) and rigorous results on moment existence and entropy (see Section 3 and Section 4).
Applied focus: In contrast to [26,27], whose emphasis was primarily on biomedical applications, this study establishes the LFLD as a valid extreme-value model for hydrological extremes. Simulation experiments and empirical case studies (see Section 5 and Section 6) demonstrate its ability to deliver reliable return level estimates in flood frequency analysis.

In summary, the present work reframes the ELIWD family within a hydrological context, showing that the proposed parametrization is not only algebraically equivalent to but also purpose-built for FFA, supported by novel asymptotic theory, moment characterization, and empirical validation.

2.2. Probability Density Function (PDF)

Differentiating the CDF (1) yields the PDF:

f_{LFLD} (x; α, β, θ) = \frac{β θ}{x} {[β log (\frac{x}{α})]}^{- θ - 1} exp (- {[β log (\frac{x}{α})]}^{- θ}), x > α .

(2)

This function is positive and integrable over

x > α

, confirming that it is a valid probability density. The parameters are interpreted as follows:

$α$ : Scale and lower bound;
$β$ : Controls steepness and mode location;
$θ$ : Governs tail heaviness.

2.3. Shape Analysis and Mode

The shape of the PDF is governed by the interaction between

β

and

θ

. To determine the mode, we differentiate the log-density:

\frac{d}{d x} log f (x) = - \frac{1}{x} + \frac{1 + θ}{x log (x / α)} - \frac{β θ}{x {[β log (x / α)]}^{1 + θ}} = 0 .

(3)

This equation must be solved numerically for given parameters. Empirical behavior (see Figure 1) shows the following:

$θ < 1, β < 1$ : Reverse-J shape;
moderate $θ$ : Right-skewed unimodal;
large $θ$ : Sharper peak and heavier tail.

2.4. Survival and Hazard Rate Function

The survival function is

S (x) = 1 - F_{LFLD} (x) = 1 - exp (- {[β log (\frac{x}{α})]}^{- θ}),

(4)

and the hazard rate function (HRF) is

h (x) = \frac{f (x)}{S (x)} = \frac{β θ {[β log (x / α)]}^{- θ - 1}}{x (exp ({[β log (x / α)]}^{- θ}) - 1)} .

(5)

This HRF may be increasing, decreasing, or unimodal depending on

β

and

θ

, see Figure 2, which makes the LFLD adaptable to various risk and failure patterns.

2.5. Asymptotic Behavior

Let

u = β ln (x / α)

. Then

F (x) = exp (- u^{- θ}), f (x) = \frac{β θ}{x} u^{- (θ + 1)} e^{- u^{- θ}} .

(i): Case $x \to \infty$ ( $u \to \infty$ ).

Since

e^{- u^{- θ}} = 1 - u^{- θ} + O (u^{- 2 θ})

as

u \to \infty

, we obtain

\begin{matrix} S (x) & = 1 - F (x) = u^{- θ} + O (u^{- 2 θ}), \\ f (x) & = \frac{β θ}{x} u^{- (θ + 1)} (1 + O (u^{- θ})) . \end{matrix}

Equivalently,

S (x) \sim {[β ln (x / α)]}^{- θ}, f (x) \sim \frac{β θ}{x} {[β ln (x / α)]}^{- (θ + 1)} .

The hazard function then satisfies

h (x) = \frac{f (x)}{S (x)} \sim \frac{θ}{x ln (x / α)} (x \to \infty) .

Hence the LFLD is Fréchet on the log-scale, producing a log–power tail in x that decays more slowly than any algebraic power.

(ii): Case $x ↓ α$ ( $u ↓ 0^{+}$ ).

Let

y = x - α > 0

. Using

ln (1 + \frac{y}{α}) = \frac{y}{α} + O ({(\frac{y}{α})}^{2}),

we obtain

u \sim β y / α

as

y ↓ 0

. Therefore

f (α + y) \sim \frac{θ}{σ} {(\frac{y}{σ})}^{- (θ + 1)} exp [- {(\frac{y}{σ})}^{- θ}], σ = \frac{α}{β} .

This is precisely the density of a Fréchet distribution with shape

θ

, scale

σ

, and location

α

. Consequently, near the lower bound

α

the LFLD behaves locally as a three-parameter Fréchet distribution shifted to

α

.

Therefore, as

x \to \infty

, the survival function decays as

S (x)

∼

{[β ln (x / α)]}^{- θ}

and the hazard

h (x) \sim θ / [x ln (x / α)] \to 0

. As

x ↓ α

, the distribution reduces locally to a shifted Fréchet law in

y = x - α

. These asymptotics justify the tail behavior and provide explicit leading terms useful for numerical approximation of return levels. These properties confirm that the LFLD is consistent with max-stable limits and extreme value convergence, as formally developed in [16,28].

2.6. Quantile Function

The cumulative distribution function (CDF) of the LFLD is

F (x) = exp (- {[β ln (x / α)]}^{- θ}), x > α .

To derive the quantile function

Q (p) = F^{- 1} (p)

for

0 < p < 1

, we proceed step by step. Let

p = F (x)

. Then, we have

p = exp (- {[β ln (x / α)]}^{- θ}) .

Now, applying the natural logarithm gives

ln p = - {[β ln (x / α)]}^{- θ} .

Moreover, by multiplying both sides by

- 1

and inverting the exponent, we obtain

{[β ln (x / α)]}^{- θ} = - ln p \Rightarrow β ln (x / α) = {(- ln p)}^{- 1 / θ} .

Solve for

ln (x / α)

.

ln (x / α) = \frac{1}{β} {(- ln p)}^{- 1 / θ} .

Exponentiate to obtain x.

x = α exp \{\frac{1}{β} {(- ln p)}^{- 1 / θ}\} .

Therefore, the quantile function is

Q (p) = α exp \{\frac{1}{β} {(- ln p)}^{- 1 / θ}\}, 0 < p < 1 .

Properties.

Monotonicity: $Q (p)$ is strictly increasing in p.
Lower quantiles: As $p ↓ 0$ , we have $Q (p) ↓ α$ , showing convergence to the lower bound.
Upper quantiles: As $p ↑ 1$ , $Q (p) \to \infty$ , reflecting the heavy-tailed behavior.
Connection to Fréchet: The form of $Q (p)$ shows that it is an exponential transform of the Fréchet quantile in the latent variable $u = β ln (x / α)$ , consistent with the asymptotics in Section 2.5.

This closed-form quantile expression facilitates the direct computation of

Return levels for flood recurrence intervals;
Thresholds for risk evaluation;
Hydrological design parameters.

3. Derivation of Moments via the Moment Generating Function

This section derives the first- and second-order moments of the LFLD using the moment generating function (MGF), where defined. Let

X \sim LFLD (α, β, θ)

, with PDF given in Equation (2).

3.1. Moment Generating Function

The moment generating function of X, if it exists, is defined by

M_{X} (t) = E [e^{t X}] = \int_{α}^{\infty} e^{t x} f_{LFLD} (x; α, β, θ) d x .

(6)

Substituting the PDF of the LFLD (Equation (2)) into (6), we obtain

\begin{matrix} M_{X} (t) & = \int_{α}^{\infty} e^{t x} \frac{β θ}{x} {[β log (\frac{x}{α})]}^{- θ - 1} exp (- {[β log (\frac{x}{α})]}^{- θ}) d x . \end{matrix}

(7)

This integral does not admit a closed-form solution for general t, but it can be approximated numerically. Due to the heavy-tail structure and exponential term

e^{t x}

, the MGF exists only for values of

| t | < 1

. This aligns with the classical result for the Fréchet distribution and similar heavy-tailed laws [16].

3.2. Derivation of Raw Moments

Alternatively, raw moments

μ_{r}^{'} = E [X^{r}]

can be computed directly without using the MGF:

μ_{r}^{'} = \int_{α}^{\infty} x^{r} f_{LFLD} (x) d x .

(8)

To simplify, use the change of variable

u = β log (\frac{x}{α}) \Rightarrow x = α \cdot exp (\frac{u}{β}), d u = \frac{β}{x} d x .

Substituting into (8) yields

μ_{r}^{'} = \int_{0}^{\infty} {[α \cdot exp (\frac{u}{β})]}^{r} \cdot [u^{- θ - 1} exp (- u^{- θ})] d u .

This simplifies to

μ_{r}^{'} = α^{r} \int_{0}^{\infty} exp (\frac{r u}{β}) u^{- θ - 1} exp (- u^{- θ}) d u .

Expand the second exponential as a power series, i.e.,

e^{x} = \sum_{m = 0}^{\infty} \frac{x^{m}}{m!}

, which implies

μ_{r}^{'} = α^{r} \sum_{m = 0}^{\infty} \frac{1}{m!} {(\frac{r}{β})}^{m} Γ (1 - \frac{m}{θ}) .

(9)

provided

m < θ

.

3.3. Approximate First and Second Moments

Using numerical integration or simulation methods, we approximate

E [X] = μ_{1}^{'}, E [X^{2}] = μ_{2}^{'}, Var (X) = μ_{2}^{'} - {(μ_{1}^{'})}^{2} .

The skewness and kurtosis coefficients follow from

γ_{1} = \frac{E [{(X - μ_{1}^{'})}^{3}]}{Var {(X)}^{3 / 2}}, γ_{2} = \frac{E [{(X - μ_{1}^{'})}^{4}]}{Var {(X)}^{2}} .

Figure 3 shows that, depending on the parameter values, the LFLD could have a negative or positive skewness, as well as a symmetric distribution. Moreover, the distribution could have a leptokurtic, mesokurtic, or platykurtic form. However, for distributions with Fréchet-type tails, high-order moments (e.g.,

r \geq 3

) may diverge unless the parameters

θ, β

are sufficiently large. This is consistent with results in [16,28].

3.4. Interpretation and Use

Despite the lack of closed-form expressions, the structure of (9) allows efficient numerical estimation of moments. These are crucial in practice for

Fitting and validating the LFLD model;
Computing risk metrics (mean exceedances, standard deviation);
Comparing distributions using skewness or kurtosis indicators.

Remark 1.

The entropy characterization in the next section complements the moment-based analysis by offering a non-moment-based foundation, particularly useful when higher-order moments diverge.

4. Entropy of the LFLD Distribution

Entropy is a fundamental concept in information theory and probability, measuring the degree of uncertainty or randomness in a distribution. For heavy-tailed distributions like the LFLD, which may not have finite moments of all orders, entropy offers an alternative approach for assessing variability and model informativeness.

4.1. Shannon Entropy Definition

Let X∼

LFLD (α, β, θ)

. The Shannon entropy

H (X)

is defined as

H (X) = - \int_{α}^{\infty} f (x) log f (x) d x,

(10)

where

f (x)

is the probability density function of the LFLD.

Substituting the expression for

f (x)

, we obtain

\begin{matrix} H (X) = - \int_{α}^{\infty} & [\frac{β θ}{x} {(β log (\frac{x}{α}))}^{- θ - 1} exp (- {[β log (\frac{x}{α})]}^{- θ})] \times log f (x) d x . \end{matrix}

(11)

Since the expression for

log f (x)

is complex, we simplify the calculation using a change of variable:

u = β log (\frac{x}{α}), x = α exp (u / β), d x = \frac{α}{β} exp (u / β) d u .

(12)

Substituting into the entropy integral, we get

H (X) = - \int_{0}^{\infty} g (u) log g (u) d u,

(13)

where

g (u) = θ u^{- θ - 1} exp (- u^{- θ}) .

(14)

Note that this transformation eliminates explicit dependence on

α

and

β

within the entropy kernel

g (u)

, simplifying numerical evaluation. The function

g (u)

resembles a generalized gamma kernel, commonly encountered in entropy analyses of heavy-tailed laws.

4.2. Numerical Evaluation

Although a closed-form expression is unavailable, the entropy integral is well-behaved and converges for all positive values of

θ

(see Figure 4). The integral can be computed numerically using adaptive quadrature or Monte Carlo methods. The resulting entropy

Increases as $θ$ decreases, indicating greater uncertainty;
Decreases as $θ$ increases, reflecting more concentrated distributions.

This behavior is consistent with known properties of entropy across the Fréchet and generalized extreme value families [29,30].

4.3. Interpretation and Use

Entropy serves as a global measure of dispersion and uncertainty, particularly valuable when higher-order moments (e.g., skewness or kurtosis) are undefined or unstable. In the context of flood modeling, higher entropy indicates greater unpredictability in extreme events, while lower entropy reflects more concentrated flood risks.

Entropy is thus useful for:

Comparing the LFLD with competing models across regions or time periods;
Quantifying the impact of changing parameters (e.g., due to climate shifts);
Developing robust risk scores for hydrological extremes.

Remark 2.

When traditional moments diverge, entropy provides an information-theoretic alternative for comparing distributions. It measures uncertainty directly from the probability density function, often remaining finite and informative even when moment-based characteristics fail.

5. Extension to Exponential-Type Families

The results obtained for the LFLD can be embedded into a broader framework by considering general exponential-type families. Recall that a random variable X belongs to the exponential family if its density can be expressed in the canonical form

f (x; η) = h (x) exp (η^{⊤} T (x) - A (η)), x \in X,

where

T (x)

is a vector of sufficient statistics,

η

the natural parameter vector,

A (η)

the log-partition function, and

h (x)

a base measure (see, e.g., [31]).

5.1. LFLD as a Special Case

The lower-bounded Fréchet–log-logistic distribution also admits an exponential-type representation after a suitable transformation. Indeed, with the change of variable

u = β ln (x / α)

, the LFLD density can be written as

f (x) = \frac{β}{x} h (u) exp (- u^{- θ}),

where

h (u) = θ u^{- (θ + 1)}

. This shows that the LFLD belongs to the class of exponential-type models with natural statistic

T (u) = - u^{- θ}

, natural parameter

η = 1

, and a base measure proportional to

u^{- (θ + 1)}

. Hence, the LFLD may be viewed as a special case of exponential family formulations after an appropriate reparameterization.

5.2. Moment Existence

For exponential families, the existence of moments is determined by the convexity properties of the cumulant-generating function, which is the log-partition function

A (η)

. In the derivations for the LFLD (Section 3), the moment existence condition

m < θ

arises naturally from the asymptotic behavior of the transformed variable. This parallels the general principle that moments of order m exist if and only if

η

lies within the interior of the natural parameter space, ensuring

A (η) < \infty

. Thus, the methodology used to analyze the LFLD can be transferred to other exponential-type families by inspecting their corresponding log-partition functions.

5.3. Asymptotic Behavior

In Section 2.5 we derived detailed asymptotic expansions for the LFLD by considering the behavior of the transformed variable

u = β ln (x / α)

. A similar approach can be applied more generally: for any exponential family distribution, asymptotic behavior is determined by the rate of growth of

T (x)

relative to

A (η)

. In this sense, the Fréchet-type convergence results established illustrate how the limiting distribution of maxima in the LFLD framework is consistent with the broader asymptotic theory of exponential families.

5.4. Quantile-Based Inference

An additional strength of the LFLD lies in its closed-form quantile function (Section 2.6). While most exponential family distributions do not have quantile functions available in closed form, the preceding derivation illustrates how algebraic transformations of exponential family CDFs may yield tractable quantile representations. This opens the possibility of extending quantile-based methods for return level estimation and risk assessment to a wider set of exponential models.

Thus, by embedding the LFLD within the exponential family framework and recognizing it as a special case under a suitable transformation, we establish that the techniques developed in this paper—moment existence analysis, asymptotic expansions, and quantile-based inference—are transferable beyond the specific form of the LFLD. This positions the current work as not only application-oriented to flood frequency analysis, but also methodologically relevant to the broader theory of exponential families.

6. Parameter Estimation

We estimate the parameters

α

,

β

, and

θ

of the LFLD using the method of maximum likelihood estimation (MLE) based on a sample

X_{1}, X_{2}, \dots, X_{n}

drawn independently from the distribution.

6.1. Likelihood and Log-Likelihood Functions

Let

x_{1}, \dots, x_{n}

be a stochastic realization of size n from the LFLD distribution. The likelihood function is given by

L (α, β, θ) = \prod_{i = 1}^{n} f (x_{i}; α, β, θ),

(15)

where

f (x_{i}; α, β, θ) = \frac{β θ}{x_{i}} {[β log (\frac{x_{i}}{α})]}^{- θ - 1} exp (- {[β log (\frac{x_{i}}{α})]}^{- θ}), x_{i} > α .

Taking logarithms, the log-likelihood function becomes

\begin{matrix} ℓ (α, β, θ) & = n log β + n log θ - \sum_{i = 1}^{n} log x_{i} \\ - (θ + 1) \sum_{i = 1}^{n} log (β log (\frac{x_{i}}{α})) - \sum_{i = 1}^{n} {[β log (\frac{x_{i}}{α})]}^{- θ} . \end{matrix}

(16)

6.2. Score Functions: First Derivatives

Let

u_{i} = log (\frac{x_{i}}{α})

. The first-order partial derivatives of the log-likelihood (score functions) are as follows.

(i): With respect to $α$ :

$\frac{\partial ℓ}{\partial α} = \frac{1}{α} [(θ + 1) \sum_{i = 1}^{n} \frac{1}{log (x_{i} / α)} - θ β^{- θ} \sum_{i = 1}^{n} {(log (x_{i} / α))}^{- θ - 1}] .$

(17)
(ii): With respect to $β$ :

$\frac{\partial ℓ}{\partial β} = - \frac{n θ}{β} + θ β^{- θ - 1} \sum_{i = 1}^{n} {(log (x_{i} / α))}^{- θ} .$

(18)
(iii): With respect to $θ$ :

$\frac{\partial ℓ}{\partial θ} = \frac{n}{θ} - \sum_{i = 1}^{n} log (β log (x_{i} / α)) + \sum_{i = 1}^{n} {[β log (x_{i} / α)]}^{- θ} log (β log (x_{i} / α)) .$

(19)

These equations define the system to be solved numerically in order to obtain the MLEs.

6.3. Numerical Optimization and Implementation

Given the nonlinearity of the likelihood function, closed-form solutions are not available. We maximize the log-likelihood numerically using iterative procedures such as BFGS or Nelder–Mead. Important considerations include

Appropriate initial values for all parameters;
Constraining $α, β, θ > 0$ using bounded optimization;
Monitoring convergence via log-likelihood trace plots and gradient norms.

We implement the estimation in R (≥2.4.0) using the maxLik package, which supports numerical gradients and Hessian approximations. This implementation is consistent with modern practices in computational statistics [32].

6.4. Asymptotic Properties and Inference

The asymptotic behavior of maximum likelihood estimators (MLEs) is fundamental to statistical inference, providing the basis for constructing confidence intervals and hypothesis tests. For the LFLD distribution, we establish the consistency and asymptotic normality of the MLEs. The following theorems show that, under standard regularity conditions [33,34], the MLEs are

Consistent as $n \to \infty$ ;
Asymptotically normal, with variance approximated by the inverse Fisher information;
Asymptotically efficient, attaining the Cramér–Rao bound.

Standard errors and

95 %

confidence intervals are computed from the inverse of the observed Hessian matrix.

Remark 3.

The LFLD model is identifiable under the assumption that

α \leq min {x_{1}, \dots, x_{n}}

. This is a standard and necessary condition for distributions with threshold parameters.

Theorem 1.

The MLEs

\hat{α}

,

\hat{β}

, and

\hat{θ}

for the parameters α, β, and θ of the LFLD are consistent. That is, as the sample size

n \to \infty

,

(\hat{α}, \hat{β}, \hat{θ}) \to_{p} (α, β, θ)

, where

\to_{p}

denotes convergence in probability.

Proof.

The proof follows from the general theory of consistency for MLEs in parametric models, assuming the LFLD satisfies the necessary regularity conditions (e.g., the parameter space is compact, the model is identifiable (see above), and the log-likelihood is continuous and differentiable with respect to the parameters). Consider the average log-likelihood

l_{n} (ϕ) = \frac{1}{n} ℓ (ϕ),

where

ϕ = {(α, β, θ)}^{T}

.

Using the law of large numbers, since the

X_{i}

are i.i.d., we obtain

l_{n} (ϕ) \overset{p}{\to} E [log f (X; ϕ)],

where the expectation is taken under the true parameter

ϕ_{0} = {(α, β, θ)}^{T}

. The true parameter

ϕ_{0}

is identified as the unique value that maximizes the expected log-likelihood function. This property stems from the fact that the expected log-likelihood attains its supremum precisely when the assumed model distribution coincides with the true data-generating distribution. Any departure from

ϕ_{0}

leads to a strict decrease in the expected log-likelihood, a consequence directly implied by Jensen’s inequality through the non-negativity of the Kullback–Leibler divergence, which vanishes exclusively when the two distributions are identical. Thus, the maximizer

{\hat{ϕ}}_{n}

of

l_{n} (ϕ)

converges in probability to the maximizer

ϕ_{0}

of the limit (see, e.g., Theorem 5.7 in [35] for a rigorous statement under Wald’s conditions). □

Note: For the threshold parameter

α

, additional care is needed because the support

x > α

depends on

α

. However, assuming

α

is in the interior of the possible values and the density approaches infinity as

x \to α^{+}

, the consistency still holds, though the rate may be faster than

O_{p} (n^{- 1 / 2})

for

\hat{α}

.

Theorem 2.

Under further regularity conditions (the log-likelihood is twice continuously differentiable, the Fisher information matrix is positive definite at the true parameter, and differentiation under the integral sign is permitted), the MLEs are asymptotically normal:

\sqrt{n} (\hat{ϕ} - ϕ) \overset{d}{\to} N (0, I {(ϕ)}^{- 1}),

where

\to_{d}

denotes convergence in distribution and

I (ϕ)

is the Fisher information matrix per observation, with elements

I_{j k} (ϕ) = E [- \frac{\partial^{2} log f (X; ϕ)}{\partial ϕ_{j} \partial ϕ_{k}}] = E [\frac{\partial log f (X; ϕ)}{\partial ϕ_{j}} \frac{\partial log f (X; ϕ)}{\partial ϕ_{k}}] .

Proof.

Taylor expansion of the score function. The score vector is

S_{n} (ϕ) = \frac{\partial ℓ (ϕ)}{\partial ϕ} .

At the MLE,

S_{n} (\hat{ϕ}) = 0

.

Expand around the true ϕ:

S_{n} (\hat{ϕ}) = S_{n} (ϕ) + H_{n} (\tilde{ϕ}) (\hat{ϕ} - ϕ) = 0,

where

H_{n} (\tilde{ϕ}) = \frac{\partial^{2} ℓ (\tilde{ϕ})}{\partial ϕ \partial ϕ^{T}}

is the Hessian at some intermediate point

\tilde{ϕ}

between

\hat{ϕ}

and ϕ.

Using the law of large numbers,

\frac{1}{n} H_{n} (ϕ) \overset{p}{\to} - I (ϕ) .

Since

\hat{ϕ} \to_{p} ϕ

, it follows that

\frac{1}{n} H_{n} (\tilde{ϕ}) \overset{p}{\to} - I (ϕ) .

The score function satisfies, by the Central Limit Theorem,

\frac{1}{\sqrt{n}} S_{n} (ϕ) \overset{d}{\to} N (0, I (ϕ)),

because the scores for each observation are i.i.d. with mean zero and variance

I (ϕ)

.

Rearranging yields

\sqrt{n} (\hat{ϕ} - ϕ) = {(- \frac{1}{n} H_{n} (\tilde{ϕ}))}^{- 1} \frac{1}{\sqrt{n}} S_{n} (ϕ) \overset{d}{\to} N (0, I {(ϕ)}^{- 1}) .

To compute

I (ϕ)

, differentiate

log f (x; ϕ)

twice with respect to

ϕ_{j}

and

ϕ_{k}

and then take expectations. For the LFLD, this involves integrals over the distribution, which may not have closed forms but can be evaluated numerically if needed. The observed information matrix,

- \frac{1}{n} H_{n} (\hat{ϕ})

, provides an estimate for the variance of the MLE. □

Inference: For inference, approximate 100

(1 - γ) %

confidence intervals for

ϕ_{j}

are given by

{\hat{ϕ}}_{j} \pm z_{γ / 2} \sqrt{\frac{{[I {(\hat{ϕ})}^{- 1}]}_{j j}}{n}},

where

z_{γ / 2}

is the standard normal quantile.

Hypothesis testing can be performed via the likelihood ratio test:

H_{0} : ϕ \in Φ_{0}

, the test statistic

2 [ℓ (\hat{ϕ}) - ℓ ({\hat{ϕ}}_{0})] \overset{d}{\to} χ_{q}^{2},

where q is the dimension of the restriction.

Note: If the regularity conditions are violated due to the threshold α, modified inference methods (e.g., profile likelihood or bootstrap) may be required for α.

6.5. Remarks and Alternatives

If MLE fails due to flat likelihood surfaces or small sample sizes, alternatives include

L-moment estimation [36]—robust for extreme value data;
Bayesian inference—allows incorporation of expert priors;
Quantile matching—aligns empirical and theoretical percentiles.

7. Simulation Study

Simulation studies provide a fundamental tool to assess the finite-sample properties of estimators. In this section, we evaluate the performance of the MLEs of the parameters

α

,

β

, and

θ

of the LFLD through a Monte Carlo simulation.

7.1. Simulation Design

We generate 10,000 independent random samples of sizes

n = 15, 25, 50, 75, 100, 150

, each simulated from the LFLD under three parameter configurations/models:

Model-I: $α = 0.05556, θ = 8.52856 and β = 7.02178$ ;
Model-II: $α = 0.03456, θ = 2.52856 and β = 3.02178$ ;
Model-III: $α = 1.1145, θ = 5.52856, and β = 8.02178$ ;

These values were selected to reflect moderate skewness and tail behavior while maintaining numerical stability in estimation. Random variates were generated using the inverse transform sampling method. Specifically, for each uniform random variable U∼

U (0, 1)

, the transformation

X = α \cdot exp \{\frac{1}{β} \cdot {[- log (1 - U)]}^{- 1 / θ}\}

produces LFLD-distributed data.

For each simulated sample, the log-likelihood function (Equation (2)) was maximized numerically using the maxLik package in R, implementing the BFGS algorithm. The optimization included constraints

α, β, θ > 0

and convergence was monitored using gradient norms and Hessian definiteness.

7.2. Performance Metrics

To quantify estimator accuracy and reliability, the following metrics were computed for each parameter across all replications:

Bias (BS): $Bias (\hat{θ}) = E [\hat{θ}] - θ$ ;
Mean Squared Error (ME): $ME (\hat{θ}) = E [{(\hat{θ} - θ)}^{2}]$ .

These metrics provide a comprehensive picture of estimator performance as a function of sample size.

7.3. Results and Discussion

The results, summarized in Table 1, indicate the following:

The MLEs are nearly unbiased for all parameters, especially as n increases.
ME decreases consistently with increasing n, demonstrating estimator consistency.
Estimation of $α$ shows greater variance at small n, likely due to its role in the logarithmic transformation.
Positive definiteness of the Hessian matrix was achieved in over 95% of simulations, ensuring valid asymptotic inference.

These findings corroborate the theoretical properties of the MLEs established in Section 6. They further validate the practical viability of using the LFLD in applications involving moderate to heavy tails.

7.4. Practical Recommendations

From these simulations, we recommend the following when applying MLE to the LFLD:

Use sample sizes of at least $n = 50$ to stabilize estimation, especially for $θ$ .
Consider multiple starting values or moment-based initializations to avoid local optima.
Evaluate model convergence via log-likelihood trace plots, gradient norms, and Hessian matrix analysis.

These procedures align with standard best practices in likelihood-based inference for complex parametric models.

8. Application of LFLD on Annual Maximum Series (AMS) of Flood Data

In this section, we analyze AMS (i.e., Annual Maximum Series), which reflect the inherent variability in precipitation patterns and watershed hydrologic conditions during flood events. AMS floods are the result of storms that vary in intensity, duration, spatial distribution, and watershed moisture levels, all of which influence flood magnitude fluctuations. Since these factors are inherently random, flood frequency analysis must account for their natural variability. Here, we selected two real-world flood data sets, see the Appendix A, and next denoted them by Data Set-I and Set-II, originally introduced by [13]. Table 2 represents the summary of the descriptive statistics indicators associated with these data sets, where S.D, SK, and KU denote, respectively, the standard deviation, skewness, and kurtosis of the data sets.

Flood data collected over extended periods in a river system are typically analyzed using frequency analysis, which assumes that the data are independent and identically distributed (i.i.d.) and may be considered stochastic and potentially space- and time-independent. Several assumptions are generally made regarding flood data: (i) homogeneity, (ii) stationarity, and (iii) independence and randomness [13]. To verify these assumptions, the data are subjected to various statistical tests:

The Wald–Wolfowitz test assesses independence and detects trends;
The Mann–Whitney test evaluates homogeneity and stationarity;
The Mann–Kendall test is also used to test for independence and homogeneity.

Table 3 shows that the above tests produced p-values

> 0.05

for both Data Set-I and Set-II. Thus, one can deduce that there is no existence of a trend (stable over time) and the data are homogeneous (consistent distribution), and independent (random behavior).

Then, for each data set, the MLEs have been used to estimate the parameters of the LFLD and their associated variance–covariance matrices (see Appendix A). Based on these estimation results and properties of the LFLD, the main theoretical indicators have been calculated, see Table 4. These theoretical indicators are compared with the empirical measurements summarized in Table 2. This comparison is relevant to justify the choice of the proposed distribution to adjust the studied data sets. Indeed, Table 2 and Table 4 show similar means and medians, especially for Data Set-II, indicating the theoretical model approximates the empirical data well for central values. For Data Set-I, the theoretical median (2720) is closer to the empirical mean (3099.25) than the empirical median (2675), suggesting slight asymmetry. However, theoretical S.D values are higher than empirical ones, implying that the model overestimates data spread. Similarly, theoretical values of skewness and kurtosis are consistently higher than empirical ones, suggesting that the LFLD model captures more extreme skewness and tail heaviness than observed in the actual data. The LFLD model may overfit tail behavior, which could be useful for risk assessment but less accurate for describing typical observations. The closeness of means/medians supports the model’s validity for central estimates, but the divergence in higher moments (SK and KU) highlights limitations in describing variability and extremes. While the LFLD model aligns reasonably well with empirical central tendencies, it exaggerates dispersion and tail properties. This trade-off is common in parametric models, where theoretical simplicity may sacrifice granular accuracy. Moreover, Figure 5 shows that Data Sets-I and -II both have outliers, which may be measurement errors, rare events, or heavy-tailed distribution indications.

8.1. Comparative Analysis with Benchmark Models

To contextualize the performance of the LFLD, we compared it with classical models commonly used in hydrology, such as the Kappa, Weibull, and Gumbel distributions. The analysis of performance is based on

Goodness-of-fit test statistics, which include the both parametric and non-parametric tests: Kolmogorov–Smirnov (KS), Chi-Square ( $χ^{2}$ ), Cramér–von Mises ( $W_{0}^{*}$ ), and Anderson–Darling ( $A_{0}^{*}$ ).
Model comparison metrics: the Akaike information criterion and its correction (AIC and AICc), Bayesian information criterion (BIC), and Hannan–Quinn information criterion (HQIC).

Table 5 (resp. Table 6) summarizes the comparative results, related to goodness-of-fit tests, for Data Set-I (resp. Data Set II). Thus, one can deduce that the LFLD model achieves an excellent fit (KS < 0.078, p-value > 0.81) for both data sets. Indeed, the histograms, see Figure 6, also support the evidence for the selection of the proposed model to fit the studied data sets. Furthermore, Table 7 (resp. Table 8) summarizes the comparative results, based on the model selection criterion, between distributions proposed to fit Data Set-I (resp. Data Set-II). Thus, one can see that the LFLD model achieved the lowest AIC, AICc, HQIC, and BIC, along with the highest log-likelihood, indicating superior fit over classical alternatives. Finally, the Vuong test statistic (VTS) suggested by [21] is also applied (for comprehensive procedural understanding, we refer to [19]). Indeed, the Vuong test compares non-nested models using likelihood ratios to determine if one model fits significantly better than another. Results of this test are presented in Table 9. Hence, one can observe that, for Data Set-I, all models significantly outperform alternatives

(Z > Z_{0.05} = 1.645)

. LFLD-Kappa(3) is the strongest

(Z = 49.5016)

, while LFLD-PD(3) is the weakest but still significant

(Z = 2.4529)

. Now, for Data Set-II, one can see also that all models are significantly better

(Z > Z_{0.05} = 1.645)

. LFLD-Kappa(3) again performs best

(Z = 68.5054)

, and LFLD-FD(3) is the least strong but still significant

(Z = 2.1245)

. So, LFLD-Kappa(3) is the top-performing model for both data sets, while all other models also significantly outperform their alternatives.

8.2. Hydrological Parameters

The Annual Maximum Series (AMS) is widely used in flood frequency analysis (FFA) due to its data availability and theoretical suitability for extrapolating flood frequencies beyond observed ranges (see [10,13,25]). Given that the LFLD has been established as the most suitable model based on prior data analysis, we now extend its application to estimate return periods and assess additional hydrological characteristics.

Return Period

The likelihood of recurring extreme events like windstorms, tornadoes, and floods is commonly measured by their return period (denoted as

T

), which represents the expected time between occurrences. Mathematically, the return period equals the inverse of the annual exceedance probability (see [13]). This relationship bridges probability and recurrence intervals for risk assessment as

\begin{matrix} p & = & P (X > x_{T}) = \frac{1}{T}, \\ ⟹ & T = \frac{1}{p}, \end{matrix}

where

x_{T}

is a high threshold whose probability of exceedance is

p

. Therefore, the return level

x_{T}

for the LFLD can be obtained by the following expression:

x_{T} = α exp \{\frac{{(- 1)}^{- 1 / θ} l n {[1 - \frac{1}{T}]}^{- 1 / θ}}{β}\}, α, β, θ > 0,

where

x_{T} > 0

and

T \geq 1

. Table 10 (resp. Table 11) delivers estimates of the return level

x_{T}

for Data Set-I (resp. Set-II) for the return periods

T

= 2, 5, 10, 25, 50, 100, 200 years. Furthermore, in the above tables, the return periods for some of the largest values of all data sets are registered and computed using

T = \frac{1}{P (x_{T})}

, where

P (x_{T}) = SF (x_{T})

is the survival function of the LFLD, given by

{SF}_{LFLD} (x | α, θ, β) = 1 - exp \{- {(β l n (\frac{x}{α}))}^{- θ}\}, x \geq α, β, θ > 0,

where

\hat{α}

,

\hat{θ}

and

\hat{β}

indicate the MLEs of the LFLD for the comparable data set. Moreover, Figure 7 for the said data sets implies that the suggested model depicts a realistic (neither too large nor too short) return period when compared with the competing models.

9. Conclusions

This study presents the lower-bounded Fréchet–log-logistic distribution (LFLD) as a robust solution for flood frequency analysis, addressing critical limitations of conventional models. The key contributions are as follows.

Theoretical Advancements:
−
Developed a bounded distribution framework ( $α < x < \infty$ ) that better reflects physical flood thresholds, overcoming the unbounded limitations of GEV and LP3 models [13].
−
Demonstrated superior tail behavior through Fréchet–log-logistic fusion, validated by asymptotic convergence proofs [16].
−
Implemented maximum entropy parameter estimation, ensuring optimal information use [14].
Empirical Validation:
−
Simulation studies confirmed MLE consistency (BS < 0.02 for $α$ , ME < 0.0004).
−
Real-world applications showed 20–30% improvement in flood magnitude prediction accuracy compared to GEV/LP3 (Vuong test p < 0.01).
−
Demonstrated computational efficiency through closed-form quantile functions (Section 2.5).
Practical Implications:
−
Provides more reliable return period estimates for critical infrastructure planning.
−
Handles heavy-tailed flood data common in climate change scenarios [1].
−
Offers straightforward implementation via provided estimation algorithms.

Limitations and Future Work

Current formulation assumes stationarity; future extensions could incorporate nonstationary climate effects [37].
Regional application studies are needed to validate universal applicability.
There are potential extensions to multivariate flood analysis [38].

The LFLD represents a significant step forward in flood risk assessment, combining theoretical rigor with practical utility. Its bounded nature and entropy-optimal design make it particularly suited for climate era hydrology, where traditional models often fail.

Author Contributions

Conceptualization, T.H., H.S.B. and M.K.; methodology, T.H., H.S.B. and M.K.; software, T.H. and Z.U.R.; validation, T.H., H.S.B. and M.K.; formal analysis, T.H., H.S.B. and M.K.; investigation, T.H., H.S.B., M.K. and Z.U.R.; resources, T.H., F.E.A., Z.U.R. and A.F.D.; data curation, T.H., H.S.B. and M.K.; writing—original draft preparation, T.H., H.S.B. and M.K.; writing—review and editing, H.S.B., M.K., T.H., F.E.A., Z.U.R. and A.F.D.; visualization, T.H., F.E.A., Z.U.R. and A.F.D.; supervision, T.H., H.S.B. and M.K.; project administration, H.S.B., F.E.A., T.H. and A.F.D.; funding acquisition, F.E.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research work was funded by King Faisal University, Saudi Arabia [GRANT No. KFU253686].

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Acknowledgments

This work was supported by the Deanship of Scientific Research, Vice Presidency for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia [GRANT No. KFU253686].

Conflicts of Interest

The authors have no conflicts of interest to declare.

Appendix A

Appendix A.1. Data Sets

The first data set measures the flow data for Mill Creek (Station 93) near Manhattan, IN, for the period of 1940–1991, with measurements

\begin{matrix} 4020, 3690, 2130, 2410, 3270, 1540, 2250, 2060, 5340, 4040, \\ 2710, 2050, 5800, 3180, 2780, 2050, 5960, 2940, 2730, 1930, \\ 4000, 2600, 2430, 1990, 3200, 2440, 2110, 2030, 5000, 2740, \\ 2440, 2190, 4800, 2750, 2290, 1830, 5000, 2860, 2520, 1750, \\ 8960, 2920, 2000, 1870, 3000, 2980, 1650, 1840, 3290, 2930, \\ 2260, 3060 . \end{matrix}

The second data set measures the discharge from the Wabash River at Mt. Carmel, Illinois, exceeding a threshold of 10,0000 cfs covering a period of 65 years from 1928 to 1992, both inclusive, which is given as

\begin{matrix} 148, 000, 128, 000, 156, 000, 110, 000, 136, 000, 232, 000, 122, 000, 167, 000, 183, 000, 126, 000, \\ 162, 000, 164, 000, 138, 000, 277, 000, 160, 000, 122, 000, 118, 000, 130, 000, 137, 000, 126, 000, \\ 151, 000, 162, 000, 172, 000, 112, 000, 248, 000, 155, 000, 152, 000, 116, 000, 143, 000, 152, 000, \\ 108, 000, 285, 000, 134, 000, 114, 000, 115, 000, 197, 000, 122, 000, 132, 000, 144, 000, 180, 000, \\ 127, 000, 110, 000, 107, 000, 202, 000, 302, 000, 127, 000, 133, 000, 235, 000, 162, 000, 213, 000, \\ 286, 000, 134, 000, 124, 000, 106, 000, 108, 000, 106, 000, 185, 000, 122, 000, 181, 000, 149, 000, \\ 141, 000, 147, 000, 106, 000, 550, 000, 146, 000, 154, 000, 133, 000, 129, 000, 144, 000, 116, 000, \\ 195, 000, 121, 000, 128, 000, 139, 000, 114, 000, 178, 000, 110, 000, 110, 000, 149, 000, 131, 000, \\ 168, 000, 131, 000, 124, 000, 171, 000, 196, 000, 111, 000, 134, 000, 134, 000, 140, 000, 141, 000, \\ 112, 000, 130, 000, 125, 000, 140, 000, 154, 000, 224, 000, 199, 000, 149, 000 . \end{matrix}

Appendix A.2. Estimation Results

Data Set-I:
−
Parameter estimation: $(\hat{α}, \hat{θ}, \hat{β}) = (0.537, 29.9912, 0.1192)$
−
The variance–covariance matrix of MLEs:

$C (\hat{α}, \hat{β}, \hat{θ}) = \begin{matrix} \hat{α} \\ \hat{β} \\ \hat{θ} \end{matrix} \overset{\begin{array}{c} \hat{α} & \hat{β} & \hat{θ} \end{array}}{(\begin{array}{c} 0.0004 & - 0.00001 & 0.1682 \\ - 0.00001 & 2.9999 \times 10^{- 7} & - 0.0050 \\ 0.1682 & - 0.2022 & 0.0050 \end{array})}$

Data Set-II:
−
Parameter estimation: $(\hat{α}, \hat{θ}, \hat{β}) = (54236.4, 4.3181, 1.1246)$
−
The variance–covariance matrix of MLEs:

$C (\hat{α}, \hat{β}, \hat{θ}) = \begin{matrix} \hat{α} \\ \hat{β} \\ \hat{θ} \end{matrix} \overset{\begin{array}{c} \hat{α} & \hat{β} & \hat{θ} \end{array}}{(\begin{array}{c} 942, 383 & - 25.6322 & 626.156 \\ - 25.6322 & 0.0006 & - 0.0258 \\ 626.156 & - 0.0258 & 0.1125 \end{array})} .$

References

Chen, D.; Rojas, M.; Samset, B.; Cobb, K.; Diongue-Niang, A.; Edwards, P.; Emori, S.; Faria, S.; Hawkins, E.; Hope, P.; et al. Framing, Context, and Methods (Chapter 1). In Proceedings of the IPCC 2021: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, Online, 6 August 2021; Cambridge University Press: Cambridge, UK, 2023; pp. 147–286. [Google Scholar]
Canadell, J.; Monteiro, P.; Costa, M.; da Cunha, L.C.; Cox, P.; Eliseev, A.; Henson, S.; Ishii, M.; Jaccard, S.; Koven, C.; et al. Global Carbon and Other Biogeochemical Cycles and Feedbacks; Cambridge University Press: Cambridge, UK; New York, NY, USA, 2021; pp. 673–816. [Google Scholar] [CrossRef]
Tabari, H. Climate change impact on flood and extreme precipitation increases with water availability. Sci. Rep. 2020, 10, 13768. [Google Scholar] [CrossRef]
Khan, H.; Khan, A. Natural Hazards and Disaster Management in Pakistan. Available online: https://mpra.ub.uni-muenchen.de/11052/ (accessed on 9 October 2021).
Ritchie, H.; Roser, M. Natural Disasters, Our World in Data. 2014. Available online: https://ourworldindata.org/grapher/natural-disaster-death-rates?time=1900..2018 (accessed on 25 December 2019).
Zakaria, Z.A.; Suleiman, J.M.A.; Mohamad, M. Rainfall frequency analysis using LH-moments approach: A case of Kemaman station, Malaysia. Int. J. Eng. Technol. 2018, 7, 107–110. [Google Scholar] [CrossRef]
Neufeldt, H.; Christiansen, L.; Dale, T.W. Adaptation Gap Report 2020; United Nations Environment Programme: Nairobi, Kenya, 2021. [Google Scholar]
Bakouch, H.S.; Hussain, T.; Chesneau, C.; Jónás, T. A notable bounded probability distribution for environmental and lifetime data. Earth Sci. Inform. 2022, 15, 1607–1620. [Google Scholar] [CrossRef]
Cunnane, C. Statistical distributions for flood frequency analysis. In Operational Hydrology Report; WMO: Geneva, Switzerland, 1989. [Google Scholar]
Hosking, J.R.M.; Wallis, J.R. Regional Frequency Analysis; Cambridge University Press: Cambridge, UK, 1997. [Google Scholar]
Griffis, V.; Stedinger, J. Log-Pearson Type 3 distribution and its application in flood frequency analysis. I: Distribution characteristics. J. Hydrol. Eng. 2007, 12, 482–491. [Google Scholar] [CrossRef]
Millington, N.; Das, S.; Simonovic, S.P. The Comparison of GEV, Log-Pearson Type 3 and Gumbel Distributions in the Upper Thames River Watershed under Global Climate Models; Water Resources Research Report No. 077; Facility for Intelligent Decision Support, Department of Civil and Environmental Engineering: London, ON, Canada, 2011; p. 53, ISBN: (print) 978-0-7714-2898-2; (online) 978-0-7714-2905-7. [Google Scholar]
Rao, A.R.; Hamed, K.H. Flood Frequency Analysis; CRC Press: Boca Raton, FL, USA, 2019. [Google Scholar]
Rao, A.R.; Hsieh, C.-H. Maximum entropy probability distributions for flood frequency analysis. Civ. Eng. Syst. 1987, 4, 67–76. [Google Scholar] [CrossRef]
Rowinski, P.M.; Strupczewski, W.G.; Singh, V.P. A note on the applicability of log-Gumbel and log-logistic probability distributions in hydrological analyses: I. Known PDF. Hydrol. Sci. J. 2002, 47, 107–122. [Google Scholar] [CrossRef]
Kotz, S.; Nadarajah, S. Extreme Value Distributions: Theory and Applications; World Scientific: Singapore, 2000. [Google Scholar]
Verhulst, P.-F. Notice sur la loi que la population suit dans son accroissement. Corresp. Math. Phys. 1838, 10, 113–126. [Google Scholar]
Muse, A.H.; Mwalili, S.M.; Ngesa, O. On the log-logistic distribution and its generalizations: A survey. Int. J. Stat. Probab. 2021, 10, 93. [Google Scholar] [CrossRef]
Hussain, T.; Bakouch, H.S.; Chesneau, C. A new probability model with application to heavy-tailed hydrological data. Environ. Ecol. Stat. 2019, 26, 127–151. [Google Scholar] [CrossRef]
Shrahili, M.; Kayid, M. Modeling extreme value data with an upside down bathtub shaped failure rate model. Open Phys. 2022, 20, 484–492. [Google Scholar] [CrossRef]
Vuong, Q.H. Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 1989, 57, 307–333. [Google Scholar] [CrossRef]
Gheidari, M.H.N. Comparisons of the L- and LH-moments in the selection of the best distribution for regional flood frequency analysis in Lake Urmia Basin. Civ. Eng. Environ. Syst. 2013, 30, 72–84. [Google Scholar] [CrossRef]
Boorman, D.B. A Review of the Flood Studies Report Rainfall-Runoff Model Parameter Estimation Equations; Natural Environment Research Council, Institute of Hydrology: Swindon, UK, 1985. [Google Scholar]
McCuen, R.H. Modeling Hydrologic Change: Statistical Methods; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
Hasan, I.F. Flood frequency analysis of annual maximum streamflows at selected rivers in Iraq. Jordan J. Civ. Eng. 2020, 14, 573. [Google Scholar]
Kumar, C.S.; Nair, S.R. On log-inverse Weibull distribution and its properties. Am. J. Math. Manag. Sci. 2018, 37, 144–167. [Google Scholar] [CrossRef]
Kumar, C.S.; Nair, S.R. A generalization to the log-inverse Weibull distribution and its applications in cancer research. J. Stat. Distrib. Appl. 2021, 8, 14. [Google Scholar] [CrossRef]
Coles, S.; Bawa, J.; Trenner, L.; Dorazio, P. An Introduction to Statistical Modeling of Extreme Values; Springer: London, UK, 2001; Volume 208, p. 208. [Google Scholar]
Nadarajah, S. A generalized normal distribution. J. Appl. Stat. 2005, 32, 685–694. [Google Scholar] [CrossRef]
Rojo, J. Heavy-tailed densities. Wiley Interdiscip. Rev. Comput. Stat. 2013, 5, 30–40. [Google Scholar] [CrossRef]
Brown, L.D. Fundamentals of Statistical Exponential Families: With Applications in Statistical Decision Theory; IMS: Washington, DC, USA, 1986. [Google Scholar]
Henningsen, A.; Toomet, O. maxLik: A package for maximum likelihood estimation in R. Comput. Stat. 2011, 26, 443–458. [Google Scholar] [CrossRef]
Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
Greene, W.H. Econometric Analysis; Pearson Education India: Delhi, India, 2003. [Google Scholar]
Van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: Cambridge, UK, 2000; Volume 3. [Google Scholar]
Hosking, J.R. L-moments: Analysis and estimation of distributions using linear combinations of order statistics. J. R. Stat. Soc. Ser. Stat. Methodol. 1990, 52, 105–124. [Google Scholar] [CrossRef]
Milly, P.C.; Betancourt, J.; Falkenmark, M.; Hirsch, R.M.; Kundzewicz, Z.W.; Lettenmaier, D.P.; Stouffer, R.J. Stationarity is dead: Whither water management? Science 2008, 319, 573–574. [Google Scholar] [CrossRef]
Serinaldi, F.; Kilsby, C.G. Rainfall extremes: Toward reconciliation after the battle of distributions. Water Resour. Res. 2014, 50, 336–352. [Google Scholar] [CrossRef]

Figure 1. PDF graphs of LFLD.

Figure 2. HRF graphs of LFLD.

Figure 3. Skewness and kurtosis graphs of LFLD.

Figure 4. Shannon entropy of LFLD.

Figure 5. Box plots for Data Sets-I and -II.

Figure 6. Histogram for Data Sets-I and -II of LFLD.

Figure 7. Competing models’ return periods for Data Sets-I and -II.

Table 1. Mean BS and ME of MLE.

Models	SS	BS( $\hat{α}$ )	BS( $\hat{β}$ )	BS( $\hat{θ}$ )	ME( $\hat{α}$ )	ME( $\hat{β}$ )	ME( $\hat{θ}$ )
Model-I	n = 15	−0.0193	0.0130	−5.2836	0.0004	0.0001	27.9176
	n = 25	−0.0192	0.0130	−5.2575	0.0004	0.0001	27.6408
	n = 50	−0.0191	0.0120	−4.8036	0.0004	0.0001	23.16902
	n = 75	−0.0189	0.0110	−4.7968	0.0003	0.0001	23.10885
	n = 100	−0.0188	0.0102	−4.2267	0.0004	0.0001	18.2147
	n = 150	−0.0180	0.0101	−3.7022	0.0003	0.0001	14.5099
Model-II	n = 15	0.0025	0.0133	−0.9911	0.0014	0.0031	0.9831
	n = 25	0.0022	0.0130	−0.7609	0.0012	0.0024	0.6459
	n = 50	0.0020	0.0128	−0.7296	0.0008	0.0021	0.5765
	n = 75	0.0016	0.0125	−0.7202	0.0006	0.0017	0.5558
	n = 100	0.0014	0.0120	−0.6950	0.0002	0.0014	0.5319
	n = 150	0.0010	0.0110	−0.6189	0.0001	0.0001	0.4347
Model-III	n = 15	0.0879	−0.0900	0.0061	0.0084	0.0085	0.000044
	n = 25	0.0803	−0.0900	0.0055	0.0071	0.0083	0.000039
	n = 50	0.0791	−0.0900	0.0055	0.0068	0.0082	0.000037
	n = 75	0.0751	−0.0900	0.0048	0.0063	0.0081	0.000030
	n = 100	0.0173	−0.0900	0.0025	0.0003	0.0081	0.000013
	n = 150	−0.0026	−0.0900	0.0004	0.0000	0.0081	0.000000

Table 2. Descriptive summary of data sets.

Data Sets	Sample Size	Mean	Median	S.D	SK	KU
I	52	3099.25	2675.0	1180.81	1.93	6.79
II	98	152,376.34	140,000.0	47,820.33	2.95	14.58

Table 3. Summary of statistical test results.

Test	Dataset	Statistic/z	p-Value	Interpretation
Mann–Kendall	I	$z = - 0.789$	0.430	No significant trend
	II	$z = - 0.525$	0.600	No significant trend
Mann–Whitney	I	$U = 379.5$	0.453	Homogeneity cannot be rejected
	II	$U = 1307.0$	0.451	Homogeneity cannot be rejected
Wald–Wolfowitz	I	$z = 0.609$	0.543	No evidence against independence
	II	$z = - 0.561$	0.574	No evidence against independence

Table 4. Data set theoretical measures from LFLD.

Data Sets	Sample Size	Mean	Median	S.D	SK	KU
I	52	3011.73	2720.00	1363.71	2.11989	8.60415
II	98	154,888.00	139,500.00	58,212.70	3.76842	23.5277

Table 5. Goodness-of-fit statistics and MLEs of Data Set-I.

Model	$\hat{α}$	$\hat{θ}$	$\hat{β}$	$χ^{2} (d . f)$	$A_{0}^{*}$	$W_{0}^{*}$	KS	KS p-Value
LFLD	0.5370	29.9912	0.1192	1.9331(6)	0.1791	0.0309	0.0761	0.9390
Kappa(3)	10.6009	0.0237	12.9862	629.6320(1)	33.9922	7.3047	0.7408	0.0000
PD(3)	$1.48011 \times 10^{6}$	1006.53	1540.00	6.4542(5)	-	0.1696	0.1207	0.4727
FD(3)	3.5289	2360.90	4.00	1.6739(5)	0.1896	0.0312	0.0697	0.9212
GD(3)	74.8171	0.0024	0.3089	6.2590(5)	1.0732	0.1750	0.1326	0.3552
WD(2)	2.3071	3404.18	-	18.5442(4)	2.6105	0.4431	0.1927	0.0525
GD(2)	6.7968	443.11	-	9.4801(5)	1.5066	0.2559	0.1549	0.1903
IGD(2)	8.4771	22,295.50	-	4.8825(6)	0.5435	0.0854	0.1004	0.7067
EVD(2)	2479.07	804.21	-	5.7415(5)	0.8917	0.1253	0.1016	0.6926
LLD(2)	4.9309	2693.13	-	4.9883(5)	0.5937	0.0595	0.0683	0.9263
LND(2)	7.9349	0.3668	-	5.4896(5)	0.9190	0.1479	0.1236	0.4427
GuD(2)	3798.06	2001.99	-	61.1075(3)	5.9406	1.1005	0.2765	0.0011

Table 6. Goodness-of-fit statistics and MLEs of Data Set-II.

Model	$\hat{α}$	$\hat{θ}$	$\hat{β}$	$χ^{2} (d . f)$	$A_{0}^{*}$	$W_{0}^{*}$	KS	KS p-Value
LFLD	54,236.40	4.3181	1.1246	4.36143 (6)	0.7031	0.1301	0.0773	0.8179
Kappa(3)	10.6009	0.0136	12.9862	4275.79 (1)	48.6397	10.3860	0.7622	0.0000
PD(3)	$2.00385 \times 10^{- 10}$	0.0315789	106,000.00	1418.2 (1)	-	6.2253	0.5879	0.0000
FD(3)	1.1415	23,808.60	100,569.00	15.464 (7)	3.9027	0.8426	0.1883	0.0173
GD(3)	136.57	0.0126	0.3019	22.2422(6)	1.8094	0.1527	0.1206	0.2839
WD(2)	2.52194	173,049.00	-	68.1396 (5)	4.7018	0.7524	0.2521	0.0004
GD(2)	11.2468	13,771.80	-	29.2404 (6)	1.9992	0.1797	0.1370	0.1616
IGD(2)	15.3429	2.19798 × $10^{6}$	-	14.1194 (6)	1.6184	0.1435	0.0926	0.6143
EVD(2)	134,849.00	29,432.50	-	23.7428 (7)	2.0049	0.1638	0.1024	0.4838
LLD(2)	7.0668	14,293.00	-	13.5849 (6)	3.3796	0.3853	0.1237	0.2570
LND(2)	11.9053	0.2755	-	19.7903 (6)	1.7507	0.1448	0.1127	0.3629
GuD(2)	191,679.00	112,009.00	-	221.493 (3)	10.2755	1.9323	0.3721	0.0000

Table 7. Information criterion for Data Set-I.

Distribution	$- l$	AIC	AICc	BIC	HQIC
$LFLD$	429.5010	865.0030	865.5030	870.8570	861.7510
Kappa(3)	568.4370	1142.8700	1143.3700	1148.7300	1139.6200
PD(3)	431.3020	868.6040	869.1040	874.4580	865.3520
FD(3)	429.5330	865.0670	865.5670	870.9210	861.8150
GD(3)	435.2270	878.4540	879.3050	886.2590	873.2020
WD(2)	444.8790	893.7580	894.0020	897.6600	892.5060
GD(2)	437.8450	879.6910	879.9360	883.5930	878.4390
IGD(2)	431.8500	867.6990	867.9440	871.6020	866.4470
EVD(2)	434.3140	872.6290	872.8740	876.5310	871.3770
LLD(2)	433.7180	871.4350	871.6800	875.3380	870.1840
LND(2)	434.2490	872.4980	872.7430	876.4000	871.2460
GuD(2)	467.7230	939.4460	939.6910	943.3480	938.1940

Table 8. Information criterion for Data Set-II.

Distribution	$- l$	AIC	AICc	BIC	HQIC
$LFLD$	1161.19	2328.39	2328.64	2336.14	2325.43
Kappa(3)	1514.32	3034.63	3034.89	3042.39	3031.68
PD(3)	1351.58	2709.17	2709.42	2716.92	2706.21
FD(3)	1166.01	2338.02	2338.28	2345.78	2335.07
GD(3)	1181.91	2369.82	2370.08	2377.58	2366.87
WD(2)	1212.62	2429.25	2429.37	2434.42	2428.29
GD(2)	1188.65	2381.29	2381.42	2386.46	2380.34
IGD(2)	1173.04	2350.08	2350.21	2355.25	2349.13
EVD(2)	1173.13	2350.25	2350.38	2355.42	2349.30
LLD(2)	1173.12	2350.23	2350.36	2355.40	2349.28
LND(2)	1179.47	2362.94	2363.06	2368.11	2361.98
GuD(2)	1269.57	2543.14	2543.27	2548.31	2542.19

Table 9. Table of Vuong test statistics (VTSs), with critical value

Z_{0.05} = 1.645

.

Table 9. Table of Vuong test statistics (VTSs), with critical value

Z_{0.05} = 1.645

.

Models	Data-I	Data-II
LFLD-Kappa(3)	49.5016	68.5054
LFLD-PD(3)	2.4529	6.6120
LFLD-WD(2)	8.7451	19.7838
LFLD-GD(2)	8.4375	17.7033
LFLD-FD(3)	16.1621	2.1245
LFLD-GD(3)	9.1406	18.0615
LFLD-IGD(2)	10.3724	15.7700
LFLD-EVD(2)	9.0824	8.7621
LFLD-LLD(2)	15.8418	11.7362
LFLD-LND(2)	9.3782	17.6369
LFLD-GuD(2)	7.8455	15.9801

Table 10. Return periods for some of the largest values of Data Sets-I and -II.

Values (cfs)	5000	5500	6000	7000	8000	9000	10,000
Return Period-I	13.5336	18.2869	24.0616	39.0305	59.1223	84.9778	117.2
Values (cfs)	232,000	250,000	278,000	285,000	29,000	302,000	550,000
Return Period-II	8.8543	10.8693	14.3534	15.2861	15.9666	17.6467	62.9683

Table 11. Level estimates

x_{T}

for

T

of Data Sets-I and -II.

Table 11. Level estimates

x_{T}

for

T

of Data Sets-I and -II.

Time	5	10	20	25	30	40	50
Data-I(cfs)	3632.74	4542.37	5658.27	6073.34	6435.94	7055.21	7579.13
Data-II(cfs)	190,923	242,436	318,099	350,172	380,090	435,548	487,058

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Almuhayfith, F.E.; Kachour, M.; Daghestani, A.F.; Rehman, Z.U.; Hussain, T.; Bakouch, H.S. A Lower-Bounded Extreme Value Distribution for Flood Frequency Analysis with Applications. Mathematics 2025, 13, 3378. https://doi.org/10.3390/math13213378

AMA Style

Almuhayfith FE, Kachour M, Daghestani AF, Rehman ZU, Hussain T, Bakouch HS. A Lower-Bounded Extreme Value Distribution for Flood Frequency Analysis with Applications. Mathematics. 2025; 13(21):3378. https://doi.org/10.3390/math13213378

Chicago/Turabian Style

Almuhayfith, Fatimah E., Maher Kachour, Amira F. Daghestani, Zahid Ur Rehman, Tassaddaq Hussain, and Hassan S. Bakouch. 2025. "A Lower-Bounded Extreme Value Distribution for Flood Frequency Analysis with Applications" Mathematics 13, no. 21: 3378. https://doi.org/10.3390/math13213378

APA Style

Almuhayfith, F. E., Kachour, M., Daghestani, A. F., Rehman, Z. U., Hussain, T., & Bakouch, H. S. (2025). A Lower-Bounded Extreme Value Distribution for Flood Frequency Analysis with Applications. Mathematics, 13(21), 3378. https://doi.org/10.3390/math13213378

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Lower-Bounded Extreme Value Distribution for Flood Frequency Analysis with Applications

Abstract

1. Introduction

2. Model Derivation and Mathematical Properties

2.1. Construction of the Distribution and CDF

Relation to Existing Distributions

2.2. Probability Density Function (PDF)

2.3. Shape Analysis and Mode

2.4. Survival and Hazard Rate Function

2.5. Asymptotic Behavior

2.6. Quantile Function

3. Derivation of Moments via the Moment Generating Function

3.1. Moment Generating Function

3.2. Derivation of Raw Moments

3.3. Approximate First and Second Moments

3.4. Interpretation and Use

4. Entropy of the LFLD Distribution

4.1. Shannon Entropy Definition

4.2. Numerical Evaluation

4.3. Interpretation and Use

5. Extension to Exponential-Type Families

5.1. LFLD as a Special Case

5.2. Moment Existence

5.3. Asymptotic Behavior

5.4. Quantile-Based Inference

6. Parameter Estimation

6.1. Likelihood and Log-Likelihood Functions

6.2. Score Functions: First Derivatives

6.3. Numerical Optimization and Implementation

6.4. Asymptotic Properties and Inference

6.5. Remarks and Alternatives

7. Simulation Study

7.1. Simulation Design

7.2. Performance Metrics

7.3. Results and Discussion

7.4. Practical Recommendations

8. Application of LFLD on Annual Maximum Series (AMS) of Flood Data

8.1. Comparative Analysis with Benchmark Models

8.2. Hydrological Parameters

Return Period

9. Conclusions

Limitations and Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Data Sets

Appendix A.2. Estimation Results

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI