Next Article in Journal
Statistical Mechanics of Linear k-mer Lattice Gases: From Theory to Applications
Previous Article in Journal
An Exergy-Enhanced Improved IGDT-Based Optimal Scheduling Model for Electricity–Hydrogen Urban Integrated Energy Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Revised Bimodal Generalized Extreme Value Distribution: Theory and Climate Data Application

by
Cira E. G. Otiniano
*,
Mathews N. S. Lisboa
and
Terezinha K. A. Ribeiro
Statistics Department, University of Brasília, Brasília 70910-900, DF, Brazil
*
Author to whom correspondence should be addressed.
Entropy 2025, 27(7), 749; https://doi.org/10.3390/e27070749
Submission received: 3 June 2025 / Revised: 3 July 2025 / Accepted: 11 July 2025 / Published: 14 July 2025
(This article belongs to the Section Information Theory, Probability and Statistics)

Abstract

The bimodal generalized extreme value (BGEV) distribution was first introduced in 2023. This distribution offers greater flexibility than the generalized extreme value (GEV) distribution for modeling extreme and heterogeneous (bimodal) events. However, applying this model requires a data-centering technique, as it lacks a location parameter. In this work, we investigate the properties of the BGEV distribution as redefined in 2024, which incorporates a location parameter, thereby enhancing its flexibility in practical applications. We derive explicit expressions for the probability density, the hazard rate, and the quantile function. Furthermore, we establish the identifiability property of this new class of BGEV distributions and compute expressions for the moments, the moment-generating function, and entropy. The applicability of the new model is illustrated using climate data.

1. Introduction

The Fréchet, Weibull, and Gumbel extreme value distributions [1,2] are genuine probabilistic models for extreme event data, as they correspond to the asymptotic distribution of statistics extreme of independent and identically distributed random variables. The generalized extreme value (GEV) distribution, presented by [3], summarizes the three extreme distributions. For this reason, the GEV distribution is widely used to model extreme events across various fields, including insurance, finance, and hydrology. The theory and applications of the GEV distribution are thoroughly discussed in the books [4,5,6,7,8,9], among others.
A continuous random variable X has a GEV distribution, X F ( . ; ξ , μ , σ ) , if its cumulative distribution function (CDF) and probability density function (PDF) are given, respectively, by 
F ( y ; ξ , μ , σ ) = exp 1 + ξ y μ σ 1 / ξ , if ξ 0 , exp exp y μ σ , if ξ = 0 ,
and
f ( y ; ξ , μ , σ ) = 1 σ 1 + ξ y μ σ 1 / ξ 1 exp 1 + ξ y μ σ 1 / ξ , if ξ 0 , 1 σ exp y μ σ exp y μ σ , if ξ = 0 ,
with shape parameter ξ R , scale parameter σ > 0 , and location parameter μ R .
The support of the GEV distribution depends on the values of the parameters. It is the set x R : [ 1 + ξ y μ σ ] for ξ 0 and R for ξ = 0 .
The parameter ξ determines the weight of the tail of the distribution. The GEV distribution accommodates heavy-tailed and light-tailed distributions and is characterized by its unimodal shape. Some of the unimodal generalizations of the GEV distribution are the transmuted GEV distribution [10,11], the dual gamma generalized extreme value distribution, the exponentiated generalized extreme value distribution [11], and the blended generalized extreme value distribution [12].
In various applications, extreme climate data, such as wind speed, humidity, and temperature, exhibit heterogeneous (bimodal) densities with rare events and heavy tails. A very promising model for extreme heterogeneous data is the bimodal GEV distribution, as defined in [13].
Following [13], a  random variable X has a bimodal GEV (BGEV) distribution; X F BGEV ( · ; ξ , μ , σ , δ ) if its cumulative distribution function is given by
F BGEV ( x ; ξ , μ , σ , δ ) = F ξ , μ , 1 T σ , δ ( x ) ,
with ξ R , μ R , σ > 0 , δ > 1 , and the transformation T σ , δ ( · ) is defined by
T σ , δ ( x ) = σ x | x | δ , x R ,
is invertible and differentiable. The support of the probability density function associated with the distribution (3) depends on the shape parameter. When ξ = 0 , the support is R , when ξ > 0 , the interval is sgn μ 1 ξ | μ 1 ξ | σ 1 1 δ + 1 , + , and when ξ < 0 , the interval is , sgn μ 1 ξ | μ 1 ξ | σ 1 1 δ + 1 , where sgn ( x ) = | x | x .
The disadvantage of the model (3) is that its four parameters are shape parameters. In this distribution, there are no location and scale parameters. In other words, σ is not a scale parameter and  μ is not a location parameter, as is the case with the GEV distribution. Furthermore, the local minimum of the PDF is always located at zero. This limitation complicates its applicability, as real bimodal data can have local minimum at any value of the real line.
The chief goal of this paper is to examine several properties of the new BGEV distribution, which was redefined by [14] and which includes a location parameter, and to illustrate its applicability. Specifically, this paper complements the work of [14] in three directions. First, it presents the proof of the identifiability of the new bimodal GEV, which is crucial for the practical application of this model. Second, it presents expressions for the moments, the moment-generating function, and the differential entropy of the new BGEV model. Third, it presents a real data application of the BGEV distribution in a scenario where a bimodal model for extreme data is needed.
The remainder of this paper is structured as follows. Section 2 presents the main results of this work. We begin in Section 2.1 with the definition of the main functions related to the new BGEV model. Next, in Section 2.2, we provide a graphical illustration of the new BGEV model. Finally, in Section 2.3, we show the main properties of the new BGEV distribution. Section 3 contains an application of the new bimodal BGEV model to climate data. Finally, Section 4 closes the paper with some concluding remarks.

2. The New BGEV Distribution

Initially, in Section 2.1, we show how the model (3) was redefined by [14], presenting the cumulative distribution function, probability density function, failure rate function, and quantile function. The versatility of the BGEV distribution is illustrated through the graphical representations in Section 2.2. The main results of this work are the properties of the new BGEV distribution, which are in Section 2.3.

2.1. The Redefined BGEV Distribution

Definition 1. 
A random variable X has a bimodal GEV distribution with location parameter X BGEV ( ξ , μ , σ , δ ) if  CDF is given by
F ( x ; ξ , μ , σ , δ ) = F ( T ( x ; μ , δ ) ; ξ , 0 , σ ) ,
where
T ( x ; μ , δ ) = x μ x μ δ , δ > 1 , μ R .
The inverse and derivative functions of T ( · ) are, respectively, given by
T 1 ( x ; μ , δ ) = sgn ( x ) | x | 1 / ( δ + 1 ) + μ
and
T ( x ; μ , δ ) = ( δ + 1 ) | x μ | δ .
For simplicity, we write T ( x ; μ , δ ) = T ( x ) , where the notation with parameters is used only to show the role of the parameters σ and μ .
The expressions (7) and (8) allow us to obtain the following PDF of X BGEV ( ξ , μ , σ , δ ) , given by
f ( x ; ξ , μ , σ , δ ) = 1 σ 1 + ξ T ( x ) σ ( 1 / ξ ) 1 exp 1 + ξ T ( x ) σ 1 / ξ T ( x ) , ξ 0 , 1 σ exp T ( x ) σ exp exp T ( x ) σ T ( x ) , ξ = 0 ,
whose support is
Support ( f ( · ; ξ , μ , σ , δ ) ) = μ σ ξ 1 / ( δ + 1 ) , + , if ξ > 0 , , μ + σ ξ 1 / ( δ + 1 ) , if ξ < 0 , ( , + ) , if ξ = 0 .
In the BGEV model, ξ , δ , and  σ are shape parameters, while μ is a location parameter. It is important to note that in the GEV distribution (1), σ is a scale parameter; however, in (5), σ is not a scale parameter, because does not satisfy the condition
f ( x ; ξ , μ , σ , δ ) = 1 σ f x σ ; ξ , μ , 1 , δ ,
since 1 σ T ( x ; μ , δ ) T ( x σ ; μ , δ ) .
On the other hand, the parameter μ in (5) is a location parameter. To prove this, it suffices to observe that T ( x ; μ , δ ) = T ( x μ ; 0 , δ ) and
F ( x ; ξ , μ , σ , δ ) = F ( T ( x μ ; 0 , δ ) ; ξ , 0 , σ ) = F ( T ( x ; μ , δ ) ; ξ , μ , σ ) .
Thus, f ( x ; ξ , μ , σ , δ ) = f ( x μ ; ξ , 0 , σ , δ ) .
The model (5) is a generalization of the GEV distribution, because when δ = 0 the BGEV distribution returns to GEV distribution. That is, X BGEV ( ξ , μ , σ , 0 ) = GEV ( ξ , μ , σ ) .
From the expressions in (5) and (9), it is simple to obtain the survival and hazard functions. These functions are useful in the area of reliability and for calculating risk measures in other areas. The survival and hazard functions are given by the expressions
S ( y ) = 1 exp 1 + ξ T ( y ) σ 1 / ξ , ξ 0 1 exp exp T ( y ) σ , ξ = 0
and
h ( y ) = 1 σ 1 + ξ T ( y ) σ ( 1 / ξ ) 1 exp 1 + ξ T ( y ) σ 1 / ξ T ( y ) 1 exp 1 + ξ T ( y ) σ 1 / ξ , ξ 0 exp T ( y ) σ exp exp T ( y ) σ T ( y ) 1 exp exp T ( y ) σ , ξ = 0 ,
respectively. The support of the survival and hazard functions is the set (10).
An important property of the new BGEV model is that its quantile function has a simple closed-form expression. This feature is extremely useful for simulation procedures and the calculation of risk measures in various applied fields.
From (1), (5), and (7), we have that the quantile function of BGEV model given by
Q ( y ) = sgn σ ξ ( log ( y ) ) ξ 1 σ ξ [ ( log ( y ) ) ξ 1 ] 1 / ( δ + 1 ) + μ , if ξ 0 sgn σ log log y σ log log y 1 / δ + 1 + μ , if ξ = 0 .

2.2. Graphic Illustrations of New BGEV Distribution

The versatility of the new PDF, defined in (9), is illustrated in Figure 1, Figure 2, Figure 3 and Figure 4. Depending on the combination of parameters, the PDF can be unimodal or bimodal, symmetric or asymmetric, and have a heavy or light tail. To better understand the role of each of the four parameters in the PDF, we consider four scenarios. In each scenario, we fix three parameters and let the fourth parameter vary to understand its effect on the curves. In each of the Figure 1, Figure 2, Figure 3 and Figure 4 are the graphs of the PDF (f), CDF (F), survival (S), and hazard functions (h) of X BGEV ( ξ , μ , σ , δ ) .
In Figure 1, Figure 2 and Figure 3, the graphs of the four functions change as the values of ξ , δ , and  σ vary. This illustrates our comment above that the parameters ξ , δ , and  σ are shape parameters. Figure 1 shows the effect of the parameter δ on the curves. When δ = 0 , the PDF is unimodal, whereas it is bimodal for δ > 0 . Furthermore, the larger the value of δ , the further apart and larger the modes are and the heavier the tails. The effect of the parameter ξ on the curves is shown in Figure 2. As  ξ increases, the density tails are heavier and the asymmetry becomes more evident. In Figure 3, one can see that the parameter σ also modifies the PDF. The parameter σ is not a scale parameter, since the PDF remains fixed at the local minimum. This confirms our proof above that σ is not a scale parameter. In Figure 4, the PDF only moves with the variation in μ . This also confirms that μ is a location parameter. Regarding the hazard function h, by depending on the combination of model parameters, the h function is increasing, decreasing, unimodal, N-shaped, or M-shaped. In other words, the BGEV distribution is quite flexible for modeling data regarding survival/reliability.
As previously demonstrated, the mode of distribution is governed by the parameter δ > 0 . In practical applications, extreme data from heterogeneous populations can be appropriately modeled using the BGEV distribution with δ > 0 , reflecting its ability to capture bimodal patterns. For instance,  [13] fitted this distribution to maximum wind speed and maximum temperature data, obtaining δ values of 0.9434 and 0.4201 , respectively, which clearly indicate the presence of bimodality in the extremes. This characteristic is consistent with the seasonal behavior of the studied region, which exhibits two well-defined climatic seasons (wet and dry), each associated with distinct regimes of extreme event occurrences, resulting in two distinct modes throughout the year.

2.3. Properties

In statistics, identifiability is an important property that a family of distributions must satisfy for accurate inference. A distribution of a family is identifiable if different values of the parameters produce different probability distributions. In other words, the parameter of the distribution is unique. The following shows that the family of distributions of F ( · ; ξ , 0 , σ , δ ) in (5) is identifiable. In addition, we derive other properties, including formulas for the moments and quantile functions.

2.3.1. Identifiability

Let F = { F = F ( · ; θ ) } be a family of CDFs. This class F is identifiable if and only if for any F 1 = F ( · ; θ 1 ) , F 2 = F ( · ; θ 2 ) F , the equality F 1 = F 2 implies θ 1 = θ 2 .
Proposition 1. 
The family of BGEV distributions with known parameter δ; F B G E V = { F ( x ; ξ , 0 , σ , δ ) : F ( x ; ξ , 0 , σ , δ )   a s   ( 5 ) } is identifiable.
Proof. 
The authors of [15] demonstrated the identifiability of the finite mixture of GEV distributions, particularly that the family of a GEV component is identifiable. That is, the family F G = { F ( · ; ξ , μ , σ ) : F ( · ; ξ , μ , σ )   as   ( 1 ) } is identifiable. Thus, to prove that for any F ( · ; ξ 1 , 0 , σ 1 , δ ) ) , F ( · ; ξ 2 , 0 , σ 2 , δ ) F B G E V equality
F ( x ; ξ 1 , 0 , σ 1 , δ ) = F ( x ; ξ 2 , 0 , σ 2 , δ ) , x Support ( F ) ,
implies ξ 1 = ξ 2 , σ 1 = σ 2 . From (5), we have that (11) is equivalent to
F ( x | x | δ ; ξ 1 , 0 , σ 1 ) = F ( x | x | δ ; ξ 2 , 0 , σ 2 ) .
Since the function F G is identifiable, it follows from (12) that the equality necessarily implies ξ 1 = ξ 2 and σ 1 = σ 2 .    □

2.3.2. Moments and Moment Generating Function

To calculate the moments of X BGEV ( ξ , μ , σ , δ ) , first consider the gamma function, the upper incomplete gamma function, and the lower incomplete gamma function, defined according to [16], respectively, by
Γ ( a ) : = 0 t a 1 e t d t
Γ ( a , x ) : = x t a 1 e t d t
and
γ ( a , x ) : = 0 x t a 1 e t d t ,
where a R + .
Proposition 2. 
Let X BGEV ( ξ , μ , σ , δ ) with ξ 0 , then the k-th integer moment of X is given by
E ( X k ) = j = 0 k k j ( 1 ) ( k j ) ( δ + 2 ) δ + 1 σ ξ k j δ + 1 i = 0 [ | k j δ + 1 | ] [ | k j δ + 1 | ] i ( 1 ) i γ 1 ξ [ | k j δ + 1 | ] i , 1 + j = 0 k k j σ ξ k j δ + 1 i = 0 [ | k j δ + 1 | ] [ | k j δ + 1 | ] i ( 1 ) i Γ 1 ξ [ | k j δ + 1 | ] i , 1 ,
when ξ > 0 , whenever ξ < δ + 1 k and
E ( X k ) = j = 0 k k j ( 1 ) ( k j ) ( δ + 2 ) δ + 1 σ ξ k j δ + 1 i = 0 [ | k j δ + 1 | ] [ | k j δ + 1 | ] i ( 1 ) i Γ 1 ξ [ | k j δ + 1 | ] i , 1 + j = 0 k k j σ ξ k j δ + 1 i = 0 [ | k j δ + 1 | ] [ | k j δ + 1 | ] i ( 1 ) i γ 1 ξ [ | k j δ + 1 | ] i , 1 ,
when ξ < 0 , whenever ξ > δ + 1 k .
Proof. 
By definition
E ( X k ) = + x k f ( T ( x ) ; ξ , 0 , σ ) T ( x ) d x ,
where f ( · ; ξ , 0 , σ ) is defined in (2), T as in (6), and  T given in (8). By substituting y = T ( x ) into (17), the moments are expressed as follows:
E ( X k ) = + [ sng ( y ) | y | 1 δ + 1 + μ ] k f ( y ; ξ , 0 , σ ) d y .
As k Z + , the Newton Binomial formula is used, so (18) is updated by the integral
E ( X k ) = j = 0 k k j μ j + [ sng ( y ) ] k j | y | k j δ + 1 f ( y ; ξ , 0 , σ ) d y = j = 0 k k j μ j ( 1 ) ( k j ) ( δ + 2 ) δ + 1 E Y k j δ + 1 I [ Y < 0 ] + j = 0 k k j μ j E Y k j δ + 1 I [ Y 0 ] ,
where Y GEV ( ξ , 0 , σ ) and I A is the indicator function of the set A; I A ( ω ) = 1 if ω A and I A ( ω ) = 0 otherwise. Now, we need to analyze the cases where ξ > 0 and ξ < 0 .
Case ξ > 0 : By replacing (2) in (19) and changing the variable t = 1 + ξ σ y 1 / ξ for  r = [ | k j δ + 1 | ] , it follows that
E ( Y k I [ Y 0 ] ) = 0 + y r 1 σ 1 + ξ σ y 1 ξ 1 exp 1 + ξ σ y 1 ξ d y = 1 + σ ξ t ξ σ ξ r e t d t .
Newton’s Binomial is used in (20) and we obtain
E ( Y k I [ Y 0 ] ) = i = 0 r r i ( 1 ) i σ ξ r 1 + t ξ ( r i ) e t d t .
In the same way, the k-th moment of Y truncated in the negative part is
E ( Y k I [ Y < 0 ] ) = σ ξ 0 y r 1 σ 1 + ξ σ y 1 ξ 1 exp 1 + ξ σ y 1 ξ d y = 0 1 σ ξ t ξ σ ξ r e t d t = i = 0 r r i ( 1 ) i σ ξ r 0 1 t ξ ( r i ) e t d t .
The lower incomplete gamma function (15) and the upper incomplete gamma function (14) are used to represent the integrals of (21) and (22), respectively. Consequently, the proof of (16) follows by substituting these updates into Equation (19).
Case ξ < 0 : The same procedure as in the case where ξ > 0 is repeated, respecting the support { y : y ( σ ξ ] } of Y GEV ( ξ , 0 , σ ) .    □
Remark 1. 
From Proposition 2, we have that for ξ > 0 , E ( X k ) is finite for k < δ + 1 ξ . That is, the two shape parameters ξ and δ influence the weight of the tail of the new distribution. Consequently, the tail index of the new BGEV distribution is δ + 1 ξ . That is, the right tail of the BGEV distribution can be heavier than the tail of the GEV distribution.
In the following corollary from Proposition 2, we obtain a known result.
Corollary 1. 
Let X BGEV ( ξ , μ , σ , 0 ) = GEV ( ξ , μ , σ ) where ξ 0 . The k-th moment of X is given by
E ( X k ) = j = 0 k k j μ j σ ξ k j Γ ( 1 ξ ( k j ) ) .
Proof. 
From (19), when δ = 0 , we obtain
E ( X k ) = j = 0 k k j μ j E Y k j = E j = 0 k k j μ j Y k j = E ( Y + μ ) k ,
where Y GEV ( ξ , 0 , σ ) and Y + μ GEV ( ξ , μ , σ ) . The proof ends with the use of the expressions (21) and (22) and the fact that Γ ( x , s ) + γ ( x , s ) = Γ ( x ) .    □
The mean of a random variable X BGEV ( ξ , μ , σ , δ ) exists when ξ < δ + 1 . It is given in the following corollary.
Corollary 2. 
Let X BGEV ( ξ , μ , σ , δ ) with ξ 0 . Then, for ξ > 0
E ( X ) = ( 1 ) δ + 2 δ + 1 σ ξ 1 δ + 1 i = 0 1 1 i ( 1 ) i γ 1 ξ 1 i , 1 + σ ξ 1 δ + 1 i = 0 1 1 i ( 1 ) i Γ 1 ξ 1 i , 1 , f o r   ξ > 0 ,
and
E ( X ) = ( 1 ) δ + 2 δ + 1 σ ξ 1 δ + 1 i = 0 1 1 i ( 1 ) i Γ 1 ξ 1 i , 1 + σ ξ 1 δ + 1 i = 0 1 1 i ( 1 ) i γ 1 ξ 1 i , 1 , f o r   ξ < 0 .
For ξ = 0 , an expression of the moment generating function was obtained. It is given in the following proposition.
Proposition 3. 
Let X BGEV ( ξ , μ , σ , δ ) with ξ = 0 . The moment generating function of X is given by
M X ( t ) = e μ t k = 0 t k k ! ( 1 ) k ( δ + 2 ) δ + 1 E Y k δ + 1 I [ Y < 0 ] + E Y k δ + 1 I [ Y 0 ] ,
where Y GEV ( 0 , 0 , σ ) .
Proof. 
By definition, we have
M X ( t ) = + 1 σ exp { t x } exp T ( x ) σ exp T ( x ) σ d x .
When using the substitution y = T ( x ) in (24) and the fact that
x = sgn ( ln y σ ) | ln y σ | 1 δ + 1 + μ
we have that
M X ( t ) = e μ t 0 + exp sgn ( ln y σ ) | ln y σ | 1 δ + 1 t exp { y } d y .
The new substitution s = ln ( y σ ) allows you to update (25) by
M X ( t ) = e μ t σ + exp sgn ( s ) | s | 1 δ + 1 t exp s σ exp s σ d y .
Finally, the series representation of the exponential function is used. Thus, (26) is rewritten by the equation
M X ( t ) = e μ t k = 0 t k k ! ( 1 ) k ( δ + 2 ) δ + 1 0 1 σ s k δ + 1 exp s σ exp s σ d s + e μ t k = 0 t k k ! 0 + 1 σ s k δ + 1 exp s σ exp s σ d s .
   □
The following result is a particular case of (24). It coincides with the moment-generating function of the Gumbel distribution [6].
Corollary 3. 
Let X BGEV ( ξ , μ , σ , 0 ) = GEV ( ξ , μ , σ ) . The moment generating function of X is given by
M X ( t ) = e μ t Γ ( 1 σ t ) .
Proof. 
When δ = 0 , the expression (23) reduces to
M X ( t ) = e μ t k = 0 t k k ! E Y k δ + 1 = e μ t E k = 0 ( t Y ) k k ! = e μ t E e t Y = e μ t Γ ( 1 σ t ) ,
where Y GEV ( 0 , 0 , σ ) .    □
The mean of X BGEV ( 0 , μ , σ , δ ) always exists. It is given in the following corollary.
Corollary 4. 
Let X BGEV ( 0 , μ , σ , δ ) . The expectation of X is given by
E ( X ) = μ E Y 1 / ( δ + 1 ) + ( 1 ) δ + 2 δ + 1 E Y 1 / ( δ + 1 ) I [ Y < 0 ] + E Y 1 / ( δ + 1 ) I Y 0 ,
where Y GEV ( 0 , 0 , σ ) .
Proof. 
The proof follows from the derivative of (23) at t = 0 .    □

2.3.3. Entropy

The differential entropy of the BGEV distribution is given in the following proposition.
Proposition 4. 
Let X BGEV ( ξ , μ , σ , δ ) with ξ 0 , then the entropy of X is given by
H ( X ) = 1 + ( 1 + ξ ) γ + ln σ δ + 1 1 δ + 1 E [ ln | Y | ] ,
where γ is the Euler constant and Y GEV ( ξ , 0 , σ ) .
Proof. 
By definition
H ( X ) = f ( x ; ξ , μ , σ , δ ) ln [ f ( x ; ξ , μ , σ , δ ) ] d x .
Case ξ > 0 . From (9), Equation (29) becomes
H ( X ) = ln ( σ ) f ( x ; ξ , μ , σ , δ ) d x + 1 ξ + 1 ln 1 + ξ T ( x ) σ f ( x ; ξ , μ , σ , δ ) d x + 1 + ξ T ( x ) σ 1 / ξ f ( x ; ξ , μ , σ , δ ) d x ln ( T ( x ) ) f ( x ; ξ , μ , σ , δ ) d x .
With the substitution y = T ( x ) in (30), it follows that
H ( X ) = ln σ δ + 1 + E ( 1 + ξ Y ) 1 / ξ + 1 ξ + 1 E [ ln ( 1 + ξ Y ) ] 1 δ + 1 E [ ln | Y | ] ,
where Y GEV ( ξ , 0 , 1 ) is as (1). Due to the fact that E ( 1 + ξ Y ) 1 / ξ = 1 and E [ ln ( 1 + ξ Y ) ] = ξ γ , the proof of (28) is complete.
Case ξ = 0 . Again, from (9), the Equation (29) becomes
H ( X ) = ln ( σ ) f ( x ; 0 , μ , σ , δ ) d x + T ( x ) σ f ( x ; 0 , μ , σ , δ ) d x = e T ( x ) σ f ( x ; 0 , μ , σ , δ ) d x ln ( T ( x ) ) f ( x ; 0 , μ , σ , δ ) d x .
With the substitution y = T ( x ) , the Equation (31) is updated by
H ( X ) = ln σ δ + 1 + E ( Y ) + E ( e Y ) 1 δ + 1 E [ ln | Y | ] .
where Y GEV ( 0 , 0 , 1 ) . Since E ( Y ) = γ and E ( e Y ) = 1 , the Equation (32) proves (28).    □

3. Application

In this section, to demonstrate the applicability of the bimodal GEV model, F ( · ; ξ , μ , σ , δ ) with PDF (9), we use data on the minimum humidity and wind gust speed of Goiânia. It is the second most populous city in the Central-West region of Brazil, surpassed only by Brasilia, the capital of Brazil. The city is an important economic hub in the region and is considered a strategic center for areas such as industry, medicine, fashion, and agriculture. In Goiânia, the climate is tropical, with a dry season with two well-defined seasons: rainy (from October to April) and dry (from May to September). In the dry season, relative humidity reaches critical levels and can be close to 10 % , which constitutes a state of emergency.
The data used here correspond to the period from 1 January 2011 to 31 December 2022 and come from the automatic weather station A002 in Goiânia. Data recording is hourly. The data are available from the National Institute of Meteorology on the website https://portal.inmet.gov.br/, accessed on 15 January 2023. The relative humidity (HUM) is calculated as the percentage of water vapor in the atmosphere and the wind speed (WS) is measured in meters per second (km/h).
Table 1 shows the descriptive statistics of HUM. In the period corresponding to the data used here, the minimum humidity recorded was 21.83%.
Since the original HUM and WS data exhibit temporal dependence and the model F ( · ; ξ , μ , σ , δ ) is for independent and identically distributed (i.i.d.) data, we first applied the minimum block technique [17] to obtain a subsample of the minimum values of HUM.
Table 2 presents a summary of the p-values obtained for different block sizes N, based on the Ljung–Box test [18] applied to the subsample of block minimum. It is observed that, from  60 towards, the p-values exceed the 5 % significance level, indicating that for block sizes greater than or equal to 60, the null hypothesis of independence among observations is not rejected. Therefore, N = 60 (1440 h) is adopted as the smallest block size that ensures serial independence within the subsample of minimum.
The left panels of Figure 5 and Figure 6 display the histograms of the HUM and WS variables, respectively. The right panels of these same figures present the histograms of the subsamples composed of the minimum values of HUM and WS. The histograms of the subsamples exhibit a bimodal behavior, which suggests the use of the BGEV distribution as an appropriate model for these data. Given the extreme nature of the observations, both sets of minima were fitted using the GEV and BGEV distributions to assess the modeling ability of each distribution to capture the empirical characteristics observed.
To estimate the parameters of the GEV and bimodal GEV distributions F ( · ; ξ , μ , σ ) and F ( · ; ξ , μ , σ , δ ) , we use the maximum likelihood technique that is implemented in the EVD [19] and bgev packages [14] in the R Project for Statistical Computing [20]. The maximum likelihood estimation algorithm of the bgev package is described in the Appendix A.
Table 3 shows the estimates and standard errors for the minimum subsamples of the HUM and WS.
The parameter δ > 0 in the BGEV distribution is associated with the presence of bimodality, as previously discussed in Section 2.2. From Table 3, we observe that the estimates of δ for HUM and WS are 0.54 and 0.36, respectively. These values indicate the presence of an inherent bimodal structure in the data.
To assess the goodness of fit of the BGEV and GEV distributions to the minima of HUM and WS, we used the Akaike Information Criterion (AIC) [21]. The AIC values obtained for the HUM variable were 611.2 (BGEV) and 2550.6 (GEV), while for WS they were 834.2 (BGEV) and 3452.2 (GEV). These results indicate that the BGEV distributions provide a significantly better fit than the GEV distribution to model the minimum values of relative humidity and wind gust speed observed in Goiânia.
The Shannon entropy of a continuous random variable X F , denoted by H ( X ) and defined in Equation (29), quantifies the average uncertainty associated with the distribution F. In general, the higher the entropy, the greater the uncertainty of F in representing the observations of X. In the context of extreme value modeling, let H 1 ( x ) be the entropy of GEV ( ξ , μ , σ ) = BGEV ( ξ , μ , σ , 0 ) and let H 2 ( x ) be the entropy of BGEV ( ξ , μ , σ , δ ) . When both the GEV and BGEV distributions are used to model extreme data, Equation (28) shows that, for  δ > 0 , the following inequality holds:
H 2 ( x ) < H 1 ( x ) , whenever σ δ + 1 > 1 .
Applying this condition to the parameter estimates obtained to model the minimum HUM and WS, as presented in Table 3, we find
σ ^ δ ^ + 1 = 41.75974 and σ ^ δ ^ + 1 = 14.16 ,
for HUM and WS, respectively. This result implies that the entropy of the BGEV distribution is lower than that of the GEV, indicating that the BGEV model provides a more efficient fit to the data, in the sense of lower uncertainty associated with the variability of the observations.
The better performance of the BGEV distribution is further illustrated in the right panels of Figure 5 and Figure 6, which present the histograms of the adjusted GEV and BGEV densities for the minimum relative humidity and wind gust speed data, respectively.

4. Conclusions

The GEV distribution is a crucial tool for modeling extreme data. However, this distribution is not well suited to datasets that exhibit bimodal behavior. In this work, we examine a recent extension of the GEV distribution that accommodates bimodal data, known as the BGEV distribution, which was introduced by [13] and later redefined by [14].
In short, the main contributions of this paper are as follows. First, it presents a detailed explanation of the redefinition of the BGEV model. Second, the versatility of the BGEV distribution is illustrated through graphical representations of its PDF, which can be highly flexible, exhibiting unimodal or bimodal characteristics, as well as being symmetric or asymmetric and possessing either heavy or light tails. Third, it provides a comprehensive proof of the key properties of the new BGEV distribution, including identifiability, moments, the moment-generating function, and differential entropy. Fourth, it illustrates the usefulness of the new BGEV distribution through the application of climate data. Overall, the BGEV distribution is more effective than the GEV distribution when bimodality is inherent in the data.
A natural extension of the present work is the development of a regression model based on the BGEV distribution. The authors of this paper are currently developing a new class of regression model based on a median reparameterization of the redefined BGEV distribution discussed here. This work is in progress and the results will be reported elsewhere. Another promising extension of this work consists in developing time series models with innovations following the BGEV distribution.
A relevant limitation of the BGEV distribution arises from the fact that its support depends directly on the parameters themselves. This dependence presents challenges in optimizing the likelihood function. In the bgev package, specific strategies have been built to overcome these restrictions and ensure the numerical convergence of the algorithm, as described in Step 3 (error treatment of input parameters) of the algorithm in Appendix A. In addition, the adjustments of ξ and δ exert a strong influence on the behavior of the tails. In contexts with small sample sizes, these adjustments tend to be unstable and difficult.   

Author Contributions

Conceptualization, C.E.G.O. and M.N.S.L.; methodology, C.E.G.O.; software, M.N.S.L.; validation, M.N.S.L. and T.K.A.R.; formal analysis, C.E.G.O.; investigation, C.E.G.O.; resources, T.K.A.R.; data curation, M.N.S.L.; writing—original draft preparation, C.E.G.O.; writing—review and editing, T.K.A.R.; visualization, M.N.S.L.; supervision, C.E.G.O.; project administration, C.E.G.O.; funding acquisition, C.E.G.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by DPI/BCE/UnB through Call for Proposals No. 001/2025 DPI/BCE/UnB.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in https://bdmep.inmet.gov.br/, accessed on 10 July 2025.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

    The following abbreviations are used in this manuscript:
AICAkaike Information Criterion
BGEVBimodal Generalized Extreme Value
CDFCumulative Distribution Function
GEVGeneralized Extreme Value
HUMRelative Humidity
PDFProbability Density Function

Appendix A. MLE Procedure for BGEV Parameters

In the bgev package, the maximum likelihood estimation of the parameters ξ , μ , σ , and δ is carried out through the following steps:

Algorithm

  • Compute the log-likelihood:
    Entropy 27 00749 i001
  • Calculate the GEV MLE (used as initial values):
    Entropy 27 00749 i002
  • Perform BGEV MLE estimation:
    Entropy 27 00749 i003

References

  1. Fisher, R.A.; Tippett, L.H.C. Limiting forms of the frequency distribution of the largest and smallest member of a sample. Proc. Cambridge Philos. Soc. 1928, 24, 180–190. [Google Scholar] [CrossRef]
  2. Gnedenko, B.V. Sur la distribution limite du terme maximum d’une serie aleatoire. Ann. Math. 1943, 44, 423–453. [Google Scholar] [CrossRef]
  3. Jenkinson, A.F. The frequency distribution of the annual maximum (or minimum) values of meteorological elements. Q. J. R. Meteorol. Soc. 1955, 81, 158–171. [Google Scholar] [CrossRef]
  4. Embrechts, P.; Klüppelberg, C.; Mikosch, T. Modelling Extremal Events: For Insurance and Finance; Springer: Berlin, Germany, 1997. [Google Scholar]
  5. Haan, L.; Ferreira, A. Extreme Value Theory: An Introduction; Springer: New York, NY, USA, 2006. [Google Scholar]
  6. Kotz, S.; Nadarajah, S. Extreme Value Distributions: Theory and Applications; Imperial College Press: London, UK, 2000. [Google Scholar]
  7. Reiss, R.-D.; Thomas, M. Statistical Analysis of Extreme Values with Applications to Insurance, Finance, Hydrology and Other Fields; Birkhäuser: Basel, Switzerland, 2007. [Google Scholar]
  8. Resnik, S. Extreme Values, Regular Variation and Weak Convergence; Springer: New York, NY, USA, 1987. [Google Scholar]
  9. Rudd, E.M.; Jain, L.P.; Scheirer, W.J.; Boult, T.E. The extreme value machine. IEEE Trans Pattern Anal. Mach. Intell. 2017, 40, 762–768. [Google Scholar] [CrossRef] [PubMed]
  10. Aryal, G.R.; Tsokos, C.P. On the transmuted extreme value distribution with application. Nonlinear Anal. 2009, 71, 1401–1407. [Google Scholar] [CrossRef]
  11. Nascimento, F.; Bourguignon, M.; Leão, J. Extended generalized extreme value distribution with applications in environmental data. Hacettepe. J. Math. Stat. 2016, 45, 1847–1864. [Google Scholar] [CrossRef]
  12. Krakauer, N.Y. Extending the blended generalized extreme value distribution. Discov. Civ. Eng. 2024, 1, 97. [Google Scholar] [CrossRef]
  13. Otiniano, C.E.G.; Paiva, B.S.; Vila, R.; Bourguignon, M. A bimodal model for extremes data. Environ. Ecol. Stat. 2023, 30, 261–288. [Google Scholar] [CrossRef]
  14. Otiniano, C.E.G.; Oliveira, Y.L.S.; Sousa, T.R. Bimodal GEV Distribution with Location Parameter. 2024. Available online: https://CRAN.R-project.org/package.bgev (accessed on 15 January 2023).
  15. Gonçalves, C.R.; Otiniano, C.E.G.; Crivinel, E.C. Estimation of a nonlinear discriminant function from a mixture of two GEV distributions. J. Stat. Comput. Sim. 2018, 88, 1147–1171. [Google Scholar] [CrossRef]
  16. Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables; Dover Publications: New York, NY, USA, 1965. [Google Scholar]
  17. Jondeau, E.; Poon, S.H.; Rockinger, M. Financial Modeling Under Non-Gaussian Distributions; Springer: New York, NY, USA, 2007. [Google Scholar]
  18. Box, G.E.P.; Pierce, D.A. Distribution of residual correlations in autoregressive-integrated moving average time series models. J. Am. Stat. Assoc. 1970, 65, 1509–1526. [Google Scholar] [CrossRef]
  19. Stephenson, A.G. evd: Extreme value distributions. R News 2002, 2, 31–32. [Google Scholar]
  20. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022. [Google Scholar]
  21. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974, 19, 716–723. [Google Scholar] [CrossRef]
Figure 1. Graphs of X BGEV ( 0 , 0 , 1 , δ ) with δ varying: PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Figure 1. Graphs of X BGEV ( 0 , 0 , 1 , δ ) with δ varying: PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Entropy 27 00749 g001
Figure 2. Graphs of X BGEV ( ξ , 0 , 1 , 1 ) with ξ varying: PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Figure 2. Graphs of X BGEV ( ξ , 0 , 1 , 1 ) with ξ varying: PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Entropy 27 00749 g002
Figure 3. Graphs of X BGEV ( 0 , 0 , σ , 1 ) with σ varying. PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Figure 3. Graphs of X BGEV ( 0 , 0 , σ , 1 ) with σ varying. PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Entropy 27 00749 g003
Figure 4. Graphs of X BGEV ( 0 , μ , 1 , 1 ) with μ varying: PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Figure 4. Graphs of X BGEV ( 0 , μ , 1 , 1 ) with μ varying: PDF (top left), CDF (top right), Survival (bottom left), and Hazard (bottom right).
Entropy 27 00749 g004
Figure 5. Histogram of the original HUM data (left panel) and histogram versus adjusted GEV and BGEV densities for HUM data (right panel).
Figure 5. Histogram of the original HUM data (left panel) and histogram versus adjusted GEV and BGEV densities for HUM data (right panel).
Entropy 27 00749 g005
Figure 6. Histogram of the original WS data (left panel) and histogram versus adjusted GEV and BGEV densities for WS data (right panel).
Figure 6. Histogram of the original WS data (left panel) and histogram versus adjusted GEV and BGEV densities for WS data (right panel).
Entropy 27 00749 g006
Table 1. Descriptive statistics of HUM and WS.
Table 1. Descriptive statistics of HUM and WS.
DataMeanStandard DeviationMedianMaximumMinimum
HUM 64.00 14.54 66.56 91.12 21.83
WS 21.3 9.7 22.5 43 8.6
Table 2. Block size and p-value of Ling–Box test.
Table 2. Block size and p-value of Ling–Box test.
Np-Value (HUM)p-Value (WS)
55<0.0001<0.0001
560.0011<0.0001
570.00100.0021
580.00230.0034
590.00390.0051
600.00590.0057
610.00500.0089
620.00710.0128
630.01010.0241
640.01340.0301
650.02220.0312
Table 3. Estimates and standard errors under the GEV and BGEV distributions for the minimum HUM and WS.
Table 3. Estimates and standard errors under the GEV and BGEV distributions for the minimum HUM and WS.
BGEVGEV
DataParametersEstimatesStd. ErrorEstimatesStd. Error
ξ 0.37 0.09 0.58 0.08
HUM μ 41.88 0.49 43.07 1.85
σ 64.31 2.92 14.66 1.53
δ 0.54 0.18 --
ξ −0.060.14−0.030.21
WS μ 18.470.2514.781.48
σ 19.263.8615.432.59
δ 0.360.56--
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Otiniano, C.E.G.; Lisboa, M.N.S.; Ribeiro, T.K.A. A Revised Bimodal Generalized Extreme Value Distribution: Theory and Climate Data Application. Entropy 2025, 27, 749. https://doi.org/10.3390/e27070749

AMA Style

Otiniano CEG, Lisboa MNS, Ribeiro TKA. A Revised Bimodal Generalized Extreme Value Distribution: Theory and Climate Data Application. Entropy. 2025; 27(7):749. https://doi.org/10.3390/e27070749

Chicago/Turabian Style

Otiniano, Cira E. G., Mathews N. S. Lisboa, and Terezinha K. A. Ribeiro. 2025. "A Revised Bimodal Generalized Extreme Value Distribution: Theory and Climate Data Application" Entropy 27, no. 7: 749. https://doi.org/10.3390/e27070749

APA Style

Otiniano, C. E. G., Lisboa, M. N. S., & Ribeiro, T. K. A. (2025). A Revised Bimodal Generalized Extreme Value Distribution: Theory and Climate Data Application. Entropy, 27(7), 749. https://doi.org/10.3390/e27070749

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop