Next Article in Journal
A Study of Adaptive Fractional-Order Total Variational Medical Image Denoising
Next Article in Special Issue
Ergodic Stationary Distribution and Threshold Dynamics of a Stochastic Nonautonomous SIAM Epidemic Model with Media Coverage and Markov Chain
Previous Article in Journal
Fractional-Order Financial System and Fixed-Time Synchronization
Previous Article in Special Issue
A Model Based on Fractional Brownian Motion for Temperature Fluctuation in the Campi Flegrei Caldera
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modelling the Frequency of Interarrival Times and Rainfall Depths with the Poisson Hurwitz-Lerch Zeta Distribution

1
Department of Agricultural, Food and Forest Sciences, Università di Palermo, Viale delle Scienze, 13, 90128 Palermo, Italy
2
Department of Mathematics “G. Peano”, Università degli Studi di Torino, Via Carlo Alberto, 10, 10123 Torino, Italy
3
Interuniversity Department of Regional and Urban Studies and Planning, Università degli Studi di Torino, Politecnico di Torino, Viale Pier Andrea Mattioli, 39, 10125 Torino, Italy
*
Author to whom correspondence should be addressed.
Fractal Fract. 2022, 6(9), 509; https://doi.org/10.3390/fractalfract6090509
Submission received: 25 July 2022 / Revised: 6 September 2022 / Accepted: 7 September 2022 / Published: 11 September 2022
(This article belongs to the Special Issue Stochastic Modeling in Biological System)

Abstract

:
The Poisson-stopped sum of the Hurwitz–Lerch zeta distribution is proposed as a model for interarrival times and rainfall depths. Theoretical properties and characterizations are investigated in comparison with other two models implemented to perform the same task: the Hurwitz–Lerch zeta distribution and the one inflated Hurwitz–Lerch zeta distribution. Within this framework, the capability of these three distributions to fit the main statistical features of rainfall time series was tested on a dataset never previously considered in the literature and chosen in order to represent very different climates from the rainfall characteristics point of view. The results address the Hurwitz–Lerch zeta distribution as a natural framework in rainfall modelling using the additional random convolution induced by the Poisson-stopped model as a further refinement. Indeed the Poisson contribution allows more flexibility and depiction in reproducing statistical features, even in the presence of very different climates.

1. Introduction

Analysis of rainfall data, and the subsequent modelling of the many variables concerning rainfall, is fundamental to many areas such as agricultural, ecological and engineering disciplines. From assessing hydrological risk to both crop and hydropower plannings, rainfall modelling is of the utmost importance. Moreover, being able to provide reliable rainfall modelling is essential in the well known issue of climate change. Due to the complexity of hydrological systems, their analysis and modelling rely heavily on historical records. Rainfall historical records are of various time scales, from hourly data to annual data. However, daily rainfall series are arguably the most used information in environmental, climate, hydrological, and water resources studies [1]. Rainfall manifests one peculiar characteristic which is common to many other geophysical processes: intermittence [2]. Intermittence is found in variables which are related to the internal and external structure of rainfall. The most commonly seen for the external structure are the Wet Spells ( W S ) and Dry Spells ( D S ), meaning the sequences of rainy days and non-rainy days, respectively. A way of studying the alternance of W S and D S is through the Interarrival Times ( I T ), that is the time elapsed between two consecutive days of rain. If we suppose that I T observations are independent and identically distributed (i.i.d.), one natural way to model them is through the well known renewal processes [3]. Many examples can be found in the literature. The simplest renewal process, the Bernoulli process, has been used in [4] for example. In this case, the I T ’s are geometrically distributed. Its continuous counterpart, the Poisson process, has been used for its simpler mathematical tractability, but requires dealing with the I T random variable (r.v.) as continuous, despite its discrete nature. The need to suppose a non-constant probability of rain requires slightly more sophisticated models.
The challenge of this paper is to propose a suitable discrete distribution to fit I T at the daily scale. It is on this time scale that the intermittent character of precipitation can be appreciated and at the same time most practical applications are possible. The proposed distribution must be able to model both the numerous occurrences of the value equal to one, which represent the sequence of rainy days, and some large values scattered over time and responsible for drought phenomena. Our starting point is the three parameter family Hurwitz–Lerch Zeta distribution (HLZD), successfully proposed in [5]. Such a distribution represents a step forward with respect to other commonly used I T modelling distributions, such as the logarithmic one. In Section 3, we summarize the main properties of the HLZD and state new results on its log-concavity and convolution. As a step forward, in this paper we propose to model the I T r.v. using the Poisson-stopped HLZD (PSHLZD). This discrete distribution presents excess zeroes (paralleling the excess of I T = 1 ) and a long tail [6]. The PSHLZD has been used in [6] for comparisons with the negative binomial distribution, a popular model for fitting over-dispersed data. Indeed the PSHLZD can be seen both as a Poisson-stopped sum of HLZD’s as well as a generalization of a negative binomial distribution. The Poisson contribution allows us to model the superposition of i.i.d. HLZD’s in the observed time series as rare event. In Section 4, we summarize its main properties using the combinatorics of exponential Bell polynomials. It is noteworthy to mention that Bell polynomials are used within fractional calculus, see for example [7,8] and within fractal models [9]. Moreover, new results are added on the the PSHLZD, as for example on log-concavity.
A second goal of this paper is to show that the PSHLZD is also a suitable model for a different feature strictly related to the internal structure of rainfall: the depth (or the intensity) of the rainy days [10]. In the literature, refs. [11,12] rainfall depths are more often treated as continuous despite that sometimes these models fail to account for the time discreteness of the sample process [13]. Daily rainfall depth measurements are almost always performed by automatically counting how many times a small bucket corresponding to 0.2 mm is filled. This led use to treat them as discrete, because of the abundance of ties in the data. Finally, in Section 5 we have also considered a third modelling distribution: the One Inflated HLZD (OIHLZD). Such a distribution mixes two generating processes: the first generates one’s and the latter is governed by a HLZD. This stochastic structure takes into account the dominance of one’s in the rainfall depth or interarrival time series.
In Section 6, we discuss the results for fitting all these models to rainfall data, proving that the PSHLZD provides a very general framework for rainfall modelling. Indeed the PSHLZD replicates the fitting features of the OIHLZD and outperforms the fitted HLZD in some cases. The PSHLZD has a limited number of parameters and at the same time can adapt very well to data collected in very different climates, from England to Sicily. Let us underline that the analyzed dataset has never been considered in the literature and consists of measures sampled along 70 years at 5 different stations. These stations were chosen in order to represent different climates from the rainfall characteristics point of view. In fact, the interarrival data examined are very different from each other, with a regular pattern of many rainy days in England, and a winter rainy season alternating with long periods in summer without rain, typical of the Sicily Mediterranean climate. The same is for the rainfall depth, namely many small depths in England, and few big storms in Sicily. This made it possible to confirm the great utility of the proposed statistical models within rainfall modelling. Some concluding remarks and future developments are addressed at the end of the paper.

2. Bell Polynomials in a Nutshell

The partial exponential Bell polynomials are usually written as [14]
B n , j ( z 1 , , z n j + 1 ) = n ! ( 1 ! ) r 1 r 1 ! ( 2 ! ) r 2 r 2 ! z 1 r 1 z n j + 1 r n j + 1 n N , j n
where the summation is over all the solutions in non-negative integers r 1 , , r n j + 1 of r 1 + 2 r 2 + + ( n j + 1 ) r n j + 1 = n and r 1 + r 2 + + r n j + 1 = j . A lighter expression is obtained using partitions of the integer n with length j . Recall that a partition of an integer n is a sequence π = ( π 1 , π 2 , ) of weakly decreasing positive integers, named parts of π , such that π 1 + π 2 + = n . A different notation is π = ( 1 r 1 , 2 r 2 , ) , where r 1 , r 2 , , named multiplicities of π , are the number of parts of π equal to 1 , 2 , respectively. The length of the partition is l ( π ) = r 1 + r 2 + and the vector of multiplicities is m ( π ) = ( r 1 , r 2 , ) . We write π n to denote that π is a partition of n . Thus the partial exponential Bell polynomials (1) can be rewritten as [15]
B n , j ( z 1 , , z n j + 1 ) = π n , l ( π ) = j d π z π
where the sum is over all the partitions π n with length l ( π ) = j and
z π = z 1 r 1 z 2 r 2 d π = i ! ( 1 ! ) r 1 r 1 ! ( 2 ! ) r 2 r 2 ! .
Using integer partitions, the explicit expression of the partial exponential polynomials can be recovered in R using the kStatistics package [16]. A useful property used in the following is
B n , j ( a b z 1 , , a b n j + 1 z n j + 1 ) = a j b n B n , j ( z 1 , , z n j + 1 )
with a , b constants. Equation (4) follows from (2) since from (3) we have
( a b z 1 ) r 1 ( a b 2 z 2 ) r 2 = a r 1 + r 2 + b r 1 + 2 r 2 + z π = a j b n z π
taking into account that l ( π ) = r 1 + r 2 + = j and r 1 + 2 r 2 + = n .
The n-th complete exponential Bell polynomials in the indeterminates z 1 , , z n is defined as [14]
B n ( z 1 , , z n ) = j = 0 n B n , j ( z 1 , , z n j + 1 )
with { B n , j } the partial exponential Bell polynomials (1). Note that n is the positive integer corresponding to the maximum degree of the monomials in (5). This polynomial sequence satisfies the following recurrence [14]
B n + 1 ( z 1 , , z n + 1 ) = j = 0 n n j z j + 1 B n j ( z 1 , , z n j )
with the initial value B 0 = 1 . The generating function of { B n } is the formal power series composition [14]
exp [ h z ( t ) z 0 ] = n 0 t n n ! B n ( z 1 , , z n ) R [ [ t ] ]
where R [ [ t ] ] is the ring of formal power series in t and h z ( t ) is the generating function of { z k } k 0 , that is
h z ( t ) = k 0 t k k ! z k .
A different expression of the n-th complete exponential Bell polynomial involves integer partitions [15] as follows
B n ( z 1 , , z n ) = π n d π z π
where the sum is over all the partitions π n , d π and z π are given in (3). In particular we have
B n ( λ z 1 , , λ z n ) = π n λ l ( π ) d π z π
with λ a constant. Now, suppose to replace λ l ( π ) in (9) with a numerical sequence { a l ( π ) } . Thanks to this device, the complete exponential Bell polynomials results as a special case of a wider class of polynomial families, the generalized partition polynomials [16]
B n ( a 1 , , a n ; z 1 , , z n ) = π n d π a l ( π ) z π
where the sum is again over all the partitions π n . A different expression of (10) involves the partial exponential Bell polynomials { B n , j } in (1)
B n ( a 1 , , a n ; z 1 , , z n ) = j = 1 n a j B n , j ( z 1 , , z n j + 1 ) .
An example of a well known polynomial family, arising from (11) is the logarithmic one [14]
L n ( z 1 , , z n ) = j = 0 n ( 1 ) j 1 ( j 1 ) ! B n , j ( z 1 , , z n j + 1 ) .

3. The Hurwitz-Lerch Zeta Distribution

Definition 1.
A discrete random variable Y = d HLZD ( a , θ , s ) if
q y : = P ( Y = y ) = θ y T ( θ , s , a ) ( y + a ) s + 1 , θ ( 0 , 1 ) , a > 1 , s R
for y = 1 , 2 , , with T ( θ , s , a ) = θ Φ ( θ , s + 1 , a + 1 ) , where
Φ ( θ , s , a ) = n = 0 θ n ( n + a ) s
is the Lerch Transcendent function.
The probability generating function (pgf) of Y = d HLZD ( a , θ , s ) is
G Y ( z ) = θ Φ ( z θ , s + 1 , a + 1 ) Φ ( θ , s , a ) , | z | 1
with G Y ( 0 ) = 0 .

3.1. Moments and Cumulants

HLZD moments have a closed form expression involving the Lerch Transcendent function. Differently from [17], we find this closed form expression using (13).
Proposition 1.
If Y = d HLZD ( a , θ , s ) , then
ξ k : = E [ Y k ] = j = 0 k ( a ) k j k j Φ ( θ , s + 1 j , a ) Φ ( θ , s + 1 , a + 1 ) .
Proof. 
Using the binomial expansion of y k = ( y a + a ) k , we have
ξ k = y = 1 y k P ( Y = y ) = y = 1 y k θ y 1 Φ ( θ , s + 1 , a + 1 ) ( y + a ) s + 1 = y = 1 j = 0 k k j ( y + a ) j ( a ) k j θ y 1 Φ ( θ , s + 1 , a + 1 ) ( y + a ) s + 1 = j = 0 k k j ( a ) k j 1 Φ ( θ , s + 1 , a + 1 ) y = 1 θ y 1 ( y + a ) s + 1 j
from which (16) follows by taking into account (14).    □
As a corollary, the mean and the variance are respectively:
E [ Y ] = T ( θ , s 1 , a ) T ( θ , s , a ) a Var [ Y ] = T ( θ , s 2 , a ) T ( θ , s , a ) T ( θ , s 1 , a ) T ( θ , s , a ) 2 .
More generally, the k-th central moment can be recovered as
ξ k : = E [ ( Y ξ 1 ) k ] = j = 0 k k j ξ 1 j ξ k j
and the factorial moments as
( ξ ) k = E [ Y ( Y 1 ) ( Y k + 1 ) ] = j = 0 k s ( k , j ) ξ k
with s ( k , j ) Stirling numbers of the first kind [14]. HLZD cumulants are such that [14]
κ n ( Y ) = L n ( ξ 1 , , ξ n ) n = 1 , 2 ,
where { ξ j } are the moments of Y = d HLZD ( a , θ , s ) , given in (16), and L n is the n-th logarithmic polynomial (12). Let us recall that, if the moment generating function (mgf) M Y ( t ) of Y is well defined in a suitable neighborhood of 0 , then the coefficients { κ n ( Y ) } in the expansion
M Y ( t ) = exp n 1 t n n ! κ n ( Y )
are the cumulants of Y . The first cumulant is the mean E [ Y ] , the second cumulant is the variance Var ( Y ) , the skewness and the kurtosis of Y can be recovered using the third and the fourth cumulant of Y respectively.

3.2. Mode

The HLZ distribution is a particular case of a wider class of distributions called the Modified Power Series Distributions (MPSD) [18].
Definition 2.
A discrete random variable Y = d MPSD ( a , g , f ) if
p y : = P ( Y = y ) = a ( y ) g ( θ ) y f ( θ ) , y T N
where a ( y ) , g ( θ ) and f ( θ ) are positive, bounded, and differentiable functions of y and θ respectively with f ( θ ) = y T a ( y ) g ( θ ) y .
Using this wider class of distributions, we will prove that Y = d HLZD ( a , θ , s ) is unimodal for all s R . To this aim, let us recall that a discrete distribution with support T Z is said to be strongly unimodal if and only if the sequence { p y } y T , with p y P ( Y = y ) , is a logarithmically concave sequence [19], that is if and only if
p y 2 p y 1 p y + 1 , y T .
Proposition 2.
Suppose Y = d HLZD ( a , θ , s ) .
  • If s 1 , the sequence { q y } y 1 is monotonically decreasing and the mode is y = 1 .
  • If s < 1 , Y is strongly unimodal.
Proof. 
Similarly to what stated in Section 2.3 of [20], we have
q y q y 1 = θ 1 1 a + y s + 1 y = 2 , 3 , .
Since θ ( 0 , 1 ) , a > 1 and s 1 , the rhs of (20) is always between 0 and 1 , thus (i) follows. For (ii) we have to prove that { q y } y 1 is log-concave, that is it satisfies (19). Using (13), (19) reduces to ( 1 ( y + a ) 2 ) s + 1 1 , which is always true if s < 1 .    □

3.3. Convolution

The family of HLZ distributions is not closed under convolution. Nevertheless, as a subclass of MPS distributions, the HLZD convolution still returns a MPSD. Indeed we will prove that the family of MPS distributions is closed under convolution.
Theorem 1.
If Y 1 , , Y j are independent r.v.’s identically distributed to Y = d HLZD ( a , θ , s ) , then Y 1 + + Y j = d MPSD ( a j , g , f j ) with f j ( θ ) = [ T ( θ , s , a ) ] j and
a j ( y ) = j ! y ! π y , l ( π ) = j d π ( a π ) s + 1 with a π = ( a + 1 ) r 1 ( a + 2 ) r 2
and d π given in (3).
Proof. 
Observe that if Y 1 , , Y j are r.v.’s i.i.d. to Y = d MPSD ( a , g , f ) , then Y 1 + + Y j = d MPSD ( a j , g , f j ) with f j ( θ ) = f j ( θ ) and
a j ( y ) = j ! y ! B y , j [ a ( 1 ) , , a ( y j + 1 ) ] , y T j 0 , y T j
with T j = { y 1 + + y j N | y 1 , , y j T } . Indeed in (18), set a ( y ) = 0 if y T and consider the sequence { p y } y 1 such that p y = 0 if y T . By using Lemma 1 in [21], we have
P ( Y 1 + + Y j = y ) = j ! y ! B y , j ( p 1 , , p y j + 1 )
where { B y , j } are the partial exponential Bell polynomials (1). From (23) with p i replaced by a ( i ) g ( θ ) i / f ( θ ) for i = 1 , , y j + 1 and using (4) we have
P ( Y 1 + + Y j = y ) = j ! y ! g ( θ ) y f j ( θ ) B y , j [ a ( 1 ) , , a ( y j + 1 ) ] .
Thus Y 1 + + Y j = d MPSD ( a j , g , f j ) with f j ( θ ) = f j ( θ ) and a j ( y ) given in (22). From (24) note that a j ( y ) = 0 if 1 , 2 , , y j + 1 T . By replacing g ( θ ) = θ , f ( θ ) = T ( θ , s , a ) and a ( k ) = ( k + a ) ( s + 1 ) for k = 1 , , y j + 1 in (24) we have
a j ( y ) = j ! y ! B y , j ( a + 1 ) ( s + 1 ) , , ( a + y j + 1 ) ( s + 1 ) y = 1 , 2 , .
The result follows after some manipulations, rewriting the partial Bell exponential polynomials as in (2).    □

3.4. Maximum Likelihood Estimation

Consider a vector y = ( y 1 , , y n ) of independent observations of Y = d HLZD ( a , θ , s ) . The maximum likelihood estimation (MLE) of ( θ , s , a ) is
( θ ^ , s ^ , a ^ ) = argmax ( θ , s , a ) Θ n ( θ , s , a , y ) ,
with Θ = ( 0 , 1 ) × ( , + ) × ( 1 , ) , n ( θ , s , a , y ) = ln L n ( θ , s , a , y ) the log-likelihood function and
L n ( θ , s , a , y ) = i = 1 n P ( Y = y i ) .
The MLE of the HLZD parameters ( θ , s , a ) has been studied by Gupta in [20]. He showed that the three likelihood equations arising from maximizing the log-likelihood correspond to the equations of the method of moments. In particular we have
i = 1 n y i n = E [ Y ] , i = 1 n log ( a + y i ) n = E [ log ( a + Y ) ] , i = 1 n 1 n ( a + y i ) = E 1 a + Y .
Unfortunately, closed form solutions of the above equations are not available and also the moments E [ log ( a + Y ) ] and E [ 1 / ( a + Y ) ] must be numerically approximated. As noted in [20], the likelihood equations may be solved by standard numerical methods to obtain the MLE. However, it is well known that this does not guarantee that global maxima of the likelihood have been achieved. In order to avoid this problem, a global optimization method can be employed to solve (25). The global optimization method takes advantage of the bounds of the parameters. More specifically, the MLE of the parameters can be obtained through a global optimization algorithm known as Simulated Annealing [22]. Simulated annealing is a stochastic global optimisation technique applicable to a wide range of discrete and continuous variable problems. It makes use of Markov Chain Monte Carlo samplers, to provide a means to escape local optima by allowing moves which worsen the objective function, with the aim of finding a global optimum. Technical details can be found in [22], a variant of which is the algorithm implemented in the Optim function in the base Stats R-package.

4. The Poisson-Stopped Hurwitz-Lerch Zeta Distribution

Definition 3.
A discrete random variable X = d PSHLZD ( λ , a , θ , s ) if its pgf is
G X ( t ) = exp λ θ Φ ( z θ , s + 1 , a + 1 ) Φ ( θ , s , a ) 1 λ > 0 ,
where Φ is the Lerch Transcendent function (15).
According to Definition 3, X = d PSHLZD ( λ , a , θ , s ) takes non-negative integer values and belongs to the class of generalized r.v.’s [23]. Indeed given two independent r.v.’s Z and Y , with pgf G Z ( t ) and G Y ( t ) respectively, the generalized r.v. X has pgf
G X ( t ) = G Z [ G Y ( t ) ] .
The composition (26) matches (27) when Y = d HLZD ( a , θ , s ) and Z is a Poisson r.v. (PS) of parameter λ > 0 , independent of Y , since G Z ( t ) = exp [ λ ( t 1 ) ] .
In the following we analyse in detail the properties of the PSHLZD using the complete exponential Bell polynomials. Some of the properties given in [6] will also be briefly recalled.
Proposition 3.
If X = d PSHLZD ( λ , a , θ , s ) then
p x : = P ( X = x ) = e λ , x = 0 , e λ x ! B x ( λ q 1 , , λ x ! q x ) , x = 1 , 2 ,
where B x is the complete exponential Bell polynomial (5) of degree x .
Proof. 
Observe that G Y ( 0 ) = 0 and G Y ( t ) = y 1 y ! q y t y / y ! . The result follows from (7) with z 0 = 0 and h z ( t ) = G Y ( t ) , since from (26) we have
exp ( λ ) exp [ λ G Y ( t ) ] = x 0 t x x ! e λ B x ( λ 1 ! q 1 , , λ x ! q x ) .
Corollary 1.
If X = d PSHLZD ( λ , a , θ , s ) then p 0 = P ( X = 0 ) = e λ and
p x : = P ( X = x ) = θ x e λ π x λ T ( θ , s , a ) l ( π ) ( a π ) s + 1 m π ! x = 1 , 2 , .
where the sum is over all the partitions π x , m π ! = r 1 ! r 2 ! and a π is given in (21).
Proof. 
From (28), using (8) and (3), we have
p x = e λ x ! π x x ! ( 1 ! ) r 1 r 1 ! ( 2 ! ) r 2 r 2 ! ( λ 1 ! q 1 ) r 1 ( λ 2 ! q 2 ) r 2 = e λ π x λ r 1 + r 2 + r 1 ! r 2 ! [ ( a + 1 ) ( s + 1 ) ] r 1 [ ( a + 2 ) ( s + 1 ) ] r 2 θ r 1 + 2 r 2 + T ( θ , s , a ) r 1 + r 2 +
by which (29) follows observing that r 1 + r 2 + = l ( π ) and r 1 + 2 r 2 + = x .
As a corollary of Proposition 3 and recursion (6), the sequence { p x } in (28) satisfies the following equations.
Corollary 2.
If X = d PSHLZD ( λ , a , θ , s ) then
p x + 1 = λ x + 1 j = 0 x ( j + 1 ) q j + 1 p x j , x = 1 , 2 , with p 0 = e λ .
Proof. 
The result follows using (6) since we have
p x + 1 = e λ ( x + 1 ) ! B x + 1 ( λ q 1 , , λ ( x + 1 ) ! q x + 1 ) = e λ ( x + 1 ) ! j = 0 x x j B x j ( λ q 1 , , λ ( x j ) ! q x j ) ( j + 1 ) ! q j + 1 = e λ ( x + 1 ) ! j = 0 x x ! ( x j ) ! j ! p x j ( x j ) ! e λ ( j + 1 ) ! q j + 1 .
The PSHLZD is unimodal if s 0 and 1 < a 0 ([6], Property 1).

4.1. Log-Concavity

Under suitable conditions, the PSHLZD is log-concave.
Proposition 4.
If X = d PSHLZD ( λ , θ , a , s ) and s 1 , then X has a log-concave cumulative distribution function (cdf), that is
[ P ( X x ) ] 2 P ( X x 1 ) P ( X x + 1 ) , x = 0 , 1 , 2 , .
Proof. 
According to ([24], Theorem 1), a random sum i = 1 Z Y i of i.i.d. r.v.’s has a log-concave cdf if Z is strongly unimodal and the distribution of { Y i } has a decreasing pdf. Thus, the result follows as X = d i = 1 Z Y i with Z = d PS ( λ ) , which has a log-concave pdf (strongly unimodal), and Y = d HLZD ( θ , a , s ) with a decreasing pdf when s 1 (see Proposition 2). □
Proposition 4 gives a sufficient condition to get cdf log-concavity. A different way is to consider the sequence { p x } . Indeed, if X has a log-concave pdf (19), then its cdf is also log-concave [24]. In the more general setting of generalized r.v.’s, X has a log-concave pdf if and only if the sequence
p x : = P ( X = x ) = 1 x ! B x ( 1 ! q ˜ 1 , , x ! q ˜ x ; 1 ! q 1 , , x ! q x ) x = 1 , 2 ,
is log-concave with p 0 = P ( X = 0 ) = G Z [ G Y ( 0 ) ] and q ˜ x = P ( Z = x ) , q x = P ( Y = x ) . Equation (30) follows from Equation (2.3) in [23] using the general partition polynomials (8). When Z = d P S ( λ ) a necessary and sufficient condition to recover strong unimodality is related to the magnitude of q 1 and q 2 , as the following theorem shows.
Theorem 2.
If X is a generalized r.v. with Y strongly unimodal and Z = d P S ( λ ) , then X is strongly unimodal if and only if λ q 1 2 q 2 .
Note that a similar result is proved in ([25], Theorem 4). We provide a different proof using the following lemma.
Lemma 1.
If { z j } j 1 [ 0 , ) is a log-concave sequence, then the sequence { 1 n ! B n ( z 1 , , z n ) } n 1 is log-concave if and only if z 1 2 z 2 , with { B n } given in (5).
Proof. 
If { z j } j 0 with z 0 = 1 is a log-concave sequence of non-negative real numbers and the sequence { a ( n ) } n 0 is defined by
n = 0 a ( n ) n ! y n = exp j = 1 z j j ! y j
then the sequence { a ( n ) n ! } n is log-concave [26]. Equation (31) parallels (7). Therefore, the sequence { 1 n ! B n ( z 1 , , z n ) } n 1 results as log-concave if the sequence { z j } j 0 is log-concave. Note that for j 2 we have
z j 2 [ ( j 1 ) ! ] 2 z j 1 ( j 2 ) ! z j + 1 j ! ,
which easily reduces to j z j 2 ( j 1 ) z j 1 z j + 1 always satisfied when { z j } j 1 is log-concave. Now let j = 1 . We have { z j } j 0 is log-concave if and only if z 1 2 z 2 and the result follows. □
Proof 
(Proof of Theorem 2). Following the same arguments of Proposition 3, for a generalized r.v. with Z = d P S ( λ ) , (30) reduces to
P ( X = x ) = e λ x ! B x ( λ 1 ! q 1 , , λ x ! q x )
with q x = P ( Y = x ) for x = 1 , 2 , . The sequence { e λ x ! B x ( λ 1 ! q 1 , , λ x ! q x ) } x 1 is log-concave if and only if the sequence { 1 x ! B x ( λ 1 ! q 1 , , λ x ! q x ) } x 1 is log-concave. The result follows using Lemma 1. □
Corollary 3.
If s < 1 , X = d PSHLZD ( λ , θ , a , s ) is strongly unimodal if and only if λ q 1 2 q 2 .

4.2. Moments and Cumulants

PSHLZD moments and cumulants have closed form expressions in terms of moments of Y = d HLZD ( a , θ , s ) .
Proposition 5.
If X = d PSHLZD ( λ , a , θ , s ) then
μ k : = E [ X k ] = B k ( λ ξ 1 , , λ ξ k ) , k = 1 , 2 , ,
with B k the k-th complete exponential Bell polynomial (5) and ξ 1 , , ξ k the first k moments of Y = d HLZD ( a , θ , s ) given in (16).
Proof. 
If M X ( t ) and M Y ( t ) are the mgf’s of X = d PSHLZD ( λ , a , θ , s ) and Y = d HLZD ( a , θ , s ) respectively, then
M X ( t ) = G X ( e t ) = e λ [ G Y ( e t ) 1 ] = e λ [ M Y ( t ) 1 ]
from (27). Equation (32) follows as the rhs of (33) can be written as (7), with h z ( t ) = λ M Y ( t ) and z 0 = λ .
Remark 1.
Taking into account (33), if X = d PSHLZD ( λ , a , θ , s ) then X = d Y 1 + + Y Z with Y = d HLZD ( a , θ , s ) and Z = d P S ( λ ) , that is X is a compound Poisson r.v. Therefore the PSHLZD is an infinitely divisible distribution [27].
Moments (32) can be explicited written using (9). A straightforward corollary of recursion (6) is the following.
Corollary 4.
μ k + 1 = λ j = 0 k k j μ k j ξ j + 1 , k = 1 , 2 , .
If μ k E [ ( X μ 1 ) k ] denotes the k-th central moment of X = d PSHLZD ( λ , a , θ , s ) then
μ k = k = 0 n n k ( λ ξ 1 ) n k B k ( λ ξ 1 , , λ ξ k ) , k = 1 , 2 ,
Proposition 6.
If X = d PSHLZD ( λ , a , θ , s ) then
( μ ) k : = E [ X ( X 1 ) ( X k + 1 ) ] = B k ( λ ( ξ ) 1 , , λ ( ξ ) k ) , k = 1 , 2 ,
where ( ξ ) 1 , , ( ξ ) k are the first k factorial moments of Y = d HLZD ( a , θ , s ) given in (17).
Proof. 
Let us recall that, if Q X ( t ) is the factorial mgf of { ( μ ) k } , then Q X ( t ) = G X ( t + 1 ) with G X the pgf of X . Therefore we have
Q X ( t ) = G X ( t + 1 ) = exp λ [ G Y ( t + 1 ) 1 ] = exp λ [ Q Y ( t ) 1 ] ,
with Q Y ( t ) the generating function of the factorial moments { ( ξ ) k } . Equation (34) follows as the rhs of (35) can be written as (7), with z 0 = λ and h z ( t ) = λ Q Y ( t ) .
Proposition 7.
If κ n ( X ) is the n-th cumulant of X = d PSHLZD ( λ , a , θ , s ) then κ n ( X ) = λ ξ n , for n = 1 , 2 , where ξ n is the n-th moment of Y = d HLZD ( a , θ , s ) given in (16).
Proof. 
The result follows since
log M Y ( t ) = log [ e λ ( M X ( t ) 1 ) ] = λ [ M x ( t ) 1 ] = n 1 t n n ! λ E [ X k ] .

4.3. Maximum Likelihood Estimation

Suppose to have x = ( x 1 , , x n ) independent observations of X = d PSHLZD ( λ , a , θ , s ) . The MLE of ( λ , θ , s , a ) is
( λ ^ , θ ^ , s ^ , a ^ ) = argmax ( λ , θ , s , a ) Θ n ( λ , θ , s , a , x ) ,
with Θ = ( 0 , ) × ( 0 , 1 ) × ( , + ) × ( 1 , ) , n ( λ , θ , s , a , x ) = ln L n ( λ , θ , s , a , x ) the log-likelihood function and
L n ( λ , θ , s , a , x ) = i = 1 n P ( X = x i ) .
The MLE of the PSHLZD parameters in this case must be directly tackled with the global optimization method described in Section 3.4, since n ( λ , θ , s , a ) is not analytically tractable referring to (29).

5. The One Inflated Hurwitz-Lerch Zeta Distribution

Definition 4.
A discrete random variable Z = d OIHLZD ( p , a , θ , s ) if
P ( Z = 1 ) = p + ( 1 p ) P ( Y = 1 ) , P ( Z = x ) = ( 1 p ) P ( Y = x ) , x = 2 , 3 ,
with p [ 0 , 1 ] and Y = d HLZD ( a , θ , s ) .
This definition parallels the definition of the Zero Inflated Modified Power Series Distribution given by Gupta [28]. If G Z ( t ) denotes the pgf of Z = d OIHLZD ( p , a , θ , s ) then
G Z ( t ) = p t + ( 1 p ) G Y ( t )
and the HLZD is retrieved by setting p = 0 .

5.1. Moments and Cumulants

If Z = d OIHLZD ( p , a , θ , s ) then M Z ( t ) = G Z ( e t ) = p e t + ( 1 p ) G Y ( e t ) = p e t + ( 1 p ) M Y ( t ) from (37). Thus
ν k : = E [ Z k ] = p + ( 1 p ) ξ k k = 0 , 1 ,
with ξ k the k-th moment of Y = d HLZD ( a , θ , s ) given in (16). For example, we have E [ Z ] = p + ( 1 p ) E [ Y ] and Var [ Z ] = ( 1 p ) Var [ Y ] + p ( 1 + E [ Y ] 2 2 E [ Y ] ) .
Similarly, if Q Z ( t ) is the factorial mgf of Z = d OIHLZD ( p , a , θ , s ) , since Q Z ( t ) = G Z ( t + 1 ) = p ( t + 1 ) + ( 1 p ) G Y ( t + 1 ) = p ( t + 1 ) + ( 1 p ) Q Y ( t ) , with Q Y ( t ) the factorial mgf of Y = d HLZD ( a , θ , s ) , then
( ν ) k : = E [ Z ( Z 1 ) ( Z k + 1 ) ] = p + ( 1 p ) ( ξ ) 1 k = 1 ( 1 p ) ( ξ ) k k = 2 , 3 ,
with ( ξ ) k the k-th factorial moment of Y = d HLZD ( a , θ , s ) given in (17).
OIHLZD cumulants are κ n ( Z ) = L n ( ν 1 , , ν n ) , n = 1 , 2 , with { ν j } moments of Z = d OIHLZD ( p , a , θ , s ) given in (38), and L n the n-th logarithmic polynomial (12).

5.2. Maximum Likelihood Estimation

To estimate the OIHLZD parameters using the MLE, let us first rewrite (36) using (18), that is
P ( Z = 1 ) = 1 w , P ( Z = x ) = ( 1 p ) a ( x ) g ( θ ) x f ( θ ) , x = 1 , 2
where w = ( 1 p ) [ 1 P ( Y = 1 ) ] , g ( θ ) = θ , a ( x ) = ( a + x ) ( s + 1 ) and f ( θ ) = T ( θ , a , s ) . Rewrite (39) as
P ( Z = 1 ) = 1 w P ( Z = x ) = w P ( W = x )
where W has a One Truncated Hurwitz-Lerch Zeta Distribution (OTHLZD) [29], that is
P ( W = x ) : = 1 1 a ( 1 ) g ( θ ) f ( θ ) a ( x ) g ( θ ) x f ( θ ) , x = 1 , 2
Suppose z = ( z 1 , , z n ) is a vector of independent observations of Z = d OIHLZD ( p , a , θ , s ) and l ( θ , a , s , w , z ) = ln L n ( θ , a , s , w , z ) is the log-likelihood function with
L n ( θ , a , s , w , z ) = i = 1 n P ( Z = z i ) .
If n j is the number of times the integer j appears in the vector z for j = 1 , 2 , , then the log-likelihood l ( θ , a , s , w , z ) can be written as
l ( θ , a , s , w , z ) = n 1 log ( 1 w ) + ( n n 1 ) log ( w ) + j = 2 n j log P ( Y = j ) 1 P ( Y = 1 ) .
Now set
l 1 ( w , z ) = n 1 log ( 1 w ) + ( n n 1 ) log ( w )
and
l 2 ( θ , a , s , z ) = j = 2 n j P ( Y = j ) 1 P ( Y = 1 ) .
From (41) and (42), the parameters ( θ , a , s , w ) can be estimated separately, that is the estimation w ^ can be recovered from l 1 ( w , z ) and the estimations ( θ ^ , a ^ , s ^ ) from l 2 ( θ , a , s , z ) . The latter ones give the MLE of the parameters of W = d OTHLZD ( θ , s , a ) in (40) using the vector z restricted to the observations which are greater than 1 . As a consequence, the estimation p ^ of p can be recovered from w ^ as
p ^ = 1 w ^ T ( θ ^ , a ^ , s ^ ) T ( θ ^ , a ^ , s ^ ) a ( 1 ) θ ^ .

6. Data-Fitting

6.1. Rainfall Depths and Interarrival Times

With rainfall depth we indicate to what depth liquid precipitation would cover a horizontal surface in an observation period if nothing could drain, evaporate or percolate from this surface. Let a time series of rainfall data be defined as h = { h 1 , , h n } , where h (mm) is the rainfall depth recorded at a fixed uniform unit τ of time (e.g., a day). A day k is considered rainy if the rainfall depth h k h * , where h * is a fixed rainfall threshold. The sub-series of h of the rainy days can be defined as the event series E = { t 1 , , t n r } , where n r n is an integer multiple of the time-scale τ . The sequence built with the times elapsed between each element of E (except the first one) and the immediately preceding one is defined as the interarrival time series I T = { I T 2 , , I T n r } . In order to select an appropriate distribution for I T , some statistical characteristics usually observed in I T samples have to be considered: very high variance and skewness, relatively high frequency associated to the observation I T = 1 , monotonically decreasing frequencies with a slowly decaying tail and a drop in the passage from the frequency at I T = 1 to the one at I T = 2 . The HLZD in (13) has been fitted to rain I T in [5] for stations in Sicily and in [30] for stations in Piedmont, with good results. However, it has not yet been considered for rainfall depths. Recall that in the following we assume to model rainfall depths with a discrete r.v.

6.2. The Data

In this paper, the I T series analyzed were obtained from the recorded rainfall observations, using the rainfall threshold h * = 1 mm, which is the conventional threshold stated by the World Meteorological Organization in order to discriminate between rainy and non rainy days. This dataset has not been previously considered in the literature and consists of both I T and h measured over 70 years at the following five stations: Floresta, Trapani, Torino, Oxford, Ceva. They were chosen in order to represent different climates from the rainfall characteristics point of view. Floresta and Trapani represent the Mediterranean climate with a very wet and a very dry situation respectively. For both stations, the rainfall is concentrated in the colder part of the year, as typical of the Mediterranean climate. Torino and Ceva are more continental, but Ceva is more influenced by the Ligurian sea. Therefore, Torino has its maximum rain in Spring, while that of Ceva is in Autumn, because of the heating of the sea in the Summer. Finally, Oxford is a northern Europe station with rainfall homogeneously spread across the whole year. The recordings start in 1947 and end in 2017, for a total of 70 years. Moreover, the time series were further subdivided. Thus for each station we considered for a total of 33 samples. Note that we did not consider wet and dry seasons for Oxford station due to its climate.
More specifically let station_name∈ {Floresta, Trapani, Torino, Oxford, Ceva} and season_name∈ {wet, dry, spring, summer, autumn, winter}. Then the samples tagged with station_name year span the whole length of the series for the station_name station, while the samples tagged with station_name season_name are the union of all the season_name seasons in the whole time series for the station_name station, omitting all the other seasons from the dataset. The MLE was used to fit the HLZD (Section 3), the PSHLZD (Section 4) and the OIHLZD (Section 5) to the dataset (Note that the PSHLZD has support k = 0 , 1 , and the r.v. I T naturally has support k = 1 , 2 , so we had to consider the shifted r.v. I T = I T 1 .). In all cases, the MLE has been tackled with the method described in Section 3.4. The addressed global optimization procedure was further simplified by the previously mentioned statistical characteristics of the data allowing to work on a subset of the parameter space Θ .
  • the whole time series, without subdividing the different seasons;
  • all the wet seasons and all the dry seasons;
  • the standard meteorological seasons,

6.3. Results

In the following we summarize the results of the distribution fitting for I T and h data. The fitting was satisfactory for both the PSHLZD and the HLZD. The assessment of the goodness-of-fit was obtained by following the methodology suggested by [31]. In the case of long tailed distributions, the goodness-of-fit through the classical χ 2 test might be biased, because if there are several small classes, strong asymmetry might occur [31] and some problems of inaccuracy might appear if the classes are grouped [32]. The alternative procedure used to test the goodness-of-fit relies on Monte Carlo simulation to numerically reconstruct the null hypothesis of the χ 2 test to compute the p-values [33].
To further inspect the differences between the distributions, we have measured the fitting errors whose magnitude is strictly related to the discrepancy between the empirical frequencies and the fitted ones. Since many empirical frequencies are zero (in the tail), the cdf has been considered. In particular we considered the mean absolute error (MAE) and the mean relative absolute error (MRAE). Let us recall that, if x = ( x 1 , , x n ) is an ordered sample, then MAE ( x ) = j = 1 n 1 n | F N ( x j ) F ^ ( x j ) | and MRAE ( x ) = j = 1 n 1 n | F n ( x j ) F ^ ( x j ) | / F n ( x j ) with F n the empirical cdf and F ^ the fitted cdf.

6.3.1. Interarrival Times

We have compared the fitted PSHLZD with the fitted HLZD and the fitted PSHLZD with the fitted OIHLZD. To summarize the results, we have selected 4 of the 33 available samples since they have been considered particularly meaningful with respect to the whole dataset. The selected samples were Floresta Summer, Trapani Wet, Trapani Dry and Torino Winter.
Figure 1 is an example of I T empirical frequencies: they usually range from a high peak located at I T = 1 to a multitude of rather smaller values in the slow decaying tails. Therefore, to perform comparisons, a log-log scale for all the plots has been adopted.
Figure 2 shows plots of the fitted PSHLZD (solid line) and HLZD (dotted line) compared with the empirical frequencies (dot line) for the 4 selected samples. The fitting in both cases is very good. In particular, in the cases of Floresta Summer, Torino Winter and Trapani Dry the PSHLZD succeeds in fitting the drop from I T = 1 to I T = 2 whereas the HLZD fails. This happens in the drier periods, where this drop is more prominent.
Moreover, Figure 1 shows the dominance of the frequencies corresponding to I T = 1 and I T = 2 , which are particularly meaningful in hydrology. Figure 3 shows the plots of MAE and MRAE obtained by comparing the fitted cdf’s of the PSHLZD (circle) and the HLZD (triangle) with the empirical I T cdf’s. Note that the MAE and the MRAE are in general lower for the PSHLZD. Due to the dominance of the frequency corresponding to I T = 1 , we explored modelling I T with the OIHLZD for all the samples. In all cases, the fitted OIHLZD and the PSHLZD one have minimal differences and are almost indistinguishable (see Figure 4 for an example), confirming the great flexibility of the latter distribution.
To conclude the validation analysis, we compared sample means and sample variances with the same theoretical moments of the HLZD and the PSHLZD computed in Section 3 and Section 4 respectively. In Table 1, we show the results for the 4 selected samples. In all cases, the fitted distributions agree with the sample means. For the variances, the PSHLZD performs better in many cases. In Table 1, an exception is Trapani Wet because the data are highly dispersed.

6.3.2. Rainfall Depths

In this section, we summarize the fitting of the rainfall depth time series using both the PSHLZD and the HLZD. We omit the comparison with the OIHLZD since this distribution does not add more insights on the fitting nor what happens for the I T datasets.
Figure 5 shows again an empirical frequency histogram ranging from a high peak in h = 1 to a multitude of rather smaller values in the slowly decaying tails. As in the previous section, we employed a log-log scale for all the plots. The selected samples were Ceva Winter, Torino Winter, Floresta Dry and Trapani Summer.
In Figure 6 we have plotted the fitted PSHLZD and HLZD compared with the empirical frequencies. As with I T samples, the fitting is very good, even better that in the I T case. Moreover there is less difference between the performances of the PSHLZD and the HLZD.
Figure 7 shows the plots of the MAE and the MRAE obtained by comparing the fitted cdf’s of the PSHLZD (circle) and the HLZD (triangle) with the empirical h cdf’s. Even though both errors are smaller for the PSHLZD, there is less difference between the two distributions and they are generally lower than for the I T case.

7. Conclusions

The first part of this paper focuses on a class of discrete distributions useful to describe very high one counts and long tails. We have reviewed the main properties using the combinatorics of exponential Bell polynomials. This device has permitted the derivation of closed form expressions for the pdf’s and their convolutions, as well as moments and cumulants. Moreover, new results on log-concavity have been presented. We have also considered the OIHLZD to compare its features with the HLZD and the PSHLZD. This deep analysis was aimed of investigating how to use these models to find a better fit for rainfall data. Indeed, the PSHLZD and the HLZD were fitted on Interarrival Times I T and rainfall depths h data coming from 5 different stations, which composed a dataset never previously analyzed in the literature. The h data were treated as observations of a discrete r.v., which is not the usual practice in the literature, but seems reasonable when taking into account how they are measured. The fitting was performed with the classical MLE method, but the likelihood was maximized using the Simulated Annealing procedure, which turns out to be fundamental since there are no closed forms of the likelihood equations. The fit was very good for both distributions, with the PSHLZD performing slightly better than the HLZD. This mostly happens for the I T data. Moreover, the PSHLZD was also able to replicate the fit of the OIHLZD further validating its flexibility.
From the modelling point of view, let us underline two final remarks. Firstly, the fit was excellent for both the I T and the h data, suggesting that the PSHLZD can be proposed as a general framework in rainfall modelling. Secondly, it is noteworthy to underline that these models capture the variability of rainfall stochastic phenomena, even though the 5 considered stations represent very different climates: a case study not yet considered in the literature that deals with previous applications of HLZD. Future works will consider modelling the dependence (inter-correlation) between I T and h. Given the remarkable performance of these distribution families in the univariate modelling, a first step would be to consider bivariate modified power series distributions [34] and the methods to estimate their parameters on a rainfall time series. This is in the agenda for our future developments.

Author Contributions

Conceptualization, C.A., G.B., S.F., E.D.N. and T.M.; methodology, E.D.N. and T.M.; software, T.M.; validation, E.D.N. and T.M.; formal analysis, E.D.N. and T.M.; data curation, C.A., G.B. and S.F.; writing—original draft preparation, E.D.N. and T.M.; writing—review and editing, C.A., G.B., S.F., E.D.N. and T.M.; funding acquisition, S.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by MIUR—Dipartimento di Eccellenza- DIST department funds, and by the “PRIN MIUR 2017SL7ABC_005 WATZON Project”.

Data Availability Statement

Data are available upon request to the Società Italiana di Meteorologia.

Acknowledgments

The authors would like to thank the anonymous referees and the associate editor for carefully reading this manuscript and giving valuable comments to improve the previous version of this paper. The authors would like to thank Nicholas Howden of University of Bristol and Daniele Cat Berro and Luca Mercalli of Società Meteorologica Italiana to provide the English and North-Italy rainfall time series. The authors would like to thank the “Risk Responsible Resilience Interdepartmental Centre” (R3C).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Serinaldi, F. A multisite daily rainfall generator driven by bivariate copula-based mixed distributions. J. Geophys. Res. Atmos. 2009, 114, D10103. [Google Scholar] [CrossRef]
  2. Davis, A.; Wiscombe, W.; Cahalan, R.; Marshak, A. Multifractal characterizations of nonstationary and intermittency in geophysical fields: Observed, retrieved, or simulated. J. Geophys. Res. 1994, 99, 8055–8072. [Google Scholar] [CrossRef]
  3. Lawrance, A.J. Stochastic Modelling of Daily Rainfall Sequences. J. R. Stat. Soc. Ser. C Appl. Stat. 1979, 28, 84. [Google Scholar] [CrossRef]
  4. Chatfield, C. Wet and dry spells. Weather 1966, 21, 308–310. [Google Scholar] [CrossRef]
  5. Agnese, C.; Baiamonte, G.; Cammalleri, C. Modelling the occurrence of rainy days under a typical Mediterranean climate. Adv. Water Resour. 2014, 64, 62–76. [Google Scholar] [CrossRef]
  6. Liew, K.W.; Ong, S.H.; Toh, K.K. The Poisson-stopped Hurwitz–Lerch zeta distribution. Commun. Stat.—Theory Methods 2022, 51, 5638–5652. [Google Scholar] [CrossRef]
  7. Liu, M.; Zhang, L.; Fang, Y.; Dong, H. Exact solutions of fractional nonlinear equations by generalized bell polynomials and bilinear method. Therm. Sci. 2021, 25, 1373–1380. [Google Scholar] [CrossRef]
  8. Taghavian, H. The use of partition polynomial series in Laplace inversion of composite functions with applications in fractional calculus. Math. Methods Appl. Sci. 2019, 42, 2169–2189. [Google Scholar] [CrossRef]
  9. Fathizadeh, F.; Kafkoulis, Y.; Marcolli, M. Bell polynomials and Brownian bridge in spectral gravity models on multifractal Robertson–Walker cosmologies. In Annales Henri Poincaré; Springer: Berlin/Heidelberg, Germany, 2020; Volume 21, pp. 1329–1382. [Google Scholar]
  10. Bernardara, P.; De Michele, C.; Rosso, R. A simple model of rain in time: An alternating renewal process of wet and dry states with a fractional (non-Gaussian) rain intensity. Atmos. Res. 2007, 84, 291–301. [Google Scholar] [CrossRef]
  11. Yang, L.; Franzke, C.L.E.; Fu, Z. Power-law behaviour of hourly precipitation intensity and dry spell duration over the United States. Int. J. Climatol. 2020, 40, 2429–2444. [Google Scholar] [CrossRef]
  12. Porporato, A.; Vico, G.; Fay, P.A. Superstatistics of hydro-climatic fluctuations and interannual ecosystem productivity. Geophys. Res. Lett. 2006, 33, L15402. [Google Scholar] [CrossRef]
  13. Foufoula-Georgiou, E.; Lettenmaier, D.P. Continuous-time versus discrete-time point process models for rainfall occurrence series. Water Resour. Res. 1986, 22, 531–542. [Google Scholar] [CrossRef]
  14. Charalambides, C.A. Enumerative Combinatorics; CRC Press Series on Discrete Mathematics and its Applications; Chapman &amp; Hall/CRC: Boca Raton, FL, USA, 2002; Volume 16, p. 609. [Google Scholar]
  15. Di Nardo, E.; Guarino, G.; Senato, D. A unifying framework for k-statistics, polykays and their multivariate generalizations. Bernoulli 2008, 14, 440–468. [Google Scholar] [CrossRef]
  16. Di Nardo, E.; Guarino, G. kStatistics: Unbiased Estimates of Joint Cumulant Products from the Multivariate Faà Di Bruno’s Formula. arXiv 2022, arXiv:2206.15348. [Google Scholar]
  17. Aksenov, S.V.; Savageau, M.A. Some properties of the Lerch family of discrete distributions. arXiv 2005, arXiv:math/0504485. [Google Scholar]
  18. Gupta, R.C. Modified power series distribution and some of its applications. Sankhyā Ser. B 1974, 36, 288–298. [Google Scholar]
  19. Keilson, J.; Gerber, H. Some Results for Discrete Unimodality. J. Am. Stat. Assoc. 1971, 66, 386–389. [Google Scholar] [CrossRef]
  20. Gupta, P.L.; Gupta, R.C.; Ong, S.H.; Srivastava, H. A class of Hurwitz–Lerch Zeta distributions and their applications in reliability. Appl. Math. Comput. 2008, 196, 521–531. [Google Scholar] [CrossRef]
  21. Eger, S. Identities for partial Bell polynomials derived from identities for weighted integer compositions. Aequationes Math. 2016, 90, 299–306. [Google Scholar] [CrossRef]
  22. Bélisle, C.J.P. Convergence theorems for a class of simulated annealing algorithms on Rd. J. Appl. Probab. 1992, 29, 885–895. [Google Scholar] [CrossRef]
  23. Charalambides, C.A. On the generalized discrete distributions and the Bell polynomials. Sankhyā Ser. B 1977, 39, 36–44. [Google Scholar]
  24. Badía, F.G.; Sangüesa, C.; Federgruen, A. Log-concavity of compound distributions with applications in operational and actuarial models. Probab. Eng. Inf. Sci. 2021, 35, 210–235. [Google Scholar] [CrossRef]
  25. Yu, Y. On the entropy of compound distributions on nonnegative integers. IEEE Trans. Inform. Theory 2009, 55, 3645–3650. [Google Scholar] [CrossRef]
  26. Bender, E.A.; Canfield, E.R. Log-concavity and related properties of the cycle index polynomials. J. Comb. Theory Ser. A 1996, 74, 57–70. [Google Scholar] [CrossRef] [Green Version]
  27. Sato, K.i. Lévy processes and infinitely divisible distributions. In Cambridge Studies in Advanced Mathematics; Translated from the 1990 Japanese original, Revised edition of the 1999 English translation; Cambridge University Press: Cambridge, UK, 2013; Volume 68, pp. 14, 521. [Google Scholar]
  28. Gupta, P.L.; Gupta, R.C.; Tripathi, R.C. Inflated modified power series distributions with applications. Commun. Stat.-Theory Methods 1995, 24, 2355–2374. [Google Scholar] [CrossRef]
  29. Conceição, K.S.; Louzada, F.; Andrade, M.G.; Helou, E.S. Zero-modified power series distribution and its Hurdle distribution version. J. Stat. Comput. Simul. 2017, 87, 1842–1862. [Google Scholar] [CrossRef]
  30. Baiamonte, G.; Mercalli, L.; Berro, D.C.; Agnese, C.; Ferraris, S. Modelling the frequency distribution of inter-arrival times from daily precipitation time-series in North-West Italy. Hydrol. Res. 2019, 50, 339–357. [Google Scholar] [CrossRef]
  31. Martínez-Rodríguez, A.M.; Sáez-Castillo, A.; Conde-Sánchez, A. Modelling using an extended Yule distribution. Comput. Stat. Data Anal. 2011, 55, 863–873. [Google Scholar] [CrossRef]
  32. Spierdijk, L.; Voorneveld, M. Superstars without talent? The Yule distribution controversy. Rev. Econ. Stat. 2009, 91, 648–652. [Google Scholar] [CrossRef]
  33. Hope, A.C. A simplified Monte Carlo significance test procedure. J. R. Stat. Soc. Ser. B Methodol. 1968, 30, 582–598. [Google Scholar] [CrossRef]
  34. Shoukri, M.M.; Consul, P.C. Bivariate Modified Power Series Distribution Some Properties, Estimation and Applications. Biom. J. 1982, 24, 787–799. [Google Scholar] [CrossRef]
Figure 1. Histogram of the empirical I T frequency for the Trapani station over the whole year. The range is up to 133 . The mode is I T = 1 with relative frequency 0.44 . The mean and the standard deviation are 5.89 and 11.97 respectively.
Figure 1. Histogram of the empirical I T frequency for the Trapani station over the whole year. The range is up to 133 . The mode is I T = 1 with relative frequency 0.44 . The mean and the standard deviation are 5.89 and 11.97 respectively.
Fractalfract 06 00509 g001
Figure 2. Log-log plots of the fitted HLZD (green dotted line), the fitted PSHLZD (black solid line) and the I T empirical frequencies (red dot line) for the 4 selected samples Floresta Summer, Torino Winter, Trapani Wet and Trapani Dry.
Figure 2. Log-log plots of the fitted HLZD (green dotted line), the fitted PSHLZD (black solid line) and the I T empirical frequencies (red dot line) for the 4 selected samples Floresta Summer, Torino Winter, Trapani Wet and Trapani Dry.
Fractalfract 06 00509 g002
Figure 3. Dot plots of MAE and MRAE taking as reference the cdf of the PSHLZD (black circle) and of the HLZD (red triangle) for all the I T samples. The maximum MAE as well as the mean MAE are given in the top left for both the fitting distributions.
Figure 3. Dot plots of MAE and MRAE taking as reference the cdf of the PSHLZD (black circle) and of the HLZD (red triangle) for all the I T samples. The maximum MAE as well as the mean MAE are given in the top left for both the fitting distributions.
Fractalfract 06 00509 g003
Figure 4. (a), the fitted PSHLZD (black solid line) is plotted together with the fitted HLZD (green dotted line) and the I T empirical frequencies (red dot line) for the sample of Trapani Dry. (b) the fitted OIHLZD (black solid line) is plotted together with the fitted HLZD (green dotted line) and the I T empirical frequencies (red dot line) for the same sample.
Figure 4. (a), the fitted PSHLZD (black solid line) is plotted together with the fitted HLZD (green dotted line) and the I T empirical frequencies (red dot line) for the sample of Trapani Dry. (b) the fitted OIHLZD (black solid line) is plotted together with the fitted HLZD (green dotted line) and the I T empirical frequencies (red dot line) for the same sample.
Fractalfract 06 00509 g004
Figure 5. Plot of the empirical h frequency histogram for the Trapani station over the whole year. 111 is the maximum registered depth. The mode is h = 1 with relative frequency 0.22 . The mean and the standard deviation are 6.81 and 9.16 respectively.
Figure 5. Plot of the empirical h frequency histogram for the Trapani station over the whole year. 111 is the maximum registered depth. The mode is h = 1 with relative frequency 0.22 . The mean and the standard deviation are 6.81 and 9.16 respectively.
Fractalfract 06 00509 g005
Figure 6. Log-log plots of the fitted HLZD (green dotted line), the fitted PSHLZD (black solid line) and the h empirical frequencies (red dot line) for the 4 selected samples Ceva Winter, Torino Winter, Floresta Dry and Trapani Summer.
Figure 6. Log-log plots of the fitted HLZD (green dotted line), the fitted PSHLZD (black solid line) and the h empirical frequencies (red dot line) for the 4 selected samples Ceva Winter, Torino Winter, Floresta Dry and Trapani Summer.
Fractalfract 06 00509 g006
Figure 7. Dot plots of MAE and MRAE taking as reference the cdf of the PSHLZD (black circle) and of the HLZD (red triangle) for all the h samples. The maximum MAE as well as the mean MAE are given in the top left for both the fitting distributions.
Figure 7. Dot plots of MAE and MRAE taking as reference the cdf of the PSHLZD (black circle) and of the HLZD (red triangle) for all the h samples. The maximum MAE as well as the mean MAE are given in the top left for both the fitting distributions.
Fractalfract 06 00509 g007
Table 1. The sample means and the sample variances for the 4 selected samples are given in the first column. The means and the variances of the fitted HLZD and of the fitted PSHLZD are given in the second and in the third column respectively.
Table 1. The sample means and the sample variances for the 4 selected samples are given in the first column. The means and the variances of the fitted HLZD and of the fitted PSHLZD are given in the second and in the third column respectively.
Trapani Dry
SampleHLZPSHLZ
Mean3.7583.7583.758
Var26.17326.66826.588
Trapani Wet
SampleHLZPSHLZ
Mean12.78812.78812.788
Var461.938461.168529.513
Floresta Summer
SampleHLZPSHLZ
Mean8.938.938.93
Var161.344189.699153.808
Torino Winter
SampleHLZDPSHLZD
Mean6.5946.5946.594
Var99.561130.709107.723
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Agnese, C.; Baiamonte, G.; Di Nardo, E.; Ferraris, S.; Martini, T. Modelling the Frequency of Interarrival Times and Rainfall Depths with the Poisson Hurwitz-Lerch Zeta Distribution. Fractal Fract. 2022, 6, 509. https://doi.org/10.3390/fractalfract6090509

AMA Style

Agnese C, Baiamonte G, Di Nardo E, Ferraris S, Martini T. Modelling the Frequency of Interarrival Times and Rainfall Depths with the Poisson Hurwitz-Lerch Zeta Distribution. Fractal and Fractional. 2022; 6(9):509. https://doi.org/10.3390/fractalfract6090509

Chicago/Turabian Style

Agnese, Carmelo, Giorgio Baiamonte, Elvira Di Nardo, Stefano Ferraris, and Tommaso Martini. 2022. "Modelling the Frequency of Interarrival Times and Rainfall Depths with the Poisson Hurwitz-Lerch Zeta Distribution" Fractal and Fractional 6, no. 9: 509. https://doi.org/10.3390/fractalfract6090509

APA Style

Agnese, C., Baiamonte, G., Di Nardo, E., Ferraris, S., & Martini, T. (2022). Modelling the Frequency of Interarrival Times and Rainfall Depths with the Poisson Hurwitz-Lerch Zeta Distribution. Fractal and Fractional, 6(9), 509. https://doi.org/10.3390/fractalfract6090509

Article Metrics

Back to TopTop