Next Article in Journal
Applied and Computational Mathematics for Digital Environments
Previous Article in Journal
Microelongated Thermo-Elastodiffusive Waves of Excited Semiconductor Material under Laser Pulses Impact
Previous Article in Special Issue
“Agree to Disagree”: Forecasting Stock Market Implied Volatility Using Financial Report Tone Disagreement Analysis
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:

A Stochastic Weather Model for Drought Derivatives in Arid Regions: A Case Study in Qatar

Department of Mathematics, Trent University, Peterborough, ON K9L 0G2, Canada
Author to whom correspondence should be addressed.
Current address: Department of Statistics, Chonnam National University, Gwangju 61186, Republic of Korea.
Mathematics 2023, 11(7), 1628;
Submission received: 31 January 2023 / Revised: 24 March 2023 / Accepted: 24 March 2023 / Published: 28 March 2023
(This article belongs to the Special Issue Advances in Financial Modeling)


In this paper, we propose a stochastic weather model consisting of temperature, humidity, and precipitation, which is used to calculate a reconnaissance drought index (RDI) in Qatar. The temperature and humidity models include stochastic differential equations and utilize an adjusted Ornstein–Uhlenbeck (O–U) process. For the precipitation model, a first-order Markov chain is used to differentiate between wet and dry days and the precipitation amount on wet days is determined by a probability distribution. Five different probability distributions were statistically tested to obtain an appropriate precipitation amount. The evapotranspiration used in the RDI calculation incorporates crop coefficient values, depends on the growth stages of the crops, and provides a crop-specific and more realistic representation of the drought conditions. Five different evapotranspiration formulations were investigated in order to obtain the most accurate RDI values. The calculated RDI was used to assess the intensity of drought in Doha, Qatar, and could be used for the pricing of financial drought derivatives, a form of weather derivative. These derivatives could be used by agricultural producers to hedge against the economic effects of droughts.

1. Introduction

The motivation for this paper was to develop stochastic weather models to simulate weather for use in pricing financial drought derivatives [1], which are a form of weather derivative designed to protect agricultural producers from the economic effects of drought. In recent years, weather derivatives have become popular and effective tools for minimizing financial losses associated with weather impacts in many industries including energy, transportation, and agriculture. Before the use of weather derivatives, the most common approach to reducing weather risks was the use of insurance contracts. As insurance contracts are designed to pay out for specific weather events, which are mentioned in the contracts, they are effective only when these events occur. Usually, the events are likely disasters, such as typhoons or floods, which happen with low probability but cause big losses. However, weather insurance is plagued by moral hazards, as it may not incentivize policyholders to mitigate damages. Additionally, due to the potential for the high positive spatial correlation of claims, it can be difficult for insurers to meet their obligations. Due to the limitations of insurance contracts and the high variability of weather, weather derivatives are becoming more popular than insurance contracts for hedging weather risks [2].
Weather derivatives are similar to traditional derivatives on financial assets, but as their values are based on weather indices, there is no underlying traded asset to hedge against. This means that risk-neutral pricing models cannot be applied. Thus, the main approach to pricing them is to use Monte Carlo simulation. However, this presents difficulties due to limited quantities of historical data. There are three Monte Carlo approaches to modeling the price of weather derivatives: historical burn analysis (HBA), index modeling, and daily modeling.
HBA, which is a classical method, uses historical price data to simulate future prices with the assumption that historical weather data contain all characteristics to value future prices of derivatives [3]. Therefore, we need to have the price of weather derivatives to employ HBA. Instead, there is an approach to model index values that are directly related to the price of weather derivatives. In this approach, a different model is required for each index [4]. Another approach suggested by recent studies is daily modeling. We use weather models that can simulate potential future weather data. This approach can lead to more accurate results compared to the other two approaches because it utilizes weather data directly to price weather derivatives [5].
Literature on weather derivatives has primarily focused on alternative models for pricing weather risks [6,7,8,9]. However, while weather derivatives are often used to protect agricultural producers from droughts, there is a basis risk in that weather does not perfectly correlate with drought risks. Thus, our interest is to improve the use of stochastic weather models to simulate drought indices, enabling the pricing of drought-specific derivatives to better capture potential risks. Furthermore, in the literature, studies have not focused on weather modeling in arid climates, where droughts are prevalent. Thus, in this paper, we assess our stochastic weather models by calculating a reconnaissance drought index (RDI) in Qatar for use in index-based weather derivatives. Modeling weather to calculate RDI provides not only an opportunity to price drought derivatives but also could be used to potentially understand the distribution of losses in traditional drought insurance contracts, thus providing more accurate pricing.
Developing stochastic models to simulate realistic daily weather scenarios that preserve the statistical characteristics of the historical weather data is very important to obtain accurate prices of weather derivatives [2]. In this study, based on daily modeling, we suggest stochastic weather models for temperature, humidity, and precipitation to decide the prices of weather derivatives.
When it comes to discrete temperature modeling, an autoregressive (AR) model for residuals was used to model daily temperature [10] as a discrete process. As an improved autoregressive model, reference [11] suggested a k-lag autocorrelation model. The continuous processes for temperature models usually contain a mean-reverting term. Most of the temperature models suggested for weather derivatives consider a mean-reverting process. Reference [12] uses an O–U process for the temperature at Heathrow Airport in the UK and concludes that it shows a good fit for modeling temperature. The study in [12] proposes an adjusted O–U process pointing out the problem that the expected value of the O–U process suggested in [13,14] does not approximately equal the mean that the process should approach. In an early study of precipitation models, References [15,16] suggested a Markov chain for modeling of precipitation occurrence. A first-order Markov chain is a well-known and widely used model for precipitation occurrence [17,18,19]. A higher-order Markov chain is also applied to model precipitation occurrence to overcome the problem of the short memory length found in a first-order Markov chain model [20,21]. A Markov chain model with jumps for precipitation data in Chongqing, China shows better results for locations with frequent excess precipitation [22].
This paper is structured as follows. In Section 2, we describe the reconnaissance drought index (RDI) as a measure of drought intensity, evapotranspiration, and crop evapotranspiration used to calculate drought index values; we also propose stochastic weather models for temperature, humidity, and precipitation. In Section 3, we calculate the RDI values for four crops in Qatar, i.e., carrots, maize, tomatoes, and wheat, using historical and simulated weather data. In Section 4, we discuss our conclusions and future work.

2. Materials and Methods

2.1. Study Area and Data

In this study, daily weather data from Doha, Qatar is used. Qatar is located at latitude 25.35 N and longitude 51.03 E, in the eastern region of Saudi Arabia. It has a dry climate with low precipitation and exceedingly hot and humid summers. The temperature during the winter months, from December to February, remains above 10 C and most of the precipitation occurs during this period. The region receives a small amount of precipitation, as shown in Figure 1, which is highly unpredictable both in terms of time and space.
Weather data from 1983 to 2013 were used to develop a stochastic weather model. To compare calculation methods for evapotranspiration, data over the period from 1985 to 2013 were used. The data included basic weather factors, such as daily values of the mean, minimum, and maximum temperature ( C), precipitation (mm), relative humidity (%), and wind speed (m/s). More detailed weather factors for calculating evapotranspiration were included from 1985.
To ignore the effects of leap years, we subtract the last day (i.e., 29 February) in a leap year. This gives us a simple model structure with an equal number of days in each year, with 365 days in every year.
In this study, missing values in the temperature data were replaced with the existing values of the same day in the previous year. This is because missing values found in our data were mostly consecutive and over a long period; for example, the worst case is that there were no data for the whole month. For precipitation, missing data values were considered as non-rainy days because in Qatar there were not many rainy days. All data processing and calculations were done with the statistical software package R.

2.2. Reconnaissance Drought Index (RDI)

In the last few decades, many drought indices have been developed as tools for assessing the severity of droughts. By incorporating the effects of relevant weather variables on the occurrence and intensity of droughts into a single numerical value, drought indices make it easy to establish strategies that could reduce the associated risks. Through the drought index, droughts could be classified as hyper-arid, arid, semi-arid, sub-humid, and humid. In this paper, we used a more recent drought index called the reconnaissance drought index (RDI), introduced by Tsakiris et al. (2007) [23]. The RDI is a physically based, universal, and comprehensive index that depends on cumulative precipitation and potential evapotranspiration. Most drought indices are precipitation-based only and are not sufficiently effective at capturing the effects of droughts on crops and agricultural production. When assessing the severity of droughts, evapotranspiration is also required in order to provide the most realistic information on water scarcity and agricultural conditions. Therefore, the RDI is a preferred index for use in agriculture in drought severity assessment and monitoring. It is also sensitive to changing climatic environments and flexible for different growing periods (Tsakiris et al. (2007) [23]).
RDI is defined as the ratio of accumulated precipitation to potential evapotranspiration [23]:
R D I i = j = 1 n P i j j = 1 n P E T i j ,
where P i j and P E T i j are the precipitation and potential evapotranspiration ( P E T ) for the jth month of the ith year and n is the period in which we are interested. Based on this equation, we can calculate RDI for any period in each year. Potential evapotranspiration ( P E T ) is the evaporation that occurs where there is a sufficient water supply, but in practice it is very hard to obtain. Therefore, in practice, we use an adjusted RDI, which uses actual evapotranspiration ( E T ) instead of potential evapotranspiration ( P E T ):
R D I a d j i = j = 1 n P i j j = 1 n E T i j ,
where E T i j is the actual evapotranspiration E T for the jth month of the ith year.

2.3. Evapotranspiration

Evapotranspiration consists of two processes that account for water loss to the atmosphere: evaporation and transpiration. While evaporation represents the water loss due to movement from the soil to the atmosphere, transpiration represents the water loss through plants. Since these two processes occur simultaneously, it is difficult to measure their effect separately. Therefore, we use evapotranspiration as a quantity that measures the combined effect of the two processes. Since the weather is the only factor that affects evapotranspiration, there are calculation methods using weather data. Several calculation methods have been developed and proposed to estimate evapotranspiration from weather data such as temperature, humidity, wind speed, and other weather parameters, such as solar radiation, pressure, and so on.
The FAO-56 Penman–Monteith (PM) method [24] is recommended by the Food and Agriculture Organization of the United Nations (FAO) as a standardized method to calculate the reference evapotranspiration E T 0 . It has the following form:
E T 0 = 0.408 Δ R n G + γ 900 T + 273 U 2 e s e a Δ + γ 1 + 0.34 U 2 ,
where E T 0 is the reference evapotranspiration (mm/day), R n is the net radiation at the crop surface (MJ/m 2 day), G is the soil heat flux density (MJ/m 2 day), T 2 is the mean air temperature at a 2 m height ( C), U 2 is the wind speed at a 2 m height (m/s), e s is the saturation vapor pressure (kPa), e a is the actual vapor pressure (kPa), Δ is the slope of the vapor pressure curve (kPa/ C), and γ is a psychometric constant (kPa/ C).
The limitation of using the PM formulation is that it requires extensive climatic data that are not easily available. Therefore, we test other simpler evapotranspiration formulations for compatibility with the PM formulation for Qatar, including those developed by Blaney and Criddle [25], Hargreaves and Samani [26], Jensen and Haise [27], Linacre [28], and Turc [29]. This will be crucial for other arid regions where the available weather data are limited. The formulations of each of these methods are provided in Appendix A.
In order to select the appropriate formulation, we compared the compatibility of each of these methods with the PM formulation by computing various statistical quantities, including, the Pearson’s correlation ( R 2 ), root-mean-square error (RMSE), mean absolute error (MAE), and the maximum absolute error (MAXE). The computed values of these quantities based on the data from 1985 to 2014 are presented in Table 1.
With the highest correlation R 2 value and lowest RMSE, MAE, and MAXE error values, the Turc method shows the closest agreement with the PM method. The Turc method is given by the following piecewise continuous function or relative humidity R H
E T 0 = 0.013 T m e a n 15 + T m e a n R s + 50 , R H > 50 % = 0.013 T m e a n 15 + T m e a n R s + 50 1 + 50 R H 70 , R H < 50 % ,
and requires only mean temperature, T m e a n (ºC), relative humidity, R H (%), and mean solar radiation, R s (W/m 2 day), which are variables that are easily obtainable over an arid region, such as Qatar, making the method suitable for common use. Therefore, the Turc method is selected as a method of evapotranspiration for the RDI calculation in this paper.
Once we obtain E T 0 , the actual evapotranspiration specific to a certain crop can be easily calculated using the crop coefficient. While most of the effects from relevant weather conditions are reflected in E T 0 , the effect of the crop type on evapotranspiration is incorporated by adjusting E T 0 with the crop coefficient to obtain the crop evapotranspiration, denoted by E T c , which is given by:
E T c = E T 0 × k c ,
where k c is the crop coefficient. Every crop has its own crop coefficient values based on the growth environment, water requirements, and growth stages. Crop coefficients for specific crops at specific growth stages are provided by the Food and Agriculture Organization of the United Nations (FAO) [24]. Table 2 shows the growth period and crop coefficients for Qatar. The growth period is separated into four growth stages; initial, development, mid-season, and late season. Typically, crop coefficients in mid-season have the largest values. Most crops in Qatar are grown in the winter months and harvested before the start of the summer months because the summer has very few rainy days.
Crop evapotranspiration for six crops, including alfalfa, bean, carrot, maize, tomato, and wheat, was calculated using observed and simulated weather data. Observed weather data from 1985 to 2013 and simulation data with the same length as the observation data were used. The mean, standard deviation, minimum, and maximum of crop evapotranspiration are presented in Table 3. The mean crop evapotranspiration from tomatoes shows the largest value, while alfalfa shows the smallest value.

2.4. Temperature Model

We propose using the mean reversion process to develop the temperature model accounting for its seasonality, which repeats annually. We use an adjusted Ornstein–Uhlenbeck (O–U), suggested by [13,14] and given by
d X t = λ μ X t + d μ d t d t + γ d W t ,
where X t represents the daily temperature, λ is the speed of mean reversion, μ is the mean where the process reverts to, γ is the volatility of the model, and d W t is the Wiener process, which is normally distributed with a mean of 0 and variance of t. The solution to Equation (6), a stochastic differential equation (SDE), is derived from Itô’s Lemma and is given by [13]:
X t = μ e 0 t λ d s μ + X 0 + e 0 t λ d s 0 t e 0 t λ d s γ d W s .
In this process, λ , μ , and γ are parameters that need to be estimated from the data. Here, we use the least-squares method to estimate these parameters, where we assume that consecutive observations have a linear relation with normally distributed error. The following linear equation is applied to the solution of the SDE [30]:
X t = a + b X t 1 + ϵ .
The relationship between the parameters of the linear equation and the solution of the SDE [30] is then derived and described below.
a = μ 1 e λ δ ,
b = e λ δ ,
s d ( ϵ ) = γ 1 e 2 λ δ 2 λ .
Here, δ represents the time step between t and t 1 , so δ is 1. Thus, rewriting with respect to parameters in the SDE, we have
λ = ln b δ ,
μ = a 1 b ,
γ = s d ( ϵ ) 2 ln b δ 1 b 2 .
We can calculate the parameters of the least square fit as follows:
a = S y b S x n ,
b = n S x y S x S y n S x x S x 2 ,
s d ( ϵ ) = n S y y S y 2 b ( n S x y S x S y ) n ( n 2 ) ,
where S x = t = 1 n X t 1 , S y = t = 1 n X t , S x x = t = 1 n X t 1 2 , S x y = t = 1 n X t 1 X t and S y y = t = 1 n X t 2 .
The mean values of the parameters in the SDE are calculated for each month of the year and are presented in Table 4. The highest value of γ is found in June and the smallest value is found in October. For a speed of mean reversion, August has a large value of λ , which means it is drawn very strongly back to its mean value. Moreover, the value of μ is close to the monthly mean temperature.
We also used the maximum likelihood estimation method to estimate these parameters and very similar results were obtained. (See Appendix B for details).

2.5. Humidity Model

In addition to temperature, relative humidity is needed to calculate evapotranspiration using our selected Turc method. Similar to the temperature model, a mean-reverting O–U process is used to model daily humidity data. The SDE equation given by Equation (6) and its solution given by Equation (7) are applied, where X t represents humidity. The parameters are then determined using Equations (12)–(14). The mean values of the parameters in the SDE were calculated using the least square method for each month of the year and are presented in Table 5. Similar results were obtained using the maximum likelihood estimation method (see Appendix B for details). As one may expect, the volatility values for humidity are larger than those for the temperature model.

2.6. Precipitation Model

We now present the precipitation model, which is required along with evapotranspiration for RDI calculation.
The most commonly used stochastic models for precipitation consist of a two-process formulation that models precipitation occurrence and amount. In the two-process model, a Markov chain is used to model precipitation occurrence, and a probability distribution is used to determine the precipitation amount on a wet day [15,16,17,18,19,31,32]. In the following sections, we describe the first-order Markov chains used to model precipitation occurrence and consider several probability distributions for estimating the precipitation amount.

2.6.1. Precipitation Occurrence Model

The first-order Markov chain implicitly assumes that the probability of rain tomorrow depends only on whether it rained today or not and is described by the Markov property:
P ( X t + 1 = s t + 1 | X t = s t , X t 1 = s t 1 , , X 0 = s 0 ) = P ( X t + 1 = s t + 1 | X t = s t ) ,
where time t = { 0 , 1 , 2 , , T } and state space s = { 1 , 2 , 3 , , S } . The Markov chain transition matrix that defines a probability that each event occurs is composed of transition probabilities, which are conditional probabilities of future state j given state i. The transition matrix, denoted by P , is given by
P = p 11 p 1 j p i 1 p i j for i , j S ,
where p i j = P ( X t + 1 = j | X t = i ) . The property of a transition matrix is that the total sum of each row must equal 1, i.e.,
j = 1 S p i j = j = 1 S P ( X t + 1 = j | X t = i ) = 1 .
The precipitation occurrence has two states: dry and wet. Therefore, the transition matrix is specified by two conditional probabilities, which are
p d d = P ( dry on day t + 1 | dry on day t ) ,
p d w = P ( wet on day t + 1 | dry on day t ) ,
p w d = P ( dry on day t + 1 | wet on day t ) ,
p w w = P ( wet on day t + 1 | wet on day t ) .
Since there are only two states, transition probabilities at the same given state are complementary. So it is not necessary to estimate four transition probabilities, we only need to estimate one of each pair of transition probabilities. For instance, the probability of a dry day following a dry day is calculated using the probability of a wet day following the dry day, which is p d d = 1 p d w . The probability transition matrix is defined as below.
P = p d d p d w p w d p w w 0 p i j 1 i , j = { d , w } .
Using the transition matrix, we can calculate the stationary state vector such that π = π P . It implies a long-run relative frequency of precipitation occurrence and satisfies i = 1 S π i = 1 , where π i 0 for all i. Each element, π i , denotes the probability of being in state i. If this state vector is given by
π = π d π w ,
then it must satisfy
π d π w = π d π w 1 p d w p d w p w d 1 p w d .
Therefore, by solving the stationary probabilities π d and π w , we obtain:
π d = p w d p w d + p d w ,
π w = p d w p w d + p d w .
The stationary probabilities are calculated for each month and are shown in Table 6. We can see that most of the stationary probabilities for dry days are quite large, showing that Qatar does not have many wet days. Since there are no wet days in June and July from the data, stationary probabilities for dry days are one, as expected.

2.6.2. Distribution of Precipitation Amount

By accounting for the fact that there is an extreme amount of precipitation over 100 mm, we expect that distributions with thicker tails will perform better in estimating the precipitation amount. Therefore, we employed two extreme distributions, the generalized extreme value, and four-kappa distributions. Since these distributions have thicker tails, they can better capture extremely large amounts of precipitation in the simulation.
In this study, we also considered the probability distributions used in previous research, including the exponential, log-normal, and gamma distributions, and two extreme distributions, i.e., the general extreme value and four-kappa distributions, to simulate the precipitation amount. The parameters of these distributions are estimated using the maximum likelihood method (MLE), which is the most common method used to find parameters in statistics.
Once we have the precipitation occurrence sequence, the next step is to determine the precipitation amounts on wet days. Since the precipitation amount is generally small, we use a right-skewed probability distribution for the precipitation amount [19]. Previously, many right-skewed probability distributions, including the exponential, log-normal, and gamma, were used to describe the distribution of the precipitation amount [18,33]. In this study, we consider exponential, log-normal, and gamma distributions, and two extreme probability distributions, the general extreme value distribution (GEVD) and four-kappa distribution (K4D). Functions for the probability distributions are given in Table 7. The two extreme distributions have a thick tail, which could model the extreme amount of precipitation in the simulation. The probability distribution parameters are estimated using the maximum likelihood estimation (MLE), and the results are shown in Table 8.
It would be preferable to fit probability distributions for each month. However, some months have very few or no rainy days in our data, making it challenging to find suitable probability distributions. Therefore, we use all the available precipitation data to estimate the probability distribution of precipitation amount in Qatar.
We determine which probability distributions show the best fit for the precipitation amount by using three model validation methods, the Kolmogorov–Smirnov (KS test), Akaike information criteria (AIC), and Bayesian information criteria (BIC). As described below, the KS test provides p-values while both AIC and BIC provide values based on likelihood functions [33].
  • Kolmogorov–Smirnov test (KS test [33])
    The Kolmogorov–Smirnov test is used to determine if a dataset comes from a specified distribution. It measures the differences between the empirical distribution of the sample and the cumulative distribution of the specified distribution, providing a test statistic, D, and p-values that can be used as criteria for hypothesis testing.
  • Akaike information criteria (AIC [35])
    Firstly, the AIC developed by Hirotugu Akaike was used to evaluate the performance of the model in a simple linear regression. It was created to select the model that has the smallest loss of information from the given data. It measures the loss based on a likelihood function and is defined by:
    A I C = 2 ln ( L ) + 2 K ,
    where L is the likelihood function and K is the number of parameters. In the formula, the negative log-likelihood term represents the loss of information and 2 K contains a penalty corresponding to the number of parameters in the model. This penalty considers the number of parameters because the model performance improves with the number of parameters it has. By comparing AIC, the model with the smallest AIC is considered to have good performance.
  • Bayesian information criterion (BIC [36])
    Similar to AIC, BIC evaluates the model performance by using the likelihood, and the model with the smallest BIC is preferred. Compared to AIC, it has a larger penalty term for the number of parameters and observations. It is defined by
    B I C = 2 ln ( L ) + K ln ( n ) .
The results of the KS test, AIC, and BIC for all of the distributions are presented in Table 9.
The p-values of all distributions except K4D are very small and much less than the significance level of 0.05, indicating that only K4D is a significant distribution resulting from the KS test. Furthermore, since the AIC and BIC values of K4D are among the smaller values, we choose to adopt the K4D distribution for the amount of precipitation on wet days.

3. Results

We use the solution of the SDE to simulate daily temperatures and compare them with the observed temperature data. Summary statistics of the observation and simulation data are summarized in Table 10. The mean daily temperature from observation, 27.67, is consistent with the simulation result of 27.57. Standard deviations from observation and simulation data are 6.8975 and 6.9036, respectively, which are close to each other. Additionally, the 1st, 10th, 25th, 50th, 75th, 90th, and 99th percentiles show similar consistencies. Therefore, the temperature model with the adjusted O–U process is able to adequately reproduce the properties of temperature found in the observation.
Similarly, the summary statistics for the SDE simulation are presented in Table 11. The mean humidity of the simulation is very close to the observation. In addition, percentiles in the simulation are very similar to each other. The humidity simulation shows a smaller standard deviation than the observation. The main discrepancy is found in the minimum values. The minimum observation humidity is 8 but the simulation minimum is 2.252. Overall, the performance of the humidity model with the mean reverting O–U process is adequate.
In terms of precipitation, 30 years of simulation data were generated using the stochastic precipitation model with a Markov chain. The number of wet days in the observed and simulated data is presented in Figure 2. The number of wet days in the observed and simulated data is very similar, indicating that a first-order Markov chain works well to generate precipitation occurrence. From the simulation data, the number of wet days in January, March, and August is generated more than in the observed data. The simulated number of wet days in February, April, May, October, November, and December is generally less than the observed data.
In order to compare the precipitation amounts between observation and simulation data, percentiles and means are calculated using the selected probability density K4D. The minimum, 25th percentile (Q1), median (Q2), mean, 75th percentile (Q3), and the maximum simulated precipitation data are presented in Table 12. It is evident that a right-skewed distribution is appropriate for the distribution of precipitation amount since the median of the observed precipitation amount is smaller than the mean. Most of the simulated percentiles are very similar to those from the observation data.
Figure 3 illustrates the histogram of the precipitation amounts obtained with K4D. The dashed line represents the simulation and both the data and the simulation clearly show that the distribution of precipitation amount is strongly rightly skewed.
We can calculate RDI using the observed and simulated weather data. To obtain simulation data, temperature and humidity data are simulated from an adjusted O–U process and precipitation data are simulated using a first-order Markov chain with K4D. RDI values for four crops—carrots, maize, tomatoes, and wheat—where calculated. Mean, standard deviation, and maximum values are presented in Table 13. No minimum values are presented here since all of the crops considered yield a minimum value of 0.
As expected, RDI in Qatar is very small, which implies that Qatar is very dry. It is hard to grow crops only relying on water from precipitation in Qatar. So it is absolutely necessary to utilize effective irrigation systems to grow crops in Qatar. From the observation data, the smallest mean value is found for tomatoes, while the largest mean value is found for carrots, which also has the largest standard deviation. When comparing the RDI from observation and simulation data, the mean values from the simulation are higher than those from observation. The common trend shown in the mean and maximum values is that the RDI from simulation is higher than that from observation data. From this perspective, we can conclude that the dryness contained in the simulation weather data is less than that from observation.

4. Discussion

We developed a stochastic weather model for temperature, humidity, and precipitation in Qatar. We used an adjusted O–U process to simulate temperature and humidity, while for the precipitation model, we employed a Markov chain with a probability distribution. A first-order Markov chain was used to determine whether a day was wet or dry, and K4D was used to determine the precipitation amount on wet days.
Moreover, we calculated RDI values from observation and simulated weather data and compared them by computing the mean, standard deviation, and maximum values. RDI from the observation is obviously small since Qatar is generally dry. The mean and maximum values of RDI from the simulation are greater than the observation. The comparison results between the observation and simulation tell us that the dryness included in the simulation data is less than in the observation.
To improve stochastic weather models and obtain better simulation data, we suggest a combined weather model. In this study, we constructed temperature and humidity models separately. In fact, humidity is somehow related to temperature, so we could develop a combined model with temperature and humidity. With a combined model, we expect to have simulation data that will likely be in better agreement with real data. Thus, we can reduce the differences in the drought index values between the observation data and simulation data.

Author Contributions

Conceptualization, M.P. and K.A.; methodology, J.P. and M.P.; software, J.P.; validation, J.P.; investigation, J.P.; resources, K.A.; data curation, J.P.; writing—–original draft preparation, J.P.; writing—review and editing, M.P.; supervision, M.P. and K.A.; project administration, K.A.; funding acquisition, K.A. and M.P. All authors have read and agreed to the published version of the manuscript.


This publication was funded by NPRP grant no. NPRP6-064-4-001 from the Qatar National Research Fund (a member of the Qatar Foundation). The statements made herein are solely the responsibility of the authors.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here:

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A. Evapotranspiration Formulations

In this Appendix, we provide the details of the formulations developed by Blaney and Criddle [25], Hargreaves and Samani [26], Jensen and Haise [27], Linacre [28], and Turc [29]. Each method has different data requirements; they are simpler equations and require fewer data variables compared to the FAO-56 PM method.
  • Blaney–Criddle:
    The Blaney–Criddle equation can be written as follows:
    E T 0 = P 0.46 T m e a n + 8.13
    where T m e a n is the mean temperature ( C) and P is the percentage of daylight hours associated with latitude and longitude.
  • Hargreaves:
    The Hargreaves equation can be written as follows:
    E T 0 = 0.0023 × R a ( T m e a n + 17.80 ) T m a x T m i n
    where T m e a n , T m a x , and T m i n are the mean, maximum, and minimum temperatures ( C), respectively, and R a is the extraterrestrial radiation (MJ/m 2 day) computed from the latitude in radians, sunset hour angle, distance between the sun and Earth, and solar declination.
  • Jensen–Haise:
    The method developed by Jensen–Haise for the arid and semiarid regions has the following equation:
    P E T = 1 38 2 × E l e v a t 305 + 7.6 50 ( e s ( T m a x ) e s ( T m i n ) ) × T m e a n 2.5 0.14 e s ( T m a x ) e s ( T m i n ) E l e v a t 550 R a
    where E l e v a t is the altitude (m), e s ( T m i n ) is the saturation vapor pressure (kPa) at the minimum temperature, e s ( T m a x ) is the saturation vapor pressure (kPa) at the maximum temperature, e a ( T m i n ) is the actual vapor pressure (kPa) at the minimum temperature, and e a ( T m a x ) is the actual vapor pressure (kPa) at the maximum temperature. This method is known to overestimate E T 0 in humid areas and underestimate it in arid and semi-arid regions.
  • Linacre
    The Linacre method can be written as follows:
    E T 0 = 700 T m e a n + 0.0006 Z 100 L + 15 T m e a n T d 80 T m e a n
    where Z is altitude (m), L is latitude in the degree, and T d is the dew point.
  • Turc
    Finally, the Turc method can be written as follows:
    P E T = 0.013 T m e a n 15 + T m e a n R s + 50 , R H > 50 % = 0.013 T m e a n 15 + T m e a n R s + 50 1 + 50 R H 70 , R H < 50 %
    where R s is the mean solar radiation(W/m 2 day) and R H is the relative humidity(%).

Appendix B. Maximum Likelihood Estimation (MLE)

Maximum likelihood estimation is a common statistical method used to estimate the parameters of a statistical model. Conditional maximum likelihood estimation (CMLE) is a type of MLE that takes into account the presence of conditional information in the data. In CMLE, the likelihood function is maximized with respect to the model parameters under the condition that certain variables in the model are held constant. We define our conditional likelihood function as
L ( μ , λ , γ ^ ) = t = 1 n f ( X t | X t 1 ; μ , λ , γ ^ ) .
Applying the conditional maximum likelihood estimation to the solution of the SDE, the conditional probability function of X t given X t 1 is given by
f ( X t | X t 1 ; μ , λ , γ ^ ) = 1 2 π γ ^ 2 exp X t X t 1 e λ δ μ 1 e λ δ 2 2 γ ^ 2 ,
γ ^ 2 = γ 2 1 e 2 λ δ 2 λ .
The log-likelihood function is derived from the conditional probability function,
l ( μ , λ , γ ^ ) = t = 1 n ln f ( X t | X t 1 ; μ , λ , γ ^ ) = n 2 ln 2 π n ln γ ^ 1 2 γ ^ 2 t = 1 n X t X t 1 e λ δ μ 1 e λ δ 2 .
The maximum likelihood estimator has a value that is satisfied when all partial derivatives (with respect to each parameter) are zero. Partial derivatives with respect to each parameter and solution are presented below.
l ( μ , λ , γ ^ ) μ = 1 γ ^ 2 t = 1 n X t X t 1 e λ δ μ 1 e λ δ ( 1 e λ δ ) = 0 ,
μ = t = 1 n X t X t 1 e λ δ n ( 1 e λ δ ) ,
l ( μ , λ , γ ^ ) λ = δ e λ δ γ ^ 2 t = 1 n ( X t μ ) ( X t 1 μ ) e λ δ ( X t 1 μ ) 2 = 0 ,
λ = 1 δ ln t = 1 n ( X t μ ) ( X t 1 μ ) t = 1 n ( X t 1 μ ) 2 ,
l ( μ , λ , γ ^ ) γ ^ = n γ ^ + 1 γ ^ 3 t = 1 n X t X t 1 e λ δ μ ( 1 e λ δ ) = 0 ,
γ ^ 2 = 1 n t = 1 n X t X t 1 e λ δ μ 1 e λ δ 2 .
Note that the solutions of μ and λ are dependent on each other. The solution of μ is affected by λ ; likewise, λ is affected by μ . Therefore, at least knowing one value of these two parameters is required to find μ and λ . In order to overcome this problem, a substitution of λ into μ is applied. Once μ and λ are determined, γ can be found. Therefore, the parameters are given as follows.
μ = S y e λ δ S x n ( 1 e λ δ ) ,
μ = S y S x x S x S x y n ( S x x S x y ) ( S x 2 S x S y ) ,
λ = 1 δ ln S x y μ S x μ S y + n μ 2 S x x 2 μ S x + n μ 2 ,
γ ^ 2 = 1 n [ S y y 2 e λ δ S x y + e 2 λ δ S x x 2 μ ( 1 e λ δ ) ( S y e λ δ S x ) + n μ 2 ( 1 e λ δ ) 2 ] ,
γ 2 = γ ^ 2 2 λ 1 e 2 λ δ .


  1. Zhu, J.; Pollanen, M.; Abdella, K.; Cater, B. Modeling Drought Option Contracts. ISRN Appl. Math. 2012, 2012, 251835. [Google Scholar] [CrossRef] [Green Version]
  2. Alexandridis, K.A.; Zapranis, A.D. Weather Derivatives: Modeling and Pricing Weather-Related Risk; Springer: New York, NY, USA, 2013. [Google Scholar]
  3. Dischel, B. Shaping history for weather risk management. Energy Power Risk 1999, 12, 13–15. [Google Scholar]
  4. Dorfleitner, G.; Wimmer, M. The pricing of temperature futures at the Chicago Mercantile Exchange. J. Bank Financ. 2010, 34, 1360–1370. [Google Scholar] [CrossRef] [Green Version]
  5. Jewson, S.; Brix, A.; Ziehmann, C. Weather Derivative Valuation: The Meteorological, Statistical, Financial and Mathematical Foundations; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
  6. Brody, D.; Syroka, J.; Zervos, M. Dynamical pricing of weather derivatives. Quant. Financ. 2002, 2, 189–198. [Google Scholar] [CrossRef]
  7. Richards, T.J.; Manfredo, M.R.; Sanders, D.R. Pricing weather derivatives. Am. J. Agric. Econ. 2004, 86, 1005–1017. [Google Scholar] [CrossRef] [Green Version]
  8. Geman, H.; Leonardi, M. Alternative approaches to weather derivatives pricing. Manag. Financ. 2005, 31, 46–72. [Google Scholar] [CrossRef]
  9. Taylor, J.W.; Buizza, R. A comparison of temperature density forecasts from GARCH and atmospheric models. J. Forecast. 2004, 23, 337–355. [Google Scholar] [CrossRef] [Green Version]
  10. Carmona, R. Calibrating degree day options. In Proceedings of the 3rd Seminar on Stochastic Analysis, Random Field and Applications, École Polytechnique de Lausanne, Ascona, Switzerland, 23 September 1999. [Google Scholar]
  11. Cao, M.; Wei, J. Pricing the weather. Risk 2000, 13, 67–70. [Google Scholar]
  12. McIntyre, R.; Doherty, S. Weather Risk—An example from the UK. Energy Power Risk Manag. 1999. [Google Scholar]
  13. Dornier, F.; Queruel, M. Caution to the wind. Weather risk special report. Energy Power Risk Manag. 2000, 13, 30–32. [Google Scholar]
  14. Bhowan, A. Temperature Derivatives. Ph.D. Thesis, University of Wiwatersand, Johannesburg, South Africa, 2003. [Google Scholar]
  15. Williams, C.B. Sequences of wet and of dry days considered in relation to the logarithmic series. Q. J. R. Meteorol. Soc. 1952, 78, 91–96. [Google Scholar] [CrossRef]
  16. Longley, R.W. The length of dry and wet periods. Q. J. R. Meteorol. Soc. 1953, 79, 520–527. [Google Scholar] [CrossRef]
  17. Chin, E.H. Modeling daily precipitation occurrence process with Markov chain. Water Resour. Res. 1977, 13, 949–956. [Google Scholar] [CrossRef]
  18. Richardson, C.W. Stochastic simulation of daily precipitation, temperature, and solar radiation. Water Resour. Res. 1981, 17, 182–190. [Google Scholar] [CrossRef]
  19. Wilks, D.S. Interannual variability and extreme-value characteristics of several stochastic daily precipitation models. Agric. For. Meteorol. 1999, 93, 153–169. [Google Scholar] [CrossRef]
  20. Dennett, M.D.J.; Rodgers, J.A.; Keatinge, J.D.H. Simulation of a rainfall record for a new site of a new agricultural development: An example from northern Syria. Agric. Meteorol. 1983, 29, 247–258. [Google Scholar] [CrossRef]
  21. Jones, P.G.; Thornton, P.K. Spatial and temporal variability of rainfall related to a third-order Markov model. Agric. For. Meteorol. 1997, 86, 127–138. [Google Scholar] [CrossRef]
  22. Göncü, A. Modeling and pricing precipitation-based weather derivatives. Financ. Math. Appl. 2011, 1, 9–18. [Google Scholar]
  23. Tsakiris, G.; Pangalou, D.; Vangelis, H. Regional drought assessment based on the Reconnaissance Drought Index (RDI). Water Resour. Manag. 2007, 21, 821–833. [Google Scholar] [CrossRef]
  24. Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop Evapotranspiration-Guidelines for Computing Crop Water Requirements-FAO; Irrigation and Drainage Paper 56; Food and Agriculture Organization of the United Nations: Rome, Italy, 1998. [Google Scholar]
  25. Allen, R.; Pruitt W., O. Rational Use of The FAO Blaney-Criddle Formula. J. Irrig. Drain. Eng.-ASCE 1986, 112, 139–155. [Google Scholar] [CrossRef]
  26. Hargreaves, G.H.; Samani, Z.A. Reference Crop Evapotranspiration from Temperature. Appl. Eng. Agric. 1985, 1, 96–99. [Google Scholar] [CrossRef]
  27. Jensen, M.E.; Haise, H.R. Estimating Evapotranspiration from Solar Radiation. Proc. Am. Soc. Civ. Eng. J. Irrig. Drain. Div. 1963, 89, 15–41. [Google Scholar] [CrossRef]
  28. Linacre, E.T. A Simple Formula for Estimating Evaporation Rates in Various Climates, using Temperature Data Alone. Agricult. Meteorol. 1977, 1, 409–424. [Google Scholar] [CrossRef]
  29. Turc, L. Estimation of irrigation water requirements, potential evapotranspiration: A simple climatic formula evolved up to date. Ann. Agron. 1961, 12, 13–49. [Google Scholar]
  30. Van den Berg, T. Calibrating the Ornstein-Uhlenbeck (Vasicek) Model. Available online: (accessed on 8 March 2023).
  31. Gabriel, K.R.; Neumann, J. A Markov chain model for daily rainfall occurrence at Tel Aviv. Q. J. R. Meteorol. Soc. 1962, 88, 90–95. [Google Scholar] [CrossRef]
  32. Katz, R.W. Computing probabilities associated with the Markov chain model for precipitation. J. Appl. Meteorol. Climatol. 1974, 13, 953–954. [Google Scholar] [CrossRef]
  33. Hui, W.; Xuebin, Z.; Elaine, M.B. Stochastic Modeling of Daily precipitation for Canada. Atmospheric 2005, 43, 23–32. [Google Scholar]
  34. Hosking, J.R.M. The four-parameter Kappa distribution. IBM J. Res. Dev.t 1994, 38, 251–258. [Google Scholar] [CrossRef]
  35. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974, 19, 716–723. [Google Scholar]
  36. Schwarz, G. Estimating the Dimension of a Model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Figure 1. Histogram of precipitation amount in Doha (mm).
Figure 1. Histogram of precipitation amount in Doha (mm).
Mathematics 11 01628 g001
Figure 2. Comparison of rainy days from the observation and simulation data.
Figure 2. Comparison of rainy days from the observation and simulation data.
Mathematics 11 01628 g002
Figure 3. Comparison of the precipitation histogram and K4D.
Figure 3. Comparison of the precipitation histogram and K4D.
Mathematics 11 01628 g003
Table 1. Statistics for the comparison of E T 0 methods.
Table 1. Statistics for the comparison of E T 0 methods.
Blaney and CriddleHargreaves and SamaniJensen and HaiseLincareTurc
R 2 0.51070.72830.57650.43670.7623
Table 2. Growing periods and crop coefficients in Qatar (FAO 56, 1988).
Table 2. Growing periods and crop coefficients in Qatar (FAO 56, 1988).
CropGrowing PeriodGrowth Stage k c (Crop Coefficient)
Table 3. Summary statistics of E T c .
Table 3. Summary statistics of E T c .
Table 4. Parameter values of the mean reversion process for the temperature.
Table 4. Parameter values of the mean reversion process for the temperature.
Month μ λ γ
Table 5. Parameter values of the mean reversion process for humidity.
Table 5. Parameter values of the mean reversion process for humidity.
Month μ λ γ
Table 6. Steady-state probability vectors for precipitation occurrence in Doha.
Table 6. Steady-state probability vectors for precipitation occurrence in Doha.
MonthProbability VectorMonthProbability Vector
January 0.965 0.035 July 1 0
February 0.971 0.029 August 0.999 0.001
March 0.97 0.030 September 0.999 0.001
April 0.976 0.024 October 0.998 0.002
May 0.993 0.007 November 0.976 0.024
June 1 0 December 0.960 0.040
Table 7. Functions for the probability distributions.
Table 7. Functions for the probability distributions.
f ( x ; λ ) = λ exp ( λ x ) , x 0 ; λ > 0
f ( x ; μ , σ ) = 1 x σ 2 π exp ( ln x μ ) 2 2 σ 2 , x > 0 ; σ > 0
f ( x ; α , β ) = β α x α 1 e x / β Γ ( α ) , x > 0 ; α , β > 0
F ( x ; ξ , μ , α ) = exp 1 + ξ x μ α 1 / ξ ,
where x { z | 1 + ξ ( z μ ) / α > 0 } ; σ > 0
F ( x ; ξ , α , h , k ) = 1 h 1 k ( x ξ ) α 1 k 1 h ,        k 0 , h 0
(See [34] for a description of the support of K4D.)
Table 8. Estimation of parameters in probability distributions.
Table 8. Estimation of parameters in probability distributions.
DistributionEstimation of Parameters
Exponential μ ^ = 0.108
Log-normal μ ^ = 0.834 σ ^ = 1.510
Gamma α ^ = 0.463 β ^ = 0.050
GEVD μ ^ = 1.060 α ^ = 1.322 ξ ^ = 1.364
K4D ξ ^ = 1.225 α ^ = 1.424 h ^ = 0.375 k ^ = 1.329
Table 9. Results of the KS test, AIC, and BIC.
Table 9. Results of the KS test, AIC, and BIC.
DistributionKS Test (p-Value)AICBIC
Exponential 2.2 × 10 16 1174.5891177.793
Log-normal 9.937 × 10 4 973.921980.329
Gamma 1.467 × 10 6 1075.6771082.085
GEVD 1.467 × 10 6 953.895963.507
Table 10. Comparison of the summary statistics of the temperature ( C).
Table 10. Comparison of the summary statistics of the temperature ( C).
Table 11. Comparison of the summary statistics of humidity (%).
Table 11. Comparison of the summary statistics of humidity (%).
Table 12. Comparison of observation and simulation precipitation data.
Table 12. Comparison of observation and simulation precipitation data.
ObservationSimulation 2 (K4D)
Q2 (Median)1.781.53
Table 13. Summary statistics of RDI in Doha.
Table 13. Summary statistics of RDI in Doha.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Paek, J.; Pollanen, M.; Abdella, K. A Stochastic Weather Model for Drought Derivatives in Arid Regions: A Case Study in Qatar. Mathematics 2023, 11, 1628.

AMA Style

Paek J, Pollanen M, Abdella K. A Stochastic Weather Model for Drought Derivatives in Arid Regions: A Case Study in Qatar. Mathematics. 2023; 11(7):1628.

Chicago/Turabian Style

Paek, Jayeong, Marco Pollanen, and Kenzu Abdella. 2023. "A Stochastic Weather Model for Drought Derivatives in Arid Regions: A Case Study in Qatar" Mathematics 11, no. 7: 1628.

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop