Drought Prediction for Areas with Sparse Monitoring Networks: A Case Study for Fiji

Rhee, Jinyoung; Yang, Hongwei

doi:10.3390/w10060788

Open AccessArticle

Drought Prediction for Areas with Sparse Monitoring Networks: A Case Study for Fiji

by

Jinyoung Rhee

^*

and

Hongwei Yang

Climate Services and Research Department, APEC Climate Center, Busan 48058, Korea

^*

Author to whom correspondence should be addressed.

Water 2018, 10(6), 788; https://doi.org/10.3390/w10060788

Submission received: 18 May 2018 / Revised: 11 June 2018 / Accepted: 11 June 2018 / Published: 14 June 2018

(This article belongs to the Special Issue Statistical Analysis and Stochastic Modelling of Hydrological Extremes)

Download

Browse Figures

Versions Notes

Abstract

:

Hybrid drought prediction models were developed for areas with limited monitoring gauges using the APEC Climate Center Multi-Model Ensemble seasonal climate forecast and machine learning models of Extra-Trees and Adaboost. The models provide spatially distributed detailed drought prediction data of the 6-month Standardized Precipitation Index for the case study area, Fiji. In order to overcome the limitation of a sparse monitoring network, both in-situ data and bias-corrected dynamic downscaling of historical climate data from the Weather Research Forecasting (WRF) model were used as reference data. Performance measures of the mean absolute error as well as classification accuracy were used. The WRF outputs reflect the topography of the area. Hybrid models showed better performance than simply bias corrected forecasts in most cases. Especially, the model based on Extra-Trees trained using the WRF model outputs performed the best in most cases.

Keywords:

drought prediction; APCC Multi-Model Ensemble; seasonal climate forecast; machine learning; sparse monitoring network; Fiji

Graphical Abstract

1. Introduction

Islands in the South Pacific are vulnerable to climate change [1]. The climate in the South Pacific has become drier by 15% and warmer by 0.8 °C, compared to the earlier 20th century [2]. Fiji, one of the key Pacific Island countries, experiences easterly trade winds on most calendar days. The easterly trade winds or the northeasterly monsoon, when lifted by high mountains, causes moisture condensation and produces heavy rainfall on the windward eastern side of Fiji. The subsidence of the relatively dry air produces less rainfall on the leeward western side.

From a large-scale viewpoint, the El Nino Southern Oscillation (ENSO) is the main cause of climate variability over this region at interannual timescales. La Nina events dominated the interannual sea surface temperature (SST) anomaly (SSTA) over the central Equatorial Pacific during 1950 and 1975; after that time, El Nino events became more frequent [3]. The Pacific Decadal Oscillation (PDO) dominates the climate variability at decadal timescales [4]. PDO was mostly positive prior to 1998 and then shifted to a strong negative phase [5]. Positive PDO is characterized by the similar SSTA of El Nino over the Equatorial Pacific, and thus shifts the weather systems northeastward, but on a decadal timescale. The South Pacific Convergence Zone (SPCZ) is a reverse-oriented monsoon trough with strong low-level convergence and a rainfall band that extends from the Warm Pool southeastward to French Polynesia [6,7]. The interferential impact of ENSO and PDO on the SPCZ is complex [8,9]. El Nino events weaken the strength of the Walker Circulation and shift the dominant weather systems over the Equatorial Pacific toward areas in the northeast such as the SPCZ. When El Nino takes place during the positive PDO, the SPCZ moves northeast towards the equator, and its intensity becomes stronger [8]. The large-scale convection departure decreases precipitation over Fiji and leads to droughts [10].

Fiji has observed more frequent dry conditions since the 1950’s compared to previous decades in the western and northern areas based on analysis performed using the Standardized Precipitation Index (SPI). Analysis of observed monthly rainfall for Fiji over the period 1949–2008 showed downward trends at a 99% confidence level with decreases in rainfall of approximately 13–47 mm per year [11]. Although no significant long-term trends were observed in annual rainfall [12], there were more frequent dry seasons during the last 50 years compared to the first 50 years when the nearly 100 years of data since 1900 were examined [13]. The local temperature also increased due to the effects of climate change [14]. The most impacted stations were located in western and northern Fiji, where deficiency in rainfall from 1969–1988 caused an increase in moderate and severe droughts [11]. Risbey et al. [15] projected an increase in rainfall of approximately 3.3% by 2025 and 9.7% by 2100 using a global climate model (GCM). Feresi et al. [16] and Agrawala et al. [17] did not project a definitive change in rainfall. IPCC [18] projected that Fiji will experience an intensified seasonal cycle, i.e., a rainfall decrease in the dry season and a rainfall increase in the wet season. The shift towards extended periods of dry spells causes loss of soil fertility, which could impact negatively on agriculture [1].

Since 1940, severe droughts have occurred in 1942, 1958, 1969, 1978, 1983, 1987, 1992, 1997–1998, 2003, and 2010 [16]. Severe droughts can cause serious socio-economic loss as well as physical damages as drought conditions persist. The ENSO event of 1997–1998 caused a severe drought with damages of up to Fiji $100 million. Rainfall failure occurred across two successive dry seasons, and more significantly during the intervening wet season when precipitation is normally reliable [16]. Since many rural communities are reliant on rainwater, streams, and shallow wells for domestic use, watering crop gardens, and livestock, these communities are especially vulnerable to periods of drought when surface water resources are at a minimum [19]. Schools and businesses were forced to close and caused disruption to residential areas. Such impacts made extreme difficulties for Fiji since the resources of an island country are limited. External aid and governmental assistance were required to ensure supply of sustenance and facilitate recovery in the worst-hit parts of Fiji, which included the western and northern divisions and outer islands.

Drought conditions in Fiji are currently monitored using the 3-, 6-, and 12-month SPI calculated for weather stations with long historical data [20]. The monitoring network over Fiji with long data is quite sparse though, resulting in considerable uncertainty in the estimates of extreme wet and dry events. Evidence shows that estimation of the historical trends has a large noise-to-signal ratio over regions with sparse data networks [21]. Furthermore, most Fiji weather stations with long data are located along the coastline, so the sparse network cannot capture small-scale convective precipitation over land and precipitation from orographic lifting at mountains. Rainfall variability in the high mountains is greater than the variability in cities.

The limited variables and inconsistency in duration of satellite observation introduces difficulties and uncertainties in methods and analysis. For example, the Climate Prediction Center Morphing Technique (CMORPH) data is only available from 1998 onward. Due to the limited number or variables being observed, it is difficult to prepare for droughts because the response of rainfall distribution to large-scale dynamics is unclear. In addition, unlike other types of disasters, the onset and termination of droughts is not always clear. The increase in uncertainty of climate variability makes the reduction of drought impacts even more difficult.

Drought outlook of Fiji is also provided based on SPI: SPI predictions for weather stations are based on the statistically downscaled seasonal forecast data from the Seasonal Climate Outlooks for Pacific Island Countries developed by the Bureau of Meteorology of Australia. If spatially distributed drought prediction is available, possibly reflecting the orographic effect of the main island, it would be helpful to prevent and minimize the adverse impacts of droughts in Fiji. Drought prediction data only available for weather stations or obtained based on low-resolution bias-corrected seasonal forecast data are not sufficient for effective decision making.

This study aims to develop a drought prediction model that can be used for areas with sparse monitoring networks. Fiji is a case study area. By providing spatially detailed drought prediction data, vulnerability to droughts may be reduced while resiliency may be increased. Multi-Model Ensemble seasonal climate forecast data from APEC Climate Center (APCC MME) are used to provide up to 6 months-lead climate forecasting. Machine learning models are used to provide spatially distributed drought information for ungauged areas. In order to overcome the limitation of sparse monitoring networks, dynamically downscaled historical climate data from the Weather Research and Forecasting (WRF) model are used to train machine learning models instead of in-situ data as reference data.

This study ultimately targets national, provincial, and regional officials whose main duties include water resources and agricultural management. The final beneficiaries of the output are residents of the area; water users and farmers for whom decision-making can be helped by drought prediction information with finer spatial resolution.

2. Study Area

Fiji has a total area of about 194,000 km² of which approximately 10% is land. Fiji consists of 332 islands. The two largest islands are Viti Levu and Vanua Levu, which account for about three-quarters of the total land area of Fiji [22]. Figure 1 shows the topography of Fiji’s main islands. The largest island, Viti Levu, which has an area of 10,388 km², is covered with thick tropical forest. The island has a considerable area higher than 500 m in elevation with the peak of Mount Tomanivi at 1324 m above sea level. Viti Levu hosts the capital city of Suva, which contains about three-quarters of the population. Other important towns include Nadi, where the international airport is located, and Lautoka.

Fiji has a tropical marine climate and is warm year-round with minimal extremes. The warm season lasts from November to April and the cool season lasts from May to October. Temperatures in the cool season average 22 °C. Winds are moderate, though cyclones occur about once a year (10–12 times per decade). Viti Levu is a mountainous volcanic island with a wet-dry tropical climate. The southeast side of the island faces the predominant trade winds and therefore receives more precipitation than the northwest side, which is rain-shadowed by interior highlands. The volcanic mountains force orographic lifting of the saturated air, which can produce extremely heavy rainfall on the windward side of the mountain. Rainfall on the leeward side is much lighter due to the subsidence of the dry air, which largely influences agriculture in those areas. In the dry season, the uneven distribution of rainfall can cause a prolonged lack of moisture on the leeward side. The leeward side only receives 20% of the annual total rainfall in the dry season, compared to 33% received on the windward side [23].

Sugar export is an important source of foreign exchange for Fiji, as sugar cane processing makes up one-third of industrial activity. Coconut, ginger, and copra are also significant industries. These agricultural products are highly influenced by climate extremes; the sugar industry was damaged by drought in 1998.

3. Materials

3.1. In-Situ Data

Figure 2a shows the location of rainfall gauges of the two main islands used in this study (Table 1). In-situ rain-gauge hourly precipitation data for 1981–2010 were obtained and daily data for the period were used for the bias-correction of the WRF model. Monthly data were also used for calculating drought index values for the training of machine learning models. Some data were missing during a short period of time from gauges at Udu Point and Nabouwalu.

3.2. WRF Model Outputs

Dynamic downscaling of historical climate through the WRF model forced by the European Centre for Medium-Range Weather Forecasts Reanalysis (ERA)-Interim reanalysis dataset in a double nested framework with spectral nudging in the parent domain was used in this study [24]. Many validations show that the WRF outputs are pretty reliable. Precipitation data with 8 km spatial resolution for 1981–2010 were used in this study. Centroids of the 227 grid cells are shown in Figure 2b.

3.3. SPI

The SPI is widely used to characterize meteorological drought on a range of timescales [25,26] (Table 2). It quantifies observed precipitation as a standardized departure from a selected probability distribution function that models the raw precipitation data. The raw precipitation data are fitted to a gamma distribution, for example, and then transformed to a normal distribution. The SPI values can be interpreted as the number of standard deviations by which the observed anomaly deviates from the long-term mean. The SPI can be created for differing periods of 1 to 36 months, using monthly input data. The SPI can be compared across regions with markedly different climates. In this study, 6-month SPI (SPI6) was used to examine the performance of the drought prediction model developed, which is based on APCC MME up to 6 months-lead forecast data. SPI6 is also used by the Fiji Meteorological Service (FMS) to examine agricultural (soil moisture) and hydrological droughts because the 6-month droughts affect deeper rooted plants and medium-sized water bodies [27].

3.4. APCC MME Seasonal Climate Forecast

APCC produces the future 6-month global climate forecast using the MME technique, by collecting, standardizing, and utilizing climate prediction data from 17 different climate prediction organizations from all round the world. The MME technique collates data from different high quality climate models resulting in a better forecast than each climate model’s independent forecast. For this study, 6-month MME data produced by the Simple Composite Method (SCM) based on six individual models were obtained from the APEC Climate Data Service System [28]. The six individual climate models were APCC model, the Centro Euro-Mediterraneo sui Cambiamenti Climatici model, the Meteorological Service of Canada (MSC) model, the National Aeronautics and Space Administration (NASA) model, the National Centers for Environmental Prediction (NCEP) model, Pusan National University (PNU) model, and the Predictive Ocean Atmosphere Model for Australia.

3.5. Remote Sensing Data

3.5.1. PERSIANN-CDR

The drought prediction model developed in this study relies on remote sensing based precipitation data in order to compensate for the low spatial coverage of weather stations. To secure precipitation data covering a large enough area, the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks (PERSIANN)-Climate Data Record (CDR) was used [29]. PERSIANN-CDR data were created based on infrared sensor data for the period with no microwave sensor data. The data cover 60° S–60° N, 180° W–180° E, with a spatial resolution of 0.25° × 0.25°. Daily data were obtained and converted to monthly total precipitation data.

3.5.2. TRMM

The tropical rainfall measuring mission (TRMM) was developed jointly by the United States (US) NASA and the Japan Aerospace Exploration Agency. The TRMM 3B42 product with 3-h data collection intervals was obtained from the NASA Goddard Earth Sciences Data and Information Service Center and converted to monthly total precipitation data. The TRMM data cover 50° S–50° N, 180° W–180° E, and have a spatial resolution of 0.25° × 0.25°. The data are in equirectangular (or geographic) projection with WGS84 datum.

3.5.3. GPM

The Integrated Multi-Satellite Retrievals for the Global Precipitation Measurement Mission (GPM) data were used as remote sensing based precipitation data from April 2014 onward. The data were obtained from the Precipitation Measurement Missions of NASA, and cover 90° S–90° N, 180° W–180° E, and have a spatial resolution of 0.1° × 0.1°. The data are also in equirectangular (or geographic) projection with WGS84 datum. The data were converted to monthly total precipitation data.

3.5.4. MODIS Land Surface Temperature

Daytime and nighttime land surface temperature (LST) data from the Level-3 standard product of the Moderate Resolution Imaging Spectroradiometer (MODIS) onboard the Aqua satellite, MYD11A2 LST and Emissivity 8-day L3 Global 1 km, were obtained from the Earth Observing System Data and Information System EARTHDATA of NASA from July 2002 to December 2016. MYD11A2 data are the average of daily MYD11A1 data of cloud-free days. Temporal and spatial resolutions of the data are 8-day and approximately 1 km × 1 km, respectively. The data are projected in Sinusoidal projection.

Since the time scale of the developed drought prediction model is monthly, the 8-day data were converted into monthly data using the number of days of the 8-day period for each month as weights. Mean LST (LST_MEAN) was also calculated from daytime LST (LST_DAY) and nighttime LST (LST_NIGHT).

3.5.5. MODIS Vegetation Indices

Vegetation indices of the Normalized Difference Vegetation Index (NDVI) and the Enhanced Vegetation Index (EVI) data were obtained from the Level-3 data of MODIS onboard Aqua, MYD13A3 Vegetation Indices Monthly L3 Global 1 km, from EARTHDATA of NASA from July 2002 to December 2016. Temporal and spatial resolutions are monthly and approximately 1 km × 1 km, respectively. The data are also projected in Sinusoidal projection.

The NDVI can be calculated using the changes in reflectance in red and near infrared (NIR) channels (Equation (1)) and has been widely used as an indicator of vegetation vigor [30]. The EVI uses the blue band in addition to red and NIR bands, minimizing the influence of the background effect of soil, snow, and water (Equation (2)). The EVI retains sensitivity to vegetation vitality, which is often shown saturated in the NDVI. The blue band helps to remove the atmospheric effect caused by air and clouds.

NDVI = \frac{NIR - RED}{NIR + RED}

(1)

EVI = 2 \times \frac{NIR - RED}{L + NIR + C 1 \times RED + C 2 \times BLUE}

(2)

where NIR, RED, and BLUE are reflectance values of NIR, RED, and BLUE channels, respectively; L is a parameter for reducing the background effect of canopy; C1 and C2 are weighting parameters to correct the influence of the aerosol effect of the red band when the blue and red bands are used together [31].

3.5.6. Elevation Data

Global 30 Arc-Second Elevation (GTOPO30) data with 1 km × 1 km spatial resolution were obtained from the US Geological Survey and used for the study area.

3.6. Large-Scale Climate Index

3.6.1. SPCZ

The SPCZ, a reverse-oriented monsoon trough, is a band of low-level convergence, cloudiness, and precipitation extending from the Western Pacific Warm Pool at the maritime continent southeastward toward French Polynesia and as far as the Cook Islands (160° W, 20° S). The SPCZ occurs where the southeast trade winds from transitory anticyclones to the south meet with the semi-permanent easterly flow from the eastern South Pacific anticyclone.

To study the SPCZ and its impacts on weather and climate over the South Pacific islands, previous studies suggested several SPCZ indices [8,32,33,34,35]. Here, we adopted the SPCZ strength index from Kidwell et al. [34] to quantify the impact of the SPCZ on rainfall over Fiji. The SPCZ region was encompassed in 0°–30° S, 130° E–110° W. The strength of the SPCZ is defined by the surface wind convergence in this region derived from the ERA-Interim. Divergence was calculated with Equation (3):

D (x, y) = \frac{\partial u}{\partial x} + \frac{\partial v}{\partial y}

(3)

where u and v are the zonal and meridional components of the surface winds. Positive D corresponds to surface divergence, and a negative value corresponds to surface convergence. The SPCZ strength is defined by the monthly mean area-weighted average of convergence within the SPCZ region:

s = \sum^{​} D (x, y) a (x, y) / \sum^{​} a (x, y)

(4)

where a(x,y) is the area of a grid cell centered at location (x,y), and the spatial summation

\sum^{​}

is performed over grid cells with D(x,y) < 0 within the SPCZ region. The anomaly of the SPCZ strength is defined as SPCZ index.

3.6.2. MEI

The ENSO is an irregularly periodic variation in winds and SST over the tropical eastern Pacific Ocean, affecting much of the tropics and subtropics. The warming phase is known as El Nino and the cooling phase as La Nina. Southern Oscillation is the accompanying atmospheric component, coupled with the sea temperature change; El Nino is accompanied with high air surface pressure while La Nina with low in the tropical western Pacific. The two periods last several months each (typically occurring every few years) and their effects vary in intensity. The Multivariate ENSO Index (MEI) from the National Oceanic and Atmospheric Administration (NOAA) were used as a measure of ENSO.

4. Methods

4.1. Drought Modeling

Mishra and Singh [36] reviewed a variety of drought modeling methods and described the components of drought modeling as hydro-meteorological variables, drought indices, climate indices, methodologies, and outputs. Among hydro-meteorological variables, rainfall is the most important variable for meteorological drought forecasting, soil moisture and crop yield are the key variables for agricultural drought forecasting, and stream flow and reservoir level are the most important variables for hydrological drought forecasting. Sometimes many variables are combined to obtain drought characteristics such as drought severity, duration, and spatial extent. Large-scale climate indices such as ENSO or the Arctic Oscillation (AO) index are used to forecast longer droughts. There can be many methods used, including regression models, time-series models, probability models, neural networks models, and statistical-dynamic models [36,37,38,39,40,41].

Recently, drought prediction methods using machine learning have been developed [42,43]. Rules required by expert systems can be developed either by human experts or derived by machines based on data provided by human beings; this training process is called machine learning [44]. Tadesse et al. [42] developed a rule-based regression tree model forecasting drought conditions and crop yield based on remotely sensed vegetation conditions, SPI, land use, available water capacity of soil, and irrigation areas. Rhee and Im [43] tested decision tree models, random forest models, and extra-trees models to forecast drought indices of the SPI and the Standardized Precipitation-Evapotranspiration Index in South Korea.

4.2. Machine Learning Model Design

As an indicator representing true drought conditions, the target variable was set as SPI6_OBS, which is reference SPI6 calculated either using in-situ precipitation data from four rainfall gauges or using the WRF model outputs from 227 pixel locations (Figure 3).

If we were to monitor current drought conditions, we may rely on SPI6_RS, which is SPI6 calculated from remote sensing based rainfall, since reference SPI6 is only available for the past or for some limited locations. However, there are usually gaps between SPI6_RS and SPI6_OBS. In order to explain or reduce the discrepancy, drought-affected input variables of LST_DAY, LST_NIGHT, LST_MEAN, NDVI, and EVI can be included to the model (Figure 3). Elevation (ELEV) can also be included to consider the topographical effect on rainfall, complementing the coarse spatial resolution of remotely sensed rainfall data (Figure 3).

Since the purpose of the model is drought prediction, long-range climate forecasting can be used to estimate the effect of synoptic and large-scale atmospheric circulation. While SPI6_RS was used for training machine learning models assuming perfect climate forecast, SPI6_FCST was used for test; SPI6_FCST is SPI6 calculated from bias-corrected precipitation data combining the percent increment of the rainfall anomaly of APCC MME and the climatology of remote sensing based rainfall [45] (Figure 3). A 6-month period of accumulated rainfall was divided into two periods according to the lead-time of the forecast; months with observed rainfall and months with forecasted rainfall. Remote sensing-based precipitation data were used as the observed rainfall, and bias-corrected precipitation forecast data were used as the forecasted rainfall. Parameters for the gamma probability distribution functions were pre-fitted based on remote sensing-based precipitation data and used for SPI6_FCST calculations.

Month of the data (MONTH) was also included for temporal information, and large-scale circulation indices of MEI and SPCZ strength (SPCZ) were also included (Figure 3).

Time points of data vary for 1 to 6-month lead drought prediction; initial points of data were used for remote sensing data and large-scale indices (for example, January 2017 values were used for 3-month lead predictions for April 2017), while target points of data were used for MONTH, SPI6_RS (training), and SPI6_FCST (test).

As the machine learning models, the Extra-Trees (ERT hereafter) [46] and the Adaboost [47] models were used in this study. The implementation was done using the Python library scikit-learn 0.18.1. ERT is known to produce stable results against outliers and noise in training data, and had excellent performance in drought forecasting [43]. Adaboost is a weak learner; it enables the model to simulate minor characteristics of training data by assigning higher weights to the subsets that are less reflected during its iteration processes.

The training of the models can be done either using in-situ data or using the WRF model outputs for SPI6_OBS. The models trained using SPI6_OBS based on in-situ data may not be appropriate to be used for other areas because data from only four weather stations are used and the models are trained specific to the locations. Two cases were compared; in one case, the models were trained using 80% of the WRF model outputs and evaluated using 20% of the data. In the other case, the models were trained using all in-situ data and evaluated using the same test dataset of the previous case. Numbers of data samples are shown in Table 3.

Although a three-tier approach of training, validation, and testing is often used to optimize parameters for some artificial intelligence models, we used a two-tier approach of training and testing with the fixed number of trees for ERT and Adaboost of 100 and the maximum depth of tree growth of 15 levels. Various numbers of trees and levels of maximum depth of tree growth had been tested using cross-validation of training data; the number of trees larger than 100 did not produce much difference. Although larger levels of maximum depth of tree growth tend to produce better results, the retrieval of the trained model with larger than 15 levels of maximum depth of tree growth including full development was very demanding of computational resources.

4.3. Data Pre-Processing

Remote sensing-based variables of LST_DAY, LST_NIGHT, LST_MEAN, NDVI, EVI, and ELEV were all subset to the extent of 176.5° E–178° W, 21.5° S–12.0° S and then resampled to have 0.01° × 0.01° spatial resolution. Since many machine learning models tend to be sensitive to the magnitudes of input variables, these data were scaled using maximum and minimum values of each month for each pixel [48].

Since SPI is inherently Gaussian, the numbers of input data for each drought category of Table 2 are not even. Because some machine learning models are known to be sensitive to the distribution of samples, the following process was performed when preparing input data: additional input data were created with added noise by multiplying the standard deviation of the variable for the location and month with a random number between 0 and 1, so that all drought categories have the same sample numbers during training.

The thirty-year period from 1981 to 2010 was used for calculating SPI. Due to the short history of MODIS, the input data from July 2002 to 2016 were used for the machine learning models.

4.4. Performance Measures

Information on drought index values or corresponding drought categories indicating the severity of drought can be more useful to users than just having binary information of drought or non-drought. Performance measures used in this study include: Total Accuracy, which is the producer’s accuracy, and mean absolute error (MAE) for all drought categories in Table 2 (total MAE hereafter). Although there may not be enough serious drought events during the short study period from July 2002 to 2016, performance measures only for the three drier categories of Extreme Drought, Severe Drought, and Moderate Drought were also used: Drought Accuracy, which is a modified producer’s accuracy in Rhee and Im [43] focusing on the three drier categories, and MAE for the three drier categories (Drought MAE hereafter).

Total or Drought Accuracy = \frac{\sum^{​} C}{\sum^{​} N}

(5)

Total or Drought MAE = \frac{\sum^{​} | SPI 6_{obs} - SPI 6_{pred} |}{Total Number of Samples}

(6)

where N is the number of samples for each category, and C is the number of correctly categorized samples for each category. All categories are considered for Total Accuracy and Total MAE, while the three drier categories are considered for Drought Accuracy and Drought MAE.

5. Results and Discussion

5.1. Training of the Models

The machine learning models of ERT and Adaboost were trained using 80% of the WRF model outputs (ERT_WRF and Adaboost_WRF hereafter) or using 100% of the in-situ data (ERT_INSITU and Adaboost_INSITU hereafter). The performance of SPI6 predictions from simply bias-corrected precipitation forecast (FCST_ONLY hereafter) based on the same training dataset of the WRF model outputs was compared to the performance of ERT and Adaboost (Figure 4). Differences in MAE between methods were also statistically tested using two-sided or one-sided Welch’s t-test for both Total MAE and Drought MAE.

Both ERT_WRF and Adaboost_WRF outperformed FCST_ONLY in most cases, and Total MAE and Drought MAE values of ERT_WRF were especially small (Figure 4a,b). The differences were all statistically significant based on two-tailed p-values with a confidence level of 0.01 (data not shown). Only Adaboost_WRF with 1-month lead predictions showed larger Drought MAE than FCST_ONLY based on one-sided t-test (Figure 4b). ERT_WRF outperformed FCST_ONLY and Adaboost_WRF based on one-sided t-test (data not shown).

In terms of Total Accuracy and Drought Accuracy, ERT_WRF was much higher compared to FCST_ONLY for all lead times (Figure 4c,d). However, Adaboost_WRF could not perform better than FCST_ONLY in terms of Total Accuracy of 2-month lead predictions and Drought Accuracy of 1- and 2-month lead predictions (Figure 4c,d).

We could see that ERT_INSITU is overly fitted based on zero or near-zero Total MAE and Drought MAE values and perfect Total Accuracy and Drought Accuracy, despite the large number of trees (Figure 4). It is not very surprising since the numbers of samples for all categories and the three drier categories are not large; smaller than 270 and 40, respectively (Table 3). Both ERT_INSITU and Adaboost_INSITU outperformed FCST_ONLY in all cases (Figure 4). Total MAE and Drought MAE of ERT_WRF were larger than ERT_INSITU because of possible overfitting based on one-sided t-test, no difference in MAE was found between ERT_WRF and Adaboost_INSITU.

Scatter plots of reference SPI6 vs. 1-month lead SPI6 predictions for training are shown in Figure 5.

5.2. Test of the Models

The performance of SPI6 predictions of the machine learning models (ERT_WRF, Adaboost_WRF, ERT_INSITU and Adaboost_INSITU) as well as FCST_ONLY was evaluated based on the remaining 20% of the WRF model outputs (Figure 6). Differences in MAE between methods were also statistically tested.

ERT_WRF showed the smallest Total MAE, and the differences between ERT_WRF and all other methods were statistically significant based on one-sided t-test with the confidence interval of 0.01 (Figure 6a; p-values are not shown). Adaboost_WRF also produced smaller Total MAE compared to FCST_ONLY for 1- to 4-month lead predictions, while the differences were not statistically significant for 5- and 6-month lead predictions (two-tailed p-values are 0.031 and 0.026, respectively). Even ERT_INSITU and Adaboost_INSITU produced significantly smaller Total MAE than FCST_ONLY for 1- to 3-month lead predictions (Figure 6a). Cases that failed to reject the null hypothesis of equal mean error with FCST_ONLY are shaded (Figure 6a).

In contrast to training where Drought MAE of FCST_ONLY was mostly the largest (Figure 4c), Drought MAE of FCST_ONLY was mostly the smallest for all lead times with the test dataset (Figure 6c). Cases that failed to reject the null hypothesis of equal or larger mean error with FCST_ONLY are shaded based on two-tailed and one-tailed p-values, meaning only these cases produce comparable Drought MAE to FCST_ONLY (Figure 6c; data not shown). The one-sided t-test with the null hypothesis of larger error of FCST_ONLY in all other cases was rejected, meaning that they produced larger Drought MAE in most cases (Figure 6c).

There were no obvious differences observed in Total Accuracy between the methods; Total Accuracy of ERT_WRF was the highest for all lead times (Figure 6b). FCST_ONLY produced higher Drought Accuracy for 1-month lead SPI6 predictions, while ERT_WRF performed the best for longer-term predictions (Figure 6d). The selection of training data (WRF model outputs versus in-situ data), the selection of a prediction model (FCST_ONLY versus machine learning models of ERT and Adaboost), and the lead time had the greatest effect on Drought Accuracy (Figure 6d).

Scatter plots of reference SPI6 vs. 1-month as well as 3-month lead SPI6 predictions for testing are shown in Figure 7 and Figure 8, respectively.

5.3. Spatial Distribution Maps of SPI6 Predictions

Spatially distributed maps of 1- to 6-month lead SPI6 predictions based on FCST_ONLY and ERT_WRF were created. Some examples are shown in Figure 9; in order to provide the WRF-based SPI6 map used for training machine learning models as well as in-situ SPI6 map with available data from all four weather stations, 21 months with all data available were identified. Although no extreme drought events were observed in the 21 months, Nadi (91680) station experienced severe droughts in March, June, July, and October 2010.

The WRF-based SPI6 (Figure 9a,b), 1-month lead SPI6 predictions based on FCST_ONLY (Figure 9c,d), and 1-month lead SPI6 predictions based on ERT_WRF (Figure 9e,f) for March and June of 2010 are shown. Four weather stations are also shown with SPI6 based on observation data for March and June, 2010 (Figure 9). Only Nadi station was in severe drought in March and June of 2010 (SPI6 = −1.51 and −1.94, respectively). In March 2010, Udu Point and Suva stations were in moderate drought (SPI6 = −1.37 and −1.05, respectively) while Nabouwalu station was in near normal condition (SPI6 = −0.67). In June, Udu Point station was in moderate drought (SPI6 = −1.48) while Nabouwalu and Suva stations were in near normal condition (SPI6 = −0.45 and −0.52, respectively).

5.4. Relative Importance of Input Variables to Machine Learning Models

Python modules for machine learning models provide information on the relative importance of input variables. The importance of the most important variable is set to 100% and relative importance scores of other input variables are determined. In all cases, the most important variable was SPI6_RS in this study, and only the scores of other input variables are shown in Figure 10.

When in-situ precipitation data were used for reference data, the relative importance of all other input variables was quite low; the score of the second important variable MEI only ranges between 4% and 8% for ERT_INSITU (Figure 10c). For Adaboost_INSITU, the scores of input variables vary with lead time, but all were below 20% (Figure 10d). The importance of temporal (MONTH) and topographical (ELEV) information as well as large-scale climate indices (SPCZ, MEI) were more obvious when the WRF model outputs were used for reference data (Figure 10a,b). For ERT_WRF, the scores of MONTH, MEI, and SPCZ were higher than other input variables, mostly over 20% (Figure 10a). The scores of those three variables as well as ELEV were higher for Adaboost_WRF; the score for MONTH even reached about 55% (Figure 10b).

Differences in the relative importance of the input variables between the sources of reference data indicate that temporal characteristics of drought occurrences and the effect of ENSO, SPCZ strength, as well as topography of the region could not be adequately applied to the models when in-situ data were used for reference data, because in-situ data from only few stations are available. The use of the WRF model output precipitation data, on the other hand, enabled the use of diverse information from those variables.

6. Conclusions

We developed hybrid drought prediction models using APCC MME seasonal climate forecasts and machine learning models and examined their performance for the case study area of Fiji. The purpose of the models is to provide spatially distributed detailed drought prediction data of SPI6 for the area. The APCC MME provides up to 6-month lead precipitation forecast data. Remote sensing data were used to bias-correct the forecast data as well as to train machine learning models; machine learning models of ERT and Adaboost were used to provide spatially distributed drought information for ungauged areas. In order to overcome the limitation of sparse monitoring network, dynamic downscaling of historical climate with the WRF model was used to produce reference data.

When compared to the performance of the hybrid models trained based on different reference data, the models trained using the WRF model outputs performed better than the models trained using in-situ data: ERT_WRF outperformed ERT_INSITU in all cases, and Adaboost_WRF outperformed Adaboost_INSITU except for Drought MAE and Drought Accuracy of 1-month lead predictions, Total MAE and Total Accuracy of 2-month lead predictions, and Total Accuracy of 3-month lead predictions. The superiority of the models trained based on the WRF model outputs indicates that the spatial extent of the training data is important because in-situ data are from only four weather stations. The added value caused by the topography is clear, especially in the convergence/divergence field over the islands; this crucially impacted inland and coastal precipitation and caused greater detail in precipitation to be found in the WRF model outputs [24].

The use of the ERT_WRF model produced better results compared to Adaboost_WRF in terms of Total MAE, Total Accuracy, and Drought Accuracy for all lead times, as well as in terms of Drought MAE of 1-month lead predictions. For other lead times, no statistical difference between ERT_WRF and Adaboost_WRF were found (2- to 4-month lead predictions) or ERT_WRF showed larger error than Adaboost_WRF (5- to 6-month lead predictions) in terms of Drought MAE. It shows that the choice of the machine learning model matters; the use of simulated input data with added noise to attain the same numbers of samples between drought categories may have improved the performance of ERT and surpassed the advantage of Adaboost, supporting weak learners.

Compared to FCST_ONLY, ERT_WRF performed better in terms of Total MAE and Total Accuracy for all lead times as well as in terms of Drought Accuracy for 2- to 6-month lead predictions. Although there was no statistically significant difference for 1-month and 3-month lead predictions in terms of Drought MAE and the error of ERT_WRF was larger for 2-month and 4- to 6-month lead predictions, Drought Accuracy of ERT_WRF for 2- to 6-month lead predictions was higher than FCST_ONLY. The hybrid model, especially ERT_WRF, showed good performance compared to simply bias corrected forecasts.

Hybrid models with better performance than simply bias corrected forecasts in most cases for areas with sparse monitoring networks were successfully developed. It should be noted that the performance of the compared methods may be evaluated differently according to the purpose of the study with the appropriate choice of a performance measure. In future studies, the use of more diverse input variables related to drought for machine learning models need to be investigated. Only SPI based on precipitation data was examined in this study; drought prediction based on drought indices considering the effect of evapotranspiration, such as the Standardized Precipitation-Evapotranspiration Index [49] or the Standardized Evapotranspiration Deficit Index [50], may also help to reduce vulnerability to droughts.

Author Contributions

J.R. designed this study, led data analysis and manuscript writing, and served as the corresponding author. H.Y. contributed to the selection of variables, data production, discussion of results, and manuscript writing.

Funding

This research received no external funding.

Acknowledgments

This research was supported by the APEC Climate Center. Authors are thankful to the Fiji Meteorological Service for the provision of data, information, and constructive comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Intergovernmental Panel on Climate Change. Climate Change 2007: The Physical Science Basis: Working Group I Contribution to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change; Solomon, S., Qin, D., Manning, M., Chen, Z., Marquis, M., Averyt, K.B., Tignor, M., Miller, H.L., Eds.; Cambridge University Press: Cambridge, UK; New York, NY, USA, 2007; 996p. [Google Scholar]
Hay, J.E.; Mimura, N.; Campbell, J.; Fifita, S.; Koshy, K.; McLean, R.F.; Nakalevu, T.; Nunn, P.; de Wet, N. Climate Variability and Change and Sea-Level Rise in the Pacific Islands Region: A Resource Book for Policy and Decision Makers, Educators and Other Stakeholders; South Pacific Regional Environment Programme: Apia, Samoa, 2003; 108p. [Google Scholar]
Trenberth, K.E.; Hoar, T.J. The 1990–1995 El Niño-Southern Oscillation event: Longest on record. Geophys. Res. Lett. 1996, 23, 57–60. [Google Scholar] [CrossRef]
Mantua, N.J.; Hare, S.R.; Zhang, Y.; Wallace, J.M.; Francis, R.C. A Pacific interdecadal climate oscillation with impacts on salmon production. Bull. Am. Meteorol. Soc. 1997, 78, 1069–1079. [Google Scholar] [CrossRef]
Mantua, N.J.; Hare, S.R. The Pacific decadal oscillation. J. Oceanogr. 2002, 58, 35–44. [Google Scholar] [CrossRef]
Bergeron, T. Richtlinien einer dynamischen Klimatologie. Meteorol. Z. 1930, 47, 246–262. [Google Scholar]
Trenberth, K.E. Spatial and temporal variations of the Southern Oscillation. Q. J. R. Meteorol. Soc. 1976, 102, 639–653. [Google Scholar] [CrossRef]
Folland, C.K.; Renwick, J.A.; Salinger, M.J.; Mullan, A.B. Relative influences of the interdecadal Pacific oscillation and ENSO on the South Pacific convergence zone. Geophys. Res. Lett. 2002, 29, 1643. [Google Scholar] [CrossRef]
Hu, Z.-Z.; Huang, B. Interferential impact of ENSO and PDO on dry and wet conditions in the US Great Plains. J. Clim. 2009, 22, 6047–6065. [Google Scholar] [CrossRef]
Nicholls, N.; Wong, K.K. Dependence of rainfall variability on mean rainfall, latitude, and the Southern Oscillation. J. Clim. 1990, 3, 163–170. [Google Scholar] [CrossRef]
Deo, R.C. On meteorological droughts in tropical Pacific Islands: Time-series analysis of observed rainfall using Fiji as a case study. Meteorol. Appl. 2011, 18, 171–180. [Google Scholar] [CrossRef]
Mataki, M.; Koshy, K.C.; Lal, M. Baseline climatology of Viti Levu (Fiji) and current climatic trends. Pac. Sci. 2006, 60, 49–68. [Google Scholar] [CrossRef]
Kumar, R.; Stephens, M.; Weir, T. Rainfall trends in Fiji. Int. J. Climatol. 2014, 34, 1501–1510. [Google Scholar] [CrossRef]
Kumar, R.; Stephens, M.; Weir, T. Temperature trends in Fiji: A clear signal of climate change. S. Pac. J. Nat. Appl. Sci. 2013, 31, 27–38. [Google Scholar]
Risbey, J.S.; Lamb, P.J.; Miller, R.L.; Morgan, M.C.; Roe, G.H. Exploring the structure of regional climate scenarios by combining synoptic and dynamic guidance and GCM output. J. Clim. 2002, 15, 1036–1050. [Google Scholar] [CrossRef]
Feresi, J.; Kenny, G.J.; de Wet, N.; Limalevu, L.; Bhusan, J.; Ratukalou, I. Climate Change Vulnerability and Adaptation Assessment for Fiji; Technical Report; The International Global Change Institute, University of Waikato: Hamilton, New Zealand, 2000; 135p. [Google Scholar]
Agrawala, S.; Ota, T.; Risbey, J.; Hagenstad, M.; Smith, J.; van Aalst, M.; Koshy, K.; Prasad, B. Development and Climate Change in Fiji: Focus on Coastal Mangroves; Organisation for Economic Co-operation and Development: Paris, France, 2003; 56p. [Google Scholar]
Intergovernmental Panel on Climate Change. Climate Change 2013: The Physical Science Basis: Working Group I Contribution to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change; Stocker, T.F., Qin, D., Plattner, G.-K., Tignor, M., Allen, S.K., Boschung, J., Nauels, A., Xia, Y., Bex, V., Midgley, P.M., Eds.; Cambridge University Press: Cambridge, UK; New York, NY, USA, 2013; 1535p. [Google Scholar]
Terry, J.P.; Raj, R. The 1997–98 El Niño and drought in the Fiji Islands. In Hydrology and Water Management in the Humid Tropics, Proceedings of the Second International Colloquium, Panama, Republic of Panama, 22–26 March 1999; UNESCO: Paris, France, 2002; pp. 80–93. [Google Scholar]
ENSO Update. Available online: http://www.met.gov.fj/ENSO_Update.pdf (accessed on 3 November 2017).
Yin, H.; Donat, M.G.; Alexander, L.V.; Sun, Y. Multi-dataset comparison of gridded observed temperature and precipitation extremes over China. Int. J. Climatol. 2015, 35, 2809–2827. [Google Scholar] [CrossRef]
Derrick, R.A. The Fiji Islands: A Geographical Handbook; Fiji Government Printing Department: Suva, Fiji, 1957; 68p.
Terry, J.P.; Raj, R. Hydrological drought in western Fiji and the contribution of tropical cyclones. In Climate and Environmental Change in the Pacific; Terry, J.P., Ed.; School of Social and Economic Development, University of the South Pacific: Suva, Fiji, 1998; pp. 73–85. [Google Scholar]
Rhee, J.; Yang, H. Development of a Drought Forecast Model for Fiji Based on High-Resolution Dynamic Downscaling of Climate Data and Machine Learning of Long-Range Climate Forecast and Remote Sensing Data; APEC Climate Center Research Report 2017-04; APEC Climate Center: Busan, South Korea, 2018; 63p. [Google Scholar]
Guttman, N.B. Accepting the standardized precipitation index: A calculation algorithm. J. Am. Water Resour. Assoc. 1999, 35, 311–322. [Google Scholar] [CrossRef]
McKee, T.B.; Doesken, N.J.; Kleist, J. The relationship of drought frequency and duration to time scales. In Proceedings of the 8th Conference on Applied Climatology, Anaheim, CA, USA, 17–22 January 1993; American Meteorological Society: Boston, MA, USA, 1993; pp. 179–183. [Google Scholar]
Fiji Meteorological Service. Personal communication, 2017.
APEC Climate Data Service System (ADSS). Available online: http://adss.apcc21.org (accessed on 7 July 2017).
Ashouri, H.; Hsu, K.-L.; Sorooshian, S.; Braithwaite, D.K.; Knapp, K.R.; Cecil, L.D.; Nelson, B.R.; Prat, O.P. PERSIANN-CDR: Daily precipitation climate data record from multisatellite observations for hydrological and climate studies. Bull. Am. Meteorol. Soc. 2015, 96, 69–83. [Google Scholar] [CrossRef]
Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote. Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef] [Green Version]
Liu, H.Q.; Huete, A. A feedback based modification of the NDVI to minimize canopy background and atmospheric noise. IEEE Trans. Geosci. Remote 1995, 33, 457–465. [Google Scholar]
Borlace, S.; Santoso, A.; Cai, W.; Collins, M. Extreme swings of the South Pacific Convergence Zone and the different types of El Niño events. Geophys. Res. Lett. 2014, 41, 4695–4703. [Google Scholar] [CrossRef] [Green Version]
Cai, W.; Lengaigne, M.; Borlace, S.; Collins, M.; Cowan, T.; McPhaden, M.J.; Timmermann, A.; Power, S.; Brown, J.; Menkes, C.; et al. More extreme swings of the South Pacific convergence zone due to greenhouse warming. Nature 2012, 488, 365–369. [Google Scholar] [CrossRef] [PubMed]
Kidwell, A.; Lee, T.; Jo, Y.-H.; Yan, X.-H. Characterization of the variability of the South Pacific convergence zone using satellite and reanalysis wind products. J. Clim. 2016, 29, 1717–1732. [Google Scholar] [CrossRef]
Vincent, E.M.; Lengaigne, M.; Menkes, C.E.; Jourdain, N.C.; Marchesiello, P.; Madec, G. Interannual variability of the South Pacific Convergence Zone and implications for tropical cyclone genesis. Clim. Dyn. 2011, 36, 1881–1896. [Google Scholar] [CrossRef]
Mishra, A.K.; Singh, V.P. Drought modeling–A review. J. Hydrol. 2011, 403, 157–175. [Google Scholar] [CrossRef]
Leilah, A.A.; Al-Khateeb, S.A. Statistical analysis of wheat yield under drought conditions. J. Arid Environ. 2005, 61, 483–496. [Google Scholar] [CrossRef]
Steinemann, A.C. Using climate forecasts for drought management. J. Appl. Meteorol. Climatol. 2006, 45, 1353–1361. [Google Scholar] [CrossRef]
Morid, S.; Smakhtin, V.; Bagherzadeh, K. Drought forecasting using artificial neural networks and time series of drought indices. Int. J. Climatol. 2007, 27, 2103–2111. [Google Scholar] [CrossRef] [Green Version]
Han, P.; Wang, P.X.; Zhang, S.Y.; Zhu, D.H. Drought forecasting based on the remote sensing data using ARIMA models. Math. Comput. Model. 2010, 51, 1398–1403. [Google Scholar] [CrossRef]
Ribeiro, A.; Pires, C.A. Seasonal drought predictability in Portugal using statistical-dynamical techniques. Phys. Chem. Earth 2016, 94, 155–166. [Google Scholar] [CrossRef]
Tadesse, T.; Brown, J.F.; Hayes, M.J. A new approach for predicting drought-related vegetation stress: Integrating satellite, climate, and biophysical data over the US central plains. ISPRS J. Photogramm. 2005, 59, 244–253. [Google Scholar] [CrossRef]
Rhee, J.; Im, J. Meteorological drought forecasting for ungauged areas based on machine learning: Using long-range climate forecast and remote sensing data. Agric. For. Meteorol. 2017, 237, 105–122. [Google Scholar] [CrossRef]
Jensen, J.R.; Lulla, K. Introductory Digital Image Processing: A Remote Sensing Perspective; Taylor & Francis: Milton Park, UK, 1987; 65p. [Google Scholar]
Quan, X.-W.; Hoerling, M.P.; Lyon, B.; Kumar, A.; Bell, M.A.; Tippett, M.K.; Wang, H. Prospects for dynamical prediction of meteorological drought. J. Appl. Meteorol. Clim. 2012, 51, 1238–1252. [Google Scholar] [CrossRef]
Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef] [Green Version]
Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef]
Rhee, J.; Im, J.; Carbone, G.J. Monitoring agricultural drought for arid and humid regions using multi-sensor remote sensing data. Remote Sens. Environ. 2010, 114, 2875–2887. [Google Scholar] [CrossRef]
Vicente-Serrano, S.M.; Beguería, S.; López-Moreno, J.I. A Multiscalar drought index sensitive to global warming: The Standardized Precipitation Evapotranspiration Index. J. Clim. 2010, 23, 1696–1718. [Google Scholar] [CrossRef]
Kim, D.; Rhee, J. A drought index based on actual evapotranspiration from the Bouchet hypothesis. Geophys. Res. Lett. 2016, 43, 10277–10285. [Google Scholar] [CrossRef]

Figure 1. Topography of Fiji’s main islands (color shades are in units of meters).

Figure 2. Location of (a) the rainfall gauges; and (b) the centroids of the Weather Research and Forecasting (WRF) model outputs.

Figure 3. Flow diagram of the drought prediction model.

Figure 4. Training performance (a) Total MAE; (b) Drought MAE; (c) Total Accuracy; and (d) Drought Accuracy of SPI6 predictions from simply bias-corrected precipitation forecast (FCST_ONLY), Extra-Trees (ERT) and Adaboost trained using 80% of the WRF model outputs (ERT_WRF and Adaboost_WRF), and ERT and Adaboost trained using 100% of in-situ data (ERT_INSITU and Adaboost_INSITU).

Figure 5. Scatter plots of reference SPI6 vs. 1-month lead SPI6 predictions for training based on (a) FCST_ONLY; (b) ERT_WRF; (c) Adaboost_WRF; (d) ERT_INSITU; and (e) Adaboost_INSITU. Reference SPI6 are based on 80% of the WRF model outputs from (a) to (c) and 100% of in-situ data for (d,e).

Figure 6. Test performance (a) Total MAE; (b) Drought MAE; (c) Total Accuracy; and (d) Drought Accuracy of SPI6 predictions from simply bias-corrected precipitation forecast (FCST_ONLY), ERT and Adaboost trained using 80% of the WRF model outputs (ERT_WRF and Adaboost_WRF), and ERT and Adaboost trained using 100% of in-situ data (ERT_INSITU and Adaboost_INSITU). Test was performed using the 20% remaining WRF model outputs.

Figure 7. Scatter plots of reference SPI6 vs. 1-month lead SPI6 predictions for testing based on (a) FCST_ONLY; (b) ERT_WRF; (c) Adaboost_WRF; (d) ERT_INSITU; and (e) Adaboost_INSITU. Reference SPI6 are based on 20% of the WRF model outputs.

Figure 8. Scatter plots of reference SPI6 vs. 3-month lead SPI6 predictions for testing based on (a) FCST_ONLY; (b) ERT_WRF; (c) Adaboost_WRF; (d) ERT_INSITU; and (e) Adaboost_INSITU. Reference SPI6 are based on 20% of the WRF model outputs.

Figure 9. Spatial distribution maps of 1-month lead SPI6 predictions for March 2010 and June 2010 and WRF-based SPI6.

Figure 10. Relative importance scores of input variables to machine learning models for (a) ERT_WRF; (b) Adaboost_WRF; (c) ERT_INSITU; and (d) Adaboost_INSITU.

Table 1. Fiji rainfall gauges used in the analysis.

Observation Sites	Latitude	Longitude
Udu Point (91652)	16.13° S	180.02° E
Nabouwalu (91659)	16.98° S	178.70° E
Nadi (91680)	17.75° S	177.45° E
Suva (91690)	18.15° S	178.45° E

Table 2. Drought categories based on Standardized Precipitation Index (SPI) [26].

Classification	Index Value
Extremely wet (EW)	≥2.00
Very wet (VW)	1.50 to 1.99
Moderately wet (MW)	1.00 to 1.49
Near Normal (NN)	0.99 to −0.99
Moderate drought (MD)	−1.00 to −1.49
Severe drought (SD)	−1.50 to −1.99
Extreme drought (ED)	≤−2.00

Table 3. Numbers of data samples used for training and testing.

Source	Type	Lead Time (Month)	Number of Samples
Source	Type	Lead Time (Month)	All Categories	Three Drier Categories
WRF model output	Train (80%)	1	16,693	1767
		2	16,545	1787
		3	16,379	1776
		4	16,211	1762
		5	16,043	1761
		6	15,875	1792
	Test (20%)	1	4169	470
		2	4132	445
		3	4091	445
		4	4049	447
		5	4006	456
		6	3964	424
In-situ data	All	1	266	37
		2	264	37
		3	262	37
		4	260	37
		5	258	37
		6	256	36

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rhee, J.; Yang, H. Drought Prediction for Areas with Sparse Monitoring Networks: A Case Study for Fiji. Water 2018, 10, 788. https://doi.org/10.3390/w10060788

AMA Style

Rhee J, Yang H. Drought Prediction for Areas with Sparse Monitoring Networks: A Case Study for Fiji. Water. 2018; 10(6):788. https://doi.org/10.3390/w10060788

Chicago/Turabian Style

Rhee, Jinyoung, and Hongwei Yang. 2018. "Drought Prediction for Areas with Sparse Monitoring Networks: A Case Study for Fiji" Water 10, no. 6: 788. https://doi.org/10.3390/w10060788

APA Style

Rhee, J., & Yang, H. (2018). Drought Prediction for Areas with Sparse Monitoring Networks: A Case Study for Fiji. Water, 10(6), 788. https://doi.org/10.3390/w10060788

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Drought Prediction for Areas with Sparse Monitoring Networks: A Case Study for Fiji

Abstract

1. Introduction

2. Study Area

3. Materials

3.1. In-Situ Data

3.2. WRF Model Outputs

3.3. SPI

3.4. APCC MME Seasonal Climate Forecast

3.5. Remote Sensing Data

3.5.1. PERSIANN-CDR

3.5.2. TRMM

3.5.3. GPM

3.5.4. MODIS Land Surface Temperature

3.5.5. MODIS Vegetation Indices

3.5.6. Elevation Data

3.6. Large-Scale Climate Index

3.6.1. SPCZ

3.6.2. MEI

4. Methods

4.1. Drought Modeling

4.2. Machine Learning Model Design

4.3. Data Pre-Processing

4.4. Performance Measures

5. Results and Discussion

5.1. Training of the Models

5.2. Test of the Models

5.3. Spatial Distribution Maps of SPI6 Predictions

5.4. Relative Importance of Input Variables to Machine Learning Models

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI