Using a Hierarchical Approach to Calibrate SWAT and Predict the Semi-Arid Hydrologic Regime of Northeastern Brazil

The Paraguaçu watershed in northeastern Brazil faces increasing water scarcity, with water resources unable to meet the increasing demand. Accurate assessment of water availability is thus essential for efficient planning and management of local resources. In this work, the potential of the SWAT model for predicting daily and monthly variability of the hydrologic regime of the Paraguaçu River was assessed. Model calibration/validation followed: (i) A hierarchical framework; (ii) the assessment of maximum, average and minimum streamflow based on paired t-test and linear regression analysis; and (iii) the definition of permanence curves for streamflow with a probability of occurrence of 90% (Q90) and 95% (Q95). The goodness-of-fit indicators revealed a “satisfactory” model performance (model efficiency ranged from 0.42 to 0.83) when predicting streamflow in monitored sub-basins using a unique set of parameters for wet and dry conditions. The flow duration curves also showed that the model underestimated higher flows resulting from extreme events but performed well for flows with exceedance probabilities of <90%. The regression analysis and paired t-test demonstrated that the SWAT model can be used for estimating maximum, average and minimum monthly streamflow in a region where information is insufficient to support water authorities in the decision-making process. The SWAT model can thus be considered adequate for simulating monthly streamflow in the Paraguaçu watershed.


Introduction
The Paraguaçu watershed in northeastern Brazil faces increasing water scarcity due to population growth, agriculture consumption and climate change.The watershed is an essential part of Bahia's fluvial system, with reservoirs supplying water to approximately 3.9 million people.Accurate assessment of streamflow at the watershed scale is of vital importance since the current seasonal and interannual discharge of the Paraguaçu River has not been able to meet the existing demand for water [1].In this context, quantitative understanding of hydrological parameters (rainfall and streamflow) and their spatial and temporal variability are essential for an efficient planning and management of local water resources [2].
Hydrological modeling is normally considered as a powerful tool for accurately estimating water availability at the watershed scale.Physically-based hydrological models have shown to be particularly Water 2018, 10, 1137 2 of 17 suitable for simulating basin response to changing conditions, including climate variability, land use and management practices [3][4][5].The Soil and Water Assessment Tool (SWAT) [6] is here a good example, being widely applied to assess the impact of climate and land management on hydrological processes, including streamflow and sediment and nutrient dynamics at the watershed scale [7][8][9].However, its application to the Brazilian semiarid region has been rather scarce, although interest has been rising in recent years.
The application of the SWAT model is often particularly challenging due to the need for calibrating a large number of parameters which aim to describe the processes under analysis.The most common approach used in model calibration is by optimizing its parameters until deviations between predictions and historical observation data are minimized.Validation then follows, with predictions being again compared with an independent dataset to assess the reliability of the optimized model parameters.Klemeš [10] discussed different validation tests and proposed a hierarchical testing framework in which different approaches are ordered according to the possibility of transferring model results to different land uses, climate and other forms of environmental changes.These calibration/validation tests are divided in four major categories: (i) The Split Sample test, in which the model is calibrated using data from a period and then applied with the same calibrated parameters for simulating a different period (validation); (ii) the Differential Split-Sample test, where the same approach is followed but data is divided in two periods with varying climate characteristics (e.g., high/low average precipitation); (iii) the Proxy-basin test, where the model is calibrated and validated in various sites, which may be in the same basin or not but with similar edapho-climatic characteristics; and (iv) the Proxy-Basin Differential Split-Sample test, which is applied in cases where the model is supposed to be both geographically and climatically (or land-use-wise) transposable.
While the SWAT model has been applied in many Brazilian watersheds, model calibration/validation has followed simple approaches that have limited its use in improving water resources management and planning at the watershed scale, namely in the Paraguaçu watershed.The choice of the hydrological model and the calibration/validation process is fundamental for the reliability of model results and for improving the decision-making process at the basin scale.It is necessary to emphasize that the longer the time interval used for calibrating/validating the model, the more reliable the results are [11].This is especially true for studies of land use/cover and climate change conducted in semiarid regions where water resources management is even more critical and necessary.As such, few reliable studies exist for the Paraguaçu basin having carried out a comprehensive calibration and validation of the modeled processes, using different techniques and especially considering time periods of more than ten years.
The objective of this study was thus to verify the potential of the SWAT model for predicting the hydrologic regime and its possible use in the planning and management of streamflow in the Paraguaçu River basin.In this research, the SWAT model was calibrated/validated using different approaches: (i) The hierarchical scheme proposed by Klemeš [10]; (ii) the assessment of the maximum, average and minimum streamflow based on the paired t-test and linear regression analysis [12]; and (iii) the analysis of the permanence curve for the Q90 and Q95 flows.The evaluation techniques were tested in the three sub-basins of the Paraguaçu upstream region (Andaraím Fortém and Iguaçu), considering calibration/validation periods of not less than ten years and considering daily and monthly timescales.Results will allow improving water resources management and planning in the Paraguaçu watershed, with the calibration approach being transferable to other semiarid regions in the world.

Study Area
The Paraguaçu watershed is located in the Bahia state, northeastern Brazil (Figure 1).The basin covers an area of 55,317 km 2 (10.14% of the territory) and has an altitude ranging from 0 to 1800 m (11 direction towards "Todos os Santos" bay.According to the Köppen classification, the climate in the Paraguaçu River Basin varies from semi-arid (BSh) in the central part (67% of the area) to equatorial (Af) in the west and southeast areas.As a result, annual rainfall varies between less than 600 mm in the former area to 1400 mm in the latter regions [13].The Paraguaçu River further constitutes the most important fluvial system in the Bahia state and includes five reservoirs: Pedra do Cavalo (with a storage capacity of 5330 hm 3 ), Bandeira de Melo (318 hm 3 ), Apertado (204 hm 3 ), França (59 hm 3 ) and São José do Jacuípe (355 hm 3 ).
(Af) in the west and southeast areas.As a result, annual rainfall varies between less than 600 mm in the former area to 1400 mm in the latter regions [13].The Paraguaçu River further constitutes the most important fluvial system in the Bahia state and includes five reservoirs: Pedra do Cavalo (with a storage capacity of 5330 hm 3 ), Bandeira de Melo (318 hm 3 ), Apertado (204 hm 3 ), França (59 hm 3 ) and São José do Jacuípe (355 hm 3 ).

Soil and Water Assessment Tool
The SWAT (Soil and Water Assessment Tool) model version 2012 [6] was used to simulate the soil water balance between 1993 and 2012.SWAT is a continuous, time-dependent, long-term, physically based, semi-distributed model, developed to assess the impacts of climate and land management on hydrological processes, sediment loading and pollution transport.The hydrologic cycle is based on the computation of the daily water balance, as follows: where SWt is the final soil water content (mm), SW0 is the initial soil water content (mm), t is the simulation period (days), Rday is the precipitation (mm), Qsurf is the surface runoff (mm), ETa is the actual evapotranspiration (mm), Qdeep is the water drained into the deep aquifer (mm) and Qgw is the return flow (mm), all referring to day i.The overall water balance is generated at the hydrologic response units (HRUs) level, that is, areas with the same land-use and soil characteristics [6].In this study, potential evapotranspiration (ETp) rates were estimated using the Penman-Monteith method [15] and partitioned into potential plant transpiration (Tp) and potential soil evaporation (Ep) rates based on Ritchie [16].Surface runoff was computed from daily precipitation using a modification of the Soil Conservation Service Curve Number (SCS-CN) method [17].Groundwater recharge was estimated by combining a storage routing technique and a crack-flow model.The lateral flow was simulated using a kinematic storage method [6].

Model Data Inputs
The digital elevation model (DEM) was obtained from the Shuttle Radar Topography Mission (SRTM), with a spatial resolution of 30 m (Figure 3).Daily precipitation, maximum and minimum temperature, relative humidity, solar radiation and wind speed were used in model simulations for computing streamflow and evapotranspiration rates.Precipitation data were obtained from Agência Nacional de Á guas (ANA), while the remaining weather variables were taken from the Instituto Nacional de Meteorologia (INMET) (Table 2).Quality control assessment for precipitation and streamflow data can be found in ANA [18].The grid of meteorological variables from Xavier et al. [19] was also used due to the high rain spatial variability and a small number of pluviometric stations in the region.The daily discharge data at Andaraí, Iguaçu and Fortém hydrometric stations (Table 3) were obtained from Agência Nacional de Á guas (ANA), while the extraction of water volumes from the river channel and reservoirs were taken from the Instituto de Meio Ambiente e Recursos Hídricos da Bahia [20].The soil map (Figure 2) was downloaded from Empresa Brasileira de Pesquisas Agropecúarias [14], while the soil's physical and chemical properties were collected from local literature [21][22][23][24].The land cover map was obtained from Globcover 2009 [25].The Brazilian Institute

Soil and Water Assessment Tool
The SWAT (Soil and Water Assessment Tool) model version 2012 [6] was used to simulate the soil water balance between 1993 and 2012.SWAT is a continuous, time-dependent, long-term, physically based, semi-distributed model, developed to assess the impacts of climate and land management on hydrological processes, sediment loading and pollution transport.The hydrologic cycle is based on the computation of the daily water balance, as follows: where SW t is the final soil water content (mm), SW 0 is the initial soil water content (mm), t is the simulation period (days), R day is the precipitation (mm), Q surf is the surface runoff (mm), ET a is the actual evapotranspiration (mm), Q deep is the water drained into the deep aquifer (mm) and Q gw is the return flow (mm), all referring to day i.The overall water balance is generated at the hydrologic response units (HRUs) level, that is, areas with the same land-use and soil characteristics [6].In this study, potential evapotranspiration (ET p ) rates were estimated using the Penman-Monteith method [15] and partitioned into potential plant transpiration (T p ) and potential soil evaporation (E p ) rates based on Ritchie [16].Surface runoff was computed from daily precipitation using a modification of the Soil Conservation Service Curve Number (SCS-CN) method [17].Groundwater recharge was estimated by combining a storage routing technique and a crack-flow model.The lateral flow was simulated using a kinematic storage method [6].

Model Data Inputs
The digital elevation model (DEM) was obtained from the Shuttle Radar Topography Mission (SRTM), with a spatial resolution of 30 m (Figure 3).Daily precipitation, maximum and minimum temperature, relative humidity, solar radiation and wind speed were used in model simulations for computing streamflow and evapotranspiration rates.Precipitation data were obtained from Agência Nacional de Águas (ANA), while the remaining weather variables were taken from the Instituto Nacional de Meteorologia (INMET) (Table 2).Quality control assessment for precipitation and streamflow data can be found in ANA [18].The grid of meteorological variables from Xavier et al. [19] was also used due to the high rain spatial variability and a small number of pluviometric stations in the region.The daily discharge data at Andaraí, Iguaçu and Fortém hydrometric stations (Table 3) were obtained from Agência Nacional de Águas (ANA), while the extraction of water volumes from the river channel and reservoirs were taken from the Instituto de Meio Ambiente e Recursos Hídricos da Bahia [20].The soil map (Figure 2) was downloaded from Empresa Brasileira de Pesquisas Agropecúarias [14], while the soil's physical and chemical properties were collected from local literature [21][22][23][24].The land cover map was obtained from Globcover 2009 [25].The Brazilian Institute of Geography and Statistics-IBGE [26]-provided the crop information for the Mucugê and Ibicoará irrigated areas (Andarai basin) between 2004 and 2012.These regions represent the largest irrigated area of the Paraguaçu watershed.

Calibration and Validation
The SWAT model calibration/validation was carried out for daily and monthly timesteps, by comparing simulated and observed streamflows following a hierarchical approach proposed by Klemeš [10].The following tests were performed: (1) Split Sample Test: This test was carried out in the Andaraí basin.Two-time periods were selected for this analysis: a calibration period (1989-2001) and a validation period (2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010)(2011)(2012)(2013).Before, the model was warmed up using the data set from 1985-1988.This test included wet and dry years, being defined for civil years with average annual precipitation above and below the average for the 1989-2013 period (866 mm•year −1 ), respectively.
(2) Differential split-sample test: This test was also performed in the Andaraí basin and consisted of calibrating streamflow during wet years and validating it during dry years and vice versa.Two data periods were selected: the wet period from 2004-2006 and the dry period from 2007-2013.This test is required whenever a model is to be used for simulating the effect of climate change [27].If the climatic change corresponds to a transition to a warmer wetter scenario, the model should be calibrated on a dry cool data set and then validated for the other extreme.Due to the temporal variation of the precipitation regime (Figure 4), the dataset chosen for the dry period included two years where precipitation was above the average (866 mm•year −1 ).A comparison between the first (2004-2006) and second (2007-2013) half of data series showed that the mean monthly precipitation decreased from 96 to 61 mm (Figure 4), corresponding to a reduction of 36%.comparing simulated and observed streamflows following a hierarchical approach proposed by Klemeš [10].The following tests were performed: (1) Split Sample Test: This test was carried out in the Andaraí basin.Two-time periods were selected for this analysis: a calibration period (1989-2001) and a validation period (2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010)(2011)(2012)(2013).Before, the model was warmed up using the data set from 1985-1988.This test included wet and dry years, being defined for civil years with average annual precipitation above and below the average for the 1989-2013 period (866 mm•year −1 ), respectively.
(2) Differential split-sample test: This test was also performed in the Andaraí basin and consisted of calibrating streamflow during wet years and validating it during dry years and vice versa.Two data periods were selected: the wet period from 2004-2006 and the dry period from 2007-2013.This test is required whenever a model is to be used for simulating the effect of climate change [27].If the climatic change corresponds to a transition to a warmer wetter scenario, the model should be calibrated on a dry cool data set and then validated for the other extreme.Due to the temporal variation of the precipitation regime (Figure 4), the dataset chosen for the dry period included two years where precipitation was above the average (866 mm•year −1 ).A comparison between the first (2004-2006) and second (2007-2013) half of data series showed that the mean monthly precipitation decreased from 96 to 61 mm (Figure 4), corresponding to a reduction of 36%.
(3) Proxy-Basin Test: The Iguaçu and Fortém sub-basins were selected as having similar land use characteristics as those in Andaraí.The model was first calibrated in the Andaraí sub-basin and validated in Iguaçu and Fortém sub-basins.This test allowed to evaluate the model's ability in predicting the impacts of land use changes in a region where flow data does not exist [10].For each test, the SWAT model was calibrated by manually modifying one factor at a time with the aim of finding the set of factors that determined the best results in terms of global water balance in the basin (Table 4).The hydrograms were analyzed and the parameters that affected flow peaks and baseflow-that is, the behavior of the flow curve-were selected and modified until deviations between model outputs and measured flow data were minimized.The previously calibrated parameters were then validated by comparing results of simulations with an independent measurement set.Four commonly used goodness-of-fit indicators were used to evaluate model performance: the determination coefficient (R 2 ), the percent bias (PBIAS), the ratio of the root mean square error to the standard deviation of measured data (RSR) and the Nash-Sutcliffe model efficiency (NSE) [28], as follows: where P is the model simulated values, O is the observed data, the over bar describes the mean for the entire period of the evaluation, n is the number of data pairs and i = 1,2, . . ., n. R 2 is an indicator of the strength of the relationship between the observed and simulated values and varies between 0 and 1. R 2 values >0.5 are regarded as acceptable for model simulation [29].The PBIAS measures the tendency of the simulated data to be larger or smaller than their observed counterparts.PBIAS values <10% are considered very good; values from 10% < PBIAS < 15% are considered good; values from ± 15% < PBIAS < 25% are considered satisfactory and ≥25% are unsatisfactory [29].RSR incorporates the benefits of error index statistics and includes a scaling/normalization factor, so that the resulting statistic and reported values can apply to various constituents.RSR varies from the optimal value of 0, which indicates zero RMSE or residual variation and therefore perfect model simulation, to a large positive value.For monthly values, RSR values ≤0.5 are considered very good; values from 0.50 < PBIAS < 0.60 are considered good; values from 0.60 < PBIAS < 0.70 are considered satisfactory; and values >0.70 are considered unsatisfactory [29].The NSE is a normalized statistic that determines the relative magnitude of the model variance compared to the measured data variance, ranging from 1 to −∞.A NSE value of 1 corresponds to the best fit between simulated and measured data; values >0.75 are considered excellent; values between 0.36 and 0.75 are considered satisfactory; and values <0.36 are unsatisfactory [29].Additionally, streamflow data was submitted to the Kolmogorov-Smirnov (p < 0.05) normality test.Considering that the data presented normal distribution, the parametric T-paired test was used to verify the hypothesis that the observed and simulated flow data did not present a significant difference between them (p < 0.05) [12].The data analysis further consisted of comparing simulated and observed streamflow values within each quartile to assess the ability of the model in describing the maximum, average and minimum streamflow.Governmental institutions generally use the first quartile (the lowest 25% of monthly streamflow) as a reference in assessing water resource availability for multiple uses.

Model Calibration and Validation
Table 4 lists the default and calibrated parameters used in model simulations of the water balance in the Paraguaçu River basin.The SWAT model was shown to be highly sensitive to surface runoff parameters (CN2, SOL_AWC, SOL_Z and SOL_K), in line with Neitsch et al. [6], who stated that these were some of the most sensitive parameters for river discharge simulations.The available water capacity of the soil layer (SOL_AWC), saturated hydraulic conductivity (SOL_K) and soil depth (SOL_Z) influenced soil moisture and thus surface runoff.The CN2 default values of 35-89 were calibrated to the range 45-78, which is still in line with USDA-SCS [17].The greatest uncertainties were found in soil parameters due to the large spatial variability observed in the basin as well as the difficulties in obtaining accurate and reliable soil information; a common limitation in most regions of the world.For example, SOL_K was made to vary between 2-35 mm/h, while SOL_AWC suffered a relative large change with inputs being considered reliable nonetheless as soils in the region have coarse to medium textures in the top layers [14].The calibration of groundwater flow and baseflow ended up being controlled by groundwater delay (GW_DELAY), revaporation coefficient (GW_REVAP) and deep aquifer recharge (GW_RCHRG_DP) parameters.Reducing GW_DELAY affected both the width of the peak discharge and quantity of water available for base flow [6].GW_REVAP controlled the amount of water moving from the shallow aquifer to the root zone as a result of soil moisture depletion and the amount of direct groundwater uptake from deep-rooted trees and shrubs [6].GW_RCHRG_DP controlled percolation from the root zone to the deep aquifer.Finally, the effective hydraulic conductivity of the main channel (CH_K2) governed the movement of water from streambed to the subsurface for ephemeral or transient streams, which, according to Veith et al. [30], is a highly sensitive parameter when the watershed is characterized by intense and inconsistent precipitation events like in the Andaraí sub-basin.Nonetheless, despite a relatively extensive model calibration, all parameters were adjusted within their physical meaning, indicating the reasonability of model inputs.
validation periods, respectively.However, the RSR resulted in very good indicators (RSR < 0.5) for monthly flows.For daily simulations, RSR values were higher than 0.7.Thus, the performance of the SWAT model in estimating the mean streamflow showed to be excellent for monthly flows, with all goodness-of-fit indicators being also within the range of values reported in the literature for similar applications using this model [33,34].As such, the model could be considered fit for improving management of streamflow in the Paraguaçu watershed.The primary requirement of the Differential split-sample test is a long record of simulation data from which different periods can be chosen which different historical conditions.For this test, the model adequately represented the behavior of the observed flows, underestimating peaks during both calibration and validation periods though (Figure 6), in line with existing literature [35,36].The NSE values reached 0.47 and 0.50 for daily results and 0.83 and 0.63 for monthly outputs during the calibration and validation periods (Table 6), respectively, which means that the model was again found appropriate for simulating streamflow.Concerning the PBIAS, the model overestimated streamflow by 2-3% during wetter years (calibration) and underestimated streamflow by 17-20% Overall, the performance of the SWAT model was satisfactory, although underestimating streamflow peaks both during calibration and validation.The largest difference was found on March 2011, where the model simulated values of 16.0 m 3 /s while the observed ones reached 68.0 m 3 /s, that is, producing an error of 328% for that day.The underestimation of streamflow peaks was associated with the type of rainfall and representation of its spatial and temporal distribution.According to Vieira et al. [31], the precipitation that occurs in the region is orographic due to the high elevation observed in some areas of the watershed, while Cho et al. [32] refers that the sparse and heterogeneous spatial distribution of rain gauges often results in inaccurate precipitation inputs for SWAT, especially when modeling large river basins or those with different complex terrains like mountainous regions, where the assumption of spatially uniform rainfall is not valid.
For the Andaraí sub-basin, the NSE values calculated from daily results were 0.49 and 0.42, while the monthly data produced values of 0.86 and 0.76 for the calibration and validation periods, respectively (Table 5).These values suggest a satisfactory model performance for daily predictions and an excellent performance for monthly estimates [29].The R 2 values were above 0.5, indicating that the model explained the variability of the observed daily and monthly data.The PBIAS indicated slight deviations between the mean simulated and observed streamflow, showing that the model underestimated by 10% and 3-6% the daily and monthly observations during the calibration and validation periods, respectively.However, the RSR resulted in very good indicators (RSR < 0.5) for monthly flows.For daily simulations, RSR values were higher than 0.7.Thus, the performance of the SWAT model in estimating the mean streamflow showed to be excellent for monthly flows, with all goodness-of-fit indicators being also within the range of values reported in the literature for similar applications using this model [33,34].As such, the model could be considered fit for improving management of streamflow in the Paraguaçu watershed.The primary requirement of the Differential split-sample test is a long record of simulation data from which different periods can be chosen which different historical conditions.For this test, the model adequately represented the behavior of the observed flows, underestimating peaks during both calibration and validation periods though (Figure 6), in line with existing literature [35,36].The NSE values reached 0.47 and 0.50 for daily results and 0.83 and 0.63 for monthly outputs during the calibration and validation periods (Table 6), respectively, which means that the model was again found appropriate for simulating streamflow.Concerning the PBIAS, the model overestimated streamflow by 2-3% during wetter years (calibration) and underestimated streamflow by 17-20% during drier years (validation).Likewise, the RSR produced very good results for the monthly dry period but only satisfactory for the drier one.Nonetheless, the model was considered to operate under wet and dry conditions within acceptable margins of error.Muleta [37] revealed that SWAT-calibrated parameters for the headwaters of the Little River watershed were largely different between dry and wet periods.Li et al. [38] using another model found that model efficiency in the dry period was consistently lower than that in the wet period.Finally, Lévesque et al. [39] followed the seasonal calibration scheme in which winter and summer data were used to calibrate the model in two watersheds in southeastern Canada.Those authors reported that the best performance was obtained in the dry period using summer observations to calibrate the model; however, when the wet period observations were used, no advantage was achieved compared with the traditional calibration method based on all available data.Regarding the Andaraí sub-basin, calibrated parameters were found to be conservative while applying for wet or dry periods.
during drier years (validation).Likewise, the RSR produced very good results for the monthly dry period but only satisfactory for the drier one.Nonetheless, the model was considered to operate under wet and dry conditions within acceptable margins of error.Muleta [37] revealed that SWATcalibrated parameters for the headwaters of the Little River watershed were largely different between dry and wet periods.Li et al.
[38] using another model found that model efficiency in the dry period was consistently lower than that in the wet period.Finally, Lévesque et al. [39] followed the seasonal calibration scheme in which winter and summer data were used to calibrate the model in two watersheds in southeastern Canada.Those authors reported that the best performance was obtained in the dry period using summer observations to calibrate the model; however, when the wet period observations were used, no advantage was achieved compared with the traditional calibration method based on all available data.Regarding the Andaraí sub-basin, calibrated parameters were found to be conservative while applying for wet or dry periods.The Proxy-catchment test returned NSE values (Table 7) ranging from 0.44 (Iguaçu) to 0.36 (Fortém) for daily outputs, while monthly results produced NSE values ranging from 0.75 (Iguaçu) to 0.80 (Fortém).The daily and monthly PBIAS values ranged from 10-12% and 8.5-9.1% in Fortém and Iguaçu, respectively, indicating an underestimation of observed data in both basins.The RSR values showed again very good simulations of monthly flows.Thus, the model had an overall good satisfactory in Iguaçu and Fortém concerning monthly calibration data and an excellent performance to monthly validation data [29].This test was performed in the same catchments as in the previous test.Table 8 shows the satisfactory performance of the SWAT model for simulating streamflow in dry and wet periods.The monthly PBIAS values ranged from 8-21% in wet and dry periods, respectively, indicating always an underestimation of observed data.RSR values oscillated between very good and satisfactory results.On the other hand, the NSE values varied between 0.50 and 0.80, indicating an acceptable model performance.After model calibration, the soil water balance was assessed to confirm the reliability of model estimates in the three simulated sub-basins (Table 9).Annual actual evapotranspiration was the main component of the soil water balance due to the characteristics of the semi-arid climate, varying between 69 and 86% of the total annual precipitation amount, in line with other semi-arid regions in the world [5,[40][41][42].ET a values further varied according to land use (Table 10) and obviously water availability.The highest ET a values were observed in Andaraí and Fortém due to the highest precipitation values registered.Yet, large variability was observed in ET a values, in line with precipitation.Runoff varied between 3.6 and 11.2% of the total annual precipitation amount, being fundamentally registered during storm events.Baseflow and lateral flow reached 7.4 to 14.3% of the total annual precipitation amount, being essential for maintaining river flow throughout the year.Similar results can be found in Bouraoui et al. [43].

Estimate of Maximum, Average and Minimum Monthly Streamflow
Figure 7 shows results of the regression analysis between monthly simulated and observed streamflow from the 1989-2013 period, with data separated by quartiles.The regression coefficient evidenced that the model tended to overestimate minimum streamflow between 4-7% (Figure 7(A1,B1,C1), the first quartile).The model also underestimated the average (Figure 7(B1,B2,B3), the second quartile) and maximum (Figure 7(C1,C2,C3), the third quartile) streamflow in all stations between 4-17%.Likewise, Table 11 shows the results of the paired t-test and regression analysis to observed and simulated monthly streamflow, also separated by quartiles.Differences between observations and simulations were not significant for first, second and third quartiles, except for Fortém station 2nd and 3rd quartiles.These results showed that the SWAT model can be used for the estimation of maximum, average and minimum monthly streamflow in this sub-basin, with potential use in hydrological studies and water management.
The analysis of the SWAT model results for improving management and planning of streamflow is usually also carried out by analyzing maximum, minimum and reference flow based on historical series [44].The availability of hydrological data in the Paraguaçu watershed and other semiarid regions of Brazil is quite limited or even nonexistent, which makes the definition of a reference flow a difficult task when models are not used, thus enhancing the interest of the outcomes of this study as a valuable alternative.The permanence curve is one of the available tools to determine water availability in a river basin.Figure 8 shows the observed and simulated permanence curves given by the model in the calibration and validation stages for the period 1989 to 2013, on a monthly scale.
The flow duration curves show that the model underestimated higher flows than lower ones in Andaraí, Iguaçu and Fortém.For example, the curve for simulated monthly data closely matched the observed evidence for exceedance probabilities of <90%.However, for streamflows with exceedance values >90% (i.e., low flows), apparent differences in predictive capabilities were evident.In all the sub-basins, flows with exceedance values Q90% and 95% were overestimated at maximum 24% and 16%, respectively (Table 12).The maximum error found for flows with exceedance values Q90% was for the Andaraí sub-basin, which has the largest irrigated area generating more uncertainties in the Q90% estimate due to farmers' irrigation management at the plot scale, which remains partially unknown.In general, errors were of small magnitude for other areas.It was possible to verify that the SWAT model adequately simulated flow duration curves for the three sub-basins of the Paraguaçu watershed, demonstrating its quality as an auxiliary tool in water resources management.The model errors observed while simulating minimum and reference flows were within a range considered acceptable in hydrology [45].However, errors in simulating maximum extreme values, which came close to 50%, indicated non-suitability of the model for estimating extreme events, precluding its use in dimensioning flood control infrastructure.Pereira et al. [45] stated that errors up to 30% in simulated flow are considered satisfactory for the concession of water use in irrigated areas of Brazil.Therefore, the SWAT model results in this study can be used for improving management and planning of streamflows at the watershed scale.The analysis of the SWAT model results for improving management and planning of streamflow is usually also carried out by analyzing maximum, minimum and reference flow based on historical series [44].The availability of hydrological data in the Paraguaçu watershed and other semiarid regions of Brazil is quite limited or even nonexistent, which makes the definition of a reference flow a difficult task when models are not used, thus enhancing the interest of the outcomes of this study as a valuable alternative.The permanence curve is one of the available tools to determine water  flows with exceedance values Q90% and 95% were overestimated at maximum 24% and 16%, respectively (Table 12).The maximum error found for flows with exceedance values Q90% was for the Andaraí sub-basin, which has the largest irrigated area generating more uncertainties in the Q90% estimate due to farmers' irrigation management at the plot scale, which remains partially unknown.
In general, errors were of small magnitude for other areas.It was possible to verify that the SWAT model adequately simulated flow duration curves for the three sub-basins of the Paraguaçu watershed, demonstrating its quality as an auxiliary tool in water resources management.The model errors observed while simulating minimum and reference flows were within a range considered acceptable in hydrology [45].However, errors in simulating maximum extreme values, which came close to 50%, indicated non-suitability of the model for estimating extreme events, precluding its use in dimensioning flood control infrastructure.Pereira et al. [45] stated that errors up to 30% in simulated flow are considered satisfactory for the concession of water use in irrigated areas of Brazil.
Therefore, the SWAT model results in this study can be used for improving management and planning of streamflows at the watershed scale.

Conclusions
The SWAT model was able to reproduce long-term daily and monthly streamflow in the Paraguaçu watershed, northeastern Brazil, including nonstationary conditions and seasonality between wet and dry years.The goodness-of-fit indicators resulted in NSE values higher than 0.42 and 0.76 for daily and monthly streamflow, respectively, indicating a satisfactory performance at the daily scale and an excellent result at the monthly scale.The hierarchical calibration approach further showed that the calibrated parameters were transferable to other sub-basins with similar land use, being thus capable of predicting the impact of land use change at the watershed scale.Likewise, the regression analysis and paired t-test showed that the SWAT model was fit for estimating the maximum, average, minimum monthly streamflow in the studied sub-basins, with potential use in hydrological studies and water management plans.On the other hand, the flow duration curves showed that the model can be used as an auxiliary tool in the management of streamflow, with the model underestimating higher flows that result from extreme events but performing well for flows with exceedance probabilities of <90%.As such, the curve flows associated with different exceedances, especially Q90%, attested the applicability of the model as a tool for management and planning of streamflow in the three sub-basins of the Paraguaçu watershed.

Figure 1 .
Figure 1.Localization of the Paraguaçu watershed and study area.

Figure 1 .
Figure 1.Localization of the Paraguaçu watershed and study area.

Figure 2 .
Figure 2. Soil and Land use maps of the High Paraguaçu region.

Figure 2 .
Figure 2. Soil and Land use maps of the High Paraguaçu region.

Water 2018 ,
10, x FOR PEER REVIEW { PAGE } of { NUMPAGES } of Geography and Statistics-IBGE [26]-provided the crop information for the Mucugê and Ibicoará irrigated areas (Andarai basin) between 2004 and 2012.These regions represent the largest irrigated area of the Paraguaçu watershed.

Figure 3 .
Figure 3. Digital elevation model of the Paraguaçu watershed.Location of reservoirs and hydrometric stations.

Figure 3 .
Figure 3. Digital elevation model of the Paraguaçu watershed.Location of reservoirs and hydrometric stations.

( 4 )
Proxy-Basin differential split-sample Test: The model was first calibrated for wet conditions in the Andaraí sub-basin and then used to simulate streamflow for dry conditions in the Iguaçu basin and vice versa.Both watersheds share similar characteristics.The wet and dry years were identified from historical records, with the model being calibrated on three wet years (2004-2006) and validated on seven drier years (2007-2013) like before.

Figure 4 .
Figure 4. Annual precipitation values for the Andarai sub-basin during 1989-2013.Wet and Dry periods chosen for the differential split-sample test.

Figure 4 .
Figure 4. Annual precipitation values for the Andarai sub-basin during 1989-2013.Wet and Dry periods chosen for the differential split-sample test.

( 3 )
Proxy-Basin Test: The Iguaçu and Fortém sub-basins were selected as having similar land use characteristics as those in Andaraí.The model was first calibrated in the Andaraí sub-basin and validated in Iguaçu and Fortém sub-basins.This test allowed to evaluate the model's ability in predicting the impacts of land use changes in a region where flow data does not exist [10].(4) Proxy-Basin differential split-sample Test: The model was first calibrated for wet conditions in the Andaraí sub-basin and then used to simulate streamflow for dry conditions in the Iguaçu basin and vice versa.Both watersheds share similar characteristics.The wet and dry years were identified from historical records, with the model being calibrated on three wet years (2004-2006) and validated on seven drier years (2007-2013) like before.Water 2018, 10, 1137 7 of 17

Table 2 .
Location of pluviometric and weather stations and sensors used for monitoring weather variables.

Table 3 .
Location of the hydrometric station and sensors used for monitoring river discharge.

Table 2 .
Location of pluviometric and weather stations and sensors used for monitoring weather variables.

Table 3 .
Location of the hydrometric station and sensors used for monitoring river discharge.

Table 4 .
Default and calibrated parameters used in model simulations.

Table 5 .
Goodness-of-fit indicators obtained after comparison of model simulations and measured values in the Andaraí sub-basin based on the split-sample test (NSE, model efficiency; PBIAS, percent bias; R 2 , coefficient of determination).

Table 5 .
Goodness-of-fit indicators obtained after comparison of model simulations and measured values in the Andaraí sub-basin based on the split-sample test (NSE, model efficiency; PBIAS, percent bias; R 2 , coefficient of determination).

Table 6 .
Goodness-of-fit indicators obtained after comparison of model simulations and measured values in the Andaraí sub-basin based on the differential split-sample test (NSE, model efficiency; PBIAS, percent bias; R 2 , coefficient of determination).

Table 7 .
Goodness-of-fit indicators obtained after comparison of model simulations and measured values in the Iguaçu and Fortém sub-basins (validation period 2002-2013) based on the proxy-catchment test (NSE, model efficiency; PBIAS, percent bias; R 2 , coefficient of determination).

Table 8 .
Goodness-of-fit indicators obtained after comparison of model simulations and measured values in the Andaraí, Iguaçu and Fortém sub-basins based on the proxy-catchment differential split-sample test (NSE, model efficiency; PBIAS, percent bias; R 2 , coefficient of determination).

Table 10 .
Minimum and actual annual actual evapotranspiration values in Andaraí, Iguaçu and Fortém per land use.

Table 11 .
Results of the regression analysis and paired t-test on observed and simulated average monthly.-number of sample value; a-regression coefficient; t cal -paired t-test, calculated value; t crit -paired t-test, two-tailed value; ns-not significant at 5% level of significance; * significant at the 5% level of significance.

Table 12 .
Comparison exceedance Q90 and Q95 values between observed and simulated monthly flow for the period to 1989-2013.

Table 12 .
Comparison exceedance Q90 and Q95 values between observed and simulated monthly flow for the period to 1989-2013.