Comparison of NCEP-CFSR and CMADS for Hydrological Modelling Using SWAT in the Muda River Basin, Malaysia

Identification of reliable alternative climate input data for hydrological modelling is important to manage water resources and reduce water-related hazards in ungauged or poorly gauged basins. This study aims to evaluate the capability of the National Centers for Environmental Prediction Climate Forecast System Reanalysis (NCEP-CFSR) and China Meteorological Assimilation Driving Dataset for the Soil and Water Assessment Tool (SWAT) model (CMADS) for simulating streamflow in the Muda River Basin (MRB), Malaysia. The capability was evaluated in two perspectives: (1) the climate aspect—validation of precipitation, maximum and minimum temperatures from 2008 to 2014; and (2) the hydrology aspect—comparison of the accuracy of SWAT modelling by the gauge station, NCEP-CFSR and CMADS products. The results show that CMADS had a better performance than NCEP-CFSR in the climate aspect, especially for the temperature data and daily precipitation detection capability. For the hydrological aspect, the gauge station had a “very good” performance in a monthly streamflow simulation, followed by CMADS and NCEP-CFSR. In detail, CMADS showed an acceptable performance in SWAT modelling, but some improvements such as bias correction and further SWAT calibration are needed. In contrast, NCEP-CFRS had an unacceptable performance in validation as it dramatically overestimated the low flows of MRB and contains time lag in peak flows estimation.


Introduction
The water cycle is a complex system that involves the movement of water between the atmosphere, earth and ocean. Understanding of the water cycle for a particular river basin is essential to manage water resources and reduce water-related hazards. The hydrological model is commonly used to quantify the changes of the water cycle components so that a better river basin management plan can be developed. The Soil and Water Assessment Tool (SWAT) is one of the most widely used hydrological models as indicated with more than 4000 publications [1][2][3]. The model is mainly applied by water scientists to understand the impact of land use and climate changes on water quantity and quality [4]. assess the capability of the gauge station, NCEP-CFSR, CMADS in simulating streamflow using SWAT in MRB. The findings help to understand the performance of CMADS and SWAT in this tropical basin.

Study Area
MRB is located between the latitudes of 5 • 20 N-6 • 20 N and longitudes of 100 • 20 E-101 • 20 E, as shown in Figure 1. As a tropical basin, MRB receives a very high amount of annual precipitation (up to 3000 mm/year), particularly during the inter-monsoon period between September and October [25]. MRB is covered by the Titingwangsa range and is close to the mountain ranges in Sumatra, Indonesia that dramatically reduced the heavy precipitation that brought by the northeast monsoon (November to mid-March) and southeast monsoon (May to August), respectively. Therefore, a high amount of monthly precipitation of MRB is normally found during the inter-monsoon seasons, i.e., April and October. The dry season of MRB is commonly occurred in January to March. Sometime, due to the El Nino effect, the dry season may continue for a longer period. In 2020, the dry season in this region lasts until June and causes many reservoirs to hit their lowest water level. Local authorities spent more than MYR one hundred thousand for cloud seeding to increase the effective capacity of the dams within or around MRB.
Water 2020, 12, x FOR PEER REVIEW 3 of 16 CMADS in simulating streamflow using SWAT in MRB. The findings help to understand the performance of CMADS and SWAT in this tropical basin.

Study Area
MRB is located between the latitudes of 5°20′ N-6°20′ N and longitudes of 100°20′ E-101°20′ E, as shown in Figure 1. As a tropical basin, MRB receives a very high amount of annual precipitation (up to 3000 mm/year), particularly during the inter-monsoon period between September and October [25]. MRB is covered by the Titingwangsa range and is close to the mountain ranges in Sumatra, Indonesia that dramatically reduced the heavy precipitation that brought by the northeast monsoon (November to mid-March) and southeast monsoon (May to August), respectively. Therefore, a high amount of monthly precipitation of MRB is normally found during the inter-monsoon seasons, i.e., April and October. The dry season of MRB is commonly occurred in January to March. Sometime, due to the El Nino effect, the dry season may continue for a longer period. In 2020, the dry season in this region lasts until June and causes many reservoirs to hit their lowest water level. Local authorities spent more than MYR one hundred thousand for cloud seeding to increase the effective capacity of the dams within or around MRB. Muda River is the longest river of the Kedah state with a total length of ~180 km and a basin size of ~4111 km 2 . The 30 km downstream part of MRB covers both the Kedah and Penang states. Figure  1 shows that there are two major reservoirs that are located in the upper part of MRB. The Muda Reservoir was constructed in 1969 under the Muda Irrigation Scheme for irrigating about 96,000 hectares of paddy field, which also known as the "rice bowl" of Malaysia [25]. Water from the Muda Reservoir (160 × 10 6 m 3 ) is transferred to the Pedu Reservoir (1073 × 10 6 m 3 ) that located out of MRB using a 6.8 km tunnel [26]. The Beris Reservoir (122.4 × 10 6 m 3 ) is the second water supply reservoir that located in the southern part of the Muda Reservoir. It was constructed by Department of Irrigation and Drainage (DID) in 2004 with a total cost of ~RM 360 million (~USD 86.68 millions) [27]. Muda River is the longest river of the Kedah state with a total length of~180 km and a basin size of~4111 km 2 . The 30 km downstream part of MRB covers both the Kedah and Penang states. Figure 1 shows that there are two major reservoirs that are located in the upper part of MRB. The Muda Reservoir was constructed in 1969 under the Muda Irrigation Scheme for irrigating about 96,000 hectares of paddy field, which also known as the "rice bowl" of Malaysia [25]. Water from the Muda Reservoir (160 × 10 6 m 3 ) is transferred to the Pedu Reservoir (1073 × 10 6 m 3 ) that located out of MRB using a 6.8 km tunnel [26]. The Beris Reservoir (122.4 × 10 6 m 3 ) is the second water supply reservoir that located in the southern part of the Muda Reservoir. It was constructed by Department of Irrigation and Drainage (DID) in 2004 with a total cost of~RM 360 million (~USD 86.68 millions) [27].

Observations
Observed daily precipitation, maximum and minimum temperature data from 2008 to 2014 at nine climate stations as listed in Table 1 were collected from Malaysian Meteorological Department (MMD). Most of these stations are generally known as good quality stations with less missing value and free from inhomogeneous condition [25]. The missing values were filled with the data from nearest station before applying into SWAT. As shown in Figure 1, the stations distributed quite well which cover both the upper and lower parts of MRB. Besides that, monthly streamflow measured from the average daily flows at the Jambatan Syed Omar discharge gauge was collected from DID for SWAT calibration and validation. The comparison between the observed and SWAT-simulated streamflow is further explained in Section 2.6. The Jambatan Syed Omar station was selected because it has extremely low missing values during the evaluation period and is located in the downstream part ( Figure 1) that able to capture the streamflow pattern of major river networks within MRB.

NCEP-CFSR
NCEP-CFSR is the third-generation global coupling seasonal forecast reanalysis data with a spatial resolution of 0.3125 • (~38 km). NCEP-CFSR was created to understand the interaction between ocean, atmosphere and land of the Earth as a coupled atmosphere-ocean-land surface-sea ice model [12,28]. To promote the usage of SWAT, a set of daily NCEP-CFSR climate variables from 1979 to 2014 including precipitation, wind, relative humidity, solar, maximum and minimum temperatures were converted to SWAT climate input format. The SWAT-based NCEP-CFSR is free of charge where users can download the climate data based their study area from the global weather data for the SWAT webpage [29].

CMADS
CMADS was developed to provide a set of high-quality climate data for meteorology and hydrology studies in East Asia [13]. It contains climate variables of precipitation, relative humidity, specific humidity, average wind, solar radiation, average atmospheric pressure, soil temperature, soil moisture, average temperature, maximum temperature and minimum temperature at a daily scale. Similar to NCEP-CFSR, CMADS was converted to SWAT format to allow modeler to utilize the data directly. The capability of CMADS has been widely used in China for hydrology applications [20,23,[30][31][32], but such studies are relatively few in other East Asian countries. In this study, daily precipitation, maximum and minimum data of CMADS v1.1 from 2008 to 2016 at 0.25 • × 0.25 • spatial resolution for MRB region ( Figure 1) were collected from the CMADS webpage [33]

SWAT Model
SWAT is a semi-distributed hydrological model that was designed for studying the best management practices on streamflow and water quality of a specific river basin [34]. SWAT has been proven to be a reliable model for simulating the water cycle in various parts of the world [1,2,4,18,[34][35][36][37][38]. Most importantly, SWAT is a public domain model with a series of tutorials available on the SWAT main webpage. SWAT includes three major sub-modules under the hydrological, soil erosion and pollution load applications. Among them, the hydrological module is the most basic sub-module. SWAT version 2012 (Revision 635) with the interface of the ArcSWAT version 2012.10_216 was applied in this study.
This study described the hydrological module since the evaluation involves the hydrological aspect. In the model setup, SWAT first divides a main basin into several sub-basins and then smaller hydrological response units (HRUs) according to the different types of land use, soil and slope. The flow is firstly calculated at the HRU level, then aggregated to the correspond sub-basin and, lastly, to the basin outlet. The hydrological cycle within SWAT is calculated based on the water balance equation [39] as follows: where SW t is the soil water content (mm) at the end of day t, SW 0 is the initial soil water content (mm) on day i, t is time (day), R day is the precipitation (mm) on day i, Q surf is the surface runoff (mm) on day i, E a is the evapotranspiration (mm) on day I, w seep is the amount of water entering the saturated zone from the soil (mm) on day i and Q gw is the underground runoff (mm) on day i. For detailed theoretical formulas within SWAT one can refer to the SWAT theoretical manual that was prepared by Neitsch et al. [39]. Muda and Beris Reservoirs that are located in the upper part of MRB were added in the SWAT model development. Basic information of Muda Reservoir was extracted from the MADA webpage [26]. Whereas, monthly average water usage for freshwater supply and irrigation purposes was extracted from the official website of National Water Balance Management System (NAWABS) [40] as input into SWAT to improve the capability in the monthly streamflow simulation. The Hargreaves method [41] within SWAT was applied to measure the potential evapotranspiration since it requires only the temperature data for the calculation.
Selection of a set of appropriate parameters for SWAT calibration can be conducted easily and effectively through the sensitivity analysis, thus greatly improving the efficiency of SWAT modelling. Sensitivity analysis can provide guidance in determining which parameters to adjust in SWAT calibration. Therefore, sensitivity analysis of parameters is essential before the calibration of the model. In this study, the global sensitivity analysis method was used to conduct sensitivity analysis of parameters. The parameters' sensitivity was ranked based on the t-stat values, where the higher the value, the higher the sensitivity. The model was then calibrated based on a sensitivity analysis of 13 parameters.

Statistical Measures
The evaluation of NCEP-CFSR and CMADS was divided into two main parts: (1) the climate aspect-validation of precipitation, maximum and minimum temperature data with climate gauge; and (2) the hydrological aspect-comparison of the monthly streamflow simulated by SWAT using a gauge station, NCEP-CFSR and CMADS with an observed streamflow. In general, gridded products provide areal climate information for each respective grid, while the gauge station measures the climate variable at a specific point. Grid-to-grid and point-to-grid are two commonly used approaches to do comparisons between gridded products and gauges. The grid-to-grid approach requires a high density of gauges within a specific grid for a robust comparison. In fact, lack of gauge stations is a major issue in different parts of the world [7,42], including MRB. Besides that, some uncertainties may arise during the interpolation of multiple gauge stations into gridded-based observed data. Hence, the climate aspect comparison was conducted using the point-to-grid approach [43], where the gauge stations were directly compared to their respective grids' values. The assessment is limited to the grids that contain at least one gauge as shown in Figure 1. The capability of NCEP-CFSR and CMADS in estimating precipitation, maximum and minimum temperatures from 2008 to 2014 at daily and monthly scales was evaluated using three widely applied statistical measures such as Correlation Coefficient (CC), Root Mean Square Error (RMSE) and Relative Bias (RB). CC, RMSE and RB are commonly used to evaluate the level of agreement, average error magnitude and systematic bias, respectively, between the gridded and gauge station data. A reliable gridded product can be declared if CC value more than 0.7 and RB value from −10% to 10% [6,44]. Categorical statistic measures such as Probability of Detection (POD), False Alarm Ratio (FAR) and Critical Success Index (CSI) are mainly used to detect the daily precipitation detection capability of NCEP-CFSR and CMADS. The formula of these statistical measures is listed in Table 2. POD and FAR show the ratio of correctly and falsely detected precipitation to the total precipitation, respectively [6,45]. Meanwhile, CSI indicates the overall ratio of precipitation and non-precipitation days detected by the gridded data.   The hydrological aspect involved the comparison of streamflow simulated by the observed climate data, NCEP-CFSR and CMADS. Uncertainty in Sequential Uncertainty Fitting (SUFI-2) algorithm that available in the SWAT-CUP tool was used to calibrate and validation the SWAT model. SUFI-2 was selected because it was relatively simple to apply, less time consuming, more efficient and can flexibly select optimization targets [46,47]. Coefficient of Determination (R 2 ), Nash-Sutcliffe Efficiency (NSE) and RB were used to evaluate the performance of SWAT in simulating streamflow at the Jambatan Syed Omar discharge station as shown in Figure 1. Table 3 shows the performance rating to measure the SWAT performance that recommended by Moriasi, et al. [48], which is widely applied in SWAT studies. The streamflow comparison only focused on the monthly scale assessment due to the difficulty in obtaining reservoir management data for both the reservoirs. A similar situation was actually found in other river basins of Malaysia as well [49,50], where the reservoir management data is considered as highly sensitive data to public in Malaysia. Table 3. SWAT performance rating for streamflow simulations recommended by Moriasi [48].

Climate Aspect
Three major climate variables (pcp, tmax and tmin) extracted from NCEP-CFSR and CMADS over MRB from 2008 to 2014 were compared with climate gauge for both the daily ( Figure 2) and monthly ( Figure 3) scales. This study focused solely on the assessment of daily and monthly data due to the relatively short annual series as NCEP-CFSR only available until July 2014. Besides that, daily and monthly are the major time scales for hydrological studies and model accuracy assessment [1,2]. efficient and can flexibly select optimization targets [46,47]. Coefficient of Determination (R 2 ), Nash-Sutcliffe Efficiency (NSE) and RB were used to evaluate the performance of SWAT in simulating streamflow at the Jambatan Syed Omar discharge station as shown in Figure 1. Table 3 shows the performance rating to measure the SWAT performance that recommended by Moriasi, et al. [48], which is widely applied in SWAT studies. The streamflow comparison only focused on the monthly scale assessment due to the difficulty in obtaining reservoir management data for both the reservoirs. A similar situation was actually found in other river basins of Malaysia as well [49,50], where the reservoir management data is considered as highly sensitive data to public in Malaysia.

Climate Aspect
Three major climate variables (pcp, tmax and tmin) extracted from NCEP-CFSR and CMADS over MRB from 2008 to 2014 were compared with climate gauge for both the daily ( Figure 2) and monthly (Figure 3) scales. This study focused solely on the assessment of daily and monthly data due to the relatively short annual series as NCEP-CFSR only available until July 2014. Besides that, daily and monthly are the major time scales for hydrological studies and model accuracy assessment [1,2].    [24] who found the RMSE values of CMADS over the Han River Basin in Korean Peninsula varying from 2.05 mm/day to 17.47 mm/day. Interestingly, NCEP-CFSR overestimated the daily precipitation by 11.84%, whereas a significant underestimation of 27.64% was found for CMADS. For the precipitation detection capability assessment, NCEP-CFSR has a better average POD value of 0.92, but a poorer average FAR value of 0.50. On the other hand, the average POD and FAR values of CMADS are 0.74 and 0.34, respectively. With respect to CSI, CMADS shows a slightly better performance with an average value of 0.55 compared to NCEP-CFSR (average CSI = 0.48). This finding was also reported by Tan [17] in another two river basins in Malaysia, where NCEP-CFSR correctly estimated around 50% of the precipitation and non-precipitation days. The outstanding performance of NCEP-CFSR in POD can be explained by the fact that the data only detected about 8% of non-precipitation days (0 mm/day) over MRB, but actually there are around 45.42% to 65.93% non-precipitation days were detected by the precipitation gauges. Therefore, NCEP-CFSR overestimated the observed daily precipitation data in this basin.
The results of the statistical analysis for daily maximum and minimum temperatures variables are presented in Figure 2. Similarly, CMADS shows a better performance compared to NCEP-CFSR in the measurement of daily maximum and minimum temperatures over MRB. CMADS shows a good correlation with observed data in daily maximum temperature measurement with an average CC value of 0.84, whereas NCEP-CFSR (average CC = 0.57) only correlated moderately with observed data. The average RMSE values of daily maximum temperature for CMADS and NCEP-CFSR are 1.74 °C and 2.62 °C, respectively, showing the difference between the reanalysis and temperature  [24] who found the RMSE values of CMADS over the Han River Basin in Korean Peninsula varying from 2.05 mm/day to 17.47 mm/day. Interestingly, NCEP-CFSR overestimated the daily precipitation by 11.84%, whereas a significant underestimation of 27.64% was found for CMADS. For the precipitation detection capability assessment, NCEP-CFSR has a better average POD value of 0.92, but a poorer average FAR value of 0.50. On the other hand, the average POD and FAR values of CMADS are 0.74 and 0.34, respectively. With respect to CSI, CMADS shows a slightly better performance with an average value of 0.55 compared to NCEP-CFSR (average CSI = 0.48). This finding was also reported by Tan [17] in another two river basins in Malaysia, where NCEP-CFSR correctly estimated around 50% of the precipitation and non-precipitation days. The outstanding performance of NCEP-CFSR in POD can be explained by the fact that the data only detected about 8% of non-precipitation days (0 mm/day) over MRB, but actually there are around 45.42% to 65.93% non-precipitation days were detected by the precipitation gauges. Therefore, NCEP-CFSR overestimated the observed daily precipitation data in this basin.
The results of the statistical analysis for daily maximum and minimum temperatures variables are presented in Figure 2. Similarly, CMADS shows a better performance compared to NCEP-CFSR in the measurement of daily maximum and minimum temperatures over MRB. CMADS shows a good correlation with observed data in daily maximum temperature measurement with an average CC value of 0.84, whereas NCEP-CFSR (average CC = 0.57) only correlated moderately with observed data. The average RMSE values of daily maximum temperature for CMADS and NCEP-CFSR are 1.74 • C and 2.62 • C, respectively, showing the difference between the reanalysis and temperature gauge estimated maximum temperature is around 2 • C. Both the CMADS and NCEP-CFSR products tended to underestimate the daily temperature by 4.25 and 5.01%, respectively. Similarly, CMADS shows a better correlation (average CC = 0.71) and a smaller bias (average RMSE = 1.01 • C) in daily minimum temperature estimation. By contrast, NCEP-CFSR had a much lower average CC value (0.43) and a higher average RMSE value (1.89 • C) compared to CMADS, which indicates CMADS is a better alternative data for temperature analysis in Muda River Basin. An underestimation of daily minimum temperature was found for NCEP-CFSR at the rate of 3%, whereas CMADS slightly overestimated by 0.76%. Figure 3 shows the statistical results of NCEP-CFSR and CMADS against observed data in monthly precipitation, maximum and minimum temperatures estimations over Muda River Basin from 2008 to 2014. Monthly maximum and minimum temperatures are averaging from daily maximum and minimum temperatures, respectively, for a specific month. From the figure, it can be seen that both the NCEP-CFSR and CMADS products had a better performance in temperature estimation as compared to precipitation. The average CC value for both the NCEP-CFSR and CMADS products is 0.67, showing a good correlation between gridded and observed data in monthly precipitation estimation. NCEP-CFSR and CMADS showed average RMSE values of 110.72 and 108.81 mm/month, respectively, which is consistent with the findings reported in other river basins of Malaysia [9,17]. Interestingly, NCEP-CFSR overestimated the monthly precipitation by 11.84%, while CMADS had an opposite direction of 27.64%.

Monthly Climate Assessment
Overall, CMADS outperformed NCEP-CFSR in monthly temperature estimations in MRB, where CMADS correlated well with the observed monthly maximum and minimum temperatures, with the average CC values of 0.90 and 0.91, respectively. This finding was also reported by Liu et al. [31] in the Qinghai-Tibet Plateau, where the CC values of CMADS in maximum and minimum temperatures validation are close to 1. Besides that, CMADS also has a smaller RMSE value compared to NCEP-CFSR as shown in Figure 3. NCEP-CFSR underestimated the monthly maximum and minimum temperatures by 4.99 and 2.99%, respectively. Similarly, CMADS underestimated the monthly maximum temperature by 4.25%, but an overestimation of 0.76% was found in monthly minimum temperature estimation. Table 4 shows the sensitivity of 13 parameters used for SWAT modelling of MRB under various climate inputs. Similar to many other Malaysian SWAT studies [50][51][52], SCS runoff curve number f (CN2) is the most sensitive parameter for SWAT modelling in MRB as shown by all the gauge station, NCEP-CFSR and CMADS products. Soil evaporation compensation factor (ESCO) is the second most sensitive parameter as simulated by all the three products. By contrast, groundwater delay (GW_Delay) and available water capacity of the soil layer (SOL_AWC) are among the less sensitive parameters, and, therefore, they can be considered to exclude future SWAT modelling in this basin.

Hydrological Aspect
In general, MRB was divided into 52 sub-basins and 318 HRUs. The year 2008 was used for reaching steady state condition, while the Jan 2009 to December 2011 and Jan 2012 to July 2014 periods were set as the calibration and validation periods, respectively. Table 5 indicates the range of parameters and their optimal value for each evaluated product. The R 2 , NSE and RB values as simulated by climate gauge for calibration are 0.82, 0.81 and 3.6%, respectively, as listed in Table 6. This indicates that the SWAT model is a reliable tool to simulate monthly streamflow in MRB, which can be used for further studies the climate or land use impact assessment on water balance in this basin.   According to Table 6, CMADS had a satisfactory performance compared to NCEP-CFSR during the calibration period. The R 2 , NSE and RB values for CMADS are 0.60, 0.59 and −3.7%, respectively. The R 2 and NSE values of NCEP-CFSR are 0.58 and 0.21, respectively, which are slightly lower than NCEP-CFSR. Interestingly, NCEP-CFSR dramatically overestimated the monthly streamflow by 40%, indicating NCEP-CFSR overestimated the precipitation amount in this basin. During the validation period, CMADS outperformed NCEP-CFSR in monthly streamflow simulation, with the R 2 , NSE and RB values of 0.39, 0.21 and −13.1%, respectively. By contrast, the performance of NCEP-CFSR is considered as unacceptable for the validation period due to the negative NSE (−0.23) and extremely high RB (44.7%) values. A similar finding has been reported by Tan [17] where NCEP-CFSR also performed poorly in another two Malaysian river basins, called Kelantan River Basin and Johor River Basin. Therefore, Tan [17] suggested to use a combination of Asian Precipitation-Highly-Resolved Observational Data Integration towards Evaluation (APHRODITE) precipitation and NCEP-CFSR temperature data for SWAT modelling in Malaysia.

Discussion
In general, CMADS had an acceptable performance for monthly streamflow in MRB using SWAT. However, when compared to other similar studies as shown in Table 7 that were mainly conducted in China, CMADS had a poorer performance in this tropical basin. The R 2 and NSE values of more than 0.8 were mainly reported in Jing River and Bo River Basins [53], Fuhe River Basin [22], Xiang River Basin [32] and Lijiang River Basin [54]. Meanwhile, Zhang et al. [30] found CMADS performed moderately at the Shenyang station in the Hunhe River Basin, with the NSE values of 0.63 and 0.54 during the calibration and validation periods, respectively. However, the reported RB values of MRB are slightly better than in the Fuhe River Basin [22] and Hun River Basin [55] that show systematic bias up to 23%. A possible explanation for the poorer performance of CMADS in MRB might be that the precipitation gauges for calibrating or assimilation CMADS are mainly located within China [56]. For regions out of China, the Climate Prediction Center Morphing Technique (CMORPH)'s integrated precipitation products were mainly used as precipitation inputs [24,56]. In fact, CMORPH underestimated monthly precipitation over Malaysia by~13% [45], which match with the underestimation of precipitation and streamflow as simulated by CMADS in this study.  Figure 4 shows the monthly observed and SWAT-simulated streamflow at the Jambatan Syed Omar discharge station from 2009 to 2014. As can be seen from the figure, the climate gauge simulated matches the observed streamflow, except for in November 2010. As reported by DID, a big flood occurred in November 2010, which inundated this region for two to five days. A total of 188 flood evacuation was operated to accommodate local people that affected by this second worst flood for the past 20 years. It can be found that NCEP-CFSR overestimated the low flow for most of the time. Meanwhile, CMADS had a better performance in low flow estimation as compared to NCEP-CFSR. A lag peak flow simulation was also found for NCEP-CFSR in several time periods, e.g., August-  CMADS was found to be reliable in temperature data estimation for MRB, especially minimum temperature. The superior performance of CMADS temperature data compared to NCEP-CFSR, shows it is applicable to study the temperature in tropical regions such as Malaysia. For example, the CMADS temperature may be used to study the relationship between temperature and paddy yield within and nearby MRB. Besides that, temperature is important for data to calculate the evapotranspiration process within SWAT. Therefore, the reanalysis products' temperature data, particularly maximum temperature may contain a consideration impact on the evapotranspiration calculation. The poorer performance of NCEP-CFSR in the maximum temperature estimation than CMADS might be one of the reasons that contributed to the lower performance in streamflow simulation. Future research should be undertaken to investigate the performance of the reanalysis products in more aspects including the calculation of the extreme value, trend and evapotranspiration.
As mentioned by Tan [42], more than 20% missing values in climate data may significantly affect hydrological modelling in tropical region. Therefore, most of the gauge stations used in this study contain no or less than 3% missing values, except for the Pusat Pertanian Charok Padang (8.53%) and Pusat Pertanian Batu Seketol (10.13%) stations. Moreover, the Butterworth station is classified as a principal station that receives more attention by MMD, with no missing values for all the three climate variables. Since most of the stations contain no missing values, so the impact of the nearest station filling approach on hydrological modelling is expected to be minimal. However, a more comprehensive comparison of different imputation techniques on tropical streamflow simulation is needed in the future, including the impact of filling missing with reanalysis products.
Calibration of different parameters to better represent the local conditions is essential for proper hydrological modelling. Deterministic approaches, e.g., trial and error, were applied in the   CMADS was found to be reliable in temperature data estimation for MRB, especially minimum temperature. The superior performance of CMADS temperature data compared to NCEP-CFSR, shows it is applicable to study the temperature in tropical regions such as Malaysia. For example, the CMADS temperature may be used to study the relationship between temperature and paddy yield within and nearby MRB. Besides that, temperature is important for data to calculate the evapotranspiration process within SWAT. Therefore, the reanalysis products' temperature data, particularly maximum temperature may contain a consideration impact on the evapotranspiration calculation. The poorer performance of NCEP-CFSR in the maximum temperature estimation than CMADS might be one of the reasons that contributed to the lower performance in streamflow simulation. Future research should be undertaken to investigate the performance of the reanalysis products in more aspects including the calculation of the extreme value, trend and evapotranspiration.
As mentioned by Tan [42], more than 20% missing values in climate data may significantly affect hydrological modelling in tropical region. Therefore, most of the gauge stations used in this study contain no or less than 3% missing values, except for the Pusat Pertanian Charok Padang (8.53%) and Pusat Pertanian Batu Seketol (10.13%) stations. Moreover, the Butterworth station is classified as a principal station that receives more attention by MMD, with no missing values for all the three climate variables. Since most of the stations contain no missing values, so the impact of the nearest station filling approach on hydrological modelling is expected to be minimal. However, a more comprehensive comparison of different imputation techniques on tropical streamflow simulation is needed in the future, including the impact of filling missing with reanalysis products.
Calibration of different parameters to better represent the local conditions is essential for proper hydrological modelling. Deterministic approaches, e.g., trial and error, were applied in the calibration process of earlier studies [58]. However, the deterministic approaches resulted in only a single set of parameters that were insufficient to characterize the errors and uncertainties in modelling [59]. Recently, stochastic algorithms such as SUFI-2, parallel solution (ParaSol), Markov chain Monte Carlo (MCMC), generalized likelihood uncertainty estimation (GLUE) and Particle Swarm Optimization (PSO) have been developed to capture the parameter uncertainties [59]. In future investigations, it might be possible to test how the selection of different stochastic algorithms could impact on the model calibration and accuracy as simulated by various gridded products such as NCEP-CFSR and CMADS.

Conclusions
NCEP-CFSR and CMADS are potentially good to be used for hydrological modelling in ungauged or poorly gauged river basins. This study compared the reliability of NCEP-CFSR and CMADS that were freely available on the SWAT webpage in simulating streamflow in a tropical river basin that is located in the north-eastern part of Peninsular Malaysia. The evaluation was divided into the climate and hydrological aspects from 2008 to 2014. In general, CMADS performed better than NCEP-CFSR in the daily and monthly climate variables estimations. NCEP-CFSR overestimated the daily precipitation, while an underestimation was found for CMADS. The differences between observed and NCEP-CFSR or CMADS in temperature estimation is about 2 • C. Both the CMADS and NCEP-CFSR products tended to underestimate the daily maximum temperature by 4.25 and 5.01%, respectively.
The gauge station, NCEP-CFSR and CMADS products were incorporated into SWAT for the hydrological aspect assessment. Parameter sensitivity analysis shows that CN2 and ESCO are among the most sensitive parameters in the SWAT calibration of MRB, while GW_Delay and SOL_AWC are less sensitive than other parameters. The performance of gauge station in SWAT-based streamflow simulation can be considered as "very good" based on the performance rating that recommended by Moriasi [48], showing that SWAT can be a reliable tool to study the water balance in MRB. Basically, CMADS underestimated the monthly streamflow simulation, while NCEP-CFSR showed a significant overestimation. The performance of CMADS in monthly streamflow simulation is acceptable, but more improvement is needed before applying the model for further impact assessment. In contrast, NCEP-CFSR is not suitable to be applied for SWAT modelling in MRB due to the fact that it is unable to capture the streamflow pattern effectively.