Evaluating Bias Correction Methods Using Annual Maximum Series Rainfall Data from Observed and Remotely Sensed Sources in Gauged and Ungauged Catchments in Uganda

Okirya, Martin; Du Plessis, JA

doi:10.3390/hydrology12050113

Open AccessArticle

Evaluating Bias Correction Methods Using Annual Maximum Series Rainfall Data from Observed and Remotely Sensed Sources in Gauged and Ungauged Catchments in Uganda

by

Martin Okirya

^*

and

JA Du Plessis

Department of Civil Engineering, Stellenbosch University, Private Bag X1, Matieland 7602, South Africa

^*

Author to whom correspondence should be addressed.

Hydrology 2025, 12(5), 113; https://doi.org/10.3390/hydrology12050113

Submission received: 12 April 2025 / Revised: 1 May 2025 / Accepted: 5 May 2025 / Published: 6 May 2025

Download

Browse Figures

Versions Notes

Abstract

This research addresses the challenge of bias in Remotely Sensed Rainfall (RSR) datasets used for hydrological planning in Uganda’s data-scarce, ungauged catchments. Four bias correction methods, Quantile Mapping (QM), Linear Transformation (LT), Delta Multiplicative (DM), and Polynomial Regression (PR), were evaluated using daily rainfall data from four gauged stations (Gulu, Soroti, Jinja, Mbarara). QM consistently outperformed other methods based on statistical metrics, e.g., for National Oceanic and Atmospheric Administration Climate Prediction Center (NOAA_CPC) RSR data at Gulu, Root-Mean-Square Error (RMSE) was reduced from 29.20 mm to 19.00 mm, Mean Absolute Error (MAE) reduced from 22.44 mm to 12.84 mm, and Percent Bias (PBIAS) reduced from −19.23% to 1.05%, and improved performance goodness-of-fit tests (KS = 0.03, p = 1.00), while PR, though statistically strong, failed due to overfitting. A bias correction framework was developed for ungauged catchments, using predetermined bias factors derived from observed station data. Validation at Arua (tropical savannah) and Fort Portal (tropical monsoon) demonstrated significant improvements in RSR data when the bias correction framework was applied. At Arua, bias correction of Climate Hazards Group InfraRed Precipitation with Station (CHIRPS) data reduced RMSE from 49.14 mm to 21.41 mm, MAE from 45.74 mm to 17.38 mm, and PBIAS from −59.83% to −8.18%, while at Fort Portal, bias correction of the CHIRPS dataset reduced RMSE from 28.35 mm to 15.02 mm, MAE from 25.28 mm to 11.35 mm, and PBIAS from −46.2% to 4.74%. Our research concludes that QM is the most effective method, and that the framework is a tool for improving RSR data in ungauged catchments. Recommendations for future work includes machine learning integration and broader regional validation.

Keywords:

quantile mapping; linear transformation; delta multiplicative; polynomial regression; remote sensing rainfall; climate zone; annual maximum series

1. Introduction

Rainfall plays a critical role in water resource modeling, management, agricultural planning, and the design of hydrological infrastructure such as culverts, roadside channels, drainage systems, and irrigation networks as noted by Kimani et al. [1] and other authors [2,3,4]. Reliable rainfall data are globally of critical importance for assessing water availability, predicting flood risks, and addressing the challenges posed by climate variability and change according to Katiraie [4] and Maheswaran [5]. In regions with robust observational networks, ground-based rain gauges provide accurate and reliable rainfall measurements. However, according to Nkunzimana et al. [6] and other authors [7,8], in some African regions with sparse and unevenly distributed gauge networks such as Uganda, there are often significant uncertainties in rainfall estimates.

Remotely sensed rainfall (RSR) products offer a promising alternative, providing spatially continuous and near-real-time precipitation estimates as noted by Kimani et al. [1] and Mekonnen et al. [9]. Despite their potential, RSR products are not without flaws. Mekonnen et at. note that biases stemming from sensor calibration, retrieval algorithms, and the complexities of converting satellite signals into accurate rainfall rates limit their reliability [9]. Correcting these biases is essential before RSR data can be confidently applied to hydrological infrastructure design, especially in ungauged catchments where traditional gauge data are scarce or absent.

In recent years, flood-related disasters have caused widespread economic damage, infrastructure destruction, population displacement, and, in extreme cases, loss of life. In Uganda, Onyutha [10] and Ngoma et al. [11] in their research have reported the severe impacts of such flood events, while other researchers like Li et al. [12] predict that flood frequency and intensity will continue to rise. The approaches to design and build resilient hydrological infrastructure such as culverts, drainage channels, and bridges, particularly in ungauged catchments, need to be explored more. At least, it is among those strategies that minimize disruptions to economic development as floods grow more frequent and severe due to climate change and other factors. A key parameter in the design process is the design discharge, often derived from Intensity–Duration–Frequency (IDF) curves according to Andre et al. [13]. These curves relate rainfall duration and intensity to specific return periods, enabling engineers to design infrastructure capable of withstanding floods of a given magnitude as suggested by Galiatsatou [13] and others [14,15,16]. Subramanya [17] notes that constructing IDF curves requires an Annual Maximum Series (AMS), a record of the highest daily rainfall values for each year over an extended period, ideally spanning at least 25 years for hydrological purposes. AMS is a key component of extreme value analysis and plays a critical role in hydrological infrastructure design, including flood control structures, bridges, and drainage systems according to Gupta [18]. In data-scarce regions like Uganda, where rainfall monitoring stations are sparse or nonexistent, analyzing the applicability of the AMS of RSR data becomes essential. Therefore, the inherent biases in RSR products must be assessed and corrected according to Gumindoga et al. [19] and Mekonnen et al. [9] to ensure their suitability for the estimation of design discharge.

Previous studies have explored the performance of RSR products in Uganda. For instance, Okirya and Du Plessis [3] evaluated the AMS of seven RSR products across different climate zones. They identified top performers like Global Precipitation Climatology Center (GPCC) (at the Gulu, Jinja, and Soroti stations) and National Oceanic and Atmospheric Administration Climate Prediction Center (NOAA_CPC) (at the Mbarara station) based on statistical metrics and goodness-of-fit performance tests. While their work highlighted variations in RSR performance tied to product type and location, it did not address the correction of inherent biases. Similarly, Onyutha [20] used AMS from observed data to evaluate Coordinated Regional Climate Downscaling Experiment (CORDEX) Regional Climate Model (RCM) simulations of extreme rainfall in East Africa, constructing IDF curves using data for the period 1961–1990 for the Lake Victoria Basin. However, the focus of his study was to evaluate the performance of CORDEX Africa RCMs, driven by Coupled Model Inter-comparison Project Phase 5 (CMIP5) General Circulation Models (GCMs) in reproducing Extreme Rainfall Indices (ERIs) and not bias correction. Other studies in Uganda and East Africa by Macharia [7] and others [21,22] have evaluated RSR products, with Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS) being the most extensively studied. Further research is still needed to explore and correct the biases in the AMS of RSR products. The biases need to be systematically evaluated and corrected to ensure the accuracy and reliability of RSR products for hydrological applications in ungauged or sparsely gauged catchments.

Previous studies have investigated various Bias Correction Techniques (BCTs) to address the challenge of inherent biases in RSR products. For instance, Ajaaj et al. [23] evaluated five BCTs to adjust GPCC rainfall data in Iraq, finding that Quantile Mapping and mean bias removal methods outperformed others, with performance varying by season and climate zone. Similarly, Ouatiki et al. [24] evaluated five bias correction techniques (BCTs) across eight satellite-based rainfall datasets in Morocco. The study revealed improvements in bias correction for the RSR dataset using methods such as Random Forest, with effectiveness varying based on local climatology.

In Uganda, Nakkazi et al. [8] used Local Intensity Scaling (LOCI) and Linear Scaling (LS) to correct precipitation data for the Soil and Water Assessment Tool (SWAT) model in the Manafwa catchment. In their study, the LS method failed to capture extreme events, while LOCI only partially addressed heavy rainfall. Their study highlighted the need for further research into extreme-value correction. These studies demonstrate that while bias correction is effective in regions with dense gauge networks, its application in ungauged or poorly gauged areas remains challenging. Uganda’s diverse climate, from western highlands to eastern lowlands, further complicates matters, as correction parameters may not be transferable across regions with different climatology.

Emerging research has turned to machine learning (ML) approaches for bias correction, with studies like those by Nguyen et al. [25] and others [26,27,28,29,30] showing improved RSR accuracy. However, ML methods often rely on historical gauge rainfall data, incur high computational costs, and lack transferability, posing obstacles for data-scarce regions like Uganda. Addressing these challenges requires a flexible, locally tailored bias correction framework that can enhance RSR data for hydrological infrastructure design in ungauged catchments. Several studies, including those by Dao et al. [26] and others [27,28,29], have applied ML-based bias correction frameworks to RSR datasets and registered promising results. For example, Chen et al. [28] developed a deep Convolutional Neural Network (CNN) framework and successfully reduced biases in the NOAA Climate Prediction Center Morphing Technique (CMORPH) rainfall product. While ML approaches yield promising results, they also introduce uncertainties related to assumptions of regional homogeneity and transferability of bias correction parameters across different climate zones. In Uganda, Nakkazi et al. [8] applied a bias correction framework based on the Soil and Water Assessment Tool (SWAT) model to validate bias-corrected RSR datasets in the Manafwa catchment. The SWAT model was calibrated using the bias-corrected RSR datasets (Climate Forecasting System Reanalysis (CFSR), MERRA-2, and TRMM3B42) as input rainfall data. The effectiveness of RSR data bias correction was assessed by how well SWAT-simulated streamflow matched observed streamflow, using performance metrics such as Nash–Sutcliffe Efficiency (NSE), PBIAS, and RMSE. The bias correction framework by Nakkazi et al. [8] was applied in a small catchment and considered a monthly temporal scale and not daily scale, which is relevant to hydrological infrastructure designs and planning.

Reliable rainfall data underpin effective hydrological infrastructure design, flood risk management, water supply, and agricultural planning. In Uganda, where many catchments lack sufficient ground-based observations, refining RSR data through bias correction offers viable alternatives to observed measurements. This research addresses a key scientific question: how can bias correction methods, effective in gauged catchments, be extended to ungauged catchments while maintaining the accuracy and reliability of RSR data for hydrological applications? To address this question, we propose a framework that evaluates bias correction techniques in gauged environments and adapts them for ungauged contexts using regionalized parameters and transferable predetermined bias correction factors. This research introduces a novel regionalized bias correction framework for RSR data in Uganda, specifically targeting ungauged catchments. Unlike previous studies, our approach uses predetermined bias correction factors derived from observed biases in mean rainfall and standard deviation at representative gauged stations and applies them to ungauged catchments. The findings provide a practical, scalable approach for improving the reliability of RSR datasets in hydrological modeling and infrastructure design in data-poor environments.

1.1. Objectives

The research’s main and specific objectives are outlined below.

1.1.1. Main Objective

The primary goal is to evaluate and compare bias correction methods, then develop and validate a framework for correcting remotely sensed rainfall data in ungauged catchments in Uganda.

1.1.2. Specific Objectives

The specific objectives are as follows:

To evaluate and compare the performance of four bias correction methods, Linear Transformation, Quantile Mapping, Delta Multiplicative, and Polynomial Regression, in gauged catchments.
To develop a bias correction framework for ungauged catchments by adapting the best-performing methods from gauged catchments.
To validate the framework using an independent dataset from selected gauged stations.

1.2. Research Questions

The research addresses the following questions:

How do bias correction methods compare in their ability to adjust RSR data in gauged catchments in Uganda?
How can the optimal bias correction methods from gauged catchments be adapted for use in ungauged catchments?
How effective is the bias correction framework in ungauged catchments in Uganda?

2. Materials and Methods

2.1. Description of the Study Area

The study area was Uganda, which is located in the East African region (Figure 1). The country is distinguished by its diverse climatic zones and varied topography. According to the Köppen–Geiger global climate classification raster data file by Beck et al. [31] for the 1991–2020 period, Uganda features nine distinct climatic zones. The country predominantly experiences a bimodal rainfall regime, characterized by two rainy seasons—March to May and September to November—though regional variations exist, as noted by Ngoma et al. [11] and Jury [32]. According to Ngoma et al. [11], the annual rainfall across Uganda ranges widely from 750 to 2500 mm, with mean annual precipitation typically falling between 800 and 1500 mm. Higher rainfall is observed in the highland areas, while semi-arid regions in the east receive lower amounts. Temperatures in Uganda remain relatively moderate, with average annual values ranging from 20 °C to 27 °C, shaped by altitude and local weather patterns [3,8]. The country’s terrain is equally diverse, encompassing towering mountains and low-lying plains. Notable peaks include the Rwenzori Mountains in the southwest, rising to about 5109 m, and Mount Elgon in the east, reaching about 4321 m as noted by Ngoma et al. [2,11]. These climatic and topographical variations make Uganda an ideal region for studying RSR bias correction in both gauged and ungauged catchments.

2.2. Research Data

This research utilized the AMS of observed rainfall and RSR datasets covering the period of 1991–2020. The observed rainfall data were obtained from six gauging stations across Uganda (Figure 1) and seven different RSR datasets. The AMS comprised the highest observed daily rainfall total for each year over a 30-year period, as described by Subramanya [17] and Maity [33].

2.2.1. Observed Rainfall Data

The observed daily rainfall data from six stations, Gulu, Soroti, Jinja, Mbarara, Arua, and Fort Portal, were obtained from the Uganda National Meteorological Authority (UNMA). The dataset spanned a 30-year period, from 1 January 1991, to 31 December 2020. Before being used in the analysis, the data underwent standard quality checks, as detailed in subsequent sections.

2.2.2. Remotely Sensed Rainfall (RSR) Data

In this research, RSR data products, widely recognized in the literature as gridded precipitation products, referred to three categories of remotely sensed rainfall data: (a) gauge-only-derived products, (b) products combining satellite and gauge data, and (c) numerical weather prediction products. The gauge-only-derived product comprised the Global Precipitation Climatology Centre (GPCC), operated by the Deutscher Wetterdienst (DWD) under the World Meteorological Organization (WMO), which offers gridded precipitation datasets from quality-controlled station data as noted by S. E. Nicholson and D. A. Klotter [34]. The satellite-gauge products, which integrate gauge and satellite data through various bias corrections, comprised (a) the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Climate Data Record (PERSIANN_CDR) from the University of California, Irvine, which has used neural networks for global daily rainfall estimates since 1983 according to Omonge et al. [9] and others [22,35]; (b) the Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS) from the University of California, Santa Barbara, and the United States Geological Survey (USGS), integrating satellite and in situ data for high-resolution rainfall monitoring [7,9,35,36]; and (c) the Climate Prediction Center (CPC) Unified Gauge-Based Analysis of Global Daily Precipitation from the National Oceanic and Atmospheric Administration (NOAA_CPC), which provides gauge-based global rainfall estimates supporting climate studies [3]. The numerical weather prediction products derived from atmospheric models (the Reanalysis products) included (a) the Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2) from the National Aeronautics and Space Administration (NASA), a reanalysis dataset incorporating advanced assimilation techniques since 1980 [8,34]; (b) the European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis v5 (ERA5); and (c) ECMWF Reanalysis v5 for Agriculture (ERA5_AG) datasets, providing daily and hourly global climate estimates from 1940 onward with a focus on land surface applications [9].

2.3. Data Preprocessing

Okirya and Du Plessis [3] provided a detailed description of data preprocessing for rainfall data from four stations: Gulu, Soroti, Jinja, and Mbarara. This research utilized the same data from the same stations, along with additional data from the Arua and Fort Portal stations. The preprocessing steps outlined in the study by Okirya and Du Plessis [3] were applied to rainfall datasets obtained for the Arua and Fort Portal stations. The preprocessing steps, among others, included outlier detection using box plots and time series plots, followed by rainfall gap-filling techniques for missing values. Two key rainfall data quality issues were addressed in this research: (a) data completeness (gaps in raw rainfall data) and (b) the presence of outliers.

2.3.1. Data Outlier Detection and Removal

The outlier detection was conducted using both visual and statistical methods. Time series plots were used to visualize extreme values that deviated significantly from the general trend. Graphical tools such as box plots were also employed to identify isolated rainfall values that occurred out of line with the majority of observations. Following the Interquartile Range (IQR) approach, rainfall values that fell outside the range Q1 − 1.5 IQR and (Q3 + 1.5 IQR) were classified as outliers. The IQR method, defined as the difference between the first (25th percentile) and third (75th percentile) quartiles, is widely used for detecting extreme values as suggested by Maity [33] and others [37,38]. However, as demonstrated by Okirya and Du Plessis [3], the IQR method tends to identify high extreme values as outliers. Consequently, it was applied with additional verification of extreme values using time series plots. The outliers were replaced with the corresponding 95th percentile rainfall value for the respective years.

2.3.2. Gap Filling of Rainfall Data

For the observed rainfall data, gaps with missing rainfall values lasting between one to four days were filled using the linear interpolation method. For longer gaps, extending up to 30 days, the long-term mean approach was applied. The long-term mean method is simple, quick, and preserves historical trends. The method is mostly applicable for datasets with less than 10% of missing values over the period under consideration as noted by Chinasho et al. [39]. Unlike Chinasho et al. [39] and Phan et al. [39,40] who used neighboring station averages to fill gaps in rainfall data at the station of interest, this research applied the long-term mean of historical rainfall values from the same station. Although the long-term mean method preserves historical trends, it may not account for climate-induced variability in rainfall data which deviate from historical trends. Consequently, any extreme weather patterns occurring within the missing period may not be accurately represented by a long-term mean value. Nevertheless, this approach remains an effective strategy for ensuring data completeness in rainfall time series analysis. For the RSR data (specifically NOAA_CPC), rainfall data gaps of one to four days were similarly addressed using linear interpolation. However, for longer gaps of up to 31 days, particularly in the PERSIANN_CDR dataset, missing rainfall values were filled using station regression coefficients. These coefficients (1.38 for Arua and 1.56 for Fort Portal) were derived from Double Mass Curve plots comparing cumulative rainfall data from PERSIANN-CDR with NOAA-CPC, both satellite-gauge-derived rainfall products. The coefficient of determination (R²) for the PERSIANN-CDR versus NOAA-CPC plots was 99.62% at Arua station and 99.76% at Fort Portal station. The remaining RSR datasets (CHIRPS, GPCC, MERRA2, ERA5, and ERA5-AG) showed no gaps in their rainfall time series after undergoing data quality checks.

2.4. Evaluating and Comparing Bias Correction Methods in Gauged Catchments

The evaluation process involved applying each bias correction method to the RSR data and assessing the bias-corrected outputs against observed rainfall records using both statistical metrics, goodness-of-fit test, and graphical visualizations.

2.4.1. Bias Correction Methods

Each bias correction method was applied to all seven RSR dataset products from four gauged stations, with validation extended to two additional stations. Below, the methods are described, highlighting their applicability.

The LT method corrects biases by aligning the mean and variance of the RSR dataset with those of the observed rainfall data, as described by Gado et al. [41] and Ajaaj et al. [23]. The adjustment follows Equation (1):

R_{adj} = μ_{obs} + \frac{σ_{obs}}{σ_{RSR}} (R_{RSR} - μ_{RSR})

(1)

where R_adj is the bias-corrected RSR data, R_RSR is the original (uncorrected) RSR data, σ_obs and μ_obs are the standard deviations and the mean of the observed data, while μ_RSR and σ_RSR are the mean and standard deviation of the original (uncorrected) RSR data.

The LT method is simple and computationally efficient, making it a widely used method for bias correction. However, it only adjusts the first two statistical moments (mean and variance), without addressing higher-order moments such as skewness or kurtosis. This means that if the bias between observed and RSR data is non-linear, particularly at the extremes, the method may not adequately capture these differences as noted by Ajaaj et al. [23] and Ehret et al. [42].

The QM method is one of the most popular and widely used bias correction techniques, as noted by several authors including Enayati et al. and others [4,12,43]. The QM approach corrects biases by matching the cumulative distribution functions (CDFs) of the observed rainfall and RSR datasets. The approach follows Equation (2), as described by Xiaomeng et al. and others [12,44,45]:

R_{adj} = F_{obs}^{- 1} (F_{RSR} (R_{RSR}))

(2)

where F_RSR (R_RSR) is the cumulative distribution function (CDF) of the RSR data, and

F_{obs}^{- 1}

is the inverse CDF (or the quantile function) of the observed data [24].

The QM method is generally effective because it adjusts the entire distribution, preserving extreme rainfall values and improving accuracy across different rainfall intensities as noted by Ehret et al. [42]. However, QM requires a sufficiently long time series to estimate quantiles accurately, and its effectiveness may be limited in datasets with short records or missing values as noted in a study by Koutsouris et al. [46].

The DM method, as defined in Equation (3), scales the RSR dataset using the ratio of the observed mean to the uncorrected RSR mean as suggested by Nakkazi [8] and others [24,41,47]:

R_{adj} = R_{RSR} \times \frac{{\bar{R}}_{obs}}{{\bar{R}}_{RSR}}

(3)

where R_adj is the adjusted (bias-corrected) RSR data, R_RSR is the original (uncorrected) RSR data, while

{\bar{R}}_{obs}

and

{\bar{R}}_{RSR}

denote the means of observed and RSR data, respectively.

The DM approach, like LT, is simple and computationally efficient [23]. The method preserves the original shape of the RSR distribution by applying a uniform scaling factor across all values. However, DM only corrects the mean and does not adjust the variance or other statistical properties of the dataset [41,42]. As a result, if biases vary across different rainfall magnitudes, this method may be insufficient in capturing variability and extreme events as noted by Ehret et al. [42].

The PR method extends linear correction techniques by introducing a non-linear relationship between the RSR and observed datasets. A commonly used form is the quadratic (second-degree) polynomial model, as presented in Bluman and other publications [33,48,49]. The method follows Equation (4):

R_{adj} = a + b (R_{RSR}) + c (R_{RSR}^{2})

(4)

where R_adj is the adjusted (bias corrected) RSR data, R_RSR is the original (uncorrected) RSR data, while a, b, and c are coefficients determined through regression analysis.

The PR is particularly beneficial when the bias between observed and RSR data is non-linear, as it can capture curved relationships that linear methods fail to model. However, the approach comes with several limitations. According to Ajaaj et al. [23], polynomial models are prone to overfitting, especially when higher-degree polynomials are used with limited data. Additionally, polynomial regression models can behave erratically when extrapolating beyond the calibration range and are sensitive to outliers, which may distort the fitted curve.

2.4.2. Statistical Performance Metrics

The performance of each of the bias correction methods were quantitatively assessed using several statistical metrics including RMSE, MAE, PBIAS, NSE, and goodness-of-fit tests (KS statistic and p-value). These methods are very popular and have been widely used in a number of studies by several authors, including Kimani et al. and others [1,36,44,50,51], to assess the performance of bias correction methods. The equations for these statistical metrics are presented in Table 1.

In hydrological applications, PBIAS values of less than ±10% are considered very good performance, ±10% to ±15% indicate good performance, while ±15% to ±25% indicate fair performance [8,50]. For the NSE metric, a value of 1 indicates a perfect match between the bias-corrected data and the observed values while negative values imply poor performance as noted by Diem et al. [52]. The NSE method is usually used for models that simulates the hydrological variables by measuring the model efficiency in terms of relative variance in simulation error compared to the variance of observed variables [33].

2.4.3. Goodness-of-Fit Tests

Among the most widely used goodness-of-fit tests for comparing Probability Density Functions (PDFs) and CDFs are (i) the Chi-Squared test and (ii) the Kolmogorov–Smirnov (KS) test. As noted by Karamouz et al. [53], the Chi-Squared Test is more effective when the sample size is large, as it relies on frequency distributions that require a sufficient number of observations for accurate assessment. However, in cases where the sample size is small, such as in this research (30 data entries), the KS test is more appropriate. The KS test is a non-parametric test, meaning it does not assume any specific distribution for the dataset. It measures the maximum difference between the empirical cumulative distribution functions (CDFs) of the observed data and the bias-corrected data, providing an indication of how well the corrected data align with the original observations [33]. The test statistic (KS) is mathematically defined in Equation (5):

D_{n} = Max |F_{obs, n} (x) - F_{adj, n} (x)|

(5)

where F_obs,n(x) and F_adj,n(x) are the empirical CDFs of the observed and the RSR bias-corrected data, respectively, and Max indicates the maximum absolute difference across all values. A smaller D_n value (KS statistics) and p-value greater than a 5% significance level indicates a closer match between the two empirical CDFs [33].

2.4.4. Visual Assessment

The visual assessment of bias correction methods uses the time series plots and PDF/CDF curves to compare observed rainfall, uncorrected RSR data, and bias-corrected RSR datasets. These plots enable visual assessments on how well the bias corrected data reflect temporal patterns, yearly changes, and extreme events relative to actual observations, revealing any major deviations. The PDF comparison evaluates bias correction performance by showing how RSR data are distributed and how different the distribution is from that of observed rainfall data. A good method aligns the RSR-corrected PDF’s shape, spread, and peak with the observed PDF, while deviations, shifted peaks, or distorted spread indicate under- or over-correction, potentially skewing variability or extremes. The CDF comparison examines the cumulative probability of rainfall amounts, assessing how well the corrected data matches the observed frequency distribution. An effective method keeps the corrected CDF close to the observed one, whereas shifts or slope differences suggest issues like missing extremes or misrepresented frequencies.

2.5. Development of RSR Bias Correction Framework for Ungauged Catchments

This research proposes a flexible regionalized framework to bias-correct RSR data in ungauged catchments, using bias correction factors derived from gauged catchments, as well as predetermined bias correction factors. The predetermined bias correction factors represented the highest or lowest systematic biases (consistent overestimation or underestimation of rainfall values compared to observed data) identified across the four stations. The regionalization approach involved transferring bias correction parameters (such as mean and standard deviation) from the nearest gauged station within the same climatic zone as the ungauged catchment. This approach assumed that catchments within the same climatic zone exhibited similar bias characteristics, as suggested by Yang et al. [54,55,56].

The RSR bias correction framework adapted the best-performing bias correction methods tested in gauged catchments considering two scenarios:

In ungauged catchments located near a gauged station within the same climatic zone, LT, QM, DM, and PR methods were applied using regionalized parameters (mean and standard deviation) derived from the closest gauged station. The predetermined bias correction factors could also be applied using the LT and DM methods.
If the ungauged catchment was in a climate zone with no observed data, the framework relied on predetermined bias correction factors (mean and standard deviation) derived from the four gauging stations to bias-correct the RSR datasets.

Each RSR product’s bias was examined across the four reference gauged stations of Gulu, Soroti, Jinja, and Mbarara. If systematic errors (consistently overestimated or underestimated rainfall values) were identified across the four stations, the highest or lowest biases (predetermined bias correction factors) from these stations were applied to the ungauged catchments. The RSR datasets with inconsistent bias patterns (overestimated and underestimated for the same product across gauging stations) were excluded from further analysis.

The bias correction framework was then implemented following the bias correction expressions shown in Equations (1)–(4). To validate the framework, the optimal bias correction methods identified in the four gauged catchments were adopted for implementation at the two additional catchments for further assessment. The performance of the bias correction methods were then assessed on how well the corrected RSR datasets matched observed data using metrics like RMSE, MAE, PBIAS, NSE, and the K-S test. In addition to these metrics, visual assessments, including the use of time series, PDF, and CDF plots, were deployed to compare how the bias corrected RSR datasets aligned with observed rainfall data.

2.5.1. Biases in RSR Data Products

Biases in each RSR dataset were estimated as the difference between observed rainfall and RSR data at each station [23,24].

The over- or underestimation in biases of the RSR datasets across the four gauged stations were identified by comparing the bias values at each of the stations. If a RSR dataset consistently overestimated or underestimated rainfall across multiple stations, it indicated a systematic bias that could be corrected using the proposed framework.

2.5.2. Regionalized Parameters for Bias Correction

The regionalized parameters applied when an ungauged catchment lay in a climatic zone with a nearby gauged station with observed rainfall data. Here, observed data from similar zones provided means and standard deviations, which were transferred to the ungauged site. In climate zones lacking gauging stations, these parameters (mean and standard deviations) were estimated from predetermined bias correction factors using Equations (6) and (7), for mean and standard deviation, respectively. The predetermined bias correction factors were applied to RSR parameters, which were used in the LT and DM bias correction methods.

μ_{obs} = μ_{RSR} + ∆ μ

(6)

where μ_obs is the estimated mean of what would have been observed data at the ungauged catchment, μ_RSR is the mean of RSR data at the ungauged catchment, and ∆μ is the predetermined bias correction factor for the mean value.

For the standard deviation:

σ_{obs} = σ_{RSR} + ∆ σ

(7)

where σ_obs is the estimated standard deviation of what would have been observed data at the ungauged catchment, σ_RSR is the standard deviation of RSR data at the ungauged catchment, and ∆σ is the predetermined bias correction factor.

3. Results

3.1. Data Quality Control and Preprocessing Results

The data quality issues and preprocessing results for the observed and RSR rainfall datasets at the Gulu, Soroti, Jinja, and Mbarara stations are presented in the publication by Okirya and Du Plessis for reference [3].

3.1.1. Identified Gaps in Rainfall Data

At Arua station, visual inspection and checking mismatches in dates revealed some gaps in the observed rainfall data. In particular, the entire month of November 1993 and the entire month of October 1998 had missing data, along with a missing value on 29 February 2020. Overall, the percentage of missing rainfall data at the Arua station was approximately 0.557% for observed station data. For the RSR datasets at the Arua station, the NOAA_CPC dataset exhibited isolated one-day gaps on several dates, amounting to a missing rainfall data percentage of about 0.046%. In contrast, the PERSIANN_CDR product showed a larger gap, with the entire month of February 1992 (29 days) missing, along with other gaps ranging from 1 to 15 days, resulting in an overall missing percentage of 0.949%. The rest of the RSR products at this station did not have missing rainfall data values.

At the Fort Portal station, the observed rainfall data exhibited relatively small gaps, with one-day missing entries recorded on four separate dates, resulting in a missing rainfall data percentage of 0.037%. The NOAA_CPC RSR product at Fort Portal had a slightly higher missing rainfall data percentage of 0.046%, while the PERSIANN-CDR product displayed an even larger gap, with a missing rainfall data percentage of 0.949%.

Figure 2 shows the identified gaps in both the observed and RSR datasets at the Fort Portal and Arua stations.

3.1.2. Outlier Detection and Removal

The Interquartile Range (IQR) box plots identified numerous rainfall values as potential outliers. However, time series plots were used to distinguish genuine extreme rainfall events from outlier data points. At the Fort Portal station, outliers were detected only in the observed data, with extreme values of 337 mm on 28 September 2006, and 183.3 mm on 12 November 1994. These were replaced by the 95th percentile values of 20.34 mm and 25.01 mm, respectively, computed specifically for the respective years.

At the Arua station, for the GPCC product, extreme outliers of 264.74 mm on 17 October 1992, and 209.39 mm on 8 August 2013, were identified. For each of these years, the 95th percentiles of the rainfall data were computed as 34.36 mm and 34.56 mm, respectively, and these values were used to replace the outliers. Similarly, in the MERRA2 dataset at the Arua station, outliers of 209.13 mm on 22 August 2017, 226.84 mm on 10 March 2018, and 205.74 mm on 9 December 2020, were replaced with the corresponding 95th percentile values of 29.38 mm, 30.34 mm, and 24.33 mm, respectively.

3.2. Evaluation and Comparison of Bias Correction Methods in Gauged Catchments

3.2.1. Visual Comparison of Time Series Plot Results

A representative time series plot for the Gulu station (Figure 3) visually illustrates the comparative performance of the four bias correction methods alongside the original (uncorrected) RSR data. Although not shown here, similar patterns were observed in the plots for the other stations; Soroti, Jinja, and Mbarara. More importantly, the quantitative evaluation of these methods is presented in Table 2 and Table 3, which provide detailed performance statistics. These metrics offer a more objective and precise assessment of each method’s effectiveness in aligning RSR data with observed rainfall. The observed rainfall, represented by a black line, serves as the benchmark for assessing each method’s ability to align RSR data patterns, capture inter-annual variability, and accurately reflect extreme rainfall events. From the time series plots two main observations are drawn. First, among the evaluated methods, Quantile Mapping (depicted in green) consistently emerged as the top performer across all stations and RSR datasets. It closely mirrors the observed rainfall peaks and troughs, demonstrating better alignment. Following the Quantile Mapping method, both Linear Transformation (orange) and Delta Multiplicative (red) methods showed moderate improvements over the uncorrected data. They aligned better with observed trends and successfully adjusted the RSR data to preserve inter-annual variability and capture extreme rainfall peaks to some extent. However, they occasionally underestimated peaks, as seen with the CHIRPS dataset at the Gulu station, where these methods failed to reach the observed maximum rainfall values. Second, the Polynomial Regression method (purple) performed poorly across all four stations, exhibiting significant limitations.

Across all stations and datasets, the original (uncorrected) RSR data, represented by a blue line, consistently underestimated observed rainfall. This underestimation was particularly pronounced in certain cases, such as with CHIRPS, ERA5, PERSIANN, MERRA2, and ERA5_AG datasets at the Jinja and Soroti stations, where the gap between observed and uncorrected data was substantial. Similarly, at the Gulu and Mbarara stations, the underestimation was evident with CHIRPS, ERA5, PERSIANN, and ERA5_AG.

3.2.2. Results of Statistical Performance Metrics

The statistical performance test results for the RSR datasets across the four stations are presented in Table 2 (Gulu and Soroti stations) and Table 3 (Jinja and Mbarara stations).

The statistical performance results revealed that the original (uncorrected) Remote Sensing Rainfall (RSR) datasets consistently exhibited high RMSE, MAE, and PBIAS values, coupled with higher negative NSE values, signifying a severe underestimation of observed rainfall. All the bias correction methods, Linear Transformation, Quantile Mapping, Delta Multiplicative, and Polynomial Regression, generally reduced RMSE, MAE, and absolute PBIAS compared to the original datasets, though their success varied.

Among the four bias correction methods evaluated, Quantile Mapping (QM) emerged as the most effective, particularly at the Gulu and Jinja stations. At Gulu, applying QM to the NOAA_CPC dataset significantly improved performance, reducing the RMSE from 29.20 mm to 19.00 mm (a 35% improvement), MAE from 22.44 mm to 12.84 mm (a 43% improvement), and PBIAS from −19.23% to 1.05% (a 95% improvement). Similarly, at Jinja, QM applied to the GPCC dataset achieved an RMSE of 17.96 mm, down from 22.22 mm (a 19% improvement), MAE of 14.36 mm from 17.56 mm (an 18% improvement), and PBIAS of 1.25% from −14.64% (a 91% improvement).

The Linear Transformation (LT) method also delivered strong performance, particularly at the Mbarara station, where it was applied to the GPCC dataset. The method reduced the RMSE from 22.62 mm to 20.66 mm (a 9% improvement), the MAE from 16.60 mm to 14.98 mm (a 10% improvement), and achieved a PBIAS of 0.00% from −5.93% (100% improvement). The Delta Multiplicative (DM) method, though generally exhibiting higher errors than QM and LT at most stations, proved most effective at Soroti. Applied to the CHIRPS dataset, it significantly reduced the RMSE from 37.67 mm to 20.89 mm (a 45% improvement) and the MAE from 33.07 mm to 15.68 mm (a 53% improvement), while correcting the PBIAS from −45% to 0.00% (a 100% correction).

The Polynomial Regression (PR) method consistently recorded the lowest RMSE (ranging from 15.34 mm for NOAA_CPC at Mbarara to 16.75 mm for CHIRPS at Gulu) and MAE values (from 10.94 mm at Soroti to 13.06 mm at Gulu). It also yielded a PBIAS of 0.00% across all RSR datasets and achieved the highest Nash–Sutcliffe Efficiency (NSE) values, including 0.19 for NOAA_CPC and GPCC at Jinja and Mbarara, and 0.01 at Gulu. While these metrics initially suggested superior performance, further inspection revealed limitations. The analysis of time series plots (Figure 3) and goodness-of-fit tests (KS statistics and p-values) indicated that PR tended to overfit the data, producing overly smoothed trends that failed to represent inter-annual variability and extreme rainfall events. This overfitting, despite strong statistical performance, compromises the method’s practical utility, rendering it less reliable for real-world hydrological applications.

3.2.3. Results Based on the Visualization of PDF and CDF Plots

To illustrate the visual performance of the bias correction methods, Figure 4 presents a representative example from the Gulu station. The figure displays the PDFs and CDFs of the original (uncorrected) RSR data and the bias-corrected datasets, compared against the observed rainfall distribution. The visual comparison shows how effectively each correction method aligned the RSR data with observed rainfall patterns.

Quantile Mapping stood out as the most effective bias correction method, demonstrating near-perfect alignment between the corrected RSR data and observed rainfall distributions across all stations. This is clearly illustrated in Figure 4 for the Gulu station, where the PDFs and CDFs of the Quantile Mapping-corrected data (green line) closely follow the observed distributions (black dashed line), accurately capturing the full range of rainfall values, including peaks, spread, and tails. In comparison, Linear Transformation and Delta Multiplicative methods showed moderate improvements but fell short, particularly at higher rainfall thresholds, where their PDFs flattened and CDFs diverged from the observed patterns. The Polynomial method performed the poorest, as evidenced by its sharply peaked PDFs and nearly vertical CDFs across stations, indicating overfitting and poor representation of rainfall variability. While it brought the mean closer to observed values, it failed to accurately model extremes, limiting its reliability for hydrological applications. Overall, the bias-corrected RSR data significantly outperformed nearly all original (uncorrected) RSR datasets. This improvement was particularly pronounced for the CHIRPS, ERA5, PERSIANN, ERA5_AG, and MERRA2 datasets across all stations. However, an exception occurs with the GPCC data at the Gulu station, where the uncorrected data demonstrated relatively better performance.

3.2.4. Results Based on the Goodness-of-Fit Test (KS and p-Values)

Table 4 presents the goodness-of-fit test results, detailing the KS statistics and corresponding p-values for four bias correction methods, alongside the original (uncorrected) RSR data.

Among the bias correction methods, Quantile Mapping consistently delivered the best goodness-of-fit results, achieving KS statistics as low as 0.03 and p-values of 1.00 across all stations and datasets. This indicated an excellent distributional match between the RSR products and observed rainfall data. Linear Transformation also improved the fit over the original RSR data, with KS statistics typically ranging from 0.10 to 0.30 and p-values often between 0.39 and 1.00. For example, at Mbarara, the Linear Transformation method for the CHIRPS dataset achieved a KS statistic of 0.10 with a p-value of 1.00, indicating a near-perfect fit. The Delta Multiplicative method exhibited mixed results, with KS statistics varying widely from 0.10 (for ERA5_AG at Gulu) to 0.33 (for MERRA2 at Mbarara) and p-values ranging from 0.03 (for ERA5_AG at Jinja and Soroti) to 1.00 (for CHIRPS at Mbarara).

In contrast, Polynomial Regression performed poorly, characterized by high KS statistics, ranging from 0.27 (for NOAA_CPC at Gulu) to 0.60 (for GPCC at Gulu), and p-values frequently at 0.00. These results highlighted significant overfitting issues and a notable distributional mismatch, confirming the inadequacy of this method for effective bias correction. The original RSR data also performed poorly, with high KS statistics, such as 0.97 for CHIRPS at Gulu, and p-values often at 0.00, further confirming a substantial distributional mismatch with observed rainfall.

3.3. Bias Correction Framework for Ungauged Catchments

3.3.1. Description of the Developed Bias Correction Framework

As illustrated in the flowchart (Figure 5), our framework offers a flexible approach to bias correction, leveraging both observed rainfall data from gauged stations and predetermined bias correction factors.

The process begins with the acquisition, quality control, and preprocessing of rainfall data, including the generation of AMS of rainfall datasets. During preprocessing, the framework also prepares the observed rainfall data for Quantile Mapping by estimating key statistical parameters, such as the mean and standard deviation, which serve as a reference for subsequent adjustments. Following preprocessing, the framework conducts an initial screening to analyze the RSR data for consistent errors, such as systematic overestimation or underestimation across stations. If the RSR data lack consistent biases or exhibit mixed errors (over- and underestimations across stations), they are rejected, and the process terminates. For datasets that pass this screening, the framework standardizes the RSR data into Z-scores (Z-values) to normalize the data, facilitating the bias correction process. Additionally, the framework estimates the RSR data’s statistical parameters (mean and standard deviation) and determines predetermined bias correction factors by comparing RSR data with observed data across the four stations. These factors provide a baseline for adjusting RSR data in ungauged catchments where observed data may not be directly available. The framework then adjusts the RSR data to align with observed rainfall patterns, considering two scenarios based on the availability of observed data in the ungauged catchment’s climate zone. The final stage involves validating the framework by comparing the bias-corrected RSR datasets with observed rainfall data, where available. This validation assesses the performance of the selected bias correction method and RSR product through a combination of statistical metrics, goodness-of-fit tests (KS and p-values), and distributional analyses (PDFs and CDFs). Based on these validation results, the framework identifies the most effective bias correction method and RSR product, producing bias-corrected RSR data tailored for ungauged catchments.

3.3.2. Predetermined Bias Correction Factors

The results of this analysis are presented in Table 5, Table 6, Table 7 and Table 8. Table 5 displays the calculated standard deviations of the dataset, and Table 6 provides the estimated biases (the maximum and minimum predetermined bias correction factors) in the standard deviations of RSR products.

For the standard deviation (Table 6), the CHIRPS, PERSIANN, and ERA5 datasets consistently underestimated observed rainfall variability, while NOAA_CPC and GPCC overestimated it, showing systematic bias patterns suitable for uniform correction. However, the ERA5_AG and MERRA2 datasets displayed inconsistent biases (both over- and underestimation), leading to their exclusion from bias correction and validation for ungauged areas.

Table 7 lists the computed mean values and Table 8 shows the estimated biases (the maximum and minimum predetermined bias correction factors) in the mean values of RSR datasets. For the mean rainfall, all five selected RSR datasets (CHIRPS, PERSIANN, NOAA_CPC, GPCC, and ERA5) consistently underestimated observed rainfall with positive biases, varying only in magnitude but uniform in direction, making them amenable to scaled bias corrections. The mean rainfall biases, being uniform, could be systematically corrected across stations, whereas standard deviation biases, varying by RSR datasets and stations, required more localized, dataset-specific adjustments due to greater inconsistency.

3.4. Application and Validation of the Bias Correction Framework

3.4.1. Validation Results Based on Comparison of Time Series Plots

Figure 6, a time series plot for Arua station, provides a visual illustration of the bias correction framework’s performance by comparing observed rainfall data with both original (uncorrected) and bias-corrected rainfall data. A visual assessment of the time series plots revealed the following with regard to the performance of the bias correction framework across the two evaluated stations:

The original (uncorrected) RSR data, represented by blue lines, consistently underestimated the observed rainfall (depicted by black lines) across both stations. This underestimation was particularly evident in the ERA5, PERSIANN, and CHIRPS RSR datasets, where the uncorrected data clearly showed the underestimation and failed to capture the full range of rainfall variability patterns.
The Quantile Mapping bias correction approach, shown in green lines, outperformed the other two methods by providing the closest alignment with observed rainfall. It effectively captured peaks, and the overall variability, demonstrating its robustness across diverse rainfall patterns at both stations.
The Linear Transformation (red lines) and Delta Multiplicative (purple lines) methods showed significant improvements over the original RSR data (especially ERA5, PERSIANN, and CHIRPS). However, they fell short of Quantile Mapping’s performance, often underestimating peak rainfall values and failing to fully replicate the observed variability across the stations.

Figure 6. Observed, original, and bias-corrected rainfall at the Arua station.

For the Arua station (Figure 6), located in a tropical savannah climate zone, the nearest gauged station with observed data is Gulu. The predetermined bias correction factors considered were the maximum biases of RSR data across the four gauged stations (Gulu, Soroti, Jinja, and Mbarara) for each RSR product. Unlike the Arua station, the Fort Portal station did not share its climate zone with a gauged station. The research therefore utilized observed rainfall data from the Mbarara station, which, while geographically close, lies in a different climate zone (tropical savannah). This adaptation allowed the research to assess the performance of RSR data at Fort Portal by applying bias correction methods using Mbarara’s observed data, in addition to the application of the predetermined bias correction factors. The application of Linear Transformation (LT), Quantile Mapping (QM), and Delta Multiplicative (DM) methods, leveraging observed data from Mbarara, enhanced the performance of RSR data at Fort Portal the same way the use of predetermined bias correction factors did. When relying on predetermined bias correction factors, the methods performed best when minimum bias values were applied at the Fort Portal station, markedly outperforming results obtained with maximum bias values. It was evident at the Fort Portal station that four out of the five top performing methods were based on the predetermined bias correction factors for the CHIRPS, PERSIANN, and NOAA_CPC data sets.

3.4.2. Validation Results Based on Statistical Performance Metrics

Table 9 presents the statistical performance validation results for the bias correction framework, evaluating the performance of three bias correction methods.

The following observations, based on the statistical performance metrics in Table 9, highlight the effectiveness of the bias correction methods and demonstrate the overall applicability of the proposed framework.

The original (uncorrected) RSR data, particularly for the ERA5, CHIRPS, and PERSIANN datasets, consistently underestimated observed rainfall at both stations. This was reflected in the high RMSE, MAE, and PBIAS values, alongside negative NSE values. For instance, at Arua, the PERSIANN original data exhibited an RMSE of 50.16 mm, MAE of 47.19 mm, PBIAS of −61.74%, and NSE of −7.13, indicating severe underperformance compared to the observed data. Similarly, at Fort Portal, the PERSIANN original data recorded an RMSE of 31.73 mm, MAE of 28.62 mm, PBIAS of −52.51%, and NSE of −4.16. After bias corrections, the metrics significantly reduced, with RMSE dropping to 23.13 m, MAE to 18.09 mm, and PBIAS to −2.74% for PERSIANN data at Arua station considering the LT method. Similarly, at the Jinja station, RMSE reduced to 16.10 mm, MAE to 12.94 mm, and PBIAS to 14.16% for PERSIANN data considering the LT method using the bias correction factors.

The effectiveness of the bias correction methods varied depending on the specific RSR dataset being adjusted at the station locations. At Arua, for instance, Quantile Mapping outperformed other methods with the GPCC dataset, by delivering the lowest errors: RMSE reduced from 31.48 mm to 19.94 mm (37% improvement), MAE from 22.57 mm to 15.44 mm (32% better), and PBIAS from 11.63% to −4.32% (63% gain). For the CHIRPS dataset at Arua, Delta2 (using predetermined bias correction factors) emerged among the top performers. The CHIRPS-Delta2 method reduced RMSE from 49.14 mm to 21.41 mm (56% improvement), MAE from 45.74 mm to 17.38 mm (62% better), and PBIAS from −59.83% to −8.18% (86% improvement). At Fort Portal, the CHIRPS-Delta2 method based on predetermined bias correction factors was the most effective, reducing RMSE from 28.35 mm to 15.02 mm (47% improvement), MAE from 25.28 mm to 11.35 mm (55% better), and PBIAS from −46.2% to 4.74% (90% improvement).

3.4.3. Validation Results Based on PDF and CDF Shapes

Figure 7, an illustrative plot for the Arua station, displays the PDFs and CDFs for observed rainfall, original (uncorrected) RSR, and bias-corrected rainfall data. The PDFs and CDFs of the original RSR data (blue lines) consistently showed narrower peaks and lower cumulative probabilities compared to the observed rainfall (black dashed lines), indicating significant underestimation across both stations. The original ERA5, CHIRPS, and PERSIANN RSR datasets exhibited distinctly different distributional shapes compared to the observed data. Among the bias correction methods, Quantile Mapping (green lines) produced PDFs and CDFs that most closely matched the observed distributions, confirming its superior effectiveness across validation stations.

3.4.4. Validation Results Based on the Goodness-of-Fit Tests (KS and p-Values)

Table 10 presents goodness-of-fit test validation results for the bias correction framework, specifically the Kolmogorov–Smirnov (KS) statistics and corresponding p-values, for three bias correction methods.

From the results presented in Table 10, the following can be observed:

The original (uncorrected) RSR data consistently exhibited high KS statistics (0.40–1.00) and low p-values (0.00–0.07), indicating significant distributional differences between RSR and observed rainfall. For example, at Arua, the CHIRPS original data had a KS statistic of 1.00 with a p-value of 0.00, and at Fort Portal, the PERSIANN original data had a KS statistic of 0.87 with a p-value of 0.00, confirming the poor fit and the need for bias correction.

At the Arua station, Quantile Mapping generally emerged as the top-performing method, achieving the lowest KS statistics for most RSR products. For instance, across the CHIRPS, ERA5, GPCC, and PERSIANN datasets, Quantile Mapping consistently recorded a KS statistic of 0.17 with a p-value of 0.81, indicating an excellent distributional fit with observed rainfall data. This suggests that Quantile Mapping effectively aligned the corrected RSR data with the observed distribution at this tropical savannah station. In contrast, at the Fort Portal station, located in a tropical monsoon climate zone, the Linear Transformation method took the lead, delivering the lowest KS values and the highest p-values. For example, the GPCC, ERA5, and CHIRPS datasets (using predetermined bias correction factors) adjusted with the Linear Transformation method achieved a KS statistic of 0.17 and a p-value of 0.81. These results demonstrate an outstanding alignment with the observed rainfall distributions, highlighting the method’s effectiveness in this distinct climatic context.

4. Discussion

4.1. Discussion on the Performance of Bias Correction Methods

All bias correction methods significantly improved the original RSR datasets by reducing the RMSE, MAE, and absolute PBIAS compared to the original data, though NSE remained mostly negative. Aside from the negative NSE values, bias correction significantly improved the original RSR data as also demonstrated in previous research by several authors [1,4,8,24]. All seven RSR products evaluated registered improvement in terms of statistical performance metrics after application of bias correction methods. For example, at the Gulu station and for the CHIRPS data, the DM bias correction method improved the RMSE by 43% (23.75 mm vs. 41.64 mm), the MAE by 50% (18.2 mm vs. 36.61 mm), and the PBIAS by nearly 100% (−0.01% vs. −50.59%). The poor performance of the original RSR data products was also observed from the goodness-of-fit test results, clearly illustrated in the time series plot (Figure 3), as well as the PDF and CDF plots (Figure 4).

The Quantile Mapping method emerged as the most effective bias correction method in gauged catchments, achieving the lowest RMSE and MAE values at Gulu and Jinja stations. At Gulu, the QM method applied to the NOAA_CPC data improved RMSE by 35% (19 mm from 29.2 mm), MAE by 43% (12.84 mm from 22.44 mm), and PBIAS by 95% (1.05% from −19.23%), significantly improving upon the original NOAA_CPC dataset. Similarly, at Jinja, QM produced an RMSE of 17.96 mm (improvement by 35%), MAE of 14.36 mm (improvement by 39%), and PBIAS of 1.25% (improvement by 90%). The good performance of the Quantile Mapping method was further supported by the goodness-of-fit test results and the graphical plots (time series and the PDF/CDF). QM outperformed other methods in aligning the bias-corrected RSR data closely with observed rainfall patterns, as illustrated in time series and PDF/CDF plots, and demonstrating near-perfect distributional fits (with a KS statistic of 0.03 and a p-value of 1.00) across all stations. The good performance of the QM method is consistent which previous research such as that of [23], who noted its effectiveness in reducing biases in GPCC RSR datasets in gauged catchments. Many other researchers, including Xiaomeng Li [12,19,57], also reported promising results after testing the effectiveness of QM in reducing biases in RSR. However, the good performance as reported from those previous research works was mostly based on the evaluation of RSR products at monthly and seasonal temporal scales.

The LT bias correction method ranked as a strong secondary option overall, outperforming other methods at the Mbarara station in terms of statistical metrics (RMSE, MAE, and PBIAS). The DM method followed as the third-best method overall, offering moderate effectiveness. The DM method showed higher errors than QM and LT at most stations but outperformed other methods at the Soroti station, suggesting situational strengths. The good performance of LT and DM, outperforming QM at the Mbarara and Soroti stations, respectively, could be attributed to local climate conditions. On the basis of PBIAS statistical metrics, the LT and DM methods outperformed QM at all stations and for all RSR datasets as the methods improved the metric nearly by 100%. As Ajaaj [23] noted, the performance of these bias correction methods vary in space and temporal resolution and therefore recommended testing multiple methods to determine the best product. Although the research by Ouatiki et al. [24] was based on the monthly and seasonal resolutions, they noted instances where LT and DT methods outperformed the QM method when correcting CHIRPS, PERSIANN, and other RSR datasets. Gado et al. [41], also in evaluating the CHIRPS, PERSIANN, and other RSR datasets in the upper Blue Nile basin, noted that DT outperformed the LT bias correction method. Meanwhile, although the Polynomial Regression (PR) approach consistently recorded the lowest RMSE (ranging from 15.34 for NOAA_CPC at Mbarara to 16.75 for CHIRPS at Gulu) and MAE (ranging from 10.94 for CHIRPS at Soroti to 13.06 at Gulu), alongside a PBIAS of 0.00% across all RSR datasets, the graphical plots and goodness-of-fit KS and p-values showed that it was the least effective bias correction method. The time series analysis revealed that the PR approach overfitted the data, producing overly smoothed trends that failed to capture inter-annual variability and extreme rainfall events. This overfitting and poor performance of the PR method has also be noted by previous researchers such as [23,58], which undermines its practical accuracy, making it less reliable despite its favorable statistical metrics.

4.2. Discussion on the Developed Bias Correction Framework

Frameworks using similar bias correction methods have been widely used in previous research, including works by Koutsouris et al. and others [19,23,24,41,46], to correct RSR data in gauged catchments. However, these studies were limited to gauged catchments and did not explore testing of their frameworks in ungauged catchments. This research explored possibilities of bias-correcting RSR data in ungauged catchments by using predefined bias correction factors. Several authors, including [42,58,59], have expressed reservations on applying bias correction using parameters derived from other regions, their primary concern being the assumption of the spatial stationarity of biases, which they rightly argue may not hold under changing climatic conditions or across different physiographic environments. They caution that transferring regionalized parameters without local observations can introduce significant uncertainty, potentially distorting key hydrological signals or feedback. However, in data-scarce environments or ungauged catchments, especially those within the same climatic zones with observed data, there remains a practical and arguably justifiable case for using regional climatological parameters from nearby gauged stations, as proxies. When bias patterns are shown to be systematic and consistent across stations within the same climate zone, these parameters can serve as a correction tool for RSR data. While not without limitations, this approach provides a flexible and scalable framework to enhance the utility of RSR datasets in ungauged catchments, offering a viable interim solution where ground observations are lacking and hydrological infrastructure planning or climate adaptation decisions must proceed.

4.3. Discussion on Framework Validation Findings and Results

The validation results based on statistical performance metrics, goodness-of-fit tests, and graphical plots at the Arua and the Fort Portal stations demonstrated the effectiveness and flexibility of the developed framework in adjusting the RSR datasets. The results aligned with previous research, indicating that bias correction significantly improves RSR data accuracy [1,4,8]. A key contribution of this research is the use of predetermined bias correction factors to adjust RSR datasets in ungauged catchments. The findings indicated that this approach was highly effective, particularly for CHIRPS, PERSIANN, and NOAA_CPC datasets. The statistical improvements seen at Arua (56% RMSE reduction, 86% PBIAS improvement) and Fort Portal (47% RMSE reduction, 90% PBIAS improvement) demonstrated the potential of this approach in providing reliable corrections without local observed data. This concept of utilizing regionalized parameters aligns with studies on regional parameter transferability in hydrology by Xue Yang and others [54,55,56], suggesting that systematic bias patterns can be used to estimate correction factors for ungauged regions. However, the effectiveness of predetermined bias correction factors varied depending on the station locations and climate zones. At Arua (tropical savannah), maximum bias values produced the best results, whereas at Fort Portal (tropical monsoon), minimum bias values were more effective. This finding emphasizes the role of regional climatic differences in shaping RSR biases, as previously noted by [23], whose dataset performance varied across climate zones.

5. Conclusions and Recommendations

5.1. Conclusions

This research aimed at evaluating and comparing the performance of four bias correction methods, QM, LT, DM, and PR, on seven RSR datasets (CHIRPS, ERA5, ERA5_AG, MERRA2, PERSIANN_CDR, NOAA_CPC, and GPCC), with the goal of developing and validating a bias correction framework for ungauged catchments in Uganda.

The first objective, to evaluate and compare the bias correction methods in gauged catchments, was successfully achieved. Among all methods assessed, QM consistently emerged as the most effective, delivering strong statistical improvements across all stations and RSR datasets. For instance, at Gulu, applying QM to the NOAA_CPC dataset reduced the RMSE from 29.20 mm to 19.00 mm (a 35% improvement), MAE from 22.44 mm to 12.84 mm (a 43% improvement), and PBIAS from −19.23% to 1.05% (a 95% improvement). At Jinja, QM applied to the GPCC dataset achieved an RMSE of 17.96 mm, down from 22.22 mm (a 19% improvement), MAE of 14.36 mm from 17.56 mm (an 18% improvement), and PBIAS of 1.25% from −14.64% (a 91% improvement). In contrast, Polynomial Regression, despite favorable statistical performance, showed poor time series alignment and overfitting, misrepresenting inter-annual variability and extremes. The strong performance of the QM method demonstrated its ability to adjust not only mean biases but also the entire distribution of rainfall, preserving variability and capturing extremes more effectively than the other techniques. This is in agreement with findings from previous studies such as those by [24,43] which highlighted the robustness of QM in bias correction. The LT demonstrated strong performance, particularly at Mbarara, where it effectively corrected bias in the GPCC dataset, achieving improvements of 9% in RMSE, 10% in MAE, and eliminating systematic bias with a PBIAS of 0.00%. The DM method, although less consistent overall, was most effective at Soroti for the CHIRPS dataset, where it reduced the RMSE and MAE by 45% and 53% respectively, and corrected PBIAS from −45% to 0.00%. Polynomial Regression (PR) yielded the best statistical performance across all stations, with the lowest RMSE and MAE and the highest NSE values. However, as revealed from the time series plots and goodness-of-fit test results, it overfitted the data, producing smoothed outputs that failed to capture rainfall variability and extremes, as also noted by [23].

For the second objective, our research successfully developed a flexible bias correction framework that adapted the better-performing methods in gauged catchments to ungauged catchments. This was achieved through the use of predetermined bias correction factors (mean and standard deviation biases) derived from four gauged stations. These factors are particularly valuable in regions with no observed data, supporting flexible bias correction across different climate zones. Validation at the Arua station, located in a tropical savannah climate zone, showed that applying the maximum observed biases yielded the best performance. In contrast, at the Fort Portal station, situated in a tropical monsoon climate zone, minimum bias values led to better results. These contrasting outcomes reveal the importance of considering regional climate variability in bias correction, an insight supported by previous research such as that by [54,55], who noted similar regional effects in hydrological modeling.

The third objective focused on validating the framework using independent stations, Arua (in a tropical savannah climate zone) and Fort Portal (in a tropical monsoon climate zone). The results confirmed the framework’s adaptability and effectiveness. For example, at Arua, validation using CHIRPS data, the Delta2 method (using predetermined factors) performed strongly, reducing the RMSE from 49.14 mm to 21.41 mm (56% improvement), MAE from 45.74 mm to 17.38 mm (62% better), and PBIAS from −59.83% to −8.18% (86% improvement). At Fort Portal, CHIRPS-Delta2 proved most effective, lowering the RMSE from 28.35 mm to 15.02 mm (47% improvement), MAE from 25.28 mm to 11.35 mm (55% better), and PBIAS from −46.2% to 4.74% (90% improvement), indicating substantial improvement. Furthermore, goodness-of-fit tests confirmed distributional alignment. At Arua, Quantile Mapping recorded KS statistics as low as 0.17 with p-values of 0.81, while Linear Transformation outperformed others at Fort Portal using minimum predetermined bias values. These results demonstrate the importance of climate zone-specific corrections, with maximum bias values performing better in savannah zones and minimum values in monsoon zones.

5.2. Recommendations

Based on this research’s findings and results, the following recommendations are proposed:

Quantile Mapping should be the primary bias correction method where sufficient historical data are available or can be inferred from nearby stations. It consistently delivered the lowest RMSE and best distributional fits across all datasets and gauging stations (e.g., KS = 0.03, p = 1.00 across multiple locations).
Where QM is not feasible due to data limitations, LT and DM methods offer a viable alternative, especially in ungauged areas where predetermined bias correction factors can be used. LT and DM methods offer useful alternatives in ungauged catchments, particularly when combined with predetermined bias correction factors. Their performance was notable in the absence of local observed data, such as at the Arua station.
Polynomial Regression should be avoided despite its statistically favorable metrics in most cases. It demonstrated a poor visual fit, over-smoothed time series, and failed to capture rainfall extremes.
The developed bias correction framework can be implemented in ungauged catchments across Uganda, particularly in remote areas where observed data are sparse. Option 1, which uses the closest station’s observed data in the same climatic zone, should be prioritized where feasible, as it provides more accurate results. Option 2, which relies on predefined bias correction factors, remains a valuable alternative in the absence of nearby stations.
For ungauged catchments, it is recommended to use predetermined bias correction factors tailored to specific climate zones. This research showed that maximum bias factors worked best in the tropical savannah (e.g., Arua), while minimum bias values were more effective in the tropical monsoon (e.g., Fort Portal). Practitioners should therefore adjust correction strategies based on regional climatic conditions.
From a policy and infrastructure planning perspective, institutions such as the Ministry of Water and Environment and the Ministry of Works and Transport of the Republic of Uganda should explore the use of bias-corrected RSR datasets into flood risk assessment and infrastructure design frameworks in data scarce or ungauged catchments. These corrected RSR datasets are viable alternatives to observed rainfall data as inputs for models used in designing culverts, bridges, and drainage systems. Additionally, policymakers and research institutions should consider prioritizing investments in expanding ground-based rainfall monitoring networks, as improved observational data will strengthen the calibration and validation of bias correction models and support better water resource planning under changing climatic conditions.

5.3. Research Assumptions and Limitations

Despite the promising results reported earlier, the research has certain limitations. In terms of scope, the research focused on bias correction for Annual Maximum Series (AMS) of daily RSR datasets (CHIRPS, GPCC, NOAA_CPC, ERA5, PERSIANN, MERRA2, and ERA5_AG) across the four gauged stations (Gulu, Soroti, Jinja, and Mbarara), with validations at the Arua and Fort Portal stations in Uganda. It analyzed RSR data from 1991 to 2020, evaluating bias correction methods for their ability to align with AMS of observed rainfall patterns. Some of the key assumptions and limitations of the research are as follows:

The research focused on Annual Maximum Series (AMS) rainfall data, which may not adequately capture short-duration extreme rainfall events that are critical for flood modeling.
The predetermined bias correction factors, derived from four gauged stations and regionalized parameters (mean and standard deviation), proved effective at the validation sites. However, their generalizability to other ungauged locations remains uncertain without further testing. The reliance on four stations may not fully capture spatial variability across Uganda.
Stationarity of RSR bias in time and space: It was assumed that the nature and magnitude of biases in RSR data remained stable over time and across stations. This assumption may not hold in the context of climate change or evolving land-use patterns as also noted by Maraun [59]. In addition, while the framework was validated at two additional stations, further testing across more ungauged catchments in various climate zones is necessary to generalize its application.

5.4. Areas for Future Research

While this research made contributions to the evaluation, development, and validation of RSR bias correction framework, there are several areas that warrant further investigation:

Future research could explore other RSR products not tested in this study to evaluate the applicability and effectiveness of the developed framework across a broader range of RSR datasets.
Future research could build on this research by examining short-duration RSR products or exploring methods to disaggregate daily rainfall data. Such disaggregation would facilitate the generation of sub-daily rainfall estimates, enhancing flood risk modeling, hydrological infrastructure design, and water resource management.
Application of the developed bias correction framework to larger scale hydrological models. Future studies should test the application of the bias correction framework on large-scale hydrological models to assess its impact on water resource management and planning.
Further research could also explore the integration of machine learning models to enhance the adaptability and precision of bias correction, especially in complex terrains or data-sparse regions. In addition, the development of multivariate bias correction frameworks, which account for interactions between rainfall and other climatic variables like temperature or humidity, could significantly improve the physical realism of corrected datasets.
Finally, validating this framework in more gauged and ungauged catchments across climate zones in Uganda is important to help determine its scalability and reliability in other regions.

5.5. Principal Conclusions

This research concludes that the bias correction of RSR datasets is essential for improving rainfall estimation in ungauged catchments, particularly in regions like Uganda, where observational data are limited. The proposed framework, anchored on the use of predetermined bias correction factors, including regionalized parameters, was validated through independent gauged stations. The framework proved effective in reducing systematic errors and aligning RSR data more closely with observed rainfall. Quantile Mapping emerged as the most effective method, while the Delta Multiplicative and Linear Transformation approaches also offered substantial improvements to RSR datasets at some stations and in some climatic zones. However, dataset-specific and climate zone-specific adjustments remain necessary, pointing to the need for tailored bias correction strategies.

The findings of this research can be broadly applied to other regions in Uganda with similar climatic and hydrological conditions, particularly in tropical and data-scarce areas that rely on RSR products. The bias correction framework, centered on regionalization and climate zone-based correction factors, is transferable to ungauged catchments within tropical savannah, monsoon, and rainforest zones, assuming similar RSR bias patterns. However, its applicability may be limited by regional variations in topography, rainfall mechanisms, and land use, which can influence both the magnitude and nature of RSR biases. Therefore, while the methodology is broadly adaptable, local recalibration and validation against representative stations in the target region are necessary to ensure accuracy in different settings.

The research contributes practical tools for hydrological applications in data-scarce environments and lays the groundwork for more adaptive, data-driven approaches to bias correction of AMS of daily RSR datasets in East Africa and beyond.

Author Contributions

Conceptualization, M.O.; methodology, M.O.; software, M.O.; validation, M.O. and J.D.P.; formal analysis, M.O. and J.D.P.; investigation, M.O. and J.D.P.; resources, M.O. and J.D.P.; data curation, M.O.; writing—original draft preparation, M.O.; writing—review and editing, M.O. and J.D.P.; visualization, M.O. and J.D.P.; supervision, J.D.P.; project administration, J.D.P.; funding acquisition, M.O. and J.D.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The Uganda National Meteorological Authority (UNMA) provided the observed daily rainfall data of the four rain gauge stations. The observed daily rainfall raw data supporting the conclusions of this article will be made available by the authors on request. The Remote Sensing Rainfall dataset presented in this research is freely available at https://app.climateengine.com/climateEngine, accessed on 16 December 2022.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AMS	Annual Maximum Series
CDF	Cumulative distribution function
CHIRPS	Climate Hazards Group InfraRed Precipitation with Station data
DM	Delta Multiplicative
ECMWF	European Centre for Medium-Range Weather Forecasts
ERA5	ECMWF Reanalysis v5
ERA5_AG	ECMWF Reanalysis v5 for Agriculture
GPCC	Global Precipitation Climatology Center
IDF	Intensity Duration Frequency
KS	Kolmogorov–Smirnov
LT	Linear Transformation
MAE	Mean Absolute Error
MERRA-2	Modern-Era Retrospective Analysis for Research and Applications, Version 2
NOAA_CPC	National Oceanic and Atmospheric Administration Climate Prediction Center
NSE	Nash–Sutcliffe Efficiency
PBIAS	Percent Bias
PDF	Probability Density Functions
PERSIANN-CDR	Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Climate Data Record
PR	Polynomial Regression
QM	Quantile Mapping
RMSE	Root-Mean-Square Error
RSR	Remote Sensing Rainfall

References

Kimani, M.W.; Hoedjes, J.C.B.; Su, Z. Bayesian Bias correction of satellite rainfall estimates for climate studies. Remote Sens. 2018, 10, 1074. [Google Scholar] [CrossRef]
Ngoma, H.; Wen, W.; Ojara, M.; Ayugi, B. Assessing current and future spatiotemporal precipitation variability and trends over Uganda, East Africa, based on CHIRPS and regional climate model datasets. Meteorol. Atmos. Phys. 2021, 133, 823–843. [Google Scholar] [CrossRef]
Okirya, M.; Du Plessis, J.A. Trend and Variability Analysis of Annual Maximum Rainfall Using Observed and Remotely Sensed Data in the Tropical Climate Zones of Uganda. Sustainability 2024, 16, 6081. [Google Scholar] [CrossRef]
Katiraie-Boroujerdy, P.S.; Naeini, M.R.; Asanjan, A.A.; Chavoshian, A.; Hsu, K.-L.; Sorooshian, S. Bias correction of satellite-based precipitation estimations using quantile mapping approach in different climate regions of Iran. Remote Sens. 2020, 12, 2102. [Google Scholar] [CrossRef]
Venkatesh, K.; Maheswaran, R.; Devacharan, J. Framework for developing IDF curves using satellite precipitation: A case study using GPM-IMERG V6 data. Earth Sci. Inform. 2022, 15, 671–687. [Google Scholar] [CrossRef]
Nkunzimana, A.; Bi, S.; Alriah, M.A.A.; Zhi, T.; Kur, N.A.D. Comparative analysis of the performance of satellite-based rainfall products over various topographical unities in Central East Africa: Case of Burundi. Earth Space Sci. 2020, 7, e2019EA000834. [Google Scholar] [CrossRef]
Ageet, S.; Fink, A.H.; Maranan, M.; Diem, J.E.; Hartter, J.; Ssali, A.L.; Ayabagabo, P. Validation of Satellite Rainfall Estimates over Equatorial East Africa. J. Hydrometeorol. 2022, 23, 129–151. [Google Scholar] [CrossRef]
Nakkazi, M.T.; Sempewo, J.I.; Tumutungire, M.D.; Byakatonda, J. Performance evaluation of CFSR, MERRA-2 and TRMM3B42 data sets in simulating river discharge of data-scarce tropical catchments: A case study of Manafwa, Uganda. J. Water Clim. Change 2022, 13, 522–541. [Google Scholar] [CrossRef]
Mekonnen, K.; Velpuri, N.M.; Leh, M.; Akpoti, K.; Owusu, A.; Tinonetsana, P.; Hamouda, T.; Ghansah, B.; Paranamana, T.P.; Munzimi, Y. Accuracy of satellite and reanalysis rainfall estimates over Africa: A multi-scale assessment of eight products for continental applications. J. Hydrol. Reg. Stud. 2023, 49, 101514. [Google Scholar] [CrossRef]
Onyutha, C. Geospatial trends and decadal anomalies in extreme rainfall over Uganda, East Africa. Adv. Meteorol. 2016, 2016, 6935912. [Google Scholar] [CrossRef]
Ngoma, H.; Ayugi, B.; Onyutha, C.; Babaousmail, H.; Lim Kam Sian, K.T.C.; Iyakaremye, V.; Mumo, R.; Ongoma, V. Projected changes in rainfall over Uganda based on CMIP6 models. Theor. Appl. Climatol. 2022, 149, 1117–1134. [Google Scholar] [CrossRef]
Li, X.; Wu, H.; Nanding, N.; Chen, S.; Hu, Y.; Li, L. Statistical Bias Correction of Precipitation Forecasts Based on Quantile Mapping on the Sub-Seasonal to Seasonal Scale. Remote Sens. 2023, 15, 1743. [Google Scholar] [CrossRef]
Andre, S.; Abhishek, G.; Slobodan, P.S.; Sandink, D. Computerized Tool for the Development of Intensity-Duration- Frequency Curves Under a Changing Climate, Computerized Tool for the Development of Intensity-Duration- Frequency Curves Under a Changing Climate, Technical Manual; Version 3; The University of Western Ontario, Department of Civil and Environmental Engineering and Institute for Catastrophic Loss Reduction: London, ON, Canada, 2018; Available online: www.idf-cc-uwo.ca (accessed on 23 May 2021).
Galiatsatou, P. Intensity-Duration-Frequency Curves at Ungauged Sites in a Changing Climate for Sustainable Stormwater Networks. Sustainability 2022, 14, 1229. [Google Scholar] [CrossRef]
Raes, D. Frequency Analysis of Rainfall Data. College on Soil Physics—30th Anniversary (1983–2013). 2013, p. 42. Available online: http://indico.ictp.it/event/a12165/session/21/contribution/16/material/0/0.pdf (accessed on 24 May 2021).
Van Wageningen, A.; Du Plessis, J. Are rainfall intensities changing, could climate change be blamed and what could be the impact for hydrologists? Water SA 2007, 33, 571–574. [Google Scholar]
Subramanya, K. Engineering Hydrology; Tata McGraw-Hill: New Delhi, India, 2008. [Google Scholar]
Gupta, S.R. Hydrology and Hydraulic Systems, 4th ed.; Waveland Press, Inc.: Long Grove, IL, USA, 2017. [Google Scholar]
Gumindoga, W.; Rientjes, T.H.M.; Tamiru Haile, A.; Makurira, H.; Reggiani, P. Performance of bias-correction schemes for CMORPH rainfall estimates in the Zambezi River basin. Hydrol. Earth Syst. Sci. 2019, 23, 2915–2938. [Google Scholar] [CrossRef]
Onyutha, C. Analyses of rainfall extremes in East Africa based on observations from rain gauges and climate change simulations by CORDEX RCMs. Clim. Dyn. 2020, 54, 4841–4864. [Google Scholar] [CrossRef]
Macharia, J.M.; Ngetich, F.K.; Shisanya, C.A. Comparison of satellite remote sensing derived precipitation estimates and observed data in Kenya. Agric. For. Meteorol. 2020, 284, 107875. [Google Scholar] [CrossRef]
Omonge, P.; Feigl, M.; Olang, L.; Schulz, K.; Herrnegger, M. Evaluation of satellite precipitation products for water allocation studies in the Sio-Malaba-Malakisi river basin of East Africa. J. Hydrol. Reg. Stud. 2021, 39, 100983. [Google Scholar] [CrossRef]
Ajaaj, A.A.; Mishra, A.K.; Khan, A.A. Comparison of BIAS correction techniques for GPCC rainfall data in semi-arid climate. Stoch. Environ. Res. Risk Assess. 2016, 30, 1659–1675. [Google Scholar] [CrossRef]
Ouatiki, H.; Boudhar, A.; Chehbouni, A. Accuracy assessment and bias correction of remote sensing–based rainfall products over semiarid watersheds. Theor. Appl. Climatol. 2023, 154, 763–780. [Google Scholar] [CrossRef]
Lee, G.; Nguyen, D.H.; Le, X.H. A Novel Framework for Correcting Satellite-Based Precipitation Products for Watersheds with Discontinuous Observed Data, Case Study in Mekong River Basin. Remote Sens. 2023, 15, 630. [Google Scholar] [CrossRef]
Dao, V.; Arellano, C.J.; Nguyen, P.; Almutlaq, F.; Hsu, K.; Sorooshian, S. Bias Correction of Satellite Precipitation Estimation Using Deep Neural Networks and Topographic Information Over the Western U.S. J. Geophys. Res. Atmos. 2025, 130, e2024JD042181. [Google Scholar] [CrossRef]
Wang, F.; Tian, D.; Carroll, M. Customized deep learning for precipitation bias correction and downscaling. Geosci. Model Dev. 2023, 16, 535–556. [Google Scholar] [CrossRef]
Chen, H.; Sun, L.; Cifelli, R.; Xie, P. Deep Learning for Bias Correction of Satellite Retrievals of Orographic Precipitation. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–11. [Google Scholar] [CrossRef]
Tao, Y.; Gao, X.; Hsu, K.; Sorooshian, S.; Ihler, A. A deep neural network modeling framework to reduce bias in satellite precipitation products. J. Hydrometeorol. 2016, 17, 931–945. [Google Scholar] [CrossRef]
Le, X.H.; Kim, Y.; Van Binh, D.; Jung, S.; Hai Nguyen, D.; Lee, G. Improving rainfall-runoff modeling in the Mekong river basin using bias-corrected satellite precipitation products by convolutional neural networks. J. Hydrol. 2024, 630, 130762. [Google Scholar] [CrossRef]
Beck, H.E.; McVicar, T.R.; Vergopolan, N.; Berg, A.; Lutsko, N.J.; Dufour, A.; Zeng, Z.; Jiang, X.; van Dijk, A.I.J.M.; Miralles, D.G. High-resolution (1 km) Köppen-Geiger maps for 1901–2099 based on constrained CMIP6 projections. Sci. Data 2023, 10, 724. [Google Scholar] [CrossRef]
Jury, M.R. Uganda rainfall variability and prediction. Theor. Appl. Climatol. 2018, 132, 905–919. [Google Scholar] [CrossRef]
Maity, R. Statistical Methods in Hydrology, 2nd ed.; Springer Nature Singapore Pte Ltd.: Kharagpur, India, 2022. [Google Scholar] [CrossRef]
Nicholson, S.E.; Klotter, D.A. Assessing the reliability of satellite and reanalysis estimates of rainfall in equatorial Africa. Remote Sens. 2021, 13, 3609. [Google Scholar] [CrossRef]
Rachidi, S.; Houssine, E.L.; Mazoudi, E.; El Alami, J.; Jadoud, M.; Er-raki, S. Assessment and Comparison of Satellite-Based Rainfall Products: Validation by Hydrological Modeling Using ANN in a Semi-Arid Zone. Water 2023, 15, 1997. [Google Scholar] [CrossRef]
Du Plessis, J.; Kibii, J. Applicability of CHIRPS-based satellite rainfall estimates for South Africa. J. S. Afr. Inst. Civ. Eng. 2021, 63, 43–54. [Google Scholar] [CrossRef]
Maity, R. Springer Transactions in Civil and Environmental Engineering Statistical Methods in Hydrology and Hydroclimatology. 2018. Available online: http://www.springer.com/series/13593 (accessed on 17 August 2021).
Zhao, C.; Yang, J. A Robust Skewed Boxplot for Detecting Outliers in Rainfall Observations in Real-Time Flood Forecasting. Adv. Meteorol. 2019, 2019, 1795673. [Google Scholar] [CrossRef]
Chinasho, A.; Bedadi, B.; Lemma, T.; Tana, T.; Hordofa, T.; Elias, B. Evaluation of Seven Gap-Filling Techniques for Daily Station-Based Rainfall Datasets in South Ethiopia. Adv. Meteorol. 2021, 2021, 9657460. [Google Scholar] [CrossRef]
Phan, Q.T.; Wu, Y.K.; Phan, Q.D.; Lo, H.Y. A Study on Missing Data Imputation Methods for Improving Hourly Solar Dataset. In Proceedings of the 2022 8th International Conference on Applied System Innovation, ICASI 2022, Nantou, Taiwan, 21–23 April 2022; pp. 21–24. [Google Scholar] [CrossRef]
Gado, A.T.; Zamzam, H.D.; Guo, Y.; Zeidan, A.B. Evaluation of satellite-based rainfall estimates in the upper Blue Nile basin. Indian Acad. Sci. 2024, 133, 27. [Google Scholar] [CrossRef]
Ehret, U.; Zehe, E.; Wulfmeyer, V.; Liebert, J. HESS Opinions “Should we apply bias correction to global and regional climate model data?”. Hydrol. Earth Syst. Sci. 2012, 16, 3391–3404. [Google Scholar] [CrossRef]
Enayati, M.; Bozorg-Haddad, O.; Bazrafshan, J.; Hejabi, S.; Chu, X. Bias correction capabilities of quantile mapping methods for rainfall and temperature variables. J. Water Clim. Change 2021, 12, 401–419. [Google Scholar] [CrossRef]
Ayugi, B.; Tan, G.; Ruoyun, N.; Babaousmail, H.; Ojara, M.; Wido, H.; Mumo, L.; Ngoma, N.H.; Nooni, I.K.; Ongoma, V. Quantile mapping bias correction on rossby centre regional climate models for precipitation analysis over Kenya, East Africa. Water 2020, 12, 801. [Google Scholar] [CrossRef]
Ringard, J.; Seyler, F.; Linguet, L. A quantile mapping bias correction method based on hydroclimatic classification of the Guiana shield. Sensors 2017, 17, 1413. [Google Scholar] [CrossRef]
Koutsouris, A.J.; Seibert, J.; Lyon, S.W. Utilization of global precipitation datasets in data limited regions: A case study of Kilombero Valley, Tanzania. Atmosphere 2017, 8, 246. [Google Scholar] [CrossRef]
Mirones, Ó.; Bedia, J.; Herrera, S.; Iturbide, M.; Baño Medina, J. Refining remote sensing precipitation datasets in the South Pacific with an adaptive multi-method calibration approach. Hydrol. Earth Syst. Sci. 2025, 29, 799–822. [Google Scholar] [CrossRef]
Bluman Allan, G. Elementary Statistics, a Step by Step Approach, 9th ed.; McGraw-Hill Education: New York, NY, USA, 2014. [Google Scholar]
Triola, M.F. Elementary Statistics, 12th ed.; Pearson Education Limited: Edinburgh Gate, UK, 2014. [Google Scholar]
Bamweyana, I.; Musinguzi, M.; Kayondo, L.M. Evaluation of CHIRPS Satellite Gridded Dataset as an Alternative Rainfall Estimate for Localized Modelling over Uganda. Atmos. Clim. Sci. 2021, 11, 797–811. [Google Scholar] [CrossRef]
Saber, M.; Yilmaz, K.K. Evaluation and bias correction of satellite-based rainfall estimates for modelling flash floods over the Mediterranean region: Application to Karpuz River Basin, Turkey. Water 2018, 10, 657. [Google Scholar] [CrossRef]
Diem, J.E.; Hartter, J.; Ryan, S.J.; Palace, M.W. Validation of satellite rainfall products for Western Uganda. J. Hydrometeorol. 2014, 15, 2030–2038. [Google Scholar] [CrossRef]
Karamouz, M.; Nazif, S.; Falahi, M. Hydrology and Hydroclimatology: Principles and Applications; CRC Press: Boca Raton, FL, USA, 2012. [Google Scholar] [CrossRef]
Yang, X.; Li, F.; Qi, W.; Zhang, M.; Yu, C.; Xu, C.Y. Regionalization methods for PUB: A comprehensive review of progress after the PUB decade. Hydrol. Res. 2023, 54, 885–900. [Google Scholar] [CrossRef]
Beck, H.E.; van Dijk, A.I.J.M.; De Roo, A.; Miralles, D.G.; McVicar, T.R.; Jaap, S.; Adrian, B.L. Global-scale regionalization of hydrologic model parameters. Water Resour. Res. 2016, 52, 3599–3622. [Google Scholar] [CrossRef]
Singh, S.K.; Bárdossy, A.; Götzinger, J.; Sudheer, K.P. Effect of spatial resolution on regionalization of hydrological model parameters. Hydrol. Process. 2012, 26, 3499–3509. [Google Scholar] [CrossRef]
Vigna, I.; Bigi, V.; Pezzoli, A.; Besana, A. Comparison and bias-correction of satellite-derived precipitation datasets at local level in northern Kenya. Sustainability 2020, 12, 2896. [Google Scholar] [CrossRef]
Gudmundsson, L.; Bremnes, J.B.; Haugen, J.E.; Engen-Skaugen, T. Technical Note: Downscaling RCM precipitation to the station scale using statistical transformations—A comparison of methods. Hydrol. Earth Syst. Sci. 2012, 16, 3383–3390. [Google Scholar] [CrossRef]
Maraun, D. Bias Correcting Climate Change Simulations—A Critical Review. Curr. Clim. Change Rep. 2016, 2, 211–220. [Google Scholar] [CrossRef]

Figure 1. Map of Uganda showing the 6 rainfall gauging stations.

Figure 2. Gaps in rainfall data at the Arua and the Fort Portal rainfall stations.

Figure 3. Observed, original, and bias-corrected rainfall at the Gulu Station.

Figure 4. PDFs and CDFs of observed and bias-corrected data for the Gulu station.

Figure 5. A flowchart of the proposed RSR data bias correction framework.

Figure 7. PDF and CDF of observed and bias-corrected data for the Arua station.

Table 1. Equations for statistical performance metrics.

Equation	Range (Remarks)	Optimum Value (Units)
$RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(R_{RSR, i} - R_{obs, i})}^{2}}$	0 to ∞ (smaller is better)	0 (mm)
$MAE = \frac{1}{n} \sum_{i = 1}^{n} \|R_{obs, i} - R_{RSR, i}\|$	0 to ∞ (smaller is better)	0 (mm)
$PBIAS = \frac{\sum_{i = 1}^{n} (R_{RSR, i} - R_{obs, i})}{\sum_{i = 1}^{n} R_{obs, i}} \times 100 %$	−∞ to ∞ (closer to 0 is better)	0 (%)
$NSE = 1 - \frac{\sum_{i = 1}^{n} {({R_{RSR, i} - R}_{obs, i})}^{2}}{\sum_{i = 1}^{n} {(R_{obs, i} - {\bar{R}}_{obs, i})}^{2}}$	−∞ to 1 (closer to 1 is better)	1 ( )

where R_obs,i is the observed rainfall values, R_adj,i is the bias-corrected RSR values, and

{\bar{R}}_{obs, i}

is the mean of the observed rainfall data.

Table 2. Statistical performance test results at the Gulu and Soroti stations.

Dataset/Bias Correction Method	Gulu Station				Soroti Station
Dataset/Bias Correction Method	RMSE	MAE	PBIAS	NSE	RMSE	MAE	PBIAS	NSE
NOAA_CPC_Orig	29.20	22.44	−19.23	−1.68	25.24	19.03	−14.79	−0.97
NOAA_CPC_Linear	21.03	13.98	0.00	−0.39	21.78	16.60	0.00	−0.47
NOAA_CPC_Quantile	19.00	12.84	1.05	−0.14	21.79	14.99	0.90	−0.47
NOAA_CPC_Delta	30.33	18.80	0.00	−1.89	25.12	18.99	0.00	−0.95
NOAA_CPC_Poly	16.43	13.06	0.00	0.15	17.25	11.73	0.00	0.08
CHIRPS_Orig	41.64	36.61	−50.59	−4.45	37.67	33.07	−45.00	−3.39
CHIRPS_Linear	26.74	20.45	0.00	−1.25	22.27	17.09	0.00	−0.54
CHIRPS_Quantile	26.12	19.97	1.05	−1.14	23.18	15.58	0.90	−0.66
CHIRPS_Delta	23.75	18.20	−0.01	−0.77	20.89	15.68	0.00	−0.35
CHIRPS_Poly	17.62	14.33	0.00	0.02	16.75	10.94	0.00	0.13
ERA5_Orig	43.29	38.66	−53.41	−4.89	45.03	38.76	−52.74	−5.28
ERA5_Linear	26.52	19.66	0.00	−1.21	26.59	18.97	0.00	−1.19
ERA5_Quantile	25.53	19.31	1.05	−1.05	26.46	17.69	0.90	−1.17
ERA5_Delta	23.32	17.95	0.00	−0.71	33.57	24.68	0.00	−2.49
ERA5_Poly	17.70	14.18	0.00	0.02	17.61	11.35	0.00	0.04
GPCC_Orig	27.46	21.15	−3.05	−1.37	26.08	18.90	−9.66	−1.10
GPCC_Linear	24.16	18.64	0.00	−0.84	24.88	17.11	0.00	−0.92
GPCC_Quantile	23.66	18.41	1.05	−0.76	26.19	16.32	0.90	−1.12
GPCC_Delta	27.91	21.36	0.00	−1.45	26.50	18.36	−0.01	−1.17
GPCC_Poly	17.77	14.46	0.00	0.01	17.58	11.61	0.00	0.04
PERSIANN_Orig	47.30	43.28	−54.50	−6.03	47.80	44.02	−59.89	−6.07
PERSIANN_Linear	27.35	18.46	0.00	−1.35	24.18	17.43	0.00	−0.81
PERSIANN_Quantile	26.20	21.24	1.05	−1.16	24.27	17.00	0.90	−0.82
PERSIANN_Delta	42.49	25.29	0.00	−4.68	23.77	17.14	0.00	−0.75
PERSIANN_Poly	17.51	13.97	0.00	0.04	17.74	11.86	0.00	0.03
ERA5_AG_Orig	43.28	37.14	−51.32	−4.89	43.30	35.02	−47.65	−4.80
ERA5_AG_Linear	27.94	22.49	0.00	−1.45	27.21	20.42	0.00	−1.29
ERA5_AG_Quantile	28.50	23.51	1.05	−1.55	26.80	18.41	0.90	−1.22
ERA5_AG_Delta	29.74	23.88	0.00	−1.78	37.00	29.34	0.00	−3.24
ERA5_AG_Poly	16.90	13.09	0.00	0.10	16.98	10.23	0.00	0.11
MERRA2_Orig	40.75	36.07	−30.01	−4.22	46.41	41.27	−55.16	−5.67
MERRA2_Linear	24.07	17.62	0.00	−0.82	26.13	18.29	0.00	−1.11
MERRA2_Quantile	23.55	17.32	1.05	−0.74	26.03	17.43	0.90	−1.10
MERRA2_Delta	46.43	31.70	0.00	−5.78	34.37	24.41	0.00	−2.66
MERRA2_Poly	17.32	13.99	0.00	0.06	17.81	11.55	0.00	0.02

Table 3. Statistical performance test results at the Jinja and Mbarara stations.

Dataset/Bias Correction Method	Jinja Station				Mbarara Station
Dataset/Bias Correction Method	RMSE	MAE	PBIAS	NSE	RMSE	MAE	PBIAS	NSE
NOAA_CPC_Orig	27.63	23.42	−12.93	−1.59	25.77	19.51	−11.63	−1.30
NOAA_CPC_Linear	20.30	15.79	0.00	−0.40	22.93	15.97	0.00	−0.82
NOAA_CPC_Quantile	21.13	16.33	1.25	−0.52	23.11	15.05	1.43	−0.85
NOAA_CPC_Delta	28.91	22.04	0.00	−1.84	26.79	18.59	0.00	−1.48
NOAA_CPC_Poly	16.35	12.82	0.00	0.09	15.34	11.20	0.00	0.19
CHIRPS_Orig	45.09	39.64	−53.18	−5.90	34.56	28.61	−46.99	−3.13
CHIRPS_Linear	28.52	21.79	0.00	−1.76	26.77	19.33	−0.01	−1.48
CHIRPS_Quantile	28.24	21.38	1.25	−1.71	26.41	18.43	1.43	−1.41
CHIRPS_Delta	29.17	22.33	0.00	−1.89	25.63	18.44	−0.01	−1.27
CHIRPS_Poly	15.63	12.40	0.00	0.17	16.18	12.90	0.01	0.09
ERA5_Orig	50.04	46.03	−61.57	−7.50	37.48	32.59	−52.53	−3.86
ERA5_Linear	23.32	16.98	0.00	−0.85	26.54	17.90	0.00	−1.44
ERA5_Quantile	22.45	17.23	1.25	−0.71	25.07	16.97	1.43	−1.17
ERA5_Delta	35.35	22.34	0.00	−3.24	28.45	18.86	0.00	−1.80
ERA5_Poly	16.95	13.61	0.00	0.02	16.52	11.74	0.00	0.06
GPCC_Orig	22.22	17.56	−14.64	−0.68	22.62	16.60	−5.93	−0.77
GPCC_Linear	18.60	14.80	0.00	−0.17	20.66	14.98	0.00	−0.48
GPCC_Quantile	17.96	14.36	1.25	−0.10	22.72	15.85	1.43	−0.79
GPCC_Delta	21.42	16.65	0.00	−0.56	23.21	16.50	0.00	−0.86
GPCC_Poly	15.42	12.53	0.00	0.19	16.34	11.94	0.00	0.08
PERSIANN_Orig	48.88	45.10	−60.76	−7.11	41.10	36.33	−61.51	−4.84
PERSIANN_Linear	26.36	20.30	0.00	−1.36	26.66	18.71	0.00	−1.46
PERSIANN_Quantile	25.68	19.59	1.25	−1.24	26.35	18.55	1.43	−1.40
PERSIANN_Delta	23.68	18.24	0.00	−0.90	25.26	17.76	0.00	−1.21
PERSIANN_Poly	16.73	13.33	0.00	0.05	16.39	12.13	0.00	0.07
ERA5_AG_Orig	41.30	37.04	−47.51	−4.79	41.36	36.94	−60.85	−4.92
ERA5_AG_Linear	20.32	16.57	0.00	−0.40	25.65	17.79	0.00	−1.28
ERA5_AG_Quantile	18.87	15.36	1.25	−0.21	24.80	17.29	1.21	−1.13
ERA5_AG_Delta	35.24	27.79	0.00	−3.22	31.01	21.32	0.00	−2.33
ERA5_AG_Poly	16.11	13.30	0.00	0.12	16.32	11.77	0.00	0.08
MERRA2_Orig	38.52	34.87	−44.57	−4.04	33.35	28.83	−39.85	−2.85
MERRA2_Linear	20.25	14.77	0.00	−0.39	23.00	16.99	0.00	−0.83
MERRA2_Quantile	19.63	14.21	1.25	−0.31	22.51	16.35	1.43	−0.75
MERRA2_Delta	29.12	19.69	0.00	−1.88	33.02	24.28	0.00	−2.77
MERRA2_Poly	16.03	12.90	0.00	0.13	15.98	11.53	0.00	0.12

Table 4. Goodness-of-fit test results for the four gauged stations.

Dataset/Bias Correction Method	Gulu Station		Soroti Station		Jinja Station		Mbarara Station
Dataset/Bias Correction Method	KS	p-Value	KS	p-Value	KS	p-Value	KS	p-Value
NOAA_CPC_Orig	0.40	0.02	0.40	0.02	0.50	0.00	0.27	0.24
NOAA_CPC_Linear	0.13	0.96	0.13	0.96	0.13	0.96	0.13	0.96
NOAA_CPC_Quantile	0.03	1.00	0.03	1.00	0.03	1.00	0.03	1.00
NOAA_CPC_Delta	0.27	0.24	0.17	0.81	0.30	0.14	0.23	0.39
NOAA_CPC_Poly	0.27	0.24	0.50	0.00	0.47	0.00	0.27	0.24
CHIRPS_Orig	0.97	0.00	0.90	0.00	0.90	0.00	0.80	0.00
CHIRPS_Linear	0.13	0.96	0.20	0.59	0.13	0.96	0.10	1.00
CHIRPS_Quantile	0.03	1.00	0.03	1.00	0.03	1.00	0.03	1.00
CHIRPS_Delta	0.17	0.81	0.17	0.81	0.13	0.96	0.10	1.00
CHIRPS_Poly	0.53	0.00	0.30	0.14	0.30	0.14	0.37	0.03
ERA5_Orig	0.97	0.00	0.87	0.00	0.93	0.00	0.87	0.00
ERA5_Linear	0.13	0.96	0.17	0.81	0.20	0.59	0.23	0.39
ERA5_Quantile	0.03	1.00	0.03	1.00	0.03	1.00	0.03	1.00
ERA5_Delta	0.23	0.39	0.27	0.24	0.27	0.24	0.23	0.39
ERA5_Poly	0.53	0.00	0.40	0.02	0.50	0.00	0.47	0.00
GPCC_Orig	0.20	0.59	0.37	0.03	0.43	0.01	0.30	0.14
GPCC_Linear	0.10	1.00	0.20	0.59	0.10	1.00	0.20	0.59
GPCC_Quantile	0.03	1.00	0.03	1.00	0.03	1.00	0.03	1.00
GPCC_Delta	0.17	0.81	0.23	0.39	0.20	0.59	0.20	0.59
GPCC_Poly	0.60	0.00	0.40	0.02	0.27	0.24	0.47	0.00
PERSIANN_Orig	0.93	0.00	0.97	0.00	1.00	0.00	0.97	0.00
PERSIANN_Linear	0.30	0.14	0.10	1.00	0.10	1.00	0.13	0.96
PERSIANN_Quantile	0.03	1.00	0.03	1.00	0.03	1.00	0.03	1.00
PERSIANN_Delta	0.27	0.24	0.10	1.00	0.17	0.81	0.13	0.96
PERSIANN_Poly	0.57	0.00	0.47	0.00	0.43	0.01	0.37	0.03
ERA5_AG_Orig	0.93	0.00	0.83	0.00	0.73	0.00	0.93	0.00
ERA5_AG_Linear	0.13	0.96	0.20	0.59	0.17	0.81	0.13	0.96
ERA5_AG_Quantile	0.03	1.00	0.03	1.00	0.03	1.00	0.03	1.00
ERA5_AG_Delta	0.10	1.00	0.37	0.03	0.37	0.03	0.20	0.59
ERA5_AG_Poly	0.37	0.03	0.30	0.14	0.40	0.02	0.43	0.01
MERRA2_Orig	0.70	0.00	0.93	0.00	0.73	0.00	0.67	0.00
MERRA2_Linear	0.20	0.59	0.10	1.00	0.10	1.00	0.13	0.96
MERRA2_Quantile	0.03	1.00	0.03	1.00	0.03	1.00	0.03	1.00
MERRA2_Delta	0.30	0.14	0.20	0.59	0.27	0.24	0.33	0.07
MERRA2_Poly	0.37	0.03	0.53	0.00	0.30	0.14	0.30	0.14

Table 5. Standard deviations of rainfall data at the four stations.

Stations	Observed	CHIRPS	ERA5_AG	MERRA2	NOAA	PERSIANN	ERA5	GPCC
Soroti	17.97	8.59	15.61	12.68	19.57	6.96	12.62	18.29
Gulu	17.84	6.73	9.79	31.14	24.69	16.18	6.18	22.28
Mbarara	17.00	8.23	9.27	17.91	19.70	5.84	9.21	19.63
Jinja	17.16	8.40	19.08	16.25	25.23	5.31	12.39	18.54

Table 6. Biases and predetermined bias correction factors in the standard deviation data values.

Stations	CHIRPS	ERA5_AG	MERRA2	NOAA	PERSIANN	ERA5	GPCC
Soroti	9.39	2.36	5.29	−1.60	11.01	5.35	−0.32
Gulu	11.11	8.05	−13.31	−6.85	1.65	11.66	−4.44
Mbarara	8.77	7.73	−0.91	−2.70	11.16	7.79	−2.63
Jinja	8.76	−1.92	0.91	−8.07	11.85	4.77	−1.38
Maximum bias	11.11			−1.60	11.85	11.66	−0.32
Minimum bias	8.76			−8.07	1.65	4.77	−4.44

Table 7. Mean values of rainfalls at the four stations.

Station	Observed	CHIRPS	ERA5_AG	MERRA2	NOAA	PERSIANN	ERA5	GPCC
Soroti	73.49	40.42	38.47	32.95	62.62	29.48	34.74	66.39
Gulu	72.38	35.76	35.23	50.66	58.46	32.93	33.72	70.17
Mbarara	59.07	31.31	23.13	35.53	52.20	22.73	28.04	55.56
Jinja	74.23	34.75	38.97	41.15	64.63	29.13	28.53	63.36

Table 8. Biases and predetermined bias correction factors in the mean rainfall values.

Station	CHIRPS	ERA5_AG	MERRA2	NOAA	PERSIANN	ERA5	GPCC
Soroti	33.07	35.02	40.54	10.87	44.02	38.76	7.10
Gulu	36.62	37.14	21.72	13.92	39.44	38.66	2.21
Mbarara	27.76	35.94	23.54	6.87	36.33	31.03	3.50
Jinja	39.48	35.26	33.08	9.60	45.10	45.70	10.87
Maximum bias	39.48	37.14	40.54	13.92	45.10	45.70	10.87
Minimum bias	27.76	35.02	21.72	6.87	36.33	31.03	2.21

Table 9. Statistical performance validation results at the Arua and the Fort Portal stations.

Dataset/Bias Correction Method	Arua Station				Fort Portal Station
Dataset/Bias Correction Method	RMSE	MAE	PBIAS	NSE	RMSE	MAE	PBIAS	NSE
NOAA_CPC_Orig	29.12	20.92	−19.74	−1.74	23.49	20.40	−7.25	−1.83
NOAA_CPC_Linear	24.90	18.23	−5.32	−1.00	20.18	15.40	8.39	−1.09
NOAA_CPC_Quantile	25.16	19.07	−4.32	−1.04	18.91	14.64	9.94	−0.83
NOAA_CPC_Delta	27.63	20.58	−5.32	−1.47	26.58	20.07	8.39	−2.62
NOAA_CPC_Linear2	23.82	17.76	−1.53	−0.83	17.56	14.08	5.36	−0.58
NOAA_CPC_Delta2	28.02	21.14	−1.53	−1.54	25.75	20.04	5.36	−2.40
CHIRPS_Orig	49.14	45.74	−59.83	−6.80	28.35	25.28	−46.20	−3.12
CHIRPS_Linear	24.48	19.73	−5.32	−0.94	18.17	13.11	8.39	−0.69
CHIRPS_Quantile	25.83	21.00	−4.34	−1.15	18.24	12.64	9.94	−0.71
CHIRPS_Delta	21.07	17.23	−5.31	−0.43	15.68	11.89	8.39	−0.26
CHIRPS_Linear2	23.94	19.26	−8.18	−0.85	16.53	12.24	4.73	−0.40
CHIRPS_Delta2	21.41	17.38	−8.18	−0.48	15.02	11.35	4.74	−0.16
ERA5_Orig	48.14	43.67	−56.72	−6.49	28.75	24.84	−41.17	−3.24
ERA5_Linear	27.99	21.07	−5.32	−1.53	25.64	19.75	8.39	−2.37
ERA5_Quantile	28.12	22.47	−4.32	−1.55	25.37	18.84	9.94	−2.30
ERA5_Delta	27.83	20.99	−5.32	−1.50	23.33	18.06	8.39	−1.79
ERA5_Linear2	29.29	21.93	3.06	−1.77	23.15	17.36	15.77	−1.75
ERA5_Delta2	28.87	21.73	3.06	−1.69	25.19	18.65	15.77	−2.25
GPCC_Orig	31.48	22.57	11.63	−2.20	20.05	16.10	13.95	−1.06
GPCC_Linear	21.46	16.56	−5.32	−0.49	22.27	17.29	8.39	−1.54
GPCC_Quantile	19.94	15.44	−4.32	−0.28	22.51	16.83	9.94	−1.60
GPCC_Delta	26.82	20.94	−5.32	−1.32	18.72	15.25	8.39	−0.80
GPCC_Linear2	35.87	26.22	25.85	−3.16	18.75	15.85	18.00	−0.80
GPCC_Delta2	38.87	27.75	25.85	−3.88	21.24	17.15	18.00	−1.31
PERSIANN_Orig	50.16	47.19	−61.74	−7.13	31.73	28.62	−52.51	−4.16
PERSIANN_Linear	20.13	16.43	−5.32	−0.31	19.33	14.03	8.39	−0.92
PERSIANN_Quantile	22.90	18.34	−4.32	−0.69	19.80	14.69	9.84	−1.01
PERSIANN_Delta	27.73	20.49	−5.32	−1.48	18.13	13.49	8.39	−0.68
PERSIANN_Linear2	23.47	18.09	−2.74	−0.78	16.10	12.94	14.16	−0.33
PERSIANN_Delta2	28.14	20.29	−2.74	−1.56	19.65	14.54	14.15	−0.98

Table 10. Goodness-of-fit test validation results at Arua and Fort Portal.

Dataset/Bias Correction Method	Arua		Fort Portal
Dataset/Bias Correction Method	KS	p-Value	KS	p-Value
NOAA_CPC_Orig	0.40	0.02	0.40	0.02
NOAA_CPC_Linear	0.17	0.81	0.23	0.39
NOAA_CPC_Quantile	0.17	0.81	0.20	0.59
NOAA_CPC_Delta	0.23	0.39	0.20	0.59
NOAA_CPC_Linear2	0.17	0.81	0.27	0.24
NOAA_CPC_Delta2	0.20	0.59	0.27	0.24
CHIRPS_Orig	1.00	0.00	0.83	0.00
CHIRPS_Linear	0.23	0.39	0.20	0.59
CHIRPS_Quantile	0.17	0.81	0.20	0.59
CHIRPS_Delta	0.27	0.24	0.23	0.39
CHIRPS_Linear2	0.27	0.24	0.17	0.81
CHIRPS_Delta2	0.30	0.14	0.20	0.59
ERA5_Orig	0.97	0.00	0.77	0.00
ERA5_Linear	0.27	0.24	0.17	0.81
ERA5_Quantile	0.17	0.81	0.20	0.59
ERA5_Delta	0.27	0.24	0.27	0.24
ERA5_Linear2	0.20	0.59	0.43	0.01
ERA5_Delta2	0.23	0.39	0.33	0.07
GPCC_Orig	0.20	0.59	0.33	0.07
GPCC_Linear	0.17	0.81	0.17	0.81
GPCC_Quantile	0.17	0.81	0.20	0.59
GPCC_Delta	0.23	0.39	0.23	0.39
GPCC_Linear2	0.33	0.07	0.50	0.00
GPCC_Delta2	0.37	0.03	0.37	0.03
PERSIANN_Orig	0.97	0.00	0.87	0.00
PERSIANN_Linear	0.30	0.14	0.20	0.59
PERSIANN_Quantile	0.17	0.81	0.20	0.59
PERSIANN_Delta	0.30	0.14	0.27	0.24
PERSIANN_Linear2	0.23	0.39	0.50	0.00
PERSIANN_Delta2	0.23	0.39	0.33	0.07

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Okirya, M.; Du Plessis, J. Evaluating Bias Correction Methods Using Annual Maximum Series Rainfall Data from Observed and Remotely Sensed Sources in Gauged and Ungauged Catchments in Uganda. Hydrology 2025, 12, 113. https://doi.org/10.3390/hydrology12050113

AMA Style

Okirya M, Du Plessis J. Evaluating Bias Correction Methods Using Annual Maximum Series Rainfall Data from Observed and Remotely Sensed Sources in Gauged and Ungauged Catchments in Uganda. Hydrology. 2025; 12(5):113. https://doi.org/10.3390/hydrology12050113

Chicago/Turabian Style

Okirya, Martin, and JA Du Plessis. 2025. "Evaluating Bias Correction Methods Using Annual Maximum Series Rainfall Data from Observed and Remotely Sensed Sources in Gauged and Ungauged Catchments in Uganda" Hydrology 12, no. 5: 113. https://doi.org/10.3390/hydrology12050113

APA Style

Okirya, M., & Du Plessis, J. (2025). Evaluating Bias Correction Methods Using Annual Maximum Series Rainfall Data from Observed and Remotely Sensed Sources in Gauged and Ungauged Catchments in Uganda. Hydrology, 12(5), 113. https://doi.org/10.3390/hydrology12050113

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Evaluating Bias Correction Methods Using Annual Maximum Series Rainfall Data from Observed and Remotely Sensed Sources in Gauged and Ungauged Catchments in Uganda

Abstract

1. Introduction

1.1. Objectives

1.1.1. Main Objective

1.1.2. Specific Objectives

1.2. Research Questions

2. Materials and Methods

2.1. Description of the Study Area

2.2. Research Data

2.2.1. Observed Rainfall Data

2.2.2. Remotely Sensed Rainfall (RSR) Data

2.3. Data Preprocessing

2.3.1. Data Outlier Detection and Removal

2.3.2. Gap Filling of Rainfall Data

2.4. Evaluating and Comparing Bias Correction Methods in Gauged Catchments

2.4.1. Bias Correction Methods

2.4.2. Statistical Performance Metrics

2.4.3. Goodness-of-Fit Tests

2.4.4. Visual Assessment

2.5. Development of RSR Bias Correction Framework for Ungauged Catchments

2.5.1. Biases in RSR Data Products

2.5.2. Regionalized Parameters for Bias Correction

3. Results

3.1. Data Quality Control and Preprocessing Results

3.1.1. Identified Gaps in Rainfall Data

3.1.2. Outlier Detection and Removal

3.2. Evaluation and Comparison of Bias Correction Methods in Gauged Catchments

3.2.1. Visual Comparison of Time Series Plot Results

3.2.2. Results of Statistical Performance Metrics

3.2.3. Results Based on the Visualization of PDF and CDF Plots

3.2.4. Results Based on the Goodness-of-Fit Test (KS and p-Values)

3.3. Bias Correction Framework for Ungauged Catchments

3.3.1. Description of the Developed Bias Correction Framework

3.3.2. Predetermined Bias Correction Factors

3.4. Application and Validation of the Bias Correction Framework

3.4.1. Validation Results Based on Comparison of Time Series Plots

3.4.2. Validation Results Based on Statistical Performance Metrics

3.4.3. Validation Results Based on PDF and CDF Shapes

3.4.4. Validation Results Based on the Goodness-of-Fit Tests (KS and p-Values)

4. Discussion

4.1. Discussion on the Performance of Bias Correction Methods

4.2. Discussion on the Developed Bias Correction Framework

4.3. Discussion on Framework Validation Findings and Results

5. Conclusions and Recommendations

5.1. Conclusions

5.2. Recommendations

5.3. Research Assumptions and Limitations

5.4. Areas for Future Research

5.5. Principal Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI