Multi-Decadal Variability and Future Changes in Precipitation over Southern Africa

: The future planning and management of water resources ought to be based on climate change projections at relevant temporal and spatial scales. This work uses the new regional demarca-tion for Southern Africa (SA) to investigate the spatio-temporal precipitation variability and trends of centennial-scale observation and modeled data, based on datasets from the sixth phase of the Coupled Model Intercomparison Project (CMIP6). The study employs several statistical methods to rank the models according to their precipitation simulation ability. The Theil–Sen slope estimator is used to assess precipitation trends, with a Student’s t -test for the signiﬁcance test. The comparison of observation and model historical data enables identiﬁcation of the best-performing global climate models (GCMs), which are then employed in the projection analysis under two Shared Socioeconomic Pathways (SSPs): SSP2-4.5 and SSP5-8.5. The GCMs adequately capture the annual precipitation variation but with a general overestimation, especially over high-elevation areas. Most of the models fail to capture precipitation over the Lesotho-Eswatini area. The three best-performing GCMs over SA are FGOALS-g3, MPI-ESM1-2-HR and NorESM2-LM. The sub-regions demonstrate that precipitation trends cannot be generalized and that localized studies can provide more accurate ﬁndings. Overall, precipitation in the wet and dry seasons shows an initial increase during the near future over western and eastern SA, followed by a reduction in precipitation during the mid- and far future under both projection scenarios. Madagascar is expected to experience a decrease in precipitation amount throughout the twenty-ﬁrst century.


Introduction
Climate change is among the top global concerns at present. In recent decades, climate shift and its associated climate dynamics have resulted in increased instability of weather and climate patterns [1][2][3]. Sustainable development requires up-to-date knowledge to make informed decisions and develop strategies based on possible future climate scenarios. Southern Africa (SA) is one of the regions that are very vulnerable to the effects of climate variability and change. The region has experienced large climatic fluctuations during the past century, with distinct variability at different time scales [4,5]. According to Barros et al. [6], SA is highly susceptible to climate change due to its large population, The black lines delineate the WSAF, ESAF and MDG regions in line with the latest IPCC regional demarcations [32]. The surface elevation data are from ETOPO2 [46].

Data
This study employs three rain gauge-based (observation) gridded datasets and 23 CMIP6 datasets to understand precipitation variability and trends over SA.

Observation Data
Gridded rain gauge-based data are used in place of observational data that is of limited quality and time span. This study uses the Global Precipitation Climatology Centre (GPCC) [47,48] monthly precipitation data version 2018 [49] to compare the observed and simulated precipitation over the region. It has a period of 1891-2016 and a spatial resolution of 0.5° × 0.5°. To assess the observational uncertainties in the GPCC data, we use two other monthly precipitation datasets developed by different centers: Climatic Research Unit (CRU) and University of Delaware (UDEL). The CRU version 4.04 [50,51] provides monthly precipitation data produced at a spatial resolution of 0.5° × 0.5° from 1901 to 2019. We also utilize the UDEL v5.01 monthly precipitation data [52,53], which have a 0.5° × 0.5° horizontal grid resolution and span from 1900 to 2017.

Model Data
We use daily precipitation data from 23 CMIP6 GCMs [36,54]. These datasets run from 1850 to 2100 but with varying spatial resolutions. Table 1 provides the model names, institutions and countries, and horizontal resolutions of all the models. Since the CMIP6 models have different numbers of ensemble members, this work considers all the models' first member variation rli1p1f1 (r1: realization index, i1: initialization index, p1: physics index, and f1: forcing index). The precipitation projection from the 23 CMIP6 GCMs is analyzed under two representative SSP scenarios: intermediate (SSP2-4.5) and high (SSP5-8.5) forcing scenarios. The SSP2-4.5 is the closest representation of the current efforts being put in place by nations to curb the spiraling emissions of greenhouse gases in the climate system [37]. We also chose SSP5-8.5, the worst possible scenario. Eyring et al. [36] provide details on CMIP6 data. The black lines delineate the WSAF, ESAF and MDG regions in line with the latest IPCC regional demarcations [32]. The surface elevation data are from ETOPO2 [46].

Data
This study employs three rain gauge-based (observation) gridded datasets and 23 CMIP6 datasets to understand precipitation variability and trends over SA.

Observation Data
Gridded rain gauge-based data are used in place of observational data that is of limited quality and time span. This study uses the Global Precipitation Climatology Centre (GPCC) [47,48] monthly precipitation data version 2018 [49] to compare the observed and simulated precipitation over the region. It has a period of 1891-2016 and a spatial resolution of 0.5 • × 0.5 • . To assess the observational uncertainties in the GPCC data, we use two other monthly precipitation datasets developed by different centers: Climatic Research Unit (CRU) and University of Delaware (UDEL). The CRU version 4.04 [50,51] provides monthly precipitation data produced at a spatial resolution of 0.5 • × 0.5 • from 1901 to 2019. We also utilize the UDEL v5.01 monthly precipitation data [52,53], which have a 0.5 • × 0.5 • horizontal grid resolution and span from 1900 to 2017.

Model Data
We use daily precipitation data from 23 CMIP6 GCMs [36,54]. These datasets run from 1850 to 2100 but with varying spatial resolutions. Table 1 provides the model names, institutions and countries, and horizontal resolutions of all the models. Since the CMIP6 models have different numbers of ensemble members, this work considers all the models' first member variation rli1p1f1 (r1: realization index, i1: initialization index, p1: physics index, and f1: forcing index). The precipitation projection from the 23 CMIP6 GCMs is analyzed under two representative SSP scenarios: intermediate (SSP2-4.5) and high (SSP5-8.5) forcing scenarios. The SSP2-4.5 is the closest representation of the current efforts being put in place by nations to curb the spiraling emissions of greenhouse gases in the climate system [37]. We also chose SSP5-8.5, the worst possible scenario. Eyring et al. [36] provide details on CMIP6 data.
We evaluate the GCMs' ability to replicate the annual and seasonal precipitation variability and trends. The Taylor diagram, Taylor skill score (TSS) and inter-annual variability score (IVS) are used to provide a quantitative performance measure for the data to be used in projection. The data are also ranked using those metrics, thus increasing the confidence of the climate projection results.
The projected monthly precipitation is investigated under two scenarios: SSP2-4.5 and SSP5-8.5 [58]. Three time slices are defined for the projection trend analysis: near future (2015-2044), mid-future (2045-2074) and far future (2075-2100). Precipitation trends are also examined for a continuous period from 2015 to 2100.

Model Performance and Ranking
This study uses the Taylor diagram [59] to give a statistical summary of the model datasets. The Taylor diagram graphically portrays the relationship between the reference (observation) and model datasets, and quantifies the temporal agreement of the different datasets. We select the following statistical metrics: correlation coefficient (R), unbiased root-mean-square difference (ubRMSD) and standard deviation (STD). R measures the similarities in the variability of the reference and simulated data. Its value ranges from −1 to 1, where −1 represents an opposite relationship, 0 means no correlation, and 1 indicates perfect correlation. The RMSD shows how far the model data are from the reference, and portrays uncertainties in both spatial pattern and annual cycle [60]. By totaling the degree of the difference between model and reference, RMSD is a measure of accuracy. The smaller the RMSD value, the closer the model data are to the reference, where 0 is the perfect score. STD portrays the variability in a dataset by showing the distance between each value and the mean of the data. The smaller the STD value, the smaller the dispersion of the values in the dataset is.
We also employ the TSS (Equation (1) [59,61]) to rank the model datasets in their ability to simulate precipitation over SA relative to the reference data. The TSS is a statistical summary of R, RMSD and STD, thus providing a single-value metric. The robustness of the abovementioned technique has been ascertained in previous studies (e.g., [57,62,63]).
where R is the correlation coefficient between reference and model data, σ m and σ r are the model and reference standard deviations, respectively, and R 0 is the highest achievable correlation coefficient (we use a threshold of 0.999). The ideal TSS score is 1. We calculate the IVS (Equation (2) [60]) to evaluate the model temporal performance relative to the reference data. The IVS complements the TSS by showing the model's inter-annual variation simulation capability. Past studies have utilized this technique to rank model performance [62,64,65].
The smaller the IVS value, the closer the model's temporal variability is to the reference data.

Trend Analysis
The Theil-Sen slope estimator (Equation (3) [66]) is applied to the model and reference datasets to quantify the precipitation trend.
where ∆σ is the gradient between two data points in the time series. x j and x k are the corresponding data values for time points j and k (j > k), respectively. We use the median instead of the mean to minimize the influence of outliers on the results. If ∆σ > 0 (∆σ < 0), the trend is increasing (decreasing). A Student's t-test is applied to test the significance of Figure 2 shows the annual cycle of the observed and modeled precipitation, averaged from 1901 to 2014, over WSAF, ESAF, MDG and SA. The four regions generally show a similar precipitation pattern. MDG records the highest precipitation amount compared with WSAF and ESAF, while ESAF is wetter than WSAF. The precipitation amount gradually increases from July (August) for ESAF and SA (WSAF), and then peaks in January before decreasing again. MDG displays a slight difference in the precipitation minimum, increasing sharply from September, peaks in January and then reduces gradually. The annual variation points to December to March (DJFM) as the wet season over SA [71], while May to October (MJJASO) are the dry months. This work focuses on the two seasons: DJFM and MJJASO. The movement of the TRB modulates this unimodal precipitation pattern.
where ∆σ is the gradient between two data points in the time series. xj and xk are the corresponding data values for time points j and k (j > k), respectively. We use the median instead of the mean to minimize the influence of outliers on the results. If ∆σ > 0 (∆σ < 0), the trend is increasing (decreasing). A Student's t-test is applied to test the significance of the results relative to the reference data. Several recent studies (e.g., [67][68][69][70]) have adopted this kind of technique to analyze the trend of hydrometeorological variables based on its robustness in handling the outliers in a time series. Figure 2 shows the annual cycle of the observed and modeled precipitation, averaged from 1901 to 2014, over WSAF, ESAF, MDG and SA. The four regions generally show a similar precipitation pattern. MDG records the highest precipitation amount compared with WSAF and ESAF, while ESAF is wetter than WSAF. The precipitation amount gradually increases from July (August) for ESAF and SA (WSAF), and then peaks in January before decreasing again. MDG displays a slight difference in the precipitation minimum, increasing sharply from September, peaks in January and then reduces gradually. The annual variation points to December to March (DJFM) as the wet season over SA [71], while May to October (MJJASO) are the dry months. This work focuses on the two seasons: DJFM and MJJASO. The movement of the TRB modulates this unimodal precipitation pattern. The CMIP6 ensemble consistently overestimates precipitation as compared to the observation for WSAF, ESAF and SA, while the annual cycles from model and observation over MDG are very close. Moreover, CMIP6 shows a greater degree of deviation, especially in the wet season, portrayed by the large spread of the data around the The CMIP6 ensemble consistently overestimates precipitation as compared to the observation for WSAF, ESAF and SA, while the annual cycles from model and observation over MDG are very close. Moreover, CMIP6 shows a greater degree of deviation, especially in the wet season, portrayed by the large spread of the data around the ensemble mean. The three rain gauge-based datasets present an almost comparable amount of precipitation throughout the year, except for MDG, which shows a larger deviation during the dry season than the other three regions. A slight spread is noted in the observation data during the wet months. Figure A1 shows the spatial distribution of accumulated precipitation over SA. The annual precipitation distribution ( Figure A1a-d) shows that northern SA, central to northern Mozambique, and Madagascar's eastern coast receive a larger amount of precipitation. In contrast, the southwestern region receives a very low amount of precipitation. Relatively high precipitation coincides with the TRB position, affirming that this phenomenon domi- nates SA's wet season. Madagascar shows a distinct west-east precipitation distribution, with precipitation amounts multiple times larger on the east than on the west [72,73]. This is due to the orographic effect [43] from the higher elevation along the island's center ( Figure 1). The three rain gauge-based datasets present very similar spatial distributions with slight maximum precipitation amount differences. The CMIP6 models generally overestimate precipitation over the whole of SA, especially over WSAF ( Figure 2). Moreover, a high-precipitation area exists around the Lesotho-Eswatini region and northwestern SA, while the precipitation amount on the eastern side of Madagascar is underestimated.

Climatology
The spatial precipitation distribution for the wet season ( Figure A1e-h) shows that this season dominates the region's precipitation climatology, supporting a similar observation in Figure 3. Similar to the annual climatology, there are high-precipitation areas over northern SA, central to northern Mozambique and eastern Madagascar. The observation data show higher precipitation (500-700 mm) around the Lesotho-Eswatini area, which is overestimated in the model data (750-1000 mm). Moreover, Madagascar's northwestern area receives more precipitation than the eastern coast, which is different from the observation. The Madagascar region has a complex system with the interaction between ocean and land, and the models cannot accurately reproduce this system. deviation during the dry season than the other three regions. A slight spread is noted in the observation data during the wet months. Figure A1 shows the spatial distribution of accumulated precipitation over SA. The annual precipitation distribution ( Figure A1a-d) shows that northern SA, central to northern Mozambique, and Madagascar's eastern coast receive a larger amount of precipitation. In contrast, the southwestern region receives a very low amount of precipitation. Relatively high precipitation coincides with the TRB position, affirming that this phenomenon dominates SA's wet season. Madagascar shows a distinct west-east precipitation distribution, with precipitation amounts multiple times larger on the east than on the west [72,73]. This is due to the orographic effect [43] from the higher elevation along the island's center ( Figure 1). The three rain gauge-based datasets present very similar spatial distributions with slight maximum precipitation amount differences. The CMIP6 models generally overestimate precipitation over the whole of SA, especially over WSAF ( Figure 2). Moreover, a high-precipitation area exists around the Lesotho-Eswatini region and northwestern SA, while the precipitation amount on the eastern side of Madagascar is underestimated.
The spatial precipitation distribution for the wet season ( Figure A1e-h) shows that this season dominates the region's precipitation climatology, supporting a similar observation in Figure 3. Similar to the annual climatology, there are high-precipitation areas over northern SA, central to northern Mozambique and eastern Madagascar. The observation data show higher precipitation (500-700 mm) around the Lesotho-Eswatini area, which is overestimated in the model data (750-1000 mm). Moreover, Madagascar's northwestern area receives more precipitation than the eastern coast, which is different from the observation. The Madagascar region has a complex system with the interaction between ocean and land, and the models cannot accurately reproduce this system. The spatial distribution of precipitation in the dry season ( Figure A1i-l) shows similar patterns in the three observation datasets. The whole region is generally dry, with a higher amount of precipitation received along the eastern coast of Madagascar, the eastern side of SA and the southern coast of South Africa. In contrast, the model data show The spatial distribution of precipitation in the dry season ( Figure A1i-l) shows similar patterns in the three observation datasets. The whole region is generally dry, with a higher amount of precipitation received along the eastern coast of Madagascar, the eastern side of SA and the southern coast of South Africa. In contrast, the model data show an overestimation over the whole of SA, especially over eastern SA and South Africa. The considerable amount of precipitation along the eastern coast of Madagascar during the MJJASO season is due to the easterly wind flowing towards the island, rising because of the island's high elevation and rain on this side of the island. This high precipitation amount on the east coast of Madagascar is underestimated in the CMIP6 data. The Lesotho-Eswatini area still presents a higher amount of precipitation from the model as compared to the observation. During both the dry and wet seasons, a considerable amount of precipitation can be observed around Angola. The precipitation over this region is influenced by moisture influx from the Congo Rainforest. In the CMIP6 DJFM distribution, this large precipitation area extends further southward over the Kalahari Desert.
We further evaluate the models' ability to represent the spatial features of precipitation ( Figure 3) during the dry and wet seasons, as well as annually. For the annual spatial distribution (Figure 3a-d), the three observation datasets are generally in agreement. GPCC and UDEL present similar spatial distributions with differences in the magnitude of the anomalies. They have an anomaly range between −45 and 57 mm, and −54 and 48 mm, respectively. CRU shows a more homogeneous distribution with positive anomalies (>9 mm) over most of the study domain. There is a negative anomaly (>−13 mm) area over central South Africa, central Madagascar and Angola. The CMIP6 ensemble spatial distribution shows a remarkable overestimation over the whole region, which is more pronounced over the Lesotho-Eswatini area. In contrast, a considerable underestimation is noted along Madagascar's east coast.
Analyzing the DJFM seasonal anomaly spatial distribution (Figure 3e-h), a similar pattern to that of the annual spatial distribution is observed. This means that the wet season signal dominates the region's precipitation variability. The CMIP6 DJFM seasonal anomaly shows an even larger magnitude. Also, the underestimation along Madagascar's east coast still exists. The positions of the large biases in the DJFM spatial distribution co-locate with high-elevation areas in SA (such as the Lesotho-Eswatini area, west Angola, Zambia and Zimbabwe). This suggests that the observed biases might be attributed to the representation of orographic processes in the models.
During the MJJASO season ( Figure 3i-l), GPCC and UDEL show more and larger anomaly areas than CRU. The CRU data present mostly positive anomalies, except over Angola and a few other areas. The CMIP6 ensemble still overestimates precipitation over most of SA, with underestimation observed along Madagascar's east coast and the Cape of Good Hope. Also, the overestimation is more pronounced around the Lesotho-Eswatini area. This area is sensitive and not well simulated by the GCMs during both the wet and dry seasons.
It is notable that the CMIP6 ensemble barely displays negative anomalies over SA. Furthermore, the observation data do not agree over Madagascar during both seasons, implying that this region has a complex system that is not easily captured by the observation datasets.
Overall, the three observation datasets have similar precipitation patterns (Figures 3 and A1), giving more confidence in the results and enabling more robust conclusions about precipitation trends during the past century. The model data match observation to some extent but generally overestimate precipitation, especially during the dry season. Moreover, in both seasons, the Lesotho-Eswatini area is a challenge for the GCMs.
The spatial distribution of seasonal precipitation anomaly from individual models is displayed in Figures A2 and A3 for wet and dry seasons, respectively. The results show that several models have large wet biases over the whole SA region. These models influence the spatial distribution of the CMIP6 ensemble to simulate such large wet anomalies.

Trend Analysis
The precipitation trends over SA during the DJFM and MJJASO seasons are presented in Figures 4 and 5, respectively. First, the observation and CMIP6 ensembles present similar spatial patterns during the wet season ( Figure 4). There is a positive trend over western South Africa, along the Namibian coast, and over northern Mozambique and central Madagascar. The precipitation trend over the rest of the study domain is negative. The individual models show varying spatial distributions, some of which agree with the observation to some extent, such as CNRM-CM6-1, CNRM-ESM2-1, EC-Earth3-Veg, HadGEM3-GC31-LL, GFDL-CM4 and UKESM1-0-LL. The statistically significant areas vary among different models. The reduction in precipitation over the study region has been attributed to the expansion of the sub-tropical dry zone [25]. The results demonstrate the difficulties of simulating the precipitation variability over SA accurately. The influence of the Atlantic and Indian Oceans and forcing from the St. Helena High and Mascarene High influence the model outputs. According to Chen and Xu [74], areas with the same precipitation trend are possibly teleconnected through oceanic forcing. The drying trend over the Eastern Cape region of South Africa corroborates recent research findings over the area [28]. The model ensemble displays an improved spatial trend that closely matches that of observation. Overall, the observation and model ensembles show areas of mixed trends over SA.
The spatial trend of MJJASO precipitation over SA is shown in Figure 5. A comparison of the CMIP6 and the observation ensemble shows that the models do not reproduce the spatial tendency of MJJASO precipitation well. The CMIP6 ensemble shows a significant negative trend over the study domain, while the observation ensemble has mixed trends. Further, the observations do not totally agree with each other over some areas, such as WSAF. The spatial distribution in GPCC is the closest to the observation ensemble, while CRU (UDEL) displays positive trends over WSAF (central SA). The CRU precipitation dataset contains data inhomogeneities that can affect the trend analysis result [50]. None of the models display a pattern close to the observations, highlighting the models' inability to capture the trend in the dry season.   The spatial trend of MJJASO precipitation over SA is shown in Figure 5. A comparison of the CMIP6 and the observation ensemble shows that the models do not reproduce the spatial tendency of MJJASO precipitation well. The CMIP6 ensemble shows a significant negative trend over the study domain, while the observation ensemble has mixed trends. Further, the observations do not totally agree with each other over some areas, such as WSAF. The spatial distribution in GPCC is the closest to the observation ensemble, while CRU (UDEL) displays positive trends over WSAF (central SA). The CRU precipitation dataset contains data inhomogeneities that can affect the trend analysis result [50]. None of the models display a pattern close to the observations, highlighting the models' inability to capture the trend in the dry season.
DJFM and MJJASO precipitation trends is calculated over WSAF, ESAF, MDG and SA (table not shown  Overall, the observation and model spatial distributions show varying tendencies. Some models show positive trends while others exhibit negative patterns over different regions. The results demonstrate the challenges for models in simulating precipitation, especially during the dry season. Numerous studies have pointed out this challenge of simulating precipitation compared with temperature over different parts of Africa [75][76][77]. This is mainly associated with issues related to model parameterization, especially in regions with complex physiographical features such as the SA region, or challenges related to local mesoscale features such as lakes, vegetation cover and extensive coastline that cause regional heterogeneity [78]. Furthermore, studies that examined moderate to extreme precipitation noted the observational uncertainties presented herein [79,80]. In fact, Alexander et al. [81] demonstrated that discrepancies between observation datasets are intensified in estimating precipitation extremes, such as consecutive dry days over regional and global domains. In conclusion, Bador et al. [55] and Alexander et al. [81] recommend the use of an observational ensemble derived from various gauge-based sources to assess observational uncertainties better. In this study, the performance demonstrated by GPCC gives the needed confidence to employ this dataset as the reference data in model evaluation to identify the best-performing CMIP6 models for projection analysis.

Model Performance and Ranking
We use the Taylor diagram, TSS and IVS to assess the capability of individual GCMs in simulating monthly precipitation over the study region and to identify the best-performing GCMs by ranking the models relative to GPCC data.
From the Taylor diagram (Figure 6), the models are clustered in a correlation coefficient range of 0.87 to 0.94, with a ratio of standard deviation between 0.96 and 1.67, and an RMSD of approximately 0.4 to 0.8. Although the temporal variability of precipitation matches GPCC well, the modeled precipitation is usually overestimated. The large overestimation shown by most GCMs confirms the excess precipitation observed in Section 3.1. Out of the 23 GCMs, only Centre National de Recherches Météorologiques' two models, CNRM-CM6-1 and CNRM-ESM2-1, display an underestimation of precipitation. Further, it is observed that the CMIP6 ensemble presents an improved result with higher R and smaller RMSD.  Since the Taylor diagram is a pictorial representation of three (3) statistical metrics, we employ the TSS, which combines those three metrics and presents a single-value criterion for model performance evaluation and ranking. Moreover, the IVS is also implemented to complement the information provided by the TSS.
We rank the precipitation datasets for each of the following five criteria: STD, R, RMSD, TSS and IVS (Figure 7a). In principle, datasets that portray more blueish color for the five criteria mean that they have good performance for all the statistical metrics  Since the Taylor diagram is a pictorial representation of three (3) statistical metrics, we employ the TSS, which combines those three metrics and presents a single-value criterion for model performance evaluation and ranking. Moreover, the IVS is also implemented to complement the information provided by the TSS.
We rank the precipitation datasets for each of the following five criteria: STD, R, RMSD, TSS and IVS (Figure 7a). In principle, datasets that portray more blueish color for the five criteria mean that they have good performance for all the statistical metrics considered in this section. Models such as ACCESS- A scatter plot of TSS against IVS (Figure 7b) displays the different models' skills in simulating spatial and temporal precipitation patterns over SA. Models with lower IVS and higher TSS are the best-performing. For the projections, models with a TSS (IVS) of less (more) than 0.87 (0.2) are not considered. This assessment provides the confidence for future climate results. The three best-and worst-performing models in our case are the FGOALS-g3, MPI-ESM1-2-HR and NorESM2-LM, and CanESM5, IPSL-CM6A-LR and NESM3, respectively. In addition to the three best-performing models, all GCMs that satisfy the 0.87 and 0.2 thresholds for TSS and IVS, respectively, are used for projection analysis. The models that are kept for projection analysis are the CNRM-CM6-1, CNRM-ESM2-1, FGOALS-g3, GFDL-CM4, MPI-ESM1-2-HR, MPI-ESM1-2-LR, NorESM2-LM, NorESM2-MM and UKESM1-0-LL, which agrees with the ranking in Figure 7a. Moreover, the CMIP6 ensemble ranks highly for the five metrics and shows a clear improvement from the 23 GCMs in the scatter plot. Therefore, the ensemble mean can amply represent the precipitation variability and trend in the projection analysis.

Projected Precipitation Changes
The nine best-performing models over the study area are used in the projection analysis. We first display the spatial distribution of seasonal precipitation anomalies. We also present the temporal and spatial precipitation trends and the probability density function.
Seasonal precipitation anomaly projections over 2015-2100 under the SSP2-4.5 and SSP5-8.5 scenarios are given in Figure 8. In DJFM (Figure 8a-h  A scatter plot of TSS against IVS (Figure 7b) displays the different models' skills in simulating spatial and temporal precipitation patterns over SA. Models with lower IVS and higher TSS are the best-performing. For the projections, models with a TSS (IVS) of less (more) than 0.87 (0.2) are not considered. This assessment provides the confidence for future climate results. The three best-and worst-performing models in our case are the FGOALS-g3, MPI-ESM1-2-HR and NorESM2-LM, and CanESM5, IPSL-CM6A-LR and NESM3, respectively. In addition to the three best-performing models, all GCMs that satisfy the 0.87 and 0.2 thresholds for TSS and IVS, respectively, are used for projection analysis. The models that are kept for projection analysis are the CNRM-CM6-1, CNRM-ESM2-1, FGOALS-g3, GFDL-CM4, MPI-ESM1-2-HR, MPI-ESM1-2-LR, NorESM2-LM, NorESM2-MM and UKESM1-0-LL, which agrees with the ranking in Figure 7a. Moreover, the CMIP6 ensemble ranks highly for the five metrics and shows a clear improvement from the 23 GCMs in the scatter plot. Therefore, the ensemble mean can amply represent the precipitation variability and trend in the projection analysis.

Projected Precipitation Changes
The nine best-performing models over the study area are used in the projection analysis. We first display the spatial distribution of seasonal precipitation anomalies. We also present the temporal and spatial precipitation trends and the probability density function.
Seasonal precipitation anomaly projections over 2015-2100 under the SSP2-4.5 and SSP5-8.5 scenarios are given in Figure 8. In DJFM (Figure 8a-h), the region exhibits mostly negative values, with positive values moving northwestward. The distribution shows pronounced positive values over the coast of Angola, the Lesotho-Eswatini area and Madagascar's east coast, while areas such as Mozambique and southeastern South Africa show strong negative anomalies. Dry anomalies are also evident over the entire area during MJJASO (Figure 8i-p) under the SSP2-4.5 and SSP5-8.5 scenarios. Similar to the DJFM distribution, Madagascar also presents positive anomalies over its eastern coast, extending further inland during the dry season than in the wet season. The persistent dry anomaly over ESAF and MDG compared with WSAF could be attributed to higher pressure, warmer SSTs in the Pacific Ocean, and Indian Ocean offshore winds that lead to decreased precipitation [22]. Drying patterns over the region can have adverse effects on several sectors of the economy, such as agriculture.
Precipitation trends over WSAF, ESAF, MDG and SA during the near, mid-and far future are presented in Figure 9. Notably, the nine GCMs used for the projection analysis have a large spread about the ensemble mean. However, according to Milinski et al. [82], the ensemble helps to reduce uncertainties in the data. The area-averaged projected precipitation variation does not accurately represent projected precipitation over the different sub-regions (WSAF, ESAF and MDG). Therefore, we examine precipitation changes over each of SA's three sub-regions. The temporal trend depicts periods of positive and negative anomalies. Analyzing the DJFM trends for WSAF (Figure 9a  The persistent dry anomaly over ESAF and MDG compared with WSAF could be attributed to higher pressure, warmer SSTs in the Pacific Ocean, and Indian Ocean offshore winds that lead to decreased precipitation [22]. Drying patterns over the region can have adverse effects on several sectors of the economy, such as agriculture. Precipitation trends over WSAF, ESAF, MDG and SA during the near, mid-and far future are presented in Figure 9. Notably, the nine GCMs used for the projection analysis have a large spread about the ensemble mean. However, according to Milinski et al. [82], the ensemble helps to reduce uncertainties in the data. The area-averaged projected precipitation variation does not accurately represent projected precipitation over the different sub-regions (WSAF, ESAF and MDG). Therefore, we examine precipitation changes over each of SA's three sub-regions. The temporal trend depicts periods of positive and negative anomalies. Analyzing the DJFM trends for WSAF (Figure 9a . The trends over WSAF describe an overall decrease in precipitation in the twenty-first century, except for the mid-future under the moderate forcing scenario. The negative trends in SSP2-4.5 might point to the possibility that WSAF is more significantly influenced by natural variability than anthropogenic factors. However, the SSP5-8.5 trends show that strong anthropogenic forcing promotes precipitation decrease, especially in the dry season.  Under the intermediate forcing scenario, ESAF shows an accelerated wetting tendency (0.683) during the DJFM's near future followed by a drying tendency during the consecutive two time slices (−0.235 and −0.097) (Figure 9c). On the other hand, the strong forcing scenario (Figure 9d) shows a drying trend (−0.809) during the near future followed by wetting trends during the mid-and far future (0.095 and 0.113). The trends of both scenarios during the near future are quite large relative to the other two time sections. Also, the precipitation trends are opposite under the two scenarios. The trends for MJJASO over the same region (Figure 9d) show a decreasing pattern during the three time slices for the SSP2-4.5 scenarios (−0.023, −0.031 and −0.081). In contrast, the SSP5-8.5 shows an upward trend during the near future (0.041), followed by downward trends during the successive two time slices (−0.063 and −0.0157). It is also observed that the magnitude of the trends for the SSP5-8.5 is about twice that of SSP2-4.5.
Over MDG, the DJFM precipitation trend is upwards (Figure 9e) in the near future, followed by negative trends during the mid-future (−1.294) and positive trends during the far future (0.145). The SSP5-8.5 shows a downward trend during the near future (−0.047), followed by relatively larger negative trends during the mid-and far future (−1.351 and −0.684). The MJJASO precipitation trend over MDG (Figure 9f) shows a drying tendency during the three time slices for both the SSP2-4.5 and SSP5-8.5 scenarios. Moreover, the magnitude of the trends is nearly the same. This observation might indicate that the precipitation over MDG during its dry season is not significantly modulated by anthropogenic forcing. As previously reported, this region is strongly influenced by the ocean [19,23], and thus the effect of climate shift on the area is minimal.
An overview of the changes in precipitation trend over the whole SA study domain under the SSP2-4.5 and SSP5-8.5 scenarios is given in Figure 9g,h. The DJFM season is projected to get wetter in the near (0.362) and mid-future (0.046), followed by a drying tendency during the far future (−0.155). The trends under the strong forcing scenario show a drying pattern throughout the three time slices (−0.548, −0.067 and −0.046). The MJJASO season also displays an overall drying trend during the near, mid-and far future (−0.037, −0.031 and −0.068) under the SSP2-4.5 scenario. The SSP5-8.5 shows a very insignificant downward trend in the near future (0.007), followed by more significant drying trends during the mid-(−0.067) and far future (−0.129). The findings are in agreement with Cook et al. [31], who used 13 CMIP6 models to project droughts over SA in a global study. According to the study, the region is expected to experience drying patterns in precipitation. The study equally projected a reduction in total column soil moisture over SA, which threatens agriculture in the region.
In summary, the DJFM precipitation is projected to decrease under the SSP2-4.5, but is depressed under the SSP5-8.5 over areas such as WSAF and MDG. On the other hand, MJJASO presents a mostly unanimous display of decreasing trends over the three subregions and under both scenarios, which means that the natural variability of the region is already in the dry phase. Also, SA's trends and magnitudes highlight the importance of conducting region-based analysis as it is clear that the SA values do not accurately explain the trends over the different sub-regions.
The spatial distribution of DJFM and MJJASO projected precipitation trend is displayed for the near, mid-and far future, under the SSP2-4.5 and SSP5-8.5 scenarios (Figure 10). The DJFM spatial trends under the SSP2-4.5 scenario (Figure 10a-d) show a nearly opposite distribution during the near and mid-future. Most of SA displays a drying tendency during the far future, with a few wetting areas over northern Madagascar, southern South Africa and southern Mozambique. Overall, the continuous period shows a general drying trend, except for the Eswatini area, western Angola and the south tip of Madagascar. The SSP5-8.5 (Figure 10e-h) shows an opposite pattern as compared to SSP2-4.5 during the near future. ESAF displays a general drying trend, and Angola shows a wetting tendency. During the mid-future, the trend changes to positive over ESAF and mostly negative over WSAF. MDG has a positive (negative) trend on its west (east) side. During the far future, most of the SA region is covered with negative trends. A few areas, such as eastern South Africa, southern MDG and western Angola, show a positive trend. The continuous period shows a nearly similar distribution to that of SSP2-4.5. The MJJASO season (Figure 10i-l) shows a general drying trend during the near future, except for the Lesotho-Eswatini area. During the mid-future, central SA and regions such as northern MDG show positive trends. During the far future, the study domain is again mostly covered with negative trends. Overall, there is a decrease in MJJASO precipitation during the twenty-first century under the moderate forcing scenario. Under the SSP5-8.5 scenario, the MJJASO dry season (Figure 10m-p) shows an overall statistically significant drying trend, especially during the mid-and far future. The southeastern coast of SA proves to be an interesting area as it is the one region showing positive trends under the strong forcing scenario. Overall, there is a drying trend over the whole of SA under the SSP5-8.5. For the analysis of spatial trends, the Lesotho-Eswatini area is shown to be a sensitive region during the next few decades. The precipitation over the entire projection period is consistently statistically significant over WSAF areas, while in the other time slices, the statistical significance varies from region to region.
The probability density function (PDF) of projected precipitation during four reference time sections averaged over WSAF, ESAF, MDG and SA, under the SSP2-4.5 scenario is shown in Figure 11. The DJFM precipitation over WSAF (Figure 11a) shows an increase (~25 mm) during the near future, followed by a slight decrease of ~5 mm during the mid-future, and the far future maintains nearly the same amount of precipitation. The ESAF (Figure 11b) also shows an initial increase during the near future (~15 mm), followed by a drop of ~5 mm during the mid-future, before picking up again by a few mm during the far future. Over MDG (Figure 11c), there is a decrease during the near future (~10 mm), followed by an increase (~5 mm), before decreasing again during the far future. Overall, SA (Figure 11d) displays an initial increase during the near future, before decreasing during the mid-future. The far future presents a nearly similar distribution to the midfuture. The subsequent decrease in precipitation amount during the mid-and far future The MJJASO season (Figure 10i-l) shows a general drying trend during the near future, except for the Lesotho-Eswatini area. During the mid-future, central SA and regions such as northern MDG show positive trends. During the far future, the study domain is again mostly covered with negative trends. Overall, there is a decrease in MJJASO precipitation during the twenty-first century under the moderate forcing scenario. Under the SSP5-8.5 scenario, the MJJASO dry season (Figure 10m-p) shows an overall statistically significant drying trend, especially during the mid-and far future. The southeastern coast of SA proves to be an interesting area as it is the one region showing positive trends under the strong forcing scenario. Overall, there is a drying trend over the whole of SA under the SSP5-8.5. For the analysis of spatial trends, the Lesotho-Eswatini area is shown to be a sensitive region during the next few decades. The precipitation over the entire projection period is consistently statistically significant over WSAF areas, while in the other time slices, the statistical significance varies from region to region.
The probability density function (PDF) of projected precipitation during four reference time sections averaged over WSAF, ESAF, MDG and SA, under the SSP2-4.5 scenario is shown in Figure 11. The DJFM precipitation over WSAF (Figure 11a) shows an increase (~25 mm) during the near future, followed by a slight decrease of~5 mm during the mid-future, and the far future maintains nearly the same amount of precipitation. The ESAF (Figure 11b) also shows an initial increase during the near future (~15 mm), followed by a drop of~5 mm during the mid-future, before picking up again by a few mm during the far future. Over MDG (Figure 11c), there is a decrease during the near future (~10 mm), followed by an increase (~5 mm), before decreasing again during the far future. Overall, SA (Figure 11d) displays an initial increase during the near future, before decreasing during the mid-future. The far future presents a nearly similar distribution to the midfuture. The subsequent decrease in precipitation amount during the mid-and far future can be detrimental to rain-fed agriculture. This effect will be amplified over MDG as the precipitation amount is anticipated to decline in the near future. can be detrimental to rain-fed agriculture. This effect will be amplified over MDG as the precipitation amount is anticipated to decline in the near future. During the MJJASO season, the distribution over WSAF (Figure 11e) shows an increase relative to the baseline period during the near future, followed by a decrease during the mid-future and a further decrease during the far future. The ESAF (Figure 11f) also shows a similar pattern to that of WSAF. That is, an increase during the near future, followed by a decrease during the successive time sections. MDG (Figure 11g), on the other hand, shows a continuous decline throughout the three time sections. Overall, MJJASO precipitation under the SSP2-4.5 shows an initial increase over SA during the near future, followed by a decline during the mid-and far future. Figure 12 shows the PDF for DJFM and MJJASO under the SSP5-8.5 scenario. During the near future, there is an increase in DJFM precipitation over WSAF, ESAF and SA, while MDG experiences a precipitation reduction. WSAF (Figure 12a) shows a decreased precipitation amount during the subsequent time slices. In contrast, ESAF (Figure 12b) displays a further increase from the near to mid-future. From the mid-to far future, there is a decrease in precipitation. Over MDG (Figure 12c), precipitation experiences a continuous decrease during the three time sections. Overall, SA (Figure 12d) illustrates an initial increase in precipitation during the near future relative to the baseline period, followed by a reduction during the mid-and far future. During the MJJASO season, the distribution over WSAF (Figure 11e) shows an increase relative to the baseline period during the near future, followed by a decrease during the mid-future and a further decrease during the far future. The ESAF (Figure 11f) also shows a similar pattern to that of WSAF. That is, an increase during the near future, followed by a decrease during the successive time sections. MDG (Figure 11g), on the other hand, shows a continuous decline throughout the three time sections. Overall, MJJASO precipitation under the SSP2-4.5 shows an initial increase over SA during the near future, followed by a decline during the mid-and far future. Figure 12 shows the PDF for DJFM and MJJASO under the SSP5-8.5 scenario. During the near future, there is an increase in DJFM precipitation over WSAF, ESAF and SA, while MDG experiences a precipitation reduction. WSAF (Figure 12a) shows a decreased precipitation amount during the subsequent time slices. In contrast, ESAF ( Figure 12b) displays a further increase from the near to mid-future. From the mid-to far future, there is a decrease in precipitation. Over MDG (Figure 12c), precipitation experiences a continuous decrease during the three time sections. Overall, SA (Figure 12d) illustrates an initial increase in precipitation during the near future relative to the baseline period, followed by a reduction during the mid-and far future. can be detrimental to rain-fed agriculture. This effect will be amplified over MDG as the precipitation amount is anticipated to decline in the near future. During the MJJASO season, the distribution over WSAF (Figure 11e) shows an increase relative to the baseline period during the near future, followed by a decrease during the mid-future and a further decrease during the far future. The ESAF (Figure 11f) also shows a similar pattern to that of WSAF. That is, an increase during the near future, followed by a decrease during the successive time sections. MDG (Figure 11g), on the other hand, shows a continuous decline throughout the three time sections. Overall, MJJASO precipitation under the SSP2-4.5 shows an initial increase over SA during the near future, followed by a decline during the mid-and far future. Figure 12 shows the PDF for DJFM and MJJASO under the SSP5-8.5 scenario. During the near future, there is an increase in DJFM precipitation over WSAF, ESAF and SA, while MDG experiences a precipitation reduction. WSAF (Figure 12a) shows a decreased precipitation amount during the subsequent time slices. In contrast, ESAF ( Figure 12b) displays a further increase from the near to mid-future. From the mid-to far future, there is a decrease in precipitation. Over MDG (Figure 12c), precipitation experiences a continuous decrease during the three time sections. Overall, SA (Figure 12d) illustrates an initial increase in precipitation during the near future relative to the baseline period, followed by a reduction during the mid-and far future. For the MJJASO season, precipitation over WSAF (Figure 12e) shows an increase during the near future from the baseline period. There is a decrease in precipitation during the mid-future, which is further accentuated during the far future, reaching an amount slightly less than that of the baseline period. The ESAF (Figure 12f) shows a continuous precipitation decrease during the three consecutive time slices after an initial increase during the near future. As observed in the DJFM precipitation, the MJJASO precipitation over MDG (Figure 12g) also experiences a constant decrease throughout the three time slices. Overall, SA ( Figure 12) displays an initial increase in precipitation during the near future, followed by precipitation reduction during the mid-and far future.
The results point to a general increase in DJFM and MJJASO precipitation over WSAF and ESAF during the near future, followed by a slight decrease during the subsequent two time sections. In contrast, MDG displays an apparent reduction in precipitation during both seasons. Moreover, the severity of the precipitation reduction is accentuated throughout the twenty-first century.

Summary and Conclusions
Climate change is of great interest to the scientific community, policy makers and the general public. Understanding shifts in precipitation variability is crucial for explaining the hydrological cycle response to a natural or forced change in the climate system. A disparity in precipitation could heavily impact SA, where the agricultural sector is a primary source of livelihood. Policy makers require detailed, in-depth information on regional changes in precipitation to devise suitable adaptation and mitigation plans to minimize the devastating effect of such changes in precipitation amount.
By using the new regional demarcations for SA, this study investigates the changes in precipitation variability and trends during the twentieth and twenty-first centuries. We compare the historical data of 23 CMIP6 GCMs against rain gauge-based data (GPCC, CRU and UDEL) to assess the individual models' performance in simulating annual precipitation variation, seasonal inter-annual variability and trends. We then select the models with good simulation capabilities for the projection analysis. The evaluation statistical metrics are the STD, R, RMSD, TSS and IVS, and the Theil-Sen slope estimator is used for the trend analysis. For the future precipitation projection, two scenarios, SSP2-4.5 and SSP5-8.5, represent the possible current state of the atmosphere and the worst possible situation, respectively.
From the comparison of observation data, it is found that the three datasets agree with each other. GPCC, however, shows more resemblance to the observation ensemble during the dry season. Therefore, we use GPCC as the reference data for model performance evaluation. By comparing the observation data with the model datasets, we note that no single GCM is the most consistent for all three sub-regions. Most of the models overestimate precipitation over the study region and underestimate precipitation along the east coast of Madagascar. The CMIP6 ensemble gives a better representation of precipitation than individual models, which agrees with Pincus et al. [83] and Pierce et al. [84], who reported that ensemble means limit spatial uncertainty and inconsistency. Moreover, GCMs inevitably introduce uncertainties into the projections due to model parameterizations, among other factors. The CMIP6 GCMs cannot well simulate areas of fast-changing topography or high elevation, such as the Lesotho-Eswatini area, the Angola coast and the high topography in the center of Madagascar. ESAF and MDG are also complex areas that the GCMs cannot accurately simulate. The models have limitations in fully representing specific physical processes [35]. Unknown variations in the climate system and inadequate understanding of the climate response over longer timescales [85] also present huge uncertainties. Therefore, it is crucial to interpret these data in light of their uncertainties. Moreover, it is assumed that the current observed shift in climate regime is attributed to human factors. Another school of thought argues that these changes are caused by natural variability in the climate system, thus adding more uncertainties into future projections [86].
Precipitation variability is highly inhomogeneous. Unlike temperature, which displays uniform variation, precipitation shows considerable spatial variability. While past studies described a generalized trend over the whole of SA, this study reports precipitation over WSAF, ESAF and MDG. The results point to a pattern that is not reflected over the whole study region average. A general increase in DJFM and MJJASO precipitation over WSAF and ESAF is expected during the near future, followed by a slight decrease during the subsequent two time sections. Some localized areas are likely to experience increasing trends during the mid-and far future. In contrast, MDG displays a clear decrease in precipitation during both seasons. Moreover, the severity of the precipitation reduction is accentuated throughout the twenty-first century. Overall, the findings on projected precipitation over SA reaffirm the results of previous studies [87][88][89] that were based on CMIP5. Although strategies have been put in place to limit carbon dioxide emissions and to "reverse" the devastating effect of global warming/climate change, the projected trends over SA point to an overall drying pattern over the three sub-regions under the SSP2-4.5. It is therefore imperative to properly manage the available water resources through sound planning.
The introduction of the latest phase of the CMIP data provides an excellent source for studying future precipitation trends over SA. The CMIP6 is an improvement of its predecessors, with a better representation of several physical processes. The SSPs also contribute to a more realistic interpretation of different world situation scenarios, thus giving more confidence in the results presented in this study. In contrast to previous studies that applied a general domain for SA, this study delineates SA into WSAF, ESAF and MDG, providing a more accurate understanding of the precipitation trends over the three regions. It is important to emphasize that the statistical metrics used in this study quantify the relative changes in climate patterns during the twentieth and twenty-first centuries. The results are not to be explained as the magnitude of climate change impact on SA.
This work establishes the trend during the past and future centuries. However, contributing factors such as dynamic mechanisms and processes have not been explored in detail. A further study aims to explore the dynamic aspect of precipitation trends over the different SA regions. Also, the large wet biases displayed by most models require further investigations to understand the models' physical representations that led to such results.    Figure A3. The same as Figure A2, but for the MJJASO season. Figure A3. The same as Figure A2, but for the MJJASO season.