1. Introduction
Regional climate models (RCMs) are increasingly being adopted in studies and applications at the regional and local scale to provide quality spatial-temporally refined weather and climate-related variables [
1]. The RCM concept, also known as dynamical downscaling, applies high-resolution numerical weather prediction (NWP) models over regional model domains to resolve the sub-grid variability in the climate fields derived from general circulation models (GCMs). The climate fields from the GCM, which describe the global atmospheric circulation response to large-scale forcings related to the Earth’s differential heating, are used as the initial and lateral boundary conditions in RCMs. On the other hand, a better representation of the physics in the NWP model is used to describe the processes’ influence at the local scale related to orography, inland water bodies, vegetation, and land use [
2]. Cognizant to the importance of local-scale processes, especially for land applications, GCM derived datasets with a relatively high resolution, e.g., the land component of the fifth generation of European ReAnalysis (ERA5) dataset [
3] referred to as ERA5-land, have been developed. The ERA5-land dataset has the highest spatial resolution (9 km) currently available of any dataset of its kind. However, the dataset is still too coarse to effectively capture the local-scale processes, especially in regions characterized by complex terrain, emphasizing the need for RCMs.
Generally, an RCM utilizes a cascade of nested model domains of increasing spatial resolutions in the inner model domains at predefined nesting ratios, allowing for a physically-based interpolation of the GCM-derived climate fields to the regional and local scales. A major challenge in the application of RCMs is that they are not readily transferable due to the sensitivity of models to the initial and lateral boundary conditions, differences in the physical processes characteristic to a region, and the choice of physics schemes [
1,
3,
4,
5]. The representation of the unresolved physical processes, popularly referred to as parameterization, has been cited as a major source of uncertainties in NWP models [
6]. As such, scores of studies have been carried out to analyze the effects of various parameterizations on the prediction skill of NWP models, e.g., cumulus parameterization [
7,
8,
9], cloud microphysics [
10,
11,
12], planetary boundary layer (PBL) parameterization [
13,
14,
15], among other studies. The forecast skill of NWP models has been shown to be highly sensitive to PBL parameterizations [
16,
17]. For instance, a study by Li and Pu [
13] found that the parameterization of the PBL was as influential to the development of hurricane intensity and associated precipitation in a similar manner to the parameterization of cloud microphysics, further emphasizing the importance of PBL parameterization in NWP models.
The PBL is defined as that portion of the troposphere that responds directly to forcings emanating from the Earth’s surface at timescales of one hour or less [
18]. The PBL layer is characterized by a strong vertical mixing and a marked diurnal variation in depth. The PBL plays an important role in the development of weather phenomena due to the exchange of heat and momentum fluxes between the land surface and the atmosphere through turbulence transport. However, the complex, chaotic nature of turbulent flow due to eddies of disproportionate sizes and time lengths within the PBL limits the explicit solution of the governing equations of atmospheric flow. Thus, PBL parameterizations suffice as efficient approximate solutions for turbulence in the atmospheric flow. PBL parameterizations are used to describe the background (time-averaged) atmospheric state and the perturbations (deviations) from the mean state associated with turbulent motions within the PBL based on the Reynolds averaging technique [
6]. PBL parameterization schemes are classified into first-order closure or higher-order turbulent kinetic energy (TKE) closure schemes. First-order closure schemes do not require additional prognostic equations to describe the effects of turbulence on the mean flow variables. In addition, first-order closure schemes allow for the exchange of fluxes between all layers in the PBL through large convective eddies and are thus also referred to as nonlocal closure schemes. On the other hand, higher-order TKE closure schemes include an additional prognostic equation for the TKE. TKE closure schemes are also known as local closure schemes since they only allow for the exchange of fluxes between adjacent model layers. Generally, local order closure schemes are suited for the simulation of stable boundary layers, whereas nonlocal schemes are better at simulating unstable boundary layers characterized by strong convection [
6,
15]. However, as reported in García-Díez et al. [
3], PBL schemes are highly sensitive to seasonality, forecast length, and geographic location and thus may not necessarily produce consistent results across a spectrum of atmospheric conditions.
Several studies aimed at evaluating the performance of PBL parameterization schemes across regions and under varying weather phenomena have been carried out. Among them is the study by Shin and Hong [
15], who compared the performance of five PBL parameterization schemes available in the Weather Research and Forecasting (WRF) model under stable and unstable atmospheric conditions in Kansas, USA. They found that PBL parameterization schemes based on local mixing approaches generally performed better under stable conditions than those based on a nonlocal mixing approach. The opposite was found to be true for unstable conditions. Cohen et al. [
18], analyzed the ability of PBL schemes in the WRF model to simulate two tornado events in the southeastern USA. They reported a better representation of the lapse rates within the first three kilometers using nonlocal schemes as opposed to the local schemes. In addition, they observed that the nonlocal schemes were less biased in the retrieval of temperature fields, which they attributed to a better representation of entrainment processes in nonlocal schemes. Similar findings have been reported in Avolio et al. [
14], in southern Italy, during an experimental campaign; Banks et al. [
19], over Athens in Greece; Hazra and Pattnaik [
20], over Odisha state in India; and Boadh et al. [
21], over a tropical station in India. Contrastingly, in a study by Chaouch et al. [
22], local schemes were found to represent PBL variables and structure relatively well compared to nonlocal schemes under dry fog conditions in the United Arab Emirates. Better performance with local parameterization schemes has also been reported in Ooi et al. [
23], over Kuala Lumpur, Malaysia; Tyagi et al. [
24], in Southern Italy; Madala et al. [
25], in a region in India; Gbode et al. [
26], in a monsoon regime over West Africa; and in de Lange et al. [
27], in the Highveld region of South Africa. However, as reported in Hazra and Pattnaik [
20] and Yang et al. [
28], the skill of a PBL scheme varies widely depending on the target variable. The findings by García-Díez et al. [
3], Hazra and Pattnaik [
20], Yang et al. [
28], and Li and Pu [
13], among others, emphasize the need to evaluate the impacts of PBL parameterization schemes on RCMs.
The Kenyan highlands rank among the most vulnerable regions to climate change and variability globally [
29]. According to projections presented in Gebrechorkos et al. [
30], the region is expected to witness a significant rise in extreme climate-related events, such as floods and droughts. On the other hand, the region is among the most meteorologically poorly gauged regions in the world [
31]. As such, RCM provides an alternative source of meteorological data to support the monitoring and sustainable use of natural resources and the implementation of climate adaptation and mitigation strategies in the region. Although several studies related to RCM have been conducted in this region [
7,
32,
33], most of them have focused on the sensitivity of RCMs to the parameterization of cumulus and cloud microphysics schemes in simulating precipitation. This study evaluates how the various PBL schemes in the Weather Research and Forecasting (WRF) model influence the retrieval of surface meteorological variables over the Kenyan highlands.
3. Results
Figure 2 shows a comparison between maps of T2 derived from the YSU scheme (
Figure 2a) versus T2 derived from ERA5-Land (
Figure 2b). More spatial variations consistent with the variations in elevation in
Figure 1b are observed in the contour map for YSU compared to ERA5. Besides, the YSU scheme-derived T2 map shows a broader range of values compared to ERA5.
The statistical performance for the four meteorological variables considered is presented as probability density function (PDF) plots, as shown in
Figure 3 and
Figure 4. The results for the R (
Figure 3) indicate that overall, the performance of the WRF model in the retrieval of T2 and RH2 is less sensitive to the choice of the PBL scheme than the retrieval of WS10 and rainfall. Relatively higher R values with relatively lower variation are observed in the retrieval of T2 and RH2 compared to WS10 and rainfall. Some very low and even negative correlations were observed for WS10 (
Figure 3c) and rainfall (
Figure 3d) with some of the schemes.
The distribution of the RMSE for each of the four variables is shown in
Figure 4. In general, the RMSE obtained with all the schemes in the retrieval of T2 (
Figure 4a) ranged between approximately 1.3 to 3.3 degrees Celsius. Apart from the MYNN, QNSE scheme, and to a lesser extent the ACM2 scheme, RMSE values for T2 with the other schemes mostly ranged between approximately 1.3 and 2.5 degrees Celsius. The MYNN scheme produced relatively higher RMSE values in the retrieval of T2 compared to the rest of the schemes, followed by the QNSE and ACM2 schemes. On the other hand, the MYJ scheme appeared to be the best suited for simulating T2, marginally edging out the BouLac scheme. YSU and SHIN, with similar distributions of RMSE values, ranked third in the retrieval of T2. Regarding the retrieval of RH2 (
Figure 4b), the MYJ and the QNSE schemes showed a relatively lower range of RMSE values, ranging between approximately 10% and 16%. However, the MYJ scheme showed a marginally better distribution of RH2 RMSE values than the QNSE scheme. The SHIN, YSU, and MYNN schemes rank third, fourth, and fifth, respectively, in the retrieval of RH2, with their RMSE values ranging between approximately 10.5% and 19.5%. In contrast, BouLac, with RMSE values ranging between approximately 11% and 18%, ranked sixth. The ACM2 scheme showed significantly lower performance than the other schemes, with RMSE values ranging between approximately 12.5% and 24.5%. The distribution of the RMSE values obtained in the simulation of WS10 is shown in
Figure 4c. The SHIN scheme showed the least RMSE values compared to the rest of the schemes, ranging from 2.5 km/h to approximately 10 km/h. MYNN, BouLac, YSU, and ACM2 showed similar distributions, with RMSE values ranging between approximately 4.5 km/h and 12 km/h. The lowest performance was observed with the QNSE and MYJ schemes, with their RMSE values ranging between approximately 6 km/h and 13 km/h, respectively, for QNSE and 14 km/h for MYJ. In terms of the ability of the schemes to simulate rainfall (
Figure 4d), it was observed that the MYNN scheme showed lower RMSE values compared to the rest of the schemes. The RMSE values obtained with the MYNN scheme ranged between 1mm/day and approximately 6 mm/day. YSU and the ACM2 schemes followed in second and third place, respectively. The YSU scheme showed a marginally better range of RMSE values, between approximately 2 mm/day and 6 mm/day, compared to the ACM2 scheme, whose RMSE value range was between approximately 2 mm/day and 7 mm/day. BouLac showed the fourth-best performance with RMSE values ranging between approximately 3 mm/day and 9 mm/day. SHIN, whose RMSE values ranged between 2 mm/day and 13.5 mm/day, ranked fifth, owing largely to the distribution of its RMSE values, whereas the MYJ scheme with RMSE values ranging between 3 mm/day and 10 mm/day, ranked sixth. The QNSE scheme showed an RMSE value of approximately 3 mm/day to 11 mm/day, which was close to the ranges observed with the MYJ scheme. However, based on the shape of the distribution, it was observed that the QNSE showed the lowest performance at simulating rainfall.
The spatial distribution of the RMSE values for RH2 (
Figure 5) revealed some spatial patterns in the performance observed with the various PBL schemes. It is evident from
Figure 5 that higher RMSE values were mainly observed in stations located to the western side of the study area. Although some spatial patterns were observed in RH2 RMSE maps, no discernible spatial patterns were observed in the RMSE maps for T2, WS10, and rainfall in
Figure A2,
Figure A3, and
Figure A4, respectively, in the
Appendix A.
The time-averaged hourly diurnal variation in T2 is shown in
Figure 6. It is observed that some slight differences exist between the diurnal variations in T2 during the dry season (
Figure 6a) and the wet season (
Figure 6b). During the wet season, the schemes showed a slightly better match to the observed T2 than during the dry season, especially over daytime hours. Generally, the schemes tend to underestimate the observed T2 for the better part of the day in both seasons. Notably, the MYNN and the QNSE schemes showed significant underestimation of the observed T2 during the night in both seasons. On the other hand, the BouLac scheme tended to overestimate the nighttime T2 during the dry season. However, BouLac showed a better match to the nighttime T2 during the wet season than most other schemes. The rest of the schemes mostly matched the observed nighttime T2 during the dry season and underestimated it during the wet season. Although all the schemes consistently underestimated the daytime T2, the MYNN showed more underestimation of the T2 during the dry season than the rest of the schemes.
The differences in the nighttime T2, especially with the MYNN and the QNSE schemes, are apparent in the time-averaged T2 maps at 03:00 UTC (06:00 LT) in
Figure 7. A greater spatial extent of lower T2 values was observed in the MYNN and QNSE scheme-derived T2 maps compared to the T2 maps derived from the rest of the schemes. Unlike for the T2 maps at 03:00 UTC (06:00 LT), not many spatial differences were observed in the time-averaged T2 maps at 11:00 UTC (14:00 LT) in
Figure A5 in the
Appendix A.
To further understand the influence of the PBL parameterization schemes on the diurnal variations observed in
Figure 6, time-averaged vertical profiles of potential temperature (
Figure 8) were analyzed. It is observed from
Figure 8 that the simulated PBL from all the schemes was slightly warmer during the wet season than in the dry season. Apart from the slight differences in temperature between the dry and wet seasons, not many differences were observed between the PBL temperature profiles at 03:00 UTC (06:00 LT) during the dry season (
Figure 8a) and the wet season (
Figure 8c). On the other hand, some differences were observed in the PBL temperature profiles at 11:00 UTC (14:00 LT) between the dry season (
Figure 8b) and the wet season (
Figure 8d). The enhanced differences observed in the PBL temperature profiles at 11:00 UTC (14:00 LT) during the wet season compared to the dry season were also observed in the T2 plots in
Figure 6. In general, it was observed from
Figure 8 that the local closure schemes tended to simulate a relatively cooler PBL than the nonlocal schemes. Notably, the MYNN scheme simulated a much cooler PBL than the rest of the schemes at 11:00 UTC (14:00 LT), especially during the dry season (
Figure 8b). However, it was observed that the BouLac scheme, despite being a local scheme, tended to simulate a warmer PBL than the rest at 03:00 UTC (06:00 LT) in both seasons. On the other hand, QNSE, a local closure scheme, simulated a warmer boundary layer than the rest at 11:00 UTC (14:00 LT) during the dry season. In the wet season, QNSE simulated 11:00 UTC (14:00 LT) PBL temperature profiles in the same range as the nonlocal schemes, which were relatively warmer than the other local schemes. The trend observed in the profiles of potential temperature at 03:00 UTC (06:00 LT) was consistent with the trend observed in T2 plots at 03:00 UTC (06:00 LT) in
Figure 6, especially for BouLac and QNSE. However, although the MYJ scheme simulated a cool PBL at 03:00 UTC (06:00 LT) similar to QNSE and MYNN, the corresponding T2 from the MYJ scheme was relatively higher than the one simulated with the QNSE and MYNN schemes. The trend observed in the vertical profiles of potential temperature for 11:00 UTC (14:00 LT) was consistent with the trend observed in the T2 plots at 11:00 UTC (14:00 LT) in
Figure 6 but less apparent than the trend observed at 03:00 UTC (06:00 LT). Just like at 03:00 UTC (06:00 LT), the MYJ scheme showed relatively higher T2 values at 11:00 UTC (14:00 LT) despite a cool PBL profile at 11:00 UTC.
Figure 9 shows the time-averaged diurnal variation in RH2. No significant differences were observed between the variations in RH2 in both the dry (
Figure 9a) and wet (
Figure 9b) seasons. It was also observed that the schemes tended to mostly underestimate the observed RH2 throughout the day in both seasons. Notably, the ACM2 scheme significantly underestimated the observed RH2. Intra-scheme differences and the differences between the schemes and the observed RH2 were more pronounced during the nighttime than the daytime.
Figure 10 shows the time-averaged RH2 maps at 03:00 UTC (06:00 LT) over the study area. The intra-scheme differences observed in
Figure 9 were also apparent in the RH2 maps, particularly with the QNSE scheme, and to a lesser extent, the ACM2 scheme compared to the rest of the schemes. The lower values of RH2 observed with the ACM2 scheme were also discernible in the time-averaged RH2 maps at 11:00 UTC (14:00 LT) in
Figure A6 in the
Appendix A. The rest of the schemes depicted minimal intra-scheme variations in RH2 at 11:00 UTC (14:00 LT), a trend that is consistent with the trend observed in
Figure 9.
The time-averaged vertical profiles of RH at 03:00 UTC (06:00 LT) in
Figure 11 show that the local schemes tended to simulate a moister PBL than the nonlocal schemes. However, the local schemes tended to dry up drastically along the vertical atmospheric column compared to the nonlocal schemes. This behavior was more pronounced with the MYNN scheme. It was also observed that at 03:00 UTC (06:00 LT) in
Figure 11a (dry season) and
Figure 11c (wet season), the BouLac scheme tended to simulate a relatively drier PBL, similar to the nonlocal schemes, compared to the rest of the local schemes. On the other hand, a somewhat drier PBL was simulated by the QNSE scheme at 11:00 UTC (14:00 LT) compared to the rest of the local schemes. The trend observed in the RH profiles was consistent with the trend observed in the RH2 in
Figure 9 for most of the schemes across both the wet and dry seasons. One notable difference was that the MYJ scheme simulated a lower RH2 than the QNSE scheme at 03:00 UTC (06:00 LT) despite a relatively more moist PBL at 03:00 UTC (06:00 LT). The significantly lower RH2 observed with ACM2 in
Figure 9 was also clearly observed in the RH profiles.
Figure 12 shows the time-averaged diurnal variations in WS10 simulated by the PBL schemes versus the observations. Higher wind speeds were observed during the dry season (
Figure 12a) compared to the wet season (
Figure 12b) based on both the observations and the WRF simulations. In general, all the PBL schemes overestimated the nighttime WS10 and underestimated the daytime WS10 in both seasons. It was also observed that the SHIN scheme tended to simulate relatively lower daytime WS10 compared to the rest of the schemes. Furthermore, it failed to capture the diurnal variation depicted by the observations, unlike the rest of the schemes. The QNSE and MYJ schemes consistently showed the largest overestimation of the observed WS10 during the day (08:00 (11:00)–14:00 (17:00) UTC (LT)) in both seasons. On the other hand, the ACM2 scheme consistently showed a significant overestimation of the WS10 between 15:00 UTC (18:00 LT) and 20:00 UTC (23:00 LT) in both seasons.
Figure 13 shows the time-averaged maps of WS10 and wind direction at 03:00 UTC (06:00 LT) during the wet season. Just like in
Figure 12b, no significant intra-scheme differences were observed in WS10 and the wind direction. Higher wind speeds were observed mainly along mountain slopes. The maps also indicate that wind flow at night is influenced primarily by topography, with no general dominant nighttime wind direction over the study area. Similar patterns were observed in the time-averaged maps of WS10 and wind direction at 03:00 UTC (06:00 LT) during the dry season (
Figure A7) in the
Appendix A.
Unlike in
Figure 13, significant intra-scheme differences consistent with the differences in
Figure 12 were observed in the time-averaged maps of WS10 and wind direction at 11:00 UTC (14:00 LT) during the wet season (
Figure 14). Strong easterly winds were observed in
Figure 14 in most parts of the study area apart from the north, where the winds assumed a more northeasterly direction. On the western side of the study area, some westerly and northwesterly winds, likely blowing from Lake Victoria, were also observed. The large region of low wind speeds observed in the western side of the study area indicates a convergence zone between the prevailing easterly winds and the westerly, northwesterly lake breeze. However, as observed in
Figure A8 in the
Appendix A, during the dry season, the daytime wind direction predominantly blows from the northeast with the lake breeze, assuming a northerly flow in the western part of the study area for most of the schemes.
Figure 15 shows the spatial distribution of WRF simulated rainfall using the seven PBL schemes during the wet season. It was observed that the schemes could capture the topography-driven spatial variations in rainfall over the study area. Although the schemes capture the spatial distribution of rainfall depicted by the observations, they tended to overestimate and underestimate the observed rainfall across stations indiscriminately. Additionally, the MYNN and the ACM2 schemes showed slightly lower amounts of rainfall compared to the rest of the schemes. Similar trends were also observed during the dry season in
Figure A9 in the
Appendix A. However, during the dry season, besides the significantly lower rainfall amounts irrespective of the scheme, the spatial extent of rainfall was mainly restricted to the mountains and the western side of the study area close to Lake Victoria.
Figure 16 shows the probability of occurrence of rain events based on simulations with the various PBL schemes versus the observations. It was observed that the schemes were capable of simulating rainfall events of up to 10 mm/h fairly well. Beyond rainfall intensities of 10 mm/h, the schemes tended to underestimate the intensity of the observed rainfall. The PBL schemes produced similar probability curves, with the main difference being the maximum intensity simulated with each scheme. The MYNN scheme simulated the lowest maximum intensity, approximately 22 mm/h, followed by ACM2 at 24 mm/h. SHIN and BouLac simulated a maximum intensity of approximately 26 mm/h each. The QNSE scheme simulated a maximum intensity of roughly 32 mm/h. The YSU scheme simulated a maximum intensity of approximately 48 mm/h, which is approximately equal to the maximum rainfall event of the observations. The MYJ, on the other hand, simulated a maximum intensity of approximately 54 mm/h, which is beyond the maximum intensity of the observations.
Figure 17 shows the distribution of the ETS values obtained using the seven PBL schemes over the study area during the study period. It was observed that better performances (higher ETS values) were obtained with the YSU scheme followed by the QNSE and BouLac schemes. The MYNN scheme ranked fourth in performance in terms of the ETS. The lowest performance (lower ETS values) was observed with the MYJ scheme, followed by the ACM2 and SHIN schemes, respectively.
4. Discussion
The WRF model has been used in various studies and applications, as reported in Powers et al. [
34]. The ability of RCMs such as WRF to add details related to local-scale processes to climate variables derived from GCMs is vital for local climate studies. In addition, it is beneficial for other applications that require high spatial and temporal resolution information on weather and climate, such as energy balance modeling, hydrological applications, among others. As observed in
Figure 2, the WRF-simulated T2 shows enhanced spatial details compared to the T2 obtained from ERA5-Land, the highest spatial resolution (9 km) global ReAnalysis dataset currently available for land applications. At the relatively higher spatial resolution of 1 km compared to the 9 km of ERA5-land, the WRF model was able to capture the local scale physical processes related to topography and land use with more details. The refined spatial details captured with the WRF-simulated T2 in
Figure 2 imply lower spatial resolution-related biases in the WRF-derived surface meteorological variables than ERA5-land, especially in regions characterized by complex topography. Besides the improved spatial resolution, WRF can simulate variables at higher temporal resolutions than the hourly time scale currently available in ERA5-land.
However, as observed in the statistical results presented in
Figure 2 and
Figure 3, it is evident that despite the improved spatial resolution achieved through WRF, uncertainties exist in the simulated surface meteorological variables irrespective of the PBL scheme. For instance, it was observed in
Figure 6 and
Figure 9 that the WRF model generally tends to underestimate T2 and RH2 irrespective of the PBL scheme. Similar findings have been reported in [
3,
19,
21] for T2 and [
14,
21] RH2. Various other studies have also shown that the WRF model tends to overestimate the observed wind speed [
5,
19], which agrees with the results presented in
Figure 12. One source of uncertainties could be related to uncertainties in the forcing data that might propagate into the WRF simulations. The other source of uncertainties in the simulated surface meteorological variables could be related to the spatial representativeness of the point observations with respect to the 1 km model grid size, especially in complex topography. For instance, the tendency of the WRF model to overestimate wind speed is mainly attributed to the representation of surface roughness in the model, especially in complex environments and at coarser model grid sizes [
5], which tends to be smoother, resulting in less resistance. However, it should also be noted that the original observation height of 2 meters might be below the surface roughness height for some of the locations, which might have contributed to lower wind speed at 10 meters upon interpolation using the wind power law. Moreover, the wind power law neglects the effects of surface roughness and atmospheric stability on wind speed. Thus, uncertainties in the observed wind data and the interpolation might have impacted the results.
The choice of a PBL scheme is largely influenced by the weather phenomenon to be studied. Weather phenomena are highly dependent on the state of atmospheric stability. For instance, convective storms occur under extremely unstable atmospheric regimes in which mixing is largely driven by large eddies. On the other hand, under relatively stable atmospheric regimes, mixing is mainly localized and occurs between adjacent layers in the PBL. Local mixing is conducive to the formation of weather phenomena such as fog. Nonlocal PBL schemes are generally better suited for simulating unstable atmospheric regimes, whereas local PBL schemes are more suitable for simulating stable atmospheric regimes. As observed in
Figure 8 and
Figure 11, the PBL parameterization directly impacts the evolution of the heat and moisture fluxes in the PBL. In general, the local schemes tended to simulate colder, more moist PBLs, especially near the surface relative to the nonlocal schemes. However, the local schemes showed drastic warming and drying with height in the atmospheric column, compared to nonlocal schemes. This implies that the local PBL schemes simulated a more stratified PBL than the nonlocal schemes. The stratification observed with the local schemes was a result of less mixing between the warmer, drier air entrained from the free atmosphere and the cooler, more moist air emanating from the surface. On the other hand, the profiles simulated by the nonlocal schemes in
Figure 8 and
Figure 11 indicate more mixing between the entrained air mass from the top of the PBL and the air emanating from the surface.
The statistical results presented in
Figure 3 and
Figure 4 show that choosing a PBL scheme is crucial in retrieving surface meteorological variables using WRF. Although the simulation of T2 and RH2 appear less sensitive to the choice of the PBL scheme, the choice of the PBL scheme will have an impact on the ability of WRF to simulate WS10 and rainfall. These results generally agree with results from other studies [
3,
19,
22,
24], which have reported better performances and lower sensitivity to the choice of PBL scheme in WRF in the simulation of T2 and RH2 compared to other surface meteorological variables. The influence of the PBL scheme appears more pronounced during the day when the atmosphere is more convective and unstable than during the night when the atmosphere is relatively stable. The trend observed in the simulated profiles of temperature and humidity (
Figure 8 and
Figure 11) is propagated to the surface as observed in the plots of T2 and RH2 in
Figure 6 and
Figure 9, respectively. However, it is observed that in some instances, the trend observed in the profiles of temperature and humidity is not consistent with the trend observed at the surface. An example is at 03:00 UTC (06:00 LT) in
Figure 6, where MYJ simulates a relatively higher T2 than QNSE despite similar temperature profiles in
Figure 8. These discrepancies may be attributed to differences in the surface layer schemes accompanying each PBL scheme. The influence of the surface layer schemes might also explain the reduced sensitivity observed in T2 and RH2 since the 2-meter height will be mostly within the surface layer, especially under a deep convective PBL. The reduced intra-scheme differences observed in
Figure 6 and
Figure 9 during the day compared to nighttime further supports this argument.
On the other hand, WS10 and especially rainfall are mainly influenced by the choice of PBL scheme since the heights at which processes related to these variables occur are within the PBL. As observed in
Figure 8 and
Figure 11, the ACM2 consistently produced a warmer and significantly drier PBL than the rest of the schemes. On the other hand, the MYNN and MYJ schemes simulated a cool and consistently more moist PBL than the rest. The behavior exhibited by the ACM2 and MYJ schemes is reflected in the rainfall probability plots in
Figure 16. The ACM2 tends not to capture the higher rainfall intensities, whereas the MYJ scheme simulates some rainfall events that are sometimes way higher than the observations. The behavior observed with the ACM2 and the MYJ schemes in
Figure 16 might partly explain the relatively lower ETS values obtained with the two PBL schemes in
Figure 17. The results in
Figure 16 and
Figure 17 imply that the ACM2 scheme might be missing more observed rainfall events than the rest of the schemes. On the other hand, the MYJ scheme might be simulating more false rainfall events than the rest of the schemes. As argued in Coniglio et al. [
40] and Cohen et al. [
18], nonlocal schemes often tend to simulate too dry of air near the surface owing to PBLs that are too deep and overmixed. Thus, the behavior observed with the ACM2 scheme might be due to a tendency by the scheme to overmix the PBL. On the other hand, local PBL schemes have suffered from undermixing, especially in deep convective environments, mainly due to a poor representation of entrainment processes at the top of the PBL [
18], resulting in a shallow, cool, and moist PBL. Thus, MYJ may be susceptible to undermixing in simulating the PBL in this area. Based on the temperature and humidity profiles (
Figure 8 and
Figure 11) and the rainfall plots in
Figure 16 and
Figure 17, it appears that the moderately warm and moist PBL profiles simulated by the YSU scheme are more suited for simulating rainfall in this area.
The results presented in
Figure 5 also suggest that the model performance might be impacted by the ability of the PBL schemes to represent the interactions between the large-scale and local-scale processes. As observed in
Figure 14 and
Figure A8, during the day, processes in the western side close to Lake Victoria are influenced by the interaction between the more large-scale wind flows from the eastern side and the lake breeze. Thus, improper representation of the interactions between the two air masses might explain the higher RMSE values for the RH2 observed in the western side of the study area in
Figure 5. For instance, the location of the convergence zone observed in
Figure 14 will be highly influenced by the ability of the schemes to properly simulate the direction and strength of the lake breeze and its interaction with the prevailing wind flow. In addition, the ability of the PBL scheme to represent local-scale turbulence related to land use and topography and its interaction with the large-scale processes, such as the prevailing wind flow, will have a significant impact on the performance of the model. As observed in
Figure 14 and
Figure A8, intra-scheme differences in wind speed are quite pronounced during the day, as opposed to during the night (
Figure 13 and
Figure A7). The daytime intra-scheme differences in wind speed signify the differences in the ability of the various PBL schemes to represent buoyancy and vertical wind shear related to the interaction between the local scale turbulence and the larger-scale wind flow in the PBL. All these interactions influence the wind speed and direction and the distribution of moisture within the atmosphere. Consequently, other processes such as convection will be affected by these interactions impacting the simulated rainfall. As such, the effectiveness of the parameterizations in the PBL scheme in representing such interactions will determine the scheme’s success in representing the local-scale processes.
Although the 2-meter variables appear less sensitive to the choice of the PBL scheme in this study, the choice of a PBL scheme for studies involving 2-meter variables is still essential. As observed in
Figure 8 and
Figure 11, the representation of fluxes along the vertical profile can vary significantly depending on the PBL scheme. Under extreme conditions, such as heatwaves, large eddy-driven turbulent mixing will be dominant across the PBL, significantly influencing the 2-meter variables as well. As such, modeling of the PBL should be carefully considered even for studies involving 2-meter variables.
5. Conclusions
This study evaluated the influence of PBL parameterization schemes in WRF in the retrieval of surface meteorological variables over the Kenyan highlands. The experimental setup was configured into three one-way nested model domains, with the innermost domain being at a spatial resolution of 1 km. The model was driven by forcing data from ERA5 at six hourly intervals. The study evaluated seven PBL schemes, namely: ACM2, BouLac, MYJ, MYNN, QNSE, SHIN, and YSU. The evaluation was only carried out for the innermost model domain. The surface meteorological variables considered are the T2, RH2, WS10, and rainfall. The surface meteorological variables were validated against observations from TAHMO.
In general, WRF was found to simulate surface meteorological variables that exhibited spatial variability consistent with the complex topography within the Kenyan highlands. A comparison between T2 simulated by WRF and T2 obtained from ERA5-Land revealed more spatial details with WRF than ERA5. Thus, WRF-derived surface meteorological variables are better suited for land applications within the Kenyan highlands than ERA-5 land.
On the influence of PBL schemes on surface meteorological variables, it was observed that T2 and RH2 are less sensitive to the PBL scheme’s choice than WS10 and rainfall. T2 and RH2 are less sensitive to the choice of the PBL owing partly to the influence of the surface layer parameterization on these variables. Since the processes related to rainfall mainly occur in the PBL, we conclude that rainfall better reflects the influence of the PBL schemes. Thus, based on the results of the simulated rainfall, the YSU scheme provides a more realistic depiction of the PBL dynamics within the study area.
One of the limitations to this study is the lack of observations of fluxes along the vertical profile, which hinders a comprehensive evaluation of PBL dynamics within the study area. Additionally, the study considered only a period of four months owing to limitations in computation resources. This period may be inadequate, especially in the evaluation of the seasonal dynamics within the study area.
Future studies should incorporate observations on the atmospheric profile. The studies should also consider simulations over more extended periods to determine the influence of seasonality on the model performance within the study area.