3.1. Temperature Validation
We can see in
Figure 3 the time series of the forecasted and observed temperature corresponding to the six validation points, from June to October 2010. The WRF model forecasted temperature as expected, i.e., it reflected the variability of both diurnal and nocturnal temperatures (
Table 6). The model was able to forecast significant thermal oscillations, such as the low temperatures that occurred late in October (
Figure 3; green box). However, on 1 July 2010, Hurricane Alex made landfall in Tamaulipas, moving inland and downgrading over the following days to a tropical storm and depression, before reaching the edge of the western coast of Mexico. According to the Servicio Meteorológico Nacional (SMN, México City, Mexico, by its Spanish acronym), this tropical depression left a low-pressure zone that generated sudden changes in air flows, first from the north and northeast and then from the west and northwest of the country. These flows caused reduced temperatures in the first days of July (
Figure 3; black box), and the model, which had forecasted normal flows, could not predict these unexpected interactions.
The validation points show a bias between the forecasted and observed temperatures that did not exceed
(
Table 6). This is minor, according to Jolliffe and Stephenson [
22], since it represents only between 10% and 25% of the standard deviation of temperature, except at the Asfalto station, in which the BIAS is 0.79, representing 30.6% of stdD-O. Nevertheless, the standard deviations of the forecasts (stdD-F) are higher in all cases than the observed values, indicating that the model overestimated higher temperatures and underestimated low temperatures. The model projected higher temperatures at four stations (Asfalto, CUC, Prepa PV, and Tomatlán River) and less high temperatures in two stations (Tec Ny and Coapinole). There is, on average, a good correlation between the forecast and the observation
of the six validation points. The errors (
and
) show values lower than
, indicating that the forecast errors are small and that the forecast presents adequate accuracy. In comparison, other studies show similar trends. In a validation analysis for the temperature of a WRF model in a subtropical region in India, the
was
for the nine simulations of summer conducted at two stations; meanwhile,
was
, and the
ranged from
to
[
27]. Challa et al. [
43] reported a graphical analysis of the sensitivity of atmospheric dispersion simulations, where the WRF underestimated daily temperatures. When evaluating temperature for an air quality analysis in the northeast of the Iberian Peninsula, errors were found in the 2 km domain, the
was 1.
, and the
ranged from
to
[
44]. In an analysis of the weather event of 2007 related to the flood in Tabasco, Mexico, the errors (
and
) in the two sampling points were
and
, respectively, while the
was
and
, and
was 0.97 and
, respectively, for the 24 h forecasts [
25].
For the elaboration of the average observed temperature map, the equation used was
with
and
. The thermal gradient for every 1000 m of altitude was
on average during the five months of validation. In other studies, the results were similar. Gomez et al. [
35], for the estimation of the average temperature in Tepehuanes, Durango, Mexico, obtained an
on average for June to October, while Ninyerola et al. [
34], for the climatological modeling of the air temperature in Catalonia, Spain, obtained an
on average for the five months of validation with the corrected temperature maps.
The spatial distribution of temperature predicted by the WRF model (
Figure 4a) presented spatial thermal variability, similar to the observed temperature (
Figure 4b). The model also accurately determined the warm zones, on the coast
and the temperate zones in the mountains
. We observed that the determining variable in the temperature in the study region is the type of topography. This corroborates what Magaña et al. [
7], Ninyerola et al. [
34] and Jorba et al. [
44] reported. Spatially, the temperature followed a bi-modal distribution that was reasonably well projected by the model in its histogram when compared with the observed temperature (
Figure 4).
The predicted spatial temperature distribution shows the same spread as the observed temperature, with a standard deviation value of
(
Figure 4). The
was
, indicating that the model slightly underestimated the observed temperature, with a correlation of
, while the
was
, showing a slight reduction in forecast accuracy. As a background to these error results, a comparison can be made with the research of Flaounas et al. [
26], where they performed a simulation of the West African monsoon in 2006 using the WRF, and the simulated temperatures in comparison with those observed for all of West Africa had a strong positive BIAS that reached
.
3.2. Precipitation Validation
For binary (yes/no) events, a “yes” event is defined for precipitation equal to or higher than a specified reference (in this study, it is 1 mm of accumulated rain in 24 h); otherwise, it is defined as a “no event”. The distribution of the observed days with rain and the days in which rain was not observed compared to those forecasted are shown in the next contingency table (
Table 7).
In contrast to the temperature analysis, the model showed poor performance in predicting the number of days with precipitation compared to what was observed. In contrast to the temperature analysis. The period of rain under assessment covered 144 days at six verification points, a total of 864 forecasts. From this total, the rain was observed on 358 days, which means that the model failed to predict 266 days, or 74.3% (
Table 7 and
Table 8). The correct proportion of days with rain forecasted
corresponds to 25.7%, with a
of 95% (21.4%, 30.5%) (
Table 8). The proportion of days with rain that represented false alarms
corresponds to 25.8%, with a
of 95% (18.9%, 34.1%). Only PC shows a favorable performance with 65.5%, with a CI of 95% (62.3%, 68.6%). However, this index was influenced by the 474 days on which no rain occurred (
Table 8).
Severe events that occurred suddenly and could involve rainfall greater than 100 mm were not accurately forecasted (
Figure 5). The temporal distribution of precipitation showed a very marked bias in the accumulated precipitation at the beginning of September (
Figure 5, green rectangle).
According to the SMN report, on 4 September 2010, a low-pressure convergence line formed over the western Sierra Madre and the Mesa Central (
Figure 6). This generated the propagation of cloudy days with a high potential for rain and thunderstorms that caused precipitation exceeding 100 mm at the meteorological stations Prepa PV and Coapinole.
Table 9 shows the results of the statistical indices of the six validation points, suggesting the poor performance of the model, since the model underestimated the accumulated precipitation in 24 h periods, by between 2.3 mm and 10.24 mm. The correlation coefficients in the six sampling points were
. The
and
values in all cases were greater than 15 and 6 mm, respectively, showing a low performance in terms of forecast accuracy. In comparison, in an analysis for validating forecasts of accumulated rain to 24 h with WRF for all of Mexico, Corrales et al. [
3] reported errors (
and
) of 6.25 mm and 2.26 mm, respectively, while the correlation was 0.35.
For the elaboration of the accumulated precipitation map during the study period, we used the equation
, where
and
. After the precipitation map was corrected, the
and
. In contrast, Ninyerola et al. [
34], for the climatological modeling of precipitation in Catalonia, Spain, obtained an
for June to October. First, we calculated multiple correlations including latitude and altitude; these variables were also used in the equation of Ninyerola et al. [
34], but in our results
and
; therefore, we used altitude and evaporation in our equation, and thus we obtained the highest value of
.
The spatial distribution of precipitation forecasted by the model was poor compared to that observed in the verification (
Figure 7). Mean precipitation for the entire period (10 June–31 October 2010) was forecasted at 503 mm, but 1355 mm was observed. WRF underestimated the accumulated precipitation during the entire study period, with a
of
mm; meanwhile, the average degree of correspondence between individual pairs of predicted and observed values
was
mm and produced the histograms shown in
Figure 7. Despite the above, the model (
Figure 7a) was able to predict areas of high precipitation (the Tuito Mountains, La Bufa Hill, and Cacoma Mountain) and of lower precipitation (the coastal zone and the Ameca River Valley), very similar to what was observed. The value of the
index is very close to 2 (
), indicating that the WRF was able to predict where higher and lower precipitation occurred; however, it underestimated the accumulated precipitation for the entire period, with a
of 813 mm (
Table 9) and produced the histograms shown in
Figure 7. Other researchers found similar results—for example, Flaounas et al. [
26] found, in a simulation of the West African monsoon in 2006 using the WRF, that the precipitation pattern was generally modeled to an adequate level, although these authors acknowledged that convective rainfall is a constantly changing variable that is difficult to model. Casati [
28], in his thesis for the evaluation of the NIMROD model for spatial rain forecasting in the United Kingdom, found values of
and
that were
, while the value of correlation was less than 0.4 in the eight case studies and the
index was between 1.75 mm and 9.27 mm. Magaña et al. [
7], in their MM5 model evaluation for rain forecasting in periods of 24 h for Mexico City, found large spatial differences in August during the period of 2008–2010. They reported that in the south and southwest of Mexico City, the rain was overestimated by approximately 50 mm, while in the western area where greater precipitation was observed, the model underestimated the precipitation by approximately 200 mm from a total of 300 mm and, on the east side of the city, the difference was approximately 50 mm.
Regarding short-term forecasting in the BB region, relatively little is known about the quality of predictions from models that are run on a national or regional scale. When high-resolution spatial forecasts are performed in cases where ground effects are important, the quality of the forecasted rainfall is limited by different spatial patterns [
7]. Another limiting factor is working at a low resolution (50 km) to save computational resources, for example in the analysis of the African monsoon reported by Flaounas et al. [
26]. In this context, Flores [
29] pointed out that the differences between the predicted and observed variables are because the model simplifies and smoothes the surface processes by representing them in a grid of 18 km resolution. Meanwhile, Jorba et al. [
44] found that the simulation with a high horizontal resolution (2 km grid) in the geographic area of Catalonia, Spain, can allow the analysis of the possible atmospheric circulation that occurs. In correspondence with the above and taking into account these limitations in the quality of the forecasts, we have evaluated the output results of the WRF at the working resolution (7 km).