3.2. Analysis of ERA5 ZTD Bias Characteristics
To quantitatively assess the consistency and deviation between the ERA5 ZTD and the GNSS ZTD within the study area, we calculated the annual mean bias, defined as the difference between GNSS ZTD and ERA5 ZTD for the 10 modeling stations. The statistical results are presented in
Table 2.
As evidenced in
Table 2, the bias values for all 10 modeling stations are consistently negative. The ubiquity of these negative values indicates a prevalent systematic discrepancy across the study region, suggesting that the ERA5 ZTD tends to systematically overestimate the ZTD values compared to the GNSS ZTD. Specifically, the values range from −8.8 mm at the APEL station to −3.5 mm at the STAV station.
To further demonstrate the temporal characteristics of this prevalent systematic bias, this study selected the EBRG station for a detailed time series analysis. This station was chosen for its statistical representativeness, as its RMSE and Bias values closely approximate the average levels of all validation stations.
Figure 3 visually illustrates the time series comparison between the GNSS ZTD and the ERA5 ZTD at the representative EBRG station. As shown in the upper panel of
Figure 3, the ERA5 ZTD demonstrates a high degree of consistency in its overall annual trend with the GNSS ZTD, which is used as the reference value. Distinct seasonal patterns are evident in both datasets, with peaks occurring in summer and autumn.
However, a clear and systematic deviation exists between those two time series. Over the study period, the ERA5 ZTD values are systematically higher than those of the GNSS ZTD. The residual sequence in the bottom panel of
Figure 3 further quantifies this difference. This residual sequence remains below the zero line for most of the year, consistent with the negative bias shown in
Table 2, confirming a prevalent systematic overestimation in the ERA5 ZTD within this region.
Furthermore, this residual sequence is not a constant offset but rather exhibits complex temporal fluctuations. Particularly during the summer, which is characterized by more active water vapor, and during partial extreme weather events, the residual amplitude and short-term oscillations are markedly amplified. This fully demonstrates that the systematic bias between the ERA5 ZTD and the GNSS ZTD is a time-varying and complex variable closely related to meteorological conditions, rather than a simple constant error. Therefore,
Figure 3 clearly reveals the necessity of constructing a refined and spatiotemporal residual correction model, which serves as the core basis for the subsequent work in this study.
3.3. Performance of the RK ZTD
To quantify the accuracy improvement gained from incorporating ERA5 grid ZTD and to provide an objective performance assessment of the proposed RK ZTD, the GNSS-Kriging ZTD model was applied, which is generated entirely from GNSS observations using Kriging interpolation and is intended to simulate the ZTD modeling accuracy achievable without relying on external meteorological products.
The procedure for this GNSS-Kriging ZTD model is as follows. First, it employs the identical set of 10 modeling stations used by the RK ZTD model. The high-precision GNSS ZTD time series from these stations are used directly as the input samples for interpolation. Subsequently, Ordinary Kriging is applied to spatially interpolate ZTD values directly to the validation station locations. The resulting ZTD estimates are derived exclusively from the interpolation of discrete GNSS station data, independent of any external model data.
To quantitatively assess the overall performance of the proposed Kriging-based residual correction model, this study conducted a year-long, comprehensive accuracy evaluation in comparison with two models, ERA5 ZTD and GNSS-Kriging ZTD, at 26 validation stations.
Figure 4 clearly illustrates the scatter density distributions comparing the estimates from the three models with the GNSS ZTD reference values.
As shown in
Figure 4, the ERA5 ZTD product exhibits the lowest accuracy. Its scatter points show significant dispersion and a 5.2 mm systematic bias, with an RMSE of 13.7 mm and an R
2 of only 0.9319. The coefficient of determination R
2 serves as a crucial metric to quantify the goodness of fit, reflecting the proportion of the variance in the reference GNSS ZTD that is predictable from the model. A higher R
2 value signifies a stronger capability of the model to capture the complex temporal variations and dynamic trends of the actual tropospheric delay. In comparison, the GNSS-Kriging ZTD model, constructed solely by interpolating from sparse modeling stations, demonstrates a significant improvement in accuracy. Its data points are more concentrated, with the RMSE decreasing to 8.8 mm and the R
2 increasing to 0.9728, which proves the fundamental advantage of using high-precision GNSS observations as a data source.
The proposed RK ZTD demonstrates the optimal performance. Its scatter distribution is highly concentrated, exhibiting the minimal dispersion among others. Statistical results show that the RMSE of RK ZTD is only 5.7 mm, representing an accuracy improvement of 58.4% and 35.4% compared to the original ERA5 and GNSS-Kriging models, respectively. Simultaneously, its absolute Bias is only 0.4 mm, indicating the systematic bias is nearly eliminated, and its R2 reaches 0.9865, showing extremely high consistency with the GNSS reference values. This demonstrates that the RK ZTD correction model proposed in this study substantially improves the accuracy of ERA5 ZTD.
To further compare the accuracy performance of the proposed RK ZTD with the ERA5 ZTD and GNSS-Kriging ZTD,
Figure 5 presents the probability density functions of the Bias values for the 26 validation stations. As illustrated, the ERA5 ZTD exhibits a distinct positive systematic deviation. Its distribution is mainly concentrated in the positive range of 2 to 9 mm, with the peak shifted significantly to the right around 5.5 mm and reaching a probability of over 30%. This quantitative shift confirms a consistent overestimation of ZTD by the ERA5 model across the region. The results are presented in
Figure 5.
Conversely, the GNSS-Kriging model, while derived from observations, displays the poorest stability. Its distribution curve is notably broad and flat, spanning a wide interval from approximately −12 mm to 4 mm. The peak density is relatively low and shifts negatively to around −6 mm. These characteristics indicate significant station-to-station fluctuations and large error dispersion caused by interpolation uncertainties from the sparse modeling stations.
In contrast, the proposed RK ZTD demonstrates the optimal performance with the most robust error control. Its probability distribution is highly concentrated within a narrow range of ±2.5 mm. The curve is sharply peaked, reaching a probability of approximately 34% while centering almost perfectly on the zero line. This confirms that the RK model effectively eliminates the systematic bias of the ERA5 ZTD while significantly reducing the error dispersion observed in the GNSS-Kriging ZTD. Consequently, the proposed method achieves high-precision, unbiased ZTD estimation across all validation stations, verifying the effectiveness of the residual correction strategy.
To further analyze the performance of the RK ZTD model at different geographical locations, this section calculates the per-station RMSE for the 26 validation stations. Their spatial distribution is presented in
Figure 6. The figure simultaneously presents the RMSE distribution of the GNSS-Kriging model for comparison, and both subplots utilize a unified color scale to facilitate direct comparison.
A comparison of the overall color tones in
Figure 6 reveals that the RMSE values of the RK ZTD model in panel a of
Figure 6 are significantly lower than those of the GNSS-Kriging model in panel b of
Figure 6 at nearly all stations, demonstrating the universal superiority of the proposed residual Kriging strategy. Specifically, the RMSE distribution of RK ZTD is more uniform across the stations, with values primarily concentrated between 4 and 8 mm. In contrast, the GNSS-Kriging model exhibits not only higher RMSE values but also greater spatial variability, with errors exceeding 12 mm at some central-eastern stations.
These spatial distribution results indicate that the introduction of ERA5 grid data effectively mitigates the decrease in interpolation accuracy caused by the uneven distribution of GNSS stations. This significantly enhances the spatial consistency and robustness of the ZTD product across the entire study region.
To visually quantify the accuracy gains achieved by the proposed RK ZTD model relative to the comparative methods,
Figure 7 presents the spatial distributions of the RMSE improvement rates against both ERA5 ZTD and GNSS-Kriging ZTD.
Panel a of
Figure 7 reveals that the improvement of RK ZTD over the original ERA5 ZTD is extremely significant. At all 26 validation stations, the accuracy improvement rates are positive, with the majority of stations exceeding 40% and some even surpassing 60%. This demonstrates that the residual correction strategy proposed in this paper possesses a universal and highly efficient capability for correcting the systematic bias of the original ERA5 ZTD.
The comparison in panel b of
Figure 7 more clearly reveals the accuracy gain brought by fusing ERA5 data. Compared to the GNSS-Kriging model, which relies solely on the interpolation of 10 modeling stations, RK ZTD likewise exhibits an RMSE reduction at nearly all stations. Notably, the regions where the GNSS-Kriging model performed worst are shown in panel b of
Figure 7, specifically the southwestern coastal stations between 51°N and 52°N and the northeastern stations north of 52.5°N, all with RMSEs exceeding 10 mm, which corresponds precisely to the regions where the RK ZTD model demonstrates the most significant improvement rates in panel b of
Figure 7. This spatial correlation provides strong evidence that the proposed residual correction method can effectively utilize the continuous spatial information provided by the ERA5 grid data to precisely compensate for the interpolation shortfalls caused by the uneven distribution of GNSS stations, particularly in regions where modeling stations are sparse.
To examine the performance stability of the models under different seasonal conditions, this section presents the RMSE distributions for the three models across the four seasons. The results are presented as box plots in
Figure 8 and summarized numerically in
Table 3.
Two primary features can be observed in
Figure 8. First, the accuracy of all models exhibits distinct seasonal characteristics, with RMSE values in summer generally being higher than in spring and winter. This aligns with the physical characteristics of the region, which experiences more active water vapor dynamics during the summer. Second, the proposed RK ZTD consistently demonstrates the highest accuracy. As detailed in
Table 3, the RK ZTD maintains the lowest RMSE throughout the year, ranging from 4.4 mm in winter to 6.8 mm in summer, significantly outperforming both the GNSS-Kriging ZTD and the ERA5 ZTD.
Specifically, during summer, the season with the most active water vapor, the RK ZTD effectively suppresses the error growth, keeping the RMSE at a low level of 6.8 mm, whereas the ERA5 ZTD reaches 11.8 mm. Concurrently, as shown in
Figure 8, the interquartile range of RK ZTD remains narrow in all seasons, indicating high consistency in performance across the different validation stations. It is particularly noteworthy that in the autumn statistics shown in
Table 3, the RMSE of the ERA5 ZTD surges abnormally to 19.9 mm, the worst performance among all seasons. In contrast, the RK ZTD remains stable with an RMSE of 6.6 mm, correcting the error by nearly 67%. This anomaly in autumn is closely related to an extreme weather event during the study period.
However, a significant anomaly is observed in the autumn statistics. As shown in
Table 3, the RMSE of the ERA5 ZTD surges abruptly to 19.9 mm, which is the worst performance among all seasons and far exceeds the typical error range. In sharp contrast, the RK ZTD remains stable with an RMSE of 6.6 mm, effectively correcting this large deviation. This unusual degradation in ERA5 accuracy suggests the presence of specific meteorological factors or extreme events during this period that the reanalysis data failed to capture accurately. To investigate the physical mechanism behind this autumn anomaly, a detailed analysis of a specific extreme weather event occurring in November will be presented in the following section.
To further investigate the spatial distribution characteristics of the seasonal differences observed in
Figure 8,
Figure 9 presents the per-station RMSE spatial distribution results for the three models across the four seasons. This figure provides richer spatial dimension information for the box plots in
Figure 8 and validates the seasonal performance of the models.
A column-wise comparison clearly shows that, regardless of the season, the RMSE of the proposed RK ZTD model is significantly lower than that of ERA5 ZTD and GNSS-Kriging ZTD at all stations. Based on the legend, RK ZTD presents as dark blue or blue-green at the vast majority of stations in all four seasons, indicating an RMSE below 10 mm. Especially during summer and autumn, when water vapor is most active and the errors of ERA5 ZTD and GNSS-Kriging ZTD are generally high, RK ZTD still effectively controls the RMSE at a low level, demonstrating superior performance.
A row-wise comparison allows for the analysis of each model’s seasonal performance and spatial dependency. The error of ERA5 ZTD is worst in autumn, and its high-error stations show clear spatial clustering, mainly distributed in the central-eastern inland region of the Netherlands, which is highly consistent with the statistical results from the box plots in
Figure 8. The GNSS-Kriging model also exhibits a significant accuracy decrease in summer and autumn, but its spatial characteristics differ from ERA5, with its high-error areas appearing more dispersed along the coast and in the southwest.
In contrast, the performance of the RK ZTD model is the most outstanding. It not only maintains the lowest RMSE level in all seasons but also features the most uniform spatial distribution, avoiding the obvious high-error “clusters” exhibited by the other two models. This result again confirms that the RK ZTD model proposed in this study possesses high accuracy and high spatial robustness under different seasonal and complex meteorological conditions.
3.4. Impact of Storm Ciarán on ZTD Accuracy
To investigate the physical mechanism behind the significant accuracy degradation of the ERA5 ZTD observed in November, this study conducted a detailed meteorological analysis of the extreme weather event, Storm Ciarán, which impacted the study area from 1 to 4 November.
Figure 10 illustrates the temporal variations of four key meteorological parameters including relative humidity, temperature, atmospheric pressure, and wind speed at the three representative stations from 26 October to 9 November.
As shown in panel c of
Figure 10, the pressure variation serves as the most direct indicator of the trajectory of the storm. Prior to 31 October, the pressure remained stable above 990 hPa. However, commencing on 1 November, a rapid and precipitous drop was recorded across all stations, hitting a deep trough on 2 November. The Deelen station (52°N, 6°E) recorded a minimum pressure of approximately 966 hPa, while the coastal Rotterdam station (52°N, 4.5°E) dropped even lower. This sharp V-shaped pressure curve signifies the rapid passage of a deep low-pressure system. Following the departure of the storm, pressure rebounded sharply from 3 November, recovering to normal levels above 1000 hPa by 7 November.
The wind dynamics exhibited a strong correlation with the pressure drop as depicted in panel d of
Figure 10. Wind speeds remained relatively low before the event but surged continuously starting 31 October. On 2 November, coinciding with the pressure minimum, wind speeds peaked across all stations. Notably, observations at the Rotterdam station were consistently higher than those at the inland stations, with a peak speed exceeding 50 km/h, reflecting the severe impact of the storm on coastal areas. Additionally, the passage of the storm induced significant fluctuations in hydrothermal conditions. Panel a of
Figure 10 shows that relative humidity experienced violent oscillations during the storm, dropping sharply to nearly 60% on 3 November after the rainband passed, indicating a rapid air mass exchange. Similarly, temperature exhibited irregular diurnal patterns during the event, as shown in panel b of
Figure 10, further confirming the atmospheric instability.
These extreme meteorological conditions provide a physical explanation for the large systematic bias and RMSE observed in the ERA5 ZTD during this period. The Zenith Hydrostatic Delay is strictly proportional to surface pressure. The drastic pressure drop of over 30 hPa within 48 h caused a massive variation in the ZHD component. ERA5, as a reanalysis model with limited temporal resolution and spatial grid averaging, often fails to perfectly synchronize with such rapid local pressure plunges, leading to significant residuals in the hydrostatic component. Furthermore, the violent fluctuations in wind and humidity imply intense turbulent water vapor transport. Under such dynamic conditions, the moisture distribution becomes highly heterogeneous. The ERA5 model tends to smooth out these local extremes, whereas the GNSS stations sensitively capture the real-time and high-frequency water vapor variations.
Consequently, the inability of the ERA5 background field to accurately replicate these extreme physical changes resulted in the abnormal bias observed in the autumn statistics. In contrast, the RK ZTD model successfully corrected these background errors by incorporating high-precision GNSS observations, which inherently contain this real-time meteorological information. This demonstrates the superior robustness of the proposed method even under such extreme weather conditions.
To further visually analyze the impact of the extreme weather event on the tropospheric delay,
Figure 11 depicts the time series of GNSS ZTD for the 10 modeling stations alongside the background validation stations from 26 October to 9 November. In this visualization, the time series of the 10 modeling stations are highlighted with distinct colored lines, whereas the data from the 26 validation stations are plotted in grey to form a background envelope. The distinct phases of the storm approach and departure are marked with vertical dashed lines. In addition, the mean ERA5 ZTD derived from the ERA5 ZTD grids at the 10 modeling station locations is plotted as a dashed blue curve to facilitate a direct comparison with the GNSS ZTD time series during the storm period. This specific plotting strategy serves to validate the spatial representativeness of the modeling network. By superimposing the trajectories of the modeling stations onto the background variability range defined by the validation stations,
Figure 11 demonstrates that the variation patterns of the modeling stations are highly consistent with the overall regional trend. This confirms that the selected 10 modeling stations effectively capture the dominant atmospheric dynamics and ZTD evolution characteristics of the entire study area, even under the volatile conditions of an extreme weather event.
As illustrated in
Figure 11, the GNSS ZTD time series exhibited a highly consistent variation pattern across all stations, indicating that the region was controlled by a large-scale weather system. Prior to the arrival of the storm, the GNSS ZTD values fluctuated at a relatively high level between 2360 mm and 2430 mm. However, coinciding with the approach of Storm Ciarán on 2 November, the ZTD values at all stations experienced a precipitous decline. Within a short period, the ZTD plummeted from approximately 2430 mm to a trough of around 2300 mm. This sharp V-shaped drop in the ZTD sequence is physically consistent with the drastic reduction in atmospheric pressure observed during the storm passage, as the Zenith Hydrostatic Delay component of ZTD is strictly proportional to the surface pressure. The dashed blue curve in
Figure 11 further shows that the ERA5 ZTD is generally higher than the GNSS-derived ZTD throughout this period, which is consistent with the systematic ERA5 overestimation identified at the EBRG station in
Figure 3 and the corresponding negative residuals. During the storm peak, the offset between the ERA5 mean curve and the GNSS station envelope becomes more evident at certain hours, indicating that the ERA5 ZTD grids do not fully reproduce the intensity and rapid evolution of the extreme event captured by the GNSS observations.
This extreme atmospheric variability provides the fundamental explanation for the large systematic bias and RMSE observed in the ERA5 ZTD during this specific period. The reanalysis data, limited by their spatial and temporal resolution, often fail to fully capture the extreme depth of the low-pressure center and the rapid rate of the pressure drop. Since the hydrostatic delay constitutes the majority of the total delay, even a small discrepancy in the modeled surface pressure by ERA5 leads to a significant systematic error in the calculated ZTD. Furthermore, the ZTD time series in
Figure 11 exhibits intensified high-frequency oscillations during the storm duration from 2 November to 4 November. These fluctuations reflect the intense turbulent transport of water vapor and the unstable atmospheric stratification. The ERA5 model tends to smooth out these local and rapid variations, whereas the GNSS observations sensitively record these real-time atmospheric dynamics. Consequently, the inability of the ERA5 background field to accurately replicate these extreme pressure drops and moisture anomalies resulted in the significant deviation from the GNSS truth values. This analysis confirms that the high-precision GNSS ZTD serves as a crucial reference for correcting numerical weather prediction models during extreme weather events.