1. Introduction
Coastal systems worldwide face significant threats from erosion and flooding caused by storm surges and rising sea levels [
1]. These hazards are projected to intensify in the coming decades as a consequence of climate change [
2,
3,
4,
5], posing severe risks to ecosystems, urban settlements, infrastructure, and economic activities [
6].
The maritime economy depends on a range of activities that manage and protect the ocean. Rising sea levels and increasingly intense meteorological events are direct consequences of climate change [
5]. From an oceanographic perspective, these phenomena result in higher wind waves occurring over a rising sea level. Coastal areas are the most vulnerable and stressed by this combination of factors. Collaborative efforts are urgent among governments, academia, and social and economic stakeholders to develop adaptation and resilience strategies that safeguard and promote the sustainable use of coastal regions.
On one hand, ports are among the most critical infrastructures exposed to these hazards. Ports serve as major hubs for international trade and are particularly sensitive to extreme water levels, which can compromise docking operations, flood protective barriers, and damage equipment [
7,
8,
9,
10]. Notable adaptation initiatives have been developed for New York [
11,
12] and London [
13], while in Spain, the Port Authority of the Balearic Islands has led pioneering efforts [
14]. Undoubtedly, there are numerous policy challenges associated with implementing effective resilience strategies in the port sector and the maritime industry [
15,
16].
On the other hand, shipyards are equally strategic, supporting ship construction, repair, and maintenance. Often located in low-lying coastal zones, they are highly vulnerable to sea level rise and storm-induced flooding. The Navantia shipyard in Puerto Real (Cádiz) is one of the most important shipbuilding facilities in southern Spain. Spanning more than one million square meters, it has the capacity to construct large bulk carriers (up to one million deadweight tons) in a privileged location. It is equipped with Europe’s widest dry dock (500 m long and 100 m wide), cranes capable of lifting 600 tons (190 m high), highly specialized workshops, and a robust surrounding industry (see [
17] and
https://www.navantia.es/ accesed on 1 July 2025). Nevertheless, the entire site contains a high concentration of critical infrastructure and is exposed to risks from storm surges, high tides, and long-term sea level rise.
The Extreme Value Analysis (EVA) is a fundamental tool for estimating long-term maximum sea levels and wave heights, aiding in the design of coastal defenses and adaptation strategies. According to [
18], EVA was originally introduced by Fisher and Tippett in 1928 [
19], with subsequent contributions from Fréchet [
20] and Gumbel [
21]. Fréchet established two of the probability density functions used in EVA, and Gumbel advanced the field by codifying and systematizing EVA methods. Three main distributions are typically considered in EVA: Fisher–Tippett I, Fisher–Tippett II (Fréchet), and Gumbel. These will be detailed in
Section 3.1.
Beyond the mathematical assessments of goodness of fit, two open questions remain. The first concerns identifying the most appropriate distribution for long-term projections, and the second relates to the physical reliability of estimating the parameters of extreme value distributions for a given model. Although some criteria exist to evaluate whether parameter estimates are plausible, the theoretical and numerical complexities involved are substantial. Among these, the three most easily implemented criteria are the Cramér–Rao Lower Bound (CRLB), the Akaike Information Criterion, and the Bayesian Information Criterion (BIC).
The CRLB was initially derived by [
22], with its proof detailed in [
23]; it was also independently obtained by [
20,
24]. The theorem states that if the estimation error of a parameter in a model attains its CRLB, then the estimator exists and can be both unbiased and of minimum variance. Conversely, if this bound is not met, the estimator is unreliable, and the model may be physically implausible despite the mathematical validity of the solution. Deriving the CRLB for model parameters provides an important link between statistical estimation and physical or engineering principles. The correct application of the CRLB involves computing the norm of the difference between the variance of the estimators and their respective CRLBs. The AIC was derived by Akaike [
25], and it has been applied in many fields with many recommendations [
26,
27]. The BIC was derived by Schwarz [
28], with the same applications as the AIC. While the AIC and BIC test the feasibility of a whole model, the CRLB tests every parameter. However, the AIC and BIC can always be computed, but not all models support a CRLB [
29].
This study investigates the risk of sea level rise and extreme waves affecting ports and shipyards along the Gulf of Cádiz and the Alborán Sea on the Spanish coasts, focusing on the return periods of 10, 25, 50, 75, and 100 years. We analyze time series sea level stations in Cádiz, Tarifa, Málaga, and Almería, as well as data from offshore wave buoys (
Figure 1). The sea level station at Cádiz port is operated by the Instituto Español de Oceanografía, and the other stations are managed by Puertos del Estado.
After conducting the EVA and computing the associated risk levels, the results were mapped onto a high-resolution digital elevation model developed by the authors using a 3D LiDAR scanner for the Navantia Puerto Real, the most significant shipyard in southern Spain. This model highlights the most vulnerable areas to flooding caused by sea level rise and storm surges, enabling the proposal of countermeasures aimed at extending the operational life of the shipyard in the face of rising sea levels.
The main contributions and findings of this study include: (i) the Extreme Value Analysis of sea level and wave heights; (ii) the derivation and computation of the CRLB, AIC, and BIC for the relevant models and parameters; (iii) the prediction of extreme values for different return periods based on physically plausible extreme value distributions; and (iv) the identification of potentially flooded areas within the Navantia Puerto Real shipyard, along with strategic recommendations for risk mitigation.
4. Results and Discussion
4.1. Extreme Value Analysis of Monthly Maxima
The time series of sea level and wave height measurements were processed following the methodology described in
Section 3.1. The resulting parameter estimates for the three probability density functions (PDFs)—Fisher–Tippett I (FTI), Fisher–Tippett II (FTII), and Gumbel (G)—are summarized in
Table 2 and
Table 3 for sea level stations (SLS) and WANA nodes (WN), respectively. Across all locations, correlation coefficients are high (typically above 0.95), and the standard errors of parameter estimates are one to three orders of magnitude smaller than the parameter values themselves. At first glance, this suggests that all three distributions could provide reliable models for predicting extreme values.
However, a closer examination reveals inconsistencies that make model selection non-trivial. For example, at the Almería sea level station, correlations are 0.9830 (FTI), 0.9637 (FTII), and 0.9806 (G). Based on correlation alone, the FTI or Gumbel distributions might appear preferable. Yet, when considering parameter errors and their physical plausibility, FTII could be equally valid. This highlights the need for more robust selection criteria beyond linear fit quality.
Similarly, for wave height data (
Table 3), several stations yield very high correlations for multiple distributions with low deviation. This complicates the decision of which model to adopt for long-term risk projections.
The questions raised earlier remain pertinent: should the analyst automatically select the distribution with the highest correlation? Or could an alternative criterion be more appropriate? In this context, it is essential to consider practical tools for deciding which distribution should be used to compute long-term risk levels. The CRLB, the AIC, and the BIC play a key role here.
4.2. Application of Selection Criteria to Sea Level Data
The Cramér–Rao Lower Bound (CRLB) for the parameters of the linear model in Equation (6) was derived in
Section 3.2, along with the AIC and the BIC for the same model. The model was adapted by treating the x-data as a numerical variable, following the EVA methodology. The application of the CRLB, AIC, and BIC is straightforward once the relevant quantities in Equations (10)–(13) are properly identified. Specifically,
N represents the number of data pairs in each analysis; the white noise variance
corresponds to the variance of the residuals obtained from the linear fit; and the variance of the estimators is calculated by squaring their standard deviations, as listed in
Table 2.
The CRLBs for the parameters of all regressions, along with the AIC and BIC values across the three sea level distributions, are presented in
Table 4. These results clearly show that the optimal distribution is not always the one with the highest correlation (see
Table 2). The CRLB highlights that an estimator, while mathematically valid, may not be physically feasible, and the AIC and BIC evaluate the overall quality of the model, not just the fit.
For Almería, both FTII and Gumbel distributions satisfy the CRLB, but only FTII achieves lower AIC and BIC values, suggesting it as the most appropriate choice despite its slightly lower correlation coefficient. For Málaga, none of the distributions fully meet the CRLB conditions (
Table 4). Nevertheless, the coefficient of variation, also referenced in
Table 2, exhibits the smallest coefficient of variation and lowest AIC and BIC values aligning with both criteria. A similar situation occurs at the Tarifa station with FTII. Conversely, at the Cádiz station, all three distributions conform to the CRLB, making it impossible to select a definitive best fit based solely on this criterion, but the BIC and AIC point to Gumbel as the preferred model.
These results demonstrate that selecting the distribution with the highest correlation coefficient may lead to physically implausible or suboptimal predictions. Incorporating the CRLB, AIC, and BIC improves the reliability of model selection, although some ambiguity may remain when the criteria are inconsistent.
4.3. Application to Wind Waves Data
The same procedure was applied to wave height data from the WANA nodes. Now the values for the computation of the CRLB, AIC, and BIC are taken from
Table 3, and the results are presented in
Table 5 similarly to the above.
In several cases, more than one distribution meets the CRLB, making the choice ambiguous. However, FTI generally provides the smallest difference between the variance of slope estimators and their CRLB, suggesting it as the most physically robust distribution.
Interestingly, in some cases, Gumbel distributions produce unrealistic extrapolated values despite having high correlations. This reinforces the importance of evaluating physical plausibility, not just statistical fit.
4.4. Predicted Extreme Sea Levels
The CRLB cannot be applied as usual, as in [
29], for comparison with the AIC and BIC. While AIC and BIC focus on the model, the CRLB is on each parameter. In this study, the authors propose evaluating the CRLB by considering the distance for the most sensitive parameter, the slope, defined as ||(
-CRLB[B])||. The preferred distribution is the one with the largest value of this difference. For sea level data, the largest deviation from the CRLB is observed for the Gumbel distribution at the Málaga, Tarifa, and Cádiz stations and for the FTII distribution at Almería. Based on the results in
Table 4, which indicate whether each distribution satisfies the CRLB condition, it is evident that selecting an inappropriate distribution can lead to significantly erroneous predictions. According to the AIC and BIC, FTII is selected for Almería and Tarifa, while Gumbel is selected for Málaga and Cádiz.
The computation of risk levels across all distributions provides valuable insight into the consequences of choosing an unsuitable model.
Table 6 presents the predicted extreme values for the return periods of 10, 25, 50, 75, and 100 years for sea level. For example, if the FTII distribution is selected for the Almería station (
Table 6), it yields nearly flat estimates of approximately 52 cm. The FTI distribution produces similarly flat results, whereas the Gumbel distribution provides more realistic increasing estimates and is therefore the most appropriate. Likewise, at Málaga, both FTI and FTII predict decreasing extreme values, while Gumbel offers more plausible projections. Tarifa shows a similar behavior, while at Cádiz, the FTI distribution produces the best estimates.
It is noteworthy that the AIC and BIC criteria provide correct model selection for Málaga and Cádiz, while the CRLB favors Gumbel for Málaga, Tarifa, and Cádiz. For Almería, none of the criteria correctly identifies the most appropriate distribution, highlighting the limitations of existing selection methods.
4.5. Predicted Extreme Wave Heights
The same criteria were applied to the predictions of extreme wave heights at the WANA nodes (
Table 7). For example, at the Alborán station (located between Málaga and Tarifa),
Table 5 suggests that the Gumbel distribution should be chosen. However, the extreme value predictions presented in
Table 7 are inconsistent, indicating that the FTI distribution provides more realist results. This demonstrates that the criterion based on the minimum difference ||(
-CRLB[B])|| is more reliable than the approaches outlined in Equations (10) and (11).
Our analysis shows that the most appropriate distribution is the one that minimizes this difference. This behavior is observed consistently for the FTI distribution across all WN stations (
Table 5). Once again, computing the risk levels for all distributions and return periods proves to be essential for understanding the potential consequences of selecting an unsuitable model. The exact reason why these inconsistencies arise remains unclear and constitutes an important subject for future research.
4.6. Potential Flooded Areas: An Application on Navantia Puerto Real
The authors used a LiDAR 3D scanner to build a digital elevation model (DEM) of the Navantia Puerto Real shipyard. Height data were referenced to the Spanish National Geodetic Network and subsequently to the mean sea level.
Figure 3a shows a modified panoramic Google Maps image highlighting (1) the 500 m long and 100 m wide dry dock equipped with two heavy-duty overhead cranes of 190 m in height; (2) the historical area containing the oldest facilities, including a museum that preserves valuable cultural and industrial heritage; and (3) the auxiliary workshops located along the dock.
Figure 3b presents the plan view of the shipyard.
Considering that (i) the inverted barometer effect—caused by a difference of approximately 40 mb in atmospheric pressure—can generate an increase of about 40 cm in the water column [
36]; (ii) the highest recorded wave height in nearby ports is approximately 1.80 m [
37]; and (iii) spring tides can reach up to 3 m [
38], adding these contributions to the predicted sea levels from
Table 6 allows for the estimation of potential flood levels.
Figure 4 illustrates the resulting flooded areas for return periods of 10, 25, and 75 years.
From an operational perspective, the most affected area is the dry dock and its surroundings, where specialized workshops and module assembly facilities are concentrated (Areas 1 and 3,
Figure 3 and
Figure 4). Flooding in these areas would likely occur within the first 10 years (
Figure 4a). Over time, sea level rise would expand the flooded area to approximately 23% of the shipyard’s total surface, also affecting the historical area, which holds significant technological heritage.
To ensure the continuity of shipyard operations in the coming decades, several mitigation measures are proposed. While relocating the entire facility to a non-flood-prone site would eliminate long-term risks, this solution is economically and logistically unfeasible and could lead to client losses. A more practical approach involves the construction of perimeter flood protection walls set back slightly from the pier edge (e.g., 0.2 m). The proposed walls would be approximately 0.5 m high and of similar width, allowing for pedestrian seating while resting on the dock wall and incorporating waterproofing to minimize leaks.
Additionally, a stormwater collection pool should be built to capture overtopping waves, rainfall, or seepage. The pool should be designed to handle a flow rate of at least 60 mm of rainfall over one hour, with reinforced concrete walls and a depth not exceeding 3 m to avoid siphoning risks. A pumping system would discharge collected water back to the sea, powered by the electrical grid with an emergency generator as backup. The cost of constructing the wall, including waterproofing, is estimated at approximately EUR 110 per linear meter. For the pool and pumping system, the estimated cost is about EUR 23,000 per hectare (EUR 15,000 for construction, EUR 5000 for the pump, and EUR 3000 for the generator).
Finally, elevating critical installations, such as generators and electrical systems, to at least + 1.50 m above the current ground level using structural platforms is recommended. This measure is relatively inexpensive given Navantia’s consolidated turnover of approximately EUR 1528 million in 2024 (with ~EUR 584 M from shipbuilding, ~EUR 553 M from repairs, and ~EUR 248 M from offshore wind projects). Considering that mitigation costs are projected to represent only 10–20% of annual profits, the proposed measures are economically feasible while ensuring long-term operational resilience.
5. Conclusions
The long-term analysis of oceanographic variables, such as sea level and wave heights, is a complex task typically addressed through EVA. Several extreme value distributions must be considered, and linear fits often produce very high correlation coefficients and small parameter estimation errors. However, selecting the most appropriate distribution for predicting extremes requires scientific and engineering judgment rather than relying solely on statistical fit quality.
The main contributions of this work can be summarized as follows: (1) The study demonstrates that the distribution with the highest correlation coefficient is not necessarily the most suitable for extreme value predictions; (2) an additional statistical tool is needed to support the automatic identification of the most appropriate extreme value distribution. In this study, the Cramér–Rao Lower Bound (CRLB), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) have been evaluated; (3) applying the Cramér–Rao Lower Bound (CRLB) in its traditional form does not always lead to conclusive results; instead, the distance between the variance of the estimators and the CRLB threshold offers more insight; (4) discrepancies were observed between the CRLB and the well-established AIC and BIC criteria, indicating that none of them can yet be considered a fully robust method for selecting the optimal distribution; and (5) the methodological analysis of the digital elevation model of the Navantia Puerto Real shipyard has identified its most vulnerable areas, supporting the development of a practical resilience plan. Rather than relocating the entire facility, constructing a flood protection wall with a water collection pool and pumping system appears to be the most feasible solution.
Finally, future work will extend the digital elevation model to include ports, small shipyards, and beaches along the southern Spanish coast (Andalusia), aiming to provide comprehensive risk assessments for rising sea levels and extreme wave events in the region.