Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning

Retamero, José Moyano; Orive, Alberto Camarero

doi:10.3390/jmse13071301

Open AccessArticle

Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning

by

José Moyano Retamero

^1,2

and

Alberto Camarero Orive

^2,*

¹

Málaga Port Authority, 29016 Málaga, Spain

²

Department of Transport, Territorial and Urban Planning Engineering, Technical University of Madrid, 28040 Madrid, Spain

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(7), 1301; https://doi.org/10.3390/jmse13071301

Submission received: 28 May 2025 / Revised: 29 June 2025 / Accepted: 29 June 2025 / Published: 2 July 2025

(This article belongs to the Section Coastal Engineering)

Download

Browse Figures

Versions Notes

Abstract

This paper presents an enhanced hub location model tailored to port–hinterland logistics planning, grounded in the Capacitated Multiple-Allocation Hub Location Problem (CMAHLP). The formulation incorporates nonlinear cost structures, hub-specific operating costs, adaptive capacity constraints, and a feasibility condition based on the Social Net Present Value (NPVsocial) to support the design of intermodal freight networks under asymmetric spatial and socio-environmental conditions. The empirical case focuses on Spain, leveraging its strategic position between Asia, North Africa, and Europe. The model includes four major ports—Barcelona, Valencia, Málaga, and Algeciras—as intermodal gateways connected to the 47 provinces of peninsular Spain through calibrated cost matrices based on real distances and mode-specific road and rail costs. A Genetic Algorithm is applied to evaluate 120 scenarios, varying the number of active hubs (4, 6, 8, 10, 12), transshipment discounts (α = 0.2 and 1.0), and internal parameters. The most efficient configuration involved 300 generations, 150 individuals, a crossover rate of 0.85, and a mutation rate of 0.40. The algorithm integrates guided mutation, elitist reinsertion, and local search on the top 15% of individuals. Results confirm the central role of Madrid, Valencia, and Barcelona, frequently accompanied by high-performance inland hubs such as Málaga, Córdoba, Jaén, Palencia, León, and Zaragoza. Cities with active ports such as Cartagena, Seville, and Alicante appear in several of the most efficient network configurations. Their recurring presence underscores the strategic role of inland hubs located near seaports in supporting logistical cohesion and operational resilience across the system. The COVID-19 crisis, the Suez Canal incident, and the persistent tensions in the Red Sea have made clear the fragility of traditional freight corridors linking Asia and Europe. These shocks have brought renewed strategic attention to southern Spain—particularly the Mediterranean and Andalusian axes—as viable alternatives that offer both geographic and intermodal advantages. In this evolving context, the contribution of southern hubs gains further support through strong system-wide performance indicators such as entropy, cluster diversity, and Pareto efficiency, which allow for the assessment of spatial balance, structural robustness, and optimal trade-offs in intermodal freight planning. Southern hubs, particularly in coordination with North African partners, are poised to gain prominence in an emerging Euro–Maghreb logistics interface that demands a territorial balance and resilient port–hinterland integration.

Keywords:

intermodal hub location; maritime–terrestrial logistics; capacitated hub networks; sustainable freight networks; nearshoring; Euro–Maghreb trade; genetic algorithm

1. Introduction

Hub location problems arise when large volumes of traffic—whether goods, postal and parcel services, telecommunications, or passengers—must be transported from origin to destination. In such contexts, establishing a direct connection for every origin–destination (O–D) pair is rarely feasible due to cost, operational, and environmental constraints. Logistics hubs offer a structural solution: they act as spatial and functional anchors, where flows are consolidated, redirected, and in many cases transferred between modes. Their core function is not simply transshipment but the orchestration of freight movements to enable economies of scale, reduce duplication, and support modal shift. In intermodal systems, this involves configuring the network so that long-haul segments—ideally rail or maritime—are maximally utilized while last-mile access is managed through flexible road services. This logic is particularly critical in port–hinterland systems, where maritime gateways and inland hubs must operate as a coherent logistics unit to absorb increasing demand, facilitate modal rebalancing, and reduce congestion at seaports. The spatial configuration and operational performance of these hubs—rather than their mere presence—ultimately determine the system’s efficiency and sustainability.

The Hub Location Problem (HLP) has evolved from its early theoretical formulation by [1] into a cornerstone of transportation network design. Over the last two decades, a new wave of the literature has extended its scope by incorporating capacity constraints, multimodal allocation rules, and socio-environmental performance metrics. Recent works, such as [2], reflect this shift from purely structural optimization toward more applied and sustainability-oriented approaches, particularly suited to intermodal and maritime–terrestrial systems.

In this article, we develop and test an enhanced Capacitated Multiple-Allocation Hub Location Problem (CMAHLP) model to determine the optimal positioning of inland consolidation mega-hubs within Spain’s intermodal logistics network. The model captures both road and rail connectivity from four key maritime ports—Barcelona, Valencia, Málaga, and Algeciras—which act as gateways for inland freight distribution. Compared to previous CMAHLP approaches, our formulation introduces nonlinear congestion penalties, dynamic operating cost functions, and a feasibility constraint based on the Social Net Present Value (NPVsocial), ensuring long-term viability and consistency with EU Green Deal objectives. To achieve this, the model integrates structural innovations in cost representation, intermodal assignment logic, and sustainability evaluation.

While the transportation cost component in our model draws on the general structure of classical hub location formulations [3,4], this work introduces several novel elements that have not been integrated into previous CMAHLP approaches. First, the model incorporates an intermodal transport cost function based on Janic’s framework, which evaluates both internal and external costs per unit of freight for road and rail transport. This enables a realistic assessment of road–rail competitiveness using a discrete mode selection mechanism for each O–D pair. Second, it integrates original functions for dynamic hub operating costs and investment penalties that depend on land footprint, flow volumes, and scale effects. Finally, a feasibility constraint based on the Social Net Present Value (NPVsocial) filters out configurations that do not deliver long-term socio-environmental returns. Together, these features define a hybrid CMAHLP formulation explicitly aligned with the goals of strategic and sustainable freight planning in intermodal networks.

The model allows each origin or destination node to connect to multiple hubs, distributing flows and reducing the risk of overload at any single node. This structure increases operational flexibility and network resilience. Crucially, the model incorporates the full cost structure of intermodal logistics. It includes (i) transport costs based on Janic’s method, accounting for internal and external costs of road and rail; (ii) investment costs for infrastructure deployment; (iii) operating and management costs associated with day-to-day terminal use; and (iv) a feasibility constraint based on NPVsocial > 0, which discards solutions that do not generate positive socio-environmental returns over time. These elements define the scope and logic of the CMAHLP model.

To ground the model in real conditions, we built an intermodal cost matrix using observed distances and calibrated mode-specific cost functions. This quantitative foundation supports hub assignment and enables the testing of plausible network scenarios under current and projected freight patterns.

Hub location models are fundamentally flow-based. Their goal is to optimize hub placement while determining O–D movements [5], independently of vehicle routing or detailed operations. This focus allows for high-level strategic network design that minimizes global costs while maximizing infrastructure performance. In practical terms, the consolidation of shipments reduces partially loaded trips, lowers the number of direct O–D links required, and improves scale economies. It also enables route flexibility, as cargo can be dynamically rerouted in response to changing conditions—traffic, constraints, or resource availability. While intermediate hubs add a reorganization step, they often reduce total travel time by enabling more direct routing for the final leg. Furthermore, hubs facilitate intermodal transfers—especially among road, rail, and maritime modes—supporting operational efficiency and environmental benefits.

From a graph-theoretic standpoint, an HLP is modeled as a network of nodes (Ns) connected by arcs (As), where each spoke represents a demand or supply node, and arcs correspond to multimodal transport infrastructure. In our case, the network includes ports, inland cities, and provinces as nodes linked by road, rail, and maritime connections. This representation provides the spatial and functional basis for optimizing hub placement and flow assignments within the CMAHLP framework.

The analysis integrates hub allocation and network flow optimization. Hubs are selected not only to reduce the total number of required links but to structure consolidated flows along high-capacity corridors, under modal and capacity constraints. Our study builds on classical CMAHLP logic while addressing operational realism and sustainability gaps. While earlier efforts like [4] incorporated capacity and infrastructure costs, recent works such as [6] still fall short in balancing territorial equity, economic viability, and environmental performance.

The model employs a disaggregated and measurable cost structure. Intermodal transport costs are quantified using calibrated mode-specific functions for road and rail, incorporating both internal and external components. Infrastructure investment is derived from observed parameters such as land availability, existing land connectivity, and construction unit costs. Operating costs are driven by three factors: economies of scale, congestion penalties based on convex functions, and storage or inventory capacity. Feasibility is enforced through a Social Net Present Value constraint (NPV_social > 0, which discards configurations lacking long-term socio-environmental returns based on monetized externalities and territorial performance.

Embedded in the CMAHLP framework, the model serves as a decision-support tool for infrastructure planning under real-world economic, spatial, and environmental constraints. A tailored genetic algorithm explores feasible configurations, combining guided mutation, adaptive elitism, and local search. Beyond cost minimization, it applies filters aligned with operational and policy criteria, eliminating scenarios that generate excessive hub concentration, modal imbalance, or inefficient land use. This ensures that only robust and spatially coherent solutions are retained—those consistent with long-term infrastructure viability and sustainable network development. The model supports a transition toward regionalized, proximity-based logistics systems, reinforcing Spain’s position as a core logistics hub in the EU and a strategic interface with North Africa within the evolving Euro–Maghreb corridor.

2. Literature Review: The Capacitated Multiple-Allocation Hub Location Problem (CMAHLP)

This section presents a cohesive literature review on the Capacitated Multiple-Allocation Hub Location Problem (CMAHLP), outlining its evolution, key constraints, and relevance to intermodal freight planning. It highlights the methodological advances and limitations of previous models, setting the foundation for the new formulation developed in this study.

The Capacitated Multiple-Allocation Hub Location Problem (CMAHLP) builds upon the classical family of hub location models, extending their scope to incorporate capacity restrictions at hubs, multi-hub assignments, and a more realistic cost structure that includes fixed infrastructure investments and variable operational expenditures.

2.1. From Single-Allocation to Multiple-Allocation Hub Models

The earliest hub formulations assumed a single allocation, where each spoke is connected to a single hub [7]. Early models focused primarily on minimizing total transport cost, introducing fixed costs for hub establishment and basic binary decision variables for hub selection and spoke–hub assignments. Extensions by [8] in the 1990s introduced the multiple-allocation structure, allowing spokes to route through several hubs and leveraging inter-hub economies of scale via discount factors.

These classical models led to the family of p-hub median problems, which became benchmarks in location science. They assumed idealized-routing constant-hub performance and mostly ignored implementation issues such as infrastructure investment or dynamic operating costs. While mathematically elegant, their real-world utility remained limited to conceptual insights.

2.2. Incorporating Capacity and Investment Constraints

The transition to capacity-constrained models marked a turning point. Studies [4,9,10,11,12,13,14] introduced formulations that incorporated hub capacity limits and infrastructure investment, adding practical realism. Subsequent studies, such as [13], focused on defining maximum throughput per hub and adjusted objective functions to capture infrastructure costs.

However, these models still treated operating costs and efficiency as static. They also failed to address route congestion, multimodal coordination, or socioeconomic feasibility factors critical to intermodal infrastructure design.

2.3. Realism Through Congestion and Operating Costs

In real freight systems, terminals do not perform uniformly. When they are saturated, delays, inefficiencies, and extra costs pile up. To reflect this, ref. [15] introduced a nonlinear congestion penalty into hub location models—marking one of the first serious attempts to bring operational realism into the picture. Their formulation linked cost not just to distance or allocation patterns but to the actual stress a hub experiences when demand exceeds capacity.

Still, despite that advance, most CMAHLP models remain focused on abstract cost minimization. They rarely include criteria like environmental feasibility, regional equity, or investment-grade constraints—all essential today when infrastructure planning must align with long-term sustainability goals.

2.4. Dynamic and Metaheuristic CMAHLP Formulations: Limits and Developments

Dynamic models brought time into the picture; [16] proposed multi-period hub location formulations that allowed the opening and closing of hubs across planning horizons. Though temporally flexible, these models were often uncapacitated and ignored operational realism.

Metaheuristic approaches, especially Genetic Algorithms (GAs) and NSGA-II variants have become prevalent for solving complex instances; [6] introduced a bi-objective load-balancing CMAHLP using evolutionary optimization. While effective in balancing network loads, many such models still lack realism in cost structures, congestion, or modal coordination.

These gaps are increasingly relevant in the context of the Green Deal, which emphasizes long-term sustainability and modal shift. Most dynamic and metaheuristic CMAHLP formulations fail to fully address the multi-objective nature of current transport planning, including emissions reduction and inclusive territorial coverage.

2.5. Recent Contributions and Computational Benchmarks

Several recent studies have focused on benchmarking and comparing MILP formulations for hub location problems. Among them, ref. [17] evaluated both flow-based and path-based approaches using CPLEX, focusing on computational performance and linear relaxation properties.

Flow-based models define variables over origin–destination–hub combinations and minimize aggregate cost. Path-based models rely on feasible routing paths and emphasize relaxation strength. While both formulations are relevant, neither adequately incorporates investment logic or sustainability metrics, limiting their use in policy-sensitive environments like Spain’s port–hinterland system.

Recent work by [18] also underlines how most formulations, even when robust computationally, still fall short in integrating socio-environmental and territorial equity dimensions—especially critical in multimodal corridors subject to Green Deal goals. These limitations hinder their usefulness for guiding real-world infrastructure investment aligned with EU sustainability frameworks.

2.6. Summary and Rationale

Progress in CMAHLP research has been substantial in terms of modeling logic and computational power. However, no single model yet integrates all key dimensions needed for policy-grade logistics design: dynamic costs, congestion effects, intermodal coverage, and sustainability constraints.

The model proposed in this study addresses these deficits by incorporating real logistics costs, dynamic hub operations, congestion penalties, and a feasibility condition based on Social Net Present Value (NPV_social). It also considers coordination across maritime and inland modes, supporting applications under the EU Green Deal and Euro–Maghreb trade strategies. Table 1 summarizes the main features of each CMAHLP approach and situates this study’s proposal in that evolution.

Table 1 provides a comparative summary of key CMAHLP-based formulations, highlighting how each model addresses capacity, cost, congestion, balancing, and sustainability criteria. The proposed model is positioned as a hybrid framework that explicitly integrates operational and socio-environmental dimensions into maritime–terrestrial hub network planning.

Section 3 introduces a hybrid framework grounded in classical hub location structures, adapted to the specific realities of maritime–terrestrial intermodal planning. Building on these developments, a revised CMAHLP formulation is proposed to address current challenges related to sustainability, intermodality, and territorial resilience.

3. Advancing CMAHLP for Maritime–Terrestrial Logistics: A Multi-Criteria and Sustainability-Based Perspective

3.1. Methodological Enhancements to Classical CMAHLP Models

The methodology developed in this study expands upon classical CMAHLP formulations by introducing key improvements that address the complexity of real-world maritime–terrestrial logistics systems. Foundational contributions, such as those by [8,19] and subsequently [4,20], incorporated fixed infrastructure costs and basic capacity constraints, offering critical theoretical grounding. However, such approaches remain limited in their ability to account for spatial heterogeneity, dynamic operational behavior, and socio-environmental feasibility in national-scale planning. The model is specifically designed for port–hinterland systems, where freight flows are increasingly shaped by intermodal corridors, saturation risks, and policy-driven sustainability objectives.

To overcome these limitations, the proposed model adopts a multi-criteria framework that simultaneously optimizes economic efficiency, operational realism, and long-term sustainability. Specifically, it includes the following:

Door-to-door transport costs disaggregated into collection, transfer, and distribution stages, capturing the full logistics chain from unimodal to intermodal operations.
Variable operating costs per hub reflect congestion, scale effects, and inventory management.
Investment costs are adjusted by territorial factors such as land value, connectivity, and urban density.
A hard constraint based on the Social Net Present Value (NPV_social), ensuring the socio-environmental viability of each hub alternative.

This formulation supports intermodal decision-making under regional asymmetries, emerging logistics corridors, and sustainability mandates, as found in European initiatives like the EU Green Deal and the TEN-T network.

In our case, we aim to spatially locate inland hubs that connect the main terrestrial corridors with four key maritime gateways—Algeciras, Málaga, Valencia, and Barcelona—which channel international freight flows into the hinterland. These port–interiors interactions are central to the model’s intermodal logic. Unlike traditional CMAHLP schemes, such as the center-of-mass approach used by [4], which assume symmetric systems where the infrastructure cost depends primarily on centrality, our model reflects a more complex reality. Hub viability here is driven by factors such as land availability, multimodal connectivity, local economic context, environmental constraints, and territorial policy incentives.

Similarly, alternative cost structures based on flow-dependent infrastructure needs, while more flexible, still neglect economies of scale, nonlinear operating behaviors, and environmental impact. For instance, they fail to model phenomena such as hub saturation or the cost implications of managing large volumes of cargo over long timeframes.

To address these omissions, our model introduces a dynamic cost term in the objective function that reflects hub operations. At the same time, the classical fixed cost structure from [4,7,21] is redefined through a territorial investment analysis structured around hub-specific land pricing, infrastructure complexity, and projected flow.

Most importantly, we embed a sustainability constraint based on the Social Net Present Value [22], which evaluates the long-term balance of environmental, social, and economic returns associated with each hub configuration. This allows us to formally include sustainability as a viability condition—not merely as an external indicator.

\min \sum_{i} \sum_{k} \sum_{l} (χ \cdot C_{i k} + α \cdot C_{k l}) \cdot Y_{k l}^{i} + \sum_{i} \sum_{j} \sum_{l} δ \cdot C_{l j} \cdot X_{l j}^{i} + \sum_{k} I_{k} \cdot H_{k} + \sum_{k} F_{k} \cdot H_{k}

(1)

Net Present Social Value of Hub Selection

{N P V}_{s o c i a l, k} \geq 0 \forall k

(2)

The flow allocation constraint ensures that each hub “l” receives traffic from at least one hub “k” for every origin node “I”.

\sum_{k} Y_{k l}^{i} \geq 1 \forall i, l \in N

(3)

Flow Capacity Constraint (Non-Saturation Condition)

\sum_{i} \sum_{j} Y_{k l}^{i} \leq λ_{s c a l e} λ_{o v e r} λ_{m a r} Γ_{k} Y_{k} \forall k \in N

(4)

Flow Balancing Constraint

\sum_{i} \sum_{k} Y_{k l}^{i} \leq \sum_{i} \sum_{j} W_{i j} (\frac{Γ_{k} H_{k}}{\sum_{m} Γ_{m} H_{m}}) \forall l \in N

(5)

Binary Variable Constraint for Hubs

H_{k} \in {0,1} \forall k \in N

(6)

Flow Positivity Constraint

X_{l j}^{i}, Y_{k l}^{i} \geq 0 \forall i, j, k, l \in N

(7)

Flow Conservation Constraint

\sum_{k} Y_{k l}^{i} = \sum_{j} X_{l j}^{i} \forall i, l

(8)

This methodological proposal constitutes a significant advancement over traditional CMAHLP frameworks by bridging operational, territorial, and sustainability dimensions into a unified formulation. Its novelty lies not in discarding classical structures but in reinterpreting them through the lens of intermodal realism, regional diversity, and long-term feasibility. By embedding environmental and social constraints directly into the optimization logic and calibrating costs from real infrastructure, logistics, and territorial data, the model becomes a practical decision-support tool capable of guiding public and private investment strategies in complex maritime–terrestrial systems like Spain’s. Unlike in most applications, NPV_social is not evaluated post-optimization but embedded as a feasibility constraint, acting as a hard filter within the solution space.

3.2. Intermodal Freight Cost Formulation for Inland and Port-Connected Nodes

The transport cost formulation implemented in this study builds on the classical three-leg decomposition typical of Capacitated Multiple-Allocation Hub Location Problems (CMAHLP), as defined by [19] and extended by [8]. In our case, however, the model is adapted to the operational conditions of a real-world maritime–terrestrial logistics system that spans both inland urban areas and coastal seaports, interconnected via road and rail infrastructure.

Freight Interaction Matrix and Node Typology. The system comprises 47 major nodes across Spain, including inland metropolitan areas, dry ports, and commercial seaports. It also includes strategic nodes like Algeciras, Málaga, Valencia, and Barcelona—ports directly connected to high-capacity rail freight motorways. Nonetheless, the system also includes other key maritime gateways (e.g., A Coruña, Bilbao, Santander, Tarragona, Almeria, Huelva, etc.), ensuring intermodal and territorial diversity. All nodes, however, are treated as logistically equivalent in terms of origin–destination flow potential, regardless of port status. This ensures that the model does not introduce structural biases favoring coastal over inland regions.

Three-Leg Decomposition of Generalized Transport Cost. The cost for freight flow from origin node “I” to destination node “j” through hubs “k” and “m” is decomposed into collection, inter-hub transfer, and distribution legs. This formulation maintains classical structure while enabling calibration under real territorial and modal conditions.

C_{i j k m} = χ {\cdot C}_{i k} + α {\cdot C}_{k m} + δ {\cdot C}_{m j} \forall i, j, k, m \in N

(9)

where

C_ik, C_km, C_mj: cost of each leg (collection, inter-hub transfer, and distribution).

χ, α, δ: Cost multipliers for collection, transfer, and distribution legs, respectively.

Mode Selection Logic for Road–Rail Competitiveness. To accurately represent inland mode competition, the model includes a binary selection mechanism that compares generalized costs of road and rail transport for each O–D pair. The transport cost C_ij between any two nodes is calculated as:

C_{i j} = \min \{{C o s t}_{r o a d}, {C o s t}_{r a i l}\} + C_{t r a n s f e r}^{(p)}

(10)

where

Cost_road = D_ij ∙c_r.

Cost_rail = D_ij ∙c_ra + rail terminal fee.

C_{t r a n s f e r}^{(p)}

: accounts for port-side handling and pre-haulage, capturing operations such as container loading, storage, and administrative processing.

This formulation does not merely reflect cost differences; it reveals strategic corridors where rail may outperform the road, especially when connected to ports with dedicated freight infrastructure.

Figure 1 shows that beyond 550–600 km, intermodal transport becomes cost-competitive with full-road haulage, reinforcing the strategic role of long-distance corridors connected to rail-accessible ports. Parameter values and assumptions used in cost matrices are provided in Appendix A.

This structure is based on the generalized intermodal cost models proposed by [23,24], both of which have been validated for capturing trade-offs in freight mode selection under cost-competitive conditions. The transport cost matrix is derived from:

Real intercity distances D_ij for all node pairs.
Mode-specific cost coefficients C_r and C_ra and port-specific terminal charges.
Fixed handling charges for each seaport node with multimodal infrastructure.

This cost formulation, while not explicitly modeling maritime legs, is particularly relevant for port authorities and logistics planners seeking to optimize inland freight distribution from coastal gateways under real intermodal conditions. By focusing on terrestrial transport costs—including both road and rail—from a full set of national logistics nodes (comprising cities and ports), the model captures the inland continuation of seaborne freight flows without distorting the relative accessibility of inland hubs. This is especially applicable in cases such as Algeciras, Málaga, Valencia, and Barcelona, which are connected to key hinterland hubs via dedicated rail corridors.

This transport cost structure enables a robust evaluation of inland and coastal hubs based on their multimodal performance rather than geographic centrality alone. It supports comparative scenario testing on modal shifts, intermodal competitiveness, and port–hinterland integration. In this way, it aligns with the key objectives of the EU Green Deal and TEN-T corridors, reinforcing the relevance of intermodal planning for Spain’s southern logistics arc.

While the model remains generalizable, its structure is particularly tailored for intermodal logistics systems with dense port–hinterland interactions—such as those found in Spain—where maritime gateways and inland hubs are increasingly integrated through rail corridors, pre-haulage services, and logistics activity zones. Its full calibration and national application, including port–hinterland dynamics, are described in the Results section.

3.3. Dynamic Operating Costs ( $I_{k}$ )

A new term is introduced to incorporate operating costs for each hub in the network (

I_{k}

), allowing for a more realistic representation of the relationship between hub capacity and its effective utilization. It is important to note that operating and maintenance costs can exceed the annual amortization of infrastructure in most dry ports or logistics activity zones. These costs depend on factors such as economies of scale, overcapacity analysis, and storage and inventory management costs, which vary according to the operational characteristics of each hub.

This dynamic formulation captures the nonlinear cost behavior arising from freight flows and their spatial concentration, improving the model’s accuracy compared to traditional cost models based on fixed infrastructure parameters [4,8,13].

Economies of scale are represented by a logarithmic function that adjusts costs based on hub capacity and processed demand. As a hub manages a higher traffic volume, it benefits from lower unit costs due to operational efficiency and better infrastructure utilization. The relationship between hub capacity and managed demand introduces an adjustment coefficient, reflecting the accumulated experience of the node, ensuring that costs are lower for high-capacity hubs with a more extended operational history [25].

λ_{K}^{s c a l e} = 1 + θ_{s c a} \cdot l o g (\frac{Γ_{k}}{d_{k} + ϵ})

(11)

where

θ_{s c a}

is an adjustment coefficient, ϵ is a stability parameter that prevents indeterminacies, and d_k represents the demand processed by hub “k”.

The overload penalty is introduced to capture the negative impact of hub saturation. In this sense, when demand is close to exceeding the assigned capacity, the system imposes an additional nonlinear cost, progressively increasing the penalty as congestion intensifies. This pattern reflects the rise in waiting times, resource congestion, etc., and reduced efficiency when a hub operates beyond its optimal capacity. The penalty function is convex, incorporating a sensitivity parameter that controls the severity of the penalty, allowing for calibration based on the actual impact of congestion on logistics performance. The effect of congestion on hubs has been extensively studied in optimization models, where it has been demonstrated that including nonlinear congestion costs improves system stability and efficiency, preventing the oversaturation of key nodes [15].

λ_{k}^{o v e r} = \{\begin{matrix} 1, d_{k} \leq Γ_{k} \\ 1 + θ_{o v e r} \cdot {(\frac{d_{k}}{Γ_{k}} - 1)}^{γ_{o v e r}}, d_{k} > Γ_{k}, λ_{k}^{o v e r} > 1 \end{matrix}

(12)

This method penalizes already congested hubs and discourages the selection of hubs that are close to their capacity limit. Anticipating congestion issues before they occur enhances model stability and prevents network collapse due to congestion in future iterations [26].

Storage, management, and inventory maintenance costs reflect the impact of accumulated flow in hubs and the necessity of efficiently managing logistics resources. These costs can represent a considerable proportion of total logistics expenses [27]. Due to the operational nature of logistics hubs, storage costs increase with the volume of cargo processed. However, the relationship between these costs and demand is not linear, as hub infrastructure allows for the absorption of a specific volume without proportionally increasing expenses [28]. To capture this dynamic, we propose an adjustment coefficient that models the impact of storage as a function of hub capacity and managed cargo volume.

λ_{K}^{s t o r e} = (1 + θ_{s t o r e} \cdot \frac{β_{s t o r e} \cdot Γ_{k}}{d_{m a x}})

(13)

where

θ_{s t o r e}

is an adjustment coefficient that captures the effect of storage on operational costs.

Unlike the geographic centrality model of [4], the proposed approach defines hub roles based on operational management principles, consistent with recent formulations that incorporate dynamic capacity management and load-balancing approaches [6,16]. The cost structure reflects capacity utilization, operational efficiency, economies of scale, and marginal cargo handling costs within the intermodal logistics network; but rather on the operational management of hubs, we redefine the concept to align it with the actual cost structure in a system based on capacity, operational efficiency, economies of scale, and the marginal costs of the cargo handled at each hub within the intermodal logistics network.

Where

I_{0}

represents the economic impact of hub allocation within the network and allows for the adjustment of logistics costs by considering transportation, storage, and efficient distribution.

I_{0} = \frac{\sum_{i, j} W_{i, j} \cdot C_{i, j} - \sum_{k, l} Y_{k l} \cdot C_{k l} + \sum_{k} S_{k} \cdot H_{k}}{N_{h}}

(14)

I₀ represents the average infrastructure-related cost per hub across the network. It aggregates total transport costs, inter-hub transfer penalties, and fixed storage investments, normalized by the number of active hubs (N_h). This value adjusts operational costs at the hub level. The numerical values and calibration parameters used for the operating cost are listed in Appendix A.

I_{k} = I_{0} \cdot λ_{k}^{s c a l e} \cdot λ_{k}^{o v e r} \cdot λ_{k}^{s t o r e}

(15)

3.4. Investment Costs ( $F_{k}$ )

A realistic analysis of investment costs is crucial when evaluating hub location alternatives within an intermodal transport network. Many decisions are influenced by logistical needs and budgetary constraints, land availability, environmental and social factors, and return on investment considerations.

In line with [29], the model defines investment cost as a combination of two key components: the market value of land at each candidate location and the associated infrastructure construction costs. The base cost is then adjusted by a pricing structure that reflects both the hub’s projected capacity and its level of territorial accessibility, following the approaches of [30,31].

F_{k} = A_{k} \cdot {(P}_{0} \cdot e^{μ D_{k}}) + {β_{i n f r a} \cdot C}_{b a s e, k} \cdot {(\frac{d_{k}}{C_{k}})}^{γ_{i n f a}}

(16)

where

A_{k} :

Land area required for hub k.

P_{0} : B

ase land price in the reference region where hub k is located.

D_k: Accessibility and urban development coefficient for hub k.

μ

: Coefficient that adjusts the influence of location on land price.

β_{i n f r a}

: Scaling factor for hub size.

C_{b a s e, k}

: Base infrastructure cost.

d_{k}

: Total expected flow at hub k.

C_{k}

: Capacity of hub k.

This structure reflects the dual nature of hub-related investment: it penalizes sites with poor accessibility or high land cost while also incorporating economies of scale in infrastructure construction, ensuring that cost estimations are adapted to real spatial conditions. The model enables both the technical assessment of candidate hubs and their economic prioritization for national infrastructure investment planning. This approach is particularly suited to contexts with significant regional asymmetries and multimodal transport constraints, such as Spain. The numerical values and calibration parameters used for the investment cost are listed in Appendix A.

3.5. Analysis of Environmental and Social Aspects (NPV_social)

Traditionally, hub location models have assumed fixed costs or simplified initial investment considerations, which may induce structural biases during the optimization process. Since hubs require significant initial investments and have long-term impacts, adopting a methodology that incorporates economic, social, and environmental factors into their evaluation is essential.

The Social Net Present Value (NPV_social), as proposed by [22], is an essential tool for evaluating infrastructure and transportation projects. It differs from Financial NPV, as defined by [32], in that it considers not only financial costs and revenues but also social, environmental, and economic costs and benefits, discounted over time. Its objective is to determine whether a project generates a net positive benefit for society, ensuring that decisions align with social welfare and sustainability.

This procedure allows for selecting hubs with greater long-term sustainability, integrating macroeconomic factors into optimizing logistics infrastructure. Since long-term social and economic benefits can offset operational and investment costs, we have not directly incorporated the Social Net Present Value into the cost minimization function (1). Instead, we introduce a constraint that ensures that the selected hubs have a net positive impact (2).

Thus, as a design criterion for evaluating each set of candidate hubs, we incorporate investment, operational, and transportation costs and environmental and social impacts. This allows for decision-making that aligns with market realities and public policies on sustainable development, ensuring that hub location and operation are optimized in terms of cost efficiency and their long-term impact on the population. Our approach treats sustainability not as a post hoc metric but as a binding criterion that guides the solution process, embedded directly into the optimization logic. It serves as a feasibility condition that shapes the solution space and redefines what is considered an admissible logistics network.

{N P V}_{s o c i a l, K} = - \frac{F_{k}}{T} + \sum_{t = 1}^{T} \frac{B_{t} - {(C}_{t, k} + I_{t, k})}{{(1 + r)}^{t}}

(17)

where

F_{k}

: Investment in hub k infrastructure, including land acquisition and construction costs.

T

: Time horizon in years.

B_{t}

: Expected annual benefits.

C_{t, k}

: Annual environmental and social costs of hub k.

I_{t, k}

: Annual operational costs of hub k.

r

: Discount rate, derived from Ramsey’s Rule.

In the analysis of environmental and social costs associated with hub management, several impact factors are considered, including greenhouse gas emissions, land use intensity, ecological conservation, and effects on population quality of life, the numerical values and calibration parameters used for the analysis of environmental and social aspects are listed in Appendix A:

C_{t, k} = {(β}_{a m b} \cdot E_{k} + γ_{a m b} \cdot S_{k}) + (λ_{s o c} \cdot {P o b}_{k} + μ \cdot C_{v i d a, k})

(18)

where

E_{k}

: CO₂ emissions generated by the construction of hub k.

S_{k}

: Ecosystem impact and biodiversity loss in the area where hub k is located.

β_{a m b}

,

γ_{a m b}

: Environmental penalty coefficients.

{P o b}_{k}

: Population affected by the operations of hub k.

C_{v i d a, k}

: Cost associated with the quality of life of the population affected by hub k.

Each hub candidate is required to achieve a positive social return, assessed through a net present value calculation that accounts for environmental penalties, operating expenses, and externalities over a 20-year horizon, as defined in Equation (17). In practice, each configuration is evaluated using independent indicators: unit logistics cost (EUR/t), environmental impact coefficients, NPV_social, and flow distribution entropy. This approach supports comparative scenario testing under spatial and operational constraints rather than relying solely on aggregate cost minimization.

3.6. Genetic Algorithm for Solving the CMAHLP with Sustainability Constraints

Hub location problems involve optimizing the structure of transport and distribution networks to minimize cost and improve connectivity. Due to their combinatorial nature, these problems are NP-hard, and solving them exactly becomes unfeasible in real-world scenarios with many nodes and constraints. For this reason, heuristic and metaheuristic approaches are used to obtain good solutions within reasonable computation times.

The model applies a genetic algorithm (GA) to identify suitable hub locations, minimizing not only transport and infrastructure costs but also operating expenses. In parallel, it evaluates the social and environmental implications of each configuration over the long term. During the initial phase of this work, we evaluated alternative solution strategies, including Harmony Search and Tabu Search, to test the robustness and versatility of different metaheuristics. Harmony Search produced fast but repetitive and structurally limited outputs, while Tabu Search, although more flexible, showed high sensitivity to initialization and instability under capacity constraints. Based on these preliminary tests, the Genetic Algorithm was selected as the core optimization engine due to its consistent convergence behavior, structural diversity, and adaptability to capacity, intermodality, and sustainability constraints. This choice is further detailed in the extended thesis version, which includes a comparative analysis of all three methods.

The selection of K-Means clustering over alternatives such as Density-Based Spatial Clustering of Applications with Noise (DBSCAN) or hierarchical clustering was based on three critical factors. First, its computational scalability ensures efficient initialization even in medium-sized national networks like Spain’s 47-node topology, where hierarchical approaches show combinatorial growth. Second, K-Means enables reproducible centroid initialization using fixed random seeds—an essential feature for institutional benchmarking and auditability in EU impact assessments. Third, its explicit minimization of intra-cluster distances enhances topological coherence, a key determinant of early-stage algorithm stability and convergence efficiency. In test scenarios, this initialization strategy consistently reduced the number of generations required to reach viable configurations, validating its suitability for CMAHLP applications where spatial structure and policy alignment are equally relevant.

Complementarily, a random initialization procedure was implemented to diversify the initial population. This routine ensures that each individual contains exactly the predefined number of hubs by randomly selecting a subset of nodes and enforcing binary constraints. While less topologically refined than clustering-based initialization, this mechanism enhances structural diversity and avoids early convergence—particularly relevant when exploring large or multimodal solution spaces.

The evolutionary cycle begins with an initial population of hub configurations derived from K-Means clustering, ensuring spatial consistency. Each generation proceeds by evaluating solution fitness, selecting the best individuals, applying crossover and targeted mutation, and conducting a local search on the top-performing solutions. Feasibility corrections ensure that all configurations respect cost, capacity, and NPV_social thresholds. Periodic reintroduction of diversity helps escape local optima. The cycle iterates until convergence criteria are met or the maximum number of generations is reached.

These techniques have been successfully applied to a wide range of problems, including combinatorial optimization for packing goods of different sizes into the fewest possible containers; task scheduling with limited resources over a time horizon; vehicle routing optimization to minimize transportation costs and delivery times; multi-criteria analysis for inventory classification; and logistics network design without capacity constraints, where transportation costs are based on the distance between customers and service centers [33,34,35,36,37]. In this research, the genetic algorithm is adapted to the CMAHLP problem, integrating multiple objectives and logistical constraints that require complex solution structures and a high evolutionary exploration capacity.

According to [38], a genetic algorithm consists of a series of key steps that enable its function as an iterative optimization mechanism. In this study, evolutionary configurations with a number of generations ranging from 100 to 800 were tested, adapting the population size to 50% of the total generations. This proportional relationship guarantees a balance between initial diversity and the convergence capacity. The most efficient configuration was identified at 300 generations and 150 individuals. The process begins with generating an initial population [39], where each individual represents a specific hub configuration within the network. This initial selection is performed randomly, based either on clustering strategies (K-means) or prioritization according to the intensity of logistic flow [40]. This initialization phase allows the population to be structured from the outset in a topologically coherent manner, avoiding the random dispersal of inefficient solutions.

Each individual is evaluated using a fitness function, which measures its quality as a solution. According to [41], the fitness function indicates the suitability for deciding which individuals will be selected for reproduction and selection processes. This function integrates a multi-criteria objective, composed of operational costs (I_k); investment costs (F_k); transportation costs (C_i,j); and an analysis of Social Net Present Value (NPV_social), incorporating Ramsey’s Rule (1928) and the improvements of [42] to reflect the intertemporal impact of environmental and social costs and benefits.

To initialize the population, the K-Means clustering algorithm, as formalized by [43], was employed to pre-select initial hub locations. The K-Means algorithm was applied to the spatial coordinates of the nodes in two benchmark scenarios: the classical dataset of CAB [4,7] and the Spanish case with 47 nodes.

For both the reference dataset and the Spanish case, the number of clusters was fixed to match the number of hubs specified in each scenario. A constant random seed was applied to guarantee reproducibility in centroid initialization, and several independent K-Means runs were conducted to verify the consistency of node allocation across cluster partitions. This ensured a coherent and unbiased initialization of the population for the genetic algorithm. Beyond reproducibility, this spatial clustering approach allows the algorithm to start from a set of territorially consistent solutions, improving early-stage stability and reducing the number of generations required to reach competitive results, as noted by [44].

Previous studies by [45] have demonstrated that the appropriate selection of initial centers in K-Means improves algorithm stability and prevents convergence to suboptimal solutions. The initialization process begins with extracting the spatial coordinates of the network nodes. K-Means is then applied to cluster the nodes into a predefined number of groups, determined by the number of hubs to be selected. A fixed random seed and multiple replications are used to improve the stability of cluster assignments, following the methodology recommended by [46].

Once the clusters are defined, each initial hub is selected as the node closest to the centroid of its corresponding group. This selection is carried out by computing the Euclidean distance between each node and its cluster centroid, choosing the one that minimizes the distance. This method, commonly used in clustering-based optimization, has been validated in previous works such as [44]. This technique ensures that each initial hub occupies a geographically central position within its cluster—maximizing topological representativeness; its application has demonstrated stable and efficient results both in the Spanish national network and in Ebery’s theoretical model, validating its suitability for initializing logistics networks with a heterogeneous territorial distribution.

This way of selecting initial hubs based on cluster centroids makes the first solutions more consistent with the real structure of the network. In both models, it helped reduce the number of generations needed to reach stable configurations and kept computation times under control. This benefit, highlighted by [45], reinforces the role of centroid-based initialization in accelerating convergence.

The evolutionary process begins with a dynamic tournament selection mechanism, in which solutions with lower objective function values are more likely to be retained. Tournament size varies across generations: large tournaments are used in the early stages to promote exploration, and progressively smaller tournaments are used to intensify exploitation in later stages [47]. This scheme allows selective pressure to be controlled and has proven effective in the evolution of CMAHLP configurations with multiple hubs.

Next, a uniform crossover operator is applied, allowing the exchange of information between selected solutions. This operator assigns each offspring gene the value of one of the parents with equal probability, promoting a diverse combination of characteristics [48]. The crossover probability has been calibrated in the range 0.75 and 0.85, selecting the 0.85 value as optimal due to its capacity to generate diversity without compromising the stability of solutions in intermediate phases of the evolutionary process.

To prevent a common issue in this type of crossover, where solutions might have too many or too few active hubs, a verification step ensures that each resulting solution has exactly the required number of active hubs. If there are too many, they are deactivated randomly; if there are too few, nodes with high connectivity are activated instead. This technique maintains the structural validity of the generated solutions [49].

Subsequently, a directed mutation process is introduced, replacing hubs to evaluate alternative configurations (genetic diversity) while maintaining feasibility. This prevents premature convergence by maintaining variability within the population, enabling the exploration of previously uncovered regions in the search space and correcting missing hubs [50]. During the calibration phase, mutation probabilities between 0.20 and 0.40 were analyzed, the 0.40 value showing the best performance in terms of topological diversity without compromising the quality or stability of the solutions.

After applying the selection, crossover, and mutation operators, the fittest individuals are chosen to form the next generation. This replacement phase is critical for the gradual improvement of the population, ensuring that solutions tend to improve over time [51]. To improve the quality of solutions, an intensive local search is applied to the top 1% of performing individuals in each generation. This search consists of carrying out controlled exchanges of active hubs and analyzing their impact on the objective function, allowing promising configurations to be fine-tuned without altering the global population. This exploitation phase allows the algorithm to fine-tune promising configurations, improving their fitness without altering the population globally.

To mitigate premature convergence, a diversification mechanism is introduced; in this sense, every 50 generations, the bottom 10% of the population is discarded and replaced with entirely new individuals generated through random initialization. This mechanism has been useful for avoiding deadlocks in scenarios with a larger number of hubs. At the same time, it introduces additional evolutionary pressure without sacrificing structural diversity. Its periodic execution allows the algorithm to escape local optima. This injection of novel genetic material promotes structural diversity and helps the algorithm escape local optima, as suggested by the foundational principles of evolutionary computation [38].

Some of the mechanisms, including directed mutation, local refinement, or hub correction, help the algorithm converge faster in most scenarios. The model keeps operational constraints under control while exploring better cost configurations step by step. If any hub gets close to capacity, the system shifts part of the flow to nearby nodes with room to absorb it. This prevents saturation and helps keep the network stable, especially when demand is not evenly distributed.

Finally, a stopping criterion is implemented [39], where the process continues for a predetermined number of generations or until improvements in the objective function become marginal. The primary stopping criterion is set at 300 generations, although the algorithm considers an additional convergence condition when no changes in the selected hubs or significant cost improvements are detected over several iterations.

At the end of the process, the best solution represents the optimal hub configuration within the logistics network, ensuring a positive social net present value (NPV_social) while minimizing transportation, operational, and construction costs. Local search is applied to the top 15% of individuals in each generation, evaluating controlled swaps between active and inactive hubs to refine solutions without altering the overall population structure. To prevent premature convergence, every 30 generations, about a third of the population is replaced with new individuals generated at random. This refresh allows the algorithm to keep exploring alternative configurations and avoid falling into local optima, which contributes to generating more balanced and robust solutions, especially in scenarios with many hubs. Parameter values and assumptions used are provided in Appendix A.

Figure 2 outlines the main evolutionary cycle of the proposed algorithm, including initialization, adaptive selection, scenario-based crossover, feasibility-enforcing mutation, local search, and stopping criteria.

3.7. Validation and Comparison of the Proposed Genetic Model Against the Ebery et al. Model

This section presents the implementation results of the proposed Genetic Algorithm (GA) applied to the Capacitated Multiple-Allocation Hub Location Problem (CMAHLP). To evaluate its performance under standard conditions, we developed a specific GA version calibrated to reproduce the cost logic and capacity rules of the formulation by [4], as used in classical benchmarks such as the CAB dataset. Although this heuristic has become a widely cited reference, its structure presents notable limitations in terms of capacity realism, flow flexibility, and sustainability integration. As discussed in Section 2, we do not adopt it as a theoretical foundation but rather as a comparative baseline. Our complete model presented in Section 3 incorporates a more advanced GA design, including dynamic operating costs, guided mutation, flow redistribution, and a feasibility constraint based on the Social Net Present Value (NPV_social), aligning the hub selection process with broader public policy and territorial balance goals.

The dataset employed corresponds to the classical CAB model, originally proposed by [7] based on intercity traffic data from the U.S. Civil Aeronautics Board. This benchmark is widely recognized for evaluating hub location formulations and remains valid for comparative purposes. In this study, we solved instances of n = 10, 15, 20, and 25 nodes, exploring different values of the transshipment discount factor α ∈ {0.2, 0.4, 1.0}. The GA was configured with a population of 150–400 individuals, 300–800 generations, 85% crossover, 40% mutation, and periodic diversity reintroduction (30% every 30 generations). A local search phase was also included for the top 15% of individuals per generation. Unlike exact methods, whose application becomes computationally limited at larger scales or when nonlinearities are introduced, this heuristic framework supports scalable, constraint-aware optimization under realistic operational behavior.

Table 2 summarizes the comparative results between the original heuristic proposed by [4] and the Genetic Algorithm developed for this study, applied to the CAB dataset. For each instance (n = 10, 15, 20, 25), and under different values of the transshipment discount factor α, both the total cost and selected hubs are reported. The percentage variation column quantifies the cost deviation—or gap— of the GA relative to Ebery’s results, calculated as:

G A P = \frac{{C o s t}_{G A} - {C o s t}_{E b e r y}}{{C o s t}_{e b e r y}} \cdot 100

Additionally, the table includes the elapsed computation time for each model. The GA shows a significant advantage, particularly in medium and large instances. For cases with 15, 20, or 25 nodes, the GA consistently reduces runtime by more than 70%, even when accounting for its increased complexity. For example, in case 25d2 (α = 0.2), the GA solves the problem in 7.27 s versus 493.77 s in the reference model—an improvement of over 98%. These results highlight the GA’s capacity to offer efficient, scalable optimization under real-world constraints.

Based on the results obtained, the Genetic Algorithm (GA) achieves solutions that are, in most cases, equivalent in terms of cost efficiency, with a deviation margin generally below ±4% compared to the values reported by [4].

Beyond these quantitative comparisons, a structural analysis of the results is presented in Figure 3, offering visual insights into hub location behavior, flow distribution, and solution robustness.

Figure 3 presents the structural analysis of the hub configurations derived by our Genetic Algorithm when solving the classical CAB dataset under different scenarios. Contrary to Table 2—which offers a numerical comparison with the benchmark by [4]—this figure focuses entirely on original results obtained through our model.

The figure includes multiple visual components to facilitate the interpretation of the behavior and outcomes of the proposed algorithm. The topological maps (left column) display the spatial distribution of active hubs (highlighted in red) and the dominant connections from each demand node (in blue). These connections correspond to the hub that receives the highest volume of flow from each spoke and thus allow us to identify nodes with High Local Centrality—an original concept introduced in this work. This represents an understanding of how logistical influence zones form and evolve under different scenarios.

Additionally, flow heatmaps (top right) visualize the transshipment volume processed by each hub, indicating their load and strategic relevance within the network. In parallel, the histograms (middle section) present the distribution of total cost values observed across generations, with fitted curves to capture the statistical behavior of convergence. Finally, diversity plots (bottom right) show the spread of top-performing solutions in the principal component space, highlighting the ability of the GA to maintain exploratory capacity and avoid local optima.

The graphical outputs in Figure 3 serve as more than just visual complements to the numerical results—they provide direct insight into the algorithm’s behavior. By representing the most frequent node-to-hub assignments, the volume handled by each hub, the cost distribution across generations, and the diversity of top solutions, we can verify whether the model behaves consistently with the expected logic: balancing flows, adapting to capacity, and maintaining exploratory capacity throughout the process. This perspective is essential not just to assess efficiency but to evaluate the robustness and consistency of the proposed configurations.

The results discussed here complement the numerical outcomes summarized in Table 2 and give structure to the discussion of how the Genetic Algorithm performs in benchmark settings under real-world conditions.

In most instances, both models arrive at very similar hub configurations. When differences do appear, they are usually linked to how the GA reallocates flows and manages capacity limits, which makes the system more flexible under uneven demand conditions.

Looking at the spatial distribution of hubs where the two models diverge, a clear pattern emerges. Ebery’s heuristic often selects nodes with high historical assignment frequencies, which reinforces positions previously considered dominant. This behavior corresponds with a selection process based on fixed costs and probability-driven prioritization.

In contrast, the GA determines hub locations through a territorially adaptive process that integrates cluster-based initialization and capacity-aware evaluation. Rather than favoring conventionally dominant hubs, the algorithm identifies alternative nodes that may yield superior performance in terms of both cost and flow distribution, particularly in less saturated areas.

The selection phase in the GA leverages K-Means clustering to partition the network space and assign hubs based on proximity to cluster centroids. This differs from Ebery’s approach, which builds hub assignments primarily through fixed-cost heuristics and capacity filtering, followed by a local exchange routine aimed at refining the solution quality through marginal cost improvements.

In case 20c4, the GA selects Baltimore, Memphis, and Phoenix instead of the more central hubs identified by [4]. This configuration distributes flows more evenly across the network and avoids excessive concentration. In case 10d4, Ebery assigns node 4 as a hub, whereas the GA selects node 9. Since node 4 has a higher historical selection probability, this difference reflects Ebery’s tendency to favor statistically dominant nodes, while the GA emphasizes territorial balance. A similar pattern appears in case 15d2 (α = 0.2), where the GA includes node 5 (Denver), while Ebery again selects node 4. The proximity between both nodes confirms that the GA refines hub positioning within local ranges rather than proposing disruptive or inefficient alternatives.

The genetic algorithm reallocates flows based on available capacity, preventing overload at critical hubs. Unlike Ebery’s heuristic, which applies local exchange rules for incremental adjustments, this approach recalculates load distributions globally during the optimization process. In case 20d4, both models select the same hubs (4–12–17), but the GA shows slightly higher costs (+2.42%), indicating that in this particular instance, Ebery’s simpler rerouting approach may be more efficient under fixed conditions.

One of the strengths of the GA is its ability to adjust flow distribution across the network based on hub capacity. This helps prevent overloads and improves infrastructure efficiency, especially during high demand or significant transshipment penalties. These conditions are common in large-scale systems or in networks exposed to changes in flow patterns, where rigid assignment rules often fail to respond effectively.

The behavior of the model also changes depending on the value of the discount factor α. As α increases, the cost gap between the GA and the heuristic tends to widen in favor of the GA. This is consistent with what we observe in scenarios where transshipment costs become more influential. In high-demand scenarios, the genetic algorithm reallocates flows to minimize global costs and mitigate hub saturation. It achieves more balanced flow distributions without increasing operational costs, maintaining solution robustness under shifting conditions, as observed in large scale networks [52,53].

In summary, the GA represents a robust and scalable approach to hub location problems. Beyond reducing costs, it improves overall network efficiency by balancing flows, managing capacity adaptively, and reflecting real operational behavior [40]. Compared to traditional heuristics, it offers similar or better results, with the added value of integrating flexible decision logic aligned with realistic logistics conditions.

3.8. Results of the Proposed Model

This section presents the results of applying the genetic algorithm to the Spanish freight transport network across a wide range of scenarios. Our model minimizes the total cost of the intermodal system by integrating three main components: transportation costs (covering collection, transfer, and distribution), dynamic hub operational costs (Iₖ), and fixed investment costs (Fₖ). In addition, a long-term sustainability constraint is introduced through the Social Net Present Value (NPV_social), acting as a constraint (NPV_social ≥ 0), ensuring that only those configurations with a structurally positive and long-term socioeconomic impact are considered valid.

Unlike traditional approaches focused solely on cost, this model introduces a set of strategic performance indicators—such as economic efficiency, territorial coverage, and intermodal connectivity. These indicators address a critical gap in hub network design literature, which traditionally prioritizes cost minimization over systemic resilience and territorial balance. They are applied ex post to assess and rank feasible configurations from a strategic, economic, and social perspective.

The analysis is based on the core optimization objective (1) and the sustainability framework NPV_social (2) from Section 3, and utilizes data from the 2023 freight statistics published by the Spanish Ministry of Transport, specifically, from the Observatorio del Transporte y la Logística en España (OTLE), which provides official multimodal flow data at the national scale. The dataset includes maritime, rail, and road modes, encompassing both national flows (intra- and inter-regional) and international trade. Key port terminals and border crossings are explicitly incorporated as origin–destination nodes in the intermodal network.

The simulated logistics network consists of 47 candidate nodes spread across the Spanish mainland, including the provincial capitals and four key maritime ports—Algeciras, Valencia, Barcelona, and Málaga—that serve as critical gateways for international freight, particularly along the southern and eastern coasts (see Table S1 in Supplementary Materials for spoke ID mapping). This configuration guarantees comprehensive territorial coverage and captures both inland and maritime dimensions of the national logistics system (Figure 4).

The demand matrix was constructed from the OTLE dataset, and a 47×47 cost matrix was developed using the methodologies of [23,24], incorporating a mode-sensitive function that distinguishes between unimodal (road-only) and intermodal (road–rail) transport, where adequate infrastructure exists. This formulation captures all internal logistics costs—collection, main haulage, transshipment, and distribution—as well as externalities such as congestion, emissions, noise, and accident risk. To represent the real freight dynamics of Spain, the model explicitly includes key rail freight corridors such as Valencia–Madrid, Zaragoza–Barcelona, and Algeciras–Zaragoza. These high-density corridors serve as strategic levers for modal shift, directly supporting Spain’s sustainable infrastructure goals.

As introduced in Section 3, the evolution of average door-to-door costs (Figure 1) confirms the structural advantage of intermodal transport—understood here as the integration of road and rail—over long distances. This pattern is consistently reflected in the model’s outputs: scenarios that activate a greater number of hubs and apply lower values of the transshipment discount factor α achieve lower per-ton costs while significantly expanding territorial coverage and enhancing intermodal connectivity across the network.

To examine the trade-offs between network structure and system performance, 120 optimization scenarios were defined using the previously calibrated genetic algorithm. Each combines a number of active hubs (4, 6, 8, 10, or 12) with a specific value of the transshipment discount factor (α = 0.2 or 1.0), simulating the presence or absence of inter-hub economies of scale. Notably, α = 0.2 aligns with current EU policies promoting rail consolidation along priority corridors (e.g., TEN-T). These combinations enable comparative testing between highly centralized systems and territorially distributed configurations. Sustainability is evaluated across its three strategic dimensions—economic, social, and environmental—under a long-term infrastructure planning lens.

The design space was explored using a structured matrix of 120 scenarios, defined by variations in key genetic algorithm settings: number of hubs, α values, generations (100–800), population sizes (50–400), crossover probabilities (0.75–0.85), and mutation rates (0.20–0.40). An elitist selection mechanism retained the top 30% of individuals per generation, complemented by a 15% local search intensification to refine high-quality solutions. This experimental design overcomes a key limitation in freight GA literature: the neglect of structured diversity for policy-transferable solutions. It ensures balanced exploration of distinct configurations while avoiding premature convergence and reinforcing solution robustness.

The discount factor α (0 ≤ α ≤ 1) reflects the extent to which economies of scale apply to inter-hub freight flows. Lower values simulate consolidated, cost-efficient intermodal corridors—such as those involving rail—while α = 1.0 represents purely road-based logistics without inter-hub cost savings. These two extremes define the system’s behavioral boundaries and support robust comparisons between highly centralized networks and territorially distributed models.

The cartographic outputs—especially the flow maps showing optimal hub locations and their assigned zones—offer a crucial spatial interpretation. The blue assignment lines do not depict physical routes but illustrate each hub’s sphere of influence, its “logistical gravity”, over the nodes it consolidates. This visualization enables the detection of consistent regional patterns, enhancing the territorial interpretation of each configuration. Madrid consistently anchors the central structure; Zaragoza and Valencia extend their influence along east–west axes, while Córdoba, Jaén, and Málaga gain prominence in socially weighted or equity-driven solutions.

These spatial maps reflect the distribution logic of the most cost-efficient solutions (subject to NPV_social ≥ 0) and lay the foundation for the next phase of analysis based on economic efficiency and structural diversity indicators.

Rather than repeating identical runs, each hub–α configuration was executed under variable genetic settings. This induces structured stochasticity into the optimization process, generating diverse convergence dynamics, solution topologies, and evolutionary paths. As empirically validated in section “Results of the model” (Figure 5), this strategy prevents premature convergence and enables the identification of robust, transferable configurations suitable for long-term strategic freight planning.

The following section presents the output obtained by the model, integrating total transport costs with the construction and operational expenses of the activated hubs. Only configurations meeting the sustainability constraint (NPV_social ≥ 0) are retained, ensuring that the resulting networks are not only cost-efficient but also socially and environmentally viable in the long term.

To clarify the analytical scope of this section, it is important to note that the configurations shown correspond to the most cost-efficient solutions—subject to the constraint NPVsocial ≥ 0. These results do not represent the full range of high-performing scenarios in terms of social or territorial return, nor do they reflect the highest-scoring layouts under composite indicators. Rather, they form the first analytical layer—benchmarking network performance under strict economic rationality and sustainability viability, which sets the baseline for deeper multi-criteria comparisons presented in later sections. These results are summarized in Table 3, which lists the most cost-efficient hub configurations that satisfy the NPVₛₒcᵢₐₗ ≥ 0 constraint.

The results show a clear trend: as the number of hubs increases, the network evolves from a compact, centralized configuration to a more spatially distributed architecture. With only four hubs, flows concentrate around central high-capacity nodes such as Madrid, Valencia, and Zaragoza. These locations consistently appear in cost-efficient solutions due to their robust infrastructure, modal connectivity, and topological centrality.

When the transshipment discount factor α is set to 0.2, the model simulates consolidated, cost-efficient corridors (e.g., rail), reducing the penalty for inter-hub distances and encouraging the inclusion of peripheral nodes. These scenarios tend to achieve greater spatial reach without significantly increasing total costs. In contrast, when α = 1.0, the absence of economies of scale leads the model to favor centralized solutions, concentrating flows through a small set of dominant hubs—a behavior typical of dense, unimodal logistics systems.

Comparing both α values reveals a key planning trade-off: α = 0.2 supports territorial equity and decentralization, while α = 1.0 maximizes cost efficiency under centralization. This balance is particularly relevant in intermodal systems where inland and maritime hubs coexist, and long-term resilience depends on avoiding excessive concentration.

In scenarios with 8, 10, or 12 hubs, the model activates additional nodes such as Lugo, Palencia, Córdoba, Cartagena, and Málaga—each selected for their geographic positioning, connectivity, or role in balancing hinterland access. Meanwhile, cities like Barcelona and Zaragoza, although structurally advantageous, occasionally show lower utilization, especially when peripheral hubs absorb regional flows. This confirms that optimal hub selection is context-dependent and evolves with the network’s spatial and operational parameters.

Beyond identifying cost-optimal configurations, the model incorporates a second analytical layer that explores the structural logic and strategic potential of each solution. This perspective shifts the focus from pure economic cost to broader indicators of resilience, spatial equity, and adaptive capacity—allowing for a more nuanced interpretation of the algorithm’s outputs. Rather than evaluating networks solely based on total expenditure, this approach examines how different configurations perform under complexity, identifying those that offer stronger trade-offs between efficiency, territorial coverage, and long-term viability.

To operationalize this perspective, the model integrates a set of structural and behavioral indicators:

Economic Efficiency: Ratio of NPV_social to total system cost.
Solution Entropy: Measures structural diversity and resistance to convergence.
Euclidean Diversity: Captures dispersion of solutions within the search space.
Cluster Diversity: Assesses whether hubs form coherent territorial groupings.
Pareto Frontier: Identifies non-dominated trade-offs between cost, sustainability, and geographic equity.

These indicators populate the Strategic Solution Space Explorer (Figure 5), which provides an interactive dashboard to compare evolutionary behaviors across scenarios. This framework reveals how mutation rates, hub counts, or flow structures influence convergence dynamics, making it a powerful tool for strategic logistics planning.

Graphical outputs confirm a progressive decline in total cost per ton as the number of hubs increases (Figure 5), driven by better flow allocation and inter-hub accessibility—especially under α = 0.2, where spatial decentralization becomes more viable.

In parallel, NPV_social increases consistently with the number of hubs (Figure 6), reflecting social returns from emissions reduction, accident avoidance, noise mitigation, and time savings. This increase becomes markedly steeper beyond the 8-hub threshold, where each additional hub starts to generate broader territorial benefits, beyond logistics alone.

This dual behavior—declining cost and increasing social return—defines the Pareto frontier, where optimal configurations align along a convex curve. Economic efficiency (Figure 7), defined as NPVsocial per unit cost, captures this trade-off: although each hub adds investment, the network gains in social and operational performance—up to a limit. Beyond 12 hubs, the marginal returns diminish, and the model transitions from efficiency-seeking to resilience-building—an inflection point where additional hubs reflect policy priorities rather than strict economic logic (Figure 8).

Computational metrics—entropy, cluster diversity, and Pareto dominance—validate the model’s robustness. Networks with six and eight hubs exhibit optimal trade-offs between cost-efficiency and structural diversity, ensuring adaptability under uncertainty. Configurations with 10 and 12 hubs extend geographic reach and improve access in peripheral regions, bolstering system resilience against congestion—albeit with a slight deterioration in aggregate efficiency metrics.

Overall, the model highlights a core triad—Madrid, Valencia, and Barcelona—emerging across all configurations as strategic backbones. Other cities like Málaga, Córdoba, Jaén, Palencia, León, and Zaragoza exhibit strong territorial performance and operational balance.

Under distributed configurations (α = 0.2), the model prioritizes nodes like Lugo, Cartagena, Pontevedra, Huelva, and Seville—locations with modal versatility and strategic geography, despite lacking fully developed logistics infrastructure.

Rather than proposing a single optimal blueprint, the model offers a curated spectrum of viable designs—flexible, scalable, and aligned with different policy agendas. These scenarios serve as a prelude to the concluding strategic insights that follow.

4. Conclusions

This work presents an advanced formulation of the Capacitated Multiple-Allocation Hub Location Problem (CMAHLP), tailored to support strategic intermodal logistics planning in Spain. The model integrates mode-sensitive transport costs, variable operational expenditures, investment requirements, hub capacity constraints, and a mandatory feasibility condition based on the Social Net Present Value (NPV_social). These components allow the model to represent the real constraints of national freight infrastructure while informing public policy design for the development of intermodality.

Unlike previous approaches reviewed in Section 3, the objective function has been broadened to include modal economies of scale, congestion penalties, storage-related costs, and infrastructure investment linked to land availability and terrestrial connectivity. Additionally, a temporal dimension is incorporated using a discount rate adapted from Ramsey’s Rule, which allows present-day investments to be weighed against long-term social and environmental returns.

Results from over 120 optimization scenarios confirm the model’s consistent performance across varying structural and cost conditions. As the number of hubs increases, the average cost per ton shows a steady downward trend. This reduction is driven not by increased investment but by more balanced distribution of flows and a more efficient use of available capacity—particularly in network structures that avoid over centralization.

The model consistently selects core logistics hubs such as Madrid, Valencia, and Barcelona across scenarios with 6, 8, 10, or 12 hubs, especially when the transshipment discount factor is high (α = 1). These nodes appear in nearly all high-performing configurations. They are frequently accompanied by inland hubs like Málaga, Córdoba, Palencia, Jaén, and León, which help redistribute flows and improve territorial balance thanks to their relative efficiency and intermodal access.

When the number of hubs is limited or the discount factor is low (α = 0.2), the model leans toward more distributed layouts. Under these conditions, nodes such as Lugo, Cartagena, Huelva, Pontevedra, or Ourense appear more frequently. While their infrastructure is more limited, they present intermodal potential that makes them strategically viable. This reflects the model’s flexibility in adapting to different spatial configurations and identifying high-performing alternatives beyond the traditional corridors.

Across all simulations, the algorithm produced a wide range of valid network designs. This diversity—measured through entropy and Euclidean dispersion—shows that the model avoids early convergence and explores alternatives with different balances between cost and coverage. For planners, this is useful: it provides a robust set of options adaptable to different future conditions. Pareto analysis confirms that some slightly more expensive layouts outperform others in terms of social return and territorial balance, reminding us that cost alone is not enough when designing intermodal networks.

The variation in genetic algorithm parameters—across 120 cases combining population size (50–400), crossover rates (0.75–0.85), mutation rates (0.20–0.40), and generations (100–800)—reinforced solution diversity. This controlled randomness introduces flexibility, enabling the model to adapt to a wide range of planning scenarios.

The scenarios with six and eight hubs deliver a balanced outcome between cost performance and network structure. They moderate logistics costs while offering a range of viable configurations, making them well suited to short- and medium-term planning. In contrast, networks with 10 or 12 hubs extend the system’s coverage by incorporating peripheral nodes—currently less active but with strong potential under evolving North Africa trade flows. These larger networks merit consideration for long-term strategies aimed at strengthening cohesion and resilience.

Figure 9 shows the spatial distribution of selected hubs and the freight flows they consolidate in three representative configurations. The red dots mark the hubs chosen by the model, while the blue lines indicate the catchment links—routes from each origin to its assigned hub. The heatmaps on the right reflect the total volume handled by each hub: darker shades indicate higher activity levels.

Six Hubs Scenario: The configuration strikes a solid balance between cost efficiency and national coverage. With six hubs well distributed across the territory, it offers a compact yet effective backbone—particularly suitable for medium-range planning under stable demand.
Eight Hubs Scenario: Introducing two additional hubs allows the network to reach into less connected areas, especially in the north and southeast. This improves accessibility and intermodal integration in regions traditionally underserved.
Twelve Hubs Scenario: With twelve active hubs, the system extends into peripheral zones with high latent potential. This layout aligns well with long-term strategies, particularly in the context of trade realignments and growing connectivity with North Africa.

The cross-analysis of hub selection frequency, economic efficiency, and flow performance supports a clear functional typology of nodes. This classification reveals not only which hubs are consistently active but also their strategic role—as structural backbones, tactical enhancers, or regional balancers. The following categories summarize the distinctive contribution each group makes within the optimized network configurations (Figure 10).

Essential hubs: Madrid, Barcelona and Valencia form the system’s core structure. They appear in virtually all high-efficiency scenarios, reflecting not only their existing capacity but also their established role in coordinating maritime and inland freight. As high-connectivity nodes, they absorb large volumes without diminishing network-wide efficiency.
High-performance tactical hubs: Córdoba, Málaga, Jaén, Palencia, León, and Zaragoza act as tactical enhancers with strong operational value. Though not present in every configuration, they often provide critical support—either by reinforcing key freight corridors or by relieving pressure from overloaded hubs. Their role is especially prominent in distributed or resilient layouts. Notably, Córdoba and Zaragoza consistently balance central positioning with territorial coverage, making them compelling candidates for investment within national infrastructure agendas.
Support hubs. Lugo, Vitoria, and Cartagena operate as secondary hubs with moderate but consistent involvement across scenarios. They feature most often in configurations designed to expand territorial coverage while keeping logistics costs under control. Their strategic value lies in strengthening regional capillarity, especially in underserved areas of the northwest and southeast. With adequate policy support—such as intermodal incentives or selective infrastructure investments—these nodes could fully realize their logistics potential and overcome existing constraints. Public policy interventions—such as targeted incentives for intermodal services or selective infrastructure upgrades—could help unlock their full logistics potential and address current structural limitations.
Territorially balanced hubs. Finally, nodes like Seville, Alicante, and San Sebastián fulfill a compensatory territorial function. Although their average economic efficiency is somewhat lower, their value lies in the geographic coverage and modal balance they introduce. These hubs are more often included in network designs that emphasize sustainability, regional cohesion, and congestion relief, particularly in southern regions and along the Mediterranean arc.

The model offers strong empirical support for the idea that an intermodal logistics network can be both efficient and geographically equitable. Instead of relying solely on cost as the guiding criterion, the model brings in key planning factors—public investment returns, environmental impact, and long-term system coherence. This shift moves the model beyond a technical optimizer and into the realm of practical planning, closely aligned with policy objectives and based on observed patterns.

Rather than prescribing a single optimal layout, the model generates a structured set of tailored solutions. Each configuration reflects a different set of planning priorities—economic, social, or territorial—and includes enough detail to support strategic decisions about where and how to allocate infrastructure resources. This versatility makes the model particularly valuable for national infrastructure planning, especially in the current context of evolving trade flows and reconfigured corridors across the Mediterranean and North Africa.

The methodology developed in this study aligns with both technical optimization goals and the strategic imperatives of the European Green Deal and EU cohesion policy. By integrating intermodal nodes—especially maritime terminals connected to inland logistics platforms—the model can support targeted investment priorities aimed at promoting modal shift and reducing carbon dependency in long-haul freight corridors. Incorporating the Social Net Present Value (NPV_social) criterion ensures that operational feasibility is assessed in conjunction with environmental impacts and spatial equity, establishing a planning framework that goes beyond simple cost minimization.

This approach is particularly relevant for Spain, given its marked regional disparities, strong port dependency, and the limited integration of rail freight—currently under 5% of modal share. Similar structural imbalances are observed in Portugal, Italy, and Greece, and even in France, where rail accounts for less than 10% of freight transport. In all these countries, multipolar logistics systems with major port regions must decarbonize and rebalance flows. By combining infrastructure configuration, flow dynamics, and socio-environmental metrics, the model provides a scientifically robust basis for integrated planning. This aligns directly with EU funding instruments such as CEF, FEDER, and the Recovery and Resilience Facility, which prioritizes sustainable, resilient, and intermodal corridor development.

This study supports a balanced and coherent model for Spain’s intermodal freight planning, demonstrating that ports are not just entry points but strategic territorial nodes. It offers concrete guidance for public policymakers to align infrastructure investment with core objective territorial cohesion, sustainable logistics, the Green Deal, and TEN-T integration. The model underpins actionable institutional proposals, including a National Intermodality Strategy (echoing the 2023 European Court of Auditors’ recommendation) and the creation of a national observatory for monitoring modal share and hub performance. It also enables the design of differentiated terminal charges, fiscal incentives for intermodal operations, and environmental clauses for funding approval. Rooted in CMAHLP outputs, these instruments bridge model-based diagnosis and policy action, fostering a logistics governance framework that is more coordinated, adaptive, and geographically equitable.

In summary, this study validates the use of the CMAHLP framework for maritime terrestrial logistics planning, enriched by sustainability-based feasibility constraints. The model aligns with EU strategic agendas—the Green Deal and TEN-T—as operationalized in Table 4. This synthesis highlights three sources of European relevance: (1) consolidation of Spain-specific insights, (2) confirmation of policy alignment with CEF and the Fit for 55 package, and (3) presentation of transferable implementation templates, demonstrated through case examples in Italy, Portugal, Greece, and France.

Looking ahead, future work will explore dynamic network adjustments to better capture the evolution of freight flows and the impact of shifting cost structures and pricing regimes.

5. Future Research and Model Improvements

This study lays the foundation for a robust and adaptive hub location framework under real-world intermodal logistics conditions. However, several areas emerge as relevant extensions to refine the model further and adapt it to evolving planning scenarios:

Real-Time Disruption Scenarios. The current formulation operates under static flow assumptions. Future developments will incorporate disruption-responsive modules to simulate volatility in real-time operations. This includes stochastic modeling of traffic fluctuations using Poisson-based distributions calibrated to freight delays along key axes (e.g., AP-7, A-4), as well as port-level disruptions—such as strikes or terminal outages—parameterized through historical data from Spanish port authorities.
In addition, weather induced disruptions—such as Mediterranean fog or Atlantic storm fronts—could be incorporated through AI-based early warning systems. Notably, the use of ensemble Generative Adversarial Networks (GANs) for maritime detection under low-visibility conditions, as demonstrated by [54] in port monitoring applications, offers a promising path for anomaly detection in high-risk operational contexts. Embedding similar architectures into the CMAHLP disruption module would enhance the model’s anticipatory capabilities and responsiveness under the NIS2 Directive thresholds for critical infrastructure resilience.
Policy Implications: These extensions will support the application of the CMAHLP framework within DG-MOVE’s Resilience and Capacity Tool (RACT), particularly under the scope of Commission Delegated Regulation (EU) 2022/436 on the protection of critical transport infrastructure.
Traffic Growth and Hub Reconfiguration. As freight volumes expand, flow patterns and the strategic weight of specific nodes will inevitably evolve. Part of this growth stems from nearshoring trends in North Africa and persistent Red Sea disruptions—redirecting trade flows through the Western Mediterranean. These shifts intensify pressure on ports around the Strait of Gibraltar and trigger inland logistics adjustments. The key challenge is assessing whether emerging hubs such as Málaga, Córdoba, Jaén, Lugo or Palencia consolidate their role or whether mainstays like Madrid, Valencia, Barcelona and Zaragoza must absorb additional demand. These dynamics may also reveal latent intermodal value in secondary nodes, conditional on infrastructure upgrades and flow redistribution.
EU Policy Transfer: this reconfiguration logic is not unique to Spain. It mirrors challenges across the TEN-T Mediterranean Corridor, where ports in France, Italy, and Greece also face traffic asymmetries tied to shifting trade patterns. The CMAHLP model provides a structured method to simulate such scenarios under demand volatility. It enables EU infrastructure programs—such as CEF Transport and DG-MOVE evaluations—to prioritize investment in pressure nodes and validate network adjustments consistent with Green Deal and Fit for 55 objectives.
Capacity Optimization and Congestion Mitigation. The inclusion of congestion-sensitive algorithms and real-time rerouting logic would enable the model to preempt bottlenecks by dynamically reallocating flows toward secondary hubs. Madrid, Barcelona, and Valencia appear in nearly all configurations—an indicator of systemic centrality but also a warning of future saturation risks. Incorporating rolling-horizon structures would allow the system to adapt progressively to freight growth and shifting modal costs in high-pressure scenarios.
EU Policy Transfer: these enhancements are especially relevant for DG-REGIO and CEF Transport programs aiming to mitigate congestion in urban nodes along the Mediterranean arc. The ability to simulate flow reallocation under traffic stress supports proactive funding decisions in multimodal corridors and aligns with AI-enabled logistics strategies in the European Smart Mobility framework (EU Urban Mobility Initiative, Digital Europe Programme, and Digital Transport and Logistics Forum (DTLF)).
Intermodal Infrastructure Expansion and Resilience. Future developments should simulate the inclusion of new intermodal corridors, dry ports, and inland terminals—especially in the Mediterranean arc. This would allow testing the performance and resilience gains of planned investments. Tailored simulations could also incorporate specific modes such as inland waterways or air cargo, in port–airport systems, or fluvial corridors, with proper reparametrization of cost and operational structures. These simulations could inform the prioritization of infrastructure investments under CEF Transport calls, particularly those targeting modal shift and resilience within TEN-T Mediterranean and Atlantic corridors.
Environmental Externalities and Sustainability Metrics. Freight growth intensifies emissions, land use, and urban congestion. Incorporating ecological metrics into the core optimization—rather than treating them as external filters—would align the model with the EU Green Deal framework and enable scenario ranking based on total environmental footprint, not just economic cost, in line with EU Taxonomy criteria for sustainable infrastructure and Fit for 55 emission benchmarks.
This logic can be further enhanced by integrating lifecycle emissions directly into the hub location decision process, as demonstrated in the Yangtze River port case study [55], where environmental criteria were embedded into procurement models for container trucks to meet China’s Phase V emission standards—a policy benchmark comparable to Euro VI regulations under the EU framework. That study’s cost–emission trade-off methodology validates our approach to internalizing carbon externalities through the NPVsocial function, structured around Ramsey-based intertemporal evaluation.
Integration with Nearshoring Dynamics and Euro–Maghreb Trade Flows. Nearshoring and supply chain volatility (COVID-19, Red Sea crisis) are shifting trade toward the Mediterranean. Spain’s interface role between Europe and North Africa will intensify. Future scenarios should simulate demand shifts tied to Euro–Maghreb dynamics, assessing the strategic elevation of hubs in Andalusia and the Levante corridor. Such scenarios are particularly relevant for CEF Transport resilience planning and can inform DG-MOVE strategies aimed at reinforcing Mediterranean corridors under shifting Euro–Maghreb trade patterns.
Cross-Border Integration with Portugal and Atlantic Corridors. Future extensions should explicitly simulate Iberian-scale integration via the Sines–Madrid–Valencia axis, enabling coordinated infrastructure planning between Spain and Portugal. The inclusion of cross-border corridors within the CMAHLP framework allows for joint capacity allocation, specialization of terminal functions, and the evaluation of cost-sharing scenarios under CEF Transport’s Cohesion Envelope. These simulations can be structured using FEDER interoperability standards and aligned with the objectives of the TEN-T Atlantic Corridor, particularly in terms of modal shift and territorial cohesion.
Beyond the Iberian Peninsula, this approach provides a transferable template for other EU border regions—such as the France–Belgium or Italy–Slovenia corridors—where multimodal convergence and infrastructure asymmetries require coordinated planning. By identifying priority nodes for strategic co-investment, the model delivers a practical toolkit for DG-REGIO and CEF managers seeking to reduce fragmentation and reinforce spatial equity under the EU Cohesion Policy.
Parameter Sensitivity and Metaheuristic Design. Future refinements should include a systematic sensitivity analysis of the genetic algorithm parameters—specifically mutation rates, crossover probabilities, and selection depth—to identify robust and replicable configurations. This process would enable calibration of the algorithm for different corridor typologies and freight dynamics across the EU. Incorporating variance-based diagnostics and adaptive parameter tuning enhances algorithm transparency, a key requirement for institutional deployment. By making the inner logic of the solution process auditable and explainable, the CMAHLP framework becomes compatible with standardized assessment procedures used by European agencies—facilitating its use in preliminary project screening, scenario comparison, and investment prioritization under DG-MOVE or DG-REGIO criteria.
Alignment with EU Impact Assessment Logic. Ensuring compatibility with EU impact assessment procedures is essential for the model’s uptake by institutional stakeholders. The CMAHLP framework structures its outputs around standardized criteria—connectivity gains, modal shift potential, and territorial cohesion—aligning directly with the objectives set out in the EU Cohesion Policy 2021–2027 (Article 9) and Green Deal targets. By mirroring the logic of ex ante evaluation workflows managed by DG-MOVE and DG-REGIO, the model facilitates early-stage project screening and eligibility checks for EU funding instruments such as CEF Transport, the Recovery and Resilience Facility (MRR), and FEDER. It supports quantitative benchmarking against Eurostat NUTS-3 corridor baselines and enables Member States to meet assessment requirements under the Common Provisions Regulation (CPR) 1303/2013. This operational alignment reinforces the CMAHLP as a technical support tool in transnational infrastructure governance and strategic corridor development.

This research agenda consolidates intermodality as a core design principle for infrastructure planning—moving beyond modal coordination to structure territorial logistics systems. It strengthens the analytical and operational foundations for resilient freight corridors in Spain and supports scalable deployment across Mediterranean and Atlantic axes under shared European policy frameworks.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jmse13071301/s1, Table S1: Spoke ID mapping for the 47 × 47 network (node and spoke index).

Author Contributions

Conceptualization, J.M.R.; Methodology, J.M.R.; Formal analysis, J.M.R.; Investigation, J.M.R.; Data curation, J.M.R.; Resources, J.M.R.; Software, J.M.R.; Writing—original draft, J.M.R.; Supervision, A.C.O.; Validation, A.C.O.; Writing—review and editing, A.C.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The raw data supporting the conclusions of this article are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Variable Values

Table A1. Input data and genetic optimization parameters.

Variable	Description	Typical Value/Range	Unit
Number of Nodes	Total number of candidate nodes.	47 (Spain case)	Integer
Number of Hubs	Number of hubs to be selected.	4–12	Integer
Cost Matrix	Transportation cost matrix between nodes.	-	EUR/t
Flow Matrix	Freight flows between nodes.	-	t/year
Node Coordinates (X, Y)	Node spatial coordinates (UTM projection).	-	Meters
Alpha (Discount Factor)	Discount factor for inter-hub transportation costs.	0.2–1.0	Dimensionless
Generations	Number of evolutionary generations.	300–800	Integer
Population Size	Initial population size in the genetic algorithm.	150–300	Integer
Mutation Probability	Probability of individual mutation per generation.	0.2–0.4	Dimensionless
Crossover Probability	Probability of crossover between individuals.	0.75–0.85	Dimensionless
Reinsertion Interval	Interval (in generations) for diversity reinsertion.	50–75	Integer
Reinsertion Proportion	Proportion of population replaced during diversity reinsertion.	10–30%	Dimensionless

Table A2. Derived metrics and performance indicators.

Metric	Description	Unit
Best Total Cost	Best total logistics cost achieved.	EUR/t
Best NPVsocial	Best socio-environmental Net Present Value (NPVsocial).	MEUR
Economic Efficiency	Economic efficiency (NPVsocial/Cost).	Dimensionless
Euclidean Diversity	Euclidean diversity measure of the solution space.	Dimensionless
Solution Entropy	Structural entropy of network configurations.	Dimensionless
Cluster Diversity	Diversity across clusters of hub groupings.	Dimensionless
Infeasibility Rate	Average infeasibility rate across solutions.	%

Table A3. Economic, infrastructure, and environmental constants.

Parameter	Description	Value	Unit
chi	Weighting factor for distance normalization.	1.0	Dimensionless
delta	Penalty factor for over-congestion impacts.	1.0	Dimensionless
alpha	Transshipment discount factor between hubs.	0.2–1.0	Dimensionless
escalaFactor	Scaling coefficient for logistic flow adjustments.	0.1	Dimensionless
sobrecargaFactor	Overload penalty factor on saturated hubs.	0.3	Dimensionless
gamma_over	Multiplier for congestion penalties.	2	Dimensionless
marginalFactor	Marginal contribution factor for secondary connections.	0.2	Dimensionless
epsilon	Convergence tolerance for optimization.	1 × 10⁻⁶	Dimensionless
theta_store	Base storage cost penalty.	0.1	Dimensionless
beta_store	Scaling factor for cumulative storage costs.	0.05	Dimensionless
P_0	Base construction cost per unit area.	300	EUR/m²
mu	Efficiency decay factor over time.	0.05	Dimensionless
beta_infra	Scaling factor for infrastructure fixed costs.	0.8	Dimensionless
C_base	Base installation cost per hub node.	300	EUR
gamma_infra	Infrastructure cost escalation factor.	1.3	Dimensionless
T	Analysis horizon for NPV calculations.	20	Years
delta_r	Inflation correction rate.	0.011	Dimensionless
eta	Economies of scale parameter.	1.5	Dimensionless
g	Annual growth rate of freight demand.	0.017	Growth rate (per year)
r	Discount rate for financial projections.	0.0365	Discount rate (per year)
beta_CO2	Cost per ton of CO₂ emissions.	40	EUR/t CO₂
beta_social	Cost per social impact unit.	40	EUR/impact
beta_ingresos	Revenue per ton transported.	150	EUR/ton
gamma_base	Base scaling factor for infrastructure costs.	1.2	Dimensionless
sigma_random	Standard deviation for synthetic noise generation.	0.02	Dimensionless
factor_emisiones	Adjustment factor for CO₂ externalities.	1.0–1.1	Dimensionless
phi_penalty	Penalty amplification factor beyond congestion threshold.	1.5	Dimensionless
omega_time	Penalty factor for inter-hub time increases.	0.05	Dimensionless

Note: All monetary figures are expressed in constant 2024 EUR. Net Present Values (NPV) have been calculated over a 20-year period, applying an annual discount rate of 3.65% and a freight demand growth rate of 1.7%.

Table A4. Functional classification of urban logistics facilities and qualitative calibration of feasibility adjustment parameter Dk.

Facility Type	Location	Size (m²)	Logistics Function	Dk
Microhub/pickup point	Dense urban	<2000 (XS–S)	Cross-docking, pickup	>1.0
Urban consolidation center	Urban periphery	2000–5000 (M–L)	Consolidation and distribution	0.4–0.7
Fulfilment center	Peri-urban	5000–30,000 (L–XL)	Storage and last-mile	0.2–0.4
XXL distribution hub	Peri-urban/rural	>30,000 (XL–XXL)	Fulfilment, cross-docking	<0.2

References

Hakimi, S.L. Optimum Locations of Switching Centers and the Absolute Centers and Medians of a Graph. Oper. Res. 1964, 12, 450–459. [Google Scholar] [CrossRef]
Gelareh, S.; Nickel, S. Hub location problems in transportation networks. Transp. Res. E Logist. Transp. Rev. 2011, 47, 1092–1111. [Google Scholar] [CrossRef]
Ernst, A.T.; Krishnamoorthy, M. Efficient algorithms for the uncapacitated single allocationp-hub median problem. Locat. Sci. 1996, 4, 139–154. [Google Scholar] [CrossRef]
Ebery, J.; Krishnamoorthy, M.; Ernst, A.; Boland, N. Capacitated multiple allocation hub location problem: Formulations and algorithms. Eur. J. Oper. Res. 2000, 120, 614–631. [Google Scholar] [CrossRef]
Ishfaq, R.; Sox, C.R. Intermodal logistics: The interplay of financial, operational and service issues. Transp. Res. E Logist. Transp. Rev. 2010, 46, 926–949. [Google Scholar] [CrossRef]
Monemi, R.N.; Gelareh, S.; Nagih, A.; Jones, D. Bi-objective load balancing multiple allocation hub location: A compromise programming approach. Ann. Oper. Res. 2021, 296, 363–406. [Google Scholar] [CrossRef]
O’Kelly, M.E. A quadratic integer program for the location of interacting hub facilities. Eur. J. Oper. Res. 1987, 32, 393–404. [Google Scholar] [CrossRef]
Campbell, J.F. Integer programming formulations of discrete hub location problems. Eur. J. Oper. Res. 1994, 72, 387–405. [Google Scholar] [CrossRef]
Skorin-Kapov, D.; Skorin-Kapov, J.; O’Kelly, M. Tight linear programming relaxations of uncapacitated p-hub median problems. Eur. J. Oper. Res. 1996, 94, 582–593. [Google Scholar] [CrossRef]
Kara, B.Y.; Tansel, B.C. The single-assignment hub covering problem: Models and linearizations. J. Oper. Res. Soc. 2003, 54, 59–64. [Google Scholar] [CrossRef]
Yaman, H. Star p-hub median problem with modular arc capacities. Comput. Oper. Res. 2008, 35, 3009–3019. [Google Scholar] [CrossRef]
Yaman, H.; Elloumi, S. Star p-hub center problem and star p-hub median problem with bounded path lengths. Comput. Oper. Res. 2012, 39, 2725–2732. [Google Scholar] [CrossRef]
da Graça Costa, M.; Captivo, M.E.; Clímaco, J. Capacitated single allocation hub location problem-A bi-criteria approach. Comput. Oper. Res. 2008, 35, 3671–3695. [Google Scholar] [CrossRef]
García, S.; Landete, M.; Marín, A. New formulation and a branch-and-cut algorithm for the multiple allocation p-hub median problem. Eur. J. Oper. Res. 2012, 220, 48–57. [Google Scholar] [CrossRef]
de Camargo, R.S.; Miranda, G.; Ferreira, R.P.M.; Luna, H.P. Multiple allocation hub-and-spoke network design under hub congestion. Comput. Oper. Res. 2009, 36, 3097–3106. [Google Scholar] [CrossRef]
Contreras, I.; Cordeau, J.F.; Laporte, G. The dynamic uncapacitated hub location problem. Transp. Sci. 2011, 45, 18–32. [Google Scholar] [CrossRef]
Setiawan, F.; Bektaş, T.; Iris, Ç. The role of hubs and economies of scale in network expansion. Omega 2025, 131, 103220. [Google Scholar] [CrossRef]
Setiawan, F.; Bektaş, T.; Iris, Ç. The hub location problem with comparisons of compact formulations: A note. Transp. Res. E Logist. Transp. Rev. 2025, 194, 103902. [Google Scholar] [CrossRef]
O’Kelly, M.E. Hub facility location with fixed costs. Pap. Reg. Sci. 1992, 71, 293–306. [Google Scholar] [CrossRef]
Rodríguez-Martín, I.; Salazar-González, J.J. Solving a capacitated hub location problem. Eur. J. Oper. Res. 2008, 184, 468–479. [Google Scholar] [CrossRef]
Campbell, J. A survey of network hub location. Stud. Locat. Anal. 1994, 6, 31–49. [Google Scholar]
Stern, N. The Economics of Climate Change: The Stern Review; Cambridge University Press: Cambridge, UK, 2007; p. 9780521877251. [Google Scholar] [CrossRef]
Janic, M. Modelling the full costs of an intermodal and road freight transport network. Transp. Res. D Transp. Environ. 2007, 12, 33–44. [Google Scholar] [CrossRef]
Braekers, K.; Janssens, G.K.; Caris, A. Review on the comparison of external costs of intermodal transport and unimodal road transport. In Proceedings of the BIVEC-GIBET Transportation Research Day 2009, Liège, Belgium, 28 May 2009. [Google Scholar]
Lemoine, O.W.; Skjoett-Larsen, T. Reconfiguration of supply chains and implications for transport: A Danish study. Int. J. Phys. Distrib. Logist. Manag. 2004, 34, 793–810. [Google Scholar] [CrossRef]
Crainic, T.G.; Laporte, G. Planning models for freight transportation. Eur. J. Oper. Res. 1997, 97, 409–438. [Google Scholar] [CrossRef]
Rushton, A.; Croucher, P.; Baker, P. The Handbook of Logistics and Distribution Management, 5th ed.; Kogan Page: London, UK, 2014; ISBN 978-0-7494-6754-9. [Google Scholar]
Stefansson, G. Collaborative logistics management and the role of third-party service providers. Int. J. Phys. Distrib. Logist. Manag. 2006, 36, 76–92. [Google Scholar] [CrossRef]
Melo, M.T.; Nickel, S.; Saldanha-da-Gama, F. Facility location and supply chain management—A review. Eur. J. Oper. Res. 2009, 196, 401–412. [Google Scholar] [CrossRef]
Alumur, S.; Kara, B.Y. Network hub location problems: The state of the art. Eur. J. Oper. Res. 2008, 190, 1–21. [Google Scholar] [CrossRef]
Farahani, R.Z.; Hekmatfar, M.; Arabani, A.B.; Nikbakhsh, E. Hub location problems: A review of models, classification, solution techniques, and applications. Comput. Ind. Eng. 2013, 64, 1096–1109. [Google Scholar] [CrossRef]
Chen, X. Theoretical Analysis of Net Present Value. BCP Bus. Manag. 2022, 30, 683–686. [Google Scholar] [CrossRef]
Reeves, C. Hybrid genetic algorithms for bin-packing and related problems. Ann. Oper. Res. 1996, 63, 371–396. [Google Scholar] [CrossRef]
Yamada, T.; Nakano, R. Genetic Algorithms for Job-Shop Scheduling Problems. In Proceedings of the Modern Heuristic for Decision Support, UNICOM Seminar, London, UK, 18–19 March 1997; pp. 67–81. [Google Scholar]
Ombuki-Berman, B.M.; Runka, A.; Hanshar, F.T. Waste collection vehicle routing problem with time windows using multi-objective genetic algorithms. In Proceedings of the 3rd IASTED International Conference on Computational Intelligence; ACTA Press: Calgary, AB, Canada, 2007. [Google Scholar]
Tsai, F.-C.; Yeh, C.-H.; Yang, C.-C. A genetic algorithm approach to the multiple criteria ABC analysis. Omega 2007, 36, 715–726. [Google Scholar] [CrossRef]
Tien, F.C.; Hsieh, K.H.; Cheng, C.Y.; Liu, C.S. Using hybrid genetic algorithms to solve discrete location allocation problems with rectilinear distance. J. Chin. Inst. Ind. Eng. 2007, 24, 1–19. [Google Scholar] [CrossRef]
Holland, J.H. Adaptation in Natural and Artificial Systems; University of Michigan Press: Ann Arbor, MI, USA, 1975; Volume 1. [Google Scholar]
Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley: Reading, MA, USA, 1989; pp. 1–412. [Google Scholar]
Kanungo, T.; Mount, D.M.; Netanyahu, N.S.; Piatko, C.D.; Silverman, R.; Wu, A.Y. An efficient k-means clustering algorithms: Analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 881–892. [Google Scholar] [CrossRef]
de Jong, K. Learning with Genetic Algorithms: An Overview. Mach. Learn. 1988, 3, 121–138. [Google Scholar] [CrossRef]
Ferrara, M.; Guerrini, L. The Ramsey model with logistic population growth and benthamite felicity function revisited. WSEAS Trans. Math. 2009, 8, 97–106. [Google Scholar]
Lloyd, S.P. Least Squares Quantization in PCM. IEEE Trans. Inf. Theory 1982, 28, 129–137. [Google Scholar] [CrossRef]
Blömer, J.; Lammersen, C.; Schmidt, M.; Sohler, C. Theoretical analysis of the k-means algorithm—A survey. In Algorithm Engineering: Selected Results and Surveys; Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Berlin/Heidelberg, Germany, 2016; Volume 9220. [Google Scholar] [CrossRef]
Arthur, D.; Vassilvitskii, S. K-means++: The advantages of careful seeding. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms, New Orleans, LA, USA, 7–9 January 2007. [Google Scholar]
Ostrovsky, R.; Rabani, Y.; Schulman, L.J.; Swamy, C. The effectiveness of Lloyd-type methods for the k-means problem. In Proceedings of the Annual IEEE Symposium on Foundations of Computer Science, FOCS, Berkeley, CA, USA, 21–24 October 2006. [Google Scholar] [CrossRef]
Blickle, T.; Thiele, L. A Comparison of Selection Schemes Used in Genetic Algorithms, TIK Report, Version 2. 1995. Available online: https://tik-old.ee.ethz.ch/file/6c0e384dceb283cd4301339a895b72b8/TIK-Report11.pdf (accessed on 1 June 2025).
Syswerda, G. Uniform Crossover in Genetic Algorithms. In Proceedings of the Third International Conference on Genetic Algorithms, Fairfax, VA, USA, 4–7 June 1989; pp. 2–9. [Google Scholar]
Deb, K.; Goldberg, D.E. Analyzing Deception in Trap Functions. In Foundations of Genetic Algorithms; Elsevier: Amsterdam, The Netherlands, 1993. [Google Scholar] [CrossRef]
Xing, Y.X.; Wang, J.S.; Zhang, S.H.; Bao, Y.Y.; Zheng, Y.; Zhang, Y.H. Mutation transit search algorithm introducing black hole swallowing strategy to solve p-hub location allocation problem. J. Intell. Fuzzy Syst. 2023, 45, 12213–12232. [Google Scholar] [CrossRef]
Andrew, A.M. INTRODUCTION TO EVOLUTIONARY COMPUTING, by, A.E. Eiben and, J.E. Smith (Natural Computing Series), Springer, Berlin, 2003, hardback, xv + 299 pp., ISBN 3-540-40184-9 (£30.00). Robotica 2004, 22, 345–349. [Google Scholar] [CrossRef]
Yamada, T.; Nakano, R. Proceedings of Modern Heuristic for Decision Support. 2000. Available online: https://www.kecl.ntt.co.jp/as/members/yamada/unicom.pdf (accessed on 1 June 2025).
Contreras, I.; Díaz, J.A.; Fernández, E. Lagrangean relaxation for the capacitated hub location problem with single assignment. OR Spectr. 2009, 31, 483–505. [Google Scholar] [CrossRef]
Chen, X.; Wei, C.; Xin, Z.; Zhao, J.; Xian, J. Ship Detection under Low-Visibility Weather Interference via an Ensemble Generative Adversarial Network. J. Mar. Sci. Eng. 2023, 11, 2065. [Google Scholar] [CrossRef]
Li, S.; Wu, W.; Ma, X.; Zhong, M.; Safdar, M. Modelling medium- and long-term purchasing plans for environment-orientated container trucks: A case study of Yangtze River port. Transp. Saf. Environ. 2023, 5, tdac043. [Google Scholar] [CrossRef]

Figure 1. Cost–distance relationship for tuck vs. intermodal transport, adapted from [23].

Figure 2. The evolutionary process of the proposed Genetic Algorithm.

Figure 3. Benchmark results using CAB Dataset (USA). Cases 15d2, 20d4, and 25d4. Hubs location; Flow heatmap; Probability density function; Solution diversity [4].

Figure 4. Left: Freight transport structure. Source: OTLE Annual Report. INECO, 2023. Right: Node locations in Spain. Rail freight motorways (Red, Blue, and Green Colors).

Figure 5. Minimum-cost hub configurations with NPV_social ≥ 0. Each row corresponds to a fixed number of active hubs (4, 6, 8, 10, or 12), while each column contrasts the effect of different transshipment discount factors (α = 0.2 on the left, α = 1.0 on the right). Red dots indicate selected hubs, and blue lines denote the catchment connections from each origin to its assigned hub.

Figure 6. Strategic solution space explorer for intermodal freight hub planning.

Figure 7. Trade-off between logistics cost efficiency and social return across hub configurations (α = 0.2–1.0).

Figure 8. Economic efficiency as a function of the number of hubs.

Figure 9. (a) CMAHLP solution with 6 hubs. The left panel shows the dominant freight assignment from origin to hub, with red nodes denoting selected hubs and blue lines indicating main routing paths. The right panel displays a heatmap of hub utilization, where total processed flow highlights operational demand across the selected nodes. (b) CMAHLP solution with 8 hubs. Additional hubs improve accessibility and intermodal connectivity in traditionally underserved areas. (c) CMAHLP solution with 12 hubs. The network expands into peripheral zones with strategic potential, supporting long-term cohesion and Euro–Maghreb connectivity.

Figure 10. Classification of hubs based on their economic efficiency vs. hub selection frequency.

Table 1. CMAHLP comparative framework overview.

Author	Allocat	Solution
O’Kelly [7]	Single	MILP
Campbell [8]	Multiple	MILP
Ebery et al. [4]	Multiple	MILP
Camargo et al. [15]	Multiple	MILP
Contreras et al. [16]	Multiple	MILP (dynamic)
García et al. [14]	Multiple	MILP + Branch and Cut
Monemi et al. [6]	Multiple	NSGA-II metaheuristic
Setiawan et al. [17]	Multiple	CPLEX
Moyano et al. (2025)	Multiple	GA (guided mutation, elitism, and local search)

Legend:

= Included or fully developed, Jmse 13 01301 i003

= Partially considered or simplified, Jmse 13 01301 i001

= Not included.

Table 2. CAB Dataset: Results of [4] and Genetic Algorithm (2025).

Case	α	Cost Ebery (1)	Hubs Ebery	Elapse Time Ebery (s)	Cost Genetic A. (2)	Hubs Genetic A.	Elapse Time GA (s)	$G A P = \frac{{C o s t}_{G A} - {C o s t}_{E b e r y}}{{C o s t}_{e b e r y}}$
10d2	0.2	1.224	7-9	2.94	1.226	6-7	3.41	+0.16
10d2	0.4	1184	7-9	2.27	1.184	7-9	3.20	+0.00
10d2	1.0	969	7-9	2.30	971	7-9	4.05	+0.20
10d4	0.2	888	3-4-7-8	2.08	895	3-7-8-9	21.69	+0.78
10d4	0.4	889	3-4-7	2.56	921	3-7-9	14.50	+3.60
10d4	1.0	839	3-4-7	1.60	855	7-8-9	13.60	+1.90
15d2	0.2	1.776	4-12	16.81	1.736	5-12	5.75	−2.25
15d2	0.4	1.720	4-12	12.89	1.681	4-12	3.56	−2.26
15d2	1.0	1.509	4-12	14.60	1.460	4-12	5.67	−3.25
20d4	0.2	1.319	4-12-17	75.40	1.351	4-12-17	6.07	+2.42
20d4	0.4	1.319	4-12-17	71.62	1.350	4-12-17	8.44	+2.35
20d4	1.0	1.243	4-12-17	42.87	1.270	4-12-17	15.25	+2.17
25d2	0.2	1.945	17-22	493.77	1.896	22-25	7.27	−2.51
25d2	0.4	1.919	12-17	460.35	1.865	12-25	7.20	−2.81
25d2	1	1.670	12-25	386.50	1.608	12-20	8.00	−3.71
25d4	0.2	1.510	4-12-17	502.31	1.504	12-17-21	10.47	−0.40
25d4	0.4	1.517	4-12-17	389.95	1.515	12-17-21	9.59	−0.13
25d4	1	1.428	4-12-17	113.91	1.421	12-17-21	10.58	−0.50

Table 3. Genetic algorithm results for the intermodal transport network in Spain with 2023 freight traffic. Summary of minimum-cost hub configurations with NPV_social ≥ 0 for discount factors α = 0.2 and α = 1.0.

Case	α	Num Hub	Cost (EUR/t)	NPS_social (mEUR)	Economic Efficiency	Solution Entropy	City Hubs	Hubs IDs
11	0.2	4	374	2.428	6.49	1.35	Zaragoza–Madrid–Jaén–León	14, 15, 44, 18
23	0.2	6	269	5.486	20.41	3.07	Madrid–Valencia–Antequera–Vitoria–Barcelona–Lugo	15, 38, 25, 7, 32, 2
25	0.2	8	232	9.157	40.00	4.55	Madrid–Barcelona–Antequera–Valencia–Palencia–Bilbao–Cartagena–Pontevedra	15, 32, 25, 38, 19, 9, 47, 4
87	0.2	10	209	11.210	52.63	5.38	Madrid–Barcelona–Vitora–Palencia–Antequera–Alicante–Orense–Castellón–Huelva–Cacerés	15, 32, 7, 19, 25, 36, 3, 37, 43, 31
89	0.2	12	190	18.059	90.91	6.52	Madrid–Valencia–Barcelona–Cartagena–Antequera–San Sebastián–Córdoba–León–Burgos–Lugo–Bilbao–Cáceres	15, 38, 32, 47, 25, 8, 41, 18, 17, 2, 9, 31
22	1	4	402	2.429	6.06	0.99	Zaragoza– Madrid– Jaén–León	14, 15, 44, 18
24	1	6	316	5.922	18.87	2.63	Valencia–Madrid–León–Málaga–Barcelona–Pamplona	38, 15, 18, 45, 32, 10
56	1	8	281	9.935	35.71	2.97	Madrid–Logroño–Valencia–Barcelona–Sevilla–Cartagena–Lugo–Málaga	15, 11, 38, 32, 46, 47, 2, 45
58	1	10	261	13.530	52.63	5.21	Madrid–Barcelona–Córdoba–Valencia–Valladolid–Bilbao–Alicante–Huelva–Zaragoza–Pontevedra	15, 32, 41, 38, 23, 9, 36, 43, 14, 4
50	1	12	251	17.016	66.67	7.11	Madrid–Barcelona–Logorño–Valencia–Antequera–Palencia–Alicante–Orense–Cordoba–Zaragoza–Bajadoz–Zamora	15, 32, 11, 38, 25, 19, 36, 3, 41, 14, 30, 24

Table 4. Synthesis of key findings and cross-border applicability.

Key Finding (Spain)	Relevance for EU (Green Deal/TEN-T)	Application in Other Countries
Optimal Hub count: 8–12	Supports efficiency in TEN-T Core Network Corridors. Enables phased investment aligned with Green Deal modal shift priorities.	Italy: Strategic integration of Gioia Tauro and Trieste with inland hubs along the Scandinavian–Mediterranean Corridor.
NPV_social > 0 as feasibility threshold	Aligns with the CEF Transport Program and Next Generation EU by applying socioeconomic viability filters to infrastructure planning.	Portugal: Cost–benefit validation of the Sines–Lisbon–Madrid rail axis, bridging the Atlantic and Mediterranean corridors.
Intermodality favors distributed networks	Encourages intermodal logistics and rail prioritization under the Fit for 55 and EU ETS emissions reduction framework.	Greece: Support optimization of the Piraeus–Thessaloniki corridor by reinforcing decentralization with scalable hub incentives.
Logistic cost benchmark: EUR232/t (8 hubs)	Offers reference points for carbon-efficient freight flows and pricing under EU Climate Policy targets.	France: Enables replicability in the Marseille–Lyon–Rhône-Alpes rail corridor, linking with the Mediterranean TEN-T spine.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Retamero, J.M.; Orive, A.C. Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning. J. Mar. Sci. Eng. 2025, 13, 1301. https://doi.org/10.3390/jmse13071301

AMA Style

Retamero JM, Orive AC. Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning. Journal of Marine Science and Engineering. 2025; 13(7):1301. https://doi.org/10.3390/jmse13071301

Chicago/Turabian Style

Retamero, José Moyano, and Alberto Camarero Orive. 2025. "Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning" Journal of Marine Science and Engineering 13, no. 7: 1301. https://doi.org/10.3390/jmse13071301

APA Style

Retamero, J. M., & Orive, A. C. (2025). Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning. Journal of Marine Science and Engineering, 13(7), 1301. https://doi.org/10.3390/jmse13071301

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning

Abstract

1. Introduction

2. Literature Review: The Capacitated Multiple-Allocation Hub Location Problem (CMAHLP)

2.1. From Single-Allocation to Multiple-Allocation Hub Models

2.2. Incorporating Capacity and Investment Constraints

2.3. Realism Through Congestion and Operating Costs

2.4. Dynamic and Metaheuristic CMAHLP Formulations: Limits and Developments

2.5. Recent Contributions and Computational Benchmarks

2.6. Summary and Rationale

3. Advancing CMAHLP for Maritime–Terrestrial Logistics: A Multi-Criteria and Sustainability-Based Perspective

3.1. Methodological Enhancements to Classical CMAHLP Models

3.2. Intermodal Freight Cost Formulation for Inland and Port-Connected Nodes

3.3. Dynamic Operating Costs ( $I_{k}$ )

3.4. Investment Costs ( $F_{k}$ )

3.5. Analysis of Environmental and Social Aspects (NPV_social)

3.6. Genetic Algorithm for Solving the CMAHLP with Sustainability Constraints

3.7. Validation and Comparison of the Proposed Genetic Model Against the Ebery et al. Model

3.8. Results of the Proposed Model

4. Conclusions

5. Future Research and Model Improvements

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Variable Values

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Optimizing Intermodal Port–Inland Hub Systems in Spain: A Capacitated Multiple-Allocation Model for Strategic and Sustainable Freight Planning

Abstract

1. Introduction

2. Literature Review: The Capacitated Multiple-Allocation Hub Location Problem (CMAHLP)

2.1. From Single-Allocation to Multiple-Allocation Hub Models

2.2. Incorporating Capacity and Investment Constraints

2.3. Realism Through Congestion and Operating Costs

2.4. Dynamic and Metaheuristic CMAHLP Formulations: Limits and Developments

2.5. Recent Contributions and Computational Benchmarks

2.6. Summary and Rationale

3. Advancing CMAHLP for Maritime–Terrestrial Logistics: A Multi-Criteria and Sustainability-Based Perspective

3.1. Methodological Enhancements to Classical CMAHLP Models

3.2. Intermodal Freight Cost Formulation for Inland and Port-Connected Nodes

3.3. Dynamic Operating Costs ( I k )

3.4. Investment Costs ( F k )

3.5. Analysis of Environmental and Social Aspects (NPVsocial)

3.6. Genetic Algorithm for Solving the CMAHLP with Sustainability Constraints

3.7. Validation and Comparison of the Proposed Genetic Model Against the Ebery et al. Model

3.8. Results of the Proposed Model

4. Conclusions

5. Future Research and Model Improvements

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Variable Values

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.3. Dynamic Operating Costs ( $I_{k}$ )

3.4. Investment Costs ( $F_{k}$ )

3.5. Analysis of Environmental and Social Aspects (NPV_social)