1. Introduction
Multimodal freight transportation networks are the critical backbone of modern economies, enabling the efficient movement of goods across local, national, and international supply chains. These networks support industrial production, trade, and consumption by integrating multiple transportation modes, such as roadways, railways, waterways, and airways, into interconnected freight corridors. A multimodal freight transportation network involves the coordinated use of two or more transportation modes to move goods from origin to destination, emphasizing efficient modal transitions, standardized containers, and transfer facilities [
1]. In freight transportation research, the terms “multimodal” and “intermodal transport” describe related but distinct concepts. Multimodal freight transport refers to the movement of goods using two or more transportation modes within an integrated transport chain, whereas intermodal transport represents a specific operational form in which each leg of the journey is managed by different carriers under separate contracts, while the freight unit remains unchanged [
2]. Although intermodal systems are widely applied in practice, particularly in rail–truck corridors, this study adopts the broader term multimodal freight transportation to encompass interactions across all connected modes and enable a comprehensive assessment of network resilience at the corridor level.
The United States hosts the world’s largest freight transportation system in terms of traffic volume, transporting goods valued at over
$14 billion annually [
3]. Highways, railways, ports, and airports form the core infrastructure supporting multimodal freight movement, with rail–truck multimodal transport accounting for approximately 75% of the total multimodal freight ton-miles [
3]. Multimodal freight transport offers significant advantages over single-mode transport, including reduced costs, improved logistics efficiency, lower congestion, and decreased environmental impact [
4]. Consequently, freight shippers have increasingly adopted multimodal solutions to enhance system flexibility and performance.
Beyond economic efficiency, the integration of multiple transportation modes is crucial for enhancing the resilience of freight networks. By enabling alternative routing and modal substitution, multimodal freight corridors can adapt better to changing operational conditions and disruptive events. However, this interdependence among modes also introduces complexity, as disruptions affecting one mode can propagate across the entire network. The infrastructure supporting multimodal freight corridors, including highways, railways, waterways, ports, airports, and intermodal terminals, forms a tightly coupled system in which the functionality of individual components is mutually dependent [
5].
Freight transportation networks are increasingly exposed to a wide range of disruptions, including natural hazards (e.g., earthquakes, floods, and hurricanes), operational failures, congestion, infrastructure deterioration, and human-induced events [
6,
7]. Recent large-scale disruptions, such as the COVID-19 pandemic, have demonstrated the vulnerability of freight systems, with passenger traffic declining and freight demand increasing owing to the rapid expansion of e-commerce. Infrastructure failure further illustrates this vulnerability. For example, the collapse of the Francis Scott Key Bridge in Baltimore, Maryland (see
Figure 1), resulted in substantial traffic redistribution across the regional network, increased truck volumes on alternative corridors, longer diversion routes, particularly for hazardous materials, and heightened congestion and safety risks [
8,
9]. These events underscore the need for freight networks that can withstand disruptions, maintain acceptable service levels, and recover efficiently. Given the critical role of freight transportation in economic stability and supply chain continuity, resilience has emerged as a key performance attribute of multimodal freight network.
Although a growing body of literature has examined transportation network resilience, research specifically addressing disruptions in multimodal freight transportation networks remains fragmented and inconsistent. Existing studies employ diverse analytical and computational techniques, represent disruptions using heterogeneous assumptions (e.g., node/link failures, capacity degradation, and demand shocks), and vary considerably in their capture of intermodal interdependencies, cascading effects, and recovery dynamics. In addition, validation and empirical testing practices differ widely across studies, which limits the comparability and transferability of findings across multimodal freight corridors. Accordingly, this systematic review seeks to answer the following research questions:
RQ1: What analytical and computational modeling techniques have been applied to study disruptions in multimodal freight networks?
RQ2: To what extent do existing models dynamically represent cross-modal interdependencies, cascading failures, and recovery processes?
RQ3: What validation, calibration, and empirical testing strategies are employed, and how robust are these approaches?
To address these questions, this study presents a systematic review of resilience modeling in multimodal freight transportation networks. Before reviewing the modeling approaches and resilience assessment methods, it is essential to establish how resilience is defined in the context of freight transportation networks. Existing studies adopt a wide range of definitions, reflecting different system boundaries, analytical objectives and disruption contexts. Therefore, the following subsection reviews the definitions of resilience used in the literature to provide a common conceptual foundation for subsequent analysis.
Definitions of Resilience in Transportation Systems
The term resilience originates from the Latin word resiliere, which means to rebound or spring back. Following the influential work of Holling [
10], who introduced resilience in the context of ecological systems and distinguished it from system stability, the concept has been widely adopted across multiple disciplines, including organizational systems [
11], economics [
12], social sciences [
13], supply chains [
14,
15,
16,
17] and engineering. Within the engineering domain, particularly in transportation systems, resilience has received increasing attention in response to the growing frequency, scale, and complexity of disruptive events.
In transportation research, resilience has been examined across multiple modes, including aviation [
18], roadway networks [
19,
20,
21], waterborne transport systems [
22,
23], and railway networks [
24,
25]. Despite this growing body of literature, no single, universally accepted definition of transportation resilience has been established. The definitions vary depending on the transportation mode, system scale, disruption type, and performance objectives. Nevertheless, a widely accepted and overarching definition is provided by the United Nations Office for Disaster Risk Reduction (UNDRR) [
26], which defines resilience as the ability of a system, community, or society exposed to hazards to resist, absorb, accommodate, adapt to, and recover from the effects of a hazard in a timely and efficient manner while preserving and restoring essential structures and functions. This definition emphasizes resilience as a dynamic process rather than as a static system property. Building on this general understanding of resilience,
Table 1 synthesizes the definitions of resilience that have been specifically developed for multimodal freight transportation networks.
Resilience in transportation networks is inherently time-dependent because disruption impacts and recovery processes evolve over time rather than occurring instantaneously. A widely adopted way to conceptualize this behavior is a performance–time representation, where system functionality is tracked before, during, and after a disruptive event.
Figure 2 illustrates this relationship for freight transportation systems.
In
Figure 2,
denotes system performance (functionality) at time
t and
is the baseline performance level at the reference time
(stable original state,
). A disruptive event, denoted by
e, occurs at time
and triggers a decline in performance governed by system vulnerability, until the disrupted state is reached at time
with performance level
(disrupted state,
). The lowest portion of the curve represents the disrupted steady state, where network performance and serviceability are most constrained. Recovery is then reflected by the increasing segment of
after
, with the system entering the recovery phase around
and reaching the recovered state at time
with performance level
(stable recovered state,
). In this framework, vulnerability is captured by the magnitude of performance loss following
e (from
toward
), whereas recoverability is reflected by the rate of performance restoration during the recovery interval (from
to
).
Within this performance–time framework, vulnerability describes the susceptibility of a freight transportation network to disruptions that result in significant performance loss, capturing both the magnitude of degradation immediately following the event and the system’s exposure to disturbance impacts [
36,
37]. In multimodal freight networks, vulnerability is often driven by failures of critical components (e.g., terminals, hubs, intermodal transfer facilities, or high-betweenness links), where the loss of capacity or connectivity can propagate through tightly coupled mode-transfer operations and reduce system-wide serviceability. Vulnerability can be amplified when redundancy is limited (i.e., few alternative routes or terminals are available), when capacity is highly concentrated at specific nodes, or when modal substitution is operationally constrained by schedule coordination, transfer times, or terminal handling limits. From a planning perspective, vulnerability therefore reflects not only the immediate performance drop but also the extent to which disruptions trigger broader bottlenecks and operational spillovers across modes.
Recoverability represents the ability of a freight transportation network to restore performance after reaching the disrupted steady state; as illustrated in
Figure 2, higher recoverability corresponds to a steeper recovery slope and faster restoration of freight movement and service levels, thereby reducing the duration and severity of disruption impacts [
38]. In practice, recoverability depends on the availability and effectiveness of response actions such as rerouting, mode switching, temporary capacity expansion, resource reallocation, and repair/restoration scheduling. It is also influenced by organizational readiness and coordination among stakeholders (e.g., carriers, terminal operators, and public agencies), which affects how quickly recovery actions can be implemented. Consequently, the same initial disruption may yield markedly different recovery trajectories depending on operational flexibility, resource constraints, and the degree of intermodal coordination, highlighting the importance of modeling both vulnerability and recoverability when assessing resilience in multimodal freight systems.
Figure 2.
System performance related to disruptive event [
35,
39].
Figure 2.
System performance related to disruptive event [
35,
39].
2. Methods
The methodology employed in this systematic review followed the PRISMA guidelines [
40] to ensure transparency, reproducibility and consistency throughout the review process. A systematic review approach was adopted to synthesize peer-reviewed evidence on disruption modeling in multimodal freight transportation networks, motivated by the increasing frequency of disruptive events and rapid growth of methodological contributions in this area. In line with the study objectives and research questions, the review focused on three elements extracted from each included record: (i) the analytical and computational techniques used to model disruptions (e.g., optimization, simulation, network science, machine learning, and hybrid frameworks), (ii) the extent to which models represent cross-modal interdependencies, cascading effects, and recovery dynamics, and (iii) the empirical testing practices used to support model robustness (e.g., uncertainty representation, calibration, validation type, and use of observed disruption events).
2.1. Search Strategy
To ensure comprehensive and diverse literature coverage, four major academic databases were systematically searched: Web of Science, Scopus, ScienceDirect, and TRID (Transportation Research International Documentation). Web of Science and Scopus were selected as the primary multidisciplinary indexing databases because of their extensive coverage of peer-reviewed journal articles and citation records across engineering, transportation, and infrastructure research. Scopus, in particular, provides broad international coverage and advanced indexing of technical and applied research, enabling comprehensive retrieval of studies related to multimodal freight transportation and disruption modeling.
ScienceDirect was included as a complementary full-text platform to reduce the likelihood of missing relevant Elsevier-hosted studies in transportation, civil engineering, and infrastructure systems that may be inconsistently indexed across databases. TRID was incorporated as a transportation-specific database because it indexes a wide range of transportation research outputs, including Transportation Research Record and related transportation publications. Overlap across sources was expected, and duplicate records were removed during the screening process. The combined use of these databases ensured broad disciplinary coverage while maintaining a transparent and reproducible search process. Other databases were not searched as standalone databases because most transportation journals relevant to multimodal freight disruption modeling are indexed in Web of Science and Scopus, and preliminary scoping indicated limited incremental yield relative to added duplication. Google Scholar was not used due to limited reproducibility of searches and a high proportion of duplicate and non-peer-reviewed records.
Specific keywords were selected to target studies that are highly relevant to multimodal freight transportation network resilience. The search strategy was structured around four main categories of terms: (1) transportation modes, (2) disruptive events, (3) network and system characteristics, and (4) resilience-related concepts. Transportation mode keywords included roadway, railway, waterway, airway, multimodal, intermodal, trucks, and freight transportation to capture the diverse modes and operational contexts of freight systems. Disruption-related terms, such as disruption, delay, natural disasters, hurricanes, earthquakes, landslides, and manmade disasters, were incorporated to reflect a broad spectrum of hazard scenarios affecting multimodal freight networks. To represent network characteristics and system performance dimensions, terms such as recovery, mobility, survivability, sustainability, vulnerability, resourcefulness, connectivity, and supply chain were used. Additional contextual terms such as economy, rural, agriculture, manufacturing, industry, stakeholders, and freight networks were included to ensure coverage of studies addressing infrastructure impacts and broader freight system interactions.
To further enhance relevance, resilience-focused terms including resilience, resiliency, resilient, redundancy, robustness, and rapidity were required to appear in the title or abstract in combination with freight- and multimodality-related terms. Freight-related terms included freight, cargo, goods movement, logistics, and supply chain, while multimodality terms included multimodal, intermodal, multi-modal, multi-mode, and mode shift. The Boolean structure followed the form (resilience terms) AND (freight terms) AND (multimodal terms), with database-specific syntax applied as needed. The resulting database-specific query strings are summarized in
Table 2. The search was limited to English-language publications published between January 2015 and December 2025 to ensure contemporary relevance and methodological consistency of the included evidence. The start year (2015) was selected to capture recent methodological developments and contemporary data environments in multimodal freight resilience research, including broader adoption of large-scale network datasets, increased computational capability for robust/stochastic optimization and simulation, and the emergence of data-driven and hybrid modeling approaches. This time window also improves comparability across studies by focusing on more consistent reporting practices and disruption contexts in the post-2015 literature.
2.2. Screening Strategy
The database search yielded 185 records across four major databases: 62 from the Web of Science, 42 from Scopus, 20 from ScienceDirect, and 61 from the Transportation Research Information Database (TRID). Following the initial retrieval, 65 duplicate records were identified and removed from the study. This resulted in 120 unique records remaining for screening. A two-stage screening process was used. In the first stage, titles and abstracts were reviewed to assess their relevance to the research objectives and predefined inclusion criteria. A total of 75 records were excluded during this stage because of a lack of relevance to multimodal freight transportation network resilience or the absence of modeling and disruption-related analysis. Consequently, 45 studies were included in the full-text assessment.
In the second stage, the full texts of the 45 potentially eligible studies were examined in detail. Studies were included if they: (1) investigated disruptions and resilience-related outcomes in the context of multimodal (intermodal) freight transportation networks; (2) explicitly modeled at least two freight transportation modes and their interaction through terminals, transfers, or shared infrastructure; (3) defined a quantitative evaluation basis, such as resilience indices/proxies (e.g., robustness, service level, time-to-recovery, vulnerability), and/or model performance metrics (e.g., cost, delay, throughput, unmet demand, reliability); and (4) applied the approach to a freight network representation using either empirical data (e.g., public or proprietary datasets) or a clearly specified simulated/synthetic network instance. Editorials, commentaries, news reports, opinion pieces, and non-peer-reviewed preprints were excluded to ensure methodological rigor and reliability of the results. Following full-text screening, 24 studies were excluded for being out of scope, primarily due to single-mode focus, absence of resilience modeling, or lack of network-level analysis. Ultimately, 21 studies met the inclusion criteria and were included in the final review. Although the final synthesis includes 21 studies, this number reflects the specificity of the review scope and inclusion criteria rather than a lack of research activity. We intentionally focused on studies that (i) model disruptions in multimodal/intermodal freight transportation networks and (ii) provide sufficient quantitative detail to be systematically coded (e.g., explicit disruption representation, performance/resilience metrics, and evaluation/validation strategy).
A substantial portion of the broader resilience literature was excluded because it is single-mode, conceptual, supply-chain oriented without an explicit network model or does not report the methodological elements required for consistent cross-study comparison. In particular, many borderline records discussed multimodal or intermodal freight at a conceptual level but did not explicitly model cross-modal transfers (e.g., terminal/transfer representation) or quantify disruption impacts using network-based performance or resilience measures. Other borderline studies modeled disruptions quantitatively but were limited to a single mode (e.g., road-only or rail-only), which prevents consistent assessment of cross-modal interdependencies and mode-switching mechanisms central to RQ2. While relaxing the multimodality requirement would increase the number of included studies, it would reduce methodological comparability and dilute the ability to synthesize cross-modal interaction and recovery mechanisms; therefore, we retained a strict multimodal/intermodal criterion and note this scope decision as a trade-off.
Importantly, the included studies span multiple regions and modeling paradigms (optimization, simulation, network science, machine learning, and hybrid frameworks) and cover diverse disruption types, providing adequate breadth to identify dominant practices and recurring methodological gaps. Nevertheless, we acknowledge the sample size as a limitation and expect that periodic updates will be beneficial as the multimodal freight resilience literature continues to expand. The full-text studies that met the inclusion criteria were subsequently coded using a standardized data extraction protocol described in the following subsection.
Figure 3 presents the PRISMA flow diagram summarizing the identification, screening, eligibility, and inclusion processes adopted in this study.
2.3. Data Extraction and Coding Protocol
After final inclusion, all 21 full-text studies were systematically coded using a standardized data extraction matrix aligned with the research questions (RQ1–RQ3). For each study, the extracted fields included: (i) bibliographic information (publication year and study region), (ii) network and modal scope (modes included and multimodal context), (iii) disruption representation (disruption type and how disruptions were modeled, such as node failure, link failure, capacity reduction, and demand shock), and (iv) modeling approach characteristics (primary modeling paradigm and, where applicable, optimization and simulation types).
To address RQ2, studies were additionally coded for the presence of cascading or interdependent failure representations, temporal representation (static, multi-period, or operational/continuous-time), and whether recovery actions were explicitly modeled. To address RQ3, we extracted evaluation and robustness information, including uncertainty representation, empirical data usage and data sources, calibration and validation type, whether a real disruption event was modeled, and the performance and resilience metrics reported. The coding matrix was used to generate the synthesis tables presented in the Results and Discussions section.
4. Conclusions
This systematic review examined disruption modeling in multimodal freight transportation networks through three questions: (RQ1) analytical and computational techniques; (RQ2) the representation of interdependencies, cascading processes, and recovery dynamics; and (RQ3) validation, calibration, and empirical testing practices. Although the reviewed studies demonstrate substantial methodological innovation, the evidence base remains fragmented, and several recurring gaps limit generalizability, comparability, and operational transferability. The discussion below synthesizes the key gaps and outlines a sequenced research agenda that moves from near-term improvements that are straightforward to implement to longer-term developments requiring richer data and more complex modeling.
A first and foundational gap is the lack of standardization in how disruptions, performance, and resilience are specified and reported, including how conceptual definitions are translated into operational measures. While
Table 1 synthesizes resilience definitions spanning infrastructure, user, and organizational perspectives, the reviewed studies often operationalize resilience using a narrower set of quantitative proxies (e.g., cost/time increases, throughput loss, unmet demand, or service-level constraints), with limited consistency in linking these measures back to an explicit definition. Many studies represent disruptions as exogenous shocks via node/link removals or capacity reductions, but disruption severity, spatial footprint, and duration are rarely documented in a consistent way, making cross-study comparisons difficult. In addition, demand shocks are underrepresented relative to capacity and connectivity degradation, despite their practical importance during major disruptions. Future studies would benefit from reporting a minimum disruption specification (hazard/trigger type, affected assets, spatial extent, duration, and severity) and from explicitly distinguishing baseline performance, disruption impact, and recovery outcomes. Clearer reporting standards that state the adopted resilience definition and justify the selected operational metric(s)—together with transparent parameter sources, even when literature-based—would strengthen comparability, transparency, and replication.
A second gap concerns evidence strength and validation practice. While most studies rely on real network data or realistic network structures, formal calibration procedures are often not reported outside machine learning contexts, and validation is dominated by case study demonstrations and sensitivity tests. Historical testing against observed disruption events remains uncommon, which limits confidence in external validity and complicates technology transfer. Addressing this gap requires more systematic evaluation designs, including explicit calibration documentation, held-out testing where feasible, consistent sensitivity analysis protocols, and benchmarking against observed disruptions when data are available. Community benchmark datasets for intermodal disruptions (with documented assumptions and standardized outputs) would substantially improve reproducibility and allow fair method-to-method comparisons.
A third gap is the limited modeling of propagation mechanisms and cascading effects. Despite frequent acknowledgement that disruptions propagate through intermodal systems, explicit representations of operational delay propagation (e.g., missed connections), spatially correlated multi-component hazards, and overload cascades remain relatively rare. The dominant single-event framing can underestimate systemic risk in tightly coupled intermodal networks where congestion spillovers, terminal queues, and service cancellations amplify initial shocks. Future work should advance models that represent propagation more explicitly, including congestion-mediated cascades (assignment under degraded capacity), schedule-based cascades (connection feasibility and dwell-time dynamics), and multi-layer dependency cascades driven by shared infrastructure and terminal coupling.
A fourth gap relates to recovery modeling. Many studies include recovery actions in a decision-based sense (rerouting, transshipment, mode switching, capacity rental, or repair choices), but comparatively few represent recovery as a time-evolving restoration process under realistic constraints. As a result, resilience is often measured using robustness-style performance degradation rather than recovery trajectories and time-to-recovery behavior. Future research should incorporate restoration dynamics, resource constraints (crews, budgets, access), and restoration sequencing to enable recovery-curve assessment and to support more actionable restoration planning. Broader adoption of recovery-trajectory metrics would also improve alignment between conceptual resilience definitions and how resilience is quantified in practice.
A fifth gap involves uncertainty quantification. Scenario-based analysis is prevalent, but probabilistic grounding and systematic uncertainty propagation are less common, and robust formulations with decision guarantees are still limited. This matters because disruption likelihood and impact are often uncertain, especially for low-frequency/high-consequence hazards and compounding events. Future work should calibrate scenario probabilities when data permit, propagate uncertainty through performance measures (including terminal throughput and queueing), and benchmark scenario-based solutions against probabilistic and robust counterparts to quantify the trade-offs between expected performance and worst-case protection.
Finally, the evidence base is skewed toward U.S. case studies and road–rail intermodal systems, with fewer studies incorporating inland waterways, maritime layers, or air freight. This concentration likely reflects data availability and the operational prominence of rail–truck terminals, but it constrains transferability to regions with different modal shares, port and inland-waterway integration, and regulatory or cross-border operating contexts. Future work should broaden modal coverage—particularly maritime and inland waterways—and test transferability by replicating methods across regions using standardized benchmark scenarios and comparable reporting frameworks.
Despite these limitations, the reviewed studies provide actionable insights for research and practice. Optimization-based models support prescriptive disruption response and recovery decisions, network-science approaches enable rapid criticality screening when detailed flow data are limited, and simulation and data-driven models improve operational realism for real-time management and early warning. Strengthening disruption specification, empirical evaluation, propagation modeling, recovery dynamics, uncertainty treatment, and transferability testing will bridge the gap between academic prototypes and decision-relevant tools for agencies and freight operators.
In summary, the literature provides a strong methodological foundation for analyzing disruptions in multimodal freight networks. However, advancing the field will require more consistent reporting standards, stronger empirical validation and benchmarking, broader adoption of dynamic propagation and recovery trajectory modeling, improved uncertainty quantification, and systematic transferability testing across modal compositions and governance contexts.