1. Introduction
Driven by the United Nations’ Net-Zero Coalition to cut greenhouse gas emissions to near-zero levels [
1], the global transportation sector is transitioning from fossil fuel dependence to zero-emission technologies. For example, in the maritime sector, the International Maritime Organization (IMO) has established decarbonization targets to reduce greenhouse gas emissions from ships. This strategy mandates a reduction in the carbon intensity of international shipping by an average of 40 percent by 2030 and 70 percent by 2040, while aiming to reach net-zero greenhouse gas emissions by or around 2050 [
2,
3]. These net-zero regulations represent significant policy shifts in the maritime sector and are driving the rapid adoption of alternative fuels, including hydrogen, ammonia, methanol, battery-electric, and fuel cell systems, across the shipping industry [
4]. Similar decarbonization policies are reshaping the automotive and multimodal transportation sectors, where electric vehicles and hydrogen fuel cell vehicles are emerging as viable alternatives to conventional internal combustion engines [
5].
However, the widespread transition to zero-emission transportation systems presents new technical challenges [
6]. Each alternative fuel technology carries distinct hazards and risk profiles that differ substantially from those of conventional hydrocarbon fuels [
7]. For instance, hydrogen presents risks associated with rapid flame propagation and invisible flame characteristics [
8,
9,
10], ammonia exhibits toxic hazards and corrosive properties [
11,
12,
13], lithium-ion battery systems face thermal runaway and fire propagation challenges [
14,
15], and methanol presents flammability and toxicity concerns [
16,
17].
Risks emerge alongside the operational deployment of zero-emission vessels such as the liquefied hydrogen carrier Suiso Frontier [
18] and the liquid hydrogen-fueled ferry MF Hydra [
19]. The urgency of addressing these hazards is evidenced by recent major accidents. In 2022, the car carrier Felicity Ace sank after a fire involving lithium-ion battery electric vehicles, which proved difficult to extinguish using conventional methods due to thermal runaway [
20]. Similarly, in 2019, the hybrid ferry Ytterøyningen experienced a battery room explosion, highlighting the severe risks of gas buildup and thermal propagation in enclosed maritime spaces [
21]. These incidents serve as a warning that the introduction of alternative fuels creates a complex and evolving risk landscape that the industry is struggling to manage [
6,
22].
Beyond individual vessel accidents, the potential for large-scale disasters in shared infrastructure is a significant concern when considering the specific hazards of fuels like ammonia [
23]. Unlike conventional hydrocarbons, ammonia poses a severe toxicity risk. For instance, a leakage of liquid ammonia at a refrigeration plant in Shanghai (2013) resulted in 15 fatalities [
24,
25], and a pipe failure in Alabama (2010) affected over 800 people in the vicinity [
26]. While these incidents occurred in non-transport industrial contexts, they serve as essential proxies for risk assessment because ammonia-powered vessels and vehicles have not yet reached widespread commercial operation. The intrinsic chemical properties and toxic hazards of ammonia remain constant across applications; therefore, these industrial disasters share an identical physical risk profile with potential leakage scenarios in maritime bunkering operations or port-side storage infrastructure. These accidents illustrate the catastrophic potential of ammonia leaks and highlight that the transition to zero-emission technologies like ammonia must be managed through continuous research and rigorous safety efforts. In this context, it is important to evaluate whether scientific knowledge and safety practices are advancing quickly enough to deal with the emerging challenges brought by the newly introduced technologies in the transportation sector.
To address this need, this study provides a comprehensive review of safety and risk research regarding zero-emission transportation. The primary objective is to systematically analyze the current research to identify knowledge gaps and outline a clear pathway for future investigations. Specifically, this review limits its scope to safety risks associated with onboard alternative fuel storage, including hydrogen, ammonia, and lithium-ion batteries. We therefore exclude electrified railway systems, as they rely on external power via catenary infrastructure and present a risk profile distinct from onboard storage concerns. Our analysis primarily focuses on the maritime and automotive sectors, as these industries are undergoing a rapid commercial transition driven by regulatory pressures such as the IMO Strategy and global electric vehicle policy initiatives. While zero-emission aviation remains in a nascent stage, it is worth noting that early aviation safety research, particularly concerning high-reliability requirements, could offer valuable transferable insights for energy storage safety in the maritime and automotive domains. However, as alternative fuel technologies are not yet actively deployed in the aviation sector, the specific extent of these cross-sectoral insights remains uncertain. Consequently, investigating the potential synergies between aviation safety standards and other zero-emission transport modes represents a significant opportunity for future research and should be addressed as a sub-research question in subsequent studies.
Given this scope, we adopted a bibliometric approach to systematically assess research growth, thematic developments, and intellectual structures that traditional narrative reviews may overlook. Ultimately, this review aims to help establish an evidence-based foundation for the safe operation of next-generation transportation systems. To achieve these objectives, this research addresses five research questions (RQ):
RQ1. What are the research focuses and evolutionary status of this field? This involves analyzing publication trajectories, growth patterns, and citation performance to determine if the domain constitutes an established and expanding field.
RQ2. Who are the primary contributors to this domain? This identifies the most influential authors, institutions, and countries, alongside the collaboration patterns that characterize the field.
RQ3. What is the intellectual structure of this domain? This examines the foundational research clusters and seminal works that define the field’s organization.
RQ4. What research gaps exist? This highlights specific areas where future research directions merit priority investigation.
RQ5. What are the emerging research trends? This explores dominant conceptual and methodological frameworks within each cluster to assess whether current trends are addressing identified gaps.
To answer these questions, the present bibliometric review systematically maps the scientific literature on safety and risk for zero-emission transportation systems published between 2000 and 2025. This study combines quantitative bibliometric analysis with qualitative content analysis to provide a comprehensive overview of the research landscape. The bibliometric analysis employs citation metrics, publication trend analysis, and Lotka’s law validation to assess field maturity and productivity patterns. Network analysis techniques identify the most influential authors, institutions, and countries contributing to this domain. Science mapping methodologies reveal the underlying intellectual structure and identify foundational research clusters. Finally, thematic content analysis of full-text publications within each cluster provides detailed insight into current research trends, dominant methodological approaches, and conceptual frameworks. This dual quantitative-qualitative approach enables not only objective assessment of the field’s bibliographic characteristics but also substantive understanding of its knowledge content and architecture.
This paper is structured as follows.
Section 2 presents comprehensive research methodology, including data collection and curation procedures, bibliometric analysis techniques, and content analysis approaches.
Section 3 presents the results of bibliometric analysis addressing the performance characteristics, contributor networks, and intellectual structure of the research field.
Section 4 provides an in-depth thematic content review of each identified research cluster, synthesizing findings and identifying research gaps and future research directions.
Section 5 discusses the integrated findings from both bibliometric and thematic analyses, synthesizing key insights to reveal limitations in current safety frameworks and proposing a structured research agenda to bridge these gaps. Finally,
Section 6 concludes with key contributions, practical implications, and recommendations for advancing safety and risk research in zero-emission transportation systems.
4. Thematic Area Review
This chapter presents an in-depth thematic review of the four primary clusters identified in the conceptual structure analysis (
Section 3.3.2). Specifically, these clusters encompass Lithium-ion battery safety and failure mechanisms, Hydrogen and fuel cell safety management, Energy system design and optimization, and Fire and emergency response systems. These clusters represent distinct, yet interconnected, research domains. This review systematically examines the most prominent publications from each cluster, which were selected based on their top NGC scores. In bibliometric research, sampling the most influential documents is a standard practice to effectively characterize the intellectual core and boundary of a specific research front. We limited the selection to the top six publications per cluster to maintain an optimal balance between qualitative analytical depth and broad thematic representation. This sample size was considered sufficient to capture the dominant methodologies, research trends, and limitations within each cluster, ensuring that the qualitative review remains focused on the most impactful contributions. Furthermore, as these top-ranked documents define the primary conceptual and theoretical foundations of their respective clusters, the resulting thematic conclusions are expected to have low sensitivity to minor variations in the sample size.
4.1. Lithium-Ion Battery Safety and Failure Mechanisms
Table 9 presents the most influential publications within the lithium-ion battery safety cluster, selected based on their NGC.
4.1.1. Current Research Overview and Trends in Lithium-Ion Battery Safety Cluster
The research within this cluster exhibits three dominant methodological trends. The first is computational simulation and optimization. Finite Element (FE) modeling is the primary methodology for assessing mechanical safety and crashworthiness. This approach is utilized to simulate high-impact events that are difficult or costly to reproduce experimentally, such as side pole impacts. This simulation-based approach is also coupled with multi-objective optimization algorithms, including NSGA-II, and surrogate models like SVR and RBF. These methods are used to systematically improve structural designs by maximizing Specific Energy Absorption (SEA) and minimizing Peak Crushing Force (PCF).
The second major trend is the application of data-driven Machine Learning (ML) models for fault diagnosis and prognosis. This involves training algorithms on battery performance data such as voltage, current, and temperature to predict failures. A key methodology is the development of specialized neural network architectures, such as the two-tower spatio-temporal Transformer (BERTtery), to analyze real-world field data from large-scale packs and identify pre-failure anomalies. An emerging trend is the use of high-fidelity computational models to generate large datasets that are otherwise impossible to acquire physically. Li et al. (2019) [
64] used a detailed FE model to generate over 2500 simulation data points, which then served as training and testing data for ML algorithms (ANN, SVM) to create a data-driven safety envelope. Physical experiments are used for two specific purposes: to validate the accuracy of foundational FE models before optimization and to validate specific sensor-based detection concepts, such as using a Nondispersive Infrared (NDIR) sensor to measure CO
2 release.
Analysis of the research content reveals a consensus on the nature of the problem but a divergence in proposed solutions. There is a strong consensus that most real-world EV field accidents are not caused by abuse conditions, such as overcharging or impact, that are tested by current standards. Instead, they are self-triggered by reliability issues, which include internal manufacturing defects, material degradation, or other “hidden killers”, that incubate over time. All papers agree that the primary goal is the early detection of these internal failures before they cascade into catastrophic thermal runaway. This detection can be prognostic or diagnostic.
The papers diverge into three distinct solution paths. The first path is structural mitigation, which focuses on mechanically protecting cells from failure during an accident. A key finding is that the crashworthiness of a battery box can be significantly improved using aluminum foam-filled structures. Multi-objective optimization of this structure’s design yielded a 50.71% increase in SEA and an 11.56% reduction in PCF in a side-pole impact simulation. The second path is internal detection using new sensors. A literature review of vent-gas composition reveals that CO2 is the most suitable target gas for detection. It is consistently produced in high concentrations, appears early during first venting, and can indicate slow leaks. An NDIR CO2 sensor demonstrated a fast and clear response in an overcharging experiment, detecting high concentrations within 26 s of the cell venting. The third path is prognostic prediction using ML models and existing sensor data. A specialized two-tower Transformer (BERTtery) model can detect early-warning signals in large-scale packs 24 h to 7 days before a fault occurs by analyzing spatio-temporal correlations. Additionally, a data-driven safety envelope can be created by training ML models on thousands of FE simulation data points, enabling the model to instantly classify mechanical loading conditions as safe or short-circuit.
4.1.2. Limitations and Future Research Directions in Lithium-Ion Battery Safety Cluster
The papers collectively identify several gaps in the existing research landscape. The most prominent gap is that most research focuses on abuse thermal runaway, while self-triggered thermal runaway from reliability issues is the root cause of most field accidents. Current safety standards fall short in guaranteeing product safety because they do not test for this reliability component. Another gap exists in mechanical safety analysis, which is described as a “blind spot” for the EV community. Specifically, the crashworthiness of battery boxes under side pole impact is a less-explored scenario compared to axial crushing.
A prevailing challenge is the gap in data availability, described as a “daunting challenge” and the “biggest challenge”. Acquiring a large data bank of battery failure tests is difficult, and performing thousands of physical experiments is noted as impossible. This data scarcity is a major barrier to developing data-driven models. Gaps in detection methodology also exist. Fault signals from voltage and temperature monitoring can be suppressed in packs with many parallel-connected cells. Furthermore, there is still some debate about which vent gas is the most suitable for detection. A final gap exists in scaling models from the cell to the system level. Insights from cell-level electrochemical models are hardly translated to the system level, and these models are still far from being applied in practice.
The studies in this cluster also acknowledge specific limitations in their own methodologies. The models presented lack universality. The data-driven safety envelope is not universal and was developed for only one type of pouch cell. Similarly, the BERTtery model was trained on NCM/graphite cell data. Computational models also rely on idealized simulation. The FE model of the battery box used an idealized geometry and was isolated from the surrounding vehicle structure. The FEA data used for the safety envelope always carries errors as compared to the real world, which is described as a needed compromise. Several studies also explicitly state the exclusion of key physical parameters. The mechanical safety envelope simulations did not account for the battery’s State of Charge (SoC) or the wetting of the electrolyte. The crashworthiness simulation did not model the dynamic effects and thermal properties of individual battery cells. A major consolidated limitation is the insufficient experimental validation for the final optimized or predictive models. The crashworthiness study acknowledges the need for further experimental studies to validate its optimized design. The safety envelope study states that dynamic tests were not performed as validation for its simulation data, noting such tests are “extremely challenging”.
These limitations and gaps converge on several clear directions for future research. The most urgent, universally acknowledged direction is the need for physical experimental validation of simulation-based findings. This includes dynamic impact tests to confirm simulation-predicted loading angles and tests of optimized structural designs. Future models must also integrate more physics to better reflect reality. This includes incorporating the SoC and the presence of electrolyte into mechanical failure models and modeling the battery box within the full vehicle structure. There is also a strong call to action to shift the research focus toward reliability. Future work should focus on a more aggressive research effort into the reliability of lithium-ion batteries and on establishing testing protocols for the self-triggered safety probability. Finally, future prognostics and health management research should develop models that account for the critical factors in the operation condition on battery uncertainty, such as dynamic, real-world factors like road slopes.
The emphasis on reliability-driven, self-triggered thermal runaway is justified by real-world forensic evidence. For instance, the extensive Chevrolet Bolt EV recalls (2020–2021) [
69] highlighted that manufacturing defects, such as the simultaneous presence of a torn anode tab and a folded separator, could trigger catastrophic fires in parked, unattended vehicles without any external abuse. These incidents underscore critical regulatory shortcomings, particularly regarding the temporal and environmental scope of current safety standards. For instance, Clause 23A.2 of United Nations Economic Commission for Europe (UNECE) Global Technical Regulations (GTR) No. 20 [
70] defines the success criteria for thermal propagation at both pack and vehicle levels as the absence of external fire, explosion, or smoke entering the passenger cabin within 5 min after a thermal event warning is activated.
Although this ‘5-min rule’ is designed to allow for basic occupant egress in standard road scenarios, it is increasingly viewed as insufficient for the complex, integrated environments identified in this review. In shared infrastructures such as ferry car decks, subsea tunnels, or multimodal bunkering hubs, a 5-min window is fundamentally inadequate for large-scale emergency response or for preventing a self-triggered battery fire from cascading into nearby high-risk assets, such as hydrogen storage tanks. Furthermore, current standards rely heavily on visual inspection without disassembly, which may fail to identify latent internal reliability risks that lead to self-triggered events under normal operating conditions. Consequently, there is a clear regulatory gap between individual vehicle safety requirements and the system-level safety needs of shared zero-emission infrastructures.
While the current body of high-impact research within this cluster is predominantly focused on automotive applications, the fundamental safety challenges identified, particularly regarding internal reliability and thermal runaway propagation, are equally critical for the maritime sector. The transition to large-scale battery energy storage systems on vessels introduces unique risks that are often exacerbated by the enclosed nature of maritime environments. For instance, the explosion on the hybrid ferry Ytterøyningen (2019) [
21] and the fire on the sightseeing vessel MS Brim (2021) [
71] highlight the severe consequences of gas accumulation and subsequent ignition within confined battery rooms. These incidents demonstrate that while the root causes of failure may be similar to those in electric vehicles, such as internal short circuits or manufacturing defects, the maritime context presents distinct challenges in terms of emergency response and fire suppression. Therefore, the prognostic models and structural mitigation strategies developed for the automotive industry must be adapted to account for maritime-specific stressors, such as salt-mist corrosion and mechanical vibrations, to ensure the robust operation of zero-emission vessels.
4.2. Hydrogen and Fuel Cell Safety Management
Table 10 presents the foundational studies within the hydrogen and fuel cell safety management cluster, highlighting key works on risk assessment and leakage modeling.
4.2.1. Current Research Overview and Trends in Hydrogen and Fuel Cell Safety Cluster
The research methodologies in this cluster are primarily computational and analytical, focused on modeling risk, consequences, and structural integrity. A dominant trend is Probabilistic Risk Assessment (PRA). Li et al. (2025) [
72] employs a hybrid PRA approach, combining a Bow-tie model to identify faults, a Bayesian Network (BN) to model probabilities, and Fuzzy Set Theory to compensate for the lack of complete failure case datasets by quantifying expert judgments. This is complemented by Deterministic Risk Assessment (DRA), which evaluates worst-case scenarios using tools like Computational Fluid Dynamics (CFD) to create 3D simulations of hydrogen dispersion and deflagration. Other methods include formal safety and structural analysis, such as FE modeling for structural failure modes, Failure Modes and Effects Analysis (FMEA) to identify and qualitatively rank failures, and reviews of Safety, Codes, and Standards (SCS). Finally, comparative hazard analysis is used to benchmark the relative hazards of fuels like ammonia against conventional fuels.
The studies investigate safety across the hydrogen value chain, including ammonia as a carrier, liquid hydrogen (LH2) onboard storage, and gaseous hydrogen (GH2) leakage. All papers agree that hydrogen and its carriers pose unique safety risks, such as wide flammability, low ignition energy, embrittlement, and toxicity, that are distinct from hydrocarbon fuels. There is also a consensus that these risks are manageable through proper engineering design and safety protocols.
Key findings show divergences based on the specific application. The safety profile of ammonia is identified as a trade-off, where its fire hazard is low, but its health hazard is large due to high toxicity. For liquid hydrogen, LH2 is identified as a promising solution for heavy-duty trucks, as their large tanks and frequent use mitigate the boil-off losses that make it impractical for passenger cars. A formal FMEA on a conceptual LH2 tank design identified no high-risk failures, and an SCS review confirmed its compliance.
Regarding gaseous hydrogen leakage, human factors are identified as the primary drivers of risk in ship-based systems. A BN model calculated that improper maintenance procedures, inadequate operational protocols, and insufficient operator training are the key risk factors. In a semi-confined Hydrogen Refueling Station (HRS), the primary hazard is not fire, as CFD modeling showed that thermal radiation from a flame is insignificant. Instead, the most significant hazard is overpressure from a deflagration, which poses an indirect risk to human life via structural collapse or flying debris. Proposed mitigation solutions include zirconium phosphate as a novel ammonia remover, Aluminum 2219 for LH2 tank structural integrity, and addressing the design of large roofs at HRSs, which promotes possible hydrogen accumulation.
4.2.2. Limitations and Future Research Directions in Hydrogen and Fuel Cell Safety Cluster
A dominant trend across the cluster is the challenge of performing safety assessments for emerging technologies where historical data is unavailable. The most significant and prevailing gap is the lack of complete failure case datasets for hydrogen systems. This data scarcity forces researchers to use methodologies like Fuzzy Set Theory to translate expert linguistic judgments into quantitative probabilities or to rely on deterministic CFD modeling of worst-case scenarios. Further gaps are identified in specific applications, including the safety of hydrogen-powered internal combustion engine vessels, the risk of fuel cell trucks in real-scale HRSs with large roofs, and the viability of LH2 storage for heavy-duty trucks.
Gaps also exist in mitigation techniques and hazard understanding. Standard water sprinklers are noted as insufficient for large ammonia leaks in confined spaces, creating a need for new techniques for the safe removal of escaped ammonia. Wang et al. (2023) [
49] argues that a common feature of past accidents is a limited understanding of the actual hydrogen hazard. They use the DRA methodology to show that the intuitive fear of fire is insignificant, while the less-obvious hazard of overpressure is the critical risk.
The studies acknowledge several limitations. Probabilistic models are dependent on the quality and completeness of the input data and subject to expert judgment bias. Deterministic CFD models are time-consuming and require specialized knowledge, which can introduce human errors and biases. Furthermore, analyses are often based on conceptual storage system configurations for which dynamic tests were not performed as validation. These models are simplified, excluding real-world variables such as equipment degradation, extreme environmental conditions, or the effect of the wind.
These gaps and limitations converge in several key directions for future research. A primary priority is to refine the model by gathering more real-world data from hydrogen dual-fuel ships. A second direction is to optimize the physical design of infrastructure. Future work should study the effect of roof inclination and configuration at refueling stations, with the goal of developing standards that limit the maximum roof coverage. There is also a call to develop and test new mitigation technologies, such as the use of zirconium phosphate as an emergency ammonia remover. Given that human factors were identified as a primary cause of leaks, an important direction is to focus on strengthening both operator training and the enforcement of stringent maintenance protocols and to design more automated systems and fail-safe mechanisms. Finally, future risk assessments should be expanded to include additional risk scenarios, such as extreme weather events or equipment failures.
4.3. Energy System Design and Optimization
Table 11 presents the prominent research contributions within the energy system design and optimization cluster, focusing on reliability and complex system integration.
4.3.1. Current Research Overview and Trends in Energy System Design and Optimization Cluster
The research in this cluster demonstrates a clear shift toward simulation and advanced analytical modeling to solve complex, multi-domain system design and reliability challenges. Methodological trends include accelerated reliability testing, where components like motor winding insulation are subjected to stressors beyond their nominal ratings to predict long-term durability. This experimental data is then used to validate reliability models. A second trend is physics-based analytical modeling, which uses established physical laws such as rate theory and the Arrhenius law to extrapolate experimental data. Ji et al. (2025) [
73] uses this approach to model the degradation of Partial Discharge Inception Voltage (PDIV) as a function of both temperature and time.
For large-scale systems, system-level simulation is the dominant methodology. This includes using platforms like MATLAB/Simulink/Stateflow for control strategy validation to test rule-based strategies under normal and failure scenarios. It also includes Time Sequential Monte Carlo simulation for reliability assessment, such as modeling the probabilistic interactions between the power grid and EV battery exchange stations. A further trend is coupled multi-physics modeling to address design trade-offs. Asgari et al. (2024) [
75] uses a coupled electromagnetic-thermal network model to analyze how changes in the Electrical Insulation System (EIS) simultaneously affect the machine’s electrical losses and thermal performance.
The core content reveals a unanimous consensus that traditional, single-objective design and reliability methods are insufficient for modern electrified systems. This insufficiency is driven by new hardware, such as 800V architectures and SiC inverters, new operational environments like maritime applications, and new system paradigms including battery exchange and complex hybrids. The key findings focus on identifying and resolving the conflicts that arise from this new complexity.
The first conflict identified is between reliability and performance in electric machine design. A more reliable EIS, using thicker insulation to resist Partial Discharge (PD) at 800V, is necessary. However, this reduces the copper fill factor, which worsens machine performance by increasing losses and temperature. This conflict can be resolved through a reliability-oriented design methodology. By using thicker insulation and simultaneously increasing the slot width, a designer can maintain the copper fill factor, achieving both high reliability and performance with negligible change in losses.
A second conflict exists between grid reliability and user reliability in EV charging. EV Battery Exchange (BE) stations can act as dispatchable Energy Storage Systems (ESS) to support the grid. This creates a conflict: discharging batteries to help the grid improves power system reliability, measured by Expected Energy Not Served (EENS), but it depletes the inventory of charged batteries, harming EV user reliability, measured by User Demand Not Satisfied (UDNS). Cheng et al. (2013) [
53] finds that an optimal balance can be achieved by adjusting operating strategies, such as implementing a reservation of batteries for users.
A third conflict is identified between normal operation and failure modes in hybrid electric vehicle (HEV) control. Most hybrid control strategies are optimized for normal conditions but neglect component failures. An Integral Power Management Strategy (IPMS) can provide fault tolerance by pre-defining rule-based actions for abnormal conditions. In a simulated engine failure, the IPMS successfully initiated a limp-home operation, allowing the vehicle to continue on battery power. A final conflict relates to inadequate reliability standards. Existing IEC standards are insufficient due to the thermal aging factor for PD being a rough estimation that ignores aging time. An improved thermal aging enhancement factor is proposed that incorporates both temperature and time based on Arrhenius law. These standards are also only for land-based applications and fail to address unique maritime stressors.
4.3.2. Limitations and Future Research Directions in Energy System Design and Optimization Cluster
The cluster is defined by a prevailing trend of addressing the failures of existing design paradigms and standards. The most prominent gap identified is that existing HEV control strategies only consider normal operations and neglect component failures, creating a need for fault-tolerant strategies. There is also a clear gap in existing IEC standards for insulation systems, which are identified as inaccurate for lacking an aging time component and incomplete for lacking methods for maritime-specific stressors like humidity and vibration. Further, a conflict of objectives exists between performance-oriented and reliability-oriented design, as current methods are based on rules of thumb and experience, leading to over-engineering. A final gap exists in the research focus, with limited investigation into BE mode and minimal public literature regarding its impact on power system reliability.
The papers acknowledge several limitations, primarily related to the scope and completeness of their models. The proposed control strategies are still oversimplified for implementation. They do not yet consider multiple failures occurring simultaneously or complex “postfailure charging strategies”. The reliability testing frameworks also do not capture all real-world stressors. Kim and Kim (2025) [
74] did not address the long-term effects of inverter surges and did not include rapid temperature fluctuations or saltwater spray. Ji et al. (2025) [
73] notes that high-humidity and high-temperature conditions do not typically coexist simultaneously in its test, and states that analytical models are also based on limited experimental data, with only three data points used to fit the linear line for its Arrhenius plot. Finally, discrepancies exist regarding how well test objects like “motorettes” represent the reliability of a full machine.
The identified limitations and gaps point to several convergent paths for future research. A clear next step is to develop advanced, optimized control strategies. This involves optimizing the rule-based strategy by using ECMS or DP methodologies and expanding grid reliability strategies to model controlled charging strategies and “postfailure” recovery. The most important direction for reliability testing is to integrate multifactor stressors. Future work must expand the work to include high-stress conditions, explicitly investigating the simultaneous effects of inverter surges, thermal stress, and environmental factors. Another direction is the development of real-time monitoring systems to track insulation degradation during operation. Finally, future work is needed to develop fault-tolerant strategies that can handle multiple failures at once.
4.4. Fire and Emergency Response Systems
Table 12 presents the literature within the fire and emergency response systems cluster, addressing early warning strategies and suppression tactics. As this cluster is comparatively minor in size and can be regarded as a functional subset of the lithium-ion battery cluster, this section reviews only the three most significant papers identified by their NGC scores.
4.4.1. Current Research Overview and Trends in Fire and Emergency Response Systems Cluster
The methodologies in this cluster are split between proactive, data-driven prognostics and reactive, experimental mitigation. A dominant trend is advanced signal-based fault detection for early warning. Researchers are developing new signal-processing techniques to analyze real-world EV operating data. These methods include the Longitudinal Outlier Average (LOA), a statistical method that amplifies anomalous battery voltage signals, and Discrete Wavelet Decomposition (DWD), a frequency-domain method used to extract early hidden fault signals from high-frequency detail wavelet components of voltage data. The second methodological trend is physical experimentation for fire suppression, which involves full-scale testing to evaluate emergency response tactics after thermal runaway has begun. This includes measuring the effectiveness of different fire suppression systems, such as external water spray versus internal water mist. A third methodology is qualitative and literature-based risk assessment, using tools like bow-tie models to identify hazards and establish procedures for safely handling damaged EVs.
The cluster investigates the full lifecycle of a thermal runaway event, from early warning to post-fire extinguishment and handling. There is a strong consensus that traditional methods for both detection and response are failing. Standard monitoring systems provide delayed warning time, and conventional firefighting tactics are hard to extinguish battery fires. EV battery failures are highly concealed and pose unique risks, such as reignition due to stranded energy and the release of flammable, toxic gases that create a gas explosion risk.
Key findings demonstrate that prognostic warning is possible. By analyzing subtle voltage anomalies, the LOA method can provide a week-level early warning. The NDWD method was able to detect and locate fault cells seven days before the thermal runaway in a real-world accident case. In fire suppression, findings show that external suppression, like sprinklers, is insignificant for cooling the battery itself. However, tests confirm that internal fire suppression, applying water mist or spray inside the battery pack, has a positive effect on fire safety. It successfully limits peak temperatures and delays propagation of thermal runaway, giving first responders a chance to gain control of a thermal runaway event. Finally, safe handling procedures for damaged EVs are identified as a key preventive measure, including open perimeter isolation and proper de-energizing procedures.
4.4.2. Limitations and Future Research Directions in Fire and Emergency Response Systems Cluster
The prevailing trend in this cluster is responding to the failure of existing safety and emergency protocols to manage EV-specific hazards. The most critical gap is that traditional thermal runaway detection methods provide delayed warning time. A gap in fault diagnosis also exists, as simple time-domain analysis is insufficient to detect highly concealed or subtle voltage changes that are precursors to failure. For emergency response, a major gap exists in firefighting tactics. EV fires require more suppressant and time than conventional fires, and methods for handling damaged EVs are underdeveloped. A recurring challenge is the gap in data availability, as the confidential nature of real-world EV accident data significantly hinders the development of high-safety battery systems.
The studies acknowledge several limitations. The data-driven early warning models were validated on a very small number of actual thermal runaway accidents. A key limitation of these warning systems is that while they can identify a risk, they are unable to predict the remaining time until thermal runaway. The fire suppression tests were also performed on a specific battery pack for heavy vehicles, and the authors caution that these results may not apply universally. It is recommended that tests be performed on each unique battery installation. Additionally, internal suppression systems create a secondary gas explosion risk from the venting of flammable gas, which is not fully resolved.
The papers converge on a clear set of next steps. The most urgent, universally cited need is to collect more data from accident vehicles to verify the robustness and universality of the proposed early warning models. Future work must also move beyond just detecting risk to developing true prognostic models that can predict the remaining time until thermal runaway. Research should also explore the contribution of other sensor data and combine analytical methods to create multidomain and multiscale battery fault prognosis. Finally, there is a clear need to optimize internal fire suppression systems, including studying the minimum amount of suppressant needed and the optimal nozzle placement for different battery pack designs.
4.5. Cross-Cluster Analysis
The thematic review in Chapter 4 provides a qualitative confirmation of the fragmented intellectual structure identified in the bibliometric analysis. The research landscape is dominated by two large, technology-specific, and intellectually insular domains: Cluster 1 (Lithium-ion Battery Safety) and Cluster 2 (Hydrogen Safety). These clusters demonstrate minimal overlap in their core problems, methodologies, and foundational literature. Cluster 1 is primarily concerned with internal reliability issues, thermal runaway propagation, and prognostic model development. In contrast, Cluster 2 focuses on managing distinct physical hazards like toxicity and embrittlement, quantifying risk via PRA/DRA, and modeling leakage dispersion and overpressure. This deep specialization confirms that the field advances along parallel, non-interacting trajectories.
Despite this thematic separation, the clusters are unified by a methodological challenge: a profound scarcity of real-world failure data. This gap is explicitly identified as the biggest challenge in Cluster 1, the primary justification for using expert judgment models in Cluster 2, and a major barrier to prognostic model validation in Cluster 4. This shared data deficit has in turn shaped the field’s research methodologies. It has forced a heavy reliance on computational simulation, such as FE modeling and CFD, to generate data and explore worst-case scenarios. It also drives the development of advanced data-driven and analytical techniques designed to extract maximum insight from limited available data.
A third cross-cluster consensus is the field’s unified response to the perceived inadequacy of existing safety paradigms. Research in Cluster 1 is motivated by the consensus that standard abuse testing falls short of addressing the root cause of field accidents, which are self-triggered reliability issues. Similarly, Cluster 3 research is explicitly driven by the failure of existing IEC standards, which are inaccurate for new high-voltage applications and incomplete for maritime environments. This trend is mirrored in Cluster 4, where the development of new detection and suppression systems is a direct response to the failure of traditional methods in managing EV-specific hazards like reignition.
Finally, this cross-cluster analysis provides validation for the central research gap identified in this paper. The thematic review confirms that the field’s fragmentation is not only structural but also conceptual. While Clusters 1, 2, and 4 address the failure of individual technologies, and Cluster 3 addresses their integration at an electrical and control level, no cluster addresses their interaction at a physical safety level. The risk of hazardous interactions between different fuel systems operating in shared infrastructure, such as a battery thermal runaway event occurring in proximity to a hydrogen storage tank, remains largely unaddressed. This confirms that cross-modal and multi-fuel safety is a critical, virtually unexplored domain that must be the focus of a future research agenda.
5. Discussion
This bibliometric review has provided a comprehensive understanding of the safety and risk research for zero-emission transportation systems. This analysis was guided by five research questions, with the fifth aiming to identify research gaps and outline a future research agenda. This section synthesizes the findings from the performance and science mapping analyses to answer this final question. The synthesis highlights an unaddressed domain that requires a new, systems-oriented research direction.
5.1. Synthesis of a Fragmented and Nascent Field
The performance analysis and thematic review have illustrated the research landscape. This is a nascent and rapidly expanding field, with a distinct inflection point in 2020 after which publication output accelerated dramatically. This recency is reflected in a productivity structure dominated by a wide array of “occasional authors” rather than a stable, dedicated core of specialists.
Moreover, the science mapping and cross-cluster analysis reveal that this field is not only new but also fragmented. This fragmentation is not merely social, with co-authorship networks showing numerous disconnected “research islands”, but it is fundamentally intellectual and conceptual. The research domain is divided into two intellectual streams.
Lithium-ion battery safety: This functions as a mature “motor theme”, primarily concerned with internal reliability issues, prognostic models, and managing thermal runaway propagation.
Hydrogen and fuel cell safety: This acts as a “basic theme”, focusing on external physical hazards like leakage, dispersion, overpressure, and toxicity, often quantified using traditional PRA or DRA.
These two communities are advancing along separate, parallel intellectual trajectories, citing different foundational works and solving different problems. The underlying causes of this fragmentation are multifaceted, spanning institutional, regulatory, technological, and disciplinary dimensions.
First, institutional separation between the maritime and automotive sectors has hindered knowledge transfer. Research is often conducted by specialized national laboratories or universities with narrow mandates. For instance, the Korea Maritime and Ocean University focuses on maritime bunkering, while the Beijing Institute of Technology centers on road electric vehicles. Second, the regulatory frameworks are fundamentally divergent. Maritime safety is governed by the IMO through ship-specific strategies and safety codes, whereas the automotive sector is driven by the UNECE World Forum for Harmonization of Vehicle Regulations (WP.29) through distinct global electric vehicle policies and road transportation standards. Third, the technological characteristics of the hazards differ significantly. Battery safety is rooted in electrochemistry and internal reliability, while hydrogen safety is defined by fluid dynamics and external dispersion hazards. Finally, academic disciplinary boundaries further entrench this divide. In most institutions, marine engineering and automotive engineering are managed by separate departments with distinct academic backgrounds, leading to a lack of cross-disciplinary collaboration in addressing the common goal of safe zero-emission transportation.
5.2. The Critical Gap: Unexplored Risk
This documented fragmentation provides an answer to the research inquiry regarding the field’s capacity to address complex, system-level hazards. While the introduction highlighted the severe safety risks associated with individual alternative fuels, such as the toxicity of ammonia and the thermal runaway of lithium-ion batteries, the bibliometric results reveal that the academic response remains fundamentally siloed. The performance and mapping analyses demonstrate that the field’s two largest clusters are largely technology-specific, operating with minimal intellectual crossover. Furthermore, the thematic review confirms that no cluster currently addresses the potential physical safety interactions that may arise when these different fuel systems operate in proximity.
The urgency of addressing this gap is underscored by the reality that zero-emission technologies will co-exist in shared infrastructures, such as ferry terminals and multimodal hubs. A concrete example of such hazardous interaction was investigated by Lee et al. (2025) [
80], where the safety of LH2 bunkering was analyzed in a shared port environment. The study identified a critical scenario where a thermal runaway event in a battery-electric vehicle loading onto a ferry could act as a continuous high-temperature ignition source for a localized hydrogen leak from nearby bunkering infrastructure. Their analysis revealed that thermal radiation from a battery fire could significantly accelerate the pressure build-up in cryogenic LH2 tanks, potentially triggering a Boiling Liquid Expanding Vapor Explosion (BLEVE).
Furthermore, this study highlighted a mitigation conflict where traditional fire suppression tactics, such as water mist used to cool a battery fire, could inadvertently interfere with the natural buoyancy of a hydrogen cloud. This interaction potentially creates trapped explosive pockets in semi-confined spaces. These findings suggest that numerous unidentified risks likely exist, including interactions between ammonia plumes and battery venting gases. Consequently, the critical real-world risk of a secondary failure triggered by the interaction of different fuel types remains unaddressed and “virtually unexplored” within the scientific literature.
This paper has also highlighted a significant methodological gap regarding the limited application of system-theoretic approaches like System-Theoretic Process Analysis (STPA), which are specifically designed to handle emergent hazards in complex systems. The thematic review confirms that the dominant methodologies remain traditional frameworks such as FMEA, PRA/DRA, and CFD modeling. However, using STPA allowed for the identification of Unsafe Control Actions (UCAs) and control feedback errors that arise from the operational coupling of diverse zero-emission technologies. These represent risks that traditional, component-based failure models often fail to capture. This analysis, therefore, answers the fifth research question by identifying a critical research gap characterized by the lack of integrated, cross-modal, and multi-fuel safety analysis. The field’s current trajectory is insufficient to ensure safety in the complex, shared infrastructure where these technologies will co-exist.
5.3. Policy and Regulatory Implications
The policy and regulatory implications of the identified research fragmentation are significant, particularly as the global transportation sector strives to meet the ambitious decarbonization targets set by the IMO and the vehicle safety standards harmonized by the UNECE WP.29. The bibliometric analysis reveals that the scientific community has established robust knowledge bases for individual fuels. However, these insights remain siloed and serve as a reflection of the current fragmented regulatory landscape.
At present, international safety standards are fundamentally technology-specific and sectorally divided. The maritime sector is governed by the IMO through instruments such as the IGF Code for gas-fueled ships and various class society guidelines for battery installations, focusing on the vessel as a closed system. Simultaneously, the automotive sector is regulated through the UNECE WP.29 framework, such as GTR No. 13 (Hydrogen and Fuel Cell Vehicles) [
81] and GTR No. 20 (Electric Vehicle Safety) [
70]. While these standards ensure high safety levels for individual transport modes, this study’s finding of “intellectual insularity” highlights a critical blind spot in shared environments where these regimes intersect, such as ferry decks and multi-fuel ports. In these spaces, a vehicle complying with UNECE WP.29 standards operates within an infrastructure governed by IMO codes, yet there is currently limited regulatory correspondence to manage the risk of cross-system hazards, such as a vehicle-level battery fire impacting maritime fuel storage.
To bridge this gap, policy-makers must transition from sector-specific safety protocols to integrated, risk-informed governance frameworks. It is recommended that international bodies like the IMO and UNECE WP.29 establish collaborative mechanisms to harmonize safety protocols for multimodal interfaces. This coordination should focus on developing an “Integrated Safety Certification Framework” for shared infrastructures, ensuring that emergency response tactics and safety distances are evaluated based on the interaction of diverse fuel systems. Ultimately, the safety of the zero-emission transition depends on a regulatory shift that mirrors the intellectual integration proposed in this review, moving from managing isolated components to governing complex, cross-modal transportation ecosystems.
5.4. Scope and Generalizability of Findings
The scope of this bibliometric review was intentionally centered on the maritime and automotive sectors. This decision is justified by the current technological landscape, where these two industries have demonstrated a significantly higher adoption rate and a more mature deployment of onboard alternative fuel storage compared to the aviation and railway sectors. Consequently, the safety literature and regulatory frameworks for zero-emission ships and road vehicles are more substantially developed, providing a robust dataset for bibliometric analysis.
However, it is important to acknowledge that this targeted focus limits the direct generalizability of our findings to all transportation modes. For instance, electrified railways primarily rely on external catenary infrastructure, and zero-emission aviation is still in an early, experimental stage with distinct weight and energy density constraints. While this study captures the intellectual structure of the most dominant and rapidly transitioning domains, the identified safety gaps and trends may not fully represent the unique operational risks of air and rail transport. Therefore, as these sectors continue to evolve, integrating aviation and railway systems into future bibliometric and thematic studies will be an essential next step in developing a truly holistic, cross-modal safety governance framework for the entire zero-emission transportation ecosystem.
5.5. Methodological Limitations and Database Selection
Despite the systematic mapping provided in this study, several methodological limitations regarding the data source and selection criteria must be acknowledged. First, the exclusive reliance on the WoS Core Collection as the primary data source may introduce a potential database bias. However, this choice was driven by the need for high-quality, standardized metadata and consistent citation metrics, which are essential for reliable science mapping and are consistent with established precedents in transportation safety research.
Second, while industry-led safety reports, international regulatory standards, and accident investigation data were not directly indexed in the primary dataset, their critical technical insights were indirectly incorporated. This is because these sources are frequently cited and extensively discussed within the 151 peer-reviewed journal articles analyzed, thereby ensuring that industrial safety practices are reflected in the thematic synthesis. Furthermore, the decision to prioritize journal articles over conference proceedings was a strategy to ensure the methodological maturity and theoretical rigor of the findings.
Finally, the inherent citation lag associated with recent publications, particularly those from 2024 and 2025, remains an unavoidable constraint in bibliometric analysis. To mitigate this effect, this study utilized NGC for document selection, which partially compensates for the citation disadvantage of newer works by accounting for their specific publication year. Given the rapid technological evolution in zero-emission transport, future research should consider a multi-database approach and the integration of diverse regulatory datasets to capture the full spectrum of industrial and academic safety developments as the field matures.
5.6. A Proposed Future Research Agenda
Bridging this gap requires a future research agenda that pivots from the current fragmented, single-technology focus to an integrated, empirical systems approach. The findings justify a future research direction structured in three phases.
First, an empirical foundation must be established. A recurring challenge identified in the thematic review was a profound scarcity of real-world failure data, which currently forces a heavy reliance on simulation. Therefore, a priority is to collect and analyze accident cases involving ships and automobiles that use alternative fuels. This effort would build a comprehensive database of accident cases, providing an empirical basis to move beyond isolated models and understand real-world failure progression, especially the role of fuel interactions. To move beyond a generic approach, this phase should employ data-mining and natural language processing (NLP) to systematically extract failure precursors and “hidden” reliability issues from disparate industrial safety reports and insurance data that are not currently indexed in academic databases.
Second, this empirical data should be used to analyze hazardous interactions using system-theoretic methods. This phase confronts the primary conceptual and methodological gaps identified in this review. Future research should examine the interactions between different alternative fuel systems operating in proximity, such as hydrogen-fueled ships and battery-powered vehicles. This analysis would tackle the central, unexplored gap identified in
Section 4.5. A critical use case for this phase is provided by Lee et al. (2025) [
80], who demonstrated how a battery-electric vehicle fire can act as an external ignition source, triggering a secondary BLEVE in liquid hydrogen bunkering infrastructure through cross-modal thermal radiation. Crucially, this work should employ the methodologies this review identified as underutilized: STPA and dynamic risk analysis. This would address the methodological gap by applying tools designed to identify UCAs and emergent hazards in complex, multi-fuel systems that traditional, component-based failure models often overlook.
Third, these analytical insights must be translated into practical, integrated safety protocols. The findings from the interaction analysis should be used to develop safety protocols and guidelines for the safe operation of integrated zero-emission transportation systems. The current literature provides safety guidance in “clusters,” focusing on areas like battery storage or hydrogen bunkering rather than addressing both simultaneously. This research would produce a set of safety protocols explicitly designed to mitigate the risks of interactions between fuel types. For instance, these protocols would resolve the conflict between battery fire suppression and hydrogen dispersion, ensuring the work has a practical impact on the transportation industry. To guide this integration, we propose two research questions for priority investigation:
RQ 1: How do the divergent failure mechanisms of different types of zero-emission energy carriers interact within shared, semi-confined infrastructures, and what modeling frameworks are required to capture these cross-platform hazards?
RQ 2: How can system-theoretic frameworks be utilized to harmonize fragmented, sector-specific safety standards and effectively manage integrated risks within evolving transportation ecosystems?
In summary, this bibliometric review has mapped the current state of zero-emission safety research and, in doing so, has illuminated a blind spot. The proposed research agenda provides a necessary path forward, shifting the field from its current fragmented state to a unified, system-level approach.
6. Conclusions
This bibliometric review systematically mapped the intellectual landscape of safety and risk for zero-emission transportation systems, analyzing 151 core documents published between 2006 and 2025. The objective was to provide a comprehensive overview of the field’s performance, structure, and thematic evolution, with a specific aim to identify critical research gaps. The science mapping and subsequent thematic review confirmed the paper’s hypothesis. The research landscape is fragmented, advancing along parallel, non-interacting trajectories.
This study contributes to the field in several key ways. First, to our knowledge, this is the first comprehensive bibliometric review to holistically map the zero-emission transportation safety field, moving beyond the numerous existing reviews that focus on a single fuel type or a single transport mode. Second, by integrating quantitative science mapping with a qualitative thematic review, this paper provides objective, data-driven evidence of the field’s intellectual fragmentation. It demonstrates the “intellectual insularity” of the battery and hydrogen safety communities, identifying this as the primary barrier to addressing complex, system-level safety challenges. Third, beyond a descriptive mapping, this study offers a critical theoretical indicator for the advancement of integrated safety governance. It highlights the urgent necessity of adopting frameworks like the Systems-Theoretic Accident Model and Processes (STAMP). Within this paradigm, modern zero-emission transportation must be viewed as a “system of systems” rather than a collection of separate components. By highlighting the limitations of traditional, siloed methodologies, this paper provides the conceptual justification for a shift toward system-theoretic approaches that treat safety as an emergent property of the entire system architecture. Fourth, this review synthesizes cross-cutting methodological challenges that unify the fragmented clusters, most notably the profound scarcity of real-world failure data and a universal consensus that existing safety standards are inadequate to address the novel reliability challenges of these technologies. Finally, this paper translates these identified gaps into a structured, actionable research agenda, as detailed in the discussion, providing a pathway from the field’s current state to its necessary future state.
The global transition to zero-emission transportation is an irreversible and essential imperative. This study demonstrates that the scientific community has successfully built a foundational knowledge base for managing the risks of individual alternative fuels. However, it also reveals that this research is developing in isolated clusters. The field’s next phase of research must be one of integration. In practical terms, this study suggests that stakeholders and regulators must transition from fuel-specific safety protocols to integrated risk management frameworks capable of addressing the complex interactions in future multi-fuel environments. The safety of our future transportation ecosystems, where diverse zero-emission technologies operate in close proximity, depends on our ability to bridge the intellectual and collaborative gaps identified in this review.