1. Introduction
The Annual Statistical Report on Aviation Safety in 2019 released by the International Civil Aviation Organization (ICAO) shows that with the rapid development of air transportation, civil aviation accidents show an increasing trend [
1]. Aviation accidents cause an enormous loss of lives and massive monetary costs worldwide. In 2002–2011 alone, there were a total of 250 worldwide fatal accidents, which resulted in 7148 fatalities [
2]. Aviation administrations conduct accident investigations to learn how the systems fail and why accidents happen, which may lead to the review and revision of related regulations, standards, and management. In order to improve the safety of air transportation, the National Institute of Aerospace and National Aeronautics and Space Administration (NASA) analyzed historical accident data to develop a high-level airline organizational hierarchy for tracing and identifying the deficiency propagation [
3].
The occurrence of a civil aviation accident is a multi-stage dynamic system, and it may have more than one causal factor (such as aircraft manufacturing, aircraft maintenance, air traffic control, weather, and human factors). Although modern aircrafts are mostly equipped with advanced technologies, human factors present a major contribution to accidents [
4,
5]. Sant’Anna et al. found that over 50% of aviation accidents were caused by human errors at a node in the causal chain, especially human factors related to pilots [
6]. Hence, this research focuses specifically on the contribution of pilot-related human factors to civil aviation risk.
Pilot factors generally include the physical conditions, experience, psychological conditions, decision-making, and task-handling performance of pilots [
7]. To analyze pilot factors, the aviation accident database is used as one of the most valuable sources to obtain the primary data, including the Flight Safety Foundation database, Federal Aviation Administration (FAA) database, Aviation Safety Network database and National Transportation Safety Board (NTSB) aviation accident database. McKay and Groff examined trends in the prevalence of over-the-counter, prescription, and illicit drugs identified in toxicology tests of fatally injured pilots between 1990 and 2012 using matched data from FAA’s Civil Aerospace Medical Institute toxicology database and NTSB aviation accident database [
8]. For analyzing civil aviation accidents, 163 commercial airliner accident reports extracted from the NTSB aviation accident database during 2008–2020 were reviewed in this paper. According to such reports, pilot factors are derived.
Quantitative risk analysis is a common approach to identifying risk factors and developing effective risk mitigation countermeasures, including fault tree analysis (FTA), event tree analysis (ETA), logistic regression (LR), and the Bayesian network (BN). Compared with FTA and ETA, BN has a significant advantage that allows for propagating such information in both forward propagation and backward inference when additional information on some random variables is available [
9]. Additionally, Zhou et al. found that the LR model is more concentrated on statically qualitative analysis, and quantitative analysis is not sufficient [
10]. For comparing the effects of LR and BN in small sample data, Leite et al. analyzed the data of 119 patients to realize the auxiliary diagnosis of obstructive sleep apnea [
11]. The results showed that the specificity of BN was higher than that of LR. In addition, BN can represent the dependencies between the indicators and accident consequences, revealing that the accident consequences were the most sensitive to the position where the accidents occurred. Thus, in the domain of aviation, BN-based causal models have been extensively applied for pilot risk management and enhancing aviation safety. For example, Zhang and Mahadevan constructed a Bayesian network to analyze the historical passenger airline accidents reported in the NTSB aviation accident database and capture the causal relationships embedded in the sequences of these accidents [
12].
According to previous studies, expert knowledge is still an important data source for aviation accident modeling [
13]. Because BN requires a relatively low number of parameters and a small-size conditional probability table, it can utilize expert knowledge and/or data-driven methods to achieve quantitative aviation risk analysis. However, compared to the studies using expert knowledge in BN construction, a data-driven BN in aviation risk analysis that is less subjective has better robustness and certainty [
14]. For the data-driven BN approach, there are several variants, such as naive Bayesian networks (NBN), and tree augmented naive Bayes networks (TAN-BN). Among them, TAN-BN constructs a quantitative BN representing RIFs’ interactive dependencies, which helps generate insights on critical pilot factors contributing to different casualty types of accidents. Li et al. pointed out that TAN-BN learning not only maintains the robustness and computational complexity of NBN learning but also provides better accuracy [
15]. Li and Cheng used the I-880 field date to develop NBN and TAN-BN for traffic incident duration prediction [
16]. By evaluating the results of NBN and TAN-BN under different sample sizes, they found that the prediction accuracy of TAN-BN was higher when the number of training samples was between 70 and 150. TAN-BN is proposed for identifying the interactions between the attribute variables by using a tree structure [
17]. To date, few studies have used a data-driven TAN-BN to analyze pilot factors in civil aviation accidents.
This study aims at investigating how pilot factors interact with non-pilot factors, how they jointly affect air transportation risk, and how different risk factors generate an impact on different casualty types of pilot-related civil aviation accidents in an individual or combined manner. Therefore, this study used the aviation accident database from NTSB between 2008 and 2020 to conduct a data-driven TAN Bayesian Network to generate the structure of risk influencing factors (RIFs), which will provide new insights on the differentiation among critical pilot factors contributing to different “casualty types” of aviation accidents.
4. Discussion
After training of the TAN-BN model with 163 individual pilot-related accidents under Part 121 in the NTSB aviation accident database from 2008 to 2020, it was found that most of the civil aviation accidents resulted in no injury, supporting the argument by Waycaster et al. that civil aviation is the safest way to travel [
32]. On the other hand, if there are casualties in an accident, the most probable casualty type is serious injury, the second is minor injury, and the least is fatal injury.
Based on the MI values of the calibrated TAN-BN model, the top five factors (MI value > 0.05) in order of importance are: weather condition, flight phase, equipment condition, pilot action and decision, and maintenance quality. This ranking confirms that weather condition has the greatest impact on the casualty types in civil aviation accidents, which is consistent with the findings from the literature [
33], which also used the NTSB data. This finding is also in line with those from Jenamani and Kumar, who used a different data source, namely, the Geneva-based Aircraft Crashes Record Office (ACRO) [
34]. According to the statistics, it is not difficult to find that convective turbulence is the most frequent weather condition causing accidents in these 163 individual pilot-related accidents. However, convective turbulence in flight is not easy to forecast at present, and accidents are often caused by a sudden convective surge.
The second most important factor is the flight phase. On the different flight phases, the number of accidents was different. Li found that in the route and approach phase, the number of accidents was the most, and the number of casualties was higher than other flight phases, accounting for approximately 50% of the whole flight phases, indicating that the flight phase is highly correlated with aviation accidents, which is also confirmed by the results of this study [
35].
Next, the third most important influencing factor is the equipment condition. This finding highlights the reliability of the equipment, and it is consistent with the results of several past accident analysis studies [
36]. More importantly, this study finds that weather condition and flight phase play a more critical role in civil aviation accidents than equipment condition.
Pilot action and decision is ranked fourth in the importance of effect, but it ranks first among all pilot factors listed in
Table 3 (pilot action and decision > pilot psychological condition > pilot task performance > pilot physical condition > pilot experience and skills), which is a new finding by this study and is partially supported by Kelly and Efthymiou’s research [
37]. This finding can help airlines or flight training schools optimize their pilot training programs to reduce risks in these areas. Furthermore, McClernon et al. found that stress training during the acquisition of flight skills might serve to enhance pilot performance in stressful operations, which indicated that pilot psychological condition could improve pilot task performance [
38]. However, some researchers suggested that pilot’s psychological pressure affects their decision-making, thus, it is more crucial [
39].
Finally, the maintenance quality ranks fifth. To achieve a deeper understanding of maintenance quality, Insley and Turkoglu analyzed the aircraft maintenance-related accidents and serious incidents which occurred between 2003 and 2017 in the Aviation Safety Network’s accident database and SKYbrary’s accidents and incidents database [
40]. They suggested that the greatest maintenance factors causing the accidents were “inadequate maintenance procedures” and “inspections not identifying defects”, which is partly corresponding to this study.
Similarly, from the results of
Table 5, average values of TRI are compared and ranked as follows: flight phase > weather condition > equipment condition > maintenance quality > pilot action and decision. Compared with the results of
Table 3, it is not difficult to see that the order of these five RIFs is different, but weather condition and flight phase are still the top two, which can indicate their higher correlation with the casualty type of civil aviation accidents than pilot action and decision. Thus, we suggest that airlines and aviation administrations should focus on monitoring flights in high-risk flight phases and bad weather, such as landing with windshear, approach with low-visibility, etc., and improve the ability of pilots to execute such flights.
An empirical sensitivity analysis of the proposed TAN-BN model has shown that the model satisfies the two axioms, partially indicating its validity for application. The reasonableness of the model could also be illustrated through a retrospective analysis of a past accident, which is not included in the above training database. For example, from the event ID “20200714 × 42039” in the NTSB database, Envoy Airlines flight 3880, an Embraer 175 aircraft, encountered turbulence during the descent into Chicago O’Hare International Airport from Texas on 9 July 2020, and one person was seriously injured on the flight. Except for the target variable, some parameter settings for the proposed BN model can be obtained based on the descriptions, including:
- (1)
Due to encountering turbulence during the descent of flight, the weather condition was poor, and the flight phase was descent.
- (2)
Because of the flight No. 3880, the flight carried passengers.
- (3)
Because the flight was from Texas to Chicago, the flight was a domestic flight.
- (4)
Because of the Embraer 175 aircraft, the number of seats on aircraft for passengers was fewer than 100, and the number of engines was 2.
Along with the above information, there is no other information recorded in the accident. The other factors (nodes) maintain their generic original probabilities given no updated evidence is collected from the accident. Based on the above parameter settings, it reveals a very high probability of 94.4% for the serious injury in this accident, which further validates the proposed model, as shown in
Figure 4.
Table 9 illustrates the MPE for all casualty types. According to the results of
Table 9, comparing with the situation of no injury in civil aviation accidents, the poor equipment condition and the bad weather condition have a strong relation to the minor injury and fatal injury, and the bad weather condition, such as convective turbulence and low visibility, also has a strong relation to serious injury. Hence, flight safety in bad weather is of great concern. Furthermore, weak supervision and carry category such as cargo are strongly related to fatal injury, which indicates that cargo airlines’ supervision on safety is insufficient and should be strengthened. Regarding the flight phase, this study finds that phases, including cruise, descent, approach, and landing, are strongly associated with these four casualty types, while phases such as standing, pushback/towing, taxi, takeoff, and climb are not, which is in accordance with Moriarty and Jarvis’s views [
41], showing that cruise, descent, approach, and landing have higher accident risks. Moreover, there is a high probability for serious injury and fatal injury to happen during the airframe of hours less than 20,000 and a high probability for no injury and minor injury to happen during the airframe of hours between 20,000 and 50,000. This is a significant finding, indicating that the new aircraft is not necessarily more reliable than the older aircraft, which can help optimize the reliability management of airline fleets. In addition, minor injury becomes probable when the number of seats on the aircraft is between 100 and 200. Additionally, the certified max gross weight of aircraft less than 100,000 has a strong association with no injury and serious injury, the certified max gross weight of aircraft between 100,000 and 200,000 has a hard relation to minor injury, and the certified max gross weight of aircraft more than 200,000 is strongly related to fatal injury, which suggests that the accidents on aircraft with a certified max gross weight of more than 200,000 carry a high risk of fatality. In terms of pilot factors, poor pilot physical and psychological conditions and weak pilot task performance only contribute to fatal injury, while incorrect pilot action and decision is not correlated with these four casualty types. Thus, this finding verifies that it is very important to study the physiological, psychological, and behavioral performance of pilots to ensure flight safety. Likewise, Yue et al. suggested that the pilot factor is the most important causative factor among the human factors for civil aviation accidents, which is in accordance with the results of this study [
42]. On the basis of Netica, we have successfully obtained the MPE for all casualty types. However, in addition, there are some other methods worth trying to obtain the MPE in the future, such as the deletion algorithm and the approximate deletion algorithm [
43]. On the other hand, it is worth further study why pilot factors and casualty types show such a relationship. We plan to analyze the impact of pilot factors on flight safety through physiological signal monitoring.
While our study provides insight into the interdependency and causal effects of pilot factors on the outcome of civil aviation accidents using past accident data, important limitations are noted. In this study, we identify RIFs of the model according to the classified information in the NTSB database, which is not undisputed. To a certain extent, some RIFs cannot be isolated simply because they may have a strong causal relationship with other RIFs. For example, Fabre et al. believed that pilots’ risky decisions are related to the change in their psychological state caused by the external environment [
44]. Therefore, further exploring the influence of the relationship between various RIFs on the results of the model is our future research content. Furthermore, we excluded the accident data not related to or not involving pilots for mainly considering pilot factors. Although the results of data filtering were checked by civil aviation domain experts, there is still some subjectivity. Future work may consider more RIFs and reduce the degree of accident data filtering. Furthermore, this study only focuses on civil commercial flights, which account for approximately 3% of the NTSB aviation accident database, resulting in a small total amount of accident data. In future, we will introduce the accident data of general aviation and conduct a comparative study.
5. Conclusions
To analyze the possible risks introduced by pilot factors to civil aviation safety, this study uses a data-driven TAN-BN approach to investigate how different risk factors contribute to different casualty types of civil aviation accidents with a focus on pilot factors. For identifying RIFs, this study utilizes a total number of 163 individual pilot-related accidents under Part 121 in the NTSB aviation accident database from January 2008 to March 2020. A TAN-BN model is calibrated to identify and analyze the effects of various RIFs incorporating pilot factors in civil aviation accidents. Finally, a sensitivity analysis is conducted, including model validation and scenario simulation.
Based on the statistical strength-of-influence measure—mutual information (MI) from the resulting TAN-BN model, the top five critical RIFs for casualty types (MI value > 0.05) are: “weather condition”, “flight phase”, “equipment condition”, “pilot action and decision”, and “maintenance quality”, successively. Based on another risk influence measure—TRI, these crucial RIFs are ranked again as follows: “flight phase”, “weather condition”, “equipment condition”, “maintenance quality”, and “pilot action and decision”. Furthermore, the scenario analysis provides a plausible causal explanation for the observed accidents, revealing the most probable scenario concerning a particular casualty type.
This study has provided insight into the interdependency and causal effects of various factors, including pilot factors, on the outcome of civil aviation accidents, contributing to the development of improved regulations, management, and safety countermeasures for the improved understanding of civil aviation industry.