Dynamic Decision-Making Process of Evacuees during Post-Earthquake Evacuation near an Automatic Flap Barrier Gate System: A Broken Windows Perspective

: The automatic ﬂap barrier gate system (AFBGS) plays a critical role in building security, but it is more vulnerable to natural hazards than common exits (including power failure, due to earthquakes, and delayed evacuation, due to safety certiﬁcation, etc.). This article considers a dynamic decision-making process of evacuees during post-earthquake evacuation near an AFBGS. An interesting metaphor, broken windows (BW), is utilized to interpret people’s actual behavior during evacuation. A multi-stage decision-making mechanism of evacuees is developed to characterize the instantaneous transition among three deﬁned stages: Habitual, mild, and radical states. Then, we build a modiﬁed three-layer social force model to reproduce the interaction between evacuees based on an actual post-earthquake evacuation. The simulations reveal that BW provides a contextualized understanding of emergency evacuation with a similar effect to the traditional metaphor. An earlier appearance of a mild rule breaker leads to a higher crowd evacuation efﬁciency. If evacuees maintain the state of broken windows behavior (BWB), the crowd evacuation efﬁciency can be improved signiﬁcantly. Contrary to the criminological interpretation, the overall effect of mild BWB is positive, but the radical BWB is encouraged under the command of guiders.


Introduction
The transport infrastructure in congested regions often presents a vulnerability under natural disasters, such as an earthquake, fire, hurricane, etc. As a common, but serious, natural disaster, earthquakes principally result in shaking, ground rupture, structure collapse, etc., and destroy the stability of transport systems [1]. Many incidents with serious consequences occurred near extremely crowded areas, such as exits. Especially as an automatic multi-exit solution for metro stations, stadiums, universities, and commercial buildings, the automatic flap barrier gate system (AFBGS) is more vulnerable to hazards than common exits (e.g., a power outage caused by structural damage, time delay, due to access control, etc.). Post-incident analysis of the exit choice revealed that pedestrians attempted to evacuate through the same exit, ignoring less crowded points of egress during an emergency evacuation [2,3]. In addition to the characteristics of evacuees near common exits, a more complex and stepped decision-making process is witnessed near an AFBGS and has attracted considerable attention.
The decision-making process and evacuation behavior of evacuees are highly correlated to the group psychology and the structure of transport facilities under earthquakes [4,5]. Numerous previous studies are consistent with this observation. Scholars argue that pedestrians may not choose an efficient evacuation route after an earthquake [6]. The dynamic decision-making process and psychological state of evacuees are quite difficult to be incorporated in a physical pedestrian movement model. Some scholars have still made some progress in mathematical frameworks for pedestrian evacuation in a variety of cases (e.g., fire accident [21,22], earthquake, terrorist attack [23,24], flood disaster [25], etc.). The related works can be generally divided into two categories: The macroscopic continuum model and the microscopic discrete model. The former [26,27] characterizes the crowd motion as the homogeneous continuum flow, deducing a series of rigorous mathematical theories and numerical algorithms. The microscopic model can be further divided as: Social force model (SFM) [28,29], cellular automata model [30,31], lattice gas model [32,33], etc. Especially, SFM flexibly considers the dynamic interactions between individuals, deriving more reliable and realistic results compared to the macroscopic model [34]. Li et al. [35] modify the panic SFM and find that an evacuation leader is beneficial to maintain the calm and order of the crowd. Especially, a two-layer model is proposed to reproduce a real-life grouping situation in an earthquake evacuation [36].
In this paper, a BW perspective is introduced to help us to understand the evolutionary process of mild and radical evacuation behaviors. A three-layer SFM is developed to study the dynamic decision-making process of evacuees during post-earthquake evacuation near an AFBGS. The paper is organized as follows. Section 2 introduces the framework of the three-layer model, which includes the trigger mechanism of BW effect, individual decision, and crowd dynamics. In Section 3, we show that the new model can reproduce the realistic BW phenomenon recorded by the video. The validation and sensitivity analysis are conducted to explore the impact of the parameters. In Section 4, the utilities of evacuation strategies are compared to derive the policy suggestions. Section 5 concludes the influence mechanism of BW under post-earthquake evacuations. (a) all evacuees followed the daily habit and chosen the right door as the exit at stage 1; (b) an evacuee (the red circle at the top left corner) suddenly passed the left door, which was noticed by other evacuees (the red circles at the center); (c) more and more evacuees noticed the left door and chosen it as the exit, the daily habit was broken mildly at stage 2; (d) some impatient evacuees (the red circles) suddenly climbed over the closed winged gates, and this radical behavior was imitated by some other evacuees nearby at stage 3.
The dynamic decision-making process and psychological state of evacuees are quite difficult to be incorporated in a physical pedestrian movement model. Some scholars have still made some progress in mathematical frameworks for pedestrian evacuation in a variety of cases (e.g., fire accident [21,22], earthquake, terrorist attack [23,24], flood disaster [25], etc.). The related works can be generally divided into two categories: The macroscopic continuum model and the microscopic discrete model. The former [26,27] characterizes the crowd motion as the homogeneous continuum flow, deducing a series of rigorous mathematical theories and numerical algorithms. The microscopic model can be further divided as: Social force model (SFM) [28,29], cellular automata model [30,31], lattice gas model [32,33], etc. Especially, SFM flexibly considers the dynamic interactions between individuals, deriving more reliable and realistic results compared to the macroscopic model [34]. Li et al. [35] modify the panic SFM and find that an evacuation leader is beneficial to maintain the calm and order of the crowd. Especially, a two-layer model is proposed to reproduce a real-life grouping situation in an earthquake evacuation [36].
In this paper, a BW perspective is introduced to help us to understand the evolutionary process of mild and radical evacuation behaviors. A three-layer SFM is developed to study the dynamic decision-making process of evacuees during post-earthquake evacuation near an AFBGS. The paper is organized as follows. Section 2 introduces the framework of the three-layer model, which includes the trigger mechanism of BW effect, individual decision, and crowd dynamics. In Section 3, we show that the new model can reproduce the realistic BW phenomenon recorded by the video. The validation and sensitivity analysis are conducted to explore the impact of the parameters. In Section 4, the utilities of evacuation strategies are compared to derive the policy suggestions. Section 5 concludes the influence mechanism of BW under post-earthquake evacuations.

Materials and Methods
In accordance with the existing metaphor [17][18][19][20], the BW effect is defined as visible signs of abnormal/noticeable behavior in the crowd, which forms dominant strategies and encourages more followers. Metaphorically speaking, the first broken windows behavior (BWB) [37] refers to the trigger that changes the original state of the crowd. Involved in stress and chaos [38], the effect of BWB on individuals will be strengthened continuously. As shown in Figure 1, mild BWB (using the left door) and radical BWB (climbing over the gates) appear in the different stages of post-earthquake evacuation, and a three-layer SFM based on the staged BW effect is developed.

Calculation Paradigm
In terms of the surveillance video, the evacuation scenario is near an AFBGS. The schematic illustration of the scenario is shown in Figure 2. The typical evacuation strategies at such three stages are mainly divided into: Habitual evacuation, mild BWB, and radical BWB. Each evacuee i considers three candidate strategies of exit choice: Passing the right door, passing the left door, and climbing over a closed barrier gate m. The choice of the right door represents the habitual evacuation strategy, and evacuees select the usual path. While the choice of the left door and that of closed barrier gate refer to the mild BWB and radical BWB, respectively. The latter two are triggered and influenced by the rule breakers (RB). Based on the perception of the dynamic environment, each evacuee i compares the disutility U i right door , U i le f t door , U i g m (g m denotes gate m) and chooses the strategy with the minimum disutility. The disutility U i k , k ∈ {right door, left door, g m } is set as follows.
where µ i k is the perceived preference coefficient of strategy k; T i→k denotes the estimated travel time for individual i to complete the strategy k; ε > 0 is the adjustment coefficient to prevent the denominator µ i k + ε from going to zero during the simulation. The computational details of µ i k and T i→k are presented in Section 2.2. The mild and radical RBs (MRB, RRB) are set to trigger different states: The first MRB breaks the habit mildly and selects the left door as the exit; the RRB climbs over a gate and escapes from the room. The whole evacuation process is divided into three stages:  Figure 2. Calculation paradigm of the three-layer SFM.

Multi-Stage BW Effect Model
A new three-layer SFM is developed to simulate the crowd motion of pedestrians after earthquakes. The classical SFM [29] was introduced that an individual kept reasonable distances to other individuals and obstacles during the evacuation. For a pedestrian i of mass i m (70 kg), the actual velocity i v at time t is governed by the following equation: where ij f and iw f , respectively, denote the action forces that evacuee j and wall w exert on evacuee i . So that i tries to keep a reasonable distance from evacuee j and wall is self-driven force meaning that evacuee i tends to move with the desired speed in the desired direction by adjusting his actual velocity i v within the characteristic time τ . The action force ij f that evacuee j exerts on evacuee i is calculated as follows.

Multi-Stage BW Effect Model
A new three-layer SFM is developed to simulate the crowd motion of pedestrians after earthquakes. The classical SFM [29] was introduced that an individual kept reasonable distances to other individuals and obstacles during the evacuation. For a pedestrian i of mass m i (70 kg), the actual velocity v i at time t is governed by the following equation: where f ij and f iw , respectively, denote the action forces that evacuee j and wall w exert on evacuee i. So that i tries to keep a reasonable distance from evacuee j and wall w. J i is the index set of the evacuees remaining in the room except evacuee i, W is the index set of the walls inside the room. The coefficient v 0 i is the desired speed, the unit vector e i (t) pointing from i to the target exit is the desired moving direction, τ > 0 is the characteristic is self-driven force meaning that evacuee i tends to move with the desired speed in the desired direction by adjusting his actual velocity v i within the characteristic time τ. The action force f ij that evacuee j exerts on evacuee i is calculated as follows.
where n ij is the unit vector pointing from the position of evacuee j to that of evacuee n ij represents the psychological tendency between i and j to stay away from each other. r ij is the sum of their body radius. d ij is the distance between two evacuees' centers of mass. kg r ij − d ij n ij and κg r ij − d ij ∆v t ji t ij are the body force and the sliding friction force, respectively. They will arise when i and j touch each other, i.e., d ij < r ij . The function g(x) = max{x, 0}, t ij = (−n 2 ij , n 1 ij ) is the tangential direction and ∆v t ji = v i − v j · t ij is the tangential velocity difference. Similarly, the action force f iw in Equation (2) between evacuee i and wall w is given by For more details of f ij and f iw , we refer to [29]. Finally, the desired direction e i (t) is calculated as where p i (t) and p exit-target (t) are the positions of evacuee i and the target exit chosen by i at time t, respectively. Clearly, the desired direction depends on the exit choice strategy. The desired direction e i is used to describe different exit choice strategies in the three stages. Each strategy represents a target exit (right door, left door, or one gate). Integrating the multi-stage exit choice pattern with the SFM is essential to reveal the mechanism and effect of BWB during post-earthquake evacuation.

Stage 1: Habitual Evacuation
Under the impact of daily habits, as shown in Figure 1, the left door is overlooked, and the right door is selected as the unique exit. Then the desired direction e i is set to point to the position of the right door at this stage. The motion of individual i is calculated according to Equation (2).

Stage 2: Habitual or Mild BWB Evacuation
Referring to previous literature [39], an agent (MRB) is set to break the rules and select the left door as the exit at T le f t . Some evacuees in the crowd may capture this abnormal phenomenon and suddenly notice the availability of the left door. Then the right door, together with the left door become the candidate exits. The daily habit (only using the right door) is mildly broken. It should be noted that, by this time, the radical behavior (climbing over barrier gates) does not appear because nearly all of evacuees are still patient. It takes a while for evacuees to get impatient and carry out radical actions. Therefore, in stage 2, evacuee i compares the preference µ i k to decide the target exit and related desired direction. That is: where DH i k (t) is a 0-1 binary variable, which represents the habitual constraint of exit k for pedestrian i and characterizes the instantaneous transition from the habitual state to the MRB's state; a continuous variable FS i k (t) indicates the stimulation of favorable strategy (SFS) on individual's decision making. The details are listed as follows.
(1) Daily habit DH i k (t) Traffic in some countries and regions, including China, keeps right [29]. Following the basic behavior standard, the right door is regarded as the conventional exit and DH i right (t) ≡ 1. The MRB, as a visible sign of QBW stimulation, contributes to the deviation of daily habits-especially for using the left exit during the evacuation. Agent i is affected by the MRB at t i le f t (t i le f t ≥ T le f t ). Then, the daily habit coefficient of the left door DH i le f t (t) can be defined as: However, not all individuals are influenced by the MRB immediately. The transition from E 1 to E 2 must satisfy at least one of the following two conditions. There are evacuees (MRB or individuals with mild BWBs) near the left door in the semi-circular area Q 3 (see in Figure 3). Meanwhile, the vision of evacuee i towards the left door is not covered by other pedestrians. That is: where Q 1 is the edge area of vision towards the left door. N Q 1 denotes numbers of agents in Q 1 . S Q 1 is the area of Q 1 calculated by visual angle θ corner and the radius r vision . β is the shielding factor. When other agents' bodies shade the view, the individual cannot detect the situation near the left door (shown in Figure 3).
However, not all individuals are influenced by the MRB immediately. The transition from 1 E to 2 E must satisfy at least one of the following two conditions.

Condition 1: The evacuee directly observes that the left door is available.
There are evacuees (MRB or individuals with mild BWBs) near the left door in the semi-circular area 3 Q (see in Figure 3). Meanwhile, the vision of evacuee i towards the left door is not covered by other pedestrians. That is: where 1 Q is the edge area of vision towards the left door.

Condition 2:
The evacuee realizes that the individual nearby holds an unconventional decision.

Condition 2:
The evacuee realizes that the individual nearby holds an unconventional decision.
There is at least one individual j with an unconventional decision in Q 2 . Motion attributes of the agent j is defined as follows.
where e 0 j → (x le f t , y le f t ) denotes the direction of desired speed e 0 j points to the left door and v j is the actual speed of agent j. The second condition in Equation (9) is that the angle between the actual moving and desired speed directions of agent j is less than ω. Meanwhile, the moving speed of agent j is large enough (greater than the threshold α). Then, j's intention to choose the left door is observable for individual i.
(2) Stimulation of the partially-favorable strategy FS i k (t) of door k The herding effect [31,40] is highlighted in the existing literature regarding seismic evacuation, which stresses that an agent often tries to keep the same moving direction and speed of the crowd. However, in a homogeneous environment, anomalies are always more likely to be observed and imitated [41,42]. As shown in Figure 1b, a small group was stimulated and quickly separated from the "sheep". They imitated the MRB while he ran away from the room quickly. The SFS is defined as a phenomenon that individuals are affected by a few predominant individuals. When the desired direction is limited within the crowd, individuals tend to be stimulated by a few abnormal evacuees nearby or who hold more favorable or efficient evacuation strategy.
The right door is always noticed by any evacuee during the evacuation and FS i right (t) ≡ 1. For t < t i le f t , the evacuation advantage of the left door is not detected, so FS i le f t (t) is set to 0 initially. At the time t i le f t , the availability of the left door is instantaneously perceived by individual i, resulting in a sudden and unexpected emotional stress. Prati et al. [43] stated that the emotional stress could reinforce the attraction of a strategy, if its utility increased suddenly from one time to another. Thus, to characterize the trigger effect, due to the sudden change of the perceived availability. FS i le f t (t) is define as: where N Q2 t i le f t is the total number of pedestrians in Q 2 at time t i le f t , N le f t Q2 t i le f t is the number of pedestrian q in Q 2 whose desired speed and moving speed point to the left door at time t, i.e., satisfying Equation (9). d iq t i le f t denotes the distance between i and q. v q t i le f t is the velocity of evacuee q. δ 1 and ε are the adjustment coefficients. Particularly, d ij (t)+ε means the herding effect of the crowd heading the left door [43,44]. δ 1 presents individual difference of the psychological stimulation to SFS. This measures the abnormality degree of crowd behavior (left door strategy is abnormal and arresting for evacuee i) within the area of Q 2 . λ 1 is the degradation coefficient of individual's SFS to the left door.
According to Equation (10), at t = t i le f t , the value of FS i le f t (t) instantly jumps from zero to a real number greater than one. After t i le f t , the left door is included in the decision set. Individuals actively choose the optimal exit. However, since the right door and the left door have the same natural availability, the attraction of the left door declines gradually and reaches the same level as that of the right door eventually. The mathematical operation "max" of the second item on the right-hand side ensures FS i le f t (t) is not less than 1 when t ≥ t i le f t . Therefore, FS i le f t (t) is larger than 1 at t i le f t , and then continues to decrease until equals to 1 (FS i right (t)).  Figure   4).

Gate
Right Door Left Door  The estimated passing time of Q 2 for individuals is based on the velocity field and spatial distribution of evacuees moving to different exits. Assume that agent i selects door k (k ∈ {right door, left door}) as the target exit. Then, individuals with different decisions in their vision are divided into two categories. In addition to passing through the unobstructed area, agent i may come into conflict with individuals (A ¬ k i→k ) who are moving to another exit, then evolves into overtaking or waiting behavior. The probabilities of overtaking and waiting are γ and 1 − γ. Then the expected time to pass area Q 2 equals the sum of waiting, overtaking, and straight-line walking time.
where r i is the radius of individual i, v k i→k is the average speed of agents moving to the same exit with i in area Q 2 , N ¬ k i→k and v ¬ k i→k denote the number and average speed of agents moving to the different exit from i in Q 2 . ϑ is the adjustment coefficient. r vision is the radius of Q 2 . The three terms on the right-hand side of Equation (11) are the expected waiting, overtaking, and straight-line walking time, respectively. The numerical maximum value of the last term ensures that the item will not be less than 0 when too many agents (N ¬ k i→k ) choose the different exit compared to i. The rest two regions tend to be a little far from i and the evacuation time is roughly estimated. In area Q 3 , pedestrians swarm towards the same exit, and the speed of the crowd tends to be homogeneous. T des i→k is defined as: where R door is the radius of Q 3 . v Q 3 is the average speed of agents in the Q 3 . P k is the set of agents in Q 3 . N i→k is the total number of agents in the Q 3 . T path i→k is defined as: where d i−k is the distance between agent i and door k. v 0 i is the desire speed. ϕ is the conservative coefficient of the individual actual velocity. Considering the possibility of overlap of areas Q 2 , Q 3 (see in Figure 4), the estimated time T i→k is defined as: (4) Disutility and movement Compare the disutility of two doors with Equation (15). The exit with the minimum disutility is chosen. The desired direction e i in Equation (5) is set to this exit and the state of i is updated to E 2 right or E 2 le f t . Then the motion is calculated based on Equation (2).

Stage 3: Habitual, Mildly BWB, or Radical BWB Evacuation
At T gate (T gate > T le f t ), a RRB near the AFBGS is set to climb over a gate. Then, evacuees satisfying both conditions below would be affected by the RRB: 1 FS i le f t (t) ≥ 1. Transition to state E 2 is the basic condition for individuals to shift to stage 3. 2 Individuals near the RRB can transfer to the radical state and start to climb over a gate immediately ( P i − P g m ≤ R climb ). The above settings are based on the video (see in Figure 1). It should be noted that before entering Stage 3, the evacuee must have entered stage 2. When the RBB appears, the effect of radical BWB may be triggered, then passing the left/right door and climbing over winged gates are the candidate strategies for evacuees. Therefore, the target exit for agent i in stage 3 is determined by comparing the exit disutility U i k , k ∈ {left door, right door, gate m}. The perceived preference µ i k for the left and right doors has been studied in Section 2.2.2 (Stage 2), and that for the winged gate in Stage 3 can be defined as: where g m , m ∈ {1, 2, 3, 4} denotes the different winged gates of the AFBGS. MN i g m (t) denotes the legal or moral constraint of individual i about climbing over gate m. Similar to FS i le f t door (t), FS i g m (t) is set as a part of overall preference µ i g m which changes continuously. (1) Moral norm MN i g m (t) Each individual needs to swipe a card to enter the gate (as a one-way entrance) in daily life. Before the RRB appears, evacuees follow the regulations and laws (MN i g m (t) = 0). After t i g m , the time point that individual i satisfies the trigger conditions, they tries to climb over the nearest gate to gain a competitive advantage. Their decision adds the option of climbing over the gate and MN i g m (t) = 1. (2) Stimulation of the partially-favorable strategy FS i g m (t) of gate g m Walking on the left is only about habits in our hypothesis. FS i le f t (t) is always no less than 1 after t i le f t . However, climbing over the gate is illegal or immoral behavior in daily life (e.g., a maximum fine of 1000 yuan for crossing the automatic ticket gate in Beijing Metro). In terms of the video, only a few students, which broke the regulations to choose the climbing strategy. Thus, FS i gate (t) is set to stay low and decrease after t i g m , even close to 0. The radical BWB needs continuous "stimulation" to maintain a large positive value.
Meanwhile, only when approaching the gate ( P i − P g m ≤ R climb ), pedestrian i may consider the strategy of climbing over a gate. For each empty gate m, FS i g m (t) is defined as: where NG denotes the number of gates. As RRB does not exist at every time step, the memory effect is considered that individual can remember the behavior of climbing over a gate (radical BWB) for the last K times. At the last s-th time (s ∈ {1, 2, 3, . . . , K}), t i s is the moment when individual i observes that other evacuees start to climb over the gate. NG s is the number of gates occupied by individuals at time t i s . In accordance with the Formula (8), δ 2 and λ 2 are the psychological stimulation and the degradation coefficient of individual's SFS to the gates of AFBGS. δ 3 denotes the basic value of FS i g m (t) which is less than 1. It indicates that climbing over the gate is a sudden BWB, and the priority of which is lower than the evacuation strategy to use the left/right door, while the basic value of FS i k (t) is 1 according to the Formula (8). Due to the moral constraint, for t > t i g m , the psychological attraction FS i g m (t) tends to decrease gradually and become smaller than 1 eventually, i.e., FS i g m (t) < FS i le f t (t), FS i right (t). P i , P g m are the coordinates of the individual and gate m. The function H(x) is 0, if the pedestrians are far away from the gates (R climb < P i − P g m ), and is otherwise equal to 1.
(3) Estimated crossing time Similarly, the estimation time for an evacuee to climb over a gate is defended as: where ϕ(0 < ϕ < 1) is the conservative coefficient. P i − P g m /ϕv i denotes the walking time of individual i to the exit. TC is the estimated time to climb over a gate.
(4) Disutility and movement In stage 3, the movement of individuals is determined by comparing the disutility of two exits (U i k , k ∈ {left door, right door}) and the four gates (U i g m , m = 1, 2, 3, 4), which are not occupied by other pedestrians currently. The disutilities of the left and right doors are presented in Section 2.2.2, and that of the gates is defined as The exit with the minimum disutility is chosen by evacuee i, and the desired direction e i in Equation (5) is set to this exit.
Based on the discrete evacuation model proposed in previous research [9,45,46], the movement of the individual near a gate is simplified. The set I (i ∈ I) represents individuals with the target g m . Firstly, random numbers are generated for individuals, and the coordinate of the individual i with the largest number is set to P g m (P i = P g m ) and the state is updated to E 3 . Secondly, the individual i is frozen for TC seconds to simulate the climbing behavior. Thirdly, remove the individual i from the simulation environment and g m is remarked as empty.

Model Validation
In this part, we focus on validating the three-layer SFM model developed in Section 2 and whether the model can reproduce the real post-earthquake evacuation process shown by Figure 1. The parameters of SFM are in accordance with Helbing's works [29,47,48]. Other parameters (refer to Appendix A) are properly set to match the realistic scenario. The shielding factor β is set to 0.35, and the perceptual threshold ω is set to 20 0 , which is according to the diopter of human eyes [49]. The threshold speed α = 0.3 m/s, probability of overtaking γ = 0.9, conservative coefficient ϕ = 0.9 of actual velocity, estimated time TC = 0.5 s to climb over the gate, and the radius of climbing R climb = 1.5 m are estimated by the statistical information of realistic earthquake evacuation video [12,35,50]. Additionally, the coefficients of PS i le f t door , i.e., δ 1 = 0.3, λ 1 = 0.2 and those of PS i g m , i.e., δ 2 = 0.9, δ 3 = 0.9, λ 2 = 0.2, are estimated according to the simulation results (the sensitivity analysis will be conducted in Sections 3.1 and 3.2).
The configuration for the simulation scenario is as follows, using NetLogo as an agentbased programming language. As shown in Figure 5, a total of 150 pedestrians evacuates a room near an AFBGS. The AFBGS contains two doors and four gates (entrance using a swipe card), which is about 300 m 2 . The width of the two doors (pink and red), four gates (orange), and two pots of plant (green) are set to 1 m (see Figure 5). Then the time step is set to 1 s. In each experiment, according to previous researches [51][52][53], 100 independent simulations were performed, and the average value was taken as the output result. Before the simulation began, 150 individuals were random placed away from AFBGS. An MRB and RRB were programmed exogenously into the simulation process. The configuration for the simulation scenario is as follows, using NetLogo as an agent-based programming language. As shown in Figure 5, a total of 150 pedestrians evacuates a room near an AFBGS. The AFBGS contains two doors and four gates (entrance using a swipe card), which is about 300 m 2 . The width of the two doors (pink and red), four gates (orange), and two pots of plant (green) are set to 1 m (see Figure 5). Then the time step is set to 1 s. In each experiment, according to previous researches [51][52][53], 100 independent simulations were performed, and the average value was taken as the output result. Before the simulation began, 150 individuals were random placed away from AF-BGS. An MRB and RRB were programmed exogenously into the simulation process.
We respectively record the evacuees with state 1 E (choose the right door), with state 2 E and choose the left door, with state 2 E and choose the right door, with state 3 E and climb over one-winged gate. The colors are set to white, red, blue, and grey, respectively, (as shown in Figure 5) for further comparison. Continuous spatial-temporal patterns are presented through the program. Comparing with the actual situation (see Figure 5a-d), the simulation scenarios of dynamic decision-making process of evacuees (see Figure 5fh) show similar distributions. This means that the BW evacuation model developed in Section 2 can reproduce the real collective behavior in post-earthquake evacuation recorded by the video. The variation movement tactics and rules of individuals influenced by the MRB or RRB are analyzed in the following section.   We respectively record the evacuees with state E 1 (choose the right door), with state E 2 and choose the left door, with state E 2 and choose the right door, with state E 3 and climb over one-winged gate. The colors are set to white, red, blue, and grey, respectively, (as shown in Figure 5) for further comparison. Continuous spatial-temporal patterns are presented through the program. Comparing with the actual situation (see Figure 5a-d), the simulation scenarios of dynamic decision-making process of evacuees (see Figure 5f-h) show similar distributions. This means that the BW evacuation model developed in Section 2 can reproduce the real collective behavior in post-earthquake evacuation recorded by the video. The variation movement tactics and rules of individuals influenced by the MRB or RRB are analyzed in the following section.

Numerical Simulation of the MRB's Behaviors
(1) Effect of trigger time T le f t The trigger time T le f t indicates the moment that the MRB appears, and it show great importance to the appearance of mild BWBs. As shown in Figure 6, the decrease of T le f t makes the total evacuation time (TET) shorter and improves the evacuation efficiency. The appearance of MRB at the beginning of evacuation shortens the TET (near 30 s) compared with the control group (the actual evacuation). Meanwhile, the earlier detection of MRB increases the utilization of the left door for evacuees. The number of evacuees using the left door shows a linear growth with the decrease of T le f t , which is stable at about half of the total number of evacuees.  λ (see in Figure 7).  (2) Effect of the stimulation of favorable strategy FS i k (t) The SFS is particularly obvious in the homogeneous system, and affects the individual decision-making process. It is regarded as the stimulation of the MRB or individuals choosing the left door. To investigate the influence of FS i le f t (t) after the appearance of the MRB, the usage of the left door is measured when changing the value of stimulation coefficient δ 1 and degradation coefficient λ 1 (see in Figure 7).  λ (see in Figure 7).  δ 1 refers to the psychological stimulation of SFS (the initial value is 0.3). As shown in Figure 7a, the evacuation scenarios are similar. With a larger δ 1 , more individuals transfer to state E 2 and utilize the left door. The TET becomes shorter, while the efficiency of the two exits (especially the left door) are improved. When δ 1 is 0.9, the growth rate of the usage of left door is the largest. More individuals in state E 1 are affected by the MRB, and then transfer to E 2 and choose the left door as the exit. The process of transition is greatly accelerated. λ 1 refers to the decline of an individual's BW psychology to MRB. The change of λ 1 has a reverse effect on the evacuation process. Figure 7b shows that the numbers of evacuees using the left door is higher with a lower value of λ 1 . The cumulative increase of evacuees choosing the left door strengthens the mutual psychological stimulation in the crowd. More evacuees ignore the daily habits and make the optimal decision by comparing the disutility. Therefore, the evacuation time is reduced.

Numerical Simulation of the RRB's Stimulation
(1) Effect of trigger time T gate and crossing time TC Here in this section, we focus on the effect of the time RRB appears (trigger time T gate ) and the time evacuees climb over the gates of AFBGS (crossing time TC) on evacuees' radical BWB. The impact of trigger time T gate (T gate > T le f t ) on the TET is generally unnoticeable (as shown in Figure 8). For T gate ≥ 70 s (after the scenario one), as T gate increases, the total number of evacuees climbing over the gates in the whole evacuation (TEG) decreases significantly and TET slightly increases. In addition, TEG continues to increase, while T gate increases from 70 s to 90 s, and reaches the maximum value at T gate = 90 s. After that, TEG presents a steady decline eventually. It can be seen from Figure 8b that the impact of estimated crossing time (TC) on TET is relatively small. Moreover, as TC increases, the individual cost of climbing over a gate will also increase, and then fewer evacuees will choose gates as the exit. has a reverse effect on the evacuation process. Figure 7b shows that the numbers of evacuees using the left door is higher with a lower value of 1 λ . The cumulative increase of evacuees choosing the left door strengthens the mutual psychological stimulation in the crowd. More evacuees ignore the daily habits and make the optimal decision by comparing the disutility. Therefore, the evacuation time is reduced. . After that, TEG presents a steady decline eventually. It can be seen from Figure 8b that the impact of estimated crossing time ( TC ) on TET is relatively small. Moreover, as TC increases, the individual cost of climbing over a gate will also increase, and then fewer evacuees will choose gates as the exit.  (2) Effect of the stimulation of favorable strategy FS i g m (t) Here in this section, we focus on the effect of the psychological effect and the degradation coefficient of individual's SFS to the gates of AFBGS (δ 2 , δ 3 , and λ 2 ) on the SFS, TET, and the evacuation strategies of evacuees. The simulation results show that the impact of δ 2 is rather small, so we only report the results about δ 3 to study the system performance after trigger time T gate , as shown in Figure 9a. As δ 3 increases, TET varies from 260 s to 270 s, implying that the transition is moderate. Furthermore, the number of evacuees using the gates during simulation process (NEUG) changes significantly with different values of δ 3 . The changing curves of NEUG is unimodal and declines rapidly if δ 3 is small (δ 3 = 0.1 or 0.3), and it is multimodal and fluctuates repeatedly when δ 3 is large (δ 3 = 0.6 or 0.9). A higher level of δ 3 (for example, comparing δ 3 = 0.9 and δ 3 = 0.1) generally results in a larger NEUG, as well as a longer time for using gates during the evacuation process. This implies that δ 3 has a noticeable effect on the exit strategies of evacuees. δ has a noticeable effect on the exit strategies of evacuees.

Discussion
People will not take the initiative to violate the rules or laws to obtain convenience in daily life [39,54]. Even in the classical "broken windows" experiments, the cars offered was also slightly damaged (not new)-which then caused a deterioration [17,37]. Pedestrians are told to regulate their own behaviors under the influence of moral or legal norms, for example do not walk on the left and do not climb over the gate. It is a kind of habitual action, especially for the college students in the video, which hold the basic principle of "walking on the right" at the beginning of evacuation. External stimulation of the MRB or RRB is necessary for individuals to form a psychological hint of a better evacuation decision. Then, all possibilities of exit will be tested and identified.
Previous searchers put forward many psychological theories to describe evacuees' herb behavior as "the need to behave in the same way as everyone else does" [40,41], which is also found in economics and business fields. Similar phenomena occur during the evacuation, evacuees prefer the crowded door as the exit. However, the video indicates people follow the minorities with an advantage strategy than their original one. In addition to the empirical studies, this kind of BW psychology spreads quickly after the breaker appears.
The BWT is highly influential. The impact of some abnormal behaviors will lead to more followers [17][18][19][20], no matter it is positive or negative. In this study, although the BWB is an abnormal or even immoral behavior in daily life, it is encouraged as the evacuation efficiency improves to a certain extent. The mild BWBs significantly reduce the TET and the environmental risk is controllable, while the two doors can be regarded as oneway exits to eliminate the influence of daily habits under emergencies. In contrast, the The effect of degradation coefficient λ 2 is presented after trigger time T gate in Figure 9b. As λ 2 increases from 0.1 to 0.9, TET increases from 255 s to 277 s. Thus, the effect is relatively apparent and the decrease of λ 2 can improve the total evacuation efficiency. Meanwhile, as λ 2 increases, the number of evacuees using gates also increases and the growth rate is positively related with λ 2 . In addition, a lower level of λ 2 (for example, comparing λ 2 = 0.9 and λ 2 = 0.1) generally results in a longer time for using gates during the whole evacuation process. The decrease of λ 2 can rise the utilization rate of gates.

Discussion
People will not take the initiative to violate the rules or laws to obtain convenience in daily life [39,54]. Even in the classical "broken windows" experiments, the cars offered was also slightly damaged (not new)-which then caused a deterioration [17,37]. Pedestrians are told to regulate their own behaviors under the influence of moral or legal norms, for example do not walk on the left and do not climb over the gate. It is a kind of habitual action, especially for the college students in the video, which hold the basic principle of "walking on the right" at the beginning of evacuation. External stimulation of the MRB or RRB is necessary for individuals to form a psychological hint of a better evacuation decision. Then, all possibilities of exit will be tested and identified.
Previous searchers put forward many psychological theories to describe evacuees' herb behavior as "the need to behave in the same way as everyone else does" [40,41], which is also found in economics and business fields. Similar phenomena occur during the evacuation, evacuees prefer the crowded door as the exit. However, the video indicates people follow the minorities with an advantage strategy than their original one. In addition to the empirical studies, this kind of BW psychology spreads quickly after the breaker appears.
The BWT is highly influential. The impact of some abnormal behaviors will lead to more followers [17][18][19][20], no matter it is positive or negative. In this study, although the BWB is an abnormal or even immoral behavior in daily life, it is encouraged as the evacuation efficiency improves to a certain extent. The mild BWBs significantly reduce the TET and the environmental risk is controllable, while the two doors can be regarded as one-way exits to eliminate the influence of daily habits under emergencies. In contrast, the radical BWBs do not significantly improve the overall evacuation efficiency, due to the time and ability of climbing behavior. Meanwhile, there are secondary risks when climbing over the gate for evacuees, e.g., tripped over the winged door, building collapsed, due to earthquake, etc. [1,4] Contrary to the criminological metaphor, the BWB under emergencies is not just about catastrophic consequences. A mild or radical breaker may lead to different evacuation results. The positive BWBs refer to breaking away from convention under the constraints of rules, e.g., staying calm and proactive about optimizing the environment, moving obstacles, offering assistance, etc. The negative BWBs refer to the violations mentioned above. Considering the effect of BWB, optimization strategies are proposed. The corresponding evacuation results are shown in Figure 10.
• Strategy 1: Conducting the mild BWB initially. An MRB appears initially, and the individuals imitate and follow the MRB. • Strategy 2: Improving the ability to break the routine (mild BWB) under evacuation. Through safety education, evacuation exercises, slogans, etc., increase the ability of individuals to continuously and actively break the routine (mild BWB) to promote evacuation efficiency. But preventing them from exhibiting radical behaviors (climbing over the gates). The efficiency of Strategy 2 is demonstrated in Figure 7. • Strategy 3: Conducting the radical BWB initially with a guider. A guider (see in Figure  10) is set near the gate to help individuals to swipe the card.
radical BWBs do not significantly improve the overall evacuation efficiency, due to the time and ability of climbing behavior. Meanwhile, there are secondary risks when climbing over the gate for evacuees, e.g., tripped over the winged door, building collapsed, due to earthquake, etc. [1,4] Contrary to the criminological metaphor, the BWB under emergencies is not just about catastrophic consequences. A mild or radical breaker may lead to different evacuation results. The positive BWBs refer to breaking away from convention under the constraints of rules, e.g., staying calm and proactive about optimizing the environment, moving obstacles, offering assistance, etc. The negative BWBs refer to the violations mentioned above. Considering the effect of BWB, optimization strategies are proposed. The corresponding evacuation results are shown in Figure 10.
• Strategy 1: Conducting the mild BWB initially. An MRB appears initially, and the individuals imitate and follow the MRB. • Strategy 2: Improving the ability to break the routine (mild BWB) under evacuation. Through safety education, evacuation exercises, slogans, etc., increase the ability of individuals to continuously and actively break the routine (mild BWB) to promote evacuation efficiency. But preventing them from exhibiting radical behaviors (climbing over the gates). The efficiency of Strategy 2 is demonstrated in Figure 7.  The results in Figure 10 tell an interesting story. The mild BWB only or the radical behavior under the command of certain guiders is recommended. Firstly, extreme behavior is not encouraged in real evacuation. However, effective guidance can reduce the negative impacts (e.g., risk of falls and stampede) of individual radical behaviors and increase evacuation efficiency. Secondly, if there is no guider, it is valuable for setting the slogans near the AFBGS and reinforcing the safety education to extend the time of mild BWB of evacuees. Low risk resources (e.g., new exits or paths) in the circumstance are encouraged to be rapidly integrated. The manager can stimulate the evacuees to cause the thought of breaking the routine slightly (generating MRBs) through the environmental stimulations (such as slogans and signs). Even in non-seismic zone, managers should be alert of natural hazards, and ensure the availability and connectivity of facilities.

Conclusions
In this work, we describe a dynamic decision-making process of evacuees during post-earthquake evacuation near an AFBGS. The BWT is utilized to provide a contextualized understanding of the pedestrian evacuation behavior. A multi-stage individual exit choice mechanism is developed to characterize the instantaneous transition among The results in Figure 10 tell an interesting story. The mild BWB only or the radical behavior under the command of certain guiders is recommended. Firstly, extreme behavior is not encouraged in real evacuation. However, effective guidance can reduce the negative impacts (e.g., risk of falls and stampede) of individual radical behaviors and increase evacuation efficiency. Secondly, if there is no guider, it is valuable for setting the slogans near the AFBGS and reinforcing the safety education to extend the time of mild BWB of evacuees. Low risk resources (e.g., new exits or paths) in the circumstance are encouraged to be rapidly integrated. The manager can stimulate the evacuees to cause the thought of breaking the routine slightly (generating MRBs) through the environmental stimulations (such as slogans and signs). Even in non-seismic zone, managers should be alert of natural hazards, and ensure the availability and connectivity of facilities.

Conclusions
In this work, we describe a dynamic decision-making process of evacuees during postearthquake evacuation near an AFBGS. The BWT is utilized to provide a contextualized understanding of the pedestrian evacuation behavior. A multi-stage individual exit choice mechanism is developed to characterize the instantaneous transition among different states (habitual state, mild BWB state, and radical BWB state). A three-layer social force model is proposed to reproduce the pedestrian evacuation process under different scenarios. Simulation results show that: (1) An earlier appearance of the first MRB leads to a higher crowd evacuation efficiency. While the impact of the trigger time of RRB is not significant.
(2) If evacuees maintain the state of BWB, e.g., increasing the psychological perception of mild BWB, decreasing the psychological decay of mild and radical BWBs, the crowd evacuation efficiency can be improved significantly. (3) The mild BWB and radical BWB under the command of guiders are recommended. Whereas, effective guidance can reduce the negative impacts (e.g., risk of falls and stampede) and increase evacuation efficiency. Informed Consent Statement: Informed consent was obtained from all subjects involved in the study.

Data Availability Statement:
The data presented in this study are available on request from the corresponding author. The videos are not publicly available as informed consent obtained from all subjects in the video specifically applies to this study. We need their permissions for further studies.

Conflicts of Interest:
The authors declare no conflict of interest.

Appendix A Parameters
Reference Notation Initial Value

Normal evacuation
The parameters of SFM are in accordance with [29] Number of agents N 150 Video-based fitting δ 2 0.9 Video-based fitting δ 3 0.9 Video-based fitting λ 2 0.2