1. Introduction
The appeal of shopping, dining, leisure, and other activities in urban pedestrian-only spaces is growing. However, the high concentration of pedestrians has led to overcrowding, resulting in challenges such as diminished leisure experiences and reduced travel comfort [
1]. The pedestrian comfort within urban infrastructure can be evaluated by level of service (LOS). The Highway Capacity Manual (HCM) is a widely accepted standard for assessing LOS based on three indicators: flow, speed, and density [
2]. Level of Service (LOS) is categorized into six levels, from A to F, representing a range of walking experiences. As LOS decreases, walking experience declines due to reduced speed and increased density. Initially, pedestrian flow increases as LOS decreases, reaching its peak at a moderate level (level D) before declining further with additional decreases in LOS. Among the various indicators, pedestrian density is particularly significant and widely utilized [
3]. By employing urban design strategies to passively manage pedestrian density, the functionality of urban spaces can be enhanced while addressing comfort issues for pedestrians [
4]. This study focuses on urban pedestrian-only spaces—outdoor public areas within a city designed specifically to exclude motorized vehicle traffic. These spaces prioritize safe and enjoyable movement for pedestrians, encompassing streets, squares, and pathways. Various urban environmental factors shape the pedestrian environment, influencing walking behavior and the distribution of pedestrian density [
5].
Predicting pedestrian density relies on models of pedestrian behavior, which can be categorized into macroscopic and microscopic approaches. Macroscopic models offer faster computational speeds but consider limited kinds of influencing factors, while microscopic models provide greater prediction accuracy [
6]. Since this study explores a broad spectrum of influencing factors, a microscopic model is employed. These models can predict individual walking trajectories [
7] and provide insights into patterns of collective self-organization [
8].
Microscopic models generally fall into two categories: knowledge-driven and data-driven models [
9]. Data-driven models, particularly Long Short-Term Memory (LSTM) models, offer high accuracy in predicting pedestrian movements [
10,
11,
12]. However, they typically require input from previous steps to forecast future behaviors. While these models excel at predicting individual walking trajectories, they are less effective for simulating collective behavior and establishing relationships between environmental factors and collective behavior [
13]. This study focuses on knowledge-driven models, specifically agent-based models (ABM) and social force models (SFM), which are well-suited for examining how environmental factors influence collective behavior.
ABM is effective for investigating individual walking trajectories [
14]. ABM models can be derived from cellular automata, but the discretization of space and time in cellular automata can limit the speed and direction of pedestrian flow [
15]. Research by Lee, Lee & Kang [
16] demonstrated that ABM can effectively simulate pedestrian behavior in the presence of corners and obstacles, by accurately modeling corresponding behavior and properly setting random behavior. Hussein & Sayed [
17] validated the accuracy of predicting personal walking trajectories in crowded environments. Furthermore, Ma, Brandt, Seipel & Ma [
18] highlighted the importance of incorporating visual parameters in trajectory predictions. Bossowski, Szandala & Mazurkiewicz [
19] emphasized the need for a walking dynamics model that considers factors such as route length, obstacle avoidance, visibility, and grouping behavior. ABM is suitable for studying individual pedestrian walking behavior influenced by various kinds of environmental factors and collective behavior shaped by environmental factors without subdivided attributes.
SFM is widely utilized for predicting collective self-organization patterns. According to Helbing & Molnár [
20], a pedestrian’s motion can be influenced by their internal motivation to move in a specific direction at a desired speed, along with interactions with other pedestrians and environmental boundaries, such as walls and obstacles. Initial validations of these models focused on the accuracy of individual pedestrian walking behavior [
21]. Moussaïd, Helbing, Garnier, Johansson, Combe & Theraulaz [
22] introduced an SFM that was successfully validated against collective crowd patterns. Subsequent research has incorporated the effects of corners [
23], obstacles [
24], and visibility [
25] to refine the model further. However, these influencing factors have been analyzed separately, leaving the interaction effects among them still unclear. SFM is suitable for studying collective behavior, but the scenarios are simple compared with urban walking condition.
In summary, the limitations of current research can be identified in two main areas. First, while individual pedestrian trajectory prediction models emphasize the importance of incorporating various influencing factors, they lack the capability to simulate collective patterns. Many models from previous studies, while essential for predicting collective patterns, have focused on a single type of influencing factor, thus falling short of a comprehensive framework. Although some integrate Agent-Based Model (ABM) and Social Force Model (SFM), they still struggle to clarify the relationship between pedestrian density distribution and multiple urban design factors. This limitation reduces their effectiveness in supporting urban design practices. However, recent studies indicate that a comprehensive model is now feasible to be developed. Research has shown that pedestrian density distribution is correlated with the characteristics of walking spaces, including the layout of traversable spaces, non-traversable spaces, and visual restriction spaces. However, the mechanisms underlying these differences remain insufficiently explored [
26].
The purpose of this study is to develop a two-layer prediction model for pedestrian walking behavior and density distribution in urban pedestrian-only spaces. The tactic layer utilizes an agent-based model to describe behaviors such as approaching aim behavior, turning behavior, visual restriction behavior, and traversing restriction behavior. In the operational layer, a social force model (SFM) is employed to capture group behavior dynamics and random behavior. By applying this model to simulate crowd behavior, pedestrian density distribution in urban environments can be accurately predicted. With the goal of improving comfort by optimizing density distribution, it provides a basis for proposing urban design strategies to optimize the pedestrian environment.
2. Materials and Methods
The walking behavior prediction model can be structured into three layers: the strategic layer, the tactic layer, and the operational layer [
27]. In the strategic layer, travel goals are defined, focusing primarily on overall urban analysis rather than specifically on pedestrian-only spaces. The tactic layer determines walking aims through path planning, while the operational layer predicts movement at a specific moment, influenced by the goals established in the tactic layer. The tactic and operational models can be combined to create a local walking behavior prediction model [
28]. The walking aim identified in the tactic layer sets the desired direction, which influences the operational layer’s predictions of pedestrian movement [
29]. By integrating these two layers, we can better understand the relationship between urban pedestrian-only environments and collective walking behavior.
The framework of the model proposed in this study is illustrated in
Figure 1. The process begins with the development of the tactic layer model, which predicts walking aims based on various environmental factors. Next, the operational layer model is employed to forecast pedestrian walking behavior. In the third step, agent-based simulation is used to predict pedestrian density distribution. Finally, the resulting data allows us to determine the relationship between the influencing factors and pedestrian density distribution.
2.1. Tactic Layer Path Planning Model
In the tactic layer (TL) of path planning, research indicates that pedestrians consider walking time (path length), passing difficulty, and risk factors to make decisions [
27]. In urban pedestrian-only spaces, passing difficulty can be perceived as uniform. The walking destination and turning space influence path length, while physical boundaries and collision risks affect risk factors. Consequently, TL path planning behavior is shaped by these elements: walking destination, turning space, collision risk, and physical boundaries. The walking distance prompts approaching aim (AA) behavior, reflecting the tendency to reach the destination via the shortest path. Turning space leads to turning (TN) behavior, which describes adjustments in direction when direct routes are unavailable. Collision risk results in visual restriction (VR) behavior in areas with limited visual information, increasing the likelihood of potential collisions with other pedestrians. Lastly, physical boundaries lead to traversing restriction (SR) behavior, which anticipates and avoids collisions with environmental obstacles.
This study employs Maslow’s hierarchy of needs to organize the logic behind the four behaviors discussed, as illustrated in
Figure 2. Maslow [
30] identified five levels of human needs, ranging from basic to advanced: physiological, safety and security, love and belonging, self-esteem, and self-actualization. Generally, lower-level needs have a greater influence on human behavior, and with the rise of demand level, the influence of demand on people gradually decreases. In this context, AA and TN behaviors are associated with efficiently reaching one’s destination, which aligns with self-actualization needs. Conversely, VR and SR behaviors are linked to the risks of collisions with the physical environment or other pedestrians, representing the safety needs of individuals.
According to Maslow’s hierarchy of needs, AA and TN behaviors, as well as VR and SR behaviors, form two distinct pairs at the same level. Although AA and TN do not occur simultaneously, VR and SR may act concurrently and influence the walking direction determined by AA and TN. The respective influence weights of VR and SR on walking direction are denoted as
and
. Let the walking direction determined by AA and TN be
, and the direction suggested by VR behavior be
. The resulting direction influenced by AA, TN, and VR (
) is
. Similarly, if
represents the direction determined by SR behavior, the final walking aim (
) is
. The overall logic of path planning is illustrated in
Figure 2.
In AA behavior, pedestrians direct their movement toward their destination, which represents their walking aim. The behaviors of TN, VR, and SR are more complex, and the methods for determining the walking aim will be discussed in the following section. All these TL path planning principles are informed by existing research findings.
2.1.1. Path Planning for Turning (TN) Behavior
Dias & Loverglio [
31], Wu, Yue, Liu, Zhang & Shao [
32] categorize the TN behavior into four distinct stages, as illustrated in
Figure 3. Phase 1 involves a straight-walking stage before the turn, where pedestrians move parallel to the corridor boundary. Phase 2 occurs when pedestrians approach a set distance upstream of the turning area, prompting them to move closer to the inner curve of the corner. Phase 3 begins when the turning region falls within the pedestrian’s field of attention; during this phase, they gradually adjust their direction before entering the intended corridor. Phase 4 sees pedestrians unaffected by the turning factors, at which point they engage in AA behavior.
When
, the pedestrian is in Phase 1. Here,
represents the x-coordinate of the current position
, The pedestrian’s walking aim is
[
32]. The x-coordinate of the inner curve is
, with
positioned a certain distance
upstream of
. The distance
varies according to walking habits and is modeled as a normal distribution, with a mean of 5.82 m and a standard deviation of 2.40 m [
33]. Additionally,
is the y-coordinate of the current position
.
When
, the pedestrian is in Phase 2. In this phase, the walking aim is denoted as
. The position
is located at a distance equal to the radius of the attention field (
= 4.0 m) upstream of
[
34].
, where
is the y-axis coordinate of the inner curve, and
is the y-axis coordinate of the first position in Phase 2 (
), when the behavior of approaching the inner curve starts [
33].
When
and
, the pedestrian is in Phase 3. During this phase, the pedestrians gradually adjusts their desired direction and align their walking aim with the turning target, denoted as
. The coordinate
is located at a distance of
downstream from
. The pedestrian’s walking aim can be expressed as
, where
represents the x-coordinate of the destination
. The calculation method for
is detailed in Equation (
1).
where,
and
are the x-axis coordinates (m) and y-axis coordinates (m) of the initial position (
) in Phase 3.
When , the pedestrian enters Phase 4. In this phase, pedestrians are not influenced by TN behavior and instead engage in AA behavior. The walking aim in this phase is the destination, represented as ).
2.1.2. Path Planning for Visual Restriction (VR) Behavior
Sun, Sun, Duives & Hoogendoorn [
35] demonstrates that when pedestrians approach an area with a limited field of vision, the lack of visible information encourages them to adjust their trajectories to avoid potential collisions, as shown in
Figure 4. If the pedestrian’s current position, denoted as
, satisfies
, they begin to deviate due to visual restriction (VR) behavior. The point
is located a specific distance
upstream from the boundary of the restricted field of vision (
). The average value of
is 7.55 m, with a standard deviation of 5.65 m [
35]. The position where this deviation first occurs is identified as
. The pedestrian’s intended walking direction, influenced by VR behavior, is represented as
. The calculation for
is provided in Equation (
2), and the calculation for
is detailed in Equation (
3) [
35].
where,
represents the y-axis coordinate (m) of the position where VR behavior begins. The variable
denotes the y-axis coordinate (m) of the visual restriction boundary. The flow rate of pedestrians entering and exiting the building entrance, denoted as
f (persons/m
2), is measured in the vicinity of the entrance under the condition that
falls within the range
. When
, the pedestrian density
.
where,
is the x-axis coordinate (m) of the center of visual restriction area.
The expected direction of VR behavior is denoted as , with representing the angle between and the boundary of the visual restriction area. Independently of VR behavior, the expected direction of pedestrian movement is represented as , and denotes the angle between and the boundary of the visual restriction area. When , the pedestrian’s walking goal is . Conversely, when , the walking target is . The VR behavior concludes when either or .
2.1.3. Path Planning for Traversing Restriction (SR) Behavior
Fajen & Warren [
34] demonstrates that when the distance between a pedestrian’s expected path and an obstacle is less than 0.5 m, and the distance from the pedestrian to the obstacle is less than 4 m, pedestrians are likely to deviate from their path to maintain a safe distance of 0.5 m from the obstacle’s boundary. Obstacles that influence pedestrian behavior can be categorized into bulge obstacles and point obstacles, as illustrated in
Figure 5.
A bulge obstacle occurs when the obstacle is connected to the boundary of the corridor. Taking the condition shown in
Figure 5 as an example, the pedestrian’s walking target during SR behavior is
, where
is the x-coordinate of the left boundary of the obstacle, and
, with
representing the y-coordinate of the upper boundary of the obstacle. The SR behavior concludes when
.
A point obstacle is defined as an obstacle that is not connected to the corridor boundary, prompting pedestrians to detour either left or right. The typical path-planning logic involves selecting the shortest route to minimize total walking distance, leading pedestrians to choose the side with the smaller detour distance [
27]. For instance, as illustrated in
Figure 5, when pedestrians decide on a detour, there are two scenarios to consider: first, when pedestrians are positioned between the upper and lower boundaries, that is,
; and second, when pedestrians are outside these boundaries, meaning
or
. In both scenarios, pedestrians may opt for either
or
as their bypass route. The lengths of these bypass paths, denoted as
and
, must be calculated. If
, the walking target is
; if
, the walking target is
.
2.2. Operational Layer Velocity Prediction Model
The social force model (SFM) is a widely used velocity prediction model at the operational layer (OL) and can be integrated with TL models to predict pedestrian flow in complex environments. However, the original SFM has not been validated for its ability to predict collective behavior [
22] and does not adequately account for group dynamics and walking randomness [
20]. To address these limitations, the modified SFM proposed by Zanlungo, Ikeda & Kanda [
36] (SF-Z) will be used as the prediction model for OL velocity in this study.
The SF-Z model retains the fundamental structure of the original SFM while enhancing the consideration of group behavior and randomness in pedestrian behavior. Additionally, its ability to predict density distribution has been validated. The calculation method for pedestrian acceleration (
a) is presented in Equation (
4).
where:
represents the influence of the pedestrian’s walking aim;
denotes the interactions among group members;
accounts for collision avoidance behaviors;
reflects the impact of other attractive elements; and
signifies random behavior.
The model posits that the desired direction is jointly determined by the influence of the walking aim and other attractive elements. Consequently, and can be consolidated into . Additionally, when pedestrian density is low, the randomness inherent in collision avoidance behavior allows and to be simplified to .
2.3. Abnormal Prediction Results Modification
This study categorizes pedestrian interactions with boundaries into four scenarios based on their relative positions: crossing upper boundaries, lower boundaries, left boundaries, and right boundaries. According to Ma, Song, Fang, Lo & Liao [
37], individuals maintain a safe distance of approximately 0.4 m from space boundaries to ensure safe movement, resulting in an effective walking space of 0.4 m from the boundary. For example, when predicting a scenario involving crossing the upper boundary, let the pedestrian’s current position be
, the predicted speed at the next time step be
V, the direction of motion be
, and the prediction time interval be
, as illustrated in
Figure 6. The y-coordinate of the upper boundary is represented as
. If
, it is assumed that the pedestrian will cross the upper boundary, necessitating an adjustment in the prediction results. During this process, the pedestrian’s walking direction remains unchanged, and the method for calculating walking speed (
V) is detailed in Equation (
5). The modification for crossing the lower, left, and right boundaries is analogous to that of crossing the upper boundary, as illustrated in
Figure 6.
6. Conclusions
This study develops a pedestrian flow prediction model tailored for urban pedestrian-only spaces. The walking direction is determined by a tactic layer (TL) model that integrates multiple influencing factors. This direction is a crucial input for velocity prediction in the operational layer (OL), creating a collective behavior model for complex urban walking scenarios. When applied to Harbin Central Street, the model achieves a pedestrian density distribution prediction error of 0.0029 person/m2, which is lower than that of conventional TL models, models lacking TL behaviors, or those employing different OL models. Compared to the TL-R-OL-M model, this represents a 46.3% improvement.
For the tactic layer, a comprehensive approach that incorporates multiple factors significantly enhances prediction accuracy. Key behaviors—such as turning, visual restriction, and spatial restriction—are crucial for reflecting collective patterns. Omitting any of these factors impairs accuracy, as the environmental elements triggering them exert a broad influence on collective behavior.
For the operational layer, under natural walking conditions in pedestrian-only spaces dominated by leisurely pedestrians, incorporating group and random behaviors can delay changes in walking behavior caused by shifts in desired direction and reduce the density gradient. An operational layer that fits the application scenario can significantly enhance the prediction of collective behaviors.
In urban pedestrian-only spaces, both the tactic and operational layers are essential for accurate behavior prediction. Omitting path planning settings or using models not validated in urban environments compromises prediction accuracy.
7. Application and Future Work
This paper presents a collective behavior prediction model designed to effectively assess the use of urban pedestrian-only spaces, which are increasingly vital in daily life. The model can identify areas that are prone to overcrowding or potential hazards and explore appropriate solutions.
The model can be applied to identify critical regions for designing walking environments. When a preliminary design plan for urban pedestrian-only spaces is proposed, this model can predict pedestrian density distribution, highlighting overcrowded areas that require intervention and underutilized spaces that can be activated. The interaction principles identified in this study can assist in optimizing overcrowded conditions, while underutilized areas may be suitable for static activities [
44]. Furthermore, when temporary obstacles need to be introduced, this model can simulate various scenarios to identify solutions that minimally disrupt the original walking conditions. Additionally, the model facilitates an analysis of unsuitable application scenarios, enabling the identification of methods to address issues by leveraging the interactions between environmental factors and collective behavior uncovered in this research.
In the future, the findings can be expanded in several ways. First, due to the limited availability of pedestrian walking data in urban pedestrian-only spaces and the challenges in extracting walking trajectories, this model has only been validated in a commercial environment. If methods for extracting walking trajectories improve or if pedestrian behavior data from other environments become available, the parameters can be adjusted to more accurately describe pedestrian density distribution in various walking environments. Second, this study aims to develop a model for predicting walking behavior without considering the function of the space, focusing solely on geometric conditions. If future research can gather sufficient data from specific types of walking environments, additional factors—such as the distribution of facilities, environmental comfort, and user characteristics—can be integrated according to the space’s function, further enhancing the model. Third, in response to sudden crowds and severe weather, the flexibility of this model enables the operational layer to be adjusted or replaced with corresponding models specifically designed to address these conditions.