1. Introduction
Bus passenger flow serves as the basis for bus operation management departments to formulate network planning and vehicle scheduling plans. By establishing the relationship between bus passenger flow and its influencing factors, and exploring the fluctuation patterns of randomness, time variation, and imbalance in passenger flow, an accurate prediction model of passenger flow fluctuations can be constructed. This enables reasonable forecasting of fluctuation trends and spatial distribution of passenger flow, thereby providing a theoretical foundation for developing bus operation scheduling plans. Meanwhile, it also facilitates passengers in planning travel routes and times more effectively, improves operational efficiency, and ensures the service level and quality of bus services.
Mei Z et al. [
1] pointed out that factors such as weather conditions, bus stop information, bus passenger flow change rate, and time and date have significant impacts on bus passenger flow. Chen D et al. [
2] established a nonlinear relationship between bus passenger flow and factors such as the number of bus stops, road network density, and the number of companies, revealing the influencing mechanisms of passenger flow changes. Zhu C et al. [
3] found that the land-use characteristics surrounding bus stops have a significant impact on bus passenger flow. Fang X et al. [
4] examined the correlation between bus IC card passenger flow attributes and travel time, and proposed a passenger flow forecasting method based on the classification of travel attributes. Chen E et al. [
5] analyzed the time-series characteristics of bus IC card passenger flow data and developed an ARIMA-based forecasting model. To further analyze the seasonal periodic patterns of bus passenger flow, Milenković M et al. [
6] proposed a Seasonal Autoregressive Integrated Moving Mean (SARIMA)-based bus passenger flow forecasting method. Cheng G et al. [
7] developed a Short Time Series Clustering–Seasonal Autoregressive Integrated Moving Mean (STSC-SARIMA) model for bus passenger flow forecasting. However, the above studies primarily fit linear regression relationships between bus passenger flow and influencing factors using historical data samples, without adequately addressing the spatiotemporal dynamics and nonlinear characteristics of bus passenger flow. This may result in the omission of certain spatiotemporal dynamic influencing factors and their relationships during parameter calibration, thereby reducing the accuracy and generalization capability of bus passenger flow forecasting results [
8].
To investigate the spatiotemporal dynamic complexity and nonlinear relationships between bus passenger flow and factors such as weather conditions, commercial land-use ratio, points of interest (POI), bus stop density, and characteristics of bus and road networks under large-scale data samples, and to explore the characteristic patterns of influencing factors and the spatiotemporal distribution of bus passenger flow to improve forecasting accuracy and robustness, Yu L et al. [
9] examined the impact of land-use characteristics on bus passenger flow and proposed a bus passenger flow forecasting model based on the least squares support vector machine. Xu X et al. [
10] developed a combined forecasting model integrating time series analysis and support vector machines for bus passenger flow. Li Y et al. [
11] proposed a bus passenger flow forecasting method based on the kernel extreme learning machine (KELM), which effectively improved the accuracy of forecasting results. Yazıcıoğlu C et al. [
12] introduced an LSTM-based bus passenger flow forecasting method, and the results showed that incorporating weather conditions helps enhance the accuracy and applicability of the forecasts. Zhang Y et al. [
13] compared the forecasting accuracy of existing machine learning models and found that bidirectional forecasting models, such as those based on BiLSTM and BiGRU, outperform unidirectional models in both accuracy and robustness.
To further integrate the spatiotemporal distribution features of the built environment, bus stop density, and bus and road networks, and to improve the accuracy and applicability of bus passenger flow forecasting, Cao B et al. [
14] proposed a hybrid forecasting method based on convolutional neural network–long short-term memory (CNN-LSTM) by incorporating factors such as commercial land-use ratio, points of interest (POI), and bus stop density. This method effectively extracts the spatiotemporal features of bus passenger flow, reduces model computational complexity, and achieves higher forecasting accuracy compared to LSTM networks. Baghbani A et al. [
15] developed a traffic-aware multi-step graph neural network (TMS-GNN)-based method for bus passenger flow forecasting, which considers bus stop connectivity and urban traffic operation characteristics to significantly improve both the accuracy and applicability of forecasts. Lv W et al. [
16] constructed an extreme gradient boosting (XGBoost) model for bus passenger flow forecasting by incorporating POI information. Yu J et al. [
17] proposed an interpretable bus passenger flow forecasting method based on CatBoost, which not only improves forecasting accuracy but also reveals the interaction mechanisms between influencing factors and forecast accuracy. These studies have effectively enhanced the accuracy and robustness of bus passenger flow forecasting by exploring the complex spatiotemporal dynamic relationships and distribution characteristics between bus passenger flow and influencing factors. However, they often rely on manually setting parameters for machine learning models, which can limit the precision and efficiency of parameter optimization.
To further improve the accuracy and efficiency of bus passenger flow forecasting, Nagaraj N et al. [
18] examined factors such as bus type, the locations of route terminals, passenger numbers, and headways, and proposed a forecasting method based on a greedy hierarchical algorithm combined with a long short-term memory (LSTM) neural network. Sun F et al. [
19] explored the spatiotemporal distribution patterns of bus passenger flow using IC card data and developed a forecasting model based on a nonlinear autoregressive network with exogenous inputs (NARX) optimized by a genetic algorithm. Li C et al. [
20], considering the volatility, nonlinearity, and periodicity of bus passenger flow, applied a clustering algorithm to segment passenger flow data and constructed a forecasting model based on a particle swarm optimization–support vector machine (PSO-SVM). Bus passenger flow forecasting models based on GWO-SVM [
21] and whale optimization-Attention-Bidirectional Gated Recurrent Unit (BiGRU) [
22] have also demonstrated high prediction accuracy and strong generalization capability, as intelligent optimization algorithms are used to fine-tune machine learning hyperparameters.
The whale optimization algorithm (WOA) in intelligent optimization exhibits advantages such as simple structure, few parameters, and high accuracy and convergence speed in optimization solving [
23], and has been widely applied in areas including path planning [
24] and machine learning parameter optimization [
25]. However, due to the use of an initialization strategy in the optimization process, WOA often suffers from uneven population distribution. Moreover, the linearly decreasing convergence factor during population position updating tends to cause an imbalance between global search and local exploitation, leading to problems such as being trapped in local optima, insufficient solution accuracy, and slow convergence speed. To address these issues, Prasad D [
26] and Duan Y et al. [
27] improved the population initialization of WOA by adopting Logistic and Circle chaotic mappings, thereby enhancing population diversity and quality and improving optimization speed and convergence accuracy. Similarly, Darvish Falehi A [
28] and Paul C et al. [
29] applied chaotic mappings to improve WOA population initialization, balancing the relationship between global search and local exploitation, enhancing optimization accuracy and speed, and strengthening WOA’s capability in solving complex engineering problems.
To address the issue that the Whale Optimization Algorithm (WOA) is prone to premature convergence to local optima, Li M [
30] and Chakraborty S et al. [
31] improved WOA by employing an elite opposition-based learning strategy. This approach enhanced population diversity and local exploration capability, increased the probability of feasible solutions approaching the optimal solution during the optimization process, and avoided premature convergence caused by limited exploration ability.
Regarding the problem that the linearly decreasing convergence factor in WOA cannot effectively balance the relationship between global search and local exploitation, thereby reducing solution accuracy and convergence speed, Anitha J [
32] and Yue Y et al. [
33] introduced cosine functions to improve convergence factors and inertia weights. This effectively balanced global and local search abilities, enhancing both solution accuracy and convergence speed. Chakraborty S et al. [
34] suggested that improving the coefficient vector in the position update mechanism could significantly enhance the exploration capability in the early stage and the exploitation capability in the later stage. Wang H [
35] and Yang W et al. [
36] used nonlinear functions and dynamic convergence factors to improve the linear convergence factor of WOA, strengthening global and local search capabilities while reducing the risks of premature convergence and insufficient accuracy. Ju C et al. [
37] integrated chaotic mapping, nonlinear convergence factors, and Cauchy mutation as a hybrid strategy to improve WOA. This increased population diversity and quality, effectively balanced global and local search abilities, and overcame the issues of insufficient accuracy, slow convergence, and susceptibility to local optima.
Existing studies mainly focus on examining the impacts of factors such as weather conditions, built environment, and the characteristics of bus and road networks on bus passenger flow, and on developing models aimed at improving the point-in-time forecasting accuracy of bus passenger flow, while neglecting its randomness, time variation, and imbalance. The use of intelligent optimization algorithms to tune machine learning parameters can avoid the limitations of manual parameter setting, which often results in insufficient accuracy and applicability of forecasting results.
Therefore, to address the problems of WOA being easily trapped in local optima, insufficient solution accuracy, and slow convergence speed, this study adopts a Logistic–Tent combined chaotic mapping to initialize the population of WOA, thereby increasing population diversity and quality; an elite opposition-based learning and Cauchy mutation hybrid mechanism is adopted to overcome premature convergence and the tendency of WOA to fall into local optima. Moreover, improved convergence factors and inertia weights are introduced to enhance the population position update mechanism, effectively balancing the relationship between global search and local exploitation, and improving both solution accuracy and convergence speed of WOA. Based on bus passenger flow data from Harbin, the study analyzes the correlations between passenger flow, weather conditions, and passenger attributes, applies fuzzy information granulation (FIG) to the passenger flow data, and employs the improved WOA (IWOA) to optimize BiGRU parameters, thereby developing a FIG-IWOA-BiGRU model for forecasting bus passenger flow fluctuation trends and spatial distribution. By considering the impacts of passenger travel randomness and uncertainty on bus passenger flow, the model extends forecasting results from point predictions to spatial predictions. Meanwhile, the use of a hybrid-strategy-improved WOA prevents the algorithm from falling into local optima and suffering from slow convergence, which could otherwise lead to poor BiGRU prediction performance and long computation times, thus addressing the limitations in bus passenger flow data forecasting applications.
The remainder of this paper is organized as follows:
Section 1 analyzes the fluctuation characteristics of bus passenger flow and its influencing factors.
Section 2 briefly introduces fuzzy information granulation and the whale optimization algorithm.
Section 3 presents in detail the strategies and functions of the improved whale optimization algorithm, and constructs the FIG-IWOA-BiGRU model for forecasting bus passenger flow fluctuation trends and spatial distribution.
Section 4 provides the results of numerical simulation experiments based on 21 benchmark test functions and discusses the advantages of the improved whale optimization algorithm compared with other metaheuristic algorithms. The results of the FIG-IWOA-BiGRU bus passenger flow fluctuation trend and spatial forecasting model are then presented, followed by a comparative analysis of the accuracy and applicability of the proposed model and existing models.
6. Conclusions
The main conclusions of this study are as follows:
(1) This paper applies a fuzzy information granulation method, which reduces the spatiotemporal complexity of bus passenger flow data while preserving its valid information, and constructs a FIG-IWOA-BiGRU-based model for forecasting bus passenger flow fluctuation trends and spatial distribution. The MAPE values of the forecasting results for the fuzzy-granulated LOW, R, and UP window data are 7.8%, 8.1%, and 8.5%, respectively. Compared with existing forecasting models, these values improve by 1.3–3.5%, 1.5–3.7%, and 1.7–4.0%, demonstrating good prediction accuracy and robustness. The findings contribute to the reasonable prediction of bus passenger flow fluctuation trends and spatial patterns, providing data support and decision-making references for bus operation scheduling plans and for passengers in planning travel routes and times.
(2) This paper improves the whale optimization algorithm (WOA) with hybrid strategies, including Logistic–Tent combined chaotic mapping, an elite opposition-based learning and Cauchy mutation hybrid mechanism, and modified convergence factor and inertia weight. These strategies enhance population diversity and quality while preserving the strengths of WOA, and improve the algorithm’s capability to balance global search and local exploitation. Comparative analysis of numerical simulation results and convergence characteristics on 21 benchmark test functions shows that IWOA outperforms WOA, GWO, PSO, and GA in solution accuracy, convergence speed, and the ability to escape local optima, further validating the effectiveness of the hybrid strategy-improved WOA proposed in this study.
In addition, due to the privacy of bus data, this study did not obtain operational data from other cities, which imposes certain limitations. In future research, the following aspects will be further explored:
(1) The main research content of this study is based on Harbin city’s bus IC card and weather condition data. Future research should further collect and organize bus IC card and weather data from other cities to comparatively analyze the impact of different built environments on the spatiotemporal distribution, fluctuation characteristics, and trends of bus passenger flow.
(2) Based on the analysis of bus passenger flow data in different urban regions, by considering modifications to the whale optimization algorithm’s search mechanism and algorithm structure and integrating the advantages of other intelligent optimization algorithms, more efficient intelligent optimization algorithms with improved search and convergence speed can be proposed, thereby enhancing the accuracy and applicability of models predicting bus passenger flow fluctuation trends and spatial distribution.