Research on Check-In Baggage Flow Prediction for Airport Departure Passengers Based on Improved PSO-BP Neural Network Combination Model
Abstract
1. Introduction
2. The Current Research Status on Airport Baggage Flow
2.1. Baggage Flow
2.2. Traffic Prediction Algorithms
3. Baggage Flow Prediction Based on the PCC-PCA-PSO-BP Combination Model
3.1. Baggage Flow: Construction of the PCC-PCA-PSO-BP Combinatorial Model
3.2. The Principle of the PCC Algorithm
3.3. The Principle of the PCA Algorithm
- (a)
- Data standardization. In a specific complex system, the original dataset consisting of n samples influenced by m variables (referred to as influencing factors or variables in this article) is expressed as follows:
- (b)
- Determination of principal components. The standardized matrix contains all the information of the original dataset . The correlation coefficient matrix R is calculated using the standardized matrix, expressed as follows:
- (c)
- The eigenvalue λ of R and its corresponding feature vector μ are calculated. Among them, ; feature vectors .
- (d)
- The variance contribution rate and cumulative variance contribution rate of the principal components are calculated. The eigenvalues of the correlation coefficient matrix are equal to the variances of the corresponding principal components, explaining the proportion of the information in each principal component to the total information in the original dataset. The expressions are as follows:
- (e)
- Principal component selection. PCA aims to weaken the coupling between the input vectors of the BP neural network, remove redundant information, and fully retain the original data information. Generally, the number of selected principal components should not exceed six, and the cumulative variance contribution rate should be as large as possible (usually not less than 80%).
- (f)
- Determine input variables. The influencing factors are transformed into uncorrelated variables through linear transformation to reduce the data dimensionality. The variables after reduction can still reflect most information in the original dataset. The input variables for the BP neural network are set as . The principal components can be expressed as follows:
3.4. The Principle of the PSO Algorithm
3.5. The Principle of the BP Algorithm
3.6. The Specific Implementation Process for the Combined Model
- (a)
- Determine the main influencing factors of the checked baggage flow for airport departing passengers.
- (b)
- Data collection and preprocessing.
- (c)
- Determine whether data are missing. If there are data missing, calculate the data missing rate. It is generally believed that when the ratio of missing data exceeds 20%, the analytical value of the data will drop remarkably [23]. Otherwise, proceed to step (e).
- (d)
- If missing data exists, perform numerical interpolation. According to different interpolation methods, the errors between the interpolation values and true values differ. This article uses four interpolation methods (regression interpolation, EM interpolation, multiple interpolation, and mean interpolation) for comparison and selection. The root mean square error (RMSE) and mean absolute percentage error (MAPE) are employed as metrics for the evaluation of the interpolation’s effectiveness:
- (e)
- PCC is adopted in the correlation analysis of the original dataset or interpolated dataset to extract the core influencing factors of baggage flow, reduce the dimensionality of the input vector of the BP neural network, and effectively abate redundant input vector information.
- (f)
- PCA is conducted on the dataset to weaken the coupling between the input variables of the BP neural network, remove the correlation interference between input vectors, and further lower the dimensionality of the input vectors of the BP neural network while retaining sufficient original data information.
- (g)
- Determine the topology of the BP neural network. This article uses a three-layer BP neural network structure, including one input layer, one hidden layer, and one output layer. The number of neurons in the input layer is determined by the number of principal components; the number of neurons in the output layer is one, which is the checked baggage flow of departing passengers at airports; the number of neurons in the hidden layer is obtained from the empirical Formula (14):
- (h)
- Normalize the input vectors of the BP neural network to eliminate the effects of the dimensionality and order of magnitude between input vectors. The normalization formula is as follows:
- (i)
- Initialize the BP neural network and PSO algorithm, setting the parameters (number of swarms, learning factors, inertia weight, etc.) empirically. The particle dimensionality depends on the number of nodes in the BP network’s input, hidden, and output layers.
- (j)
- Calculate the fitness function value for each particle. By introducing weights and thresholds as particles into the PSO, the fitness function value is a key indicator of the particle quality. The minimum objective function can be derived by minimizing the fitness function. This paper uses the mean square error (MSE) of the BP neural network as the fitness function. The smaller the fitness function, the less the network error, and the better the adaptability of particles.
- (k)
- Determine the individual extremum and global extremum of particles. If the current particle fitness , update the current particle fitness; otherwise, remains unchanged. If the current particle fitness , update the current particle fitness; or else, keeps the same.
- (l)
- Update the velocity and position of each particle.
- (m)
- Assign weights and thresholds to the BP neural network based on the obtained global optimal solution.
- (n)
- Train the dataset using the BP neural network with optimal weights and thresholds and identify whether the training results meet the preset error. The mean absolute error (MAE) and (coefficient of determination) are used for judging the training effectiveness as follows:
- (o)
- Apply the constructed PCC-PCA-PSO-BP model to predict the flow of checked baggage of departing passengers at airports.
4. Case Study
4.1. Analysis of the Main Factors That Influence Baggage Flow
4.2. Data Collection and Preprocessing
4.3. PCC Is Conducted on the Dataset
4.4. PCA Is Conducted on the Dataset
4.5. The PCC-PCA-PSO-BP Model Is Used to Predict Baggage Flow
5. Conclusions
- Univariate missing data exist in the dataset of factors affecting baggage flow, which belongs to the completely missing at random pattern. Under low missing rates, compared with mean interpolation, regression interpolation, and EM interpolation, multiple interpolation presents superior numerical interpolation performance.
- Unlike the factors that affect the airport departure passenger flow, the total retail sales of consumer goods have a tenuous relationship with baggage flow, and the two variables do not exhibit a reciprocal relationship. The departure passenger flow and flight takeoff and landing sorties play a dominant role in baggage flow; the railway passenger flow, highway passenger flow, and months have significant impacts on baggage flow changes; and holiday and weekend factors also contribute to baggage flow changes.
- In terms of the performance of the baggage flow prediction, the combination PCC-PCA-PSO-BP model designed in this article is compared with four models: BP, PCA-BP, PSO-BP, and PCA-PSO-BP. PCC-PCA-PSO-BP achieves a faster convergence speed and higher accuracy, which can evidently improve on the shortcomings of traditional BP neural networks. This verifies the effectiveness and feasibility of the algorithm in predicting the baggage flow of departing passengers at airports.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Society International de Telecommunicatioan Aeronautiques. Baggage IT Insights in 2019. 2023. Available online: https://www.sita.aero/resources/surveys-reports/baggage-it-insights-2019/ (accessed on 3 April 2024).
- [R1] Society International de Telecommunicatioan Aeronautiques. Baggage IT Insights in 2024. 2024. Available online: https://www.sita.aero/resources/surveys-reports/sita-baggage-it-insights-2024/ (accessed on 15 November 2024).
- Cavada, J.P.; Cortes, C.E.; Rey, P.A. A Simulation Approach to Modelling Baggage Handling Systems at an International Airport. Simul. Model. Pract. Theory 2017, 75, 146–164. [Google Scholar] [CrossRef]
- Yfantis, E.A. An Intelligent Baggage-Tracking System for Airport Security. Eng. Appl. Artif. Intell. 1997, 10, 603–606. [Google Scholar] [CrossRef]
- Brunettal, L.; Romanin-Jacu, J.D.; San, N.A.S. Passenger and Baggage Flow in an Airport Terminal: A Flexible Simulation Model. J. Air Traffic Manag. 1999, 6, 361–363. [Google Scholar]
- Takakuwa, S.; Oyama, T. Modeling People Flow: Simulation Analysis of International-Departure Passenger Flows in an Airport Terminal. In Proceedings of the 35th Conference on Winter Simulation: Driving Innovation, New Orleans, LA, USA, 7–10 December 2003; pp. 1627–1634. [Google Scholar]
- Yang, Z.C. The Demand Forecasting for the Checked Baggage of the Departing Passengers the Airport Terminal. Master’s Thesis, Harbin University of Technology, Harbin, China, 2013. [Google Scholar]
- Li, Z.; Bi, J.; Zhang, J.; Li, Q. Analysis of Airport Departure Baggage Check-in Process Based on Passenger Behavior. In Proceedings of the 2017 10th International Symposium on Computational Intelligence and Design (ISCID), Hangzhou, China, 9–10 December 2017; pp. 204–207. [Google Scholar]
- Liu, X.; Li, L.; Liu, X.; Zhang, T.; Rong, X.; Yang, L.; Xiong, D. Field Investigation on Characteristics of Passenger Flow in a Chinese Hub Airport Terminal. Build. Environ. 2018, 133, 1536–1545. [Google Scholar] [CrossRef]
- Li, Z.Y. Forecast Research on the Demand for Checked Baggage of Departing Passengers at the Airport Terminal Based on Data Driven. Master’s Thesis, Beijing Jiaotong University, Beijing, China, 2018. [Google Scholar]
- Chandra, S.R.; Al-Deek, H. Cross-Correlation Analysis and Multivariate Prediction of Spatial Time Series of Freeway Traffic Speeds. Transp. Res. Rec. J. Transp. Res. Board 2008, 2089, 64–76. [Google Scholar] [CrossRef]
- Tsai, T.H.; Lee, C.K.; Wei, C.H. Neural Network Based Temporal Feature Models for Short-Term Railway Passenger Demand Forecasting. Expert Syst. Appl. 2009, 36, 3728–3736. [Google Scholar] [CrossRef]
- Liu, H.; Li, B.; Liu, C.; Zu, M.; Lin, M. Research on Yield Prediction Technology for Aerospace Engine Production Lines Based on Convolutional Neural Networks-Improved Support Vector Regression. Machines 2023, 11, 875–897. [Google Scholar] [CrossRef]
- Lou, J.; Li, W. Forecasting Model for the Scale of New-Built Airport Logistics Demand Based on the Back Propagation Artificial Neural Network. In Proceedings of the 2010 International Conference on E-Product E-Service and E-Entertainment, Henan, China, 7–9 November 2010; pp. 3021–3027. [Google Scholar]
- Filipovska, M.; Mahmassani, H.S. Traffic Flow Breakdown Prediction using Machine Learning Approaches. Transp. Res. Rec. J. Transp. Res. Board 2020, 2674, 560–570. [Google Scholar] [CrossRef]
- Lu, W.X.; Dai, Y.R.; Li, C.; Li, K.Q. Tourist Traffic Flow Forecasting Method Based on Improved PSO-BP Neural Network. J. Syst. Sci. Math. Sci. Chin. Ser. 2020, 40, 1407–1419. [Google Scholar]
- Hauke, J.; Kossowski, T. Comparison of Values of Pearson’s and Spearman’s Correlation Coefficients on the Same Sets of Date. Quaest. Geogr. 2011, 30, 87–93. [Google Scholar]
- Hinton, G.E.; Salakhutdinov, R. Reducing the Dimensionality of Data with Neural Networks. Science 2006, 313, 504–507. [Google Scholar] [CrossRef] [PubMed]
- Shi, Y.H.; Eberhart, R.C. Empirical Study of Particle Swarm Optimization. In Proceedings of the IEEE Congress on Evolutionary Computation, Washington, DC, USA, 12–17 May 2002; pp. 1945–1950. [Google Scholar]
- Li, Z.J.; Liu, X.D.; Duan, X.D.; Huang, F.X. Comparative Research on Particle Swarm Optimization and Genetic Algorithm. Comput. Inf. Sci. 2010, 3, 120–127. [Google Scholar] [CrossRef]
- Vlahogianni, E.I.; Karlaftis, M.G. Testing and Comparing Neural Network and Statistical Approaches for Predicting Transportation Time Series. Transp. Res. Rec. J. Transp. Res. Board 2013, 2399, 9–22. [Google Scholar] [CrossRef]
- Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Representations by Back-Propagating Errors. Nature 1986, 232, 533–536. [Google Scholar] [CrossRef]
- Chen, X.B.; Chen, C.; Chen, L.; Wei, Z.J.; Cai, Y.F.; Zhou, J.J. Interpolation Method of Traffic Volume Missing Data Based on Improved Low-Rank Matrix Completion. J. Traffic Transp. Eng. 2019, 19, 180–190. [Google Scholar]
- Wai, H.K.T.; Hatice, O.B.; Andrew, G.; Hamish, G. Forecasting of Hong Kong Airport’s Passenger Throughput. Tour. Manag. 2014, 42, 62–76. [Google Scholar]
- Shu, Y.J. Research on the Forecast Method of Airport Passenger Throughput. Master’s Thesis, Nanjing University of Aeronautics and Astronautics, Nanjing, China, 2008. [Google Scholar]
- Silva, P.; Ribeiro, D.; Mendes, J.; Seabra, E.A.R.; Postolache, O. Railways Passengers Comfort Evaluation through Motion Parameters: A Systematic Review. Machines 2023, 11, 465–495. [Google Scholar] [CrossRef]
- Li, C.P. Analysis of the Influencing Factors of China’s Civil Aviation Passenger Volume. Sci. Technol. Ind. 2011, 11, 59–61. [Google Scholar]
- Zhang, Z.D.; Xu, J.H. An Analysis of Major Factors on Airport Passenger Volumes. Urban Transp. China 2007, 5, 54–57. [Google Scholar]
- Gao, W.; Xiao, X.M. Prediction of Airport Passenger Throughput Based on Entropy BP Neural Network. Comput. Simul. 2021, 38, 67. [Google Scholar]
- Wang, A. Research of Logistics and Regional Economic Growth. Ibusiness 2010, 2, 395–400. [Google Scholar] [CrossRef]
- Liu, S.; Wan, Y.; Ha, H.K. Impact of High-Speed Rail Network Development on Airport Traffic and Traffic Distribution: Evidence from China and Japan. Transp. Res. Part A Policy Pract. 2019, 127, 115–135. [Google Scholar] [CrossRef]
- Zuidberg, J. Exploring the Determinants for Airport Profitability: Traffic Characteristics, Low-Cost Carriers, Seasonality and Cost Efficiency. Transp. Res. Part A-Policy Pract. 2017, 101, 61–72. [Google Scholar] [CrossRef]
- Strand, S. Airport-Specific Traffic Forecasts: The Resultant of Local and Non-Local Forces. J. Transp. Geogr. 1999, 7, 17–29. [Google Scholar] [CrossRef]
- Deng, J.X.; Shan, L.B.; He, D.Q.; Tang, Y. Processing Method of Missing Data and Its Developing Tendency. Stat. Decis. 2019, 23, 28–34. [Google Scholar]
- Zhang, J.P.; Wang, F.Y.; Wang, K.F.; Ling, W.H.; Xu, X.; Chen, C. Data-Driven Intelligent Transportation Systems: A Survey. IEEE Trans. Intell. Transp. Syst. 2011, 12, 1624–1639. [Google Scholar] [CrossRef]
- Li, Y.B.; Li, Z.H.; Li, L. Missing Traffic Data: Comparison of Imputation Methods. IET Intell. Transp. Syst. 2014, 8, 51–57. [Google Scholar] [CrossRef]
- Donders, A.R.; Van-Der-Heijden, G.J.; Stijnen, T.; Review, K.G. A Gentle Introduction to Imputation of Missing Values. J. Clin. Epidemiol. 2006, 59, 1087–1091. [Google Scholar] [CrossRef]
- Qin, Y.; Rao, J.N.K.; Ren, Q. Confidence Intervals for Marginal Parameters under Fractional Linear Regression Imputation for Missing Data. J. Multivar. Anal. 2008, 99, 1232–1259. [Google Scholar] [CrossRef]
- Moon, T.K. The Expectation-Maximization Algorithm. Signal Process. Mag. IEEE 1996, 13, 47–60. [Google Scholar] [CrossRef]
- Wang, J.; Loong, B.; Westveld, A.H.; Welsh, A.H. A Copula-Based Imputation Model for Missing Data of Mixed Type in Multilevel Data Sets. arXiv 2017, arXiv:1702.08148. [Google Scholar]
- Yan, L.K. Application of Correlation Coefficient and Biased Correlation Coefficient in Related Analysis. J. Yunnan Financ. Trade Inst. 2003, 3, 78–80. [Google Scholar]
- Geoffrey, U.N.; Chiu, S.F.; Biona, J.B.M.; Lopez, N.S. Comparison of Driving Forces to Increasing Traffic Flow and Transport Emissions in Philippine Regions: A Spatial Decomposition Study. Sustainability 2021, 13, 6500. [Google Scholar] [CrossRef]
- Hong, J.; Chu, Z.F.; Wang, C.Q. Transport Infrastructure and Regional Economic Growth: Evidence from China. Transportation 2011, 38, 737–752. [Google Scholar] [CrossRef]
- Hakim, M.M.; Merkert, R. The Causal Relationship Between Air Transport and Economic Growth: Empirical Evidence from South Asia. J. Transp. Geogr. 2016, 56, 120–127. [Google Scholar] [CrossRef]
- Zhou, X.; Chen, T.; Qiu, T. BP Neural Network Forecast of Flight “Estimated Final Arrival Time” Based on Levenberg-Marquardt Algorithm Optimization. In Proceedings of the 2020 5th International Conference on Electromechanical Control Technology and Transportation (ICECTT), Nanchang, China, 15–17 May 2020; pp. 317–320. [Google Scholar]
Variables | Physicochemical Factors |
---|---|
X1 | Different months |
X2 | Departure passenger flow |
X3 | Flight takeoff and landing frequency |
X4 | Total retail sales of consumer goods |
X5 | Non-working days |
X6 | Railway passenger flow |
X7 | Highway passenger flow |
Variables | Physicochemical Factors | Number of Valid Data | Missing Rate (%) |
---|---|---|---|
X1 | Different months | 70 | 0 |
X2 | Departure passenger flow | 70 | 0 |
X3 | Flight takeoff and landing frequency | 70 | 0 |
X4 | Total retail sales of consumer goods | 70 | 0 |
X5 | Non-working days | 70 | 0 |
X6 | Railway passenger flow | 70 | 0 |
X7 | Highway passenger flow | 70 | 0 |
Y1 | Baggage flow | 59 | 15.71 |
Methods | Years | |||
---|---|---|---|---|
2019 | 2020 | |||
Missing Rate (%) | ||||
16.67 | 16.67 | |||
RMSE | MAPE | RMSE | MAPE | |
Mean imputation | 57,555.01 | 0.0825 | 25747.71 | 0.0367 |
Regression imputation | 56,797.71 | 0.0933 | 40,088.34 | 0.0490 |
Expectation maximization | 55,999.58 | 0.0743 | 25,747.71 | 0.0734 |
Multiple imputation | 37,891.81 | 0.0649 | 7442.74 | 0.0201 |
Variables | Variables | ||||||
---|---|---|---|---|---|---|---|
X1 | X2 | X3 | X4 | X5 | X6 | X7 | |
Coefficient (r) | −0.321 | 0.793 | 0.759 | −0.135 | 0.308 | 0.544 | 0.585 |
Significant difference (Sig.) | 0.019 | 0.000 | 0.000 | 0.333 | 0.025 | 0.000 | 0.000 |
Control Variables | Variables | Coefficient (r) | Significant Difference (Sig.) |
---|---|---|---|
X2 | Y1 & X4 | −0.024 | 0.844 |
X6 | |||
X7 |
Components | Eigenvalue (λ) | Proportion of the Initial Eigenvalue’ Variance (%) | Cumulative Contribution Rate (%) |
---|---|---|---|
1 | 2.631 | 43.856 | 43.856 |
2 | 1.266 | 21.105 | 64.961 |
3 | 1.029 | 17.142 | 82.103 |
4 | 0.749 | 12.484 | 94.587 |
5 | 0.311 | 5.178 | 99.765 |
6 | 0.014 | 0.235 | 100.000 |
Variables | Components | ||
---|---|---|---|
1 | 2 | 3 | |
X1 | 0.097 | −0.763 | 0.169 |
X2 | 0.970 | 0.064 | 0.063 |
X3 | 0.973 | 0.010 | 0.012 |
X5 | 0.133 | 0.292 | −0.868 |
X6 | 0.414 | 0.662 | 0.444 |
X7 | −0.738 | 0.395 | 0.214 |
Prediction Model | MAE (1 × 104) | R2 |
---|---|---|
BP | 7.0526 | 0.81057 |
PCA-BP | 5.2446 | 0.87192 |
PSO-BP | 5.8072 | 0.83231 |
PCA-PSO-BP | 4.8698 | 0.91228 |
PCC-PCA-PSO-BP | 4.3621 | 0.94633 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Jiang, B.; Zhang, J.; Fu, J.; Ding, G.; Zhang, Y. Research on Check-In Baggage Flow Prediction for Airport Departure Passengers Based on Improved PSO-BP Neural Network Combination Model. Aerospace 2024, 11, 953. https://doi.org/10.3390/aerospace11110953
Jiang B, Zhang J, Fu J, Ding G, Zhang Y. Research on Check-In Baggage Flow Prediction for Airport Departure Passengers Based on Improved PSO-BP Neural Network Combination Model. Aerospace. 2024; 11(11):953. https://doi.org/10.3390/aerospace11110953
Chicago/Turabian StyleJiang, Bo, Jian Zhang, Jianlin Fu, Guofu Ding, and Yong Zhang. 2024. "Research on Check-In Baggage Flow Prediction for Airport Departure Passengers Based on Improved PSO-BP Neural Network Combination Model" Aerospace 11, no. 11: 953. https://doi.org/10.3390/aerospace11110953
APA StyleJiang, B., Zhang, J., Fu, J., Ding, G., & Zhang, Y. (2024). Research on Check-In Baggage Flow Prediction for Airport Departure Passengers Based on Improved PSO-BP Neural Network Combination Model. Aerospace, 11(11), 953. https://doi.org/10.3390/aerospace11110953