A Novel Approach for Spatially Controllable High-Frequency Forecasts of Park Visitation Integrating Attention-Based Deep Learning Methods and Location-Based Services

Lu, Ming; Xie, Qian

doi:10.3390/ijgi12030098

Open AccessArticle

A Novel Approach for Spatially Controllable High-Frequency Forecasts of Park Visitation Integrating Attention-Based Deep Learning Methods and Location-Based Services

by

Ming Lu

^† and

Qian Xie

^*,†

School of Architecture, Harbin Institute of Technology; Key Laboratory of Cold Region Urban and Rural Human Settlement Environment Science and Technology, Ministry of Industry and Information Technology, Harbin 150006, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

ISPRS Int. J. Geo-Inf. 2023, 12(3), 98; https://doi.org/10.3390/ijgi12030098

Submission received: 11 December 2022 / Revised: 21 February 2023 / Accepted: 24 February 2023 / Published: 26 February 2023

Download

Browse Figures

Versions Notes

Abstract

Forecasting tourism volume can provide helpful information support for decision-making in managing tourist attractions. However, existing studies have focused on the long-term and large-scale prediction and scarcely considered high-frequency and micro-scale ones. In addition, the current approaches are limited regarding forecasting the visitor volume of a designated sub-area in a tourist attraction. This sub-area forecast can assist local-scaled managing decisions of tourist attractions, particularly for large-scale tourist attractions. Therefore, to achieve high-frequency forecasts of tourist volume for finer scale areas such as parks and their sub-areas and generate more controllable and flexible forecasts, this study developed a novel method that incorporates a forecasting model composed of multiple deep learning components and a designed control mechanism. The control mechanism produces high-temporal-resolution sequences of tourist volume for designated sub-areas, and the forecasting model is built on an attention-based deep-bidirectional neural network to better capture the long-range dependencies of the sequence and enhance the forecasting accuracy and robustness. The experimental research was performed at Taiyangdao Park and its two designated sub-areas to validate the effectiveness and superiority of the proposed method compared to other widely used deep-learning methods; three types of performance evaluations were adopted including fitting methods, error measures, and Diebold–Mariano tests. The results demonstrated that the proposed method provided outstanding performance in high-frequency forecasts and yielded more desired forecasting outcomes than other widely used forecasting methods. Furthermore, the comparison with the performances of various other deep learning models provide insights concerning their forecasting capacity; for instance, bidirectional RNN models tend to achieve better forecasts than general RNN models in the high-frequency forecasts. The proposed method has significant practical applicability in aiding short-term micro-scale management decisions and can also serve as an alternative approach in the field of tourist volume forecasting.

Keywords:

park visitation; tourist volume forecasting; high-frequency forecasting; micro-scale forecasting; spatially controllable forecasting; deep learning; location-based services; attention mechanism

1. Introduction

Tourist volume forecasting is an emerging research area in tourism management and tourism geography. With the development and application of AI algorithms and techniques in tourism volume forecasting, forecast accuracy has been significantly enhanced. Meanwhile, its practical implications in informing tourism-related decision-making have gained significant recognition by many researchers and practitioners [1]. Concretely, the practical value of accurate tourism volume forecasting for tourism attractions is embodied in aiding three aspects of tourism-related decisions, which are management decisions, e.g., crowd control, staffing, and scheduling arrangement in the tourist attraction [2,3]; economic decisions, e.g., formulating pricing strategies [3,4]; planning decisions, e.g., subsequent infrastructure planning for the tourist attraction [5]. The temporal characteristics can be highly differentiated according to the type of decision. For instance, planning decisions such as infrastructure plans can last for years, in contrast, management decisions about crowd control and staffing arrangement usually occur intra-day. Therefore, to better support related decision-making, the temporal characteristics of tourist volume forecast should be in accordance with the decision to be made.

Tourism volume forecasts can be divided into long-term, medium-term, and short-term from the perspective of the forecasting horizon. Long-term forecasts have a relatively long-time span of prediction that can be equal to or more than a year [6,7,8]; medium-term forecasts have a shorter forecasting horizon compared to the long-term, approximately six-months or a quarter [9,10]; the horizon of short-term forecasts can be as short as one month [11,12]. In terms of temporal resolution, the most common resolution for tourism volume forecasting is monthly [8,13], weekly [14,15,16,17], and daily [9,11,12].

Existing studies of tourist volume forecasts focused on long- or medium-term predictions, capturing the long-running trend of the tourist volume variation. These studies are crucial for understanding the long-term tourist behavior and supporting strategic decision-making. However, such approaches are not competent for extracting high-temporal-resolution information over a short period to advise short-run decision-making. Although a few researchers have realized the importance of short-term forecasts, and studies on such predictions have emerged [18,19], short-term forecasts, particularly the ones that can reflect intraday fluctuations, are still considerably scarce in the field of tourism volume forecasts. The importance of understanding the time-of-day fluctuation of tourist volume has been emphasized by UNESCO World Heritage Sustainable Tourism Toolkit (http://whc.unesco.org/sustainabletourismtoolkit (accessed on 4 May 2021)), which holds that the information about such fluctuations can help enhance the quality of tourist experience and mitigate pressure, etc. Therefore, the temporal resolution of tourist volume forecasting needs to be improved to guide some short-term management operations. In particular, the prediction time granularity must be increased to a higher level than daily to reflect the intraday variation of tourist volume. In this study, tourist volume forecasts with frequencies that are finer than hourly are denoted as high-frequency (or high-temporal-resolution) forecasts. Such forecasts can provide more details regarding upcoming visits to a tourist attraction, i.e., the variation of tourist volume during a minimal period, and allow the forecasting horizon to be as short as a single day, providing crucial information support to advise short-term decision making.

Spatial scale is another critical aspect related to tourist volume forecasting, where forecasts of a more specific area at a micro scale can provide more valuable information in managerial decisions such as crowd control. However, most related studies have focused on forecasting at a national [20], provincial [21], and urban scale [13]. While studies on site scale are mostly focused on large-sized tourist attractions such as Jiuzhaigou (650.7 km²) [10] and Huangshan Mountain Area (160.6 km²) [11]. These forecasts for the entire tourist attraction can certainly contribute to estimating the visitation variation of the entire site. However, actual managerial decisions such as crowd control are more dependent on specific forecasting information of finer local areas, particularly, the forecasting of crowd-prone sub-areas, which can provide well-targeted support to local crowd management.

In general, most related studies rarely consider high-frequency and small-scale forecasts, and the existing approaches are not competent for achieving such predictions. In particular, forecasting for a micro-scale area such as a sub-park area can provide valuable information for local management decisions such as local crowd control, accident prevention, and staff arrangement. Therefore, to address this gap, we proposed a novel method for such forecasts that incorporates a proposed control mechanism for the forecasting scope and a forecasting model built of multiple deep-learning components. We designed the control mechanism for actualizing tourist volume forecasts of designated sub-area in the park making the forecast more controllable and flexible. The forecasting model receives a high-frequency sequence of tourist volume generated from the control mechanism and completes the forecasting. The model comprises an attention mechanism and a deep bidirectional GRU to identify relevant features of high-temporal-resolution tourist volumes and improve the model’s capability of learning long-term dependency in the tourist volume sequence.

The remainder of this article is structured as follows. Section 2 (literature review) reviews the related works. Section 3 (materials and methods) introduces the proposed control mechanism and forecasting model for high-frequency forecast of tourist volume. Section 4 (experiment) introduces the experimental procedures, study areas, data preparation, model setting, and performance evaluations. Section 5 (results and discussion) analyzes and discusses the experimental results. Finally, the conclusion section summarizes the study and discusses its contributions and practical implementations.

2. Literature Review

2.1. Approaches for Tourist Volume Forecasting

Tourism volume forecasting methods are commonly divided into time-series, econometrics, and artificial intelligence models [22]. The time-series models generate predictions based on past values (e.g., ARIMA and its variants, etc.,) and the econometrics analysis models explore the causal relationship between tourist volume and relevant factors (e.g., ADL, VAR, TVP and PDR, etc.,). As of now, the time-series and econometrics models are dominantly used in the field of tourist volume forecasting [23], and many of the forecasts achieved desired outcomes [24,25,26,27,28,29]. However, a limitation of these models is that they are sensitive to the stability of past time-series patterns and economic structures, thus they are sometimes challenging to model a complicated non-linear variation in tourist volume data [18,22].

AI models are the computing techniques that simulate the intelligence process of the human brain [30,31,32], and the models such as ANN and support vector regression (SVR) have been applied to tourist volume forecasting. Many empirical results show that AI models show better performance compared to traditional time-series and econometrics models [33,34,35]; however, there is still evidence demonstrating that ARIMA families can outperform some AI techniques (i.e., ANN) in many cases [36,37]. Another AI method, SVR, is the regression version of support vector machine. Its effectiveness in capturing the non-linear trend of time-series data has been proved by many studies [8,38]. However, SVR is prone to overfitting and unqualified for training based on a large dataset. Moreover, the method pertains to shallow machine-learning techniques, which depend on manual operations for learning rather than automatically discovering the regularity in information from data [39,40].

Owing to the rapid development of AI, several prior research about tourist volume forecasts have been done based on deep learning approaches [9,11,13,41]. The deep learning models contain deep neural networks, which have a more complicated structure with more hidden layers than conventional (shallow) neural networks. Therefore, they can simulate complex non-linear variations and extract useful features from data with minimal manual intervention while learning.

Particularly, deep learning techniques that are mainly used for sequence data (e.g., tourist volume) forecasting such as recurrent neural network (RNN) and long short-term memory neural network (LSTM NN) have been proposed for the field of tourist volume forecasting. For example, Law et al. (2019) predicted monthly tourist arrival volume in Macau based on LSTM and attention mechanism [13]. Bi, Liu, and Li (2020) applied LSTM to forecast the daily tourist volume of Jiuzhaigou scenic spot and Huangshan Mountain Area [11]. Kulshrestha, Krishnaswamy, and Sharma (2020) used bidirectional LSTM to forecast the quarterly volume of tourist arrival in Singapore [41]. Deep learning methods always show a better forecast performance than the traditional modeling approaches, and owing to its effectiveness and adaptability, the usage of deep learning has become a new trend in tourist volume forecasting.

2.2. Data for Tourist Volume Forecasting

As this study is an AI-based tourist volume forecasting, this subsection will focus on reviewing the studies about forecasting data from an AI modeling perspective. Most AI-based tourist volume forecasts are data-driven approaches, and temporal characteristics of the forecasts (e.g., temporal resolution of the forecasting) are mostly dictated by data. The data for AI-based tourist volume forecasting can be divided into tourist volume data and auxiliary data, referred to as the time-series data of tourist volume and the data that corresponds with correlated variables to tourist volume (e.g., web search data), respectively.

The most common tourist volume data used by the related studies are from official statistics that are recorded and released by the related statistics departments of scenic spot or the government, which is usually collected from the official website [9,11], official publications [12] or directly provided by the official agency [10,42]. The frequency of tourist volume data from official statistics mainly ranges from daily to monthly. Majority of the studies use monthly data, and generally, the corresponding forecasting horizon is equal to or longer than a year. Hitherto, a day is recognized as the highest temporal resolution for the official statistics data, and a month is considered the minimum forecasting horizon based on such data; web search data are the most commonly used auxiliary data provided by Baidu index or Google trends. The correlation between travel information search and tourist behavior has been mentioned in many studies [43,44], and incorporating such data into training dataset helps enhance the forecasting performance [2,7,16]. However, daily is also the upper frequency limit for web search data. Additionally, whether there is a correlation between web search and ultra-short-term (e.g., less than a day) tourist flow has not been verified. Therefore, at least to date, the conventional dataset combined with the official statistics and web search data is incapable of conducting the high-temporal-resolution forecast for tourist volume.

In terms of the application of high-frequency forecasting, two types of data with high temporal granularity have been applied in related fields such as crowd forecasting and vehicle volume forecasting, which are mobile location data and traffic sensor data. The application of such data can serve as a reference to high-frequency tourism forecasting.

Mobile location data (or mobile signaling data) becomes available owing to the GNSS positioning techniques and wide usage of mobile Internet. The data have recently been applied to many short-term crowd forecasts. For example, Wang et al. (2019) forecasted crowd flow based on cellular data with a temporal resolution of 30 min [45]; Cecaj, Lippi, Mamer and Zambonelli (2020) predicted crowd distribution using aggregated mobile data with a frequency of 15 min [46]; Fan, Song, Shibasaki, and Adachi (2015) used an anonymous GPS data with time granularity of 5 min to forecast crowd movement [47]. Traffic sensor data are mostly applied to vehicle volume forecasting, and according to the related studies, such data can achieve high temporal resolution of 5 min [48,49]. However, the data collection strongly relies on tremendous sensors such as inductive-loop traffic detector, and similar sensors for crowd monitoring or pedestrian counting has not been pervasively equipped in tourist attractions at present. However, with the rapid development of technologies regarding crowd monitoring and visual crowd analysis, such data availability in near future with high reliability and frequency for tourism forecasting is expected. Then, many spatiotemporal forecasting techniques such as Conv-LSTM [50], PredRNN [51], and traffic-flow forecasting approaches [52,53] can be introduced to promote the development of the tourist flow forecasting field.

3. Materials and Methods

3.1. Control Mechanism for Forecasting Scope Based on Location-Based Services

Location-based services (LBS) data are a type of mobile location data that records real-time geographical location of mobile Internet users through wireless communication network or GNSS. In this process, each smartphone device is considered as a mobile sensor reflecting its user’s location information. This allows the collection of a large amount of individual location data in real-time. The advantages of LBS data are, it is timely, objective, massive in volume, and more importantly, offers a finer spatial and temporal resolution compared to the traditional data such as the official statistics data. According to the studies that adopted such data to conduct high-frequency crowd forecasting, temporal resolutions of the data can reach minute level [46,47].

Based on the aforementioned characteristics of LBS data, this study proposed a control mechanism to manipulate the spatial scope of the forecasts. Specifically, after delimiting the forecasting sub-area in park, the mechanism is able to produce high-frequency tourist-volume sequence of the designated area, thereby supporting the subsequent forecasts. The control mechanism illustration is shown in Figure 1. The mechanism consists of the following steps. First, at each time note (t₁, t₂, t₃…), the mechanism iterates over all the LBS data, based on which it creates LBS-based matrices and then aligns these matrices to the rasterized park area. After delimiting the spatial scope for forecasts in the park, the mechanism extracts corresponding data from the matrices and converts it to tourist volumes. This process is repeated at each time note to produce a time series sequence of tourist volume for the designated area, which is utilized as training data for deep-learning forecasting techniques. This approach can be regarded as a transfer tool that effectively connects LBS data to forecasting models to achieve maneuverable scope control for forecasts, which is able to contrapuntally assist crowd control of the local area in park.

3.2. The Proposed Deep Learning Model for High-Frequency Forecasting

3.2.1. The Neural Network Architecture

The proposed network, referred to as attention-based deep bidirectional gated recurrent unit (Att-DBGRU), integrates multiple deep-learning components to form a deeper and more effective model for the high-frequency tourist-volume forecasts. The two main components adopted in the network were deep bidirectional GRU (DBGRU) and attention mechanism. The former is composed of bidirectional GRUs (BGRUs), which can learn the tourist volume sequence data from both past and future directions to better model the sequential dependency, and its superiority compared to unidirectional RNN based models have been tested by related studies [41]. Thus, we used multiple BGRUs and stacked them to a deeper DBGRU network to further improve its accuracy and robustness. The latter enables the network to differentiate feature importance of high-frequency tourist-volume sequence and enhance the performance on capturing its long-range dependencies.

More concretely, Att-DBGRU is composed of four layer types: BGRU, attention, dropout, and fully connected layers (Figure 2). BGRUs can fully learn features from high-temporal-resolution tourist volume. The information is received from the previous BGRU layer to yield outputs, which are then transmitted to the next BGRU layer as inputs. The process is repeated in the network to fully extract valuable information from the tourist volume. Moreover, the attention layer focuses on important information in the tourist-volume sequence; dropout layers are added to the network to alleviate over-fitting, which is always considered a serious problem in machine learning and deep learning [54]. The layers randomly discarded a specific portion of the neurons in a deep learning layer to regularize and enhance the robustness of the deep network; a fully connected layer is disposed to reshape the outputting information and accomplish the modeling process of the deep network.

According to the characteristics of high-temporal-resolution forecast for tourist volume, the proposed forecasting models are many-to-one models, which use multiple inputs to yield a single output. The input features contained historical tourist volume and time-series variables correlated to tourist attraction visits, such as weather and vacation variables, and the predicted tourist volume was the only output feature.

3.2.2. Deep Bidirectional Gated Recurrent Units

The forecasting model involved multiple bidirectional RNNs to manage high-temporal-resolution sequence of tourist volume, the network proposed by Schuster and Paliwal (1997) [55], which had been applied to many time-series forecasts and always yielded outstanding performance [56,57]. With the same or similar setup parameters, a bidirectional network achieved more desired results than a unidirectional network in many fields [58].

Different from the typical unidirectional RNNs, bidirectional network consists of two separated hidden RNN layers (the forward and backward layer) in its architecture, and each of these layers is connected to an input and an output layer, respectively. The bidirectional architecture allows its network to learn the tourist volume sequence from both past and future directions. A forward and a backward layer in the network read input sequence x

(x_{1}, x_{2}, \dots x_{t} \dots, x_{n - 1}, x_{n})

from two opposite directions, in which

x^{f o r w a r d} = (x_{1}, x_{2}, \dots x_{t} \dots, x_{n - 1}, x_{n})

and

x^{b a c k w a r d} = (x_{n}, x_{n - 1}, \dots x_{t} \dots, x_{2}, x_{1})

, then obtain a forward hidden state

{\vec{h}}_{t} ({\vec{h}}_{1}, {\vec{h}}_{2}, \dots, {\vec{h}}_{n - 1}, {\vec{h}}_{n})

and a backward hidden state

{\overset{\leftarrow}{h}}_{t} ({\overset{\leftarrow}{h}}_{1}, {\overset{\leftarrow}{h}}_{2}, \dots, {\overset{\leftarrow}{h}}_{n - 1}, {\overset{\leftarrow}{h}}_{n})

, respectively (see Equations (1) and (2)). Subsequently, forward and backward sequences are concentrated and generate the final output sequence y

(y_{1}, y_{2}, \dots y_{t} \dots, y_{n - 1}, y_{n})

, which is calculated using Equation (3).

{\vec{h}}_{t} = f (W_{x \vec{h}} \cdot x_{t} + W_{\vec{h} \vec{h}} \cdot {\vec{h}}_{t - 1} + b_{\vec{h}})

(1)

{\overset{\leftarrow}{h}}_{t} = f (W_{x \overset{\leftarrow}{h}} \cdot x_{t} + W_{\overset{\leftarrow}{h} \overset{\leftarrow}{h}} \cdot {\overset{\leftarrow}{h}}_{t + 1} + b_{\overset{\leftarrow}{h}})

(2)

y_{t} = W_{y \vec{h}} \cdot {\vec{h}}_{t} + W_{y \overset{\leftarrow}{h}} \cdot {\overset{\leftarrow}{h}}_{t} + b_{y}

(3)

where

{\vec{h}}_{t}, {\overset{\leftarrow}{h}}_{t}

, are forward and backward propagation, respectively; f (

\cdot

) denotes a non-linear activation function (i.e., sigmoid function);

W_{x \vec{h}}

,

W_{\vec{h} \vec{h}}

,

W_{x \overset{\leftarrow}{h}}

,

W_{\overset{\leftarrow}{h} \overset{\leftarrow}{h}}

,

W_{y \vec{h}}

, and

W_{y \overset{\leftarrow}{h}}

represent the corresponding weight coefficients and

b_{\vec{h}}

,

b_{\overset{\leftarrow}{h}}

, and

b_{y}

represent corresponding bias vectors.

GRU cells were adopted to add to the aforementioned bidirectional network, which is known as an enhanced version of RNN cell and a variant of LSTM cell. Even GRU was rarely applied to tourist volume prediction, it achieved the same desired forecast effect as LSTM in many other time-series forecasts [59]. GRU simplifies the gating mechanism from three LSTM gates—forget, input, and output—to two gates—reset gate and update gate—to reduce the calculation cost. A typical GRU cell has two gates: reset gate and update gate. Reset gate determines what information the current step can access from

h_{t - 1}

and

x_{t}

, which is calculated using Equation (4). Subsequently, a candidate vector,

{\tilde{h}}_{t}

, is produced by tanh function, during which outputs of reset gate,

r_{t}

, only influences

h_{t - 1}

(see Equation (5)). Furthermore, update gate is used to control the influence of the previous state,

h_{t - 1}

, and candidate vector,

{\tilde{h}}_{t}

, on state vector,

h_{t}

, calculated using Equations (6) and (7).

r_{t} = σ (W_{r} \cdot [h_{t - 1}, x_{t}] + b_{r})

(4)

{\tilde{h}}_{t} = t a n h (W_{h} \cdot [r_{t} ⨀ h_{t - 1}, x_{t}] + b_{h})

(5)

u_{t} = σ (W_{u} \cdot [h_{t - 1}, x_{t}] + b_{u})

(6)

h_{t} = (1 - u_{t}) ⨀ h_{t - 1} + u_{t} ⨀ {\tilde{h}}_{t}

(7)

where

r_{t}

and

u_{t}

represent the output of reset gate and update gate, respectively;

h_{t - 1}

and

h_{t}

represent previous cell state and current cell state, namely;

{\tilde{h}}_{t}

denotes a candidate activation vector;

W_{r}

,

W_{h}

, and

W_{u}

represent weight matrixes, and

b_{r}

,

b_{h}

, and

b_{u}

represent bias vectors; ⨀ denotes the Hadamard product.

3.2.3. Attention Mechanism in the Network

The attention mechanism mimics the way the human brain processes information and concentrates only on the important parts of the information while ignoring the irrelevant parts. We utilized the mechanism to highlight the key parts of time-series sequence of high-frequency tourist volume. Specifically, our attention mechanism was added after the DBGRU layers to assign probability weights to the output vectors from the last DBGRU layer (Figure 2). Thus, the neural network can focus more on the important content of the vector to yield highly effective forecasts. This study used Luong’s multiplicative attention to construct our custom attention layer, and the calculating processes are listed as follows (see Equations (8)–(11)).

a_{t} = \frac{\exp (s c o r e (h_{t}, {\bar{h}}_{s}))}{\sum_{s^{'} = 1} \exp (s c o r e (h_{t}, {\bar{h}}_{s'}))}

(8)

s c o r e (h_{t}, {\bar{h}}_{s}) = h_{t}^{T} W_{a} {\bar{h}}_{s}

(9)

c_{t} = \sum_{s = 1}^{T} a_{t} {\bar{h}}_{s}

(10)

{\tilde{h}}_{a t t} = \tan h (W_{c} [c_{t}; h_{t}])

(11)

where

h_{t}

and

{\bar{h}}_{s}

denote current hidden state and source hidden states, respectively;

a_{t}

represents attention weight;

c_{t}

represents context vector;

{\tilde{h}}_{a t t}

denotes attention vector.

4. Experiment

4.1. Overview

The objective of the experiment is to verify the effectiveness of the proposed method. Specifically, we delimit three forecasting areas and produce their high-frequency tourist volumes based on the proposed scope control mechanism. We then use Att-DBGRU model to forecast a 3-day tourist volume of the designated areas in a temporal resolution of 30 min, as we believe this forecasting provides sufficient information about the short-term high-frequency variation of the tourist volumes for the park managers to conduct crowd control decision-making. Finally, the performance of the proposed method is evaluated based on multiple measurements, which is then compared with other forecasting techniques. More specific procedures are illustrated in Figure 3.

4.2. Study Area

Taiyangdao Scenic Spot is located in Harbin—the capital of Heilongjiang province, China. It has been given a 5A rating—the highest level in China’s tourist attraction rating system according to the Ministry of Culture and Tourism of the People’s Republic of China (https://zwfw.mct.gov.cn/scenicspot/ (accessed on 10 May 2021)). This study focuses on one of its well-known sub-area, the Taiyangdao Park. The three forecasting areas are the entire Taiyangdao Park and its two sub-regions (Area 1 and Area 2). Area 1 and Area 2 are crowd-prone areas that are always highly congested during the peak hours of the day, therefore short-term forecasts of the areas help practically in making crowd control decisions.

The spatial layout of Taiyangdao Park is presented in Figure 4. The area in yellow represents the entrance to the park (Area 1), which has four entrance gates (represented by the red dots). Furthermore, the area in red represents the Taiyang lake area (Area 2), which is also known as the central area. It contains the main recreational facilities such as the Taiyang lake (the largest lake in the park), amusement park, and central square. Figure 4 also contains aerial photographs which provide more information on these areas.

4.3. Data Preparation and Model Setting

China’s mobile Internet users have reached 1.319 billion by the end of 2019 according to the 2020 China Mobile Internet Development Report. With the widespread use of mobile Internet in China and advanced mobile positioning techniques, mobile location data such as LBS data can reflect real-time variations in tourist volume in Chinese tourist attractions, and this becomes highly valuable in applying relevant time-series or spatio-temporal forecasting. This study procured LBS data from the Tencent Company—one of the well-known Internet service companies with the largest user base in China. Tencent LBS provides various big data-related products, such as the platform of Tencent location big data (https://heat.qq.com/bigdata/index.html (accessed on 3 March 2021)). The regional heatmap (https://heat.qq.com/heatmap.php (accessed on 3 March 2021)) is one of the services offered by Tencent location big data, allowing the collection of high-resolution location data of the tourists in Taiyangdao Park. We collected data in a 30-day interval (from 21 September 2020 at 5:00 AM to 21 October 2020 at 4:30 AM) to set up a dataset for each of the study area (i.e., Taiyangdao park, Area 1 and Area 2). Each dataset had a sample size of 1440 and a 30-min temporal resolution.

Many studies note that variables related to weather and vacation influence the travel behavior [60,61], and adding these features into inputs of model training can enhance the accuracy of tourist volume prediction [62,63]. Hence, we considered these variables as parts of our input features. First, we collected the weather data from Worldweatheronline (https://www.worldweatheronline.com/ (accessed on 11 March 2021)); the data show that there is no very extreme weather condition in our study area during the period, and primary weather types are sunny, cloudy, and rainy. We converted the nominal weather variable to dummy variables by binarization. As tourist activities in Taiyangdao park mainly occur outdoors and can be more impacted by rainy weather, we used different dummy variables to represent rainy and others, where 0 represents rainy and 1 represents sunny and cloudy. Moreover, the nominal vacation variable were also converted to a binary value where 1 represented vacation days and 0 represented the rest. In addition, we split the total samples to set up the training and test dataset. The former was used for model training, and the latter was prepared to evaluate forecast performance. The data for the last three days was used to set up the test dataset with a size of 144, and the previous data were used for model training.

Our dataset contained multi-dimensional variables, including tourist volume, weather, and vacation. Therefore, the data was normalized before being sent into the deep learning models. We used the min-max normalization to normalize the original data, transforming the value of features to dimensionless values within [0,1] range to reduce the sensitivity of the model to the scale of features. A de-normalization operation was also conducted after the training to inverse transform all the normalized data to its original scale.

With regards to the forecasting models, this study along with verifying the effectiveness of the proposed Att-DBGRU validates the superiority of Att-DBGRU compared to other commonly used deep-learning models in this high-frequency forecasting, which is also considered an objective of this study. The benchmark models adopted are divided into two general-RNN based models, (general-RNN model) based on typical RNN network, and bidirectional-RNN based models (bi-RNN models) based on bidirectional RNN networks. The general-RNN includes some widely used model in tourist volume forecasting and tourist demand forecasting field (such as LSTM), the bi-RNN models we adopted are DBGRU and DBLSTM, the latter were also used in other studies of tourist demand forecasting [41].

For all the general- and bi-RNN models, the activation function is used a rectified linear unit (ReLU)—this function can reduce the probability of vanishing gradient and generate sparse networks to prevent overfitting [64]. Additionally, we employed a regularization for each hidden layer to reduce the overfitting based on a dropout method. The dropout rate was set to 0.1, meaning that 10% of the neurons were randomly discarded. Furthermore, we adopted adaptive moment estimation (Adam) as the optimization method, which has been widely applied in many related studies [65,66,67]. The predicting experiments and the training were executed using Python 3.9, Anaconda, TensorFlow, and PyTorch.

4.4. Performance Evaluation

This study adopted the following three measures to evaluate the performance of the forecasting models: fitting methods, error measures, and Diebold–Mariano tests. The fitting methods including the fitting curves and scatters can visualize the forecasting results and real observations of tourist volume, which directly show the fitting effects of different periods; the error measures include the root means square error (RMSE), the mean absolute error (MAE), and the coefficient of determination (R²). Their mathematical formulations are presented in Equations (12)–(14).

RMSE = \sqrt{\frac{1}{N} \cdot \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}

(12)

MAE = \frac{1}{N} \cdot \sum_{i = 1}^{N} | y_{i} - {\hat{y}}_{i} |

(13)

R^{2} = \frac{\sum_{i = 1}^{N} (y_{i} - \bar{y}) ({\hat{y}}_{i} - \bar{Y})}{\sqrt{\sum_{i = 1}^{N} {(y_{i} - \bar{y})}^{2}} \sqrt{\sum_{i = 1}^{N} {({\hat{y}}_{i} - \bar{Y})}^{2}}}

(14)

where N represents the value of the sample size;

y_{i}

and

{\hat{y}}_{i}

denote observed tourist volume and forecasted tourist volume, respectively;

\bar{y}

and

\bar{Y}

represent the mean of

y_{i}

and

{\hat{y}}_{i}

, respectively. The smaller the values of RMSE and MAE are, the better the performance of the forecasting model is. The closer the value of R² to 1 is, the more accurate the forecast.

Diebold–Mariano test (D-M test) is adopted to validate if the performances of predictions are significantly different, which can also imply the difference of model capacity in such high-frequency forecasting for the park and its sub areas.

5. Results and Discussion

5.1. Fitting Effects of the Forecasting Techniques

The empirical study was focused on Taiyangdao Park and its two designated sub-areas using the proposed Att-DBGRU method and other five deep learning methods—DBGRU, DBLSTM, GRU, LSTM, and RNN. Figure 5 demonstrates the fitting effects for a visual comparison between the forecasts based on the six models and the real observation. The figure illustrates the actual and forecasted tourist volume of the entire Taiyangdao Park on the last three days of the entire 30-day time horizon with a temporal resolution of 30 min, where the black line represents the actual 144 observations of tourist volume.

In the forecast results for the entire Taiyangdao Park, Att-DBGRU generated ideal fitting effects. Its forecasting curve fit the real observations well even in peaks and valleys where the curves of many other methods deviated from the actual values to some extent. The scatter of the Att-DBGRU also illustrates that its forecasts fit the real observation well in both high- and low-volume intervals. The good fitting effects indicate the superiority of Att-DBGRU to the high-frequency forecast of tourist volume of the park. Many other deep-learning methods also display the desired effects on fitting the actual observation during the period, particularly DBGRU and DBLSTM, which yielded well-fitted values in the low-volume interval; however, the values deviated slightly from the real observation line in the high-volume interval (800–1500). Generally, the bi-RNN models perform more accurate forecasts than the general-RNN models. The latter generated well-fitted values in the 200–600 interval but did not fit well in the high-volume interval and extremely low-volume (0–100) interval according to the scatter.

With regards to the forecasts at the sub areas in the park, the results for the two sub areas are illustrated in Figure 6. The results are comparable to the previous results, showing that Att-DBGRU makes accurate forecasts for both the sub areas, although its scatter in area 1 deviated slightly from the actual value in approximately 800–1000 interval, but the general results are more well-fitted than the result of other methods, particularly in the high-volume interval. This indicated the obvious advantages of Att-DBGRU in peak period forecasting of the tourist volumes compared to the other deep learning models. In terms of the Bi-RNN models, although the curves and scatters display relatively significant deviations than Att-DBGRU, the overall performances are still satisfying in the sub-areas forecasting tests and more precise than the performance generated by the general-RNN models. Particularly, in the area 2, the forecasts by bi-RNN models slightly deviated from the actual value, but the forecasts by some general-RNN models such as GRU and RNN shows more significant deviations compared to the bi-RNN models.

5.2. Results of the Performance Evaluations

Three performance measures—RMSE, MAE, and R2—were used to evaluate the performance of the forecasting models. Table 1 shows the results of these measures for the forecasts of Taiyangdao park. According to the table, Att-DBGRU made the best forecast with the lowest RMSE and MAE scores (62.289 and 44.482, respectively) with a highest R2 score of 0.970. This not only indicated the effectiveness of Att-DBGRU in such high-frequency forecasts, but it also outperformed the bi-RNN and general-RNN methods. Additionally, bi-RNN models outperformed general-RNN models, particularly DBGRU, which also achieved desired outcomes; although its result (the RMSE, MAR, and R2 score are 79.326, 57.003, and 0.939, respectively) is less accurate than Att-DBGRU, it is distinctly more accurate compared to DBLSTM and the general-RNN models. In terms of the general-RNN models, the RMSE, MAE, and R2 scores of GRU and LSTM are approximately equivalent to each other, and both demonstrated much higher performance accuracy compared to that of RNN.

The performances for Area 1 and Area 2 are shown in Table 2, which shows that Att-DBGRU also achieved ideal outcomes in forecasting the sub areas in the park, with the highest R2 score of 0.959 and 0.981, respectively. The lowest RMSE and MAE scores of Att-DBGRU also demonstrated it had the best result among all the models in both of the sub areas. This illustrated the desired forecasting capacity of the proposed method for such high-frequency forecasts of tourist volumes for the designated sub-areas in the park. Similar to the results of the evaluation in the entire Taiyangdao Park, the bi-RNN models provide better predicting performance than the general-RNN models based on the three evaluation measures, and the performances of the two bi-RNN models are approximately comparable with each other. In addition, GRU and LSTM have minimal forecasting errors among the general-RNN models in Area 1 and Area 2, respectively, and both of them outperform RNN.

5.3. Comparison of Forecasting Methods with Diebold–Mariano Test

This study used D-M test to validate each model with every other model in turn, to compare if one significantly outperforms the other in the high-frequency forecasting. The utilized D-M test is based on MAE and MSE error statistic, respectively (Table 3). A negative DM value denotes the corresponding model tends to outperform the benchmark model, and vice versa if the DM value is positive.

According to the results of the D-M tests, the proposed Att-DBGRU outperforms all the other models at 5% significance level in the two sub areas, which indicates its significant superiority in high-frequency forecast of the sub areas compared to other deep-learning methods. However, in the entire Taiyangdao Park, although Att-DBGRU significantly outperforms DBLSTM and the general-RNN models at the 10% or 5% significance level, the superiority of the method to DBGRU is not significant. This implies DBGRU model also has satisfactory capacity in such forecasting and can be considered an alternative to Att-DBGRU in some cases.

In terms of comparison of the other five models, DBGRU outperforms almost all the general-models at 10% or 5% significance level, and DBLSTM also performs significantly accurate forecasts compared to the general-RNN models in most cases, the only exception occurs in Area 1, both the MAD- and MSE-based D-M tests show that the performance difference between DBLSTM and GRU is not significant, which also implies that forecasting model can be differentiated depending on the region that is being forecasted. Generally, bi-RNN models provide better performance than general-RNN models, which demonstrates the former have significant superiority in such high-frequency forecasts of park visitation compared to the frequently used general-RNN models (such as LSTM) by the related works.

5.4. Discussion

Most related studies focus on improving the forecasting model. Earlier, they adopt linear methods (i.e., ARIMA and its variants) [20,29] and econometrics methods [12], and then shallow learning methods [8,38]. Recently, deep learning models have achieved outstanding performances in many tourist volume forecasting [9,11,13]. The application of more advanced model largely enhances the forecasting accuracy, which provides vast contribution to the field. However, improving the controllability and flexibility of the forecasting method is of equal importance, which is neglected all along. Particularly, the control of the forecasting scope can make the forecasting more focalized, and can provide the forecasting information of tourist volume at a finer sub-site level.

An example of how the results of spatially controllable forecasts can contribute to the actual crowd management is provided to emphasize its practicality. Based on the forecasting information at sub-site scale, we can show density comparisons between the entire site and its two sub-areas (Figure 7). According to the hourly illustration of forecasted crowd density, we can observe that the two sub-areas experience a much higher density than the entire site during a specific period of the day (Area 1 at 11:00, 12:00, and 15:00 and Area 2 at 11:00–15:00). This information regarding high density period can contribute to the short-term decision in preventing crowding at local scale, and this information cannot be reflected by forecast of the entire site. The highly specific information can support the discovery of the disparity in crowd density between the sub-regions of the park within a specific period and advise park management to shift their focus from one sub-region to another.

With regards to the forecasting model, this study also proposed a novel deep-learning model (Att-DBGRU) for the high-frequency forecasting, and compared its effectiveness with other deep learning models used by related works. According to [9,11,13,68,69], deep learning techniques always outperform the shallow learning methods and linear methods in the forecasting. Therefore, we adopted only the deep learning models as our benchmark models. Notably, the deep learning algorithms used in this study were categorized into general-RNN and bi-RNN models, to allow a comparison between different models. These models are commonly used by other tourist volume forecasts, such as LSTM [9,11] and BLSTM [41]. The results of this study validated the significant forecasting capacity and superiority of the proposed Att-DBGRU in high-frequency forecasts, and demonstrated that such method could be a new forecasting approach of tourist volume. Furthermore, the findings provide more evidence in the differences between widely used deep learning methods in such high-frequency forecasting performance. Generally, bi-RNN models outperformed the general-RNN models in forecasting, and such result is consistent with the works from Kulshrestha, Krishnaswamy, and Sharma (2020) [41], who conducted a long-term tourism demand forecasting based on bidirectional LSTM.

6. Conclusions

This study proposed a novel method based on a control mechanism and a forecasting model named Att-DBGRU to achieve spatially controllable high-frequency forecasts of park visitation. To validate the effectiveness of the method in high-frequency forecasting and test the superiority of Att-DBGRU compared to other widely used deep learning methods, we experimentally evaluated the method considering Taiyangdao park and its two subareas. Moreover, we adopted fitting methods, error measures, and D-M tests to evaluate the forecasting performances.

The experimental results reveal that the proposed method could provide outstanding performance in high-frequency forecasts of tourist volume for the three designated areas, indicating that this method is suitable for implementation for the forecast of the park and its sub-areas. Furthermore, the proposed Att-DBGRU model significantly outperformed other commonly used deep learning methods in tourist volume forecasting. Moreover, the findings also involved more evidence about the forecasting accuracy of other deep-learning techniques, for example, bi-RNN models outperformed unidirectional general-RNN models in such high-frequency forecasting. Particularly DBGRU, which also obtained satisfying outcomes in the forecasting experiments, and can be an alternative model for such high-temporal-resolution forecasts. In addition, GRU demonstrated the most accurate prediction among those of the general-RNN models and outperformed LSTM, the most commonly used deep learning model in the tourist volume forecasting field.

The study’s contributions to the field of tourist volume forecasting are: First, it achieved the forecast for designated sub-areas in the park, and the design of the control mechanism of the proposed method provided a more controllable and flexible forecast, actualizing the forecast at a finer spatial scale. Second, the study performed high-frequency forecasts of tourist volume, with a temporal resolution up to a minute-level. Third, an advanced deep learn model (Att-DBGRU) was proposed, which significantly outperformed the widely used bi-RNN and general-RNN models in high-frequency forecasting.

The practical implications of this study are as follows: First, the high-frequency forecast predictions can contribute to multiple short-term decision-making related to tourist attraction management, such as intraday crowd control, staffing arrangement, and accident prevention. Second, regarding the spatial scale of the forecast, forecasting only for an entire tourist attraction provides limited support in operational management. Therefore, the proposed method further focused on the sub-areas and their forecasting information, which is of practical use in local crowd control.

Author Contributions

Conceptualization, Qian Xie and Ming Lu; methodology, Qian Xie; software, Qian Xie; validation, Qian Xie; formal analysis, Qian Xie; writing—riginal draft, Qian Xie; writing—review and editing, Qian Xie; visualization, Qian Xie; supervision, Ming Lu; project administration, Ming Lu. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (Grant no. 52078160).

Conflicts of Interest

The authors declare no conflict of interest.

References

Palmer, A.; Montaño, J.; Sesé, A. Designing an artificial neural network for forecasting tourism time series. Tour. Manag. 2006, 27, 781–790. [Google Scholar] [CrossRef]
Li, S.; Chen, T.; Wang, L.; Ming, C. Effective tourist volume forecasting supported by PCA and improved BPNN using Baidu index. Tour. Manag. 2018, 68, 116–126. [Google Scholar] [CrossRef]
Jiao, E.X.; Chen, J.L. Tourism forecasting: A review of methodological developments over the last decade. Tour. Econ. 2019, 25, 469–492. [Google Scholar] [CrossRef]
Divino, J.A.; McAleer, M. Modelling and forecasting daily international mass tourism to Peru. Tour. Manag. 2010, 31, 846–854. [Google Scholar] [CrossRef]
Chu, F.L. Forecasting tourism demand with ARMA-based methods. Tour. Manag. 2008, 30, 740–751. [Google Scholar] [CrossRef]
Wen, L.; Liu, C.; Song, H. Forecasting tourism demand using search query data: A hybrid modelling approach. Tour. Econ. 2019, 25, 309–329. [Google Scholar] [CrossRef]
Li, X.; Pan, B.; Law, R.; Huang, X. Forecasting tourism demand with composite search index. Tour. Manag. 2017, 59, 57–66. [Google Scholar] [CrossRef]
Sun, S.; Wei, Y.; Tsui, K.L.; Wang, S. Forecasting tourist arrivals with machine learning and internet search index. Tour. Manag. 2019, 70, 1–10. [Google Scholar] [CrossRef]
Zhang, B.; Li, N.; Shi, F.; Law, R. A deep learning approach for daily tourist flow forecasting with consumer search data. Asia Pac. J. Tour. Res. 2020, 25, 323–339. [Google Scholar] [CrossRef]
Lu, W.; Jin, J.; Wang, B.; Li, K.; Liang, C.; Dong, J.; Zhao, S. Intelligence in Tourist Destinations Management: Improved Attention-based Gated Recurrent Unit Model for Accurate Tourist Flow Forecasting. Sustainability 2020, 12, 1–20. [Google Scholar]
Bi, J.-W.; Liu, Y.; Li, H. Daily tourism volume forecasting for tourist attractions. Ann. Touris. Res. 2020, 83, 102923. [Google Scholar] [CrossRef]
Huang, X.; Zhang, L.; Ding, Y. The Baidu Index: Uses in predicting tourism flows –A case study of the Forbidden City. Tour. Manag. 2017, 58, 301–306. [Google Scholar] [CrossRef]
Law, R.; Li, G.; Fong, D.K.C.; Han, X. Tourism demand forecasting: A deep learning approach. Ann. Touris. Res. 2019, 75, 410–423. [Google Scholar] [CrossRef]
Pan, B.; Wu, D.C.; Song, H. Forecasting hotel room demand using search engine data. J. Hosp. Tour. Technol. 2012, 3, 196–210. [Google Scholar] [CrossRef]
Pan, B.; Yang, Y. Forecasting destination weekly hotel occupancy with big data. J. Travel Res. 2017, 56, 957–970. [Google Scholar] [CrossRef]
Bangwayo-Skeete, P.F.; Skeete, R.W. Can Google data improve the forecasting performance of tourist arrivals? Mixed-data sampling approach. Tour. Manag. 2015, 46, 454–464. [Google Scholar] [CrossRef]
Yang, Y.; Pan, B.; Song, H. Predicting hotel demand using destination marketing organization’s Network traffic data. J. Travel Res. 2014, 53, 433–447. [Google Scholar] [CrossRef]
Chen, R.; Liang, C.Y.; Hong, W.C.; Gu, D.X. Forecasting holiday daily tourist flow based on seasonal support vector regression with adaptive genetic algorithm. Appl. Soft Comput. 2015, 26, 435–443. [Google Scholar] [CrossRef]
Li, K.; Liang, C.; Lu, W.; Li, C.; Zhao, S.; Wang, B. Forecasting of Short-Term Daily Tourist Flow Based on Seasonal Clustering Method and PSO-LSSVM. ISPRS Int. Geo-Inf. 2020, 9, 676. [Google Scholar] [CrossRef]
Park, S.; Lee, J.; Song, W. Short-term forecasting of Japanese tourist inflow to South Korea using Google trends data. J. Travel Tour. Mark. 2017, 34, 357–368. [Google Scholar] [CrossRef]
Zhang, B.; Huang, X.; Li, N.; Law, R. A novel hybrid model for tourist volume forecasting incorporating search engine data. Asia Pac. J. Tour. Res. 2017, 22, 245–254. [Google Scholar] [CrossRef]
Song, H.; Li, G. Tourism demand modelling and forecasting. A review of recent research. Tour. Manag. 2008, 29, 203–220. [Google Scholar] [CrossRef]
Li, X.; Law, R.; Xie, G.; Wang, S. Review of tourism forecasting research with internet data. Tour. Manag. 2021, 83, 104245. [Google Scholar] [CrossRef]
Lee, C.K.; Song, H.J.; Mjelde, J.W. The forecasting of International Expo tourism using quantitative and qualitative techniques. Tour. Manag. 2008, 29, 1084–1098. [Google Scholar] [CrossRef] [PubMed]
Smeral, E.; Wüger, M. Does complexity matter? Methods for improving forecasting accuracy in tourism: The case of Australia. J. Travel Res. 2005, 44, 100–110. [Google Scholar] [CrossRef]
Assaf, A.G.; Li, G.; Song, H.; Tsionas, M.G. Modeling and forecasting regional tourism demand using the Bayesian Global Vector Autoregressive (BGVAR) model. J. Travel Res. 2018, 58, 383–397. [Google Scholar] [CrossRef]
Dogru, T.; Sirakaya-Turk, E.; Crouch, G.I. Remodeling international tourism demand: Old theory and new evidence. Tour. Manag. 2017, 60, 47–55. [Google Scholar] [CrossRef]
Smeral, E. Seasonal Forecasting Performance Considering Varying Income Elasticities in Tourism Demand. Tour. Econ. 2019, 25, 355–374. [Google Scholar] [CrossRef]
Andrew, W.P.; Cranage, D.A.; Lee, C.K. Forecasting hotel occupancy rates with time series models: An empirical analysis. J. Hosp. Tour. Res. 1990, 14, 173–182. [Google Scholar] [CrossRef]
Law, R. Back-propagation learning in improving the accuracy of neural network-based tourism demand forecasting. Tour. Manag. 2000, 21, 331–340. [Google Scholar] [CrossRef]
Law, R.; Au, N. A neural network model to forecast Japanese demand for travel to Hong Kong. Tour. Manag. 1999, 20, 89–97. [Google Scholar] [CrossRef]
Uysal, M.; El Roubi, M.S. Artificial neural networks versus multiple regression in tourism demand analysis. J. Travel Res. 1999, 38, 111–118. [Google Scholar] [CrossRef]
Li, J.; Xu, L.; Tang, L.; Wang, S.; Li, L. Big data in tourism research: A literature review. Tour. Manag. 2018, 68, 301–323. [Google Scholar] [CrossRef]
Song, H.; Qiu, R.T.; Park, J. A review of research on tourism demand forecasting. Ann. Touris. Res. 2019, 75, 338–362. [Google Scholar] [CrossRef]
Kon, S.C.; Turner, W.L. Neural network forecasting of tourism demand. Tour. Econ. 2005, 11, 301–328. [Google Scholar] [CrossRef]
Volchek, K.; Liu, A.; Song, H.; Buhalis, D. Forecasting tourist arrivals at attractions: Search engine empowered methodologies. Tour. Econ. 2019, 25, 425–447. [Google Scholar] [CrossRef]
Claveria, O.; Torra, S. Forecasting tourism demand to Catalonia: Neural networks vs. time series models. Econ. Model. 2014, 36, 220–228. [Google Scholar] [CrossRef]
Pai, P.F.; Hung, K.C.; Lin, K.P. Tourism demand forecasting using novel hybrid system. Expert Syst. Appl. 2014, 41, 3691–3702. [Google Scholar] [CrossRef]
Zhang, B.; Pu, Y.; Wang, Y.; Li, J. Forecasting hotel accommodation demand based on LSTM model incorporating Internet search index. Sustainability 2019, 11, 4708. [Google Scholar] [CrossRef]
Luo, X.; Li, D.; Yang, Y.; Zhang, S. Spatiotemporal Traffic Flow Prediction with KNN and LSTM. J. Adv. Transp. 2019, 4145353. [Google Scholar] [CrossRef]
Kulshrestha, A.; Krishnaswamy, V.; Sharma, M. Bayesian BILSTM approach for tourism demand forecasting. Ann. Touris. Res. 2020, 83, 1–19. [Google Scholar] [CrossRef]
Li, K.; Lu, W.; Liang, C.; Wang, B. Intelligence in tourism management: A hybrid FOA-BP method on daily tourism demand forecasting with web search data. Mathematics 2019, 7, 531. [Google Scholar] [CrossRef]
Xiang, Z.; Pan, B. Travel queries on cities in the United States: Implications for search engine marketing for tourist destinations. Tour. Manag. 2011, 32, 88–97. [Google Scholar] [CrossRef]
Fesenmaier, D.R.; Xiang, Z.; Pan, B.; Law, R. A framework of search engine use for travel planning. J. Travel Res. 2011, 50, 587–601. [Google Scholar] [CrossRef]
Wang, X.; Zhou, Z.; Zhao, Y.; Zhang, X.; Xing, K.; Xiao, F.; Yang, Z.; Liu, Y. Improving Urban Crowd Flow Prediction on Flexible Region Partition. IEEE Trans. Mob. Comput. 2019, 19, 2804–2817. [Google Scholar] [CrossRef]
Cecaj, A.; Lippi, M.; Mamei, M.; Zambonelli, F. Comparing Deep Learning and Statistical Methods in Forecasting Crowd Distribution from Aggregated Mobile Phone Data. Appl. Sci.-Basel 2020, 10, 6580. [Google Scholar] [CrossRef]
Fan, Z.; Song, X.; Shibasaki, R.; Adachi, R. City Momentum: An online approach for crowd behavior prediction at a citywide level. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, Osaka, Japan, 7–11 September 2015; pp. 559–569. [Google Scholar]
Zhao, Z.; Chen, W.; Wu, X.; Chen, P.C.Y.; Liu, J. LSTM network: A deep learning approach for short-term traffic forecast. IET Intell. Transp. Syst. 2017, 11, 68–75. [Google Scholar] [CrossRef]
Cui, Z.; Ke, R.; Pu, Z.; Wang, Y. Stacked Bidirectional and Unidirectional LSTM Recurrent Neural Network for Forecasting Network-wide Traffic State with missing values. Transp. Res. Part C Emerg. 2020, 118, 102674. [Google Scholar] [CrossRef]
Shi, X.; Chen, Z.; Wang, H.; Yeung, D.Y.; Wong, W.K.; Woo, W. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. Adv. Neural Inf. Process. Syst. 2015, 28, 802–810. [Google Scholar]
Wang, Y.; Long, M.; Wang, J.; Gao, Z.; Yu, P.S. PredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs. Adv. Neural Inf. Process. Syst. 2017, 1, 879–888. [Google Scholar]
Zhang, C.; Zhou, H.-Y.; Qiu, Q.; Jian, Z.; Zhu, D.; Cheng, C.; He, L.; Liu, G.; Wen, X.; Hu, R. Augmented Multi-Component Recurrent Graph Convolutional Network for Traffic Flow Forecasting. ISPRS Int. J. Geo-Inf. 2022, 11, 88. [Google Scholar] [CrossRef]
Zhao, J.; Chen, C.; Liao, C.; Huang, H.; Ma, J.; Pu, H.; Luo, J.; Zhu, T.; Wang, S. 2F-TP: Learning Flexible Spatiotemporal Dependency for Flexible Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2022. [CrossRef]
Wielgosz, M.; Skoczeń, A.; Mertik, M. Using lstm recurrent neural networks for monitoring the LHC superconducting magnets. Nucl. Instrum. Methods Phys. Res. Sect. A-Accel. Spectrom. Dect. Assoc. Equip. 2017, 867, 40–50. [Google Scholar] [CrossRef]
Schuster, M.; Paliwal, K.K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 1997, 45, 2673–2681. [Google Scholar] [CrossRef]
Zhang, B.; Zhang, H.; Zhao, G.; Lian, J. Constructing a PM2.5 concentration prediction model by combining auto-encoder with Bi-LSTM neural networks. Environ. Model. Softw. 2020, 124, 104600. [Google Scholar] [CrossRef]
Shahid, F.; Zameer, A.; Muneeb, M. Predictions for COVID-19 with deep learning models of LSTM, GRU and Bi-LSTM. Chaos Soliton Fract. 2020, 140, 110212. [Google Scholar] [CrossRef]
Graves, A.; Schmidhuber, J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks 2005, 18, 602–610. [Google Scholar] [CrossRef]
Ravanelli, M.; Brakel, P.; Omologo, M.; Bengio, Y. Light gated recurrent units for speech recognition. IEEE Trans. Emerg. Topics Comput. 2018, 2, 92–102. [Google Scholar] [CrossRef]
Becken, S. Measuring the effect of weather on tourism: A destination-and activity-based analysis. J. Travel Res. 2013, 52, 156–167. [Google Scholar] [CrossRef]
Gössling, S.; Scott, D.; Hall, C.M.; Ceron, J.P.; Dubois, G. Consumer behavior and demand response of tourists to climate change. Ann. Touris. Res. 2012, 39, 36–58. [Google Scholar] [CrossRef]
Mourad, L.; Qi, H.; Shen, Y.; Yin, B. ASTIR: Spatio-Temporal Data Mining for Crowd Flow Prediction. IEEE Access 2019, 7, 175159–175165. [Google Scholar] [CrossRef]
Yuan, X.; Han, J.; Wang, X.; He, Y.; Xu, W.; Zhang, K. A Novel Learning Approach for Citywide Crowd Flow Prediction. In Proceedings of the 2019 Computing, Communications and IoT Applications (ComComAp), Shenzhen, China, 26–28 October 2019; IEEE: New York, NY, USA, 2009; pp. 341–346. [Google Scholar]
Singh, U.; Determe, J.-F.; Horlin, F.; Doncker, P.D. Crowd Forecasting Based on WiFi Sensors and LSTM Neural Networks. IEEE Trans. Instrum. Meas. 2020, 69, 6121–6131. [Google Scholar] [CrossRef]
Ali, A.; Zhu, Y.; Zakarya, M. A data aggregation based approach to exploit dynamic spatio-temporal correlations for citywide crowd flows prediction in fog computing. Multimed. Tools Appl. 2021, 80, 31401–31433. [Google Scholar] [CrossRef]
Li, H.; Wang, Y.; Xu, X.; Qin, L.; Zhang, H. Short-term passenger flow prediction under passenger flow control using a dynamic radial basis function network. Appl. Soft Comput. 2019, 83, 105620. [Google Scholar] [CrossRef]
Wu, Y.; Yuan, M.; Dong, S.; Lin, L.; Liu, Y. Remaining useful life estimation of engineered systems using vanilla LSTM neural networks. Neurocomputing 2018, 275, 167–179. [Google Scholar] [CrossRef]
Lu, H.; Zhang, J.; Xu, Z.; Shi, R.; Wang, J.; Xu, S. Prediction of tourist flow based on multi-source traffic data in scenic spot. Trans. GIS 2020, 25, 1082–1103. [Google Scholar] [CrossRef]
Peng, L.; Wang, L.; Ai, X.-Y.; Zeng, Y.-R. Forecasting Tourist Arrivals via Random Forest and Long Short-term Memory. Cogn. Comput. 2020, 13, 125–138. [Google Scholar] [CrossRef]

Figure 1. Illustration of the control mechanism for forecasting scope.

Figure 2. Structural illustration of the proposed Att-DBGRU architecture.

Figure 3. Illustration of the experimental procedures.

Figure 4. Spatial layout and aerial photographs of Taiyangdao Park.

Figure 5. Real and forecasted tourist volume of the entire Taiyangdao Park.

Figure 6. Real and forecasted tourist volumes of the two sub areas (Area 1 and Area 2).

Figure 7. Hourly crowd density of the three areas based on the forecast.

Table 1. Results of the performance evaluations (Taiyangdao Park).

Model		RMSE	MAE	R²
Name	Category of Network	RMSE	MAE	R²
RNN	Unidirectional	146.138	122.510	0.655
LSTM	Unidirectional	118.329	95.419	0.816
GRU	Unidirectional	118.395	93.851	0.839
DBLSTM	Bidirectional (Stacked)	102.478	72.101	0.896
DBGRU	Bidirectional (Stacked)	79.326	57.003	0.939
Att-DBGRU	Bidirectional (with attention)	62.289	44.482	0.970

Table 2. Results of the performance evaluations (Area 1 and Area 2).

	Model		RMSE	MAE	R²
	Name	Category of Network	RMSE	MAE	R²
Area 1	RNN	Unidirectional	94.887	71.858	0.714
	LSTM	Unidirectional	87.392	64.002	0.780
	GRU	Unidirectional	79.978	57.057	0.808
	DBLSTM	Bidirectional (Stacked)	72.413	49.038	0.897
	DBGRU	Bidirectional (Stacked)	66.724	45.475	0.908
	Att-DBGRU	Bidirectional (with attention)	45.583	34.008	0.959
Area 2	RNN	Unidirectional	39.929	30.804	0.824
	LSTM	Unidirectional	37.071	33.436	0.862
	GRU	Unidirectional	45.108	38.811	0.859
	DBLSTM	Bidirectional (Stacked)	30.440	23.608	0.929
	DBGRU	Bidirectional (Stacked)	28.881	19.354	0.926
	Att-DBGRU	Bidirectional (with attention)	16.622	9.761	0.981

Table 3. Results of Diebold–Mariano test.

Area	Model	DMCriteria	Benchmark Model
Area	Model	DMCriteria	DBGRU	DBLSTM	GRU	LSTM	RNN
	Att-DBGRU	(MAE)	−1.114	−1.881 *	−3.194 **	−3.628 **	−5.073 **
	Att-DBGRU	(MSE)	−1.118	−1.878 *	−2.560 **	−3.087 **	−3.400 **
	DBGRU	(MAE)		−2.672 **	−7.297 **	−4.462 **	−7.585 **
	DBGRU	(MSE)		−2.962 **	−4.060 **	−4.452 **	−4.309 **
Taiyangdao	DBLSTM	(MAE)			−3.867 **	−1.775 *	−5.513 **
	DBLSTM	(MSE)			−5.031 **	−1.825 *	−4.177 **
	GRU	(MAE)				−0.165	−3.756 **
	GRU	(MSE)				0.008	−2.871 **
	LSTM	(MAE)					−3.067 **
	LSTM	(MSE)					−2.610 **
	Att-DBGRU	(MAE)	−2.709 **	−2.724 **	−3.235 **	−4.200 **	−4.033 **
	Att-DBGRU	(MSE)	−2.936 **	−2.444 **	−2.976 **	−3.145 **	−2.966 **
	DBGRU	(MAE)		−1.320	−1.663 *	−3.720 **	−3.831 **
	DBGRU	(MSE)		−1.227	−1.864 *	−2.819 **	−2.613 **
Area 1	DBLSTM	(MAE)			−1.194	−3.319 **	−3.722 **
	DBLSTM	(MSE)			−1.405	−3.108 **	−2.854 **
	GRU	(MAE)				−1.642	−1.893*
	GRU	(MSE)				−1.780 *	−2.044 **
	LSTM	(MAE)					−1.685 *
	LSTM	(MSE)					−1.678 *
	Att-DBGRU	(MAE)	−3.565 **	−8.862 **	−6.526 **	−5.577 **	−7.648 **
	Att-DBGRU	(MSE)	−2.267 **	−3.501 **	−4.886 **	−5.610 **	−3.366 **
	DBGRU	(MAE)		−2.065 **	−2.972 **	−2.202 **	−3.462 **
	DBGRU	(MSE)		−0.525 *	−2.375 **	−1.441	−2.516 **
Area 2	DBLSTM	(MAE)			2.610 **	−1.910 *	−2.770 **
	DBLSTM	(MSE)			−2.225 **	−1.345	−2.623 **
	GRU	(MAE)				1.221	1.352
	GRU	(MSE)				1.532	0.686
	LSTM	(MAE)					0.523
	LSTM	(MSE)					−0.436

* and ** denotes the significance level of 10% and 5%, respectively.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lu, M.; Xie, Q. A Novel Approach for Spatially Controllable High-Frequency Forecasts of Park Visitation Integrating Attention-Based Deep Learning Methods and Location-Based Services. ISPRS Int. J. Geo-Inf. 2023, 12, 98. https://doi.org/10.3390/ijgi12030098

AMA Style

Lu M, Xie Q. A Novel Approach for Spatially Controllable High-Frequency Forecasts of Park Visitation Integrating Attention-Based Deep Learning Methods and Location-Based Services. ISPRS International Journal of Geo-Information. 2023; 12(3):98. https://doi.org/10.3390/ijgi12030098

Chicago/Turabian Style

Lu, Ming, and Qian Xie. 2023. "A Novel Approach for Spatially Controllable High-Frequency Forecasts of Park Visitation Integrating Attention-Based Deep Learning Methods and Location-Based Services" ISPRS International Journal of Geo-Information 12, no. 3: 98. https://doi.org/10.3390/ijgi12030098

APA Style

Lu, M., & Xie, Q. (2023). A Novel Approach for Spatially Controllable High-Frequency Forecasts of Park Visitation Integrating Attention-Based Deep Learning Methods and Location-Based Services. ISPRS International Journal of Geo-Information, 12(3), 98. https://doi.org/10.3390/ijgi12030098

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Approach for Spatially Controllable High-Frequency Forecasts of Park Visitation Integrating Attention-Based Deep Learning Methods and Location-Based Services

Abstract

1. Introduction

2. Literature Review

2.1. Approaches for Tourist Volume Forecasting

2.2. Data for Tourist Volume Forecasting

3. Materials and Methods

3.1. Control Mechanism for Forecasting Scope Based on Location-Based Services

3.2. The Proposed Deep Learning Model for High-Frequency Forecasting

3.2.1. The Neural Network Architecture

3.2.2. Deep Bidirectional Gated Recurrent Units

3.2.3. Attention Mechanism in the Network

4. Experiment

4.1. Overview

4.2. Study Area

4.3. Data Preparation and Model Setting

4.4. Performance Evaluation

5. Results and Discussion

5.1. Fitting Effects of the Forecasting Techniques

5.2. Results of the Performance Evaluations

5.3. Comparison of Forecasting Methods with Diebold–Mariano Test

5.4. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI