1. Introduction
Tourism plays an important role in building the economy, socio-culture, and image of many counties around the world, and it has a remarkable effect on global economic development [
1]. According to Statista’s global tourism industry report, in 2023, the global tourism revenue reached approximately USD 9.9 trillion [
2]. The contribution of the tourism industry led to an increase in a destination’s competitiveness in the long run [
3]. Therefore, improving tourism destination development and infrastructure may generate tourist volume and tourism demand, which subsequently require planning and policy making for predicting future economic development [
4]. Specifically, improvement in a touristic destination, such as transportation and hospitality services [
5], can optimize resource allocation and create pricing strategies.
The transportation system is recognized as a critical component of the tourism industry that influences touristic destinations. Railways, airlines, highways, and cruise ships, among others, are key to the accessibility of transportation systems [
6]. According to Wang, Niu, and Qian (2018) [
7], appropriate transportation can promote destination marketing and strategies that influence the attractiveness of touristic destinations. Moreover, different modes of transportation services offer diverse characteristics such as economy, speed, convenience, and safety, which leads to a variety of effects on tourism. In particular, railway transportation is considered essential to trade openness, which directly and indirectly affects tourism development in urban and station locations [
8]. In other words, traveling by train has a lower environmental effect and reduces emissions compared with other forms of transport. In addition, this mode of transport has been related to tourism since the nineteenth century and early years of the twentieth century, and it is a popular mode of transport for travelers [
9]. Thus, understanding railway transportation for tourism can assist in predicting sustainable performance parameters such as economic and environmental perspectives, which catalyze improvement in the quality of railways. This is important for the future and marketing perspectives.
Prior research demonstrates that traveling by train is an important form of transportation in Asian countries such as China, Taiwan, and Thailand [
9,
10]. In particular, Thailand is acknowledged as an emerging country, with a strong and growing service sector (50% of the GDP) mainly focused on tourism. Specifically, Thailand’s sector is projected to reach approximately USD 5.3 billion in 2025, accounting for a 33% year-on-year increase and contributing 9.4% to 11.6% national GDP growth from 2023 to 2033 [
11]. Therefore, the Thai government seeks to encourage the tourism sector by enhancing the performance of various transportation options, including railway transportation. Railway transportation is one of the most attractive modes for both general passengers and tourists because it is the most affordable and offers frequent trips between cities across the country. In Thailand, railway tourism has been developed to improve the touristic experience, relate travel to nostalgia, and provide a taste of history, such as the history of the Thai–Burma railway. This route represents a period of the Second World War and features a bridge over the river [
12]. In improving the experience of tourists, the government is especially interested in promoting train tourism programs throughout all regions. Specifically, developing infrastructure, including railway transportation for tourism, is one of the Thai government’s principal goals.
The purpose of developing and upgrading transportation infrastructure such as the State Railway of Thailand (SRT) is to increase the number of train tourists and revenue by seamlessly connecting the railway system with other modes of transport [
13]. The objective is to enhance the convenience of multiple-city sightseeing journeys for train tourists. However, to ensure the success of this initiative, it is essential to understand the trends in both passengers and tourists in each region and nationwide. One promising approach to this challenge is to forecast passenger trends by leveraging novel forecasting techniques such as machine learning (ML) or deep learning (DL). This introduces a new way to analyze tourist behavior for the SRT. Forecasting passenger trends could help the SRT to better understand tourist behavior and develop appropriate policies to enhance train tourism for both local and foreign passengers. The rest of this paper is divided into four sections.
Section 2 describes relevant theories, such as the concept of train tourism and various forecasting techniques. Additionally, this section identifies the research gap.
Section 3 presents the conceptual framework and provides a detailed description of research methodologies.
Section 4 analyzes and synthesizes the experimental results from the previous section, focusing on both forecasting and economic performance. Finally,
Section 5 summarizes the overall findings of this research and highlights potential future perspectives.
2. Literature Reviews
This section establishes three topics: the concept of train tourism, forecasting techniques, and relevant prior work. All details are described below.
2.1. Concept of Train Tourism
The State Railway of Thailand (SRT) has adopted a policy to encourage local and foreign passengers to travel on trains by implementing an attractive tourism program. The SRT aims for this vision “
To become logistics and connectivity platform to fulfill stakeholders’ expectations” [
13], which means that all stakeholders, including passengers, will be fulfilled in their travel expectations when traveling by train in Thailand. In addition, the SRT has a rail transport network that covers all regions of Thailand and connects with neighboring countries, such as Laos and Malaysia. The details above demonstrate that all tourists traveling by train will feel comfortable and enjoy their journeys.
However, to achieve success in train tourism, it is not enough to focus solely on establishing a seamless rail network across the country; it is equally important to understand passenger trends and behavior when using trains for travel. Some previous studies have proposed research methodologies for analyzing passenger behavior. First, one study [
14] proposed an economic/mathematical model to optimize rail route planning in a way that serves passenger demand and maximizes profit from tourism. The model considered all relevant risks that could occur on tourist routes. The results revealed that the proposed models could suggest appropriate tourist routes based on existing infrastructure. Developing tourism for economic diversification in regions not suited to this sector requires fundamental changes to previous ways of operating, including new approaches to business creation, capacity building, education and knowledge exchange, networking, and public/private interactions [
15]. Another study [
16] developed a novel model combining the Theory of Planned Behavior and quality factors. The model assessed the relationship between passengers’ service quality and their intention for future trips. The results illustrated that train service quality and travel quality significantly impact the intentions of high-speed railway tourists. These two studies are good examples of understanding and synthesizing passenger behavior using different tools.
Even though there have been some studies on train tourist behavior, few have focused on the impact of passenger trends on future tourism programs. Passenger trends can help the SRT propose appropriate tourism programs for each rail route. One interesting solution to capturing passenger trends during different periods is the implementation of forecasting techniques. The next section will describe, in more detail, the concept of relevant forecasting techniques and explain why forecasting is important for understanding passenger trends, specifically for train passengers.
2.2. Forecasting Techniques
This study focuses on time-series forecasting techniques, analyzing the pattern of historical monthly data from the SRT, which exhibits both linear and nonlinear trends, and seasonal variations during certain periods. Based on the performance of forecasting techniques in prior studies [
17,
18,
19], we are interested in developing hybrid forecasting models that combine traditional statistical techniques with machine learning techniques. We begin by providing more details about some traditional statistical techniques that are frequently used to forecast time-series data, followed by a description of the characteristics of the machine learning technique. For traditional statistical techniques, Double Moving Average (DMA), Double Exponential Smoothing (DES), and Holt–Winters Exponential Smoothing (ES3) are implemented. Additionally, long short-term memory (LSTM) is utilized as one of the most popular machine learning techniques for forecasting future demand.
2.2.1. Traditional Statistical Techniques
Double Moving Average (DMA)
This technique is similar to the Simple Moving Average (SMA) technique but involves calculating a second average using the first set of moving averages. The DMA technique performs well with linear trends in time-series data. The structure of the DMA technique is outlined in Equations (1)–(5) [
20].
where
K = considered period k;
Ft+n = forecast value at time t with n steps.
Double Exponential Smoothing (DES)
This technique is an extension of the Simple Exponential Smoothing (SES) technique. However, there are a few key differences between the two. SES is suitable for time-series data characterized by high uncertainty, meaning the data can undergo drastic changes within short periods. Conversely, the Double Exponential Smoothing (DES) technique is designed to handle both linear trends and uncertainty in time-series data, without accounting for seasonal patterns. Additionally, DES requires two smoothing constants, alpha (α) and beta (β). This technique is also commonly referred to as “Holt’s Linear method”. The structure of the DES technique is presented in Equations (6)–(9) [
21].
α = smoothing constant for Lt;
β = smoothing constant for St;
Ft+n = forecast value at time t with n steps.
Holt–Winters Trend and Seasonality (ES3)
This model is an extension of the DES technique, but its structure differs from those of other Exponential Smoothing (ES) techniques. Commonly known as Triple Exponential Smoothing (ES3), this technique incorporates three main parameters: level (α), trend (β), and seasonal (γ) [
22,
23]. This technique is particularly effective for univariate input factors [
23,
24]. Furthermore, the level and seasonal components can be modeled using either additive or multiplicative data patterns. The details of the ES3 structure [
21] are presented in Equations (10)–(17).
Multiplicative combination:
Ft+n = forecast value at time t with n steps;
p = considered period length;
n = forecast step;
St−p+n = estimated seasonal.
2.2.2. Machine Learning Technique
Long Short-Term Memory (LSTM)
One of the most efficient techniques in the machine learning (ML) category for forecasting time-series data is the long short-term memory (LSTM) model [
25,
26]. LSTM, a variant of recurrent neural networks (RNNs), utilizes short-term and long-term memory cells to enhance forecasting performance [
18,
27]. This design allows LSTM to retain only the necessary input data in short-term memory for accurate predictions while capturing long-term dependencies in both linear and nonlinear data. Additionally, LSTM addresses the issues of vanishing and exploding gradient descents during training. By continuously updating weights in its memory cells throughout the training process, LSTM reduces noise and improves forecast accuracy, making predictions more aligned with actual data. The structure and details of the LSTM model are illustrated in Equations (18)–(23) [
26].
First step: The forget gate (
decides which information must be rejected from the cell.
Second step: The input gate (
determines the chosen inputs for updating the LSTM cell state. Additionally, the hyperbolic tangent layer generates a new vector for the candidate cell state (
.
Third step: The existing cell state (
is updated by combining it with the candidate cell state (
to produce the new state (
.
Fourth step: The output gate (
determines which information from the cell state should be produced as the output.
Last step: The value of the hidden state (
is constructed based on the output gate (
and the update cell state (
, typically using an activation function such as the hyperbolic tangent.
where
= the input at time t and represents the external factors;
= the hidden state at time t − 1;
= the hidden state at time t;
= input weights at each gate;
= recurrent weights at each gate;
= bias factors at each gate;
= the sigmoid function;
= the hyperbolic tangent function.
2.3. Relevant Prior Works
Existing studies have explored the use of ML and DL techniques to predict train passenger trends across various scenarios. For example, some researchers [
28,
29] applied artificial neural networks (ANNs) and related ML techniques to forecast passenger train delays, considering multiple constraints and input factors, including varying portions of training and testing datasets. The results of these studies revealed that the ANN technique outperformed other approaches. Another study [
30] proposed a real-time prediction model for train and platform crowding using Random Forest and Gradient-Boosted Trees techniques. The results demonstrated that the predictive data could reduce passenger boarding refusal rate and improve train capacity utilization. Lastly, a researcher [
31] developed a novel forecasting model based on the theory of random utility and a multinomial logit model to predict passenger distribution flow during holidays. This proposed model also outperformed other ML models in comparative analyses.
Although some studies have implemented ML and DL techniques to forecast train passenger trends, few have examined how such forecasts influence tourist behavior in the context of rail tourism. Moreover, most existing studies rely on single forecasting techniques to analyze passenger behavior. The problem with single forecasting techniques is that they cannot learn and capture all behavior contexts. Some techniques perform well when only identifying behavior trends, while others rely solely on the seasonality of demand during certain periods. To address these research gaps, this study aims to enhance the forecasting performance of passenger trends by utilizing hybrid forecasting models. These models are effective in capturing tourist behaviors, including linear and nonlinear trends, as the seasonal patterns of specific periods. Additionally, we will evaluate the performance of proposed hybrid models using historical train passenger data from all regions covered by the SRT. The details of the research methodology will be presented in the next section.
5. Conclusions
This paper proposes hybrid LSTM-DES models that combine traditional forecasting methods with recurrent neural network techniques. We experiment with the proposed models using historical passenger datasets from four regions of Thailand. Additionally, we compare the performance of the proposed hybrid models with several individual forecasting models, including DMA, DES, and Holt–Winter methods (ES3), and long short-term memory (LSTM) recurrent neural networks, using MAPE, MASE, and CV scores as performance metrics.
Our proposed hybrid model builds upon previous work, incorporating improvements in hyperparameter tuning through the GRG nonlinear optimization method. The results demonstrate that the hybrid LSTM-DES model outperforms all individual models in terms of both accuracy and demand variation. For the MAPE, the hybrid LSTM-DES model achieves the lowest score, with less than 3.50 percent in all regions. Similarly, the hybrid model provides the lowest MASE score, which is below 0.5 in all regions. Furthermore, the forecasted passenger numbers show stability and reliability, with a CV score consistently below 0.25. Using the forecasted data, we calculate the incoming passenger revenue for each region and compare it to actual data. The deviation percentage between forecasted and actual passenger numbers is very small, with less than 1 percent deviation across all regions. This suggests that the forecasted passenger numbers can be used to reliably estimate incoming revenue across the four regions. From a managerial perspective, the forecasted monthly train passenger numbers can also serve as key input factors for developing train tourism strategic plans. Moreover, the experimental results prove that the hybrid forecasting approach is applicable for resource allocation planning, both regionally and globally in different areas.
For future work, we recommend conducting tourism cluster analyses using the forecasted train passenger data for each region. Factors such as train station locations, available facilities, and the number of departure/arrival trains at each station could serve as valuable inputs for designing sustainable train tourism policies. This approach would further enhance the managerial framework proposed in this paper and contribute to more effective and sustainable train tourism strategies.