Next Article in Journal
A Novel Method for Establishing an Efficiency Map of IPMSMs for EV Propulsion Based on the Finite-Element Method and a Neural Network
Next Article in Special Issue
Reputation System for Increased Engagement in Public Transport Oriented-Applications
Previous Article in Journal
A Versatile Analog Electronic Interface for Piezoelectric Sensors Used for Impacts Detection and Positioning in Structural Health Monitoring (SHM) Systems
Previous Article in Special Issue
A Data Mining and Analysis Platform for Investment Recommendations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Two Advanced Models of the Function of MRT Public Transportation in Taipei

1
Department of Information Management, Hwa Hsia University of Technology, New Taipei City 235, Taiwan
2
Department of Business Management, Hsiuping University of Science and Technology, Taichung City 412, Taiwan
3
Department of Computer Science and Information Management, Hungkuang University, Taichung City 433304, Taiwan
4
National Museum of Marine Science & Technology, Keelung City 202010, Taiwan
*
Authors to whom correspondence should be addressed.
Electronics 2021, 10(9), 1048; https://doi.org/10.3390/electronics10091048
Submission received: 2 April 2021 / Revised: 23 April 2021 / Accepted: 26 April 2021 / Published: 29 April 2021

Abstract

:
Tour traffic prediction is very important in determining the capacity of public transportation and planning new transportation devices, allowing them to be built in accordance with people’s basic needs. From a review of a limited number of studies, the common methods for forecasting tour traffic demand appear to be regression analysis, econometric modeling, time-series modeling, artificial neural networks, and gray theory. In this study, a two-step procedure is used to build a predictive model for public transport. In the first step of this study, regression analysis is used to find the correlations between two or more variables and their associated directions and strength, and the regression function is used to predict future changes. In the second step, the regression analysis and artificial neural network methods are assessed and the results are compared. The artificial neural network is more accurate in prediction than regression analysis. The study results can provide useful references for transportation organizations in the development of business operation strategies for managing sustainable smart cities.

1. Introduction

Taipei has the highest population density and traffic capacity in Taiwan. The construction of the Mass Rapid Transit (MRT) [1] system has relieved long-standing traffic problems in Taipei’s urban area. In addition to its safety and convenience, the public MRT may also control the growth of private vehicles, reduce carbon emissions, and save energy, allowing for the creation of a low-carbon and green energy-based city. The preliminary road network of the public MRT system in the urban area of Taipei was approved by the Executive Yuan in 1986. The Taipei City Government established the Department of Rapid Transit Systems in 1987 and launched the construction of a preliminary road network or extension, following a revision. Taipei Rapid Transit Corporation was incorporated in 1994. The Muzha Line, the first driverless medium-capacity rapid transit line in Taiwan, was opened in March 1996, turning over a new leaf for public transportation in Taiwan. In March 1997, the first high-passenger-capacity system, the Tamsui Line, was opened, with a service scope that extended from Taipei City to New Taipei City. Following the continuous opening of road networks, 21 administrative regions across Taipei (with 12) and New Taipei City (with 9) all came to be included in the MRT routes after the opening of the Songshan Line in November 2014. According to the statistics of the department of the account of the Ministry of Transportation and Communications (MOTC), for traffic indicators from January to June 2015, the daily passenger capacity was 1.943 million on average, indicating that the MRT system is frequently used now and poses an interesting/important positive issue relating to traffic volume forecasting. However, autonomous vehicles (or self-driving cars) pose another challenging public transport-related issue, and they have been the subject of much research in recent years. According to the study of Wiseman [2], although it is noteworthy that the overall consequences of autonomous vehicles are still unknown, their impacts on other means of transportation and on the transportation infrastructure remain unclear. In particular, the point at which autonomous vehicles penetrated the transportation market needs to be determined, and more information about them needs to be available, as this information will enable more profound studies about the future. More seriously, Wiseman [3] indicated that public transportation systems will soon be obsoleted due to the benefits of autonomous vehicles. While this serious topic is valuable for sustainability studies, it is not the focus of this study. This study is concerned with traffic forecasting.
It is problematic that the rising number of passengers affects the passenger traffic of each MRT station, resulting in the spread or concentration of some passengers in certain transit stations. Therefore, passenger traffic forecasting and analysis are required. In addition to providing a train control and transportation plan, the data on MRT passenger traffic, with the e-ticket or passenger traffic information about the surrounding High-Speed Rail (HSR) [4], Taiwan Railway, and city bus, may also be used as the basis of the public transportation system plan and operational management to enhance the efficiency and service quality of all public transportation tools and derive the maximum benefits from passenger traffic forecasting. Since the volume of passenger traffic has a significant impact on the transportation industry, past passenger traffic trends may be used as the basis for future operational plans and decisions in order to derive maximum benefit. The forecasting of MRT passenger traffic is correlated with the positive or negative economic development of the greater Taipei area in the future, while a precise forecast depends on a full understanding of the environment of urban areas, factors affecting demand, and the proper forecasting tool selection. The primary purpose is to collect past research data and sort out the best variables affecting the passenger traffic demand forecast from a review of the literature. The aims of this study are as follows: (1) to analyze important variables that affect the selection and forecasting of passenger traffic and develop a forecast model using regression analysis [5]; (2) to select important variables affecting MRT passenger traffic using an artificial neural network [6] and create a forecast model; and (3) to compare the results of the regression analysis and artificial neural network research, evaluate the forecast performance of the two projection methods, and then select the best forecasting model.
In this study, a two-step procedure is used to build a predictive model for public transport. In the first step of this study, regression analysis is used to find the correlations between two or more variables and their associated directions and strength, and the regression function is used to predict future changes. In the second step, the regression analysis and artificial neural network methods are assessed, and the results are compared.

2. Literature Review

This section introduces the passenger traffic forecast and application, selection of variables, artificial neural network, regression analysis, and mean absolute percentage error (MAPE) evaluation indicators.

2.1. Passenger Traffic Forecasts and Application

Passenger traffic forecasts are an important basis for developing a traffic capacity and transportation plan for a transportation organization, as well as the foundation for constructing and modifying various forms of transportation equipment. The transportation demand and qualitative and quantitative features determine the transportation supply plan. Some common approaches to forecasting passenger traffic demand include regression analysis, econometric modeling, time-series modeling, artificial neural networks, and gray theory. This study adopts an artificial neural network to create the model. The artificial neural network is more flexible in terms of its parameter design without limitations relating to statistical assumptions, it is highly capable of learning, and it is available to precisely forecast passenger traffic under the conditions of a sufficient number of training samples and appropriate parameter design. The other research model used is regression analysis. The approach adopts mathematical statistics to create independent and dependent variables based on massive observation data in order to understand whether two or more variables are correlated and their correlative direction and strength before establishing a regression function to predict future movement.
The literature related to passenger traffic discussed in this study includes the following: Mazanec [7] suggested that the artificial neural network model is better than discriminate analysis. Furthermore, the artificial neural network can also classify a group of input variables (such as the social, economic, demographic, or behavioral attributes) of train passengers, including output data. Law and Au [8] predicted the passenger traffic of the air route between Hong Kong and Japan, using the GDP, foreign exchange, population, marketing expenses, service price, and hotel rate as input variables. The authors randomly picked 20 trained datasets among the data from the period of 1967–1997 and another 10 datasets for comparison. In the meantime, the MAPE, normalized correlation coefficient, and acceptable output percentage were used as the standards to compare the actual value with the forecast value derived from an artificial neural network, exponential smoothing, the moving average, and regression analysis. As a result, the MAPE value of an artificial neural network is only 10%, meaning that it has the best forecast result.
Kulendran and Witt [9] used the leading indicator transfer function (TF) to predict the needs for travel from England to six other countries (including the U.S., Germany, the Netherlands, Spain, Portugal, and Greece) in the period of 1993–1995. The authors first selected the six most common economic indicators for travel testing—relative price, exchange rate, relative exchange, domestic real income, and GDP—to find the correlation between the need to travel to these six countries and all variables. Next, they found the formula, using data from 1978 to 1992 as the input variable and the passenger number as the output variable to forecast the demand in the period of 1993–1995. Meanwhile, the MAPE was used as the criteria to compare the results of the autoregressive integrated moving average (ARIMA) and error correlation models (ECMs). The results show that ARIMA is more precise than TF when the quarter unit is one, four, and eight, while TF is more precise under other circumstances. On the other hand, the ECMs are more precise than TF in terms of long-term conditions (by a unit of four quarters and eight quarters). Grosche et al. [10] used two gravity models to predict the passenger traffic between two cities. The input variables of the first gravity model were population, consumption ability, GDP, distance, and average travel time. The second one was the extended gravity model, which added the number of competitive airports and average distances of competitive airports as input variables. They were verified by the data on Germany and 29 other European countries, and the results show that both gravity models were able to precisely predict the passenger traffic volume.

2.2. Variable Selection

The input variables (independent variables) adopted in this study are summarized in Table 1 based on a literature review and an analysis of the demand for public transportation. X1–X20 are independent variables based on the total passenger number of the Taipei MRT, i.e., the output variables. Among them, each datum shall be based on the “year”.
The input variables of the study are shown in Table 1, and their definitions are as follows:
  • Y1: The training samples are from the passenger traffic data of the Muzha Line from 1996 to 2015, and the artificial neural network and regression analysis are adopted to forecast the Taipei MRT passenger traffic.
  • X1: The “nominal market value” of “final products and services” produced by “all people in a specific region” within a “certain period”. A prosperous economy will lead the passenger traffic.
  • X2: The total market value of all the “services” and “final products” produced by a country within a certain period. In general, a higher number means higher productivity in the country, which means it is more prosperous and helpful in relation to passenger traffic.
  • X3: The most important economic indicator to determine the macroeconomic situation, which may reflect the change in the scale of the total economic output, and the most physical symbol of economic prosperity, which may affect the capacity of public transportation tools.
  • X4: The higher the density of a financial industry, the more prosperous the business development, which may increase the demand for MRT.
  • X5: The tourism business is an important service industry that contributes much to the national income, employment, and foreign exchange earnings. Meanwhile, the number of tourists is also helpful in evaluating the increase in passenger traffic.
  • X6: The good operation of listed companies with sufficient working capital may promote business activities and increase the demand for transportation.
  • X7: A local employee population may reflect the level of local economic prosperity. The people of a country with greater employment opportunities may have a certain consumption ability and will affect the public transportation volume.
  • X8: The price variation indicator, calculated based on the prices of products and services relating to living in a residence, is one of the major indicators for measuring inflation. The increase in price will affect the public transportation volume.
  • X9: The total population density of Taipei and New Taipei City, obtained using the land area of Taipei and New Taipei City divided by the population, is adopted as the variable. The population will affect the consumption ability of the public. The consumption ability will automatically increase with an increasing population. Therefore, the population may affect the demand for transportation in a certain area.
  • X10: Personal income affects the consumption ability of the public. The higher the income, the greater the consumption ability. Therefore, personal income may affect the transportation demand.
  • X11: This factor measures the change in prices of imported and exported products. The higher the consumption amount of the public, the stronger the consumption ability of the nationals in the country, which may affect the passenger traffic.
  • X12: The consumption of the public in the country could be observed from the total amount imported to Taiwan. A higher amount means a stronger consumption ability of the people in the country, which may affect the passenger traffic.
  • X13: The density of the total MRT train trips and time spent waiting for mass transportation are important factors affecting the willingness of commuters to take public transportation. The shorter the waiting time, the higher the willingness of commuters to take public transportation.
  • X14: The accessibility of the transportation network to administrative regions may likely affect the willingness of the public to take public transportation. If different mass transportation tools can be effectively integrated, commuters will be more willing to take public transportation.
  • X15: The total travel mileage of all MRT trains within a specific time and period. The higher the passenger traffic is, the more train trips and travel mileage of trains there will be.
  • X16: Operation income status is critical for the sustainable operation of a transportation organization. If the organization has operation losses, its transportation supply efficiency must be affected.
  • X17: The actual number of stops in all the routes of the MRT operation and the expansion of the transportation network will increase the accessibility of the expected destinations. If different mass transportation tools can be effectively integrated, commuters will be more willing to take public transportation.
  • X18: Shorter distances of transportation routes will increase the accessibility of the planned destinations. If different mass transportation tools can be effectively integrated, commuters will be more willing to take public transportation.
  • X19: The total number of passengers multiplied by the mileage traveled. The mileage for the calculation of passenger kilometers will be based on the mileage for the freight calculation.
  • X20: The changes in MRT, bus transportation, and transfer volume are used to analyze the effects of decreases in transfers. The time spent waiting for public transportation tools is an important factor affecting the willingness of commuters to take public transportation. The shorter the waiting time, the higher the willingness of the commuters to take public transportation.

2.3. Artificial Neural Network

An artificial neural network [11] is a parallel calculation model, which is similar to the human neural structure. It is an information-processing technology inspired by brain and nervous system research, and it is also usually referred to as a parallel distributed processing model or a connectionist model. An artificial neural network uses a lot of simply connected artificial neurons [12] to simulate the ability of the biological neural network. An artificial neuron is a simple simulation of a biological neuron, which acquires information from the external environment or other artificial neurons. It performs very simple calculations and then exports the results to the external environment or other artificial neurons for further action. The basic structure of a general neural network is divided into the neuron, layer, and network parts. The layer consists of basic neurons, and the network is constituted by layers. The neuron is also called the processing unit and the basic unit of the artificial neural network. The operation model of an artificial neural network is mainly divided into the training phase and the recall phase: the training phase refers to learning from training to adjust the weight of the network so that the network can become stable. The recall phase involves determining the output value induced by the network to test whether the output is close to the target output value. The artificial neural network adopted by this study is the back-propagation network [13] in a supervised learning network [14], which is applicable to classification, forecasting, system control, noise filtering, sample identification, and data compression. The input layer units are different in each step. The number of processing units in the hidden layer is determined by the number of input and output layers, and the final output result will be either one or two. The back-propagation network is the most representative and popular model among the currently available artificial neural networks. The basic theory of artificial neural networks involves minimizing the error function, using the gradient steepest descent method (GSDM) [15] to achieve the learning purpose.

2.4. Regression Analysis Method

Regression analysis [16] is a method involving the analysis of data in statistics. It is mainly used to analyze whether there is a specific relationship between one or more independent variables and dependent variables. Regression analysis is a model for establishing the relationship between a response variable Y and independent variables X. The purpose is to understand whether two or more variables are related and their correlative direction and strength, as well as to establish a mathematical model that allows for the observation and prediction of specific variables. Since the purpose of prediction regression is not to clarify but rather to establish the best formula, the primary consideration in variable selection is whether there is a maximum practical value, as opposed to the theoretical appropriateness. The theory primarily explains the value of the regression model in practical applications and its mechanism for solving problems in prediction regression. It is expected to achieve the maximum practical value with the lowest cost. The first job of explanation regression is to carefully review the features and relationships among all variables, that is, to examine the correlation among the variables.

2.5. Mean Absolute Percentage Error

MAPE [17,18,19] refers to the average absolute percentage error, which is the evaluation indicator for whether a prediction model is good or bad. Since MAPE is a relative value that is not affected by the measurement value and estimated value, it can observe the difference between the estimated and evaluated values objectively. The estimation effect is better if the MAPE value is closer to 0. The standards to evaluate the precision of a forecast are shown in Table 2.

3. Materials and Methods

This section introduces the research structure and research design and steps.

3.1. Research Structure

The flow of this study is shown in Figure 1, below, and the relevant steps are as follows:
  • The literature and selected variables that may affect the MRT passenger traffic are collected via a literature review.
  • The artificial neural network and regression analysis are adopted as research methods for training and establishing a prediction model of the MRT passenger traffic: (1) the years of training are selected; (2) the input variables of the MRT passenger traffic are deleted one by one, and the best variable is found; (3) the possible input variables that are likely to affect the MRT passenger traffic in the training are added, and the best variable is found; and (4) the MRT passenger traffic forecast model is established via an artificial neural network and regression analysis.
  • The MRT passenger traffic forecast model is established.
  • The advantages and disadvantages of the input variables selected are compared and analyzed, considering the results of the artificial neural network and regression analysis.
  • The research conclusions and suggestions based on the research results are provided.

3.2. Research Design and Steps

We first collected the relevant literature and selected variables that could potentially affect the MRT passenger traffic via a literature review, as shown in Table 1, and then selected the years of training, as shown in Table 3.
The main structure of this study is established by the idea of an artificial neural network structure, including the input layer, hidden layer, and output layer. X1 to X10 are variables that may affect the Taipei MRT passenger traffic, while Y1 is the Taipei MRT passenger traffic. We first tested independent variables X1 to X10 and then carried out the deletion test, before adding independent variables X11 to X20 one by one. The purpose of this step was to screen the input variables with a lower correlation to the Taipei MRT passenger traffic to avoid interfering with the prediction results. The regression analysis is primarily used to discuss the causal result relationship among the variables and conduct the prediction via a line chart [20,21], a scatter chart [22], correlation analysis [23,24,25], the enter method [26,27], and stepwise regression [28,29,30].

3.2.1. Line Chart and Scatter Chart

The line chart demonstrates that the growth trend of the Taipei MRT passenger traffic (Figure 2) is relatively similar to the variable X1 GNP (Figure 3), and its effect is therefore assumed to be more important. From the scatter chart (Figure 4), if the variable X1 GNP and the Taipei MRT passenger traffic have a linear distribution, they should be more correlated, and the effect of variable X1 GNP on the Taipei MRT passenger traffic will be obvious.

3.2.2. Correlation Analysis

In correlation analysis, if two variables are significantly correlated, this only means that the strength and direction between the variables are significant. When the coefficient is significant, it only explains that the two variables are correlated at a certain level, including the strength and direction, instead of indicating the existence of a causal relationship. They may both be the cause and result at the same time, or a causal relationship may actually exist.

3.2.3. Enter Method

The enter method means that all the prediction variables must be entered into the regression formula, regardless of the significance of the individual variable.

3.2.4. Stepwise Regression

When variables are entered into the regression equation, the backward selection method is used to eliminate unimportant variables.

4. Empirical Analysis

This study uses regression analysis and class neural network methods to select and predict the variables affecting passenger traffic using the Taipei Metro passenger traffic trends from March 1996 to December 2015. Finally, we compare the results of the neural network and regression analysis prediction methods, evaluate and analyze the prediction performance of the two methods, and then select the best prediction model.

4.1. Regression Analysis

We adopted statistical software to analyze 20 parametric variables from March 1996 to December 2015 to explore the causal relationship between the variables as well as to make predictions through line graphs, scatter plots, correlation analysis, forced entry, and stepwise regression analysis to explore the causal relationship between the variables.

4.1.1. Line Chart

If the line graphs of the variables are found to be more similar to the growth trend line graphs of Taipei Metro’s passenger volume, it is speculated that the effect should be more important (Figure 5).

4.1.2. Scatter Chart

From the scatter plot, we can see that the correlation between the variables and Taipei Metro’s passenger volume is greater if they are clearly distributed in a straight line (Figure 6).

4.1.3. Correlation Analysis

The correlation coefficient primarily refers to the degree of correlation between the variables, while it does not verify the influence of the “independent variable” on the “dependent variable”. Therefore, the obtained correlation coefficient (R value) can only indicate that the two variables are positively correlated, negatively correlated, or independent. It cannot be interpreted as the effect of the independent variable on the dependent variable. In the interpretation of the correlation coefficient, positive and negative indicate the direction of the correlation, not the degree of the correlation. If the degree of the correlation is between the R values, i.e., between plus or minus 0.3 (i.e., between 0.3 and −0.3), it is considered a low correlation; it is considered a moderate correlation if the value is between plus or minus 0.3 to 0.6 (that is, between 0.3 and 0.6, or between −0.3 and −0.6); and if it is between plus or minus 0.6 to 0.9 (i.e., 0.6 to 0.9, or −0.6 to −0.9), it is considered a high correlation. If the R value is plus or minus 1, this indicates a complete correlation.

4.1.4. Entry Method

All the predictive variables are incorporated into the regression equation at the same time, and the least squares method and statistical software are used to calculate the complex regression model. The analysis of the explanatory variables is as follows. Table 4, showing the entry variables inputted/removed, provides a list of variables that are subjected to regression analysis, including a total of 16 independent variables.
The variance analysis result of the forced entry method shows the overall verification of the model significance to verify the significance of the overall regression model. The quadratic sum of the regression model is 830,852,340,051,751,420, the total sum of squares is 830,902,477,461,923,070, the F value = 3107.157, and the p value = 0.000 < 0.05, which reaches significance. Furthermore, F is used to verify the regression model, F (α, K, N-K-1) = F(0.05, 16, 3) ≒ 8.7, and the F value of this regression equation = 3107.157 > critical value F value of 8.7, thus rejecting the null hypothesis and indicating that the explanatory power of the overall regression model reaches a significant level.
In Table 5, the coefficients of the forced entry method are the verification results of the multicollinearity of the output independent variables using statistical software. This model has a total of 16 independent variables and 17 eigenvalues, and the minimum tolerance is 0.000. If the tolerance is lower than 0.2, it is determined that there is collinearity between this independent variable and other independent variables. The highest VIF is 4167.205. In general, if the VIF is greater than 4, this means that the independent variable is collinear with other independent variables.
The variables excluded by the forced entry method demonstrate that there are four variables, including the total GNP, GDP, MRT mileage, and MRT extended passenger mileage, that did not enter the regression model. To reduce the collinearity problem of the multiple regression model, we adopted the stepwise method. The advantage of this method is that the variables are selected one by one according to the influence of the independent variable on the dependent variable, thus eliminating the collinearity problem.

4.1.5. Stepwise Method

To reduce the collinearity problem of multiple regression models, we adopted the stepwise method and performed calculations using statistical software. The following Table 6 was obtained.
Table 6 shows the stepwise regression variables inputted/removed and a list of stepwise method variables. The criteria for selection are: (F-to-enter probability ≤ 0.050, F-to-remove probability ≥ 0.100). In total, two variables are selected for the regression equation in two steps (models). In Model 1, MRT passenger revenue is selected. In Model 2, two-way transfer preferential volume is added. Therefore, the two variables of the MRT passenger revenue and two-way interchange preferential volume are selected for the model.
The analysis result of stepwise regression variance using Model 1 of the MRT passenger revenue variables shows that the F value of the overall regression model is 15,598.145, p = 0.000 < 0.05, reaching significance and indicating a significant correlation between the independent variable (the MRT passenger revenue) and the dependent variable (the Taipei MRT passenger traffic). Taking Model 2 as an example, the F value of the regression model of Model 2 is 12,741.962, p = 0.000 < 0.05, which reaches significance and indicates a significant correlation between the independent variable (the preferential volume of two-way interchange) and the dependent variable (the Taipei MRT passenger traffic). The two variables can effectively predict the passenger traffic of the Taipei MRT.
Table 7 demonstrates a positive relationship between the MRT passenger revenue and the preferential volume of the two-way passenger transfer—two independent variables in the table of stepwise regression coefficients, both of which were consistent with the prediction and significant.
Table 8 demonstrates the stepwise mode analysis results of the variable table excluded by stepwise regression, with two variables remaining: the MRT passenger revenue and two-way interchange preferential volume. From Table 8, it can also be understood that the variables may enter the regression model because the correlation value of the dependent variables is the highest in Model 1, where the preferential volume of two-way interchange is 0.649. In Model 2, the correlation value of the total amount of goods imported into Taiwan is −0.310, which is the highest.
We used statistical software to analyze the inputting of 20 parameter variables from 1996 to 2013 and outputting of the predictive value of the passenger traffic of the Taipei MRT (Table 9). The predicted value was calculated according to the predicted value table of the progressive regression analysis using the MAPE formula. The result obtained using the regression analysis prediction method was MAPE = 6.47%.

4.2. Artificial Neural Network

Using the training data from 1996 to 2013 as the artificial neural network training materials and those from 2014 to 2015 as the sample data, we were able to use a total of 20 parameters as the input variables of the artificial neural network; the output variables were the passenger traffic prediction values of the Taipei MRT. Statistical software was used to analyze 20 parametric variables from 1996 to 2015, before the input variables were converted into an artificial neural network to investigate the causal relationship between the variables and make predictions based on a line chart, scatter chart, correlation analysis, forced entry method, and stepwise method. Therefore, the artificial neural network does not temporarily delete any variables and inputs all of them into the artificial neural network. Then, the optimal combination of input variables was analyzed.

4.2.1. Year of Selection

Basic information on the year of selection for variables X1 to X20 and the output variable Y1 is shown in Table 3, where Y1 stands for the Taipei MRT passenger traffic. Using the backward propagation artificial neural network algorithm model, the average absolute percentile error was used to analyze 7 years/2 years of training, 6 years/2 years of training, and 5 years/2 years of training from 1996 to 2015, and the training results are shown in Table 10. This study uses the training model of 5 years/2 years of training prediction for empirical research.

4.2.2. Deleted Variables

First, we used the 5-year and 2-year training modes (2009 to 2013 and 2014 to 2015) and then deleted the X1 to X10 variables in order, selecting the variables with the highest MAPE value for deletion. The purpose of this was to screen out variables that have little influence on the passenger traffic of the Taipei MRT to avoid interference with the predicted results. After the deletion was complete, we then started the training process to determine the best variables and improve the accuracy of the prediction results.

4.2.3. The Output Results of the Artificial Neural Network

Using the training data from 1996 to 2013 as the artificial neural network training materials and the data from 2014 to 2015 as the sample data, we used a total of 20 parameters as the input variables of the artificial neural network; the output variables were the passenger traffic prediction values of the Taipei MRT (Table 11).
According to the prediction values, shown in Table 10, of the artificial neural network, calculated by the MAPE formula, the result obtained by the neural network prediction method is MAPE = 4.82%. To compare the predicted values of regression analysis with those of the artificial neural network, the results showed that the MAPE value of the regression analysis was 6.47%. The MAPE value of the artificial neural network was 4.82%, so the artificial neural network has the best prediction outcomes.

4.3. Comparison of Regression Analysis and Artificial Neural Networks

4.3.1. Deletion of the Forced Entry Method of Regression Analysis to Exclude Four Variables

Using the 5-year training for 2 years (2009–2013/2014–2015) model, we deleted the four eliminated variables of the regression analysis, which were the GNP, GDP, MRT mileage, and MRT extended passenger mileage. Using a total of 16 parameters as the input variables for the regression analysis and artificial neural network comparison, it was found that the MAPE value predicted by regression analysis was 0.94%, and the MAPE value of the artificial neural network was 0.54%. Both methods showed the best prediction results.

4.3.2. Deletion of Four Variables with High MAPE Values in Artificial Neural Networks

Using the model of 5-year training for 2 years (2009–2013/2014–2015), we deleted the four variables with the highest MAPE values in the artificial neural network, which included the economic growth rate, personal income, total amount of goods imported into Taiwan, and number of MRT stations. In total, 16 parameters were used as the input variables for the regression analysis and artificial neural network comparison, and it was found that the MAPE value predicted by regression analysis was 0.62%. The MAPE value of the artificial neural network was 0.31%. Both methods showed the best prediction results.

4.3.3. Collinearity Verification

In order to verify whether the independent variable of the MRT passenger revenue is collinear with the dependent variable of the MRT passenger traffic, statistical software was used to analyze the inputting of 19 parameter variables from 1996 to 2013 and outputting of the passenger traffic prediction value of the Taipei MRT. Again, the training data from 1996 to 2013 were used as the artificial neural network training materials, and the data from 2014 to 2015 were used as the sample data. In total, 19 parameters were used as the input variables of the artificial neural network, while the output variable was the passenger traffic prediction value of the Taipei MRT. Twenty variables and 19 parameter variables were used to compare the predictive values of the regression analysis and artificial neural network after removing the independent variable, the MRT revenue. The results showed that the MAPE residual value of the regression analysis was 1.28%, and the MAPE residual value of the artificial neural network was 0.19%. We observed no significant difference between the two, which proves that there was no collinearity problem after deleting the independent variable, the MRT passenger revenue.

4.3.4. Monthly Passenger Traffic Experiment

In this study, we collected data on passenger traffic (month) and related variables (month); however, because some data were recorded at different times, 167 regression analyses were conducted from January 2000 to December 2015 (excluding the incomplete data on Cyclone Nari in September 2001). The neural network used the mode of 2000–2013 training 2014–2015 and the following seven parameter variables: the MRT operation mileage, MRT station number, MRT train number, MRT extended vehicle mileage, MRT extended passenger mileage, MRT passenger revenue, and two-way transit preferential volume. The output variable is the passenger traffic prediction value of the Taipei MRT. The results showed that the MAPE value of the regression analysis was 1.45%, and the MAPE value of the artificial neural network was 0.42%. Both methods showed the best prediction results.

4.4. Summary of the Empirical Results

The results of this study are shown in Table 12, which summarizes the empirical results of the regression analysis and artificial neural network.

5. Analysis of Empirical Results

The results and research contributions of this paper in relation to passenger traffic prediction provide a valuable reference for both academics and practitioners. They are summarized in this section.

5.1. Analysis of Results

In this study, we used an artificial neural network and regression analysis to construct the Taipei MRT passenger traffic prediction model. Seven findings of this study are worth summarizing:
  • A total of 20 parameters from 1996 to 2013 were used as the input variables of the regression analysis, and the output value was the passenger traffic prediction value of the Taipei MRT. The stepwise regression of the regression analysis, predicting the MAPE value of the Taipei MRT passenger traffic, is 6.47%, which demonstrates an excellent predictive performance.
  • Using the training data from 1996 to 2015 as artificial neural network training materials and the data from 2013 to 2015 as the sample data, a total of 20 parameters were used as the input variables of the artificial neural network; the output value was the passenger traffic prediction value of the Taipei MRT. Using an artificial neural network to predict the MAPE value of the Taipei MRT passenger traffic, the result was 4.82%, which demonstrates an excellent predictive performance.
  • We used the 5-year training for 2 years (2009–2013/2014–2015) model and deleted the already eliminated four variables of regression analysis, which were the GNP, GDP, MRT mileage, and MRT extended passenger mileage. In total, 16 parameters were used as the input variables for regression analysis and artificial neural network comparison. The MAPE value predicted by regression analysis was found to be 0.94%, and the MAPE value of the artificial neural network was 0.54%. Both methods had the best prediction results.
  • We used the model of 5-year training for 2 years (2009–2013/2014–2015) and deleted the four variables with the highest MAPE values in the artificial neural network, including the economic growth rate, personal income, total amount of goods imported into Taiwan, and number of MRT stations. In total, 16 parameters were used as the input variables for regression analysis and artificial neural network comparison. The MAPE value predicted by regression analysis was 0.62%, and the MAPE value of artificial neural networks was 0.31%. Both methods had the best prediction results.
  • In order to verify whether the independent variable MRT passenger revenue is collinear with the dependent variable MRT passenger traffic, we applied statistical software to analyze the inputting of 19 parameter variables from 1996 to 2013 and outputting of the passenger traffic prediction value of the Taipei MRT. Again, we used the training data from 1996 to 2015 as the artificial neural network training materials and the data from 2013 to 2015 as the sample data. We used a total of 19 parameters as the input variables of the artificial neural network and the passenger traffic prediction value of the Taipei MRT as the output value in order to compare the predictive values of the regression analysis and artificial neural network. The result showed that the MAPE value of the regression analysis was 5.19%, and the MAPE value of the artificial neural network was 4.63%. Twenty variables and 19 parameter variables were used to compare the predictive values of the regression analysis and artificial neural network after removing the independent variable, the MRT revenue. The results showed that the MAPE residual value of regression analysis was 1.28%, and the MAPE residual value of the artificial neural network was 0.19%. No significant difference was observed between them, which proves that there is no collinearity problem after deleting the independent variable, the MRT passenger revenue.
  • We collected data on the passenger traffic (month) and related variables (month); however, because some data were recorded at different times, 167 regression analyses were conducted from January 2000 to December 2015 (excluding the incomplete data on Cyclone Nari in September 2001). The neural network used the mode of 2000–2013 training 2014–2015 and the following seven parameter variables: the MRT operation mileage, MRT station number, MRT train number, MRT extended vehicle mileage, MRT extended passenger mileage, MRT passenger revenue, and two-way transit preferential volume. The output value is the passenger traffic prediction value of the Taipei MRT. The results show that the MAPE value of regression analysis was 1.45%, and the MAPE value of the artificial neural network was 0.42%. Both methods had the best prediction results.
  • This study investigated the passenger traffic prediction of the Taipei MRT and analyzed and constructed the prediction model based on two prediction methods: regression analysis and artificial neural networks. The results are presented to allow transportation organizations to maximize their profits by making plans and decisions relating to future operations based on past passenger traffic trends.

5.2. Research Contributions

To improve the utilization rate of the metro area’s transportation system and reduce environmental pollution by combining it with the public transportation system of the MRT station, the following research contributions are provided:
1.
Choosing the right forecasting tools for business planning and decision making.
The amount of passenger traffic significantly impacts the transportation industry, and accurate prediction depends on a full understanding of the metropolitan environment and analysis of the factors that influence demand. The commonly used methods for forecasting passenger traffic demand include statistical regression analysis, econometric modeling, time-series modeling, neural-network modeling, gray theory, etc.; selecting the appropriate forecasting tools can allow for the maximization of profits by aiding in planning and decision making relating to future operations based on past passenger traffic trends.
2.
Construction and planning of transport systems.
Passenger traffic prediction is a very important factor in the construction and planning of transportation systems and serves as the essential basis for putting forward requirements for the construction and expansion of transportation equipment. Therefore, the results of this study can be used as a reference for transportation organizations, such as operation management, manpower allocation, shift distance, transportation demand, etc.
3.
Curbing the growth of private vehicles.
The excessively increased use of private vehicles will cause such problems as air pollution, noise, road congestion, traffic accidents, etc., and the social costs shall be shared by the public. Therefore, strengthening the control on the increase of private vehicles is necessary, and passenger traffic prediction cooperates perfectly with public transportation transfer system construction, encouraging people to transfer from private vehicles to public transport systems and then improving ridership on public transportation systems.
4.
Public transportation leads to urban development.
With the public transport system as the backbone of urban development, passenger traffic prediction can establish a different planning method and procedure from traditional urban development, implement the priority concept of public transportation, look toward the future, and actively promote subsequent MRT-related construction based on the existing construction. In addition to being safe and convenient, public transportation systems can also inhibit the growth of private transportation, reduce carbon while promoting energy saving, and build low-carbon and green energy cities.
5.
Improvement of the efficiency of all public transport vehicles.
In this study, the data on MRT passenger traffic prediction not only provide traffic dispatching and transport strategic planning data but also combine them with the electronic ticket or passenger traffic data of public transport vehicles, such as the high-speed railways, Taiwan’s railways, urban buses, and UBikes around the MRT stations, to conduct the overall passenger traffic analysis. This is used as the basis for the planning and operation management of the public transport system to improve the efficiency and service quality of each public transport vehicle and thus maximize the benefit of passenger traffic prediction.

6. Conclusions

This study investigated the passenger traffic prediction of the Taipei MRT and analyzed and constructed the prediction model based on two prediction methods: regression analysis and artificial neural networks. The results are presented as a reference for transportation organizations to allow them to maximize profits by making plans and decisions relating to future operations based on past passenger traffic trends.
During this research, we discussed with senior executives of MRT companies and found that accurate passenger traffic prediction can effectively reduce costs. The management implications of this study are as follows:
  • The forecasting of passenger traffic has a great influence on the transportation industry, and transportation organizations can make plans and decisions for future operations based on past passenger traffic trends. Therefore, it is very important for transportation organizations to have a clear and objective forecasting method.
  • Based on past research data and a literature review, we determined the variable that affects the forecasting of passenger traffic demand the most. The forecasting of the MRT passenger traffic affects the future economic development of Taipei City and New Taipei City, so it is necessary to select the appropriate passenger traffic forecasting tool.
  • The data on passenger traffic forecasting in this study not only allow for the planning of traffic dispatching, operation management, manpower allocation, shift distance, transportation demand, etc., but also serve as an essential basis for transportation organizations, enabling them to put forward requirements for the construction and expansion of transportation equipment.
Despite the limitations of this study, we believe that the findings and management implications of our study are intriguing enough to invite future research on the topic of passenger traffic forecasting, as well as future research on other traffic-related topics.

Author Contributions

Conceptualization, Y.-S.C. and S.-H.C.; methodology, Y.-S.C.; software, S.-H.C.; validation, S.-H.C.; writing—original draft preparation, S.-H.C.; writing—review and editing, Y.-S.C., S.-F.C. and C.-K.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the Ministry of Science and Technology of Taiwan, grant number MOST 109-2221-E-146-003.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Verbavatz, V.; Barthelemy, M. Access to mass rapid transit in OECD urban areas. Sci. Data 2020, 7, 1–6. [Google Scholar] [CrossRef]
  2. Wiseman, Y. Autonomous vehicles. In Encyclopedia of Information Science and Technology, 5th ed.; IGI Global: Hershey, PA, USA, 2021; pp. 1–11. [Google Scholar]
  3. Wiseman, Y. Driverless cars will make passenger rail obsolete [Opinion]. IEEE Technol. Soc. Mag. 2019, 38, 22–27. [Google Scholar] [CrossRef]
  4. Yang, Z.; Li, C.; Jiao, J.; Liu, W.; Zhang, F. On the joint impact of high-speed rail and megalopolis policy on regional economic growth in China. Transp. Policy 2020, 99, 20–30. [Google Scholar] [CrossRef]
  5. Lio, W.; Liu, B. Uncertain maximum likelihood estimation with application to uncertain regression analysis. Soft Comput. 2020, 24, 9351–9360. [Google Scholar] [CrossRef]
  6. Toraman, S.; Alakus, T.B.; Turkoglu, I. Convolutional capsnet: A novel artificial neural network approach to detect COVID-19 disease from X-ray images using capsule networks. Chaos Solitons Fractals 2020, 140, 110122. [Google Scholar] [CrossRef]
  7. Mazanec, J.A. Classifying tourists into market segments: A neural network approach. J. Travel Tour. Mark. 1992, 1, 39–60. [Google Scholar] [CrossRef]
  8. Law, R.; Au, N. A neural network model to forecast Japanese demand for travel to Hong Kong. Tour. Manag. 1999, 20, 89–97. [Google Scholar] [CrossRef]
  9. Kulendran, N.; Witt, S.F. Leading indicator tourism forecasts. Tour. Manag. 2003, 24, 503–510. [Google Scholar] [CrossRef]
  10. Grosche, T.; Rothlauf, F.; Heinzl, A. Gravity models for airline passenger volume estimation. J. Air Transp. Manag. 2007, 13, 175–183. [Google Scholar] [CrossRef]
  11. Shams, S.R.; Jahani, A.; Moeinaddini, M.; Khorasani, N. Air carbon monoxide forecasting using an artificial neural network in comparison with multiple regression. Model. Earth Syst. Environ. 2020, 6, 1467–1475. [Google Scholar] [CrossRef]
  12. Liu, S.; Mocanu, D.C.; Matavalam, A.R.R.; Pei, Y.; Pechenizkiy, M. Sparse evolutionary deep learning with over one million artificial neurons on commodity hardware. Neural Comput. Appl. 2020, 1–16. [Google Scholar] [CrossRef]
  13. Sun, W.; Huang, C. A carbon price prediction model based on secondary decomposition algorithm and optimized back propagation neural network. J. Clean. Prod. 2020, 243, 118671. [Google Scholar] [CrossRef]
  14. Liu, R.; Mancuso, C.A.; Yannakopoulos, A.; Johnson, K.A.; Krishnan, A. Supervised learning is an accurate method for network-based gene classification. Bioinformatics 2020, 36, 3457–3465. [Google Scholar] [CrossRef] [PubMed]
  15. Shirokanev, A.S.; Kirsh, D.V.; Kupriyanov, A.V. Research of an algorithm for crystal lattice parameter identification based on the gradient steepest descent method. Comput. Opt. 2017, 41, 453–460. [Google Scholar] [CrossRef]
  16. Hemilä, H.; Chalker, E. Vitamin C may reduce the duration of mechanical ventilation in critically ill patients: A meta-regression analysis. J. Intensive Care 2020, 8, 15. [Google Scholar] [CrossRef] [Green Version]
  17. Şahin, U.; Şahin, T. Forecasting the cumulative number of confirmed cases of COVID-19 in Italy, UK and USA using fractional nonlinear grey Bernoulli model. Chaos Solitons Fractals 2020, 138, 109948. [Google Scholar] [CrossRef]
  18. Prado, F.; Minutolo, M.C.; Kristjanpoller, W. Forecasting based on an ensemble autoregressive moving average-adaptive neuro-fuzzy inference system–neural network-genetic algorithm framework. Energy 2020, 197, 117159. [Google Scholar] [CrossRef]
  19. Longo, G.A.; Righetti, G.; Zilio, C.; Ortombina, L.; Zigliotto, M.; Brown, J.S. Application of an Artificial Neural Network (ANN) for predicting low-GWP refrigerant condensation heat transfer inside herringbone-type Brazed Plate Heat Exchangers (BPHE). Int. J. Heat Mass Transf. 2020, 156, 119824. [Google Scholar] [CrossRef]
  20. Feng, Y.; Yang, T.; Niu, Y. Subpixel computer vision detection based on wavelet transform. IEEE Access 2020, 8, 88273–88281. [Google Scholar] [CrossRef]
  21. Sohn, C.; Choi, H.; Kim, K.; Park, J.; Noh, J. Line Chart understanding with convolutional neural network. Electronics 2021, 10, 749. [Google Scholar] [CrossRef]
  22. Shibzukhov, Z.M. Robust method for finding the center and the scatter matrix of the cluster. J. Phys. Conf. Ser. 2020, 1479, 012045. [Google Scholar] [CrossRef]
  23. Peng, Q.; Wen, F.; Gong, X. Time-dependent intrinsic correlation analysis of crude oil and the US dollar based on CEEMDAN. Int. J. Financ. Econ. 2021, 26, 834–848. [Google Scholar] [CrossRef]
  24. Shi, J.; Liu, B.; Qin, J.; Jiang, J.; Wu, X.; Tan, J. Experimental study of performance of repair mortar: Evaluation of in-situ tests and correlation analysis. J. Build. Eng. 2020, 31, 101325. [Google Scholar] [CrossRef]
  25. Ibrahim, M.S.; Sidiropoulos, N.D. Reliable detection of unknown cell-edge users via canonical correlation analysis. IEEE Trans. Wirel. Commun. 2020, 19, 4170–4182. [Google Scholar] [CrossRef]
  26. Di Tella, M.; Romeo, A.; Benfante, A.; Castelli, L. Mental health of healthcare workers during the COVID-19 pandemic in Italy. J. Eval. Clin. Pract. 2020, 26, 1583–1587. [Google Scholar] [CrossRef]
  27. Moskowitz, S.; Dewaele, J.M. Is teacher happiness contagious? A study of the link between perceptions of language teacher happiness and student attitudes. Innov. Lang. Learn. Teach. 2021, 15, 117–130. [Google Scholar] [CrossRef]
  28. Liu, B.; Zhao, Q.; Jin, Y.; Shen, J.; Li, C. Application of combined model of stepwise regression analysis and artificial neural network in data calibration of miniature air quality detector. Sci. Rep. 2021, 11, 1–12. [Google Scholar]
  29. Hornby, T.G.; Henderson, C.E.; Holleran, C.L.; Lovell, L.; Roth, E.J.; Jang, J.H. Stepwise regression and latent profile analyses of locomotor outcomes poststroke. Stroke 2020, 51, 3074–3082. [Google Scholar] [CrossRef]
  30. Żogała-Siudem, B.; Jaroszewicz, S. Fast stepwise regression based on multidimensional indexes. Inf. Sci. 2021, 549, 288–309. [Google Scholar] [CrossRef]
Figure 1. Research structure.
Figure 1. Research structure.
Electronics 10 01048 g001
Figure 2. The line chart of the total Taipei MRT passenger traffic.
Figure 2. The line chart of the total Taipei MRT passenger traffic.
Electronics 10 01048 g002
Figure 3. The line chart of the annual GNP.
Figure 3. The line chart of the annual GNP.
Electronics 10 01048 g003
Figure 4. The scatter chart of the MRT passenger traffic and GNP.
Figure 4. The scatter chart of the MRT passenger traffic and GNP.
Electronics 10 01048 g004
Figure 5. Line chart of the MRT passenger traffic.
Figure 5. Line chart of the MRT passenger traffic.
Electronics 10 01048 g005
Figure 6. Scatter chart of GNP and MRT passenger traffic.
Figure 6. Scatter chart of GNP and MRT passenger traffic.
Electronics 10 01048 g006
Table 1. Variable selection.
Table 1. Variable selection.
VariableName of VariableVariable Code
Y1MRT passenger trafficPassengers
X1Gross National ProductGNP
X2Gross Domestic ProductGDP
X3Economic growth rateEconomic
X4Number and density of registered financial businessesInFinanden
X5Number of major touristsVisitors
X6Number of listed companiesCompanies
X7Employment populationEmployed
X8Consumer Price IndexCPI
X9Population densityInPopden
X10Personal incomePCI
X11Import Price IndexIPI
X12Total cargo input amountCargoInput
X13MRT train tripsTrains
X14Number of administrative regionsAdministrative
X15MRT train kilometersTrainsKM
X16MRT passenger traffic incomeIncome
X17Number of MRT stationsStations
X18MRT operation kilometersOperationsKM
X19MRT passenger kilometersPassengersKM
X20Round trip transfer discount volumeTransfers
Table 2. Standards for the precision of MAPE evaluation.
Table 2. Standards for the precision of MAPE evaluation.
MAPE ValueStandard
MAPE < 10%Excellent prediction power (the closer to 0, the better)
10% < MAPE < 20%Good prediction power
20% < MAPE < 50%Reasonable prediction power
50% < MAPEPoor prediction power
Table 3. Annual information of the variables in relation to passenger traffic.
Table 3. Annual information of the variables in relation to passenger traffic.
VariableX1X2X3X4...X17X18X19X20Y1
Unit/YearMillionMillion%Km2 (Train)...StationKmPersons/KmPersons (Thousand)Total Persons
19968,146,0928,036,5906.18500...1210.557,226,81010211,174,359
19978,806,8528,717,2416.11509...3232.4243,676,517328231,081,395
19989,449,6929,381,1414.21501...3940.3512,282,67812,22960,737,782
19999,906,1139,815,5956.72495...5656.41,031,342,47221,203126,952,122
200010,490,81810,351,2606.42491...6265.12,042,303,17138,138268,716,740
200110,350,23310,158,209−1.26448...6265.12,223,486,59644,368289,642,714
200210,923,38510,680,8835.57437...6265.12,469,037,31253,093324,433,557
200311,294,73910,965,8664.12433...6265.12,440,488,93472,399316,189,128
200412,021,74411,649,6456.51428...63672,680,355,529125,350350,141,956
200512,383,12012,092,2545.42422...63672,742,372,258127,424360,729,803
200612,952,50212,640,8035.62415...6974.43,002,988,957130,916384,003,220
200713,739,82813,407,0626.52417...6974.43,298,870,463140,044416,229,685
200813,465,59613,150,9500.70418...7075.83,513,969,060152,643450,024,415
200913,375,65012,961,656−1.57428...8290.63,720,991,244153,679462,472,351
201014,548,85214,119,21310.63425...93100.84,123,189,518162,098505,466,450
201114,700,57214,312,2003.80425...94101.94,607,801,411170,963566,404,489
201215,141,10814,686,9172.06427...102112.84,973,666,983177,918594,864,715
201315,646,21115,221,2012.23428...109121.35,234,160,050182,193634,961,083
201416,621,37816,081,7983.74428...116129.25,589,414,250180,133679,506,401
201517,485,37516,881,6143.78429...117131.15,880,980,257173,877717,511,809
Table 4. Forced entry method variables inputted/removed.
Table 4. Forced entry method variables inputted/removed.
ModelVariable InputtedVariable RemovedMethod
1X3, X4, X5, X6, X7, X8, X9, X10, X11, X12, X13, X14, X15, X16, X17, X20 Entry
Table 5. Coefficients of the entry method.
Table 5. Coefficients of the entry method.
Collinearity Statistical Material
Model TSignificancePermissible DeviationVIF
1(Constant)−2.3740.098
X10.6850.5430.09410.601
X32.7870.0690.004263.957
X41.5000.2310.002457.889
X51.4290.2480.0011047.998
X63.5840.0370.0004167.205
X8−1.3090.2820.001835.575
X9−3.5090.0390.0011728.353
X10−1.1150.3460.007143.916
X112.9190.0620.005206.473
X121.1510.3330.007142.958
X13−3.0060.0570.003293.111
X14−1.9130.1520.003313.741
X150.0300.9780.002617.120
X1611.9900.0010.003374.019
X17−0.0980.9280.002656.926
X20−2.3250.1030.003348.165
Table 6. Stepwise regression variables inputted/removed.
Table 6. Stepwise regression variables inputted/removed.
ModelVariable InputtedMethod
1X16Step by step (criteria: F-to-enter probability ≤ 0.050, F-to-remove probability ≥ 0.100)
2X20Step by step (criteria: F-to-enter probability ≤ 0.050, F-to-remove probability ≥ 0.100)
Table 7. Stepwise regression coefficients.
Table 7. Stepwise regression coefficients.
ModelUnstandardized CoefficientsStandardized CoefficientsTSignificance
BStandard ErrorBeta
1(Constant)−12,082,965.6673,520,486 −3.4320.003
X160.04700.999124.8930
2(Constant)−9,989,402.5752,818,390 −3.5440.002
X160.0440.0010.94253.4820
X20193.09954.850.0623.520.003
Table 8. Variables excluded by the stepwise method.
Table 8. Variables excluded by the stepwise method.
ModelBetaTSignificancePartial Correlation
1X10.0310.7990.4360.190
X20.0240.6410.5300.154
X30.0070.7800.4460.186
X4−0.031−2.7050.015−0.549
X5−0.020−1.2340.234−0.287
X60.1002.9490.0090.582
X70.0852.6680.0160.543
X80.0090.3710.7150.090
X90.0801.4110.1760.324
X100.0020.0710.9440.017
X110.0251.9810.0640.433
X120.0160.7820.4450.186
X13−0.008−0.2040.841−0.049
X140.0120.4810.6370.116
X15−0.039−0.8310.418−0.197
X17−0.058−1.7880.092−0.398
X18−0.063−1.8730.078−0.414
X190.2201.4730.1590.336
X200.0623.5200.0030.649
2X1−0.026−0.7500.464−0.184
X2−0.030−0.9000.382−0.219
X30.0040.5620.5820.139
X4−0.015−1.2320.236−0.294
X5−0.007−0.5190.611−0.129
X60.0501.2390.2330.296
X70.0210.4710.6440.117
X8−0.017−0.8040.433−0.197
X9−0.076−1.1580.264−0.278
X10−0.018−0.7080.489−0.174
X11−0.007−0.4470.661−0.111
X12−0.026−1.3060.210−0.310
X13−0.032−1.0150.325−0.246
X14−0.011−0.5190.611−0.129
X15−0.024−0.6370.533−0.157
X17−0.027−0.9450.359−0.230
X18−0.033−1.0930.290−0.264
X190.0900.6980.4950.172
Table 9. Predictive value of regression analysis.
Table 9. Predictive value of regression analysis.
Year of Observation Actual Number of MRT Passengers Predictive Value of Regression AnalysisResidual Value
199611,174,3593,305,1177,869,242
199731,081,39536,987,970−5,906,575
199860,737,78268,054,760−7,316,978
1999126,952,122129,264,711−2,312,589
2000268,716,740270,052,789−1,336,049
2001289,642,714287,079,3952,563,319
2002324,433,557319,060,6115,372,946
2003316,189,128312,874,2543,314,874
2004350,141,956351,632,592−1,490,636
2005360,729,803359,697,2721,032,531
2006384,003,220385,978,346−1,975,126
2007416,229,685421,166,927−4,937,242
2008450,024,415449,687,066337,349
2009462,472,351457,533,2014,939,150
2010505,466,450495,603,1269,863,324
2011566,404,489560,822,8985,581,591
2012594,864,715607,214,292−12,349,577
2013634,961,083638,210,638−3,249,555
Table 10. Year selection of the artificial neural network.
Table 10. Year selection of the artificial neural network.
Evaluation Indicators7 Years/2 Years of Training (1996–2002/2003–2004)6 Years/2 Years of Training (2003–2008/2009–2010)5 Years/2 Years of Training (2009–2013/2014–2015)
MAPE8.26%4.35%2.45%
Table 11. Predictive values of the artificial neural network.
Table 11. Predictive values of the artificial neural network.
Year of Observation Actual Number of MRT Passengers Predictive Values of the Artificial Neural NetworkResidual Value
199611,174,35915,320,8134,146,454
199731,081,39530,226,981−854,414
199860,737,78260,188,236−549,546
1999126,952,122126,305,437−646,685
2000268,716,740267,922,813−793,927
2001289,642,714289,012,656−630,058
2002324,433,557324,181,085−252,472
2003316,189,128327,612,73711,423,609
2004350,141,956349,893,508−248,448
2005360,729,803360,534,282−195,521
2006384,003,220383,858,990−144,230
2007416,229,685415,952,914−276,771
2008450,024,415285,870,926−164,153,489
2009462,472,351462,125,697−346,654
2010505,466,450505,270,408−196,042
2011566,404,489565,977,673−426,816
2012594,864,715616,442,41821,577,703
2013634,961,083630,207,775−4,753,308
Table 12. Summary of the empirical results of the regression analysis and artificial neural network.
Table 12. Summary of the empirical results of the regression analysis and artificial neural network.
Experiment ModuleRegression Analysis MAPE ValueArtificial Neural Network MAPE ValueResults
20 variables6.47%4.82%The artificial neural network is better.
The four variables excluded in the regression analysis were deleted: the GNP, GDP, MRT mileage, and MRT extended passenger mileage.0.94%0.54%Both had the best prediction results.
The four variables with the highest MAPE values were deleted: the economic growth rate, personal income, total amount of goods imported into Taiwan, and number of MRT stations.0.62%0.31%Both had the best prediction results.
19-variable collinearity verification (deletion of the MRT passenger revenue).5.19%4.63%There is no collinearity problem.
Monthly passenger traffic experiment.1.45%0.42%Both had the best prediction results.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, Y.-S.; Lin, C.-K.; Chen, S.-F.; Chen, S.-H. Two Advanced Models of the Function of MRT Public Transportation in Taipei. Electronics 2021, 10, 1048. https://doi.org/10.3390/electronics10091048

AMA Style

Chen Y-S, Lin C-K, Chen S-F, Chen S-H. Two Advanced Models of the Function of MRT Public Transportation in Taipei. Electronics. 2021; 10(9):1048. https://doi.org/10.3390/electronics10091048

Chicago/Turabian Style

Chen, You-Shyang, Chien-Ku Lin, Su-Fen Chen, and Shang-Hung Chen. 2021. "Two Advanced Models of the Function of MRT Public Transportation in Taipei" Electronics 10, no. 9: 1048. https://doi.org/10.3390/electronics10091048

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop