Determination of the Peak Hour Ridership of Metro Stations in Xi’an, China Using Geographically-Weighted Regression

: The ridership of a metro station during a city’s peak hour is not always the same as that during the station’s own peak hour. To investigate this inconsistency, this study introduces the peak deviation coe ﬃ cient to describe this phenomenon. Data from 88 metro stations in Xi’an, China, are used to analyze the peak deviation coe ﬃ cient based on the geographically weighted regression model. The results demonstrate that when the land around a metro station is mainly land for work, primary and middle schools, and residences, its station’s peak hour is consistent with the city’s peak hour. Additionally, the station’s peak hour is more likely to deviate from the city’s peak hour for suburban stations. There are two ridership options when designing stations, namely the extra peak hour ridership during a city’s peak hour and that during a station’s peak hour, and the larger of the two is used to design metro stations. The mixed land use ratio must be considered in urban land use planning, because although non-commuting land can mitigate the tra ﬃ c pressure of a city’s peak hour, it may cause the deviation of the station’s peak hours from that of the city.


Introduction
According to the Transit Capacity and Quality of Service Manual, the calculation procedure for a desired station's platform size includes choosing the corresponding average number of passengers, adjusting for passenger characteristics as appropriate, estimating the maximum passenger demand for the platform at a given time, and calculating the required waiting space by multiplying the average space per person by the maximum passenger demand [1]. Estimating the maximum passenger demand for a platform at a given time is one of the most important steps for station design. In China, the Code for Design of Metro (the most recent edition of which is from 2003) states that the capacity of a metro station must be determined by the extra peak hour passenger flow, which is the predicted peak hour passenger flow multiplied by the extra peak hour factor, which is between 1.1 and 1.4 [2]. The predicted result for a metro station is extracted from the predicted result of the metro network, as the total passenger flow volume of the metro network must be controlled to reduce forecasting errors [3]. Today, macroscopic passenger flow forecasting still uses the four-step model [4], which was primarily developed for the prediction of traffic on a regional scale [5] and the evaluation of large-scale infrastructure projects [6]. Thus, the predicted ridership of a station is always in the city's peak hour, and it determines the station design [7]. However, surveys of urban rail transit operations have shown that, in many areas, a station's peak hours are not always the same as the city's peak hours [8]. When designing stations, designers are clearly biased toward adopting station ridership during the city's

Data Sources
The object of this study is the 88 metro stations in Xi'an, China. It should be noted that the station names consist of two parts. The first component is the number of the line that the station belongs to, so for example, 1# represents Line 1, and the number of the transfer station is the line that was operated earlier. The second part is the initials of the station name. The data includes: 1) The location of metro stations in Xi'an and whether the station is a transfer station or not, according to information collected by the Baidu coordinate pickup system in 2018; 2) The automatic fare collection system (AFC) credit card data of the Xi'an metro in March 2019, provided by Xi'an Metro Co., Ltd.; 3) Land use and building density around stations, which were collected using Baidu Maps. The land use of Xi'an metro stations is shown in Figure 1.

Dependent Variable
A station's peak hour refers to the single hour of the day with the highest hourly ridership of the urban rail transit station, whereas a city's peak hour refers to the single hour of the day that has the highest number of trips throughout the city. For example, the time distributions in the morning of three metro stations are presented in . 1#CLP is in the urban area of Xi'an, 2#BKZ is co-built with a high-speed rail station, and 3#DYT is near a popular tourist spot, the Greater Wild Goose Pagoda Square. The boarding or alighting peak hours of these stations are incongruent with the city's peak morning hour.

Dependent Variable
A station's peak hour refers to the single hour of the day with the highest hourly ridership of the urban rail transit station, whereas a city's peak hour refers to the single hour of the day that has the highest number of trips throughout the city. For example, the time distributions in the morning of three metro stations are presented in . 1#CLP is in the urban area of Xi'an, 2#BKZ is co-built with a high-speed rail station, and 3#DYT is near a popular tourist spot, the Greater Wild Goose Pagoda Square. The boarding or alighting peak hours of these stations are incongruent with the city's peak morning hour.

Dependent Variable
A station's peak hour refers to the single hour of the day with the highest hourly ridership of the urban rail transit station, whereas a city's peak hour refers to the single hour of the day that has the highest number of trips throughout the city. For example, the time distributions in the morning of three metro stations are presented in . 1#CLP is in the urban area of Xi'an, 2#BKZ is co-built with a high-speed rail station, and 3#DYT is near a popular tourist spot, the Greater Wild Goose Pagoda Square. The boarding or alighting peak hours of these stations are incongruent with the city's peak morning hour.     The predicted ridership of metro stations during a city's peak hour is provided in the forecasting of passenger flow. The peak deviation coefficient (PDC) is used in this study to facilitate the conversion of ridership during a city's peak hour to that during a station's peak hour; the coefficient is the ratio between the predicted ridership of a metro station during its own peak hour and the predicted ridership of this metro station during the city's peak hour. The calculation formula of the PDC is as follows: where Ps is the ridership of a metro station during its own peak hour and Pc is the ridership of this metro station during the city's peak hour, and PDC is the peak deviation coefficient. The closer the value of the PDC is to 1, the more similar the ridership at the station's peak hour is to the ridership at the city's peak hour.
In this study, the boarding and alighting PDC values in the morning were calculated separately. PDC b denotes the boarding PDC, and PDC a denotes the alighting PDC; both are dependent variables. The boarding and alighting ridership in a city's peak hour, the difference between ridership during a station's peak hour and the city's peak hour, and the PDC values of Xi'an metro stations in the morning are presented in Figure 5 and Figure 6. Most PDC values are near 1, indicating that these stations' peak hours align with the city's peak hour. However, there are also some stations with a PDC value greater than 1, and some with a PDC value even greater than 1.4. Table 1 presents the morning and evening PDC values of 52 stations of Chongqing metro lines 1, 3, and 6 [41]. It can be seen that although most PDC values are in the range of 1-1.2, 7.69% and 13.46% of stations' morning and evening PDCs, respectively, are found to be greater than 1.2. Thus, the phenomenon of the ridership of a metro station during a city's peak hour not always being the same as that during a station's own peak hour is not a unique case in Xi'an. For the sake of the meticulous design and sustainable development of metro systems, it is important to study the PDC.   The predicted ridership of metro stations during a city's peak hour is provided in the forecasting of passenger flow. The peak deviation coefficient (PDC) is used in this study to facilitate the conversion of ridership during a city's peak hour to that during a station's peak hour; the coefficient is the ratio between the predicted ridership of a metro station during its own peak hour and the predicted ridership of this metro station during the city's peak hour. The calculation formula of the PDC is as follows: where Ps is the ridership of a metro station during its own peak hour and Pc is the ridership of this metro station during the city's peak hour, and PDC is the peak deviation coefficient. The closer the value of the PDC is to 1, the more similar the ridership at the station's peak hour is to the ridership at the city's peak hour.
In this study, the boarding and alighting PDC values in the morning were calculated separately. PDC b denotes the boarding PDC, and PDC a denotes the alighting PDC; both are dependent variables. The boarding and alighting ridership in a city's peak hour, the difference between ridership during a station's peak hour and the city's peak hour, and the PDC values of Xi'an metro stations in the morning are presented in Figure 5 and Figure 6. Most PDC values are near 1, indicating that these stations' peak hours align with the city's peak hour. However, there are also some stations with a PDC value greater than 1, and some with a PDC value even greater than 1.4. Table 1 presents the morning and evening PDC values of 52 stations of Chongqing metro lines 1, 3, and 6 [41]. It can be seen that although most PDC values are in the range of 1-1.2, 7.69% and 13.46% of stations' morning and evening PDCs, respectively, are found to be greater than 1.2. Thus, the phenomenon of the ridership of a metro station during a city's peak hour not always being the same as that during a station's own peak hour is not a unique case in Xi'an. For the sake of the meticulous design and sustainable development of metro systems, it is important to study the PDC. The predicted ridership of metro stations during a city's peak hour is provided in the forecasting of passenger flow. The peak deviation coefficient (PDC) is used in this study to facilitate the conversion of ridership during a city's peak hour to that during a station's peak hour; the coefficient is the ratio between the predicted ridership of a metro station during its own peak hour and the predicted ridership of this metro station during the city's peak hour. The calculation formula of the PDC is as follows: where P s is the ridership of a metro station during its own peak hour and P c is the ridership of this metro station during the city's peak hour, and PDC is the peak deviation coefficient. The closer the value of the PDC is to 1, the more similar the ridership at the station's peak hour is to the ridership at the city's peak hour. In this study, the boarding and alighting PDC values in the morning were calculated separately. PDC b denotes the boarding PDC, and PDC a denotes the alighting PDC; both are dependent variables. The boarding and alighting ridership in a city's peak hour, the difference between ridership during a station's peak hour and the city's peak hour, and the PDC values of Xi'an metro stations in the morning are presented in Figures 5 and 6. Most PDC values are near 1, indicating that these stations' peak hours align with the city's peak hour. However, there are also some stations with a PDC value greater than 1, and some with a PDC value even greater than 1.4. Table 1 presents the morning and evening PDC values of 52 stations of Chongqing metro lines 1, 3, and 6 [41]. It can be seen that although most PDC values are in the range of 1-1.2, 7.69% and 13.46% of stations' morning and evening PDCs, respectively, are found to be greater than 1.2. Thus, the phenomenon of the ridership of a metro station during a city's peak hour not always being the same as that during a station's own peak hour is not a unique case in Xi'an. For the sake of the meticulous design and sustainable development of metro systems, it is important to study the PDC. It can be determined from Figures 5 and 6 that, when the PDC is large, there may be two conditions. The first condition is that the difference between the ridership during a station's peak hour and the city's peak hour is very large compared to that of other stations, such as is the case for 1#KFL and 2#BKZ. The second condition is that the difference between the ridership during a station's peak hour and the city's peak hour is small compared to that of other stations, but the station ridership itself is small, and as a result the PDC becomes large, such as is the case for 3#WZ and 4#SZDD.

Independent Variable Selection
Research on the peak hours of metro stations is still in the initial stage, and is classified as the forecasting of the temporal distribution of station ridership. Therefore, this paper uses the direct station ridership model as a reference, and determines the influence factors of the PDC. The direct station ridership model generally categorizes the influence factors of station ridership into four classes: the built environment, population, social economy, and station characteristics [42].

Built Environment
The peak hours of trips are always different. People's trips can be classified into two kinds according to the consistency of the city's peak hour. The peak hour of the first kind of trip is consistent with the city's peak hours, such as going to work and going to school. The peak hour of the second kind of trip is not consistent with the city's peak hours, such as traveling for shopping. Because different land uses attract different purposes of trips, the ratio of land use ultimately determines the peak hour of the station. Thus, the proportion of the commuting area has an impact on the PDC.
The proportion of the commuting area refers to the proportion of land area that can generate commuter travel to the total land area. According to a 2015 resident travel survey in Xi'an, the city's peak hours are 7:30-8:30 in the morning and 18:00-19:00 in the evening. The temporal distribution of trip purposes is presented in Figure 7. The peak hours of going to work, going to school, and going home fall within the city's peak hours; however, the peak hours of other purposes do not fall within the city's peak hours. In particular, the land for education consists of two kinds of land, namely land for primary and middle schools and land for colleges, which both generate different passenger flows. Most primary-and-middle-school students go to school and go back home every school day, and the temporal distribution of their trip is labeled "Going to school" in Figure 7. Other scholars have also found that home-based school trips are another important part of morning-peak ridership [43], and this peak hour falls within the city's peak hours. However, most college students live and learn at their schools, and only leave occasionally. When they return, their destinations are their colleges, and the temporal distribution is labeled "Going back to college" in Figure 7. Thus, in this paper, the land areas for work, primary and middle schools, and residences (WPR) are used as the commuting area. The proportion of WPR refers to the sum of the land areas for work, primary and middle schools, and residences, which is then divided by the total land area around the station.

Independent Variable Selection
Research on the peak hours of metro stations is still in the initial stage, and is classified as the forecasting of the temporal distribution of station ridership. Therefore, this paper uses the direct station ridership model as a reference, and determines the influence factors of the PDC. The direct station ridership model generally categorizes the influence factors of station ridership into four classes: the built environment, population, social economy, and station characteristics [42].

Built Environment
The peak hours of trips are always different. People's trips can be classified into two kinds according to the consistency of the city's peak hour. The peak hour of the first kind of trip is consistent with the city's peak hours, such as going to work and going to school. The peak hour of the second kind of trip is not consistent with the city's peak hours, such as traveling for shopping. Because different land uses attract different purposes of trips, the ratio of land use ultimately determines the peak hour of the station. Thus, the proportion of the commuting area has an impact on the PDC.
The proportion of the commuting area refers to the proportion of land area that can generate commuter travel to the total land area. According to a 2015 resident travel survey in Xi'an, the city's peak hours are 7:30-8:30 in the morning and 18:00-19:00 in the evening. The temporal distribution of trip purposes is presented in Figure 7. The peak hours of going to work, going to school, and going home fall within the city's peak hours; however, the peak hours of other purposes do not fall within the city's peak hours. In particular, the land for education consists of two kinds of land, namely land for primary and middle schools and land for colleges, which both generate different passenger flows. Most primary-and-middle-school students go to school and go back home every school day, and the temporal distribution of their trip is labeled "Going to school" in Figure 7. Other scholars have also found that home-based school trips are another important part of morning-peak ridership [43], and this peak hour falls within the city's peak hours. However, most college students live and learn at their schools, and only leave occasionally. When they return, their destinations are their colleges, and the temporal distribution is labeled "Going back to college" in Figure 7. Thus, in this paper, the land areas for work, primary and middle schools, and residences (WPR) are used as the commuting area. The proportion of WPR refers to the sum of the land areas for work, primary and middle schools, and residences, which is then divided by the total land area around the station. In China, only large cities have built up urban rail transit. The floating populations of these cities are large, and the populations around the stations are difficult to count. The high density of residential buildings can be attributed to more residents living near the stations [44]. Therefore, in this paper, the built environment is used in place of the population. In a study by Sung [45], the best In China, only large cities have built up urban rail transit. The floating populations of these cities are large, and the populations around the stations are difficult to count. The high density of residential buildings can be attributed to more residents living near the stations [44]. Therefore, in this paper, the built environment is used in place of the population. In a study by Sung [45], the best catchment area for predicting station-level ridership is 500 m; thus, the statistical range of the current built environment is considered to be 500 m. The proportion of the commuting area as calculated by the built environment in a 500 m station catchment area is then used as an independent variable. The spatial distribution of the proportions of the commuting areas of Xi'an metro stations is presented in Figure 8.
Sustainability 2020, 2, x FOR PEER REVIEW 9 of 21 catchment area for predicting station-level ridership is 500 m; thus, the statistical range of the current built environment is considered to be 500 m. The proportion of the commuting area as calculated by the built environment in a 500 m station catchment area is then used as an independent variable. The spatial distribution of the proportions of the commuting areas of Xi'an metro stations is presented in Figure 8. Yu [41] found that the area of undeveloped land around stations is related to the PDC, but no definite results were obtained. Thus, the undeveloped land area refers to the area of the undeveloped land around a station, and is used as an independent variable. The spatial distribution of undeveloped land areas of Xi'an metro stations is presented in Figure 9.   Yu [41] found that the area of undeveloped land around stations is related to the PDC, but no definite results were obtained. Thus, the undeveloped land area refers to the area of the undeveloped land around a station, and is used as an independent variable. The spatial distribution of undeveloped land areas of Xi'an metro stations is presented in Figure 9.
Sustainability 2020, 2, x FOR PEER REVIEW 9 of 21 catchment area for predicting station-level ridership is 500 m; thus, the statistical range of the current built environment is considered to be 500 m. The proportion of the commuting area as calculated by the built environment in a 500 m station catchment area is then used as an independent variable. The spatial distribution of the proportions of the commuting areas of Xi'an metro stations is presented in Figure 8. Yu [41] found that the area of undeveloped land around stations is related to the PDC, but no definite results were obtained. Thus, the undeveloped land area refers to the area of the undeveloped land around a station, and is used as an independent variable. The spatial distribution of undeveloped land areas of Xi'an metro stations is presented in Figure 9.

Distance to the City Center
García [46] determined that the location of a house (close to the sea or city center and located in a certain area) influenced housing prices. Therefore, properties in the same land-use class in different locations will have different prices, reflecting socioeconomic conditions in the area. If the metro station is far from the city center, people have to depart earlier because of the longer travel time. Thus, the distance to the city center is considered as an independent variable in this study.

Betweenness Centrality
If the land use is the same at two stations, the ridership volume of a transfer station is higher than that of a normal station [47]. Existing studies have used these two types of stations as discrete dummy variables [48] when analyzing the differences between transfer stations and normal stations. However, this approach does not consider the differences between transfer stations in different locations. Therefore, the concept of betweenness centrality (BC), which is used in graph theory, is used in the present study to determine the locations and properties of stations in the urban rail transit network; the dummy variable indicates whether the station is a transfer station or not. There is one shortest path to every starting station s to terminal station t, but not all shortest paths pass through station i. The BC value reflects the situation of all shortest paths in the metro network that pass one station, and the formula is as follows: where BC i is the betweenness centrality of station i, ε sit is the number of shortest paths from station s to station t via station i, and ε st is the number of shortest paths in the metro network from station s to station t. The spatial distribution of the BC values of Xi'an metro stations is presented in Figure 10.

Distance to the City Center
García [46] determined that the location of a house (close to the sea or city center and located in a certain area) influenced housing prices. Therefore, properties in the same land-use class in different locations will have different prices, reflecting socioeconomic conditions in the area. If the metro station is far from the city center, people have to depart earlier because of the longer travel time. Thus, the distance to the city center is considered as an independent variable in this study.

Betweenness Centrality
If the land use is the same at two stations, the ridership volume of a transfer station is higher than that of a normal station [47]. Existing studies have used these two types of stations as discrete dummy variables [48] when analyzing the differences between transfer stations and normal stations. However, this approach does not consider the differences between transfer stations in different locations. Therefore, the concept of betweenness centrality (BC), which is used in graph theory, is used in the present study to determine the locations and properties of stations in the urban rail transit network; the dummy variable indicates whether the station is a transfer station or not. There is one shortest path to every starting station s to terminal station t, but not all shortest paths pass through station i. The BC value reflects the situation of all shortest paths in the metro network that pass one station, and the formula is as follows: where BCi is the betweenness centrality of station i, εsit is the number of shortest paths from station s to station t via station i, and εst is the number of shortest paths in the metro network from station s to station t. The spatial distribution of the BC values of Xi'an metro stations is presented in Figure 10.

Summary of Variables
The variables used in this paper are described in Table 2.

Methodology
Based on the ordinary least square (OLS) model, the GWR model introduces local parameters to measure the spatial position, and the spatial characteristics of data are added into the model to show the local spatial characteristics and the instability of the spatial distribution in the research area [49]. The equation of the GWR model can be expressed as follows: where PDC i is the PDC of metro station i, (u i , v i ) denotes the location of station i, β k (u i , v i ) indicates the kth regression parameter at station i, which is a function of the geographical position, and x ik is the independent variable, of which there are four used in this study (the proportion of WPR, the undeveloped land area, the distance to the city center, and BC); their definitions and formulas were introduced in the previous chapter. Additionally, ε i is the normally distributed error term of station i, and p is the total number of stations. After omitting the spatial position term, the resulting equation is as follows: Formula (4) can be expressed as: The weighted least-squares method is used to minimize Formula (6) to calculate the regression parameters of station i:β where W(i) is the weight matrix of the geographically weighted regression at station i, and X is a matrix of explanatory variables. If the sample points of each station remain homogeneous in space, i.e., β(i) is a constant, then the GWR is equivalent to the OLS model. However, in actuality, the assumption of spatial independence or homogeneity is difficult to hold, especially in the related research of urban rail passenger flow; although the land use situation around each station is similar, due to the different spatial locations of each station, their boarding and alighting ridership are very different. It is clear that determining how to choose the desired spatial weight matrix is the key to accurately describing the spatial interaction, which largely determines the quality of the model fitting effect [50]. According to Hanham et al. [51], the Gaussian distance function was chosen in this study to be the basic form of space weight, and the expression is as follows: where d ij is the distance between station i and station j, and b is the kernel bandwidth parameter. Point-by-point regression is performed by using the above method to obtain the regression parameter estimation matrix containing sample points: Thus, the value of the dependent variable can be estimated as follows:

Spatial Autocorrelation and Local Collinearity Test
Before using the GWR model to analyze the relationship between PDC and the independent variables, the collinearity test of the independent variables must be done and one of the collinear variables must be deleted first. Table 3 shows the results of the collinearity test. According to the Mathematical dictionary [52], there are three methods to judge the collinearity of the independent variables. The condition index is more than 10, at least one of the variance ratios is close to 1, and the value of eigenvalue is 0. In Table 3, the condition index in the second line is more than 10, the variance ratio of the 'distance to city center' is 0.86, and the value of eigenvalue is 0.03. So this independent variable is deleted. The last line in Table 3 is the collinearity test result after deleting the 'distance to city center'. Before establishing the GWR model, the Moran I test must be conducted to determine if spatial autocorrelation exists. The results are shown in Table 4. The P-value of the variable is less than 0.05, and the Moran I is between −1 and 1. Thus, spatial autocorrelation exists, and the GWR model is suitable to setting up.

Summary Statistics
The summary statistics of the local coefficients are shown in Table 5. If the Akaike information criterion (AIC) difference between the two models is greater than 3, the goodness of fit is considered to be significantly improved [44], and AIC is converted to corrected Akaike information criterion (AICc) in the case of a small sample. In Table 5, the AICc of GWR are decreased by more than 67 compared with the AICc of the OLS model. The adjusted R 2 values are larger for the GWR (more than 0.8 for the PDCs) than for the OLS model. Cardozo [5] used the GWR model to forecast ridership at the metro station level. He used nine variables and the adjusted R 2 value was 0.7. Thus, the adjusted R 2 values in this paper are acceptable. The values of the undeveloped land area and BC are higher than 0, indicating that these variables have positive influences on the PDC. For the PDC a , the value of proportion of WPR is less than 0, demonstrating that the variable has a negative influence on the PDC, whereas for the PDC b , the value is near 0.

Spatial Distributions of the Coefficients
The spatial distributions of the regression coefficients are presented in Figure 11, from which it is evident that the spatial distributions of the same coefficients exhibit differences for different PDCs.
For the proportion of WPR, the regression coefficients of PDC b have both positive and negative numbers, and the values are small, ranging from −2.5 to 1.0. This demonstrates that this variable does not have a dominant influence on the PDC. For the spatial distribution of the proportion of WPR, the coefficients are close to 0.0 in the center and the northeast of the city, the coefficients are less than 0.0 in the north, south, and east of the city, and the coefficients are greater than 0.0 in the west of the city. In the northeast of the city, the land has not been maturely developed. In the north, south, and east of the city, there are three sub-centers near the stations 2#FCWL, 2#XZ, and 1#FZC. To the northwest of Xi'an, there is another city-Xianyang. The two cities are so close that the linear distance from 1#HWZ to the city center of Xianyang is only about 10 km. The office space in this area is uncertain, and the influences of the proportions of WPR on these stations are small. The regression coefficients of PDC a are all negative, indicating that the proportion of WPR has a negative influence on the PDC. The regression coefficients are higher in the city center and lower in the periphery. Because the regression coefficients are all negative, the influence increases from the center to the periphery.
For the undeveloped land area, the range of the PDC a values is slightly larger than that of the PDC b values. The spatial distributions are approximately the same, and the larger values occur in the periphery. By comparing Figures 9 and 11c,d, it is clear that there are few stations in Metro Line 2 (the north-south line) that have undeveloped land, but the regression coefficients are substantially different. For the stations in the southeast and northeast of the city with large undeveloped land areas, the regression coefficients are still substantially different. This indicates that undeveloped land areas have uncertain influences on both the PDC b and PDC a values, which is consistent with the fact that trips to undeveloped land occur relatively seldom.

Station Classification
Because the undeveloped land area has an uncertain influence on the PDC, the k-means method was used to classify stations with undeveloped land areas of 0. The classifying variables are the proportion of WPR, BC, PDC b , and PDC a , and the results are presented in Table 6 and Figure 12. The PDC b and PDC a values in the 1st, 3rd, and 5th kinds of stations are all less than 1.17, and the For the BC, the range of values is similar to those of the PDC b and the PDC a . Most values are positive, indicating that this variable has a positive influence on the PDC. The coefficients of the variables are smaller in the city center and larger in the periphery, which means that the BC has a greater influence on diverging a station's peak hour from the city's peak hour in suburban areas.

Station Classification
Because the undeveloped land area has an uncertain influence on the PDC, the k-means method was used to classify stations with undeveloped land areas of 0. The classifying variables are the proportion of WPR, BC, PDC b , and PDC a , and the results are presented in Table 6  There are two stations of the 2nd kind, namely 1#KYM and 4#DMG. 1#KYM has a large amount of industrial land and many residential districts established by a factory manager, and these lands do not produce medium-or long-distance travel. 4#DMG is near cultural relics and historic sites, including Daming Palace National Heritage Park, but this area is not as developed as the Greater Wild Goose Pagoda Square, and the station only sees 276 people during the city's peak hour and 340 people during the station's peak hour.
Thus, the proportion of WPR has the greatest influence on the PDC. If the proportion of WPR is less than 0.5, the PDC b and PDC a values are both very large; if the proportion of WPR is greater than 0.5, most PDC b and PDC a values are close to 1.

Discussion-Future Station Design and Policy
The enlargement coefficient put forward in this paper, PDC, can be used as a simple way to convert the ridership during a city's peak hour to the ridership during a station's peak hour. The proportion of the WPR was found to have a negative influence on the PDC a value. In other words, the larger the lands for work, primary and middle schools, and residences, the smaller the deviation of ridership between a station's peak hour and the city's peak hour, as the commuting trip during workdays constitutes the city's peak hour. This result is consistent with the results of previous studies [8,53] that investigated the metro ridership in Osaka, Shanghai, and Zhengzhou, and found that trips of going to work and going to school make the station's peak hour earlier, while shopping and traveling trips delay the station's peak hour.
If the proportion of WPR of a station is greater than 0.5, it can be considered that the ridership during the city's peak hour is the highest ridership of the whole day; if it is less than 0.5, the highest ridership is the ridership during the city's peak hour multiplied by the PDC. This result is consistent with the findings of Yu [41], who examined two cities-Xi'an and Chongqing-and found that the PDC value of most metro stations is close to 1 when the proportion of WPR is greater than 0.5.
In the morning, the proportion of WPR has more influence on the alighting ridership than on the boarding ridership. For stations with proportions of WPR of greater than 0.5, if it is a special type of land, such as the 1#KYM and 4#DMG stations, the PDC b value is close to 1, but the PDC a value is greater among the PDC a values of all stations. Moreover, the regression coefficients of PDC a are negative, but the regression coefficients of PDC b are both positive and negative numbers. This means that the proportion of WPR results in the alighting ridership occurring during a city's peak hour, but does not have a clear effect on the boarding ridership. It indicates that the lands for work, primary and middle schools, and residences has more explanatory power regarding the peak hour deviation in the alighting during morning peak hours. The lands for work, primary and middle schools mostly attract office workers and students who need to arrive on time or ahead of schedule. Compared with their boarding behavior, their alighting behavior has a relatively clear arrival time. For different enterprises or schools, the time is almost the same, and is the same as the city's peak hour. However, the boarding times will present large differences because of the distance between home and work. In China, the administrative land is more concentrated than residential land [54],

Discussion-Future Station Design and Policy
The enlargement coefficient put forward in this paper, PDC, can be used as a simple way to convert the ridership during a city's peak hour to the ridership during a station's peak hour. The proportion of the WPR was found to have a negative influence on the PDC a value. In other words, the larger the lands for work, primary and middle schools, and residences, the smaller the deviation of ridership between a station's peak hour and the city's peak hour, as the commuting trip during workdays constitutes the city's peak hour. This result is consistent with the results of previous studies [8,53] that investigated the metro ridership in Osaka, Shanghai, and Zhengzhou, and found that trips of going to work and going to school make the station's peak hour earlier, while shopping and traveling trips delay the station's peak hour.
If the proportion of WPR of a station is greater than 0.5, it can be considered that the ridership during the city's peak hour is the highest ridership of the whole day; if it is less than 0.5, the highest ridership is the ridership during the city's peak hour multiplied by the PDC. This result is consistent with the findings of Yu [41], who examined two cities-Xi'an and Chongqing-and found that the PDC value of most metro stations is close to 1 when the proportion of WPR is greater than 0.5.
In the morning, the proportion of WPR has more influence on the alighting ridership than on the boarding ridership. For stations with proportions of WPR of greater than 0.5, if it is a special type of land, such as the 1#KYM and 4#DMG stations, the PDC b value is close to 1, but the PDC a value is greater among the PDC a values of all stations. Moreover, the regression coefficients of PDC a are negative, but the regression coefficients of PDC b are both positive and negative numbers. This means that the proportion of WPR results in the alighting ridership occurring during a city's peak hour, but does not have a clear effect on the boarding ridership. It indicates that the lands for work, primary and middle schools, and residences has more explanatory power regarding the peak hour deviation in the alighting during morning peak hours. The lands for work, primary and middle schools mostly attract office workers and students who need to arrive on time or ahead of schedule. Compared with their boarding behavior, their alighting behavior has a relatively clear arrival time. For different enterprises or schools, the time is almost the same, and is the same as the city's peak hour. However, the boarding times will present large differences because of the distance between home and work. In China, the administrative land is more concentrated than residential land [54], which will lead to the concentrated distribution of commuting-alighting passenger flow.
The local coefficient of BC is greater than 0, meaning it has an influence on the deviation of the peak hours of the station and city. For the spatial distribution, the BC is greater in suburban areas, indicating that the BC has a greater influence on diverging a station's peak hour from the city's peak hour in suburban areas. This is because commuters who live in suburban areas and work in the city center need to leave earlier. This finding is reasonable, as housing prices in suburban areas are lower; people are more willing to live in these areas, thus increasing their time spent on the metro and affecting the station's peak hour. This evidence is consistent with the results of previous studies [43], which found that more people want to live in suburban areas, resulting in the increased metro travel demands of these areas.
In China, station design must consider the extra peak hour passenger flow, which is the predicted peak hour passenger flow multiplied by the extra peak hour factor, which is between 1.1 and 1.4 [2]. The extra peak hour factor (EPHF) is the highest fifteen-minute ridership during the city's peak hour multiplied by 4, and then divided by the ridership during the city's peak hour [55]. The EPHF is the expanded threshold of the station's capacity in the city's peak hour. However, this study shows that some stations' own peak hours are inconsistent with the city's peak hour because of various land use and BC around the stations, and the peak load shifting is formed. This results in that the EPHF does not have constraints to these peak load shifting stations' capacities. Thus, this paper put forward the PDC to depict this inconsistent phenomenon of the station's peak hour. Although the EPHF and the PDC both reflect the temporal distribution of metro stations, they are totally different. The EPHF reflects the concentration of passenger flow in a city's peak hour, while the PDC reflects the inconsistency between a station's peak hour and the city's peak hour. There is no comparability between the two coefficients. Moreover, the thinking methods about the EPHF and the PDC are completely different from each other. For example, stations with large proportions of administrative land usually have high EPHF values because of the instantaneous gathering of commuting ridership [56]. But the greater commuting ridership results in the high consistency between the station's peak hour and that of the city, leading to a PDC value close to 1. By contrast, stations with large proportions of commercial land usually have low EPHF values because of the randomness of the shopping flow [57]. However, the peak hour of shopping flow lags behind the city's peak hour, leading to the station's peak hour being highly inconsistent with the city's, and this increases the PDC value. Thus, there are two ridership options when designing stations, namely the extra peak hour ridership during a city's peak hour and ridership during a station's peak hour.
The morning boarding and alighting PDC and EPHF values of Xi'an metro stations are shown in Figures 13 and 14 . Therefore, the size relationship between the extra peak hour passenger flow during a city's peak hour and the ridership during a station's peak hour cannot be determined. If only one kind of coefficient is used to design the station, the scale of some of the station will be small. Thus, when a station is designed, both types of ridership (the extra peak hour ridership during a city's peak hour and ridership during a station's peak hour) must be calculated, and the larger of the two is used to design the station. Therefore, the size relationship between the extra peak hour passenger flow during a city's peak hour and the ridership during a station's peak hour cannot be determined. If only one kind of coefficient is used to design the station, the scale of some of the station will be small. Thus, when a station is designed, both types of ridership (the extra peak hour ridership during a city's peak hour and ridership during a station's peak hour) must be calculated, and the larger of the two is used to design the station.  The land development in the center of Xi'an has been completed, and the city is now faced with land replacement and land development in urban areas. Land replacement refers to moving industrial land from the city center to urban areas and changing the land to other types of property. Non-commuting land can mitigate the traffic pressure of a city's peak hour, because its travel peak is different from that of commuting land. However, the mixed land use ratio must be considered. More non-commuting land around metro stations will result in the deviation of their peak hours from those of the city. This will increase the difficulty of both passenger flow forecasting in the planning stage and train departure intervals and station management in the operation stage. Because there is only one train departure interval in a period of time in a section of a metro line, if a station's peak hour is different from that of other stations, the train departure interval will be greater in this station's peak hour, and passengers will pile up. However, less non-commuting land around metro stations will result in a large extra peak hour passenger flow and a small off-peak hour passenger flow. Although a significant amount of money may be spent on building a large station, it may be nearly empty for most of the day. Thus, the proportion of commuting land around metro stations must be considered in land planning.

Conclusion
In this study, we investigated the differences in the ridership of metro stations between the stations' peak hours and the city's peak hours for 88 metro stations in Xi'an. The enlargement coefficient put forward in this paper, PDC, can be used as a simple way to convert the ridership during a city's peak hour to the ridership during a station's peak hour. GWR was used to determine the influencing factors on the ridership volume and specifically on the PDC. The key findings are as follows: The proportion of WPR is found to have a negative influence on the PDC a value. In the morning, the proportion of WPR has more influence on the alighting ridership than on the boarding ridership. If the proportion of WPR of a station is greater than 0.5, it can be considered that the The land development in the center of Xi'an has been completed, and the city is now faced with land replacement and land development in urban areas. Land replacement refers to moving industrial land from the city center to urban areas and changing the land to other types of property. Non-commuting land can mitigate the traffic pressure of a city's peak hour, because its travel peak is different from that of commuting land. However, the mixed land use ratio must be considered. More non-commuting land around metro stations will result in the deviation of their peak hours from those of the city. This will increase the difficulty of both passenger flow forecasting in the planning stage and train departure intervals and station management in the operation stage. Because there is only one train departure interval in a period of time in a section of a metro line, if a station's peak hour is different from that of other stations, the train departure interval will be greater in this station's peak hour, and passengers will pile up. However, less non-commuting land around metro stations will result in a large extra peak hour passenger flow and a small off-peak hour passenger flow. Although a significant amount of money may be spent on building a large station, it may be nearly empty for most of the day. Thus, the proportion of commuting land around metro stations must be considered in land planning.

Conclusions
In this study, we investigated the differences in the ridership of metro stations between the stations' peak hours and the city's peak hours for 88 metro stations in Xi'an. The enlargement coefficient put forward in this paper, PDC, can be used as a simple way to convert the ridership during a city's peak hour to the ridership during a station's peak hour. GWR was used to determine the influencing factors on the ridership volume and specifically on the PDC. The key findings are as follows: The proportion of WPR is found to have a negative influence on the PDC a value. In the morning, the proportion of WPR has more influence on the alighting ridership than on the boarding ridership. If the proportion of WPR of a station is greater than 0.5, it can be considered that the ridership during the city's peak hour is the highest ridership of the whole day; if it is less than 0.5, the highest ridership is the ridership during the city's peak hour multiplied by the PDC.
The BC has an influence on the deviation of the peak hours of the station and city, and the BC is greater in suburban areas, indicating that the BC has a greater influence on diverging a station's peak hour from the city's peak hour in suburban areas.
Thus, if a metro station is primarily surrounded by commuting land (such as if the proportion of commuting land area is greater than 0.5 in Xi'an), does not have special land (such as an external transportation hub), and is located in the city center, its PDC is close to 1.
When designing a metro station, there are two ridership options, namely the extra peak hour ridership during a city's peak hour and ridership during a station's peak hour. The size relationship between the extra peak hour passenger flow during a city's peak hour and the ridership during a station's peak hour cannot be determined. The larger of the two is used to design the station. However, for the stations to accord with the conditions in the previous paragraph, they only need to consider the extra peak hour factor, as their values of PDC are close to 1.
When performing urban land use planning, the mixed land use ratio must be considered. Non-commuting land can mitigate the traffic pressure of a city's peak hour, but at the same time more non-commuting land around metro stations will result in the deviation of their peak hours from those of the city. This will increase the difficulty of both passenger flow forecasting in the planning stage and train departure intervals and station management in the operation stage.
The transfer passenger volume cannot be counted by the automatic fare collection system directly, and its correctness cannot be verified. Thus, this research only considered the boarding and alighting volumes, and did not account for transfer passengers in interchange stations; an inconsistent peak hour phenomenon also occurs for transfer passengers, which would influence the design of transfer channels. Therefore, this topic will be investigated in future research.