Interpreting Regional Functions Around Urban Rail Stations by Integrating Dockless Bike Sharing and POI Patterns: Case Study of Beijing, China

Liu, Siyang; Rong, Jian; Zhou, Chenjing; Guo, Miao; Sun, Haodong

doi:10.3390/urbansci10010001

Open AccessArticle

Interpreting Regional Functions Around Urban Rail Stations by Integrating Dockless Bike Sharing and POI Patterns: Case Study of Beijing, China

by

Siyang Liu

¹

,

Jian Rong

²,

Chenjing Zhou

^2,*,

Miao Guo

³ and

Haodong Sun

⁴

¹

Key Laboratory of Highway Engineering of Ministry of Education, Changsha University of Science and Technology, Changsha 410114, China

²

School of Civil Engineering and Transportation, Guangzhou University, Guangzhou 510006, China

³

Faculty of Transportation Engineering, Kunming University of Science and Technology, Kunming 650550, China

⁴

Beijing Municipal Institute of City Planning and Design, Beijing 100045, China

^*

Author to whom correspondence should be addressed.

Urban Sci. 2026, 10(1), 1; https://doi.org/10.3390/urbansci10010001

Submission received: 5 August 2025 / Revised: 10 December 2025 / Accepted: 17 December 2025 / Published: 19 December 2025

(This article belongs to the Special Issue Transit-Oriented Land Development and/or 15-Minute Cities)

Download

Browse Figures

Versions Notes

Abstract

Identifying area functions around urban rail transit (URT) stations is crucial for optimizing urban planning and infrastructure allocation. Traditional methods relying on static land-use data fail to capture dynamic human–environment interactions, while emerging mobility datasets suffer from spatial granularity limitations. This study bridges this gap by integrating spatiotemporal patterns of dockless bike sharing (DBS) with Point of Interest (POI) configurations to characterize station functions. Taking Beijing as a case study, we develop a cluster analysis framework that synthesizes DBS density fluctuations, parking distribution shifts between day/night periods, and POI features. Cluster results reveal functionally distinct station groups with statistically significant differences in both DBS usage patterns and POI distributions. Critically, high-density urban cores exhibit concentrated bicycle usage aligned with mixed POI agglomerations, while suburban zones demonstrate commuter-oriented fluctuations with evening residential surges. This alignment between DBS-derived activity signatures and POI-based land-use features provides actionable insights: planners can optimize bicycle parking in residential clusters, calibrate last-mile connections in employment cores, and adapt infrastructure to localized functional transitions—ultimately enhancing URT-integrated sustainable development.

Keywords:

urban rail transit; land-use; regional function; transit-oriented development

1. Introduction

Urban rail transit (URT) has become a prevailing travel mode in major cities worldwide, contributing to compact urban forms, relieving traffic congestion, and reducing emissions [1,2]. The global URT infrastructure has undergone rapid development in past decades, a trend markedly exemplified by the expansion in Beijing, China, where 27 URT lines and over 400 URT stations are now in operation [3,4]. Meanwhile, with the construction of URT lines and stations, station areas gradually develop distinct functions like residential, employment, or commercial clusters. Station areas are designed to deliver specific urban services, and their functionality is shaped by the interplay between daily human activities and the built environment [5,6]. However, as these functions evolve with planning adjustments and behavioral shifts, accurately identifying their current functional characteristics is crucial for effective urban planning and management.

In the context of global efforts toward sustainable urban development, transit-oriented development (TOD) has emerged as a dominant paradigm, which emphasizes high-density, mixed-use development to curb urban sprawl and reduce carbon emissions. Identifying station-area functions is not merely an academic exercise; it is fundamental to realizing the goals of TOD [7,8]. It enables the precise optimization of the last mile, a critical yet often inefficient segment of urban travel, ensuring that URT systems truly function as the backbone of a compact urban form. However, traditional methods for identifying functional areas are mostly based on static land-use classifications, which fail to capture functional dynamics as they ignore facility density and human activity patterns [9]. A commonly employed framework is the Node-Place (N-P) model developed by Bertolini [10] in 1996, which has since been enriched through the incorporation of new dimensions and indicators in various contexts [11,12,13]. Nevertheless, the N-P model primarily focuses on built environment metrics reflected by point of interest (POI) data, often neglecting dynamic human-environment interactions. While traditional data sources have limitations, emerging mobility data offer potential solutions. Recent studies have used cellular data and smart card records to capture these interactions [14,15,16,17]. However, the accuracy of cellular signaling data is constrained by the coarse spatial resolution of cell towers, which fails to capture fine-grained activities within station areas. Similarly, smart card data are typically confined to transactions within the station; their integration with multi-source heterogeneous datasets remains challenging, particularly for tracing multi-modal journeys beyond the station gates.

In recent years, the rise of dockless bike sharing (DBS) has set a new path for discovering station-area functions through travel behavior. Due to their flexibility and low cost, DBS systems have expanded globally, serving millions of daily trips and providing station-free, accessible first/last-mile connections to public transport [18,19,20]. Users generate detailed spatiotemporal data by renting and returning bikes near their trip origins and destinations. These data can help infer the functional characteristics of an area based on the spatiotemporal patterns of trip origins and destinations. For example, a station area with high DBS density in the evening and low density during the day is likely residential, whereas an area with the opposite pattern is likely employment-oriented [9,21].

To bridge the abovementioned gaps, this study aims to develop an integrated framework that integrates the dynamic spatiotemporal patterns of DBS (specifically day–night variations) with static POI configurations to identify and characterize the functional types of URT station areas in Beijing. In contrast to prior studies that primarily used DBS for city-scale land use classification or generic travel pattern recognition, we focus on station-level functional heterogeneity by quantifying the day–night differentials in DBS usage. Our key contribution is the use of day–night DBS density and distribution ratios to infer functional shifts. This dynamic dimension is often overlooked in traditional models that rely on static built-environment indicators or coarse-grained mobility data, which lack the spatiotemporal resolution to capture station-area specific dynamics. Accordingly, we formulate the following research questions (RQs) and hypotheses (H) to guide our inquiry. The novelty of this research lies in its integrated approach that leverages day–night DBS dynamics to infer functional shifts at the station level, a dimension often overlooked in static models or coarse-grained mobility data analyses.

RQ1:

Can the spatiotemporal patterns of dockless bike sharing (DBS), specifically day–night variations in density and parking distribution, be used to classify urban rail transit (URT) stations into functionally distinct types?

H1:

URT stations can be classified into distinct functional types based on their spatiotemporal DBS usage patterns (e.g., nighttime density, daytime parking distance, and their day–night ratios). This hypothesis is testable via cluster analysis and is policy-relevant for identifying stations with unique mobility signatures that require tailored infrastructure planning.

RQ2:

What are the characteristic differences in DBS usage patterns (e.g., intensity, concentration, and temporal variation) across the different station clusters identified?

H2:

There will be statistically significant differences in DBS usage metrics among the different station clusters. We hypothesize directional patterns, such as urban core clusters exhibiting higher DBS density and more concentrated parking than suburban clusters. Confirming this helps calibrate last-mile services according to local demand patterns.

RQ3:

Do the station types identified by DBS data exhibit significant differences in their static built environment, as represented by POI configurations?

H3:

The clusters derived from DBS data will show statistically significant differences in their POI profiles. Specifically, we hypothesize that clusters with high daytime DBS density will correlate with higher office POI density, while those with high nighttime density will correlate with higher residential POI density. Validating these bridges the gap between dynamic mobility behavior and static land use, enabling a more holistic understanding of station area functions.

The answers to these questions are not merely academic; they provide actionable insights for optimizing bicycle infrastructure planning, calibrating last-mile connections, and fostering TOD principles, ultimately enhancing URT-integrated sustainable urban development.

The structure of this paper is outlined below. Section 2 reviews relevant literature and provides a formal statement of the research problem. Section 3 describes the data and method used in this paper. The results and discussion are presented in Section 4 and Section 5, followed by the conclusion in Section 6.

2. Literature Review

Various researchers attempt to capture the station area function through the analysis of built environment or activities performance. As part of our research, related work pertaining to all aspects of function identification was conducted as follows.

2.1. The N-P Model and Its Improvements

Since the N-P model was first proposed by Bertolini in 1996 [10], it has been continuously optimized and improved and has been widely used for interpreting regional functions, particularly near URT stations around the world [12,22]. As presented in Figure 1 (left), the traditional N-P model classifies URT stations into five statuses, i.e., dependence, stress, balance, unbalanced node, and unbalanced place, according to their node (transportation conditions) features and place (land-use) features. Due to the interplay of transportation and land use in practice, most URT stations fall into the balanced zone. Stressed URT stations have higher utilization of both transport and land resources. Further development of stressed stations can lead to conflicting resources. The competition for space from land-use and transportation conditions in the dependent URT stations is minimal. Regional transportation system of unbalanced node-type URT stations is more developed compared to urban activities, which can further increase land development to bring more travel activities. The unbalanced place-type URT stations have significantly higher regional activity but relatively weak transportation infrastructure [6].

Although the traditional N-P model explains the relative interrelationship between urban land use and transportation development, there are no integrated and universally accepted indicator systems that have been applied to classify URT stations. Consequently, as presented in Figure 1 (right), a large number of scholars optimize the indicator system to adapt to local development characteristics [6,23]. In addition to optimizing the indicators to adapt to regional features, some scholars have attempted to incorporate new dimensions into the N-P model to achieve a more comprehensive classification [22,24]. Despite these improvements in indicators and analytical dimensions, Bertolini’s N-P model was originally proposed as an empirical classification framework lacking precise delineation criteria, and subsequent scholars have basically given suggestions along these lines. The upgraded algorithms, clustering methods [25,26], and potential class methods [27] have been applied to solve the problem of unclear division criteria of the N-P models.

2.2. The Usage of Mobility Data for Identifying Functional Areas

Most of the studies mentioned above rely on static indicators for functional identification, overlooking the dynamic interactions between human activities and the built environment. The emerging mobility data based on location-acquisition technologies offers solutions for identifying functional areas. Smart card data, mobile phone data, taxi data are commonly applied for the identification of functional areas [9]. For instance, Zhang et al. [28] use taxi trajectory data to construct a spatially embedded network model, revealing that overlapping urban structures—particularly short-distance trips—better reflect daily life functions than administrative divisions. Similarly, Zhang et al. [29] integrate metro passenger flow and POI data via a heterogeneous graph neural network, achieving 81.56% classification accuracy and demonstrating how crowd mobility data enhances functional identification. Further advancing this field, Sun et al. [30] leverage a spatio-temporal Transformer to fuse remote sensing and social media data, outperforming traditional methods by explicitly modeling cross-modal interactions.

DBS data, with its high spatiotemporal resolution and ability to reflect detailed mobility patterns, offers unique advantages in capturing human–environment interactions within station areas [21,31]. While Chang [5] demonstrated its utility in identifying six travel behavior patterns through POI correlations, and Zhao [21] proved its standalone capability in land-use characterization at city scale, few studies have specifically explored its potential for station-level function identification. Particularly, the combination of DBS’s granular movement data (e.g., precise origin-destination tracking) and temporal dynamics (e.g., peak/off-peak variations) remains underutilized for delineating station-area functions.

2.3. The Methods Used for Identification

With the development of computer technology, many studies took tentative steps towards the function discovery based on advanced algorithms, including hierarchical cluster [32,33], two steps cluster [26,34] and latent class method [27,35]. For example, Carol [33] employed a hierarchical clustering approach to categorize station-area functions in metropolitan Phoenix, Arizona, identifying five functional types based on transportation infrastructure, socio-demographic attributes, and land use. Similarly, Kumar [34] introduced a two-step clustering technique incorporating six variables, such as network density, intersection density, and proximity to transit, to classify 47 neighborhoods in the Delhi metropolitan region into six types, including low-density suburban zones and peri-urban residential areas with limited job access. In addition, Higgins [35] further applied a model-based latent class clustering method to 372 current and planned rapid transit stations in Toronto, distinguishing ten station functions, such as urban commercial cores and mixed-use centers.

Despite these contributions, three critical gaps persist. First, while N-P model extensions and clustering methods enable station classification, they rely heavily on static built-environment indicators that cannot capture dynamic human–environment interactions. Second, although mobility data address this limitation, their coarse spatial resolution or confinement to station interiors hinders fine-grained analysis of station-area functions. Third, existing DBS studies focus primarily on city-scale land use or generic travel patterns, neglecting station-level functional heterogeneity. While previous studies have indeed pioneered the integration of DBS and POI data for urban function recognition, their focus has largely been on spatial patterns at a macro scale. However, leveraging DBS’s high temporal resolution, particularly its capacity to capture day–night variations, to reveal dynamic functional shifts at the station level is an area that remains significantly underexplored. To bridge these gaps, we integrate DBS spatiotemporal dynamics (with a particular emphasis on day–night differentials) with POI features, applying cluster analysis to classify and interpret functional zones across Beijing’s rail stations, directly addressing the research questions that how DBS spatiotemporal patterns and POI data jointly identify and characterize URT stations.

3. Data and Methodology

3.1. Study Area

We take Beijing, the capital of China, as a case for research. As a representative major city in a developing country, Beijing has undergone rapid urbanization in recent decades, marked by substantial expansion and intensification of its built-up areas. According to projections, by the end of 2023, the city’s permanent population is expected to reach 21.86 million, distributed across 16 districts that cover a total administrative area of 16,410 km² [36]. As of 2023, there are 27 urban rail transit operating lines in Beijing, with 490 stations and an operating mileage of 836 km. The planned rail transit stations achieve 800 m radius coverage encompassing 50% of the city’s total residential population and 56% of employment positions, while delivering 71% residential population coverage and 76% employment coverage within central urban areas. This study focuses on 335 operational URT stations in Beijing due to data availability (Figure 2).

3.2. Data Description

This study integrates two data types to identify station-area functions through web crawling and subsequent analysis. Data on urban rail transit (URT) and points of interest (POIs) were collected via the Amap API (http://ditu.amap.com/). Specifically, we acquired the locations of three POI categories in December 2019, including residential, office, and entertainment facilities such as internet cafes, bars, and clubs. DBS has been growing rapidly since it entered Beijing in September 2016. By the end of 2018, nine DBS companies launched 1.91 million dockless bikes with 610 million orders in Beijing. Among all companies, Mobike is reported as the biggest DBS enterprise with 839,000 average daily riders. Given its dominant market position and data availability, we selected Mobike as the source for DBS data and obtained the data on 24–25 May 2020. We acknowledge that a two-day snapshot does not capture weekly or seasonal variations in DBS usage. However, for the purpose of establishing a methodological framework and revealing the relative spatial differences in DBS patterns across stations, data from consistent, representative days are sufficient. The analysis dates (24–25 May 2020) were selected to represent typical operational conditions: both days were clear weekdays without extreme weather or major holidays, thus minimizing atypical behavioral shocks. It should be noted that the data period falls within the post-lockdown phase of the COVID-19 pandemic in Beijing. While travel patterns may have been in a state of flux, the core objective of this study, to classify stations based on their relative functional signatures, remains valid. The potential influence of residual pandemic effects constitutes a context for interpreting the results rather than invalidating the methodological approach.

All data sources are transformed into the variables shown in Table 1 for the analysis. The DBS night density (N_den) and daytime density (D_den) are calculated as the ratios of total number dockless bikes for each area to the buffer area at night (23:00) and daytime (14:00), respectively. The DBS night distance (N_dis) and daytime distance (D_dis) are valued as the average distance from the DBS’s location to the station center at night (23:00) and during the daytime (14:00), respectively. The ratio of D_den to N_den (R_D2N_1) and the ratio of D_dis to N_dis (R_D2N_2) reflect the shifts in DBS usage from the daytime to the night time.

Figure 3, for example, shows the changes in DBS density from daytime to night around two stations, inferring potential functions for the stations. In station A, the DBS density is higher during the daytime than at night. One possible explanation is that users ride dockless bikes from their origins to the station area and park them near the station during the daytime, and they take dockless bikes from the station area to the destination at night, which indicates that station A is a probable daily activity-oriented area (e.g., office area). Meanwhile, station B probably is a night activity-oriented area (e.g., residential area) due to the higher density at night than during the daytime.

Note that N_dis and D_dis measure the degree of concentration or dispersion of DBS around the station. For example, as shown in Figure 4, the DBS in station A is dispersed while it is concentrated in station B, suggesting that N_dis is longer in station A than in station B.

Similarly to measuring the distribution of data values around the statistical mean through standard deviation, the N_dis and D_dis values are given by Equation (1).

N_d i s (D_d i s) = \sqrt{\frac{{\sum_{i = 1}^{n} (x_{i} - \bar{X})}^{2}}{n} + \frac{{\sum_{i = 1}^{n} (y_{i} - \bar{Y})}^{2}}{n}}

(1)

where x_i and y_i are the coordinates of dockless bike i,

\bar{X}

and

\bar{Y}

are the average centers of dockless bikes in the station area, and n is the total number of dockless bikes.

3.3. Methodology

The overall analytical framework for identifying and interpreting station area functions is illustrated in Figure 5, which consists of three main steps: (1) data preparation and variable selection, (2) cluster analysis and interpretation, and (3) validation and functional interpretation.

After data collection, six DBS-derived variables (N_den, N_dis, D_den, D_dis, R_D2N1, R_D2N2) were calculated for each station. To avoid multicollinearity in the subsequent cluster analysis, Pearson correlation analysis was conducted. Variables with a correlation coefficient greater than 0.8 were considered highly redundant. Based on this analysis, all selected variables were then standardized using the z-score method to eliminate the influence of different units.

The K-means algorithm was employed for clustering due to its interpretability, simplicity, and proven effectiveness in generating distinct, policy-relevant typologies from urban mobility data, similar to prior studies [37,38,39]. Compared to other clustering methods such as hierarchical clustering (which is computationally intensive for large samples) or DBSCAN (which is sensitive to parameter tuning and performs poorly with clusters of varying densities), K-means offers a robust and scalable solution suitable for our study’s objective of partitioning stations into distinct functional types [40,41]. A critical preliminary step is determining the optimal number of clusters (k). We utilized the Elbow method by plotting the Sum of Squared Errors (SSE) and the Average Silhouette Coefficient Score (ASCS) for a range of k values to identify the k value where the rate of decrease in SSE sharply slows down and the ASCS remains relatively high and stable [42]. After determining the optimal k, the K-means algorithm was executed to group the stations into distinct clusters.

The derived clusters were validated and interpreted in two ways. First, the statistical significance of differences in POI densities across clusters was tested using a one-way Analysis of Variance (ANOVA). Then, the functions of each cluster were interpreted by synthesizing the characteristics of both DBS usage patterns and POI configurations (e.g., density, diversity, and the ratio of core to periphery POIs). In particular, the POI density is measured by the average number of POIs located within a unit buffer area. The diversity indicator of the station area is estimated by Equation (2) [43,44]. The diversity index ranges from 0 to 1, with values closer to 1 representing greater land-use diversity. To explore the change in diversity in each 100 m, the values were z-score standardized for each cluster.

D i v e r s i t y_{j} = \frac{- \sum_{i \in R} p_{i} L N (p_{i})}{L N (n_{j})}

(2)

where

D i v e r s i t y_{j}

refers to the diversity of the given area

j

;

p_{i}

is the proportion of POI

i

in all POI types and

R

is the POI typeset, including resident, office, and entertainment facility;

L N

is the natural logarithm;

n_{j}

is the total POI number.

4. Results

4.1. Variable Analysis

To identify key variables for cluster analysis and avoid multicollinearity, we first examined the Pearson correlations among the six DBS-derived metrics (N_den, N_dis, D_den, D_dis, R_D2N_1, R_D2N_2). The Pearson correlation coefficients (Corr.) are presented in Figure 6. Most of the variable-pairs show weak correlations or no correlation, while N_den and D_den, N_dis, D_dis are correlated with each other. N_den is correlated with D_den with a significant Corr. of 0.98. Similarly, N_dis is correlated with D_dis with significant Corr. of 0.84. To further validate these strong linear relationships, we plotted the scatter plots with linear regression fits for N_den vs. D_den and N_dis vs. D_dis (Figure 7a,b). The results confirm an almost perfect linear positive correlation between N_den and D_den (Slope = 0.95, R² = 0.95), and a strong linear positive correlation between N_dis and D_dis (Slope = 0.81x, R² = 0.70). These robust linear dependencies statistically confirm the high redundancy within each variable pair.

Consequently, for the subsequent cluster analysis, we selected only one representative variable from each highly correlated pair as input features to avoid multicollinearity issues. The selection was guided by the principle of minimizing the overall correlation with variables outside the pair. For each candidate variable, we calculated the sum of its absolute Pearson correlation coefficients with all variables not in its own highly correlated pair. The variable with the smaller sum was retained. Between N_den and D_den, N_den was chosen; between N_dis and D_dis, D_dis was chosen. These two variables, together with the ratio variables, constituted the final set of four input features (N_den, D_dis, R_D2N1, R_D2N2) for cluster analysis.

In particular, to quantitatively assess the day–night variations in DBS usage patterns, paired-samples t-tests were conducted to compare the nighttime and daytime metrics. It revealed a statistically significant difference in the spatial distribution of bikes, with the average parking distance at night (N_dis) being significantly greater than during the daytime (D_dis) (t = 3.149, p = 0.002, Mean Difference = 32.59 m). This indicates that bicycles were parked more dispersed around stations at night. Conversely, no significant difference was found between nighttime and daytime density across all stations (t = 0.721, p = 0.471). This lack of a global density shift suggests a complex interplay of urban functions, where the evening influx of bikes into residential areas is counterbalanced by the outflow from employment-centric areas, resulting in no net change on an aggregate level.

4.2. Cluster Result

The optimal number of clusters was determined using the Elbow method. Figure 8 illustrates the model performance for cluster numbers (k) ranging from 2 to 14. The Elbow method showed that the rate of decrease in the SSE slowed considerably after k = 5 (SSE = 162.10). Concurrently, the ASCS for this solution was 0.43, representing a stable and relatively high value before a general trend of decreasing volatility was observed at higher k values. Therefore, k = 5 was selected as the optimal number, providing a good balance between model fit and cluster interpretability.

The outputs of classification are presented in Table 2, which details the cluster profiles using the four input variables. For the density and distance variables (N_den and D_dis), the table reports the mean z-scores to facilitate comparison across variables with different units; positive values indicate above-average levels and negative values indicate below-average levels. The ratio variables (R_D2N1 and R_D2N2) are presented in their original, non-standardized form. This is because these ratios are inherently dimensionless and centered around 1.0, which naturally signifies no change between day and night. Standardizing these ratios would obscure this intuitive interpretation without providing additional analytical benefit. The nighttime DBS density (N_den) exhibits a clear decreasing gradient from Cluster 1 (C1) to Cluster 5 (C5). The average park distance during the daytime increases from C1 to Cluster 3 (C3) and reaches the peak in Cluster 4 (C4) and drops to the lowest in Cluster 5 (C5). The two ratios of the DBS density and parking distance from daytime to night are close to 1 except C5. This indicates that day–night DBS usage patterns are relatively stable across all clusters except C5. To test the robustness of the classification, we performed alternative clustering using the variables that were excluded due to high correlation (D_den and N_dis) in place of the selected ones (N_den and D_dis). The results remained largely consistent, which is expected given the high linear correlation within each variable pair, ensuring that both sets convey nearly identical information.

The spatial distribution of each cluster is shown in Figure 9. These five clusters are distinct, with each being marked by a unique combination of index values and spatial distribution. Of the five clusters, C1 is located around Beijing’s Central Business District (CBD). It is characterized by the highest N_den (2.93) and a R_D2N1 close to 1 (0.96), consistent with sustained, high-intensity DBS usage throughout the day. The low average D_dis (−0.80) and the R_D2N2 close to 1 (1.01) indicate that individuals in C1 are parking dockless bikes at several subcenters instead of evenly distributed in the whole station area. Cluster 2 (C2) (e.g, Jingsong Station, Jiulongshan Station) and Cluster 3 (C3) (e.g., Fangzhuang Station, Jishuitan Station) are mostly located inside the 5th Ring Road of the urban areas, while most station areas in C2 are located in the northeast of the urban areas, and most of the station areas in C3 are scattered in the southwest of the urban and suburban areas. The parking distributions in both C2 and C3 are more concentrated than average; however, obvious differences in DBS usage are observed between them. C2 has higher N_den (0.83) than average, whereas N_den in C3 (−0.39) is below the average. In addition, the R_D2N1 (1.17) is slightly larger in C3 than in C2 (0.99). In other words, dockless bikes are more available in C2 than in C3 both during the daytime and at night.

Cluster 4 (C4) and Cluster 5 (C5) show almost the opposite trend in terms of the DBS usage except for the low N_den for both (−0.77 for C4, −0.88 for C5). The highest D _dis value (1.53) and the normal R_D2N1 (0.99) and R_D2N2 (1.04) values in C4 indicate that dockless bikes in the C4 area are widely distributed both during the daytime and at night. The lowest R_D2N1 values in C5 (0.55) implies users ride dockless bikes from stations to the outside of the buffer area during the daytime, and ride from the periphery to the buffer area at night. In addition, the lowest D_dis (−1.89) for C5 also shows dockless bikes parked around the station during the daytime, and the lowest R_D2N2 (0.14) indicates that dockless bikes are much more dispersed at night than during the daytime.

Figure 10 presents the relationship between the parking distance and density of dockless bikes. As illustrated in Section 3.2, N_dis and D_dis values represent the degree of concentration or dispersion of DBS locations around the station. We found a negative correlation between parking distance and density both during the daytime and at night. Remarkably, the centralization of parking distance was also observed as the density was increased. For example, the D_dis value in C1, which has the highest parking density, is centralized to approximately 300 m, whereas it is widely distributed from 600 m to 1400 m in C4.

From a temporal perspective, the majority of R_D2N1 and R_D2N2 are scattered between 0.8 and 1.2, as shown in Figure 11. Especially, all R_D2N1 and R_D2N2 values of C1 are in the range of 0.8 to 1.2, indicating that the station areas in C1 have compound functions. It is noted that, as for the parking distance, although the ratio of daytime to night was in the range from 0.8 to 1.2, the parking distance in C1 was between 200 m and 400 m, which is longer than C5.

4.3. POIs Configuration and Clusters’ Function Interpretion

4.3.1. POIs Configuration

As presented in Figure 12, C1 and C2 are high-density development areas with the maximum densities that occur in 200 m, but the poor diversity value within 300 m indicates the homogenized land-use type, whereas the increasing diversity value indicates the various land use in the periphery. The density curve of C3 is slightly below the average and the diversity reaches the top in the first 100 m and then drops to a medium level in the second 100 m where it has remained ever since. Hence, the types of land reduced sharply in the second 100 m and tended to be homogeneous from 200 m to 800 m ring. The density of C4 and C5 is the lowest, but the diversity within 200 m is relatively high, after which the diversity continued to fluctuate. It is therefore concluded that, for well-developed station areas, the lower diversity within the core represents the simple land-use type, and the increased diversity in the periphery diversifies the land use; for low-density developed station areas, the diversity decreases from the top in the first 100 m to a lower value and then stays at the same level. The land-use type in these areas decreases to a certain range and remains relatively stable.

4.3.2. Clusters’ Function Interpreting

The POI configuration reflects area functions (e.g., a station area that contains a variety of POIs has mixed functions) [5] and we attempt to annotate the function for each cluster through POIs configuration. Notably, as presented in Figure 12, POI density typically increases within 0–200 m from the station (peaking at 200 m), then declines to levels approaching station-proximate densities by 300 m, beyond which it stabilizes or decreases gradually. This inflection point at 300 m captures the transition from station-influenced functional intensity to the ambient urban areas. Hence, we use residential, office, and entertainment facility distributions within 300 m buffer to interpret functions for each cluster.

Prior to comparing POI densities across clusters, we tested the assumption of homogeneity of variances. The Levene’s test indicated a violation of this assumption for all three POI types (Res_300, Off_300, Ent_300; all with p < 0.01). Consequently, the robust Welch’s ANOVA was employed. As presented in Table 3, the results confirmed statistically significant differences in residential, office, and entertainment POI densities among the five clusters.

To delineate the specific pairwise differences between clusters, we conducted Games-Howell post hoc tests, which is the recommended procedure following a significant Welch’s ANOVA. The results are summarized in Figure 13. Values in parentheses are means, red connectors indicate statistically significant differences, with the corresponding p-values labeled above, and gray connectors denote non-significant results. For residential density (Res_300), C1 demonstrated significantly higher density than C3, C4, and C5. C2 was also significantly higher than C3, C4, and C5. No significant difference was found between C1 and C2 or between C4 and C5. Regarding office density (Off_300), C1 was significantly higher than C4 and C5. C2 was significantly higher than C3, C4, and C5. The difference between C1 and C3 was non-significant. For entertainment density (Ent_300), C1 was significantly higher than C4 and C5. C2 was significantly higher than C4 and C5, and C3 was significantly higher than C5.

This granular statistical evidence, derived from the post hoc analysis, strongly validates that the clusters derived from DBS data possess distinct built-environment characteristics. For example, the residential density is 353.58 per sq.km in C1, which is approximately three times as high as in C3 (116.20 per sq.km). The distribution of the average parking distance can be well explained by the POI configuration. The well-developed C1 has the highest POI density in terms of residential, office, and entertainment facilities, and the DBS users might have to park dockless bikes in a particular place (e.g., the entrance of a neighborhood and a rail station), hence forming one or several parking centers and resulting in low D_dis value or N_dis value.

To further decipher the spatial organization of functions within each cluster, we employed two key ratios: the job-to-residence ratio (R_J2R) and the core-to-periphery density ratio (R_3/8). The R_J2R illuminates the local functional balance at a specific spatial scale. More critically, the R_3/8 ratio operationalizes the concept of ‘inner-outer’ functional layering around a station. A value greater than 1 indicates that the specific POI type is more concentrated in the immediate 300 m core area compared to the 300–800 m periphery, suggesting a station-centered agglomeration. Conversely, a value less than 1 implies that the function is more dispersed in the broader station area. This allows us to move beyond aggregate density and understand how different urban functions are spatially structured in relation to the transit station.

As described in Table 4, most clusters have similar R_J2R values in the two layers except C4. That is, the two layers have the same balance degree of job and resident, while C4 has larger R_J2R in the inner layer than the outer layer, indicating that more jobs are in the inner layer, relatively. From the perspective of R_3/8, the R_3/8 values of residential from C1 to C5 are greater than 1 and show the increasing trend, indicating that all residential buildings are concentrated in the inner layer. The R_3/8 values of office are quite special, mainly located in the periphery of the compact developed zones (C1 and C2), and the inner layer of the low-density zone (C3, C4, and C5).

Note that both C4 and C5 have close and high R_3/8 values of residential, whereas the R_3/8 values of offices are significantly large in C4 (1.73) than it is in C5 (1.17), hence the inner layer of C4 provides more job opportunities than the inner layer of C5. Given the POIs condition as shown in Table 4, the various in R_D2N1 of C4 and C5 may be well explained by the POIs configuration. Compared to the C4, more users leave from inner layers to the outer layer for working in C5, hence the DBS in inner layer are reduced, and they come into the inner layer at night, therefore increasing the DBS density. This commute behavior results in the DBS density change from daytime to night.

In summary, according to the differences in built environments (POIs) and activities (DBS), we can interpret these clusters as presented in Table 5. C1 represents a Central Business District with High-Intensity Mixed Functions. This compact, well-developed urban core exhibits extremely high and concentrated DBS usage, aligned with its agglomeration of office, residential, and entertainment POIs, sustaining vibrant activity both day and night. C2 is characterized as an Urban Residential District with Comprehensive Services. While similar in urban form to C1, its most distinct feature is the dominant residential function and comprehensive local services, resulting in strong but slightly less intense DBS usage. C3 is identified as a Balanced Employment-Residential District. As the most common station type, it features moderate DBS density and a notable balance between employment and residential POIs within the station area, indicating a well-integrated live-work environment. C4 is classified as a Suburban Employment Center with Commuter Hub Functions. Its low-density development and dispersed daytime DBS parking reflect its role as a job concentration point and a key destination for suburban commuters. C5 is defined as a Low-Density Commuter Origin with Limited Local Services. This cluster has the lowest POI density and exhibits the most pronounced day–night DBS variations, underscoring its primary function as a residential origin for commuters with minimal local employment opportunities.

5. Discussion

5.1. Key Findings and Theoretical Implications

This study developed the typology of Beijing’s URT station areas by integrating the spatiotemporal dynamics of DBS with the static built environment represented by POIs. Our findings provide strong support for the proposed hypotheses and offer a dynamic, station-level perspective that is absent in traditional static models like the N-P model and prior DBS studies focused on aggregate, city-scale land use patterns.

First, the cluster analysis revealed five functionally distinct station groups based on DBS usage patterns, confirming our hypothesis (H1) that URT stations can be classified into distinct functional types using DBS data. This demonstrates that DBS data alone can effectively capture functional differences between station areas.

Second, significant and directional differences in DBS usage characteristics were identified across the five clusters, fully supporting hypothesis (H2). Specifically, stations in urban core areas (C1) exhibited high-density, around-the-clock usage with concentrated parking distributions, while suburban stations (C4, C5) showed clear commuter-oriented patterns with distinctive evening surges and more dispersed parking. These systematic variations in DBS metrics provide empirical evidence that different functional areas generate distinctive mobility signatures detectable through bike-sharing patterns.

Third, the station types identified by DBS data exhibited statistically significant differences in their POI configurations, thereby validating hypothesis (H3). Critically, the alignment between DBS-derived clusters and POI-based land-use features provides empirical evidence that DBS patterns effectively reflect underlying urban functions. This validation confirms that the mobility-based classification corresponds to meaningful functional differences in the built environment, bridging the gap between dynamic mobility data and static land-use models.

Our work moves beyond prior research that used DBS data for coarse land-use classification by establishing a station-level classification framework based on fine-grained DBS usage patterns. While extended N-P models incorporate new dimensions, they remain largely reliant on static indicators. Our approach complements these models by introducing a dynamic, behavior-based component (day–night DBS ratios) that directly captures the temporal functional rhythms of station areas, providing a more nuanced understanding that aligns with the theoretical interplay between ‘node’ and ‘place’ but is grounded in observed human activity.

5.2. Practical Implications for Planning and Policy

The functional typology derived from our integrated framework offers actionable insights for urban planners and policymakers. The identified clusters can guide targeted infrastructure investments and policy interventions based on station-specific characteristics. For bicycle parking management, stations in the Central Business District with High-Intensity Mixed Functions (C1) require high-capacity, organized bicycle parking facilities near station entrances to manage the extreme density. In contrast, the dispersed usage pattern in Job-Concentrated Suburban Commuter Hubs (C4) necessitates a network of smaller, flexible parking solutions distributed across the broader catchment area to effectively serve commuters arriving from all directions. Furthermore, the pronounced evening surge in Commuter Residential Areas (C5) highlights the critical need for dynamic parking redistribution strategies, where shared bikes are systematically repositioned to these stations in the afternoon to meet returning commuter demand.

The DBS-based classification system enables a precise understanding of commuter patterns, which directly informs last-mile service calibration. The prominent evening surge in residential clusters (C4, C5) underscores the need for reliable last-mile services from stations to homes, potentially informing the scheduling of shuttle services. Specifically, for the Low-Density Commuter Origin (C5), fixed-route or on-demand micro-transit services in the evening could be prioritized to bridge the connectivity gap. Conversely, stations identified as employment centers (C1, C2) may require enhanced morning arrival facilities and evening departure services. This could translate into designated drop-off zones and optimized traffic flow at C1 and C2 stations during peak hours.

Our findings reinforce TOD principles through data-driven station classification and provide a basis for targeted land-use and zoning policies [45]. The well-developed, mixed-use nature of C1 and C2 stations exemplifies successful TOD. To sustain this, zoning in C1 should continue to encourage high-density, vertical mixed-use development. For stations in C3, C4, and C5, planners can use these insights to encourage more balanced development. In the Balanced Employment-Residential District (C3), policies should protect existing job-housing balance and incentivize infill development of both types. For the Suburban Employment Center (C4), strategic densification and the introduction of supportive residential and retail land uses around the station core could help create a more vibrant, round-the-clock environment and reduce reliance on long-distance commuting. Conversely, in the Commuter Residential Area (C5), the primary goal should be to introduce local employment opportunities and basic services through relaxed zoning and incentives, thereby reducing its mono-functional dependence and associated cross-city travel demands. The methodology developed in this study provides a transferable framework for other cities to develop similarly nuanced station area typologies for targeted planning interventions.

5.3. Limitations and Future Research

This study has several limitations that offer directions for future work. First, the temporal scope of DBS data is a limitation. Our analysis relied on data from two consecutive days in May 2020. While the selected days were representative of typical spring weekdays in terms of weather and overall trip volume, the two-day snapshot cannot account for seasonal variations or the potential long-term impacts of exceptional events like the COVID-19 pandemic on commuting habits. Future studies should incorporate multi-season and multi-year data to validate the temporal stability of the identified clusters and explore the dynamics of functional patterns across different timescales.

Methodologically, while informative, our metrics may be influenced by operator interventions like bike repositioning [46], and the clustering results could be further validated through sensitivity analyses using different algorithms or variable selection schemes. Future research could integrate repositioning data or apply filtering algorithms to isolate user behavior more precisely, employ non-parametric correlation methods to strengthen variable selection, and conduct a quantitative comparison with traditional models like the N-P framework. Beyond these methodological refinements, the transferability of the proposed framework warrants consideration. The core analytical approach, classifying stations by integrating dynamic DBS patterns with static POI data, is highly transferable, as the required data are increasingly accessible globally. The principle of leveraging day–night DBS ratios to infer functional shifts is universally applicable. However, the specific typology (e.g., the clear dichotomy between a dominant urban core and sprawling suburbs) is contextual, shaped by Beijing’s monocentric urban structure and mature DBS market. In cities with different urban forms (e.g., polycentric) or DBS market structures, the manifestation of clusters may differ. Therefore, future research should test this framework in diverse urban contexts to refine the typology and distinguish universal from context-specific station-area patterns. The proposed typology, though statistically sound, would also benefit from ground-truthing through field surveys or interviews to verify cluster-assigned functions.

6. Conclusions

This study established a data-driven framework for interpreting the functional characteristics of URT station areas by integrating dynamic DBS spatiotemporal patterns—specifically day–night density and distribution ratios—with static POI-based built environment indicators. The analysis classified Beijing’s stations into five distinct clusters, each exhibiting unique signatures in terms of DBS usage intensity, concentration, and day–night variation. These mobility-based clusters were further validated by significant differences in their underlying POI compositions, confirming a strong alignment between observed travel behavior and urban function.

The key contribution of this work lies in its demonstration that the day–night dynamics of DBS data serve as a powerful proxy for delineating station-area functions and inferring functional shifts, thereby addressing a critical gap in traditional static models and coarse-grained mobility analyses. The derived typology offers a nuanced understanding of the functional hierarchy and spatial organization around transit stations, providing actionable insights for targeted infrastructure investment and policy interventions tailored to specific station types. Ultimately, this integrated method enhances our ability to plan for more efficient last-mile connections and promote sustainable urban development centered on rail transit.

Author Contributions

Conceptualization, S.L. and J.R.; methodology, S.L. and C.Z.; data curation, S.L. and M.G.; writing—original draft preparation, S.L.; writing—review and editing, J.R. and H.S.; visualization, S.L.; supervision, J.R. and C.Z. All authors have read and agreed to the published version of the manuscript.

Funding

The research was supported by the National Natural Science Foundation of China (Grant No. 52402380) and the Key Laboratory of Highway Engineering of Ministry of Education (Changsha University of Science & Technology) (Grant No. kfj230303).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhu, T.; Yang, X.; Wei, Y.; Chen, A.; Wu, J. Urban Rail Transit Resilience under Different Operation Schemes: A Percolation-Based Approach. Commun. Transp. Res. 2025, 5, 100177. [Google Scholar] [CrossRef]
Rong, W.; Zhang, Z.; Liu, Y.; Yang, Y.; Qu, X. Spatiotemporal Heterogeneity Exploration in the Effects of TOD Structural Characteristics on Metro Ridership: Evidence from Shanghai. J. Transp. Geogr. 2025, 128, 104339. [Google Scholar] [CrossRef]
Han, B.M.; Yu, Y.R.; Xi, Z.; Sun, Y.J.; Lu, F.; Li, S.W.; Li, Z.; Huang, S.; Hu, J.; Sang, Y.; et al. Statistical Analysis of Urban Rail Transit Operations Worldwide in 2023: A Review. Urban Rapid Rail Transit 2024, 37, 1–9. [Google Scholar] [CrossRef]
2023 National Passenger Volume in Urban Transportation. Available online: https://xxgk.mot.gov.cn/2020/jigou/zhghs/202404/t20240416_4128383.html (accessed on 16 April 2024).
Chang, X.; Wu, J.; He, Z.; Li, D.; Sun, H.; Wang, W. Understanding User’s Travel Behavior and City Region Functions from Station-Free Shared Bike Usage Data. Transp. Res. Part F Traffic Psychol. Behav. 2020, 72, 81–95. [Google Scholar] [CrossRef]
Liu, S.; Rong, J.; Zhou, C.; Bian, Y. Probability -Based Typology for Description of Built Environments around Urban Rail Stations. Build. Environ. 2021, 205, 108193. [Google Scholar] [CrossRef]
Osman, T.A.A.; Paquet, T.N.K.; Furlan, R.; Tadi, M.; Al-Matwi, R.S.; Isaifan, R.J. Transit-Oriented Development in Middle Eastern Cities: The Urban Renewal Framework for the Souq Waqif in Doha, Qatar. Urban Sci. 2024, 8, 182. [Google Scholar] [CrossRef]
Chai, T.; Lu, F.; Gao, J.; Deng, X.; Gao, R.; He, Q. TOD Zoning Planning: Floor Area Ratio Attenuation Rate and Center Migration Trajectory. Land 2025, 14, 1200. [Google Scholar] [CrossRef]
Liu, S.; Zhou, C.; Rong, J.; Bian, Y.; Wang, Y. Concordance between Regional Functions and Mobility Features Using Bike-Sharing and Land-Use Data near Metro Stations. Sustain. Cities Soc. 2022, 84, 104010. [Google Scholar] [CrossRef]
Bertolini, L. Nodes and Places: Complexities of Railway Station Redevelopment. Eur. Plan. Stud. 1996, 4, 331–345. [Google Scholar] [CrossRef]
Kang, H.; Miwa, T. Systematic Review and Methodological Comparison of TOD Typologies Based on the Node-Place Model. J. Transp. Geogr. 2025, 128, 104373. [Google Scholar] [CrossRef]
Haque, A. Transit Oriented Development (TOD) in Dhaka: Measuring Spatial Potential from Node Place Perspective. Transp. Res. Procedia 2025, 82, 2172–2187. [Google Scholar] [CrossRef]
Liu, J.; Yang, X.; Huang, X.; Li, J.; Li, J.; Chen, H.; An, R.; Song, Y. Introducing Street View into Node-Place Model for Evaluating Transit-Oriented Development. J. Transp. Geogr. 2025, 128, 104334. [Google Scholar] [CrossRef]
Alexander, L.; Jiang, S.; Murga, M.; González, M.C. Origin-Destination Trips by Purpose and Time of Day Inferred from Mobile Phone Data. Transp. Res. Part C Emerg. Technol. 2015, 58, 240–250. [Google Scholar] [CrossRef]
Calabrese, F.; Diao, M.; Di Lorenzo, G.; Ferreira, J.; Ratti, C. Understanding Individual Mobility Patterns from Urban Sensing Data: A Mobile Phone Trace Example. Transp. Res. Part C Emerg. Technol. 2013, 26, 301–313. [Google Scholar] [CrossRef]
Pei, T.; Sobolevsky, S.; Ratti, C.; Shaw, S.L.; Li, T.; Zhou, C. A New Insight into Land Use Classification Based on Aggregated Mobile Phone Data. Int. J. Geogr. Inf. Sci. 2014, 28, 1988–2007. [Google Scholar] [CrossRef]
Dong, S.; Wang, Y.; Dou, M.; Wang, C.; Gong, J. Uncovering the Similarity and Heterogeneity of Metro Stations: From Passenger Mobility, Land Use, and Streetscapes Semantics. Appl. Geogr. 2025, 174, 103450. [Google Scholar] [CrossRef]
Ma, J.; Zheng, C.; Yu, M.; Shen, J.; Zhang, H.; Wang, Y. The Analysis of Spatio-Temporal Characteristics and Determinants of Dockless Bike-Sharing and Metro Integration. Transp. Lett. 2024, 16, 182–195. [Google Scholar] [CrossRef]
Liu, Y.; Li, L.; Liu, K.; He, M.; Shi, Z. Investigating User Preferences for Dockless Bike- and Electric Bike-Sharing through Tracking Usage Patterns. Transp. Policy 2025, 169, 41–55. [Google Scholar] [CrossRef]
Song, K.; Diao, M. Nonlinear and Spatially-Varying Effects of the Built Environment on Dockless Bike-Sharing Usage. Transp. Res. Part D Transp. Environ. 2025, 145, 104807. [Google Scholar] [CrossRef]
Zhao, J.; Fan, W.; Zhai, X. Identification of Land-Use Characteristics Using Bicycle Sharing Data: A Deep Learning Approach. J. Transp. Geogr. 2020, 82, 102562. [Google Scholar] [CrossRef]
Pezeshknejad, P.; Monajem, S.; Mozafari, H. Evaluating Sustainability and Land Use Integration of BRT Stations via Extended Node Place Model, an Application on BRT Stations of Tehran. J. Transp. Geogr. 2020, 82, 102626. [Google Scholar] [CrossRef]
Amini Pishro, A.; Yang, Q.; Zhang, S.; Amini Pishro, M.; Zhang, Z.; Zhao, Y.; Postel, V.; Huang, D.; Li, W.Y. Node, Place, Ridership, and Time Model for Rail-Transit Stations: A Case Study. Sci. Rep. 2022, 12, 16120. [Google Scholar] [CrossRef] [PubMed]
Olaru, D.; Moncrieff, S.; McCarney, G.; Sun, Y.; Reed, T.; Pattison, C.; Smith, B.; Biermann, S. Place vs. Node Transit: Planning Policies Revisited. Sustainability 2019, 11, 477. [Google Scholar] [CrossRef]
Kong, X.; Xia, F.; Ma, K.; Li, J.; Yang, Q. Discovering Transit-Oriented Development Regions of Megacities Using Heterogeneous Urban Data. IEEE Trans. Comput. Soc. Syst. 2019, 6, 943–955. [Google Scholar] [CrossRef]
Jeffrey, D.; Boulangé, C.; Giles-Corti, B.; Washington, S.; Gunn, L. Using Walkability Measures to Identify Train Stations with the Potential to Become Transit Oriented Developments Located in Walkable Neighbourhoods. J. Transp. Geogr. 2019, 76, 221–231. [Google Scholar] [CrossRef]
Huang, R.; Grigolon, A.; Madureira, M.; Brussel, M. Measuring Transit-Oriented Development (TOD) Network Complementarity Based on Tod Node Typology. J. Transp. Land Use 2018, 11, 304–324. [Google Scholar] [CrossRef]
Zhang, Z.; Hu, S.; Wu, L.; Liu, Y.; Xu, Y.; Cui, H. Revealing the Hierarchical Overlapping Urban Structure Using Taxi Trajectory Data. Cities 2025, 166, 106290. [Google Scholar] [CrossRef]
Zhang, P.; Yang, M.; Wang, Y.; Yang, T.; Yu, H.; Yan, X. Integrating Metro Passenger Flow Data to Improve the Classification of Urban Functional Regions Using a Heterogeneous Graph Neural Network. Int. J. Digit. Earth 2024, 17, 613–630. [Google Scholar] [CrossRef]
Sun, R.; Su, X.; Yuan, Q.; Jiao, H.; He, J.; Zheng, L. Urban Region Function Classification via Fusing Optical Imagery and Social Media Data: A Spatio-Temporal Transformer Interaction Approach. Inf. Fusion 2025, 121, 103140. [Google Scholar] [CrossRef]
Guo, Y.; He, S.Y. Built Environment Effects on the Integration of Dockless Bike-Sharing and the Metro. Transp. Res. Part D Transp. Environ. 2020, 83, 102335. [Google Scholar] [CrossRef]
Vale, D.S. Transit-Oriented Development, Integration of Land Use and Transport, and Pedestrian Accessibility: Combining Node-Place Model with Pedestrian Shed Ratio to Evaluate and Classify Station Areas in Lisbon. J. Transp. Geogr. 2015, 45, 70–80. [Google Scholar] [CrossRef]
Atkinson-Palombo, C.; Kuby, M.J. The Geography of Advance Transit-Oriented Development in Metropolitan Phoenix, Arizona, 2000–2007. J. Transp. Geogr. 2011, 19, 189–199. [Google Scholar] [CrossRef]
Phani Kumar, P.; Ravi Sekhar, C.; Parida, M. Identification of Neighborhood Typology for Potential Transit-Oriented Development. Transp. Res. Part D Transp. Environ. 2020, 78, 102186. [Google Scholar] [CrossRef]
Higgins, C.D.; Kanaroglou, P.S. A Latent Class Method for Classifying and Evaluating the Performance of Station Area Transit-Oriented Development in the Toronto Region. J. Transp. Geogr. 2016, 52, 61–72. [Google Scholar] [CrossRef]
Beijing 2023 National Economic and Social Development Statistical Bulletin. Available online: https://tjj.beijing.gov.cn/tjsj_31433/tjgb_31445/ndgb_31446/202403/t20240321_3595860.html (accessed on 21 March 2024).
Zhang, X.; Li, W.; Zhang, F.; Liu, R.; Du, Z. Identifying Urban Functional Zones Using Public Bicycle Rental Records and Point-of-Interest Data. ISPRS Int. J. Geo-Inf. 2018, 7, 459. [Google Scholar] [CrossRef]
Seo, M.; Lee, D. Typological Differences in Railway Station Areas According to Locational Characteristics: A Nationwide Study of Korea. Sustainability 2021, 13, 4310. [Google Scholar] [CrossRef]
Feng, H.; Chen, Y.; Wu, J.; Zhao, Z.; Wang, Y.; Wang, Z. Urban Rail Transit Station Type Identification Based on “Passenger Flow—Land Use—Job-Housing”. Sustainability 2023, 15, 15103. [Google Scholar] [CrossRef]
Seidenfus, M.; Zacher, T.; Balke, G.; Lienkamp, M. Out of Alignment: Fixing Overlapping Segments in German Car Classification Through Data-Driven Clustering. Futur. Transp. 2025, 5, 132. [Google Scholar] [CrossRef]
Schubert, E.; Sander, J.; Ester, M.; Kriegel, H.P.; Xu, X. DBSCAN Revisited, Revisited. ACM Trans. Database Syst. 2017, 42, 1–21. [Google Scholar] [CrossRef]
Zhong, H.; Zhang, H.; Jia, F. Analysis and Improvement of Evaluation Indexes for Clustering Results. EAI Endorsed Trans. Collab. Comput. 2020, 4, 163211. [Google Scholar] [CrossRef][Green Version]
Phani Kumar, P.; Ravi Sekhar, C.; Parida, M. Residential Dissonance in TOD Neighborhoods. J. Transp. Geogr. 2018, 72, 166–177. [Google Scholar] [CrossRef]
Tian, G.; Park, K.; Ewing, R.; Watten, M.; Walters, J. Traffic Generated by Mixed-Use Developments—A Follow-up 31-Region Study. Transp. Res. Part D Transp. Environ. 2020, 78, 102205. [Google Scholar] [CrossRef]
Calthorpe, P. The Next American Metropolis: Ecology, Community, and the American Dream; Princeton Architectural Press: New York, NY, USA, 1993. [Google Scholar]
Xing, Y.; Wang, K.; Lu, J.J. Exploring Travel Patterns and Trip Purposes of Dockless Bike-Sharing by Analyzing Massive Bike-Sharing Data in Shanghai, China. J. Transp. Geogr. 2020, 87, 102787. [Google Scholar] [CrossRef]

Figure 1. Traditional N-P model (left) and its improvement (right).

Figure 2. Study area.

Figure 3. Density changes in two stations.

Figure 4. The various of N_dis in two stations.

Figure 5. The analytical framework of this study.

Figure 6. Pearson correlation coefficient matrix (* indicates statistical significance at the 0.10 level, ** at the 0.05 level, and *** at the 0.01 level).

Figure 7. Linear relationships between daytime and nighttime (a) DBS density, and (b) parking distance.

Figure 8. The optimal number of clusters.

Figure 9. Cluster result of the station areas in Beijing.

Figure 10. Relationships between the parking density and distance. (a) the N_dis and N_den values, (b) the D_dis and D_den values.

Figure 11. Distribution of R_D2N1 and R_D2N2 values. (a) R_D2N1; (b) R_D2N2.

Figure 12. POI density (top) and diversity (bottom) for each cluster.

Figure 13. Games-Howell post hoc tests of the Res_300, Off_300, and Ent_300 for each cluster. (a) Res_300, (b) Off_300, (c) Ent_300.

Table 1. Descriptive statistics of data.

Data	Description	Unit	Mean	Std.dev
N_den	the ratio of the total number of dockless bikes collected at night (23:00) to the buffer area.	pcu/sq.km	49.210	54.105
N_dis	The average distance from locations of dockless bikes to the center at night, referring to the degree of concentration or dispersion.	m	509.741	253.384
D_den	the ratio of the total number of dockless bikes collected during the daytime (10:00) to the buffer area	pcu/sq.km	49.891	52.528
D_dis	The average distance from locations of dockless bikes to the center at daytime, referring to the degree of concentration or dispersion.	m	477.148	245.148
R_D2N1	the ratio of D_den to N_den.	-	1.044	0.370
R_D2N2	the ratio of D_dis to N_dis.	-	0.943	0.310
Res_300	Residential POI densities in 300 m buffer around station.	/sq.km	150.888	222.192
Off_300	Office POI densities in 300 m buffer around station.	/sq.km	24.177	35.038
Ent_300	Entertainment POI densities in 300 m buffer around station.	/sq.km	8.499	14.068
Inte_300	The sum of all kinds of POI densities in 300 m buffer around station.	/sq.km	183.56	248.789
R_J2R	Ratio of the office density to the resident density in a given area	-	-	-
R_3/8	ratio of a certain POI density in 300 m buffer to the density in the area with the buffer radius between 300 m and 800 m	-	-	-

Table 2. Cluster description summary.

Cluster	Number	Examples	N_den	D_dis	R_D2N1	R_D2N2
C1	18	Wangjing Station	2.93	−0.80	0.96	1.01
C2	88	Xiyuan Station	0.82	−0.60	0.99	1.00
C3	139	Guanzhuang Station	−0.39	−0.14	1.17	0.93
C4	75	Huilongguan Station	−0.77	1.53	0.99	1.04
C5	15	Shahe Station	−0.88	−1.89	0.53	0.14

Table 3. Welch’s ANOVA Results for POI Densities Across Clusters.

Variable	Welch’s F	df 1	df 2	p-Value
Res_300	13.481	4	77.652	<0.01
Off_300	17.489	4	69.693	<0.01
Ent_300	19.697	4	65.683	<0.01

Note: The results show a statistically significant difference in POI densities among the five station clusters at the p < 0.01 level. df = degrees of freedom.

Table 4. R_J2R and R_3/8 in each cluster.

Cluster	R_J2R		R_3/8
Cluster	300 m Buffer	300 m–800 m Buffer	Residential	Office	Entertainment
C1	0.22	0.37	1.57	0.86	2.03
C2	0.36	0.47	1.96	0.93	2.26
C3	0.46	0.34	1.50	1.23	2.49
C4	0.62	0.35	2.00	1.73	2.51
C5	0.34	0.36	2.20	1.17	1.21

Table 5. The POIs and DBS usage features in clusters.

Clusters		C1	C2	C3	C4	C5
Functional type		Central Business District with High-Intensity Mixed Functions	Urban Residential District with Comprehensive Services	Balanced Employment-Residential District	Suburban Employment Center with Commuter Hub Functions	Low-Density Commuter Origin with Limited Local Services
Typical profiles
DBS feature	N_den	2.93	0.82	−0.39	−0.77	−0.88
	R_D2N1	0.96	0.99	1.17	0.99	0.53
	D_dis	−0.80	−0.60	−0.14	1.53	−1.89
	R_D2N2	1.01	1.00	0.93	1.04	0.14
POIs configuration	Inte_300	433.84	319.15	142.79	67.58	45.51
	Res_300	353.48	264.70	116.20	55.50	38.43
	Off_300	61.89	41.64	18.73	8.44	5.66
	Ent_300	18.47	12.82	7.86	3.63	1.41
	R_J2R (300)	0.22	0.36	0.46	0.62	0.34
	R_J2R (300–800)	0.37	0.47	0.34	0.35	0.36
	Res: R_3/8	1.57	1.96	1.50	2.00	2.20
	Off: R_3/8	0.86	0.93	1.23	1.73	1.17
	Ent: R_3/8	2.03	2.26	2.49	2.51	2.21
Case number		18	88	139	75	15

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, S.; Rong, J.; Zhou, C.; Guo, M.; Sun, H. Interpreting Regional Functions Around Urban Rail Stations by Integrating Dockless Bike Sharing and POI Patterns: Case Study of Beijing, China. Urban Sci. 2026, 10, 1. https://doi.org/10.3390/urbansci10010001

AMA Style

Liu S, Rong J, Zhou C, Guo M, Sun H. Interpreting Regional Functions Around Urban Rail Stations by Integrating Dockless Bike Sharing and POI Patterns: Case Study of Beijing, China. Urban Science. 2026; 10(1):1. https://doi.org/10.3390/urbansci10010001

Chicago/Turabian Style

Liu, Siyang, Jian Rong, Chenjing Zhou, Miao Guo, and Haodong Sun. 2026. "Interpreting Regional Functions Around Urban Rail Stations by Integrating Dockless Bike Sharing and POI Patterns: Case Study of Beijing, China" Urban Science 10, no. 1: 1. https://doi.org/10.3390/urbansci10010001

APA Style

Liu, S., Rong, J., Zhou, C., Guo, M., & Sun, H. (2026). Interpreting Regional Functions Around Urban Rail Stations by Integrating Dockless Bike Sharing and POI Patterns: Case Study of Beijing, China. Urban Science, 10(1), 1. https://doi.org/10.3390/urbansci10010001

Article Menu

Interpreting Regional Functions Around Urban Rail Stations by Integrating Dockless Bike Sharing and POI Patterns: Case Study of Beijing, China

Abstract

1. Introduction

2. Literature Review

2.1. The N-P Model and Its Improvements

2.2. The Usage of Mobility Data for Identifying Functional Areas

2.3. The Methods Used for Identification

3. Data and Methodology

3.1. Study Area

3.2. Data Description

3.3. Methodology

4. Results

4.1. Variable Analysis

4.2. Cluster Result

4.3. POIs Configuration and Clusters’ Function Interpretion

4.3.1. POIs Configuration

4.3.2. Clusters’ Function Interpreting

5. Discussion

5.1. Key Findings and Theoretical Implications

5.2. Practical Implications for Planning and Policy

5.3. Limitations and Future Research

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI