Next Article in Journal
Improving the Transport and Logistic Infrastructure of a City Using the Graph Theory Method: The Case of Astana, Kazakhstan
Next Article in Special Issue
How Does the “Civilized City” Selection Affect Environmental Governance Performance? A Spatial DID Approach Based on Prefecture-Level Cities
Previous Article in Journal
Recycling Post-Consumed Polylactic Acid Waste Through Three-Dimensional Printing: Technical vs. Resource Efficiency Benefits
Previous Article in Special Issue
Demographic Change and the Housing Stock of Large and Medium-Sized Cities in the Context of Sustainable Development
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Geographically Weighted Nonlinear Regression for Cost-Effective Policies to Enhance Bus Ridership

Department of Civil Engineering, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
*
Author to whom correspondence should be addressed.
Sustainability 2025, 17(6), 2485; https://doi.org/10.3390/su17062485
Submission received: 5 February 2025 / Revised: 2 March 2025 / Accepted: 4 March 2025 / Published: 12 March 2025
(This article belongs to the Special Issue Spatial Analysis for the Sustainable City)

Abstract

:
This paper introduces a new geographically weighted nonlinear regression (GWNR) model to predict bus boarding more accurately. The proposed model, based on empirical data from selected bus routes in Chennai city, India, simultaneously accounts for spatial variations and non-linear relationships. The proposed GWNR model improves boarding forecast accuracy by increasing R2 by 18.5% and reducing MAE by 15% compared to linear models. The results are used to identify best-fitting non-linear transformations for key variables such as bus and train station density, scheduled headway, and occupancy, thereby providing deeper insights and better interpretability. Unlike existing aggregate models, bus consideration probability is identified as a key predictor of bus boarding, thus reflecting non-users’ behavior. Without this effect, the influences of nearby bus and train stations show counterintuitive trends. Upon incorporating consideration probability, the presence of a single nearby train station increases bus boarding by improving accessibility, whereas multiple stations nearby reduce it due to competition effects. Finally, an illustrative policy application demonstrates the ability of the model to identify priority locations where scheduled headway changes are needed and to determine the optimal magnitude of adjustments. Such a targeted policy intervention is found to be twice as effective in increasing the ridership gain index compared to uniform area-wide policies.

1. Introduction

The motorized transportation system significantly contributes to global oil consumption (64%), energy usage (27%), and CO2 emissions (23%) [1]. This, in turn, prompts cities worldwide to prioritize mass transit for sustainable mobility, particularly amidst intensifying climate change concerns. Developing nations are turning to public bus transit due to its greater affordability, ease of implementation, and energy efficiency. Nonetheless, despite initiatives to promote bus travel, globally declining bus demand is impeding these moves towards sustainable development goals in transport [2,3]. Traditional demand forecasting models are designed to predict demand and identify key influencing factors so that transit agencies can implement policies to maximize ridership. However, these models often have potential deficiencies in adequately analyzing critical factors, which lead to underestimated demand projections, inaccurate assessments of factor influence, or an unrealistic representation of the relationship between transit demand and its determinants. As a result, transit agencies struggle to implement effective policies that can enable them to achieve maximum ridership. Therefore, to address this challenge, a comprehensive analysis of bus demand using advanced modeling techniques is crucial for developing targeted interventions that can reverse this declining trend and guide cities to increase bus ridership as a step toward sustainability.
Boarding is a key transit aggregate demand indicator, alongside alighting, ridership, and footfall. It offers an understanding of where passengers enter the transit system, that can inform decisions regarding service planning, scheduling, improving amenities, and providing first-mile services at high-demand locations. Current bus boarding models have two main shortcomings that can cause over or underestimation of demand: (i) they often assume a normal distribution [4], and (ii) focus only on the linear effects of influencing factors on the boarding [5]. To address these issues, some studies explored nonlinear relationships, primarily based on log-linear or log-log transformation. However, these models may not be appropriate in all cases. For instance, the log-linear model assumes the marginal effect to be proportional to the response variable (boarding), whereas the elasticity remains constant for each independent variable (influencing factor) in the log-log model. Furthermore, the literature lacks a comprehensive understanding of nonlinearity due to the independent variables. For instance, the time gap between bus runs could potentially have a nonlinear effect on boarding. Very small gaps may lead to fewer passengers boarding per run, while long gaps may prompt passengers to consider alternative modes due to prolonged waiting times. The existing literature identifies the primary factors influencing bus boarding. However, it does not adequately address an important research question about the nature and suitability of alternative nonlinear functional forms that can realistically and practically explain bus boarding outcomes. To advance the current knowledge, this study proposes accounting for non-normality by transforming the response variable scale and capturing the nonlinearity in the covariates by adopting a wider range of nonlinear functions such as power, polynomial, log, and spline.
Current transit demand models solely rely on transit users’ data. They neglect how the behavior or perception of passengers who do not consider the bus as a viable option affects bus boarding. This negligence may introduce self-selection bias into the model, and lead to over or underestimation of key factors. Factors contributing to low consideration (the process of deciding to include a mode in an individual’s choice set) of buses may involve users’ poor perceptions of accessibility, service quality, and several other factors [6,7]. Therefore, the second research issue of interest is to investigate whether and to what extent an individual’s propensity to consider bus transit in his/her choice set affects the relationship between other influencing factors and bus boarding.
The impact of influencing factors on transit demand may vary across geographic areas, depending on unique land use, built environment, and transportation system characteristics. Spatial models such as spatial lag, spatial error, spatial Durbin, and geographically weighted regression (GWR) analyze this spatial variability [8,9]. When other spatial models mostly use global coefficients with spatially lagged terms, GWR employs locally varying coefficients to help in customizing policies based on the specific needs of different locations. However, the GWR literature often assumes a normal distribution for response variables and linear functional forms for influencing factors. This approach could not only potentially misrepresent transit demand data but also fail to identify the quantum of improvements needed in influencing factors to enhance the response variable. As a result, traditional GWR can prioritize locations but is unable to quantify the necessary improvements adequately. Hence, this paper explores how much ridership gains can be obtained from interventions using a nonlinear GWR model compared to those achieved with a traditional GWR model. Furthermore, past GWR studies often simplify real-world impediments by utilizing Euclidean distance for geographical weights and neglecting physical barriers [9]. However, some studies reported that results using Euclidean and network distance metrics are almost similar, possibly due to a lack of significant physical barriers [10,11]. Despite this, network distance is a widely accepted approach for capturing spatial heterogeneity.
In response to the above limitations in practical advancements, this paper aims to accomplish the following objectives:
(i)
To investigate the effect of non-normality and nonlinearity in key factors influencing bus transit boarding.
(ii)
To examine the impact of including non-bus users’ behavior on the responsiveness of other key influencing factors for bus boarding.
(iii)
To illustrate the effectiveness of interventions derived from the nonlinear GWR model by identifying intervention locations and the quantum of improvement required to enhance bus ridership.
This paper develops a geographically weighted nonlinear regression (GWNR) model to predict bus boarding using aggregate data from electronic ticketing machines (ETM) on specific bus routes in Chennai, India. This model applies square root transformation for boarding and nonlinear functions (polynomial, power, etc.) for different influencing factors. The weight matrix is calculated using pedestrian and driving network distances derived from OpenStreetMap to assess the neighborhood influence on coefficient estimation. To predict bus consideration probability, a binary logit model is developed using disaggregated household survey data from Chennai. Finally, the predicted consideration probability is incorporated into the boarding model as an instrumental variable.
In contrast to existing models that captured either nonlinearity or geographical variation alone, this study introduces a novel GWNR model for bus boarding, which accounts for nonlinearity together with spatial variation simultaneously for enhancing prediction accuracy, understanding behavioral relationships, and informing policy implications. Specifically, this paper presents three novelties: (a) a square root transformation of the response variable is applied to represent the empirical distribution of data, (b) nonlinear functions of explanatory factors are employed to explore the realistic intricacies of their influences, and (c) bus consideration probability is utilized to account for possible self-selection in bus boarding data. The proposed model provides detailed area-specific insights into the effects of station, route, run, land-use characteristics, and consideration propensity on bus boarding. Finally, the proposed GWNR guides targeted policy interventions that are nearly twice as effective as those obtained from traditional GWR. The findings will empower transit operators to make effective decisions about bus boarding, ultimately contributing to more sustainable and efficient transportation systems.
The rest of the paper is structured as follows: Section 2 briefs the literature review. Section 3 details the study area and data characteristics. Section 4 discusses the methodological development. Section 5 and Section 6 present the results, discussion, and practical applications. Section 7 concludes with key findings and contributions.

2. Literature Review

This section briefly overviews the relevant literature on bus boarding models, examining the effect of non-normality, nonlinearity of influencing factors, consideration probability, and spatial heterogeneity.

2.1. Non-Normality and Nonlinear Influencing Factors in Bus Boarding Models

The literature suggests that assuming a normal distribution may not adequately predict bus boarding. Various methodologies, such as log-log regression [4,5,12], generalized linear multilevel negative binomial model [13], and log sum regression [14], were employed to address this issue. However, these studies were primarily conducted based on the stop or route-level boardings measured over annual, daily, or hourly periods, possibly due to challenges associated with finer-resolution datasets. Hence, currently, there is a research gap in exploring how to handle non-normality at a fine-grained resolution (run level) of boarding data.
Existing models explored how stop and route characteristics, service characteristics, transit facilities, and land use primarily influenced bus aggregate demand. Stop and route level variables, such as bus route coverage and number of stops, were found to positively influence passenger miles and boarding after log transformation [4,14]. However, factors like upstream overlaps with interacting routes led to competition and negatively affected boarding [5,13]. Regarding service characteristics, headway showed either a linear or logarithmic negative effect on boarding or passenger miles, respectively [4,15]. Additionally, Deepa et al. (2022) captured the diminishing benefit of increasing bus frequency on bus boarding through a piecewise linear model [14]. Log-transformed bus supply enhanced passenger miles, while log-transformed fare and accidents reduced it [4]. For transit facilities, more bus stops increased boarding either linearly [15] or logarithmically [13], whereas more metro stations linearly decreased it [15]. In terms of land use, residential areas exerted a linear positive influence on morning bus boarding, but public service and commercial floor areas had a linear or logarithmic negative impact [14,15]. Alam et al. (2018) showed that log-transformed population demographics, such as neighborhoods with colleges and low-income immigrant populations, positively influenced passenger miles of travel [4]. Thus, the literature shows that influential factors may have a nonlinear impact on bus boarding. Even though log transformation was extensively used to explore such nonlinearity, the other potential nonlinear functions, such as logistic, power transformation, and polynomial functions, remained largely unexplored.

2.2. Inclusion of Consideration Probability into Bus Boarding Model

As the existing bus boarding models focus only on aggregate data of transit users, there is a lack of understanding of how individual inclinations towards considering bus transit can shape bus boarding patterns. Neglecting non-transit users’ behavior can lead to a biased estimation of aggregate demand, and over or underestimate the influence of primary factors on aggregate demand. In simpler terms, commuters who have not used bus transit for a while (a month, three months, or six months), are presumed to not consider bus in their choice set. They might be reluctant to consider buses due to concerns about poor accessibility, lack of information, or poor service [6,7,16]. By improving these factors, the probability of considering buses could increase, and subsequently lead to more commuters actually getting on buses. Hence, further research is needed to explore not only the direct impact of bus consideration probability on bus boarding but also the effect of bus consideration probability on the responsiveness of other variables.

2.3. Advanced Demand Forecasting Models Addressing Spatial Variability

Recent advancements in demand forecasting have introduced deep learning, Bayesian models, and ensemble methods, which provide strong predictive performance. Deep learning techniques, such as long short-term memory (LSTM) networks, capture complex temporal patterns in transportation demand [17]. Bayesian models, like Bayesian model averaging (BMA), incorporate probabilistic approaches to handle uncertainty and enhance forecast accuracy [18]. Ensemble methods further improve reliability by combining multiple models for robust predictions [19]. Despite their predictive strength, spatial models are preferred in this study due to their ability to explicitly capture spatial dependencies and heterogeneity in transit data, which help to understand the unobserved influence of factors such as land use, built environment, and transportation system characteristics on transit demand that global or non-spatial models may overlook.
Commonly used spatial models include spatial lag or autoregressive (SAR) model, spatial error (SEM) model, spatial Durbin model (SDM), and geographically weighted regression (GWR) [9]. Each model captures different aspects of spatial relationships. SAR model captures the influence of direct neighbors; SEM captures exogenous interactions through error terms, whereas SDM permits spatial state dependence between dependent and independent variables. In contrast, GWR captures spatial heterogeneity across observations by allowing coefficients to vary locally. By mapping spatial variation in localized coefficient estimates of different explanatory factors, GWR helps decision-makers to implement location-specific policy actions, unlike SAR, SEM, and SDM, which assume uniform coefficients across space with spatially lag term.
However, GWR is computationally demanding, which makes it challenging to apply it to large datasets. Consequently, GWR was found to be primarily utilized to study spatial heterogeneity in factors affecting rail transit demand, rather than to forecast bus boarding. This is likely because bus networks and datasets are larger and more complex than those of rail systems. Shakespeare and Srinivasan (2024) found that factors influencing transit demand, such as transit frequency, the percentage of Hispanic residents, and rent burden, vary significantly across neighborhoods [20]. They employed hierarchical linear regression models to analyze these relationships but did not explore neighborhood contributions using methods like GWR.
GWR could provide more precise insights into spatial variations to better understand how different factors influence transit ridership in different areas. For example, Ma et al. (2018) showed that the presence of bus stops significantly increase transit ridership in large residential, business, and hub areas beyond the fourth ring road in Beijing, China, with some high-demand areas extending beyond the fifth ring road in business clusters. Residential buildings also influence ridership, with higher values in suburban areas near subway lines (where buses facilitate transfers) and in downtown areas (where people commute short distances) [21]. Wang et al. (2022) demonstrated that in Xi’an, China, suburban stations with higher population density and commercial facilities experienced increased ridership, while stations in the northeast with low land-use density benefited from strong bus connections [22]. Chen et al. (2019) observed in Nanjing, China, that metro ridership is higher near parks and plazas, with more people alighting in the morning. They also highlighted that residential, business, and workplace areas have a strong impact on ridership, especially in the CBD where commercial and office developments increase metro use [23]. Tu et al. (2018) emphasized that metro accessibility is crucial for metro usage in densely populated urban areas, but bus accessibility plays a key role in suburban ridership in Shenzhen, China [24]. Hence, these factors influence transit demand differently based on geographic and infrastructural context. However, using linear functional forms for all these influential factors limits these studies in determining the optimal number of facilities needed to maximize transit usage, particularly for bus ridership.

2.4. Gaps in the Earlier Literature

Table 1 summarizes the gaps identified in the earlier literature. Although some global models attempt to capture the nonlinear relationships between bus boarding and its influencing factors, they often rely heavily on log transformation, which can be restrictive. Moreover, these global models apply uniform coefficients across regions, and cannot estimate the location-specific importance of influencing factors. Previous GWR studies offer insights into which factors are crucial in specific locations. However, they cannot assess the quantum of improvement needed in these factors adequately to enhance ridership due to the linearity assumption. To effectively enhance ridership through targeted policies, it is not sufficient to identify areas for improvement but also essential to determine the required magnitude of these improvements. Furthermore, even though past models focus on boarding data at the stop or route level across various timeframes, they do not thoroughly investigate boarding at the run level, likely due to complexities arising from finer resolution datasets. Additionally, current models for aggregated bus transit demand may exhibit self-selection bias, as they only focus on data from current bus users, overlooking the impact of incorporating bus consideration probability on the sensitivity of other variables. In light of these literature gaps, a more robust approach is necessary that can provide realistic and effective insights into bus boarding models. This approach should simultaneously account for nonlinearity and spatial variations, while also accounting for both the bus users’ and non-users’ behavior.

3. Study Area and Data Characteristics

3.1. Study Area

This paper combines both aggregate and disaggregate data to analyze bus boarding on selected routes from Chennai City, the capital of Tamil Nadu, and India’s fourth-largest urban agglomeration. Chennai is chosen as the study area (Figure 1b) for being a metropolitan city with a diverse population and a well-connected mass transit system, including a fleet of about 3500 buses operated by Metropolitan Transport Corporation Ltd. (MTC), Chennai, India’s one of the oldest suburban railways, and a newly constructed metro network with two routes and 32 stations. Despite these transit options, Chennai has over 5 million private vehicles and the highest number of two-wheelers across different states in the country [25]. Consequently, an efficient bus transit system is crucial to address the city’s growing demand and promote sustainable urban mobility.
Four MTC bus routes (M70, 519, D70, M1) were selected, spanning 39 bus stages (groups of stops with uniform fares) in this study based on their geographic coverage, linking urban, suburban, and peri-urban regions (Figure 1). These routes serve major transit hubs like Broadway, CMBT bus terminus, and key attractions such as T Nagar, Broadway, Saidapet, and Egmore. The sample routes’ average daily ridership (4816) closely matches the overall average (4789) across all 691 MTC routes operating in Chennai city [26]. Thus, these four routes are considered representative of all MTC routes. A bus stop is defined as a designated location where passengers board or alight from a bus, whereas a bus stage comprises a sequence of consecutive stops that fall within a uniform fare zone. Since electronic ticketing machines (ETMs) issue tickets at the stage level rather than for individual stops, this study is conducted at the stage level to align with the ETM fare system and passenger boarding data.

3.2. Description of Aggregate Data

Electronic ticketing machine (ETM) data for weekdays from 6 a.m. to 10 p.m. from the four selected MTC bus routes are used to develop the bus boarding model at the run-stage level. ETMs issue tickets based on stages (groups of stops with uniform fares), and the records are used to derive passenger boarding and alighting counts at each stage during each run. Additional details recorded include the date, trip start time, service type, route, and trip direction. MTC offers three different bus services in Chennai: ordinary (cheapest), deluxe (most expensive and comfortable), and express.
Figure 1c illustrates the spatial variation in average daily boardings and the number of bus stations within a 500 m buffer for each stage along bus routes. Bus stages are classified as low (yellow, <33rd percentile), medium (red, 33rd–67th percentile), and high (blue, >67th percentile) based on both boarding and bus stop density values. Each bus stage location is represented by a circular marker for boarding classification and a square marker for bus stop density classification.
Urban areas have a higher average boarding level (blue) compared to medium (red) in suburban and lower (yellow) boardings in peri-urban areas. Suburban and peri-urban areas generally have fewer bus stops than urban areas, although this trend cannot be generalized everywhere. For instance, urban stages like Thirumanga and Concorde have lower boarding despite having many bus stops. Conversely, peri-urban stages like Kelambakkam and Thirupporur experience medium to high boarding despite having few bus stops. Hence, this spatial variation highlights the need for a spatial model to capture the effect of unobserved factors and neighborhoods affecting bus boarding patterns.
Boarding per run per stage is the dependent variable considered in this study. Independent variables, such as stage characteristics (SC), run characteristics (RNC), service type (ST), trip direction (TD), day of the week (DW), and time window of one hour (TW), are derived from the ETM data. Under route characteristics (RTC), observed headway is computed from ETM data, while deviation between bus and driving route distance and scheduled headway are obtained from Google Maps and the MTC website, respectively. Land-use characteristics (LC) are extracted from OpenStreetMaps. Table 2 provides descriptive statistics for all variables.

3.3. Description of Disaggregate Data

Disaggregate data collected from a face-to-face household survey of 803 work commuters in Chennai show that the average age is 35 years in the sample. The average monthly household income is INR 28,000, and the sample comprises 82% male workers. These demographics align with other Chennai studies [27,28]. The survey captures commuting modes and respondent characteristics over a three-month period. Individuals who used the bus for work commutes at least once in the last three months are assumed to consider the bus. Approximately 53% of commuters considered buses in this sample. The independent variables used in this study are categorized into four groups, as detailed in Table 3. Most of the variables within accessibility (ACC), network integration and information (NII), transit service characteristics (TSC), and personal and work characteristics (PWC) are collected using a Likert scale. Local and collector street density near the respondents’ homes is determined by the ratio of the area of these streets within a 1 km buffer to the total road area within the buffer region, sourced from OpenStreetMap data. Additionally, access time by walking to the nearest train station is calculated using Google Maps. Table 3 presents the relative frequency of each categorical variable. A bus consideration model is developed using all independent variables from Table 3. This predicted consideration probability is then used as an instrumental variable in the bus boarding model.

4. Methodology and Model Formulation

The proposed model formulation involves the following steps (Figure 2): (i) transforming boarding data for normality, (ii) applying nonlinear forms of key influencing factors, (iii) incorporating consideration probability, and (iv) deriving mode-specific spatial impedance metrics.

4.1. Transformation of the Response Variable for Normality

The response variable ( Y r s ) in this paper is boarding per run (r) per stage (s). Logarithmic transformation is commonly used in the literature [4,12,14]. Therefore,   Y r s is transformed using both logarithmic and power functions, with the optimal power parameter determined through the Box-Cox transformation, to evaluate the empirical distribution of the data (Equation (1)). Equation (2) presents the boarding model, where β 0 is the intercept, β 1 to β 8 are coefficients of the influencing factors (X) listed in Table 2, and ε s r is the error term with a zero mean. The best model is determined by goodness-of-fit metrics such as adjusted R2 and mean absolute error (MAE), and also the significance, signs, and magnitude of key influencing factors’ coefficients (β).
Y r s = Y r s ,   i n   o r i g i n a l   s c a l e   u s e d   i n   O L S   a s   B e n c h m a r k   log Y r s ,   f o r   l o g   t r a n s f o r m a t i o n   ( B o x C o x   w i t h   λ = 0 ) Y r s λ ,   f o r   p o w e r   t r a n s f o r m a t i o n   ( B o x C o x   w i t h   λ 0 )
Y r s = β 0 + β 1 S C s + β 2 R T C r t + β 3 R N C r + β 4 L C s + β 5 S T r + β 6 T D r + β 7 D W r + β 8 T W r + ε s r  

4.2. Nonlinear Functional Forms for the Influencing Factors

Nonlinear transformations such as log, logistic, square root, cube root, spline, and polynomial are applied to the key influencing factors (X) with continuous variables, as defined in Table 2. These transformations are used to capture the practical complexity of the relationship between   Y r s and its key influencing factors. The model’s adjusted R2, MAE, and Y s r versus βx plot are used to determine the most appropriate function, denoted by f in Equation (3). The final transformation is selected to ensure the best predictive accuracy with easy interpretability and practical decision-making for transit operators by reflecting the real-world relationship between bus boardings per run per stage and its influencing factors.
Y r s = β 0 + β 1 f 1 ( S C s ) + β 2 f 2 ( R T C r t ) + β 3 f 3 ( R N C r ) + β 4 f 4 ( L C s ) + β 5 S T r + β 6 T D r + β 7 D W r + β 8 T W r + ε s r  

4.3. Inclusion of Bus Consideration Probability

A global binary logit model predicts the probability of bus consideration ( Y c o n _ n ) for nth individual (Equation (4)). The utility of bus consideration ( U c o n _ n ) comprises a systematic component V c o n _ n and random component ϵ c o n _ n (Equation (5)). V c o n _ n is explained by four sets of variables defined in Section 3 (Table 3), and γ denotes the set of coefficient vectors.
Y c o n _ n = 1 ,   i f   b u s   i s   c o n s i d e r e d 0 ,   o t h e r w i s e    
U c o n _ n = V c o n _ n + ϵ c o n _ n = γ 0 + γ 1 A C C n + γ 2 N I I n + γ 3 T S C n + γ 4 P W C n + ϵ c o n _ n
The corresponding log-likelihood to estimate γ:
L L = 1 n Y c o n _ n log P Y c o n _ n = 1 + ( 1 Y c o n _ n ) log P Y c o n _ n = 0  
Disaggregate consideration probability is used as an instrumental variable in the boarding model to overcome self-selection bias and capture the effect of all individuals’ perceptions and behaviors regardless of bus usage (Equation (7)). This instrumental variable (   P s ¯ ) is the average of predicted bus consideration probabilities only among those individuals who consider buses within a 500 m buffer around their home or work locations.
P s ¯ Y c o n = 1 = 1 n δ s n P Y c o n _ n = 1 n
where δ s n =   1 ,   i f   i n d i v i d u a l   n   b e l o n g s   t o   500   m   b u f f e r   o f   b u s   s t a g e s 0 ,   o t h e r w i s e   .

4.4. Mode and Distance-Specific Spatial Impedance Metric

Global regression models cannot effectively capture spatial variation because they assign uniform coefficients across the entire study area. In contrast, the GWR model addresses this limitation by computing geographical weights ( w i j ) from neighboring data points to estimate coefficients for the k t h independent variable of a reference data point i at location u i , v i (Equation (8)). In this way, GWR addresses spatial heterogeneity and autocorrelation. In the proposed GWNR, the best transformation of the dependent variable ( Y r s i ) (log vs. power) is used with the independent variables listed in Table 2 along with the average predicted bus consideration probability (   P s ¯ ).
Y r s i = β 0 ( u i ,   v i ) +   β 1 u i ,   v i f 1 ( S C s i ) + β 2 u i ,   v i f 2 ( R T C r t i ) + β 3 u i ,   v i f 3 ( R N C r i ) + β 4 u i ,   v i f 4 ( L C s i ) +   β 5 u i ,   v i S T r i + β 6 u i ,   v i T D r i + β 7 u i ,   v i D W r i + β 8 u i ,   v i T W r i + β 9 u i ,   v i P s i ¯ Y c o n = 1 +   ε s r i
Calibration of this model involves optimizing the bandwidth, i.e., determining the extent of neighbor contributions to the estimation of the ith data point. Weights follow the distance–decay concept, and kernel functions quantify the contribution of neighboring data points to coefficient estimation. This allows for a local calibration of the regression model by moving the regression point spatially. The matrix β ^ u i , v i estimates the GWR parameter at location u i , v i (Equation (9)).
β ^ u i ,   v i = x t w u i ,   v i x 1 x t w u i ,   v i
Four kernel functions namely fixed Gaussian, fixed bi-square, adaptive Gaussian, and adaptive bi-square, are commonly used for differential weighting. The adaptive kernel adjusts its bandwidth size θ i ( K N N ) based on data point density, using the K t h nearest neighbor distance, while the fixed kernel maintains a constant bandwidth regardless of data density. The bi-square function employs a clear cut-off for neighboring contributions, whereas Gaussian function considers contributions from all neighbors with a diminishing effect based on distance. In the bandwidth selection process, neighboring data points are chosen based on the distance-decay concept within a defined range. If all data points are included, the model behaves like a global model. Conversely, if too few data points are included, the estimation may not converge properly. In this study, bandwidth selection is based on the cross-validation minimization technique, which identifies the lowest prediction error. Among the kernels tested, the adaptive Gaussian performed the best, and its adaptability has also been noted in previous studies [29].
A d a p t i v e   G a u s s i a n = e x p d ij 2 θ i ( KNN ) 2 2
Traditional GWR often uses Euclidean distance for geographical weights in regression models, which may oversimplify real-world impediments and neglect physical barriers [9]. Although some studies use metrics such as Minkowski, Manhattan, or shortest path distances [23,30], these often overlook actual geographic distances and mode feasibility for accessing bus stops. Walking may be more practical for shorter distances than driving, and pedestrian network distance may differ from driving network distance. Therefore, neighborhood contributions to estimation can also vary based on mode feasibility. In the proposed GWNR framework, the distance ( d i j ) between data points i and j (Equation (10)) is assessed as network distance based on mode feasibility to offer a more realistic representation of spatial heterogeneity and autocorrelation. The household survey in Chennai revealed that the 75th percentile access distance from home to bus stop for pedestrians is 0.75 km. Therefore, for distances under 0.75 km, the pedestrian network (local and collector streets) is used, while the driving network is applied for longer distances (Equation (11)). The weight contributed by neighbors in the estimation of reference points varies based on this mode-specific distance matrix.
d i j = Euclidean   distance   between   i   and   j If   driving   distance < 0.75   km ,   walking   distance ,   else   driving   distance   between   i   and   j

4.5. Model Execution

A runtime analysis was conducted to evaluate the computational feasibility of the GWNR model, as shown in Figure 3. The model was tested on a system with an Intel Core i7 (2.92 GHz, x64), 16 GB RAM, running Windows 11, using Python 3.9.13. The results show that runtime increases with dataset size but remains within practical limits even for large data sets and offline applications. With a multi-threaded setup using 16 threads, the model’s total runtime increased with dataset size: approximately 10 min for 5000 observations, 40 min for 15,000, 100 min for 25,000, and 300 min for 35,000 observations. The computational load increases primarily due to localized parameter estimation. However, the ‘SearchGWRParameter’ function facilitates parallel processing techniques using multi-threading and optimizes bandwidth, which significantly reduces computation time. Thus, even if GWNR is computationally intensive, it remains a feasible approach for larger transit networks with appropriate optimizations.
The above-discussed advancements in GWNR methodology are summarized in Figure 2. These advancements can enhance the responsiveness of the proposed boarding model to both aggregate and disaggregate characteristics and jointly address nonlinearity and spatial heterogeneity. The subsequent section explores the results and inferences drawn from this model.

5. Results and Discussion

5.1. Effect of Non-Normality of Boarding

Logarithmic and power transformations of bus boarding per run per stage were tested to evaluate empirical distribution and model fit. The Box-Cox transformation identified an optimal exponent (λ) of 0.494, which was rounded to 0.5 (square root) for ease of back-transformation. Among the tested transformations, the square root (   Y r s ) provided the best model fit, ensuring residual normality and achieving the highest adjusted R2 (0.57), lowest MAE (2.22), and RMSE (3.32). This outperformed both the original scale   Y r s (adjusted R2 = 0.54, MAE = 2.31, RMSE = 3.40) and the logarithmic scale l o g (   Y r s ) (adjusted R2 = 0.35, MAE = 3.61, RMSE = 4.20). Additionally, Q-Q plots and density plots of   Y r s confirm improved residual normality, with closer alignment to theoretical quantiles and more symmetrical density curves, validating this choice of transformation (   Y r s ) as the most suitable.
Figure 4 compares the normality and goodness-of-fit of six OLS global models in original versus square root scale, using Q-Q plots and density plots of standardized residuals. The Q-Q plots show how closely the standardized residuals follow a theoretical normal distribution, with better fits indicated by the yellow points/line closer to the red dashed line. The overlaid blue density curves on the histograms illustrate the standardized residuals’ distribution, with a bell-shaped curve indicating normality.
The plots indicate that models using the square root transformation of the response variable   Y r s (bus boardings per run per stage) generally produce residuals that follow a normal distribution much more closely when compared to models using the original   Y r s . This is evident from improved alignment with theoretical quantiles and more symmetrical density curves in the bottom row plots. Specifically, the nonlinear model employing   Y r s (Figure 4f) achieves the highest adjusted R2 (0.59), lowest MAE (2.16), and RMSE (3.23), indicating the best fit and most accurate predictions among all OLS global models.
The adjusted R2 improved by 9.3% in the proposed nonlinear OLS model (0.59 in Figure 4f) from the traditional OLS model (0.54 in Figure 4a), indicating a substantially better fit. This shows that transforming the response variable, applying nonlinear functional forms, and including consideration probability as an instrumental variable can significantly improve model performance and normality of residuals with more robust statistical inferences.
From a behavioral perspective, further analysis reveals that the deviation between scheduled and observed headways ( S C H _ H W O B S _ H W ) s a m e   r o u t e becomes significant at a 90% confidence level after applying a square root transformation to Y r s , unlike the original scale (Table 4). Additionally, applying nonlinear functional forms to explanatory variables and incorporating consideration probability makes the influence of some key explanatory variables, such as the number of bus stops and train stations, more logical and practically relevant, as discussed in the following Section 5.2. The key results from all models presented in Figure 5 are summarized in Table 4. To improve clarity and highlight significant improvements after transformation, the summary table presents only the most influential coefficients along with their interpretations. This summary table provides a clearer insight into the key relationships identified in the model, and the full numerical details are available in the Appendix A (Table A1).

5.2. Effect of Nonlinear Influencing Factors on Bus Boarding

Various nonlinear transformations were tested to find the best fit for key continuous independent variables, including logarithmic, logistic, square root, cubic root, quadratic, cubic, and spline functions, as summarized in Table A2 in the Appendix A. The final transformation for each variable was selected primarily based on adjusted R2 and MAE, ensuring logical reasoning and practical decision-making for transit operators also played a role in choosing transformations that accurately represent the relationship between bus boardings per run per stage and its influencing factors.
For example, although both the cubic and spline transformations provided the best fit, the cubic transformation was chosen instead of the spline function for scheduled headway. The cubic function provided a better fit with a simpler mathematical form, making interpretation easier. Spline functions can introduce unnecessary complexity because they allow for sudden shifts in the relationship between variables. The cubic transformation avoids this issue by ensuring smooth and continuous behavior. The cubic function also allows transit agencies to find optimal values by setting the second derivative to zero. This helps to identify points where changes in headway have the greatest impact on boardings.

5.2.1. Nonlinear Effect of Scheduled Headway on Boarding

In Figure 5a, the linear function of scheduled headway S C H H W s a m e   r o u t e shows a negative impact on boarding per run per stage. The dark red dot indicates higher boarding values, and a yellowish-red dot indicates lower values, suggesting an impractical continuous increase in boarding as more buses arrive. In reality, with more buses, passengers tend to be distributed themselves more evenly across routes or services to avoid overcrowding, while too few buses can increase waiting times and potential mode shifts. Conversely, the cubic polynomial function provides a more realistic interpretation, showing an initial increase in boarding per run per stage (greenish-purple to dark purple dots in Figure 5a) with increased headway up to an optimal 7 min. Beyond this point, boarding declines for the given dataset. This pattern aligns with past studies by Alam et al. (2018) and Deepa et al. (2022), who found negative effects of bus headway on boarding or passenger miles, and a decrease in boarding benefits at high-frequency levels [4,14]. However, this study’s proposed cubic polynomial function finds an optimal scheduled headway of 7 min at the run level, which is not highlighted in past studies.

5.2.2. Nonlinear Effect of Headway Deviation on Boarding

When buses arrive earlier than scheduled, i.e., observed headway is shorter than scheduled headway ( S C H _ H W O B S _ H W ) s a m e   r o u t e , it can be beneficial up to a point. Early arrivals can accommodate passengers who missed the previous bus or arrived ahead of schedule. However, if buses arrive too early, boarding per run per stage may decrease due to fewer waiting passengers. In Figure 5b, the red dots represent the linear function, which misleadingly indicates a positive effect on boarding. In contrast, the polynomial function shows that an early arrival of up to 7 min (Figure 5b) can increase boarding (dark purple). Beyond this limit, a sharp decline in boarding occurs (greenish purple) for the given dataset. Thus, the cubic polynomial function is more practical as it identifies the optimal headway to enhance boarding.

5.3. Effect of Bus Consideration Probability on Bus Boarding

A binary logit bus consideration model with an AICc of 1012 and a ρ2 of 0.13 is developed, with coefficients presented in Table A3 in the Appendix A. A detailed interpretation of these coefficients is provided by Roy et al. (2024) [7]. In the square root nonlinear bus boarding model, the average predicted consideration probability from the binary logit model (within each stage’s 500 m buffer) is used as an instrumental variable. Its significant positive coefficient (0.70) indicates a beneficial impact on bus boarding (Table 4). This highlights the self-selection bias in the boarding model developed without this instrumental variable. Another notable difference between the models with (purple dots) and without (red dots) consideration probability is the relationship between land-use factors and boarding (Figure 6). The coefficients of the following two factors behave differently under each approach.

5.3.1. Effect of Bus Station Count Within a Buffer Area

Bus boarding per run per stage is expected to increase with the count of bus stations ( B S s ) upto a certain limit, after which passengers disperse to other nearby stops to avoid overcrowding. However, the boarding model without consideration probability produces an unrealistic decreasing trend. In contrast, the model incorporating consideration probability shows an increase in boarding (dark purple) with up to four bus stations within a 500 m buffer of a stage (Figure 6a), after which boarding declines (greenish purple). Previous research found a positive effect of the bus stops within a buffer on stop-route level hourly or daily boarding [13,22]. However, the proposed model stands out by focusing on run-stage level boarding results, showing different passenger distribution patterns beyond a certain number of bus stops.

5.3.2. Effect of Train Station Count Within a Buffer Area

Train stations within a 500 m buffer ( T S s ) can either complement or compete with bus services. Rayaprolu and Levinson (2024) found that in Sydney, Australia, trains complement bus services by improving network connectivity, with well-planned transfers enhancing accessibility and reducing travel times [31]. On the other hand, Pei and Wang (2024) stated that in Harbin, China, rail transit competes strongly with conventional buses, attracting more passengers and causing bus ridership to decline. This is because trains have a larger capacity, higher speed, and are not affected by traffic congestion, making them a more convenient choice for many commuters [32]. Therefore, this relationship could be location-specific and is unpredictable beforehand. Despite this variability, there should be an optimal number of train stations to ensure complementarity. The model, without consideration of probability, suggests that bus boarding continuously increases with more train stations, which is illogical (yellowish to dark red dots in Figure 6b). Conversely, the model incorporating probability demonstrates that having one train station within the buffer enhances boarding (dark purple), by acting as a transfer point or increasing accessibility. Beyond this, additional train stations may compete with bus services (greenish purple), as rail systems are generally more reliable in terms of travel time.
The above observations highlight the importance of incorporating consideration probability in boarding models. The consideration-inclusive model more accurately reflects passenger behavior, suggesting optimal numbers of nearby bus and train stations to maximize boardings. Beyond these points, additional stations become less effective or reduce boardings due to passenger dispersion and increased competition from other transit modes.
Incorporating the square root transformation of boarding, nonlinear functional forms of influencing factors, and consideration probability as an instrumental variable makes the bus boarding OLS model statistically robust and behaviorally insightful. Thus, these features are also integrated into the GWR model for a better local behavioral perspective, as discussed below.

5.4. Local Effect of Influential Factors on Bus Boarding in GWR

Unlike past studies, the proposed model incorporates both pedestrian and driving networks to determine the distance metric in GWR. Specifically, a threshold of 0.75 km is used to select the network used to access bus stops. For distances under 0.75 km, the pedestrian network is used, while for longer distances, the driving network is applied. Despite the weight variation in neighbors’ contributions, the GWR model with Euclidean distance and the mode-specific network distance model produce nearly equivalent results due to the minimal difference between Euclidean and network distances in the study area. Similar findings are also reported by Guo and Bhat (2007) and Schirmer et al. (2014) [10,11]. Nevertheless, since using network distance is generally correct practice to better reflect real-world spatial heterogeneity compared to Euclidean distance, this study uses mode-specific network distance.
The proposed GWR model (   Y r s with nonlinear x, probability consideration, and mode-specific network distance) substantially improves goodness-of-fit compared to traditional OLS, proposed nonlinear OLS (   Y r s with nonlinear x and probability consideration), and traditional GWR (Figure 7). MAE values (shown as red dots) decrease from 2.31 in traditional OLS to 1.96 in the proposed GWR, indicating a 15% improvement. Adjusted R2 (shown as blue squares) increases from 0.54 in traditional OLS to 0.64 in the proposed GWR, reflecting an 18.5% enhancement.

Effect of Influential Factors on Bus Boarding

Table 4 presents the results of all the developed OLS and GWR bus boarding models, incorporating both linear and nonlinear functional forms. Among these, the proposed GWR model, i.e., non-Euclidean GWR—nonlinear x and consideration with   Y r s —shows superior goodness-of-fit compared to the other models in this study. The coefficients of influential variables derived from this model are analyzed to examine their impact on bus boarding per run per stage along bus routes in Chennai, India. Figure 8, Figure 9, Figure 10 and Figure 11 represent key influencing variables’ normalized coefficients (β), which indicate the strength of each variable’s effect, scaled from 0 to 1 based on their minimum and maximum values with a 90% confidence level (|t-values| ≥ 1.64). These β values are shown in different colors on spatial maps using QGIS: yellow for weak effects (0–0.25), green for moderate effects (0.26–0.50), sky blue for strong effects (0.51–0.75), and royal blue for very strong effects (0.76–1). Linear or binary influential variables exhibit positive or negative effects, seen in the green and red striped areas in Figure 9 and Figure 11. This behavior is typically observed in traditional GWR models. However, nonlinear influential variables have not been explored in past GWR studies. In this paper, Figure 8 and Figure 10 illustrate that the effects of these variables on bus boarding are not straightforwardly positive or negative. Instead, they have optimal values in a particular range, beyond which their effects change. These optimal values can vary by location. Even if the optimal values are the same across locations, their impact on boarding can vary across different places; in some places, the impact is high, while in others, it is not as significant.
Among stage characteristics, the starting stage positively impacts boarding, while the ending stage negatively influences, as expected. The cubic polynomial form of the stage index indicates higher boarding at stages near transit hubs or major trip production points and lower boarding at stages with fewer production points, which is intuitive. Thus, the ‘optimal’ stage index can guide strategic planning for station amenities to maximize boarding levels.
Route characteristics, such as the deviation between bus and driving route distances affect boarding, but not always negatively as the linear model shows. High deviation can enhance boarding when routes traverse dense land use or major attractions. A quadratic function better explains these scenarios. Figure 8a shows that nonlinearity in scheduled headway reveals excessively frequent bus arrivals lead to sparse passenger boarding per run per stage, while overly infrequent arrivals result in prolonged waiting times. Polynomial functions identify an optimal headway that balances these extremes. Bus stages farther from the CBD, with higher β values (royal blue), are more affected by bus frequency due to reliance on buses in the absence of rail services. Figure 8(a.1) shows boarding peaks at a headway of 6 to 7 min at some stages (Kandanchavadi, SRP Tools, Taramani, and Velachery) along routes 519, M1, D70, and M70, respectively. However, suburban stages like Kandanchavadi and SRP Tools experience a higher boarding increase with headway compared to stages closer to the CBD.
Run characteristics, such as high occupancy at the previous stage and trip influence boarding positively in a linear model, may not always hold. People may choose different transport modes or wait for the next bus depending on their schedules, flexibility, and level of comfort. The polynomial function captures these practical interpretations better than the linear ones. Figure 8b shows stronger effects of occupancy at the previous stage on bus boarding in peri-urban and suburban areas (royal and sky blue). In Figure 8(b.1), SRP Tools on route M1 near the CBD shows peak boarding with previous stage occupancy of around 40 passengers. However, stages farther from the CBD, like Thoraipakkam, Kumaran Nagar, and Kelambakkam on route 519, show similar patterns with lower optimal values of 20 to 30 passengers, indicating peri-urban and suburban areas are more sensitive to overcrowding due to longer commutes to the CBD.
Cost sensitivity among bus passengers is evident as the cheapest ordinary bus service experiences the highest boarding levels, while the most expensive deluxe service has the least boarding. The express bus service boardings fall between these extremes. Figure 9a shows that ordinary bus services positively impact (green-striped areas) boarding everywhere, especially in CBD areas (royal and sky blue). Conversely, Figure 9b indicates that deluxe bus services have a stronger negative effect (red-striped) than ordinary services, particularly in suburban and urban areas. This trend suggests that passengers in these regions are more cost-sensitive and less inclined to choose premium services. Figure 9c shows that trips toward the CBD strongly increase boarding in peri-urban and suburban areas, as the CBD is a major destination with a high concentration of employment, commercial activities, and amenities.
Figure 10a shows a strong local nonlinear effect of bus stations within 500 m on boarding in suburban and peri-urban areas that especially rely on buses. The plots in Figure 10(a.1) indicate that having too many nearby bus stops may not significantly increase boarding per run per stage, as passengers distribute themselves to avoid overcrowding, with boarding maximizing at up to four bus stations. Figure 10(b.1) shows that too many nearby train stations may reduce bus boardings due to competition, but up to one train station within 500 m can increase boardings due to integrated transport networks.
Regarding land use, sparse areas negatively affect boarding, whereas denser areas have a positive effect (Figure 11a,b). These effects are more pronounced in peri-urban areas, where land use is relatively sparser than the CBD.
Consideration probability consistently shows a strong positive effect on bus boarding across most locations (Figure 11c). This indicates that stages with higher consideration probabilities experience more bus boarding, likely due to increased public awareness and preference for these stages. This highlights the importance of taking into account both bus users’ and non-users’ behavior in the bus boarding prediction.
Among these selected routes, Wednesdays show the most significant negative impact on boarding compared to Fridays, possibly due to increased trips anticipated before the weekend. The peak hours from 7–9 a.m. and 4–8 p.m. have the highest positive effect on boarding.

6. Practical Applications

As discussed in Section 5.4, the proposed GWNR model can prioritize intervention locations with higher coefficients and identify the quantum of improvements needed to enhance bus ridership using polynomial functions. It finds facility requirements in prioritized locations with suboptimal facilities, which provides targeted policy guidance. In contrast, the proposed nonlinear OLS can determine the necessary improvements without prioritizing locations, while traditional GWR can prioritize locations but cannot identify the needed improvements; traditional OLS can do neither. This section compares the ridership gains from interventions aimed at improving bus scheduled headway using the nonlinear OLS and GWNR models to those achieved with traditional models.
Here, ridership refers to the total number of passengers boarding across all stages of a bus run. The policy intervention process is simulated by improving the scheduled headway of sample bus stages using the following mathematical calculations.
(a) Traditional OLS Model: In this model, the linear functional form of scheduled headway can find an extreme solution (maximum or minimum) within budget constraints based on the coefficient’s sign. However, it cannot determine the exact optimal headway value or prioritize improvement locations to maximize ridership. Moreover, regardless of location, it consistently shows a negative coefficient, which suggests increased bus boarding per run per stage with continuous reductions in scheduled headway, whereas, in practice, continuous reductions in headway can lead to frequent empty bus runs, causing significant losses for transit operators. Therefore, this policy analysis sets the headway reduction threshold at the 85th percentile of scheduled headways (i.e., 10 min per hour) in the study dataset. This 10-min headway may not provide the maximum benefit, but since the model does not provide any optimal value, this value is chosen within the range of feasible options. Fourteen out of 55 bus stages along sample bus routes have headways exceeding this threshold (shown in green dots in Figure 12a), and headways at these stages are subsequently improved by a 2-min reduction to meet the threshold limit.
(b) Traditional GWR Model: This model uses a linear functional form for scheduled headway and generates different coefficient values across locations. It also cannot find an optimal value but can prioritize improvement locations based on higher coefficient values, which indicates a stronger influence of scheduled headway on bus boardings at those locations. Similar to traditional OLS, in this model as well the headway reduction threshold is set at the 85th percentile of scheduled headways (i.e., 10 min per hour) in the study dataset. Thirteen bus stages along sample bus routes have headways exceeding this threshold (highlighted as green dots in Figure 12b), and headways at these stages are subsequently decreased by 2 min in priority order to meet the threshold limit.
(c) Proposed Nonlinear OLS Model: This model employs a cubic polynomial functional form of scheduled headway to find an optimal value providing maximum benefit. However, since it generates consistent coefficients across all locations, it cannot prioritize improvement locations to maximize ridership. Regardless of location, it identifies an optimal scheduled headway of 7 min per hour. If headway is reduced below this threshold, it leads to frequent empty bus runs, resulting in significant losses for transit operators. Conversely, increasing headway above 7 min creates prolonged passenger waiting times and can lead to mode shifts to alternative travel modes, also causing losses for bus transit operators. Therefore, the headway reduction threshold is set at 7 min per hour. Forty-two out of 55 bus stages along sample bus routes have headways exceeding this threshold, and headways at these stages are subsequently improved by reducing them by 2, 3, and 5 min (shown as green, blue, and red dots, respectively, in Figure 12c) to meet the threshold limit.
(d) Proposed GWNR Model: This model uses a cubic polynomial functional form of scheduled headway to find an optimal value and generates different coefficient values across locations, allowing it to prioritize improvement locations based on higher coefficients. Therefore, the headway reduction threshold is set at this optimal value. Locations with a very strong influence on scheduled headway (normalized β > 0.75) are prioritized for improvement. Fifteen priority bus stages along sample bus routes have headways exceeding their location-specific threshold, and headways at these stages are improved by reducing them by 2, 3, and 5 min (shown as green, blue, and red dots, respectively, in Figure 12d). This model provides a location-specific quantum of improvements, such as reducing the scheduled headway at ‘Taramani’ by 2 min for route M1 but by 5 min for route M70.
Bus boardings per run per stage are calculated using all four models, i.e., traditional OLS, traditional GWR, proposed OLS, and proposed GWR, with their respective improved headway values and original coefficients. Then, ridership per run is calculated, followed by the average ridership per hour per day. Similarly, the frequency of bus service is calculated from the scheduled headway before and after improvement for each model to determine how many buses are needed per hour per day. The bus frequency improvement ratio is calculated as follows:
B u s   F r e q u e n c y   I m p r o v e m e n t   R a t i o = T o t a l   i n c r e a s e   i n   b u s   r u n s   p e r   d a y   a f t e r   i m p r o v e m e n t T o t a l   b u s   r u n s   p e r   d a y   b e f o r e   i m p r o v e m e n t
The bus ridership improvement ratio is calculated as:
B u s   R i d e r s h i p   I m p r o v e m e n t   R a t i o = T o t a l   i n c r e a s e   i n   r i d e r s h i p   p e r   d a y   a f t e r   i m p r o v e m e n t T o t a l   r i d e r s h i p   p e r   d a y   b e f o r e   i m p r o v e m e n t
Finally, the ridership gain index is calculated as:
R i d e r s h i p   G a i n   I n d e x = B u s   R i d e r s h i p   I m p r o v e m e n t   R a t i o B u s   F r e q u e n c y   I m p r o v e m e n t   R a t i o
Table 5 demonstrates the superior performance of the proposed GWNR model compared to both traditional models and the proposed OLS model. When traditional OLS and GWR models produce low ridership gain indices (0.147 and 0.206, respectively), the proposed models show substantial enhancements. The proposed OLS model achieves a notable ridership gain index of 3.552, while the proposed GWNR model achieves almost twice that, at 6.110. Hence, targeted interventions obtained from the proposed GWNR become twice as effective compared to those from traditional regression for maximizing bus ridership gains. In this study, the effectiveness of service planning policies is measured using the Ridership Gain Index, which evaluates how efficiently service enhancements translate into increased ridership. Transit agencies can utilize GWNR predictions to adjust bus schedules, improving service efficiency and promoting greater public transit usage, ultimately contributing to sustainable urban mobility.
Nonetheless, implementing these recommendations in real-world transit operations is not a standalone action. Adjusting scheduled headways for specific routes requires a holistic approach that accounts for multiple interdependent factors. For instance, operational constraints such as depot capacities, driver shift regulations, and fleet availability can limit how much a schedule can be altered without disrupting the overall system’s service reliability. Additionally, interactions between different routes play a crucial role because some routes may act as complements (facilitating seamless transfers) or competitors (serving overlapping demand), which may lead to passenger demand fluctuations. Given these complexities, transit agencies must assess the broader network-wide impacts before implementing service changes. Future research can explore scenario-based feedback mechanisms or simulation-driven approaches to understand the comprehensive effects of adjustments across the entire transit network in real-world applications.

7. Conclusions

This paper evaluates the role of nonlinearity and spatial heterogeneity in bus boarding in terms of prediction accuracy, behavioral relationships, and policy implications. A novel geographically weighted nonlinear regression (GWNR) model is developed to predict bus boardings at the run-stage level. This proposed model integrates aggregate ETM data from four MTC bus routes with disaggregate household survey data in Chennai to effectively capture both the bus users’ and non-bus users’ behavior. The proposed GWNR model improves the adjusted R-square by 18.5% and the mean absolute error by 15% compared to global linear regression.
This paper addresses all the research questions raised in the introduction section. First, it explores how appropriate functional forms of influencing factors can explain their relationship with bus boarding. A square root transformation ensures normality in the boarding data distribution. Contrary to many studies, this paper applies diverse nonlinear functions (e.g., power, polynomial, log, logistic, spline), among which the polynomial function provides the best fit and realistically depicts the relationship between most of the key influencing factors and bus boarding per run per stage. Linear models unrealistically suggest passenger boarding keeps increasing with more buses, whereas the proposed nonlinear model uses a cubic polynomial function to find an optimal 7-min headway for maximizing boarding along these sample routes. This creates a realistic balance, as more buses distribute passengers evenly to avoid overcrowding, while too few buses increase waiting times and cause people to switch to other travel modes. Such realistic behavioral insights are drawn from the nonlinear functional forms, and the optimal values help evaluate the quantum of improvements needed to enhance bus demand.
To address self-selection bias, the model includes bus consideration probability, predicted from a logit model with disaggregated household survey data, as an instrumental variable. This variable captures non-bus users’ behavior and examines how an individual’s consideration propensity affects the relationship between other factors and bus boarding. Incorporating this probability alters the relationship between land-use factors and boarding, presenting a more realistic pattern: boarding increases with more bus stations initially but declines due to passenger dispersion, while one train station enhances boarding, with additional stations creating competition.
Unlike prior research on GWR, this paper utilizes a mode-specific network distance metric to better represent neighborhood effects, providing area-specific insights into how station, route, run, land-use characteristics, and consideration propensity affect bus boarding. For instance, headways affect stages farther from the CBD due to reliance on bus service in the absence of rail. Previous stage occupancy influences boarding in peri-urban and suburban areas, where commutes to the CBD are longer. The number of nearby bus and train stations affects boarding, especially in suburban or peri-urban areas. Consideration probability consistently enhances boarding everywhere, highlighting the importance of perceived service reliability and convenience.
Finally, this paper compares ridership gains from policy interventions using the proposed GWNR model to those from traditional OLS and GWR, as well as nonlinear OLS models. The GWNR model prioritizes high-coefficient locations for interventions and optimizes the quantum of improvements using polynomial functions in suboptimal areas, providing targeted policy guidance to maximize ridership. Traditional OLS and GWR models achieve low ridership gain indices of 0.147 and 0.206, respectively. In contrast, the proposed nonlinear OLS model reaches a gain index of 3.552, and the GWNR model achieves an impressive 6.110, making interventions nearly twice as effective compared to traditional and global nonlinear methods. This enhanced effectiveness supports sustainability by maximizing ridership through targeted interventions, which can promote more efficient and resilient public transport systems.
Moreover, the proposed GWNR model is transferable to other cities according to its variable specifications, which account for transit infrastructure, land-use characteristics, service supply measures, and sociodemographic factors. The number of bus and train stations within a 500 m buffer of each stage represents transit infrastructure or transit stop density. The number of educational institutes and government offices within the same buffer captures land-use characteristics. Additionally, binary variables indicating whether the density of points of interest within a 500 m buffer is sparse or dense help differentiate low-density and high-density areas. Furthermore, scheduled headway for a specific route and stage-level bus inter-arrival times reflects service supply measures. The average bus consideration probability within a 500 m buffer of the current stage accounts for sociodemographic variations such as age, gender, income, and trip distance, as well as user perceptions of public transportation, including reliability, crowding, security, and usability of pedestrian infrastructure. Thus, given this structure, the GWNR model can be transferred to other cities by adjusting these variable values based on local conditions. However, due to spatial heterogeneity, some localized adjustments in bandwidth selection and calibration may be necessary to account for city-specific variations. Future research can further explore the model’s adaptability by applying it to develop a multi-city demand model with distinct transit characteristics. This would allow for validation of its generalizability using more direct measures, such as population density, employment accessibility, and other relevant factors across diverse urban contexts.
Furthermore, the proposed GWNR model can offer extensive applications across various fields by providing a valuable decision-making tool for realistic spatial analysis of demand, incorporating both aggregate and disaggregate characteristics. Beyond transportation, this model can significantly contribute to urban planning and retail. In urban planning, it can guide infrastructure development by identifying regions with high population growth and demand for amenities and providing targeted investment guidance in housing, schools, and public facilities. In retail, it can help in selecting store locations and designing marketing strategies by analyzing consumer behavior and identifying high-demand zones. Hence, the versatility of this model supports targeted and data-driven interventions across diverse sectors.

Author Contributions

Conceptualization, P.R. and K.K.S.; methodology, P.R. and K.K.S.; software, P.R.; validation, P.R. and K.K.S.; formal analysis, P.R. and K.K.S.; investigation, P.R. and K.K.S.; data curation, P.R. and K.K.S.; writing—original draft preparation, P.R.; writing—review and editing, P.R. and K.K.S.; visualization, P.R. and K.K.S.; funding acquisition, K.K.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partly supported by the Department of Science & Technology, Government of India [grant number IMP/2018/001850], and the Ministry of Education, Government of India.

Institutional Review Board Statement

The Institutional Review Board (IRB) approval is not applicable. The study was conducted in accordance with the Declaration of Helsinki by giving due importance to informed consent, confidentiality, minimal risk, language accessibility, and data anonymization. Additionally, the survey did not collect any sensitive information that could impact respondents medically, socially, or financially.

Informed Consent Statement

Informed consent for participation was obtained from all subjects involved in the study.

Data Availability Statement

Restrictions apply to the availability of these data. Data were obtained from a third party (Metropolitan Transport Corporation, Chennai Ltd.) and are available from the authors with the permission of the third party concerned (Metropolitan Transport Corporation, Chennai Ltd.).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Results from bus boarding models.
Table A1. Results from bus boarding models.
VariablesFunctional FormLinear x with Original yLinear x with √yNonlinear x with √yNonlinear x and Consideration with √yEuclidean GWR—Linear x with Original yNon-Euclidean GWR—Non-Linear x and Consideration with √y
CoefStd ErrCoefStd ErrCoefStd ErrCoefStd ErrMeanStd DevMeanStd Dev
Intercept 3.090.231.500.06−24.193.80−32.203.762.133.59−15.8434.22
Stage Characteristics
The starting stage of a routeBinary11.290.132.300.032.160.052.000.0512.061.881.200.71
The ending stage of a route−0.910.12−0.400.030.320.050.270.05−2.002.40−0.581.13
The intermediate stage of a route (base)
Stage Index = Position Number of Current Stage /Total Number of StagesCubic−2.850.10−1.080.03−5.570.40−6.510.40−2.741.52−12.083.80
11.550.8313.010.82 23.269.51
−8.110.51−8.750.51 −14.676.26
Route Characteristics
Deviation between bus and driving route distance (in km)Quadratic−0.110.01−0.020.004−0.050.03−0.040.03−0.290.30−0.200.37
0.010.0020.0050.00 0.020.01
Scheduled headway with respect to the current route direction at a stage for the current time window (in minutes)Cubic−0.040.01−0.020.0039.841.4612.821.440.090.366.4312.51
−1.140.17−1.510.17 −0.751.45
0.040.010.060.01 0.030.05
Stage-level bus inter-arrival times for the current time window (in minutes)Linear0.010.0030.0040.0010.0020.0010.0020.0010.010.0040.0040.002
Deviation between scheduled and observed headway with respect to the current route direction at a stage (in minutes)Cubic0.010.010.010.0020.00020.010.0020.010.030.0030.040.004
0.010.0040.000.004 0.010.001
−0.0010.0001−0.0010.0001 −0.0010.0001
Run Characteristics
Previous stage occupancy = (Cumulative boarding − cumulative alighting) up to the previous stage per runCubic0.010.0030.010.0010.010.0040.010.0040.020.010.030.01
−0.000050.001−0.00010.001 −0.0010.001
−0.00000040.000−0.0000010.000 0.0000030.000
Previous trip occupancy = (Cumulative boarding − cumulative alighting) at the current stage per run during the previous tripCubic0.040.0020.010.0010.020.0030.020.0030.030.010.010.004
−0.00030.0001−0.00040.0001 0.00040.0001
0.0000030.00000.0000030.0000 −0.0000040.0000
Missing values for previous trip occupancyBinary0.570.170.110.040.200.050.230.051.290.400.130.28
Service Type
OrdinaryBinary1.250.140.310.040.130.050.130.041.200.610.160.04
Deluxe−1.600.07−0.350.02−0.380.02−0.370.02−1.610.56−0.320.09
Express (Base)
Trip Direction (Away from central urban area: Base)
Towards central urban areaBinary−0.010.050.010.010.080.020.080.020.261.150.340.37
Land-use Characteristics (within 500 m buffer)
Number of bus stationsQuadratic−0.060.01−0.010.002−0.040.010.030.01−0.130.100.030.16
0.0010.001−0.0040.001 −0.0030.01
Number of train stationsQuadratic0.770.060.240.020.240.050.510.051.310.840.726.69
0.000.02−0.180.03 −0.566.85
Number of educational institutesSquare root0.090.0040.010.0010.150.010.130.010.150.120.240.12
Number of government officesCubic−0.070.01−0.020.002−0.130.01−0.090.01−0.380.61−0.210.48
0.010.000.010.001 0.200.44
−0.00040.00−0.00040.00004 −0.010.03
Sparse land-use areaBinary0.260.12−0.230.03−0.200.03−0.180.030.351.81−0.120.69
Neither sparse nor dense land-use area0.730.090.090.020.100.020.090.021.191.460.380.48
Dense land-use area (base)
Land-use variable is missing−0.250.16−0.340.04−0.260.050.070.06−0.291.740.570.93
Day of Week
Monday (0/1)Binary−0.090.070.000.02−0.010.02−0.010.02−0.330.08−0.060.01
Tuesday (0/1)0.120.070.040.020.030.020.030.020.310.050.080.01
Wednesday (0/1)−0.090.07−0.010.02−0.030.02−0.030.02−0.240.04−0.070.01
Thursday (0/1)−0.020.070.020.020.020.020.020.02−0.050.230.080.01
Friday (Base)
Time Window
6 a.m.–7 a.m. (Base)
7 a.m.–8 a.m.Binary0.740.120.170.030.180.030.150.030.830.460.210.11
8 a.m.–9 a.m.1.380.140.270.040.300.040.270.041.270.610.300.11
9 a.m.–10 a.m.0.960.140.150.040.180.040.150.041.080.410.210.10
10 a.m.–11 a.m.0.490.130.070.030.080.030.060.030.700.290.140.06
11 a.m.–12 p.m.0.260.130.050.030.030.030.020.030.520.230.110.07
12 p.m.–1 p.m.0.370.130.080.030.080.030.070.031.230.460.270.11
1 p.m.–2 p.m.0.200.130.030.030.040.030.030.030.680.120.150.02
2 p.m.–3 p.m.0.040.130.010.030.0020.03−0.0010.030.170.430.100.00
3 p.m.–4 p.m.0.170.130.050.030.060.030.050.030.840.310.180.04
4 p.m.–5 p.m.1.000.130.270.030.290.030.270.031.080.350.320.08
5 p.m.–6 p.m.1.450.150.330.040.360.040.330.041.380.440.380.09
6 p.m.–7 p.m.1.470.150.320.040.350.040.320.041.390.440.350.09
7 p.m.–8 p.m.0.760.160.110.040.150.040.120.040.890.330.230.07
8 p.m.–9 p.m.0.440.17−0.010.050.040.050.020.040.800.250.190.11
9 p.m.–10 p.m.0.570.180.000.050.060.050.060.051.271.170.120.30
Consideration Probability (within 500 m buffer)
Average of aggregated bus consideration probability for people who considered busesLinear 0.700.03 0.401.41
Missing valuesBinary 0.730.03 0.620.94
(Italics and small font) Insignificant variables at 95% two-tail confidence level
Table A2. Comparative analysis of nonlinear transformations selection for model performance evaluation.
Table A2. Comparative analysis of nonlinear transformations selection for model performance evaluation.
VariableLogLogisticSquare RootCube RootQuadraticCubicSplineFinal Transformation *Reason
Adj R2MAEAdj R2MAEAdj R2MAEAdj R2MAEAdj R2MAEAdj R2MAEAdj R2MAE
Stage index0.5662.2250.5662.2230.5652.2260.5652.2280.5672.2150.5712.2010.5712.201CubicBest fit, simple, easy interpretation, optimal value
Route distance
deviation
(bus - drive)
0.5662.2210.5672.2200.5662.2220.5662.2210.5672.2210.5682.2190.5672.219QuadraticHighest Adj R², lowest MAE
Scheduled
headway
0.5632.2270.5682.2170.5632.2260.5682.2180.5682.2180.5692.2160.5692.216CubicBest fit, simple, easy interpretation, optimal value
(Schd-Obs.)
headway
0.5652.2240.5652.2240.5632.2260.5652.2250.5662.2230.5662.2220.5652.225CubicHighest Adj R², lowest MAE
Last stage
occupancy
0.5652.2260.5672.2290.5632.2270.5652.2250.5652.2250.5652.2250.5652.225CubicNo improvement over others, but consistent
Last trip
occupancy
0.5622.2270.5552.2460.5642.2250.5652.2250.5662.2230.5662.2220.5652.224CubicHighest Adj R², lowest MAE
Bus station0.5652.2240.5642.2250.5652.2240.5652.2240.5652.225Insignificant0.5662.220QuadraticHighest Adj R², lowest MAE
Train station0.5652.2250.5652.2250.5652.2250.5652.2260.5652.225Insignificant0.5652.225QuadraticHighest Adj R², lowest MAE
School0.5592.2310.5522.2410.5632.2260.5602.229InsignificantInsignificantInsignificantSquare rootHighest Adj R², lowest MAE
Govt.
office
0.5662.2200.5692.2150.5662.2220.5672.2200.5642.2230.5702.2140.5652.223CubicHighest Adj R², lowest MAE
* The bold red digits indicate the highest Adj R² and lowest MAE for each variable transformation.
Table A3. Results obtained from the bus consideration model.
Table A3. Results obtained from the bus consideration model.
VariablesGlobal Binary Logit Model Coef. (Std. Error)
Intercept0.13 (0.54)
Accessibility
Usability of footpath: Medium or high (1/0)0.53 (0.17)
Usability of footpath: Missing (1/0)0.54 (0.35)
Walking and crossing roads for buses are difficult and unsafe (1/0)0.09 (0.19)
Small road density near home (in %)−0.01 (0.01)
Network Integration
Direct bus is available to work (1/0)1.08 (0.22)
Total walking time for transfers (in minutes)0.02 (0.05)
Total walking time for transfers (in minutes) missing−0.96 (0.35)
Access time to the nearest train station (in minutes)0.01 (0.01)
Accurate information about bus routes is good (1/0)0.39 (0.22)
Accurate information about bus timing is good (1/0)0.53 (0.18)
Transit line haul service characteristics
Low security for women in bus (1/0)−0.36 (0.21)
Low security for women in bus is missing (1/0)0.50 (0.54)
Reliability of reaching on time by bus: Medium or high (1/0)0.14 (0.17)
Crowd in bus is high or medium for bus line-haul distance ≤ 5 km (1/0)−0.49 (0.23)
Crowd in bus is missing (1/0)−0.63 (0.55)
Personal and work characteristics
Young Male (1/0)0.32 (0.22)
Monthly household income before taxes is more than INR 20,000 (1/0)−0.49 (0.17)
Come home for lunch (1/0)−0.69 (0.26)
Travel at odd times (1/0)0.17 (0.18)
Home to work Trip distance (in km) more than 15 km to 20 km (1/0)0.19 (0.24)
Home to work Trip distance (in km) beyond 20 km (1/0)0.70 (0.26)

References

  1. Mead, L. Still Only One Earth: Lessons from 50 Years of UN Sustainable Development Policy. In The Road to Sustainable Transport Key Messages and Recommendations; International Institute for Sustainable Development: Winnipeg, MB, Canada, 2021. [Google Scholar]
  2. Erhardt, G.; Mucci, R.; Cooper, D.; Sana, B.; Chen, M.; Castiglione, J. Do Transportation Network Companies Increase or Decrease Transit Ridership? Empirical Evidence from San Francisco. Transportation 2021, 49, 313–342. [Google Scholar] [CrossRef]
  3. Bakar, M.; Norhisham, S.; Katman, H.; Fai, C.; Azlan, N.; Samsudin, N. Service Quality of Bus Performance in Asia: A Systematic Literature Review and Conceptual Framework. Sustainability 2022, 14, 7998. [Google Scholar] [CrossRef]
  4. Alam, B.M.; Nixon, H.; Zhang, Q. Factors Affecting Travel Demand by Bus: An Empirical Analysis at U.S. Metropolitan Statistical Area Level. Transp. Res. Rec. 2018, 2672, 817–826. [Google Scholar] [CrossRef]
  5. Diab, E.; DeWeese, J.; Chaloux, N.; El-Geneidy, A. Adjusting the Service? Understanding the Factors Affecting Bus Ridership Over Time at the Route Level in Montréal, Canada. Transportation 2021, 48, 2765–2786. [Google Scholar] [CrossRef]
  6. Ambi Ramakrishnan, G.; Srinivasan, K.K.; Pynda, S.P. Joint Models for Consideration of Public Transit and Mode Choice for Work Commute. Transp. Dev. Econ. 2020, 6, 12. [Google Scholar] [CrossRef]
  7. Roy, P.; Srinivasan, K.K.; Ramakrishnan, G.A. Why Don’t More Commuters Consider Buses for Their Work Trip?—A Geographically Weighted Segmented Logistic Regression Modelling Approach. Appl. Spat. Anal. 2024, 17, 1043–1079. [Google Scholar] [CrossRef]
  8. Rahman, M.; Yasmin, S.; Faghih-Imani, A.; Eluru, N. Examining the Bus Ridership Demand: Application of Spatio-Temporal Panel Models. J. Adv. Transp. 2021, 2021, 1–10. [Google Scholar] [CrossRef]
  9. Saputro, D.R.S.; Muhsinin, R.Y.; Widyaningsih, P.; Sulistyaningsih. Spatial Autoregressive with a Spatial Autoregressive Error Term Model and Its Parameter Estimation with Two-Stage Generalized Spatial Least Square Procedure. J. Phys. Conf. Ser. 2019, 1217, 012104. [Google Scholar] [CrossRef]
  10. Guo, J.Y.; Bhat, C.R. Operationalizing the Concept of Neighborhood: Application to Residential Location Choice Analysis. J. Transp. Geogr. 2007, 15, 31–45. [Google Scholar] [CrossRef]
  11. Schirmer, P.M.; van Eggermond, M.A.B.; Axhausen, K.W. The Role of Location in Residential Location Choice Models: A Review of Literature. J. Transp. Land Use 2014, 7, 3–21. [Google Scholar] [CrossRef]
  12. Cui, B.; DeWeese, J.; Wu, H.; King, D.A.; Levinson, D.; El-Geneidy, A. All Ridership Is Local: Accessibility, Competition, and Stop-Level Determinants of Daily Bus Boardings in Portland, Oregon. J. Transp. Geogr. 2022, 99, 103294. [Google Scholar] [CrossRef]
  13. Dong, X. Investigating Changes in Longitudinal Associations between Declining Bus Ridership, Bus Service, and Neighborhood Characteristics. J. Public Transp. 2022, 24, 100011. [Google Scholar] [CrossRef]
  14. Deepa, L.; Rawoof Pinjari, A.; Krishna Nirmale, S.; Srinivasan, K.K.; Rambha, T. A Direct Demand Model for Bus Transit Ridership in Bengaluru, India. Transp. Res. Part A Policy Pract. 2022, 163, 126–147. [Google Scholar] [CrossRef]
  15. Chakour, V.; Eluru, N. Examining the Influence of Stop Level Infrastructure and Built Environment on Bus Ridership in Montreal. J. Transp. Geogr. 2016, 51, 205–217. [Google Scholar] [CrossRef]
  16. Outwater, M.; Sana, B.; Ferdous, N.; Woodford, B.; Lobb, J. Characteristics of Premium Transit Services That Affect Choice of Mode; The National Academies Press: Washington, DC, USA, 2014. [Google Scholar]
  17. Mamede, F.P.; da Silva, R.F.; de Brito Junior, I.; Yoshizaki, H.T.Y.; Hino, C.M.; Cugnasca, C.E. Deep Learning and Statistical Models for Forecasting Transportation Demand: A Case Study of Multiple Distribution Centers. Logistics 2023, 7, 86. [Google Scholar] [CrossRef]
  18. Seyedan, M.; Mafakheri, F.; Wang, C. Cluster-Based Demand Forecasting Using Bayesian Model Averaging: An Ensemble Learning Approach. Decis. Anal. J. 2022, 3, 100033. [Google Scholar] [CrossRef]
  19. Wu, H.; Levinson, D. The Ensemble Approach to Forecasting: A Review and Synthesis. Transp. Res. Part C Emerg. Technol. 2021, 132, 103357. [Google Scholar] [CrossRef]
  20. Shakespeare, R.M.; Srinivasan, S. Demographic and Built Environment Predictors of Public Transportation Retention and Work-from-Home Changes in Small- to Medium-Sized Massachusetts Cities, 2011–2021. Sustainability 2024, 16, 8620. [Google Scholar] [CrossRef]
  21. Ma, X.; Zhang, J.; Ding, C.; Wang, Y. A Geographically and Temporally Weighted Regression Model to Explore the Spatiotemporal Influence of Built Environment on Transit Ridership. Comput. Environ. Urban Syst. 2018, 70, 113–124. [Google Scholar] [CrossRef]
  22. Wang, J.; Zhang, N.; Peng, H.; Huang, Y.; Zhang, Y. Spatiotemporal Heterogeneity Analysis of Influence Factor on Urban Rail Transit Station Ridership. J. Transp. Eng. Part A Syst. 2022, 148, 04021115. [Google Scholar] [CrossRef]
  23. Chen, E.; Ye, Z.; Wang, C.; Zhang, W. Discovering the Spatio-Temporal Impacts of Built Environment on Metro Ridership Using Smart Card Data. Cities 2019, 95, 102359. [Google Scholar] [CrossRef]
  24. Tu, W.; Cao, R.; Yue, Y.; Zhou, B.; Li, Q.; Li, Q. Spatial Variations in Urban Public Ridership Derived from GPS Trajectories and Smart Card Data. J. Transp. Geogr. 2018, 69, 45–57. [Google Scholar] [CrossRef]
  25. Government of Tamil Nadu; Department of Economics and Statistics. Statistical Hand Book 2019; Department of Economics and Statistics (DoES): Teynampet, India, 2019. [Google Scholar]
  26. Suresh, M. Metropolitan Transport Corporation (Chennai) Limited. 2024. Available online: https://mtcbus.tn.gov.in/Home/performance (accessed on 1 February 2025).
  27. Devaraj, A.; Ambi Ramakrishnan, G.; Nair, G.S.; Srinivasan, K.K.; Bhat, C.R.; Pinjari, A.R.; Ramadurai, G.; Pendyala, R.M. Joint Model of Application-Based Ride Hailing Adoption, Intensity of Use, and Intermediate Public Transport Consideration among Workers in Chennai City. Transp. Res. Rec. 2020, 2674, 152–164. [Google Scholar] [CrossRef]
  28. Selvakumar, M.; Abishek Reddy, M.; Sathish, V.; Venkatesh, R. Potential Influence of Metro on Bus: A Case Study. J. Inst. Eng. (India) Ser. A 2018, 99, 379–384. [Google Scholar] [CrossRef]
  29. Fotheringham, A.S.; Brunsdon, C.; Charlton, M.; O’Sullivan, D. Geographically Weighted Regression: The Analysis of Spatially Varying Relationships. Geogr. Anal. 2003, 35, 272–275. [Google Scholar]
  30. Gao, F.; Yang, L.; Han, C.; Tang, J.; Li, Z. A Network-Distance-Based Geographically Weighted Regression Model to Examine Spatiotemporal Effects of Station-Level Built Environments on Metro Ridership. J. Transp. Geogr. 2022, 105, 103472. [Google Scholar] [CrossRef]
  31. Rayaprolu, H.; Levinson, D. Transit Modal Complementarity: Measuring the Access Provided by Transfers. Transportation 2024, 1–20. [Google Scholar] [CrossRef]
  32. Pei, Y.; Wang, W. Analysis of Multi-Modal Public Transit Competing Relationships and Evolutionary Mechanisms in Cities in Cold Regions. Appl. Sci. 2024, 14, 2702. [Google Scholar] [CrossRef]
Figure 1. (a) Location of the study area, Chennai, in India; (b) Chennai city map; (c) Spatial variation in average daily bus boarding and number of bus stations in the study dataset.
Figure 1. (a) Location of the study area, Chennai, in India; (b) Chennai city map; (c) Spatial variation in average daily bus boarding and number of bus stations in the study dataset.
Sustainability 17 02485 g001
Figure 2. Methodology of the GWNR model.
Figure 2. Methodology of the GWNR model.
Sustainability 17 02485 g002
Figure 3. Bandwidth search and model runtime in GWNR for different dataset sizes.
Figure 3. Bandwidth search and model runtime in GWNR for different dataset sizes.
Sustainability 17 02485 g003
Figure 4. QQ plot of residuals1 for OLS and square root transformed models. 1 The red dashed line represents the theoretical normal distribution, and the yellow line/points show the observed residuals. The blue density curve illustrates the residual distribution, and the blue bars represent the histogram of standardized residuals.
Figure 4. QQ plot of residuals1 for OLS and square root transformed models. 1 The red dashed line represents the theoretical normal distribution, and the yellow line/points show the observed residuals. The blue density curve illustrates the residual distribution, and the blue bars represent the histogram of standardized residuals.
Sustainability 17 02485 g004
Figure 5. Effect of (a) scheduled headway and (b) headway deviation on bus boarding.
Figure 5. Effect of (a) scheduled headway and (b) headway deviation on bus boarding.
Sustainability 17 02485 g005
Figure 6. Effect of (a) bus and (b) train station count within a 500 m buffer on bus boarding.
Figure 6. Effect of (a) bus and (b) train station count within a 500 m buffer on bus boarding.
Sustainability 17 02485 g006
Figure 7. Performance measures of OLS models versus GWR models.
Figure 7. Performance measures of OLS models versus GWR models.
Sustainability 17 02485 g007
Figure 8. Effect of route and run characteristics on bus boarding.
Figure 8. Effect of route and run characteristics on bus boarding.
Sustainability 17 02485 g008
Figure 9. Effect of service type and trip direction on bus boarding.
Figure 9. Effect of service type and trip direction on bus boarding.
Sustainability 17 02485 g009
Figure 10. Effect of the number of bus and train stations on bus boarding.
Figure 10. Effect of the number of bus and train stations on bus boarding.
Sustainability 17 02485 g010
Figure 11. Effect of land use and bus consideration probability on bus boarding.
Figure 11. Effect of land use and bus consideration probability on bus boarding.
Sustainability 17 02485 g011
Figure 12. Location and quantum of headway improvements based on OLS and GWR.
Figure 12. Location and quantum of headway improvements based on OLS and GWR.
Sustainability 17 02485 g012
Table 1. Summary of research gaps in the earlier literature.
Table 1. Summary of research gaps in the earlier literature.
AuthorCase StudyTransitTransit
Demand
Dimension
Resolution (Level) 2Transformation in Variable 3Disaggregate
Effect 4
Spatial Variation
DependentIndependent
Chakour and Eluru (2016) [15]CanadaBusBoarding, AlightingHourly stop× 1×××
Alam et al. (2018) [4]U.S.BusPassenger miles/MSA populationAnnually MSALogLog××
Ma et al. (2018) [21]ChinaTransitBoardingHourly TAZ×××Yes
Tu et al. (2018) [24]ChinaMetroBoarding, AlightingHourly station×××Yes
Chen et al. (2019) [23]ChinaBus, MetroRidershipHourly station×××Yes
Diab et al. (2020) [5]CanadaBusBoardingAnnually routeLogLog××
Cui et al. (2022) [12]OregonBusBoardingDaily stopLogLog××
Deepa et al. (2022) [14]IndiaBusBoardingDaily fare-stage clusterLogLog, Piecewise linear××
Dong (2022) [13]U.S.BusBoardingDaily stop-route×Log××
Wang et al. (2022) [22]ChinaRailInflow, OutflowHourly station×××Yes
Shakespeare and Srinivasan (2024) [20]U.S.TransitRidershipAnnually census block×××Partially Yes
1 The symbol ‘×’ denotes that the aspect was not considered in the literature. 2 Existing studies did not include run-level demand as a resolution level. 3 Log transformation was overused, with limited exploration of other transformations. 4 No disaggregate effect was captured in prior studies.
Table 2. Descriptive statistics of the aggregate data.
Table 2. Descriptive statistics of the aggregate data.
Variable DefinitionContinuous
Variable
Categorical
Variable
MeanStd. DevRelative
Frequency
Dependent Variable: Boarding per run
(i.e., per stage per route per direction)
original scale3.355.95
square root scale1.231.36
Independent Variables
Stage (SC)The starting stage of a route 6.27%
The ending stage of a route 6.29%
The intermediate stage of a route (base) 87.44%
Stage Index = Position Number of Current Stage /Total Number of Stages0.530.29
Route (RTC)Deviation between bus and driving route distance for each O-D pair of routes (in km)5.482.33
In the current time window (in minutes)(Scheduled headway) current route direction8.612.6
Stage-level bus inter-arrival times10.2512.11
(Deviation between scheduled and observed headway) current route direction2.373.26
Run (RNC)Previous stage occupancy = (cumulative boarding −
cumulative alighting) up to the previous stage per run
19.8814.11
Previous trip occupancy = (Cumulative boarding −
cumulative alighting) at the current stage per run during the previous trip
20.5514.85
Missing values for previous trip occupancy 2.52%
Service type (ST)Ordinary 5.34%
Deluxe 71.15%
Express (Base) 23.51%
Trip Direction (TD)Towards central urban area 51.31%
Away from the central urban area (Base) 48.69%
Land use (LC)Number of bus stations within 500 m buffer5.173.61
Number of train and subway stations within 500 m buffer0.280.55
Number of educational institutes within 500 m buffer13.5812.74
Number of government offices within 500 m buffer3.54.84
Sparse land-use area 33.68%
Neither sparse nor dense land-use area 29.23%
Dense land-use area 37.09%
Land-use variable is missing 9.82%
Table 3. Distribution of the disaggregate data.
Table 3. Distribution of the disaggregate data.
VariableCategoryRelative Frequency
AccessibilityFootpath conditionDifficult to use55%
Somewhat easy to use27%
Easy to use12%
Missing6%
Road intersections near bus stopsEasy and safe to cross32%
Difficult and unsafe to cross68%
Local and collector street density near home (in %)≤30%6%
31–50%53%
51–70%41%
Network Integration and InformationDirect bus available from home to workplaceYes60%
No40%
Total walking time for transferring buses from home to work (in minutes)26%
3.57%
8.55%
108%
Missing72%
Train access time from home (in minutes)≤1067%
11–3028%
31–603%
60–1051%
105–1201%
Information provision about bus routesAccurate49%
Inaccurate51%
Information provision about bus timingAccurate42%
Inaccurate58%
Transit Service CharacteristicsSecurity for women inside busLow22%
Medium32%
High14%
Missing32%
Reliability of reaching on time by busLow31%
Medium22%
High15%
Missing32%
Crowd inside bus for bus line-haul distance ≤ 5 kmComfortable space for seating and movement3%
Some crowding, but manageable16%
Limited space, crowded conditions45%
Missing37%
Personal and Work CharacteristicsAge17–25 years21%
26–40 years54%
41–60 years23%
61–85 years2%
GenderMale82%
Female18%
Monthly household income before taxes≤INR 20,00058%
INR 20,001–INR 40,00025%
INR 40,001–INR 60,0007%
INR 60,001–INR 80,0005%
>INR 80,0005%
Come home from the workplace to have lunchYes11%
No89%
Commute at unusual times (very early or late)Yes28%
No72%
Home to work trip distance (in km)≤5 km26%
5–10 km28%
10–15 km20%
15–20 km13%
>20 km13%
Table 4. Summary of key global model results and effects of transformations.
Table 4. Summary of key global model results and effects of transformations.
VariablesFunctional FormLinear x with Original yLinear x with √yNonlinear x with √yNonlinear x and Consideration with √y
Coeft ValueCoeft ValueCoeft ValueCoeft Value
Route Characteristics
Scheduled headway with respect to the current route direction at a stage for the current time window (in minutes) 1Cubic−0.04−4.14−0.02−6.259.846.7612.828.91
−1.14−6.65−1.51−8.90
0.046.510.068.83
Deviation between scheduled and observed headway with respect to the current route direction at a stage (in minutes) 2Cubic0.011.020.012.950.00020.010.0020.14
0.011.440.001.32
−0.001−2.23−0.001−2.12
Land-use Characteristics (within 500 m buffer)
Number of bus stations 3Quadratic−0.06−6.26−0.01−5.92−0.04−3.730.033.41
0.0012.05−0.004−7.09
Number of train stations 3Quadratic0.7713.840.2416.050.245.090.5110.78
0.00−0.19−0.18−7.19
Consideration Probability (within 500 m buffer)
Average of aggregated bus consideration probability for the people who considered bus 4Linear 0.7022.59
Missing values for bus consideration probabilityBinary 0.7326.71
1 Coefficients show nonlinearity. 2 Coefficient has a significant t value after applying the √y transformation. 3 Coefficients become intuitive after incorporating ‘Consideration Probability’. 4 Coefficient shows a positive effect of bus consideration probability on bus boarding.
Table 5. Ridership gain from traditional and proposed models.
Table 5. Ridership gain from traditional and proposed models.
ModelBus Ridership
Improvement Ratio
Bus Frequency
Improvement Ratio
Ridership Gain Index
Traditional ModelsOLS0.0050.0340.147
GWR0.0070.0340.206
Proposed Nonlinear ModelsOLS1.0220.2953.464
GWR0.4460.0736.110
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Roy, P.; Srinivasan, K.K. Geographically Weighted Nonlinear Regression for Cost-Effective Policies to Enhance Bus Ridership. Sustainability 2025, 17, 2485. https://doi.org/10.3390/su17062485

AMA Style

Roy P, Srinivasan KK. Geographically Weighted Nonlinear Regression for Cost-Effective Policies to Enhance Bus Ridership. Sustainability. 2025; 17(6):2485. https://doi.org/10.3390/su17062485

Chicago/Turabian Style

Roy, Payel, and Karthik K. Srinivasan. 2025. "Geographically Weighted Nonlinear Regression for Cost-Effective Policies to Enhance Bus Ridership" Sustainability 17, no. 6: 2485. https://doi.org/10.3390/su17062485

APA Style

Roy, P., & Srinivasan, K. K. (2025). Geographically Weighted Nonlinear Regression for Cost-Effective Policies to Enhance Bus Ridership. Sustainability, 17(6), 2485. https://doi.org/10.3390/su17062485

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop