1. Introduction
Transit inequity in large metropolitan areas typically manifests as a gradient: dense inner districts enjoy frequent, all-day service; specific suburban corridors see partial coverage; and peripheral communities are left with sparse or time-limited routes [
1,
2,
3,
4]. Capturing this inequality requires understanding not just where stops are located but how frequently vehicles arrive, which agencies operate them, and whether service remains viable outside the weekday commuter peak.
Following Geurs and van Wee [
5] and Hansen [
6], place-based service availability and opportunity-based destination access are related but distinct concepts [
7,
8,
9,
10]. This paper focuses on the former: specifically, a schedule-aware, origin-side measure of transit supply—what we term
schedule-aware service intensity—that captures temporal intensity without claiming to measure destination access or generalized travel cost [
11]. The indicator belongs to the cumulative-opportunity supply tradition: it is conceptually aligned with the Service Frequency Index used by Currie [
12], with the local index of transit availability of Mavoa et al. [
13], and with the departures-per-hour family of supply measures discussed by Boisjoly and El-Geneidy [
14]. We use the term “schedule-aware” throughout to emphasize the timetable basis of the measure; the more precise descriptor is “schedule-aware service intensity,” since no routing or journey-time calculation is performed.
These challenges are particularly pronounced in the Greater Toronto Area (GTA). As Canada’s largest metropolitan economy, the GTA functions as an integrated regional landscape, yet remains institutionally fragmented [
15,
16,
17]. The TTC anchors the region, GO Transit provides the regional rail backbone, and municipal agencies—MiWay, York Region Transit/Viva, Brampton Transit, and Durham Region Transit—serve expanding suburban centres. Although residents routinely cross municipal boundaries, transit provision remains disjointed in both spatial density and temporal availability.
Prior research suggests that transit accessibility is a fundamental determinant of urban development: frequent service is associated with denser housing, car-light lifestyles, and broader labour-market participation, whereas poor service is linked to constrained opportunity and social exclusion [
18,
19]. Understanding how supply is distributed is therefore central to evaluating whether peripheral expansion is outrunning the infrastructure required to sustain it.
The GTA is a compelling case because social vulnerability is no longer confined to the inner city. Lower-income households, renters, recent immigrants, and racialized communities are now broadly distributed across Toronto and its surrounding suburbs [
1,
20]. This spatial diffusion raises a critical question: has high-quality transit evolved to track the region’s social geography, or does it remain concentrated in the historic core? Prior GTA research has documented substantial variation in transit accessibility by neighbourhood and time of day, with service broadly aligned to social need within Toronto but unevenly distributed across the suburban belt [
1,
15,
21,
22]. At the national scale, Deboosere and El-Geneidy [
23] showed that public-transport accessibility in Canadian metropolitan regions is highly uneven, and methodological reviews have underscored that accessibility results depend strongly on how supply is operationalized and spatial units are defined [
14,
24,
25,
26].
Relative to the existing Toronto-region literature, this paper adds three specific elements. First, whereas El-Geneidy et al. [
15] and Farber and Grandez [
21] examined accessibility with temporal variation and socioeconomic linkages, both relied on partial-agency inventories or administrative spatial units; this paper integrates official GTFS feeds from all six major GTA providers into a single schedule-aware inventory, building on timetable-based network analysis [
27] and producing a more complete picture of multi-agency service. Second, the H3 hexagonal grid provides a uniform spatial baseline that avoids the shape irregularity and boundary effects of census tracts or municipal polygons [
28,
29,
30], enabling direct cross-boundary comparison at a consistent spatial grain. Third, the distance-ring zoning centred on Union Station—an operationalization, to our knowledge, not previously applied in GTA transit research—separates the urban–suburban gradient from municipal boundary artifacts and surfaces the Inner-to-Outer Suburban Ring break that municipal-level analyses can obscure.
The paper therefore asks: what spatial pattern of schedule-aware transit service intensity—an origin-side, supply-focused component of accessibility—emerges when the GTA is represented as a uniform H3 grid? How does that pattern vary across distance rings extending from Union Station to the suburban and exurban fringe? And how do socioeconomic gradients in service intensity change when the analysis is performed at census-tract and municipal scales?
Empirically, the paper shows that the GTA exhibits a corridor-and-node structure in which strong service extends into selected inner-suburban centres while large portions of the outer ring and fringe remain weakly served. Methodologically, it demonstrates how multi-agency GTFS, H3 indexing, distance-ring zoning, and CT-level socioeconomic linkage can be combined in a reproducible regional workflow [
31], providing a more policy-relevant representation of accessibility than stop counts or proximity-weighted scores alone.
3. Methods
The workflow comprises three phases (
Figure 2): data acquisition (multi-agency GTFS, census profiles, spatial boundaries), computational harmonization (H3 grid generation, stop-to-grid mapping via 800 m catchments, schedule-aware accessibility quantification), and equity synthesis (distance-ring segmentation, service-band classification, socioeconomic gradient analysis).
3.1. H3 Grid Construction
The GTA study area was tessellated using the H3 hierarchical hexagonal indexing system at resolution 8. The GTA union polygon was passed to the H3 polyfill routine and every resulting cell converted to a polygon and centroid representation, yielding 9635 hexagons. The principal methodological advantage is that H3 provides a uniform spatial baseline that abstracts accessibility measurement from irregular municipal borders while reducing the directional bias inherent to square rasters [
28,
29,
30]. Each hexagon represents an origin from which local transit service can be reached on foot; the measure therefore captures local transit supply around origins rather than network travel time to destinations.
3.2. Schedule-Aware Service-Intensity Measure
The central indicator in this paper is a schedule-aware count of departures per hour reachable from each H3 cell—an origin-side service-intensity measure rather than a destination-based accessibility measure. For each stop in each GTFS feed, departures were counted for three windows: weekday morning peak, weekday midday, and Saturday midday. These counts were converted to departures per hour. Each hexagon’s service intensity was then calculated by summing all departures per hour from stops lying within a specified catchment radius of the hexagon centroid.
The primary indicator is defined as
where
is service intensity for hexagon
i in time window
w and radius
r,
is the set of stops within radius
r of hexagon
i, and
is the scheduled departures per hour observed at stop
s in time window
w. The primary radius is 800 m, which approximates a broadly accepted walk-access threshold for higher-order and frequent transit [
15,
34]. Additional 400 m and 1000 m measures were computed for robustness testing.
Summing departures across all routes at each stop and across all stops in the catchment produces a service-intensity construct that intentionally weighs every scheduled departure as an additional boarding opportunity from the rider’s perspective. Whether two adjacent stops both serve a passenger’s intended trip, or whether multiple routes share a single stop, each contributes to local supply: the cumulative count captures spatial redundancy and time-of-day choice in a way that a single-stop or single-route metric cannot. This formulation aligns with the cumulative-opportunity tradition of Currie [
12] and with the boarding-opportunity framing of Conway et al. [
27]. The validity of the metric does not require that each stop represent an independent route, but that each stop represent an actual boarding event the passenger can use [
35,
36].
The known cost of this choice is stop-adjacency inflation in dense corridors. Where a single bus route has stops every 200–300 m, a hexagon centroid near such a corridor can have four or five stops from the same route within its 800 m catchment, multiplying that route’s frequency accordingly. In downtown Toronto, where multiple parallel streets carry overlapping routes and busy intersections host several directional stops, the cumulative departure count can overstate the number of distinct services by a factor of two to five. The accessibility values reported here should therefore be interpreted as cumulative boarding-opportunity density within walking distance—how much scheduled service surrounds an origin—rather than the number of distinct lines available. The inflation is strongest in the Urban Core where stop spacing is densest, and weakest in the outer rings where stops are sparse, so removing the differential scaling (i.e., applying route-direction deduplication) compresses the absolute Inner-to-Outer break somewhat—but does not eliminate it, because outer-ring values are anchored at zero in both specifications. The qualitative gradient is therefore robust to the choice of construct (cumulative boarding-opportunity density vs. deduplicated route–direction count), with deduplication providing a more conservative absolute estimate of the Core–Periphery gap.
To make the magnitude of this concern transparent,
Table 2 reports the ring-level median accessibility under the primary specification and under a route-direction-deduplicated variant in which each unique (route, direction) pair within an 800 m catchment is counted once, retaining its maximum scheduled frequency across the catchment’s stops. Absolute values in the Urban Core fall by roughly 70% (from 234.25 to 65.50 dep/h), but the spatial hierarchy and ring ordering are preserved; the rank correlation between the original and deduplicated hexagon-level series is
(
Supplementary Note S5).
All scheduled departures are weighted equally regardless of transit mode. This deliberate choice keeps the measure focused on service intensity rather than passenger utility: mode-specific weighting would require assumptions about speed, capacity, and destination coverage that go beyond a supply-side frequency measure [
10,
12]. The unweighted departure count provides the most transparent baseline for characterizing regional service intensity.
Spatial nearest-neighbor search on a haversine metric was used to identify all stops within each centroid’s catchment. For each H3 centroid, the analysis records the nearest stop distance, the number of reachable stops and agencies, and total departures per hour in three time windows; the 800 m weekday peak field serves as the primary indicator [
27]. Because the measure sums departures rather than counting stops, a hexagon near several high-frequency routes scores much higher than one with the same number of thinly scheduled stops—making it more policy-relevant than a static stop count [
35,
36], while remaining a supply-side rather than destination-access measure [
24,
26].
3.3. Service Bands and Temporal Comparison
To make the accessibility values interpretable in planning terms, each hexagon was assigned to one of five service bands based on accessible departures per hour within the 800 m catchment:
None: 0 departures per hour;
Low: more than 0 but fewer than 4 departures per hour;
Moderate: 4 to fewer than 12 departures per hour;
High: 12 to fewer than 30 departures per hour; and
Very High: 30 or more departures per hour.
These thresholds translate frequency sums into an interpretable hierarchy (
Supplementary Note S1) and are anchored in established frequent-service planning conventions: the 4 dep/h threshold aligns with Metrolinx’s frequent-transit definition of 15-min headways or better [
17], the 12 dep/h threshold with the TTC’s competitive surface-corridor standard equivalent to 5-min headways [
37], and the 30 dep/h marker with TCRP Report 165 “very frequent” tier and the rapid-transit-corridor definition discussed by Walker [
38] and Transportation Research Board [
39]. Because
aggregates departures across multiple stops and routes, the thresholds reflect total boarding-opportunity density rather than the headway of any single route; sensitivity to the exact cutoff is examined in
Table S2.
Accessibility was computed separately for the weekday morning peak, weekday midday, and Saturday midday. A weekend retention ratio (Saturday midday/weekday midday) was calculated for hexagons with non-zero weekday midday service. Because cells with no weekday service are excluded, the ratio understates the full extent of weekend inequality.
3.4. Distance-Ring Zoning
Rather than aggregating results by municipal boundary, the paper uses concentric rings centered on Union Station, Toronto’s primary regional transit hub. Haversine distance from each hexagon centroid to Union Station was used to classify cells into four zones:
Urban Core: less than 15 km;
Inner Suburban Ring: 15–30 km;
Outer Suburban Ring: 30–45 km; and
Fringe Ring: more than 45 km.
These breaks reflect the GTA’s nested regional structure (
Supplementary Note S2): 15 km approximates the City of Toronto and TTC primary coverage; 30 km captures inner suburban municipalities where GO corridors and municipal BRT/Viva operate; and 45 km marks the practical extent of regular GO commuter rail. Ring zoning captures the continuous outward gradient more directly than municipal boundaries and separates compact inner-ring suburbs from spatially extensive fringe territory.
3.5. Socioeconomic Joins
Socioeconomic attributes were attached to H3 cells via centroid-based spatial joins: where a centroid fell within a census tract, the tract profile was assigned; otherwise, census subdivision context was retained. This CT-first approach captures intra-municipal variation that municipal averages flatten, making it possible to assess whether service is aligned with denser, lower-income, more renter-dominated, or more diverse local contexts.
3.6. Correlation Robustness Framework
Associations between service intensity and socioeconomic context were evaluated using Spearman’s rank correlation coefficient (
Table S2), chosen because the service-intensity distribution is highly skewed with a large zero mass. The robustness framework has four components: (1) main correlations for CT-linked hexagons using weekday peak intensity at 800 m; (2) sensitivity tests at 400 m, 1000 m, and Saturday midday; (3) tract-level (one-tract-one-observation) Spearman correlations on the 1144 census tracts that intersect the GTA, computed by aggregating hex-level accessibility to tract means before correlating with the original CT socioeconomic variables; and (4) municipal aggregates summarizing both intensity and socioeconomic variables at the municipal level. The tract-level component directly addresses pseudo-replication: each CT contributes exactly one observation rather than being repeated across its constituent hexagons.
Pseudo-Replication, Spatial Dependence, and Effective Sample Size
Hex-level CT-linked correlations carry two well-known statistical limitations. CT attributes are repeated across the hexagons within each tract, inflating the nominal
n from approximately 1144 unique tract values to 8379 hex observations; and spatial dependence in the accessibility surface further reduces effective degrees of freedom. We therefore compute Global Moran’s
I on the H3 hexagonal grid using queen-contiguity weights derived from the immediate H3 neighbourhood (999 random permutations;
Supplementary Note S4), and we apply a Clifford–Richardson correction for the effective sample size when assessing the significance of CT-linked Spearman correlations [
40]. Given the observed Moran’s
, the Clifford–Richardson approximation yields an effective sample size of
. We report adjusted
p-values computed against this effective
n as the primary inferential basis for socioeconomic claims; under this correction, all six principal CT-level associations remain statistically significant (
), but the magnitude of significance is materially smaller than at the nominal
n. Spearman partial correlations controlling for population density were computed using rank regression on OLS residuals [
41]. The municipal-level analysis (
after exclusions) and the tract-level analysis (
) serve as additional robustness checks against both pseudo-replication and residual spatial autocorrelation.
Population-weighted measures were also computed by apportioning CT (or CSD) population equally across constituent hexagons, permitting direct comparison between the share of the GTA’s land area and the share of its population without transit service. A population-weighted Gini coefficient and Lorenz curve (
Supplementary Figure S1) quantify distributional inequality [
42].
All data processing, spatial operations, statistical analyses, and visualization were carried out in Python 3.13.5 (Python Software Foundation, Wilmington, DE, USA) using GeoPandas 1.1, Shapely 2.1, H3-Py 4.4 (Uber Technologies, San Francisco, CA, USA), pandas 2.3, NumPy 2.3, SciPy 1.15, statsmodels 0.14, and matplotlib 3.10.
5. Discussion
The multi-agency GTFS inventory reveals a GTA transit landscape that is both more structured and more unequal than a Toronto-centric view suggests. The schedule-aware metric separates nominal, thin coverage from genuinely frequent urban service, exposing where regional policy interventions are most needed. Census-tract data confirm that transit-dependent populations are distributed widely across the suburban municipalities, not concentrated in the Toronto core—yet the infrastructure to serve them remains strikingly uneven.
The distinction between area-weighted and population-weighted perspectives is critical for interpreting the headline inequality numbers. The area-weighted 70.6% zero-service rate includes extensive greenbelt, conservation lands, and agricultural fringe where few people live; the population-weighted 16.4% figure better captures the share of residents without service. Neither perspective is correct in isolation: area-weighted metrics are relevant for land-use planning, greenfield development, and future growth corridors, while population-weighted metrics speak to current equity and service-delivery evaluation.
Table 3 presents the full comparison by distance ring.
The suburban transit picture warrants closer attention. Several suburban municipalities sustain meaningful scheduled transit networks, refuting the view that the commuter belt is a blank canvas. But suburban service is highly uneven and geographically selective—organized around specific corridors, nodes, and built-up centres. The planning problem is therefore one of incomplete regionalization: not whether suburban transit exists, but where it is strong enough to function as a reliable everyday option.
The GTA’s 70.6% zero-service rate reflects its hybrid character—a dense, well-served core embedded in a large, auto-oriented suburban field where six independent agencies operate with limited coordination—placing it between more compact Canadian regions and sprawling US sunbelt metros [
12,
23].
The schedule-aware approach produces a more realistic hierarchy than stop proximity alone: the gap between 29.4% with any service and 22.6% reaching 12 dep/h identifies places where transit is nominally present but insufficiently frequent for regular urban use, a distinction that sharpens further on Saturday when marginal service recedes.
Weekend retention ratios (∼0.98 core, ∼0.86 inner ring, ∼0.75 outer rings) indicate that peri-urban inequality is temporal as well as spatial: areas that appear reasonably connected during weekdays often become much less usable on weekends, echoing earlier findings on time-sensitivity in the Toronto region [
15].
These findings are broadly consistent with earlier GTA research. Foth et al. [
1] found service within Toronto aligned with social need over 1996–2006, and the Urban Core remains well served here; El-Geneidy et al. [
15] documented time-of-day shifts, confirmed by the pronounced weekend contraction. The persistence of these gradients over more than a decade suggests that the region’s fundamental accessibility structure has remained stable despite ongoing investment, and that suburban equity remains unresolved.
At the hex-level, population density is the strongest single correlate of service intensity, followed by visible minority share and unemployment—plausible in a region where the densest areas combine higher transit provision with substantial renter, immigrant, and racialized populations. Under one-tract-one-observation aggregation (
), the magnitude ordering shifts: median income (
) becomes essentially co-equal in magnitude with population density (
), suggesting that the income gradient operates partly between tracts at a coarser spatial grain than the hex-level analysis can resolve. The partial correlations confirm that density mediates much of the bivariate hex-level association: once its effect is removed, all socioeconomic associations weaken, though income, renter share, unemployment, and visible minority share retain significant independent associations. The observed co-location of high service intensity with socioeconomic disadvantage may partly reflect residential sorting [
44] and supply-driven allocation (agencies serve where ridership potential is highest) rather than equitable planning intent alone. These are area-level ecological associations and do not characterize individual-level outcomes.
The population-weighted Gini of 0.60 places the GTA’s schedule-aware service-intensity inequality in the upper range of values reported for comparable metropolitan regions (where the literature uses broader transit-accessibility constructs). Delbosc and Currie [
42] reported Ginis of 0.52–0.68 across Melbourne’s transit modes, and Deboosere and El-Geneidy [
23] found substantial inter-metropolitan variation across Canadian cities, with larger, more polycentric regions exhibiting higher inequality. The GTA value is consistent with a region where a well-served core coexists with extensive low-density periphery.
The municipal robustness check confirms that these patterns survive scale reduction: once collapsed to 19 municipalities (excluding Halton and the two smallest fringe CSDs), density, visible minority share, and unemployment remain strongly positive, income remains negative, and low-income share weakens but does not reverse.
The municipal findings suggest that suburban transit investment has created islands and corridors of strong service—but not a uniformly connected metropolitan fabric. Corridor-based investment can substantially improve equity when combined with feeder services and fare integration [
45].
These findings align with broader debates on infrastructure concentration: transit provision does not necessarily follow population suburbanization; major infrastructure accumulates where density and existing investment are already strongest [
2,
16]. The transport-justice [
46] and social-exclusion [
47] literature suggest that such patterns carry distributional consequences even when they arise from operational efficiency. Because the indicator examined here is origin-side scheduled supply rather than destination-based access, employment accessibility, ridership, fare integration, or actual travel behavior, the planning implications below should be read as directions for further investigation rather than as direct policy prescriptions. The sharp break between the Inner and Outer Suburban Rings suggests that the 30 km transition zone may warrant further regional planning analysis, particularly destination-based and demand-side studies that could assess whether coordinated investment in this band would improve practical access. Weekend thinning, which disproportionately affects peri-urban populations working non-standard schedules [
4], indicates that off-peak frequency may warrant further investigation as a planning consideration. Inter-agency fare integration and coordinated scheduling are areas where additional analysis of perceived accessibility—particularly where multiple services already exist but do not operate as a seamless system—could be valuable.
Several limitations apply. (1) The analysis measures schedule-based supply near origins, not travel times to destinations, and does not account for transfer penalties, fares, reliability, or crowding [
24,
26,
32]; in a multi-agency region, inter-system transfers may reduce practical accessibility below what the departures-per-hour metric implies. (2) Walk catchments are Euclidean rather than network-based. The 800 m Euclidean catchment overstates effective walking access most strongly in cul-de-sac suburban environments where stormwater ponds, highways, and discontinuous sidewalks raise network walking distances to 1.4–1.8× the straight-line distance [
34,
43]. Two complementary sensitivity analyses (
Section 4.6,
Note S6) quantify this effect: a literature-factor calculation suggests that 70.5–86.5% of currently-classified Outer-Ring “High” hexagons would lose at least one band under network distances, and an empirical shortest-path computation on the Pickering walkable street network—presented as a representative sub-analysis for one outer-suburban municipality rather than a GTA-wide network-distance robustness test—reduces mean weekday peak intensity from 7.91 to 2.92 dep/h with 12.8% of hexagons dropping to a lower band. The bias acts in the direction of understating the outer-ring deficit, so network distances would, if anything, strengthen the paper’s equity thesis. (3) No distance-decay function is applied; the deduplication sensitivity in
Table 2 and
Note S5 shows that route-level deduplication compresses absolute Urban-Core values but preserves the spatial pattern [
6,
25]. (4) Temporal and spatial coverage is incomplete: only three time windows are examined (excluding evening, late-night, and Sunday service [
14]), and CT coverage thins in the exurban fringe. The Fall 2023 GTFS feeds represent a post-pandemic recovery period in which several agencies had not fully restored pre-pandemic service levels; the TTC had restored approximately 90% of pre-2020 service by Fall 2023, while suburban agencies had recovery trajectories that varied by route. To the extent that the TTC’s recovery lagged its suburban counterparts, the analysis may somewhat understate the TTC’s structural dominance relative to a fully recovered scenario. (5) The H3 resolution-8 grain introduces a modifiable areal unit problem [
24]; this was examined directly by re-tessellating Mississauga at H3 resolution 7 and resolution 9 (
Note S7), and the qualitative pattern was preserved at both alternatives, although the absolute share of cells reaching 12 dep/h shifts modestly. A GTA-wide cross-resolution analysis would be a useful extension. None of these limitations invalidates the spatial and temporal patterns identified here, but each defines a direction for further work.
The next step is to build on this place-based baseline. A destination-based extension could evaluate access to jobs, health care, and services using full timetable routing [
35,
48,
49]; a reliability extension could compare scheduled and observed service; and a policy extension could test whether municipalities with weak weekend retention concentrate the most service-dependent populations. The Lorenz-curve and Gini-based equity measures follow Delbosc and Currie [
42] and Talen and Anselin [
50] and could be extended to destination-based accessibility.
6. Conclusions
This study used an H3 hexagonal grid, multi-agency GTFS feeds, and Statistics Canada census boundaries to map schedule-aware transit service intensity across the Greater Toronto Area. The indicator is an origin-side, supply-focused component of accessibility and does not by itself characterize destination access, travel time, transfers, fares, reliability, or crowding. Within those bounds, the GTA is served by a much larger and more differentiated transit network than a Toronto-only stop inventory suggests, but the distribution of meaningful scheduled service remains highly uneven: during the weekday morning peak, 70.6% of hexagons have no service within 800 m and only 22.6% reach at least 12 departures per hour. The distribution is structured most clearly as a distance-ring hierarchy, with strong service fields in the Urban Core and Inner Suburban Ring giving way to predominantly zero-service territory in the outer rings; this hierarchy is robust to alternative ring thresholds, exclusion of Halton municipalities, the choice of H3 resolution, and a network-distance walking-catchment specification (
Section 4.6). Weekend service weakens most sharply outside the core, indicating that peri-urban inequality has a temporal as well as spatial component. Higher service intensity co-locates with population density, rental tenure, ethnocultural diversity, and lower household income at the tract level; the income and density gradients strengthen under one-tract-one-observation aggregation, and all six principal CT-level associations remain statistically significant after Clifford–Richardson correction for spatial autocorrelation. These are area-level ecological associations and do not characterize individual outcomes. The strongest single relationship is with population density, indicating that built form and social geography jointly shape where scheduled transit supply accumulates.
A schedule-aware, multi-agency H3 framework offers a substantially clearer account of regional service-intensity inequality than stop counts or proximity-weighted indices, while remaining one input among several for a complete equity assessment. The highest-priority extension is a destination-based accessibility analysis using full timetable routing; real-time reliability comparisons, fare-integration analysis, and network-distance catchments at full GTA scale would further sharpen policy relevance. The population-weighted Gini coefficient of 0.60 provides a baseline against which future distributional progress can be tracked. For the GTA, the descriptive results suggest that further investigation is warranted into where service is frequent, temporally durable, and regionally coherent enough to support everyday life beyond the core, particularly in the Inner-to-Outer Suburban transition zone, where the gap between residential growth and frequent scheduled transit appears largest.