1. Introduction
Transit accessibility and travel time have attracted particular attention from both policymakers and planners. Due to the spatial allocations of transit stations and temporal route schedules, the transit accessibility and travel time have spatiotemporal dynamics. Therefore, visualizing the transit performance in a real-time manner becomes critical to transit operation and management. However, transit operational data (e.g., stop and boarding activities with reference to time and location) are typically collected by a range of information and communication technologies (ICTs), including Automated Vehicle Location (AVL), Automated Passenger Counting (APC), and Automated Fare Collection (AFC) systems [
1,
2]. Many transportation agencies transform those transit data into General Transit Feed Specification (GTFS) according to the Transit Feed Specification uniformed by the Google company. GTFS been widely adopted by transit agencies since the early 2010s to share transit schedules with the public via the internet [
3]. With historical GTFS feeds archived on platforms, GTFS has emerged as a invaluable research resource for transit analysis [
4,
5]. This historical data enable researchers to compare past versions with current data feeds, providing insights into how transit agencies have evolved their services over time.
Each GTFS feed is represented using multiple tables as a dataset, which encapsulates the complete transit service of an agency for a specific date range. The feed consists of several mandatory and optional tables, structured similarly to a typical SQL database with primary and foreign keys.
Figure 1 illustrates the relationships between these tables through an Entity Relationship Diagram (ERD). This diagram only encompasses the required tables and their relationships related to accessibility analysis (e.g., “agency”, “trips”, “stops”, and “routes”). The other GTFS tables are not shown (i.e., “shapes”, “frequencies”, “transfers”, “fare rules” and “attributes”). Each table represents a distinct aspect of the transit system, with primary and foreign keys establishing connections between different tables. The “trips” table is the center stage of the dataset. Traditionally, a trip typically refers to section of a vehicle traveling from the first/origin stop to the last/terminal stop. Thus, any trip has a unique “route id”. Similarly, each trip belongs to a specific “service id”. The detailed information of each bus stop is further recorded in the “stop times” table. Collectively, these interconnected tables construct a comprehensive representation of the entire transit plans.
Understanding the GTFS data structure facilitates the development of various applications, including route planning (e.g., Open Route Service [
6] and Mapnificent [
7]), public service information provision (e.g., Google Maps [
8] and Open Trip Planner [
9]), system visualization and analysis, etc. Nearly all these applications rely on the construction of a spatiotemporal network. Furthermore, there are also some commercial tools for transit planning (e.g., Remix [
10] and Conveyal [
11]). To further understand the differences between these existing tools,
Table 1 summarizes the performance results of the aforementioned services. All the aforementioned tools lack some flexibilities. Some do not allow uploading a special version of GTFS, and some services do not fully disclose the code and algorithms. Many services are mostly tailored for user interactions through graphic interfaces. Most importantly, in general, there is a lack of the ability to let users to download the spatiotemporal network to perform customized analysis using their own code. Thus, to address this fundamental need for a free, open-source product converting GTFS to its spatiotemporal network, we propose
GTFS2STN, a standardized tool designed to generate spatiotemporal transit networks as the foundation for comprehensive transit analysis.
The remainder of this paper is structured as follows:
Section 2 reviews the previous GTFS-based studies, focusing on three key aspects: transit accessibility, transit data visualization, and travel time variability.
Section 3 details the process of constructing spatiotemporal networks and outlines the algorithms used to generate outcomes.
Section 4 presents the case studies and demonstrates the basic functionalities of the
GTFS2STN application.
Section 5 concludes the paper by summarizing the new findings, limitations, potentials, etc.
Author Contributions
Conceptualization, L.D.H., C.B. and D.L.; methodology, D.L.; software, D.L.; validation, C.B., Y.G. and C.B.; formal analysis, D.L.; investigation, J.G.; resources, C.B.; data curation, D.L. and J.G.; writing—original draft preparation, D.L. and C.B.; writing—review and editing, D.L., M.K.; visualization, D.L.; supervision, L.D.H. and C.B.; project administration, C.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
OD | origin–destination pair |
GTFS | General Transit Feed Specification |
GTFS2STN | The proposed method to convert GTFS data specification to a spatiotemporal network |
ERD | Entity Relationship Diagram |
DAG | Directed Acyclic Graph |
TCQSM | Transit Capacity and Quality of Service Manual |
BNA | Nashville International Airport |
References
- Prommaharaj, P.; Phithakkitnukoon, S.; Demissie, M.G.; Kattan, L.; Ratti, C. Visualizing public transit system operation with GTFS data: A case study of Calgary, Canada. Heliyon 2020, 6, e03729. [Google Scholar] [CrossRef]
- Ma, X.; Wang, Y. Development of a data-driven platform for transit performance measures using smart card and GPS data. J. Transp. Eng. 2014, 140, 04014063. [Google Scholar] [CrossRef]
- Google. GTFS. 2024. Available online: https://www.gtfs.org/ (accessed on 16 November 2024).
- TransitLand. 2024. Available online: https://www.transit.land/ (accessed on 16 November 2024).
- Mobility Database. 2024. Available online: https://mobilitydatabase.org/ (accessed on 16 November 2024).
- Open Route Service. 2024. Available online: https://openrouteservice.org/ (accessed on 3 May 2024).
- Mapnificent. 2024. Available online: https://www.mapnificent.net/ (accessed on 3 May 2024).
- Google. Google Maps API. 2024. Available online: https://developers.google.com/maps/documentation/routes (accessed on 16 November 2024).
- Open Trip Planner. 2024. Available online: https://www.opentripplanner.org/ (accessed on 16 November 2024).
- Remix Transit Planning. 2024. Available online: https://ridewithvia.com/solutions/remix/transit (accessed on 16 November 2024).
- Conveyal. 2024. Available online: https://conveyal.com/ (accessed on 16 November 2024).
- Mahajan, V.; Kuehnel, N.; Intzevidou, A.; Cantelmo, G.; Moeckel, R.; Antoniou, C. Data to the people: A review of public and proprietary data for transport models. Transp. Rev. 2022, 42, 415–440. [Google Scholar] [CrossRef]
- Sobral, T.; Galvão, T.; Borges, J. Visualization of urban mobility data from intelligent transportation systems. Sensors 2019, 19, 332. [Google Scholar] [CrossRef]
- Guo, J.; Brakewood, C. Analysis of spatiotemporal transit accessibility and transit inequity of essential services in low-density cities, a case study of Nashville, TN. Transp. Res. Part Policy Pract. 2024, 179, 103931. [Google Scholar] [CrossRef]
- Guo, J.; Mishra, S.; Brakewood, C. Analyzing gender and age differences in travel patterns and accessibility for demand response transit in small urban areas: A case study of Tennessee. J. Transp. Land Use 2024, 17, 675–706. [Google Scholar] [CrossRef]
- Lee, E.H.; Lee, H.; Kho, S.Y.; Kim, D.K. Evaluation of transfer efficiency between bus and subway based on data envelopment analysis using smart card data. Ksce J. Civ. Eng. 2019, 23, 788–799. [Google Scholar] [CrossRef]
- Lee, H.; Park, H.C.; Kho, S.Y.; Kim, D.K. Assessing transit competitiveness in Seoul considering actual transit travel times based on smart card data. J. Transp. Geogr. 2019, 80, 102546. [Google Scholar] [CrossRef]
- Yun, H.; Lee, E.H.; Kim, D.K.; Cho, S.H. Development of estimating methodology for transit accessibility using smart card data. Transp. Res. Rec. 2021, 2675, 159–171. [Google Scholar] [CrossRef]
- Antrim, A.; Barbeau, S.J. The Many Uses of GTFS Data–Opening the Door to Transit and Multimodal Applications; Location-Aware Information Systems Laboratory at the University of South Florida: Tampa, FL, USA, 2013; Volume 4. [Google Scholar]
- Para, S.; Wirotsasithon, T.; Jundee, T.; Demissie, M.G.; Sekimoto, Y.; Biljecki, F.; Phithakkitnukoon, S. G2Viz: An online tool for visualizing and analyzing a public transit system from GTFS data. Public Transp. 2024, 16, 893–928. [Google Scholar] [CrossRef]
- Lock, O.; Bednarz, T.; Pettit, C. The visual analytics of big, open public transport data–a framework and pipeline for monitoring system performance in Greater Sydney. Big Earth Data 2021, 5, 134–159. [Google Scholar] [CrossRef]
- Caros, N.S. Leveraging Spatial Relationships and Visualization to Improve Public Transit Performance Analysis. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 2021. [Google Scholar]
- Liu, L.; Porr, A.; Miller, H.J. Realizable accessibility: Evaluating the reliability of public transit accessibility using high-resolution real-time data. J. Geogr. Syst. 2023, 25, 429–451. [Google Scholar] [CrossRef] [PubMed]
- Klar, B.; Lee, J.; Long, J.A.; Diab, E. The impacts of accessibility measure choice on public transit project evaluation: A comparative study of cumulative, gravity-based, and hybrid approaches. J. Transp. Geogr. 2023, 106, 103508. [Google Scholar] [CrossRef]
- Miller, P.; de Barros, A.G.; Kattan, L.; Wirasinghe, S. Analyzing the sustainability performance of public transit. Transp. Res. Part Transp. Environ. 2016, 44, 177–198. [Google Scholar] [CrossRef]
- Aemmer, Z.; Ranjbari, A.; MacKenzie, D. Measurement and classification of transit delays using GTFS-RT data. Public Transp. 2022, 14, 263–285. [Google Scholar] [CrossRef]
- Luo, X.; Dong, L.; Dou, Y.; Zhang, N.; Ren, J.; Li, Y.; Sun, L.; Yao, S. Analysis on spatial-temporal features of taxis’ emissions from big data informed travel patterns: A case of Shanghai, China. J. Clean. Prod. 2017, 142, 926–935. [Google Scholar] [CrossRef]
- Park, Y.; Mount, J.; Liu, L.; Xiao, N.; Miller, H.J. Assessing public transit performance using real-time data: Spatiotemporal patterns of bus operation delays in Columbus, Ohio, USA. Int. J. Geogr. Inf. Sci.e 2020, 34, 367–392. [Google Scholar] [CrossRef]
- Farber, S.; Morang, M.Z.; Widener, M.J. Temporal variability in transit-based accessibility to supermarkets. Appl. Geogr. 2014, 53, 149–159. [Google Scholar] [CrossRef]
- Liu, L.; Porr, A.; Miller, H.J. Measuring the impacts of disruptions on public transit accessibility and reliability. J. Transp. Geogr. 2024, 114, 103769. [Google Scholar] [CrossRef]
- Martinazzo, L.; Falavigna, C. Public transport accessibility to hospitals in the city of Córdoba: A comparative analysis in times of a pandemic (2019–2021). Rev. Produção e Desenvolv. 2022, 8, e589. [Google Scholar] [CrossRef]
- Wessel, N.; Widener, M.J. Discovering the space–time dimensions of schedule padding and delay from GTFS and real-time transit data. J. Geogr. Syst. 2017, 19, 93–107. [Google Scholar] [CrossRef]
- Wessel, N.; Farber, S. On the accuracy of schedule-based GTFS for measuring accessibility. J. Transp. Land Use 2019, 12, 475–500. [Google Scholar] [CrossRef]
- Goliszek, S.; Połom, M. The use of general transit feed specification (GTFS) application to identify deviations in the operation of public transport at morning peak hours on the example of Szczecin. Eur. XXI 2016, 31, 51–60. [Google Scholar] [CrossRef]
- Fayyaz, S.K.; Liu, X.C.; Porter, R.J. Dynamic transit accessibility and transit gap causality analysis. J. Transp. Geogr. 2017, 59, 27–39. [Google Scholar] [CrossRef]
- Polzin, S.E.; Pendyala, R.M.; Navari, S. Development of time-of-day–based transit accessibility analysis tool. Transp. Res. Rec. 2002, 1799, 35–41. [Google Scholar] [CrossRef]
- Kukuliač, P.; Horák, J.; Fojtík, D.; Ivan, I.; Kolodziej, O.; Orlíková, L.; Marešová, P. Post COVID-19 public transport accessibility changes: Case study of Ostrava and Hradec Králové regions. Geogr. Cassoviensis 2023, 17. [Google Scholar] [CrossRef]
- Singh, S.S.; Javanmard, R.; Lee, J.; Kim, J.; Diab, E. Evaluating the accessibility benefits of the new BRT system during the COVID-19 pandemic in Winnipeg, Canada. J. Urban Mobil. 2022, 2, 100016. [Google Scholar] [CrossRef]
- Kar, A.; Carrel, A.L.; Miller, H.J.; Le, H.T. Public transit cuts during COVID-19 compound social vulnerability in 22 US cities. Transp. Res. Part D Transp. Environ. 2022, 110, 103435. [Google Scholar] [CrossRef]
- Sharma, I.; Mishra, S.; Golias, M.M.; Welch, T.F.; Cherry, C.R. Equity of transit connectivity in Tennessee cities. J. Transp. Geogr. 2020, 86, 102750. [Google Scholar] [CrossRef]
- Kim, H.; Song, Y. An integrated measure of accessibility and reliability of mass transit systems. Transportation 2018, 45, 1075–1100. [Google Scholar] [CrossRef]
- Wong, J. Leveraging the general transit feed specification for efficient transit analysis. Transp. Res. Rec. 2013, 2338, 11–19. [Google Scholar] [CrossRef]
- Bast, H.; Brosi, P.; Storandt, S. Real-time movement visualization of public transit data. In Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Fort Worth, TX, USA, 4–7 November 2014; pp. 331–340. [Google Scholar]
- Wilbur, M.; Ayman, A.; Sivagnanam, A.; Ouyang, A.; Poon, V.; Kabir, R.; Vadali, A.; Pugliese, P.; Freudberg, D.; Laszka, A.; et al. Impact of COVID-19 on public transit accessibility and ridership. Transp. Res. Rec. 2023, 2677, 531–546. [Google Scholar] [CrossRef] [PubMed]
Figure 1.
The relationship of the GTFS tables to the spatiotemporal network generation.
Figure 2.
A simple demonstration of converting a transit network to a spatiotemporal transit network.
Figure 3.
A simplified example of generating a spatiotemporal network for three consecutive stops of a transit route.
Figure 4.
A spatiotemporal network generated in downtown Nashville, TN (a small segment of the network).
Figure 5.
A shortest paths starting from a bus stop in Nashville, TN.
Figure 6.
The 5 major steps of using the GTFS2STN application.
Figure 7.
The isochrone map to access any of the three Walmart markets in Nashville, Tennessee.
Figure 8.
Accessibility from/to the Nashville International Airport (BNA) using WeGo transit services in Nashville, Tennessee (all isochrone legends are same as the one in
Figure 7).
Figure 9.
Analyzing journey time changes of a network by comparing between 2019 and 2024 over 10 different bus stops across the network.
Figure 10.
Analyzing journey time changes of a networks. (a) A journey time scatter plot comparing between 14 November 2019 and 14 May 2020; (b) a journey time scatter plot comparing between 14 November 2019 and 13 November 2020.
Figure 11.
A comparison of the isochrone plot between
GTFS2STN and Mapnificent using similar query conditions (isochrone legends on the left subplot are same as the one in
Figure 7).
Table 1.
A comparison between the different tools/services analyzing GTFS for transit planning.
Service Name | Price | Upload GTFS | Source Code | Massive Analysis | Download Spatiotemporal Network |
---|
Google Maps API [8] | Low | No | No | Yes | No |
Open Route Service [6] | Free | No | Yes | No | No |
Open Trip Planner [9] | Free | No | Yes | No | No |
Mapnificent [7] | Free | No | Yes | No | No |
Open Trip Planner [9] | Free | No | Yes | No | No |
Remix [10] | High | No | No | Yes | No |
Conveyal [11] | High | Yes | No | Yes | No |
GTFS2STN | Free | Yes | Yes | Yes | Yes |
Table 2.
Basic elements of spatiotemporal network.
Name | Description |
---|
stop/transit node | traffic nodes of network (i.e., a bus stop at a give time) |
destination node | for each stop, there is a destination node to denote the arrival |
stop/waiting link | vertical links connecting the same stop over time |
transit link | links connecting different bus stops traversed by buses |
walking link | links connecting different bus stops traversed by walking |
arrival link | connecting from stop node to the destination node |
Table 3.
Evaluating travel time results between GTFS2STN model and Google Maps APIs (ground truth).
City Name | Transit Agency | MAE (Min) | RMSE (Min) | MAPE |
---|
New York, NY | MTA | 5.5 | 7.1 | 10.4% |
San Francisco, CA | BART | 1.8 | 5.1 | 2.6% |
Washington, DC | WMATA | 6.9 | 8.6 | 12.6% |
Austin, TX | CapMetro | 12.0 | 17.4 | 10.5% |
Table 4.
Evaluating journey time changes between 2019 and 2024 over different departure times. The null hypothesis journey time in November 2019 is equal or smaller than that of May 2020. Alternative hypothesis: Journey time in November 2019 is greater than that of November 2024.
Departure Time | Paired t-Test | Wilcoxon Signed-Ranked Test |
---|
| Statistics | p-Value | Statistics | p-Value |
---|
Overall | 11.41 | <0.0001 | 112,830 | <0.0001 |
8 AM | 5.73 | <0.0001 | 3376 | <0.0001 |
10 AM | 4.48 | <0.0001 | 3045 | <0.0001 |
12 PM | 4.19 | <0.0001 | 3030 | <0.0001 |
14 PM | 4.29 | <0.0001 | 3034 | <0.0001 |
16 PM | 3.90 | <0.0001 | 3194 | <0.0001 |
18 PM | 7.20 | <0.0001 | 3575 | <0.0001 |
Table 5.
Evaluating journey time changes between 2019 and 2024 over different origin sites. Null hypothesis Journey time in November 2019 is equal to or smaller than that of May 2020. Alternative hypothesis: Journey time in November 2019 is greater than that of November 2024.
| | Paired t-Test | Wilcoxon Signed-Ranked Test |
---|
Site | Coordinates | Statistics | p-Value | Statistics | p-Value |
---|
Site 1 | (36.143989, −86.799031) | 2.99 | 0.0020 | 1105 | 0.0009 |
Site 2 | (36.104841, −86.814243) | 2.23 | 0.0149 | 1027 | 0.0071 |
Site 3 | (36.130815, −86.666128) | 3.64 | 0.0003 | 1105 | 0.0009 |
Site 4 | (36.089632, −86.733373) | 4.37 | <0.0001 | 1222 | <0.0001 |
Site 5 | (36.167145, −86.826597) | −2.10 | 0.9801 | 520 | 0.9723 |
Site 6 | (36.228731, −86.725305) | 9.87 | <0.0001 | 1484 | <0.0001 |
Site 7 | (36.221122, −86.805488) | 14.18 | <0.0001 | 1475 | <0.0001 |
Site 8 | (36.100465, −86.871159) | −5.22 | 0.9999 | 230 | 0.9999 |
Site 9 | (36.051442, −86.714537) | 12.04 | <0.0001 | 1476 | <0.0001 |
Site 10 | (36.167234, −86.660707) | 8.47 | <0.0001 | 1432 | <0.0001 |
Table 6.
Evaluating journey time changes between 2019 and 2024 over different destination sites. Null hypothesis Journey time in November 2019 is equal or smaller than that of May 2020. Alternative hypothesis: Journey time in November 2019 is greater than that of November 2024.
| | Paired t-Test | Wilcoxon Signed-Ranked Test |
---|
Site | Coordinates | Statistics | p-Value | Statistics | p-Value |
---|
Site 1 | (36.143989, −86.799031) | 2.77 | 0.0037 | 1034 | 0.0060 |
Site 2 | (36.104841, −86.814243) | 5.49 | <0.0001 | 1264 | <0.0001 |
Site 3 | (36.130815, −86.666128) | 0.52 | 0.3002 | 883 | 0.1131 |
Site 4 | (36.089632, −86.733373) | 2.87 | 0.0029 | 1051 | 0.0039 |
Site 5 | (36.167145, −86.826597) | 4.15 | <0.0001 | 1173 | 0.0001 |
Site 6 | (36.228731, −86.725305) | 5.27 | <0.0001 | 1248 | <0.0001 |
Site 7 | (36.221122, −86.805488) | 5.86 | <0.0001 | 1324 | <0.0001 |
Site 8 | (36.100465, −86.871159) | 2.82 | 0.0033 | 1067 | 0.0026 |
Site 9 | (36.051442, −86.714537) | 5.97 | <0.0001 | 1261 | <0.0001 |
Site 10 | (36.167234, −86.660707) | 2.73 | 0.0042 | 1203 | <0.0001 |
Table 7.
Evaluating journey time changes between November 2019 and May 2020 over different departure times. Null hypothesis Journey time in November 2019 is equal or greater than that of May 2020. Alternative hypothesis: Journey time in November 2019 is smaller than that of May 2020.
Departure Time | Paired t-Test | Wilcoxon Signed-Ranked Test |
---|
| Statistics | p-Value | Statistics | p-Value |
---|
Overall | −11.53 | <0.0001 | 30,300 | <0.0001 |
8 AM | −8.91 | <0.0001 | 230 | <0.0001 |
10 AM | −4.58 | <0.0001 | 973 | <0.0001 |
12 PM | −3.74 | <0.0001 | 1016 | <0.0001 |
14 PM | −2.38 | <0.0001 | 1334 | 0.00206 |
16 PM | −6.33 | <0.0001 | 723 | <0.0001 |
18 PM | −3.25 | <0.0001 | 1181 | <0.0001 |
Table 8.
Evaluating journey time changes between November 2019 and November 2020 over different departure time. Null hypothesis Journey time in November 2019 is equal to that of November 2020. Alternative hypothesis: Journey time in November 2019 is not equal to that of November 2020.
Departure Time | Paired t-Test | Wilcoxon Signed-Ranked Test |
---|
| Statistics | p-Value | Statistics | p-Value |
---|
Overall | −2.17 | 0.0307 | 60,997 | 0.0307 |
8 AM | −3.05 | 0.0030 | 3376 | 0.0007 |
10 AM | 0.94 | 0.3495 | 3045 | 0.7983 |
12 PM | −1.78 | 0.0772 | 3030 | 0.0259 |
14 PM | 0.44 | 0.6642 | 3034 | 0.7460 |
16 PM | −1.17 | 0.2448 | 3194 | 0.1277 |
18 PM | −1.25 | 0.2133 | 3575 | 0.8523 |
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).