You are currently viewing a new version of our website. To view the old version click .
Sensors
  • Article
  • Open Access

21 December 2021

Data-Driven Performance Evaluation Framework for Multi-Modal Public Transport Systems

,
,
and
Group Biometry, Biosignals, Security, and Smart Mobility, Departamento de Matemática Aplicada a las Tecnologías de la Información y las Comunicaciones, Escuela Técnica Superior de Ingenieros de Telecomunicación, Universidad Politécnica de Madrid, Avenida Complutense 30, 28040 Madrid, Spain
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
This article belongs to the Special Issue Emerging Sensors Techniques and Technologies for Intelligent Environments

Abstract

Transport agencies require accurate and updated information about public transport systems for the optimal decision-making processes regarding design and operation. In addition to assessing topology and service components, users’ behaviors must be considered. To this end, a data-driven performance evaluation based on passengers’ actual routes is key. Automatic fare collection platforms provide meaningful smart card data (SCD), but these are incomplete when gathered by entry-only systems. To obtain origin–destination (OD) matrices, we must manage complete journeys. In this paper, we use an adapted trip chaining method to reconstruct incomplete multi-modal journeys by finding spatial similarities between the outbound and inbound routes of the same user. From this dataset, we develop a performance evaluation framework that provides novel metrics and visualization utilities. First, we generate a space-time characterization of the overall operation of transport networks. Second, we supply enhanced OD matrices showing mobility patterns between zones and average traversed distances, travel times, and operation speeds, which model the real efficacy of the public transport system. We applied this framework to the Comunidad de Madrid (Spain), using 4 months’ worth of real SCD, showing its potential to generate meaningful information about the performance of multi-modal public transport systems.

1. Introduction

The design and operation of a public transport system involves high levels of complexity as it must convey a wide range of perspectives involving, among others, economic investments, service provision, and land use. In their decision-making processes, transport agencies require precise information about the system they manage. Thus, a formal and systematic evaluation is the key to producing optimal outcomes and provide high-quality services [1].
A common way of approaching the evaluation of public transport systems is assessing their topologies and service. These methods focus on detecting the accessibility and vulnerability of the network itself. The former measures whether the users’ are able to reach their destinations on public transport [2]; the latter represents the ability of the transport system to recover from incidents or disturbances [3]. These metrics study the static (topology) or semi-static (operating frequencies) components of the transport system and provide meaningful information about its performance. However, a complete knowledge still requires an analysis of the specific routes traveled by users and the subsequent performance metrics.
The traditional approaches to cover this issue are fundamentally based on simulations and surveys. Simulations are inherently deterministic, which means that, although they are capable of reproducing well-known transport dynamics, they fail to reproduce spurious effects that often carry the most meaningful information about the travelers’ behaviors and the systems’ reactions to them. On the other hand, surveys are incomplete by nature as they must be limited to a predetermined number of people and biased to specific demographic groups (for example, elderly people are hardly represented as they tend to show low response rates). In addition to this, transport behaviors continuously evolve over time; however, given the economic costs, time and effort associated with surveys, they are rarely conducted, which results in huge gaps with no updated knowledge about transport systems.
Although this set of challenges are evidently demanding, the disruption of automatic fare collection (AFC) systems has opened new and innovative approaches to tackle them. Smart cards were originally thought as a simple, agile, efficient, and accurate method of payment. In addition, they provided a higher flexibility with regard to tariffs, improved user experience by reducing waiting times, and decreased the workload on public transport staff. Despite these useful features, their actual potential lies in exploiting these data to provide public transit agencies with valuable information, intensive in space and time, and particularized for individual passengers.
In order to characterize the performance of transport systems, we need to extract the origin and destination of the users’ journeys. These data are directly recorded by some AFC systems such as San Francisco and London, which require exit tapping as tariffs based on the actual traveled distance. However, the majority of AFC systems only register users entering the transport mode (entry-only systems) [4]. This is the reason why the research on AFC systems and origin–destination (OD) matrices estimation has been particularly active over the last two decades. The classical approach to this problem consists of linking journeys following the trip chaining model [5], a simple and effective way of creating an OD dataset that represents travelers’ behaviors in real public transport systems.
We will adopt this same approach, including an extension to it that incorporates specific features of multi-modal mobility, to exploit the spatial similarity observed in outbound and inbound journeys of the same passenger. Using the resulting individual routes, we are capable of constructing a data-driven framework for the performance analysis of public transport systems from a multi-modal perspective.
The remainder of the paper is structured as follows: Section 2 includes a review of the scientific works that study the performance analysis of transport systems from real OD matrices; Section 3 defines a multi-modal journey as the basic information to construct the performance evaluation framework and presents the methodology we followed to create the dataset of individual OD matrices; Section 4 presents the performance evaluation framework and the set of performance metrics it encompasses; and Section 5 discusses relevant issues associated with the calculations and indicates the further research to be carried out in this field.

3. The Adapted Trip Chaining Method

3.1. Preamble

As input to the performance evaluation framework, we need to collect data about the individual routes of users. However, a simple journey is a routine that encompasses a higher complexity than expected. Let us use an explanatory example to detail this complexity. Figure 1 shows a schematic of a complete journey, where the numbered vertical lines represent its relevant items such that the n-th item occurs in time t n and in spatial coordinates given by vector p n .
Figure 1. Schematic drawing of a complete journey.
For simplicity, let us imagine a female traveler who is completing a morning journey from home to her office building. She leaves home in t 1 and walks to a bus stop, reaching it in t 2 . She waits there until the bus arrives, boards and validates her travel card in t 3 . After a certain time, she alights in t 4 and walks again to a commuter train station; she validates again on the turnstile in t 5 and reaches the platform in t 6 . There, she waits until the train arrives in t 7 , boards, and travels until she gets off in t 8 . Then, she exits the commuter train network in t 9 , transits through some corridors, and enters the subway network where she validates her card again in t 10 . She reaches the platform in t 11 , boards the train in t 12 , and alights in t 13 . Still in the subway network, she then changes to a new line reaching its platform in t 14 . There, she waits until the train arrives in t 15 , boards, and travels until she alights in t 16 . She transits through corridors until she exits the subway network in t 17 . She walks some distance and finally reaches her work place in t 18 .
This journey produces 3 validations in total (items 3, 5, and 10) marked in filled magenta points () in Figure 1. This is the only available information about the journey provided by an entry-only AFC system, given that it does not record validations on the exits (items 4, 9, and 17 represented by empty boxes with magenta outlines () in Figure 1). Consequently, although the journey comprises 18 relevant items, we can only observe 3 of them In the SCD, being unable to receive direct information about the remaining 83.3 % .
From the point of view of urban mobility, this traveler completes the journey represented by the continuous orange line drawn on the lower part of Figure 1, from ORIGIN to DESTINATION: she leaves in t 1 , invests a total time given by t 18 t 1 , and travels a distance, following a straight line, given by d p 1 , p 18 .
The question is: Can we reproduce this journey from the SCD? Is it possible to characterize the exact journey, obtaining accurate values for every p n , t n pair? Unfortunately, the answer is “no”. Firstly, everything that happens outside the transport system is undetectable for the AFC technology. The walking stretches at the beginning and end of the journey are unknown, and there is no way to determine the values of p 1 , t 1 and p 18 , t 18 . In fact, although we have supposed that the traveler completed these stretches by foot, they could have also been covered by car. In addition, given that the first transport mode is the bus in this specific example, we could not know the first waiting time, at the bus stop, as we cannot obtain the value of p 2 , t 2 . In summary, there will always be a lack of information at the beginning and end of the journey. These stretches that fall out of the transport network cannot be characterized by an AFC system. On the plus side, we can describe the rest of the journey through estimation.
As we can observe in Figure 1, the time that a traveler invests inside the transport network can be:
  • Walking time: Transits made by foot through indoor corridors and streets traveled to (i) reach the boarding platform (which does not apply to the case of buses); (ii) change from one line to another on the same transport mode; or (iii) change from one mode to another.
  • Waiting time: Since the moment the traveler reaches the boarding point (stop, platform, etc.) until she gets on the vehicle (bus, train, etc.).
  • In-vehicle traveling time: The time the rider travels on the vehicles, including the times at each intermediate stop.
In the case of buses, walking and waiting times always occur on streets, which makes their estimation even more difficult and impossible at the beginning of the journey.
The time e1lapsed from the end of a traveling period and the start of the next one is called the transfer time (see Figure 1). Transfer times always involve some walking (outdoor or indoor) and waiting times. The length of these intervals is highly diverse, depending on the structure of the network, the size and design of stations, and the regularity of the services in each mode, plus some random component related to the time of arrival to the transport system.
Chaining trips imply two main tasks: first, deciding which validations belong to the same journey; second, estimating its unknown destination. A particular research work is then identified by the approach it uses to face these two tasks. Among the methods that are widely employed, we can mention the use of (i) time thresholds to aggregate validations [31]; (ii) spatial thresholds to estimate the destination [24]; (iii) statistical averages to characterize transfer times [28]; and (iv) specific algorithms, such as the schedule-based shortest path algorithm, to determine the most probable path inside a transport mode [5]. In this paper, we will adapt the trip chaining method to estimate the pair p 17 , t 17 in Figure 1, being able to “complete” the journey drawn as a magenta line between origin and destination, inferring its a priori unknown part (discontinuous magenta line). In this case, we focus on multi-modal mobility, and thus we will pair incomplete journeys based on the spatial similarity of routes traveled by the same rider in opposite directions.

3.2. Methodology

Within this work, we define a journey as a movement in a specific direction (outbound or return), from an activity to the next, with a particular purpose. A journey is composed by a set of segments that we name trips, defined by transfers that imply a validation within the same transport mode (from one bus to another) or among two different modes (from subway to bus, for example). The SCD recorded by an entry-only AFC system during a journey allow us to know the origin and intermediate stations, but not the destination (in which there is no validation).
Let V k , d be the time ordered set of the N S validations made with a smart card k on day d. Each validation is associated with a spatial location p n (including the latitude and longitude of the station where the validation was recorded) and a time stamp t n :
V k , d = p n , t n n = 1 N S , t 1 < t 2 < < t N S .
We group the validations in sets considering the time interval between each pair of consecutive validations. Specifically, fixing a time threshold δ T , if t n + 1 t n δ T , the two corresponding validations will belong to the same subset, and if t n + 1 t n > δ T , they will be separated in two different subsets. We will use the following numbering for validations within each subset:
1 , _ _ _ , N 1 , N 1 + 1 , _ _ _ , N 2 N S 1 + 1 , _ _ _ , N S .
Consequently, we have a partition of the set V k , d defined as:
V k , d = p n , t n n = N s 1 + 1 N s s = 1 S
satisfying:
( i ) t N s 1 + i + 1 t N s 1 + i δ T : s [ 1 , S ] , i [ 1 , N s N s 1 1 ] ( ii ) t N s + 1 t N s > δ T : s [ 1 , S 1 ]
with N 0 = 0 and S the number of created subsets.
Each element in the partition,
T k , d s = p n , t n n = N s 1 + 1 N s ,
is a subset of validations that we call stretch. Two consecutive validations in this subset delimit a trip; the time difference between both of them is the travel time of the trip, which accounts for the sum of the walking, waiting, and traveling times. A stretch represents a non-ended journey because its last station (where the traveler left the public transport network) is always unknown given the lack of validations at this exit. Note that there are N s N s 1 validations and N s N s 1 1 trips in a stretch.
From this point on, we will relax the notation for the sake of clarity. Let T k , d i and T k , d j be two stretches with the same number of validations, i.e.,
N i N i 1 = N j N j 1 = T .
Let us consider the sequence of spatial locations that define each stretch, plus one last unknown location (the destination of the journey). For the first stretch, we have:
a 1 a 2 a T ?
and for the second:
b 1 b 2 b T ?
The adapted trip chaining method couples, in inverse order, two incomplete journeys whose intermediate stations match, satisfying some spatial restriction, ignoring their origin and destination, i.e.,
a 1 a 2 a T ?
? b T b 2 b 1
where:
d a i , b j δ D , ( i , j ) | 2 i T , j = T ( i 2 ) ,
where δ D is a distance threshold.
In this case, we can pair stretches i and j, making a twofold assignment of spatial locations and times:
  • On one hand, we assign the location of the origin of each journey to the location of the destination of the other.
    a 1     a 2     a T     b 1 b 1     b 2     b T     a 1
  • On the other hand, we assign the travel time of the first trip of each journey to the travel time of the last trip of the other.
    t t a T b 1 = t t b 1 b 2 t t b T a 1 = t t a 1 a 2
Therefore we obtain two complete journeys, both with a known destination.
Note that, in order to obtain journeys from stretches formed by the validations in V k , d , V k , d must contain at least four validations. Consequently, the journeys this method derives are always multi-modal, thus including a change from one transport mode to another or from one line to another in the case of buses. In addition, validations in V k , d allow us to obtain S journeys at most, being S always an even number as journeys are determined using the corresponding reverse pairs.
In order to proceed to perform the pairing of incomplete journeys, our method uses two thresholds in time ( δ T ) and space ( δ D ). The search for journeys within the S stretches in V k , d will require a minimum of S / 2 matches and a maximum of C S , 2 . The algorithm presents a linear complexity with the number of validations. A detailed discussion on the selection process of these parameters can be found in Section 5.

4. Performance Evaluation Framework for Multi-Modal Public Transport Systems

The adapted trip chaining method generates the input data we need to construct a performance evaluation framework for multi-modal transport systems, which include a set of metrics that characterize the overall network and quantify its performance.

4.1. Dataset

In this study, we used data provided by the Consorcio Regional de Transportes de Madrid (CRTM), a public agency that coordinates and manages the public transport network in the Community of Madrid (8000 km 2 and 6.6 million people). It is a wide network (11,000 km ) that connects all 179 municipalities, including Madrid, Spain’s capital city. The dataset extends from 1 November 2018 to 28 February 2019 (4 months) and includes approximately 500 million transactions, widely exceeding other previous datasets such as those used in [5] (2 weeks and 95 million transactions) and [33] (1 month and 210 million transactions).
In 2017, the traditional magnetic ticket was substituted by a new contactless smart card. Nowadays, this is the only way of payment allowed in the public transport network. Under certain circumstances, it is permitted to use cash as the payment of single-trip tickets in urban buses and commuter trains. This form of payment accounts for a negligible percentage of trips, which is ultimately irrelevant for this study as it does not reflect multi-modal journeys. The smart cards can integrate up to three various modalities and can be validated on more than 33,000 stations. The card is only validated at the entry point of each transport mode.
Each validation includes: card ID (anonymized), time stamp, pay point (the boarding station, coded as a combination of the operator, the line, and the stop), the type of title (pass, multi-ticket, single ticket, tourist ticket… plus young ticket, senior ticket, children’s ticket…), and the type of discount (for large families, disabled persons…). The data were recorded with a temporal definition of 2 s. With the exception of buses, the SCD do not register the line and direction of the trip, just the station of validation.
The public transport network integrates six different modes: Subway (13 lines and 244 stations); Commuter Train (9 lines and 91 stations); Light Rail and Tram (4 lines and 58 stations); Urban Bus in Madrid Capital (216 lines and 10,563 stops); Intercity Bus (328 lines and 18,115 stops); and Urban Bus in the other Municipalities (52 lines and 4684 stops). Every line includes two directions. The bus fleet is equipped with an AVL technology, which allows us to obtain the line and stop directly from the SCD, with no additional processing.
This overall public transport network delivers high volumes of trips, registering validations that can exceed 5,000,000 a day. Among these, the multi-modal set of cards with four validations or more account for 59 % .
Apart from the SCD, the CRTM provided us with a set of metadata that specifies the spatial location of every station in the network.
We applied the adapted trip chaining method to the multi-modal journeys in this extensive database, creating the dataset we used as the input for the performance evaluation framework we propose. The potential of this framework resides in, first, its data-driven nature and, second, its ability to provide detailed information for each specific day of the year. Next, we present the metrics that form the performance evaluation framework, particularized for 23 November 2018, as an illustrative example of the results.

4.2. Statistical Characterization

As the first basic metric to evaluate a public transport system, Figure 2 shows the statistical characterization of the reconstructed trips. On the top right corner of the histogram of each magnitude (distance, duration, and speed) we indicate its minimum, maximum, mean, and standard deviation.
Figure 2. Statistical characterization for journeys (23 November 2018): (a) distance. (b) duration. (c) speed.
The information presented in Figure 2 allows a direct interpretation of the public transport network in the Comunidad de Madrid: The average traveler traverses approximately 14 km in a straight line to reach a place of work, study, or leisure, investing 50 min . In both outbound and inbound journeys. Thus, this rider travels at an average speed of almost 17 km / h .

4.3. Signature of a Public Transport System

In Figure 3a, each blue point represents a journey with a specific duration (X axis) and traversed distance (Y axis). The red circle indicates the average journey and the slope of the discontinuous red line coincides with its velocity. This spatio-temporal representation allows us to characterize the performance of any public transport system. Every point underneath the line represents a journey that was slower than the average. In consequence, these points are able to highlight connections between geographical areas for which the network provides a poor transport solution. Thus, an optimal public transport system should aim for a high average velocity (represented by a steep slope) and gathering every journey around it so that all areas are optimally connected.
Figure 3. Space-time representation of journeys (23 November 2018): (a) points of individual journeys; (b) contour lines.
The high number of points shown in Figure 3a hides part of the information. Consequently, we present Figure 3b, which shows the contour lines for the bivariate histogram of journeys in bins of 1 km × 1 min . ; note the zoom applied to the axis. The area with a higher number of journeys beneath the discontinuous red line corresponds to the grid given by 37– 38 min . and 7– 8 km , showing a velocity of 12 km / h (obviously lower than the speed of the average journey). We name this representation the signature of a transport system and provides a direct visualization of the spatio-temporal distribution of journeys. The signature of a transport system will reflect any future modifications made on the network (increasing the number of stations, restructuring the lines, extending the inter-modal transportation hubs, changing timetables, etc.), thus providing a tool to compare the results of these actions. Successful modifications will contribute to slender the shape of the contour lines, i.e., making them more elliptical and aligned to the line.

4.4. Journeys Departure Time

Figure 4 shows the number of journeys versus the time of departure with an aggregation period of 5 min . We can observe that most outbound journeys start in the morning, during a relatively narrow time interval (the great majority between 07:00 and 09:00). Return journeys mainly occur in the afternoon and present a much higher variability (from 13:00 to 22:00, with a significant peak between 14:00 and 16:00). This reflects the typical commuter behavior as reported in works such as [29].
Figure 4. Number of journeys depending on the time of departure (23 November 2018): (a) outbound journeys; (b) return journeys.

4.5. Operating Speed

The CRTM uses the operating speed as a parameter to characterize each transport mode: 23 km / h for subway, 47 km / h for commuter train, etc.
The average velocity of all journeys reconstructed by the proposed method can be interpreted as the operating speed of the transport system as a whole. This way it represents the average velocity at which passengers travel in the transport network (including the travel, waiting, transfer times, and walking times). We can observe in Figure 5 the regularity of this variable during a complete week; note that it keeps this regularity even on weekends. Consequently, the operating speed can be considered an intrinsic parameter of an inter-modal public transport network, thus characterizing it. As such, it allows a simple and straightforward comparison of the efficiency of public transport networks within different cities and regions.
Figure 5. Boxplot of speed: week of 19 November 2018 (Mon) to 25 November 2018 (Sun).

4.6. Enhanced OD Matrices

From the information about individual journeys, we can easily obtain the corresponding OD matrices. The concise mobility survey carried out by CRTM in 2014 divided the territory in 84 zones. We have used this same spatial aggregation to construct the resulting OD matrices in this study. Figure 6 shows a screenshot of the software tool we have developed to obtain reconstructed journeys and OD matrices. The tool was developed in Matlab and aims at providing meaningful information for the assessment of multi-modal public transport systems. It is fed with data corresponding to paired journeys, which result from the adapted trip chaining method. The tool provides control buttons to select a specific date and time interval; in addition, we can choose the analysis to be performed on the outbound journeys, the return journeys, or both; finally, we can define each zone as the origin or destination of all the journeys. Thus, each setting of the control parameters provides a row or a column of the complete OD matrix resulting from the journeys selected by the applied filters. The information is presented in different modes: as colors on a map, values in a table, and bar charts; each of them can be reordered depending on a set of magnitudes: percentage of journeys (i.e., the values in the OD matrix), distances, travel times, and speeds.
Figure 6. Screenshot of the software tool.
As a consequence, this tool generates enhanced OD matrices, extending the basic information about the distribution of journeys throughout the network, with the distance, travel time, and speed corresponding to each OD pair. As an example, Figure 6 shows that zone “(09) Moncloa–Aravaca” is the destination of 13.3 % of the outbound journeys originating in zone “(01) Centro”, elementary OD matrix information. On top of it, we also know that these journeys imply an average traveled distance of 6.2 km and an average travel time of 00:33:48, resulting in an average speed of 11.7 km / h . Furthermore, we can observe that the zone with the best connectivity is “(50) Móstoles”, which ranks first in terms of operating speed ( 23.3 km / h ) and forth regarding the number of journeys.

5. Discussion, Conclusions, and Further Research

5.1. Comparative Study

Characterizing mobility often encompasses an inherent problem for validation: the lack of a solid ground truth [34]. For this reason, we must rely on comparative approaches that may highlight an underlying common structure.
In this respect, let us first perform a comparative analysis of the results we obtained and those reflected in the latest Household Mobility Survey performed in the Comunidad de Madrid. This survey was carried out in 2018 and included 85,064 transport users throughout the complete region. In order to compare our results to this survey, we segmented the overall OD matrices considering the transport modes employed. Table 1 shows the percentages of trips traveled in any combination of two transport modes in the network. This set of trips is the largest, accounting for more than 90 % of the total mobility. Columns HMS18 and PEF include the corresponding percentages of each category resulting from the Household Mobility Survey and the performance evaluation framework, respectively. We can observe how both approaches reach very similar results in each category (deviations below 7.5 % ), which indicates the capability of the proposed framework to represent the detailed mobility in the Comunidad de Madrid.
Table 1. Household Mobility Survey and Reverse Pairing Method.
In addition, let us visit again the histograms presented in Figure 2 to observe that they show an outstanding regularity and resemble the usual patterns observed in public transport systems, such as those reported in [35], which were exactly reconstructed from exit data captured in Shanghai’s (China) subway. Apart from their regularity, these histograms are also consistent, i.e., the definite shape of the distribution is shown with approximately the first 5000 completed journeys. This fact indicates that our performance evaluation framework precisely represents the multi-modal mobility in public transport systems.

5.2. Parameter Selection

The adapted trip chaining method includes the selection of two thresholds in time, δ T , and distance, δ D . These parameters are specific to the public transport system under study; thus, we have to determine them accordingly. In this subsection, we will discuss the impact of the selection of these parameters.
Let us remember that δ T fixes the maximum time interval accepted between two consecutive validations to be considered part of the same journey. If we choose a low value of δ T , the method will not include journeys that comprise long trips. The duration of these trips depends on the specific transport network. As an example, intercity buses in the Comunidad de Madrid connect peripheral areas with Madrid’s city center, taking 50 to 55 min to be completed. Thus δ T must be adapted to this duration. On the other hand, if we take an excessive value, the method will group together trips belonging to different journeys, which will not be paired correctly. This will slightly reduce the number of detected journeys. Considering these two limitations, we carried out a sensitivity analysis that resulted in selecting δ T = 1 h (see Table 2).
Table 2. Sensitivity Analysis of the Time Threshold δ T .
Similarly, the distance threshold δ D is determined by the network’s topology. It represents the typical distance between stations of different modes. In this respect, we must consider distances in transport hubs (including intercity buses, subway, and commuter trains), stations that comprise more than one transport mode (mainly subway and commuter train), and streets. Note that in this case the number of journeys will grow with increasing values of δ D at the expense of reducing their spatial definition. Consequently, we must choose a δ D that results in a relatively stable number of journeys. Following this rule, we selected δ D = 200 m for the public transport network in the Comunidad de Madrid (see Table 3).
Table 3. Sensitivity Analysis of the Distance Threshold δ D .

5.3. Conclusions and Further Research

Multi-modal mobility represents a considerable percentage of the overall mobility in a city or region. Thus, it is the key to developing tools explicitly tailored to assess how public transport networks respond to it. To this end, we have created a framework for the evaluation of multi-modal public transport systems that uses a data-driven approach based on journeys reconstructed by means of an adapted trip chaining method.
The performance evaluation framework we have proposed opens up a wide range of possibilities for further research. In order to completely characterize mobility, a new set of aspects must be taken into account. First, unimodal and sporadic journeys in public transport networks must be analyzed. Second, mobility on private vehicles, including cars, motorbikes, bikes, scooters, etc., must be incorporated to the model. Finally, pedestrian flows and the emerging shared mobility have to be addressed.
The next step of our investigation will incorporate the share of unimodal recurrent mobility, i.e., journeys using a single mode, to the input dataset to our performance evaluation framework. The required assignment of destinations is a priori feasible, using SCD from a set of dates. The problem is determining the travel times for the last trip. In this respect, we will consider specific data of the service and averages derived from other users’ journeys. In any case, this is a complex process that will require further research and the development of specific algorithms.
The accurate assessment of multi-modal mobility is important for the future redesigning of public transport systems. This includes, for example, the construction of direct links that avoid transfers and improve the overall urban mobility. Using a data-driven approach enables the evaluation of the transport system 365 days a year, which allows the extraction of meaningful knowledge about passenger behavior, its evolution, and their response to new changes and updates in the network and services.

Author Contributions

Conceptualization, A.B.R.G. and J.J.V.D.; Software, M.R.W.; Formal Analysis, M.R.W. and R.F.P.; Investigation, J.J.V.D. and A.B.R.G.; Writing—Original Draft Preparation, M.R.W.; Writing—Review and Editing, R.F.P.; Validation, R.F.P.; Funding Acquisition, J.J.V.D. and A.B.R.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by Comunidad de Madrid, through the scheme “Convenio Plurianual con la Universidad Politécnica de Madrid en la línea de actuación Programa de Excelencia para el Profesorado Universitario”.

Data Availability Statement

The data presented in this study are available on request from Consorcio Regional de Transportes de Madrid. The data are not publicly available due to its proprietary nature.

Acknowledgments

The authors would like to thank the support of Consorcio Regional de Transportes de Madrid (CRTM), and specifically, Juan Elices Torrente, Domingo Martín Duque, and Luis Criado Fernández.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Weisbrod, G.; Stein, N. Public transport evaluation. In The Routledge Handbook of Public Transport; Mulley, C., Nelson, J.D., Ison, S., Eds.; Taylor and Francis Group: London, UK, 2021; Chapter 11; pp. 136–151. [Google Scholar]
  2. Albacete, X.; Olaru, D.; Paül, V.; Biermann, S. Measuring the Accessibility of Public Transport: A Critical Comparison between Methods in Helsinki. Appl. Spat. Anal. Policy 2017, 10, 161–188. [Google Scholar] [CrossRef] [Green Version]
  3. Berdica, K. An introduction to road vulnerability: What has been done, is done and should be done. Transp. Policy 2002, 9, 117–127. [Google Scholar] [CrossRef]
  4. Kumar, P.; Khani, A.; He, Q. A robust method for estimating transit passenger trajectories using automated data. Transp. Res. C Emerg. Technol. 2018, 95, 731–747. [Google Scholar] [CrossRef]
  5. Barry, J.J.; Newhouser, R.; Rahbee, A.; Sayeda, S. Origin and destination estimation in New York City with Automated Fare System data. Transp. Res. Rec. J. Transp. Res. Board 2002, 1817, 183–187. [Google Scholar] [CrossRef]
  6. Scheurer, J.; Silva, C. Refining Accessibility Tools: SAL and SNAMUTS for Porto. In Proceedings of the 24th AESOP Congress, Helsinki, Finland, 7–10 July 2010. [Google Scholar]
  7. Curtis, C.; Scheurer, J. Performance measures for public transport accessibility: Learning from international practice. J. Transp. Land Use 2017, 10, 93–118. [Google Scholar] [CrossRef]
  8. Tiznado-Aitken, I.; Muñoz, J.C.; Hurtubia, R. Public transport accessibility accounting for level of service and competition for urban opportunities: An equity analysis for education in Santiago de Chile. J. Transp. Geogr. 2021, 90, 102919. [Google Scholar] [CrossRef]
  9. Cats, O.; Jenelius, E. Dynamic Vulnerability Analysis of Public Transport Networks: Mitigation Effects of Real-Time Information. Netw. Spat. Econ. 2014, 14, 435–463. [Google Scholar] [CrossRef]
  10. Cats, O.; Yap, M.; van Oort, N. Exposing the orle of exposure: Public transport network risk analysis. Transp. Res. Part A 2016, 88, 1–14. [Google Scholar]
  11. Rodríguez-Núñea, E.; García-Palomares, J.C. Measuring the vulnerability of public transport networks. J. Transp. Geogr. 2014, 35, 50–63. [Google Scholar] [CrossRef] [Green Version]
  12. Saaty, R. The analytic hierarchy process—What is it and how it is used. Math. Model. 1987, 9, 161–176. [Google Scholar] [CrossRef] [Green Version]
  13. Sezhian, M.V.; Muralidharan, C.; Nambirajan, T.; Deshmukh, S. Developing a performance importance matrix for a public sector bus transport company: A case study. Theor. Empir. Res. Urban Manag. 2011, 6, 5–14. [Google Scholar]
  14. Barabino, B.; Cabras, N.A.; Conversano, C. An Integrated Approach to Select Key Quality Indicators in Transit Services. Soc. Indic. Res. 2020, 149, 1045–1080. [Google Scholar] [CrossRef]
  15. Dey, S.; Winter, S.; Tomko, M. Origin-Destination flow information from link count data. Sensors 2020, 20, 5226. [Google Scholar] [CrossRef] [PubMed]
  16. Zhang, M.; Guo, Y.; Ma, Y. A Probability Model of Transit OD Distribution Based on the Allure of Bus Station. J. Trans. Inf. Saf. 2014, 32, 57–61. [Google Scholar]
  17. Jung, J.; Sohn, K. Deep-learning architecture to forecast destinations of bus passengers from entryonly smart-card data. IET Intell. Transp. Syst. 2017, 11, 334–339. [Google Scholar] [CrossRef]
  18. Barry, J.J.; Freimer, R.; Slavin, H. Use of entry-only Automatic Fare Collection data to estimate linked transit trips in New York City. Transp. Res. Rec. J. Transp. Res. Board 2009, 2112, 53–61. [Google Scholar] [CrossRef]
  19. Zhao, J. The Planning and Analysis Implications of Automated Data Collection Systems: Rail Transit OD Matrix Inference and Path Choice Modeling Examples; Massachusetts Institute of Technology: Cambridge, MA, USA, 2004. [Google Scholar]
  20. Pelletier, M.P.; Trépanier, M.; Morency, C. Smart card data use in public transit: A literature review. Transp. Res. C Emerg. Technol. 2011, 19, 557–568. [Google Scholar] [CrossRef]
  21. Li, T.; Sun, D.; Jing, P.; Yang, K. Smart card data mining of public transport destination: A literature review. Information 2018, 9, 18. [Google Scholar] [CrossRef] [Green Version]
  22. Cui, A. Bus Passenger Origin-Destination Matrix Estimation Using Automated Data Collection Systems. Master’s Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 2006. [Google Scholar]
  23. Zhao, J.; Rahbee, A.; Wilson, N.H.M. Estimating a rail passenger trip origin-destination matrix using automatic data collection systems. Comput. Aided Civil Infras. Eng. 2007, 22, 376–387. [Google Scholar] [CrossRef]
  24. Trépanier, M.; Chapleau, R. Destination estimation from public transport smartcard data. In Proceedings of the 12th IFAC Symposium on Information Control Problems in Manufacturing, St. Etienne, France, 17–19 May 2006; pp. 393–398. [Google Scholar]
  25. Nassir, N.; Khani, A.; Lee, S.G.; Noh, H.; Hickman, M. Transit stop-level origin–destination estimation through use of transit schedule and automated data collection system. Transp. Res. Rec. J. Transp. Res. Board 2011, 2263, 140–150. [Google Scholar] [CrossRef] [Green Version]
  26. Li, D.; Lin, Y.; Zhao, X.; Song, H.; Zou, N. Estimating a transit passenger trip origin-destination matrix using Automatic Fare Collection system. In Proceedings of the 16th International Conference on Database Systems for Advanced Applications (DASFAA), Hong Kong, China, 22–25 April 2011; pp. 502–513. [Google Scholar]
  27. Wang, W.; Attanucci, J.P.; Wilson, N.H.M. Bus passenger origin-destination estimation and related analyses using automated data collection systems. J. Public Transp. 2011, 14, 131–150. [Google Scholar] [CrossRef] [Green Version]
  28. Munizaga, M.A.; Palma, C. Estimation of a disaggregate multimodal public transport origin-destination matrix from passive smartcard data from Santiago, Chile. Transp. Res. C Emerg. Technol. 2012, 24, 9–18. [Google Scholar] [CrossRef]
  29. Briand, A.S.; Côme, E.; Mahrsi, M.K.E.; Oukhellou, L. A mixture model clustering approach for temporal passenger pattern characterization in public transport. In Proceedings of the IEEE International Conference on Data Science and Advanced Analytics (DSAA), Paris, France, 19–21 October 2015; pp. 1–10. [Google Scholar]
  30. Nunes, A.A.; Dias, T.G.; e Cunha, J.F. Passenger journey destination estimation from automated Fare Collection System data using spatial validation. IEEE Trans. Intell. Transp. Syst. 2016, 17, 133–142. [Google Scholar] [CrossRef]
  31. Seaborn, C.; Attanucci, J.; Wilson, N.H.M. Analyzing multimodal public transport journeys in London with smart card fare payment data. Transp. Res. Rec. J. Transp. Res. Board 2009, 2121, 55–62. [Google Scholar] [CrossRef]
  32. Alsger, A.; Assemi, B.; Mesbah, M.; Ferreira, L. Validating and improving public transport origin–destination estimation algorithm using smart card fare data. Transp. Res. C Emerg. Technol. 2016, 68, 490–506. [Google Scholar] [CrossRef]
  33. Zhao, J.; Qu, Q.; Zhang, F.; Xu, C.; Liu, S. Spatio-temporal analysis of passenger travel patterns in massive smart card data. IEEE Trans. Intell. Transp. Syst. 2017, 18, 3135–3146. [Google Scholar] [CrossRef]
  34. Scherrer, L.; Tomko, M.; Ranacher, P.; Weibel, R. Travelers or locals? Identifying meaningful sub-populations from human movement data in the absence of ground truth. EPJ Data Sci. 2018, 7, 1–21. [Google Scholar] [CrossRef]
  35. Sun, Y.; Shi, J.; Schonfeld, P.M. Identifying passenger flow characteristics and evaluating travel time reliability by visualizing AFC data: A case study of Shanghai Metro. Public Transp. 2016, 8, 341–363. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.