Evaluation of Traffic Assignment Models through Simulation

Conflicts of Interest: The authors declare no conflicts of interest. The funders participated in the collection of data, and agree with the decision to publish the results. Abstract Assignment methodologies attempt to determine the traffic flow over each network arc based on its characteristics and the total flow over the entire area. There are several methodologies, some faster and others that are more complex and require more time to complete the calculation. In this study, we evaluated different assignment methodologies using a computer simulation and tested the results in a specific case study. The results show that the "all-or-nothing" methods and the incremental assignment methods generally yield results with an unacceptable level of error unless the traffic is divided into four or more equal parts. The method of successive averages (MSA) is valid starting from a relatively low number of iterations, while the user equilibrium methodologies (approximated using the Frank & Wolfe algorithm) are valid starting from an even lower number of iterations. These results may be useful to researchers in the field of computer simulation and planners who apply these methodologies in similar cases.

Funding: This research was carried out using funds provided under the Research Contract entitled -Simulación y Análisis de Tráfico dentro del Estudio Informativo de la Autovía Orbital B-40‖ (Simulation and Traffic Analysis within the Study Report of the B-40 Orbital Highway) between GPYO Innova, S.L. and the University of Burgos (Spain) with reference number W24T06.

Introduction
Traffic assignment techniques can be used to assess traffic intensities on a track or road system based on its physical and functional characteristics and the potential traffic that it can use.
An assignment process has two stages.First, it is necessary to determine the possible traffic that can be used by the network, which is usually expressed through current and future Origin/Destination matrices.Based on this data, in the second stage of the process, this potential traffic is assigned to each of the sections of a given road network.There are a multitude of procedures for this purpose, most of which are empirical and based on statistical observations of user behavior, certain track conditions, and the traffic itself.Procedures with a higher level of precision are, of course, more complicated.However, we cannot forget that we are starting from an estimate of future potential traffic; therefore, each result must be studied by a decision-maker in order to develop appropriate contrasts and produce final results that are consistent with the analyzed situation.
There are several assignment methodologies, some of which have already been compared in certain case studies, although usually not all at once, and only sometimes using simulation techniques [1,2].The assignment stage has been analyzed at times as an intermediate problem between optimization and equilibrium issues [3].We can also attempt to analyze the zonal distribution and assignment stages together, as was performed by Tan et al. [4], in a study in which the use of Intelligent Transport Systems (ITS) was incorporated.The number of possible applications of assignment methodologies is enormous, among which we can include as especially novel their adaptation to multimodal networks [5], their use to minimize the environmental impact of traffic [6] and the damage caused to infrastructure by traffic [7], or to optimize the process of searching for a place to park a vehicle [8].In addition, if we add to this the use of simulation with computer tools [9,10], the number of possibilities is greatly increased.However, the assignment problem is not only associated with road traffic, but also with other means of transport.For example, within road networks, assignment methods are applied to private vehicles, including taxis and autonomous vehicles, and public transportation (PT).For example, Long et al. [11] introduce the concept of expected rate of return of taxi drivers and perform a dynamic taxi traffic assignment problem supposing real-time traffic information provision; Bischoffa & Maciejewski [12] analyze the possible effect of a city-wide replacement of private cars with autonomous taxi fleets; Heilig et al. [13] use a microscopic travel demand model to simulate the mode choice www.videleaf.combehaviour in the assumption of the existence of a large autonomous mobility on demand service instead private cars; and Poulhès & Berrada [14,15] analyze the effect of dial-a-ride services on the assignment problem.Within the PT assignment problem, Liu & Ceder [16] and Eltved et al. [17] consider different schemes for that; Cats & Hartl [18] focus on the discomfort that causes congestion on PT users and how it affects to their choices; and Nuzzolo & Comi [19] include big data issues to this modelling.Within railway transport, there are also interesting studies that employ these techniques to estimate the passenger load in each route.Thus, Lin et al. [20] investigate the railway passenger pricing problem supposing that operators could modify ticket prices to optimize the system's performance; Xu et al. [21] propose a dynamic assignment problem for urban rail transit networks considering queuing process, capacity constraints and congestion effects; and Gao & Wu [22] propose a method to calculate the proportion of passengers on each path based on the entry and exit time records of users.
In the maritime domain, assignment methods are also applicable and there are several studies that have analyzed the assignment problem by considering traffic assignment close to seaports.Venturini et al. [23] and Iris et al. [24] study the integrated berth allocation in seaport container terminals; Li & Jia [25] model the traffic scheduling problem as a mixed integer linear program, to minimize the berthing and departure delays of vessels; Han et al. [26] study a storage yard management problem if the loading and unloading activities are both heavy and concentrated; and Iris et al. [27] formulate a mathematical model for the containership loading problem where the terminal has the right to decide which specific container to load for each slot.
In air transportation, we can find some examples of the application of assignment methods.For instance, Ganić et al. [28] and Ho-Huu et al. [29] develop mathematical models of air traffic assignment in order to minimize the noise effects on population; and Starita et al. [30] consider future capacity www.videleaf.comprovision in terms of the available man-hours of air traffic controllers.
Finally, several studies have considered two or more means of transport simultaneously in a multimodal scenario.Thus, Yu & Guo [31] develop a tri-level combined-mode traffic assignment model; Pi et al. [32] include heterogeneous traffic on roads, parking availability and travel modes (such as solo-driving, carpooling, ride-hailing, public transit, and park-and-ride); Macedo et al. [33] propose an efficient traffic assignment, where users are not only concerned about travel times, but also about global and local pollutant emissions; Jiang et al. [34] include the car-truck interaction paradox in assignment problems; and Dimitrov et al. [35] model the interaction between buses, passengers and cars on a bus route.Some researchers have applied these methodologies in a specific case study, often with the support of computer simulation [36][37][38][39], while others have attempted to assess different techniques or even propose novel ones [40][41][42][43][44][45].Among the first ones, Zhang et al. [36] integrate an activity-based travel demand model with a dynamic traffic assignment model for the Baltimore Metropolitan Council; Shafiei et al. [37] develop a simulationbased dynamic traffic assignment model of Melbourne, Australia; Zhu et al. [38] use dynamic traffic assignment for a case study in Maryland; and Kucharski & Gentile [39] apply the Information Comply Model on different situations, including a corridor in the north of Kraków, Poland, and the Sioux-Falls network.Within the latter ones, Zhang et al. [40] analyze the calibration of dynamic traffic assignment models applying the extended Kalman filter; Prakash et al. [41] present a dimensionality reduction of the assignment models calibration problem, based on principal components; Du et al. [42] focus on the dynamic traffic assignment problem on large-scale expressway networks especially under the condition of traffic events (such as severe weather, large traffic accidents etc.) and propose an approximate solution algorithm; Lin & Chen [43] develop a simulation-based multiclass, multimodal traffic assignment model for evaluating traffic control plans of planned special events; Batista & Leclercq [44] study a regional dynamic www.videleaf.comtraffic assignment framework for macroscopic fundamental diagram considering stochasticity on both the trip lengths and the regional mean speed; and Bagdasar et al. [45] examine discrete and continuous optimization and equilibrium-type problems and make a comparison of them with the Beckmann cost function.
Finally, we can cite some interesting studies that consider multiclass, multi-modal and elastic demand traffic for assignment models.Cascetta [46] covers all the issues on traffic assignment; Cantarella [47] provided a highlight on traffic assignment with elastic demand; and Cantarella et al. [48][49][50] propose an overview on solution algorithms for traffic assignment, presenting a general fixed-point formulation for multi-user stochastic equilibrium assignment with variable and elastic demand and comparing it with other approaches.
Following this line of research, the main objectives of this study are: (1) to develop an assignment model for a specific case study (the new B-40 highway in Barcelona, Spain) and implement it using traffic macro-simulation tools; and (2) to evaluate the influence of the use of different assignment models on the results and the numbers of iterations in them.This paper is structured as follows.After a brief introduction, Section 2 outlines the main techniques used in the modelling work performed in this study.Section 3 contains a description of the case study and the principles adopted for delimitation, zonification, and obtaining the base Origin/Destination matrix.The principles adopted for applying the general methodologies to the specific case study that is being analyzed are then set out in Section 4. Section 5 contains an evaluation of the best assignment methodology for the specific case study to be analyzed.Finally, in Section 6, we discuss our results and the main conclusions drawn from the full study.

Methods for Traffic Assignment
Potential users of a transportation system in which there are different alternative itineraries generally have incomplete information about the conditions of each section; however, they www.videleaf.commust make their decision with these inaccurate data.These data include the length of each segment, the quality of the track (in terms of layout, pavement, and safety conditions), and, finally, the degree of congestion at all times, which decisively affects the vehicle's speed and, therefore, the time spent on the route.The possibility of knowing the above data ranges from high to low.In addition, with regard to congestion, the knowledge that each driver has is very subjective and is generally based on his/her previous experiences, since in very few cases does he/she have information about the actual situation.
Typically, when choosing from several possible itineraries, we consider different types of factors (e.g., distance, travel time and/or cost, comfort, and toll if any) that are usually grouped into a single variable called generalized cost.In practice, the process that we need to follow in order to assign traffic to the existing road network in a given area of study has the following steps:  Zone division.First, the entire study area to be analyzed must be divided into zones of more or less homogeneous characteristics. Origin/Destination matrix construction.Once the study area has been divided into zones, we need to construct a matrix that lists the movements of each origin/destination pair.We should also distinguish between light and heavy vehicles, since the conditions for the assignment of each type of vehicle are different. Definition of road networks.Each defined zone is represented by its centroid, which is the fictional place that generates or attracts all trips in the area.All zones must be joined to one another through the existing transportation system and, if applicable, through the future one (constituting two separate transport networks: a current one and a future one). Calculation of travel costs and/or times.For each of the sections of the considered network (current and future), the generalized cost of travelling along it under certain traffic conditions must be determined.It is desirable to calculate the costs for various traffic levels in order to take www.videleaf.cominto account the capacity of each segment, or even do it dynamically at each iteration of the process, depending on the traffic supported by each part of the network. Assignment.Using the origin and destination data, it is advisable to calculate the traffic intensities for the existing network in the current situation, and then compare the results obtained with the traffic accounts that have been made.Once this check has been carried out, each of the movements in the Origin/Destination matrix is assigned to the current and future networks.A separate assignment should be performed for light and heavy vehicles and for various driving conditions.The application of computer tools to traffic assignment models has resulted in the creation of new methods that introduce a greater number of theoretical complications but are perfectly applicable nowadays.
Among the wide variety of methods for traffic assignment (or methods for calculating the traffic load on the network), we used the following, which are those that are used in practice in most cases [51]:

All-or-Nothing Assignment
This is the simplest of all assignment methods and results from the load of all journeys to the minimum cost path between the nodes.Costs are initially fixed for all segments in the network; i.e., the speed/flow ratio function is not considered.
While it is simple, it has significant drawbacks.It is an unstable method because small changes in the travel time of a segment can drastically change which routes are chosen to form the minimum cost path.This method ignores limits on the capacity of each segment and the number of times that they have been used in the assignment, so the result may result in little relation to the real flow in the network.In addition, it does not allow for variations between users.For these reasons, if in a study we include a new alternative route and use this methodology, we usually find that all trips head for the new path (the so-called "vacuuming effect"), when in reality there will be a distribution of travel through the segments of the network.
A number of modifications have been made to this "base" methodology, such as the one by Lee & Oduor [52], which includes parameters from a GIS network that improve the model's results.

Stochastic Assignment
Stochastic assignment distributes the trips of each O/D pair between different paths of the multiple alternatives that connect them.We can sort these methodologies into two kinds: those based on simulation and those based on proportions.
In simulation-based methodologies, which often use the Monte Carlo simulation [53] to represent the variability among users in terms of their perception of network costs, the following assumptions are often made:  For each segment, it is necessary to differentiate the objective costs, which are measured or estimated by an observer (modeler), from the subjective costs, which are perceived by each user.It is accepted that the target cost roughly matches the average of the subjective costs, which are randomly distributed according to a given probability function.By analyzing the literature on this topic, we can find different assumptions regarding the distribution of these subjective costs.For example, Burrel [53] adopts a www.videleaf.comuniform distribution, while other authors adopt a normal distribution. The distributions of costs perceived by users are independent. Users choose the route that minimizes their perceived travel cost; i.e., the sum of the costs of each used section.
Although these methods are simple and relatively quick to apply, they have certain drawbacks.For instance, the assumption that the perceived costs of each section are independent may not be true when users have a certain predisposition to the use of certain types of infrastructure (high-capacity highways, as an example).
In addition, the effect of congestion is not explicitly taken into account.However, these methods do allow us to distribute the trips in such a way that the "optimal" paths for the modeler turn out to be the ones with the highest amount of traffic; however, they do not convey the total number of trips.Also, it is not necessary to know the flow-speed functions of each section, which makes these methods easier to apply, although sometimes this can be a limitation.
In proportion-based methods, as their name suggests, the fraction of trips assigned to a particular path is equal to the probability of choosing it, which is calculated using a logit-type route choice model.The lower the generalized cost of one path, as compared with the others, the greater its chance of being chosen.However, with this methodology, the only alternative paths that have traffic are those that are considered to be "reasonable", that is, those that separate the user from the origin point and bring them to the destination point.The travel time for each segment in a stochastic assignment is a fixed set of input data and is not dependent on the volume that it supports.Therefore, these methods are not equilibrium methods, nor are they valid in situations of congestion.
According to the Single Path method [54], the trips between an i-j origin/destination pair along the r-path, T ijr , would result in: where T ij is the total number of trips from i to j, C ijk is the generalized cost from i to j using the k-path, Ω is a parameter (usually 1), and k is the number of reasonable paths from i to j. Ω can be used to control the distribution of trips between routes; thus, depending on their (always positive) value, cheaper routes tend to concentrate trips, or, on the contrary, distribute them more uniformly among the possible reasonable paths.

Assignment with Congestion
Regarding congestion-side assignment techniques, the methodologies that researchers most commonly use when requiring an adequate level of detail are equilibrium methods.These methods take into account the dependence between the flow of a section and its generalized travel cost, and calculate both them simultaneously so that they are consistent.Equilibrium algorithms require the iteration of flow assignments and the recalculation of travel times.Despite the additional computational load, equilibrium methods are almost always preferable to other assignment methods.
The main behavioral assumptions are that each traveler has all of the information about the attributes of network alternatives, all travelers choose routes that minimize their travel time or cost, and all travelers-of the same category-assign the same value to all of the network attributes.
According to the first equilibrium principle, which was proposed by Wardrop [55] and is called "user equilibrium" (UE), no individual traveler can reduce his/her travel time unilaterally by changing his/her path [56].One consequence of the UE principle is that all of the roads that an O/D pair uses have the same minimum cost, while unused roads have equal or higher costs.Unfortunately, this is not an absolutely realistic description of congested traffic networks, although it can be considered to be sufficiently approximate.The mathematical notation for these expressions is as follows (bearing in mind that, in addition, the conditions of continuity, conservation, and non-negativity of flows must be met): www.videleaf.com where c p is the unit cost in the equilibrium state, P w is the set of paths that connect the pair w, and h p is the flow of path p.
In addition, Wardrop [55] proposed a second alternative to assigning traffic to the network, which takes the name of -system equilibrium‖.According to this alternative, under balanced conditions, traffic should be distributed so that the total cost of operation of the network is the minimum possible cost.
Computationally, it is similar to the first principle, as expressed in Equation ( 2), but it replaces unit costs with marginal costs: where f a and h p are the flows of section a and path p, respectively, CM a and CM p are their respective marginal costs, and c a and c p are their respective unit costs.
In either case (the first principle is most commonly used, since it responds more to the actual "self-interested" behavior of users), we obtain a system of equations the solution of which would be the flows in sections or on paths.
Another type of model is that of speed adaptation.The simplest of these models, called direct adaptation, was based on all-ornothing methodologies.However, it differs from them in that, once it assigns traffic, it recalculates the travel times or costs of each section, and then reapplies the all-or-nothing method with the new values.This process can be iterated indefinitely.However, this approach has a major drawback, as the selected paths usually change at each iteration, and the method does not converge to a single solution.www.videleaf.comTherefore, in order to try to reduce these oscillations, a methodology was proposed that uses the average of the speeds of two or more all-or-nothing assignments in each iteration.This methodology, called speed-weighted adaptation, is not a real improvement, as it continues to assign all traffic to a single minimum path, which is also usually different in each iteration.
The incremental assignment methodologies are, again, based on the all-or-nothing concept.In this case, however, they introduce a significant improvement by making the results more realistic.
In the first step, the demand matrix is split into parts (not necessarily equal ones).The first portion is assigned using the all-or-nothing method and, once the flows are entered, the travel times of each section are recalculated.Then, with the new travel times, the second fraction of the demand matrix is allocated using the all-or-nothing method, and the travel times of each section are recalculated again.This continues until all of the fractions have been assigned.Commonly used values for the fractions are [51]: 0.4, 0.3, 0.2, and 0.1.The end result provides us with flows that are distributed between routes and not assigned in their total to the initially shorter one, which more closely approximates reality.
Theoretically, the larger the number of fractions considered, the more likely it is that trips will be distributed among a greater number of roads.However, even if the demand matrix is divided into very small fractions, this method may never converge to the solution when using the Wardrop balance, as the flow that is already assigned to a section cannot be suppressed or modified.
In any case, it is clear that its simplicity and ease of use and the possibility of interpreting its results as the accumulation of congestion during peak periods make it an interesting method.
Finally, the method of successive averages (MSA) is another variant of the all-or-nothing method.In each iteration, the flow of a section is calculated as a linear combination of the flows assigned in the previous iterations and the auxiliary flow resulting from an all-or-nothing assignment in that iteration.Thus, the resulting flow in each iteration is: www.videleaf.com where V a n is the flow of section a in iteration n, F a is the auxiliary flow in section a from an all-or-nothing assignment, and ϕ is a value between 0 and 1.The different variations of this algorithm lie in the value of the parameter ϕ.One approach is to assign it a fixed value of 0.5.However, one of the bestperforming techniques [57] assigns a value of ϕ = 1/n.In this situation, each auxiliary flow F a always has the same weight, and that is why it is called the Method of Successive Averages (MSA).In fact, it has been demonstrated [56] that, with this value, it is possible to obtain a convergent solution with Wardrop's equilibrium principles.
Frank & Wolfe's algorithm [58] aims to determine the optimal values of ϕ in order to ensure a rapid convergence of the MSA methods with the user equilibrium principle.The problem of searching for "the lowest point of the valley on a day of thick fog" is often used as an allegory to explain it [51]:  Since we do not know the optimal downward direction in global terms, we start by descending to where it looks optimal at that point. We continue descending until the slope starts to rise again. We stop at that point, look again for a new downward direction, and take it.We continue and repeat the previous step. We continue like this until there are no downward directions.By that time, we will have reached the bottom of the valley.
In our case, in each iteration of the MSA method, a feasible solution (equivalent to a position in the valley) is obtained.The all-or-nothing assignment is the one that marks the direction of descent.The Frank-Wolfe algorithm tends to converge quickly in the first few iterations; however, it will converge more slowly as it approaches the optimal value [59].

Case Study
In the Study Report of the Barcelona Orbital Highway B-40 (from Terrassa to Granollers (Link with AP-7/C60 roads)) it was necessary to carry out a traffic study in Barcelona, Spain to analyze the different considered alternatives.This traffic study was divided into two phases, each with a different scope:  Phase A. This phase included a comprehensive background study and the collection of numerous previous traffic, mobility, urban planning, and socio-economic data. Phase B. This phase contained the traffic study itself, including the complete modeling of the network, the network's simulation via a computer, and the final results and conclusions.
In November 2017, the Phase A study was conducted.In September 2018, Phase B was completely finished.We used the results obtained in this traffic study as the basis for a further assessment of the influence of the used type of model on the level of observed error.
In total terms, the road network of Catalonia (Spain) consists of 12,076 km of tracks, of which 1648 km are high capacity.The Regional Government is responsible for 50.4% of these tracks, the Local Government is responsible for 34.8% of them, and the Country's Government is responsible for 14.9% of them [60].
For the analysis of traffic in the area, all of the roads parallel and perpendicular to the new network have been considered, regardless of their ownership.We have collected data from the Ministry of Development of Spain [61] and the Government of Catalonia [62].In particular, we have collected information from 16 metering stations belonging to the Ministry of Development (three of them permanent ones) and 42 stations of the Government of Catalonia.

Zonification
Seventeen (17) municipalities are directly affected by the layout of the different considered corridors.All 17 belong to the province of Barcelona (Spain).However, as the study area obviously cannot be limited to these municipalities, we extended it to a much larger territorial area that includes the entire province of Barcelona, the neighboring provinces, and the rest of Spain.
Therefore, we define two distinct areas:

Determining the Basic Origin/Destination Matrix
The main source of data used in the project was anonymized mobile phone data from a network operator with an important market share of total users in Spain.To do so, information on the geolocated positions of mobile devices was gathered and recorded during March 2018.The study population consisted of residents of Spain over 16 years old.The resulting data from this process were: four Origin/Destination matrices for light vehicles (with different travel reasons) and one Origin/Destination matrix for heavy vehicles on an average working day.
After adding the different travel reasons, a global Origin/Destination matrix was obtained for an average working day.Finally, based on data collected from the Movilia survey [63], we note that the proportion of private vehicle travel on a working day is 40.6%, and that the average occupancy of vehicles is 1.22 people/vehicle on a working day.Taking into account all of these corrective elements, the final base matrices of light and heavy vehicles were obtained for the 38 considered zones.These tables were found to yield values similar to those obtained in the Movilia survey for the province of Barcelona, which gives us some confidence in the reliability of the process.
However, when applying these matrices to the network model, the traffic values recorded at the actual metering stations were not accurately reproduced.Therefore, we needed to make one last adjustment after developing the offer model (or network model).www.videleaf.com

Methodological Application
This section describes the particularities of the methodology that we used for the modeling in the case study.We differentiate between an offer model (a model of the road network on which the future highway is framed) and an assignment traffic model (consisting of a choice model with multiple paths according to the generalized cost).

Network Model
The offer model consists of a representative structure of the characteristics of the road network on which the travel matrices assignment was to be carried out.We included, as prescribed by Spanish regulations [64], all of the sections of roads belonging to corridors from which the new road could take a significant amount of traffic, all of the sections of roads that connected the new road with alternative corridors, all of the roads that crossed the new road, and all of the connections to the traffic generation/attraction centers in the immediate area.
In other words, all of the main types of road infrastructure included in the new road's area of influence were modeled.The scenario was complemented by the inclusion of other roads of the province of Barcelona that could potentially be in competition with the new infrastructure or may influence the results of the model.Also, sections where movement was free were distinguished from those where a toll was applied.This incorporated an additional cost into the latter sections depending on the travelled distance.
Figure 2 shows the explanatory graph of the modeled network and the connections of the centroids on it.www.videleaf.comIn total, the modeled network consists of 1232 sections and 519 intersections and has a total length of 1593 km (3251 km if we divide the roads by lane).Considering that the total length of the road network of the province of Barcelona was 3929 km during the same period, these data reflect the high degree of detail achieved in this study.

Traffic Assignment Model
The network model was implemented using the AIMSUN software.In this case, we used macro-simulation tools, as they have been adapted for studies in which the size of the study area is large.Each section of the network was delineated in this software from the basic information contained in the OpenStreetMap platform [65], updating if necessary the new or modified infrastructures with respect to it.The generalized cost of each section was determined using a function that considers the economic cost, the cost of spent time, and other variables, such as toll rates, if needed.The general expression of the unit generalized cost is: www.videleaf.com(6) where SVT is the subjective value of time.The average value of time was extracted from the Spanish regulations [66], which set a value of 23.03 euros/hour for the year 2013, which we updated in accordance with the official Euribor [67].
The model assigns both types of trips-light vehicle trips and heavy vehicle trips-together to the network by applying an equivalence factor, which in our study was initially equal to 3.0 as this value corresponds to undulating terrain in the Highway Capacity Manual [68].The network was simulated over various assignment models, and the results obtained with each of them were compared with the actual data from the existing metering stations.

Results
In this section, we compare the assignment models described in Section 2 in order to initiate a brief discussion on the quality of the different alternatives.Specifically, we performed experiments with five types of static traffic assignment (all-ornothing, stochastic assignment with a simulation-based method, the method of successive averages (MSA), incremental assignment, and user equilibrium applying the Frank & Wolfe algorithm).We compare the results obtained with each of these methodologies with the traffic data that were actually measured by the detectors in the network.We also qualitatively assess the ability of each methodology to estimate the traffic flow in the modeled network.To this end, we follow the guidelines established by Spanish legislation [64], which indicate that a valid model satisfies two conditions:  Regression analysis.A scatter plot should be developed containing the pairs of traffic volume values obtained in each section by the model (vertical axis) and by the actual observations in metering stations (horizontal axis).Above it, a regression line must be adjusted, where the slope value should be close to 1, the intercept value on the vertical axis should be close to 0 (compared with the analyzed traffic www.videleaf.comvolume ranges), and the coefficient of determination R 2 should be greater than 0.7.
 RMSE (root-mean-square error) index.The total set of observations should be divided into two groups: a ‗contrast' sample containing at least 10% of the values and a sample containing the rest of the values.For each group, the following indicator should be calculated, and the values in all cases should be less than 30%.
 where E i is the estimated flow by the model, O i is the observed flow in the metering station, and N is the number of observations.In our case study, the data were divided into a sample containing 29 observations (representing 28% of the total data) and a sample containing the remaining 73 observations.These proportions were used in all considered models.

All-or-Nothing Assignment
By comparing the data recorded by the metering stations with those estimated by the model, we obtained Figure 3.A regression analysis was performed between the observed and the estimated values.In Figure 3, the estimated values are shown on the vertical axis and the observed ones are shown on the horizontal axis.We can find an R 2 coefficient of 80.1% for light vehicles and an R 2 coefficient of 69.1% for heavy vehicles.Since the value for heavy vehicles is less than 0.7 [64], this methodology can be directly discarded in the current case.
In addition, the RMSE index was calculated, resulting in a value of 27.6% for the sample of 29 observations, and a value of 43.3% for the sample of 73 observations.These results are also inadmissible [59], so we again arrive at the above conclusion.www.videleaf.com

Stochastic Assignment with a Simulation-Based Method
For the application of this methodology, we tested seven different utility functions.All of these functions are dependent on the generalized cost of travel and were calculated according to Equation ( 6) with a 95% of level of confidence.These functions are shown in Table 1.where U ijr is the utility within path r for trips from i to j and c ij is the generalized cost of the trip from i to j.
After comparing the data recorded in the metering station with the estimated values from the model, the results shown in Table 2 were obtained by a regression analysis.Table 2 shows that, in general terms, this methodology does not allow us to obtain a www.videleaf.com Similarly to the previous cases, after comparing the data recorded by the metering stations with those estimated using the model, the results shown in Table 4 were obtained by a regression analysis.Table 4 shows that all of the considered schemes yield admissible R 2 coefficients [64].On the other hand, Table 5 shows the results of the calculation of the RMSE indices in each analyzed case.This time, we can see that only homogeneous partitions with four or more divisions yield values less than 30%, so they are the only permissible ones [64].Therefore, this methodology can be accepted, but only in the case of four or more homogeneous partitions.www.videleaf.com

Method of Successive Averages (MSA)
For this methodology, we tested five different configurations, with a maximum number of 5, 10, 20, 50, and 100 iterations, to determine the value needed to achieve the required level of detail [64].
Table 6 shows the results of the regression analysis.The results show that all of the considered schemes yield admissible R 2 coefficients [64].Table 7 shows the evolution of the RMSE indices based on the number of iterations.From Table 7, we can see that all configurations with 10 or more iterations yield permissible values [64].
Finally, Figure 4 shows a summary of the evolution of these indices based on the number of iterations.We can see there that the level of accuracy increases rapidly over the first several iterations, and then remains practically stagnant even though the number of iterations significantly increases.

User Equilibrium Using the Frank & Wolfe Algorithm
As in the previous case, the main issue to consider is the maximum number of iterations with which to limit the process.Again, we tested five different configurations with a maximum number of 5, 10, 20, 50, and 100 iterations.
After comparing the data recorded by the metering stations with those estimated using the model, the results shown in Table 8 were obtained by a regression analysis.Table 9 shows the evolution of the RMSE indices based on the number of iterations.Tables 8 and 9 show that all of the considered schemes yield admissible R 2 coefficients and that all configurations, even the one that only has five iterations, yield adequate RMSE values [64].
Finally, Figure 5 shows that the level of accuracy increases even more rapidly over the first several iterations than in the MSA algorithm, and then remains practically stagnant even though the number of iterations significantly increases.This finding is consistent with those of Arezki & Van Vliet [59].www.videleaf.com

Discussion and Conclusions
From our analysis, we can draw the following conclusions:  The all-or-nothing and stochastic assignment methods are inadmissible in this particular case study.In our study, we did not obtain the same outcome as Chen & Pan [69], who concluded that stochastic algorithms can produce similar results to UE and the MSA.This may be due to the size of the simulated network; the examples they used were small in size, while we analyzed a large area.Further research is needed to verify this point. The incremental assignment algorithms are valid only when they have four or more demand partitions of equal size; thus, no individual partition can exceed 25% of the total. The method of successive averages (MSA) algorithm is valid only when working with more than 10 iterations.This result is consistent with those of Ameli et al. [1], who found that this method is good for small-and mediumsized networks, but is not the fastest one when estimating the traffic flow in a large network. Finally, the user equilibrium methods (approximated by the Frank & Wolfe algorithm) were found to be valid in all of the considered cases (five or more iterations).Table 10 shows a summary of the results obtained using those methodologies that were found to be valid for the case study.From the results, we can conclude that the MSA and UE yield practically the same results when we have a sufficiently high number of iterations [59].However, for lower numbers of iterations, the results of the UE algorithm are clearly better.The incremental allocation method provides a significantly lower level of detail.
Therefore, we recommend the use of user equilibrium methodologies.To speed up the process, approximation with the Frank & Wolfe algorithm could be performed.This recommendation is in line with the results of Denoyelle et al. [70], who determined, by applying numerical simulation techniques, that this method is versatile and superior to other methodologies.
However, as Bliemer et al. [71] note, planners have to evaluate a methodology's applicability in terms of both its adjustment to real-world data and its computational flexibility and speed.In our case study, the Frank & Wolfe algorithm was found to satisfy all of these criteria.We can infer that a network's size is a www.videleaf.comcrucial variable to take into account when choosing a method [40,41,44,69,71].
To finish, we want to make a note about the limitations of this study and its scientific and practical implications.In this study, we used five different types of methods to estimate the traffic flow in a large network, and compared the results with realworld data.These five methods were the ones included in the employed software.It would be interesting to use other methodologies, or even a higher number of different configurations within the considered ones.In addition, we determined whether the models complied with the Spanish regulations [64], since their verification, through the R 2 and the RMSE indices, is a prerequisite of their use in our geographical area.However, it would have been interesting to use other goodness-of-fit indicators, or to even develop new ones, to enhance our knowledge of the adjustment of each methodology to real-world data.These are possible lines of future research in this field.
Regarding scientific and practical implications, we want to note that the results obtained in this study can be considered to be valid for both the case study and other areas with similar characteristics.However, although they cannot be directly extrapolated to different cases, it may be possible to develop a similar methodology that allows, under the same conditions, for the selection of the optimal assignment method in each case.The results of this study and the employed procedure may be useful to researchers in the field of computer simulation and planners who apply these methodologies in similar cases.

Figure 1 :
Figure 1: Geographical distribution of zones.(a) The whole of Spain; (b) Catalonia in detail.

Figure 2 :
Figure 2: The base road network.

Figure 4 :
Figure 4: MSA assignment: comparison by number of iterations.

Figure 5 :
Figure 5: Equilibrium assignment: comparison by number of iterations.

Table 10 :
Comparison of results obtained with valid methodologies.