Neural Networks Applied to Microsimulation: A Prediction Model for Pedestrian Crossing Time

: Walking is the original form of transportation, and pedestrians have always made up a signiﬁcant share of transportation system users. In contrast to motorized tra ﬃ c, which has to move on precisely deﬁned lanes and follow strict rules, pedestrian tra ﬃ c is not heavily regulated. Moreover, pedestrians have speciﬁc characteristics—in terms of size and protection—which make them much more vulnerable than drivers. In addition, the di ﬀ erence in speed between pedestrians and motorized vehicles increases their vulnerability. All these characteristics, together with the large number of pedestrians on the road, lead to many safety problems that professionals have to deal with. One way to tackle them is to model pedestrian behavior using microsimulation tools. Of course, modeling also raises questions of reliability, and this is also the focus of this paper. The aim of the present research is to contribute to improving the reliability of microsimulation models for pedestrians by testing the possibility of applying neural networks in the model calibration process. Pedestrian behavior is culturally conditioned and the adaptation of the model to local speciﬁcs in the calibration process is a prerequisite for realistic modeling results. A neural network is formulated, trained and validated in order to link not-directly measurable model parameters to pedestrian crossing time, which is given as output by the microsimulation tool. The crossing time of pedestrians passing the road on a roundabout entry leg has been both simulated and calculated by the network, and the results were compared. A correlation of 94% was achieved after both training and validation steps. Finally, tests were performed to identify the main parameters that inﬂuence the estimated crossing time.


Introduction
In a world where urbanization is increasing-statistics predict a 58% growth in the world's urban population by 2025 [1] and where all communities are tending towards more sustainable and environmentally friendly transport policies, it is understandable that pedestrians play a central role in the urban environment. Many transportation problems could be minimized and in some cases completely solved by proper planning of the urban environment: The development of walkable spaces could reduce the problems of congestion in city centers and pollution could decrease if people prefer to walk rather than using other means of transport. Of course, walking should be encouraged, and this is closely linked to the level of safety guaranteed by the available infrastructures.
A well-known and sophisticated tool for planning and design of transportation infrastructures is microsimulation. This technique is already widely used in transportation engineering and has many advantages; in particular it provides the possibility of studying various infrastructural options without actually realizing them, in order to better meet the needs of the considered road users, may they be Levels Of Service (LOS), emergency needs or safety-related issues. Certainly, to be useful, it is necessary to verify the accuracy and reliability of the developed models and the obtained results have to be checked: calibration and validation steps are necessary to be run, processes that request a long elaboration period and several efforts [2,3].
Many methods and techniques have been applied to calibrate traffic microsimulation models: trial-and-error procedures [4,5], statistical methods [6][7][8] and evolutionary algorithms [9][10][11]. One of the most commonly used methods for the calibration of traffic microsimulation models is the Genetic Algorithm (GA), a stochastic based search method, which is based on the concept of biological evolution [11][12][13].
This paper focuses on improving the reliability of a microsimulation tool for modeling pedestrian behavior by applying Neural Networks (NN).
Previous research developed by the research team on the investigation and modeling of motorized traffic [14][15][16] has shown that neural networks provide a good response, when applied to vehicular traffic issues. These studies [14][15][16] demonstrated that the formulated neural network, despite giving a good prediction of travel time, did not have similarly successful responses for vehicle queue parameters. Once calibrated via neural network applications, the microsimulation model provided vehicle queue parameter values that differed by less than 5% from the measured ones.
The aim of this paper is to investigate the possibility of applying neural networks in the calibration of a microsimulation model of pedestrian movements. Pedestrian behavior has its own specifics and is influenced by the cultural milieu perhaps more than vehicular movement [1,[17][18][19]. Also, the pedestrian population includes all vulnerable groups, from children at the beginning of their independent movement, to elderly pedestrians with mobility difficulties as well as pedestrians with various disabilities (motor disabilities, low vision, etc.) [18]. Given the greater dispersion of reaction times and velocities of the pedestrian population in comparison to the driving population, the question of the possibility of predicting the crossing time in the conflict zone becomes critical for model calibration [19][20][21]. Also, although walking is part of the transport system, having its own particular behavioral rules and characteristics, it cannot be simulated by the behavioral model normally used for cars. The aim of the present work is to obtain a reliable model to predict one aspect of the simulated pedestrian behavior. The developed procedure is the first contribution to the development of a calibration methodology, whose final goal is to fine-tune non-directly measurable modeling parameters on the basis of a model output that is characterized by its easy measurability. For this purpose-based on previous experience-neural networks were selected as a possible applicative tool. The selected neural network will be trained, validated and evaluated in order to create a predictive model of pedestrian simulated behavior at a specific location type. Crossing times of pedestrians walking on a zebra crossing at a roundabout entry leg will be both simulated and calculated by the network and results compared. Tests are also developed to underline which input parameters have the greatest influence on the selected output. Herein, a limitation of the research presented in this paper should be pointed out: since it deals with the formulation of the model to predict the micro-simulated behavior of pedestrians, representing the first important step towards the development of the whole calibration methodology, the calibration of the initial model parameters cannot be worked out. Of course, a check of the validity of the developed microsimulation model was carried out, in order to understand if it reproduced a realistic behavior. Nevertheless, a precise calibration of the model needs its parameters to be fixed. To train the neural network, it is necessary to create a huge number of combinations of the model's parameters-which are each time different, in order to make the neural network learn the relationship between these parameters and the selected output. Only at the end of the whole calibration process is it possible to apply the selected, trained neural network to obtain the combination of input parameters which, put in the microsimulation model, gives the best output, i.e., the crossing time which is the most similar to the real, measured crossing time.
The aim of the paper is to highlight the effective possibility of applying neural networks in the process of calibration, by showing that the selected, trained network is able to reproduce the model's outcomes. This step is central, because if the neural network can reproduce the microsimulation output, Sustainability 2020, 12, 5355 3 of 22 then it can be used to calibrate the model. The calibrated model will be the outcome of the whole, further research.
The paper is structured as follows: the next section explains the 5-step-methodology used in the research: the location and period for video footage recording are described; the model set up, the peculiarities of both the zone and the behavior of traffic participants and the way they were tackled in the model are presented; after this, an overview of the equation and parameters governing the micro-simulation model is given and the choice and preparation of parameters is illustrated. Finally, the formulation of the neural network architecture is characterized. Section 4 presents the results provided by the neural network, describing the measures of performance used for the evaluation, the results of the training step and the ones achieved after the validation of the prediction model. In the end, Section 5 deals with conclusions and future research.

Related Works
In this section, a literature review about the main applications of neural networks to pedestrian questions will be provided. In addition, a brief overview of the main tools applied to microsimulation model calibration is worked out.

Use of Neural Networks in Pedestrian Research
In the pedestrian study area, neural network practice is mainly limited to the progression of detection approaches.
In [22], authors deal with pedestrian detection and classification. They use various methods for feature extraction and classification, among which there are also neural networks. NN are used in both steps: firstly, as one of the methods to be compared, in the second one to develop the training of the recalled techniques. As feature extraction method, they use a feed-forward neural network with local receptive fields. This seemed to achieve the best results among all the techniques involved.
The authors of [23] develop a pedestrian counting technique, starting from the data obtained by a single camera. The peculiarities of the system are the use of feature histograms and the feature normalization. It is interesting to notice that the authors try two kinds of supervised training: one using linear fitting and the other utilizing neural networks and show that neural networks have better performance compared to linear fitting.
In [24], the studiers develop a pedestrian detection method based on convolutional neural networks, seeming to outperform traditional detection techniques. Specifically, they applied two state-of-the-art neural networks-AlexNet and GoogLeNet-to the problem of pedestrian detection and evaluate their results following the Caltech Pedestrian Detection evaluation protocol.
The authors of [25] developed a Deep Learning Algorithm to be applied to Advanced Driver Assistance Systems for the observation and detection of pedestrians, by fusing SqueezeNet and the YOLO detection network. Their findings show both high accuracy and robustness, and they propose the possibility of implementing the algorithm both on mobile phones and on embedded systems. Also, the authors suggest reducing the number of parameters of the neural network in order to obtain a faster algorithm with a still acceptable level of accuracy. Also, [26] deals with pedestrian detection. In this research, the authors address a Fused Deep Neural Network (F-DNN) to achieve faster and more accurate detection. By testing the proposed algorithm on different scenarios, the authors highlights its better performance in terms of both robustness and speed in comparison to the other state-of-the-art models. The work [27] formulates a Support Vector Machine (SVM) model to use the micro Doppler features of radar signals to distinguish objects from people around vehicles and, in this way, to detect near crash situations involving pedestrians. The aim of the article is to improve-by applying SVM-both the classification performance and speed, and it reached a 0.12 m/s velocity resolution. Among all the evaluated approaches, the performance was obtained by a polynomial kernel approach, which obtained a 99.5% accuracy rate and a processing time of 0.073 s. The authors of [28] deal with Deformable Part Models (in the following DPMs) and convolutional neural networks (CNN).
In particular, the authors developed a new model, describing DPMs as a composition of 2 CNNs. The model they achieved is called Deep Pyramid DPM and it is constituted by two neural networks. As the input, it takes an image pyramid, as the output it performs a pyramid of object detection scores.
Also, face detection has been worked out thanks to neural networks. Specifically, in [29], a hybrid neural network method for real-time detection of faces made up of local image sampling, SOM neural network and a CNN is developed and applied to a database of 400 images of 40 individuals characterized by different poses, facial details and expressions. In [30], a system to detect not only walking pedestrians, but also standing people, is carried out thanks to the use of neural networks. In [31], the authors introduce a deep neural network with a star topology. The goal of the network is to predict pedestrian trajectories. To reach it, a hub network is formulated to produce a comprehensive representation of the crowd, while host networks predict the future trajectories of each pedestrian. The proposed architecture was evaluated on two datasets where different interactions happen: specifically, walking side by side, collision avoidance and direction changing are among the considered behaviors. Results were compared in terms of Average Displacements Error and Final Displacement Error with three state-of-the-art models: Social LSTM, Social GAN and Social Attention. Regarding the accuracy rate, the StarNet architecture outperformed all the other models, while in terms of computational time cost, it was second only to the LSTM model.
Finally, in [31] a back propagation-Ward net-is formulated to predict children' reaction time at signalized intersections. The structure of the network is characterized by five layers: one input layer, three hidden layers and one output slab. In the input layer there are seven inputs-age, gender, children with special needs, movement in group, adult supervision, mobile phone texting activities and mobile phone listening-while each hidden slab has five neurons, and the output layer contains one output neuron.
The results show a 94.56% of correlation and a mean absolute error of the prediction equal to 0.348 s.

Algorithms Applied to Calibration of Pedestrian Microsimulation Models
In the field of pedestrian microsimulation, an important point is model assessment. To reach it, various calibration methodologies were developed and can be found in the literature. Mainly, three main tools are addressed: maximum likelihood estimation (MLE), genetic algorithm (GA) and artificial neural networks (ANN).
In [32], a MLE methodology was applied to the calibration of the microsimulation tool NOMAD for evacuation conditions. Using as inputs trajectory data divided for three age classes-children, adults and the elderly-the authors focus on the tuning of four elements, which are the acceleration time, free speed, an interaction term identified as acceleration and the interaction distance.
The authors of [12] implement a genetic algorithm to estimate the values of parameters A and B, i.e., the strength and range of interaction between agent-agent and agent-obstacle, of a social force inspired model, applied to pedestrian flow at a metro entrance. Though many results are reported in this research, direct metrics-like the RMSE-are not summarized, meaning an assessment in relation to other works is difficult.
An interesting comparison among different calibration methodologies is worked out in [33]. The researchers apply the MLE, Nelder-Mead optimization algorithm and Genetic Algorithm on common simulated data to calibrate parameters A and B-strength and range of interaction among pedestrians-to be able to test and compare them. As result, they determined that the Genetic Algorithm has to be preferred over the other techniques, while MLE is not recommended.
Artificial Neural Networks (ANN) are introduced in [34] for pedestrian microscopic model calibration. In this paper, ANNs are compared to multiple linear regression (MLR) for the prediction of pedestrian gap acceptance. The comparison made considering RMSE, MAPE, correlation coefficient and R 2 shows that ANN performs much better than MLR for RMSE and MAPE, and is comparable to regarding the correlation coefficient and R 2 . Briefly enlarging the review to transportation research, the same three main methods are highlighted [6, 15,35]. Specifically, regarding neural networks, [14][15][16] found that neural networks are able to reliably predict time variables, such as travel time and crossing time, and a procedure was developed to calibrate traffic microsimulation models. Also in [35], where a review of the applications of neural networks in transportation research is worked out, it is underlined that neural networks are more effective in estimating travel times than other methods. In [36], when comparing and contrasting statistics and neural networks, authors emphasize various advantages of applying neural networks: they underline the efficiency of this tool in terms of accuracy and development time, their flexibility as well as the opportunity of applying few restrictions in comparison to statistics, where many hypotheses and assumptions are to be taken. All these observations, as well as the applications of neural networks to other parameter estimation issues, put the basis for the use of this tool in the present research. Moreover, the effectiveness of neural networks in predicting time variables, which emerged from previous authors' research [14][15][16], and the convenience of use when input parameters that need to be tuned are not directly measurable [14][15][16], justify their choice in a noteworthy way.

Method
The general method used to formulate and evaluate the neural network consists of 5 main steps ( Figure 1).
First, real data were obtained to adapt the model both geometrically and in terms of flow to the real chosen situation and to compare-in further evaluations-the simulated outputs with the real values of the chosen parameter. With this information, the microsimulation model was created and visually tested to avoid unrealistic behavior.
The second step involved the selection of suitable parameters, which mainly influence pedestrian behavior, but also consider the interaction with motorized road users and are to be used as inputs for the neural network. Then the most suitable microsimulation output was selected, which should also be used as an output of the neural network. After these selections, a database with random input combinations was generated. Each of these combinations was linked by simulation to a corresponding output value. The third step consisted in the formulation of the neural network and its training. The results of this step were evaluated in relation to the criterion of learning outcomes.
Finally, a new data set was used to evaluate the network, taking into account the criterion of generalization. In the next paragraphs, the steps are described in detail.
Sustainability 2020, 12, x FOR PEER REVIEW 5 of 23 able to reliably predict time variables, such as travel time and crossing time, and a procedure was developed to calibrate traffic microsimulation models. Also in [35], where a review of the applications of neural networks in transportation research is worked out, it is underlined that neural networks are more effective in estimating travel times than other methods. In [36], when comparing and contrasting statistics and neural networks, authors emphasize various advantages of applying neural networks: they underline the efficiency of this tool in terms of accuracy and development time, their flexibility as well as the opportunity of applying few restrictions in comparison to statistics, where many hypotheses and assumptions are to be taken. All these observations, as well as the applications of neural networks to other parameter estimation issues, put the basis for the use of this tool in the present research. Moreover, the effectiveness of neural networks in predicting time variables, which emerged from previous authors' research [14][15][16], and the convenience of use when input parameters that need to be tuned are not directly measurable [14][15][16], justify their choice in a noteworthy way.

Method
The general method used to formulate and evaluate the neural network consists of 5 main steps ( Figure 1).
First, real data were obtained to adapt the model both geometrically and in terms of flow to the real chosen situation and to compare-in further evaluations-the simulated outputs with the real values of the chosen parameter. With this information, the microsimulation model was created and visually tested to avoid unrealistic behavior.
The second step involved the selection of suitable parameters, which mainly influence pedestrian behavior, but also consider the interaction with motorized road users and are to be used as inputs for the neural network. Then the most suitable microsimulation output was selected, which should also be used as an output of the neural network. After these selections, a database with random input combinations was generated. Each of these combinations was linked by simulation to a corresponding output value. The third step consisted in the formulation of the neural network and its training. The results of this step were evaluated in relation to the criterion of learning outcomes. Finally, a new data set was used to evaluate the network, taking into account the criterion of generalization. In the next paragraphs, the steps are described in detail.

Data Gathering
As a starting point for the model setting up and the development of the neural network, a real situation had to be selected.
It was decided to record an unsignalized pedestrian crossing, which is located on the entry leg of a roundabout: the considered infrastructure is located in Monfalcone (GO), a city in the Italian region Friuli-Venezia-Giulia. The roundabout is located in the urban area of the municipality and connects the city center with the main neighboring towns. The crossing passes two unidirectional lanes of the same roundabout entry leg and connects pubs and cafés with offices, schools and shops. Both pedestrian movements and conflicting vehicle traffic were recorded for a whole week, during the two peak hours of the city, i.e., from 7.30 to 9.30 a.m., obtaining more than 1000 recorded pedestrians.
The recordings were analyzed using a semi-automatic software that allows selected detections to be automatically saved and the users involved to be tracked manually. This step made it possible to obtain a huge amount of data on both motorized and non-motorized users. Due to the research objective, the selected real-world data, which were then used in the model, were pedestrian speed distribution, pedestrian flow and its split rate, vehicular flow and speed (Table 1). In Table 1, the numbers reported in the second column, which refer to pedestrian flows, represent the start and end zones of the pedestrian trip, the ones referring to vehicular flows indicate the link on which vehicles travel (see Figure 2).
Geometrical data have also been collected and they have been gathered directly from official map information, provided by the municipality.

Micro-simulation Model Set Up
The model, on which neural networks were trained, was developed with the commercial microsimulation software Vissim. The simulated geometry and the allowed manoeuvers are exactly the same as the real ones ( Figure 2; Table 2).
In order to obtain a realistic simulation, vehicular traffic was regulated according to the Wiedemann 74 behavior, which is the suggested behavior for urban traffic modeling [37], while pedestrian traffic was regulated according to the social force model, which was implemented thanks to the Viswalk add-on.
Four main problems were encountered in the reproduction of the site: The first concerned the network geometry and in particular the need to model the entire roundabout or just the recalled entry leg. The decision was made to model the entire roundabout and its oncoming and outgoing legs, as the facility is very crowded and each incoming and outgoing flow is affected by the others. The second question was related to the path of modeling pedestrian movements. In fact, simulated pedestrians can move on a "pedestrian area" or on a "link used as pedestrian area". After heuristically implementing both possibilities and critically observing the reproduced behavior, it was decided Sustainability 2020, 12, 5355 7 of 22 to use the "link used as pedestrian area", since it visually provided the response best fitting real observations. Finally, the third question concerned vehicular behavior: Italian drivers do not usually stop at pedestrian crossings unless they consider it necessary. This led to the need to realistically reproduce this type of yielding behavior. To solve the problem, priority rules and speed reduction areas were jointly used. The first ones let the pedestrian be yielded by the oncoming vehicles, while there were 4 "speed reduction areas" placed, 2 in proximity of the crossing and 2 just before the stop line to enter the roundabout circle, to allow vehicles to reduce their speed, thus well simulating the real observed outcomes.  In order to obtain a realistic simulation, vehicular traffic was regulated according to the Wiedemann 74 behavior, which is the suggested behavior for urban traffic modeling [37], while pedestrian traffic was regulated according to the social force model, which was implemented thanks to the Viswalk add-on.
Four main problems were encountered in the reproduction of the site: The first concerned the network geometry and in particular the need to model the entire roundabout or just the recalled entry leg. The decision was made to model the entire roundabout and its oncoming and outgoing legs, as the facility is very crowded and each incoming and outgoing flow is affected by the others. The second question was related to the path of modeling pedestrian movements. In fact, simulated pedestrians can move on a "pedestrian area" or on a "link used as pedestrian area". After heuristically implementing both possibilities and critically observing the reproduced behavior, it was decided to use the "link used as pedestrian area", since it visually provided the response best fitting real observations. Finally, the third question concerned vehicular behavior: Italian drivers do not usually  A fourth complication was set by pedestrian generation: in the selected scenario, pedestrians can arrive from 5 different directions-3 on the left and 2 on the right side-and they can reach 5 different destinations. To solve this problem, a pedestrian area and the corresponding pedestrian input for each of the listed directions were created.
Summing up, the network geometry consists of 49 links: 16 vehicular links defining the oncoming and outgoing roads and the roundabout circle lanes, 12 links used as pedestrian areas and 5 pedestrian areas and 12 connectors.

Parameter Choice and Preparation
To successfully formulate and train a neural network, the parameter choice, on which the selected network will work, is of primary importance. Since the aim of the research is to assess pedestrian crossing behavior in an environment where the interaction with vehicular flow is not uninfluential, it was decided to work both on pedestrian and vehicular behavioral parameters. Specifically, a total of 8 parameters were selected, 5 of which relate to pedestrians and 3 of them linked to vehicular traffic. Pedestrian movement was modeled thanks to the Social Force Model (SFM) [38]. As stated in [39], on the one hand, this model is easy to understand, but on the other hand, it is difficult to interpret and measure the parameters from which it is composed, which strongly influence diverse behavioral aspects.
The basic concept of the SFM is that each pedestrian, defined by its desired speed and target time, moves towards its destination ruled by the so-called social forces [38]. These effects are: the attractive forces that lead each pedestrian to its destination F 0 α ; the repulsive forces among pedestrians and among the individual and obstacles and the attractive forces due to other pedestrians/objects [38]. In Vissim/Viswalk, the Social Force Model expression (Equation (1)) is adapted and some parameters governing the equations, emerge: In detail, these are the following: ω(λ) is a parameter depending on lambda calculated for each physical force opposing the main force F [37].
d is a parameter dependent on a further formulation, which indicates the distance between two pedestrians [37].
n is the vector pointing the influenced pedestrian to the influencing one [37]. Tau (τ): relaxation time as expressed in Helbing's original model. It relates the difference between the desired speed and direction to the current speed and direction. Though it is not directly present in Equation (1), it is used to express the force leading the pedestrians to its destination [37].
Lambda_mean (λ): amount of anisotropy. It regulates the effect of phenomena that take place in the back of the considered pedestrian [37].
A_soc_isotropic and B_soc_isotropic: non-measurable parameters that control the two forces among pedestrians [37].
A_soc_mean and B_soc_mean: respectively define the strength and typical range of the social force between pedestrians [37].
Also, other parameters influence pedestrian movement in the Vissim/Viswalk simulation tool, and should be considered. These are: Noise: introduces the random forces, which are systematically added to the calculated forces [37]. React_to_n: the number of pedestrians considered for the calculation of the forces [37]. Side_preference: defines whether opposing pedestrians prefer using the right or left hand side when passing each other [37].
Queue_order and Queue_straightness: specify the properties of the queue [37].
In the calibration attempt developed in [40], the authors give an insight into which modifications are brought about by the change of some of the recalled parameters (Table 3). In Table 3, additional parameters to the ones reported in Equation (1) are also considered. They are: the radius of pedestrians, which indicates the size of the bidimensional ellipse containing the shape of a pedestrian [40]; B physical, a border that is a theoretical parameter controlling the force between a pedestrian and the border of a location [40]; friction force, a component of the main force F, together with attractive and repulsive forces; VD is a theoretical parameter of Vissim namely "velocity dependence", which is expressed in terms of seconds and which involves various other parameters, nevertheless it is not exactly defined Sustainability 2020, 12, 5355 9 of 22 in [37]; velocity is the speed of pedestrians; longitudinal scale is a dimensional parameter of the model; maximum number of pedestrians represents the maximum number of interacting agents considered in the simulation. Table 3. Vissim (i.e., the applied micro-simulation software) parameters and their modifications as proposed by Kretz [41].
Since in the considered situation the interaction with vehicular flow is strict, vehicular parameters also have to be considered. In particular, the model governing vehicular behavior is Wiedemann 74, and the selected parameters to be used are the three that most affect the car-following model, i.e., average standstill distance, defining the mean desired distance between two cars, the additive part of safety distance and multiplicative part of safety distance, which are values used for the calculation of the desired safety distance [37].
Geometric characteristics and traffic load are also very important parameters, but they are entered as inputs to the model for each location separately. In Table 4, vehicular parameters and their ranges are also summarized. The selection of a feasible output is also important. For this study, pedestrian crossing time has been chosen as assessment output for the neural network, because of its ease to be measured both from footage (semi-automatically as well as manually) and microsimulation results.
After parameter choices, these have to be stored in a suitable way to be read as input values by the neural network. The requirements needed in order to obtain a good training of the network are the amount of data, their randomness and a good quantity of repetitions of each combination.
For this study, a database of 100 random combinations of input-output values was worked out in Excel. Each combination is made up of a random selected value for each one of the 8 input parameters with a step of 0,1 and the resulting crossing time obtained by implementing the chosen input combination in the Vissim model. Also, parameters were selected in such a way that their ranges are comprised in the same magnitude unit, in order to avoid great range differences. To have a sufficient number of repetitions, ten simulations have been run for each random combination with an initial seed random value of 42 and an increment of 10 and a simulation of 1 timestep/second. In this way, we ensured that the same ten possible traffic scenarios were analyzed for each combination of input parameters. The number of simulations has been chosen as a compromise to the pseudo-stochastic working method of the model: as a matter of fact, it simulates a phenomenon which has a stochastic nature-traffic flow-but at the same time the experiment has to be repeatable, which means that the same "random seed" parameter value (random number generator) gives the same output.
Here the mean output value of ten simulations was used as the final output of the simulation for each random combination of selected input parameters.
Following the most used protocol [42] of the created dataset, 80% has been used for the training of the network, while the remaining 20% has been utilized as a test bed for the first evaluations (test and validation).
An additional database corresponding to the 20% of the initial one was generated after network training and testing. This completely independent database, which did not participate in either the training set or the test set in the neural network learning process, served to independently confirm the ability to generalize the selected neural network. Indeed, the goal of training a neural network is to achieve a good generalization, being able to be applied to much larger databases than the one on which it was trained. In the specific case, the neural network trained on a database of 100 combinations, will be applied to 3-4 times larger databases, which compare crossing times measured in real traffic conditions and crossing times as obtained by microsimulations using the neural network prediction function implemented in the calibration process for different values of input parameters.

Comparison between Measured and Simulated Pedestrian Crossing Time
By using default parameters in the microsimulation model, an average crossing time of 12.11 s has been obtained. This value appears to be higher than the crossing time measured in situ, whose mean equals 8.27 s. Table 5 summarizes the descriptive statistics for the crossing time. The simulated crossing time is 3.84 s higher (46.4%) than the measured one, justifying the need for a calibration process to be worked out.
In the following, the formulation of a neural network is presented, which-through its prediction function-could be used in a calibration program procedure. The aim of this paper is to show the applicability of the neural network to the prediction of the microsimulation output, giving the opportunity to effectively use it in the calibration process under development. The goal of the neural network, therefore, is not to outperform the Vissim/Viswalk microsimulation software; rather, it is to learn the relationship between the selected microsimulation input parameters and the simulated output, the crossing time.

Neural Network Formulation
Previous works [14,15] about the calibration of vehicular traffic models have considered the application of various kinds of neural networks and their performance, examining 176 neural network configurations. On the basis of these considerations and of previous experiences, for the present issue, a ward network was implemented in the Neuroshell2 program.
Ward nets, better known as feedforward networks, are neural networks structured in such a way that the neurons of a layer have the outputs of the neurons in the precedent slab as input signals [43]. A feedforward network, in which each neuron of one layer is linked to the ones of the adjacent layer, is called fully connected [43]. An example of a fully connected feedforward network can be seen in Figure 3.

Confidence level (95,0%)
1.1 0.26 The simulated crossing time is 3.84s higher (46,4%) than the measured one, justifying the need for a calibration process to be worked out.
In the following, the formulation of a neural network is presented, which-through its prediction function-could be used in a calibration program procedure. The aim of this paper is to show the applicability of the neural network to the prediction of the microsimulation output, giving the opportunity to effectively use it in the calibration process under development. The goal of the neural network, therefore, is not to outperform the Vissim/Viswalk microsimulation software; rather, it is to learn the relationship between the selected microsimulation input parameters and the simulated output, the crossing time.

Neural Network Formulation
Previous works [14,15] about the calibration of vehicular traffic models have considered the application of various kinds of neural networks and their performance, examining 176 neural network configurations. On the basis of these considerations and of previous experiences, for the present issue, a ward network was implemented in the Neuroshell2 program.
Ward nets, better known as feedforward networks, are neural networks structured in such a way that the neurons of a layer have the outputs of the neurons in the precedent slab as input signals [43]. A feedforward network, in which each neuron of one layer is linked to the ones of the adjacent layer, is called fully connected [43]. An example of a fully connected feedforward network can be seen in Figure 3. The adopted neural network is made up of 5 layers: the input layer has 8 neurons, corresponding to the 8 selected input parameters, the output slab has one neuron, i.e., the selected output parameter-pedestrian crossing time, while layers 2,3 and 4 are hidden layers and each one is composed of 4 hidden neurons.
The architecture of the neural network has been chosen following some of the rules found in literature [43][44][45] and the knowledge acquired by previous experiences [16]. The number of input and The adopted neural network is made up of 5 layers: the input layer has 8 neurons, corresponding to the 8 selected input parameters, the output slab has one neuron, i.e., the selected output parameter-pedestrian crossing time, while layers 2, 3 and 4 are hidden layers and each one is composed of 4 hidden neurons.
The architecture of the neural network has been chosen following some of the rules found in literature [43][44][45] and the knowledge acquired by previous experiences [16]. The number of input and output neurons is related to the number of initial parameters to be used and desired outputs to be evaluated. The selection of the number of hidden neurons is more complicated and has important consequences for the results of the network: indeed, as reported in [46], too many hidden neurons can cause over-fitting issues and degrade the generalization ability of the neural network. Much research exists on methods to select the best number of hidden neurons [44,46]: following [47]'s indications, a correct number of hidden neurons can be suggested by applying the formula (Equation (2)) n in + n out − 1 2 where n in represents the number of input neurons and n out is the number of output neurons. Also, a general rule of thumb is to select a number of hidden neurons belonging to the interval between the number of inputs and outputs.
Considering these two rules, 4 neurons have been chosen for the hidden layers of the network. Regarding the number of hidden layers, initially it was selected according to the literature [45][46][47] and empirically, on the basis of previous experiences [15,16], then 10 different configurations were tested and the one giving the best results was chosen. As with any optimization procedure, the question arises whether the selected network architecture is a local or global optimum. Given the learning outcomes and their practical application, this question was not focus of our further research.
The main feature of the selected network stands in the activation functions. As a matter of fact, the input layer is connected to the hidden ones by the same linear function, but each hidden layer is linked to the output slab through a different activation function in Table 6. In Figure 4, the neural network architecture is outlined.
Sustainability 2020, 12, x FOR PEER REVIEW 12 of 23 output neurons is related to the number of initial parameters to be used and desired outputs to be evaluated. The selection of the number of hidden neurons is more complicated and has important consequences for the results of the network: indeed, as reported in [46], too many hidden neurons can cause over-fitting issues and degrade the generalization ability of the neural network. Much research exists on methods to select the best number of hidden neurons [44,46]: following [47]'s indications, a correct number of hidden neurons can be suggested by applying the formula (Eq.2) n + n − 1 2 where represents the number of input neurons and is the number of output neurons. Also, a general rule of thumb is to select a number of hidden neurons belonging to the interval between the number of inputs and outputs.
Considering these two rules, 4 neurons have been chosen for the hidden layers of the network. Regarding the number of hidden layers, initially it was selected according to the literature [45][46][47] and empirically, on the basis of previous experiences [15,16], then 10 different configurations were tested and the one giving the best results was chosen. As with any optimization procedure, the question arises whether the selected network architecture is a local or global optimum. Given the learning outcomes and their practical application, this question was not focus of our further research.
The main feature of the selected network stands in the activation functions. As a matter of fact, the input layer is connected to the hidden ones by the same linear function, but each hidden layer is linked to the output slab through a different activation function in Table 6.

Commented [m3]: Please change and c
We wonder the "Learning rate 0,1" does "0.1", if it is "0.1", please change "0,1" need to do with below two "Momentum Weights: 0,3" As the commas are not allowed at here,

Results and Discussion
In this section, the results of the formulated neural network are presented and evaluated. The first paragraph reports the ward network's results, the related performance measures and the analysis of input parameters; the second paragraph sums up the results of the validation of the neural network.

Neural Network Results and Analysis of Input Parameters
In the following, the best result provided on a test data set is reported, on which the ability to generalize the network was initially evaluated. The selected neural network showed a very good correlation with the initial data; as can be inferred from the statistics reported in Table 7, a correlation coefficient of 97% was obtained. Also, 38% of the data have a Mean Absolute Error (MAE) lower than 5% and 21% have a MAE between 5% and 10%, for a total amount of 59% of data with a MAE lower than 10%, which is an acceptable threshold for the first training, test and validation of the network. Also, the comparison between the microsimulation outcomes and the neural network calculation show an overall agreement. Figure 5 outlines the consistency between the two kinds of data on the complete database of 100 combinations. In Figure 5, the outcomes of neural network prediction are divided into "acceptable to call-prediction error less than 10%(WARD NET)" and "unacceptable to call-predicted error greater than 10%(WARD NET)", identifying in this way the data which give a reliable approximation of simulated crossing time and the data with an error larger than 10% from the simulated outcomes.

Neural Network Results and Analysis of Input Parameters
In the following, the best result provided on a test data set is reported, on which the ability to generalize the network was initially evaluated. The selected neural network showed a very good correlation with the initial data; as can be inferred from the statistics reported in Table 7, a correlation coefficient of 97% was obtained. Also, 38% of the data have a Mean Absolute Error (MAE) lower than 5% and 21% have a MAE between 5% and 10%, for a total amount of 59% of data with a MAE lower than 10%, which is an acceptable threshold for the first training, test and validation of the network. Percentage over 30% 7.000 Also, the comparison between the microsimulation outcomes and the neural network calculation show an overall agreement. Figure 5 outlines the consistency between the two kinds of data on the complete database of 100 combinations. In Figure 5, the outcomes of neural network prediction are divided into "acceptable to call -prediction error less than 10%(WARD NET)" and "unacceptable to call -predicted error greater than 10%(WARD NET)", identifying in this way the data which give a reliable approximation of simulated crossing time and the data with an error larger than 10% from the simulated outcomes.  From the results of the neural network ( Figure 6), a first screen of the influential parameters was obtained. In the present case, results showed that input 1, i.e., τ, is the one most affecting result, followed respectively by inputs 8 (multiplicative part of safety distance) and 5 (side preference).
To have a better insight in which factors effectively contribute to pedestrian crossing time, some statistical tests were worked out.
The first developed test is the Pearson correlation. The outcomes, reported in Table 8, confirm that parameter τ is the most influential one. Moreover, they show that inputs 1, 2, 3, 4 and 8 (respectively τ, λ, Asoc_iso, Bsoc_iso and the multiplicative part of safety distance) have a negative relationship with the output result, while inputs 5, 6 and 7 (side preference, average stand still distance and additive part of safety distance) show a positive correlation with the crossing time. From the results of the neural network ( Figure 6), a first screen of the influential parameters was obtained. In the present case, results showed that input 1, i.e., τ, is the one most affecting result, followed respectively by inputs 8 (multiplicative part of safety distance) and 5 (side preference). To have a better insight in which factors effectively contribute to pedestrian crossing time, some statistical tests were worked out.
The first developed test is the Pearson correlation. The outcomes, reported in Table 8, confirm that parameter τ is the most influential one. Moreover, they show that inputs 1,2,3,4 and 8 (respectively τ, λ, Asoc_iso, Bsoc_iso and the multiplicative part of safety distance) have a negative relationship with the output result, while inputs 5, 6 and 7 (side preference, average stand still distance and additive part of safety distance) show a positive correlation with the crossing time. Bsoc_isotropic −0.262

Input 6
Average Standstill Distance 0.229

Input 7
Additional Part of Safety Distance 0.007 Input 8 Multiplicative Part of Safety Distance −0.034 The p-value test has been drawn to discover the significance for the target alternative hypothesis. The results resumed in Table 9 have been obtained for a significance level α=0,1, which is feasible for a preliminary analysis. Table 9. p-values for the input parameters of the formulated neural network.   The p-value test has been drawn to discover the significance for the target alternative hypothesis. The results resumed in Table 9 have been obtained for a significance level α = 0.1, which is feasible for a preliminary analysis. Data confirm the importance of parameter τ, but they also highlight three other factors which affect the model, i.e., λ, Bsoc_iso regarding pedestrian movement and average standstill distance referring to vehicular traffic. Finally, in Figure 7, a chart is provided with those parameters, which show the greatest impact on the chosen output. These charts represent the relationship between crossing time and the mean values of each input parameter.

Input no. Input parameter p-value
The recognition of these 4 parameters as being the most influential ones for the reproduction of crossing time also agrees with the theoretical definition of the same. Indeed, parameter τ regulates the acceleration of each pedestrian and Bsoc_iso controls the interaction between pedestrians. Thus, the lower τ is, the lower the pedestrian crossing time is, while the higher the Bsoc_iso, the stronger the effect of other interacting walkers, which pushes the simulated pedestrian to its destination more quickly. Also, an average standstill distance of oncoming vehicles relates the crossing action of pedestrians to vehicular arrivals, highlighting the effect of gap acceptance in the phase of crossing: a larger average standstill distance implies more time for a pedestrian to cross the road, which consequently turns into a higher crossing time. Furthermore, parameter λ, representing the events happening behind the considered pedestrian, influences the crossing time. This also agrees both with the theoretical background of the model and the real crossing behavior, which is dependent on all the happenings on the crosswalk, and not only on the ones occurring in front of the pedestrian. This short interpretation of the most influential input parameters shows their relationship with the observable pedestrian behavior, allowing us to assess if they are relevant-not only from a statistical perspective, but also from a more concrete point of view. Multiplicative Part of Safety Distance 0.734 Data confirm the importance of parameter τ, but they also highlight three other factors which affect the model, i.e., λ, Bsoc_iso regarding pedestrian movement and average standstill distance referring to vehicular traffic. Finally, in Figure 7, a chart is provided with those parameters, which show The recognition of these 4 parameters as being the most influential ones for the reproduction of crossing time also agrees with the theoretical definition of the same. Indeed, parameter τ regulates the acceleration of each pedestrian and Bsoc_iso controls the interaction between pedestrians. Thus, the lower τ is, the lower the pedestrian crossing time is, while the higher the Bsoc_iso, the stronger the effect of other interacting walkers, which pushes the simulated pedestrian to its destination more quickly. Also, an average standstill distance of oncoming vehicles relates the crossing action of pedestrians to vehicular arrivals, highlighting the effect of gap acceptance in the phase of crossing: a larger average standstill distance implies more time for a pedestrian to cross the road, which consequently turns into a higher crossing time. Furthermore, parameter λ, representing the events happening behind the considered pedestrian, influences the crossing time. This also agrees both with the theoretical background of the model and the real crossing behavior, which is dependent on all the happenings on the crosswalk, and not only on the ones occurring in front of the pedestrian. This short interpretation of the most influential input parameters shows their relationship with the observable pedestrian behavior, allowing us to assess if they are relevant-not only from a statistical perspective, but also from a more concrete point of view.
Moreover, considering Eq.1, the direct relationship among parameters λ and Bsoc_iso and the main force F ruling pedestrian behavior can be observed in the micro-simulation model. Regarding Moreover, considering Equation (1), the direct relationship among parameters λ and Bsoc_iso and the main force F ruling pedestrian behavior can be observed in the micro-simulation model. Regarding τ, it is fundamental in the calculation of the acceleration term, appearing in the model's equation (Equation (3)) as: where a is the acceleration, v 0 represents the desired speed and direction, while v represents the current speed and direction. Also these considerations, specifically related to the micro-simulation model, confirm the relevance of the identified parameters, underling their core role in the simulation. In practice, it is important to identify which parameters most influence the simulated pedestrian crossing time, since it will allow us to act directly on them to achieve better simulation results. Also, by further studying different crossing typologies, attention can be paid to these highlighted parameters, identifying different specific values for each crossing type. An extended record of the most popular crosswalk typologies and of the micro-simulation parameter values related to each of them, could, indeed, be a very useful input for technicians, which have not obtained enough sources to run a whole calibration step.
Finally, a first rough comparison between real-world measurements and the outputs of the neural network shows that the majority of the predictions stands in the range of [4,12] s, which very well approximates the crossing time measured in situ, belonging to the range [4,14] s. Also, the mean value, though it is the result of the training step and therefore is not the final desired outcome, shows a difference with the mean measurement of 1.95 s. In Table 10, basic descriptive statistics are reported.

Validation
One of the major issues when dealing with neural network is overfitting. This question is related to the architectural choices of the network, and the number of neurons greatly affects it: a too large number of hidden neurons, indeed, can generate overfitting, affecting the generalization ability of the network. Besides a correct choice of the network architecture, the creation of a large enough database can partially prevent the effect of network overfitting and provide a better chance to achieve a more satisfactory generalization. Also, one of the causes of overtraining may be the ratio of the number of input parameters and the dataset size used for network training. If there is a large number of input parameters, and a small number of examples in the learning set, the network will memorize rather than learn and generalize on the training dataset. However, it should be pointed out that the ratio of the number of input parameters and the size of the dataset for network learning are not the only causes of poor network generalization.
Within the Neuroshell2 software, an initial assessment of the generalization capability on the test dataset was made. However, the best estimate of network generalization ability is given by its application to a completely independent data set, in this case corresponding to the 20% of the initial training database.
This section deals with the evaluation of the generalization capability of the formulated neural network on a new dataset, and the results can be graphically seen in Figure 8.
As for the previous analysis, correlation metrics, as well as error measures were used as criteria to define the performance of the network.
On the testing dataset, a correlation coefficient of 94%, was obtained, and respective r-squared and r-squared coefficients of 88.6% and 88% were calculated. As can be noticed, their values are slightly lower than the ones of the initial evaluation. On the other hand, MAE equals 1.76 s, with a minimum error of 0.34 s and a maximum of 2.92 s. In this case, the mean absolute error is higher than the one calculated on the training results, but the maximum values of the absolute error on the validation and training results are comparable. The fact that the neural network was also able to reproduce the simulation results with a 94% correlation on a much more constrained set of the total amount of data demonstrates that it is able to generalize the relationships learned during the training step.
Despite the 3.5%-lower value of correlation in comparison to the after-training results, the level is still optimal and demonstrates the opportunity to use neural networks for further studies and assessments of pedestrian microsimulation model outcomes. Indeed, validating it on a new set of data, the 94% correlation shows a very good generalization capacity of the network and ensures that the neural network in future application will also apply the notions learnt during the training step on totally new and larger sets of data.
On the testing dataset, a correlation coefficient of 94%, was obtained, and respective r-squared and r-squared coefficients of 88.6% and 88% were calculated. As can be noticed, their values are slightly lower than the ones of the initial evaluation. On the other hand, MAE equals 1.76 s, with a minimum error of 0.34 s and a maximum of 2.92 s. In this case, the mean absolute error is higher than the one calculated on the training results, but the maximum values of the absolute error on the validation and training results are comparable. The fact that the neural network was also able to Monfalcone -1 st data set Monfalcone -2 nd data set

Comparison of Model and Prediction Results with Real-World Data
To further evaluate the results of the formulated neural network, a comparison with two additional data sets of real-world measurements was worked out. The first data set consists of new data measured at the same location on a different day of the week. As can be seen from Table 11, the achieved data have the same distribution of the original data set, with a mean value of 7.91 s for the crossing time. Table 11. Descriptive statistics of the two data sets of the real-world measurements recorded in Italy. The second data set consists of new measurements taken at a completely new location. This second location is characterized by the same type of road infrastructure and similar geometrical features, i.e., a pedestrian crossing on a double lane entry leg of a roundabout. However, it is located in the urban area of Maribor in Slovenia. The choice of this location was made because it belongs to the same type of infrastructure as the first location, but there are still some differences in the behavior of road users. In this way, a broader application of the neural network could be assessed and its ability to generalize was ensured. As summarized in Table 12, lower crossing times were found for this intersection compared to the Italian one: the mean crossing time was 5.92 s, with a more homogeneous behavior of pedestrians overall. Comparing the real data of the two locations to the results of the microsimulation model and of the formulated Ward network (Figure 9), various observations can be stated. Figure 9 shows the mean values achieved by the three data sets: real measurements, micro-simulation results and predicted outputs. Firstly, an important difference can be noticed between the results of the microsimulation model run with default parameters and all the other measurements, with an error equal to 3.84 s, 4.20 s and 6.19 s respectively considering the first data set recorded in Monfalcone, the second set of data collected at the same location and the results of the measurements done in Maribor. Comparing the mean crossing time predicted by the neural network to the measurements in situ-listed following the previous order-the errors equal 1.95 s, 1.59 s and 0.40 s, respectively. Also, looking at the ranges defined by the maximum and minimum values of each data set ( Figure 10), it can be noticed that the microsimulation model has the most sparse range of values with a minimum of 4 s and a maximum of 35 s, while Maribor's case study has the most homogeneous data, ranging from 3 s to 9 s. The neural network predictions show a more constrained range of values [3.47; 12.51] s, which effectively approximates both the real cases, though it neglects some boundary values. More in detail, a MAE was obtained between the simulated and measured maxima of 21 s and 26 s, respectively, when considering Monfalcone and Maribor. The neural network maximums differed by 1.49 s and 3.51 s, respectively. Regarding the minima, well-fitting values were found of the simulated and measured data, with no difference for the Italian location and 1 s difference for the Slovenian one. The same error calculated between predicted and measured data is 0.53 s and 1.02 s. Also, looking at the ranges defined by the maximum and minimum values of each data set ( Figure 10), it can be noticed that the microsimulation model has the most sparse range of values with a minimum of 4 s and a maximum of 35 s, while Maribor's case study has the most homogeneous data, ranging from 3 s to 9 s. The neural network predictions show a more constrained range of values [3.47; 12.51] s, which effectively approximates both the real cases, though it neglects some boundary values. More in detail, a MAE was obtained between the simulated and measured maxima of 21 s and 26 s, respectively, when considering Monfalcone and Maribor. The neural network maximums differed by 1.49 s and 3.51 s, respectively. Regarding the minima, well-fitting values were found of the simulated and measured data, with no difference for the Italian location and 1 s difference for the Slovenian one. The same error calculated between predicted and measured data is 0.53 s and 1.02 s. [3.47; 12.51] s, which effectively approximates both the real cases, though it neglects some boundary values. More in detail, a MAE was obtained between the simulated and measured maxima of 21 s and 26 s, respectively, when considering Monfalcone and Maribor. The neural network maximums differed by 1.49 s and 3.51 s, respectively. Regarding the minima, well-fitting values were found of the simulated and measured data, with no difference for the Italian location and 1 s difference for the Slovenian one. The same error calculated between predicted and measured data is 0.53 s and 1.02 s. From the previous comparison, an overall better matching of the results of the neural network to the real measurements is noticeable, and a better performance of the same than the microsimulation software run with default parameters is achieved. These first rough considerations show the effective improvement that could be introduced by using the formulated neural network: a better fitting already at this step confirms, indeed, the opportunity of applying the neural network to fine-tune the model's parameters in order to achieve a more reliable output.

Conclusions and Future Research
With the increasing of pedestrian flows on road infrastructures, pedestrian safety is becoming a central issue in transportation engineering. A promising way to tackle the problem of walking behavior and safety is microsimulation. Nowadays, lots of software is available to simulate walkers, but their reliability has not been extensively studied.
The present paper focuses on this issue, introducing a contribution to enhance the reliability of a microsimulation tool to model pedestrian behavior: the application of neural networks in the calibration process of a microsimulation model intended for pedestrian movement is a key step of the innovative calibration process.
The entire research aims at developing a methodology to calibrate a pedestrian microsimulation model, and this paper demonstrates the possibility of applying neural networks in the process of calibration and of making the pedestrian model more reliable, by predicting selected micro-simulated magnitudes to model pedestrian behavior.
In the paper, a micro-simulation model for pedestrian simulation at roundabout crossing has been developed through Vissim/Viswalk and a 5-layer neural network has been formulated to predict the pedestrian simulated crossing time.
A total of 8 model parameters (Table 4), which are not easy to measure in real-world field campaigns, have been chosen as input parameters for the neural network, and one simulation output (crossing time) has been selected as a prediction outcome.
An initial database of 100 random combinations of the input and output parameters has been drawn: 80% of it has been used for the training of the network, while the remaining 20% has been utilized to validate the same according to the criterium of learning outcomes.
Later, a new set of data with a size equal to the 20% of the initial one has been employed to further validate the network under the criterium of generalization capability.
Basic statistical tests have been carried out in order to evaluate the influence of input parameters on the dependent variable (pedestrian crossing time) and to understand which are the most influential input parameters for the considered output.
After the implementation of the first database as well as the training and first validation of the neural network, outputs found a very high level of correlation, equal to 97.5%.
Pearson correlation and p-value tests confirmed the findings and gave a first insight about the most significant input parameters: τ, λ, Bsoc_iso and the average standstill distance.
On the second set of data, a validation step has been developed, providing a 94% correlation, which shows a very good generalization capability of the network. The following conclusions were obtained:

•
It is possible to reproduce the simulated pedestrian behavior for a selected output by the application of neural networks; • Significant prediction parameters have been identified, providing indications about their influence on the desired computed output; • Overall, the high level of correlation and the identification of significant prediction parameters show a good performance of the selected neural network for the considered issue and set the basis for the development of a calibration method founded on the use of such AI tool; • The first comparisons among the neural network outputs, the micro-simulation model ones and the mean crossing times recorded at the two locations show that already at this step, the formulated neural network provides a range of results which highly agree with the real-world measurements and could improve the performance of the micro-simulation.
In further research, the results of this paper will be extended by enlarging the current database, majorly stressing the most influential parameters which have been pointed out in this paper. The neural network which has been formulated and tested in this study will be applied to the general calibration procedure of the microsimulation model in order to identify the optimal combination of random input parameters, which will provide the most reliable output result. Also, validation of the whole procedure will be carried out at other locations, characterized in the first instance by the same kind of crossing, then at places with different configurations, to ensure a large applicability of the methodology.