Bayesian inference on dynamic linear models of day-to-day origin-destination flows in transportation networks

Estimation of origin-destination (OD) demand plays a key role in successful transportation studies. In this paper, we consider the estimation of time-varying day-to-day OD flows given data on traffic volumes in a transportation network for a sequence of days. We propose a dynamic linear model (DLM) in order to represent the stochastic evolution of OD flows over time. DLM's are Bayesian state-space models which can capture non-stationarity. We take into account the hierarchical relationships between the distribution of OD flows among routes and the assignment of traffic volumes on links. Route choice probabilities are obtained through a utility model based on past route costs. We propose a Markov chain Monte Carlo algorithm, which integrates Gibbs sampling and a forward filtering backward sampling technique, in order to approximate the joint posterior distribution of mean OD flows and parameters of the route choice model. Our approach can be applied to congested networks and in the case when data are available on only a subset of links. We illustrate the application of our approach through simulated experiments on a test network from the literature.


Introduction
Given a geographic region, one of the main problems in planning and operating transportation systems is the estimation of origin-destination (OD) flows, i.e., the amount of trips made by people or freight between points in the region over a defined time interval. This is also referred to in the literature as the OD matrix estimation problem, since many initial models represented the vector of OD flows as a matrix.
OD flows are traditionally estimated through surveys, in which households or drivers are inquired about their daily journeys [24]. However, direct surveys are expensive and due to this reason they are in general carried out every decade. [8]. This low frequency implies that planners may remain many years with no data on the evolution of OD flows over time.
In the last years, governments in urban regions have built traffic control systems in order to manage traffic congestion in transportation networks. Those systems automatically gather large amounts of data on traffic volumes at low cost on a daily or even hourly basis. This allowed in theory the indirect estimation of OD flows by means of mathematical models. The idea is that we can extract information on OD flows from data on traffic volumes if we have a suitable mathematical model which describes their relationships. These models are of an optimization or statistical nature.
The early models assumed that the transportation network from which one observes traffic volumes is not congested. We can trace back the first attempts to the work of [26], who applied linear regression to estimate OD flows from traffic volumes. [23] proposed a model based on Beckman's optimization model for traffic assignment [2], whose solution is an estimate of the OD flows. [31] proposed a non-linear programming model based on entropy maximization in which the constraints are the linear conservation flow equations, whose solution corresponds to a maximum entropy OD flow configuration. [7] proposed a generalized least squares (GLS) model which can cope with errors in observed traffic volumes by minimizing the Mahalanobis distance between predicted volumes and observed volumes. [10] and [4] proposed general frameworks which generalize previous models.
[ 13,14] proposed models which take into account traffic congestion, by seeking solutions which correspond to an equilibrium state as defined for example by Wardrop's first principle [33,29]. [37] proposed a framework based on bilevel optimization, in which the first level corresponds to an objective function (e.g., maxent or GLS) subject to constraints which are obtained as a solution of a traffic assignment model in the second level. Due to the non-convex nature of the model, the authors proposed a heuristic procedure to solve it [36]. [11] formulated the OD matrix estimation problem as a compound fixed point problem, in which the solution of an inner fixed point problem corresponds to an equilibrium traffic assignment of an OD matrix obtained as a solution of an outer fixed point problem. In order to solve the problem, they propose a fixed point iteration based on the method of successive averages [28].
A common caveat in the abovementioned models is that they do not take variability of OD flows into account. Assuming that OD flows follow independent Poisson probability distributions, [32] considered the estimation of mean OD flows given a sample of observed traffic volumes vectors. Since the mean and variance of Poisson random variables is equal, variance of traffic volumes carry information on mean OD flows. He also assumed the existence of a single route for each OD pair, and proposed maximum likelihood and moment-based estimators for the mean OD flows.
Following Vardi's work, [30] proposed Bayesian estimators for the case when a sample of size one is available for the traffic volumes vector. [18,20,19] relaxed the assumption of a single route between OD pairs and proposed multivariate normal approximations to the likelihood function assuming that the variances of OD flows are functions of their means.
More recent approaches consider the development of dynamic models for the estimation of OD flows. As traffic volumes are observed over time, we cannot ignore their temporal nature. Here we must make a distinction between within-day and day-to-day dynamics. When considering within-day dynamics, one is concerned with the estimation of OD flows over the course of a single day. Here we can cite the noteworthy works by [12], [9] and [1]. In contrast, in day-to-day dynamics we consider the estimation of time-varying OD flows for a reference time period, e.g. the morning peak, given a time series of observed traffic volumes over many consecutive days. It is assumed that this reference time period is long enough so that most trips in the study region start and finish within the observation period.
[21] developed one of the first models for the estimation of day-to-day dynamic OD flows. The author assumes that mean OD flows are functions of parameters that do not change over time. Particular cases include constant demand, linear trend and weekday-weekend models. [25] proposed a general Bayesian framework for the estimation of parameters of day-to-day dynamic traffic models. They assumed that OD flows vary according to a Markovian transition kernel, and proposed a Markov chain Monte Carlo (MCMC) algorithm for the estimation of parameters. [22] proposed statistical methods to compare alternative day-to-day dynamic models.
A central aspect in the development of realistic day-to-day dynamic models is the modeling of users' behavior through route choice models. In practice, users choose routes based on the travel times they have experienced over past days. Thus, a suitable route choice model should take into account the influence of past route costs on the decision of users in a given time. [34,5] discuss models for the analysis of dynamic users' behavior.
In this paper, we represent stochastic day-to-day OD flows as a dynamic linear model (DLM), which allows us to capture the time dependencies of OD flows as well as non-stationarity. To the extent of our knowledge, this approach has not been applied before to the estimation of time-varying OD flows. In the formulation of the DLM, we take into account the variability originating from OD flows, user's route choices and measurement of traffic volumes on links. We model route choices through a utility model based on past route costs. Our model can be applied to congested networks and when there are available data only on partial links.
We also propose an MCMC algorithm, based on Gibbs sampling and forward filtering backward sampling, in order to approximate the joint posterior distributions of mean OD flows and parameters of the route choice model. We illustrate its application through numerical studies in a network from the literature. This paper is organized as follows: in Section 2 we describe the proposed dynamic linear model; in Section 3 we describe a route choice model and an MCMC algorithm for the approximation of the joint posterior of mean OD flows and route choice parameters; in Section 4 we present numerical studies and discuss the results; finally, in Section 5 we draw some conclusions and propose further developments.

A dynamic linear model for day-to-day OD flows
In Section 2.1 we define the proposed dynamic linear model and in Section 2.2 we describe the procedure for estimation of mean OD flows.

Model definition
Dynamic linear models are defined by a state transition equation, which describes how a system evolves in time, and an observational equation, which describes how observed quantities relate to the system's states. They have a Markovian structure and assume multivariate normal probability distributions for the random variables. System's states are regarded as unobserved parameters whose estimation is done through Bayesian updating. We refer to [35] for an excellent introduction to the corresponding theory.
Let (N , L) be a transportation network in which N is a set of nodes, L a set of directed links and J ⊆ N × N a set of OD pairs. For a sequence of subsequent time periods t = 1, 2, . . . , T , we define θ t = (θ 1t , θ 2t , . . . , θ nt ) T as the mean OD flow vector, in which θ jt is the mean OD flow for OD pair j ∈ J at time t and n = |J |. We define z t = (z 1t , z 2t , . . . , z mt ) T as the vector of observed traffic volumes in a subset of links in a network at time t, in which z it is the observed volume on link i ∈ I ⊆ L and m = |I|.
We consider the vector of mean OD flows θ t as the unobserved system state vector, which depends only on its previous state θ t−1 at time t − 1 plus a stochastic error. The vector of observed link volumes z t at time t depends only on the current unobserved vector of states, such that: We refer to equation (1) as the dynamic model, while equation (2) is the observational model.
G t is the system matrix and ω t ∼ N(0, W t ) is a stochastic error, where the letter "N" stands for the multivariate normal density with the appropriate dimension and W t is a covariance matrix also referred to as an evolution matrix. It governs the stochastic evolution of the system through time. We notice that θ t |θ t−1 ∼ N(G t θ t−1 , W t ). F t is an assignment matrix which relates mean OD flows to observed traffic volumes, while ν t ∼ N(0, V t ) is a zero-mean error term which corresponds to the deviation of observed volumes relative to the expected value In order to fully determine the model given by equations (1) and (2), we must specify the corresponding matrices. Regarding the system matrix, we assume that in the short term mean OD flows are locally constant. This implies that at time t mean OD flows should be approximately equal to mean OD flows at time t − 1, so that θ t = θ t−1 + ω t and G t = I, the identity matrix with the appropriate dimension. The evolution matrix W t is in general supplied by the modeler.
Regarding matrices F t and V t of the observational model, they must represent correctly the relationship between mean OD flows and observed traffic volumes. We define x t as realized OD flows around mean OD flows at time t, which distributes among flows on routes y t , which aggregate into observed traffic volumes z t . Figure 1 summarizes the hierarchical relationship between variables in our proposed DLM.
. , x nt ) T be the vector of realized OD flows at time t. We assume where Σ x t is a covariance matrix, which can account for correlations between OD flows or simply be a diagonal matrix in case OD flows are independent. Given a realized vector x t , for each OD pair j there is a vector of route flows y jt = (y 1jt , y 2jt , . . . , y n(j)jt ), in which n(j) = |K j | is the size of the route set K j of OD pair j, and let p kjt be the probability of route k ∈ K j being selected at time t. We assume y jt follows a multivariate normal distribution with a multinomial-like covariance structure, i.e.: and Cov(.) denote the expected value and the covariance, respectively. Thus . . , p n(j)jt ) T is the vector of route choice probabilities of OD pair j and the covariance matrix is given by: Where diag(p jt ) denotes a matrix whose diagonal elements correspond to the vector p jt and all other elements are zero. Notice also that as k∈K j p kjt = 1, the covariance matrix Σ y jt will be singular and the corresponding multivariate normal distribution will be degenerate. In order to avoid this, we assume there is a positive probability π of none of the routes in a route set being chosen, so that k∈K j p kjt < 1 assuring that Σ y jt is non-singular. This assumption will be realistic if the route sets include many routes, so that flows on the excluded routes are negligible.
Let y t = (y 1t , y 2t , . . . , y nt ) T be the joint vector of route flows across all OD pairs. We also define P t = blockdiag j∈J {p jt } as a block-diagonal matrix composed of route choice probability vectors and Σ y t = blockdiag j∈J {Σ y jt } as a block-diagonal matrix composed of covariance matrices of route flows, i.e.: Since we defined multivariate normal distributions for all subvectors y jt , and assuming that route flow subvectors y jt are conditionally independent given realized OD flow vector x t we have that y t |x t , P t ∼ N(P t x t , Σ y t ). (For convenience, we omit the explicit dependence on P t in the following developments). Now we are able to obtain the conditional p(y t |θ t ) by marginalizing x t . Since both the densities p(x t |θ t ) and p(y t |x t ) are multivariate normal, and ignoring the dependence of Σ y t on x t (we will take the dependence of Σ y t on x t into account in the inference of θ t in next section), from the properties of the multivariate normal we have [27]: In order to complete the formulation of our model, we must obtain the conditional density p(z t |θ t ) of observed traffic volumes given mean OD flows. First we assume that z t |y t ∼ N(∆y t , Σ z t ), where Σ z t is the covariance matrix of measurement errors when observing volumes on links. ∆ is the link-path incidence matrix for observed links, whose entries a ik = 1 if link i make part of route k and a ik = 0 otherwise. It is a deterministic parameter which is a function of the topology of the network and of the route choice sets.
Since both p(z t |y t ) and p(y t |θ t ) are multivariate normal, by marginalizing y t we have: From (6), we can finally specify F t and V t by defining: as an assignment matrix and the covariance matrix by: It is noteworthy in equation (8) that the variability in OD flows, in route choices and in volume measurement are all represented in the covariance matrix. In next section we describe the estimation equations for mean OD flows.

Estimation of mean OD flows
Estimation of mean OD flows θ t from observed link volumes can be made by Bayesian updating. We assume that the covariance matrices Σ x t , Σ z t and W t , the link-path incidence matrix ∆, the route choice matrices P t and the probability π of none of the routes in a route set being chosen are known parameters and make part of prior knowledge.
At time t, let p(θ t−1 |z 0:t−1 ) = N(m t−1 , C t−1 ) be the posterior distribution in previous time step t − 1, where z 0:t−1 = {z 0 , z 1 , . . . , z t−1 } is the set of observed volume vectors in past time periods. We have a prior distribution p(θ t |z 0:t−1 ) = N(m t ,C t ) before the observation of z t . Since in our local constant model given by equation (1) the system matrix G t = I, then: Notice that uncertainty on mean OD flows increases by addition of the evolution matrix W t to the covariance matrix C t−1 . Next, we define p(z t |z 0:t−1 ) = N(f t , Q t ) as the one-step forecast distribution of the vector of observed link volumes, where: It is worth noting that in computing V t by equation (8), the covariance matrix of route flows Σ y t is computed based on the predicted mean OD flowsm t = (m 1t ,m 2t , . . . ,m nt ) T , i.e.,Σ y t = blockdiag j∈J {Σ y jt } in which from equation (3): Finally, the parameters of the posterior distribution p(θ t |z 0:t ) = N(m t , C t ) are given by the Bayesian updating of the parameters, which in this case coincide with Kalman filter updating equations [35]: Where A t is an adjustment matrix, which controls how the parameters from the posterior distribution are modified according to the new observation z t . In particular, the adjustment matrix is a function of the prior covariance matrixC t and of the inverse Q −1 t of the covariance matrix of the one-step forecast distribution of link volumes, so that the adjustment matrix gives more or less weight to observed link volumes according to their uncertainty relative to the uncertainty in OD flows.
At a time t,θ t = m t is an estimator of the mean OD flows and C t is a measure of uncertainty of the estimate. At t = 0, we define p(θ 0 |z 0 ) = N(m 0 , C 0 ) where z 0 are not actually observed link volumes, but symbolically represents the modeler's prior knowledge on OD flows. The estimation procedure can be summarized as follows: 1. Starting from t = 0, set p(θ 0 |z 0 ) = N(m 0 , C 0 ); 2. For t = 1 to T , do: (a) Determine prior distribution parametersm t andC t by means of equations (9) and (10), respectively; (b) Compute the assignment matrix F t by means of equation (7) (c) Compute the covariance matrix V t by means of equations (8) and (13); (d) Determine the parameters of the one-step forecast distribution f t and Q t by using equations (11) and (12); (e) Determine posterior distribution parameters m t and C t by means of equations (14), (15) and (16).
In next section, we model route choice probabilities based on a utility model in order to estimate jointly mean OD flows and the route choice matrix.

Bayesian inference on route choice parameters
In Section 3.1, we treat route choice probabilities as unobserved quantities and represent them through a utility model. In Section 3.2 we propose an MCMC algorithm to approximate the joint posterior distribution of route choice parameters and mean OD flows.

Utility model and route choice probabilities
In congested networks, route choice probabilities greatly depend on past route costs. Users take into account their previous experiences in order to evaluate the utility associated with each route. Utilities will vary over time depending on user's memory, their sensitivity to route costs, fluctuations in mean OD flows, among other possibly influencing factors. As route choice probabilities are functions of utilities, these will also dynamically change.
We assume that the utility u kjt of a route k in an OD pair j at a time t as perceived by users is a linear function of its past route costs: in which φ s are parameters, c kj(t−s) are observed past route costs and r is the length of users' memory. The minus sign in equation (17) is used since costs are disutilities, in order that routes with lower costs are preferable. We may interpret the parameters φ = (φ 1 , φ 2 , . . . , φ r ) T as users' sensitivities to past route costs. It is reasonable to assume that they have non-negative values, otherwise higher costs will contribute to higher utilities. We also expect users to be more sensitive to recent route costs than to older costs, which implies that φ 1 ≥ φ 2 ≥, . . . , ≥ φ r . We adopt a multinomial logit model for route choice probabilities [3] in order that the probability p kjt of a route k ∈ K j in OD pair j at time t being chosen by a user is given by: l∈K j e u ljt (18) Where π is the probability of a user not following a route in a route choice set and e denotes Euler's number.

An MCMC algorithm
In this section, we develop an MCMC algorithm in order to sample from the joint posterior distribution of mean OD flows and the parameters of the route choice utility model given in (17).
Sampling from p(θ 1:T |φ, z 1 and We notice that step 1 (forward filtering) of the FFBS algorithm is possible since given φ, observed link volumes z t and prior knowledge D 0 , we have the route choice matrix P t and all other parameters determined for t = 1, . . . , T , so that we can apply recurrence equations (14) and (15). Then, step 2 generates a sample of mean OD flows θ 1:T .
In next section we illustrate the application of the proposed MCMC algorithm through some numerical studies.

Numerical studies
In the following subsections, we describe the generation of simulated data and present the results from the application of our approach to a test network.

Generation of simulated data
In order to illustrate the application of our proposed DLM and MCMC algorithm, we generated data for a transportation network from [22], given in Figure 2, which has 8 nodes and 10 links. Nodes 1 and 2 are the origins and nodes 7 and 8 are the destinations, so that we have 4 OD pairs: (1,7), (1,8), (2,7) and (2,8). In order to adapt the network to our experiment, we did not use the original data on demand and link parameters. We assumed that travel times on links are given by BPR functions according to equation (25). (25) Where τ (z) is the travel time on a link with traffic volume z, τ 0 denotes the travel time in "free flow" and z max is the capacity of the link. α and β are parameters of the function, for which we adopt the typical values from the literature 0.15 and 4, respectively. Note that the capacity of the link is not treated as a hard constraint, i.e., simulated volumes z are allowed to be greater than z max . In the test network, we set τ 0 = 1 and z max = 130 on all links.
The procedure used to generate the simulated data was the following: 1. At t = 0, set values for ∆, θ 0 , W t , Σ x t , Σ z t , φ, π for t = 1 to T and past route costs c 1−r = c 1−r+1 = · · · = c 0 = c freeflow , where r is the size of users' memory and c freeflow denotes route costs computed assuming free flow in the network.
2. For t = 1 to T do: (a) Sample mean OD flows θ t ∼ N(θ t−1 , W t ); (b) Sample OD flows x t |θ t ∼ N(θ t , Σ x t ); (c) Compute u kt and p kt for all routes k ∈ K j for all OD pairs j ∈ J from equations (17) and (18), respectively, and compute the route choice matrix P t = blockdiag j∈J {p jt }; (d) Sample route flows y t |x t ∼ N(P t x t , Σ y t ), where Σ y t is calculated from equation (3); (e) Calculate route costs c t = ∆ T g(∆y t ), where g(.) is a vector function which returns a vector of costs on links based on BPR functions given by equation (25); (f) Sample observed traffic volumes z t |y t ∼ N(∆y t , Σ z t ). The OD pairs are ordered lexicographically in vector θ t , so that mean OD flows in OD pair (1,7) corresponds to θ 1t , (1,8) corresponds to θ 2t and so on. The evolution covariance matrix is constant and given by W t = 10I, where I is the appropriate identity matrix. We assume that variability around mean OD flows is negligible, so as to Σ x t = I and θ t ≈ x t for all t, and we also assume that Σ z t = I. These assumptions imply that variability in observed link volumes will be mostly due to variability in route choices. We simulate OD flows for T = 100 time periods.
In addition, we set a users' memory length of r = 2 for the utility model, so that utility of a route k of OD pari j at time t is given by (17)) and φ = (φ 1 , φ 2 ) T = (0.5, 0.3) T is the vector of route choice parameters. We enumerated exhaustively all 12 routes between origins and destinations, resulting in a link-path incidence matrix ∆ with 10 rows and 12 columns. Since we enumerated all routes, we assumed a small probability π = 0.01 of a trip not following one of the routes. Table 1 shows the mean congestion level (CL) for a link i, given by the ratio between mean simulated traffic volumes and the link capacities: As can be seen in Table 1, links 2, 5, 6 and 9, which are in the central area of the network, have high congestion levels. The simulations, the proposed DLM and MCMC algorithm were implemented in the Python programming language version 2.7 by using Numerical Python module version 1.10.

Application of the MCMC algorithm
When applying the MCMC algorithm, we assume that we observe link volumes z 1:T on all links and route costs c 1:T . In practice, these data on traffic volumes are routinely collected through modern ITS systems and route costs can be estimated from data on travel times on links or can be collected from a sample of cars following selected routes. Except for the mean OD flows θ t and the parameters φ, we assume all other parameters are known. We use an uninformative multivariate normal prior for the initial mean OD flows vector θ 0 , with m 0 = 100 1 where 1 is the unit vector with the appropriate dimension and C 0 = 1000 I, and an uninformative improper prior for φ.
The MCMC algorithm was ran through 10000 iterations with a multivariate normal proposal distribution for candidate φ vectors with covariance matrix equals to 0.04 I, which resulted in an acceptance percentage of about 22%, and we discarded the initial 2000 samples as burn-in. The resulting Markov chain of sampled values is given in Figure 3. Figure 4 shows kernel density estimation of the marginal posteriors of φ 1 and φ 2 obtained from the sampled values. It can be seen that the sampled marginal posteriors have high densities around the true simulated values for φ 1 and φ 2 . The sample mean values for φ 1 and φ 2 were 0.5250 and 0.3651, respectively, and highest posterior density regions with 95% probability were [0.3546, 0.7067] and [0.1979, 0.5437], respectively. Figure 5 shows the simulated and estimated mean OD flows, with mean squared error (MSE) between simulated and estimated mean OD flows of 15.83. From these results, we can conclude that the proposed MCMC algorithm was able to estimate with good accuracy both mean OD flows and route choice parameters.

Unknown evolution matrix
Although in theory the evolution matrix W t in DLMs may be jointly estimated along with other parameters, it is often treated as a known parameter provided by the analyst. A convenient way of specifying the evolution matrix is through discount factors [35]. According to equation (9), the prior covariance matrix of the OD flows at time t is given byC t = C t−1 + W t , i.e., it is the posterior covariance matrix at the previous time period amplified by the evolution covariance matrix, which corresponds to the increase in uncertainty due to time. Then, we can write the prior covariance matrix asC t = 1/δC t−1 , where 0 < δ ≤ 1, so as to W t = (1 − δ)/δC t−1 . The term δ is a discount factor. Notice that, when δ = 1.0, we do not have an increase in uncertainty from time t − 1 to t, thus corresponding to a static model. Lower δ values correspond to higher increase in uncertainty from time t − 1 to time t.
We ran our proposed MCMC algorithm with varying discount factors δ, starting from covariance matrix C 0 = 1000 I, in order to assess its impact on the quality of the estimation of  the route choice parameters and mean OD flows. In Table 2 we show the corresponding sample meansφ 1 andφ 2 , 95% highest posterior density regions for φ 1 and φ 2 , and MSE between simulated and estimated mean OD flows. Figure 6 illustrates the effect of using δ = 0.9 on the estimation of mean OD flows. As can be seen from the results, there was little difference in estimation quality between the cases with known and unknown evolution matrix regarding the parameters φ. The difference is more   pronounced with relation to estimation quality of mean OD flows, which exhibited higher MSE with the evolution matrix specified by means of a discount factor.

Observation of traffic volumes on partial links
In some real transportation networks, there are data on traffic volumes only on a few links. We consider the application of our MCMC algorithm in these settings. In these set of experiments, all parameters are equal to the simulated ones, except for the mean OD θ 1:T flows and the parameters φ of the utility model, which are estimated by the MCMC algorithm. We vary only the number of links and in which of them we have traffic volume data. We test three cases: observations on 1 link, on 2 links, and on 3 links in the network in Figure 2. Links were selected so that we have a representative subset of the network. In Table  3 we show the corresponding sample meansφ 1 andφ 2 , 95% highest posterior density regions for φ 1 and φ 2 , and MSE between simulated and estimated mean OD flows.
In Table 3 Figure 7 shows simulated and estimated mean OD flows for this case with data on links 1, 7 and 9. Estimation quality of route choice parameters φ also increases with the observation of more links, as the HPD regions get tighter around the simulated values φ = (0.5, 0.3) T . Nevertheless, this effect is not so pronounced for OD flows. The best result among the tested cases was obtained when observing links 2, 5 and 9, which are links located in the central and more congested region of the network. It is also worth noting that we could estimate route choice parameters to a good accuracy by observing only link 9.
In addition, we notice that the estimation results for the case when we have traffic volumes data only on link 1 are the worst. This may be possibly due to the fact that there is no route in OD pairs (2,7) and (2,8) which includes link 1. Thus, traffic volumes on link 1 give no information on OD flows for these OD pairs. In Figure 8 we see that estimated mean OD flows

Conclusion
In this paper, we proposed a dynamic linear model for day-to-day OD flows in transportation networks. In our model, we took into account variability in OD flows, route choices and traffic volume measurements. We also modeled route choices through a utility model based on past route costs. We proposed a Markov chain Monte Carlo algorithm in order to sample from the joint posterior distribution of mean OD flows and route choice parameters by using data on traffic volumes on links and past route costs. Our model can be applied to congested networks and when data on only a subset of links are available.
We illustrated the application of the DLM and MCMC algorithm on a test network from the literature using simulated data. In our experiments, we were able to estimate with good accuracy both mean OD flows and route choice parameters using uninformative prior distributions and data on a subset of links in the test network. These are very promising results and indicate that dynamic linear modeling of day-to-day OD flows may provide a valuable tool in analyzing and estimating OD demand in transportation networks.
As further extensions to our work we suggest the consideration of trend and seasonality in OD flows. This can be implemented through the definition of additional parameters such as trend and seasonal factors, whose relationship with mean OD flows can be easily established by means of the system matrix G t . Another promising research direction is the use of our proposed DLM to forecast future traffic volumes, after setting up the model with estimated route choice parameters.