Using New Mode Choice Model Nesting Structures to Address Emerging Policy Questions : A Case Study of the Pittsburgh Central Business District

As transportation activities affect a region’s environmental quality, knowing why individuals prefer certain modes can help a region make judicious transportation investments. Using a nested logit model, this paper studies the behavior of commuters to downtown Pittsburgh who use auto, bus, light rail, walking, and biking. Although statistical measures influence the selection of a nesting structure, another criterion for model selection is the policy questions such models inform. Hence this paper demonstrates how an alternative model structure allows planners to consider new policy questions. For example, how might a change in parking fee affect greenhouse gas emission (GHGs)? The proposed model showed that a 5%, 10% and 15% increase in parking cost reduces GHGs by 7.3%, 9% and 13.2%, respectively, through increasing carpoolers’ mode share. Because the proposed model forecasts mode choices of certain groups of travelers with higher accuracy (compared to an older model that did not consider the model selection criteria presented here), the proposed model strengthens policymakers’ ability to consider environmental impacts of interest to the region (in this case, GHGs). The paper does not suggest that one nesting structure is always preferable; rather the nesting structure must be chosen with the policy considerations in mind.


Introduction
Although one may view a city as a geographical unit, a city itself is a "functional region" defined by a series of networks-trade, communications, and transportation [1].Not only is the environmental health of the region influenced by actions taken [2], but the very definition of what constitutes a region rests in part on the connectedness of the transportation network.Recognizing that transportation impacts both the natural world (e.g., air, noise, and water quality) and the human environment (e.g., community cohesion, economic development, and property values) [3], it is appropriate to evaluate these impacts when considering potential changes to the transportation system.For example, in the U.S. alone, $4 trillion were forecast to be spent to accommodate new commercial and residential development over a quarter century-but more than a $400 billion reduction in these costs could be achieved with more compact development, which certain modes of transportation support [4].Failure to consider indirect impacts of transport investments, such as additional land consumption [1], increased barriers for social interaction [3] or vehicle emissions means that a large contributor to a community's health is ignored.Because these effects may not be easily apparent prior to investment decisions, transportation "models" may be used to help forecast demand over medium to longer periods of time (e.g., 5 to 20 years) and through considering external forces (such as changes in a region's employment), shorter term factors (e.g., fuel costs), and longer-term initiatives (e.g., increasing transit capacity) [5].Although a model may take many forms, it is at its essence "a series of mathematical equations that are used to represent how people travel" [6]; in a model, the impacts of a variety of decisions (do nothing at location x, build a new freight hub at location y, and expand the hours of service for transit line z) can be examined before such decisions are implemented.
A model's scope is not limited to actions dictated from a central authority; in fact, the fragmented decision-making structure of U.S.-based metropolitan planning [2] renders it naïve to assume such an authority can implement policies with no other institutional support.Gossling [7] and Feng [8] differentiate between such investments that entail a directive (also described as "command and control"; examples are law enforcement moving crowds after a sporting event or speed limits) and two other actions that do not require a centralized entity.These are: (1) investments that provide information only (e.g., changeable message signs indicating travel times on parallel routes thereby enabling travelers to reduce delay if they so choose); and (2) market-based approaches that are a combination of the former two categories (e.g., taxes on certain routes and subsidies for certain modes of travel).The role of the model is thus to provide information to decision-makers regarding the costs and benefits of potential investments, and this role can be served in a variety of planning contexts.For example, the model developed herein shows how work trips (relative to other trip purposes) tend to increase bus use more than they increase light rail use.While such a finding could, in limited situations, inform a command-and-control measure (such as the provision of bus-only lanes during peak periods), its relevance is more apt for "soft" policies [7] such as apps allowing travelers to compare this mode to the cost of other modes or market-based policies (e.g., a change in fares during peak periods).

The Niche for Discrete Mode Choice Models
Metropolitan residents have the option to choose from different travel modes multiple times throughout the day.Since the cumulative effect of these decisions affects the environment, understanding how travelers choose among these alternatives matters for policymakers.Some factors are specific to a given mode (e.g., travel time, cost, comfort, and reliability); others are specific to the traveler (income, awareness of alternatives, and the reason for the trip).As mode choice is a complex decision making process for each person, a large sample of individual choices are used to extract insights despite seemingly random variation [9].For example, slower modes might be tolerated to a greater degree by commuters with less disposable income than by wealthier travelers.
Because an individual's socioeconomic characteristics are a critical part of the decision process, and because some of these data can only be obtained through relatively expensive survey methods, transportation agencies have a vested interest in being able to mine historical survey-based data sets, such as travel diaries.However, new policy questions are emerging with the passage of time that may not have been emphasized just a few years ago when diaries were collected.Hence, there is a need to better understand how alternative model structures can be applied to answer such questions.This paper addresses that need by demonstrating how an alternative discrete choice model structure can answer one such question: how do changes in age and parking cost influence willingness to carpool?While carpooling itself is an old mode, the advent of a new supplier-the transportation network company (TNC)-has increased opportunities to share a vehicle and thus generated new interest in the factors that influence ridesharing.While some of this benefit results from improved model accuracy, the need to develop a model to address a specific question-rather than presuming one model can answer all questions-is germane.The paper thus has three objectives:

•
To determine if altering the nesting structure affects the ability of discrete choice models to address questions of interest.

•
To demonstrate the impact of such a revised structure on forecast accuracy.

•
To quantify how better models, influence estimation of environmental impacts (e.g., how increased vehicle sharing may affect emissions).
The first bullet is significant because new nesting structures can incorporate new variables, and there is an increased understanding that the individual socioeconomic characteristics influence mode choice as much as transportation characteristics [10][11][12].The second bullet is of interest because while much literature [13][14][15][16] has focused on modeling decisions, less frequently have models been validated for transferability to similar regions [2,17,18].(This paper assesses the model's accuracy using a different data set than that used to build the model, thereby helping to provide guidance on transferability elsewhere.)The third bullet is important because impacts on greenhouse gas emissions (GHGs) are increasingly considered in transport investment decisions [19,20].Thus, assessment of how changes in certain mode shares, such as carpooling, affects GHGs is an integral component of future policy decisions.While the paper uses a data set for an urbanized section of Pittsburgh, Pennsylvania as a case study, the demonstration of the value of new modeling structures for answering new questions is relevant to other municipal locations seeking to maximize the benefit of existing discrete choice data sets.
An extensive amount of literature related to mode choice decisions provides a starting point for considering model refinements [10].Less attention has been paid to forecast accuracy and using revised model structures to address policy questions of interest.

•
Following the advocacy of econometric techniques for discrete choice modeling, some studies employed explicit modeling of latent psychological explanatory variables, heterogeneity, and latent [9,21].Others have integrated two decisions via a combined mode and departure time choice model [22,23].Two other studies [24,25] used multinomial logit models to study intercity travel behavior in Libya and choice behavior of concert participants at Taipei Arena, respectively.Wang et al. [26] developed binary and nested logit models to check the extent to which visitors' individual attributes (e.g., income, travel frequency distance, and home delivery) influence shopping trip mode choices.Subbarao [27] developed a typology of trip chains based on the structure and activity of trips in a metropolitan region of India, and later proposed a nested logit model.Another study [28] used integrated choice and a latent variable model to examine the factors influencing school teenagers' travel decisions.Gao et al. [29] reviewed how altering public transit networks can improve the performance of mode choice forecasting.Two recent studies [30,31] considered commuters' willingness to carpool using cross nested model structures.Discrete choice models are not limited to mode selection but include highway safety [32], express delivery service [33], travelers' willingness to pay for better quality information [34], and economic impacts of disruptions to entire industries [35].

•
Forecast accuracy, especially the desire to make models transferable, has been a motivation for discrete choice models in particular [14]; Rossi and Bhat [36] summarize related efforts.Tellingly, however, the authors state that "There is no basis in the research for defining situations in which model parameters are clearly and definitively spatially transferable [36]."

•
For the study's third objective, some authors have developed models for policy questions of interest to non-modelers.Schlaich [37] showed that logit modeling can inform investments made in traveler information; at one location, variable message signs led to a maximum diversion rate of 30%.Kalaee et al. [13] showed that students not in neighborhood schools, students from families with high income, high school students, and female students are less likely to walk or bike in comparison to other students.Veras and Wang [38] concluded that time factors instead of cost factors are more influential in choosing an electronic toll collection system.Islam et al. [15] showed how mode choice behavior of park and ride users was affected by transit vehicles and transfer time at stations using logit models.A hypothetical hurricane scenario presented to Miami residents-showed that although special evacuation buses were the most likely mode choice, other modes could be favored such as taxi (wealthy evacuees) or regular bus (evacuees destined for a hotel) [14].Incentives for carpooling (e.g., [39][40][41]) have also received attention; this paper however, contributes to these existing studies by seeking to explicitly consider the carpooling and vanpooling modes as discrete alternatives and comparing the proposed model to an existing model used in practice for forecast accuracy and validation in order to provide guidance on transferability to other regions.

Motivation for Considering an Alternative Nesting Structure
The Southwestern Pennsylvania Commission is the regional planning agency serving the Pittsburgh 10-county area [42].Figure 1 shows the travel demand model developed for the region in 1989, and the structure informed certain policy questions that faced decision makers at that time.For example, the model differentiates between walking to transit and driving to transit.This enables one to evaluate how improvements to pedestrian facilities might influence transit use.One might compare the cost of such improvements to, say, the provision of better parking at such facilities, and thereby determine which initiative can increase transit patronage in the most cost-effective manner.The 2010 survey conducted by the Pittsburgh Downtown Partnership [43] was later used by the Southwestern Pennsylvania Commission to update some features of their model, however the model was not recalibrated and the structure was also left intact.this paper however, contributes to these existing studies by seeking to explicitly consider the carpooling and vanpooling modes as discrete alternatives and comparing the proposed model to an existing model used in practice for forecast accuracy and validation in order to provide guidance on transferability to other regions.

Motivation for Considering an Alternative Nesting Structure
The Southwestern Pennsylvania Commission is the regional planning agency serving the Pittsburgh 10-county area [42].Figure 1 shows the travel demand model developed for the region in 1989, and the structure informed certain policy questions that faced decision makers at that time.For example, the model differentiates between walking to transit and driving to transit.This enables one to evaluate how improvements to pedestrian facilities might influence transit use.One might compare the cost of such improvements to, say, the provision of better parking at such facilities, and thereby determine which initiative can increase transit patronage in the most cost-effective manner.The 2010 survey conducted by the Pittsburgh Downtown Partnership [43] was later used by the Southwestern Pennsylvania Commission to update some features of their model, however the model was not recalibrated and the structure was also left intact.With the passage of time since the development of the original model structure, new information has become available that suggests it may be productive to revise the model structure: A primary reason is that new policy questions are emerging.One question concerns transportation network companies (TNCs), also known as "technology-enabled mobility service companies," which provide on-demand transport services or ride hailing [44].As the availability of TNCs increases, a better understanding of the factors that cause individuals to carpool generally may shed insight into the potential market for TNCs.A related set of questions is whether commuter vans are similar to carpools, are they a competitor to transit services (e.g., [45]), or are they complementary to such services (e.g., [46]).Another question is to what extent may carpooling be affected, if any, by an increase in driverless vehicles.In fact, TNCs have started testing such vehicles in the case study area and elsewhere [47,48].For these types of questions, a focus on two modes "carpool" and "vanpool" is of interest.By contrast, because it focused on whether certain managed lanes should allow only vehicles with at least two, three or four people in them, it was logical for the original model to differentiate among the carpool modes (e.g., two, three, or more than four occupants)-and to With the passage of time since the development of the original model structure, new information has become available that suggests it may be productive to revise the model structure: A primary reason is that new policy questions are emerging.One question concerns transportation network companies (TNCs), also known as "technology-enabled mobility service companies," which provide on-demand transport services or ride hailing [44].As the availability of TNCs increases, a better understanding of the factors that cause individuals to carpool generally may shed insight into the potential market for TNCs.A related set of questions is whether commuter vans are similar to carpools, are they a competitor to transit services (e.g., [45]), or are they complementary to such services (e.g., [46]).Another question is to what extent may carpooling be affected, if any, by an increase in driverless vehicles.In fact, TNCs have started testing such vehicles in the case study area and elsewhere [47,48].For these types of questions, a focus on two modes "carpool" and "vanpool" is of interest.By contrast, because it focused on whether certain managed lanes should allow only vehicles with at least two, three or four people in them, it was logical for the original model to differentiate among the carpool modes (e.g., two, three, or more than four occupants)-and to avoid a separate vanpool mode entirely.A related policy item is that what's the role of environmental assessments in transportation investments; while this paper considers how GHGs may be affected by use of shared vehicles, decisions such as increased parking rates (to alter mode selection) or rezoning (to alter land development) also affect emissions [49].
A second reason for revising the structure is to increase model transferability.To some degree, demographic changes affect forecasts.For example, the number of persons age 65 and over is forecasted to increase by 78% in the U.S. between 2014 and 2040 [50].Accordingly, the se socioeconomic variables (such as age, gender, and income) capture individual preference.(Adeel et al. [12] found that females are more likely than males to use personal auto due to comfort, Easton and Ferrari [11] found that boys more than girls prefer to walk or cycle to school, Kalaee et al. [13] found that preference for car travel is affected by income.)Neglecting these factors hampers forecast accuracy, and hence transferability.Of particular interest to this paper, a separate validation data set [18] is fundamental to model development.

Data Description
This study used a stated preference survey designed by the Pittsburgh Downtown Partnership [43] in 2009 and conducted in 2010.The emphasis of the survey was the transportation needs of Pittsburgh regional residents who worked in downtown.Data from the survey concerning factors that influence travel decisions were used to update some features of the original model developed in 1989 [51].The survey was released to each employer's human resources manager and was made available to individual workers in that company.The survey was distributed to residents using emails and hard copies.The workers and residents were provided with a response time of one month in order to receive most responses.In total, nine districts near the downtown area were selected to participate in the survey, as shown in Figure 2.
Sustainability 2017, 9, 2120 5 of 16 avoid a separate vanpool mode entirely.A related policy item is that what's the role of environmental assessments in transportation investments; while this paper considers how GHGs may be affected by use of shared vehicles, decisions such as increased parking rates (to alter mode selection) or rezoning (to alter land development) also affect emissions [49].
A second reason for revising the structure is to increase model transferability.To some degree, demographic changes affect forecasts.For example, the number of persons age 65 and over is forecasted to increase by 78% in the U.S. between 2014 and 2040 [50].Accordingly, these socioeconomic variables (such as age, gender, and income) capture individual preference.(Adeel et al. [12] found that females are more likely than males to use personal auto due to comfort, Easton and Ferrari [11] found that boys more than girls prefer to walk or cycle to school, Kalaee et al. [13] found that preference for car travel is affected by income.)Neglecting these factors hampers forecast accuracy, and hence transferability.Of particular interest to this paper, a separate validation data set [18] is fundamental to model development.

Data Description
This study used a stated preference survey designed by the Pittsburgh Downtown Partnership [43] in 2009 and conducted in 2010.The emphasis of the survey was the transportation needs of Pittsburgh regional residents who worked in downtown.Data from the survey concerning factors that influence travel decisions were used to update some features of the original model developed in 1989 [51].The survey was released to each employer's human resources manager and was made available to individual workers in that company.The survey was distributed to residents using emails and hard copies.The workers and residents were provided with a response time of one month in order to receive most responses.In total, nine districts near the downtown area were selected to participate in the survey, as shown in Figure 2.  A total of 6513 surveys were completed during the survey period.The survey data included information on demographic and socioeconomic characteristics, travel purpose, commute distance, parking fee, and many other factors that might influence mode choice such as travel cost, travel time, number of workers in the household, and vehicle ownership.Respondents were asked to select between seven modes of transport (bus, light rail transit, car, walk, biking, carpool, and vanpool) based on various factors affecting their travel decisions.The descriptive statistics for the explanatory variables used to generate the model are provided in Table 1.

Methodology
The modeling of this discrete choice problem for travel modes involves the seven discrete outcomes shown in Figure 3 A total of 6513 surveys were completed during the survey period.The survey data included information on demographic and socioeconomic characteristics, travel purpose, commute distance, parking fee, and many other factors that might influence mode choice such as travel cost, travel time, number of workers in the household, and vehicle ownership.Respondents were asked to select between seven modes of transport (bus, light rail transit, car, walk, biking, carpool, and vanpool) based on various factors affecting their travel decisions.The descriptive statistics for the explanatory variables used to generate the model are provided in Table 1.

Methodology
The modeling of this discrete choice problem for travel modes involves the seven discrete outcomes shown in Figure 3. (Readers interested in modeling details should refer to Appendix A.)  A nested logit model consisting of three final mode choice groups were public transportation modes (including modes available for general public), private transportation modes (including modes that are private property of travelers) and commuter pool modes (consisting of ridesharing options).The data for the 6513 travelers were then split into two groups: 70% of these data points (e.g., 4559) were used for calibration-e.g., developing the model.The remaining 30% (e.g., 1954 data points) were used for validation and thus never used for calibration.
Multiple nesting structures were considered based on three criteria.First, the authors assessed the extent to which alternative structures demonstrated a competitive relationship among alternatives (within the nest) and hence were representative of true behavior.(For example, a nest that includes walking and biking may be appealing because of the common element of active transport; however, because the lengths of such trips differ substantially, another plausible nest is one that includes both bus and biking because of the common elements of not requiring auto ownership and comparable trip lengths).Second, because many plausible nesting structures exist, the likelihood ratio test (Appendix A) was used to test the significance of selected nest over the rejected one.A third criterion was whether the inclusive value parameters are reasonable.

Best Model
Table 2 shows the best model based on calibration from 70% of the data.For example, after evaluating multiple models, the best model includes a parameter of 4.87 for using the drive-alone car mode (if married), and the t-statistic for this parameter, since it is larger than 1.96, suggests this variable is significant.The values of the McFadden pseudo R-squared and adjusted R-squared for summary statistics are greater than 0.1, suggesting that the model provides good explanatory power.Almost all of the defined variables are statistically significant.All three inclusive value parameters (θ m in Appendix A) are below 1.0, justifying that the three nests are appropriate and indicating that the modes possess some degree of unobserved shared effects.The lowest value of 0.23 for commuter pool, for example, suggests that carpool and vanpool are substitutable for each other and the nesting structure has taken care of the unobserved shared effects that existed between the two modes.(The literature supports at least two different ways of viewing vanpools: as similar to "larger capacity transportation modes" such as buses [53] or as similar to carpools (e.g., [54]) analyzed carpools and vanpools together, with the common feature being their exemption from tolls).Placement of vanpool in a public transportation nest along with bus and light rail resulted in illogical inclusive parameter values, in contrast to the chosen model of Figure 3.For this particular data set, therefore, it appears appropriate to treat vanpooling as more similar to carpooling.

Variables with Intuitive Signs
Most variables have the expected signs.Household income shows a negative coefficient for bus, carpool, and vanpool, suggesting that higher incomes lead to a greater likelihood of driving alone.Travel time is also negative for walking and bicycling in the private nest: as travel time increases, travelers are more likely to substitute a private car for these non-motorized modes.Race also shows a positive coefficient for bus; however, the practical significance of this variable is lower for present-day work since Pittsburgh has a low population of Hispanics (around 2%).Distance from downtown shows a positive coefficient for bus and light rail, suggesting that travelers living farther from the downtown are more likely to choose these modes.Large parking costs, however, reduce the likelihood of using the car, as shown by this negative coefficient.
The signs in the model also can inform policy initiatives.Presently, the lack of street and garage parking during peak hours, for example, coupled with the negative coefficient for parking costs in the model, points to the use of park and ride services originating outside the Central Business District (CBD).Shared driverless vehicles could also be considered as a viable alternative in the future to eliminate the need for parking in the CBD-and thus knowing whether parking cost substantially affects mode split may help forecast the attractiveness of this alternative.(The impact of how individuals might react to an increase in parking cost is the focus of Section 7.)

Variables Requiring Additional Interpretation
For a few variables, however, the signs were different than that provided in some literature-and the reasons for differentiation require some additional interpretation.For example, these data showed that women were more likely to use public transportation than the private auto, which directly contradicts [55].However, given that Belwal and Belwal [55] found that female travelers preferred private cars due to a "gender disadvantage in developing countries", it appears that such a condition does not apply to Pittsburgh.The results also showed that married travelers are more likely to use a personal car than other choices and, while Table 2 alone does not provide a definitive reason, the fact that 'family obligations' was the second highest reason for individuals indicating they wanted a private car (Figure 4) suggests that these travelers may have needed the vehicle to pick up children from daycare or school.The results also suggest that people over the age of 45 are less likely to use active modes of transportation.The finding however, matches that of Schoner and Lindsey [56] who found that travelers age 40-49 were about two-thirds as likely to walk or bike as persons age 20-29.

Travel Mode Market Share
Figure 5 shows that that the highest market share among the three upper level nests (e.g., ,  , and ) belongs to public transport (44.6%), while personal cars represent the single highest portion of trips (42.5%) among the seven lower level choices ( , .. ).Carpool represents 6.5% of trips followed by walking (4%), bicycling (1.6%), and vanpool (0.8%).The forecasted and observed values revealed similar mode shares; for example, the observed mode shares for carpool was 7.5% (compared to 9% which was forecasted by the model).The mode splits shown in Figure 5 match those elsewhere that offer substantial public transportation options, although American Association of State Highway and Transportation Officials (AASHTO) [57] notes regional factors influence these shares.

Travel Mode Market Share
Figure 5 shows that that the highest market share among the three upper level nests (e.g., m 1 , m 2 , and m 3 ) belongs to public transport (44.6%), while personal cars represent the single highest portion of trips (42.5%) among the seven lower level choices (i 1 , i 2 ...i 7 ).Carpool represents 6.5% of trips followed by walking (4%), bicycling (1.6%), and vanpool (0.8%).The forecasted and observed values revealed similar mode shares; for example, the observed mode shares for carpool was 7.5% (compared to 9% which was forecasted by the model).The mode splits shown in Figure 5 match those elsewhere that offer substantial public transportation options, although American Association of State Highway and Transportation Officials (AASHTO) [57] notes regional factors influence these shares.

Travel Mode Market Share
Figure 5 shows that that the highest market share among the three upper level nests (e.g., , , and ) belongs to public transport (44.6%), while personal cars represent the single highest portion of trips (42.5%) among the seven lower level choices ( , .. ).Carpool represents 6.5% of trips followed by walking (4%), bicycling (1.6%), and vanpool (0.8%).The forecasted and observed values revealed similar mode shares; for example, the observed mode shares for carpool was 7.5% (compared to 9% which was forecasted by the model).The mode splits shown in Figure 5 match those elsewhere that offer substantial public transportation options, although American Association of State Highway and Transportation Officials (AASHTO) [57] notes regional factors influence these shares.

Public Transport Attractiveness
Collectively, respondents indicated that the most important factors contributing to their decision to use the personal car was convenience (48.5%) and family obligations (14.3%).Only 9.7% said they needed their car while downtown and only 4.6% named privacy as a factor.Substantially different results were obtained by considering respondents who chose public transport: saving money was the most common reason (48.5%), with other reasons cited less: convenience (14.3%), no car (10.6%), reduced stress (9.7%), and less pollution (4.2%).These observations have three implications for policies: first, for auto use, convenience (which may include control over departure and arrival times) is critical; second, cost has a much bigger impact for those choosing transit than those choosing the auto, and third, socioeconomic factors may play a sizeable role (given the 14.3% of persons choosing the private car cited family obligations).

Alternative Nesting to Consider Policy Questions of Interest
A question facing decision makers is what factors may affect the decision to carpool rather than drive alone-and what are the environmental consequences of intervention to influence these factors?This question matters as the "supply" of opportunities to share a vehicle may increase in the future.Although the focus herein is parking cost and persons age 45 and over as factors that may affect the decisions to carpool, the approach may be extended to other attributes that also affect willingness to carpool: road tolling, fuel pricing, education levels, job status, household income, and car ownership [39,58].Parking is a good example both because of its influence on urban density and because it was a primary issue of interest to decision makers in the case study location.Persons age 45 and over was used because the original model omitted this variable and, with the passage of time, it has become clearer that persons age 45 and over is projected to differ substantially by 2040.Table 3 shows how an alternative model structure can help planners consider such emergent policy questions.The original structure supported a different policy question: should the occupancy requirement for managed lanes be 2, 3, or 4? That model suggested a 10% increase in parking cost would increase carpoolers' mode share by about 1.5%.The new structure, however, suggests the same parking fee increase would raise the carpool mode share by 6.3%.Further, a 10% increase in travelers age 45 and over (a variable not included in the original model) increases the mode share of carpoolers by 3.7%.Because the proportion of persons age 45 and over is forecast to increase from 41% to 51% by 2040 [50], such socioeconomic variables may merit attention as new policy questions arise.The revised model structure can thus answer questions regarding the environmental impact of encouraged vehicle sharing through an increase in parking cost: how would a shift from a drive-alone vehicle to shared vehicles reduce GHGs?For this demonstration, the study further focused on the example of change in parking cost only.Given an average trip length of 11.2 miles to downtown Pittsburgh, average vehicle emissions of 8887 grams (g) of carbon dioxide per gallon, a ratio of carbon dioxide to total GHGs of 0.989, and an average of 21.6 miles/gallon [59], Table 4 shows amount of GHGs based on the 4559 total trips to the downtown area.After scaling this result to reflect annual emissions for downtown travelers (not just those sampled), decision-makers can compare this impact to potential emissions reductions from other policies such as recycling programs and better insulation for low-income housing [19].a (Trips)/2; b Shifts to other modes (bicycle, walk, and bus) are not presumed to generate emissions (e.g., while some additional persons use the bus, the increase is not large enough to require additional bus service); c Greenhouse Gas Emissions, measured as carbon dioxide equivalent grams (g CO 2 eq).
Table 4 shows how models can aid in policy decisions regarding parking costs in urban areas based on the expected future consequences.Table 4 suggests that by increasing parking costs, travelers prefer shifting to shared carpools, thus reducing net GHGs.A 5% increase in parking cost suggests a reduction of 7.3%, a 10% increase in parking cost suggests 9% reduction and a 15% increase in parking cost suggests a 13.2% reduction in GHGs.These numbers provide the basis for selection of future policies on parking costs to maintain a sustainable environment in dense urban settings.

Model Transferability
The strength of discrete choice models is that they explain variation in behavior based on a wealth of individual characteristics [60] (income, gender, urban environment), making it feasible, at least in theory, to transfer such models to other locations [2,17,18] (although some "updating" [36] of the model parameters using local data will be advantageous) [32].By contrast a model linked to a particular set of transportation analysis zones from whence the data were calibrated will likely not be transferable as no two locations will have the same zonal configuration [2].McFadden [60] noted discrete choice models, because they do not aggregate data by zone, preserve the "detailed associations between individual circumstances and travel choices".Thus, Rossi and Bhat [36] indicate that the likelihood of transferability is higher for person-level models than for zone-based models.
While a robust model can explain some variation in a given data set, the model's practical value depends on whether it can be used for forecasting scenarios of interest.That is, changes in a particular mode share will ultimately translate into a set of behaviors that have short-term consequences (e.g., congestion on bicycle network) and longer-term effects (e.g., consumption of land for new development and the resultant water and sewer costs).Thus, for any model, one must ask how does it perform with a different data set not used to calibrate the model?Hence, the authors determined the accuracy of the revised model, defined as the difference between model forecasts and observations, by using the validation data set (which had never been used for calibration); this approach provides the magnitude of error [18].For example, consider the 1,954 travelers not used to develop the revised model.Individual 1 chose to carpool.A perfect model would have forecasted the probability of carpooling as 1.0 for this individual, whereas the model forecasted carpooling with a probability of 0.749.The sum of the error terms (such as 0.251 for individual 1), divided by 1954, yielded an error of approximately 23.2% for carpooling for the revised model.
Table 5 shows the error for seven modes for the original model and the revised nesting structure, where lower values indicate more accurate forecasts.Table 5 suggests the revised nesting structure has higher forecast accuracy for six modes common to both models.For instance, the error for carpoolers reduces from 29.6% (from the original model) to 23.2% (for the revised model).For other regions, this exercise suggests that a revised nesting structure can be of value as the modes of interest change: for instance, if one wants to differentiate between requiring two versus requiring three persons per vehicle, the original nesting structure is preferable.However, if one wants to differentiate between two non-motorized forms of transport (bike and walk) then the revised model is preferable.Table 5 is also encouraging in that for all six modes shown, the revised nesting structure reduced the forecast error.Furthermore, regions interested in transferring the model can also use their local samples to test the forecast accuracy and then update model parameters as suggested elsewhere [2,36].a (Travelers using rail and buses f or at least a section o f their trip); b (sum o f 2 occupant auto, 3 occupant auto, and 4 occupant auto in the original model).
Additionally, the error from Table 5 can be combined with the mode shares of Table 3 to provide a forecast range instead of a point value-thereby conveying a more realistic understanding of any model's limitations.Table 3 shows that a 10% increase in persons age 45 and over increases the propensity of carpoolers by 3.7%.With an error of 23.2% for this mode (Table 5) and a 95% confidence level, the forecast increase is a range of 3.2% to 4.2%.

Conclusions and Application to Planning Practice
Using the over 6000 samples from a case study conducted in Pittsburgh, the paper reports on the development of a proposed model that incorporates, at its inception, both statistical criteria and policy questions.
The proposed model targets a specific question of interest: what factors lead to the decision to carpool rather than drive alone.The model shows that a 10% rise in parking cost increases carpoolers' mode share by 6.3%-an increase that is four times the amount forecasted by the original nesting structure.Demographics are a critical component of the revised model: a 10% increase in persons age 45 and over, adds almost 4 percentage points to carpoolers' mode share.Returning to the paper's second objective, the proposed model appears to increase transferability, as it reduces aggregate forecast error by six percentage points.Finally, the model shows the value of what-if analyses: a 5%, 10% and 15% increase in parking costs reduces net GHGs by 7.3%, 9% and 13.2% through shifting travelers to shared carpools.This type of scenario visioning can facilitate the incorporation of environmental impacts into investment choices.
The paper's demonstration of how changing the nesting structure affects both (1) the accuracy of the model and (2) the ability to answer certain policy questions of interest contributes to the existing literature.The authors therefore, recommend that new nesting structures should be considered along with statistical criterion during model calibration in order to tailor the model to a wide range of questions that may arise.Such joint consideration of both policy questions and statistical criteria at the same time may take some coordination between modelers and analysts.(For instance, the paper showed that the inclusion of socioeconomic variables in the model matters as much as transportation characteristics since the age distribution in the U.S. is projected in 2040 to differ substantially from what is seen today.) The paper, the refore, does not suggest that one nesting structure is always preferable; rather the nesting structure should be chosen based on the policy questions at hand.The findings do show some benefits to recalibrating relatively recent data sets for that purpose.Further, because of the importance of age, income, gender, and marital status in the Pittsburgh model, there may be a benefit to collecting additional demographic data, especially with respect to trip chaining, as resources become available.θ m are between 0 and 1-a fact that proved critical for later evaluating the many potential nesting structures for this data set.
The actual implementation of Equations (A1)-(A5) was performed with the econometric package N-Logit 4.0.The survey results were converted into the proper format for the N-logit package, including the creation of dummy variables and skipping of missing observations to clean the data properly.Because highly correlated variables are problematic [32,61], variables with correlation above 0.85 were removed from the analysis.(In this data set, travel time and cost had a high correlation coefficient of 0.96.)Then, a multinomial logit model was tested, but due to correlation effects of certain alternatives, such as vanpool and carpool, the results were not satisfactory.As an alternative approach, multiple nested logit models were created and tested to develop the best possible model having lower Akaike Information Criteria (AIC) values.The likelihood ratio test was also used to check whether the selected model was superior to the rejected models.The results for each model were tested for their statistical significance, where t-statistics larger than 1.96 (corresponding to a p-value of 0.05 for a two-tailed test) were viewed as significant.Note that in Table 2, the log-likelihood at zero may appear relatively high (e.g., closer to zero) than expected because some individuals did not have all seven modal choices available to them.Note also that because the bus system is much larger than the light rail system (that runs in a very small portion of the city) [51], approximately four times as many travelers use bus than light rail.

Figure 1 .
Figure 1.Original Mode Choice Model Developed for Pittsburgh.

Figure 1 .
Figure 1.Original Mode Choice Model Developed for Pittsburgh.

Figure 2 .
Figure 2. Map of Nine Districts included in the Downtown Pittsburgh Survey (Source: Pittsburgh Downtown Partnership Survey, base map © Google Earth [52]).

Figure 2 .
Figure 2. Map of Nine Districts included in the Downtown Pittsburgh Survey (Source: Pittsburgh Downtown Partnership Survey, base map © Google Earth [52]).

Figure 3 .
Figure 3. Two-Level Nested Logit Model for Pittsburgh Traveler Mode Choice.

Figure 3 .
Figure 3. Two-Level Nested Logit Model for Pittsburgh Traveler Mode Choice.

Figure 4 .
Figure 4. Reasons for Choosing Private Car (n = 2997).The category "Other" includes reasons not cited above, such as work late, start early, and distance to work.

Figure 5 .
Figure 5. Market Share for the Seven Modes Based on the Revised Model.

Figure 4 .
Figure 4. Reasons for Choosing Private Car (n = 2997).The category "Other" includes reasons not cited above, such as work late, start early, and distance to work.

Figure 5 .
Figure 5. Market Share for the Seven Modes Based on the Revised Model.

Figure 5 .
Figure 5. Market Share for the Seven Modes Based on the Revised Model.

Table 1 .
Descriptive statistics of explanatory Variables.

Table 1 .
Descriptive statistics of explanatory Variables.

Table 2 .
Results for the Best Nested Logit Model.

Percent Use Mode of Travel PREFERENCE OF VARIOUS MODES OF TRAVEL
Reasons for Choosing Private Car (n = 2997).The category "Other" includes reasons not cited above, such as work late, start early, and distance to work.

Table 3 .
Considering Alternative Nesting to Answer Policy Questions of Interest.

Table 4 .
Assessment of How an Increase in Parking Prices affects Greenhouse Gas Emissions (GHG).

Table 5 .
Forecast Error for Seven Modes of Interest.Observed Outcome − Predicted Outcome; ** n = Number o f Observations;