Life-Cycle Asset Management in Residential Developments Building on Transport System Critical Attributes via a Data-Mining Algorithm

Public transport can discourage individual car usage as a life-cycle asset management strategy towards carbon neutrality. An effective public transport system contributes greatly to the wider goal of a sustainable built environment, provided the critical transit system attributes are measured and addressed to (continue to) improve commuter uptake of public systems by residents living and working in local communities. Travel data from intra-city travellers can advise discrete policy recommendations based on a residential area or development’s public transport demand. Commuter segments related to travelling frequency, satisfaction from service level, and its value for money are evaluated to extract econometric models/association rules. A data mining algorithm with minimum confidence, support, interest, syntactic constraints and meaningfulness measure as inputs is designed to exploit a large set of 31 variables collected for 1,520 respondents, generating 72 models. This methodology presents an alternative to multivariate analyses to find correlations in bigger databases of categorical variables. Results here augment literature by highlighting traveller perceptions related to frequency of buses, journey time, and capacity, as a net positive effect of frequent buses operating on rapid transit routes. Policymakers can address public transport uptake through service frequency variation during peak-hours with resultant reduced car dependence apt to reduce induced life-cycle environmental burdens of buildings by altering residents’ mode choices, and a potential design change of buildings towards a public transit-based, compact, and shared space urban built environment.


Introduction
Municipal residential areas and new developments are often marked by economic growth and high population density, where respective higher environmental emissions affect the air quality [1,2].Indeed, the high private automobile traffic in the established residential areas affects the structural integrity of buildings in such developments [3] with knock-on negative impact on the residents [4].Similarly, sustainable development and the creation and maintenance of a built environment necessarily requires measures of construction material transportation [5,6].For example, studies have shown that for a building element such as roofing, a sizable proportion of a built asset's life-cycle assessment (LCA) relates explicitly to transportation which has been measured to contribute 10% to a residency's new-build/building-renovation's overall carbon footprint within a LCA system boundary covering the entire life-cycle of building-product usage [7,8].Built environment development whole-costs are similarly affected by community transportation links; locations affect life-cycle costs with congestion apt to delay resources deliveries and often result in vehicle damage across respective congested transportation routes [9].
Regional constraints notwithstanding, mass-transit system plans are generally developed by municipal and transportation agencies to reduce ever-increasing traffic congestion on road networks by affecting mode choices of building residents [10].These plans are simultaneously targeted as being environmentally conservative for the existing municipal residential areas or any upcoming residential developments in the region.The implication of adequate public transport accessibility and reduced reliance on private automobile usage towards sustainable residential areas; e.g., compact neighbourhoods, walking habits of residents, urbanisation and shared-space designs, construction of shops and other facilities in buildings and neighbourhoods is abundant in the literature [11,12].Jabareen [13] further include sustainable transport as one of the key design concepts of sustainable building and urban design plans.Other researchers, such as Zimring, Joseph [14] and Cervero [15] propose that sustainable residential built environments, transport systems and urban forms should be designed so as to promote sustainable modes of transport, e.g., public transport, and devise policies to discourage individual car usage, among building residents.
On the other hand, transportation researchers such as de Luca [16] and Leyden, Slevin [17] argue that planning and provision of any (public) transport system is largely "flawed" as alternatives and policies are prioritised subjectively by decision-makers alone, with very little public engagement from local building occupants or consultation at the initial stages.Often public feedback generated is neglected or only marginally used to improve the existing system, which is argued by above literature to significantly increase the risk of implementing a public transport service and an overall transportation system incoherent with public expectations.This may lead to higher private automobile dependence among the building residents of any municipal area, causing heavy traffic congestions on road networks adjacent to these residential buildings [18].In addition to disrupting the supply-chain by delaying construction material transport for new buildings or haulage of disposal material from demolished buildings in the area, traffic congestions can result in increased cost and environmental burdens depending upon the traffic load and transport system typology.For example, in case of high traffic volume roads, around 95% emissions are traffic-related [19], while Stephan, Crawford [20] showed that the residents' daily commute energy load for Belgian passive (i.e., energy-efficient) houses was approximately 27% of the entire life-cycle energy burdens of these passive houses.Stephan, Crawford [20] also proposed that the transport energy of these residents may be reduced by approximately 31%, provided a shift in mode use occurs from private vehicle transport in favour of public transport, corresponding to a reduction of 8.4% in the houses' life-cycle energy consumption.Since environmental impact assessment rating tools for buildings, such as the United States Green Building's Leadership in Energy and Environmental Design (LEED) system, also include the residents' daily commute energy and environmental load as an integral part of the total life-cycle burden of the buildings due to the building-induced impacts (more in Section 2.2), therefore, in order to reduce the overall life-cycle impact of buildings, the mode choice redistribution in favour of public transport should be researched [10,18].
In mode use redistribution or diversion research, Diana [21] and De Vos, Mokhtarian [22] suggest that traveller expectations influence commuter satisfaction, which ultimately influences the variation in their ridership preference over time (public vs. private transit), hereby referred to as modal variability.Establishing the satisfaction of an individual traveller may be difficult, aligning satisfaction with travel patterns of daily commuters is important for municipal policymakers aiming to influence mode choice and travel survey datasets are used for this purpose by building [23] and transport [24] researchers alike.However, the presence of large quantities of variables in the travel dataset complicates the pattern discovery process of deducing the potential for commuter mode choice diversion in favour of public transport [25] and optimising the consumer mode choice -satisfaction dynamics.Golob and Hensher [26] and Diana and Pronello [27] maintain that although the patterns in large categorical variable sets may be discovered to some extent through multiple correspondence analysis, scatter plots and cross-tabulation techniques, such methods are fairly limited when many variables are to be jointly considered.Limitations of frameworks only applying these statistical analyses to study travel survey datasets can thus be avoided by data mining [28,29] through its potential to handle a large number of interrelated variables [30].
This study is part of a larger project that aims to deliver a multi-criteria decision-making framework towards the objective of sustainable cities and future green buildings served by a connected and sustainable road transportation system that meets commute demands of building residents.The proposed framework consists of: social aspect of government and user stakeholders' demands [31], life-cycle cost and environmental in/outflows [32].In Hasan, Whyte [33], a multipartite model was developed by the authors to establish commuter satisfaction from the level of service (measured on network coverage and frequency) as the antecedent of building residents' mode choice.Moreover, market segmentation based on mode choice, satisfaction from service level and service perception as value for commuter money was also performed in Hasan, Whyte [33] to analyse the function of underlying exogenous factors in the sample residential area.The explicit objectives of the research presented in this paper are described below: 1.
Examine the travel patterns in a representative intra-city (Abu Dhabi) dataset for existing residential areas or upcoming residential developments.

2.
Present a systematic way of assessing critical factors eliciting mode choice in commuter market segments towards optimising the social, i.e., stakeholder demands, aspect of the overall LCA of transportation systems and in the process, reduce the user-transport life-cycle energy and environmental load of residential buildings.

3.
Determine bus service desiderata for policymakers to develop an ameliorated bus service in future, which may divert more building residents to the improved bus service.This can potentially reduce the life-cycle costs and environmental (greenhouse gas emissions, smog, resource and energy use) burdens besides tacking the social parameters (stakeholders' perspectives) towards optimising overall life-cycle burdens of residential buildings as well improving future building designs, i.e., lesser parking area requirements, more shared walkable spaces, better accessibility etc.
In order to meet the study objectives, a custom travel data processing algorithm unifying association rules mining, and statistical analysis techniques is developed to identify two classes of variable combinations; statistically-significant and validated association-only.The two different groups aim to provide policymakers with the maximum information about the commute habits of local building residents in the city.Whilst statistical analysis is informative and interesting, it may fail to uncover the underlying modal variability patterns and commuter behaviour which may be visualised through association rules [34][35][36].For this purpose, validation of the filtered-out association rules against an internal validation set was conducted to further generate a set of association rules [37,38] followed by data reduction (similar to [36] and [39]) to remove redundant association rules.

Multiscale Effect of Road Network Transport System on Residential Buildings
Researchers not only propose sustainable waste disposal from building constructions sites but also advocate use of recycled construction and demolition waste for new residential developments to reduce the overall environmental impact.The presence of an adequate transportation system at regional scale is critical for facilitating the disposal and reuse, for example, Building Research Establishment (BRE) assigns a weighted contribution of 8% to building construction material transportation in the overall life-cycle of buildings [40,41].Other national and international guidelines on LCA of built assets that are used for environmental impact assessment of buildings such as EN 15978 [42], CEN/TC350 [43] and ISO/TS 12720 [44] also recommend estimating the material transport in both construction and disposal stages for accurate calculations.At the same time, a number of government regulations, subsidies and incentives exist for promoting recycled material usage in residential buildings and housing development sectors.Decree 205/2010 in Italy mandates at least 15% recycled material use in building construction [45] while Environmental Product Declaration on construction is now practiced in many European countries [19].The United Kingdom Aggregates Levy for extracting quarry materials of £2 per tonne [46] has boosted recycled aggregates usage in building constructions by approximately 25% [47].However, material supply-chain and costs due to far-off recycling sites, for e.g., accounting for ~70% costs for a 350 tonne/h facility in a Portugal-based study [48] and increased travel time of material transporting trucks due to on-route traffic congestion often discourage building contractors from recycled material usage.Similarly, any excessive time spent on road networks due to congestion may reverse some of the positive environmental benefits from using recycled materials due to fuel consumption.For example, using a standard 15-tonne lorry to move 1 tonne of aggregate material over 1km requires 0.014 kg of diesel, depending upon the local traffic conditions [40].Thus, the question of traffic congestion mitigation should also be tackled by the policymakers aiming to promote recycled material usage in buildings (see Section 2.2).
Besides reducing traffic congestion on road networks and relieving residents' transport energy of buildings [18], public transit services also affect property prices of buildings and development projects depending upon the relative proximity to mass-transit services.Mulley,Ma [49] studied residential property market in Brisbane, Australia and found that bus rapid transit (BRT) affected housing prices at the rate of 0.14% per 100m closer to the transit service routes while train service inferred a negative effect of 0.15% reduction per 100m closer to the train service line.These effects were also strongly dependent upon urban form and spatial distributional densities of buildings.Their study also highlighted the importance of studying local mode use travel patterns, commuter perceptions and the key position of underlying service attributes of service frequency, on-board crowding and the journey time in determining the relative impact of BRT service on the residential property development sector.

Interrelated Urban Form, Transport System and Buildings LCA: Public Transport Accessibility and Residents' Mode Choices
The interrelation between urban form, transportation systems (private and public transit modes), and the overall life-cycle impacts of buildings is a complex issue which has been explored by many researchers focusing on the synergy between developing low-energy buildings and sustainable transport [10,[50][51][52].The lower density urban form undoubtedly puts extra load on private vehicle transport as the preferred mode choice of municipal residents, for example, accounting for ~70% in European passenger transport with predominantly semi-detached houses, whereas limited parking in buildings may promote public transit usage based upon proximity to service line and accessibility [51].One of the earliest studies of the 21st century by Steemers [51] on developing a less polluted urban built environment noted the link between the overall building energy and transport mode choice of building residents.Similarly, the "Le Plan Bâtiment Durable" (French: "The Sustainable Building Plan") launched by the Government of France in 2009 aimed at reducing the existing buildings' energy consumption by 38%, also enlists the mode choice of building residents and developing strategies for promoting public transportation usage among the criteria for assessing the overall performance of buildings.Furthermore, other European Union states also highlighted the importance of understanding the interrelation between urban design, transportation system, building design, residents' mode choice pattern, "public transport connected buildings", compacted shared spaces and accessibility as environmental hotspots for future sustainable building designs [10].Similar points on compact and shared urban form were also highlighted by Norman, MacLean [53] in their economic input-output LCA on buildings and by Cuéllar-Franca and Azapagic [54] who showed terraced house to carry 309 t CO 2 eq.compared to the 455 t CO 2 eq. of a semi-detached house in the UK.
Stephan, Crawford [20] observed that the majority of so-called low-energy passive houses in Belgium are designed as suburban detached single-family dwellings compared to the public transport-connected city buildings.They further commented that any energy savings by these low-energy houses are offset by increased mode use of private vehicles compared to public transport.Earlier research by the same authors [55,56] on life-cycle energy requirements of low-energy houses has shown that over the life-cycle (50-100 years) of these low-energy residential buildings, residents' transport and embodied energy are >50% of the overall life-cycle energy load.They proposed that in order to reduce the overall energy demands of residential buildings, the transport mode choices of building residents must be tackled towards public transport uptake in addition to using recycled building components and sustainable energy measures as the mode use alone represented 27% of the built assets' life-cycle energy [20].
Two studies on buildings and environmental analysis (i.e., greenhouse gas emissions and energy use) of buildings were conducted by Anderson, et al. [23,57].They proposed transportation systems and residents' mobility patterns (i.e., mode choice distribution) as key parameters for interlinking residential buildings and the urban environment.They emphasised the strong tie between transportation and buildings and argued that the induced environmental impacts in terms of transport habits due to the design, locality and interaction of buildings with the urban form need to be captured.They used streamlined LCA to capture this effect on urban Munich residents in Germany using detailed mode use and travel behaviour surveys [23].They found that the overall life-cycle impact of buildings and the induced transportation environmental impact due to mobility pattern of residents is dependent upon the daily commute distance, mode choice and the location of the buildings.For example, in their study the lowest impacts were found for Munich central business district (CBD) residents due to buildings typology and short commute distance of residents, where the largest share was claimed by overall transportation impacts of CBD residents as 1160 kg CO 2 e/capita/annum while CBD residential building emissions were 1088 kg CO 2 e/capita/annum.Both studies [23,57] concluded that although the design and material usage in buildings needs to be upgraded to be more sustainable, the real criticality still lies with improving the mode use behaviour of building residents.Similar findings were noticed by Dodd, Donatello [10] in their review on LCA of European buildings with the residents' transport energy demands claiming between a 19% share to even completely overwhelming the life-cycle energy load of some buildings.
The critical role of mobility and the mode choice of building residents induced by the interrelation between urban form, locality, transport service provision and the building design was also acknowledged by the LEED system weighting methodology by assigning a total of ~15% credit weightage to transport in terms of reduced parking, bicycling and better quality public transit accessibility near buildings.Promoting sustainable mode choice among residents, proximity or connection to public transport services and bicycling and shared vehicle facilities were also included in the property/real-estate based Investment Property Databank of the triple alliance between Sustainable Building, Bureau Veritas and Barclays (~16% weightage); the Invesco, Allianz, AXA and AEW founded Green Rating Alliance; and, the Global Real Estate Sustainability Benchmark [10].The extensive focus on the mobility pattern of building residents in municipal residential zones/areas proposed by these works also recommended understanding the underlying factors behind mode choice to postulate policies towards uptake of more sustainable public transport usage.

Factors Affecting Travel Perception and Mode Choice
Researchers studying factors affecting mode choice behaviour, such as Dell'Olio, Ibeas [58] and de Oña, de Oña [59] proposed that generally any bias possessed by commuters, as for or against a particular travel mode (referred to as travel bias, henceforth) depending upon their unique characteristics, influences their perception of the quality attributes offered by a particular mode and may translate to a subsequent mode choice.The factors behind mode choices and variation of travel behaviour were also focused in a study conducted by Schmid, Schmutz [60].They performed a hypothetical study of a low carbon post-private car world and found public transport to be the most popular mode amongst survey respondents, followed by bikes and car-sharing.Stradling [61] found that the modal variability of residents in a case-study Scottish housing area is temporally constrained, with 20% of private automobile users actually acting as multimodal, i.e., using several transport modes, when the analysis period is stretched over a week.Building upon this, the present study gathered commuter behavioural responses and travel pattern data over a month to investigate the sensitivity of modal shifts in long-term mobility patterns.
Built environment around the public transit service, accessibility near commuter accommodations and offices may also influence their mode choice.An empirical study on the association between travel behaviour of urban residents in New York City and the density of the urban neighbourhoods was conducted by Chen, Gong [62].They noted that the density and built environment affected mode choice of the survey respondents.They also proposed that government authority planners may attract more users towards public transit modes by providing better accessibility near workplaces.Similarly, Cervero [15] proposed that residential developments with lesser shared-spaces for cyclists and pedestrians may affect the mode choice, particularly an inclination towards individual car usage, more so as the building designs may also carry abundant nearby parking facilities instead of using the space for mixed-use walkable areas.
Friman and Fellesson [63] and Diana [21] found modal variability and level of commuter satisfaction from public transport as highly dependent upon the local built environment and regional characteristics of the study area.They noted that the travellers near the city centres tended to sway towards higher public transport usage compared to their urban/suburban counterparts, making the results highly sensitive to the zonal distribution (urban vs suburban) as well as number of zones covered in the study area.The current study addresses this issue by surveying commuters on the intra-city travel routes across all zones of a metropolitan city (Abu Dhabi) to capture the maximum variability of responses.The City of Abu Dhabi has witnessed an increase in population accompanied by extensive residential developments and an increasing commuter dependence on private vehicles [64], resulting in traffic congestion and as such, is a suitable case study area for the purpose of this research.Furthermore, the excessive private vehicle ridership is responsible for high greenhouse gas emissions, for example in United Arab Emirates alone, it annually exceeds 11735.6Gg CO 2 equivalent [65] from road transport.The local government agencies in UAE have aimed to reduce its environmental load [66].The research presented in this paper and its accompanying works [19,32] may aid in reducing the environmental burden from road and building residents' daily transit dimensions towards an overall sustainable future city development.

Data Mining for Analysing Travel Datasets
Provided the competitive benefit of outlining commuter expectations, perceptions and rankings of existing service level to create pro-public transport policies, a large database of underlying variables (e.g., journey time, ride quality, transit fare, accessibility, nodes, socio-demographics of commuters etc.) are generally collected from representative population samples.A recent study by Liu, Xu [67] on improving the public mass-transit service quality in China acknowledges that an efficient data mining framework is capable of extracting useful information from transit data and representing it into clear and succinct policy-related recommendations.Liu, Xu [67] developed a data mining algorithm with data cleaning and filtration options.However, their proposed algorithm focused only on the time parameters and the pre-journey (accessibility, location, train/bus-station) and on-board (crowding, seating arrangements, fare and ride quality) parameters were not involved in the algorithm design.
Similarly, a Weka classifying algorithm-based data mining study on the occurrence of faults on tram lines, local atmospheric conditions and safety was performed by Gürbüz and Turna [36].The study noted the ability of data mining algorithms to filter through large mobility datasets and association rules to visualise the interrelation between various variables in the collected dataset.Building upon these studies, the work presented in this current paper contends that many of the variables collected through travel surveys or response diaries are interdependent or associated to varying degrees.Studying them in isolation may be inadequate if environmental impact analysts aim to study the complex relationship of categorical and nonmetric variables to recommend apposite policies to promote public transport, specifically as emission-controlling measure for residential buildings, in addition to the Intergovernmental Panel on Climate Change (IPCC) [68] recommendations of using recycled materials as green construction practices [69][70][71].Current literature has no such framework that can help policymakers to segregate the most desired transit service attributes from the several underlying variables affecting commuter psychometrics, thereby quantifying the stakeholder demands as part of the social aspect of LCA on any transportation system.This aspect is of particular importance for deducing marketing tactics aimed at attracting users that have somewhat of a neutral perception of either travel mode (bus or car) as well as retainment of the loyal public transport users.
Inspired by Ponte, Melo [72], who used the Gini coefficient primarily used in economics, for measuring the public bus-transit travel time heterogeneity in municipal areas and Diana [35] who used association rules for studying travel pattern, this study explores the applicability of data mining techniques used in economics and marketing research.Apriori association rules data mining is used here for achieving the objective of identifying the motivations behind commuter mode choices.Introduced by Agrawal, Imieli ński [73] and Agrawal and Srikant [74] for mining frequent item-sets in transaction data without using any underlying relative and distributional assumptions, it has now developed into a robust market analysis technique.Association rules identify the frequency of different variables appearing together in an observation (data) set.An association analysis reads every single row of variables on the travel datasets to produce association rules, which are of the implication form given in Equation ( 1).
x i , x j , . . . ,x n ⇒ y where the left-hand side is the antecedent (a set of predictor variables) and the right-hand side is the consequent (and represents the response variable).Periodically used for frequent pattern mining in fields besides management sciences, e.g., health [75] and weather forecasting [76], its application in transportation research is relatively new.Nonetheless, researchers [35,77] have explored the efficacy of associations rules for handing large and complex travel and mobility datasets.A primary advantage of using Apriori over traditional multivariate statistical analyses is its ability to rapidly find an association between large sets of metric and non-metric variables.Once an association is established between the dataset variables, traffic flow patterns and the individual public transport service attributes behind commuters' observable modal variability can be extrapolated for policy recommendations in favour of low carbon and energy conserving transit among the local building residents and travellers.Conversely, one of the main drawbacks of the Apriori algorithm is its tendency to produce a large number of association rules, rendering the technique to be an ineffective process if no other controls are provided.Pruning the rules is the next step [78] to filter out meaningless and inutile rules that may be misleading for policymakers.Association rules are constrained by the number of times a certain rule is supported by the dataset and the strength of rule (i.e., confidence) which is the fraction of dataset rows that contain both consequent and antecedent of the rule.However, confidence is an asymmetric measure which may provide erroneous results if the consequent has a large probability [79].Moreover, if the antecedent and consequent variables are independent of each other, the generated rules may be unsuitable, irrespective of high confidence [80], for establishing inference relations for policy development.Wu, Zhang [81] have introduced an additional measure called "interestingness" to discard unsuitable association rules (Equation ( 2)).

Survey Design and Analysis
A questionnaire was designed to gather data on the factors that affect user perception of existing bus network and modal variability among the local building residents and travellers [33].A total of 11 questions were developed for the survey, aimed at capturing commuter responses on the different variables according to Likert-type scales, dichotomous and multiple-choice options.Previous researchers [21,82] have noted the adequacy of weekly mode choice data in modelling the mode use of surveyed commuters, the sensitivity of commuter responses to the local built environment and zonal distributions as well as the importance of zonal characteristics in defining the observed transport pattern of building residents and the calculated transport parameter in buildings' overall environmental impacts [23,83].This study targets the commuters in all zonal distributions (urban and suburban) of Abu Dhabi city.The survey questionnaire used for this study and the different zonal distributions are shown in Table A1 and Figure A1, respectively.The eight arterial and medium to high traffic routes serving the daily commute demands of the residents of various types of buildings in the city were selected for data collection (Table 1) to capture an optimally representative data of the city's usual commuters and weekly data over six weeks was collected to improve the number of observations.The completed questionnaires collected by Department of Transport survey teams [84] were then analysed.Logic checking of data consistency was performed by the authors to address data sparseness, outliers and missing data based upon the guidelines by Osborne [85].This resulted in a useful sample of 1520 completed questionnaires (inclusive of weekdays and weekends surveys) which identified (detailed under separate cover as Hasan, Whyte [33]) ten commuter segments, namely: 1.
Modal variability: distribution of generated trips for each mode (i.e., bus and car travel).

2.
Perception of bus service as value for money (VfM): trade-off between quality of ride and level of fare.Three segments of good value for money, borderline value for money and bad value for money.

3.
Commuter satisfaction from level of service (LoS): ranked based on perception of the current level of network coverage and frequency of buses.Three segments of good level of service, borderline level of service and bad level of service.
The statistical distribution of the collected commuter responses is shown in Table A2.The study objective is finding service desiderata critical for policymakers to deter private automobile use and increase uptake of bus usage among local building residents and travellers to promote environmental conservation by developing an improved public transit service.A data mining algorithm is thus proposed below which is used to analyse the above case study commuter segments to generate variable associations for explaining the daily commute behaviour of the local building residents and travellers.

Conceptual Framework-Overall Proposed Modelling Approach
Current research suggests a market-based analysis that unifies "statistical measurement" and "associate data mining".The combination has been used to some extent in marketing literature (e.g., Shaharanee, Hadzic [38]) for filtering out redundant rules.Work here contributes by building a multi-tier modelling approach for travel survey dataset that filters "interesting rules" to split them in two distinct categories: associated rules; and, statistically-significant models.Initially, the generated rules are filtered using an "interestingness measure" from Eq. ( 2).The filtered rules are then used to build ordinal regression models, followed by (only) selecting meaningful rules at the filtration stage 2. "Meaningful rules" are defined as those that provide association between the variables that have not been already construed by the other rules passing the minimum thresholds, thereby rejecting the repetitive or redundant sets of rules.The problem is further explained in Figure 1.
Buildings 2018, 8, x FOR PEER REVIEW 9 of 37 distinct categories: associated rules; and, statistically-significant models.Initially, the generated rules are filtered using an "interestingness measure" from Eq. ( 2).The filtered rules are then used to build ordinal regression models, followed by (only) selecting meaningful rules at the filtration stage 2. "Meaningful rules" are defined as those that provide association between the variables that have not been already construed by the other rules passing the minimum thresholds, thereby rejecting the repetitive or redundant sets of rules.The problem is further explained in Figure 1.This twofold technique proposed in this study filters out redundant rules and provides final sets of statistically-significant rules.The rejected rules are then analysed in the second phase by a validation technique using a validation set.It is argued here that this technique will remove any misleading rules while still retaining a significant number of rules for policymaking without compromising the analysis accuracy.The proposed conceptual framework is presented in Figure 2. It should be noted that there are several off-the-shelf packages available for performing the Apriori association mining [35].However, as this paper proposes a unified framework, a tailored java script was used for data mining due to its compatibility with SPSS 23.0 regression functions, which can then be used to perform ordinal regression on the observed rules.Initial travel dataset "ζ" was cleaned to ensure a high quality of dataset be fed into the analysis algorithm.Data cleaning is an important step in the knowledge discovery process from analysis of predictor and response variables as it removes missing or anomalous values [85].Numerical values of household car-ownership and time to reach bus stations were also transformed into nominal variables using equally distributed classes to maintain uniformity in the data variable classes and ease the analysis procedure.The resulting dataset was then randomly split into a training dataset (ζts, containing ≈ 70% data) and, a validation dataset (ζvs).The training dataset was then used for generating an initial set of association rules: frequency of bus travel (FBT); frequency of car/taxi travel (FCT); satisfaction with network coverage (NetCov); quality of ride (QoR); level of fare (LvlFare); and, frequency of buses (FreqBus) which were used interchangeably as response variables based upon the desired output.This twofold technique proposed in this study filters out redundant rules and provides final sets of statistically-significant rules.The rejected rules are then analysed in the second phase by a validation technique using a validation set.It is argued here that this technique will remove any misleading rules while still retaining a significant number of rules for policymaking without compromising the analysis accuracy.The proposed conceptual framework is presented in Figure 2. It should be noted that there are several off-the-shelf packages available for performing the Apriori association mining [35].However, as this paper proposes a unified framework, a tailored java script was used for data mining due to its compatibility with SPSS 23.0 regression functions, which can then be used to perform ordinal regression on the observed rules.Initial travel dataset "ζ" was cleaned to ensure a high quality of dataset be fed into the analysis algorithm.Data cleaning is an important step in the knowledge discovery process from analysis of predictor and response variables as it removes missing or anomalous values [85].Numerical values of household car-ownership and time to reach bus stations were also transformed into nominal variables using equally distributed classes to maintain uniformity in the data variable classes and ease the analysis procedure.The resulting dataset was then randomly split into a training dataset (ζ ts , containing ≈ 70% data) and, a validation dataset (ζ vs ).The training dataset was then used for generating an initial set of association rules: frequency of bus travel (FBT); frequency of car/taxi travel (FCT); satisfaction with network coverage (NetCov); quality of ride (QoR); level of fare (LvlFare); and, frequency of buses (FreqBus) which were used interchangeably as response variables based upon the desired output.In order for the algorithm to generate association rules, there should be at least one tuple in the dataset for each observation.Given our target of finding transit policy-related hotspots to promote sustainable transport usage among local building residents and travellers covered within the travel dataset, syntactic constraints were put towards the consequent-side based upon the market segments.This constrained analysis is interesting for policymakers [35] as it may help to define service attributes that may be improved upon in future services that meet the public needs regarding the local bus service.According to the principles of Apriori data mining, only one response variable can be set as a consequent.Therefore, response variables were interchangeably used as a consequent for obtaining the association rules for the respective category and obtain sets of rules specific to each segment, e.g., frequent car travellers, occasional multimode travellers and commuters ranking bus service as good value for money, and so on.
Past literature has suggested that higher support and confidence relate to a better category of rules but that lower values of support and confidence are still recommended.Therefore, to capture the most "interesting" rules, minimum support and confidence were respectively set here at 0.1 and 65% [34,35].After appropriate data transformation and setting syntactic, support and confidence constraints, a number of frequent item rules were obtained for each category.The "interestingness measure" was used in the first filtration stage by setting a minimum interest value "Γ min ".The association rules mining through Apriori algorithm permits the analysts to set the minimum threshold for selecting useful models from a large set of created rules.Large numbers of association rules are initially generated which are then pruned by the analysts to reject falsely produced rules from the rule set that fail to qualify the established minimum threshold.
Evaluating the minimum threshold in the analysis may be guided through past experience or research literature.Therefore, in the present analysis, a minimum interest value of 0.05 was selected [16,86] with any future application of the proposed algorithm able to use a higher or lower value depending upon the study objectives.The retained rules were then used for ordinal regression analysis.The ordinal regression analysis calculated log likelihood and p-values for the models (the models were based on the association rules).Models which showed considerable change in log likelihood compared to null model and p-values < 0.05 were retained.The repetitive rules were then rejected using the theory defined in Figure 1 and the retained rules were used to represent the statistically-significant rules.The remaining rules were then used in phase II of the analysis to validate against the validation set while controlling for the minimum support, interest, confidence and non-repetitive criteria as discussed earlier.The algorithm proposed in this study is illustrated in Figure 3.
Input: Travel survey dataset ζ consisting of 31 travel attributes, minimum confidence, minimum support, minimum interest and number of rules across 1,520 questionnaires.

Performance of the Proposed Algorithm
The initial application of the proposed algorithm produced a large set (~1559) of association rules between the collected variables in the travel dataset that passed the minimum support (σ min = 0.1) and minimum confidence (α min = 65%) thresholds set in Step 3a of the algorithm proposed in this study (Figure 3).Filtration stage 1 (Step 3b) was then applied to remove uninteresting rules based on Eq. ( 2) with the minimum interest threshold (Γ min = 0.05).This way only 351 or 22.5% of the association rules were retained in the so-called intermediate superset as interesting rules.These 22.5% remaining association rules were then subjected to the algorithm stages in Step 3c and Step 3d and ordinal regression models were developed for each rule.All the association rules with insignificant p-values for coefficients were filtered out as a separate superset.Approximately 231 or ~65.8% of the remaining association rules passed the ordinal regression performance tests and were collected in a superset of ordinal rules.Although all of these 231 association rules may be useful for developing some pro-public transit policies for local building occupants and travellers, the large number of rules may be impractical for policy analysis and may also have some redundant or reoccurring relations as also suggested in the literature (see Gürbüz and Turna [36]).The algorithm steps of feature selection and Filtration Stage 2 were thus used for data-reduction to filter out redundant rules using the superset of the 231 ordinal regression rules, as explained below.
First, an intermediate superset was created as an exact copy of ordinal rules superset and using variable selection, any smaller association rule subset (e.g., {FBT 5 or more times a week, Weekday ⇒ Very satisfied with level of fare}) implying an association or interrelation already present in a larger subset association rule (e.g., {FBT 5 or more times a week, Employed full-time, Weekday ⇒ Very satisfied with level of fare}) already present in the intermediate superset was removed.The steps were automatically repeated for all the remaining association rules subsets as rules describing an existing or reoccurring association were redundant since they were already contained in the larger subset in the intermediate superset.This way only 25 or ~11% of the 231 ordinal regression rules were retained in the so-called intermediate superset as statistically-significant rules in Step 3f, while the filtered-out rules (total of 326 rules) from both Step 3d and Step 3e were collected in a separate superset.Using the validation travel dataset, 213 rules or approximately 65% of these discarded rules were found to be validated in Phase II, where again filtration approach based on interestingness measure (Step 3b) and meaningfulness (Step 3e) resulted in retainment of 22% or 47 rules in the superset of association-only rules.Finally, both supersets were grouped to give three association rules sets for (1) mode use, (2) level of service, and (3) value for money.Figure 4 presents the results of applying various filtration techniques in the developed algorithm where Phase III results present the grouping of phase I and II association rules.The overall the algorithm application managed to filter out a total of 95% (~1487 rules) of initially obtained association rules (total rules: ~1559) as redundant or insignificant to the total travel dataset, generating a total of ~5% or 72 rules for policymaking purposes.The following sections present results of the proposed data mining algorithm on the selected case-study of intra-city commuters in Abu Dhabi.The results from each phase are presented separately to elucidate the respective findings.An importance index is also introduced to summarise the policy-relevant findings highlighted by public consultation regarding existing services.Policy recommendations are also given on how the transit service can be improved to promote an environmentally, resource and energy conservative transportation system by improved public (bus) transport ridership among the local residents and travellers, specifically the predominant (~55%) apartment building residents.

Phase I results -public responses from statistically-significant rules
The input and output variables were primarily categorical in nature with upper and lower bounds of each attribute explained in the previous sections.Ordinal regression showed that a majority of the commuters reported bus travel to be easy and that 36% were willing to pay more for seat.Full-time workers were more flexible towards bus use than the younger respondents.Furthermore, as the collected data was skewed towards male commuters, some of the association rules were found to be affected by the gender of the respondents.On the other hand, only a few of the quality attributes of the bus service were found to be statistically-significant.In addition, the statistically-significant models also failed to produce inference between the purpose of the commuters' journey and their mode choice, thereby requiring further analysis in phase II.The ordinal regression performance results of the meaningful association rules, retained after the second filtration stage, are presented in Table 2. Interestingly, the residential building typology (e.g., apartment building, villa-style house, etc.) had no statistically-significant influence on the mode choices of the The following sections present results of the proposed data mining algorithm on the selected case-study of intra-city commuters in Abu Dhabi.The results from each phase are presented separately to elucidate the respective findings.An importance index is also introduced to summarise the policy-relevant findings highlighted by public consultation regarding existing services.Policy recommendations are also given on how the transit service can be improved to promote an environmentally, resource and energy conservative transportation system by improved public (bus) transport ridership among the local residents and travellers, specifically the predominant (~55%) apartment building residents.

Phase I Results-Public Responses From Statistically-Significant Rules
The input and output variables were primarily categorical in nature with upper and lower bounds of each attribute explained in the previous sections.Ordinal regression showed that a majority of the commuters reported bus travel to be easy and that 36% were willing to pay more for seat.Full-time workers were more flexible towards bus use than the younger respondents.Furthermore, as the collected data was skewed towards male commuters, some of the association rules were found to be affected by the gender of the respondents.On the other hand, only a few of the quality attributes of the bus service were found to be statistically-significant.In addition, the statistically-significant models also failed to produce inference between the purpose of the commuters' journey and their mode choice, thereby requiring further analysis in phase II.The ordinal regression performance results of the meaningful association rules, retained after the second filtration stage, are presented in Table 2. Interestingly, the residential building typology (e.g., apartment building, villa-style house, etc.) had no statistically-significant influence on the mode choices of the residents, requiring further investigation to understand the association between these two factors as proposed by literature (e.g., [23,57]).

Affordability and Constrained Users
In the first block of mode use results, the odds of a full-time employed commuter to choose public transport was 6-7 times that of their unemployed counterparts (PS rule 1: χ 2 = 72.68,p < 0.001) confirming market segmentation observations made previously by this project and identified under separate cover [33].The commuters admitting affordability of private automobiles were 1.3 (p < 0.05, χ 2 = 140.07:OMD rule 1) times more likely to be multimodal and 0.85 (p < 0.05, χ 2 = 39.96:EI rule 1) as likely to be first time travelling by bus compared to others.While as expected, the commuters with zero household car-ownership were 1.86 times (p < 0.05, χ 2 = 67.18:PS rule 2) as likely to use a bus service more than twice a week.Further exploring the affordability of the bus service exhibited that the commuters with a "very satisfied" perception of level of fare, also had 6.72 times likelihood of travelling 5 or more times/week by bus transport (Good VfM rule 2).The commuter satisfaction of public transport service level exhibited a slight decline relative to passenger travel time to the nearest bus-stop (see rule 2 of borderline level of service and rule 1 of bad value for money).Similarly, modal variability was affected when analysed against the same attribute as both rule 2 of pro-sustainability and rule 2 of environmentally insensitive market segments.Moreover, the commuters reporting buses to be crowded, were also twice as likely to belong to the environmentally insensitive group, indicating its significance in defining this market segment.

Crowded Buses
Rules pertaining to crowded buses were found in both less frequent bus users (EI rule 2) and bad service level (Bad LoS rule 1) groups.The results are consistent with the previous findings by Batarce, Muñoz [24] where the disutility of public transportation in Chilean commuters was observed to be correlated with in-vehicle crowding.Comparison results of value for money groups further suggested a number of priority areas in this regard.The commuter market segment willing to pay more for a seat in the bus, were also 1.16 times more likely to hold a neutral perception of the level of fare (Borderline VfM rule 3: p < 0.05, χ 2 = 62.22) and 0.71 times more likely to be very dissatisfied (p < 0.01, χ 2 = 632.49:Bad VfM rule 2).This implies that merely increasing the number of seats may at first persuade borderline users, but may not affect dissatisfied travellers to the same extent.Interestingly, travellers labelling buses as crowded were 1.62 times as likely to be "dissatisfied" by the frequency of buses (Bad LoS rule 2).Even though a direct relationship between the variables may have been unperceived, a deeper correlation may be present as less frequent buses on the travel routes serving the commute demands of these local building residents may impose a higher load on the existing buses.This finding indicates the presence of a potential market (of borderline and dissatisfied commuters) for policymakers by improving the public transit service based upon the public need of frequent service.The improved public transit ridership due to a corresponding reduction in private vehicle usage may then reduce the overall life-cycle environmental and energy load of the residential buildings in these studied Abu Dhabi City zones due to lesser contribution from the daily transport energy demands of their residents.
Proposing investment in size or number of units of supply (buses) to meet the local building occupants' daily commute needs can then be countered by slightly higher fares and it may optimise the financial elasticity of the publicly-owned transport agencies.A follow-up life-cycle study on the cost of running an improved public bus transport service, i.e., more frequent (shorter headways) shall also be conducted.It is reiterated by the observance of travellers "satisfied" with journey time, as 13.22 times likely to be also satisfied with the network coverage (Good LoS rule 5: p < 0.001, χ 2 = 214.14).It should also be noted that the two indicators of level of service, namely network coverage and frequency of buses were found to be highly correlated as the travellers who were "very satisfied" with the coverage were 2.88 times more likely to have the same perception of the frequency (Good LoS rule 2: p < 0.05, χ 2 = 352.27).Commuters were also around 7 times more likely (Borderline LoS rule 1: p < 0.001, χ 2 = 348.09) to hold a neutral perception of network coverage, causing a neutral opinion of the bus frequency, and 10.45 times more likely to be dissatisfied with both; thereby exhibiting a strong correlation between the two attributes.

Dynamics of Bus Fare, Quality and Frequency
Exploring the reciprocity between fare and quality added to this recommendation of a more frequent bus service supported by higher fares.The commuters unequivocally associated "levels of fare" with "quality of ride" (rule 3 of good VfM; OR =2.86, p < 0.001, χ 2 = 97.02),so the transit by bus was regarded as a value worthy of riders' money.Statistically, quality of ride was also found to be constrained by the frequency of buses as commuters satisfied with frequency of buses were 9.65 times more likely to be also satisfied by the quality of ride (rule 1 of good VfM; p < 0.001, χ 2 = 164.79)and similar trends of interdependency were observed for the neutral perception of these three variables.This current study implies that unlike findings of previous studies (see Bachman and Katzev [87], Savage [88]), and fortifying the observations by Tirachini [89] and Tirachini, Hensher [90], decreasing levels of fare may not be solely responsible for influencing the perceptions of commuters regarding existing public transport services.
A slight increase in fares may be justified by more frequent buses or a higher ride quality; even though more frequent bus travellers exhibited an unwillingness to pay more for seats, a sizeable increase in frequent bus commuters may be achieved by optimising the "fare-frequency-quality" dynamic due to modal diversion, i.e., occasional multimode (both bus and car) users and frequent car travellers shifting towards higher (public) bus service ridership.Transportation economists and strategists traditionally propose a balance between frequency of buses, journey time and the level of fare [88], but fall somewhat short of indicating how the balance affects commuter behaviour and perception.As the fares were recently increased by the Department of Transport-Abu Dhabi, the observations of the current study may be of significance to the policymakers to understand the straightforward implications of such decisions in the studied traveller market.A benefit of the proposed algorithm is that the endogenous variables determining the perception of these variables among the commuter market segments are analytically filtered across a broad range of travel attributes.

Phase II Results-Policy Insights from Validated Association Rules
Mode choice dependency trends and bus ridership characteristics were further studied by analysing the association rules filtered out from phase I. Table 3 presents the validated association rules, grouped analogous to market segmentation analysis [33].Some interesting econometric models about the journey purpose and backgrounds of the commuters were found; contrary to past research, this study found that statistical analyses alone may be deficient for capturing detailed characteristics of the traveller market.Inner correspondence of association rules was also partially detected by rule 1 of PS where first time car travellers at the time of the survey were found to be more frequent bus users.

Targeting Work Commutes and Full-Time Workers
Studies on building residents working full-time have found that a majority of them use private automobiles for work commutes [18,23,91], yet research targeting work-related commutes to encourage public transport use among these building residents is usually neglected.Policymakers may need to further investigate the quality improvements for attracting these consumers as this study found full-time workers commuting to/from work as predominant in both occasional multimodal (OMD rule 3) and frequent car travellers (FrCT rule 1).These findings are also consistent with the previous findings by Horner and Mefford [92] where the potential of public transit for work-related commutes was recognised.Furthermore, the need to live close to place of work and/or friends and family was one of the primary elicited reasons across all segments (e.g., PS rule 4, Good LOS rule 7, Borderline LOS rule1 and Good VfM rule 5).

Impact of Residential Building Typology, Locality and Nodal (Bus-Stop) Characteristics
The type of residential building, local urban form and location of the buildings were found to affect the daily mode choice of the residents [23,93,94].However, no association was found between origin-destination areas and mode choice for the commuter segments surveyed here.Yet some influence of the commuters' residence building type (e.g., apartment, villa house, labour camps etc.) were observed (rule 3 of PS).These observations may indicate the generalisability of the conclusions drawn from the results of the current study, specifically the dominant (> 80%, as per Abu Dhabi Government [95]) expatriate population of Abu Dhabi also being the dominant group among the survey respondents as apartment building dwellers [96].The residual effect of commuter satisfaction with the node, i.e., bus-station waiting area, on the overall quality of ride was also exposed by the results; namely that the commuters who were dissatisfied with the waiting area were also likely to be dissatisfied with the quality of ride (rule 2 of bad VfM).This represents a probable tendency of commuters to carry forward a negative perception that was developed before embarking on the public transit journey.

Budgetary Constraints, Bus Service Frequency and Network Coverage
This work found that the environmentally insensitive commuters, i.e., those who primarily use private automobiles for daily travel (rule 1 of EI), were largely satisfied with the public transport fare level.It can be inferred that the budgetary constraints and pricing of the service may be of little or no concern to such users (high social or financial background, employment status etc.) and there can be several underlying reasons that may motivate their mode choice shift to public buses, if at all.Frequent car travellers indicated a willingness to pay further after observing a need for seat as a major concern (rule 5 of FrCT) and attributed no delays due to traffic congestion during their daily commute.Phase II results also showed that some commuters may have been compelled to use public transport due to necessity, rules 1 and 4 of PS (no driving license or car-ownership).Investigation of the rules on neutral and dissatisfied consumers of both LoS and VfM also revealed that financially (or otherwise) obliged travellers might have settled for public transport against a more idealised mode.
Consumer satisfaction from the level of fare was also found to be dependent upon the quality of ride, thereby confirming initial perceptions that commuters perceive fare as a function of ride quality, as presented in rule 2 of VfM.This is also largely consistent with the findings of Hensher, Stopher [97] and fare resilience addressed in the literature review by Redman, Friman [98] where fare was largely associated with service quality.It is likely that the solution to motivating transit mode choice shift from private vehicles and/or improving the service perception may not be a reduction in fares but delivering consumer-expected quality for the charged price.One interesting finding of phase I and II was the tendency of a majority of commuters to purchase cash tickets instead of monthly passes regardless of the transit service perception.Marketing policies for monthly fare collection may prove to be significant in this regard.Additionally, follow up analysis attributed to rules 1 and 2 (good LoS) and rules 1, 4, 5, 6 and 7 (good VfM) exemplifies that contrary to past research (see Redman, Friman [98]), economic restraints only partly diminish the commuters' positive cognitive assessment of the public transport.Moreover, perception of the service quality attributes, such as journey time was also found to influence the satisfaction from level of bus service and should also be carefully considered by the policymakers.

Journey Time and Ride Quality
Rule 3 of bad VfM suggests that the commuters mentioning a neutral opinion of the journey time were very dissatisfied about the quality of the ride.It implies the role of long commute time in developing stress among travellers, as past studies [99] also found commuter sensitivity to journey time.Longer journey time is also significant due to: high vehicle operating costs to local building residents [100,101]; higher fuel consumption [102,103] and high environmental burden to the buildings' overall embodied life-cycle [10].The local government policymakers should, therefore, be mindful of the journey time detriments caused by any variations in the service route or frequency.In addition to confirming previously noticed significance of journey time on commuter satisfaction [97] and mode choice [104], phase II of the proposed algorithm was also able to pinpoint the contingent service level factors (namely frequency of buses and crowded buses).Policymakers can then predict the relative influence of changing any of the antecedent variables on commuter satisfaction from remaining variables and the subsequent mode choice.Another implication was the relative nature of quality of ride and level of service attributes of network coverage (Rule 4 of good LoS), also noticed in ordinal regression performance.Public demand-based policy implications for the bus service to improve bus ridership among local building residents and travellers towards reduction in the residents' transport energy and environmental component of the local residential buildings are distributed across the generated "meaningful rules".In order to encapsulate the findings of the algorithm proposed in this study on the case study dataset, this work also sought to solve an importance index (I in ) to estimate the policy recommendations consistent with the public demands.For a response variable ψ in the phase III results, with categorical values ranging from 1 to N, the importance index can be calculated for a predictor variable x i as shown in Equation (3).It should be worth noting, as the predictor variables are also nominal in nature, only one value η is used at a time to determine the index (I x ) for the specific predictor variable.The results of applying Equations ( 3) and ( 4) on the grouped association rules of phase III are illustrated in Figure 5 and have been summarised to some degree to eliminate similar findings from any particular commuter market segment.For example, the variable category Weekday was kept for only frequency of car travel despite its occurrence in the results for frequency of bus travel models: Findings here suggest that full-time workers in the 25-34 age group formed the major portion of the target market, and frequently commuted on the case study routes during weekdays.These study observations were distinct from some of the past studies that analysed only segregated student and young traveller markets [105], whereas a majority of the young users studied in this paper travelled by private automobiles.Further findings are detailed in the text below.

Residential Apartment Building Designs May Need to Be Upgraded
Residential apartments scored higher than any other dwelling types in the existing residential zones of the surveyed Abu Dhabi city (I in = 16.6%) as majority of the commuters responded as living in such buildings.This may reflect a pattern of denser land-use with an abundance of multi-storeyed buildings either on streets traversing or facing main arterial roads serviced by adequate public bus transport routes, especially since most commuters travelled short distance to reach the bus-stop (further details in Section 4.4.2),similar to the observations made by Thøgersen [106] and Hrelja [107].Improving public transport service in these residential areas may impact the parking requirements in the residential apartment buildings [108].Further work on building design implications may be developed from this study for upcoming residential development projects with adequate by provided local or building-connected public transport, as fewer parking floors may be needed in such buildings in the studied Abu Dhabi City area.The survey respondents generally stated travelling a short distance to reach the nodes (bus-stop) from their residential and office buildings across the studied travel routes and thus the time to reach bus-station from these buildings only had nominal effect on the modal variability of their occupants.For example, I in = 45.57% for the frequency of bus travel and I in = 28.57%for satisfaction from network coverage and ride quality.This study, therefore proposes that establishing more bus-stations or further updating the connection between these buildings and a public transit service to serve the local building residents on the studied Abu Dhabi City routes may not necessarily encourage them to shift towards bus transport.This is a policy-sensitive observation and different from some of the past studies (e.g., Chien and Qin [109]).However, as Heinen and Chatterjee [110] found in their review on the status of research in transportation literature, few of the studies extensively explored modal variability over long-term and a majority only utilised a limited set of categorical variables to study university students or employees living in near-work communities, which may have also restricted their findings in terms of application to real-world situations.

Optimise Journey Time to Impact Commuter Satisfaction
Intrinsically, importance indices from this study for network coverage and quality of ride show that the commuters assume upper and lower bounds of journey time regardless of mode choice, in line with past works [111].Any deviation in either direction causes a positive/negative ripple in commuter predilection of the service depending upon the journey time, confirming findings from previous research (see Hensher,Stopher [97]).In general, the commuters satisfied with the journey time on public bus transit were also satisfied from its network coverage (I in = 28.57%) and ride quality (I in = 42.86%).This supports the findings of previous studies [89,112] where the travellers were found to associate public transport as a slow and inflexible travel mode and the importance of journey time in promoting modal diversion among local residents towards a more sustainable mode (e.g.buses) was identified.On the other hand, unlike some of the past works [113,114], results gathered by this work found that the commuters admitting affordability of private automobiles were more likely to be multimodal in their transit behavior.

Bus Service Frequency, Crowded Buses and Level of Fare
Results here argue that the public ranking of the ride quality was influenced more by the frequency of bus service (I in = 28.57%) and less by the level of fare.The commuters interviewed were also found to be mainly concerned with the low capacity (crowded) buses, as both frequent bus and car travellers labelled the existing buses to be crowded.Other similar interesting correlations were also found, as the commuters who commented on over-crowded buses were also dissatisfied with the frequency of buses (I in = 14.29%) and level of fare.It builds upon the deductions from the econometric ordinal models that even though the commuters satisfied with the fare level travelled the most by buses (OR = 6.72); neutral (OR = 1.16) and dissatisfied respondents were more willing to pay extra for seats.Validated association rules further explained these observations by indicating that the level of fare was of little to no concern to non-users of bus services (i.e., the environmentally insensitive commuter segment), unlike past works [114,115].Commuters ranking level of fare as satisfactory also indicated willingness to pay more for a seat (I in = 28.57%).
Frequent car travellers were also willing to pay more for the bus service if they get a seat (I in = 14.29%) and face no delays due to traffic congestion (I in = 14.29%).In addition, the level of fare was mostly perceived by surveyed commuters in terms of ride quality and it is argued here that strong correlations exist between the two variables.These findings support a change in the current perspective of policymakers where agencies generally advocate a reduction in fare level [116] to encourage local building residents in the cities to choose public buses instead of private automobiles for the daily commute.This study proposes that instead of implementing a blanket reduction in the fare level, policymakers may be able to achieve better results in terms of user predilection towards bus service and modal use by increasing the frequency of buses, economically supported by a commensurate reasonable increase in the current level of fare.Furthermore, as journey time was of significant concern to the commuting public, travel should also be optimised by route improvements such as establishing a bus rapid transit (BRT) service to facilitate the daily commutes of the current and potential service users in Abu Dhabi.The cost and environmental implication of any improved transit system alternatives for future modifications based on public demand and the resulting journey time savings can then be compared through a detailed LCA study.

Discussion
There is a lack of research in the behavioural econometric analysis of the social aspect of commuter decision-making regarding transport mode choice, factors and attributes affecting these decisions such as biases, service quality, and frequency etc., which, according to IPCC (Intergovernmental Panel on Climate Change) [68], considerably hinders the selection of a suitable climate-conserving transport alternative for a sustainable municipal built environment.Apart from influencing daily commute pattern of local building residents in cities, inadequate transportation systems also impact the construction of sustainable buildings due to their direct impact on the material supply-chain and disposal options due to time delays caused by congestions on local road (Section 2.1).Therefore, by extension transportation systems are also argued to hinder sustainability performance of residential buildings.Stakeholders then seek to limit multiple-handling and transportation of feed-stock from building construction and demolition activities, as well as properly factor the impact of haulage and disposal distances and traffic congestion on the material choice of building developers.Furthermore, as explained in Section 2.1, the provision of an effective transportation system with less congestion and a better public transportation system may also influence the residency choices, sale and property development aspects of buildings.
In order to address these issues, the study presented here analyses a survey questionnaire of 1,520 responses gathered to capture the commuter perceptions on mode choice and predilection of an existing intra-city setting bus service towards improving the overall transportation system.Passenger responses can be used to influence their mode choice and reduce overall traffic congestion of the transport system after implementing the prospective changes in the transport service, based upon the public demand.Literature review covered in Section 2.1 explained that this reduction in traffic congestion via more uptake of public-transport contributes to an efficient distribution of building materials.While Section 2.2 explained that the reduction of embodied operational and life-cycle energy and environmental load of residential buildings is significantly influenced by the daily commute mode choices of its residents and may be reduced by relying on public transport instead of private vehicle transport.
To identify the motivations behind the public and private transport as preferred mode of building residents, this study proposed a data-mining algorithm unifying a modified version of the Apriori algorithm and ordinal regression to highlight policymaking proposals pertinent to the studied Abu Dhabi and Middle Eastern residents.Training portion of the travel dataset was subjected to the modified Apriori algorithm in order to generate the initial set of association rules that were partitioned according to the individual market segment candidates in the consequent of the rules.After initial interestingness-based filtration of rules, the results were exported to SPSS for externally executing ordinal regression analyses.Ordinal regression performance results of the obtained association rules, after passing a meaningfulness measure, were used to analyse the market segments.Throughout the ordinal regression experiments, a majority of the respondents stated travelling a short distance to reach the bus-stop from their residential and/or office buildings, and time to bus-station had nominal effect on the modal variability of commuters.This may indicate that the current connection between these buildings and the supplied public transit service is adequate.
Mode use pattern was found to be only partly related to the socio-demographic characteristics of the surveyed local Abu Dhabi City building residents.The gender of these respondents had no statistically-significant influence on modal variability but was found to influence their perception of the service as the males generally had a more positive outlook.As such, a majority of similar variables (e.g., building typology, location, age etc.) failed to pass the ordinal regression tests and the next stage (validation of association rules) was used to capture the influence of these variables.Towards travel constraints limiting mode choice of commuters, this study found that a significant share of frequent bus travellers may have been compelled to uptake of public transport due to socio-demographics (lack of access to cars and lack of driving license).Nonetheless, the economic restraints only partly prejudiced their negative perception of the bus service and a majority of the so-called "compelled" travellers viewed public buses as good value for money and deemed the current service level to be satisfactory.
It is noted here that occasionally multimodal and frequent car travellers exhibit a substantial willingness to travel by public buses provided the capacity is increased or the buses become less crowded, even at the cost of higher fare.The results show that commuter satisfaction from the quality of ride and level of fare were highly interrelated.Frequency of the public bus service, as a measure of the passenger satisfaction from the level of service, is found to influence the mode choices of local building occupants.Given the competitive nature of private automobiles and the tendency of commuters in the studied dataset to use public transport for work-related commute dissimilar to past findings, policymakers may need to optimise quality-frequency-fare dynamics especially targeting peak work hours to reduce residents' transport-related environmental load of local buildings.One of the common strategies that is often implemented in many countries around the world for this purpose is investing in mass-transit systems.Technical and regulatory aspects of implementing the mass-transit system may include smoother traffic flow due to lesser private vehicles on the road network, reduction in cost and environmental burdens, fewer energy and fuel requirements on the power and fuel supply grids in the municipal regions, reduced overall operational and user-transport energy of residential buildings and availability of more recycled material haulage options (from far-off locations) in some areas for building construction projects.The mode use changes as a result of any improved public transit service may also affect the urban form and building designs: accessibility and shared spaces, lesser parking needs routine maintenance due to lower traffic and noise vibration damages.

Conclusions
The life-cycle energy and environmental load inventory of residential buildings includes the impact of material usage choices due to haulage requirements from virgin material suppliers vs. recycling plants, traffic congestion impact on per tonne kilometre (tkm) energy load during the actual material transport, disposal feedback loops and the embodied building residents' daily transportation requirements.Public transport services are generally proposed for alleviating excessive private commuter traffic load on road networks and reduce the overall transport system energy and environmental burdens in the local municipal regions.Yet, the interrelations of mode choice to underlying social (i.e., stakeholder-related) attributes of journey time, location of residential and office buildings comparative to the public transit service node (bus-stop), on-board crowding and ride quality are somewhat unexplored.
To that end, the methodology proposed in this study aids the policymakers in finding the critical attributes for improving performance of any built asset by engaging the stakeholders in the life-cycle asset management process through a social component.The findings of the current study demonstrate its application to a real-world case study of Abu Dhabi City building residents and travellers.Based on results, it recommends that a bus rapid transit (BRT) service, more frequent at peak office hours, may entice significant car users towards the public bus transport service.Whilst the overall whole-cost and explicit environmental impact of meeting public demands by constructing and managing a proposed practicable BRT system can be explored through subsequent detailed LCA, the work presented here does find that addressing public transportation systems preferences can contribute to congestion reduction and impact positively to sustainable development.
The proposed framework used data-mining to filter through the large set of public response survey data for a transport pattern behavioural case study among local building occupants.However, future works may apply the same procedure for data-mining the stakeholder perspectives of other aspects in the residential building development: need for a recreational building facility, upgrading/modifying an existing building, façade renovations, and interior works for buildings, etc.

Figure 1 .
Figure 1.Definition of "meaningfulness measure" to filter repetitive set of rules.

Figure 1 .
Figure 1.Definition of "meaningfulness measure" to filter repetitive set of rules.

Figure 2 .
Figure 2. Conceptual framework of the proposed algorithm to mine a travel dataset.Figure 2. Conceptual framework of the proposed algorithm to mine a travel dataset.

Figure 2 .
Figure 2. Conceptual framework of the proposed algorithm to mine a travel dataset.Figure 2. Conceptual framework of the proposed algorithm to mine a travel dataset.

ψ 1 ) 5 .Figure 3 .
Figure 3. Proposed data mining algorithm for a travel dataset analysis of 1,520 questionnaires.Figure 3. Proposed data mining algorithm for a travel dataset analysis of 1,520 questionnaires.

Figure 3 .
Figure 3. Proposed data mining algorithm for a travel dataset analysis of 1,520 questionnaires.Figure 3. Proposed data mining algorithm for a travel dataset analysis of 1,520 questionnaires.

Figure 4 .
Figure 4. Filtration results of different phases in a collected travel dataset based on the developed algorithm.

Figure 4 .
Figure 4. Filtration results of different phases in a collected travel dataset based on the developed algorithm.

Figure 5 .
Figure 5. Importance index values for the phase III association rules of satisfaction and frequency criteria.

Figure 5 .
Figure 5. Importance index values for the phase III association rules of satisfaction and frequency criteria.

Table 1 .
Bus routes selected for the purpose of study.
Γ ; reject rule ψ R as uninteresting Return remaining interesting rules as subsets in superset ts R ψ ζ .c. Develop ordinal regression models ( ) ln k ψ for each response variable ψ in ts R ψ ζ .Compute; log likelihoods LL, coefficients i β of predictor variables i x and p-values.d.Filter rules in superset ts R ψ ζ

Table 2 .
Predictors of modal variability and service perception from association rules passing ordinal regression performance test.

Table 3 .
Validated association rules for interdependency of studied transit-related variables † † .

Table A1 .
Cont.Strongly agree † Both English and Arabic versions were used along with multilingual teams to capture data across all demographics.

Table A2 .
Statistical descriptive distribution of the collected variables.

Table A2 .
Statistical descriptive distribution of the collected variables.