Next Article in Journal
Forecasting Industrial Carbon Peaking and Exploring Emission Reduction Pathways at the Metropolitan Scale: A Multi-Scenario STIRPAT Analysis of the Hangzhou Metropolitan Area
Previous Article in Journal
A Method for Rapid Computation of Transport-Related Emission for Urban Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling Mode Choice Preferences of E-Scooter Users Using Machine Learning Methods—Case of Istanbul

Faculty of Engineering and Natural Sciences, Istanbul Okan University, Istanbul 34959, Turkey
*
Author to whom correspondence should be addressed.
Sustainability 2025, 17(24), 11088; https://doi.org/10.3390/su172411088
Submission received: 10 November 2025 / Revised: 28 November 2025 / Accepted: 4 December 2025 / Published: 11 December 2025
(This article belongs to the Section Sustainable Transportation)

Abstract

Delays caused by motor vehicle traffic, accidents, and environmental pollution present considerable challenges to sustainable urban mobility. To address these issues, transportation system users are encouraged to adopt active transportation methods, micromobility options, and public transit. Electric scooters have become a notably popular micromobility choice, especially following the emergence of vehicle-sharing companies in 2018, a trend that gained further momentum during the COVID-19 pandemic. This study explored the demographic characteristics, attitudes, and behaviors of e-scooter users in Istanbul through an online survey conducted from 1 September 2023 to 1 May 2024. A total of 462 e-scooter users participated, providing valuable insights into their preferred modes of transportation across 24 different scenarios specifically designed for this research. The responses were analyzed using various machine learning techniques, including Artificial Neural Networks, Decision Trees, Random Forest, and Gradient Boosting methods. Among the models developed, the Decision Tree model exhibited the highest overall performance, demonstrating strong accuracy and predictive capabilities across all classifications. Notably, all models significantly surpassed the accuracy of discrete choice models reported in existing literature, underscoring the effectiveness of machine learning approaches in modeling transportation mode choices. The models created in this study can serve various purposes for researchers, central and local authorities, as well as e-scooter service providers, supporting their strategic and operational decision-making processes. Future research could explore different machine learning methodologies to create a model that more accurately reflects individual preferences across diverse urban environments. These models can assist in developing sustainable mobility policies and reducing the environmental footprint of urban transportation systems.

1. Introduction

One of the most pressing challenges facing modern society is highway traffic, along with its numerous associated issues, including delays, traffic accidents, and environmental pollution. Each additional minute spent in traffic represents an irreversible loss of time in a person’s life, significantly diminishing overall quality of life. The impact of traffic congestion is particularly severe in large urban areas. In Türkiye, Istanbul stands out as the most populous city, boasting an official population of 15,655,924 [1]. This number does not account for irregular migrants or individuals traveling for daily commutes or tourism, which underscores the severity of traffic-related problems in the city. Despite numerous investments in rail systems, the traffic issues in Istanbul continue to worsen. According to data released by INRIX, the average delay experienced by drivers in Istanbul rose by 15.38% in 2024 compared to the previous year, amounting to an astounding 105 h [2]. This statistic positions Istanbul as the global leader in traffic delays. Addressing these challenges is essential for achieving sustainable urban development and aligns with Sustainable Development Goal 11 (Sustainable Cities and Communities).
E-scooters are not only a mobility innovation but also a potential enabler of sustainable cities through energy-efficient and low-emission transport modes. Promoting micromobility systems such as e-scooters plays an important role in advancing sustainable transportation by reducing emissions, noise, and congestion. In response to these challenges, individuals facing delays in road traffic are actively seeking innovative solutions. Various micromobility systems, such as bicycles, e-bikes, and e-scooters, have gained traction in this context. E-scooters, in particular, have recently emerged as a popular transportation option in Türkiye. With several rental companies entering the market, the number of users has surged, making e-scooters a practical alternative for various transportation modes, especially over short distances.
However, the regulations governing e-scooter use in urban traffic remain underdeveloped. Notably, the “Electric Scooter Regulation” was published in the Official Gazette [3], and the Istanbul Metropolitan Municipality (İstanbul Büyükşehir Belediyesi—IBB) has formulated an e-scooter directive for Istanbul [4]. Similar initiatives are also underway in other cities across Türkiye.
In recent years, e-scooters have become a favored last-mile transportation option in Türkiye, particularly among young people. Nevertheless, regulations concerning the operating conditions and rules for e-scooters on highways have only recently begun to take shape. Consequently, many e-scooter users and other stakeholders in highway traffic find themselves operating vehicles without full awareness of these regulations. The published guidelines are often informed by examining comparable rules in other countries and are sometimes adapted to suit the Turkish context. However, since traffic conditions in each country are primarily influenced by driver behavior, these conditions can vary significantly. Therefore, it may be beneficial to establish tailored operating conditions for e-scooters specific to each country or city.
Central and local authorities should seek to understand public opinion regarding e-scooters, including user desires and usage patterns. This information is essential for crafting effective regulations. Additionally, insights into user attitudes, behaviors, and preferences are crucial for e-scooter sharing companies, as this understanding can inform their strategic and operational policies, enhance service quality, and increase market share. Well-regulated e-scooter services with improved quality may lead to increased demand, subsequently alleviating problems associated with motor vehicle traffic.
This study examined various preferences of e-scooter users concerning their usage and predicted mode choices under different conditions using machine learning techniques. An online survey was conducted with 462 individuals, either current or potential e-scooter users, residing in Istanbul. The survey results were analyzed utilizing Artificial Neural Networks (ANNs), Decision Trees (DTs), Random Forest (RF), and Gradient Boosting (GB) methods. The notable success rates of these models, particularly when compared to traditional discrete choice models, highlight their considerable potential as decision-support tools for government authorities and urban planners. This enhanced predictive capability enables more informed policy-making and resource allocation, ultimately leading to improved urban mobility solutions. Additionally, the findings offer valuable insights for shared e-scooter service providers, equipping them with practical strategies to refine their operations and enhance user engagement. By gaining a deeper understanding of user preferences and behaviors, these providers can better tailor their services to meet market demands, increase user satisfaction, and boost overall operational efficiency.
Numerous studies have examined e-scooter mode choice behavior using discrete choice models, such as multinomial logit, mixed logit, and nested logit [5,6,7,8,9,10,11,12,13,14,15]. However, significant gaps still exist that this study aims to address. Firstly, with a few exceptions [8], previous research has predominantly relied on traditional discrete choice frameworks, which typically achieve prediction accuracies below 60% when applied to micromobility mode choice. Secondly, most existing datasets consist either of revealed-preference data that offers limited attitudinal insights or stated-preference surveys featuring a small number of hypothetical scenarios (usually ranging from 4 to 9 per respondent), thus constraining the models’ ability to capture heterogeneity in preferences across a broader array of travel time and cost combinations. Thirdly, while machine learning approaches have occasionally been employed for general mode choice prediction, there has yet to be a systematic comparison of multiple ML algorithms (including ANN, DT, RF, and GB) specifically for e-scooter-inclusive mode choice in the literature. Fourthly, no prior research has integrated a large-scale stated-preference experiment (with 462 respondents across 24 orthogonal scenarios, resulting in 11,088 observations) within a megacity context, alongside detailed demographic, attitudinal, and infrastructural variables, to train and compare ML models. Lastly, the application of model outputs within a policy-oriented decision-support interface for municipal authorities and shared-micromobility operators—explicitly showcased as a practical outcome in Section 6—has not been explored in the existing e-scooter research.
The novelty of the present work therefore lies in four key aspects:
  • The development of a comprehensive, scenario-rich dataset specifically tailored to the traffic and cost conditions of Istanbul.
  • The first systematic application and performance evaluation of four widely adopted machine learning algorithms for predicting e-scooter-inclusive mode choice.
  • The attainment of significantly higher predictive accuracy (92.40% with DT) compared to previously reported discrete choice models.
  • The clear translation of the resulting high-performance model into actionable policy and operational recommendations, which includes potential integration into traffic simulation platforms and the creation of a user-friendly interface for planners and micromobility providers.
The second chapter of the study reviews literature on models utilized in mode choice and their applications in micromobility. The third chapter details the survey and data collection process, while the fourth chapter focuses on the machine learning methods employed. The fifth chapter presents the model development process and compares the performance of the various models. Finally, the sixth chapter summarizes and discusses the findings of the study.

2. Literature Review

In transportation planning, discrete choice models (DCMs) are widely employed to analyze mode selection. These models have been utilized across various contexts, including urban and intercity travel [16], freight transport [17], airport access preferences [18,19], traffic safety research [20], route selection decisions [21,22], and the evaluation of new transportation modes [23]. Among the various models, Binary Logit (BL), Multinomial Logit (MNL), and Nested Logit (NL) models are the most commonly used [16], while Mixed Logit (ML) models have increasingly gained traction since the early 2000s [23,24].
Discrete choice models (DCMs) are based on the assumption that individuals seek to maximize their utility [16]. Multinomial logistic (MNL) models assume that coefficients are homogeneous across individuals, while mixed logit (ML) models account for variations that reflect behavioral differences among users [25]. Although these methods are well-established and easy to interpret, they often face challenges in capturing the complex interactions between user characteristics, contextual conditions, and emerging micromobility behaviors, such as the use of e-scooters.
Recent studies have examined e-scooter mode choice across various contexts. Research by Reck et al. investigated user preferences for e-scooters and e-bikes in Zurich through MNL and ML frameworks, revealing a strong inclination toward micromobility solutions, particularly with parking available during peak periods—an approach that could effectively reduce car trips [5]. Zuniga-Garcia et al. highlighted the importance of infrastructure-focused modeling for e-scooters [26]. Additional research [6,7,27,28] indicates that usage patterns differ based on demographics and accessibility, with e-scooters primarily replacing short trips of under 15 min [6]. While these studies offer valuable insights, traditional discrete choice models often struggle to adequately capture individual-level variability or travel decisions that depend on specific scenarios [8].
Demographic analyses indicate that young, educated men constitute the primary users of e-scooters [9,29,30,31,32]. Furthermore, the types of transportation modes replaced by e-scooters differ by region, with journeys shorter than 8 km accounting for approximately half of these substituted trips [10]. Additional research underscores that e-scooters are often favored over walking or public transport for certain types of trips [24,25,28,29,30], and the availability of parking further enhances their appeal [5,13,14,15].
Recent studies extend beyond mode choice to explore environmental and ergonomic impacts. For instance, Karpenko et al. investigated the vertical vibrations experienced by e-scooter riders across different pavement types, revealing potential long-term musculoskeletal risks [33]. Similarly, Danilevičius et al. highlighted that traditional traffic noise models tend to underestimate peak noise exposure, emphasizing the necessity for dynamic mapping and infrastructure enhancements [34,35]. These studies illustrate both the strengths (such as experimental rigor and scenario-specific insights) and limitations (including restricted generalizability and a narrow focus on specific variables) of current research.
Table 1 provides a comprehensive summary of key studies on e-scooter mode choice, highlighting their methodologies, contexts, advantages, and limitations. This overview underscores both the contributions and shortcomings in the existing literature, thus informing the methodology adopted in the current study. By integrating demographic, behavioral, and scenario-based data, our approach effectively addresses these gaps and establishes a more adaptable and precise framework for predicting transportation preferences in urban environments.
There is a notable scarcity of studies that explicitly explore e-scooter mode choice from a sustainability perspective, revealing a significant area for further investigation. The present study seeks to fill this gap by combining scenario-based travel choices, demographic traits, and behavioral data to develop predictive machine learning models suitable for integration into traffic simulation and policy evaluation frameworks.

3. Methodology

An online survey was conducted using Google Forms as part of the study, which received approval from the Ethics Committee of Istanbul Okan University during meeting number 158 on 21 September 2022. The survey was administered between 1 September 2023, and 1 May 2024, and included 512 participants. However, in accordance with the requirements of the TÜBİTAK 1001 project (project number 123M063), responses from individuals outside of Istanbul were excluded, resulting in a final sample size of 462 participants.
This number exceeds the minimum required sample size of 384 for a population of over 500,000 in a heterogeneous context [36,37]. While definitive data on the total number of e-scooter users in Istanbul is lacking, one of the leading e-scooter sharing companies, Martı, reported through its founder, Oğuz Alper Öktem, that its user base surpassed 5 million as of 2021 [38]. Considering potential users who have yet to try e-scooters, it is estimated that the total population exceeds 500,000. Therefore, the sample size of 462 is deemed sufficient for this study.

3.1. Survey Design

The first section of the survey gathered demographic information about e-scooter users, including details such as gender, age, income level, education level, and car ownership.
The second section focused on questions related to e-scooter usage, covering the following aspects:
  • Whether there is a bike lane within 250 m of their residence.
  • Their primary mode of transportation for daily travel.
  • The primary purpose of their trips (e.g., work, school, health, shopping).
  • Their preferred type of infrastructure for riding e-scooters (e.g., roads, sidewalks, bike lanes).
  • Whether they currently use or would consider using e-scooters on sidewalks.
  • The transportation modes they currently replace or would consider replacing with e-scooters.
  • The traffic conditions under which they prefer to use e-scooters (e.g., light, moderate, congested).
  • The maximum duration, in minutes, they prefer for a one-way e-scooter trip.
  • Their perception of the safety of using e-scooters.
  • Their opinions on the use of e-scooters on sidewalks.
  • Whether they currently use or would use e-scooters on roads with one-way or two-way traffic.
  • Their views on parking locations for e-scooters.
  • Whether they currently park or would park e-scooters in designated parking spots.
  • Their level of compliance with traffic rules.
  • Whether cost or safety is the primary factor influencing their preference for e-scooters.
  • The maximum hourly cost (in Turkish Lira–TL) they find acceptable for using e-scooters.
In the third and final section of the survey, respondents were presented with scenarios that included varying travel times and costs for private cars, public transportation, taxis, and e-scooters. For each scenario, participants were asked to indicate their preferred mode of transportation. These scenarios were based on a 4 km distance along Bağdat Avenue, specifically between the neighborhoods of Bostancı and Selamiçeşme in Istanbul.
Because participants were not familiar with traffic engineering terms, traffic conditions were described using three verbal categories: light, moderate, and congested. Travel times for these conditions along the 4 km route were obtained from Google Maps. Figure 1 provides a screenshot of the measurements, and Table 2 presents the travel times assigned to the four transportation modes.
Two levels have been established for travel costs across four modes of transportation: the current costs and the increased costs. The increase has been set at 65%, reflecting the inflation rate for 2023. The calculation of fees for a 4 km journey has been conducted as follows:
For Private Car:
F privatecar = BF ( TL / km ) × BS ( L / km ) × 100 / 4 + P
In Equation (1), Fprivatecar represents the travel cost for using a private car. The variables involved are as follows: BF is the price of one liter of gasoline in Turkish Lira (TL), BS is the average fuel consumption in liters per kilometer, and P is the average parking fee for a round trip per hour. In 2023, BF was set at 34.48 TL [40], BS was 8 L, and P was 52 TL [41].
For Public Transport:
In Istanbul, the price of a full ticket fare is 15 TL [42].
For Taxi:
F taxi =   A + M × x   ( km ) + Z × t ( h )
In Equation (2), Ftaxi denotes the taxi fare, where A represents the starting fee, M is the charge per kilometer, Z is the hourly rate, x is the distance traveled in kilometers, and t is the travel time in hours. In 2023, the rates in Istanbul were set as follows: A was 19.17 TL, M was 13.75 TL, and Z was 152.73 TL [43].
For E-scooter:
F escooter =   B + C × d ( min )
In Equation (3), Fescooter represents the fare for e-scooter rides, where B is the starting fee, C is the per-minute usage charge, and d is the duration of usage in minutes.
The average fare for a 4 km journey was determined by calculating the prices from three different companies. Based on this, a journey at the legal speed limit of 15 km/h would take approximately 16 min. In 2023, Binbin and Hop priced this distance at 75 TL, whereas Martı charged 82 TL. The mean fare of 77 TL was rounded down to 75 TL to make survey responses easier. The fares presented in Table 3 were derived from Equations (1)–(3).
When assessing the number of scenarios, it is crucial to recognize that accounting for the combinations of travel time levels across various modes of transportation proves challenging. This is because cars, taxis, and public transit modes will be similarly influenced at each traffic speed level. In essence, while one mode may be traveling swiftly, the others cannot simultaneously be moving at a significantly slower pace. The e-scooter’s speed is held constant at 15 km/h across all congestion levels, consistent with the national legal speed limit for such vehicles. As illustrated in Table 2, the travel time remains consistent across different traffic conditions, calculated at 16 min.
However, when examining travel cost levels, combinations among the modes are feasible. Utilizing a full factorial experimental design for each travel time [14], the total number of scenarios would be calculated as 24 × 1 = 24. Consequently, the overall total becomes 24 × 3 = 72. Nevertheless, this substantial number could overwhelm respondents, prompting the use of an alternative experimental design to streamline the total.
In this case, a fractional factorial design was employed, resulting in a total of 24 scenarios for respondents to address. The fractional factorial design is typically employed when a full factorial design generates an unmanageable number of scenarios. It retains the orthogonality characteristic of the full factorial design while significantly reducing the total combinations. In this case, a ½ fraction was used, indicating that only half of the possible variable combinations from the full factorial design were included. The scenarios shown to survey participants are listed in Table 4.
While the survey did not directly measure environmental attitudes, questions related to micromobility usage indirectly reflect participants’ sustainable mobility preferences.

3.2. Survey Analysis

Table 5 presents the percentage distribution of survey participants categorized by gender, driving license possession, income level, preferred infrastructure for e-scooter usage, and their perceptions of e-scooter safety. The gender distribution among participants is nearly balanced. A significant majority possess a driving license, whereas only about 30% of respondents own a car. In terms of income, most participants report having no income or earning at a minimum wage level, primarily because 60% of the survey respondents are students. When asked about their preferred infrastructure for e-scooter use, a substantial majority (68.3%) favor bike lanes.
However, the survey responses reveal that e-scooters are not regarded as very safe, with only approximately 8% of participants considering them safe or very safe. This perception is likely influenced by the absence of dedicated bike lanes in Istanbul, as e-scooters primarily operate on main roads and sidewalks, which raises safety concerns.

4. Machine Learning Models

This section outlines the theoretical foundations and mathematical formulations of four machine learning methods used in this study: Artificial Neural Networks, Decision Trees, Random Forest, and Gradient Boosting. Each method is explained with its operational mechanism and mathematical principles.

4.1. Artificial Neural Networks

ANNs mimic the structure of biological neural networks and are used to model intricate, non-linear data relationships in both classification and regression applications. They are built from linked neurons structured into an input layer, one or more hidden layers, and a final output layer.
The input layer receives feature vectors x = [x1, x2, …, xn]. In each hidden layer, neurons take the weighted input values, add a bias, and process them through an activation function to enable non-linearity, while the output layer provides the ultimate prediction or classification. Training of ANNs is carried out using backpropagation and gradient descent, aimed at minimizing a specified loss function.
For a neuron in layer l, the weighted sum is computed as:
z i ( l ) = j w i j ( l ) a j ( l 1 ) + b i ( l )
where w i j ( l ) are weights, a j ( l 1 ) are activations from the previous layer, and b i ( l ) is the bias. The activation is:
a i ( l ) = g z i ( l )
where g(·) is an activation function, such as Rectified Linear Unit (ReLU) (g(z) = max(0,z)) or sigmoid g z = 1 1 + e z . The loss function for regression typically is mean squared error (MSE):
L = 1 N i = 1 N y i y ^ i 2
For classification, cross-entropy loss is used:
L = 1 N i = 1 N y i log y ^ i + 1 y i log 1 y ^ i
Weights are updated via gradient descent:
w i j w i j η L w i j
where η is the learning rate [44].

4.2. Decision Trees

DTs are models that segment the input space into distinct regions based on feature values. They apply to both regression and classification tasks. The internal nodes of the tree represent splits determined by different features, while the leaf nodes indicate the output classes or values. The root node encompasses the entire dataset. Classification trees rely on metrics such as Gini impurity and information gain to create splits, whereas regression trees use variance reduction Additionally, pruning can be employed to simplify the model.
For classification, Gini impurity is calculated as:
G i n i = 1 i = 1 C p i 2
where pi is the proportion of class i, and C is the number of classes. Information gain, based on entropy, is:
E n t r o p y = i = 1 C p i log 2 p i
I n f o r m a t i o n   G a i n = E n t r o p y ( p a r e n t ) j N j N E n t r o p y ( c h i l d j )
where Nj is the number of samples in child node j, and N is the total number of samples. For regression, variance reduction is used:
V a r i a n c e = 1 N i = 1 N p i y i y ¯ 2
Splits are chosen to minimize the weighted variance of child nodes [45].

4.3. Random Forest

Random Forest is an ensemble technique that aggregates the outputs of multiple decision trees to enhance prediction performance. It employs bootstrap aggregating (bagging) and introduces feature randomness into the process.
Multiple bootstrap samples are created from the training data, with each decision tree being trained on a different sample. At each split in the tree, a random subset of features is considered (typically √p, where p is the total number of features). For classification tasks, the final predictions from the trees are determined through majority voting, while for regression tasks, the predictions are averaged.
Given B bootstrap datasets D1, D2, …, DB, each tree Tb (for b = 1, 2,…, B) outputs a prediction y b ^ ( x ) . The final prediction is:
  • Classification:
    y ^ ( x ) = m o d e y ^ 1 x , y ^ 2 x , , y ^ B x
  • Regression:
    y ^ ( x ) = 1 B b = 1 B y ^ b x
Out-of-bag (OOB) error is estimated using samples excluded from each tree’s bootstrap sample [46].

4.4. Gradient Boosting

GB is an ensemble method that builds decision trees in a sequential manner, where each tree aims to rectify the errors made by its predecessors by minimizing a loss function through gradient descent.
The algorithm begins with a constant prediction and then iteratively adds trees that predict the negative gradient of the loss function. The final output is generated by combining the predictions from all trees, each weighted by a learning rate [47].
For regression, the loss function is typically MSE:
L ( y , y ^ ) = 1 2 i = 1 N y i y ^ i 2
For classification, log-loss is used:
L y , y ^ = i = 1 N y i log p i + ( 1 y i ) log ( 1 y i )
where p i = 1 1 + e y ^ i . The model is initialized with F0(x), such as the mean for regression. For each tree m = 1, 2, …, M:
  • Compute pseudo-residuals:
    r i m = L y i , F m 1 x i F m 1 x i
  • Fit a decision tree hm(x) to predict rim.
  • Update the model:
    F m x = F m 1 x + η h m x
    where η is the learning rate. The final prediction is:
    y ^ x = F M x = F 0 x + η m = 1 M h m x

5. Model Development and Comparison of Model Performances

In this study, the input data for the developed models were derived from two sections of the survey: the “Demographic Characteristics” section and the “Opinions and Preferences Regarding E-Scooters” section. Additionally, travel times and costs defined in the scenarios (see Table 3) were utilized as inputs for the models.
Among the demographic characteristics, age was treated as a continuous variable. It was normalized to a range between 0 and 1 by dividing each participant’s age by 100, and a single input neuron represented this. Similarly, the number of household members and the number of cars in the household were normalized by dividing by 10, resulting in values between 0 and 1, with each represented by a single input.
Questions regarding ownership of a driving license, private car, and e-scooter were binary (yes/no) and represented by a single input value. A dummy variable was used, where 1 indicated “yes” and 0 indicated “no.”
For questions that include more than two response options—such as education level, income level, and perceptions of safety—one-hot encoding was utilized. Each category was assigned a separate input, with the selected category marked as 1 and all others as 0. For instance, in the case of gender, if a participant identified as male, the first input for gender would be set to 1, while the other two inputs would be set to 0. Conversely, if a participant identified as female, the second input would be 1, with the remaining inputs set to 0. If a participant chose not to disclose their gender, the third input would be set to 1, and the others would be 0. Similarly, for ordinal variables like income level and safety perceptions, one-hot encoding was applied, allowing the model to process all response options without imposing a numerical order.
Due to the numerous alternatives available, variables such as the participant’s district of residence and profession detail were not included in the models.
In the “Opinions and Preferences Regarding E-Scooters” section, six questions with binary responses (yes/no) were represented by a single input each. For the remaining 17 questions, the number of inputs corresponded to the number of selectable answers for each question.
The study defined eight scenarios, each representing different traffic conditions (light, moderate, or congested), as individual data points. Consequently, for each participant, the combination of three traffic conditions and eight scenarios resulted in a total of 24 different scenarios. Participants were asked to choose their preferred mode of transportation—options included private car, taxi, e-scooter, or public transportation—under each scenario. The costs and travel times for each transportation mode were also specified for the different conditions, with the last eight input values reflecting these variables. These values were normalized to a range of 0 to 1 by dividing the cost values by 200 and the time values by 60.
As a result, each model utilized 126 inputs. All 126 input variables used in the models, along with their type and encoding method, are summarized in Table 6. The model’s output indicates the preferred mode of transportation for a user, given the current conditions. Since there are four possible modes, a separate output was defined for each. The output corresponding to the chosen category receives a value of 1, while all other categories are assigned a value of 0.
The models were therefore designed to predict four outputs based on 126 inputs. With 462 participants evaluating 24 scenarios each, a total of 11,088 data points were generated. For all models, the data was split into three parts: 70% was used for training, 15% for testing, and the remaining 15% for validation.
The study employed ANN, DT, RF, and GB algorithms to forecast participants’ transportation preferences across various scenarios and conditions. In the revised analysis, the dataset was segmented by participants, ensuring that all 24 responses from any given individual were assigned exclusively to either the training, validation, or test set. The data was divided, allocating 70% for training, 15% for testing, and 15% for validation.
As detailed in Table 7, models were developed for five distinct datasets, with the model exhibiting the highest performance selected as the base model. The predictive capabilities of these models were evaluated using all 11,088 data points while preserving the integrity of each participant’s responses by confining them to a single data split. This modification facilitates a more accurate assessment of model performance, thereby better reflecting true generalization to unseen participants.
The ANN model is composed of 126 input neurons and four output neurons. The number of hidden layers, along with the number of neurons in each layer, was optimized using a grid search method. Various configurations were evaluated, including one or two hidden layers, with each layer containing between 1 and 50 neurons. The architecture of the developed ANN is depicted in Figure 2. The green color represents the neurons corresponding to the demographic characteristics in the input layer; the yellow color represents the opinions and preferences regarding e-scooters; the light blue color represents the scenarios; the pink color represents the neurons in the hidden layer; and the dark blue color represents the neurons in the output layer. The numbers indicate the index of each neuron within its respective layer.
A three-way data split method was utilized to mitigate overfitting and to determine the optimal network architecture. Figure 3 illustrates the flowchart of this three-way data split approach. Initially, the data was randomly shuffled and divided into three distinct sets: training (70%), testing (15%), and validation (15%). The assessment of the most effective network architecture was performed based on the dataset combinations detailed in Table 7. The optimal architecture for each ANN model was chosen based on the MSE performance metric.
The most effective architecture for the ANN model comprised 126 neurons in the input layer, two hidden layers each containing 50 neurons, and four output neurons.
To assess the performance of the developed models, we first calculated the True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN) values for each class. These terms can be defined as follows:
  • TP (True Positive) → Correctly predicting the positive class
  • TN (True Negative) → Correctly predicting the negative class
  • FP (False Positive) → Incorrectly predicting the negative class as positive
  • FN (False Negative) → Incorrectly predicting the positive class as negative
Using these values, we computed the support, weight, as well as the macro and weighted accuracy, precision, recall, and F1-score metrics for each class. The formulas employed in these calculations can be found in Equations (20)–(29).
s u p p o r t i = T P i + F N i
w i = T P i + F N i N
A c c u r a c y m a c r o = i = 1 K T P i N
A c c u r a c y w e i g h t e d = i = 1 K w i T P i N
P r e c i s i o n m a c r o = 1 K i = 1 K T P i T P i + F P i
P r e c i s i o n w e i g h t e d = i = 1 K w i T P i T P i + F P i
R e c a l l m a c r o = 1 K i = 1 K T P i T P i + F N i
R e c a l l w e i g h t e d = i = 1 K w i T P i T P i + F N i
F 1 m a c r o = 1 K i = 1 K 2 P r e c i s i o n i × R e c a l l i P r e c i s i o n i + R e c a l l i
F 1 w e i g h t e d = i = 1 K w i 2 P r e c i s i o n i × R e c a l l i P r e c i s i o n i + R e c a l l i
Table 8 and Table 9 present the confusion matrices and performance metrics of the four developed models—ANN, DT, RF, and CB—used to predict users’ preferred travel mode: Private Car, Public Transport, Taxi, and E-scooter.
Table 8 and Table 9 illustrate the imbalance within the dataset: the Taxi class has the fewest observations, with a support of 640 and a weight of 0.058. In contrast, the Public Transport class (5186 observations, weight = 0.467) and E-scooter class (3663 observations, weight = 0.330) represent the majority. Meanwhile, the Private Car class is a minority, comprising 1599 observations and a weight of 0.144.
The confusion matrices reveal distinct patterns of misclassification across the models. For instance, the ANN model shows substantial confusion between the majority classes (Public Transport and E-scooter) and the minority classes (Taxi and Private Car). Notably, many trips classified as Private Car are misidentified as E-scooter (184 instances) or Taxi (640 instances), resulting in a relatively low macro recall of 0.509 and a macro F1-score of 0.505. In contrast, the weighted metrics are higher—reflecting an accuracy of 0.872 and a weighted recall of 0.704—due to the model’s better performance on the classes with higher support.
DT model demonstrates the best overall performance as shown with bold in Table 9, among the evaluated methods, achieving the highest macro accuracy of 0.924 and a macro F1-score of 0.829. Its confusion matrix reflects strong discrimination across all travel modes, particularly for the low-support taxi class, which is classified more consistently than in the cases of ANN, RF, and CB. The weighted metrics, including an accuracy of 0.897 and a recall of 0.890, indicate robust performance across both majority and minority classes.
RF exhibits moderate performance, with a macro recall of 0.553 and a macro F1-score of 0.575, which highlights persistent misclassifications among private car, e-scooter, and taxi trips. While its weighted metrics show better results—accuracy of 0.827 and recall of 0.723—this is largely due to the correct identification of majority classes, revealing ongoing challenges with the minority taxi class.
CB model presents mixed results. Its macro precision of 0.706 and macro recall of 0.725 suggest competent performance for balanced evaluations. However, the low weighted recall of 0.524 signifies difficulties with the minority taxi class and some misclassification of private car trips. The confusion matrix indicates that taxi trips are especially vulnerable to being misclassified as other modes, aligning with its low support.
In summary, the DT model emerges as the most balanced and reliable performer across all evaluation metrics. All four methods significantly surpassed the 54% success rate reported by Günay et al. [21], who utilized the same dataset, but the DT model stands out as the most effective approach among those assessed in this study. Both ANN and RF achieved reasonable weighted scores yet struggled with minority classes, while CB performed well in macro-level metrics but remained sensitive to class imbalances in weighted evaluations.

6. Discussions and Conclusions

An online survey was conducted as part of the study, garnering responses from 512 participants, of whom 462 residents of Istanbul were included in the analysis. The survey comprised three sections: the first section collected demographic information about the participants, the second focused on e-scooter usage, and the third presented various scenarios along with questions regarding the transportation modes the participants would choose in those situations.
To develop predictive models for transportation mode choices based on demographic characteristics, e-scooter usage, and varying scenario conditions, we utilized methodologies such as ANN, DT, RF, and CB. All models significantly surpassed the 54% accuracy rate reported for discrete choice models by Günay et al. [21], which utilized the same dataset. Among the models assessed, the Decision Tree (DT) exhibited the highest accuracy and overall performance, demonstrating its effectiveness in capturing travel mode preferences across diverse user types and scenarios.
The models developed are crucial in traffic simulation, particularly for analyzing the impacts of e-scooters on urban traffic patterns. They have the capability to generate synthetic populations and simulate distributions, thereby predicting transportation preferences under various conditions. The resulting outputs can be seamlessly integrated into traffic simulation software, enhancing our understanding and facilitating the evaluation of alternative future scenarios.
By adding a user-friendly interface, this model can become accessible to both central and local governments. It will help decision-makers predict how the public might react to strategic transportation or pricing policies. For example, it can show how transportation choices might change if public transit prices rise. Additionally, the model can evaluate the potential effects of initiatives aimed at increasing micromobility usage, especially for e-scooters, which can help reduce the negative environmental impacts of transportation. This makes the model a valuable tool for policymakers.
The intuitive interface will help companies that offer shared e-scooter services predict potential returns on investment. It can also help shape policies to improve customer satisfaction. This study used data from e-scooter users in Istanbul, but its methods and findings may be relevant to other urban areas. The model development process includes demographic information, specific travel times and costs for different modes, infrastructure conditions, and user perceptions. This approach offers a way to predict transportation mode preferences in cities with similar micromobility shares or those that are growing.
The impressive accuracy of machine learning methods, particularly DT, in forecasting mode choices indicates that these models can serve as dependable tools for public authorities and private micromobility providers. By applying the same survey design and model structure in different cities, the models can capture local user profiles, travel behaviors, and environmental factors. This strategy would enable cities to anticipate user responses to changes in pricing, infrastructure investments, or regulations.
Moreover, by incorporating additional variables such as population density, topography, and climate, future iterations of the model could be tailored for various urban settings. The outcomes from these models could be seamlessly integrated into traffic simulation platforms to evaluate policy scenarios and devise effective mobility strategies. Furthermore, developing a web-based, user-friendly interface would empower city officials and operators to explore the implications of various transportation decisions in real-time interactively.
Furthermore, future efforts will emphasize enhancing the interpretability of machine learning models. While this study has prioritized prediction accuracy, grasping the key factors influencing mode choice is still crucial. We intend to assess the impact of demographic, attitudinal, and scenario variables through feature importance analyses and other innovative or advanced machine learning techniques. This approach aims to deliver actionable insights while preserving robust predictive performance. Ultimately, this will enable decision-makers and micromobility providers to gain a clearer understanding of user behavior, informing both policy and operational strategies.
In conclusion, although this study is anchored in Istanbul, its methods and insights are broadly applicable and adaptable. Future research involving multi-city data collection and cross-validation will enhance the transition of this work into a robust, general-purpose micromobility planning tool. In future work, incorporating sustainability indicators—such as emission reduction, energy efficiency, and shared mobility potential—could further strengthen the link between user preferences and environmental outcomes.

Author Contributions

Conceptualization, S.D. and S.A.; methodology, S.D.; software, S.A.; validation, S.A.; formal analysis, S.D. and S.A.; investigation, S.A.; resources, S.D.; data curation, S.D.; writing—original draft preparation, S.D.; writing—review and editing, S.A.; visualization, S.D.; supervision, S.A.; project administration, S.D.; funding acquisition, S.D. All authors have read and agreed to the published version of the manuscript.

Funding

This study was carried out within the scope of the project numbered 123M063, supported by TÜBİTAK under the Scientific and Technological Research Projects Support Program. The researchers express their gratitude to TÜBİTAK for its support.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of Ethics Committee of Istanbul Okan University (158, 21 September 2022).

Informed Consent Statement

Informed consent was obtained from the subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

During the preparation of this manuscript, the authors used Grammarly Pro for the purposes of improving the language of this text. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ANNArtificial Neural Network
BLBinary Logit
DCMDiscrete Choice Model
DTDecision Trees
GBGradient Boosting
IBBİstanbul Büyükşehir Belediyesi (Istanbul Metropolitan Municipality)
MLMixed Logit
MNLMultinomial Logit
MSEMean Squared Error
NLNested Logit
OOBOut-of-bag
RFRandom Forest
TLTurkish Lira
UKUnited Kingdom
USUnited States

References

  1. Türkiye İstatistik Kurumu (TÜİK). Adrese Dayalı Nüfus Kayıt Sistemi Sonuçları, 2023. Available online: https://data.tuik.gov.tr/Bulten/Index?p=Adrese-Dayali-Nufus-Kayit-Sistemi-Sonuclari-2023-49684 (accessed on 21 January 2025).
  2. Pishue, B. INRIX Global Traffic Scorecard Reveals Return to Office Drives Congestion and Trips to Downtown. 2025. Available online: https://inrix.com/blog/inrix-global-traffic-scorecard-reveals-return-to-office-drives-congestion-and-trips-to-downtown/ (accessed on 21 January 2025).
  3. Resmi Gazete, T.C. Elektrikli Skuter Yönetmeliği; 14 April 2021, No. 31454; T.C. Cumhurbaşkanlığı İdari İşler Başkanlığı Hukuk ve Mevzuat Genel Müdürlüğü: Ankara, Türkiye. Available online: https://www.resmigazete.gov.tr/eskiler/2021/04/20210414-3.htm (accessed on 21 January 2025).
  4. İstanbul Büyükşehir Belediyesi (İBB); Ulaşım Yönetim Merkezi (UYM). Ukome’den Scootere Çekidüzen. 2021. Available online: https://uym.ibb.gov.tr/kurumsal/haberler-ve-duyurular/ukome-den-scootere-çekidüzen (accessed on 21 January 2025).
  5. Reck, D.J.; Haitao, H.; Guidon, S.; Axhausen, K.W. Explaining shared micromobility usage, competition and mode choice by modelling empirical data from Zurich, Switzerland. Transp. Res. Part C Emerg. Technol. 2021, 124, 102947. [Google Scholar] [CrossRef]
  6. Psarrou Kalakoni, A.M.; Christoforou, Z.; Gioldasis, C. The ambivalent relationship of e-scooters and public transport: Evidence from France. Transp. Policy 2024, 159, 215–229. [Google Scholar] [CrossRef]
  7. Esztergár-Kiss, D.; Tordai, D.; Lopez Lizarraga, J.C. Assessment of travel behavior related to e-scooters using a stated preference experiment. Transp. Res. Part A Policy Pract. 2022, 166, 389–405. [Google Scholar] [CrossRef]
  8. Günay, G.; Dündar, S.; Dilekçi, S. Effects of automobile ownership on e-scooter choices: The Istanbul case. Cities 2025, 165, 106161. [Google Scholar] [CrossRef]
  9. Laa, B.; Leth, U. Survey of E-scooter users in Vienna: Who they are and how they ride. J. Transp. Geogr. 2020, 89, 102874. [Google Scholar] [CrossRef]
  10. Şengül, B.; Mostofi, H. Impacts of e-micromobility on the sustainability of urban transportation—A systematic review. Appl. Sci. 2021, 11, 5851. [Google Scholar] [CrossRef]
  11. Kopplin, C.S.; Brand, B.M.; Reichenberger, Y. Consumer acceptance of shared e-scooters for urban and short-distance mobility. Transp. Res. Part D Transp. Environ. 2021, 91, 102680. [Google Scholar] [CrossRef]
  12. Luo, H.; Zhang, Z.; Gkritza, K.; Cai, H. Are shared electric scooters competing with buses? A case study in Indianapolis. Transp. Res. Part D Transp. Environ. 2021, 97, 102877. [Google Scholar] [CrossRef]
  13. Leger, S.J.; Dean, J.L.; Edge, S.; Casello, J.M. If I had a regular bicycle, I wouldn’t be out riding anymore: Perspectives on the potential of e-bikes to support active living and independent mobility among older adults in Waterloo, Canada. Transp. Res. Part A Policy Pract. 2019, 123, 240–254. [Google Scholar] [CrossRef]
  14. James, O.; Swiderski, J.I.; Hicks, J.; Teoman, D.; Buehler, R. Pedestrians and e-scooters: An initial look at e-scooter parking and perceptions by riders and non-riders. Sustainability 2019, 11, 5591. [Google Scholar] [CrossRef]
  15. Lee, M.; Chow, J.Y.J.; Yoon, G.; He, B.Y. Forecasting e-scooter substitution of direct and access trips by mode and distance. Transp. Res. Part D Transp. Environ. 2021, 96, 102892. [Google Scholar] [CrossRef]
  16. Ben-Akiva, M.; Lerman, S. Discrete Choice Analysis: Theory and Application to Travel Demand; MIT Press: Cambridge, MA, USA, 1985. [Google Scholar] [CrossRef]
  17. Günay, G.; Ergün, G.; Gökaşar, I. Conditional Freight Trip Generation Modelling. J. Transp. Geogr. 2016, 54, 102–111. [Google Scholar] [CrossRef]
  18. Gökaşar, I.; Günay, G. Mode choice behavior modeling of ground access to airports: A case study in Istanbul, Turkey. J. Air Transp. Manag. 2017, 59, 1–7. [Google Scholar] [CrossRef]
  19. Günay, G.; Gökaşar, I. Market segmentation analysis for airport access mode choice modeling with mixed logit. J. Air Transp. Manag. 2021, 91, 102001. [Google Scholar] [CrossRef]
  20. Mo, B.; Wang, Q.Y.; Moody, J.; Shen, Y.; Zhao, J. Impacts of subjective evaluations and inertia from existing travel modes on adoption of autonomous mobility on-demand. Transp. Res. Part C Emerg. Technol. 2021, 130, 103281. [Google Scholar] [CrossRef]
  21. Yang, H.; Kitamura, R.; Jovanis, P.P.; Vaughn, K.M.; Abdel-Aty, M.A. Exploration of route choice behavior with advanced traveler information using neural network concepts. Transportation 1993, 20, 199–223. [Google Scholar] [CrossRef]
  22. Prato, C.G. Route choice modeling: Past, present and future research directions. J. Choice Model. 2009, 2, 65–100. [Google Scholar] [CrossRef]
  23. Ilahi, A.; Belgiawan, P.F.; Balac, M.; Axhausen, K.W. Understanding travel and mode choice with emerging modes: A pooled SP and RP model in Greater Jakarta, Indonesia. Transp. Res. Part A Policy Pract. 2021, 150, 398–422. [Google Scholar] [CrossRef]
  24. McFadden, D.; Train, K. Mixed MNL Models for Discrete Response. J. Appl. Econom. 2000, 15, 447–470. [Google Scholar] [CrossRef]
  25. Hensher, D.A.; Greene, W.H. The mixed logit model: The state of practice. Transportation 2003, 30, 133–176. [Google Scholar] [CrossRef]
  26. Zuniga-Garcia, N.; Ruiz Juri, N.; Perrine, K.A.; Machemehl, R.B. E-scooters in urban infrastructure: Understanding sidewalk, bike lane, and roadway usage from trajectory data. Case Stud. Transp. Policy 2021, 9, 983–994. [Google Scholar] [CrossRef]
  27. Zuniga-Garcia, N.; Tec, M.; Scott, J.G.; Machemehl, R.B. Evaluation of e-scooters as transit last-mile solution. Transp. Res. Part C Emerg. Technol. 2022, 139, 103660. [Google Scholar] [CrossRef]
  28. Cao, Z.; Zhang, X.; Chua, K.; Yu, H.; Zhao, J. E-scooter sharing to serve short-distance transit trips: A Singapore case. Transp. Res. Part A Policy Pract. 2021, 147, 177–196. [Google Scholar] [CrossRef]
  29. Abouelela, M.; Al Haddad, C.; Antoniou, C. Are young users willing to shift from carsharing to scooter–sharing? Transp. Res. Part D Transp. Environ. 2021, 95, 102821. [Google Scholar] [CrossRef]
  30. Christoforou, Z.; Gioldasis, C.; de Bortoli, A.; Seidowsky, R. Who is using e-scooters and how? Evidence from Paris. Transp. Res. Part D Transp. Environ. 2021, 92, 102708. [Google Scholar] [CrossRef]
  31. Nikiforiadis, A.; Paschalidis, E.; Stamatiadis, N.; Raptopoulou, A.; Kostareli, A.; Basbas, S. Analysis of attitudes and engagement of shared e-scooter users. Transp. Res. Part D Transp. Environ. 2021, 94, 102790. [Google Scholar] [CrossRef]
  32. Esztergár-Kiss, D.; Lopez Lizarraga, J.C. Exploring user requirements and service features of e-micromobility in five European cities. Case Stud. Transp. Policy 2021, 9, 1531–1541. [Google Scholar] [CrossRef]
  33. Karpenko, K.; Prentkovskis, O.; Skačkauskas, P. Analysing the impact of electric kick-scooters on drivers: Vibration and frequency transmission during the ride on different types of urban pavements. Eksploat. I Niezawodn.—Maint. Reliab. 2025, 27, 199893. [Google Scholar] [CrossRef]
  34. Danilevičius, A.; Danilevičienė, I.; Karpenko, M.; Stosiak, M.; Skačkauskas, P. Determination of the Instantaneous Noise Level Using a Discrete Road Traffic Flow Method. Promet—TrafficTransportation 2025, 37, 71–85. [Google Scholar] [CrossRef]
  35. Danilevičius, A.; Karpenko, M.; Křivánek, V. Research on the Noise Pollution from Different Vehicle Categories in the Urban Area. Transport 2023, 38, 1–11. [Google Scholar] [CrossRef]
  36. Fox, N.; Hunn, A.; Mathers, N. Sampling and Sample Size Calculation; The NIHR Research Design Service for the East Midlands, MHS. 2007. Available online: https://www.researchgate.net/profile/Mansour-Abdullah-Alshehri/post/How_to_determine_population_and_sample_size_for_a_survey_research/attachment/5aa18dfc4cde266d58902255/AS%3A601990316974080%401520537084240/download/Sampling+Sample+Size+Calculation.pdf (accessed on 21 January 2025).
  37. Meyer, J.T. Fundamental Research Statistics for the Behavioural Sciences; Holt, Rinehart & Winston: Saint Louis, MO, USA, 1979. [Google Scholar]
  38. Muradoğlu, C. Elektrikli Scooter Paylaşım Girişimi Marti’nin Kullanıcı Sayısı 5 Milyonu Geçti. 2021. Available online: https://webrazzi.com/2021/01/14/elektrikli-scooter-paylasim-girisimi-marti-nin-kullanici-sayisi-5-milyonu-gecti/ (accessed on 12 December 2024).
  39. Google. Google Maps Directions for Driving from Bostancı to Selamiçeşme. 2025. Available online: https://www.google.com.tr/maps (accessed on 21 January 2025).
  40. OPET. OPET Benzin Fiyatları Arşivi. 2023. Available online: https://www.opet.com.tr/akaryakit-fiyatlari-arsivi (accessed on 21 January 2025).
  41. NTV. İSPARK Ücretleri Ne Kadar Oldu? 2023. Available online: https://www.ntv.com.tr/turkiye/ispark-ucretleri-ne-kadar-oldu-iste-ekim-2023-guncel-ispark-tarifesi,aukiggueC0aes4uOzs8WNg (accessed on 21 January 2025).
  42. İstanbul Büyükşehir Belediyesi (İBB). İstanbul Büyükşehir Belediyesi Ücret Tarifesi. 2023. Available online: https://tuhim.ibb.gov.tr/media/2571/uecret-yayını.pdf (accessed on 21 January 2025).
  43. İstanbul Büyükşehir Belediyesi (İBB). İstanbul Büyükşehir Belediyesi Taksi Ücret Tarifesi. 2023. Available online: https://tuhim.ibb.gov.tr/media/2573/taksi-yayınlama.pdf (accessed on 21 January 2025).
  44. Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
  45. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer: New York, NY, USA, 2009. [Google Scholar]
  46. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  47. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
  48. Murat, Y.Ş. Comparison of fuzzy logic and artificial neural networks approaches in vehicle delay modeling. Transp. Res. Part C Emerg. Technol. 2006, 14, 316–334. [Google Scholar] [CrossRef]
Figure 1. Screenshot of Travel Time Measurements for the 4 km Route Between Bostancı and Selamiçeşme, 20 minutes by automobile at the moment of the screenshot [39].
Figure 1. Screenshot of Travel Time Measurements for the 4 km Route Between Bostancı and Selamiçeşme, 20 minutes by automobile at the moment of the screenshot [39].
Sustainability 17 11088 g001
Figure 2. The architecture of the developed ANN.
Figure 2. The architecture of the developed ANN.
Sustainability 17 11088 g002
Figure 3. A flowchart illustrating the three-way data split validation method [48].
Figure 3. A flowchart illustrating the three-way data split validation method [48].
Sustainability 17 11088 g003
Table 1. Summary of key studies on e-scooter mode choice.
Table 1. Summary of key studies on e-scooter mode choice.
StudyMethodsContextAdvantagesLimitations
Reck et al. [5]MNL, MLE-scooter vs e-bikeIdentified parking preference, peak period patternsLimited to one city, short-term trips only
Zuniga-Garcia et al. [26]DCMInfrastructure planningHighlights need for infrastructure-focused modelsDoes not model user behavior heterogeneity
Psarrou Kalakoni et al. [6]Survey analysisFranceIdentifies short trips replaced by e-scootersLimited scenario variability
Esztergár-Kiss et al. [7]Survey & DCMMultiple citiesCompares cost, efficiency, travel time preferencesTraditional DCM limitations for complex patterns
Günay et al. [8]DCMIstanbulProvides baseline accuracy for mode choice prediction, establishes reference for micromobility behaviorLimited to discrete choice modeling, does not explore machine learning approaches or scenario-based analysis
Karpenko et al. [33]Field
experiments
Pavement
vibration
Detailed ergonomic assessmentLimited focus on mode choice
Danilevičius et al. [34,35]Noise
modeling
Urban trafficCaptures instantaneous peaksNot integrated with mode choice models
Table 2. Travel Times of Modes (minutes).
Table 2. Travel Times of Modes (minutes).
Light TrafficModerate TrafficCongested Traffic
Private Car92030
Public Transport152945
Taxi92030
E-scooter161616
Table 3. Travel costs of the modes (TL).
Table 3. Travel costs of the modes (TL).
Base FareIncreased Fare
Private Car65105
Public Transport1525
Taxi75125
E-scooter75125
Table 4. The prepared scenarios.
Table 4. The prepared scenarios.
Private CarTaxiE-ScooterPublic Transport
Fare (TL)Time (Min)Fare (TL)Time (Min)Fare (TL)Time (Min)Fare (TL)Time (Min)
11-Light10591259125162515
21-Moderate1052012520125162529
31-Congested1053012530125162545
42-Light1059125975161515
52-Moderate105201252075161529
62-Congested105301253075161545
73-Light1059759125161515
83-Moderate105207520125161529
93-Congested105307530125161545
104-Light105975975162515
114-Moderate10520752075162529
124-Congested10530753075162545
135-Light6591259125161515
145-Moderate652012520125161529
155-Congested653012530125161545
166-Light659125975162515
176-Moderate65201252075162529
186-Congested65301253075162545
197-Light659759125162515
207-Moderate65207520125162529
217-Congested65307530125162545
228-Light65975975161515
238-Moderate6520752075161529
248-Congested6530753075161545
Table 5. Demographic Data.
Table 5. Demographic Data.
AgeMin 16
Max 66
Mean 25.74
Standard Deviation 7.84
NumberPercentage
GenderFemale22849.35
Male23350.43
Does not wish to specify10.21
Automobile License OwnershipYes35276.19
No11023.81
Automobile OwnershipYes13729.65
No32570.35
Income Level0 TL13829.87
1–Minimum wage10121.42
Minimum wage–20,000 TL408.65
20,001–30,000 TL5411.69
30,001–40,000 TL5111.04
40,001–50,000 TL316.71
More than 50,000 4910.61
Which infrastructure would you use the e-scooter on?Bicycle Path31768.61
Sidewalk5111.04
Road9420.35
How safe do you find the e-scooter?Very Safe30.65
Safe337.14
Moderate18640.26
Slightly Safe13829.87
Not Safe at All10222.08
Table 6. Summary of input variables used in the predictive models.
Table 6. Summary of input variables used in the predictive models.
CategoryVariable/QuestionTypeDescriptionEncoding MethodNumber of Neurons
DemographicAgeNumericAge of participant in yearsNone1
GenderCategoricalMale/Female/Prefer not to sayOne-hot3
Education LevelCategoricalMiddle School/High School/Undergraduate /PostgraduateOne-hot4
OccupationCategoricalNot employed/Self-employed/Public sector/Private sector/StudentOne-hot5
Automobile License OwnershipCategoricalYes/NoOne-hot1
Automobile OwnershipCategoricalYes/NoOne-hot1
Household SizeNumericNumber of people in householdNone1
Number of Cars NumericNumber of cars in householdNone1
Income LevelCategorical7 ordinal classesOne-hot7
Household Income LevelCategorical6 ordinal classesOne-hot6
E-scooter OwnershipCategoricalYes/NoOne-hot1
Opinions and Preferences Regarding E-ScootersIs there a bicycle lane within 250 m of your residence?CategoricalYes/NoOne-hot1
Do you use e-scooters on the sidewalk?CategoricalYes/NoOne-hot1
Do you use e-scooters on one-way streets?CategoricalYes/NoOne-hot1
Do you use e-scooters on two-way streets?CategoricalYes/NoOne-hot1
Do you park e-scooters in designated parking areas built specifically for bicycles and e-scooters?CategoricalYes/NoOne-hot1
Which mode of transportation do you prefer the most for your daily trips?CategoricalE-Scooter/Taxi/Public Transport/Walking/Private Vehicle One-hot5
For what purpose do you travel most frequently?CategoricalWork/School/Health/Entertainment/Shopping/OtherOne-hot6
On which type of infrastructure do you prefer to use e-scooters the most?CategoricalBicycle lane/Sidewalk/StreetOne-hot3
If you could not use an e-scooter for your trips, which mode of transportation would you use?CategoricalBicycle/Taxi/Public Transport/Walking/Private Vehicle One-hot5
At which levels of traffic congestion do you prefer to use e-scooters?CategoricalLight/Moderate/HeavyOne-hot3
For trips of up to how many minutes do you prefer to use e-scooters?Categorical5/10/10+One-hot3
How safe do you consider e-scooters to be?Categorical1–5 Likert scaleOne-hot5
What is your opinion on allowing e-scooter use on sidewalks under certain rules?CategoricalAcceptable/Should be banned/UndecidedOne-hot3
Where do you most frequently park e-scooters?CategoricalIn bicycle and e-scooter parking areas/At the edge of the rightmost lane/On the sidewalk/At a suitable place in my own vehicle, home, or officeOne-hot4
Would you prefer these parking areas if benefits such as discounted fees or free usage time were offered?CategoricalYes/NoOne-hot1
To what extent do you comply with traffic rules when using an e-scooter?Categorical1–5 Likert scaleOne-hot5
What is the most important factor influencing your decision to use an e-scooter for your trips?CategoricalAccessibility/Safety/Speed/Duration/Traffic congestion/CostOne-hot7
What is the most important factor influencing your decision not to use an e-scooter for your trips?CategoricalAccessibility/Safety/Speed/Duration/Traffic congestion/CostOne-hot7
How much do you consider appropriate to pay for 10 min of e-scooter usage?Categorical5 ordinal classesOne-hot5
How many e-scooter trips do you make per week?Categorical5 ordinal classesOne-hot5
On average, how many minutes do you travel during a single e-scooter trip?Categorical5 ordinal classesOne-hot5
On average, how many kilometers do you travel per day using an e-scooter?Categorical5 ordinal classesOne-hot5
Under which weather conditions do you use e-scooters?CategoricalSunny/Cloudy/Windy/RainyOne-hot4
Scenario AttributePrivate Vehicle FareNumericCost of the travel in TLNone1
Private Vehicle TimeNumericTravel duration in minutesNone1
Taxi FareNumericCost of the travel in TLNone1
Taxi TimeNumericTravel duration in minutesNone1
E-Scooter FareNumericCost of the travel in TLNone1
E-Scooter TimeNumericTravel duration in minutesNone1
Public Transport FareNumericCost of the travel in TLNone1
Public Transport TimeNumericTravel duration in minutesNone1
Table 7. Dataset combinations used in determining the network architecture.
Table 7. Dataset combinations used in determining the network architecture.
Partition ITrainingValidationTest
Partition IITrainingValidationTestTraining
Partition IIITrainingValidationTestTraining
Partition IVTrainingValidationTestTraining
Partition VValidationTestTraining
Table 8. The confusion matrices of the developed models, along with the support and weight values of the classes.
Table 8. The confusion matrices of the developed models, along with the support and weight values of the classes.
ModePrivate CarPublic TransportTaxiE-Scooter
ANNPrivate Car2206223104184
Public Transport11184804347648
Taxi01310
E-scooter339158640767
DTPrivate Car327213179118
Public Transport216491593121
Taxi474739370
E-scooter12893751290
RFPrivate Car2912529184455
Public Transport4974448184583
Taxi602315352
E-scooter194186119509
CBPrivate Car2952504199497
Public Transport5604559221660
Taxi36712029
E-scooter115116100413
Support366351866401599
Weight0.3300.4670.0580.144
Table 9. Performance metrics of the developed models.
Table 9. Performance metrics of the developed models.
MacroWeighted
ModelAccuracy Precision Recall F1Accuracy Precision Recall F1
ANN0.8440.5710.5090.5050.8720.7170.7040.690
DT0.9240.8350.8150.8290.8970.8280.8900.854
RF0.8590.6640.5530.5750.8270.7010.7230.701
CB0.8330.7060.7250.6980.8630.6630.5240.553
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dündar, S.; Alp, S. Modeling Mode Choice Preferences of E-Scooter Users Using Machine Learning Methods—Case of Istanbul. Sustainability 2025, 17, 11088. https://doi.org/10.3390/su172411088

AMA Style

Dündar S, Alp S. Modeling Mode Choice Preferences of E-Scooter Users Using Machine Learning Methods—Case of Istanbul. Sustainability. 2025; 17(24):11088. https://doi.org/10.3390/su172411088

Chicago/Turabian Style

Dündar, Selim, and Sina Alp. 2025. "Modeling Mode Choice Preferences of E-Scooter Users Using Machine Learning Methods—Case of Istanbul" Sustainability 17, no. 24: 11088. https://doi.org/10.3390/su172411088

APA Style

Dündar, S., & Alp, S. (2025). Modeling Mode Choice Preferences of E-Scooter Users Using Machine Learning Methods—Case of Istanbul. Sustainability, 17(24), 11088. https://doi.org/10.3390/su172411088

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop