Next Article in Journal
Theoretical and Numerical Research on High-Speed Small Refrigeration Twin-Screw Compressor
Next Article in Special Issue
A Scoping Review on On-Road Risks Associated with Driving with Dementia: Implications for Driving Technology Advancements
Previous Article in Journal
Consumer Perceptions and Attitudes Towards Ultra-Processed Foods
Previous Article in Special Issue
Ontology-Based Customisation Management System for Driver-Vehicle Interfaces: A Preventive Approach to Incident Reduction and Legal Accountability in Highly Automated Vehicles
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Operationalizing Dyadic Urban Traffic Interaction Studies: From Theory to Practice

1
Department of Industrial Design, Eindhoven University of Technology, 5612 AP Eindhoven, The Netherlands
2
Department of Information Science, Cornell Tech, New York, NY 10044, USA
3
SCANIA CV AB, 417 56 Göteborg, Sweden
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(7), 3738; https://doi.org/10.3390/app15073738
Submission received: 8 February 2025 / Revised: 19 March 2025 / Accepted: 21 March 2025 / Published: 28 March 2025
(This article belongs to the Special Issue Human–Vehicle Interactions)

Abstract

:
Realistically modeling interactions between road users—like those between drivers or between drivers and pedestrians—within experimental settings come with pragmatic challenges. Due to practical constraints, research typically focuses on a limited subset of potential scenarios, raising questions about the scalability and generalizability of findings about interactions to untested scenarios. Here, we aim to tackle this by laying the methodological groundwork for defining representative scenarios for dyadic (two-actor) interactions that can be analyzed individually. This paper introduces a conceptual guide for operationalizing controlled dyadic traffic interaction studies, developed through extensive interdisciplinary brainstorming to bridge theoretical models and practical experimental design. It elucidates critical trade-offs in scenario selection, interaction approaches, measurement strategies, and timing coordination, thereby enhancing reproducibility and clarity for future traffic interaction research and streamlining the design process. The methodologies and insights we provide aim to enhance the accessibility and quality of traffic interaction research, offering a guide that aids researchers in setting up studies and ensures clarity and reproducibility in reporting, bridging the gap between theoretical traffic interaction models and practical applications in controlled experiments, thereby contributing to advancements in human factors research on traffic management and safety.

1. Introduction

The complexities of automated vehicle (AV) interactions with human road users are now recognized as a significant obstacle to their large-scale adoption [1,2,3,4,5]. Inadequate interaction between AVs and humans could result in accidents if the AVs’ actions are not anticipated by human drivers, or traffic congestion and human dissatisfaction if AVs are overly cautious [6]. Hence, there is a pressing need for advancements in understanding traffic interaction, since the substantial societal and economic gains anticipated from the deployment of AVs depend on these vehicles’ ability to anticipate and respond appropriately to the behaviors of human driver, pedestrians and other human road users [7,8,9]. This urgency highlights existing research questions and introduces new ones regarding how traditional human interactions unfold in traffic—the ability to rigorously examine these interactions, discern nuanced differences, and analyze both the breakdowns and recoveries, is crucial.
To drive the research on human behavior in traffic interactions, Markkula et al. [10] established a comprehensive, high-level conceptual framework for articulating and examining road traffic interactions from an interdisciplinary perspective. This conceptual framework introduced a precise definition of a “traffic interaction” and outlined a taxonomy of behaviors that road users display during such interactions. It offers multiple benefits: besides enhancing communication among researchers through clear definitions, it improves observational methodologies with precise interaction classifications, informs the setting of technical standards for AVs based on a detailed understanding of traffic interactions, and serves as a foundation for developing detailed interaction models.
Markkula et al.’s work surfaces the need for controlled experiments of road traffic interactions, but the practical operationalization of this framework into human participant studies involves numerous non-explicit challenges and decision points that are not captured in their framework but worth detailing for researchers embarking on the critical challenge of gathering qualitative and quantitative data about traffic interactions. Furthermore, Deshmukh et al. [11] presented a systematic review as a valuable survey of challenging scenarios for automated vehicles (AVs) and vulnerable road users (VRUs), with its focus primarily centering on the identification and classification of critical scenarios from a theoretical and observational standpoint. While these prior works provided a theoretical overview of the diverse possible interaction scenarios, they do not extend to conducting controlled studies with these scenarios. In our work, we build upon prior research, particularly, Markkula et al.’s theoretical framework [10] to operationalize experimental studies that enable us to gather qualitative and quantitative data about traffic interaction behavior. We aim to answer a critical question as experimentalists: how should we proceed in operationalizing these interactions for controlled empirical study? A key aspect to focus on is dyadic traffic interactions—interactions that occur between two road users, such as a human-operated vehicle and an autonomous vehicle (see Figure 1). Studying these two-party interactions is essential for understanding the dynamic exchanges and decision-making processes that take place in real-world traffic, where vehicles must constantly respond to the behaviors and actions of others. Operationalizing these dyadic interactions in a controlled setting will allow for more detailed investigation of how autonomous vehicles can safely and efficiently navigate complex situations involving human road users.
Our work extends Markkula et al.’s work by addressing the practical challenges encountered in setting up study design and conducting real-world or virtual interaction studies. We elucidate the logistical, technical, and methodological issues that researchers face when attempting to set up and implement these scenarios in empirical settings, with a particular focus on dyadic scenarios. It is not practical to run every conceivable interaction scenario, due to constraints such as time, budget, participant availability, and the technical limitations of simulation environments. Hence, experimentalists must make strategic choices that balance depth of inquiry with feasibility, for which we provide initial guidance in the framework presented in this paper. More specifically, this paper is a synthesis of the outcomes of five brainstorming sessions on the various pragmatic decisions necessary for studying various interaction scenarios, each designed to provoke different behaviors, and highlight the trade-offs involved in scoping such studies. Our aim is to offer an initial framework that will guide researchers in making informed decisions about study design that balance the depth of individual scenario analysis with the breadth required to derive meaningful insights applicable to more general contexts.

Contribution Statement

In this work, we offer an evaluation of the complexities in traffic interactions for controlled studies, providing a conceptual guide for downstream research in this field that we hope others will adopt. This framework aims to enhance study reproducibility, enable comparability across research, and support the design of interaction studies. By defining these complexities, we help researchers implement and investigate traffic interactions more effectively, advancing both theoretical and practical research in traffic management and vehicle automation. This discussion on the pragmatics of operationalizing dyadic traffic interaction studies—while not comprehensive—lays the groundwork of a structured approach to simulator-based studies by serving two primary purposes: (1) to educate both researchers and practitioners about the realities of conducting traffic interaction studies, particularly for those who are unfamiliar with the intricacies involved in such research; and (2) to establish a common language that facilitates clear communication between researchers and reviewers.

2. Background

2.1. Dyadic Interactions

Traffic systems are intricate and dynamic, influenced by multiple parameters such as vehicle speed, traffic density, road conditions, and environmental influences, all of which complicate the isolation and study of specific traffic interactions. Historically, the exploration of traffic interaction dynamics primarily relied upon observational methodologies. Notably, Juhlin utilized ethnographic strategies to study interactions at a Swedish driving school by observing and interviewing participants, recording sessions, and subsequently transcribing and thematically analyzing instances of cooperation among road users [12]. This approach has been paralleled in research addressing various facets of social agent navigation within urban traffic [13], interactions between drivers and buses [14], pedestrian-vehicle dynamics [15,16,17], and behaviors at petrol stations [18]. Additionally, Vinkhuyzen and Cefkin employed ethnographic methods to probe into how AVs are anticipated to interact with pedestrians, bicyclists, and other vehicles in a socially congruent manner, highlighting the methodological challenges in distinguishing such interactions observationally [19].
This is particularly relevant when comparing traffic interaction behaviors across locations with distinct cultures. Scholars studying driving interactions focus on the dynamics betweendrivers and other road users as key elements defining regional driving styles. Sociologist Dale Dannefer, for instance, discusses informal norms that influence driving behaviors, including following distance, merging practices, and the adherence to right-of-way rules, as well as the display of attention or inattention while driving [20]. Extending this viewpoint, Factor et al. propose that certain vehicular accidents should be understood not merely as results of individual risky behaviors but as “social accidents”, which occur due to misunderstandings and misinterpretations among individuals from different social backgrounds during their interactions [21]. This further contextualizes the need for studying and characterizing traffic interactions, for which the seminal work by Markkula et al. [10] provides a solid theoretical foundation.
Observational studies and naturalistic data analysis have historically played a crucial role in elucidating insights into traffic interactions. Prominent examples include large-scale driving studies utilizing instrumented vehicles to expand the scope of data by increasing the number of participants, miles driven, and duration of data collection. Notable projects include Virginia Tech’s 100 Car Naturalistic Driving Study [22], the SHRP2 Naturalistic Driving Study [23], the University of Minnesota’s Field Operational Test of the Teen Driver Support System [24], and the ongoing MIT Autonomous Vehicle Technology Study [25]. These studies collect diverse data forms such as images, videos, and telematic data from instrumented vehicles, creating versatile research datasets used for purposes like accident prediction [26] and analyzing the impact of weather on driving performance [27]. A notable challenge with these multi-use datasets is that they are often compiled without a clear vision of how future researchers might utilize them [28]. Because of this, the analysis of interaction models using such data, as explored by Domeyer et al. using MIT-AVT data [29], often demands substantial processing to identify and classify interactions due to the complexity and volume of the data [30]. The challenges of isolating findings amidst the sea of confounding factors has lead to growing interest in augmenting naturalistic datasets with more tightly curated simulation datasets collecting interactions under more controlled environments.
Simulations are particularly beneficial in the study of AVs, as they allow for control of interactive agents, facilitating analysis of how variations in AV behavior impact human drivers [31]. Recent studies employing virtual simulations have explored pedestrian interactions with AVs, investigating how these interactions are affected by factors such as vehicle appearance and behavior [32,33], the effectiveness of external displays in signaling vehicle intent [34,35,36], and the influence of vehicle motion on pedestrian decision-making regarding crossing behaviors [37].
To understand the dynamic between actors, such as those the social scientists study, driving simulators that can enable interaction between multiple participants are needed. The concept of conducting interaction studies among multiple drivers within a shared virtual environment was pioneered by Hancock and De Ridder in 2003, who utilized adjacent full-vehicle simulators to study collision avoidance behaviors [38]. Following this, Muhlbacher et al. developed a platform in 2011 for studying interactions among four drivers in a platooning scenario [39]. Furthermore, researchers at the Institute for Transportation Studies at the German Aerospace Center (DLR) have developed the Modular and Scalable Application Platform for ITS Components (MoSAIC), which has been instrumental in examining interactions among V2V connected vehicles and non-equipped vehicles [40], as well as cooperative lane change maneuvers [41] and traffic-light assistance systems [42]. Recent advancements in virtual-reality based multi-driver and driver-pedestrian simulations demonstrate the growing feasibility of conducting comprehensive multi-agent traffic interaction studies [43,44,45,46,47,48,49].
Despite the presence of these systems, there are as yet only a limited number of papers elaborating interactions based on these multi-participant scenarios [44,45,46,48]. In three of these four papers, only one multi-agent scenario is explored; none attempt to cover the range of interactions outlined by Markkula et al. [10]. We perceive this to stem from the need to translate Markkula’s work into operationalizable study designs.

2.2. Related Frameworks on Experiment Design

While there are other papers addressing design of studies on traffic interaction (e.g., [50,51,52]), they have a tendency to focus on particular road users or certain measurements, leaving a gap when it comes to a holistic methodological approach in the design of controlled studies for traffic interactions. For instance, Donmez et al. [52] present in their framework only a set of guidelines for analysis of road user behavior data, particularly focusing on non-driving related task. Similarly, Kircher et al. [50] focuses on analysis of data from semi-controlled experiments from driver perspective. Kaß et al. [51] present a methodology for conducting usability study with focus on situations involving automated vehicles and pedestrians or bicyclists. These works are limited to usability and lack a holistic approach to critical decisions and trade-offs that experimenters need to make. The systematic review by Deshmukh et al. [11] provides a comprehensive taxonomy of challenging scenarios involving automated vehicles (AVs) and vulnerable road users (VRUs), with a strong emphasis on identifying critical interaction points that AVs must navigate to ensure safety and effectiveness. Though this work offers a valuable foundation for understanding the scope and diversity of traffic interactions, its focus is largely theoretical, with limited consideration of the practical challenges involved in translating these scenarios into experimental settings. To address this gap, we contribute with a methodological framework that explicitly addresses the operationalization of traffic interaction studies, providing a structured approach to selecting, designing, and running scenarios under real-world constraints. We emphasize the logistics of running factorial experiments, including scenario selection criteria, decision-making processes, and strategies to mitigate participant burden.

3. Method

Our process consisted of a series of brainstorming sessions between the authors—each having extensive experience in designing and executing both controlled and naturalistic studies on traffic interactions in academia and automotive industry.
The brainstorming sessions conducted for this study spanned three meetings, during which the authors collaboratively examined the complexities of operationalizing dyadic traffic interaction studies. The discussions were iterative and comprehensive, revisiting multiple aspects of the study design across the three meetings. The goal was to methodically address pragmatic issues while developing the framework. No significant disagreements emerged during these sessions; instead, the team focused on systematically unearthing and resolving issues related to study design pragmatics. The first author was mainly responsible for preparing and organizing the sessions as well as for documenting the results. All authors participated in the brainstorming, structuring and revision of the framework.
This research was part of a broader NSF-funded project on vehicle-pedestrian interaction. As part of this effort, the research team consulted an external advisory board composed of eight members representing diverse sectors, including academia, industry and policy. The advisory board brought a wealth of expertise to the discussions from experience in the domains of human factors, traffic psychology, and autonomous vehicle systems. This ensured a well-rounded exploration of the challenges and solutions. provided valuable feedback on the research and its development, ensuring that diverse perspectives informed our work. The ideas generated during the brainstorming sessions were presented to this advisory board, whose insights were instrumental in refining the work and enhancing its applicability.
Each session was designed to progressively deconstruct and address the complexities of traffic interaction studies. During these sessions, we identified key elements of interaction as defined in theoretical models and discussed how these elements could be translated into practical scenarios that could be implemented in a simulated study design in the following steps:

3.1. Identifying Elements of Interaction

Initial sessions focused on dissecting the theoretical aspects of interactions, such as the spatial and temporal dynamics between multiple road users, and understanding how these could influence each other’s behavior within a shared space. The outcomes from this step underpin the theoretical ground for our framework and are as such reflected in the rest of our decisions.

3.2. Translation to Practical Implementation

Subsequently, we delved into the operationalization of these interactions within a simulation. This included discussions on the technological and logistical feasibility of simulating complex multi-agent interactions, considering the physical constraints of the simulation environment, resource limitations, safety and ethics, and participant requirements. The overall structure of our framework and the decisions that the experimenters need to make for a successful controlled interaction study constitute the key results from this step. More specifically, we identified a set of high-level trade-offs that experimenters need to consider when designing such a study (Depth vs. Breadth, Quality vs. Quantity, Cost vs. Coverage, Complexity vs. Manageability), as well as a set of critical decision-making aspects (Choice of scenarios, Interaction approach, Measures, and Timing and coordination of interactions). While there might be other important aspects, these four wide dimensions were selected by the team as the most critical for a comprehensive study design.

3.3. Scenario Development

Based on the results from the previous two steps, we systematically developed an exhaustive set of interaction scenarios between a pair of traffic participants (driver-driver or driver-pedestrian) that encompassed a broad range of situations. The results from this step are included in the framework as a comprehensive pool of scenarios from which the final set of scenarios in the study can be selected based on the research questions and study criteria.

4. Results: Aspects of Traffic Interaction Study Design Warranting Trade-Offs

In setting up experimental interaction studies, experimenters encounter a variety of challenging constraints and considerations that significantly influenced our study design. Here, we share insights into the decision-making processes involved in designing interaction studies. The purpose is not to prescribe a universal method for conducting such studies but rather to illuminate the pragmatic limitations and logistical challenges inherent to this type of research.
Although traffic interactions may involve multiple participants, in this paper, for analytical simplicity, we decompose these multi-agent interactions into dyadic components, i.e., we only focus on 2-actor scenarios. This reduction is based on the assumption that at a fundamental level, traffic interactions occur between pairs and are resolved according to the most immediate threats or safety concerns, which allows us to dissect the broader challenges into more manageable segments.
As experimenters, we must confront the inherent trade-off between the depth of study in specific scenarios and the generalizability of findings. Not all behaviors or phenomena observed in particular settings will necessarily apply to all scenarios, highlighting the limitations of each study’s applicability. This requires us to keep in mind the following decision factors:
  • Depth vs. Breadth: Concentrating on specific scenarios provides depth but may sacrifice breadth and general applicability.
  • Quality vs. Quantity: Gathering high-quality data in a few scenarios might mean fewer overall observations, affecting statistical power and broader implications.
  • Cost vs. Coverage: Financial constraints may limit the number of scenarios or participants, impacting the study’s scope and comprehensiveness.
  • Complexity vs. Manageability: Studying more complex interactions offers richer insights but can complicate the analysis and interpretation of data, and also create complications with ethical approval from institutional IRBs if the complexity and realism of the setup can potentially create safety concerns for participants.
By highlighting these specific decisions and trade-offs necessary in the design and execution of interaction studies, we aim to clarify the requirements for researchers planning to run interaction studies. Besides these broad decision factors, operationalizing interaction studies necessitates a balance of trade-offs in scoping and managing interactions effectively. It is important to determine reasonable ways to break down the comprehensive problem space into manageable, researchable chunks for a focused and effective study. Our brainstorming sessions identified critical decision-making aspects across four dimensions that influence the design of these studies:
  • Choice of Scenarios: Choosing which scenarios to include involves weighing the comprehensiveness against cost and effort. It is impractical to cover every possible scenario, so selecting a subset that offers the most significant insights while being feasible is crucial.
  • Interaction approach—Vignettes vs. Drives: Deciding between using short, specific scenarios (vignettes) or longer, continuous driving simulations (drives) affects the depth of data and the naturalism of interactions.
  • Measures: Deciding what measurements to take (behavioral, physiological, subjective) and how these metrics can best capture the interactions of interest. The choice of the measures and how they are collected has a direct impact on the determination of whether the interaction approach of the study.
  • Timing and coordination of Interactions: Setups that attempt to control the timing to provoke interactions can detract from the natural flow of operating in traffic, presenting a trade-off against naturalism.
While addressing these aspects requires making decisions that may not always be optimal but are necessary within the study’s constraints, understanding these practical considerations ensures an effective and transparent design of interaction studies. This transparency not only aids in replicating studies but also in understanding and evaluating the limitations and strengths of research findings within the field. These trade-offs underscore the need for efficiently designed studies that maximize data collection within the available duration, ensuring participant well-being and adherence to budgetary constraints. These constraints necessitate a more selective approach to studying traffic interactions, focusing on key scenarios that offer the most significant insights relevant to the research questions at hand.
For each of these four dimensions, there are further technical considerations and research question needs that must play a role in determining the final study setup, which we elaborate in the subsequent sections of this paper.

4.1. Choice of Scenarios

Markkula’s detailed enumeration of interaction scenarios presents us with a decision point in conducting interaction studies: should we attempt to cover all enumerated scenarios or should we focus in-depth on a select few? Each approach offers distinct benefits and aligns with different research objectives. Broadly studying every possible scenario could provide a comprehensive understanding of traffic behavior across various contexts, emphasizing the universality of space-sharing conflicts. This approach would ideally capture the full spectrum of interaction dynamics, offering a macroscopic view of traffic behavior. However, the feasibility of such an exhaustive study is often limited by factors such as resources, time, funding, simulator sickness, and the sheer complexity of traffic environments.
On the other hand, focusing on a few selected scenarios in depth allows for a more detailed exploration of specific interactions. While this approach sacrifices some breadth, it enhances depth and may lead to more detailed, context-specific understandings that can inform targeted interventions or policies. Given the impracticality of studying “every situation”, we suggest in this paper a “balanced” approach that involve selecting a representative range of diverse scenarios. This strategy enables researchers to explore how findings from one scenario may transfer to or differ from others, thereby uncovering the relationships between different types of encounters. Understanding these relationships can help to generalize findings and apply them more broadly, even to untested scenarios, thereby maximizing both the reach and the relevance of the research. However, at a more pragmatic level, the right approach is dictated by the technical constraints of the setup for the study, including:
  • Space Constraint: Physical or virtual environments where the studies are conducted might have limitations on the nature of the scenarios that can be realistically set up and analyzed due to space restrictions.
  • Time Constraint: Each scenario requires setup, execution, and analysis time. Covering every possible scenario in an interaction study can thus be exceedingly time-consuming, making it difficult to complete the study within a affordable timeframe. Besides, extensive time commitments can be taxing for participants, potentially affecting their performance and willingness to participate. Furthermore, if participants are compensated by the hour, budget limitations can restrict the total available hours for conducting the study. This financial constraint often necessitates a concise and focused experimental design to stay within budget. Furthermore, in simulated environments, prolonged exposure can lead to various physiological limitations among participants such as simulator sickness (a condition similar to motion sickness), fatigue and cognitive overload, which limits the duration participants can comfortably engage with the simulation.
  • Number of Participants: Recruiting a sufficient number of participants to cover all scenarios comprehensively can be challenging and costly, especially for more complex interactions that may require a larger and more diverse participant pool.
  • Type of Participants: Attention need to be paid to the specific demographic of participant, incorporating parameters such as age group and expertise level with a domain and technology (e.g., ADAS system usage, heavy vehicle operation experience, or level of exposure to autonomous driving), aligning with the agents required as actors in the study.
  • Naturalism: Conducting studies in in the real world ensures that the data and findings approach high ecological validity (e.g., drivers are more likely to exhibit natural driving behaviors). However, this inherently carries risks to the participant and others, and can be resource and cost intensive, and directly dictate the scenarios that are accessible to test in the circumstances.

4.1.1. Categorizing Scenarios

The location of the scenarios on road range from intersections, parking lots, shared roads (where vehicles and vulnerable road users occupy the same space with the latter having the right of way at all times), mid-block sites on roads, and merge situations. Within each of these scenarios, there can be various combinations of how a dyadic (two-agent) interaction can unfold, which is expounded in the appendices, while the final selection of the scenarios in the study from within that pool depend on the constraints explained earlier and shown in Figure 2. Based on our brainstorming sessions, we argue that it is crucial to systematically determine the grouping of traffic interactions based on the specific research question being addressed. More specifically, in a study focusing on dyadic interactions, different kinds of groupings can be derived from the research question. For instance, one potential grouping could involve scenarios where one agent possesses the right of way while the other does not, necessitating a comprehensive examination of all such scenarios. Another grouping might focus on situations where one agent proceeds straight and the other needs to turn. Additionally, a further grouping could involve both actors executing a change in direction, requiring the specification of all scenarios that fit this criterion. The choice of grouping(s) stems directly from the research question, emphasizing the need to align with the underlying inquiry, rather than advocating for any particular grouping. Our proposed strategy ensures that the interactions studied are relevant to the specific dynamics and outcomes we aim to understand, allowing for more targeted and meaningful insights into traffic behavior and safety.
Choosing the right subset of scenarios for a traffic interaction study is a critical decision hinges critically on aligning with the research question at hand so that it balances thoroughness with practical constraints such as cost and effort. Running only one scenario might not be very informative, but attempting to cover all possible scenarios might neither be feasible nor efficient. The challenge, then, is to select a subset of scenarios that is both manageable and representative of the broader issues being investigated. Among the subset of scenarios that is chosen, it is important to articulate clearly what remains consistent across scenarios and what varies. Such clarity supports the study’s internal validity by illustrating how different variables or settings influence the outcomes.
Each scenario’s inclusion in the study must be justified, explaining why these particular scenarios were chosen over others. This justification should focus on their potential to provide insights into the types of interactions being studied. Additionally, establishing a common thread or theme that runs through all scenarios helps in synthesizing findings and drawing broader conclusions. Whether it is a specific type of interaction or a shared environmental condition, this commonality ties the scenarios together and enhances the overall coherence of the research. By thoughtfully selecting and justifying these scenarios, researchers ensure that their studies are both manageable and meaningful. This structured approach not only aids in conducting focused and effective research but also strengthens the study’s contribution to the broader field of traffic interaction studies, providing insightful and impactful results.

4.1.2. Example of Scenario Selection

Appendix A.1, Appendix A.2, Appendix A.3, Appendix A.4, Appendix B.1, Appendix B.2, Appendix B.3 and Appendix B.4 showcase various use cases of driver:driver and driver:pedestrian interactions. Here, we outline a “balanced” approach to scenario selection and provide examples that illustrate how to streamline scenario selection based on these guidelines, which involves identifying a subset of representative scenarios that align closely with the research objectives while remaining feasible within the study’s constraints. Factors such as scenario relevance to the research question, potential for generating impactful insights, and logistical feasibility (e.g., time required, participant burden, and budgetary limits) are critical criteria for prioritization. By focusing on scenarios that address the most pressing aspects of the research question or that are most likely to yield transferable insights, researchers can make informed decisions about where to allocate limited resources effectively.
The scenarios highlighted in Figure A1 and Figure A2 refer to interactions between two drivers when they approach one another in an orthogonal (perpendicular) or contra-directional (head-on) manner respectively. In fleshing out the scenarios, the grouping (“common thread”) applied was the direction of travel of the other car (left, right, or straight) in relation to the ego car. Once these scenarios are thoroughly elaborated, we realize that the 27 total interaction scenarios (18 orthogonal and 9 contra-directional) is a large number of scenarios that may be pragmatically impossible to investigate exhaustively in an interaction study. It therefore becomes necessary to eliminate some scenarios to scope the study to the ones that are the core focus of the research question. For instance, if the research question deals specifically with space sharing conflicts, then there may be an argument to eliminate scenarios in which the cars do not actively cross one another’s path even though they encounter one another. Thus, in Figure A1, scenarios DD-O2, O-5, O7, O8, O10, O11, O12, and O14 can be omitted. Similarly, in Figure A2, scenarios DD-C5, C6, C8, and C9 may be ignored. This allows for a more streamlined consideration of the problem space for a focused interaction study.
Similarly, Figure A5 and Figure A6 showcase the interactions between a driver and a pedestrian when they approach one another in an orthogonal (perpendicular) or contra-directional (parallel/head-on) manner respectively. Among the 24 total interaction scenarios (12 orthogonal and 12 contra-directional), the research question may help us winnow down the scenarios of interest. For example, if the focus is on how the pedestrian crosses the road when the car approaches from their side regardless of the behavior of the car predicated on its intention (to turn or go straight) at the intersection, the scenarios DP-O1, O2, and O3 essentially reduce to one scenario—for each of them the car approaches the pedestrian from the side. Any one of these three scenarios can be chosen. Consider another study design driven by road geometry: if the road the pedestrian crosses is a single-lane road as opposed to a multi-lane one, scenarios O4, O5, and O6 essentially mirror scenarios O1, O2, and O3 respectively. In each of them, the pedestrian essentially crosses a one-lane road with a car approaching them from the side. This type of rationalizing can help scope the interaction setup into component scenarios that are of interest for the research question.
Besides the figures mentioned above, which are full-factorial designs of dyadic interactions between two drivers, or one driver and one pedestrian at an intersection, we also included several other interaction scenarios in the appendix. Figure A3 and Figure A4 highlight non-exhaustive combinations of interaction scenarios in a parking lot and other non-intersection on-road situations respectively for two drivers, and Figure A7 and Figure A8 do the same for driver-pedestrian encounters. We hope that these scenarios can function as a reference point for the research community to compare, benchmark, rationalize, and choose appropriate scenarios to scope downstream interaction studies.

4.1.3. Sample Size and Participant Diversity

Determining an appropriate sample size depends heavily on the specific research question and the effect sizes of the dependent variables under investigation. While larger sample sizes enhance statistical power, smaller, more focused samples may provide greater depth and richer qualitative insights, particularly in exploratory or user experience-focused studies. We emphasize the importance of balancing breadth with depth, depending on the study’s objectives. For instance, when investigating nuanced behaviors or capturing outlier interactions, qualitative methods such as in-depth interviews, longitudinal observations, and reflective exercises may yield more actionable insights than statistical analyses alone. These approaches are particularly valuable for understanding the subjective experiences and motivations that underpin traffic interactions, which may not be fully captured through objective behavioral measures.
Regarding participant diversity, we highlight the importance of ensuring representative samples that reflect the demographic and cultural variability of road users. This includes accounting for not only differences in age, gender, driving experience, and familiarity with autonomous vehicle systems, but also cultural norms that influence interaction behaviors. Studies aimed at exploring cross-cultural differences or generalizing findings across regions should prioritize participant diversity to ensure ecological validity and inclusivity. In some cases, smaller sample sizes (e.g., 5–10 participants) may be appropriate for generating deep, qualitative insights, particularly in the early stages of research or when piloting new frameworks. Conversely, larger sample sizes (e.g., 30+ participants) may be warranted for studies requiring robust statistical comparisons or generalizability. Ultimately, the choice between breadth and depth should align with the research question and the study’s goals. Incorporating iterative study designs that combine both approaches—e.g., qualitative insights from smaller, exploratory studies that inform the design of larger-scale quantitative studies—can ensure that both depth and representativeness are achieved over the course of investigating a research question.

4.2. Interaction Approach: Vignettes vs. Drives

Centering on the methodological decisions involved when setting up a study that incorporates multiple scenarios, each designed to provoke different interactions, we outline two primary approaches:
  • Studying Each Interaction Piecemeal (Vignettes of scenarios): This approach involves examining each interaction in isolation, which allows for detailed analysis but may limit the generalizability of the findings to broader contexts.
  • Considering Interactions as Part of a Larger Whole (long drives which can contain multiple interactions that flow into one another): This method views individual interactions as components of a more extensive system, aiming to understand the cumulative and emergent effects of multiple interactions.
The choice to connect scenarios or treat them as standalone depends largely on the research questions researchers aim to address. While connected scenarios can offer insights into sequential interactions and context continuity, standalone scenarios focus more on isolated behaviors under controlled conditions. Deciding whether to use controlled vignettes or more naturalistic drive scenarios depends on the study goals. Vignettes allow for controlled manipulation of specific variables and easier replication of scenarios, making them suitable for isolating specific interaction effects. In contrast, naturalistic drives provide richer, more holistic data that reflect real-world driving conditions, which is beneficial for studying more complex, emergent behaviors. If research objectives require precise control over environmental variables and interaction parameters, controlled vignettes of interactions may be preferable over more naturalistic, longer drive scenarios. Vignettes allow experimenters to manipulate interactions more deliberately to isolate and study specific encounters and simplify replicating scenarios, ensuring consistency across multiple trials.
However, implementing vignettes of scenarios involves significant trade-offs, particularly in balancing the controlled environment with the degree of naturalism. Coordinating interactions meticulously to ensure precise timing can make the scenarios feel less natural, potentially affecting the behavior of participants. This artificial setup might be necessary to achieve the study’s goals but does come at the cost of reduced realism. Moreover, issues such as motion sickness, especially triggered by the start-stop nature of vignettes, need to be managed. This factor can influence participant comfort and the overall quality of data collection. The duration of each trial and the overall study also require careful consideration. The length of interactions and the number of trials per session must be balanced to maximize efficiency without overwhelming participants. This balancing act affects how many scenarios we can realistically fit into each session and the total duration of the study.
These considerations highlight the complexity of designing and executing traffic interaction studies. A conclusion from our brainstorming is that each choice, from scenario type to session length, involves strategic decisions that impact the study’s effectiveness and the integrity of its conclusions, and are dictated by several factors, which we classify broadly under:
  • Disruption vs. Data Quality: Techniques that interrupt an experience to query a participant significantly disrupt the scenario, potentially altering participant behavior and responses due to the unnatural pause and interrogation. If this may fundamentally influence and impact the data pertaining to the research question being studied, vignettes are not an option and the interaction approach must constitute a long drive.
  • Memory vs. Immersion: Delaying questions to post-interaction can reduce the immediate disruptive impact, but at the risk of diminished recall accuracy. Participants may forget crucial details or modify their memories based on post-scenario reflection. However, if it is crucial to get as accurate and detailed a feedback on an interaction immediately or soon after it has taken place, the vignette approach is ideal.
  • Interaction Time vs. Interrogation Depth: The more time spent interrogating participants, the less time is available for actual interaction scenarios. This trade-off is particularly acute in studies with limited time resources. Extensive questioning can exhaust the time that could otherwise be used to run additional scenarios or explore different interactions. However, if it is important that participants experience an uninterrupted and immersive experience in the world to spend time tackling different kinds of interaction, a long drive setup is ideal.
  • Comprehensiveness vs. Participant Burden: Extensive (and repetitive) questioning can provide more comprehensive insights but may also place a greater cognitive or emotional burden on participants, potentially affecting their performance and the quality of the data collected. In lengthy traffic interaction studies, participants might experience boredom, simulator sickness, or wish to quit early. Particularly when recruitment costs are high, vignettes offer a solution by allowing researchers to salvage partial data from interactions already completed if a participant withdraws. This approach maintains data integrity and usability, which is not feasible with continuous long drives. Therefore, choosing between vignettes and long drives should consider the risk of participant withdrawal and the importance of preserving partial data.
These factors are captured in Figure 3, and the flowchart in Figure 4 guides the determination of whether a Vignette or Long Drive setup if ideal for a study. This also has direct influence on the measures used for the study, as we elaborate in the next section.

4.3. Measures

When designing traffic interaction studies, one of the key elements is selecting the appropriate behavioral measures. These measures are crucial for capturing the dynamics of participant interactions and understanding the underlying mechanisms of their behavior, especially in terms of their awareness and responses in different traffic scenarios. Behavioral measures in traffic interaction studies typically include objective measures such as Reaction Time (how quickly a participant reacts to changes in the traffic environment, such as the sudden appearance of a pedestrian or a car braking abruptly), Eye Tracking (to determine where participants are looking during an interaction, indicating what they are paying attention to and their level of situational awareness), physiological indicators (heart rate, galvanic skin response, etc., characterizing internal state and stress), and Decision Making (assessments of the choices participants make in response to specific scenarios, which can include lane changes, speed adjustments, or navigational decisions). Experimenter-driven subjective data resulting from passive observation & coding, or real-time psychological monitoring can also prove useful in providing insights on the participant state.
However, each of these traditional measures from controlled engineering-psychology studies, while providing valuable insights, also comes with inherent limitations. Reaction Time can be difficult to accurately assess in studies where the environment is highly dynamic and participants have significant freedom in how they respond. The lack of control over the independent variable and the presence of many confounding factors can make it challenging to isolate the specific impact of a stimulus on reaction time. Secondly, while eye tracking is a powerful tool for understanding where participants direct their attention, it does not always correlate directly with situational awareness (SA). The phenomenon of “looked but failed to see” [53] illustrates this limitation—participants may physically look at an object or scenario but may not process it meaningfully, leading to discrepancies between where their eyes are pointed and what they actually perceive or are aware of. Analyzing decisions made by participants can reveal how they respond to specific scenarios, but does not provide insights into the underlying reasons for those decisions. And lastly, methods such as passive observation & coding or real-time psychological monitoring might be affected by experimenter bias, and might not reflect or elucidate the underlying reasons behind specific participant behavior. Without understanding the motivations or cognitive processes that drive decision making, it is difficult to use these data to infer deeper behavioral patterns or to predict future behaviors under different conditions. Moreover, the analysis must be designed to handle the complexity of these data, acknowledging that simple causal inferences are often insufficient. In response to these challenges, it is crucial for studies to employ a suitable combination of measures and to integrate qualitative insights where possible. For instance, incorporating follow-up interviews or questionnaires can help elucidate the reasoning behind participants’ decisions, providing context that quantitative measures alone might miss.
To address this, it becomes necessary to ask participants directly. This interrogation can be conducted in various forms, such as after or during the interaction, each with its own implications. Post-interaction interviews, i.e., asking participants to recall their experiences and decisions after the fact can lead to issues with memory decay or recall bias. Participants might forget details or reconstruct their memories inaccurately. On the other hand, real-time interrogation, i.e., interrupting the interaction to ask participants about their awareness can provide more immediate and potentially accurate responses. However, this method can disrupt the flow of the interaction and take people out of the scenarios both experientially and cognitively, potentially altering the naturalness of their behavior and affecting the results.
For example, one of the critical challenges is accurately measuring participants’ workload, emotional state, situational awareness, and attitudes without compromising the naturalistic integrity of the scenarios. Tools like the NASA-TLX [54] (workload and cognitive effort), SAM [55] (emotion and valence), and SAGAT [56] (situation awareness) provide structured methods to query participants’ awareness at specific moments by pausing the scenario and asking targeted questions. However, this approach has significant trade-offs in terms of naturalism, as it can disrupt the flow of interaction and remove participants from the immersive experience of the scenario. An alternative approach is to use an in-situ tool that allows querying the desired metric without disrupting the immersion of the experience and preserving naturalism. Examples for such tools for the case of situation awareness include the Daze methodology proposed by Sirkin et al. [57] or SPAM (Situation Present Assessment Method) [58], which uses a more seamless questioning technique in a continuous manner. Employing other efficient questioning techniques such as auto- and allo-confrontation during review of video replay of interactions [59] can help balance the depth of interrogation with the need to keep the scenarios running smoothly—participants can reflect on their or others’ actions and attitudes by watching their performance, which can mitigate some of the issues with memory-based distortions. However, these subjective measures are also not without their own biases and can compromise data integrity.
Ultimately, designing an interaction study involves carefully considering these trade-offs to balance the need for rich, accurate data with the goal of maintaining a realistic and engaging participant experience. Deciding whether to prioritize depth of understanding in a few scenarios over breadth in many depends largely on the specific research questions and the resources available. Each choice impacts not just the quality of the data collected but also the overall validity and reliability of the study’s findings.

4.4. Timing and Coordination of Interactions

Regardless of which scenario is chosen for the study, and whether they are presented as vignettes or a sequence of encounters within a long drive, timing is a critical challenge in traffic interaction studies. This is particularly important when attempting to orchestrate interactions between participants who have varying degrees of freedom and autonomy, such as the speed at which they choose to drive or walk. Ensuring that these interactions occur as planned without stripping participants of their autonomy requires a nuanced approach to study design. The risk is that if the interaction occurs too early or too late, the intended encounter either does not happen or does not yield useful data. Forcing these interactions effectively, without making the control measures overly apparent to the participants, is indeed a subtle yet critical aspect of study design.
To manage the timing of interactions effectively, experimenters may resort to strategies that delay or control the start of an action. This can be achieved through instructions, i.e., providing participants with specific directions on when to start their movement. This might include explicit verbal instructions, navigation cues, or auditory signals like a beep to prompt the initiation of walking or driving. Alternatively, environmental artifacts or elements within the interaction space (such as strategically placed bus stops or buildings) can serve to naturally delay or obscure a participant’s view, thereby aligning their timing with the study’s needs. These artifacts help create realistic scenarios where participants encounter each other as they naturally would, but within a controlled timing framework.
Another approach to address this challenge is the implementation of what might be termed “time morphing”. This technique involves manipulating the timing of environmental cues that influence participant behavior. For instance, a traffic light could be programmed to change at specific times to orchestrate the desired interactions at intersections. As an example, in a simulated urban traffic study, imagine two drivers approaching an intersection from perpendicular directions. Rather than setting a fixed moment for their encounter, the simulation can continuously monitor each driver’s speed and trajectory. If one driver is slightly behind schedule, the system can subtly delay a traffic light change or adjust a pedestrian crossing cue by a fraction of a second. This slight, real-time modification—i.e., time morphing—ensures both drivers arrive at the intersection simultaneously, thus creating a natural yet controlled interaction scenario without overtly disrupting the simulation’s flow. Essentially, this method does not fix the traffic light in a specific spatial/ physical position in the environment, but rather its spawning location in a virtual world can be adjusted based on the timing necessary to ensure that interactions occur as planned. This kind of manipulation allows for a flexible yet controlled setting where interactions can be orchestrated more naturally without participants feeling overtly controlled or guided. By adjusting environmental factors such as traffic light timings in response to real-time participant movements, researchers can subtly guide participants into the desired interactions, enhancing the study’s naturalism and the reliability of its results.
In our experience, however, these interventions come with trade-offs. The most significant is the impact on naturalism. In timing or forcing interactions, naturalism can only be aspired to, but never reached. When participants are aware of being controlled or feel restricted by the setup, their behavior might not accurately reflect their responses in real-world conditions. This awareness can lead to altered driving or walking behaviors, potentially skewing the study’s results. Thus, while these methods help achieve the desired synchronization of interactions, they must be employed judiciously to maintain a balance between control and natural behavior. When employing vignettes (see Section 4.2), it might be more optimal to use instruction and event-based triggers to actuate the start of an interaction between participants. On the other hand, in long drives, small differences in driving or walking speed magnify over time, and instruction or event-based triggers become increasingly complex and untenable, necessitating the time-morphing approach. Ensuring that the study conditions remain as realistic as possible while achieving the necessary experimental control is a delicate balance that requires careful planning and consideration.

5. Discussion

The methodological framework we present in this paper is designed to help researchers operationalize traffic interaction studies, formalizing key design decisions that might be made and trade-offs that should be considered in selecting scenarios to test.
By laying out these considerations clearly, we aspire to motivate the community to use existing multi-participant simulators [43,44,45,46,47,48,49] to better understand the rich array of dyadic interactions that Markkula et al. outlined [10]. The theoretical contributions of our work are situated within the framework of a clarified understanding and foundation of the practicalities on conducting experimental studies on traffic interactions. This is vital as it offers both quantitative and qualitative structures that are instrumental in advancing the state of the art of research on vehicle automation.
For this discussion, we would like to highlight points about traffic interaction studies that we have come to understand over the course of designing studies in this domain. We offer these as subjective impressions, open to discussion from others in the community, but we note them in part because we are discovering that these types of studies form a major departure in the experimental paradigms many of us have expertise in.

5.1. Interaction Studies vs. Experimental Psychology Studies

Interaction studies in the context of traffic dynamics present a distinct challenge compared to more traditional experimental psychology studies. While experimental psychology typically involves controlled manipulations with a focus on observing and measuring clear, direct responses, the goal in interaction studies in traffic environments should be to elicit data about the behavioral complexities of interactions, as well as their consequences.

5.1.1. Degrees of Freedom

The “openness” in the design of interaction studies is needed to capture natural interaction, and leads to a significantly higher level of complexity due to the numerous variables and the dynamic nature of interactions among participants. Interaction studies inherently include multiple degrees of freedom (see e.g., [50,60]. This creates a rich, complex dataset but also introduces substantial variability that can obscure straightforward cause-and-effect relationships.

5.1.2. Statistical and Analytical Challenges

Given the complexity and interconnected nature of the data collected in traffic interaction studies, simple analytical techniques like ANOVA (Analysis of Variance) commonly used in psychology-style studies may not be sufficient to describe the key insights from an interaction study. These traditional methods often assume independence among variables, an assumption which is frequently violated in interaction studies where multiple factors are inherently interdependent.

5.1.3. Need for Advanced Techniques

To adequately analyze data from interaction studies, more sophisticated statistical techniques or machine learning models are often required. These models can handle multiple inputs and account for interdependencies among variables, making it possible to untangle the complex web of factors that influence behavior. Techniques such as multivariate regression, structural equation modeling, or even deep learning algorithms (see e.g., [61,62,63,64]) could be suitable to address the complexity of interaction studies.

5.1.4. Heuristic vs. Computational Outcomes

The key outcomes of traffic interaction studies might not be the “useful design heuristics” that we have come to expect from psychology-style human factors studies. Rather than a clean predictive explanation of the effect that independent variables will have on dependent variables in similar contexts, we expect more complicated dependencies of behavior; for example, there are many instances of people employing various strategies to avoid interacting, sometimes speeding quickly to the intersection to try to race across before the other car will arrive and then slowing down dramatically on the approach to the intersection when they realize they will not make it across the intersection before the other car will arrive. Computational models offer a more robust framework for exploring these complex, dynamic and interdependent behaviors, and do a better job of providing predictive information that accounts for the myriad interactions occurring within the study. The computational models are as useful and valid as an outcome to an experimental study as a heuristic finding. Employing advanced machine learning techniques—such as multivariate regression, structural equation modeling, or deep learning approaches to better account for the non-linear and high-dimensional aspects of traffic interactions can have potential benefits. While our goal is not to prescribe a specific solution (as the selection of the most appropriate approach should depend on the specific context of each study), it is important highlight that augmenting conventional methods with these advanced techniques could offer deeper insights.

5.2. Regional and Cultural Differences

One objection that might be raised about traffic interaction studies is that the outcomes are likely not to be “generalizable”. It would be surprising if the outcomes of a traffic interaction study run in Dallas, Texas would yield the same outcomes as the same study run in Mumbai, India. However, we point out that this is not a weakness of traffic interaction studies, but rather a strength; traffic interactions in Mumbai are highly different from those in Dallas, Texas, and it is important that AV designers know about and can plan for the difference (see e.g., [65,66,67]. The methodological framework we proposed is designed to focus on the behavioral dynamics inherent to specific “classes” of encounters rather than the contextual specifics of road geometry or cultural norms. For example, while the layout of an intersection may vary between urban settings in North America, Europe, and Asia, the fundamental dynamics of yielding behavior, right-of-way negotiation, and space-sharing conflicts remain consistent within the same class of interaction scenarios [68,69,70,71]. By prioritizing these behavioral dynamics over contextual details, the framework ensures that the novelty effects observed are attributable to the interaction itself rather than to variations in context. Furthermore, the scenarios presented in the framework are intentionally abstracted to be recognizable across diverse cultural and regional settings. This abstraction facilitates cross-contextual applicability while allowing researchers to adapt specific elements—such as road signage, vehicle types, or traffic density—to better reflect the local conditions being studied. Future studies could validate the framework by comparing its outcomes across multiple cultural contexts, exploring how localized adaptations influence interaction behaviors. By emphasizing behavioral universality within distinct encounter classes and supporting regional customization, the framework provides a flexible yet robust tool for studying traffic interactions globally. This approach ensures its relevance and applicability in diverse cultural and geographic settings.

5.3. How Should We Look at Interaction Studies?

The complex ways that people interact in traffic interaction studies might be as interesting to ethnographers or behavioral scientists as AV designers; they show, better than heuristic findings do, some of the richness and variety social scientists are often trying to draw attention to in everyday road life [1,19,72]. These dyadic traffic interaction studies can help us to more adequately discover how often different patterns that social scientists note happen give the contextual priors of scenario.
One limitation of traffic interaction studies is that, while they help elicit how people will respond to different scenarios, they do not address how often any of these scenarios occur. In addition, they are designed to collect a range of “normal” driving interactions and behaviors. However, this work does not necessarily address the question of corner case scenarios or how interactions will unfold in critical incidents [73]. It is necessary to consider traffic interaction studies in conjunction with other research methods, such as critical incident analysis (see e.g., [74], to understand the conditions and outcomes of failures or near-misses, which can provide crucial insights for improving safety and design.

5.4. Balancing Naturalism and Control in VR Environments

While highly controlled environments allow for precise manipulation of variables and reproducibility, they risk diminishing ecological validity if participants perceive the scenarios as overly artificial. To mitigate this, several strategies may be used to simulate naturalistic behaviors effectively within VR environments. First, incorporating realistic environmental elements such as traffic flow, pedestrian density, and dynamic weather conditions can enhance the immersion of participants. Second, using adaptive algorithms to simulate the behavior of non-player agents (e.g., vehicles or pedestrians) ensures that their responses are contextually appropriate and resemble real-world interactions. These strategies help create a believable setting that encourages participants to engage naturally. Additionally, the design of interaction prompts can be subtle to maintain the flow of naturalistic behavior, as elaborated in Section 4.2 and Section 4.4. For example, instead of explicit instructions or scripted cues, participants can encounter interactions organically through well-placed environmental triggers, such as the timing of traffic lights or the positioning of obstacles. This approach minimizes the intrusion of experimental control while preserving the natural dynamics of traffic interactions. The extent of naturalism required depends on the specific research question. Studies focusing on high-level behavioral patterns may tolerate slightly reduced naturalism in exchange for control, while those examining fine-grained decision-making processes may prioritize ecological validity.

5.5. Addressing the Conundrum of Operationalizing Interaction Studies

The choice among these methodologies and trade-offs reflects a fundamental conundrum in traffic behavior research: Should the focus be on rare but critical events, or should it be on the everyday non-critical interactions that constitute the majority of experiences? There is no simple heuristic or one-size-fits-all answer to this question. Instead, the decision on which analytical approach to use should be driven by the specific aims and contexts of the research question. Researchers must consider what they hope to achieve with their analysis. The answer to these questions will guide the choice of setup and analytical methods. By highlighting these options and their trade-offs and offering this framework, we aim to equip researchers with the knowledge to choose the most appropriate method for their specific needs, fostering a deeper understanding of traffic dynamics and contributing to more effective interventions and designs.
Given that it is not possible to address all traffic interaction in one study, the methodologies and insights presented in this paper are intended to enhance the transparency, replicability and quality of traffic interaction research. By simplifying the setup and execution of interaction studies, we make it feasible for more researchers to engage in this field. The framework presented in this paper can aid both authors and reviewers in operationalizing and describing interactions in traffic research. By emphasizing the critical trade-offs and decisions that need to be made for a successful study design, we also urge the authors to report these aspects in their publications. Considering the many trade-offs while designing a traffic interaction study, one concrete prioritization strategy we propose involves adopting a multi-criteria decision analysis (MCDA) framework [75,76]. In this approach, candidate scenarios are evaluated against several criteria that capture both experimental objectives and resource constraints, such as relevance to the research question, feasibility, cost and resource demand, and data richness. By assigning weights to these criteria based on the specific priorities of the study (e.g., if resources are limited, feasibility and cost might be weighted more heavily), each scenario can be scored and ranked. This quantitative scoring system would facilitate transparent and systematic decision-making, ensuring that the selected scenarios maximize scientific impact while remaining within practical limits. Although this proposal is conceptual at this stage, we believe that its empirical application in pilot studies would further refine its utility and effectiveness.
While prior work (e.g., [11,50,51,52]) has provided valuable guidelines for analyzing road user behavior or methodologies focusing on specific scenarios, these approaches often lack a holistic framework for designing controlled studies that address critical experimental decisions and trade-offs. In contrast, our framework bridges this gap by offering a structured approach to operationalizing traffic interaction studies, emphasizing scenario selection, decision-making processes, and strategies for managing experimental constraints.
Our primary contribution of this paper is theoretical and methodological [77], providing a conceptual lens to guide traffic interaction studies. While the framework we present is not intended to serve as an empirical artifact; rather, it aims to act as a tool for organizing and understanding the complexities of dyadic traffic interactions. It outlines the key trade-offs and decisions in designing controlled traffic interaction studies providing a roadmap by articulating key trade-offs and design considerations for designing traffic interaction studies. However, we refrain from than prescribing specific, one-size-fits-all solutions, as the implementation of these trade-offs is inherently dependent on the unique research questions and contextual constraints of each study. Future research could build on our insights by exploring tailored integrations within simulation environments and AV testing protocols, thereby operationalizing these considerations in specific, context-dependent ways.
The examples provided in the appendices outline plausible use cases for the framework, offering initial guidance for its implementation in controlled settings. These examples serve to bridge the gap between theory and practice and lay the groundwork for subsequent empirical investigations. By clarifying the practical steps and considerations involved in setting up interaction studies, we hope to help researchers design and implement their experiments more efficiently and effectively. Overall, our work contributes to the broader field of traffic research by providing tools and concepts that help bridge the gap between theoretical models and real-world applications. This not only aids in understanding current traffic dynamics but also assists in the development of future interventions and improvements in traffic management and safety systems. However, we note that this work is derived from expert brainstorming sessions and does not include an empirical validation at this stage. Future work should involve applying this conceptual guide in real-world experimental settings, thereby scaffolding this guide and providing empirical validation based on actual implementation outcomes. Implementing controlled pilot experiments in both simulated and real-world traffic environments to assess the feasibility of our guide, refine its components, and evaluate its predictive validity can turn this conceptual guide into a robust framework or tool that addresses practical constraints and research objectives, ultimately supporting its integration into broader traffic interaction research.

6. Conclusions

This paper synthesizes interdisciplinary expert insights on operationalizing dyadic traffic interaction studies, highlighting critical trade-offs in scenario selection, interaction approaches, measurement strategies, and timing coordination. Through a conceptual lens, our analysis offers strategic guidance for designing controlled studies that balance depth and generalizability, ultimately enhancing clarity, reproducibility, and informed decision-making in traffic interaction research. The real-world challenges and trade-offs of implementing interaction design studies discussed in this paper provide a reference for the research field, acting as a tool for researchers to help them anticipate potential issues in their study designs and consider how different choices might impact both the feasibility and the scientific validity of their research outcomes. Understanding these dynamics is crucial for anyone looking to conduct rigorous and impactful traffic interaction studies. This can foster a more informed and prepared community of researchers who can anticipate and manage these complexities in their own work. Ultimately, we hope to facilitate smoother, more effective research practices in the field of traffic interaction studies, enhancing both the quality and the efficiency of future investigations.

Author Contributions

Conceptualization: D.D. and W.J.; Methodology: D.D., A.H. and W.J.; Writing—Original Draft: D.D.; Writing—Review & Editing: D.D., A.H. and W.J.; Funding Acquisition: W.J. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Science Foundation under award #2212431—NSF-BSF: HCC: Cultural Differences in Pedestrian-Autonomous Vehicle Interaction.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

Azra Habibovic is affiliated to Scania CV AB, although this has not affected the work and no conflict of interest arises from this. The other authors declare no conflicts of interest. The funders had no role in the interpretation of data or the writing of the manuscript.

Appendix A. Scenarios for 2-Actor Driver:Driver (DD) Interaction

Here, we outline different possible interaction use cases for driver:driver and driver:pedestrian interaction.

Appendix A.1. Orthogonal Approach at an Intersection: DD-O Scenarios

Figure A1. Full-factorial design of interactions between two drivers at an intersection where the ego driver (red car) is traveling from the bottom of the images and the other driver (blue car) is approaching orthogonally (i.e., perpendicularly, at a right angle) either from the left or the right.
Figure A1. Full-factorial design of interactions between two drivers at an intersection where the ego driver (red car) is traveling from the bottom of the images and the other driver (blue car) is approaching orthogonally (i.e., perpendicularly, at a right angle) either from the left or the right.
Applsci 15 03738 g0a1

Appendix A.2. Contra-Directional Approach at an Intersection: DD-C Scenarios

Figure A2. Full-factorial design of interactions between two drivers at an intersection where the ego driver (red car) is traveling from the bottom of the images and the other driver (blue car) is approaching contra-directionally (i.e., as oncoming traffic from the opposite road).
Figure A2. Full-factorial design of interactions between two drivers at an intersection where the ego driver (red car) is traveling from the bottom of the images and the other driver (blue car) is approaching contra-directionally (i.e., as oncoming traffic from the opposite road).
Applsci 15 03738 g0a2

Appendix A.3. Parking Lot: DD-P Scenarios

Figure A3. Possible (non-exhaustive) interaction scenarios between two drivers at a parking lot. These scenarios assume that the entrance/exit of the parking lot is at the bottom of the images, and explore some potential maneuvers, some of which lead to space-sharing conflicts. If another entrance/exit is considered from the top of the images, this will lead to a multiplication of the possible interaction scenarios.
Figure A3. Possible (non-exhaustive) interaction scenarios between two drivers at a parking lot. These scenarios assume that the entrance/exit of the parking lot is at the bottom of the images, and explore some potential maneuvers, some of which lead to space-sharing conflicts. If another entrance/exit is considered from the top of the images, this will lead to a multiplication of the possible interaction scenarios.
Applsci 15 03738 g0a3

Appendix A.4. Other Non-Intersection Interactions: DD-X Scenarios

Figure A4. Possible (non-exhaustive) on-road interaction scenarios between two drivers outside of intersections.
Figure A4. Possible (non-exhaustive) on-road interaction scenarios between two drivers outside of intersections.
Applsci 15 03738 g0a4

Appendix B. Scenarios for 2-Actor Driver:Pedestrian (DP) Interactions

Appendix B.1. Orthogonal Approach at an Intersection: DP-O Scenarios

Figure A5. Full-factorial design of interactions between a driver and a pedestrian at an intersection where the ego driver (red car) is traveling from the right of the images and the pedestrian is approaching orthogonally (i.e., perpendicularly, at a right angle) either from the top or the bottom, and either from the side of the intersection closer to or farther from the driver.
Figure A5. Full-factorial design of interactions between a driver and a pedestrian at an intersection where the ego driver (red car) is traveling from the right of the images and the pedestrian is approaching orthogonally (i.e., perpendicularly, at a right angle) either from the top or the bottom, and either from the side of the intersection closer to or farther from the driver.
Applsci 15 03738 g0a5

Appendix B.2. Contra-Directional/Parallel Approach at an Intersection: DP-C Scenarios

Figure A6. Full-factorial design of interactions between a driver and a pedestrian at an intersection where the ego driver (red car) is traveling from the right of the images and the pedestrian is walking along the curb with an intention to cross the road either parallel to or contra-directionally from the driver (i.e., along or opposite to the driver).
Figure A6. Full-factorial design of interactions between a driver and a pedestrian at an intersection where the ego driver (red car) is traveling from the right of the images and the pedestrian is walking along the curb with an intention to cross the road either parallel to or contra-directionally from the driver (i.e., along or opposite to the driver).
Applsci 15 03738 g0a6

Appendix B.3. Parking Lot: DP-P Scenarios

Figure A7. Possible (non-exhaustive) interaction scenarios between a driver and a pedestrian at a parking lot. These scenarios, like the ones between two drivers, assume that the entrance/exit of the parking lot is at the bottom of the images, and explore some potential maneuvers, some of which lead to space-sharing conflicts. If another entrance/exit is considered from the top of the images, this will lead to a multiplication of the possible interaction scenarios.
Figure A7. Possible (non-exhaustive) interaction scenarios between a driver and a pedestrian at a parking lot. These scenarios, like the ones between two drivers, assume that the entrance/exit of the parking lot is at the bottom of the images, and explore some potential maneuvers, some of which lead to space-sharing conflicts. If another entrance/exit is considered from the top of the images, this will lead to a multiplication of the possible interaction scenarios.
Applsci 15 03738 g0a7

Appendix B.4. Other Non-Intersection Interactions: DP-X Scenarios

Figure A8. Possible (non-exhaustive) on-road interaction scenarios between a driver and a pedestrian outside of intersections.
Figure A8. Possible (non-exhaustive) on-road interaction scenarios between a driver and a pedestrian outside of intersections.
Applsci 15 03738 g0a8

References

  1. Brown, B.; Laurier, E. The trouble with autopilots: Assisted and autonomous driving on the social road. In Proceedings of the CHI Conference on Human Factors in Computing Systems, Denver, CO, USA, 6–11 May 2017; Volume 2017, pp. 416–429. [Google Scholar] [CrossRef]
  2. Schieben, A.; Wilbrink, M.; Kettwich, C.; Madigan, R.; Louw, T.; Merat, N. Designing the interaction of automated vehicles with other traffic participants: Design considerations based on human needs and expectations. Cogn. Technol. Work 2019, 21, 69–85. [Google Scholar] [CrossRef]
  3. Millard-Ball, A. Pedestrians, Autonomous Vehicles, and Cities. J. Plan. Educ. Res. 2018, 38, 6–12. [Google Scholar] [CrossRef]
  4. Brooks, R. The Big Problem With Self-Driving Cars Is People. IEEE Spectrum 2017, 27. Available online: https://spectrum.ieee.org/the-big-problem-with-selfdriving-cars-is-people (accessed on 20 March 2025).
  5. Rasouli, A.; Tsotsos, J.K. Autonomous vehicles that interact with pedestrians: A survey of theory and practice. IEEE Trans. Intell. Transp. Syst. 2020, 21, 900–918. [Google Scholar] [CrossRef]
  6. Alambeigi, H.; McDonald, A.D.; Tankasala, S.R. Crash Themes in Automated Vehicles: A Topic Modeling Analysis of the California Department of Motor Vehicles Automated Vehicle Crash Database. In Proceedings of the Transportation Research Board, 99th Annual Meeting, Washington, DC, USA, 12–16 January 2020. [Google Scholar]
  7. Fagnant, D.J.; Kockelman, K. Preparing a nation for autonomous vehicles: Opportunities, barriers and policy recommendations. Transp. Res. Part A Policy Pract. 2015, 77, 167–181. [Google Scholar] [CrossRef]
  8. Piao, J.; McDonald, M.; Hounsell, N.; Graindorge, M.; Graindorge, T.; Malhene, N. Public Views towards Implementation of Automated Vehicles in Urban Areas. Transp. Res. Procedia 2016, 14, 2168–2177. [Google Scholar] [CrossRef]
  9. Dia, H.; Javanshour, F. Autonomous Shared Mobility-On-Demand: Melbourne Pilot Simulation Study. Transp. Res. Procedia 2017, 22, 285–296. [Google Scholar] [CrossRef]
  10. Markkula, G.; Madigan, R.; Nathanael, D.; Portouli, E.; Lee, Y.M.; Dietrich, A.; Billington, J.; Schieben, A.; Merat, N. Defining interactions: A conceptual framework for understanding interactive behaviour in human and automated road traffic. Theor. Issues Ergon. Sci. 2020, 21, 728–752. [Google Scholar] [CrossRef]
  11. Deshmukh, A.; Wang, Z.; Guo, H.; Ammar, D.; Sherony, R.; Feng, F.; Lin, B.; Bao, S.; Zhou, F. A Systematic Review of Challenging Scenarios Involving Automated Vehicles and Vulnerable Road Users. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting, Washington, DC, USA, 23–27 October 2023; Volume 67, pp. 678–685. [Google Scholar] [CrossRef]
  12. Juhlin, O. Traffic behaviour as social interaction-implications for the design of artificial drivers. In Proceedings of the 6th World Congress on Intelligent Transport Systems (ITS), Toronto, ON, Canada, 8–12 November 1999. [Google Scholar]
  13. Toiskallio, K. The impersonal flâneur: Navigation styles of social agents in urban traffic. Space Cult. 2002, 5, 169–184. [Google Scholar]
  14. Normark, D. Enacting Mobility. Studies into the Nature of Road-Related Social Interaction. Ph.D. Thesis, Götesborgs Universitet, Gothenburg, Sweden, 2006. [Google Scholar]
  15. Šucha, M. Road users’ strategies and communication: Driver-pedestrian interaction. Transp. Res. Arena TRA 2014. Available online: https://www.ictct.net/wp-content/uploads/26-Maribor-2013/26-Sucha-Full-paper-2.pdf (accessed on 20 March 2025).
  16. Dey, D.; Terken, J. Pedestrian interaction with vehicles: Roles of explicit and implicit communication. In Proceedings of the 9th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, Oldenburg, Germany, 24–27 September 2017; ACM: New York, NY, USA, 2017; pp. 109–113. [Google Scholar]
  17. Brown, B.; Laurier, E.; Vinkhuyzen, E. Designing Motion: Lessons for Self-driving and Robotic Motion from Human Traffic Interaction. Proc. ACM Hum.-Comput. Interact. 2022, 7, 5. [Google Scholar] [CrossRef]
  18. Normark, D. Tending to mobility: Intensities of staying at the petrol station. Environ. Plan. A 2006, 38, 241–252. [Google Scholar]
  19. Vinkhuyzen, E.; Cefkin, M. Developing socially acceptable autonomous vehicles. In Proceedings of the Ethnographic Praxis in Industry Conference Proceedings; Wiley Online Library: Hoboken, NJ, USA, 2016; Volume 2016, pp. 522–534. [Google Scholar]
  20. Dannefer, W.D. Driving and Symbolic Interaction 1. Sociol. Inq. 1977, 47, 33–38. [Google Scholar]
  21. Factor, R.; Mahalel, D.; Yair, G. The social accident: A theoretical model and a research agenda for studying the influence of social and cultural characteristics on motor vehicle accidents. Accid. Anal. Prev. 2007, 39, 914–921. [Google Scholar]
  22. Dingus, T.A.; Klauer, S.G.; Neale, V.L.; Petersen, A.; Lee, S.E.; Sudweeks, J.; Perez, M.A.; Hankey, J.; Ramsey, D.; Gupta, S. The 100-Car Naturalistic Driving Study, Phase II-Results of the 100-Car Field Experiment; Technical Report; United States Department of Transportation—National Highway Traffic Safety: Washington, DC, USA, 2006.
  23. Hankey, J.M.; Perez, M.A.; McClafferty, J.A. Description of the SHRP 2 Naturalistic Database and the Crash, Near-Crash, and Baseline Data Sets; Technical Report; Virginia Tech Transportation Institute: Blacksburg, VA, USA, 2016. [Google Scholar]
  24. Creaser, J.; Morris, N.; Edwards, C.; Manser, M.; Cooper, J.; Swanson, B.; Donath, M. Teen Driver Support System (TDSS) Field Operational Test; Technical Report; Center for Transportation Studies, University of Minnesota: Minneapolis, MN, USA, 2015. [Google Scholar]
  25. Fridman, L.; Brown, D.E.; Glazer, M.; Angell, W.; Dodd, S.; Jenik, B.; Terwilliger, J.; Kindelsberger, J.; Ding, L.; Seaman, S.; et al. MIT Autonomous Vehicle Technology Study: Large-Scale Deep Learning Based Analysis of Driver Behavior and Interaction with Automation. arXiv 2017, arXiv:1711.06976. [Google Scholar]
  26. Papazikou, E.; Quddus, M.; Thomas, P.; Kidd, D. What came before the crash? An investigation through SHRP2 NDS data. Saf. Sci. 2019, 119, 150–161. [Google Scholar]
  27. Ahmed, M.M.; Ghasemzadeh, A. The impacts of heavy rain on speed and headway behaviors: An investigation using the SHRP2 naturalistic driving study data. Transp. Res. Part C Emerg. Technol. 2018, 91, 371–384. [Google Scholar]
  28. Paullada, A.; Raji, I.D.; Bender, E.M.; Denton, E.; Hanna, A. Data and its (dis) contents: A survey of dataset development and use in machine learning research. Patterns 2021, 2, 100336. [Google Scholar]
  29. Fridman, L.; Brown, D.E.; Glazer, M.; Angell, W.; Dodd, S.; Jenik, B.; Terwilliger, J.; Patsekin, A.; Kindelsberger, J.; Ding, L.; et al. MIT advanced vehicle technology study: Large-scale naturalistic driving study of driver behavior and interaction with automation. IEEE Access 2019, 7, 102021–102038. [Google Scholar]
  30. Domeyer, J.E.; Lee, J.D.; Toyoda, H.; Mehler, B.; Reimer, B. Interdependence in Vehicle-Pedestrian Encounters and Its Implications for Vehicle Automation. IEEE Trans. Intell. Transp. Syst. 2020, 23, 4122–4134. [Google Scholar]
  31. Schöner, H.P. Simulation in development and testing of autonomous vehicles. In 18. Internationales Stuttgarter Symposium; Bargende, M., Reuss, H.C., Wiedemann, J., Eds.; Springer Fachmedien Wiesbaden: Wiesbaden, Germany, 2018; pp. 1083–1095. [Google Scholar]
  32. Dey, A.K. Understanding and using context. Pers. Ubiquitous Comput. 2001, 5, 4–7. [Google Scholar]
  33. Burns, C.G.; Oliveira, L.; Hung, V.; Thomas, P.; Birrell, S. Pedestrian Attitudes to Shared-Space Interactions with Autonomous Vehicles—A Virtual Reality Study. In Applied Human Factors and Ergonomics; Springer: Cham, Switzerland, 2019; pp. 307–316. [Google Scholar]
  34. Holländer, K.; Colley, A.; Mai, C.; Häkkilä, J.; Alt, F.; Pfleging, B. Investigating the Influence of External Car Displays on Pedestrians’ Crossing Behavior in Virtual Reality. In Proceedings of the MobileHCI ’19: Proceedings of the 21st International Conference on Human-Computer Interaction with Mobile Devices and Services, Taipei, Taiwan, 1–4 October 2019. [Google Scholar]
  35. Li, Y.; Dikmen, M.; Hussein, T.G.; Wang, Y.; Burns, C. To cross or not to cross: Urgency-based external warning displays on autonomous vehicles to improve pedestrian crossing safety. In Proceedings of the 10th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, Toronto, ON, Canada, 23–25 September 2018; ACM: New York, NY, USA, 2018; pp. 188–197. [Google Scholar]
  36. Mahadevan, K.; Somanath, S.; Sharlin, E. Communicating awareness and intent in autonomous vehicle-pedestrian interaction. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, Montreal, QC, Canada, 21–26 April 2018; ACM: New York, NY, USA, 2018; p. 429. [Google Scholar]
  37. Moore, D.; Currano, R.; Strack, G.E.; Sirkin, D. The Case for Implicit External Human-Machine Interfaces for Autonomous Vehicles. In Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, Utrecht, The Netherlands, 21–25 September 2019; ACM: New York, NY, USA, 2019; pp. 295–307. [Google Scholar]
  38. Hancock, P.; De Ridder, S. Behavioural accident avoidance science: Understanding response in collision incipient conditions. Ergonomics 2003, 46, 1111–1135. [Google Scholar] [CrossRef]
  39. Mühlbacher, D.; Zimmer, J.; Fischer, F.; Krüger, H.P. The multi-driver simulator—A new concept of driving simulation for the analysis of interactions between several drivers. In Proceedings of the Driving Simulation Conference 2012, Paris, France, 6–7 September 2012; pp. 147–158. [Google Scholar]
  40. Oeltze, K.; Schießl, C. Benefits and challenges of multi-driver simulator studies. IET Intell. Transp. Syst. 2015, 9, 618–625. [Google Scholar] [CrossRef]
  41. Heesen, M.; Baumann, M.; Kelsch, J.; Nause, D.; Friedrich, M. Investigation of cooperative driving behaviour during lane change in a multi-driver simulation environment. In Proceedings of the Human Factors and Ergonomics Society (HFES) Europe Chapter Conference Touluse, Toulouse, France, 10 October 2012; pp. 305–318. [Google Scholar]
  42. Rittger, L.; Muehlbacher, D.; Maag, C.; Kiesel, A. Anger and bother experience when driving with a traffic light assistant: A multi-driver simulator study. In Proceedings of the Human Factors and Ergonomics Society Europe, Groningen, The Netherlands, 14–16 October 2015; pp. 41–51. [Google Scholar]
  43. Goedicke, D.; Zolkov, C.; Friedman, N.; Wise, T.; Parush, A.; Ju, W. Strangers in a Strange Land: New Experimental System for Understanding Driving Culture using VR. IEEE Trans. Veh. Technol. 2022, 71, 3399–3413. [Google Scholar] [CrossRef]
  44. Bazilinskyy, P.; Kooijman, L.; Dodou, D.; De Winter, J.C.F. Coupled simulator for research on the interaction between pedestrians and (automated) vehicles. In Proceedings of the 19th Driving Simulation Conference (DSC), Antibes, France, 9–11 September 2020; Available online: https://repository.tudelft.nl/record/uuid:e14ae256-318d-4889-adba-b0ba1efcca71 (accessed on 20 March 2025).
  45. Bazilinskyy, P.; Kooijman, L.; Dodou, D.; Mallant, K.P.; Roosens, V.E.; Middelweerd, M.D.; Overbeek, L.D.; De Winter, J.C. Get Out of The Way! Examining eHMIs in Critical Driver-Pedestrian Encounters in a Coupled Simulator. In Proceedings of the Main Proceedings—14th International ACM Conference on Automotive User Interfaces and Interactive Vehicular Applications, Seoul, Republic of Korea, 17–20 September 2022; AutomotiveUI 2022. Association for Computing Machinery, Inc.: New York, NY, USA, 2022; pp. 360–371. [Google Scholar] [CrossRef]
  46. Mok, C.S.; Bazilinskyy, P.; de Winter, J. Stopping by looking: A driver-pedestrian interaction study in a coupled simulator using head-mounted displays with eye-tracking. Appl. Ergon. 2022, 105, 103825. [Google Scholar] [CrossRef] [PubMed]
  47. Hübner, M.; Feierle, A.; Rettenmaier, M.; Bengler, K. External communication of automated vehicles in mixed traffic: Addressing the right human interaction partner in multi-agent simulation. Transp. Res. Part F Traffic Psychol. Behav. 2022, 87, 365–378. [Google Scholar] [CrossRef]
  48. Kalantari, A.H.; Yang, Y.; Garcia de Pedro, J.; Lee, Y.M.; Horrobin, A.; Solernou, A.; Holmes, C.; Merat, N.; Markkula, G. Who goes first? A distributed simulator study of vehicle–pedestrian interaction. Accid. Anal. Prev. 2023, 186, 107050. [Google Scholar] [CrossRef]
  49. Kearney, J.K.; Noyce, D.A.; Santiago-Chaparro, K.R.; Chakraborty, S.; Jiang, Y. Multi-Modal Distributed Simulation Combining Cars, Bicyclists, and Pedestrians; Technical Report; Safety Research Using Simulation (SAFER-SIM) University Transportation Center, 2018. [Google Scholar] [CrossRef]
  50. Kircher, K.; Eriksson, O.; Forsman, Å.; Vadeby, A.; Ahlstrom, C. Design and analysis of semi-controlled studies. Transp. Res. Part F Traffic Psychol. Behav. 2017, 46, 404–412. [Google Scholar] [CrossRef]
  51. Kaß, C.; Schoch, S.; Naujoks, F.; Hergeth, S.; Keinath, A.; Neukum, A. Standardized Test Procedure for External Human–Machine Interfaces of Automated Vehicles. Information 2020, 11, 173. [Google Scholar] [CrossRef]
  52. Donmez, B.; McDonald, A.D.; Lee, J.D.; Boyle, L.N. Road user behavior: Describing, inferring, predicting and beyond. Transp. Res. Interdiscip. Perspect. 2023, 22, 100932. [Google Scholar] [CrossRef]
  53. Herslund, M.B.; Jørgensen, N.O. Looked-but-failed-to-see-errors in traffic. Accid. Anal. Prev. 2003, 35, 885–891. [Google Scholar] [CrossRef]
  54. Hart, S.G.; Staveland, L.E. Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research. Adv. Psychol. 1988, 52, 139–183. [Google Scholar] [CrossRef]
  55. Bradley, M.; Lang, P.J. Measuring Emotion: The Self-Assessment Semantic Differential Manikin and the. J. Behav. Ther. Exp. Psychiatry 1994, 25, 49–59. [Google Scholar] [CrossRef] [PubMed]
  56. Endsley, M.R. Direct Measurement of Situation Awareness: Validity and Use of SAGAT. In Situation Awareness Analysis and Measurement; Lawrence Erlbaum Associates: Mahwah, NJ, USA, 2000. [Google Scholar] [CrossRef]
  57. Sirkin, D.; Martelaro, N.; Johns, M.; Ju, W. Toward Measurement of Situation Awareness in Autonomous Vehicles. In Proceedings of the Conference on Human Factors in Computing Systems, Denver, CO, USA, 6–11 May 2017; Association for Computing Machinery: New York, NY, USA, 2017; pp. 405–415. [Google Scholar] [CrossRef]
  58. Durso, F.T.; Hackworth, C.A.; Truitt, T.R.; Crutchfield, J.; Nikolic, D.; Manning, C.A. Situation Awareness as a Predictor of Performance for En Route Air Traffic Controllers. Air Traffic Control Q. 1998, 6, 1–20. [Google Scholar] [CrossRef]
  59. Mollo, V.; Falzon, P. Auto- and allo-confrontation as tools for reflective activities. Appl. Ergon. 2004, 35, 531–540. [Google Scholar] [CrossRef]
  60. Eby, D.W. Chapter 5—Naturalistic Observational Field Techniques for Traffic Psychology Research. In Handbook of Traffic Psychology; Porter, B.E., Ed.; Academic Press: San Diego, CA, USA, 2011; pp. 61–72. [Google Scholar] [CrossRef]
  61. Zhang, C.; Berger, C. Analyzing Factors Influencing Pedestrian Behavior in Urban Traffic Scenarios Using Deep Learning. Transp. Res. Procedia 2023, 72, 1653–1660. [Google Scholar] [CrossRef]
  62. Bichicchi, A.; Belaroussi, R.; Simone, A.; Vignali, V.; Lantieri, C.; Li, X. Analysis of Road-User Interaction by Extraction of Driver Behavior Features Using Deep Learning. IEEE Access 2020, 8, 19638–19645. [Google Scholar] [CrossRef]
  63. De Ceunynck, T.; Pelssers, B.; Bjørnskau, T.; Aasvik, O.; Fyhri, A.; Laureshyn, A.; Johnsson, C.; Hagenzieker, M.; Martensen, H. Interact or counteract? Behavioural observation of interactions between vulnerable road users and autonomous shuttles in Oslo, Norway. Traffic Saf. Res. 2022, 2, 000008. [Google Scholar] [CrossRef]
  64. Lanzaro, G.; Sayed, T.; Fu, C. A comparison of pedestrian behavior in interactions with autonomous and human-driven vehicles: An extreme value theory approach. Transp. Res. Part F Traffic Psychol. Behav. 2023, 99, 1–18. [Google Scholar] [CrossRef]
  65. Sun, X.; Jiang, Y.; Burnett, G.; Bai, J.; Bai, R. A cross-cultural analysis of driving styles for future autonomous vehicles. Adv. Des. Res. 2023, 1, 71–77. [Google Scholar] [CrossRef]
  66. Li, J.; Currano, R.; Sirkin, D.; Goedicke, D.; Tennent, H.; Levine, A.; Evers, V.; Ju, W. On-Road and Online Studies to Investigate Beliefs and Behaviors of Netherlands, US and Mexico Pedestrians Encountering Hidden-Driver Vehicles. In Proceedings of the HRI ’20: ACM/IEEE International Conference on Human-Robot Interaction, Cambridge, UK, 23–26 March 2020; pp. 141–149. [Google Scholar] [CrossRef]
  67. Ranasinghe, C.; Holländer, K.; Currano, R.; Sirkin, D.; Moore, D.; Schneegass, S.; Ju, W. Autonomous Vehicle-Pedestrian Interaction Across Cultures: Towards Designing Better External Human Machine Interfaces (eHMIs). In Proceedings of the Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems, Honolulu, HI, USA, 25–30 April 2020; pp. 1–8. [Google Scholar] [CrossRef]
  68. Colley, M.; Kornmüller, D.; Dey, D.; Ju, W.; Rukzio, E. Longitudinal Effects of External Communication of Automated Vehicles in the USA and Germany: A Comparative Study in Virtual Reality and Via a Browser. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2024, 8, 1–33. [Google Scholar] [CrossRef]
  69. Klein, N.A.; Sandhaus, H.; Goedicke, D.; Parush, A.; Ju, W. Modeling Social Situation Awareness in Driving Interactions. In Proceedings of the 16th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, Stanford, CA, USA, 22–25 September 2024; pp. 31–41. [Google Scholar] [CrossRef]
  70. Ehrhardt, S.; Merat, N.; Daly, M.; Solernou Crusat, A.; Deml, B. Comparing implicit communication via longitudinal driving dynamics: A cross-cultural study in Germany and the UK. Transp. Res. Part F Traffic Psychol. Behav. 2024, 102, 278–293. [Google Scholar] [CrossRef]
  71. Wang, W.; Cheng, Q.; Li, C.; André, D.; Jiang, X. A cross-cultural analysis of driving behavior under critical situations: A driving simulator study. Transp. Res. Part F Traffic Psychol. Behav. 2019, 62, 483–493. [Google Scholar] [CrossRef]
  72. Cefkin, M.; Zhang, J.; Stayton, E.; Vinkhuyzen, E. Multi-methods Research to Examine External HMI for Highly Automated Vehicles. In Proceedings of the Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), HCI in Mobility, Transport, and Automotive Systems: First International Conference, MobiTAS 2019, Held as Part of the 21st HCI International Conference, HCII 2019, Orlando, FL, USA, 26–31 July 2019; Springer: Berlin, Germany, 2019; Volume 11596, pp. 46–64. [Google Scholar] [CrossRef]
  73. Viergever, R.F. The Critical Incident Technique: Method or Methodology? Qual. Health Res. 2019, 29, 1065–1079. [Google Scholar] [CrossRef]
  74. Habibovic, A.; Tivesten, E.; Uchida, N.; Bärgman, J.; Ljung Aust, M. Driver behavior in car-to-pedestrian incidents: An application of the Driving Reliability and Error Analysis Method (DREAM). Accid. Anal. Prev. 2013, 50, 554–565. [Google Scholar] [CrossRef] [PubMed]
  75. Dean, M. Multi-Criteria Analysis. In Advances in Transport Policy and Planning; Academic Press: Cambridge, MA, USA, 2020; Volume 6, pp. 165–224. [Google Scholar] [CrossRef]
  76. Taherdoost, H.; Madanchian, M. Multi-Criteria Decision Making (MCDM) Methods and Concepts. Encyclopedia 2023, 3, 77–87. [Google Scholar] [CrossRef]
  77. Wobbrock, J.O.; Kientz, J.A. Research contributions in human-computer interaction. Interactions 2016, 23, 38–44. [Google Scholar] [CrossRef]
Figure 1. Virtual Reality (VR)-based multi-agent simulators allow conducting Driver:Driver and Driver:Pedestrian interaction studies where participants can behave with a high degree of freedom and agency in the simulated environment that parallel real world behavior. However, interaction studies need careful scoping of scenarios to systematically distill useful behavioral insights.
Figure 1. Virtual Reality (VR)-based multi-agent simulators allow conducting Driver:Driver and Driver:Pedestrian interaction studies where participants can behave with a high degree of freedom and agency in the simulated environment that parallel real world behavior. However, interaction studies need careful scoping of scenarios to systematically distill useful behavioral insights.
Applsci 15 03738 g001
Figure 2. Within the broader research question of studying traffic interactions, the choice of scenarios, i.e., which interaction setups to include in a study is directly dependent upon technical constraints, such as space, time, number and type of participants, as well as the degree of naturalism sought.
Figure 2. Within the broader research question of studying traffic interactions, the choice of scenarios, i.e., which interaction setups to include in a study is directly dependent upon technical constraints, such as space, time, number and type of participants, as well as the degree of naturalism sought.
Applsci 15 03738 g002
Figure 3. The choice of interaction approach—whether to present the study as vignettes of interaction or a long drive constituting multiple interactions is dependent upon several factors which trade off of each other. These factors can be the influence of disruption that can alter participant behavior as opposed to quality and integrity of data, relying on the memory of the participant to report an experience accurately post hoc as opposed to breaking immersion to get an immediate response, time spent experiencing the interaction as opposed to time spent elaborating and reflecting on the experience, and the comprehensiveness of the data as opposed to the burden (cognitive, physical, time, etc.) placed on the participant.
Figure 3. The choice of interaction approach—whether to present the study as vignettes of interaction or a long drive constituting multiple interactions is dependent upon several factors which trade off of each other. These factors can be the influence of disruption that can alter participant behavior as opposed to quality and integrity of data, relying on the memory of the participant to report an experience accurately post hoc as opposed to breaking immersion to get an immediate response, time spent experiencing the interaction as opposed to time spent elaborating and reflecting on the experience, and the comprehensiveness of the data as opposed to the burden (cognitive, physical, time, etc.) placed on the participant.
Applsci 15 03738 g003
Figure 4. Detailed flowchart guiding the decision-making process for selecting an interaction study design. The flowchart prompts researchers to evaluate whether experimental interruptions (e.g., participant questioning) might disrupt the natural flow of interactions, weighing the trade-offs between immediate interrogation (which can yield more precise recall but risk disrupting immersion) and delayed post-interaction feedback (which preserves naturalism but may suffer from memory decay). Based on considerations such as disruption versus data quality, memory versus immersion, and the balance between interaction time and interrogation depth, the flowchart directs the experimenter toward either adopting isolated vignette scenarios or integrating continuous, naturalistic long drive scenarios. The most important question to begin this decision-making process is by asking whether interrupting the interaction would adversely affect data collection, and if so, a long drive is the only plausible option. If not, it is critical to consider whether an immediate feedback on an interaction is necessary, in which case a vignette-based design is crucial. Otherwise, it is important to assess whether an extended, immersive interaction experience is critical for the research question, in which case a long drive is the desired approach. If not, the researchers must ask themselves as as a contingency whether participant dropouts can lead to a waste of the entire experiment—in that case, a vignette design is the best approach to make use of partial data and optimize resources.
Figure 4. Detailed flowchart guiding the decision-making process for selecting an interaction study design. The flowchart prompts researchers to evaluate whether experimental interruptions (e.g., participant questioning) might disrupt the natural flow of interactions, weighing the trade-offs between immediate interrogation (which can yield more precise recall but risk disrupting immersion) and delayed post-interaction feedback (which preserves naturalism but may suffer from memory decay). Based on considerations such as disruption versus data quality, memory versus immersion, and the balance between interaction time and interrogation depth, the flowchart directs the experimenter toward either adopting isolated vignette scenarios or integrating continuous, naturalistic long drive scenarios. The most important question to begin this decision-making process is by asking whether interrupting the interaction would adversely affect data collection, and if so, a long drive is the only plausible option. If not, it is critical to consider whether an immediate feedback on an interaction is necessary, in which case a vignette-based design is crucial. Otherwise, it is important to assess whether an extended, immersive interaction experience is critical for the research question, in which case a long drive is the desired approach. If not, the researchers must ask themselves as as a contingency whether participant dropouts can lead to a waste of the entire experiment—in that case, a vignette design is the best approach to make use of partial data and optimize resources.
Applsci 15 03738 g004
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dey, D.; Habibovic, A.; Ju, W. Operationalizing Dyadic Urban Traffic Interaction Studies: From Theory to Practice. Appl. Sci. 2025, 15, 3738. https://doi.org/10.3390/app15073738

AMA Style

Dey D, Habibovic A, Ju W. Operationalizing Dyadic Urban Traffic Interaction Studies: From Theory to Practice. Applied Sciences. 2025; 15(7):3738. https://doi.org/10.3390/app15073738

Chicago/Turabian Style

Dey, Debargha, Azra Habibovic, and Wendy Ju. 2025. "Operationalizing Dyadic Urban Traffic Interaction Studies: From Theory to Practice" Applied Sciences 15, no. 7: 3738. https://doi.org/10.3390/app15073738

APA Style

Dey, D., Habibovic, A., & Ju, W. (2025). Operationalizing Dyadic Urban Traffic Interaction Studies: From Theory to Practice. Applied Sciences, 15(7), 3738. https://doi.org/10.3390/app15073738

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop