How Can Autonomous Vehicles Convey Emotions to Pedestrians? A Review of Emotionally Expressive Non-Humanoid Robots

Wang, Yiyuan; Hespanhol, Luke; Tomitsch, Martin

doi:10.3390/mti5120084

Open AccessArticle

How Can Autonomous Vehicles Convey Emotions to Pedestrians? A Review of Emotionally Expressive Non-Humanoid Robots

by

Yiyuan Wang

^*

,

Luke Hespanhol

and

Martin Tomitsch

Design Laboratory, Sydney School of Architecture, Design and Planning, The University of Sydney, Sydney 2006, Australia

^*

Author to whom correspondence should be addressed.

Multimodal Technol. Interact. 2021, 5(12), 84; https://doi.org/10.3390/mti5120084

Submission received: 29 October 2021 / Revised: 16 December 2021 / Accepted: 16 December 2021 / Published: 20 December 2021

(This article belongs to the Special Issue Feature Papers of MTI in 2021)

Download

Browse Figures

Versions Notes

Abstract

:

In recent years, researchers and manufacturers have started to investigate ways to enable autonomous vehicles (AVs) to interact with nearby pedestrians in compensation for the absence of human drivers. The majority of these efforts focuses on external human–machine interfaces (eHMIs), using different modalities, such as light patterns or on-road projections, to communicate the AV’s intent and awareness. In this paper, we investigate the potential role of affective interfaces to convey emotions via eHMIs. To date, little is known about the role that affective interfaces can play in supporting AV–pedestrian interaction. However, emotions have been employed in many smaller social robots, from domestic companions to outdoor aerial robots in the form of drones. To develop a foundation for affective AV–pedestrian interfaces, we reviewed the emotional expressions of non-humanoid robots in 25 articles published between 2011 and 2021. Based on findings from the review, we present a set of considerations for designing affective AV–pedestrian interfaces and highlight avenues for investigating these opportunities in future studies.

Keywords:

emotional expression; non-humanoid robots; autonomous vehicles; pedestrians; human–machine interfaces; human–robot interaction

Graphical Abstract

1. Introduction

The introduction of autonomy in vehicles promises to increase the level of convenience and comfort for riders [1]. However, the absence of human drivers in fully autonomous systems induces an interaction void around what used to be traditional communication strategies between drivers and other road users, such as eye contact and body gestures [2]. The responsibility for communicating internal states and intentions in typically short, dynamic traffic scenarios is, to a large extent, if not entirely, delegated from drivers to the self-moving vehicles themselves.

Researchers and manufacturers are dedicated to developing additional channels to assist autonomous vehicles (AVs) in conveying their intent and awareness to surrounding road users, especially to pedestrians, who are considered one of the most vulnerable yet frequent interaction subjects [2]. Existing means that aim to support a safe and intuitive AV–pedestrian interaction are varied, ranging from display technologies, such as LED lighting patterns [3,4] and on-road projections [5,6], to borrowing anthropomorphic features, such as moving eyes that follow the pedestrian’s position [7,8] or displaying a smiling expression on the front of AVs [6,8].

While the existing solutions have been validated to compensate for the lack of driver–pedestrian communication in different ways, emotions, a vital dimension in human–human interaction [9], have thus far been mostly disregarded in this growing area of research. Different from pragmatic channels, emotions have a unique role in affecting perception, empathy, decision making, and social interactions [10]. In fact, imbuing emotions into social robots is not rare in human–robot interaction (HRI). Prior work describes the ability to convey emotions as one of the indicators of socially interactive robots [11]. Previously designed robots that are capable of expressing emotions vary in functionalities, appearances, as well as how they articulate emotions in interaction contexts. We reviewed a sub-domain of these robots—emotionally expressive robots that have a non-humanoid form—aiming to translate the design paradigms for their emotional expressions into cues that AVs may employ in the affective dimension, as AVs are in essence self-moving, non-humanoid social robots.

The contribution of this paper is twofold. First, we systematically review 25 articles focusing on designing and evaluating emotional expressions for non-humanoid robots in the past ten years. We summarize emotion models, output modalities, and evaluation measures, as well as how users perceived the emotional expressions in these studies. Second, based on the findings from the review, we propose a set of considerations for designing affective AV–pedestrian interfaces by adding emotions as an additional communication dimension. Our findings contribute to enhancing AV–pedestrian interaction and increasing social acceptance for the deployment of AVs in urban environments.

2. Background

2.1. Emotion in Social Robotics

Social robots are utilized in many domains, including healthcare, education, domestic environments, and public spaces [12]. To bring the interactions with humans onto a more natural and engaging level [13,14,15], many of these robots are integrated with the capability of displaying emotions. Eyssel et al. [16] showed that a robot expressing emotional states could make people feel closer to the robot, perceive it as having anthropomorphic traits and intentionality, and experience a more pleasant HRI process. Besides facilitating empathic inferences about internal states and intentions [17], emotions expressed by robots can subsequently elicit affective responses from humans [18,19,20] and impact the immediate environment where the interaction is taking place [21,22]. With the increased perceived sociability [19], emotionally expressive robots can effectively establish closer bonds with humans and, more importantly, increase public trust and acceptance [9,20,22,23] in their deployment in our daily lives.

2.2. Humanoid Robots vs. Non-Humanoid Robots

Many humanoid robots, such as NAO [24] and Pepper [25], can employ readily available anthropomorphic features, such as body movement, facial expression or speech, to exhibit emotions. While such physical embodiment may increase the recognition level of displayed emotions [17,21], a large class of social robots are tuned to be less or non-anthropomorphic in order to match the functional requirements at their designated tasks [14,21,26,27]. For example, rescue robots are usually small and tank-like [28], and domestic vacuum robots, such as Roomba, are likely to be puck-shaped and able to fit under couches [21]. In addition to the utilitarian aspect, implementing emotions using the anthropomorphic features in humanoid robots not only can be expensive and technically complex [14,27,29], but also needs to fulfill users’ expectations on their level of anthropomorphism [17,30] and at the same time minimize the feeling of creepiness known as the “uncanny valley” effect [19,30]. Consequently, an increasing body of literature has focused on designing emotional expressions for non-humanoid robots to show affection. On the one hand, non-humanoid robots are anatomically unavailable to express emotions like humans; on the other, this reduces stereotypes of how emotions should be displayed and thus broadens the range of modalities across visual, auditory, and haptic channels that these robots may employ. In the context of this paper, we are interested in understanding what modalities are used to encode emotions in non-humanoid robots and how well the resulting emotional expressions are identified and perceived by users.

2.3. Current AV–Pedestrian Interaction

With human drivers being replaced by autonomous control systems, one important challenge for the social acceptance of AVs is to communicate their intent and awareness to nearby pedestrians and other vulnerable road users (VRUs) [2,6]. A major direction in exploring effective communication strategies is the development of external human–machine interfaces (eHMIs) [2,6,31]. Current solutions to AV–pedestrian eHMIs are manifold. For example, vehicle-mounted LED lighting patterns are utilized to indicate vehicle modes, the awareness of a nearby pedestrian, or the intention to yield or move [2,3,4]. Studies also investigated the use of eHMIs to convey messages by displaying pictograms [32] and texts [33,34]. On-road projections were investigated as a way to leverage traffic metaphors, such as crosswalks or stop signs [5,8,35]. In one implementation, the road infrastructure was updated to collect data from AVs and to convey the information to pedestrians via a smart road [36]. Some researchers also experimented with anthropomorphic features to restore current driver–pedestrian interaction patterns. An example for this is the implementation of moving eyes that follow the position of pedestrians at crosswalks [7,8]. Other studies tried a printed hand that waved to indicate yielding [4] and a sign indicating a smile to inform pedestrians that it is safe to cross [6,8].

Those studies focused on the communication of intent and awareness through the use of operational cues, akin to the way traditional street signage does, which is designed to evoke immediate, and sometimes emotional, responses from users and further coordinate actions among interaction subjects. However, enabling AVs to express emotions as a communication strategy has not yet been addressed as a primary focus of research. This gap was also identified in a recently published design space for the external communication of AVs [37]. The authors pointed out the need for “affective messages” (i.e., messages related to emotions) since such messages are highly important in interpersonal communication, even if they do not necessarily carry meaning [37].

2.4. Why Ascribe Emotions to AVs

The mass deployment of AVs in daily traffic environments could be hindered by their social acceptance related to not only technical aspects, but societal factors as well [6]. Using eHMIs to communicate intent and awareness reduces public skepticism by improving pedestrians’ understanding of the machine’s decisions and maneuvers, thereby fostering safe interaction. Yet, overcoming people’s psychological barrier toward AVs’ deployment is not easy, and various forms of discrimination against AVs are still continuously witnessed. For example, local residents harassed and attacked (e.g., threw rocks at) Waymo’s self-driving cars during a public trial because they felt uncomfortable or scared around them [38]. Volvo’s driverless cars were reported to be “easy prey” on the road and were bullied by other drivers with slamming on brakes or aggressive driving to force them into submission [39]. Even being one of the most vulnerable road users, pedestrians were also found to take advantage of AVs’ rule-abiding nature by crossing the road with impunity once they discovered that the cars were self-driving [40,41]. Though intelligent agents, AVs are often considered mindless machines following programmed rules or even, more generally, a piece of “creepy” technology that breaks the status quo that people are comfortable with [42].

Following findings from studies of social robots in other domains, a promising avenue to address some of these issues is to equip AVs with social traits, such as the ability to express emotions. This has the potential to shift people’s perception of AVs as purely algorithmically driven agents toward intelligent social actors. Indeed, concepts for increasing AVs’ sociability have surfaced in recent years. In 2014, Google announced a driverless car prototype that was intentionally designed like adorable “Marshmallow Bumper Bots” with headlights like wide eyes and a front camera like a button nose, aiming to resemble a living being or a friend [42,43]. Taking an even greater leap, in August 2021, Honda released a lively AV bot serving as both transportation and smart companion, conceptualized for the year 2040 [44]. It has a large frontal face with animated emotional facial expressions and fenders with covered wheels like pet animal legs, radiating “the cute character of a playful puppy” [44]. These efforts are striving to make AVs likable social agents and improve their acceptance by evoking people’s empathy. In line with these concepts, increasing AVs’ emotional expressiveness is likely to enrich their social characteristics [11,19,45] and improve their acceptability [9,19].

Apart from influencing perception and empathic understanding, emotion in HRI is known for its function to regulate and guide human decision-making and behavior, biasing the interaction process away from negative or harmful results [10]. For example, a majority of in-car affective human–machine interfaces (HMIs) use expressive cues, such as emotional music, ambient light, or empathic speech, to regulate drivers into an emotionally balanced state and thus promote safe driving behavior [46]. Similarly, AVs expressing emotions through external affective cues may help regulate the traffic climate [47], especially when other road users disagree with AVs’ decisions (e.g., road rage toward the AV’s maneuver [39]). Close to the strategy of using emotions or affects, some existing AV-pedestrian eHMIs have attempted to convey courtesy through textual messages such as “Thank You” and “You’re Welcome” [33], and “Please” [48], facilitating the cooperation between AVs and pedestrians. Hence, along with adjusting people’s preconceptions, AVs’ expressiveness should help regulate and guide the interaction process and eventually contribute to their functioning. This kind of approach follows Picard’s definition of affective computing, suggesting that it is not about making machines look “more emotional” but about making them more effective [10].

The rich literature in affective robotics indicates that emotion encoding in AV–pedestrian interaction is possible. Indeed, humans have the innate tendency to attribute liveliness, emotions, intelligence, and other social characteristics to moving objects [13,21,22,23,29,49]. A close example to AVs are drones, also referred to as unmanned aerial vehicles (UAVs). These are small plane-like flying robots that show rich kinematics and functionalities [9]. As personal drones are becoming more popular and ubiquitous in our daily lives [9,19], emotion encoding in human–drone interaction (HDI) has also been studied in recent years [9,19,50], especially for the purpose of increasing their acceptability [9,19]. Through a systematic review of how drones and other non-humanoid social robots have displayed emotions to people surrounding them, we aim to investigate plausible emotions or affects as well as possible output modalities that could be considered for designing affective AV–pedestrian interfaces.

3. Method

To gain an understanding of current emotionally expressive non-humanoid robots, we reviewed relevant articles from 2011 to 2021 using a systematic search strategy.

3.1. Search Strategy

3.1.1. Database Selection

To identify the most relevant publishers, we first queried Google Scholar, as it covers a broad search across various sources of publications. We used the Publish or Perish software (version 7) [51] to request Google Scholar in order to extract the search results into a CSV file, as Google Scholar does not provide the functionality to download the results as a whole. We searched for “emotion” AND “robot” from 2011 to 2021. We extracted the most relevant 1000 results, as it was the maximum number of results available for retrieval. We then counted the number of results per publisher, resulting in four top publishers (IEEE = 291, Springer = 176, Elsevier = 72, and ACM = 69) followed by Google Patents (31), MDPI (25), and SAGE (22) while others were below 20.

We then searched within the databases corresponding to the top four publishers, i.e., IEEE Xplore Digital Library, SpringerLink, ScienceDirect (for Elsevier), and ACM Digital Library, using the same query string and time frame as in the first step. We checked the number of results and then excluded SpringerLink, as it yielded over 10,000 results under the “Article” and “Chapter and Conference Paper” content types, which would have made a subsequent review impossible to achieve. This left us with three databases for the detailed article search and analysis (ACM = 1427, IEEE = 1103, and ScienceDirect = 3329).

3.1.2. Keyword Search Procedure

To identify potential article candidates for the review, we conducted a keyword search within each of the selected databases. Three main keywords were used: “emotion”, “robot”, and “non-humanoid”. We also included synonyms that are commonly used to describe a non-humanoid appearance for robots: “non-anthropomorphic”, “appearance-constrained”, and “appearance constrained”. We combined these keywords using AND/OR operators and utilized the advanced search feature in each database. We selected a time frame of the last ten years because (1) 87% of the total results fell within the last ten years, and (2) we were interested in understanding recent trends in this growing discipline. The time frame for the final search was from 1 January 2011 to 16 July 2021. The search yielded a total of 225 results, including research articles, posters, books, and other kinds of publications across the three databases (ACM = 77, IEEE = 77, and ScienceDirect = 71).

3.1.3. Article Selection

We chose articles published in conference proceedings and journals that (1) were written in the English language, (2) used a non-humanoid robot, (3) designed emotional expressions for the robot, and (4) evaluated the emotional expressions with empirical user studies and presented the evaluation results. In a first step, the lead author screened the articles. This process involved reading each article’s title, abstract, and full text to see if it met the selection criteria. In this process, we removed duplicated articles, as some articles were published jointly by different publishers. We further excluded articles that proposed approaches for robots to sense or recognize users’ emotions rather than express emotions, as well as articles that included a robot capable of displaying emotions but offered little information in how the emotional expressions were designed. The screening process resulted in the final collection of 25 articles that are included in our review. As a second step, two of the authors then reviewed and discussed the results, mapping them out on a digital whiteboard. No further changes were made to the article selection in that step.

3.2. Research Questions

The review presented in this paper is based on the following research questions.

What emotions are commonly expressed by non-humanoid robots?
How are the emotions displayed?
What measures are used to evaluate the emotional expressions?
What are the user perceptions of the emotional expressions?

4. Review of Emotionally Expressive Non-Humanoid Robots

This section presents the systematic review of the 25 articles in response to the research questions in Section 3.2.

4.1. Overview

An overview of the reviewed articles is provided in Table 1. The non-humanoid robots used in these articles varied in morphologies and functionalities (Figure 1). Eleven (44%) articles adopted readily available robots developed in previous work [13,14,15,22,52] or commercially available [9,18,20,28,50,53], while another thirteen (52%) articles designed and prototyped robots for the specific purpose of investigating emotional expressions [17,19,21,23,27,29,45,49,54,55,56,57,58]. The remaining one article implemented the design from another robot using Lego robot parts [26]. Two articles developed animations instead of using a physical embodiment to simulate robots (a car seat in [23] and a drone in [19]). Regardless of whether the robots had a designated functionality per se, four robots had a perceivable utilitarian function during the evaluation of emotions, including two voice assistants [20,52], one drone [9], and one car seat [23]. Another two robots served as companions during the studies reported in the articles [18,45].

The robots were ascribed with a set of distinct expressive behaviors corresponding to specific emotions. These expressive behaviors were encoded through a variety of output modalities, supporting inferences of the robots’ internal states and broadening the range of feasible cues applicable to designing affective interfaces. Twenty-three (92%) articles investigated the encoding of multiple emotions and evaluated the effectiveness of one or more modalities in displaying the emotions, while the remaining two articles simply tested the perception and impact of a single emotion [53,56]. To support a more interactive evaluation process, robots in eight (32%) articles were able to express emotions based on the behaviors of participants [13,15,18,20,21,45,52,56].

The purpose of robotic emotion design in fifteen (60%) articles was to explore the use of a modality or multiple modalities. Eight of these articles aimed to address the feasibility or effectiveness of the chosen modality/modalities in developing the emotional expressions [17,19,21,27,28,50,54,55], while the other seven of the fifteen articles focused on providing design strategies for using the modality/modalities to encode emotions [9,14,23,26,49,57,58]. The remaining ten (40%) articles had a main purpose other than evaluating modalities. Six of them examined the influence of emotional expressions on user cognition [52,53] or behavior [18,29,45,56]. Two articles evaluated the efficacy of their proposed approach for the robot expressing emotions dynamically [13,15]. As for the remaining two articles, one [20] aimed to understand user preference for the robot’s personality traits, and the other [22] explored various contextual factors influencing user perception of the emotional expressions.

4.2. Emotion Models

To understand what emotions are commonly expressed by non-humanoid robots, this section presents the emotion models that guided the selection of emotions in the reviewed articles. We identified three emotion models, which are used to structure this section. Two streams of models are based on previous literature, namely categorical models and dimensional models [13,22,26,49]. We further added emotional personas as a third model since we found that some articles developed emotional personas for the robots and selected emotions in accordance with the personas to articulate corresponding personalities.

4.2.1. Categorical Models

Nineteen (76%) articles encoded categorical emotions (e.g., happiness and sadness). The most employed categorical emotion model was Ekman’s six basic emotions [59], including anger, disgust, fear, happiness, sadness and surprise. These emotions are regarded as essential for human–human communication, easy to understand, and widely recognizable across cultural backgrounds [13,17,19,22]. Besides using a validated psychological model, two studies referred to cultural conventions. Hieida et al. derived anger, joy, pleasure, and sadness from a popular Japanese idiom, ki-do-ai-raku [50], while Cauchard et al. [9] chose emotional states (brave, dopey, grumpy, happy, sad, scared, shy, and sleepy) from personalities found in Walt Disney’s Seven Dwarfs and Peyo’s Smurfs.

4.2.2. Dimensional Models

Some studies argued that discrete emotions were unable to cover a comprehensive space of emotions since human emotions comprise not only basic emotions but also subtle variations within each category [26,58]. For instance, joyous, content, and jubilant describe different levels of happiness [13]. Consequently, dimensional approaches were utilized in nine (36%) articles. This includes three articles [13,28,49] that conducted multiple studies using both categorical emotion models and dimensional emotion models. The two mostly adopted dimensional models were Russell’s circumplex model [60] and Mehrabian’s model [61]. Russell’s circumplex model positions emotions on a two-dimensional space: valence and arousal, where valence refers to the positive or negative connotation of the emotion [22], and arousal means the intensity of the emotion. Mehrabian’s model distributes emotions in a three-dimensional space: pleasure–arousal–dominance (PAD), also known as valence–arousal–dominance in which the dominance dimension measures the controlling or submissive nature of the emotion [19]. For example, although both anger and fear are negative emotions, the former is perceived as dominant while the latter is submissive.

4.2.3. Emotional Personas

Two articles (8%) used emotions to portray personalities for different personas [9,20]. Instead of being presented with emotions directly, participants in these two articles were faced with stereotype emotional personas [20] in which typical emotions were expressed and discerned. Cauchard et al. [9] integrated selected emotional traits into three representative personas (adventurer, anti-social, and exhausted) for drones. For example, the behaviors of an adventurer drone showed a combination of happiness and bravery. In the work of Whittaker et al. [20], a voice-assisted home robot was assigned to three distinct personas (buddy, butler, and sidekick), which were derived based on the well-known “Big Five” personality traits [62] and differed in perceived emotions via speech, intonation, color, and movement when responding to people’s commands.

4.3. Output Modalities

In order to answer how the emotions are displayed, this section demonstrates a variety of output modalities across visual, auditory, and haptic channels in the articles for creating affective interfaces for non-humanoid robots to manifest emotions. We classified the modalities into the sensory categories with reference to how these modalities were sensed by users during evaluation.

4.3.1. Visual Modalities

Twenty-three (92%) articles utilized visual modalities including movement, color, and facial expression. Many of the robots presented in the articles used a combination of visual modalities.

The most employed modality was movement, which was found in twenty-two articles. Six articles used movement to encode emotions based on suggestions in prior work [18,20,22,27,49,55]. For instance, for a robot placed in a natural play scenario with young children, Boccanfuso et al. [18] used movements that were previously reported to be indicative of emotions in children. Tan et al. [49] designed shape-changing movements by reviewing biological motion studies that had demonstrated relations between emotions and shape-changing parameters, such as velocity and orientation. In addition to referencing previous findings directly, four articles created emotional movements based on validated knowledge systems including the Laban’s movement framework [63] which was adopted in three articles [23,26,50] and the interaction vocabulary [64] was used in one article [9]. Furthermore, metaphorical mappings were utilized in four articles [14,17,28,52]. For example, Löffler et al. [17] used conceptual metaphors, such as “joy is up and active” and “anger is hot fluid in a container”, to develop movement patterns. Shi et al. [52] designed emotional movements for text boxes on a smart-phone based voice assistant using affective human body expressions. Besides prescribing movement–emotion mappings in advance, four articles explored how users specified the relationships between emotions and movements [21,28,54,57]. Despite the non-humanoid form of the robots, six articles designed emotional body expressions [13,14,26,45,50,56]. Except for the body expressions for drones in [50] which still remained in the realm of mechanical movements, the other five articles followed anthropomorphic or zoomorphic behaviors, such as stretching the “neck” for a lamp-like robot to show curiosity [45] or borrowing emotional behavioral patterns from dogs [14].

Colored lights were used in seven articles. Three articles [17,27,28] corresponded one emotion to one color, e.g., anger—red, sadness—blue, calm—white, etc. Among them, the two articles by Song and Yamada [27,28] generated the encoding based on prior research, while the work of Löffler et al. [17] referred to conceptual metaphors, such as “joy is light and warm” and “fear is darkness”. Additionally, four articles [18,20,22,56] combined multiple visual properties of colored lights to reflect single emotions. For example, Hoggenmueller et al. [22] developed color patterns with animation effects, e.g., blur and fade green and yellow colors for showing disgust, and Boccanfuso et al. [18] used bright and colorful lights with high intensity to convey happiness. For generating the mappings between multiple color properties and emotions, Whittaker et al. conducted an online user study to decide color patterns for different emotional personas, while the other three articles referenced prior work.

Four articles reported the design of facial expressions for conveying emotions [19,29,52,56]. Herdel et al. [19] used the facial action coding system (FACS) to establish associations between basic emotions and animated facial expressions on drones. Shi et al. [52] developed facial expressions for their smart-phone based voice assistant by first seeking well-received cartoon designs and then selecting the final expressions through an online survey. Instead of using multiple facial features, Peng et al. [29] and Frederiksen and Stoy [53] designed only eye patterns for displaying emotions. Peng et al. [29] created eyes with various shapes and colors for their small theater robots and identified the mappings with emotions using online questionnaires, while the animated eyes on a phone mounted on the robot in [53] simply showed paying attention and conveyed an apologetic state in a scolding scenario.

4.3.2. Auditory Modalities

Eight (32%) articles employed auditory modalities including non-linguistic utterances (NLUs), music, and vocalizations.

NLUs are usually defined as “robot-like” mechanical sounds (e.g., chirps, beeps, whirrs, etc.) [65] and are often used as affective cues in HCI [27,65]. Five articles included NLUs for cuing emotions [14,17,27,53,56]. For instance, mechanical sounds such as chirps and beeps varied in tones and rhythms were associated with different emotions in [14,17,27]. In both articles by Frederiksen and Stoy [53,56], NLUs were used to show single emotions (e.g., alerting audio signals to express fear [53] and augmented naturally occurring sounds of the robot to convey remorse [56]).

Two articles utilized music that was validated to evoke emotions before [15,18]. Ritschel et al. [15] used previously proposed melodies to show emotions and intentions and personalized the timbre dynamically according to user preferences. Boccanfuso et al. [18] produced a set of synthesized music to enhance the robot’s emotional expressions in the play environment with young children (e.g., a happy state was conveyed with a piece of music with frequent and smooth changes in a moderate to high pitch).

Vocalizations were found in two articles [18,20]. Whittaker et al. [20] implemented humanoid speech and intonation to articulate three distinct personas in the voice-assisted home robot, while Boccanfuso et al. [18] added non-linguistic child vocalizations, such as giggle and crying, simply to augment the main sound cue (i.e., music).

4.3.3. Haptic Modalities

Haptic modalities, including haptic movements and textures, were used in four (16%) articles. In one of those articles, the robot was covered in a naturalistic fur to mimic furry animals and invite user touch only to assist in the evaluation of the primary modality (i.e., breathing behaviors) [54]. The other three articles used haptics as main cues [55,57,58]. In the work of Sato et al. [58], they investigated how users mapped combinations of haptic movements (e.g., tap rapidly/slowly) and textures (e.g., aluminum and clay) to a list of discrete emotions. Chase and Follmer [55] combined haptic movements with visual movements to test the perceived pleasure–arousal–dominance (PAD) for properties such as stiffness and jitter. Kim and Follmer [57] assessed perceived PAD in a swarm of small haptic devices by changing parameters, including the number of robots, force types, frequency, and amplitude.

4.4. Evaluation Measures

All of the reviewed articles included user evaluations to assess the quality and impact of the emotional expressions. This section discusses the evaluation measures in terms of use scenarios, experimental tasks, and evaluated aspects.

4.4.1. Use Scenarios

Eleven (44%) articles created use scenarios for robots displaying emotions in the evaluation. Three of the articles embedded the emotional expressions into the robots’ tasks [9,20,52]. For instance, the drone in [9] displayed emotional profiles during different flying tasks. The two voice assistants in [20,52] conveyed emotional states during tasks activated by users’ spoken commands, e.g., setting a reminder [52] and playing a music [20]. Six articles created scenarios specifically for the emotional expressions [23,26,29,45,53,56]. The car seat in [23] showed expressive movements when greeting its driver. Peng et al. designed plots for the robot actors in the robot theater to show contextual emotions [29]. The conversation companion robot in [45] responded to humans’ vocalics during a conflict conversation between couples. The robot in [56] was placed in a scolding scenario in order to convey remorse. Two articles intentionally designed triggers for the emotions in order to help with users’ understanding, such as a scenario where the robot showed positive emotions after finishing its task successfully [26] and a high volume explosion sound to evoke the robot’s fearful reactions [53]. Using a more natural context, Boccanfuso et al. placed the robot into an unstructured play environment with young children for eliciting their affective responses [18]. Though the robot in [13] expressed emotions without contexts in the first two studies, it responded to users’ speech in the third one. Apart from these eleven articles, the robots’ emotional expressions in other articles were devoid of any use case, presumably with the purpose of evaluating the design without bias [21,22].

4.4.2. Experimental Tasks

In seven articles (28%), participants viewed images or watched pre-recorded or simulated videos, where the robot showcased emotional expressions [13,14,19,23,26,29,50]. Specifically, the videos in [19,23] were simulated animations. Nineteen (76%) articles presented participants with physically embodied robots, including the article by Bretan et al. [13] in which participants were faced with either the physical robot or images/videos. Six of those articles asked participants to only watch the robots displaying emotions [9,17,22,27,28,53], whereas thirteen of the articles let participants interact with the robots [13,15,18,20,21,45,49,52,54,55,56,57,58]. The contents of the interactions varied across articles. For example, participants were instructed to touch or play with the robots in order to experience the emotional expressions [18,21,49,54,55,57,58]. In the two articles with voice-assisted robots [20,52], participants received emotions from the robot while completing specified tasks using the robot. Furthermore, some robots were able to adjust their emotional responses according to participants’ behaviors through either pre-programmed mechanisms [13,15,18,45,56] or a Wizard-of-Oz procedure [21].

4.4.3. Evaluated Aspects

Twelve articles (48%) gauged the recognition level of the emotional expressions, that is, the rate that an emotion was correctly recognized through its expression. In nine articles, participants matched presented expressions with a list of emotion names [9,14,17,19,23,26,27,49,58], whereas the other three articles asked participants to rate how much they thought the presented expressions matched the prescribed emotions on scales [13,15,22].

Other characteristics of the emotional expressions were also measured. Five articles used the self-assessment mannequin (SAM) scale [66] to measure the perceived valence, arousal, and dominance of the expressions [19,28,49,55,57]. Several articles evaluated valence [50,53,54], arousal (or intensity) [9,19,50,53], and dominance [53] independently. Two articles [21,50] asked participants to rate each emotional expression on a scale of pairs of opposite adjectives, such as “tired vs. energetic” and “hasty vs. leisurely”. Other traits such as emotional and cognitive engagement [52], perceived urgency [57], and the impact of emotions on surroundings [19,56] were also measured. Additionally, some articles assessed the level of anthropomorphism of their robot. For instance, two articles employed HRI metrics for gauging the perceived anthropomorphism, likability and safety of the robots [55,57]. Similarly, social human character traits, such as friendliness, cooperativeness, sociability, etc., as well as participants’ comfort level with the robot were assessed in one of the articles [45].

Three articles analyzed participants’ behaviors during their interaction with the robots using video coding [18,45] or direct observation [21]. Through the analysis of video coding data, Boccanfuso et al. [18] characterized different play patterns and affective responses of young children, and Hoffman et al. [45] collected verbal references to the robot during simulated couple conflicts. In the direct observation procedure in [21], think-aloud was used for participants to express feelings and thoughts during their interaction with the moving robot. Moreover, nine articles presented and discussed participants’ comments on their experience with the emotionally expressive robot, either from interviews or open-ended questions in questionnaires [9,15,19,20,21,22,29,45,54].

4.5. User Perceptions

To discuss user perceptions of the emotional expressions in the reviewed articles, we summarize common findings reported including the recognition level of emotions and other important aspects associated with user perceptions.

4.5.1. Recognition of Emotional Expressions

Basic emotions that are relatively obvious and universal [9,49] were best recognized by participants, including happiness [9,14,19,22,26,29], sadness [13,15,19,22], and anger [22,29]. However, sadness was, at the same time, found to be the least recognizable emotion in three articles [14,26,29]. Two other negatively valenced emotions, fear [13,19] and disgust [19,49], were also the most difficult to identify. This is in line with research in psychology showing that some emotions are more easily recognized than others [19] and that negative emotions tend to be recognized slower [19] and are less consistently interpreted correctly [23] compared to positive emotions. Furthermore, emotions such as surprise, coolness, and affirmation that are believed to be more abstract and need more contexts to interpret [22,57] were also rated low in terms of recognition [15,22,23].

When combining emotions with modalities, we found that the interpretation of emotional expressions was mostly the intuitive comprehension of the emotion–expression relationship. For example, in most studies where movement was used, participants were likely to associate high speed or high frequency with emotions with high arousal, such as excitement or anger [9,21,22,26,27,49,57], while a low speed or an avoidance behavior were usually interpreted as less intense emotions such as sadness or fear [22,26,28]. Similarly, participants were able to understand intuitive, conventional mappings for other modalities. For instance, a falling sound or slow tempo were usually interpreted as conveying sadness [15,17,27], and bright and fast-changing colors were commonly associated with joy [17,20,22].

4.5.2. Sociability

Many articles reported that participants attributed liveliness, internal states, and sociability to the emotionally expressive robots. For example, participants in one article [54] referred to the emotional furry robot as having “a rich inner life” and reminding them of pets. In the storytelling section in another article [29], children interpreted motivations, intentions and emotions from the performance of the robots in the robot theater. In [45], the companion robot was perceived as friendly, warm, and capable of forming social bonds and attachments. The drone with facial expressions in [19] was described as an agent with autonomy, consciousness, and cognitive and behavioral abilities.

Nevertheless, disengagement was found when robots displayed certain emotional expressions. Harris and Sharlin [21] reported that nearly half of their participants showed boredom when presented with slow and repetitive movements. For the voice-assisted home robot in [20], the “sidekick” persona which had a low amplitude voice and used slow movements also failed to engage some participants emotionally. Boccanfuso et al. [18] suggested that negative emotions that elicited frustration and annoyance might cause disengagement in children. In the investigation of emotional and cognitive engagement of the emotionally expressive voice assistant, Shi et al. [52] found that emotions with positive valence and high arousal might help robots to establish emotional connections with humans.

4.5.3. Contexts

Context was regarded as an important factor influencing users’ interpretation of emotional expressions. In [26], the recognition rate of emotions improved dramatically when emotions were displayed within an appropriate context compared to displayed alone. Tan et al. [49] also suggested that adding a use scenario could help users identify and disambiguate emotions. In order to interpret relatively more abstract emotions such as fear and surprise, participants in [57] tended to combine the haptic expressions with other factors such as motion paths and the contact locations of the haptic stimuli to obtain more contextual information. Hoggenmueller et al. [22] discussed a range of contextual aspects that impacted users’ comprehension, including spatiotemporal context, interactional context, and contexts related to users’ background.

Additionally, participants were found to create narratives for the emotional expressions in several articles. For instance, participants in [19] tended to develop stories to make sense of the emotions, such as speculating about the cause of fear and surprise. Children who watched the theater play performed by multiple robots in [29] generated conjectures about robots’ social relationships according to the display of emotions. After analyzing participants’ verbal descriptions of the expressive behaviors of the robot, Bucci et al. [54] concluded that narratives made about the motivation and situation of the robot could heavily influence the perception of emotional expressions.

5. Considerations

The reviewed studies show rich evidence for designing affective interfaces for non-humanoid robots to communicate emotions. This serves as a foundation and offers guidance for adding an emotional dimension to AV–pedestrian interfaces. In this regard, we aim to provide preliminary considerations around core elements for designing affective AV–pedestrian interfaces. First, we draw on findings from the review to provide a set of considerations for designing emotional expressions for AVs as social robots. Then, based on both the review and current AV–pedestrian communication strategies, we present a set of considerations that take a broader range of factors into account for building affective AV–pedestrian interfaces.

5.1. Considerations for Designing Emotional Expressions

Based on findings from the review of emotional expressions of non-humanoid robots, we propose five considerations for imbuing AVs with emotions with regard to what emotions to communicate, and how to communicate emotions.

Include Basic Emotions: More than half of the reviewed articles employed basic emotions, which were mostly derived from Ekman’s six emotions [59]. Particularly, the user evaluation in these articles showed that happiness, sadness, and anger were most easily recognized [9,13,14,15,19,22,26,29]. In general, such emotions are regarded as being easy to understand, recognizable across different cultural backgrounds, and important for intuitive human–robot interaction [13,17,19,22]. Hence, we suggest that basic emotions or basic emotion models should be considered when deciding what emotions to attribute to AVs.

Use Negative Emotions for a Reason: In some articles where negative emotions were triggered for a reason, these emotions demonstrated important contributions in affecting user behaviors, e.g., diffused conflict situations when showing fear [45] or remorse [53], and evoked sympathetic behaviors when displaying sadness [18]. Indeed, robot emotions can cause humans to mirror the emotional state of the robot [19,22,53] or to reflect on their own behaviors [19,45,56]. However, when negative emotions were displayed without reasons, users speculated about the cause of the emotions [19], involuntarily interpreted them as positive valenced [22], or even sometimes concerned about their own safety under aggressive emotions [21,23,55]. Therefore, providing reasons for the display of negative emotions can be essential for the intended user interpretation and the subsequent influence on user behaviors.

Provide Contexts for Abstract Emotions: Some reviewed articles showed that emotions such as surprise, disgust, coolness, and affirmation, though some included in Ekman’s six basic emotions, are more abstract in connotation than those universally recognizable emotions (e.g., happiness and sadness) and thus need more contexts for users to interpret them as intended [15,22,23,57]. When displayed without context, these emotions can seem to be ambiguous [49], and the interpretation can be more greatly biased by the user’s cultural background and previous experiences [22]. Hence, the expected user perception of abstract emotions in AVs can hardly be isolated from an appropriate context, a context relating but not limited to the task that the AV is performing [26], the immediate surroundings [22], and cultural norms [30].

Combine Multiple Modalities: Several studies compared the recognition rate of emotions between using single and multiple modalities [17,27,28]. Results showed that people recognized the multi-modal expressions more easily and were more confident in their judgment. Indeed, when multiple modalities are combined together to convey a certain emotion, they tend to serve as an “amplifier” to each other [28] and reassure people of their interpretation [17,27]. Even in some studies where only a single modality was tested, some participants still referred to partial cues other than the primary modality to support their inferences [19,57]. Hence, using multiple modalities can be beneficial to clarifying the emotional state of the AV and help increase users’ confidence in making fast and safe decisions accordingly.

Employ Intuitive Encoding: The review showed that emotions were best interpreted when the encoding followed a conventional and intuitive assignment of expression–emotion relationships, such as using colorful lights or uplifting music for happiness [15,17,18,20,22] and slow movements or dark colors for sadness [17,22,27,28]. Nonetheless, such “intuitive” mappings can be culture specific [17,28,29,56]. For instance, some studies argue cultural differences in mapping color to emotion [29]. There are also conceptual models or universal associations that are very little dependent on culture or can be found in many languages [17,29]. Overall, employing encoding that is intuitive to users is important in AV–pedestrian interaction, as such interaction requires immediate decisions in dynamic traffic situations. Similar concerns can be found in existing AV–pedestrian eHMIs. For example, a participant in [4] commented that during the crossing scenario, they had to frequently look at a sheet which specified the mapping between multiple LED colors and vehicle states, and they further pointed out that it was unrealistic to bring a sheet in real life. Thus, it should be carefully considered to use encoding rules that might require a learning process.

5.2. Considerations for Building Affective AV–Pedestrian Interfaces

Drawing on the review and taking into account current AV–pedestrian communication strategies, this section presents a set of considerations around a broader range of factors that may contribute to designing affective AV–pedestrian interfaces.

Align with AV’s Primary Functionality: AVs’ primary function (i.e. transportation) is likely to influence people’s interpretation of their social traits during the interaction. In the review, only four robots had a perceivable primary function during user evaluation [9,20,23,52]. However, as shown in [21,22], when interacting with a robot, people are likely to speculate about the functionality or “purpose” of the robot and interpret its emotions accordingly. This suggests that people would expect the emotional expressions of a robot to be coherent with its functionality. In the example of a voice-assisted home robot [20], people preferred the smart helper to show conscientiousness and agreeableness through its expressive cues. Therefore, the affective interface should account for its interplay with AVs’ major utilitarian purpose, i.e., serving as a secondary function [22] and facilitating the operation of the primary one.

Understand Pedestrian’s Expectations: Pedestrians’ expectations of AVs’ emotions can differ from passengers sitting inside the vehicle or other road users. Various interaction contexts were found in AV–pedestrian interaction, such as interaction contents, road types, other vehicles or road users, etc. [67]. If AVs are to use emotions to support the interaction with pedestrians, it is important to consider the legitimacy of the emotions in those interaction contexts from the perspective of pedestrians, that is, to understand what emotions pedestrians expect to see. Similar considerations were reported in human–drone interaction (HDI). For instance, in the design of emotional drones, Cauchard et al. [9] left aside emotions that did not seem to be applicable to HDI, such as disgust. Herdel et al. [19] also stated the concern about whether users would envision certain emotions (e.g., fear) to appear in drones. Emotional patterns in existing driver–pedestrian interactions may provide a vital reference for conjecturing pedestrians’ expectations of AVs’ emotions, as it is essentially the drivers’ role that the autonomous systems take over when interacting with pedestrians socially.

Refer to Existing eHMIs: Although there is no empirical evidence yet for which modality is effective for an affective AV–pedestrian interface, efforts made in current eHMIs for communicating AVs’ intent and awareness offer various solutions, such as vehicle-mounted displays [3,4,7,32,34], on-road projections [5,8,35], smart road interfaces [36], and wearable devices [31]. These interfaces provide a range of feasible modalities across visual, auditory, and haptic channels to support the potential communication of emotions. Nevertheless, many previous studies show that regardless of the presence of eHMIs, pedestrians still greatly rely on changes in the movement of AVs (e.g., speed) [2,34,68,69,70]. Thus, the affective interface may prefer movement cues to encode emotions (e.g., movement-related “gestures” [31]), or use emotions to amplify the intention of movements (e.g., display a happy face to show friendliness when the vehicle is yielding). Overall, designers should refer to existing AV–pedestrian eHMIs and their corresponding findings to understand the usability of different types of interfaces when designing for the affective dimension.

Make It a Reciprocal Process: Current eHMIs for conveying intent and awareness to pedestrians are mostly designed in a proactive manner. For example, in a crossing scenario, most visual displays and on-road projections provide information about the deceleration progress of the vehicle [6,34,35] or inform pedestrians when it is safe to cross [5,32]. However, it is also important for AVs to be responsive to pedestrians’ intentions [68]. A recent study by Epke et al. [68] used human gestures and eHMIs to form a bi-directional communication between pedestrians and AVs. The study found that participants preferred the case where an approaching AV acted (i.e., displaying “I SEE YOU” or yielding) in accordance with pedestrians’ hand gestures. More importantly, the communication of emotions is itself a reciprocal process in which emotional responses can be evoked between the two subjects [19]. Hence, AVs should have the capability to express emotions not only proactively, but also in response to the behaviors of pedestrians or contingencies in their surroundings.

Account for Contextual Factors: As suggested in previous sections, the display of emotion should be appropriate and plausible within the specific context. However, a number of contextual factors can condition pedestrians’ interpretation of the emotional expression, such as pedestrians’ own background, the environment where the AV is situated in, and the way the AV interacts with pedestrians [22,26,70,71]. For instance, cultural norms can influence how users perceive the emotional expression [71] and also the appropriateness of the situation in which the emotion is expressed [30]. In the deployment of an emotional self-moving robot in a university campus [22], Hoggenmueller et al. found that participants generally perceived the robot as being in a happy state regardless of what emotion the robot was displaying, due to the peaceful environment and the robot’s high luminosity. Moreover, weather and road conditions, such as rainy and poor lighting, can impact the effectiveness of the AV’s interface [2,70]. Hence, the design of affective AV–pedestrian interfaces should account for various contextual factors, including cultural, environmental, and interactional aspects.

Consider Evaluation Environments: Most AV–pedestrian interaction solutions are evaluated in lab-based simulated environments since building fully functional prototypes on actual AVs can be expensive, complicated, and sometimes dependent on traffic regulations [67]. On the one hand, simulation-based prototypes, such as virtual reality (VR) simulations, are more feasible and flexible than physical ones deployed in the wild; on the other, they are still weaker in terms of ecological validity [72]. Furthermore, VR-based evaluations can be divided into immersive VR (i.e., use wearable equipment and immerse users in the virtual world) and non-immersive VR (i.e., users experience the virtual world via a computer screen) [72] in which the former provides a better presence of the AV while the latter is less confined by hardware and allows for remote user testing. Nevertheless, studies in affective robotics show that the proximity and presence of the robot can impact people’s perceptions of its social intentions [13,23]. Therefore, evaluation environments should be considered and compared, particularly in the context of both AV–pedestrian interaction and affective robotic design.

6. Conclusions

In this paper, we investigated the potential role of affective interfaces for AVs to communicate emotions to pedestrians. To understand possible solutions to this problem, we systematically reviewed 25 articles about the emotional expressions of non-humanoid robots in the past ten years. The review summarized core aspects, including emotion models, output modalities, evaluation measures, as well as user perceptions. Building on findings from the review and taking into account current AV–pedestrian communication strategies, we proposed five considerations for designing emotional expressions for AVs and six considerations regarding a broader range of factors contributing to building affective AV–pedestrian interfaces. Our findings provide a foundation for incorporating an affective dimension into AV–pedestrian eHMIs to increase the social acceptance of AVs and highlight avenues for investigating these opportunities in future research.

Author Contributions

Conceptualization, Y.W., L.H. and M.T.; methodology, Y.W., L.H. and M.T.; formal analysis, Y.W.; writing—original draft preparation, Y.W.; writing—review and editing, Y.W., L.H. and M.T.; visualization, Y.W.; supervision, L.H. and M.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Australian Research Council through grant number DP200102604 Trust and Safety in Autonomous Mobility Systems: A Human-Centered Approach.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Litman, T. Autonomous Vehicle Implementation Predictions; Victoria Transport Policy Institute: Victoria, BC, Canada, 2017. [Google Scholar]
Rasouli, A.; Tsotsos, J.K. Autonomous vehicles that interact with pedestrians: A survey of theory and practice. IEEE Trans. Intell. Transp. Syst. 2019, 21, 900–918. [Google Scholar] [CrossRef] [Green Version]
Lagström, T.; Malmsten Lundgren, V. AVIP-Autonomous Vehicles’ Interaction with Pedestrians-An Investigation of Pedestrian-Driver Communication and Development of a Vehicle External Interface. Master’s Thesis, Chalmers University of Technology, Gothenburg, Sweden, 2016. [Google Scholar]
Mahadevan, K.; Somanath, S.; Sharlin, E. Communicating Awareness and Intent in Autonomous Vehicle-Pedestrian Interaction. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2018; pp. 1–12. [Google Scholar]
Nguyen, T.T.; Holländer, K.; Hoggenmueller, M.; Parker, C.; Tomitsch, M. Designing for Projection-Based Communication between Autonomous Vehicles and Pedestrians. In Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, Online, 21–25 September 2019; Association for Computing Machinery: New York, NY, USA, AutomotiveUI ’19; 2019; pp. 284–294. [Google Scholar] [CrossRef]
Pratticò, F.G.; Lamberti, F.; Cannavò, A.; Morra, L.; Montuschi, P. Comparing State-of-the-Art and Emerging Augmented Reality Interfaces for Autonomous Vehicle-to-Pedestrian Communication. IEEE Trans. Veh. Technol. 2021, 70, 1157–1168. [Google Scholar] [CrossRef]
Chang, C.M.; Toda, K.; Sakamoto, D.; Igarashi, T. Eyes on a Car: An Interface Design for Communication between an Autonomous Car and a Pedestrian. In Proceedings of the 9th International Conference on Automotive User Interfaces and Interactive Vehicular Applications; Association for Computing Machinery: New York, NY, USA, 2017; AutomotiveUI ’17; pp. 65–73. [Google Scholar] [CrossRef]
Löcken, A.; Golling, C.; Riener, A. How Should Automated Vehicles Interact with Pedestrians? A Comparative Analysis of Interaction Concepts in Virtual Reality. In Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications; Association for Computing Machinery: New York, NY, USA, 2019; AutomotiveUI ’19; pp. 262–274. [Google Scholar] [CrossRef]
Cauchard, J.R.; Zhai, K.Y.; Spadafora, M.; Landay, J.A. Emotion encoding in human-drone interaction. In Proceedings of the 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Christchurch, New Zealand, 7–10 March 2016; pp. 263–270. [Google Scholar]
Picard, R.W. Affective computing: Challenges. Int. J. Hum.-Comput. Stud. 2003, 59, 55–64. [Google Scholar] [CrossRef]
Fong, T.; Nourbakhsh, I.; Dautenhahn, K. A survey of socially interactive robots. Robot. Auton. Syst. 2003, 42, 143–166. [Google Scholar] [CrossRef] [Green Version]
Leite, I.; Martinho, C.; Paiva, A. Social robots for long-term interaction: A survey. Int. J. Soc. Robot. 2013, 5, 291–308. [Google Scholar] [CrossRef]
Bretan, M.; Hoffman, G.; Weinberg, G. Emotionally expressive dynamic physical behaviors in robots. Int. J. Hum.-Comput. Stud. 2015, 78, 1–16. [Google Scholar] [CrossRef]
Gácsi, M.; Kis, A.; Faragó, T.; Janiak, M.; Muszyński, R.; Miklósi, Á. Humans attribute emotions to a robot that shows simple behavioural patterns borrowed from dog behaviour. Comput. Hum. Behav. 2016, 59, 411–419. [Google Scholar] [CrossRef] [Green Version]
Ritschel, H.; Aslan, I.; Mertes, S.; Seiderer, A.; André, E. Personalized synthesis of intentional and emotional non-verbal sounds for social robots. In Proceedings of the 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII), Cambridge, UK, 3–6 September 2019; pp. 1–7. [Google Scholar]
Eyssel, F.; Hegel, F.; Horstmann, G.; Wagner, C. Anthropomorphic inferences from emotional nonverbal cues: A case study. In Proceedings of the 19th international symposium in robot and human interactive communication, Viareggio, Italy, 13–15 September 2010; pp. 646–651. [Google Scholar]
Löffler, D.; Schmidt, N.; Tscharn, R. Multimodal Expression of Artificial Emotion in Social Robots Using Color, Motion and Sound. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction; Association for Computing Machinery: New York, NY, USA, 2018; HRI ’18; pp. 334–343. [Google Scholar] [CrossRef]
Boccanfuso, L.; Kim, E.S.; Snider, J.C.; Wang, Q.; Wall, C.A.; DiNicola, L.; Greco, G.; Flink, L.; Lansiquot, S.; Ventola, P.; et al. Autonomously detecting interaction with an affective robot to explore connection to developmental ability. In Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction (ACII), Xi’an, China, 21–24 September 2015; pp. 1–7. [Google Scholar]
Herdel, V.; Kuzminykh, A.; Hildebrandt, A.; Cauchard, J.R. Drone in Love: Emotional Perception of Facial Expressions on Flying Robots. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2021. [Google Scholar]
Whittaker, S.; Rogers, Y.; Petrovskaya, E.; Zhuang, H. Designing Personas for Expressive Robots: Personality in the New Breed of Moving, Speaking, and Colorful Social Home Robots. J. Hum.-Robot Interact. 2021, 10. [Google Scholar] [CrossRef]
Harris, J.; Sharlin, E. Exploring the affect of abstract motion in social human-robot interaction. In Proceedings of the 2011 Ro-Man, Atlanta, GA, USA, 31 July–3 August 2011; pp. 441–448. [Google Scholar]
Hoggenmueller, M.; Chen, J.; Hespanhol, L. Emotional Expressions of Non-Humanoid Urban Robots: The Role of Contextual Aspects on Interpretations; Association for Computing Machinery: New York, NY, USA, 2020; PerDis ’20; pp. 87–95. [Google Scholar] [CrossRef]
Tennent, H.; Moore, D.; Ju, W. Character Actor: Design and Evaluation of Expressive Robot Car Seat Motion. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2018, 1, 1–23. [Google Scholar] [CrossRef]
Monceaux, J.; Becker, J.; Boudier, C.; Mazel, A. Demonstration: First Steps in Emotional Expression of the Humanoid Robot Nao. In Proceedings of the 2009 International Conference on Multimodal Interfaces; Association for Computing Machinery: New York, NY, USA, 2009; ICMI-MLMI ’09; pp. 235–236. [Google Scholar] [CrossRef]
Pandey, A.K.; Gelin, R. A mass-produced sociable humanoid robot: Pepper: The first machine of its kind. IEEE Robot. Autom. Mag. 2018, 25, 40–48. [Google Scholar] [CrossRef]
Novikova, J.; Watts, L. A Design Model of Emotional Body Expressions in Non-Humanoid Robots. In Proceedings of the Second International Conference on Human-Agent Interaction; Association for Computing Machinery: New York, NY, USA, 2014; HAI ’14; pp. 353–360. [Google Scholar] [CrossRef] [Green Version]
Song, S.; Yamada, S. Expressing Emotions through Color, Sound, and Vibration with an Appearance-Constrained Social Robot. In Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction; Association for Computing Machinery: New York, NY, USA, 2017; HRI ’17; pp. 2–11. [Google Scholar] [CrossRef] [Green Version]
Song, S.; Yamada, S. Designing Expressive Lights and In-Situ Motions for Robots to Express Emotions. In Proceedings of the 6th International Conference on Human-Agent Interaction; Association for Computing Machinery: New York, NY, USA, 2018; HAI ’18; pp. 222–228. [Google Scholar] [CrossRef] [Green Version]
Peng, Y.; Feng, Y.L.; Wang, N.; Mi, H. How children interpret robots’ contextual behaviors in live theatre: Gaining insights for multi-robot theatre design. In Proceedings of the 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Naples, Italy, 31 August–4 September 2020; pp. 327–334. [Google Scholar]
Park, S.; Healey, P.G.T.; Kaniadakis, A. Should Robots Blush? In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2021. [Google Scholar]
Dey, D.; Habibovic, A.; Löcken, A.; Wintersberger, P.; Pfleging, B.; Riener, A.; Martens, M.; Terken, J. Taming the eHMI jungle: A classification taxonomy to guide, compare, and assess the design principles of automated vehicles’ external human-machine interfaces. Transp. Res. Interdiscip. Perspect. 2020, 7, 100174. [Google Scholar] [CrossRef]
Urmson, C.P.; Mahon, I.J.; Dolgov, D.A.; Zhu, J. Pedestrian Notifications. U.S. Patent 9 196 164 B1, 24 November 2015. [Google Scholar]
Colley, M.; Belz, J.H.; Rukzio, E. Investigating the Effects of Feedback Communication of Autonomous Vehicles. In 13th International Conference on Automotive User Interfaces and Interactive Vehicular Applications; Association for Computing Machinery: New York, NY, USA, 2021; AutomotiveUI ’21; pp. 263–273. [Google Scholar] [CrossRef]
Clamann, M.; Aubert, M.; Cummings, M.L. Evaluation of vehicle-to-pedestrian communication displays for autonomous vehicles. In Proceedings of the 96th Annual Transportation Research Board, Washington, DC, USA, 8–12 January 2017. [Google Scholar]
Hesenius, M.; Börsting, I.; Meyer, O.; Gruhn, V. Don’t Panic! Guiding Pedestrians in Autonomous Traffic with Augmented Reality. In Proceedings of the 20th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct; Association for Computing Machinery: New York, NY, USA, 2018; MobileHCI ’18; pp. 261–268. [Google Scholar] [CrossRef]
Mairs, J. Umbrellium Develops Interactive Road Crossing that Only Appears when Needed. 2017. Available online: https://www.dezeen.com/2017/10/12/umbrellium-develops-interactive-road-crossing-that-only-appears-when-needed-technology/ (accessed on 17 December 2021).
Colley, M.; Rukzio, E. A Design Space for External Communication of Autonomous Vehicles. In 12th International Conference on Automotive User Interfaces and Interactive Vehicular Applications; Association for Computing Machinery: New York, NY, USA, 2020; AutomotiveUI ’20; pp. 212–222. [Google Scholar] [CrossRef]
Newcomb, A. Humans Harass and Attack Self-Driving Waymo Cars. NBC News. 22 December 2018. Available online: https://www.nbcnews.com/tech/innovation/humans-harass-attack-self-driving-waymo-cars-n950971 (accessed on 11 October 2021).
Connor, S. First Self-Driving Cars Will Be Unmarked So That Other Drivers Don’t Try to Bully Them. The Guardian. 30 October 2016. Available online: https://www.theguardian.com/technology/2016/oct/30/volvo-self-driving-car-autonomous (accessed on 11 October 2021).
Bazilinskyy, P.; Sakuma, T.; de Winter, J. What driving style makes pedestrians think a passing vehicle is driving automatically? Appl. Ergon. 2021, 95, 103428. [Google Scholar] [CrossRef] [PubMed]
Jayaraman, S.K.; Creech, C.; Tilbury, D.M.; Yang, X.J.; Pradhan, A.K.; Tsui, K.M.; Robert, L.P., Jr. Pedestrian trust in automated vehicles: Role of traffic signal and av driving behavior. Front. Robot. AI 2019, 6, 117. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Garber, M. The Revolution Will Be Adorable: Why Google’s Cars Are So Cute. The Atlantic. 29 May 2014. Available online: https://www.theatlantic.com/technology/archive/2014/05/the-revolution-will-be-adorable-why-googles-driverless-cars-are-so-cute/371699/ (accessed on 11 October 2021).
D’Onfro, J. Why Google Made Its Self-Driving Car Look So Cute. Business Insider Australia. 24 December 2014. Available online: https://www.businessinsider.com.au/google-self-driving-car-why-its-so-cute-2014-12?r=US&IR=T (accessed on 11 October 2021).
Sood, G. Honda 2040 NIKO Comes with A Tiny AI Assistant, Taking the Car from A Vehicle to Your Friend! Yanko Design. 21 August 2021. Available online: https://www.yankodesign.com/2021/08/21/honda-2040-niko-comes-with-a-tiny-ai-assistant-taking-the-car-from-a-vehicle-to-your-friend/ (accessed on 11 October 2021).
Hoffman, G.; Zuckerman, O.; Hirschberger, G.; Luria, M.; Shani Sherman, T. Design and Evaluation of a Peripheral Robotic Conversation Companion. In Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction; Association for Computing Machinery: New York, NY, USA, 2015; HRI ’15; pp. 3–10. [Google Scholar] [CrossRef]
Braun, M.; Weber, F.; Alt, F. Affective Automotive User Interfaces–Reviewing the State of Driver Affect Research and Emotion Regulation in the Car. ACM Comput. Surv. 2021, 54. [Google Scholar] [CrossRef]
Sadeghian, S.; Hassenzahl, M.; Eckoldt, K. An Exploration of Prosocial Aspects of Communication Cues between Automated Vehicles and Pedestrians. In 12th International Conference on Automotive User Interfaces and Interactive Vehicular Applications; Association for Computing Machinery: New York, NY, USA, 2020; AutomotiveUI ’20; pp. 205–211. [Google Scholar] [CrossRef]
Lanzer, M.; Babel, F.; Yan, F.; Zhang, B.; You, F.; Wang, J.; Baumann, M. Designing Communication Strategies of Autonomous Vehicles with Pedestrians: An Intercultural Study. In 12th International Conference on Automotive User Interfaces and Interactive Vehicular Applications; Association for Computing Machinery: New York, NY, USA, 2020; AutomotiveUI ’20; pp. 122–131. [Google Scholar] [CrossRef]
Tan, H.; Tiab, J.; Šabanović, S.; Hornbæk, K. Happy Moves, Sad Grooves: Using Theories of Biological Motion and Affect to Design Shape-Changing Interfaces. In Proceedings of the 2016 ACM Conference on Designing Interactive Systems; Association for Computing Machinery: New York, NY, USA, 2016; DIS ’16; pp. 1282–1293. [Google Scholar] [CrossRef]
Hieida, C.; Matsuda, H.; Kudoh, S.; Suehiro, T. Action elements of emotional body expressions for flying robots. In Proceedings of the 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Christchurch, New Zealand, 7–10 March 2016; pp. 439–440. [Google Scholar]
Harzing, A. Publish or Perish. 2007. Available online: https://harzing.com/resources/publish-or-perish (accessed on 17 December 2021).
Shi, Y.; Yan, X.; Ma, X.; Lou, Y.; Cao, N. Designing Emotional Expressions of Conversational States for Voice Assistants: Modality and Engagement. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2018; CHI EA ’18; pp. 1–6. [Google Scholar] [CrossRef]
Frederiksen, M.R.; Stoy, K. On the causality between affective impact and coordinated human-robot reactions. In Proceedings of the 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Naples, Italy, 31 August–4 September 2020; pp. 488–494. [Google Scholar]
Bucci, P.; Zhang, L.; Cang, X.L.; MacLean, K.E. Is It Happy? Behavioural and Narrative Frame Complexity Impact Perceptions of a Simple Furry Robot’s Emotions. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2018; pp. 1–11. [Google Scholar]
Chase, E.D.Z.; Follmer, S. Differences in Haptic and Visual Perception of Expressive 1DoF Motion. In ACM Symposium on Applied Perception 2019; Association for Computing Machinery: New York, NY, USA, 2019; SAP ’19. [Google Scholar] [CrossRef]
Frederiksen, M.R.; Stoy, K. Robots can defuse high-intensity conflict situations. In Proceedings of the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 25–29 October 2020; pp. 11376–11382. [Google Scholar]
Kim, L.H.; Follmer, S. SwarmHaptics: Haptic Display with Swarm Robots. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2019; pp. 1–13. [Google Scholar]
Sato, D.; Sasagawa, M.; Niijima, A. Affective Touch Robots with Changing Textures and Movements. In Proceedings of the 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Naples, Italy, 31 August–4 September 2020; pp. 1–6. [Google Scholar]
Ekman, P.; Friesen, W.V. Constants across cultures in the face and emotion. J. Personal. Soc. Psychol. 1971, 17, 124. [Google Scholar] [CrossRef] [Green Version]
Russell, J.A. A circumplex model of affect. J. Personal. Soc. Psychol. 1980, 39, 1161. [Google Scholar] [CrossRef]
Mehrabian, A. Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in temperament. Curr. Psychol. 1996, 14, 261–292. [Google Scholar] [CrossRef]
Goldberg, L.R. The development of markers for the Big-Five factor structure. Psychol. Assess. 1992, 4, 26. [Google Scholar] [CrossRef]
Sharma, M.; Hildebrandt, D.; Newman, G.; Young, J.E.; Eskicioglu, R. Communicating affect via flight path exploring use of the laban effort system for designing affective locomotion paths. In Proceedings of the 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Tokyo, Japan, 3–6 March 2013; pp. 293–300. [Google Scholar]
Lenz, E.; Diefenbach, S.; Hassenzahl, M. Exploring Relationships between Interaction Attributes and Experience. In Proceedings of the 6th International Conference on Designing Pleasurable Products and Interfaces; Association for Computing Machinery: New York, NY, USA, 2013; DPPI ’13; pp. 126–135. [Google Scholar] [CrossRef]
Read, R.; Belpaeme, T. People interpret robotic non-linguistic utterances categorically. Int. J. Soc. Robot. 2016, 8, 31–50. [Google Scholar] [CrossRef]
Bradley, M.M.; Lang, P.J. Measuring emotion: The self-assessment manikin and the semantic differential. J. Behav. Ther. Exp. Psychiatry 1994, 25, 49–59. [Google Scholar] [CrossRef]
Tran, T.T.M.; Parker, C.; Tomitsch, M. A Review of Virtual Reality Studies on Autonomous Vehicle–Pedestrian Interaction. IEEE Trans. Hum.-Mach. Syst. 2021, 51, 641–652. [Google Scholar] [CrossRef]
Epke, M.R.; Kooijman, L.; De Winter, J.C. I See Your Gesture: A Vr-Based Study of Bidirectional Communication between Pedestrians and Automated Vehicles. J. Adv. Transp. Available online: https://www.hindawi.com/journals/jat/2021/5573560/ (accessed on 27 April 2021).
Lee, Y.M.; Madigan, R.; Giles, O.; Garach-Morcillo, L.; Markkula, G.; Fox, C.; Camara, F.; Rothmueller, M.; Vendelbo-Larsen, S.A.; Rasmussen, P.H.; et al. Road users rarely use explicit communication when interacting in today’s traffic: Implications for automated vehicles. Cogn. Technol. Work 2021, 23, 367–380. [Google Scholar] [CrossRef]
Pillai, A. Virtual Reality Based Study to Analyse Pedestrian Attitude towards Autonomous Vehicles. Master’s Thesis, KTH Roy. Inst. Technol., Stockholm, Sweden, 2017. [Google Scholar]
Fischer, K.; Jung, M.; Jensen, L.C.; aus der Wieschen, M.V. Emotion expression in HRI–when and why. In Proceedings of the 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Daegu, Korea, 11–14 March 2019; pp. 29–38. [Google Scholar]
Albastaki, A.; Hoggenmüller, M.; Robinson, F.A.; Hespanhol, L. Augmenting Remote Interviews through Virtual Experience Prototypes; Association for Computing Machinery: New York, NY, USA, 2020; OzCHI ’20; pp. 78–86. [Google Scholar] [CrossRef]

Figure 1. Examples of non-humanoid robots from the reviewed articles [13,19,20,22,23,26,27,29,49,54,55,57].

Table 1. Articles in the review of emotionally expressive non-humanoid robots.

Year	Authors	Robot Prototype	Emotion Models	Output Modalities
2011	Harris and Sharlin [21]	The Stem	angry, happy, sad, etc.	movement
2014	Novikova and Watts [26]	a Lego robot based on a Phobot robot’s design	Mehrabian’s model	movement
2015	Boccanfuso et al. [18]	Sphero	angry, fearful, happy, sad	color, movement, sound
2015	Bretan et al. [13]	Shimi	Ekman’s basic emotions, Russell’s circumplex model	movement
2015	Hoffman et al. [45]	Kip1	calm, curious, scared	movement
2016	Cauchard et al. [9]	AR.Drone 2.0 by Parrot	personalities from Walt Disney’s Seven Dwarfs and Peyo’s Smurfs	movement
2016	Gácsi et al. [14]	PeopleBot	Ekman’s basic emotions	movement, sound
2016	Hieida et al. [50]	“Rolling Spider” Drone by Parrot	anger, joy, pleasure, and sadness from a Japanese idiom	movement
2016	Tan et al. [49]	a shape-changing interface	Ekman’s basic emotions, Mehrabian’s model	movement
2017	Song and Yamada [27]	Maru	Russell’s circumplex model	color, movement, sound
2018	Bucci et al. [54]	FlexiBit with a fur cover	emotional valence	haptics, movement
2018	Löffler et al. [17]	a simple, wheeled robot probe	Ekman’s basic emotions	color, movement, sound
2018	Shi et al. [52]	a smartphone-based voice assistant	Russell’s circumplex model	facial expression, movement
2018	Song and Yamada [28]	Roomba	Ekman’s basic emotions, Russell’s circumplex model	color, movement
2018	Tennent et al. [23]	3D animation for a robot car seat	aggressive, confident, cool, excited, quirky	movement
2019	Chase and Follmer [55]	a device with a visible and graspable handle	Mehrabian’s model	haptics, movement
2019	Kim and Follmer [57]	SwarmHaptics	Ekman’s basic emotions	haptics, movement
2019	Ritschel et al. [15]	BärBot	happiness, sadness, etc.	sound
2020	Frederiksen and Stoy [53]	three Thymio II Robots	fear	movement, sound
2020	Frederiksen and Stoy [56]	Affecta	remorse	color, facial expression, movement, sound
2020	Hoggenmueller et al. [22]	Woodie	Ekman’s basic emotions	color, movement
2020	Peng et al. [29]	three small robot characters for a robot theater	Ekman’s basic emotions	facial expression, movement
2020	Sato et al. [58]	tabletop, wheeled haptic robots	Russell’s circumplex model	haptics
2021	Herdel et al. [19]	animated drone with DJI Phantom 3 body	Ekman’s basic emotions	facial expression
2021	Whittaker et al. [20]	Olly	“Big Five” personality theory	color, movement, sound

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, Y.; Hespanhol, L.; Tomitsch, M. How Can Autonomous Vehicles Convey Emotions to Pedestrians? A Review of Emotionally Expressive Non-Humanoid Robots. Multimodal Technol. Interact. 2021, 5, 84. https://doi.org/10.3390/mti5120084

AMA Style

Wang Y, Hespanhol L, Tomitsch M. How Can Autonomous Vehicles Convey Emotions to Pedestrians? A Review of Emotionally Expressive Non-Humanoid Robots. Multimodal Technologies and Interaction. 2021; 5(12):84. https://doi.org/10.3390/mti5120084

Chicago/Turabian Style

Wang, Yiyuan, Luke Hespanhol, and Martin Tomitsch. 2021. "How Can Autonomous Vehicles Convey Emotions to Pedestrians? A Review of Emotionally Expressive Non-Humanoid Robots" Multimodal Technologies and Interaction 5, no. 12: 84. https://doi.org/10.3390/mti5120084

APA Style

Wang, Y., Hespanhol, L., & Tomitsch, M. (2021). How Can Autonomous Vehicles Convey Emotions to Pedestrians? A Review of Emotionally Expressive Non-Humanoid Robots. Multimodal Technologies and Interaction, 5(12), 84. https://doi.org/10.3390/mti5120084

Article Menu

How Can Autonomous Vehicles Convey Emotions to Pedestrians? A Review of Emotionally Expressive Non-Humanoid Robots

Abstract

1. Introduction

2. Background

2.1. Emotion in Social Robotics

2.2. Humanoid Robots vs. Non-Humanoid Robots

2.3. Current AV–Pedestrian Interaction

2.4. Why Ascribe Emotions to AVs

3. Method

3.1. Search Strategy

3.1.1. Database Selection

3.1.2. Keyword Search Procedure

3.1.3. Article Selection

3.2. Research Questions

4. Review of Emotionally Expressive Non-Humanoid Robots

4.1. Overview

4.2. Emotion Models

4.2.1. Categorical Models

4.2.2. Dimensional Models

4.2.3. Emotional Personas

4.3. Output Modalities

4.3.1. Visual Modalities

4.3.2. Auditory Modalities

4.3.3. Haptic Modalities

4.4. Evaluation Measures

4.4.1. Use Scenarios

4.4.2. Experimental Tasks

4.4.3. Evaluated Aspects

4.5. User Perceptions

4.5.1. Recognition of Emotional Expressions

4.5.2. Sociability

4.5.3. Contexts

5. Considerations

5.1. Considerations for Designing Emotional Expressions

5.2. Considerations for Building Affective AV–Pedestrian Interfaces

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI