Open Access This article is
- freely available
Electronics 2019, 8(7), 794; https://doi.org/10.3390/electronics8070794
Expressing Personalities of Conversational Agents through Visual and Verbal Feedback
Department of Communication, Seoul National University, Seoul 08826, Korea
Author to whom correspondence should be addressed.
Received: 3 June 2019 / Accepted: 6 July 2019 / Published: 16 July 2019
As the uses of conversational agents increase, the affective and social abilities of agents become important with their functional abilities. Agents that lack affective abilities could frustrate users during interaction. This study applied personality to implement the natural feedback of conversational agents referring to the concept of affective computing. Two types of feedback were used to express conversational agents’ personality: (1) visual feedback and (2) verbal cues. For visual feedback, participants (N = 45) watched visual feedback with different colors and motions. For verbal cues, participants (N = 60) heard different conditions of agents’ voices with different scripts. The results indicated that the motions of visual feedback were more significant than colors. Fast motions could express distinct and positive personalities. Different verbal cues were perceived as different personalities. The perceptions of personalities differed according to the vocal gender. This study provided design implications for personality expressions applicable to diverse interfaces.
Keywords:personality expression; conversational agent; visual feedback; verbal cues; artificial intelligent speakers
Future artificial intelligence (AI) assistants must be more than just question-and-answer machines. Computers need to express and recognize affect and emotions because affective computers can help reduce user frustration during interactions and enable smooth communication between users and computers . Computers with unnatural expressions and reactions could deter human–computer interaction (HCI) and frustrate users. For instance, when users expect their conversational agents to deliver factual information without intense emotions, but conversational agents speak with intense emotions, users will become frustrated. Therefore, the feedback that matches with users’ perceptions of natural human-computer interaction needs to be designed.
Most studies in the field of affective computing have applied emotion to convey an agent’s internal states naturally. However, this research applied the concept of personality to implement more natural feedback for a conversational agent for the following reasons: (1) The concept of personality can allow individualized and complex features to be conveyed simultaneously; (2) it is possible to predict users’ perceptions by giving a personality to conversational agents; and (3) with personality, agents can implement consistent patterns of reactions rather than simple and immediate reactions.
Personalities are not innate for computer systems or interfaces; thus, humans must program them. In addition, they cannot be defined in a single sentence; instead, they are combinations of abilities, beliefs, preferences, dispositions, behaviors, and temperamental features with diverse behavioral and emotional attributes [2,3]. Therefore, the study posits the following general research question: RQ. How can conversational agents effectively express personalities?
Personalities can be expressed with two factors: (1) behavioral features (gestures, movements) and (2) verbal traits (voice, speech style). People with different personalities demonstrate different amplitudes and speeds of gestures and movements. Extroverts demonstrate faster, wider, and broader movements than introverts . In addition, extroverts tend to demonstrate more reactive and faster movements and body gestures than introverts . Personality and verbal traits are also highly interrelated. Extroverts tend to express more emotionality with positive emotions and use fewer formal expressions with agreement and gratitude . Extroverts also demonstrate shorter silences and use more positive words and informal expressions than introverts. Introverts use more abstract words and formal words than extroverts [2,7].
Gestures are difficult to implement with current conversational agents because most of them are designed in the form of AI speakers. Existing conversational agents in the current market, including Amazon Alexa and Google Home, are not able to implement gesture movements. Instead of using gestures, AI speakers deliver simple visual feedback through a smart display in response to voice commands. Considering the current form of conversational agents, visual feedback was chosen as the personality expression element. As people with different personalities demonstrate different gesture speeds, the lighting speed could also be perceived as different personalities. Quick lighting is perceived as more active than slow lighting; therefore, the study posits the following research question: RQ 1. Can different visual feedback be perceived as different personalities?
Most current AI speakers use a consistent voice with the same speech style. As extroverts and introverts demonstrate different verbal traits, different verbal traits are perceived as different personalities. Therefore, the study posits the second research question: RQ 2. Can different verbal cues be perceived as different personalities?
Unlike previous studies, the current study applied a wider range of personalities rather than focusing on expressing only two contrasting personalities, such as extroversion and introversion. In addition, personality was expressed with diverse factors, including visual feedback with different colors and motions and five verbal cues, rather than focusing on a single element.
2. Related Work
2.1. Expressing Internal Machine States in HCI
People treat computers as social actors . Considering the uncertainty of systems, it is difficult to set absolute standards for expressing a machine’s psychological and emotional states. In the field of HCI, machines’ internal states can be represented by diverse elements.
Verbal cues and text are the most direct ways of representing machines’ and computers’ internal states. Voice-related factors, such as volume, frequency range, and speech rate, are associated with the internal states of machines and computers [9,10]. Non-verbal behaviors, such as facial expressions, interpersonal closeness, gestures , and gaze [11,12], of embodied conversational agents were adjusted to demonstrate their social competencies in conversations. In particular, the gaze of the conversational agents were used to deliver back-channel feedback in turn-taking situations . Diverse colors and lights were also used to express the social robots’ emotions [14,15,16], and a simple blinking light was used as a subtle expression to smooth human–robot speech interactions .
A large number of studies have focused on emotion . This study argues that, for long-term HCI, the interface’s personality plays a more crucial role than simple emotional reactions. The current approach for personality expression was adapted from previous affective computing  and expressive output modality studies [15,16]. For instance, similar to prior studies, this study adopted cues to express conversational agents’ internal states, such as personality. Affective computers can both perform better when assisting humans and enhance their decision-making skills.
2.2. Personality Expressions of Computers and Interfaces
Personality can be defined as consistent patterns of feeling, thinking, and behavior . In this study, personality was chosen to express the internal states of conversational agents for the following reasons: First, personality can easily express subtle and complex dimensions in individual differences. Second, personality can provide a wide range of predictions concerning how people may respond to various personalities . In other words, people unconsciously use personality as a tool for assessing social partners . In addition, much literature supports that individuals assess computers according to personality [8,21,22]. Lastly, personality is essential [23,24] in that it can help agents build natural relationships with humans.
Personality can be expressed and perceived in diverse ways because diverse viewpoints exist regarding personality. Personality could be seen as only measurable consistencies in behavior, or it could be seen as perceived consistencies regardless of its actual measurable consistencies. Among many diverse perspectives, the current study argues that personality is expressed and perceived through the interactions between the expectations of observers and the behaviors of the observed .
Most prior studies on a machine’s expressions of personality in particular have focused specifically on expressing two contrasting personalities, such as extroversion/introversion or dominance/submissiveness. Comparatively, few studies of personality expression have applied diverse types of personalities, such as openness and conscientiousness. Among the diverse categorizations of personalities, the Big Five personality traits were applied because they have demonstrated high reliability and validity in psychology and communication studies [25,26,27].
2.3. Combination of Diverse Cues
The two representative environmental factors in affective computing are (1) visual and behavioral elements (including color, light, video, and animations) and (2) audio stimuli (including noise, sound, and music). The current study uses both visual and verbal feedback.
Previous studies effectively examined personalities using diverse interfaces and contexts. Behavioral and linguistic factors and their correlations with personality were observed in virtual environments . A social robot using different postures, gestures, and eye gazes depending on its personality and that of its user was also studied [9,29]. Linguistic cues and their correlations with personality  were also studied. LED lights  were used for the internal expressions of interfaces.
Personality and verbal traits, such as voice pitch, speech speed, wordiness, questioning, and vocal emotionality are highly related. Pitch and speech speed are two key factors of vocal communication  for delivering individual features. Vocal emotionality, which was expressed as the emotional reactions and feedback of the conversational agents in this study, was selected because the agent generating the desired emotional state and personality is important in users’ perceptions of the agent . The questioning element was chosen because the usages of questionnaires determine the perception of voices . Wordiness was chosen with reference to the natural language generator named PERSONAGE , which uses verbosity as the element to express personality dimensions through language. Vocal gender could be an element of a verbal cue because it is meaningful for personal attributes .
Based on previous studies, the current study argues that the personality of interfaces must be expressed using combinations of diverse cues. Therefore, both visual feedback and paralinguistic and verbal cues were adopted to express the conversational agent’s personality.
2.4. Big Five Personality
A number of researchers have examined and established human personality measurements. The Big Five Personality traits are a taxonomy for personality traits . The Big Five personality model classified human personalities into five dimensions. Researchers agree that there are five robust factors of personality that can serve as a meaningful taxonomy for classifying personality attributes . The first dimension is extraversion/introversion, which is associated with being sociable, talkative, and active. The second dimension is related to emotional stability and neuroticism. It is associated with being anxious, depressed, angry, and insecure. Agreeableness and likeability are the third dimension, which is associated with being trusting, good-natured, cooperative, and forgiving. The fourth dimension is conscientiousness, which is related to being careful, thorough, hardworking, and organized. The last dimension is openness to experience, which is related to being imaginative, cultured, intelligent, and artistically sensitive.
Diverse previous studies have applied the Big Five personality taxonomies. Researchers mapped human verbal traits to corresponding nonverbal and verbal behaviors of robots based on the extraversion–introversion personality dimension, which is one element of the Big Five personalities . Another study investigated the role of a robot’s personality in the hands-off therapy process focusing only on the extraversion and introversion personality dimension . A previous study examined the role of task context in perceived social robot personality applying the Big Five personality traits [10,35,36]. They found that extraverted people preferred robots with a similar personality when the robot was a tour guide . Most previous studies that applied the Big Five personality traits focused on the extraversion/introversion dimension. In contrast to previous studies, this study adopted more diverse dimensions of the Big Five personality model for the conversational agent interface.
3.1. Overall Experimental Procedure
A total of 105 participants were recruited for the entire study (Table 1). All participants were recruited from a local community group located in Seoul, South Korea. They were all college students with a similar educational background, and they all had an intermediate level of English. In addition, they had similar residences and similar economic backgrounds.
Participants entered the room and received oral instructions about the study. Divided into groups, they were exposed to different experimental materials (visual feedback or verbal cues) because this research aims to express conversational agents’ personalities both with visual feedback and verbal cues.
A total of 45 participants (20 male, 25 females, mean age = 25, standard deviation of age = 3.6) were recruited for visual feedback. Divided into eight groups, five or six participants watched one visual feedback among eight visual feedback (see Section 3.2). A total of 60 participants (34 male, 26 females, mean age = 23.5, standard deviation of age = 2.5) heard 24 different verbal cues. Divided into five groups, 12 participants heard one condition of verbal cues among five conditions. (see Section 3.3).
After being exposed to experimental materials, they answered survey questions asking which personalities best matched the shown visual feedback and verbal cue voices. The survey questions were the short version of the Big Five personality questionnaires  (Table A1). The Big Five personality traits were applied because they have demonstrated high reliability and validity in psychology and communication studies [24,25,26,27]. In total, 10 questions were asked of participants in the survey.
3.2. Experimental Materials: Conversational Agent
A telepresence robot was used during the entire study because it is suitable for demonstrating both visual feedback and verbal cues (Figure 1). The visual feedback was demonstrated to participants through the telepresence robot’s screen. The verbal cues were played through the speaker of the robot. The static height of the telepresence robot is between 119.36 cm and 149.86 cm. The weight is 6.35 kg including the iPad with 7 kg. The height of the telepresence robot was fixed as 125 cm.
3.3. Experimental Materials: Visual Feedback
The main object of visual feedback is abstract moving circles (Figure 2). The background of the visual feedback was white blank space. The design of the visual feedback was motivated by diverse traits of the Big Five personalities. Considering that the Big Five personality traits are measured with bipolar adjectives , colors with bipolar traits were chosen based on previous studies about colors. Visual feedback was designed with combinations of bipolar or consistent colors. When using multiple colors, colors were only partially blended and overlapped, not entirely mixed. The partial overlapping of colors was motivated by the concept of continuity since each personality is defined as continuous traits between bipolar adjectives.
Among eight visual feedbacks, four visual feedbacks were designed with a single color (Visual feedbacks 2 and 5–7) and the other four visual feedbacks (1, 3, 4, and 8) were designed with combinations of two or three colors (Table 2). Both red and yellow reflect exciting, stimulating, and enlivening properties [38,39]. Considering this, visual feedback 1 was designed with combinations of red and yellow expressing an active personality. Visual feedback 6 used only red to observe the single impact. In contrast to visual feedback 1, visual feedback 2 was designed with a single purple color, which is related to vigorous and unhappy properties [38,40]. Visual feedback 3 combined red, yellow, and dark blue, which are colors with contrasting traits. Dark blue is related to soothing and tender properties , but red and yellow reflect active properties [38,39]. Visual feedback 4 was designed with green and blue, the opposite of visual feedback 3. In contrast with active colors, both green and blue are related to calm, peaceful, and comfortable properties . Visual feedback 5 was designed with a combination of black and white. Black is related to melancholy and unhappy properties , while white is relaxing .
Motion was chosen as the design element of visual feedback because the five personalities have distinctly different levels of expressivity, proactivity, and energy . The radius of circles’ movements was about 4 cm, and circles move clockwise and counterclockwise. All visual feedback moved with the same sequence (Figure 3). The duration of an object’s movement was 30 seconds. The motion size was also adjusted in parallel with the motion speed. The motion speed was adjusted using the Adobe Premiere Pro and After Effects programs, and the movement speed was adjusted between fast (0.35), moderate (0.25), and low (0.15) with the program. Visual feedback with fast motion demonstrated wider movements. Motions were selected with the opposite color traits.
3.4. Experimental Materials: Verbal Cues
Each script was written based on small talk from the chatbot Mitsuku  and current AI speakers such as Google Home and Amazon Alexa. Content that is commonly provided by current AI speakers and chatbots as the default setting was applied to observe the impacts of verbal cues, avoiding a heavy influence from content. Two Oddcast TTS programs  were used for vocal manipulations. Different voices were designed using the PC internal recorder with different scripts. The manipulation of each verbal cue occurred as follows (Table 3).
The pitch level was adjusted using the Audacity program and Oddcast TTS. Using the Amazon Polly female and male voices as the default voices, the vocal pitch level was adjusted. The effects of vocal pitch on the evaluation of a social robot were examined . Exuberant and calm voices were manipulated with different pitch levels. In the case of the male voice, the high pitch was 125.5 Hz, the moderate pitch was 110 Hz, and the low pitch was 98 Hz. In the case of the female voice, the high pitch was 226 Hz, the moderate pitch was 213 Hz, and the low pitch was 200 Hz. The pitch level was adjusted based on previous studies about pitch and personality [31,44]. Script 1 asked a social robot to introduce itself.
Wordiness was manipulated with two levels: high and low. The wordiness was adjusted with the length of the conversational agent’s answers. High wordiness was manipulated as more than 20 words, and the low wordiness condition was manipulated as less than 20 words. Script 2 asked for the name of the robot.
The speed was manipulated with three levels using the Audacity program. For female voices, the high speed was 18.5 times and the low speed was −11 times. For male voices, the high speed was 20 times and the low speed was −11 times. Voices with a moderate speed were not adjusted by the program. Script 3 asked the robot about the weather conditions.
Vocal emotionality was manipulated with two levels. For voices with low emotionality, a default voice was used. For voices with high emotionality, emotion functions were applied through the Oddcast program. Emotional expressions such as "Aha" and "Wow" were included. The Oddcast program without emotion was applied for voices without emotionality. Script 4 gave directions. The participant asked the agent for directions from their home to the subway station.
The questionnaire condition was manipulated with two levels: high and low. The high-level questionnaire was expressed in the format of interrogative sentences. The low-level questionnaire was expressed in the format of declarative sentences.
4.1. Visual Feedback
One-way ANOVA and Tukey post-hoc tests were conducted. The following are the detailed statistical results of the one-way ANOVA (Table 4). For agreeableness, visual feedback 4 (F (4,30) = 14.37, p < 0.001) demonstrated the highest mean value (M = 6.16). For conscientiousness, visual feedback 2 (F (4,30) = 6.12, p < 0.01) demonstrated the highest mean value (M = 5.83). For extraversion, visual feedback 8 demonstrated the highest mean value (Mean = 6.6) (F (4, 30) = 11.01, p < 0.001). For neuroticism, visual feedback 2 demonstrated the highest value (M = 6.167) (F (4, 30) = 6.167, p < 0.0001). For openness, visual feedback 4 demonstrated the highest mean value (M = 3.66) (F (4, 30) = 3.24, p < 0.05).
Based on the results of visual feedback (Table 4), we analyzed personality perceptions depending on the motions and colors of visual feedback. Our initial research purpose was to find personality perceptions depending on visual feedback, not the separate elements of visual feedback (colors, motions). Therefore, in the experiment, we did not separately test these two elements as individual factors. However, in the process of analysis, we discovered that levels of motions influenced the personality perceptions.
Depending on the levels of motions, eight visual feedbacks could be categorized into three groups. Visual feedbacks 3 and 8 were designed with fast motions. Visual feedbacks 1, 4, and 6 were designed with moderate motions. Visual feedbacks 2 and 5 were designed with slow motions (Table 2). We calculated the average mean values of conversational agents’ perceived personalities for each personality depending on the levels of motions (Table 5).
Visual feedback with faster motions was more likely to be perceived as active and likeable personalities, such as extraversion and agreeableness. In the case of extraversion, fast motion showed the highest value (M = 6.25) compared with moderate (M = 3.31) and slow motions (M = 3.25). Agreeableness demonstrated a similar tendency in that faster motions showed higher values. However, slow motions were more likely to be perceived as negative and anxious personalities, including conscientiousness and neuroticism (Figure 4).
4.2. Verbal Cues
To analyze the results of study 2, two-way ANOVA and Tukey post-hoc test were conducted. Female voice with slow speed (M = 5.87, p < 0.05) and voice with low emotionality (M = 5.20, p < 0.005) were likely to be perceived as conscientious compared with other conditions (Table A2). Male voice that questions a lot (M = 4.62, p < 0.05), voice with high emotionality (M = 6.29, p < 0.001), and voice that speaking fast (M = 4.13, p < 0.01) were most perceived as extraversion (Table A3). Female voice with high emotionality (M = 4.95, p < 0.05) were most perceived as openness (Table A4). Male voice speaking fast (M = 3.50, p < 0.01) and low wordiness (M = 3.33, p < 0.001) were most perceived as neuroticism (Table A5). Agreeableness was excluded in the table because verbal cues condition did not show a significant value (Table A6).
Considering the visual feedback results, different motions of visual feedback were highly influential on the perceptions of personalities. Fast, moderate, and slow motions could express different personalities, which answers the first research question of whether visual feedback with different motions could express different personalities. Regardless of colors, visual feedback with fast and moderate-speed motions were perceived as agreeable and kind (agreeableness). Visual feedback with slow motions was perceived as deliberate and careful (conscientiousness). Visual feedback with fast and moderate motions was considered to have a creative and imaginative personality (openness). Visual feedback with fast motions was considered active and sociable (extraversion). Visual feedback with slow motions was considered depressed and anxious (neuroticism). Different motions of visual feedback were perceived as different personalities.
The study suggested that fast motions were appropriate for expressing positive personalities such as agreeableness and extraversion. The traits of extroversion were related to ambition and sociability. Agreeableness can be interpreted as likability, cooperation, social conformity, and love , and these traits are more highly related to positive personalities than the other Big Five personalities. The results supported the previous finding that users’ perceptions of extraversion increased as the motion level increased . Considering that previous studies about motions were conducted with virtual characters’ behavioral movements, this study provides design implications for more diverse formats of appearance constraint agents. The study argues that using slow motions is suitable for personalities that are usually perceived as negative concerning the results of personality perceptions associated with slow motions. The results reveal that slow motions were perceived as neurotic and conscientious. According to the Big Five personality trait study , traits associated with neuroticism were highly related to emotional instability, anxiety, and insecurity. In addition, traits associated with conscientiousness were related to thoroughness and planning, which could be negatively perceived in relationships. Slow motions of visual feedback are required to design conversational agents with negative personalities, such as neuroticism and conscientiousness. This is similar to human–human conversations because actively reactive conversational partners are perceived as having more active and positive personalities than passive and sullen conversational partners .
The study demonstrated that color was not a valid factor for expressing different personality traits. Even though the experimental materials were designed with the viewpoint that colors have different traits, colors do not show any consistent or significant patterns depending on different personality traits. Even though the two studies used experimental materials with diverse colors, such as red, purple, green, and green with blue, consistent patterns of personality perceptions were only demonstrated depending on the motion speed, regardless of color.
The visual feedback results highlight the issue of color subjectivity and objectivity. The subjectivity and objectivity of colors are a highly controversial issue among color scientists. Based on the argument that color perceptions can be organized with objective standards and systems, the relationship between colors and personalities has been widely observed and studied, particularly in human psychology. For instance, personality testing systems, such as the Color Pyramid Test , exist to evaluate personalities depending on different colors. In addition, according to the theory of color , red, yellow, and orange are related to exciting and enlivening features, blue and purple are related to anxious and yearning features, yellow is related to anger, and black is related to depression. Bold colors are more suitable for expressing dominant personalities than submissive personalities .
The current study argues that colors are not suitable for expressing diverse personalities because personal preference is decisive in the perception of colors. The finding of this study that colors are not influential factors is supported by diverse color perception studies. Color eliminativism, which is the notion that physical objects are not colored at all and colors are perceived psychologically rather than objectively, supports the current study’s results . In addition, the color researcher Jean-Philippe Lenclos’s study of geographical color demonstrated that colors could be explained and defined based on geographical, cultural, and geographical conditions , which is highly related to the ecological perspective on color . The correlations between specific colors and psychological factors as neither objective nor complicated ; therefore, more sophisticated color schemes that reflect other design elements should be designed for future studies.
The results could suggest the possibility that the number of colors could influence personality perceptions rather than their hue. In particular, both visual feedbacks 3 and 8 demonstrated significantly high mean values compared with other visual feedback instances in extraversion. Considering that only visual feedbacks 3 and 8 used three colors, while the others used only one or two colors, we suggest that using diverse colors rather than a single color could be suitable for expressing extraversion. However, since this study cannot ignore the impacts of hues, this study’s results do not fully support this argument.
Perceptions of the conversational agent’s personalities differed according to vocal gender. Fast speaking speed was perceived differently depending on the vocal gender. When the male voice spoke fast, it was perceived as emotionally unstable and nervous, but the female voice speaking fast was perceived as sociable and outgoing. Sociable and outgoing personalities are considered more positive than unstable and nervous personalities . Even though the same verbal cue was used, the personality perception differed depending on vocal gender. In particular, people tended to perceive female voices as more positive than male voices. This result is highly related to the previous study findings that people prefer female extraverted voices for assistive social robots for elders [43,51].
The limitation of this study was that the design standard for choosing the colors of the visual feedback was based on previous studies, which could be subjective. Perceptions of colors could be influenced by personal preferences and cultural backgrounds. The findings of this study could be restricted to subjective and personal perceptions. For future studies, participants’ characteristics, such as personalities and gender, could be added as variables. Another limitation is that we did not separately measure the personality perceptions depending on motions and colors because visual feedback was designed with the combination of these two elements. Based on these findings, with more detailed conditions, the impacts of motions could be measured separately for a future study.
This study has explored how a conversational agent’s personalities can be expressed. The current study provided a novel approach to implement the natural feedback of conversational agents with the concept of personality rather than emotion. In addition, rather than focusing on two contrasting personalities, the current study applied wider personality categories. The study sought to implement more natural feedback by applying more diverse personality expression cues.
Visual feedback was used as an experimental material to answer the first research question: "Can different visual feedback be perceived as different personalities?" The results demonstrated that different motions of visual feedback were highly influential on the perception of personality regardless of color. Visual feedback with different motions could express different personalities. Visual feedback with slow motion was perceived as depressed and anxious, while fast motion was perceived as active and sociable. Moderate motion was perceived as having a creative, agreeable, and imaginative personality. Slow motion was perceived as conscientious and neurotic.
Verbal cues were used as experimental materials to answer the second research question: "Can different verbal cues be perceived as different personalities?" The results demonstrated that different verbal cues were perceived as different personalities. Regarding conscientiousness, the male voice with low emotionality and the female voice with a slow and moderate speed were statistically significant. The conversational agent with a voice with high emotionality, the female voice with fast speech, and the male voice asking many questions were perceived as extraverts. The female voice with high emotionality was likely perceived as having an open personality. The male voice speaking fast with low wordiness was perceived as neurotic.
The overall results of the conversational agents could be applied to diverse interfaces designed with smart displays, such as AI speakers, social robots, cars, and Internet of Things environments. In addition, this study applied a wider range of personalities rather than focusing on two contrasting personalities. Furthermore, more diverse elements were applied in personality expressions rather than focusing on simple factors.
Methodology, S.-y.L., G.L. and S.K.; Supervision, J.L.; Writing—original draft, S.-y.L.; Writing—review & editing, S.-y.L., G.L., S.K., J.L.
This research was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korean National Police Agency and the Ministry of Science and ICT for Police field customized research and development project. (NRF-2018M3E2A1081492).
Early in the research, Soohyun Shin helped to develop the initial ideation of the visual feedback design. Also, we worked closely with Seung Hyuk Choe when designing the visual feedback for the study.
Conflicts of Interest
The authors declare no conflict of interest.
Table A1. Survey questionnaires of Big Five Personality used in the experiment.
|How well do the following statements describe the personality of the experimental material? (1 = Disagree strongly, 7 = Agree strongly)|
|2||Is generally trusting.|
|3||Tends to be lazy.|
|4||Is relaxed, handles stress well.|
|5||Has few artistic interests.|
|6||Is outgoing, sociable.|
|7||Tends to find fault with others.|
|8||Does a thorough job.|
|9||Gets nervous easily.|
|10||Has an active imagination.|
Note. Scoring of the Big Five Inventories 10 scales are as follows (R= items are reversed-scored): Extraversion 1R, 6; Agreeableness: 2, 7R; Conscientiousness: 3R, 8; Neuroticism: 4R, 9; Openness: 5R, 10.
Table A2. Results of two-way ANOVA for verbal cue conditions: Conscientiousness.
|Verbal cues||Mean (SD)||df||F-Value||P-Value|
|Voice Gender||Female||4.62 (1.30)||1||0.12||0.72|
|Emotionality||High||4.17 (1.22)||1||8.69||0.00 ***|
|Gender * Emotionality||Female * high||4.29 (1.25)||1||1.16||0.29|
|Male * high||4.04 (1.23)|
|Female * low||5.45 (0.98)|
|Male * low||4.95 (1.32)|
|Gender * Question||Female * High||4.87 (1.22)||1||0.37||0.54|
|Male * High||5.38 (1.07)|
|Female * Low||4.75 (0.94)|
|Male * Low||4.87 (1.05)|
|Gender * Speed||Female * High||5.21 (1.50)||2||4.81||0.01 *|
|Male * High||4.79 (1.66)|
|Female * Moderate||5.54 (0.62)|
|Male * Moderate||5.04 (1.39)|
|Female * Low||5.88 (0.86)|
|Male * Low||3.37 (1.54)|
|Gender* pitch||Female * High||5.71 (0.96)||2||0.79||0.46|
|Male * High||4.63 (1.20)|
|Female * Moderate||5.38 (1.15)|
|Male * Moderate||5.13 (1.28)|
|Female * Low||5.67 (0.86)|
|Male * Low||4.83 (1.48)|
|Gender* Wordiness||Female * High||5.11 (0.4)||1||3.1||0.78|
|Male * High||4.32 (0.23)|
|Female * Low||4.53 (0.15)|
|Male * Low||5.23 (0.42)|
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
Table A3. Results of two-way ANOVA for verbal cue conditions: Extraversion.
|Verbal cues||Mean (SD)||df||F-Value||P-Value|
|Voice Gender||Female||4.72 (1.67)||1||0.13||0.72|
|Emotionality||High||6.29 (0.69)||1||8.96||0.00 ***|
|Gender * Emotionality||Female * high||6.33 (0.61)||1||1.68||0.2|
|Male * high||6.20 (0.78)|
|Female * low||3.95 (1.03)|
|Male * low||3.25 (0.65)|
|Gender * Question||Female * High||3.08 (1.54)||1||4.62||0.03 *|
|Male * High||4.62 (1.22)|
|Female * Low||3.12 (1.26)|
|Male * Low||3.08 (0.66)|
|Gender * Speed||Female * High||4.29 (0.81)||2||0.15||0.86|
|Male * High||3.95 (0.81)|
|Female * Moderate||3.42 (1.62)|
|Male * Moderate||2.92 (1.31)|
|Female * Low||3.54 (1.21)|
|Male * Low||2.83 (1.19)|
|Gender* pitch||Female * High||4.08 (0.76)||2||0.1||0.89|
|Male * High||4.17 (1.67)|
|Female * Moderate||3.54 (1.39)|
|Male * Moderate||3.67 (1.17)|
|Female * Low||3.38 (1.35)|
|Male * Low||3.17 (1.54)|
|Gender * Wordiness||Female * High||2.83 (1.23)||1||1.49||0.76|
|Male * High||2.45 (0.89)|
|Female * Low||3.08 (0.54)|
|Male * Low||3.21 (0.12)|
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
Table A4. Results of two-way ANOVA for verbal cue condition: Openness.
|Verbal cues||Mean (SD)||df||F-Value||P-Value|
|Voice Gender||Female||4.22 (1.76)||1||0.01||0.93|
|Gender * Emotionality||Female * high||4.95 (1.43)||1||0.19||0.01*|
|Male * high||4.70 (1.32)|
|Female * low||3.50 (1.81)|
|Male * low||3.66 (1.83)|
|Gender * Question||Female * High||4.79 (1.27)||1||0.03||0.87|
|Male * High||4.04 (1.49)|
|Female * Low||4.63 (1.11)|
|Male * Low||4.79 (1.27)|
|Gender * Speed||Female * High||4.13 (1.28)||2||0.18||0.83|
|Male * High||4.04 (1.64)|
|Female * Moderate||4.29 (1.18)|
|Male * Moderate||3.79 (1.32)|
|Female * Low||4.25 (1.35)|
|Male * Low||3.71 (1.57)|
|Gender* pitch||Female * High||4.04 (1.46)||2||0.77||0.47|
|Male * High||4.83 (1.45)|
|Female * Moderate||4.29 (1.18)|
|Male * Moderate||4.29 (1.32)|
|Female * Low||4.04 (1.09)|
|Male * Low||3.95 (1.51)|
|Gender* Wordiness||Female * High||3.87 (1.84)||1||0.25||0.62|
|Male * High||4.00 (1.52)|
|Female * Low||3.76 (1.23)|
|Male * Low||4.12 (1.76)|
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
Table A5. Results of two-way ANOVA for verbal cue condition: Neuroticism.
|Verbal cues||Mean (SD)||df||F-Value||P-Value|
|Voice Gender||Female||2.93 (1.14)||1||0.22||0.64|
|Gender * Emotionality||Female * high||2.87 (0.97)||1||1.33||0.26|
|Male * high||3.08 (1.10)|
|Female * low||3.29 (0.94)|
|Male * low||2.79 (1.21)|
|Gender * Question||Female * High||3.58 (1.56)||1||0.48||0.49|
|Male * High||2.79 (1.12)|
|Female * Low||3.29 (1.09)|
|Male * Low||3.00 (0.90)|
|Gender * Speed||Female * High||3.50 (1.31)||2||0.08||0.00 **|
|Male * High||3.54 (1.72)|
|Female * Moderate||2.67 (0.98)|
|Male * Moderate||2.79 (1.01)|
|Female * Low||2.76 (1.03)|
|Male * Low||3.00 (1.55)|
|Gender* pitch||Female * High||2.96 (0.86)||2||0.07||0.92|
|Male * High||3.08 (1.38)|
|Female * Moderate||2.42 (1.26)|
|Male * Moderate||2.71 (1.27)|
|Female * Low||2.75 (1.25)|
|Male * Low||3.17 (1.53)|
|Wordiness||High||2.67 (1.07)||1||3.2||0.00 ***|
|Gender * Wordiness||Female * High||5.33 (1.48)||1||2.2||0.78|
|Male * High||5.24 (0.78)|
|Female * Low||4.16 (0.65)|
|Male * Low||4.77 (0.78)|
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
Table A6. Results of two-way ANOVA for verbal cue condition: Agreeableness.
|Verbal cues||Mean (SD)||df||F-Value||P-Value|
|Voice Gender||Female||5.04 (1.39)||1||0.12||0.73|
|Gender * Emotionality||Female * high||4.96 (1.51)||1||0.66||0.42|
|Male * high||4.79 (0.86)|
|Female * low||5.12 (1.33)|
|Male * low||5.54 (1.17)|
|Gender * Question||Female * High||5.13 (1.13)||1||0.48||0.49|
|Male * High||5.17 (1.46)|
|Female * Low||5.38 (0.86)|
|Male * Low||5.20 (0.92)|
|Gender * Speed||Female * High||5.50 (1.52)||2||0.19||0.83|
|Male * High||4.79 (1.79)|
|Female * Moderate||5.71 (0.99)|
|Male * Moderate||5.25 (1.37)|
|Female * Low||5.67 (1.13)|
|Male * Low||4.71 (1.51)|
|Gender* pitch||Female * High||5.42 (1.39)||2||1.49||0.23|
|Male * High||5.50 (1.04)|
|Female * Moderate||5.21 (0.96)|
|Male * Moderate||5.71 (1.19)|
|Female * Low||5.79 (0.84)|
|Male * Low||5.13 (1.52)|
|Gender * Wordiness||Female * High||3.88 (1.84)||1||2.3||0.78|
|Male * High||3.75 (0.98)|
|Female * Low||4.00 (1.52)|
|Male * Low||4.15 (0.23)|
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
- Picard, R.W. Picard Affective computing for HCI. HCI 1999, 1, 829–833. [Google Scholar]
- Mairesse, F.; Walker, M.A. Can Conversational Agents Express Big Five Personality Traits through Language?: Evaluating a Psychologically-Informed Language Generator. Camb. Sheffield. 2009, 1–59. [Google Scholar]
- Pervin, L.A.; John, O.P. Handbook of Personality; Guilford Press: New York, NY, USA, 1999. [Google Scholar]
- Neff, M.; Wang, Y.; Abbott, R.; Walker, M. Evaluating the Effect of Gesture and Language on Personality Perception in Conversational Agents In International Conference on Intelligent Virtual Agents. In Proceedings of the 10th international conference on Intelligent virtual agents, Philadelphia, PA, 20–22 September 2010; pp. 222–235. [Google Scholar]
- Brebner, J. Personality Theory and Movement; Springer: Dordrecht, The Netherlands, 1985; pp. 27–41. [Google Scholar]
- Dewaele, J.M.; Furnham, A. Extraversion: The Unloved Variable in Applied Linguistic Research. Lang. Learn. 2010, 49, 509–544. [Google Scholar] [CrossRef]
- Dewaele, J.M.; Heylighen, F. Variation in the Contextuality of Language: An Empirical Measure. Found. Sci. 2002, 7, 293–340. [Google Scholar]
- Nass, C. Machines and Mindlessness: Social Responses to Computers. J. Soc. Issues 2000, 56, 81–103. [Google Scholar] [CrossRef]
- Aly, A.; Tapus, A. A Model for Synthesizing a Combined Verbal and Nonverbal Behavior Based on Personality Traits in Human-Robot Interaction. In Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction, Tokyo, Japan, 3–6 March 2013; pp. 325–332. [Google Scholar]
- Lee, S.; Kim, S.; Lee, G.; Lee, J. Robots in Diverse Contexts: Effects of Robots Tasks on Expected Personality. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, Chicago, IL, USA, 5–8 March 2018; pp. 169–170. [Google Scholar]
- Bickmore, T.; Cassell, J. Relational Agents: A Model and Implementation of Building User Trust. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Seattle, Washington, USA, 1 March 2001; pp. 396–403. [Google Scholar]
- Oliveira, R.; Arriaga, P.; Alves-Oliveira, P.; Correia, F.; Petisca, S.; Paiva, A. Friends or Foes?: Socioemotional Support and Gaze Behaviors in Mixed Groups of Humans and Robots. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, Chicago, IL, USA, 5–8 March 2018; pp. 279–288. [Google Scholar]
- Torres, O.E.; Cassell, J.; Prevost, S. Modeling Gaze Behavior as a Function of Discourse Structure. In Proceedings of the First International Workshop on Human-Computer Conversation, Bellagio, Italy, 14–16 July 1997. [Google Scholar]
- Löffler, D.; Schmidt, N.; Tscharn, R. Multimodal Expression of Artificial Emotion in Social Robots Using Color, Motion and Sound. In Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, Chicago, IL, USA, 5–8 March 2018; pp. 334–343. [Google Scholar]
- Song, S.; Yamada, S. Expressing Emotions through Color, Sound, and Vibration with an Appearance-Constrained Social Robot. In Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, Vienna, Austria, 6–9 March 2017; pp. 2–11. [Google Scholar]
- Terada, K.; Yamauchi, A.; Ito, A. Artificial emotion expression for a robot by dynamic color change. In Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication, Paris, France, 9–13 September 2012; pp. 314–321. [Google Scholar]
- Funakoshi, K.; Kobayashi, K.; Nakano, M.; Yamada, S.; Kitamura, Y.; Tsujino, H. Smoothing human-robot speech interactions by using a blinking-light as subtle expression. In Proceedings of the 10th international conference on Multimodal interfaces, Chania, Crete, Greece, 20–22 October 2008; pp. 293–296. [Google Scholar]
- Seo, Y.S.; Huh, J.H. Automatic Emotion-Based Music Classification for Supporting Intelligent IoT Applications. Electronics 2019, 8, 164. [Google Scholar] [CrossRef]
- Picard, R.W. Affective Computing; The MIT Press: London, UK, 2000. [Google Scholar]
- Dryer, D.C. Getting personal with computers: How to design personalities for agents. Appl. Artif. Intell. 1999, 13, 273–295. [Google Scholar] [CrossRef]
- Isbister, K.; Nass, C. Consistency of personality in interactive characters: Verbal cues, non-verbal cues, and user characteristics. Int. J. Hum. Comput. Stud. 2000, 53, 251–267. [Google Scholar] [CrossRef]
- Nass, C.; Youngme, M.; Fogg, B.J.; Reeves, B.; Dryer, D.C. Can computers personalities be human personalities? Int. J. Hum. Comput. Stud. 1995, 43, 223–239. [Google Scholar] [CrossRef]
- Breazeal, C. Social interactions in HRI: The robot view. IEEE Trans. Syst. Man Cybern. Part. C Appl. Rev. 2004, 34, 181–186. [Google Scholar] [CrossRef]
- Lee, K.M.; Peng, W.; Jin, S.A.; Yan, C. Can robots manifest personality?: An empirical test of personality recognition, social responses, and social presence in human-robot interaction. J. Commun. 2006, 56, 754–772. [Google Scholar] [CrossRef]
- Moon, Y.; Nass, C. How “real” are computer personalities? Psychological responses to personality types in human-computer interaction. Commun. Res. 1996, 23, 651–674. [Google Scholar] [CrossRef]
- Wang, R.; Harari, G.; Hao, P.; Zhou, X.; Campbell, A.T. SmartGPA: How Smartphones Can Assess and Predict Academic Performance of College Students. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, Osaka, Japan, 7–11 September 2015; pp. 295–306. [Google Scholar]
- Hancock, J.T.; Dunham, P.J. Impression Formation in Computer - Mediated Communication Revisited. Communic. Res. 2001, 28, 325–347. [Google Scholar] [CrossRef]
- Bailenson, J.N.; Yee, N.; Brave, S.; Merget, D.; Koslow, D. Virtual Interpersonal Touch: Expressing and Recognizing Emotions Through Haptic Devices. Hum. Comput. Interact. 2007, 22, 325–353. [Google Scholar]
- Andrist, S.; Mutlu, B.; Tapus, A. Look Like Me: Matching Robot Personality via Gaze to Increase Motivation. ACM Conf. Hum. Factors Comput. Syst. 2015, 2, 3603–3612. [Google Scholar]
- Sokolova, M.; Fernández-Caballero, A. A Review on the Role of Color and Light in Affective Computing. Appl. Sci. 2015, 5, 275–293. [Google Scholar] [CrossRef]
- Apple, W.; Streeter, L.A.; Krauss, R.M. Effects of pitch and speech rate on personal attributions. J. Pers. Soc. Psychol. 1979, 37, 715–727. [Google Scholar] [CrossRef]
- Barrick, M.R.; Mount, M.K. The big five personality dimensions and job performance: A meta-analysis. Pers. Psychol. 1991, 44, 1–26. [Google Scholar] [CrossRef]
- Digman, J.M. Personality Structure: Emergence of the Five-Factor Model. Annu. Rev. Psychol. 1990, 41, 417–440. [Google Scholar] [CrossRef]
- Tapus, A.; Matarić, M.J. User personality matching with a hands-off robot for post-stroke rehabilitation therapy. Springer Tracts Adv. Robot. 2008, 39, 165–175. [Google Scholar]
- Lee, S. Expressing the personalitis of the conversational agents with visual feedback and verbal cues. Master’s Thesis, Seoul National University, Seoul, South Korea, 2019; pp. 1–104. [Google Scholar]
- Joosse, M.; Lohse, M.; Perez, J.G.; Evers, V. What you do is who you are: The role of task context in perceived social robot personality. In Proceedings of the IEEE International Conference on Robotics and Automation, Karlsruhe, Germany, 6–10 May 2013; pp. 2134–2139. [Google Scholar]
- Rammstedt, B.; John, O.P. Measuring personality in one minute or less: A 10-item short version of the Big Five Inventory in English and German. J. Res. Pers. 2007, 41, 203–212. [Google Scholar] [CrossRef]
- Murray, D.C.; Deabler, H.L. Colors and mood-tones. J. Appl. Psychol. 1957, 41, 279–283. [Google Scholar] [CrossRef]
- Warner, S. On the relation of color and personality. J. Proj. Tech. Pers. Assess. 1966, 30, 512–524. [Google Scholar]
- Karwoski, T.F.; Odbert, H.S. Color music. Psychol. Monogr. 1938, 50, i-60. [Google Scholar] [CrossRef]
- Naz, K.; Helen, H. Color-emotion associations: Past experinece and personal preference. AIC Color. Paint. Interim Meet. Int. Color. Assoc. 2004, 5, 31. [Google Scholar]
- Bricks, M. Mental hygiene value of children’s art work. Am. J. orthopsychiat 1994, 14, 136–146. [Google Scholar] [CrossRef]
- Costa, P.T.; McCrae, R.R. The Revised NEO Personality Inventory (NEO-PI-R); The SAGE Handbook of Personality Theory and Assessment: Thousand Oaks, CA, USA, 2008; pp. 179–198. [Google Scholar]
- Niculescu, A.; Van Dijk, B.; Nijholt, A.; See, S.L. The influence of voice pitch on the evaluation of a social robot receptionist. In Proceedings of the International Conference on User Science and Engineering (i-USEr ), Shah Alam, Selangor, Malaysia, 29 November–1 December. 2011; pp. 18–23. [Google Scholar]
- Hyde, J.; Carter, E.J.; Kiesler, S.; Hodgins, J.K. Using an Interactive Animated Avatar’s Facial Expressiveness to Increase Persuasiveness and Socialness. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, Seoul, Republic of Korea, 18–23 April 2015; pp. 1719–1728. [Google Scholar]
- Schaie, W. The color pyramid test: A nonverbal technique for personality assessment. Psychol. Bull. 1963, 60, 530–547. [Google Scholar] [CrossRef]
- Byrne, A.; Hilbert, D.R. Color realism and color science. Behav. Brain Sci. 2003, 26, 3–21. [Google Scholar] [CrossRef]
- Lenclos, J.-P.; Lenclos, D. Colors of the World: The Geography of Color; WW Norton & Company: New York, NY, USA, 2004. [Google Scholar]
- Brave, S.; Nass, C. Emotion in Human—Computer Interaction; The Human-Computer Interaction Handbook: Hillsdale, NJ, USA, 2003; pp. 81–96. [Google Scholar]
- McCrae, R.R.; Costa, P.T. Personality Trait Structure as a Human Universal. Am. Psychol. 1997, 52, 509–516. [Google Scholar] [CrossRef]
- Chang, R.C.S.; Lu, H.P.; Yang, P. Stereotypes or golden rules? Exploring likable voice traits of social robots as active aging companions for tech-savvy baby boomers in Taiwan. Comput. Human Behav. 2018, 84, 194–210. [Google Scholar] [CrossRef]
Figure 1. Telepresence robot used in the entire study.
Figure 2. Visual feedback 1–8 used in the study.
Figure 3. Partially captured sequence of motions of visual feedback. All visual feedback moved with the same sequence.
Figure 4. Average mean values of conversational agent’s perceived personalities depending on different levels of motions of visual feedback.
Table 1. Number of participants depending on conditions.
|Conditions||Number of Participants||Total|
Table 2. Details of experimental materials for visual feedback. Five visual feedback were designed with different motions and colors.
|Visual Feedback.||Motions||Motion Speed||Colors||Number of Colors|
|1||Moderate||0.25||Red + Yellow||2|
|3||Fast||0.35||Dark Blue + Yellow + Red||3|
|4||Moderate||0.25||Green + Blue||2|
|8||Fast||0.35||Dark Blue + Yellow + Purple||3|
Table 3. Experimental materials of verbal cues.
|1: Introduction||3 Pitch (high/moderate/low) x 2 Gender|
|2: Weather conditions||2 Wordiness (high/low) x 2 Gender|
|3: Give directions||3 Speed (high/moderate/low) x 2 Gender|
|4: Ask a name||2 Emotionality (high/low) x 2 Gender|
|5: Scheduling||2 Questioning styles (high/low) x 2 Gender|
Table 4. Results of visual feedback. Mean values of conversational agent’s perceived personalities depending on different visual feedback.
|Mean||SD||df||F Value||P Value|
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
Table 5. Average mean values of conversational agent’s perceived personalities depending on different levels of motions of visual feedback.
|Personality Traits||Motion speed (Visual Feedback)|
|Fast (3, 8)||Moderate (1, 4, 6)||Slow (2, 5)|
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).