Gaze Aversion in Conversational Settings: An Investigation Based on Mock Job Interview

Acarturk, Cengiz; Indurkya, Bipin; Nawrocki, Piotr; Sniezynski, Bartlomiej; Jarosz, Mateusz; Usal, Kerem Alp

doi:10.16910/jemr.14.1.1

Open AccessArticle

Gaze Aversion in Conversational Settings: An Investigation Based on Mock Job Interview

by

Cengiz Acarturk

¹,

Bipin Indurkya

²,

Piotr Nawrocki

³,

Bartlomiej Sniezynski

³,

Mateusz Jarosz

³ and

Kerem Alp Usal

¹

Department of Cognitive Science, Middle East Technical University, Turkey

²

Department of Cognitive Science, Jagiellonian University, Poland

³

Institute of Computer Science, AGH University of Science and Technology,, Poland

J. Eye Mov. Res. 2021, 14(1), 1-13; https://doi.org/10.16910/jemr.14.1.1

Submission received: 1 December 2020 / Published: 19 May 2021

Download

Browse Figures

Versions Notes

Abstract

:

We report the results of an empirical study on gaze aversion during dyadic human-to-human conversation in an interview setting. To address various methodological challenges in assessing gaze-to-face contact, we followed an approach where the experiment was conducted twice, each time with a different set of interviewees. In one of them the interviewer’s gaze was tracked with an eye tracker, and in the other the interviewee’s gaze was tracked. The gaze sequences obtained in both experiments were analyzed and modeled as Discrete-Time Markov Chains. The results show that the interviewer made more frequent and longer gaze contacts compared to the interviewee. Also, the interviewer made mostly diagonal gaze aversions, whereas the interviewee made sideways aversions (left or right). We discuss the relevance of this research for Human-Robot Interaction, and discuss some future research problems.

Keywords:

Gaze in conversation; gaze aversion; face contact; Human Human Interaction

Introduction

Non-verbal aspects of face-to-face communication have been researched intensively over the past several decades (e.g., Mehrabian, 1967; Archer, 1977; Kendon, 2004; Levinson, 2014). In contrast to verbal communication, non-verbal communication covers a broad spectrum of modalities, including speech intonation, facial expressions, eye movements, and gestures.

Eye-gaze contact is a primary, non-verbal modality that has multiple functions, including initiating a conversation, regulating turn-taking, as well as signaling a topic change, among other adjustments of the roles of the interlocutors in a conversation (Schegloff, 1968; Duncan, 1972; Sacks, 1974; Goodwin, 1981; Grosz, 1986; Quek, 2000, see Hessels, 2020, for a review). The role of gaze is of particular importance in that it has its own "language" (Baron-Cohen, 1997) as a means for establishing joint attention (Kleinke, 1986; Emery, 2000) by capturing the interlocutor's attentional states through gaze directions. The partner’s eye-gaze direction results in a tendency for shifting attention, a phenomenon that has been considered reflexive and modulated by a set of social factors related to the observer (Doherty-Sneddon, 2005, see Bonaccio, 2016; Dalmaso, Castelli, & Galfano, 2020, for reviews).

Gaze aversion plays a complementary role in eye-gaze contact by regulating conversation. In terms of its specific functions in conversation, in contrast to gaze contact, gaze aversion indicates a tendency to avoid face-to-face interaction (e.g., Mason, 2005; Hietanen, 2008).

Investigations of gaze patterns in conversational settings have been conducted over the last decade by recording eye movements from a single participant or dual eye tracking, mostly in laboratory settings (Broz, 2012; Pfeiffer, 2013; Ho, 2015; Rogers, 2018, Macdonald, 2018). One factor that disrupts the dual eye tracking methodology's ecological validity is the presence of head-mounted, wearable eye trackers during the conversational interaction between the interlocutors. Recent studies on the influence of the presence of glasses reveal a positive effect, in terms of attraction and attributing intelligence, on the perception of the interlocutor (see Gueguen, 2017 for a review); however, there have been divergent findings in the past (e.g., Hasart, 1993; Lundberg, 1994). Moreover, the presence of a wearable eye tracker is an artifact that may influence visual attention and the behavior of the interlocutor (Risko & Kingstone, 2011). At the moment, there seems to be no technical workaround to resolve this issue. In the present study, we tried to minimize this problem by conducting two experiments that differed in terms of which interlocutor wore the eye tracker.

An important aspect of the research on gaze aversion is the influence of eye contact on attention. Some studies (e.g., Dalmaso, Alessi, Castelli, & Galfano, 2020) have examined whether making eye contact with a person leads to a more substantial gaze-mediated orienting effect. In other studies (e.g., Dalmaso, Castelli, & Galfano, 2017; Dalmaso, Castelli, Scatturin, & Galfano, 2017), attentional boosts due to eye contact episodes (in the context of various saccadic parameters) have been investigated. Research is also being carried out on the influence of direct gaze on orienting to faces. For example, Gobel, Kim, and Richardson (2015) emphasized that the gaze can signal and perceive and that this dual function needs to be considered in face perception. Mares, Smith, Johnson, and Senju (2016) showed that detecting a direct gaze facilitates quick orienting to faces.

Another challenge in investigating gaze patterns in face-to-face settings is the potential influence of the interlocutors' social roles. The previous research reveals a tendency to gaze more frequently at higher-ranking interlocutors. Moreover, there is an influence of personality traits on gaze patterns in conversational settings (Cheng, 2013; Tang, 2015; Maran, 2019). In the two experiments we report in this article, we aimed to control these factors by fixing the conversational setting to be a job interview. Here, an interviewer asks a set of predefined questions to an interviewee and evaluates the response of each question using a tablet, or paper and a pencil.

This approach has been employed in many past studies to investigate conversation characteristics in human-human and human-robot settings (Villani, 2012; Andrist, 2013; Andrist, 2014; Admoni, 2017). More generally, the investigation of design parameters for gaze has gained importance in Human-Robot Interaction and Social Robotics over the last decade. Recent studies show that a robots' gaze behavior significantly impacts the interaction quality, as specified in terms of a set of dimensions, such as engagement (Pitsch, et al. 2009) and persuasiveness (Fischer, et al., 2020). In the context of our research, the issues related to HRI during social interaction are important, including the use of humanoid robots (which ensures reasonable experimental control) for targeting the mechanisms of joint attention (Chevalier, Kompatsiari, Ciardo, & Wykowska, 2020). We believe that the findings of the present study and the implemented methodology can contribute to the domain of HRI and Social Robotics in multiple ways, including designing gaze aversion sequences and their timings during interactions (Jensen, et al., 2017).

In summary, the goal of the present study is to investigate the gaze patterns of the interlocutors in a specific dyadic conversation setting. In the next section, we present the study's methodology in state of the art and the challenges addressed in the present study. We then report two experiments where the participants conducted a conversation in a job interview setting. The discussion of the results and the conclusions are presented in the final section.

Methodology

For investigating gaze patterns of interviewers and interviewees during a job-interview setting, a one-on-one interviewer-interviewee setting was established. In the first experiment (Experiment 1), we focused on the interviewers' gaze patterns while an interviewee answered a set of job interview questions. In the second experiment (Experiment 2), using the same setting as Experiment 1 but working with a different group of participants, we focused on the interviewees' gaze patterns. In both the experiments, the interviewer read the interview questions from an A4 size paper and did not say anything else. Therefore, no gaze contact was observed during the interviewer's periods of speech. The eye-tracking data was analyzed for gaze contact and aversion only for the periods in which the interviewee spoke.

The interview questions were translated from Villani (2012) into the native language of the participants. We utilized a Tobii Glasses 2 wearable, binocular eye tracking device with a sampling frequency of 100 Hz and a camera with a field of view of 82 degrees in the horizontal direction and 52 degrees in the vertical direction. The reported accuracy of the device is 0.62 degrees, and it has 0.05 degrees of precision for gaze angles less than 15 degrees, and 3.05 degrees of accuracy, and 0.62 degrees of precision for gaze angles larger than 15 degrees, as published by Tobii Pro technical specifications (Tobii, 2018). The device was used first for tracking the interviewers' gaze on the interviewees (Experiment 1) and then for tracking the interviewees' gaze on the interviewer (Experiment 2). The interview was conducted in a controlled environment with artificial lighting (160 lux illumination). The interlocutors sat about 1 to 1.5 meters from one another. The participants were asked to complete a demographic questionnaire and a TIPI test (Gosling, 2003) for their subjective personality ratings.

The eye-tracking data were analyzed by the vendor software (Tobii Pro Lab) using the I-VT filter for automatic mapping the participants' gaze fixations with the following parameters: A window size of moving median of 3 gaze samples, 20 ms window length, 100 degrees/s threshold value, adjacent fixations not merged, fixations below 100 ms discarded.

After extracting the fixation data, four annotators annotated video recordings with gaze overlay to mark each fixation state. The fixation state was either 0, showing gaze aversion by the interlocutor or 1 for gaze contact. For this, we mapped fixations to predefined Areas of Interest (AOIs) in the environment. The specification of an AOI is relatively straightforward in a stationary setting. However, automated analysis of eye-movement data in a dynamic setting is a well-known challenge in eye-tracking methodology (Holmqvist, 2011). Solutions such as object recognition techniques for image processing have been proposed (Munn, 2008; DeBeugher, 2014; Stuart, 2017). Though the accuracy of these solutions has been increasing with the advancement of novel algorithms, each solution has its limitations. Therefore, we chose to conduct manual annotation of gaze locations to avoid errors of automated analysis. The annotators discussed and decided whether a fixation was a gaze contact or gaze aversion to provide high-quality annotations for each fixation. We identified a 100-pixel threshold for annotating the aversion fixations: a fixation was identified as an aversion fixation if it had a saccade of 100 or more pixels away from the previous fixation on an AOI.

For the annotations, the target interlocutor's entire face was labeled as "gaze contact," and the rest of the area as "gaze aversion." We should emphasize that the term "gaze contact" is a misnomer because the annotators labeled a fixation as "gaze contact" when the gaze-tracked interlocutor was gazing at the face of the other interlocutor but not precisely at the eyes. In particular, we annotated one side of the interaction. It means that the "gaze contact" in the present study included both proper "eye contact" (where people are looking at each other) and periods where the eye-tracked person is looking at the other person while they are looking away.

Given that the eye-tracking output of a specific gaze point in space is a byproduct of visualization rather than being a veridical gaze location with high precision and accuracy, and given the low functional precision of wearable eye trackers in today's technology, it is not possible to make clear discrimination between gaze contact and face contact at a distance of 1.5 meters between the interlocutors. Nevertheless, "face contact" is not a frequently used term. Therefore, we will use the term "gaze contact" or simply "contact" to mean "gaze-to-face contact" when one of the interlocutors is gazing at the other's face. Future development of high-precision wearable eye trackers may allow the measurement of gaze-to-gaze contact. However, this limitation does not apply to the content of "gaze aversion" in a similar way because the gaze-aversion region is much larger than the region occupied by the interlocutor's face. Accordingly, the use of the term "gaze aversion" is mostly correct in the eye-tracking literature.

Following the manual annotation of the fixation data, we calculated gaze durations (the sum of fixation durations) and average fixation coordinates of gaze aversions and contacts. These calculations let us know the duration of contact or aversions, frequently observed contact-aversion patterns, and the relative frequency of gaze shifts between contact and aversions. We also utilized all participants' TIPI results over five personality measures (Extroversion, Agreeableness, Conscientiousness, Emotional Stability, Openness) in our analysis.

The next step in the data analysis was evaluating gaze durations and gaze locations utilizing direct measures, such as determining the mean gaze duration on a predefined AOI and derived measures, such as computing various dispersion measures in gaze-location patterns. In particular, Hidden Markov Models (HMMs) have been employed for scanpath modeling and classification since the last decade (Coutrot, 2018). For instance, the EyePatterns software (West, 2006) conducts a pattern search to identify fixation sequences represented by strings. In this approach, a pattern is defined as a subsequence of an entire sequence of fixations. It may be expanded (with all fixations) or collapsed (repetitions of the same areas of interest are replaced by a single instance). The sequences are then analyzed to detect similar patterns using the Levenshtein distance or the Needleman-Wunsch algorithm. They can also be visualized as hierarchical trees of clusters of similar sequences.

Another gaze-sequence analysis methodology was proposed by (Steichen, 2014), which employed differential sequence mining. The analysis was used to characterize gaze behaviors specific for individual users and particular tasks and create user-adaptive information visualizations. They also employed the EyePatterns software to find the number of occurrences of patterns. Their experimental results revealed that the frequencies of selected patterns might be used to distinguish users with low or high perceptual abilities and complicated or straightforward task types.

Hassani (2015, 2019) proposed a BFSPMiner pattern searching algorithm for searching patterns in a data stream without splitting it into batches. The algorithm was tested on gaze data that was recorded together with keystrokes and mouse movements. In the present study, we use the SPAM algorithm for searching gaze location sequences, which does not need splitting data into batches. In Burch (2017), data mining was used for eye movement visualization by generating association rules from the data. Two types of rules were considered: set-based rules, in which time was not taken into account, and sequence-based rules, in which premises and consequences were time-ordered sequences. The sequence-based rules were hierarchically ordered in a prefix hierarchy. In the present study, we apply a methodology similar to both Hassani (2015, 2019) and (Burch, 2017). However, we use sequences instead of rules. In the following sections, we report the experiments and their results.

Experiment 1

Participants

Fourteen interviewees (six females and eight males) and two interviewers (one male and one female) participated in this experiment. They were given a small monetary compensation (approximately 5 EUR) for their participation. All the participants (both the interviewers and the interviewees) were either undergraduate or graduate students. The interviewees' ages varied between 19 and 39 (M = 26.3, SD = 4.78). The female interviewer was 24 years old, and the male interviewer was 34 years old. One interviewer was assigned to each of the fourteen participants in a gender-balanced way so that there were four male and three female participants per interviewer.

Procedure

A laboratory assistant welcomed the interviewees and seated them in an empty laboratory room where they filled out the consent form, the demographic data form, and a ten-item questionnaire of personality inventory (TIPI, Gosling, 2003). The interviewer's eye tracker calibration was completed in the interview room. Then the interviewer welcomed the interviewee and began the interview by asking a series of questions. The interviewer waited for each answer to be finished before asking the next question. The order of the interview questions was counterbalanced in blocks between the interviewees to minimize the effect of the independent variables of exposure (e.g., a full-time window of contact between the interlocutors) on the interviewers. The eye tracker recorded the interviewer’s gaze direction.

Analysis of Fixations

The fixations were annotated with the Tobii Pro Lab analysis software. The resulting data was then analyzed by focusing on three dependent variables: The percentage of aversion durations in total duration of fixations, the percentage of gaze aversion variables in total fixation variables, and the gaze direction of the aversions. The data from three participants (three males) was removed from the analyses because their gaze contact duration and gaze contact percentage values were further than two standard derivations when z-scores were calculated.

Independent t-tests were conducted to analyze the potential influence of interviewee gender and TIPI measures on the percentage of aversion durations in all the fixation durations. This analysis revealed non-significant differences, showing that the two interviewee groups (one interviewed with the male interviewer and the other interviewed with the female interviewer) were similar in their TIPI scores (Extroversion: t(9) = -1.374, p = .203, r = .42, Agreeableness: t(9) = .064, p = .951, r = .02, Conscientiousness: t(9) = .029, p = .977, r = .01, Emotional Stability: t(9) = .484, p = .64, r = .16, Openness: t(9) = -.888, p = .398, r = .28). The gender of the participant did not have a significant effect on the percentage of aversion durations (t(9) = -0.781, p = .455, r = .25), and the percentage of gaze aversion fixations in all the fixations (t(9) = -0.655, p = .529, r = .21).

The majority of all the fixations (approximately 96%) were contact fixations, whereas the remaining 4% were aversion fixations. Generally, the interviewers conducted sustained face contacts (M = 3890 ms, SD = 4935 ms) when consecutive fixations on the face were counted as a single contact fixation. The average duration of single fixations during these contacts was M = 213 ms (SD = 212 ms). On the other hand, the interviewers' aversions were shorter than their contact durations (M = 286 ms, SD = 300 ms) when consecutive fixations off the face were counted as a single aversion fixation. The average duration of single fixations during aversions was M = 181 ms (SD = 114 ms). Paired samples t-test revealed that there was a significant difference between the contact and the aversion durations if consecutive fixations were accumulated, t(10) = 4.696, p = .001, r = .83), but not if the fixations were considered independently, t(10) = 1.317, p = .217, r = .39).

We compared the two interviewers in further analysis, as they had different TIPI scores: seven for the male interviewer and 3.5 for the female interviewer (out of 7). For this, we used weighted averages of fixation parameters in percentages to normalize for differences caused by the interview lengths. These weighted averages of aversion fixations were calculated as the ratios of the total number of aversion fixations relative to the total number of all fixations (henceforth, aversion fixation counts). Similarly, the weighted averages of aversion fixation durations were calculated as the ratios of the sum of aversion gaze durations relative to the sum of all gaze durations (henceforth, aversion gaze durations). A comparison between the two interviewers revealed significant differences both in their aversion fixation counts, t(9) = 2.287, p = .048, r = .61), and aversion gaze durations, t(9) = 2.708, p = .024, r = .67. The male interviewer made fewer aversions (M = 2.3%, SD = 1.7) during the interview compared to the female interviewer (M = 5.1%, SD = 2.2). The aversion gaze durations were in line with the aversion fixation counts. The female interviewer had a higher aversion duration ratio (M = 4.8%, SD = 1.7) in comparison to the male, extrovert interviewer (M = 2%, SD = 1.7). In the next section, we report a gaze sequence analysis aimed at detecting frequently observed contact-aversion patterns.

Analysis of Gaze Sequences

We categorized the interviewers' gaze directions using labels that put consecutive fixations in the same direction as members of a single category: C defines a sequence of gaze contact fixations, whereas Down defines a sequence of downward aversion fixations. Similarly, other aversion fixations are labeled as follows: Up - upward, Right - rightward, Left - leftward, and Diag - upper or lower diagonal.

Thus, every sequence S_j = (S₁, S₂, …, S_nj) where S_i ∈ Cat = {C, Down, Up, Right, Left, Diag} represents a set of consequent gaze fixations in one of the labeled directions, which occur in duration t_i.

To find frequently occurring patterns, we applied the SPAM algorithm. In particular, we used the Sequential PAttern Mining (SPAM) algorithm (Ayres et al., 2002) for searching gaze location sequences, which does not need splitting data into batches. The implementation of the algorithm can be found in the SPMF data mining library (The SPMF data mining library - http://www.philippe-fournierviger.com/spmf/index.php.). The results showed that the top five frequently observed patterns in this experiment are as follows:

C Down C (85.7%)
C Diag C (78.6%)
C Diag C Down C (64.3%)
C Down C Down C(57.1%)
C Down C Diag C (57.1%)

The numbers in the parentheses show the support value, which is defined as the percentage of gaze sequences (number of interviews) in which the pattern is found. We consider only closed patterns, which means that the sub-patterns included in more extended patterns with the same or higher support are omitted.

These findings show that the interviewers conducted downward aversions and diagonal aversions between two contact gazes. However, downward aversions are expected in the analysis, as the interviewers read questions from the paper. Although we removed question-reading segments at the beginning of each trial from our data analysis, it is still likely that the paper's presence might have resulted in a tendency towards downward fixations. Therefore, this result may be an artifact of the experimental setting. However, diagonal patterns were not observed in the interviewee aversions, as presented in the next section.

Stochastic models can be used to analyze gaze sequences. Discrete-Time Markov chain (DTMC, Papoulis, 1984) describes a sequence of gaze directions assuming that each direction's probability depends only on the previous direction (Markov property assumption). This chain may be defined as a sequence of random variables X1, X2, X3, … such that

Pr(X_t+1 = x_t+1 | X₁ = x₁, X₂ = x₂, …, X_t = x_t) = Pr(X_t+1 = x_t+1 | X_t=x_t)

The set of outcomes xi of the random variables is called the state space of the chain. In our case, the state space is finite and represents possible gaze directions. We also assume that the Markov chain is time-homogeneous, which means that the probabilities are independent of t. Therefore, they can be represented by a Markov chain transition matrix. For n directions, the matrix is n x n. The value p_ij is the probability of changing direction from x_i to x_j. The values in each row of the matrix should add up to 1.

The transition matrix for aversions in Experiment 1, with the horizontal and the vertical directions grouped, is shown in Table 1. Table 2 shows this matrix distinguishing all aversion directions. A graphical representation of these models is shown in Figure 1 and Figure 2. These figures reveal a 52% chance of downward aversion after contact, a 25% chance of diagonal aversion, an 11% chance of upward, and a 5% of horizontal aversions. Accordingly, an aversion was always followed by a contact. That is an interesting finding since it is not usually expected to have no direct transitions between two aversions (e.g., from left to right). We will discuss this finding in the Discussion section.

Experiment 2

This experiment's setting was the same as in Experiment 1, except that we recorded the interviewees' eye movements instead of the interviewers. The male interviewer of Experiment 1 was recruited to conduct all the interviews. The stimuli (the interview questions), demographic and consent forms, and TIPI (Gosling, 2003) questionnaire were the same as in Experiment 1.

Participants and Procedure

Sixteen interviewees (seven females; age range 23 to 31, M = 26.4, SD = 2.85) participated in the experiment. They were offered monetary compensation (approximately 5 EUR) for their participation. All the participants were graduate or undergraduate students.

The interviewee wore the eye tracker, and its calibration was conducted before the interview in another room with similar light conditions. After the calibration, the interviewee was brought into the interview room, where the interviewer was waiting to begin the interview. The analysis procedure was also the same as for Experiment 1.

Analysis of Fixations

The data from two male and two female participants were removed from the analyses by applying the same outlier criterion as in Experiment 1.

The majority of all the fixations (73%) were contact fixations, though to a lesser extent than the interviewer's fixations in Experiment 1 (96%). Accordingly, the remaining 27% were considered as aversion fixations. The average duration of face contacts of the interviewees (assuming consecutive fixations on the face as a single contact fixation), the average duration of single fixations (during the contacts), and the interviewees' mean aversion durations are presented in Table 3.

Paired samples t-tests revealed a significant difference between contact and aversion durations in single fixations, t(11) = 4.5, p = .001, r = .81, and in values where consecutive fixations of the same type are added over, t(11) = 3.805, p = .003, r = .75.

For assessing the influence of personality (TIPI scores) and gender on gaze parameters in Experiment 2, the data from the interviewees was divided into two groups based on their extroversion scores. Eight participants were extroverts (TIPI score in extraversion > 4) and four participants were introverts (TIPI score < 4). This distribution was similar to that obtained in Experiment 1. The TIPI measures of the interviewees did not differ between the genders (Extroversion: F(1, 11) = .457, p = .514; Agreeableness: F(1, 11) = .455, p = .515; Conscientiousness: F(1, 11) = .657, p = .454; Emotional Stability: F(1, 11) = 0, p = .99; Openness: F(1, 11) = 1.534, p = .244). The other TIPI measures were similar between the two extroversion groups (Agreeableness: F(1, 11) = 3.936, p = .075; Conscientiousness: F(1, 11) = 1.008, p = .339; Emotional Stability: F(1, 11) = .154, p = .703; Openness: F(1, 11) = .105, p = .753).

Following the same analysis procedure as Experiment 1, weighted aversion fixation counts and aversion gaze durations were compared. However, this analysis did not yield any difference between the two extroversion groups (Counts: t(10) = -0.763, p = .463, r = .23; Durations: t(10) = .902, p = .388, r = .27). A comparison of male and female interviewees did not give any statistically significant results either (Counts: t(10) = -1.273, p = .232, r = .37; Durations: t(10) = -.454, p = .66, r = .14). Pearson Correlation analysis was conducted to assess the relation between participants’ extroversion scores and gaze behavior, which did not show a significant correlation between extroversion scores and percentage of aversion counts (r = -.07, p = .84), and between extroversion scores and percentage of aversion durations (r = -.37, p = .23).

Analysis of Gaze Sequences

The same methodology as Experiment 1 was employed for the analysis of gaze sequences in Experiment 2. The top five frequently observed patterns in Experiment 2 are listed below. The numbers in the parentheses show the support value:

C Right C (61.5%)
C Right Left (53.8%)
Right C Right (53.8%)
C Right C Right (38.5%)
C Diag Right (38.5%)

The transition matrices for Experiment 2 are presented in Table 4 and Table 5. Figure 3 and Figure 4 show a graphical representation of the model. As expected, the interviewees did not look down so often as the interviewers because they did not have to read the questions. From the latter figure, we can see a 39% chance of right aversion after a contact, an 18-20% chance of downward, diagonal and left aversions, and only a 3% chance of upward aversion. In contrast to Experiment 1, an aversion may be followed by another aversion. After a downward aversion, left and right aversions are more probable than a contact. After a diagonal aversion, down, left, and right aversions are more probable than a contact. Another observation is that the probability distributions of the interviewee's gaze aversions (Experiment 2) are more uniformly distributed than the interviewer's gaze aversions (Experiment 1).

Discussion

When both experiments are considered, the interviewer had a lower percentage than the interviewee for both the number and the duration of aversion fixations in all the fixations during the interview. This difference was independent of the gender or extroversion score of the interviewee, which suggests that it arises from the social context of the interaction. Both the experiments were conducted in a job interview setting. In this context, the interviewer was in a socially dominant position as she or he evaluated the interviewee. It was the social dominance, but the speech duration also influenced interlocutors' gaze patterns. So, it is expected that there would be more frequent and more prolonged gaze contact fixations from the interviewer than from the interviewee.

We also found that the interviewer generally made diagonal gaze aversions, whereas the interviewees made left or right aversions with shorter durations. These findings are compatible with the previous research, which shows that gaze aversion is a robust measure in various social situations, including anxiety (Wieser, 2009) and social phobia (Moukheiber, 2010). Nevertheless, the differences in the direction of aversions between the interviewers and the interviewees are influenced by the experimental setting and the socially asymmetric situation due to the interviewer's dominance. Therefore, more studies are needed to explore these issues before generalizing these results.

A major application of this research is to design robots that show a natural gaze-aversion behavior when interacting with a human. This topic is gaining importance in Human Robot Interaction (HRI) research (see Chevalier, Kompatsiari, Ciardo, & Wykowska, 2020, for a review). The methodology of designing a robot's gaze movement based on human-to-human gaze contact studies has been followed in the past (Andrist, 2013, 2014; Lehmann, 2017; Zhang, 2017). We have followed the methodology of tracking the gaze of one interlocutor at a time. We plan to evaluate our model by incorporating the interviewer's gaze contact pattern into a humanoid robot and repeat the experiment by putting the robot in the interviewer's role.

In recent years, researchers have been experimenting with deploying robots to conduct interviews (Tengai, 2020) as they do not have unconscious biases. This approach has its problems, for a robot may show data-dependent biases. However, in this scenario, if the interviewer robot shows a natural gaze-aversion behavior based on models such as the one developed in this paper, the interviewee may feel more comfortable.

From a broader perspective, we have explored another methodology for studying gaze-contact behavior in a specific setting. As noted by Andrist (2013), gaze aversion in a conversation can serve several diverse functions, such as taking turns, indicating cognitive effort, regulating intimacy. Even during the same interview, different instances of gaze aversion may play different roles. A more comprehensive model needs to take all these factors into account. Thus, it may be helpful to collect data using different methodologies, including the one proposed here.

Needless to say, our study suffers from some limitations due to the complexity of data collection in dynamic situations and our assumptions regarding data annotation and analyses. For instance, one-sided recording and data annotation constitute a limitation as the moments of "contact" in our study lumped together proper gaze contact, true face contact, and the moments where the eye-tracked interlocutor gazes the partner's face while the partner was gazing at another location. This limitation can be partially resolved by employing dual eye tracking. This solution is not perfect as dual eye tracking leads to other behavioral changes in the partners. Future research should address this problem by employing less intrusive and higher resolution eye tracking. Secondly, the results of Experiment 1 revealed a lack of direct transitions between two gaze aversions, always showing an intervening look on the interviewee's face. This finding may result from how the interviewers interacted with the interviewees, or it may be due to the low resolution of the eye tracking equipment. A higher resolution eye tracker would provide more detailed information to address this problem. Another limitation was the limited number (two) of interviewers. Though we chose one male and one female interviewer to counterbalance gender, this is insufficient to make claims about the influence of the interviewer's gender and personality on the interaction. Future research should address likely influences of such personal traits.

The annotation of fixations (as aversion fixations and contact fixations) requires operational assumptions about their definitions. In the present study, we used a 100-pixel threshold, taking into account the low resolution of the eye tracking equipment. We assumed that a fixation is an aversion fixation if it is 100 pixels away or further from the previous fixation (cf. saccadic amplitude). Fixations below the 100-pixel threshold comprised approximately a quarter of all the fixations in our data (25.5% in Experiment 1, 29.8% in Experiment 2). Further research is needed to validate this characterization of aversion fixations.

Ethics and Conflict of Interest

The author(s) declare(s) that the contents of the article are in agreement with the ethics described in http://biblio.unibe.ch/portale/elibrary/BOP/jemr/ethics.html and that there is no conflict of interest regarding the publication of this paper.

Acknowledgments

This research was supported in part by a grant by TUBITAK (The Scientific and Technological Research Council of Turkey) Grant No 117E021 and NCBR (The National Center for Research and Development of Poland) Grant No POLTUR2/5/2018. The authors thank Efecan Yılmaz, Gizem Özen, Elif Esmer, Gamze Eşdur, Bengisu Çağıltay and Faruk Büyüktekin for their technical support.

References

Admoni, H., and B. Scassellati. 2017. Social eye gaze in human-robot interaction: A review. Journal of Human Robot Interaction 6, 1: 25–63. [Google Scholar] [CrossRef]
Andrist, S., B. Mutlu, and M. Gleicher. 2013. Edited by R. Aylett, B. Krenn, C. Pelachaud and H. Shimodaira. Conversational gaze aversion for virtual agents. In Intelligent Virtual Agents. Springer, Berlin Heidelberg: pp. 249–262. [Google Scholar] [CrossRef]
Andrist, S., X. Z. Tan, M. Gleicher, and B. Mutlu. 2014. Conversational gaze aversion for humanlike robots. In Proceedings of the 9^th ACM/IEEE International Conference on Human-Robot Interaction. pp. 25–32. [Google Scholar] [CrossRef]
Archer, D., and R. M. Akert. 1977. Words and everything else: Verbal and nonverbal cues in social interpretation. Journal of Personality and Social Psychology 35: 443–449. [Google Scholar] [CrossRef]
Ayres, J., J. Flannick, J. Gehrke, and T. Yiu. 2002. Sequential pattern mining using a bitmap representation. In Proceedings of the Eighth ACMSIGKDD International Conference on Knowledge Discovery and DataMining. ACM, New York, NY, USA: pp. 429–435. [Google Scholar] [CrossRef]
Baron-Cohen, S., S. Wheelwright, and T. Jolliffe. 1997. Is there a "language of the eyes"? evidence from normal adults, and adults with autism or asperger syndrome. Visual Cognition 4, 3: 311–331. [Google Scholar] [CrossRef]
Bonaccio, S., J. OReilly, S. O'Sullivan, and F. Chiocchio. 2016. Non-verbal behavior and communication in the workplace: A review and an agenda for research. Journal of Management 42: 1044–1074. [Google Scholar] [CrossRef]
Broz, F., H. Lehmann, C. L. Nehaniv, and K. Dautenhahn. 2012. Mutual gaze, personality, and familiarity: Dual eye-tracking during conversation. In Proceedings of IEEE RO-MAN, the 21^st IEEE International Symposium on Robot and Human Interactive Communication. IEEE: pp. 858–864. [Google Scholar] [CrossRef]
Burch, M. 2017. Mining and visualizing eye movement data. In Proceedings of SIGGRAPH Asia Symposium on Visualization. New York, NY, USA. ACM: pp. 1–8. [Google Scholar] [CrossRef]
Cheng, J., J. Tracy, T. Foulsham, A. Kingstone, and J. Henrich. 2013. Two ways to the top: Evidence that dominance and prestige are distinct yet viable avenues to social rank and influence. Journal of personality and social psychology 104, 1: 103–125. [Google Scholar] [CrossRef]
Chevalier, P., K. Kompatsiari, F. Ciardo, and A. Wykowska. 2020. Examining joint attention with the use of humanoid robots-A new approach to study fundamental mechanisms of social cognition. Psychonomic Bulletin & Review 27: 217–236. [Google Scholar] [CrossRef]
Coutrot, A., J. H. wen Hsiao, and A. B. Chan. 2018. Scanpath modeling and classification with Hidden Markov Models. Behavior Research Methods 50: 362–379. [Google Scholar] [CrossRef]
Dalmaso, M., G. Alessi, L. Castelli, and G. Galfano. 2020. Eye contact boosts the reflexive component of overt gaze following. Scientific Reports 10: 4777. [Google Scholar] [CrossRef]
Dalmaso, M., L. Castelli, and G. Galfano. 2017. Attention holding elicited by direct-gaze faces is reflected in saccadic peak velocity. Experimental Brain Research 235: 3319–3332. [Google Scholar] [CrossRef]
Dalmaso, M., L. Castelli, and G. Galfano. 2020. Social modulators of gaze-mediated orienting of attention: A review. Psychonomic Bulletin & Review 27: 833–855. [Google Scholar] [CrossRef]
Dalmaso, M., L. Castelli, P. Scatturin, and G. Galfano. 2017. Trajectories of social vision: Eye contact increases saccadic curvature. Visual Cognition 25: 358–365. [Google Scholar] [CrossRef]
De Beugher, S., G. Brône, and T. Goedemé. 2014. Automatic analysis of in-the-wild mobile eye-tracking experiments using object, face and person detection. In Proceedings of International Conference on Computer Vision Theory and Applications (VISAPP). Vol. 1, pp. 625–633. [Google Scholar] [CrossRef]
Doherty-Sneddon, G., and F. Phelps. 2005. Gaze aversion: a response to cognitive or social difficulty? Memory & Cognition 33, 4: 727–733. [Google Scholar] [CrossRef]
Duncan, S. 1972. Some signals and rules for taking speaking turns in conversations. Journal of Personality and Social Psychology 23, 2: 283–292. [Google Scholar] [CrossRef]
Emery, N. 2000. The eyes have it: the neuroethology, function and evolution of social gaze. Neuroscience & Biobehavioral Reviews 24, 6: 581–604. [Google Scholar] [CrossRef]
Fischer, K., R. M. Langedijk, L. D. Nissen, E. R. Ramirez, and O. Palinko. 2020. Gaze-Speech Coordination Influences the Persuasiveness of Human-Robot Dialog in the Wild. In Proceedings of the International Conference on Social Robotics. Springer, Cham: pp. 157–169. [Google Scholar] [CrossRef]
Gobel, M. S., H. S. Kim, and D. C. Richardson. 2015. The Dual Function of Social Gaze. Cognition 136: 359–364. [Google Scholar] [CrossRef] [PubMed]
Goodwin, C. 1981. Conversational Organization: Interaction between Speakers and Hearers. Academic Press, London. [Google Scholar] [CrossRef]
Gosling, S. D., P. J. Rentfrow, and W. B. Swann. 2003. A very brief measure of the big-five personality domains. Journal of Research in Personality 37, 6: 504–528. [Google Scholar] [CrossRef]
Grosz, B. J., and C. L. Sidner. 1986. Attention, intentions, and the structure of discourse. Computational Linguistics 12, 3: 175–204. [Google Scholar]
Guéguen, N., and A. Martin. 2017. Effect of interviewer's eyeglasses on compliance with a face-to-face survey request and perception of the interviewer. Field Methods 29, 3: 194–204. [Google Scholar] [CrossRef]
Hasart, J. K., and K. L. Hutchinson. 1993. The effects of eyeglasses on perceptions of interpersonal attraction. Journal of Social Behavior and Personality 8, 3: 521–528. [Google Scholar]
Hassani, M., C. Beecks, D. Töws, T. Serbina, M. Haberstroh, P. Niemietz, S. Jeschke, S. Neumann, and T. Seidl. 2015. Edited by T. Seidl, N. Ritter, H. Schöning, K.-U. Sattler, T. Härder, S. Friedrich and W. Wingerath. Sequential pattern mining of multimodal streams in the humanities. In Datenbanksysteme für Business, Technologie und Web. Bonn. Gesellschaft für Informatik e.V: pp. 683–686. [Google Scholar]
Hassani, M., D. Töws, A. Cuzzocrea, and T. Seidl. 2019. Bfspminer: an effective and efficient batch-free algorithm for mining sequential patterns over data streams. International Journal of Data Science and Analytics 8, 3: 223–239. [Google Scholar] [CrossRef]
Hessels, R. S. 2020. How does gaze to faces support face-to-face interaction? A review and perspective. Psychonomic Bulletin & Review 27: 856–881. [Google Scholar] [CrossRef]
Hietanen, J. K., J. M. Leppänen, M. J. Peltola, K. Linna-Aho, and H. J. Ruuhiala. 2008. Seeing direct and averted gaze activates the approach-avoidance motivational brain systems. Neuropsychologia 46, 9: 2423–2430. [Google Scholar] [CrossRef] [PubMed]
Ho, S., T. Foulsham, and A. Kingstone. 2015. Speaking and listening with the eyes: Gaze signaling during dyadic interactions. PloS one 10, 8: 1–18. [Google Scholar] [CrossRef]
Holmqvist, K., M. Nyström, R. Andersson, R. Dewhurst, H. Jarodzka, and J. van de Weijer. 2011. Eye Tracking: A comprehensive guide to methods and measures. OUP, Oxford. [Google Scholar]
Jensen, L. C., K. Fischer, S. D. Suvei, and L. Bodenhagen. 2017. Timing of multimodal robot behaviors during human-robot collaboration. In Proceedings of the 26^th IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN. IEEE: pp. 1061–1066. [Google Scholar] [CrossRef]
Kendon, A. 2004. Gesture: Visible Action as Utterance. Gesture: Visible Action as Utterance. Cambridge University Press. [Google Scholar] [CrossRef]
Kleinke, C. 1986. Gaze and eye contact: A research review. Psychological Bulletin 100, 1: 78–100. [Google Scholar] [CrossRef] [PubMed]
Lehmann, H., I. Keller, R. Ahmadzadeh, and F. Broz. 2017. Edited by A. Kheddar, E. Yoshida, S. S. Ge, K. Suzuki, J.-J. Cabibihan, F. Eyssel and H. He. Naturalistic conversational gaze control for humanoid robots-a first step. In Social Robotics. Cham. Springer: pp. 526–535. [Google Scholar] [CrossRef]
Levinson, S., and J. Holler. 2014. The origin of human multimodal communication. Philosophical Transactions of the Royal Society of London Series B, Biological Sciences 369, 1651: 20130302. [Google Scholar] [CrossRef]
Lundberg, J. K., and E. P. Sheehan. 1994. The effects of glasses and weight on perceptions of attractiveness and intelligence. Journal of Social Behavior and Personality 9, 4: 753–760. [Google Scholar]
Macdonald, R., and B. Tatler. 2018. Gaze in a real-world social interaction: A dual eye-tracking study. Quarterly Journal of Experimental Psychology 71: 2162–2173. [Google Scholar] [CrossRef]
Maran, T., M. Furtner, S. Liegl, S. Kraus, and P. Sachse. 2019. In the eye of a leader: Eye-directed gazing shapes perceptions of leaders' charisma. The Leadership Quarterly 30: 1–13. [Google Scholar] [CrossRef]
Mares, I., M. L. Smith, M. H. Johnson, and A. Senju. 2016. Direct gaze facilitates rapid orienting to faces: Evidence from express saccades and saccadic potentials. Biological Psychology 121: 84–90. [Google Scholar] [CrossRef]
Mason, M. F., E. P. Tatkow, and C. N. Macrae. 2005. The look of love: gaze shifts and person perception. Psychological Science 16, 3: 236–239. [Google Scholar] [CrossRef]
Mehrabian, A., and M. Wiener. 1967. Decoding of inconsistent communications. Journal of Personality and Social Psychology 6, 1: 109–114. [Google Scholar] [CrossRef] [PubMed]
Moukheiber, A., G. Rautureau, F. Perez-Diaz, R. Soussignan, S. Dubal, R. Jouvent, and A. Pelissolo. 2010. Gaze avoidance in social phobia: Objective measure and correlates. Behaviour Research and Therapy 48, 2: 147–151. [Google Scholar] [CrossRef] [PubMed]
Munn, S. M., L. Stefano, and J. B. Pelz. 2008. Fixation-identification in dynamic scenes: Comparing an automated algorithm to manual coding. In Proceedings of the 5^th Symposium on Applied Perception in Graphics and Visualization. ACM, New York, NY, USA: pp. 33–42. [Google Scholar] [CrossRef]
Papoulis, A. 1984. Edited by A. Papoulis and S. U. Pillai. Brownian movement and Markoff processes. In Probability, random variables, and stochastic processes. Tata McGraw-Hill Education: pp. 515–553. [Google Scholar] [CrossRef]
Pfeiffer, U. J., K. Vogeley, and L. Schilbach. 2013. From gaze cueing to dual eye-tracking: Novel approaches to investigate the neural correlates of gaze in social interaction. Neuroscience and Biobehavioral Reviews 37, 2: 2516–2528. [Google Scholar] [CrossRef]
Pitsch, K., H. Kuzuoka, Y. Suzuki, L. Sussenbach, P. Luff, and C. Heath. 2009. "The first five seconds": Contingent stepwise entry into an interaction as a means to secure sustained engagement in HRI. In Proceedings of the 18^th IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN. IEEE: pp. 985–991. [Google Scholar] [CrossRef]
Quek, F., D. McNeill, R. Bryll, C. Kirbas, H. Arslan, K. E. McCullough, N. Furuyama, and R. Ansari. 2000. Gesture, speech, and gaze cues for discourse segmentation. In Proceedings IEEE Conference on Computer Vision and Pattern Recognition. Vol. 2, pp. 247–254. [Google Scholar] [CrossRef]
Rogers, S., C. Speelman, O. Guidetti, and M. Longmuir. 2018. Using dual eye tracking to uncover personal gaze patterns during social interaction. Scientific Reports 8, 1: 1–9. [Google Scholar] [CrossRef]
Risko, E.F., and A. Kingstone. 2011. Eyes wide shut: implied social presence, eye tracking and attention. Attention, Perception, & Psychophysics 73, 2: 291–296. [Google Scholar] [CrossRef]
Sacks, H., E. A. Schegloff, and G. Jefferson. 1974. A simplest systematics for the organization of turn-taking for conversation. Language 50, 4: 696–735. [Google Scholar] [CrossRef]
Schegloff, E. A. 1968. Sequencing in conversational openings. American Anthropologist 70, 6: 1075–1095. [Google Scholar] [CrossRef]
Steichen, B., M. M. A. Wu, D. Toker, C. Conati, and G. Carenini. 2014. Edited by V. Dimitrova, T. Kuflik, D. Chin, F. Ricci, P. Dolog and G.-J. Houben. Te, Te, Hi, Hi: Eye gaze sequence analysis for informing user-adaptive information visualizations. In User Modeling, Adaptation, and Personalization. Cham. Springer: pp. 183–194. [Google Scholar] [CrossRef]
Stuart, S., D. Hunt, J. Nell, A. Godfrey, J. M. Hausdorff, L. Rochester, and L. Alcock. 2017. Do you see what I see? Mobile eye-tracker contextual analysis and inter-rater reliability. Medical & Biological Engineering & Computing 56, 2: 289–296. [Google Scholar] [CrossRef]
Tang, D., and B. J. Schmeichel. 2015. Look me in the eye: Manipulated eye gaze affects dominance mindsets. Journal of Nonverbal Behavior 39, 2: 181–194. [Google Scholar] [CrossRef]
Tengai. 2020. Groundbreaking validation study: Tengai measures personality traits without human interference. Technical Report. Available online: https://www.tengaiunbiased.com/validation-study-robot-interview (accessed on 28 November 2020).
Tobii. 2018. Eye tracker data quality report: Accuracy, precision and detected gaze under optimal conditions—controlled environment, Technical Report. Tobii Pro Spectrum.
Villani, D., C. Repetto, P. Cipresso, and G. Riva. 2012. May I experience more presence in doing the same thing in virtual reality than in reality? An answer from a simulated job interview. Interacting with Computers 24, 4: 265–272. [Google Scholar] [CrossRef]
West, J. M., A. R. Haake, E. P. Rozanski, and K. S. Karn. 2006. Eyepatterns: Software for identifying patterns and similarities across fixation sequences. In Proceedings of the 2006 Symposium on Eye Tracking Research & Applications, ETRA '06. ACM, New York, NY, USA: pp. 149–154. [Google Scholar] [CrossRef]
Wieser, M. J., P. Pauli, G. W. Alpers, and A. Mühlberger. 2009. Is eye to eye contact really threatening and avoided in social anxiety? An eye-tracking and psychophysiology study. Journal of Anxiety Disorders 23, 1: 93–103. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y., J. Beskow, and H. Kjellström. 2017. Edited by A. Kheddar, E. Yoshida, S. S. Ge, K. Suzuki, J.-J. Cabibihan, F. Eyssel and H. He. Look but don't stare: Mutual gaze interaction in social robots. In Social Robotics. Cham. Springer: pp. 556–566. [Google Scholar] [CrossRef]

Figure 1. Markov model for Experiment 1 with all aversion directions distinguished.

Figure 2. Markov model for Experiment 1 with horizontal and vertical directions aggregated.

Figure 3. Markov model for Experiment 2 with all aversion directions distinguished.

Figure 4. Markov model for Experiment 2 with horizontal and vertical directions aggregated.

Table 1. Chain transition matrix for Experiment 1 with horizontal and vertical directions grouped.

Table 2. Chain transition matrix for Experiment 1 with all aversion directions distinguished.

Table 3. Eye movement parameters in Experiment 1 and Experiment 2.

Table 4. Chain transition matrix for Experiment 2 with horizontal and vertical directions grouped.

Table 5. Chain transition matrix for Experiment 2 with all aversion directions distinguished.

Share and Cite

MDPI and ACS Style

Acarturk, C.; Indurkya, B.; Nawrocki, P.; Sniezynski, B.; Jarosz, M.; Usal, K.A. Gaze Aversion in Conversational Settings: An Investigation Based on Mock Job Interview. J. Eye Mov. Res. 2021, 14, 1-13. https://doi.org/10.16910/jemr.14.1.1

AMA Style

Acarturk C, Indurkya B, Nawrocki P, Sniezynski B, Jarosz M, Usal KA. Gaze Aversion in Conversational Settings: An Investigation Based on Mock Job Interview. Journal of Eye Movement Research. 2021; 14(1):1-13. https://doi.org/10.16910/jemr.14.1.1

Chicago/Turabian Style

Acarturk, Cengiz, Bipin Indurkya, Piotr Nawrocki, Bartlomiej Sniezynski, Mateusz Jarosz, and Kerem Alp Usal. 2021. "Gaze Aversion in Conversational Settings: An Investigation Based on Mock Job Interview" Journal of Eye Movement Research 14, no. 1: 1-13. https://doi.org/10.16910/jemr.14.1.1

APA Style

Acarturk, C., Indurkya, B., Nawrocki, P., Sniezynski, B., Jarosz, M., & Usal, K. A. (2021). Gaze Aversion in Conversational Settings: An Investigation Based on Mock Job Interview. Journal of Eye Movement Research, 14(1), 1-13. https://doi.org/10.16910/jemr.14.1.1

Article Menu

Gaze Aversion in Conversational Settings: An Investigation Based on Mock Job Interview

Abstract

Introduction

Methodology

Experiment 1

Participants

Procedure

Analysis of Fixations

Analysis of Gaze Sequences

Experiment 2

Participants and Procedure

Analysis of Fixations

Analysis of Gaze Sequences

Discussion

Ethics and Conflict of Interest

Acknowledgments

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI