Different Judgments About Visual Textures Invoke Different Eye Movement Patterns

Jacobs, Richard H.A.H.; Renken, Remco; Thumfart, Stefan; Cornelissen, Frans W.

doi:10.16910/jemr.3.4.2

Open AccessArticle

Different Judgments About Visual Textures Invoke Different Eye Movement Patterns

by

Richard H.A.H. Jacobs

¹,

Remco Renken

¹,

Stefan Thumfart

² and

Frans W. Cornelissen

¹

University Medical Center Groningen, Groningen, The Netherlands

²

Profactor GmbH, Austria

J. Eye Mov. Res. 2009, 3(4), 1-13; https://doi.org/10.16910/jemr.3.4.2

Published: 15 October 2010

Download

Browse Figures

Versions Notes

Abstract

:

Top-down influences on the guidance of the eyes are generally modeled as modulating influences on bottom-up salience maps. Interested in task-driven influences on how, rather than where, the eyes are guided, we expected differences in eye movement parameters accompanying beauty and roughness judgments about visual textures. Participants judged textures for beauty and roughness, while their gaze-behavior was recorded. Eye movement parameters differed between the judgments, showing task effects on how people look at images. Similarity in the spatial distribution of attention suggests that differences in the guidance of attention are non-spatial, possibly feature-based. During the beauty judgment, participants fixated on patches that were richer in color information, further supporting the idea that differences in the guidance of attention are feature-based. A finding of shorter fixation durations during beauty judgments may indicate that extraction of the relevant features is easier during this judgment. This finding is consistent with a more ambient scanning mode during this judgment. The differences in eye movement parameters during different judgments about highly repetitive stimuli highlight the need for models of eye guidance to go beyond salience maps, to include the temporal dynamics of eye guidance.

Introduction

Background

In his seminal work, Buswell (Buswell, 1935) demonstrated that fixation locations on scenes differ according to the questions the observer had to answer. This finding has been confirmed many times (Yarbus, 1973; Lipps & Pelz, 2004; Rothkopf, Ballard, & Hayhoe, 2007; DeAngelus & Pelz, 2009; Underwood, Foulsham, & Humphrey, 2009). As Buswell’s questions related to information that was present in different parts of the pictures, his finding may not appear too surprising, yet it was the first formal demonstration of task effects on the guidance of the eyes.

Our interest in task-dependent differences in the guidance of eye movements was raised when we found that judging visual textures for beauty lead to higher activation of the amygdala than judging the same textures for roughness (Jacobs, Renken, & Cornelissen, 2009; Jacobs, Renken, Aleman, & Cornelissen, 2010). The amygdala has been linked to orienting behavior (Bancaud, Talairach, Morel, & Bresson, 1966), to spatial attention to emotional information (Adolphs et al., 2005; Ohrmann et al., 2007; Carlson, Reinke, & Habib, 2009), to emotional effects on the attentional blink (Anderson & Phelps, 2001; Lim, Padmala, & Pessoa, 2009) and to emotional judgments (Fusar-Poli et al., 2009). Together, these findings suggest that the guidance of visual attention may differ between an emotionally tinted task – judging for beauty – and a non-emotional, descriptive task – judging for roughness.

Although attention can be directed to peripheral parts of a visual scene, there is a tight coupling between attention and eye movements. For example, evidence suggests that a shift in spatial attention is required for shifts in eye movements to occur (Shepherd, Findlay, & Hockey, 1986;; Hoffman & Subramaniam, 1995). Hence, eye movements can be used as a proxy for the allocation of spatial visual attention.

Models of eye guidance generally focus on salience maps that are derived from bottom-up visual information (e.g., Vincent, Troscianko, & Gilchrist, 2007), and that are modulated by task and other context effects (Navalpakkam & Itti, 2005; Torralba, Oliva, Castelhano, & Henderson, 2006; Kanan, Tong, Zhang, & Cottrell, 2009). However, other eye movement parameters, such as total saccade distance, are generally not considered (with the exception of studies looking at scan paths, see e.g. Groner & Menz (1985)). Here, we assume the existence of separate bottom-up and top-down influences on eye guidance, although both are still contested (Parkhurst & Niebur, 2003; Ballard & Hayhoe, 2009). We are interested in the influence of different instructions on how people look, and less so in where they look. Differences might be found in eye movement parameters such as average fixation duration, the number of fixations/saccades, the length of saccades, and other measures derived from these measures.

When paintings or other real-world scenes are used as stimuli, finding differences in such parameters is relatively trivial, as they may be contingent on the placement of objects that are relevant for the task at hand. To minimize such spatial effects on the way participants look around, we used visual textures as stimuli. Texture stimuli contain repetitive elements, so that re-directing spatial attention does not lead to focusing on substantially different information. Assuming that eye movements do occur, as they do for fractals (Parkhurst, Law, & Niebur, 2002) and visual noise (R. Groner & Menz, 1985), differences in eye movement parameters during different judgments about visual textures would constitute evidence for the presence of task effects on the non-spatial guidance of eye movements. However, it is not a priori evident that eye movements will occur to texture stimuli in the first place, as visual noise is less repetitive than visual textures, and salient features that attract bottom-up attention might arise in visual noise purely by chance.

For the tasks, we selected beauty and roughness judgments. Previous work has shown that these judgments are orthogonal in judgment space (Jacobs, Haak et al., 2010), which indicates that they are maximally different. This enhances our chance of finding an influence of these judgments.

For the current paper, we define a visual texture as a repetitive visual pattern that does not contain clearly recognizable object outlines. Typically, surfaces contain texture. We regard color as an integral part of texture information.

Tasks could influence eye movements based on differences in the rate of feature extraction, assuming that the different judgments are based on different features.

Beauty and roughness judgments are partly based on different features (Jacobs, Haak et al., 2010). Different tasks could even result in the deployment of entirely different scanning modes, for example in ambient versus focal scanning modes (Unema, Pannasch, Joos, & Velichkovsky, 2005).

Besides eye movement parameters, tasks also influence pupil size. In particular, increased effort or cognitive load leads to increases in pupil size (Beatty, 1982). We are not aware of reports about other task effects on pupil size, in particular ones contrasting emotionally tinted versus more neutral tasks. Nevertheless, effects are quite conceivable. Pupil size increases when observers view more interesting stimuli (Hess & Polt, 1960). Assuming that beautiful stimuli might also be considered more interesting, one would expect beauty and pupil size to correlate. One may ask whether this correlation occurs during explicit evaluation for beauty, or during evaluation for another aspect, such as roughness, or during both. Hence, we looked for a relationship between pupil size, averaged over the time of stimulus presence, and judgment, separately within the beauty and roughness judgment condition. We looked separately at explicit effects (correlating roughness ratings with pupil size during the roughness task, and correlating beauty ratings with pupil size during the beauty task) and at implicit effects (correlating roughness ratings with pupil size during the beauty task, and correlating beauty ratings with pupil size during the roughness task). The correlations with beauty were our primary interest here, and the correlations with roughness ratings were added for completeness.

Hypothesis

Judgments of visual textures for beauty or roughness are associated with different eye-movement behavior. These differences are related to non-spatial aspects of attention, and will occur in parameters such as average fixation duration, number of fixations, and total distance traveled by the eyes. We have, however, no prior expectations about the direction of such effects. We also expect feature values to differ between fixated locations in the two judgment conditions.

Moreover, during the observation of highly repetitive visual stimuli, eye movements do not result in different information impinging on the retina. Hence, we expect no differences in the spatial allocation of attention, as indexed by the spatial distribution of eye movements.

In addition, we expect task-dependent correlations between beauty and pupil size.

Methods

Participants

Twelve observers (8 males, of whom 1 left-handed; age range 23-36) participated in this study.

Equipment and Software

Experiments were written in Matlab, using the Psychophysics and Eyelink Toolbox extensions (Brainard, 1997; Cornelissen, Peters, & Palmer, 2002); see http://psychtoolbox.org/).

An EyeLink 1000 System (SR Research, Canada) was used for eye tracking. The participants’ left eyes were tracked at 500 Hz. We used the manufacturer’s software for calibration, validation, drift-correction, and determining saccade and fixation parameters. Participants had their viewing position stabilized by a head and chin rest.

Stimuli were presented on a 41 by 31 cm CRTmonitor (LaCie, Paris, France). Experiments were conducted in a room that was dark, except for the illumination provided by the screen.

Stimuli

Figure 1. Example textures, used in the experiment. Both computer-generated and natural textures were used, and the set included both colored and grayscaled pictures.

Texture images had a size of 1280 by 1024 pixels. A texture growth algorithm (Ashikhmin, 2001) was applied to textures that originally were smaller than this. This growth algorithm does not significantly affect feature values (Jacobs, Haak et al., 2010). When presented, textures filled the entire screen. The visual angle of the stimuli was 39 by 29 degrees. A total of 292 stimuli were presented. We aimed at using a diverse set of texture stimuli. The stimulus set consisted of textures taken from a standard set (Brodatz, 1966), with additional textures gathered from diverse internet-sources (the set is available on request). Both colored and gray-scaled pictures were included. Figure 1 shows thumbnails of some of the textures used in the experiments.

Procedure

After signing an informed consent form, participants completed four blocks of trials of judging visual textures. A block typically lasted about 15-20 minutes, and blocks were separated by substantial pauses. No more than two blocks of trials were assessed on a single day. Textures were judged for beauty (B) and for roughness (R), in separate blocks of trails. The order of blocks was either R-B-B-R or B-R-R-B. A single block consisted of 146 trials.

Before starting a block of trials, the participant was instructed to judge the visual textures either on beauty or on roughness. A few test trials were performed before the first block of trials. Following calibration of the eye tracker, the experiment was started. The participant selfinitiated a trial by pressing the spacebar on the computer’s keyboard. A trial started with the presentation of a fixation dot which was used to drift-correct the eyetracker calibration. Next, the fixation dot disappeared and a visual texture was presented for 3500 ms. After disappearance of the texture the fixation dot reappeared, and participants had to indicate their judgment by pressing one of the keys on the numerical part of the keyboard. Key 1 indicated “least beautiful” or “least rough”, while key 9 indicated ”most beautiful” or “most rough”. There was no time limit for making a judgment. The space bar could be pressed to indicate an absence of a judgment. To indicate that the response was registered, the fixation dot increased in size. Following this, the participant initiated the next trial.

Analysis

Criteria for detecting saccades were standard settings for the Eyelink. A saccade was defined by a velocity of at least 30°/s, and an acceleration of at least 8000°/s², each lasting at least 4 ms. Fixations and saccades starting before the onset of the texture stimulus, or ending after the offset of the texture stimulus were excluded from the analysis.

Per participant and judgment condition, for each texture, the number of fixations, blinks, and saccades were counted, and average fixation duration, cumulative saccade distance (over saccades within a trial), average saccade velocity, average saccade duration, and average pupil size (during the time that stimuli were presented) were computed. In addition, total fixation duration over all trials in a condition was determined. Next, differences in these parameters for the two different conditions were expressed as a contrast, according to the formula:

P = 100 % \times \frac{V (B e a u t y) - V (R o u g h n e s s)}{V (B e a u t y) + V (R o u g h n e s s)}

(1)

where V(condition) represents the value of the parameter under study. P can in principle range from –1 to +1. For each participant, the resulting values were averaged over all stimuli. Kolmogorov-Smirnov tests were performed to check for deviations from normality of the distributions of these parameters. Deviations from 0 were statistically tested, over participants. One-sample, twotailed t-tests were performed in SPSS, for all parameters. No correction for multiple testing was performed on these tests, as the parameters are interrelated, and our conclusions are based on the differences as a group, and not so much on the individual parameters.

Fixation contrast maps (Wooding, 2002) were computed as follows. First, for each individual participant and within each judgment condition, we computed for each stimulus the total amount of time spent fixating each screen location. These values were spatially smoothed with a Gaussian kernel with a standard deviation of 30 pixels. Next, per stimulus a fixation contrast map was computed according to Formula 1, where V(condition) represents the fixation map for a particular judgment condition. Next, the obtained contrast maps were averaged over stimuli. Finally, these maps were averaged over participants, and a familywise-error-corrected nonparametric test (Nichols & Holmes, 2002) was applied to test for differences in maximum dwell time over the screen.

Correlations between beauty and roughness ratings on the one hand, and pupil size on the other, were computed. This was done both for the pupil size during the judgment (explicit effects of beauty and roughness on pupil size), and for the pupil size during the other judgment (to assess implicit effects of beauty and roughness on pupil size). These correlations were computed per participant. Then, after checking for normality using Kolmogorov-Smirnov tests, one-sample t-tests were conducted to ascertain whether these correlations were significantly different from 0, over participants. We also report some correlations between feature values and ratings and pupil sizes. Considering the amount of features we computed, correction for multiple comparisons would leave none of these relations significant. Hence, we report them without statistical testing, for confirmation in future experiments.

Feature Computation

The texture features most strongly associated with beauty and roughness decisions were determined as follows. First, we computed the correlations between a set of 188 computationally derived features on the one hand, and the beauty and roughness ratings on the other hand. Computed features are based on Gray-Level Cooccurrence Matrices (Haralick, Shanmugam, & Dinstein, 1973), a set of features related to psychological judgments (Tamura, Mori, & Yamawaki, 1978), Neighborhood Gray-Tone Difference Matrices (Amadasun & King, 1989), the Fourier spectrum (Tuceryan & Jain, 1998), Gabor energy features (Kim, Park, & Koo, 2005), and features expressing the presence of colors, brightness, and saturation (Datta, Joshi, Li, & Wang, 2006).

The Tamura features are based on psychological evaluations, and comprise coarseness, contrast, directionality, line-likeness, regularity, and roughness. The Gray Level Co-occurrence Matrices indicate how often particular gray levels co-occur at a certain distance. For our purposes, we computed them for distances of 1, 2, 4, and 8 pixels. These matrices are used to compute statistical properties like entropy, energy, homogeneity, et cetera. A Neighborhood Gray Tone Difference Matrix is a vector containing, for each gray-level, a sum of the differences in gray-tone with all the surrounding pixels, for each pixel with that gray-tone. The size of the neighbourhood is variable, and we computed matrices for sizes of 3 by 3 and 5 by 5 pixels. Based on these matrices, the features coarseness, contrast, busyness, complexity, and strength are computed.

Fourier features are based on the spatial frequencies in the brightness variations. The extent to which a certain spatial frequency is present is expressed as its energy or power. First, a two-dimensional image is transformed into the frequency domain using the fast Fourier transform to obtain the Fourier spectrum. Each component of the spectrum is represented by a complex number that describes a frequency in the two-dimensional image by means of amplitude and phase. The component coordinates in the spectrum determine the frequencies’ wavelength and direction. The spatial frequency with highest wavelength (uniform signal, i.e. average brightness) is represented in the centre of the spectrum, while high frequencies can be found on the outside. The average energy of circular bands around the average brightness is computed for different radii. Also, the energy of wedges with their peak at the average brightness is computed, yielding a measure of the orientation of the image. In this way, 12 circular energy, and 24 wedge energy features were computed, each reflecting the presence of information at a different spatial frequency (circular rings) and orientation (wedges). In addition, a number of features summarizing their distribution were computed.

Like Fourier features, Gabor features capture the spatial frequencies in pictures, but they preserve some spatial information. The human visual system is known to contain cells that work as Gabor filters. Gabor ‘energy’, over the entire texture, was computed for 4 spatial frequencies, in six orientations. Average saturation and intensity were based on HSV color space. The presence of the colors red, green, yellow, cyan, blue, and magenta, was computed by partitioning HSV color space into six sectors, and counting the relative frequency of pixels within each sector. The sector frequency was normalized to the average image value and saturation. As we extensively described relations between visual texture features and judgments elsewhere (Thumfart et al., 2008; Jacobs, Haak et al., 2010), we here restrict ourselves to simply reporting the features correlating most strongly to beauty and roughness judgments.

Features around fixations

To support our idea that differences in eye guidance between the two judgments reflect differences in featurebase attention, we extracted patches around fixations from the textures, and computed the 188 feature values for each of these patches. We then computed average feature values per stimulus, over all fixations. Next, we computed for each stimulus a difference in feature values, according to formula (1), but with absolute values in the denominator, to deal with negative values. Then we computed the means for the 188 feature differences, over subjects. We sorted the resulting (absolute value of) averages, and compared these to permuted data. Our 1000 permutations consisted of switching the feature values between roughness and beauty fixations for randomly selected textures. We then followed the same procedure as with the real data, so that we got 1000 examples of ordered features. We then looked up till what point the real data stayed in the top 5% of the permuted data. Those features were considered to be significantly different between judgments, and the direction of the difference was determined.

Results

Observations

Participants responded well, skipping roughness judgment on 1.2% of the stimuli, and beauty judgment on 1.6 % of stimuli.

Observers did make eye movements. On average, 8.0 fixations (and as fixations alternate with saccades, a similar number of saccades) were made in the 3500 ms period that textures were presented. Over all participants, 26 trials were encountered in which no fixations (and hence, also no, or maximally one, saccade) fell completely within the stimulus duration. Such trials did not contribute to average durations computed over all trials, and derived measures, but they did contribute to the counts of saccades and fixations.

Frequency plots of fixation durations during beauty and roughness judgments are displayed in figure 2. There are more short fixations (< 400 ms) during beauty judgments. For longer fixation durations, the numbers are similar for both judgment types. The distributions of fixation durations per observer were skewed to the right (peak shifted to the left). The most frequent fixation duration was between 200 and 300 ms, although there was an observer with most fixation durations at 700-800 ms (not shown). As pointed out by others, for different experiments (Velichkovsky, Dornhoefer, Pannasch, & Unema, 2000; Pelz, Canosa, Lipps, Babcock, & Rao, 2003), fixation durations under 200 ms were not uncommon.

Figure 2. Distribution of fixation durations for the beauty and roughness judgments, integrated over all participants.

Spatial distribution of gaze

Figure 3. Dwell time contrast map. Red indicates locations on the screen where participants dwelled longer during beauty judgments, blue locations where participants dwelled longer during roughness judgments. Effects are non-significant (p = 0.12 for the maximum, and p = 0.89 for the minimum, FWEcorrected).

Figure 3 shows a map indicating the relative amount of time spent at each location for the two judgment conditions. During beauty judgments, on average participants spend about 8% more of their time (cumulative fixation duration) just above the center of the screen, while they spend on average about 3% more of their time below the screen center during roughness judgments. Although suggestive, these differences were not statistically significant (p = 0.89 for the maximum, and p = 0.12 for the mimimum, FWE-corrected). There were few fixations in the periphery, resulting in 0% differences there, between the judgments.

Eye movement parameters

Kolmogorov-Smirnov tests on the eye movement parameters, as computed using Formula (1) did not reveal significant deviations from normality (all p > .76). Figure 4 shows changes in eye-movement parameters. Average fixation duration was higher during the roughness judgments compared to the beauty judgments (t(11) = -4.27, p = 0.001). Both number of saccades (t(11) = 2.49, p = 0.03) and the distance covered by the saccades (or cumulative saccade amplitude; p = 0.02) are significantly higher during the beauty judgments. This suggests observers scanned coarser and more globally during beauty compared to roughness judgments. There was no difference in average saccade duration (t(11) = .997, p = 0.35), and average saccade velocity was higher during beauty judgments (t(11) = 2.30, p = 0.04).

Figure 4. Eye movement parameters. Increases during beauty compared to roughness judgments, expressed as a percentage of their average. Cumul. = cumulative, avg. = average.

Pupil size

Kolmogorov-Smirnov tests on the correlations between pupil size and ratings did not reveal any significant deviations from normality (all p > .8).

No differences were found in the average pupil size (t(11) = 0.03, p = 0.98) nor in the number of blinks (t(11) = 1.07, p = 0.31), between the judgment conditions.

There was no correlation between pupil size and beauty rating, neither during the explicit rating of beauty (r = 0.02, t(11) = 0, p = 1), nor during the implicit rating of beauty (i.e. between pupil size during the roughness judgment and the beauty rating) (r = -0.01, t(11) = .192, p = .851).

There was a correlation between pupil size and rated roughness, both during the (explicit) rating of roughness (r = 0.13, t(11) = 8.07, p =.000), and during the (implicit) rating of beauty (r = 0.11, t(11) = 5.83, p < .001).

Feature correlations

The features correlating most strongly to mean beauty ratings were average saturation (r(10) = 0.47) and the yellowness of the texture (r(10) = 0.37), and a coarseness-measure based on the Neighborhood Gray-Tone Difference Matrices (r(10)= 0.33).

The features correlating most strongly to roughness ratings were entropy-measures (for a range of distances) based on the Gray-Level Co-occurrence Matrices (all correlations in the range of 0.6 to 0.7). We note also a positive correlation between average pupil size and average saturation, during both beauty (r(10) = 0.24) and roughness judgments (r(10) = 0.22).

Fixations on different features

We found that the colors blue, magenta, red and cyan had higher values around fixations during the beauty judgment than during roughness judgments. More striking than the differences in average feature values between judgments, were the standard deviations over subjects (figure 5, top). The color features showed much higher standard deviations than the other features. Looking at the individual participants’ averages (figure 5, bottom), it appears that some participants looked more at all colors during the beauty judgment, while others looked at some colors at the expense of other colors. One participant, DG, looked less at some colors, possibly indicating that he was in fact judging for ugliness rather than beauty.

Figure 5. Feature differences, computed according to Formula (1), between fixations during beauty and fixations during roughness. The top shows the standard deviation over subjects for all features. The color features are numbered 150-158 (highlighted in yellow). The bottom graph shows the feature differences per subject for the color features.

Discussion

We examined differences in eye movement parameters between beauty and roughness judgments to visual textures, because previous findings of differential engagement of the amygdala in these judgments suggested such a possibility.

We found that several eye-movement parameters differed between roughness judgments and beauty judgments, even though identical stimuli were shown. As this is a task effect, it is a top-down effect. Although this classification does little more than rephrasing the finding, it brings forward the possibility that other forms of topdown effects on eye movements may exist. For example, mood might also have an influence on the guidance of eye movements, and indeed such influences have already been reported (Wadlinger & Isaacowitz, 2006).

Although we demonstrated the presence of task effects on eye movements in our texture stimuli, it remains to be shown what the relevant dimensions are along which these tasks differ. We chose beauty and roughness judgments, because we found that these loaded strongly on orthogonal dimensions in a judgment space, derived from a range of judgments that people made about visual textures (Jacobs, Haak et al., 2010). We interpreted these dimensions as an evaluative dimension on the one hand, on which judgments such as beauty, elegance, warmth, and colorfulness loaded strongly, and a descriptive dimension on the other, with high loadings of roughness, age, and complexity. We chose judgments from these orthogonal dimensions to maximize the possibility of finding effects. Now that we indeed found effects these may arise from this distinction, but other differences between the tasks may also account for the different findings. To confirm our idea that the evaluative-descriptive distinction is the relevant one, replications with other judgments, such as complexity (descriptive) and elegance, warmth, interestingness, or colorfulness (evaluative) would be in order. One can think of other differences between the tasks, such as difficulty in feature extraction, a possibility that we entertain below. Another possibility would be the implicit tactile nature of a roughness judgment, likely requiring a visuo-tactile transformation of the information. But even if such differences exist, these may generalize to all judgments differing along the evaluative-descriptive dimension.

As the differences in eye guidance between the two tasks were not related to differences in the spatial location of the relevant information, these differences are strong evidence for non-spatial, possibly feature-based, differences in attention. In particular, the longer average fixation durations during the roughness judgments can readily be interpreted as reflecting differences in featurebased attention. A longer fixation during roughness judgments likely reflects additional time needed to extract the relevant information. Longer fixation durations have already been shown to be related to tougher discriminability of the information at the fixated location (Hooge & Erkelens, 1996; Cornelissen, Bruin, & Kooijman, 2005), to more elements around a fixation location (Salthouse & Ellis, 1980), to search for detailed information, as compared to free viewing (Buswell, 1935), to time spent searching (Over, Hooge, Vlaskamp, & Erkelens, 2007), and to non-expertise (Antes, Chang, Lenzen, & Mullis, 1985). All these findings corroborate the notion that more difficult feature extraction leads to longer fixation times. Also, the nature of the information upon which the judgments are based suggests that simpler features are used for beauty (e.g., color information, a firstorder feature) than for roughness (e.g., entropy information; a third-order feature) assessments. Shorter fixations and larger saccades have been associated with higher spatial frequencies (M. T. Groner, Groner, & von Mühlenen, 2008), suggesting that attention may have been directed at different spatial frequencies in our stimuli, under the different task instructions. Closely related to our current findings, fixation durations are longer when attending to location than when attending to color (Hayhoe, Bensinger, & Ballard, 1998), again suggesting that color is a relatively easy feature to extract.

In the previous paragraph, we argue that our findings should be interpreted in terms of feature-based attention differing between two different judgments. We should point out, however, that it is also possible that beauty and roughness judgments are based on the same features, and that differences occur only in the processing subsequent to the extraction of the features. Longer fixations could then be the result of higher processing load during the judgment of roughness. However, pupil size, an index for processing load, did not differ between the judgments. Hence, it is unlikely that processing load differs between the judgments, and differences in the extraction of features remain as an alternative. Also, one may question to what extent it is possible to separate feature extraction from processing further downstream. Importantly, we showed that feature values for some color features are, overall, higher around fixations during beauty judgments than during roughness judgments, although there is individual variability in the colors that are attended. Behavioral results here, and in other data from our group (Jacobs, Haak et al., 2010), indicate that color information is important for determining beauty. These results support the idea that people attend to different features, depending on what is relevant for the task at hand. The results also suggest that (most) people attend predominantly to the beautiful, colored, parts of a stimulus, when judging for beauty. It would be interesting to see in future experiments if this changes when people judge for ugliness.

A parsimonious explanation for longer fixation durations during roughness judgments may be the following: sensory evidence for the presence of certain features at a fixated location needs to build up and exceed some recognition threshold. If this build-up is slower or the threshold is higher for the roughness features than for the beauty features, this would lead to longer fixation durations during roughness judgments. This procedure repeats, until sufficient information is gathered from different parts of the stimulus, until a decision has been reached.

It is not clear how models of eye guidance based on salience maps can explain our findings. In these models, longer fixations are translated into a higher salience for the fixated region. Hence, longer fixations during roughness judgments would have to be translated into higher salience for the fixated regions. But as the information in our textures is highly repetitive, the computation of salience would increase all over the stimulus. If fixation durations are based on the relative salience of a fixated region with respect to the salience of the surroundings, the resulting duration would not be higher at all, and the modeling would fail. In addition, the models that incorporate task effects only deal with search tasks with predefined targets (Navalpakkam & Itti, 2005; Torralba et al., 2006). In the case of beauty judgments, there is no pre-defined search target, although it would be possible to translate such a task into a search for relevant features. It seems that our findings relate better to a window of attention. This window of attention is larger during beauty judgments. The window of attention tends to narrow for more difficult tasks (Ahissar & Hochstein, 2000), suggesting that the roughness judgment or the extraction of the relevant features for this judgment is more difficult. In terms of ambient versus focal modes of information processing (Unema et al., 2005), our findings mean that beauty judgments are associated with a more ambient (short fixations, long saccades), and roughness judgments with a more focal (long fixations, short saccades), mode of processing. We note that the saccade durations were not significantly different between our judgments, however, although numerically in the right direction. We do not endorse the notions of ambient processing relying on the dorsal visual pathway and focal processing relying on a ventral visual pathway, as is often claimed in connection with the ambient-focal distinction (Unema et al., 2005). Models need to go beyond the computation of salience maps, and incorporate eye movement parameters separately. Recently, reinforcement learning has been applied to model the deployment of human attention and eye gaze (Ballard & Hayhoe, 2009). As this approach includes not only where, but also when people look, this seems to us a much better framework for modeling data of the type we provided.

The finding of higher amygdala activation during beauty judgments than during roughness judgments (Jacobs, Renken, Aleman, & Cornelissen, 2010) that inspired our current investigation, may underlie the enforecement of the different scanning modes found here. The experiments were nearly identical, except that in one case brain activity was measured, while in the current experiment eye movements were monitored. Hence, the different findings related to the different judgments are likely to be associated, certainly when one considers reports of amygdalar involvement in attention (Anderson & Phelps, 2001; Carlson et al., 2009; Jacobs, Renken et al., 2010) and eye movements (Bancaud et al., 1966; Adolphs et al., 2005; Ohrmann et al., 2007; van Reekum et al., 2007).

One separate issue that deserves discussion is that we did not find effects of stimulus beauty on pupil size, despite reports in the literature pointing to such effects in the presence of emotional stimuli. For example, sounds, such as baby’s crying and laughter (Partala & Surakka, 2003) and visual stimuli that are selectively interesting to the different sexes, such as opposite sex semi-nudes and pictures of mothers and babies (Hess & Polt, 1960), increase pupil size. Those effects may reflect arousal elicited by the stimuli. As our stimuli were clearly not very arousing, this may account for the absence of an effect of beauty on pupil size. Moreover, beauty itself may not be a very arousing aspect to judge on, compared to evidently emotional judgments. In line with our results, the valence of written words does not seem to influence pupil size (Silk et al., 2009). The original Hess and Polt finding has been interpreted as reflecting relationships between positive or pleasurable emotions and pupil size, even in textbooks (Mather, 2006), but this interpretation appears to be unwarranted.

There were no influences of judgment task on pupil size. Rated roughness was related to pupil size, with rougher-looking textures eliciting pupil dilation. So, a relationship with rated roughness is established here, independent of whether this roughness was explicitly rated. This relationship may be based on (a combination of) features.

In consideration of the many features we computed, some of which are interrelated, and none of which we manipulated systematically, we can not draw any hard conclusions with regard to relations of dependent variables to those features. The correlations between features and some dependent variables were reported for confirmation in future experiments, although we believe that the correlation of several color features (saturation, yellowness) with beauty judgments is no coincidence, and also a correlation between entropy measures and roughness judgments makes sense to us.

Conclusions

We found task-driven differences in eye movement parameters between beauty and roughness judgments of identical texture stimuli. As the spatial distribution of dwell times does not differ between these judgments, the differences in the eye movement patterns must result from differences in non-spatial, hence feature-based, attention. Average fixation duration, the number of fixations and saccades, and the average saccade duration differed between the two judgments. This points to differences in how people scan their environment, depending on their current goals. These differences in how people scan their environment cannot be explained by differences in placement of the relevant information, as would be possible when paintings or photographs were used as stimuli. Rather, we believe these differences should be interpreted as reflecting differences in non-spatial, feature-based attention (repetition in the paragraph), related to a higher difficulty in extracting the relevant information during the roughness judgment.

So far, models of attention and eye guidance focus on the guidance of the eyes to salient information, often taking the observer’s task into account. Our findings indicate that models of eye guidance need to go beyond spatial salience maps, and need to incorporate top-down effects on eye movement parameters other than location of fixation, for example by modeling effects of feature complexity on fixation duration.

People’s fixation locations are a good index of what they are currently thinking about. We have shown that more subtle indices of eye movements may provide additional, valuable information about stimulus processing, such as the difficulty of extracting features for the assessment of certain higher-order qualities, such as beauty and roughness of textures. The eyes are a window to the soul, as an English proverb goes. As we have shown, this may be true, in more ways than hitherto acknowledged.

Acknowledgments

This study was supported by grants from the European Commission, under contracts #043157 (Syntex), #043261 (Percept), and #033816 (GazeCom) to Frans W. Cornelissen, and by the Groningen graduate school of Behavioral and Cognitive Neuroscience. We thank Jan Bernard C. Marsman for explanation of the eye tracking procedure. This work represents only the authors’ view.

References

Adolphs, R., F. Gosselin, T. W. Buchanan, D. Tranel, P. Schyns, and A. R. Damasio. 2005. A mechanism for impaired fear recognition after amygdala damage. Nature 433, 68–72. [Google Scholar] [CrossRef] [PubMed]
Ahissar, M., and S. Hochstein. 2000. The spread of attention and learning in feature search: effects of target distribution and task difficulty. Vision Research 40, 10-12: 1349–1364. [Google Scholar] [CrossRef]
Amadasun, M., and R. King. 1989. Textural features corresponding to textural properties. Systems, Man and Cybernetics, IEEE Transactions on 19, 5: 1264–1274. [Google Scholar] [CrossRef]
Anderson, A. K., and E. A. Phelps. 2001. Lesions of the human amygdala impair enhanced perception of emotionally salient events. Nature 411, 6835: 305–308. [Google Scholar] [CrossRef]
Antes, J. R., K. T. Chang, T. Lenzen, and C. Mullis. 1985. Eye movements in map reading. In Eye movements and human information processing. Edited by R. Groner, G. W. McConkie and C. Menz. Amsterdam: North-Holland, pp. 357–373. [Google Scholar]
Ashikhmin, M. 2001. Synthesizing natural textures. 2001 ACM Symposium on Interactive 3D Graphics, Research Triangle Park, North Carolina; pp. 217–226. [Google Scholar]
Ballard, D. H., and M. M. Hayhoe. 2009. Modelling the role of task in the control of gaze. Visual Cognition 17, 6-7: 1185–1204. [Google Scholar] [CrossRef]
Bancaud, J., J. Talairach, P. Morel, and M. Bresson. 1966. Le corne d'Ammon et le noyau amygdalien: effects cliniques et électriques de leur stimulation chez l'homme. Revue Neurologique 115, 3: 329–352. [Google Scholar] [PubMed]
Beatty, J. 1982. Task-evoked pupillary responses, processing load, and the structure of processing resources. Psychological Bulletin 91, 2: 276292. [Google Scholar] [CrossRef]
Brainard, D. H. 1997. The Psychophysics Toolbox. Spatial Vision 10, 4: 433–436. [Google Scholar] [CrossRef]
Brodatz, P. 1966. Textures: A photographic album for artists and designers. New York: Dover Publications. [Google Scholar]
Buswell, G. T. 1935. How people look at pictures. Chicago: University of Chicago Press. [Google Scholar]
Carlson, J. M., K. S. Reinke, and R. Habib. 2009. A left amygdala mediated network for rapid orienting to masked fearful faces. Neuropsychologia 47, 5: 1386–1389. [Google Scholar] [CrossRef]
Cornelissen, F. W., K. J. Bruin, and A. C. Kooijman. 2005. The influence of artificial scotomas on eye movements during visual search. Optometry & Vision Science 82, 1: 27. [Google Scholar]
Cornelissen, F. W., E. M. Peters, and J. Palmer. 2002. The Eyelink Toolbox: eye tracking with MATLAB and the Psychophysics Toolbox. Behavior Research Methods Instruments and Computers 34, 4: 613–617. [Google Scholar] [CrossRef] [PubMed]
Datta, R., D. Joshi, J. Li, and J. Z. Wang. 2006. Studying aesthetics in photographic images using a computational approach. Lecture Notes in Computer Science 3953, 288. [Google Scholar]
DeAngelus, M., and J. B. Pelz. 2009. Top-down control of eye movements: Yarbus revisited. Visual Cognition 17, 6: 790–811. [Google Scholar] [CrossRef]
Fusar-Poli, P., A. Placentino, F. Carletti, P. Landi, P. Allen, S. Surguladze, and et al. 2009. Functional atlas of emotional faces processing: a voxel-based meta-analysis of 105 functional magnetic resonance imaging studies. Journal of Psychiatry & Neuroscience 34, 6: 418–432. [Google Scholar]
Groner, M. T., R. Groner, and A. von Mühlenen. 2008. The effect of spatial frequency content on parameters of eye movements. Psychological Research 72, 6: 601–608. [Google Scholar] [CrossRef]
Groner, R., and C. Menz. 1985. The effect of stimulus characteristics, task requirements and individual differences on scanning patterns. In Eye movements and human information processing. Edited by R. Groner, G. W. McConkie and C. Menz. Amsterdam: North-Holland, p. 239. [Google Scholar]
Haralick, R. M., K. Shanmugam, and I. H. Dinstein. 1973. Textural Features for Image Classification. Systems, Man and Cybernetics, IEEE Transactions on 3, 6: 610–621. [Google Scholar] [CrossRef]
Hayhoe, M. M., D. G. Bensinger, and D. H. Ballard. 1998. Task constraints in visual working memory. Vision Research 38, 1: 125–138. [Google Scholar] [CrossRef]
Hess, E. H., and J. M. Polt. 1960. Pupil size as related to interest value of visual stimuli. Science 132, 3423: 349–350. [Google Scholar] [CrossRef]
Hoffman, J. E., and B. Subramaniam. 1995. The role of visual attention in saccadic eye movements. Perception and Psychophysics 57, 6: 787–795. [Google Scholar] [CrossRef]
Hooge, I. T. C., and C. J. Erkelens. 1996. Control of fixation duration in a simple search task. Perception and Psychophysics 58, 7: 969–976. [Google Scholar] [CrossRef]
Jacobs, R. H. A. H., K. V. Haak, S. Thumfart, R. Renken, B. Henson, and F. W. Cornelissen. 2010. Features and aesthetics. submitted. [Google Scholar]
Jacobs, R. H. A. H., R. Renken, A. Aleman, and F. W. Cornelissen. 2010. Searching for beauty in the mundane: Amygdalar guidance of featurebased attention during emotional decisionmaking. submitted. [Google Scholar]
Jacobs, R. H. A. H., R. Renken, and F. W. Cornelissen. 2009. The amygdala selects information for emotional decision making. Perception, 38S. [Google Scholar]
Kanan, C., M. H. Tong, L. Zhang, and G. W. Cottrell. 2009. SUN: Top-down saliency using natural statistics. Visual Cognition 17, 6: 979–1003. [Google Scholar] [PubMed]
Kim, M., C. Park, and K. Koo. 2005. Natural/man-made object classification based on Gabor characteristics. Lecture notes in computer science 3568, 550. [Google Scholar]
Lim, S. L., S. Padmala, and L. Pessoa. 2009. Segregating the significant from the mundane on a momentto-moment basis via direct and indirect amygdala contributions. Proceedings of the National Academy of Sciences 106, 39: 16841. [Google Scholar] [CrossRef] [PubMed]
Lipps, M., and J. B. Pelz. 2004. Yarbus revisited: taskdependent oculomotor behavior. Journal of Vision 4, 8: 115a. [Google Scholar]
Mather, G. 2006. Foundations of perception. Psychology Press: Hove, UK. [Google Scholar]
Navalpakkam, V., and L. Itti. 2005. Modeling the influence of task on attention. Vision Research 45, 2: 205–231. [Google Scholar]
Nichols, T. E., and A. P. Holmes. 2002. Nonparametric permutation tests for functional neuroimaging: a primer with examples. Human Brain Mapping 15, 1: 1–25. [Google Scholar]
Ohrmann, P., A. V. Rauch, J. Bauer, H. Kugel, V. Arolt, W. Heindel, and et al. 2007. Threat sensitivity as assessed by automatic amygdala response to fearful faces predicts speed of visual search for facial expression. Experimental Brain Research 183, 1: 51–59. [Google Scholar]
Over, E. A. B., I. T. C. Hooge, B. N. S. Vlaskamp, and C. J. Erkelens. 2007. Coarse-to-fine eye movement strategy in visual search. Vision Research 47, 17: 2272–2280. [Google Scholar] [CrossRef]
Parkhurst, D., K. Law, and E. Niebur. 2002. Modeling the role of salience in the allocation of overt visual attention. Vision Research 42, 1: 107123. [Google Scholar] [CrossRef]
Parkhurst, D., and E. Niebur. 2003. Texture contrast attracts overt visual attention in natural scenes. European Journal of Neuroscience 19, 783–789. [Google Scholar] [CrossRef]
Partala, T., and V. Surakka. 2003. Pupil size variation as an indication of affective processing. International Journal of Human-Computer Studies 59, 1-2: 185–198. [Google Scholar] [CrossRef]
Pelz, J. B., R. Canosa, M. Lipps, J. Babcock, and P. Rao. 2003. Saccadic targeting in the real world. Journal of Vision 3, 9: 310. [Google Scholar]
Rothkopf, C. A., D. H. Ballard, and M. M. Hayhoe. 2007. Task and context determine where you look. Journal of Vision 7, 14: 12. [Google Scholar]
Salthouse, T. A., and C. L. Ellis. 1980. Determinants of eye-fixation duration. The American Journal of Psychology 93, 2: 207–234. [Google Scholar] [CrossRef]
Shepherd, M., J. M. Findlay, and R. J. Hockey. 1986. The relationship between eye movements and spatial attention. Quarterly Journal of Experimental Psychology: Human Experimental Psychology 38, 3-A: 475–491. [Google Scholar] [CrossRef]
Silk, J. S., G. J. Siegle, D. J. Whalen, L. J. Ostapenko, C. D. Ladouceur, and R. E. Dahl. 2009. Pubertal changes in emotional information processing: Pupillary, behavioral, and subjective evidence during emotional word identification. Development and Psychopathology 21: 726. [Google Scholar] [CrossRef]
Tamura, H., S. Mori, and T. Yamawaki. 1978. Textural features corresponding to visual perception. IEEE Transactions on Systems, Man and Cybernetics 8, 6: 460–473. [Google Scholar]
Thumfart, S., R. Jacobs, K. Haak, F. Cornelissen, J. Scharinger, and C. Eitzinger. 2008. Feature based prediction of perceived and aesthetic properties of visual textures. Proceedings Materials & Sensations; pp. 55–58. [Google Scholar]
Torralba, A., A. Oliva, M. S. Castelhano, and J. M. Henderson. 2006. Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. Psychological Review 113, 4: 766–786. [Google Scholar] [PubMed]
Tuceryan, M., and A. Jain. 1998. Texture analysis. In The Handbook of Pattern Recognition and Computer Vision, 2nd ed. Edited by C. Chen, L. Pau and P. Wang. New Jersey: World Scientific Publishing, pp. 207–248. [Google Scholar]
Underwood, G., T. Foulsham, and K. Humphrey. 2009. Saliency and scan patterns in the inspection of real-world scenes: Eye movements during encoding and recognition. Visual Cognition 17, 6: 812–834. [Google Scholar] [CrossRef]
Unema, P. J. A., S. Pannasch, M. Joos, and B. M. Velichkovsky. 2005. Time course of information processing during scene perception: The relationship between saccade amplitude and fixation duration. Visual Cognition 12, 3: 473494. [Google Scholar]
van Reekum, C. M., T. Johnstone, H. L. Urry, M. E. Thurow, H. S. Schaefer, A. L. Alexander, and et al. 2007. Gaze fixations predict brain activation during the voluntary regulation of pictureinduced negative affect. Neuroimage 36, 3: 1041–1055. [Google Scholar] [PubMed]
Velichkovsky, B. M., S. Dornhoefer, M. Pannasch, S. Unema, and P. J. A. 2000. Visual fixations and level of attentional processing. Paper presented at the Proceedings of the 2000 symposium on Eye tracking research & applications. [Google Scholar]
Vincent, B. T., T. Troscianko, and I. D. Gilchrist. 2007. Investigating a space-variant weighted salience account of visual selection. Vision Research 47, 13: 1809–1820. [Google Scholar]
Wadlinger, H. A., and D. M. Isaacowitz. 2006. Positive mood broadens visual attention to positive stimuli. Motivation and Emotion 30, 1: 87–99. [Google Scholar] [CrossRef]
Wooding, D. 2002. Fixation maps: quantifying eyemovement traces. Proceedings of the 2002 symposium on Eye tracking research & applications, New Orleans, Louisiana; pp. 31–36. [Google Scholar]
Yarbus, A. L. 1973. Eye movements and vision. Translated by B. Haigh, and L. A. Riggs. New York: Plenum press. [Google Scholar]

Share and Cite

MDPI and ACS Style

Jacobs, R.H.A.H.; Renken, R.; Thumfart, S.; Cornelissen, F.W. Different Judgments About Visual Textures Invoke Different Eye Movement Patterns. J. Eye Mov. Res. 2009, 3, 1-13. https://doi.org/10.16910/jemr.3.4.2

AMA Style

Jacobs RHAH, Renken R, Thumfart S, Cornelissen FW. Different Judgments About Visual Textures Invoke Different Eye Movement Patterns. Journal of Eye Movement Research. 2009; 3(4):1-13. https://doi.org/10.16910/jemr.3.4.2

Chicago/Turabian Style

Jacobs, Richard H.A.H., Remco Renken, Stefan Thumfart, and Frans W. Cornelissen. 2009. "Different Judgments About Visual Textures Invoke Different Eye Movement Patterns" Journal of Eye Movement Research 3, no. 4: 1-13. https://doi.org/10.16910/jemr.3.4.2

APA Style

Jacobs, R. H. A. H., Renken, R., Thumfart, S., & Cornelissen, F. W. (2009). Different Judgments About Visual Textures Invoke Different Eye Movement Patterns. Journal of Eye Movement Research, 3(4), 1-13. https://doi.org/10.16910/jemr.3.4.2

Article Menu

Different Judgments About Visual Textures Invoke Different Eye Movement Patterns

Abstract

Introduction

Background

Hypothesis

Methods

Participants

Equipment and Software

Stimuli

Procedure

Analysis

Feature Computation

Features around fixations

Results

Observations

Spatial distribution of gaze

Eye movement parameters

Pupil size

Feature correlations

Fixations on different features

Discussion

Conclusions

Acknowledgments

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI