1. Introduction
Public acceptance of landscape changes due to energy infrastructure developments is a crucial issue in the implementation of the ongoing energy transition towards alternative energy sources [
1,
2]. With increasing energy production from decentralized renewable energy infrastructures, it is essential to expand also the transmission network to keep the security of supply at a high level [
3,
4]. In many European countries, however, public acceptance of new power lines is generally low, causing social resistance and in turn leading to project delays and increasing costs for the necessary grid expansion [
5]. Thereby, the landscape context, and how people perceive the impact on the view of the landscape, are significant factors [
4,
6]. However, the acceptance of power lines is higher when people see the grid expansion as a necessary measure because of decentralized energy production [
7]. Yet, there is lack of in-depth understanding how people perceive and accept power lines in certain landscape types and in combination with further infrastructure such as wind turbines. In the present paper, we demonstrate how innovative 3D point cloud visualizations can be used for an eye tracking study to systematically investigate these questions.
Laboratory experiments investigating the perception of the visual landscape aspect require highly realistic landscape representations [
8]. LiDAR (Light Detection and Ranging) data enables to capture surfaces with high angular and range resolution. The resulting point clouds allow to create 3D landscape simulations with high detail of the foreground as well as of built structures [
9]. Furthermore, the 3D point cloud models can be combined with other 3D objects such as power lines and wind turbines, so that the landscape model can easily be altered systematically. Such simulations were already implemented in a laboratory experiment focusing on the perception of different landscape types with scenarios of a mix of renewable energy systems, namely wind turbines and photovoltaic systems [
10]. In this experiment, also the participant’s physiological responses were measured by recording their electrodermal activity (EDA).
Physiological responses offer access to the strong emotional, subjective component of landscape values, which the purely “objective” measures, such as explicit preferences collected with questionnaires, do not reflect sufficiently [
11,
12]. Including both, cognitive and physiological responses into landscape perception studies can foster the understanding of people’s preferences for certain landscape changes [
11,
12,
13,
14]. However, employing EDA measurements requires a rather big visual difference (pleasant or unpleasant content) between the stimuli concerning the investigated aspect to identify meaningful differences in the skin conductance activity [
15]. This visual difference may not be sufficient by adding a power line to the landscape scene. An alternative measure of physiological responses on visual stimuli, which is capable to investigate the effect of subtle changes in images, is eye tracking. Yet, to the best of the authors’ knowledge these 3D point cloud simulations have not been used in studies employing eye tracking.
Eye tracking is a well-established method and has been used for the recording and analysis of eye movements [
16]. Eye tracking has been applied in a wide range of fields, including psychology [
17], reading research [
18], human-computer interaction and usability [
19,
20], cartography [
21], and spatial cognition [
22]. The movement of the eyes is measured, mapped to positions on a visual stimulus such as a large display, and aggregated spatiotemporally to fixations [
23]. Most eye tracking studies build upon the assumption that a fixation implies perception, and that the perceived content is processed during the fixation (eye–mind assumption) [
18]. Aggregated fixation times for one type of content, such as power lines, may therefore be used as an indicator for the amount of cognitive processing about that type of content. Orquin and Loose [
24] found that the elements focused on when viewing an image influence personal judgment most. Therefore, the duration of fixation can be a suitable measure to investigate the disturbance of landscape elements such as power lines. It is important to note, however, that it is difficult to find definite explanations for eye movements since they are determined by both, top-down (task-dependent) processes [
25] and bottom-up (image-based) processes [
26].
The goal of this paper is to demonstrate an example of a perception study using 3D point clouds for eye tracking. The investigation of the acceptance of power lines in different landscapes serves as case. Selected findings from literature provided the basis for the hypotheses tested in the study. Accordingly, landscape protection is one of the public’s main arguments against the construction of new power lines [
4,
6]. However, in landscapes perceived as less beautiful, the power lines are judged to be less disturbing [
6]. Further, power lines are perceived more negatively in rural and near-natural landscapes than in more urban and industrial areas [
6,
27]. The perception of landscapes differs also with the landscape structure. Homogeneous landscapes characterized by an open structure are explored less intensively because they exhibit less variation, while heterogeneous landscapes are more diverse and encourage greater exploration [
28,
29]. Moreover, acceptance of power lines has been found higher when people regard them as necessary for the promotion of renewable energy systems in order to implement the energy transition [
6]. These findings led to the following hypotheses:
More attention to power lines leads to more perceived disturbance and lower acceptance.
In near-natural landscapes power lines are perceived as more disturbing and are less accepted than in human-influenced landscapes.
Power lines in combination with many wind turbines are perceived as less disturbing and are better accepted than without or few wind turbines.
The results are discussed concerning the effectiveness of employing 3D point clouds for investigating the perception of visual landscape impacts of power lines with eye tracking. Finally, recommendations for further research directions are provided.
2. Materials and Methods
3D point cloud simulations served as stimuli in the experiment. In a preliminary study, we tested whether the simulations should be animated or not when using eye tracking for measuring the perception of the power lines’ impact in the landscape contexts and tested the overall procedure. The main study was conducted with animated simulations, and the eye tracking data as well as the participants’ answers on a questionnaire were collected. Finally, this data was statistically analyzed. In the following, further details on the participants, the stimuli generation, the setup of the laboratory, the preliminary study, the questionnaire, the overall procedure of the main study, and the analysis of the results of the experiment are presented.
2.1. Participants
The study involved 23 women and 26 men, aged between 21 and 35 years (M = 25.5 years, SD = 3.74 years), and with a background in geography, cartography, and spatial and landscape planning. Of the participants, 11 held a master’s degree, 22 a bachelor’s degree, and 16 a high school degree as their highest education. Women and men differed significantly in their attitude towards the acceptance of power lines, whereby male study participants were more positive about power lines (F(1,47) = 4.416, p = 0.037, N = 49). There were no significant differences in the attitude towards wind turbines. The participants were not compensated for their participation and took part voluntarily.
2.2. Visual Stimuli Generation
The 3D landscape simulations of four different Swiss landscape types, Jura, Pre-Alps, urbanized Plateau and urbanized Alps, were generated following and adapting the approach of Spielhofer et al. (2017). For specific locations in these landscape types in Switzerland, LiDAR data from airborne laser scanning (ALS) were collected from Swiss cantonal data bases and from terrestrial laser scanning (TLS) in the field. For the latter a RIEGL VZ-1000 laser scanner with up to 1.4 km range and a calibrated Nikon D700 camera mounted on top was used. The TLS data was obtained during June 2017 to have the summer state of the vegetation in all landscape scenes. The airborne LiDAR data was colorized with orthophotos of the respective landscape areas employing the point cloud processing software LIS Pro 3D [
30]. The point clouds from terrestrial laser scanning were further processed with the RiSCAN Pro [
31] software to register multiple scans recorded for a landscape area and colorize them with the photos taken with each scan. In order to homogenize the color scheme between the four landscape scenes, the colors of the photos were adjusted using Adobe Photoshop CC [
32].
The entire landscape scene of each of the landscape types was then set up using the software Cinema 4D [
33]. First, a digital terrain model (DTM) of the area to visualize was imported and textured with the respective orthophoto of that area. Then, the post-processed point clouds were imported to these basic scenes with the plug-in LAZPoint 2 [
34]. Further, a skybox was created and slightly moving clouds were added for a higher level of realism. In the next step, the scenes were systematically altered with energy infrastructure objects. In all scenes a power line in form of 3D objects (in the format *.fbx) of four pylons and cables was added in the middle ground of the scenes at approximately the same distance from the observer. Finally, for two scenarios 3D objects of either three or ten wind turbines were added, whereby the starting angle of the different rotors was slightly varied for a more realistic representation [
35]. The rotors were animated to turn with approximately 10 rounds per minute, which equals a wind speed of about 6 meters per second.
For a ten-second video at 25 frames per second, 250 frames were rendered for each scenario of the four landscape scenes from a pedestrian perspective at a view height of 1.6 m (
Figure 1). Using Adobe Photoshop [
32] filters adding haze were applied to the frames for designing the image depth of the renderings as realistically as possible. After the editing, the frames were exported using the render video menu, resulting in 12 video clips in *.mp4.
2.3. Setup of the Laboratory
The animated 3D visualizations were presented on a 65-inch monitor at 3840 × 2160 pixel. Participants were standing at a viewing distance of 167 cm, which resulted in a covered visual angle of roughly 46° horizontally and 26° vertically (
Figure 2). The eye tracker was placed between the participants and monitor at a distance of 60 cm to track the point of regard (
Figure 2). We used a Tobii TX 300 eye tracker, which we set to a sample rate of 300 Hz. In order to collect gaze data and the interaction with the stimuli we used the Software OGAMA 5.0 [
36]. Fixation detection is based on the dispersion-threshold-identification (I-DT) method described by Salvucci and Goldberg [
37]. We set the time threshold to 100 ms and the minimum dispersion to 20 pixels. At a viewing distance of 167cm this resulted in 1.37° of visual angle which lies well within the range of 1°–2° suggested by Holmqvist and colleagues [
16] for fixation detection.
2.4. Preliminary Study
Before designing the overall procedure of the experiment, it was tested whether the animated stimuli with turning rotors and moving clouds were suitable for the intended eye tracking study. Therefore, the differences in the perception of power lines were investigated in still and in animated simulations during a preliminary study. The attention attracted by power lines and by the other elements, i.e., clouds and 10 wind turbine rotors, was analyzed based on the duration of fixation on the areas of interest (AOI) of these elements. The latter are rectangular polygons defined by the experimenter, covering the respective individual element. The fixation durations of the six pre-test participants were summarized for each AOI (i.e., the power lines, the clouds, and the wind turbines) and weighted with the total duration of all fixations to receive the mean duration of the single fixations. The results show that in animated visualizations the fixation duration on wind turbines (1255.67 ms, SD = 888.46 ms) tends to be longer than in static visualizations (759.44 ms, SD = 562.32). However, there were rather no differences in the fixation duration on power lines between animated (1660.63 ms, SD = 870.75 ms) and static visualizations (1382.08 ms, SD = 1067.79 ms). The moving clouds were barely fixated and participants stated that they did not consciously perceive the cloud movement. Based on these results it was found the participants perceive types of power lines in both visualizations with a sufficient duration in order to form an opinion on their visual landscape impact. Furthermore, the wind turbine rotors would turn also in the real landscape and, hence, a higher level of attention to the wind turbines is a rather realistic setting.
In addition, it was tested whether the recording of eye tracking data makes sense while participants are answering questions on their perception and acceptance of the power lines and wind turbines in the presented landscape scenes. As the captured eye tracking data was still quite accurate and the results reasonable, this setup was found feasible.
2.5. Questionnaire
During the main experiment the participants answered different kinds of questions (
Table 1). The first set consisted of questions about the liking of the landscape and about disturbing elements. These questions were asked orally during the guided perception part of the eye tracking survey, which means that the participants were given a task directing their gazing behavior [
25]. The second set focused on how much the participants felt disturbed by the power lines and wind turbines and how much they accept these infrastructures. The liking of the landscape as well as the disturbance and acceptance of power lines and wind turbines were rated using a five-point Likert scale. In the last part, the participants answered questions on their attitudes and sociodemographic questions. As eye tracking data may be influenced by how a person is feeling, the participants were also asked whether they felt distracted as well as whether they were energetic or tired during the experiment. The latter two questions help in explaining data that is not sufficiently accurate due to strong variation or deviation from calibration. The second and third part of the questionnaire was prepared as web page with the tool Select Survey [
38] and participants filled it in using a tablet.
2.6. Procedure of the Main Study
In advance of the experiment, the participants received an information sheet on the course of the study and gave their consent to participate. They also had the opportunity to ask questions about the experiment. Then, each participant performed a five-point calibration to adjust the eye tracker to individual differences. Furthermore, after each task we provided a test image to the participant in order to detect fixation drifts and to ensure a consistent tracking quality among participants. In addition, each instruction screen displayed a red crosshair that the participant had to fixate before the stimulus was presented. Therefore, each participant started exploring the different landscapes from the same position.
In order to capture the free and the guided perception of the landscapes with the eye tracking, the participants were shown the landscapes in several rounds (
Figure 3). In a first run, the participants were asked to look at the four landscapes without a task. This bottom-up perception enabled an unbiased exploration of the landscape without directing the participants’ attention to specific elements [
39]. The eye movements while answering the questions posed by the experimenter, i.e. the guided perception, were recorded in the second run. In a third run without eye tracking, the participants answered the online questionnaire after perceiving the scenes, whereby the still image at the end of each video was still visible while they filled in the online-questionnaire. Finally, they answered the remaining questions.
2.7. Design and Analyses
The present experiment represents a 4 (landscapes: Jura, Pre-Alps, urbanized Plateau, urbanized Alps) by 3 (number of wind turbines: ZERO, LOW, HIGH) mixed factorial design. The combination of the factors landscape (factor A) and number of wind turbines (factor B) resulted in 12 different landscape scenes (
Figure 1). The participants saw a visualization for each of the four landscapes examined (factor A). In order to enable a comparison between the landscapes, all four landscapes were rated once by each participant (within-subject design). Further, to prevent effects caused by a different number of wind turbines between the different landscapes, this factor (factor B) was compared between three groups in a between-subject design. The participants in the groups were each shown the four landscapes either without (group ZERO), with three wind turbines (group LOW) or with ten wind turbines (group HIGH). This prevented the study participants from seeing landscapes with different numbers of wind turbines and from focusing on the elements of the landscape with bias. A balanced ratio of the number of participants in the three groups without, with three and with ten wind turbines was achieved by the alternating allocation of participants to the groups.
Based on the fixations, attention heat maps were created with the Software OGAMA 5.0 [
36]. The maps represent an attention distribution for each point with X and Y coordinates. The Gaussian distributions of all fixations were calculated, their values added and the amplitude normalized using a two-dimensional kernel to smooth the distribution [
36]. Further, for the individual areas of interest (AOI) the number of fixations as well as the total and the mean fixation time were calculated. In order to obtain comparable values between the landscapes, the size of the AOI was kept constant for the power lines and the wind turbines. However, for the wind turbines this was only possible in the scenarios with 3 wind turbines. In the scenarios with 10 wind turbines, AOIs overlapped due to their close grouping, which meant that the total area of the AOIs differed between the landscapes. Therefore, the share of the AOIs in the overall image were weighted by dividing the fixation times by the percentages of their areas in the overall image. In addition, the fixation durations were weighted with the total duration of all fixations to exclude effects due to missing data. The power lines were of the same size in all landscapes and occupied the same area in all landscapes. Therefore, the duration of fixation on power lines was weighted only with the total duration of all fixations. For the results see
Table A1 in the
Appendix.
The statistical analysis of the data was carried out with SPSS Version 25 [
40]. Eye tracking and survey data were analyzed for significant differences between landscape type and energy infrastructure with multivariate analysis of variance (MANOVA). Independent variables were the landscape types, the general landscape characteristics (near-natural or urban), and the number of existing wind turbines. The dependent variables were the assessment of disturbance and acceptance of power lines as well as the liking and coherence of the landscape scenes. Further, correlations between the eye movements and the participants’ answers on the questionnaire were investigated applying Spearman’s rank-order correlation.
4. Discussion and Conclusions
The goal of the study was to demonstrate an example of a landscape perception study using 3D point cloud simulations with a high level of visual realism as stimuli for gathering gaze behavior recorded by eye tracking. For showing the suitability of these simulations for such studies, we investigated the perception and acceptance of power lines in different landscapes and in combination with different amounts of wind turbines.
The results indicate that the 3D point cloud simulations were appropriate to produce results, which are in line with findings of other research groups using photographs. For example, we found that the gaze patterns differ between urban and near-natural landscapes during free perception. The more intensive exploration of the urbanized landscape scenes can be explained by their higher heterogeneity compared to the rather homogeneous near-natural landscapes (Dupont et al. 2014). However, we could not observe that during free perception more attention is given to power lines although they were regarded as disturbing elements. Hence, the participants’ opinions are not reflected in the eye tracking data. It seems that the participants did not need a long fixation time on the elements they perceived as disturbing to make their opinion. Furthermore, the fixation time on the power lines during free perception does show dependencies with the variation of the amount of wind turbines. However, concerning the disturbance and acceptance rating of the power lines depending on the number of wind turbines these gaze patterns do not provide significant correlations. Therefore, the free image-based gazing could not provide reasonable eye movement patterns explaining the level of disturbance and acceptance of power lines in the different landscape scenes.
In contrast, the gaze patterns during guided perception show correlations with the rated level of disturbance of the power lines and attitudes toward protecting the view of the landscape. However, only in the urban landscape types the fixation time on the power lines correlates with the rated level of disturbance. In the urban landscapes, the power lines were rated generally as less disturbing and are more accepted than in near-natural landscapes, which matches the results of Lienert and colleagues [
7].
Overall, differences in acceptance and disturbance between landscapes were identified but the patterns can hardly be investigated with eye tracking only. Whereas the data of the free perception is difficult to interpret, the guided perception is more promising to record clearer differences in eye movements. Further, eye tracking focuses only on the gaze behavior in response to the presented landscape. However, there are further physiological measures, which allow capturing unconscious affective responses such as facial electromyography (EMG), which is regarded as reliable measure for pleasant or unpleasant images [
15]. Combining eye tracking with such measures of affective responses might help getting a better understanding of the landscape perceptions and valuations.
For further evaluating the quality of the point cloud simulations as valid stimuli for landscape perception studies, the gaze patterns recorded in the laboratory should be compared with the ones that can be observed at the actual location in the real world [
41]. Augmented reality glasses including eye tracking [
42,
43] could be suitable devices that allow for such studies. This validation is crucial because the 3D point cloud simulations enable a very precise representation of the real-world situation and of intended landscape changes accordingly. This makes them a powerful tool for better informed landscape planning and design, which can actually be informed by the cultural values landscapes give to people.