Next Article in Journal
Multimodal Feature-Assisted Continuous Driver Behavior Analysis and Solving for Edge-Enabled Internet of Connected Vehicles Using Deep Learning
Next Article in Special Issue
Time-Domain Sound Field Reproduction with Pressure and Particle Velocity Jointly Controlled
Previous Article in Journal
The Future Impact of Carbon Tax on Electricity Flow between Great Britain and Its Neighbors until 2030
Previous Article in Special Issue
Multi-Zone Active Noise Control Strategy for the Scattered Sound Control of an Infinite Rigid Cylinder
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantifying Auditory Presence Using Electroencephalography

Key Laboratory of Modern Acoustics, Institute of Acoustics, Nanjing University, Nanjing 210093, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2021, 11(21), 10461; https://doi.org/10.3390/app112110461
Submission received: 30 September 2021 / Revised: 29 October 2021 / Accepted: 2 November 2021 / Published: 7 November 2021
(This article belongs to the Special Issue Sound Field Control)

Abstract

:
Presence is used to assess the subjective experience of being in one place when physically situated in another. Recently, the research on presence has gained increasing attention due to the wide use of immersive audio technologies. Currently, the most widely-used measurement of presence is based on post-experiment self-report questionnaires. It is reliable but imperfect due to the psychological changes caused by the act of answering the questionnaire when immersed in the virtual environment. Therefore, the present work aims to find an objective way to measure presence, and electroencephalography (EEG) was investigated as a possible tool for this objective measurement. In this study, two listening tests were conducted, where eight loudspeakers were used to reproduce urban soundscapes to stimulate auditory presence. Presence was measured by both questionnaires and EEG. Results showed a significant correlation between T/B (Theta/Beta Ratio) extracted from EEG and subjective presence levels assessed by questionnaires, suggesting the possible use of EEG to measure presence objectively. This study could bring some insight for the research of presence, and related technologies, such as VR, video games and immersive audio production.

1. Introduction

1.1. Presence

Presence is used to assess the participants’ sense of “being there” in the virtual environment [1]. It can be defined explicitly as the subjective experience of being in one place or environment, even when physically situated in another [2]. In other words, it reflects the extent to which the participants’ cognitive and perceptual systems are tricked into believing they are somewhere other than their physical location [3]. Generally, the term presence refers to physical presence in many applications, though it can also be classified as social presence or co-presence [4,5].
Many researchers have assessed presence earlier, but most of them focused on the visual aspect of the stimuli, for example, the size and resolution of the screen [6], or the visual fidelity of the animation [7]. Less attention was paid to the auditory aspect of the stimuli. Regarding research focusing on the auditory cues of presence [8,9,10,11,12,13], most experiments still used visual display to accompany auditory stimuli. Besides, the audio stimuli these researchers used were mostly stereophonic or binaural recordings, whereas a loudspeaker array is used in the present work.

1.2. Subjective Measurement

Typically, presence is measured via questionnaires. There are many types of questionnaires available for this purpose, and Schwind summarized 15 that had been used in earlier studies [14]. Among them, the most widely used questionnaires were Presence Questionnaire (PQ) by Witmer and Singer [2,15] and SUS Questionnaire by Slater, Usoh and Steed [1,14,16,17,18].
Subjective measurement based on questionnaires is still the most widely-used and reliable way to evaluate presence, because it is the direct response reported by participants. However, it is sometimes imperfect due to its inherent characters. For example, the act of answering questionnaires would cause the participants to leave the virtual environment and cause a break in presence (BIP) [18,19,20]. Let alone the questionnaires that require papers and pens, Schwind et al. reported that even answering the questionnaires within VR would still affect the consistency of data [14]. Besides, participants have to average their response during the entire process when answering the questionnaire, which means the measurement is not real-time. Slater noted that researchers should rely less on questionnaires in the studies of presence [19]. For this reason, we seek to explore the objective measurement of presence to possibly improve on that.

1.3. Objective Measurement

Physiological signals generated from the body can be directly collected to measure presence (e.g., peripheral physiological signals, such as heart rate and skin conductance [21,22] and neurological signals [23]) without the influence of subjective opinions of the participants. This kind of objective measurement is straightforward and can be real-time and reliable if conducted with strictly controlled procedures or algorithms. Electroencephalography (EEG) has gained its advantage by the freedom it gives the participants when wearing the EEG headsets, especially compared to other devices such as fMRI [6]. EEG is the recorded electrical activity generated by the brain via electrodes placed on the scalp. It is the superposition of many simpler electric signals generated by millions of neurons, reflecting brain activity. EEG has already been used in many types of research regarding presence [6,7,22,24,25,26,27], and other fields [28,29,30]. This paper explored the possibility of measuring presence using EEG data.
The power spectrum of the EEG signal is an informative indicator to reveal neurological activities. Several frequency bands are commonly used, including delta (0.5–4 Hz), theta (4–8 Hz), alpha (8–13 Hz), beta (13–30 Hz) and gamma (30 Hz and above). Different frequency bands reflect different mental states. Considering that the deep sleep state indicated by delta waves and clinical diseases indicated by gamma waves [31] are not related to the purpose of this study, the three frequency bands of interest in this study are theta, alpha and beta.
The power ratio between two specified frequency bands is also commonly used in clinical and cognitive neuroscience [32,33,34,35,36]. Using the three frequency bands of interest in this study (theta, alpha and beta) and following the pattern of calculating the ratio as “slow-wave/fast-wave”, the power ratios available in the present study could be T/A (theta/alpha), T/AB (theta/(alpha + beta)), T/B (theta/beta) and A/B (alpha/beta).

1.4. Summary

This present study seeks to find the answers to the following questions. Besides subjective measurement by questionnaires, is it possible to objectively measure presence with EEG? If yes, which EEG index should we use specifically to reflect the level of presence, absolute band power or specific power ratio? Is there any correlation between these two measurement methods?
To answer these questions, two experiments were conducted where eight loudspeakers were used to reproduce urban soundscapes to stimulate auditory presence. During the experiment, both subjective measurement (based on questionnaires) and objective measurement (based on EEG signal) of presence were carried on. Then, we analyzed the subjective presence levels and EEG indices via different statistical tests, and further evaluated the correlation between these two measurement methods.
The rest of the paper is organized as follows: Section 2 explains the methodology that we used in the study. In Section 3, we present the results followed by a discussion in Section 4. Then, in Section 5, we draw the conclusion and give a few suggestions for future work.

2. Materials and Methods

2.1. Program Selection and Listening Environment

In this study, urban soundscape was used to stimulate presence. Urban soundscape is the combination of the natural acoustic environment (e.g., the sound from trees and water) and environmental sounds created by humans. It can be considered as the sonic landscape of urban scenes, and can be used for the research of city designing, noise control, etc. There are three advantages of using it in this study. First, it eliminates the influence of familiarity on the participants’ performance in the listening test, because almost everyone has the experience of being in the urban scenes (e.g., on the street or in the railway station). Second, it widens the choice of participants, because for this study, participants were only required to assess the feeling of being in a virtual space. It made the task easier and thus lowered the threshold for selecting participants. Third, compared with using music as programs, the participants’ performance would be less affected by preference or other personal affections.
This study used ESMA-3D Immersive Soundscape Recordings by Dr. Hyunkook Lee’s team [37] as urban soundscape. The ESMA-3D recording technique uses eight microphones for 360° audio capture. Four microphones in the main layer capture sound sources, and the other four in the upper layer capture ambience and elevated sources. By its nature, the reproduction of ESMA-3D could be achieved by the 8-channel double-layer quad-speakers array (i.e., cube array) [38]. As for programs, six out of 13 soundscapes of the original sound file were chosen to shorten the duration of the experiment, including four outdoor scenes (Union Square, Central Park, W 34th St Penn Station, and Adam St under the Manhattan Bridge) and two indoor scenes (Grand Central Terminal, and factory at W 34th & 10th Ave). In the authors’ opinion, these soundscapes have the best potential to stimulate presence because they can show moving objects (e.g., a bus arriving and leaving the bus station) or unique acoustic character of chambers (e.g., broadcast in the railway station).
The experiments were carried out in an air-conditioned room with no other external noise, as shown in Figure 1. The room was acoustically treated with carpet on the floor and curtains 15 cm ahead of each wall.
There were 32 loudspeakers (GENELEC 8020D) in the room. Only eight were selected to form a cube array for the sound reproduction, with four loudspeakers in the main layer and the other four in the upper layer. The distance from each loudspeaker to the listening spot was carefully adjusted to be 3 m. The acoustic center of the four loudspeakers in the main layer was set to be 1.2 m from the ground, which is approximately ear height when seated. The acoustic center of the four loudspeakers in the upper layer was set to be at the height of 2 m, elevated at about 45° from the listening point. Thus, the 8-channel signal could be distributed to eight corresponding loudspeakers without further processing.

2.2. Questionnaires

The questions related to auditory stimuli within the Presence Questionnaire and the SUS Questionnaire were selected and combined into a new questionnaire. The selected four questions were as follows:
  • Please rate your sense of being in the virtual environment on a scale of 0 to 10, where 10 represents your normal experience of being in a place;
  • During your experience, did you often think to yourself that you were actually in the virtual environment? Please rate it on a scale of 0 to 10, where 10 represents you almost feel you were actually in the virtual environment;
  • How well could you identify sounds? Please rate it on a scale of 0 to 10, where 10 represents you could clearly identify different kinds of sounds;
  • How well could you localize sounds? Please rate it on a scale of 0 to 10, where 10 represents you could easily detect the location of each sound.
The first two questions were chosen from the SUS Questionnaire. The first question focuses on rating the experience in the virtual environment compared to the real world, and the second question focuses on the subjective feeling of placing oneself in the virtual environment. The last two questions were chosen from the Presence Questionnaire to evaluate the realism of the sound field of the virtual environment. This questionnaire design was similar to Hendrix and Barfield’s study [9], where they assessed auditory presence in their experiment with five questions, three to evaluate presence and the other two to evaluate the spatialized sound. In this study, detailed explanations of these four questions were given to the participants during the experiment.
The final presence score was the sum of the answers to these four questions. As the questions suggest, the higher the score, the more the presence. In this way, the subjective assessment of presence was acquired.

2.3. EEG Apparatus

Conventional measurement of EEG is time-consuming and constrains the movement of participants. Therefore, a multi-channel wireless portable EEG device (Emotiv Epoc X [39]) was used. Its ability to collect reliable EEG data has been validated [40,41]. It guaranteed that participants could immerse themselves in the experiment without feeling the EEG measurement equipment being there.
The placement of electrodes of the EEG device is in agreement with the international 10–20 system, but restricted in the number. Data were collected through 14 channels, using 14 active electrodes (placed on AF3, F7, F3, FC5, T7, P7, O1, O2, P8, T8, FC6, F4, F8, AF4) and four reference electrodes (placed on TP9, P3, P4, and TP10). The layout of the electrodes on the scalp is shown in Figure 2. These 14 electrodes could cover the whole brain area and provide sufficient data for processing [42]. The EEG sampling rate was 128 Hz. The EmotivPro software was used to acquire raw EEG data from the headset.

2.4. Experiment Design

2.4.1. Experiment 1

In Experiment 1, two types of programs were utilized: (1) the original version (Program A): six scenes included, each lasting 30 s, with the total time of three minutes; (2) the mono version (Program B): with the same content as the original version, but signals from eight channels were added together and then copied to each channel. That is, the signal of every channel was the same, and participants would hear the same sound from eight loudspeakers. Given that the eight loudspeakers were placed at an equal distance from the listening spot, we were expecting the effect of listening to Program B to be similar to listening to mono sound and less immersive than listening to Program A.
Experiment 1 was conducted with 18 experienced listeners (16 males, 2 females, ages ranging between 23–27 years old). They were selected from the lab, and none of them reported hearing damages. These 18 participants were randomly divided into two groups. Participants in Group I (nine participants) were presented with Program A followed by Program B, with 15 s of silence in between. Participants in Group II (nine participants) were presented with Program B ahead of Program A instead, still with 15 s of silence in between. This design aims to rule out the influence of the order of presentation.
We divided participants into two groups but did not require the same participant to participate in the experiment twice, because this study is distinct from traditional listening tests, where participants were required to assess spectral irregularities or distortion. Those tasks require specific listening skills that could be repeated many times. However, assessing presence depends more on the first-time experience. Being too familiar with the programs would cause participants to lose concentration on the task and give unreliable results. In similar studies [6,21,27,30] participants were also presented with two or three conditions for comparison within a single test.
The procedure was as follows. (1) Participants were first welcomed and given the introduction of the experiment. (2) Program A was played to make the participants familiar with the program and the immersive sound. (3) Participants were fitted with the EEG headset, and each electrode’s conductivity and EEG quality were checked. (4) Participants were presented with the programs, and EEG was measured in the meantime. To help participants immerse themselves when listening, they were recommended to close their eyes. If not, there were still no visual clues of which loudspeaker was playing sound. When listening, the task for the participants was only to assess presence by their experience, without going into details of the evaluation of spaciousness such as ASW (Apparent Source Width) or LEV (Listener Envelopment) [43] (5) After measurement, participants were required to fill in the questionnaire. The duration of the whole experiment was approximately 20 min.

2.4.2. Experiment 2

In Experiment 2, two types of programs were utilized: (1) the original version (Program A); (2) the 2D version (Program C): the four height channels (Left Front Height, Right Front Height, Left Rear Height, Right Rear Height) of the original version were muted. Thus, the ESMA-3D degenerated into ESMA without height channels. We called it the 2D version for this reason. The purpose of making Program C was to create a program that is subtly different from Program A, with the difference smaller than that between Programs A and B. This serves the purpose of testing the consistency between subjective and objective measurements under a more stringent condition than Experiment 1.
Experiment 2 was conducted with 25 naïve listeners (8 males, 17 females, ages ranging between 21–28 years old). They were volunteers selected in different majors across the campus, and none of them reported hearing damages. Similar to Experiment 1, the participants for Experiment 2 were also divided into two groups: 12 in Group I and 13 in Group II. Participants in Group I were presented with Program A followed by Program C, with 15 s of silence in between. Participants in Group II were presented with Program C ahead of Program A, still with 15 s of silence in between. The procedure of Experiment 2 was the same as Experiment 1.

2.5. Signal Processing

Raw EEG data were processed using a brain signal processing toolbox in MATLAB called EEGLAB [43]. EEG data were first applied with a bandpass filter of 1–50 Hz to screen the unwanted noise and keep the frequency range of interest. The filtered data were further cleaned via a built-in plugin of EEGLAB by the name of Clean Rawdata. Bad portions of data caused by non-physiologic artifacts (e.g., insufficient contact of electrodes with the head surface or sudden large movements) were removed based on their standard deviation during that process. To further remove physiologic artifacts, such as heart beat (ECG), eye movements (EOG), muscle pulsations (EMG), Independent Component Analysis (ICA) was run. Next, another EEGLAB plugin, MARA (“Multiple Artifact Rejection Algorithm”) was utilized for automatic classification of artifactual ICA components [44,45]. The components rejected by MARA were then removed from the original data.
A MATLAB script was run with the pre-processed data to calculate the power of each frequency range and the power ratios. The script was organized as follows. The power spectral density (PSD) was first calculated using Welch’s method with a Hanning window of the length of one second. Then, different frequency bands were defined as described in Section 1.3, and the power of each frequency band was summed. Lastly, different power ratios of interest were calculated.

3. Results

3.1. Experiment 1

In Experiment 1, 16 sets of valid data were collected. Data from two participants were ruled out due to their poor condition during the experiment. One participant in Group I reported that he was sleepy and could not concentrate on the experiment, and one participant in Group II reported that he got distracted by his phone.
The presence scores calculated from the questionnaires are displayed in Figure 3, in which Participant 1–8 belonged to Group I, and Participant 9–16 belonged to Group II. It demonstrates a dramatic difference in the presence score between Programs A and B. Presence score rated for Program A was higher than B for almost all participants except Participant 10.
The presence score was normally distributed according to the Kolmogorov–Smirnov method (for Program A, p = 0.104; for Program B, p = 0.466). Given that each participant rated the presence score for Programs A and B successively, the presence score for Programs A and B should be treated as pairs when evaluating the difference between them. The paired t-test showed that the difference in presence score was significant at a significance level of 0.05 (p = 1.28 × 10 4 ). Besides, paired t-tests were also carried out on the difference in the ratings of each question between the two programs. The results showed that there was significant difference between the two programs for all four questions. (For Question 1, p = 2.85 × 10 4 ; for Question 2, p = 1.25 × 10 4 ; for Question 3, p = 3.195 × 10 3 ; for Question 4, p = 8.2 × 10 5 .)
Calculated EEG indices in Experiment 1 are shown in Table 1. The absolute powers of theta, alpha and beta and a series of power ratios are presented. The power ratios used were T/A (theta/alpha), T/AB (theta/(alpha+beta)), T/B (theta/beta) and A/B (alpha/beta). It can be seen that the distribution of these EEG indices fluctuated greatly. For different participants, the fluctuation was large, and even for the same participant in different stages, the indices still varied vastly. The values of all EEG indices for both Programs A and B (each column of Table 1) did not meet the normal distribution (p < 0.05) according to the Kolmogorov–Smirnov method, indicating the paired t-test was no longer suitable. Thus, the Wilcoxon signed rank test [46] was used instead, which does not require the data to be normally distributed.
Results of the Wilcoxon signed rank test for each EEG index pair (for example, theta power for Programs A and B) are shown in Table 2. It can be seen that only the difference in beta power between Programs A and B was significant at a significance level of 0.05. The differences in alpha power, T/AB and T/B were close to significant.
Because of the significant difference in both presence score and specific EEG indices between Programs A and B, it is worth analyzing the correlation between them. To this end, we used the chi-squared test, a non-parametric statistic. Data were first transformed into the dichotomous format for further processing, as shown in Table 3. If the presence score for Program A is higher than B, it would be marked as 1, and 0 if otherwise. Meanwhile, if the EEG index, for example, T/B for Program A is higher than B, it would be marked as 1 as well.
In the row direction of Table 3, the dichotomous data of presence score and EEG indices came from the same participant, indicating they should be treated as pairs in the correlation analysis. For this reason, we utilized the McNemar test within the range of chi-squared tests to be more specific [47,48].
Table 4 presents the results of the McNemar test. We could see that there was a significant difference (p < 0.05) between presence score and the absolute powers (theta, alpha and beta), suggesting that they were not statistically correlated. Instead, there was no significant difference (p > 0.05) between presence score and the four power ratios, indicating a significant correlation between them. In addition, the Pearson correlation coefficient was also obtained with the McNemar test. We could see that presence score only showed a significant (p < 0.05) correlation with T/B among all seven indices, with the correlation coefficient being 0.683. It suggested that if the presence score for program A was higher than B, it would be highly possible that the T/B for program A was also higher than that for B. On the contrary, the correlation was insignificant (p > 0.05) for the other six indices.
To summarize, we first found significant differences in the presence score and beta power when presenting different programs, proving that participants were able to detect the difference between the two programs. Then, we investigated the correlation between presence score by subjective questionnaires and EEG indices by objective measurements. Considering both the results of the McNemar test and the Pearson correlation coefficient, we found that among all EEG indices used in this study, only T/B showed a significant correlation with the presence score.

3.2. Experiment 2

In Experiment 2, data from one participant in Group II were ruled out considering that she could not tell the difference between Programs A and C. Thus, 24 sets of valid data were collected as shown in Figure 3, in which Participant 1–12 belonged to Group I, and Participant 13–24 belonged to Group II.
As shown in Figure 4, there was a trend that the presence score for Program A was higher than that for C. Though not as overwhelming as in Experiment 1, this trend still applied to most participants (19 of 24). The paired t-test showed that p = 0.02 < 0.05, indicating a significant difference in the presence score at a significance level of 0.05.
By the results of Experiment 1, the presence score only showed a significant correlation between T/B within all the indices. Therefore, in Experiment 2, we put emphasis on checking the reliability of this correlation. As shown in Table 5, these two sets of values did not meet normal distribution according to the Kolmogorov–Smirnov method (for Program A, p = 2.6 × 10 5 ; for Program C, p = 1.58 × 10 4 ). Thus, the Wilcoxon signed rank test was applied, and the results (p = 0.04) showed that the difference in T/B between the two programs was significant.
Then, the McNemar test was performed to evaluate the correlation between presence score and T/B. The results (p = 0.625) showed no significant difference between presence and T/B, indicating the existence of correlation. Besides, the Pearson correlation coefficient was 0.574 (p = 0.003), confirming the correlation from a statistical point of view.
To summarize, we first found significant differences in both presence score and T/B when presenting different programs. Then, the correlation between the measure of presence score and the measure of T/B was evaluated, showing no significant difference between these two measurement methods. The correlation coefficient was significantly different from 0, as well.

4. Discussion

Up to now, we could answer the questions in Section 1.4. In this study, we first found that presence score was significantly different between programs. In addition, T/B (Theta/Beta Ratio) extracted from EEG signal showed significant differences between programs, and also demonstrated a significant correlation with the presence score after the data transformation. This indicates that the results of EEG measurement, together with subjective measurement both showed significant differences between different loudspeaker setups, and they were correlated to a great extent. This implies the possibility of EEG to be a potential tool for the objective measurement of presence.

4.1. Post-Test Interview

Besides questionnaires, we also interviewed the participants about their experience within the virtual environment. We obtained similar answers from the participants who rated higher presence levels for Program A. They reported that the sound came from all directions, and they felt enveloped by the sound. On the contrary, answers from the participants who gave higher presence scores for the counterpart of Program A (Program B or C) were more interesting. In Experiment 1, the one who rated a higher presence level for Program B reported that he could concentrate more on one or two items in the scenes during Program B. In Experiment 2, the five participants who rated higher presence levels for Program C reported that they felt the ambience from the four height channels in Program A has masked the sound from nearby (such as conversation on the left or the bus leaving in front) and degraded presence. The interview results indicate that not all participants prefer the 8-channel setup, and suggest that it is not always the case that “the more loudspeakers, the better the experience”.

4.2. T/B in Neuroscience

In Section 3, we found a significant correlation between presence and T/B. It was still unclear what this stood for from the perspective of neuroscience. Witmer and Singer [2] stated that presence in a virtual environment depends on one’s attention shifting from the physical environment to the virtual environment. In the meantime, T/B (Theta/Beta Ratio) has been found to be related to attention control [49,50]. van Son et al. [51] found that the frontal T/B was significantly higher during mind-wandering episodes than during the time on-task, which indicated a state of attention control over thoughts. Besides, T/B has been used as a diagnostic biomarker of attention deficit hyperactivity disorder (ADHD) in clinical diagnose [52]. This could give a little clue about the connection between presence and T/B, but more detailed research is still needed in future work.

4.3. Correlation between Questionnaires and EEG Indices

In Section 3, the McNemar test was used to evaluate the correlation between presence score and EEG indices, and revealed no significant difference between these two metrics in distinguishing which program stimulated a higher presence level. Another statistic metric, Cohen’s kappa could also be utilized. It is commonly used to measure the agreement between different observers on the same judgement. Kappa is standardized to a −1 to 1 scale. A kappa of 1 indicates perfect agreement; a kappa of 0 indicates agreement equivalent of chance; negative values indicate agreement less than chance [53]. We could take kappa as a tool to evaluate the agreement between questionnaires and EEG measurements.
Kappa values between the presence score and each EEG index (all indices in Experiment 1 and T/B in Experiment 2) were calculated as shown in Table 6. It can be seen that only the kappa values between presence score and T/B were significantly different from 0, proving the consistency between them to be significant. Kappa was 0.636 in Experiment 1 and 0.56 in Experiment 2, indicating substantial agreement between presence score and T/B. In addition, kappa in Experiment 2 was a little lower than that in Experiment 1. This is consistent with the fact that the difference between Programs A and C is smaller than that between Programs A and B.

5. Conclusions

This paper presented an analysis of auditory presence using eight loudspeakers to reproduce urban soundscapes as programs. Two experiments were conducted where presence was measured both subjectively with questionnaires and objectively with EEG. For subjective evaluation, the presence score showed significant differences between loudspeaker setups. For objective measurement, beta power and T/B extracted from EEG signals showed significant differences in the meantime. Correlation between presence score and EEG indices was analyzed, and T/B demonstrated a significant correlation with presence score.
The results of the present work suggest that the results of EEG measurement and subjective measurement both showed significant difference between different loudspeaker setups, and they were correlated to a great extent. This indicates the possibility of EEG to be a potential tool for the objective measurement of presence. Among all indices extracted from EEG signals, T/B has the largest potential to be the objective marker for the measurement of presence. Future work could focus on building a model for predicting presence by measuring the T/B only once, which may further make it easier to measure presence.

Author Contributions

Conceptualization, S.Z.; methodology, S.Z. and X.F.; software, S.Z.; formal analysis, S.Z.; investigation, S.Z. and X.F.; resources, X.F. and Y.S.; data curation, S.Z.; writing—original draft preparation, S.Z.; writing—review and editing, X.F. and Y.S.; visualization, S.Z.; supervision, Y.S.; project administration, Y.S.; funding acquisition, Y.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Key R&D Program of China 2018YFB1403800.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank Ziyun Liu for the suggestions for this study, and all listeners who participated in the listening test.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
EEGelectroencephalography
T/Athe power ratio of theta/alpha
T/Bthe power ratio of theta/beta
T/ABthe power ratio of theta/(alpha+beta)
A/Bthe power ratio of alpha/beta

References

  1. Slater, M.; Usoh, M.; Steed, A. Depth of Presence in Virtual Environments. Presence Teleoperators Virtual Environ. 1994, 3, 130–144. [Google Scholar] [CrossRef]
  2. Witmer, B.G.; Singer, M.J. Measuring presence in virtual environments: A presence questionnaire. Presence Teleoperators Virtual Environ. 1998, 7, 225–240. [Google Scholar] [CrossRef]
  3. Patrick, E.; Cosgrove, D.; Slavkovic, A.; Rode, J.A.; Verratti, T.; Chiselko, G. Using a large projection screen as an alternative to head-mounted displays for virtual environments. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems, The Hague, The Netherlands, 1–6 April 2000; pp. 478–485. [Google Scholar]
  4. Riva, G.; Davide, F.; IJsselsteijn, W.A. Being there: The experience of presence in mediated environments. In Being There: Concepts, Effects and Measurement of User Presence in Synthetic Environments; Ios Press: Amsterdam, The Netherlands, 2003; Volume 5. [Google Scholar]
  5. Agrawal, S.; Simon, A.; Bech, S.; BÆrentsen, K.; Forchhammer, S. Defining immersion: Literature review and implications for research on audiovisual experiences. AES J. Audio Eng. Soc. 2020, 68, 404–417. [Google Scholar] [CrossRef]
  6. Clemente, M.; Rodríguez, A.; Rey, B.; Alcañiz, M. Assessment of the influence of navigation control and screen size on the sense of presence in virtual reality using EEG. Expert Syst. Appl. 2014, 41, 1584–1592. [Google Scholar] [CrossRef]
  7. Dey, A.; Phoon, J.; Saha, S.; Dobbins, C.; Billinghurst, M. A Neurophysiological Approach for Measuring Presence in Immersive Virtual Environments. In Proceedings of the 2020 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2020, Porto de Galinhas, Brazil, 9–13 November 2020; pp. 474–485. [Google Scholar] [CrossRef]
  8. Hendrix, C.; Barfield, W. Presence in virtual environments as a function of visual and auditory cues. In Proceedings of the Virtual Reality Annual International Symposium ’95, Research Triangle Park, NC, USA, 11–15 March 1995; pp. 74–82. [Google Scholar]
  9. Hendrix, C.; Barfield, W. The sense of presence within auditory virtual environments. Presence Teleoperators Virtual Environ. 1996, 5, 290–301. [Google Scholar] [CrossRef]
  10. Rumsey, F.; Berg, J. Verification and correlation of attributes used for describing the spatial quality of reproduced sound. In Proceedings of the Audio Engineering Society Conference: 19th International Conference: Surround Sound-Techniques, Technology, and Perception, Schloss Elmau, Germany, 21–24 June 2001. [Google Scholar]
  11. Ozawa, K.; Chujo, Y.; Suzuki, Y.; Sone, T. Contents which yield high auditory-presence in sound reproduction. Kansei Eng. Int. 2002, 3, 25–30. [Google Scholar]
  12. Ozawa, K.; Ohtake, S.; Suzuki, Y.; Sone, T. Effects of visual information on auditory presence. Acoust. Sci. Technol. 2003, 24, 97–99. [Google Scholar] [CrossRef]
  13. Nicol, R.; Dufor, O.; Gros, L.; Rueff, P.; Farrugia, N. EEG measurement of binaural sound immersion. In Proceedings of the EAA Spatial Audio Signal Processing Symposium, Paris, France, 6–7 September 2019; pp. 73–78. [Google Scholar]
  14. Schwind, V.; Knierim, P.; Haas, N.; Henze, N. Using presence questionnaires in virtual reality. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, Glasgow, Scotland, UK, 4–9 May 2019; pp. 1–12. [Google Scholar]
  15. Witmer, B.G.; Jerome, C.J.; Singer, M.J. The factor structure of the Presence Questionnaire. Presence Teleoperators Virtual Environ. 2005, 14, 298–312. [Google Scholar] [CrossRef]
  16. Slater, M.; McCarthy, J.; Maringelli, F. The influence of body movement on subjective presence in virtual environments. Hum. Factors 1998, 40, 469–477. [Google Scholar] [CrossRef] [Green Version]
  17. Usoh, M.; Arthur, K.; Whitton, M.C.; Bastos, R.; Steed, A.; Slater, M.; Brooks, F.P., Jr. Walking > walking-in-place > flying, in virtual environments. In Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques, Los Angeles, CA, USA, 8–13 August 1999; pp. 359–364. [Google Scholar]
  18. Slater, M.; Steed, A. A virtual presence counter. Presence Teleoperators Virtual Environ. 2000, 9, 413–434. [Google Scholar] [CrossRef] [Green Version]
  19. Slater, M. How colorful was your day? Why questionnaires cannot assess presence in virtual environments. Presence 2004, 13, 484–493. [Google Scholar] [CrossRef]
  20. Sanchez-Vives, M.V.; Slater, M. From presence to consciousness through virtual reality. Nat. Rev. Neurosci. 2005, 6, 332–339. [Google Scholar] [CrossRef]
  21. Meehan, M.; Insko, B.; Whitton, M.; Brooks, F.P., Jr. Physiological measures of presence in stressful virtual environments. ACM Trans. Graph. (TOG) 2002, 21, 645–652. [Google Scholar] [CrossRef] [Green Version]
  22. Terkildsen, T.; Makransky, G. Measuring presence in video games: An investigation of the potential use of physiological measures as indicators of presence. Int. J. Hum. Comput. Stud. 2019, 126, 64–80. [Google Scholar] [CrossRef]
  23. Baumgartner, T.; Speck, D.; Wettstein, D.; Masnari, O.; Beeli, G.; Jäncke, L. Feeling present in arousing virtual reality worlds: Prefrontal brain regions differentially orchestrate presence experience in adults and children. Front. Hum. Neurosci. 2008, 2, 8. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Kober, S.E.; Kurzmann, J.; Neuper, C. Cortical correlate of spatial presence in 2D and 3D interactive virtual reality: An EEG study. Int. J. Psychophysiol. 2012, 83, 365–374. [Google Scholar] [CrossRef] [PubMed]
  25. Kober, S.E.; Neuper, C. Using auditory event-related EEG potentials to assess presence in virtual reality. Int. J. Hum. Comput. Stud. 2012, 70, 577–587. [Google Scholar] [CrossRef]
  26. Clemente, M.; Rodriguez, A.; Rey, B.; Alcañiz, M. Measuring presence during the navigation in a virtual environment using EEG. Stud. Health Technol. Inform. 2013, 191, 136–140. [Google Scholar]
  27. Abromavičius, V.; Gedminas, A.; Serackis, A. Detecting sense of presence changes in EEG spectrum during perception of immersive audiovisual content. In Proceedings of the 2017 Open Conference of Electrical, Electronic and Information Sciences (eStream), Vilnius, Lithuania, 27 April 2017; pp. 1–4. [Google Scholar]
  28. Lim, S.; Yeo, M.; Yoon, G. Comparison between concentration and immersion based on EEG analysis. Sensors 2019, 19, 1669. [Google Scholar] [CrossRef] [Green Version]
  29. Jap, B.T.; Lal, S.; Fischer, P.; Bekiaris, E. Using EEG spectral components to assess algorithms for detecting fatigue. Expert Syst. Appl. 2009, 36, 2352–2359. [Google Scholar] [CrossRef]
  30. Wen, T.Y.; Aris, S.A.M. Electroencephalogram (EEG) stress analysis on alpha/beta ratio and theta/beta ratio. Indones. J. Electr. Eng. Comput. Sci. 2020, 17, 175–182. [Google Scholar] [CrossRef] [Green Version]
  31. Sanei, S.; Chambers, J.A. EEG Signal Processing; John Wiley & Sons: Hoboken, NJ, USA, 2013. [Google Scholar]
  32. Donoghue, T.; Dominguez, J.; Voytek, B. Electrophysiological Frequency Band Ratio Measures Conflate Periodic and Aperiodic Neural Activity. bioRxiv 2020, 7, 1–14. [Google Scholar] [CrossRef] [PubMed]
  33. Chen, C.C.; Hsu, C.Y.; Chiu, H.W.; Hu, C.J.; Lee, T.C. Frequency power and coherence of electroencephalography are correlated with the severity of Alzheimer’s disease: A multicenter analysis in Taiwan. J. Formos. Med. Assoc. 2015, 114, 729–735. [Google Scholar] [CrossRef] [Green Version]
  34. Trammell, J.P.; MacRae, P.G.; Davis, G.; Bergstedt, D.; Anderson, A.E. The relationship of cognitive performance and the Theta-Alpha power ratio is age-dependent: An EEG study of short term memory and reasoning during task and resting-state in healthy young and old adults. Front. Aging Neurosci. 2017, 9, 364. [Google Scholar] [CrossRef] [Green Version]
  35. Pope, A.T.; Bogart, E.H.; Bartolome, D.S. Biocybernetic system evaluates indices of operator engagement in automated task. Biol. Psychol. 1995, 40, 187–195. [Google Scholar] [CrossRef]
  36. Freeman, F.G.; Mikulka, P.J.; Prinzel, L.J.; Scerbo, M.W. Evaluation of an adaptive automation system using three EEG indices with a visual tracking task. Biol. Psychol. 1999, 50, 61–76. [Google Scholar] [CrossRef]
  37. ESMA-3D Immersive Soundscape Recordings. Available online: https://zenodo.org/record/3549050#.YVVn9UZBxGO (accessed on 30 September 2021).
  38. Hyunkook, L. Capturing 360 audio using an equal segment microphone array (ESMA). AES J. Audio Eng. Soc. 2019, 67, 13–26. [Google Scholar] [CrossRef]
  39. Emotiv Epoc, X. Available online: https://www.emotiv.com/epoc-x/ (accessed on 30 September 2021).
  40. Kuber, R.; Wright, F.P. Augmenting the Instant Messaging Experience Through the Use of Brain-Computer Interface and Gestural Technologies. Int. J. Hum.-Comput. Interact. 2013, 29, 178–191. [Google Scholar] [CrossRef]
  41. Badcock, N.A.; Mousikou, P.; Mahajan, Y.; De Lissa, P.; Thie, J.; McArthur, G. Validation of the Emotiv EPOC® EEG gaming systemfor measuring research quality auditory ERPs. PeerJ 2013, 2013, 1–17. [Google Scholar] [CrossRef] [Green Version]
  42. Ramirez, R.; Vamvakousis, Z. Detecting emotion from EEG signals using the emotive epoc device. In Proceedings of the International Conference on Brain Informatics, Macau, China, 4–7 December 2012; Springer: Heidelberg, Germany; pp. 175–184. [Google Scholar]
  43. Delorme, A.; Makeig, S. EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 2004, 134, 9–21. [Google Scholar] [CrossRef] [Green Version]
  44. Winkler, I.; Haufe, S.; Tangermann, M. Automatic classification of artifactual ICA-components for artifact removal in EEG signals. Behav. Brain Funct. 2011, 7, 1–15. [Google Scholar] [CrossRef] [Green Version]
  45. Winkler, I.; Brandl, S.; Horn, F.; Waldburger, E.; Allefeld, C.; Tangermann, M. Robust artifactual independent component classification for BCI practitioners. J. Neural Eng. 2014, 11, 35013. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Wilcoxon, F. Individual comparisons by ranking methods. Biometrics 1945, 1, 80–83. [Google Scholar] [CrossRef]
  47. McNemar, Q. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 1947, 12, 153–157. [Google Scholar] [CrossRef]
  48. Adedokun, O.A.; Burgess, W.D. Analysis of paired dichotomous data: A gentle introduction to the McNemar test in SPSS. J. Multidiscip. Eval. 2012, 8, 125–131. [Google Scholar]
  49. Putman, P.; Verkuil, B.; Arias-Garcia, E.; Pantazi, I.; van Schie, C. EEG theta/beta ratio as a potential biomarker for attentional control and resilience against deleterious effects of stress on attention. Cogn. Affect. Behav. Neurosci. 2014, 14, 782–791. [Google Scholar] [CrossRef]
  50. Van Son, D.; van der Does, W.; Band, G.P.; Putman, P. EEG Theta/Beta Ratio Neurofeedback Training in Healthy Females. Appl. Psychophysiol. Biofeedback 2020, 45, 195–210. [Google Scholar] [CrossRef]
  51. Van Son, D.; De Blasio, F.M.; Fogarty, J.S.; Angelidis, A.; Barry, R.J.; Putman, P. Frontal EEG theta/beta ratio during mind wandering episodes. Biol. Psychol. 2019, 140, 19–27. [Google Scholar] [CrossRef] [Green Version]
  52. Bussalb, A.; Collin, S.; Barthélemy, Q.; Ojeda, D.; Bioulac, S.; Blasco-Fontecilla, H.; Brandeis, D.; Purper Ouakil, D.; Ros, T.; Mayaud, L. Is there a cluster of high theta-beta ratio patients in attention deficit hyperactivity disorder? Clin. Neurophysiol. 2019, 130, 1387–1396. [Google Scholar] [CrossRef] [PubMed]
  53. Viera, A.J.; Garrett, J.M. Understanding interobserver agreement: The kappa statistic. Fam. Med. 2005, 37, 360–363. [Google Scholar] [PubMed]
Figure 1. Listening room setup.
Figure 1. Listening room setup.
Applsci 11 10461 g001
Figure 2. The layout of the electrodes on the scalp.
Figure 2. The layout of the electrodes on the scalp.
Applsci 11 10461 g002
Figure 3. Presence score for 16 participants in Experiment 1.
Figure 3. Presence score for 16 participants in Experiment 1.
Applsci 11 10461 g003
Figure 4. Presence score for Programs A and C in Experiment 2.
Figure 4. Presence score for Programs A and C in Experiment 2.
Applsci 11 10461 g004
Table 1. EEG indices in Experiment 1.
Table 1. EEG indices in Experiment 1.
  ParticipantTheta ( μ V 2 /Hz)Alpha ( μ V 2 /Hz)Beta ( μ V 2 /Hz)T/AT/ABT/BA/B
ABABABABABABAB
16.4455.354141.57095.0741.7081.6670.0460.0560.0450.0553.7733.21182.87257.030
20.2380.3360.6290.2680.3710.5460.3781.2510.2380.4120.6410.6141.6950.491
30.40810.3810.17363.6850.08342.0232.3590.1631.5950.0984.9270.2472.0891.515
40.9531.2921.5842.1410.6261.2180.6020.6030.4310.3851.5211.0612.5281.758
58.2527.54322.74526.6930.9791.0310.3630.2830.3480.2728.4317.31323.23925.880
63.5110.64594.8491.0853.4571.0680.0370.5940.0360.2991.0150.60427.4361.016
73.3557.0162.1604.8911.9355.0731.5531.4350.8190.7041.7341.3831.1160.964
85.50310.38118.35563.68512.25642.0230.3000.1630.1800.0980.4490.2471.4981.515
90.645201.7500.548135.4900.281127.1001.1761.4890.7780.7682.2971.5871.9541.066
103.08312.2171.66220.0520.7351.3761.8540.6091.2860.5704.1978.8772.26314.571
117.6412.98041.11614.1627.1617.2170.1860.2100.1580.1391.0670.4135.7421.962
1232.59357.44299.725547.3204.0454.4800.3270.1050.3140.1048.05812.82224.655122.175
134.2563.441106.630123.7601.7692.0790.0400.0280.0390.0272.4061.65560.28459.534
1418.19015.4021032.8001559.4008.91010.6220.0180.0100.0170.0102.0411.450115.912146.809
152.2050.9191.3020.4831.2900.5481.6941.9030.8510.8911.7101.6771.0090.881
1647.7733.8991485.216055.31785.87260.0320.0210.0320.0218.9835.772279.288273.303
Table 2. Results of the Wilcoxon signed rank test in Experiment 1.
Table 2. Results of the Wilcoxon signed rank test in Experiment 1.
EEG IndexThetaAlphaBetaT/AT/ABT/BA/B
p0.5690.0790.02 *0.6420.0980.0560.438
* Significance level of 0.05.
Table 3. Data displayed in the dichotomous format.
Table 3. Data displayed in the dichotomous format.
ParticipantPresence ScoreThetaAlphaBetaT/AT/ABT/BA/B
111110011
210100011
310001111
410000111
511001110
611110011
710001111
810001110
910000111
1000001100
1111100111
1210001100
1311001111
1411001110
1511110011
1611001111
Table 4. Results of the McNemar test in Experiment 1.
Table 4. Results of the McNemar test in Experiment 1.
EEG Indexp for McNemarPearson Correlation Coefficientp for Pearson
Theta0.016 *0.2580.334
Alpha0.002 *0.1740.519
Beta0.000 *0.1240.647
T/A0.070−0.2280.396
T/AB0.375−0.1490.582
T/B1.0000.6830.004 *
A/B0.1250.3830.143
* Significance level of 0.05.
Table 5. T/B for 24 participants in Experiment 2.
Table 5. T/B for 24 participants in Experiment 2.
ParticipantProgram AProgram C
13.2333.685
22.4531.325
30.9990.876
43.0592.800
58.6983.571
60.6000.537
73.0512.643
85.3044.993
94.8431.940
1016.05111.392
112.8232.372
121.2033.779
131.4752.322
143.3383.577
154.6611.104
163.5082.612
1715.3034.546
183.7592.141
191.4220.561
204.8014.058
214.1574.856
222.9992.465
230.1270.865
240.8631.647
Table 6. Kappa between presence score and EEG indices.
Table 6. Kappa between presence score and EEG indices.
EEG IndexCohen’s Kappap for Kappa
Theta0.1250.302
Alpha0.0590.486
Beta0.030.62
T/A−0.1230.362
T/AB−0.1110.551
T/B (Experiment 1)0.6360.006 *
A/B0.2560.126
T/B (Experiment 2)0.560.005 *
* Significance level of 0.05.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhang, S.; Feng, X.; Shen, Y. Quantifying Auditory Presence Using Electroencephalography. Appl. Sci. 2021, 11, 10461. https://doi.org/10.3390/app112110461

AMA Style

Zhang S, Feng X, Shen Y. Quantifying Auditory Presence Using Electroencephalography. Applied Sciences. 2021; 11(21):10461. https://doi.org/10.3390/app112110461

Chicago/Turabian Style

Zhang, Shufeng, Xuelei Feng, and Yong Shen. 2021. "Quantifying Auditory Presence Using Electroencephalography" Applied Sciences 11, no. 21: 10461. https://doi.org/10.3390/app112110461

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop