Subjective Salience of Birdsong and Insect Song with Equal Sound Pressure Level and Loudness

Yoshiharu Soeta; Ayaka Ariki

doi:10.3390/ijerph17238858

and

Biomedical Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Osaka 563-8577, Japan

^*

Author to whom correspondence should be addressed.

Int. J. Environ. Res. Public Health2020, 17(23), 8858;https://doi.org/10.3390/ijerph17238858

This article belongs to the Special Issue Evaluations of Sound and Vibration in the Built Environments

Version Notes

Order Reprints

Abstract

Birdsong is used to communicate the position of stairwells to visually impaired people in train stations in Japan. However, more than 40% of visually impaired people reported that such sounds were difficult to identify. Train companies seek to present the sounds at a sound pressure level that is loud enough to be detected, but not so loud as to be annoying. Therefore, salient birdsongs with relatively low sound pressure levels are required. In the current study, we examined the salience of different types of birdsong and insect song, and determined the dominant physical parameters related to salience. We considered insect songs because both birdsongs and insect songs have been found to have positive effects on soundscapes. We evaluated subjective saliences of birdsongs and insect songs using paired comparison methods, and examined the relationships between subjective salience and physical parameters. In total, 62 participants evaluated 18 types of bird songs and 16 types of insect sounds. The results indicated that the following features significantly influenced subjective salience: the maximum peak amplitude of the autocorrelation function, which signifies pitch strength; the interaural cross-correlation coefficient, which signifies apparent source width; the amplitude fluctuation component; and spectral content, such as flux and skewness.

Keywords:

soundscape; salience; birdsong; insect song

1. Introduction

Birdsong and insect song are near universal experiences in the outdoor environment. Although some birdsongs and insect songs communicate seasonal changes and are considered pleasant by Japanese listeners, not all instances of birdsong elicit pleasant feelings [1]. Further, while not all birdsongs and insect songs are considered by humans to be beneficial components of an environment, some have been found to contribute to perceived attention restoration and stress recovery [2,3].

From the perspective of soundscapes, natural sounds (e.g., water, birdsongs, and wind in trees) can play a key role in acoustic comfort. Water sounds are often used to mask other sounds and as noise barriers to enhance urban soundscapes [4,5,6]. The introduction of birdsong has been found to increase the subjective pleasantness of soundscapes in public spaces [7]. Among various natural sounds, birdsong was judged as the most effective and beneficial type of sound for improving sound environments [8,9,10].

In Japanese public spaces, sound signals are often used to guide visually impaired people to specific destinations, such as a ticket gate or staircase. For instance, birdsong is often used to signal the presence of a staircase in train stations. However, more than 40% of visually impaired people reported that a birdsong stimulus was difficult to localize in a train station setting [11]. Although guidelines exist regarding the use of birdsongs as information signals [12], these are not always strictly followed by train company staff, who may prefer to use lower-than-recommended sound pressure levels (SPLs) to reduce discomfort in customers, staff, and surrounding residents.

The physical factors that affect the sound signals used to guide visually impaired people have been investigated from the viewpoint of sound localization. Based on the percentages of correct localization responses, researchers have found that the signal to noise ratio, initial delay time, reverberation energy, distance, elevation angle, and the temporal pattern of the signal all affect sound localization [13,14]. Additionally, researchers have proposed that sounds with specific temporal patterns, such as those with particular early component and silent interval lengths, might be more easily detectable by visually impaired people, and further, that these factors might be uncovered by examining human brain responses [15].

The aim of this study was to evaluate the salience of a number of birdsongs and insect songs, and to determine the physical factors that modulated the observed salience. Here, we used the term salience to refer to whether or not a sound stood out from background noise. While previous studies have indicated that loudness is a significant predictor of salience [16,17], it is preferable that sound signals used for guidance purposes be salient even at a low SPL, as this reduces unnecessary discomfort elicited by loud sounds in the environment. Although the abovementioned guidelines include an appropriate volume in SPL, they do not consider variations among specific sound sources [12]. To address this in the present study, we investigated the subjective salience of birdsong and insect song presented with equal SPLs and loudness to clarify the effects of physical factors in physically and subjectively equal sound intensity conditions.

2. Materials and Methods

2.1. Subjective Salience Test

As stimuli, we used 18 types of birdsongs and 16 types of insect songs that had been used in previous experiments [1]. The abovementioned guidelines include some recommendations regarding acoustic specifications [12]. For example, pure tones are not acceptable, sounds with broader frequency bands, frequency fluctuations, amplitude fluctuations, and a duration of less than 5 s are preferable. We selected birdsongs and insect songs that most closely met these specifications. The stimuli were birdsongs produced by Halcyon coromanda (HC), Latham (L), Cuculus canorus (CC), Cuculus saturatus (CS), Strix uralensis (SU), Otus scops (OS), Caprimulgus indicus (CI), Streptopelia orientalis (SO), Terpsiphone atrocaudata (TA), Garrulus glandarius (GG), Porzana fusca (PF), Parus minor (PM), Horornis diphone (HD), Zosterops japonicus (ZJ), Turdus sibiricus (TS), Prunella rubida (PR), Eophona personata (EP), and Emberiza cioides (EC), and insect songs produced by Cryptotympana facialis (CF), Meimuna opalifera (MO), Graptopsaltria nigrofuscata (GN), Oncotympana maculaticollis (OM), Tanna japonensis (TJ), Velarifictorus micado (VM), Loxoblemmus doenitzi (LD), Oecanthus longicauda (OL), Gryllotalpa orientalis (GO), Teleogryllus emma (TE), Meloimorpha japonica (MJ), Xenogryllus marmoratus (XM), Hexacentrus hareyamai (HH), Mecopoda nipponensis (MN), Tettigonia orientalis (TO), and the Japanese katydid (JK).

The SPLs of the stimuli were analyzed with a temporal window of 5 ms and an interval of 2.5 ms. The stimulus onset and offset were defined as positions with an SPL that was 10 dB higher than the noise floor. We tested subjective salience by presenting birdsong and insect song stimuli with durations between 0.4 and 2.0 s, as the duration is not expected to significantly modulate loudness within that duration range [18]. The temporal waveforms of the birdsongs and insect songs are shown in Figure 1 and Figure 2, and the spectrograms of the sounds are shown in Figure 3 and Figure 4. The stimuli were presented to participants using a headphone amplifier (HDVD800, Sennheiser, Wedemark, Germany) and headphones (HD800, Sennheiser). The participants listened to the stimuli while sitting in a soundproof room with an ambient temperature of 22–25 degrees.

Figure 1. Temporal waveforms of the 18 birdsongs.

Figure 2. Temporal waveforms of the 16 insect songs.

Figure 3. Spectrograms of the 18 birdsongs.

Figure 4. Spectrograms of the 16 insect songs.

In the equal L_Aeq condition, the birdsongs and insect songs were presented at a continuous A-weighted SPL, i.e., L_Aeq, measured over the duration of each sound of 70 dBA. In the equal loudness condition, the birdsongs and insect songs were presented at 3 sone, which was considered to reflect long-term loudness [19]. The stimuli in the L_Aeq and long-term loudness conditions were verified using a dummy head microphone (KU100, Neumann, Berlin, Germany) and a sound calibrator (Type 4231, Brüel & Kjær, Naerum, Denmark).

In the equal L_Aeq condition, we presented the birdsong and insect song stimuli to 15 participants (11 men) aged between 20 and 41 years (median age of 21.0 years) and 16 participants (11 men) aged between 20 and 41 years (median age of 22.0 years), respectively. We presented both the birdsong and insect songs to 10 participants. In the equal loudness condition, we presented the birdsong and insect song stimuli to 16 participants (seven men) aged between 20 and 54 years (median age of 23.5 years) and 15 participants (seven men) aged between 21 and 54 years (median age of 24.0 years), respectively. We presented both the birdsong and insect songs to 15 participants. We presented the birdsong and insect songs in both the equal L_Aeq and loudness condition to one participant. All participants had normal hearing and no history of neurological disease. According to previous psychoacoustic experiments and our previous studies, we considered the involvement of at least 10 participants to be necessary to ensure sufficient statistical power and the generality of the results [1,20]. Informed consent was obtained from each participant after explaining the nature of the study. The study was approved by the ethics committee of the National Institute of Advanced Industrial Science and Technology (AIST) of Japan (2020–0227).

We used the modified Scheffe’s paired comparison method [21,22,23] to measure subjective salience. In our protocol, we performed all pairwise comparisons for each iteration for each participant. All combinations of pairs (i.e., 153 pairs (N(N − 1)/2, N = 18) for birdsongs and 120 pairs (N(N − 1)/2, N = 16) for insect songs) were presented in random order, and the presentation order within each pair was also randomized. The silent interval between the stimuli was 1.0 s long. Following the presentation of each pair, the participants were asked to judge which stimulus from each pair was more salient using a seven-point scale. Judgments regarding each ordered pair (i, j) were made using one of the following seven statements: I perceived i as strongly more salient than j (3 points); I perceived i as moderately more salient than j (2 points); I perceived i as slightly more salient than j (1 point); I perceived the salience of the two sounds to be equal (0 points); I perceived j as slightly more salient than i (−1 point); I perceived j as moderately more salient than i (−2 points); and, I perceived j as strongly more salient than i (−3 points). The averaged salience values were calculated based on the modified Scheffe’s method and were defined as scale values (SVs) of salience. An analysis of variance (ANOVA) was conducted on the results of the paired comparison experiments [21,22,23].

2.2. Physical Parameters

To quantify the acoustic characteristics of the birdsongs and insect songs, we analyzed specific physical parameters. First, we analyzed three parameters using autocorrelation function (ACF) analysis. The first parameter is the delay time of the first maximum peak, τ₁, which is related to the perceived pitch. The second parameter is the amplitude of the first maximum peak, ϕ₁, which is related to the strength of the perceived pitch. The third parameter is the effective duration of the envelope of the normalized ACF, τ_e, which is defined by the ten-percentile delay and represents a repetitive component in the sound source itself. The fourth parameter is the width of the first decay, W_ϕ(0), which is the counterpart to the spectral centroid [24,25].

Based on the results of the interaural cross-correlation function (IACF) analysis, we evaluated three parameters. The first factor was the interaural cross-correlation coefficient (IACC), which was defined by the maximum value of the IACF. The IACC is related to spatial impression, such as subjective diffuseness and apparent source width [26]. The second parameter was the interaural time delay, τ_IACC, which was defined by the delay time at the IACC. The third parameter was the width of the IACF, W_IACC, which was defined by the interval of the delay time at a value of 10% below the IACC. The W_IACC mainly depends on the frequency component of the signal and is equivalent to the apparent source width [27].

We then analyzed four psychoacoustic parameters: loudness, sharpness, roughness, and fluctuation strength [28]. Loudness is the psychological strength of a sound. Sharpness relates to the balance of the high and low frequency components of a sound. Roughness and fluctuation strength quantify the subjective perception of the rapid (15−300 Hz) and slow (at frequencies up to 20 Hz) amplitude modulation of a sound.

We also analyzed other audio features for sound description, such as the entropy of energy, spectral entropy, spectral flux, and spectral skewness [29]. The entropy of energy is a measure of abrupt changes in the energy level of a sound. Spectral entropy is a measure that is similar to the entropy of energy, but is computed in the frequency domain. Spectral flux measures the speed of spectral change. Spectrum skewness describes the degree of asymmetry in the frequency distribution of a spectrogram of a sound [30].

We analyzed the L_Aeq, ACF, and IACF parameters. The integration interval was 50 ms and the running step was 1 ms. We also calculated the loudness, sharpness, roughness, fluctuation strength, entropy of energy, spectral entropy, spectral flux, and spectral skewness. The temporal window used for the analysis was 50 ms. The analyses were conducted using a Matlab-based analysis program (Mathworks, Natick, MA, USA).

2.3. Multiple Regression Analysis

The normality of each physical parameter was tested using the Shapiro–Wilk test [31]. None of the physical parameters were normally distributed. Because the subjective impression of a sound is influenced not only by average activity, but also by variable components [20,32,33,34,35], we used the median and interquartile range (QR) as predictors of subjective salience. Because sharpness was highly correlated with W_ϕ(0) (|r| > 0.79, p < 0.01), we excluded it from the multiple regression analysis.

To identify and quantify the physical factors that affect salience, we carried out multiple regression analyses with stepwise selection using a linear combination of the physical factors. The stepping criteria used for entry and removal were based on the significance level of the F-value and were set at 0.05 and 0.10, respectively. Factors with a variance inflation factor of 3.5 or more were excluded to avoid multicollinearity. The analyses were carried out using SPSS statistical analysis software (SPSS version 22.0, IBM Corp., Armonk, NY, USA).

3. Results and Discussion

The ANOVA for the SV of salience in the equal L_Aeq condition revealed that the main effect was statistically significant (F(17, 4184) = 194.3, p < 0.001 for birdsongs; F(15, 3479) = 325.6, p < 0.001 for insect songs). The ANOVA for the SV of salience in the equal loudness condition revealed that the main effect was statistically significant (F(17, 4472) = 187.1, p < 0.001 for birdsongs; F(15, 3255) = 120.9, p < 0.001 for insect songs).

Figure 5 shows the SVs of salience for birdsongs in the equal L_Aeq and loudness conditions. The most salient birdsong was that of Garrulus glandarius in the equal L_Aeq condition, although the Garrulus glandarius song was not as salient in the loudness condition. This is probably because the L_Aeq of the Garrulus glandarius song was lower than that of the other birdsongs in the equal loudness condition. The relatively salient birdsongs in both the equal L_Aeq and loudness conditions were that of Horornis diphone, Cuculus canorus, Latham, Porzana fusca, and Terpsiphone atrocaudata. Horornis diphone is one of three major species of passeriforme in Japan, which are known for their beautiful vocalizations. Its birdsong was the most preferred stimulus in a previous study [1]. Since ancient times, Cuculus canorus has appeared in various documents in Japan, and its song is often compared to onomatopoeia. Compared with other stimuli, it was found to elicit the largest N1m responses, and these responses were most strongly correlated to the sound envelope in the human brain [15]. The less salient birdsongs in both the equal L_Aeq and loudness conditions were those of Zosterops japonicus, Emberiza cioides, Strix uralensis, and Cuculus saturates.

Figure 5. Scale values of saliences for birdsongs in the equal L_Aeq and loudness conditions. The symbols indicate the mean values and the error bars indicate standard deviations.

Figure 6 shows the SV of salience for insect songs in the equal L_Aeq and loudness conditions. The most salient insect songs in both the equal L_Aeq and loudness conditions were those produced by Meimuna opalifera, Oncotympana maculaticollis, Mecopoda nipponensis, and Tanna japonensis, which are all cicadas except for Mecopoda nipponensis. Cicadas are famous in Japan as noisy insects in the summertime. The less salient insect songs in both the equal L_Aeq and loudness conditions were those produced by the Japanese katydid, Meloimorpha japonica, Gryllotalpa orientalis, and Tettigonia orientalis. The Japanese katydid and Meloimorpha japonica are well known in Japan and produce sounds during autumn. Regarding the songs of Xenogryllus marmoratus and Hexacentrus hareyamai, the judgment of salience varied among the participants. The songs of Xenogryllus marmoratus and Hexacentrus hareyamai have mainly higher frequency components and shorter durations. This might have caused the large differences between participants.

Figure 6. Scale values of saliences for insect songs in the equal L_Aeq and loudness conditions. The symbols indicate the mean values and the error bars indicate standard deviations.

We conducted a multiple linear regression analysis with the SVs of salience for birdsongs in both the equal L_Aeq and loudness conditions as the outcome variable. The final model showed that IACC, entropy, spectral flux, and spectral skewness were significant parameters in the equal L_Aeq condition, while τ₁, QR of τ_e, QR of W_ϕ(0), IACC, loudness_QR, and roughness were significant parameters in the equal loudness condition:

SV_{birdsong in equal SPL} ≈ b₀ + b₁ × IACC+ b₂ × entropy + b₃ × spectral flux + b₄ × spectral skewness,

(1)

SV_{birdsong in equal loudness} ≈ c₀ + c₁ × τ₁ + c₂ × τ_e_QR + c₃ × W_ϕ(0)_QR + c₄ × IACC + c₅ × loudness_QR + c₆ × roughness.

(2)

The correlation coefficients between all of the explanatory variables in the equal L_Aeq and loudness conditions are shown in Table 1 and Table 2, respectively. The ANOVA indicated the statistical significance of the model (F(5, 264) = 40.34, p < 0.001 for the equal L_Aeq condition, F(6, 281) = 26.26, p < 0.001 for the equal loudness condition). The adjusted coefficient of determination, R², was 0.41 for the equal L_Aeq condition and 0.35 for the equal loudness condition. The standardized partial regression coefficients in Equations (1) and (2) are summarized in Table 3.

Table 1. The correlation coefficients between the explanatory variables used in the multiple regression analysis in the equal L_Aeq condition for the birdsongs.

Table 2. The correlation coefficients between the explanatory variables used in the multiple regression analysis in the equal loudness condition for the birdsongs.

Table 3. Significant predictive parameters and standardized partial regression coefficients revealed by multiple linear regression analyses of birdsong salience.

The IACC, which signifies the apparent source width, was the significant predictive variable for both the equal L_Aeq and loudness conditions. The partial regression coefficients of the IACC were positive, indicating that birdsongs with a narrower sound source width were perceived as more salient. This is consistent with previous studies regarding the accuracy of sound source localization [36,37]. Spectral flux was also a significant predictive variable in the equal L_Aeq condition, with positive partial regression coefficients. This suggests that quick spectral change led to higher perceived salience.

Higher frequency components might play a key role in saliency. The delay times of the maximum peak amplitude of the ACF, τ₁, and the spectral skewness were significant predictive variables in the equal loudness and L_Aeq conditions, respectively. The negative τ₁ regression coefficient indicates that birdsongs with a higher pitch were perceived to be more salient, while the positive regression coefficient of spectral skewness demonstrates that birdsongs with more energy at high frequencies were perceived as more salient.

We also conducted a multiple linear regression analysis for insect songs. The final model showed that the QR of W_ϕ(0), QR of loudness, fluctuation strength, spectral entropy, and spectral skewness were significant parameters in the equal L_Aeq condition, while ϕ₁, τ_e, roughness, fluctuation strength, and the QR of spectral entropy were significant parameters in the equal loudness condition:

SV_{insect song in equal SPL} ≈ i₀ + i₁ × W_ϕ(0)_QR+ i₂ × loudness_QR + i₃ × fluctuation strength + i₄ × spectral entropy + i₅ × spectral skewness,

(3)

SV_{insect song in equal loudness} ≈ j₀+ j₁ × ϕ₁ + j₂ × τ_e + j₃ × roughness + j₄ × fluctuation strength + j₅ × spectral entropy_QR.

(4)

The correlation coefficients between all of the explanatory variables in the equal L_Aeq and loudness conditions are shown in Table 4 and Table 5. The ANOVA indicated the statistical significance of the model (F(5, 250) = 106.00, p < 0.001 for the equal L_Aeq condition; F(5, 234) = 14.58, p < 0.001 for the equal loudness condition). The adjusted coefficient of determination, R², was 0.68 for the equal L_Aeq condition and 0.22 for the equal loudness condition. The standardized partial regression coefficients in Equations (3) and (4) are summarized in Table 6.

Table 4. The correlation coefficients between the explanatory variables used in the multiple regression analysis in the equal L_Aeq condition for insect songs.

Table 5. The correlation coefficients between the explanatory variables used in the multiple regression analysis in the equal loudness condition for insect songs.

Table 6. Significant predictive parameters and standardized partial regression coefficients revealed by multiple linear regression analyses of inset song salience.

Fluctuation strength was a significant predictive variable in both the equal L_Aeq and loudness conditions. This suggests that strong and slow amplitude modulation of insect songs is important for salience perception. Although IACC was a significant predictor of birdsong salience in both the equal L_Aeq and loudness conditions, it was not a significant predictor of insect song salience. This may be because the insect songs used in the experiment were not recorded using a dummy head microphone [1], and so spatial impressions of the sound sources were not accurately reproduced.

Loudness variations can also be important for saliency. The QR of loudness was a significant predictive variable in the equal loudness condition. This is consistent with the results for birdsongs in the equal L_Aeq condition, although the partial regression coefficient was negative for insect songs and positive for birdsongs. This suggests that sound sources with moderate loudness variations are perceived to be more salient. We observed a similar pattern for roughness. Roughness was a significant predictor of both birdsong and insect song salience in the equal loudness condition. Although the partial regression coefficient for birdsong was positive, that for insect songs was negative. This suggests that sound sources with moderately fast amplitude modulation are perceived to be more salient.

Spectral entropy appears to play an important role in saliency. Spectral entropy and the QR of spectral entropy were significant predictors of salience in the equal L_Aeq and loudness conditions, respectively. The positive partial regression coefficient of spectral entropy suggests that abrupt energy changes in the frequency domain of a sound increase the salience. This is partially consistent with our finding regarding the role of spectral flux in the salience of birdsongs. The negative partial regression coefficient of the QR of spectral entropy suggests that stable energy changes in the frequency domain are more important for saliency.

Pitch strength can also be an important modulator of saliency. The maximum peak amplitude of the ACF, ϕ₁, was a significant predictor of insect song salience in the equal loudness condition. The negative partial regression coefficient of ϕ₁ suggests that broader frequency components are necessary for salience. This is inconsistent with previous findings regarding preference [1]. One possible explanation for this discrepancy is the importance of tonal components for preference, specifically, the importance of melody and broader frequency components for salience, as they enable the listener to more deeply understand the characteristics of the sound source.

4. Conclusions

We examined the salience of birdsong and insect song in terms of several physical parameters. The results indicated that Horornis diphone and Cuculus canorus produce the most salient birdsongs, while Meimuna opalifera and Oncotympana maculaticollis produce the most salient insect songs. All of these creatures are well-known in Japan. The variation of loudness, roughness, and spectral skewness were significant predictors of salience for both birdsongs and insect songs. Spatial content related to the interaural cross-correlation coefficient, IACC, and spectral content expressed by spectral flux were significantly associated with birdsong salience. The maximum peak amplitude of the ACF, ϕ₁, was significantly associated with insect song salience. These findings may be useful to designers of sound landmarks regarding physical parameters to consider, such as ϕ₁, IACC, and spectral skewness.

Considering the findings of the current study together with those of a previous study on preference for birdsongs [1], the birdsongs of Horornis diphone and Cuculus canorus appear to be desirable information signals because they are salient and preferred. As for insect songs, the song of Tanna japonensis appears to be a desirable signal because it is salient and preferred. This may be partly because they are ubiquitous in Japan, where they are well-liked.

Subjective salience in the current study was not well correlated with the physical parameters of the sounds. Soundscapes are not only affected by the physical aspects of sounds, but also by the context, which includes relationships between people and activities and position in space and time. Thus, the context with respect to the participants may be a dominant factor influencing subjective salience, and could be an interesting topic for future study. Furthermore, the salience of birdsong and insect song stimuli may differ according to culture. We hope to examine cognitive and cultural factors influencing salience in future work. In addition, the present findings need to be verified in a study with visually impaired participants.

Author Contributions

Conceptualization, Y.S.; methodology, Y.S.; formal analysis, A.A.; investigation, Y.S.; resources, Y.S.; data curation, Y.S. and A.A.; writing—original draft preparation, Y.S.; writing—review and editing, Y.S.; visualization, Y.S.; project administration, Y.S.; funding acquisition, Y.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partly supported by a Grant-in-Aid for Scientific Research (B) (Grant No. 18H03324) from the Japan Society for the Promotion of Science.

Acknowledgments

We thank Sydney Koke, MFA, from Edanz Group (https://en-author-services.edanzgroup.com/ac) for editing a draft of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Soeta, Y.; Kagawa, H. Subjective preferences for birdsong and insect song in equal sound pressure level. Appl. Sci. 2020, 10, 849. [Google Scholar] [CrossRef]
Ratcliffe, E.; Gatersleben, B.; Sowden, P.T. Bird sounds and their contributions to perceived attention restoration and stress recovery. J. Environ. Psychol. 2013, 38, 221–228. [Google Scholar] [CrossRef]
Ratcliffe, E.; Gatersleben, B.; Sowden, P.T. Associations with bird sounds: How do they relate to perceived restorative potential? J. Environ. Psychol. 2016, 47, 136–144. [Google Scholar] [CrossRef]
Axelsson, Ö.; Nilsson, M.E.; Hellström, B.; Lundén, P. A field experiment on the impact of sounds from a jet-and-basin fountain on soundscape quality in an urban park. Landsc. Urban Plan. 2014, 123, 49–60. [Google Scholar] [CrossRef]
Galbrun, L.; Calarco, F. Audio-visual interaction and perceptual assessment of water features used over road traffic noise. J. Acoust. Soc. Am. 2014, 136, 2609–2620. [Google Scholar] [CrossRef]
Kang, J.; Aletta, F.; Gjestland, T.T.; Brown, L.A.; Botteldooren, D.; Schulte-Fortkamp, B.; Lercher, P.; van Kamp, I.; Genuit, K.; Fiebig, A.; et al. Ten questions on the soundscapes of the built environment. Build. Environ. 2016, 108, 284–294. [Google Scholar] [CrossRef]
De Coensel, B.; Vanwetswinkel, S.; Botteldooren, D. Effects of natural sounds on the perception of road traffic noise. J. Acoust. Soc. Am. 2011, 129, EL148–EL153. [Google Scholar] [CrossRef]
Jeon, J.Y.; Lee, P.J.; You, J.; Kang, J. Perceptual assessment of quality of urban soundscapes with combined noise sources and water sounds. J. Acoust. Soc. Am. 2010, 127, 1357–1366. [Google Scholar] [CrossRef]
Hong, J.Y.; Jeon, J.Y. Designing sound and visual components for enhancement of urban soundscapes. J. Acoust. Soc. Am. 2013, 134, 2026–2036. [Google Scholar] [CrossRef]
Ou, D.; Mak, C.M.; Pan, S. A method for assessing soundscape in urban parks based on the service quality measurement models. Appl. Acoust. 2017, 127, 184–193. [Google Scholar] [CrossRef]
Foundation for Promoting Personal Mobility and Ecological Transportation. Basic Case Study Report on Movement Support for Traffic Hub; Foundation for Promoting Personal Mobility and Ecological Transportation: Tokyo, Japan, 2009. (In Japanese) [Google Scholar]
Ministry of Land, Infrastructure, Transport and Tourism. Guidelines for Improving the Facilitation of Transportation for Passenger Facilities of Public Transportation; Ministry of Land, Infrastructure, Transport and Tourism: Tokyo, Japan, 2020. (In Japanese)
Sato, H.; Morimoto, M.; Sato, H. Effect of noise and reverberation on sound localization of acoustic guide signal for visually impaired persons in public spaces. Noise Control Eng. J. 2014, 62, 1–9. [Google Scholar] [CrossRef]
Sato, H.; Morimoto, M.; Sato, H. Perception of azimuth angle of sound source located at high elevation angle: Effective distance of auditory guide signal. Appl. Acoust. 2020, 159, 107084. [Google Scholar] [CrossRef]
Soeta, Y.; Nakagawa, S. Prediction of optimal auditory signals using auditory evoked magnetic responses. Build. Environ. 2015, 94, 924–929. [Google Scholar] [CrossRef]
Huang, N.; Elhilali, M. Auditory salience using natural soundscapes. J. Acoust. Soc. Am. 2017, 141, 2163–2176. [Google Scholar] [CrossRef] [PubMed]
Kim, K.; Lin, K.; Walther, D.B.; Hasegawa-Johnson, M.A.; Huang, T.S. Automatic detection of auditory salience with optimized linear filters derived from human annotation. Pattern Recognit. Lett. 2014, 38, 78–85. [Google Scholar] [CrossRef]
Takeshima, H.; Suzuki, Y.; Suzuki, Y.; Sone, T. Growth of the loudness of a tone burst with a duration up to 10 seconds. J. Acoust. Soc. Jpn. 1988, 9, 295–300. [Google Scholar] [CrossRef][Green Version]
Glasberg, B.R.; Moore, B.C.J. A model of loudness applicable to time-varying sounds. J. Audio Eng. Soc. 2002, 50, 331–342. [Google Scholar]
Soeta, Y.; Kagawa, H. Three dimensional psychological evaluation of aircraft noise and prediction by physical parameters. Build. Environ. 2020, 16, 106445. [Google Scholar] [CrossRef]
Scheffé, H. An analysis of variance for paired comparisons. J. Am. Stat. Assoc. 1952, 147, 381–400. [Google Scholar]
Sato, S. Statistical Method of Sensory Test; JUSE Press: Tokyo, Japan, 1985. (In Japanese) [Google Scholar]
Nagasawa, S. Improvement of the Scheffe’s method for paired comparisons. Kansei Eng. J. 2002, 3, 47–56. [Google Scholar] [CrossRef]
Ando, Y.; Cariani, P. Auditory and Visual Sensations; Springer: New York, NY, USA, 2009. [Google Scholar]
Soeta, Y.; Ando, Y. Neurally Based Measurement and Evaluation of Environmental Noise; Springer: Tokyo, Japan, 2015. [Google Scholar]
Ando, Y.; Kurihara, Y. Nonlinear response in evaluating the subjective diffuseness of sound field. J. Acoust. Soc. Am. 1986, 80, 833–836. [Google Scholar] [CrossRef]
Sato, S.; Ando, Y. Apparent source width (ASW) of complex noises in relation to the interaural cross-correlation function. J. Temporal Des. Archit. Environ. 2002, 2, 29–32. [Google Scholar]
Zwicker, E.; Fastl, H. Psychoacoustics: Facts and Models; Springer: Berlin, Germany, 1999. [Google Scholar]
Giannakopoulos, T.; Pikrakis, A. Introduction to Audio Analysis: A MATLAB® Approach; Academic Press: Oxford, UK, 2008. [Google Scholar]
Peeters, G. A large set of audio features for sound description (similarity and classification) in the CUIDADO. CUIDADO IST Proj. Rep. 2004, 54, 1–25. [Google Scholar]
Shapiro, S.S.; Wilk, M.B. An analysis of variance test for normality (complete samples). Biometrika 1965, 52, 591–611. [Google Scholar] [CrossRef]
Sato, S.; Kitamura, T.; Ando, Y. Annoyance of noise stimuli in relation to the spatial factors extracted from the interaural cross-correlation function. J. Sound Vib. 2004, 277, 511–521. [Google Scholar] [CrossRef]
Sato, S.; You, J.; Jeon, J.Y. Sound quality characteristics of refrigerator noise in real living environments with relation to psychoacoustical and autocorrelation function parameters. J. Acoust. Soc. Am. 2007, 122, 314–325. [Google Scholar] [CrossRef]
Jeon, J.Y.; Sato, S. Annoyance caused by heavyweight floor impact sounds in relation to the autocorrelation function and sound quality metrics. J. Sound Vib. 2008, 311, 767–785. [Google Scholar] [CrossRef]
Soeta, Y.; Shimokura, R. Sound quality evaluation of air-conditioner noise based on factors of the autocorrelation function. Appl. Acoust. 2017, 124, 11–19. [Google Scholar] [CrossRef]
McEvoy, L.K.; Picton, T.W.; Champagne, S.C. The timing of the processes underlying lateralization: Psychophysical and evoked potential measures. Ear Hear. 1991, 12, 389–398. [Google Scholar] [CrossRef]
Zimmer, U.; Macaluso, E. High binaural coherence determines successful sound localization and increased activity in posterior auditory areas. Neuron 2005, 47, 893–905. [Google Scholar] [CrossRef]