Enhancing Alarm Localization in Multi-Window Map Interfaces with Spatialized Auditory Cues: An Eye-Tracking Study
Abstract
1. Introduction
2. Related Work
2.1. Auditory Spatial Cueing and Cross-Modal Attention
2.2. Display Dynamics
2.3. Interface Complexity
2.4. Eye-Tracking Evidence in Multimodal Attention
3. Materials and Methods
3.1. Participants
3.2. Apparatus
3.3. Experimental Design
3.3.1. Auditory Spatial Cueing
3.3.2. Display Dynamics
3.3.3. Interface Complexity
3.4. Procedure
4. Results
4.1. Data Collection and Analysis
4.2. Behavioral Data
4.2.1. Auditory Spatial Cueing
4.2.2. Display Dynamics
4.2.3. Interface Complexity
4.2.4. Interaction Effects
- (1)
- Auditory Spatial Cueing × Display Dynamics
- (2)
- Auditory Spatial Cueing × Interface Complexity
- (3)
- Display Dynamics × Interface Complexity
4.3. Eye-Tracking Data
4.3.1. Auditory Spatial Cueing
4.3.2. Display Dynamics
4.3.3. Interface Complexity
4.3.4. Interaction Effects
- (1)
- Auditory Spatial Cueing × Display Dynamics
- (2)
- Auditory Spatial Cueing × Interface Complexity
- (3)
- Display Dynamics × Interface Complexity
5. Discussion
5.1. Effects of Auditory Spatial Cueing
5.2. Effects of Display Dynamics
5.3. Effects of Interface Complexity
5.4. Interaction Effects Among Variables
5.5. Limitations and Future Research Directions
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Wickens, C.D. Multiple resources and mental workload. Hum. Factors 2008, 50, 449–455. [Google Scholar] [CrossRef]
- Dehais, F.; Karwowski, W.; Ayaz, H. Brain at Work and in Everyday Life as the Next Frontier: Grand Field Challenges for Neuroergonomics. Front. Neuroergonomics 2020, 1, 583733. [Google Scholar] [CrossRef] [PubMed]
- Mandal, A.; Liesefeld, A.M.; Liesefeld, H.R. Tracking the Misallocation and Reallocation of Spatial Attention toward Auditory Stimuli. J. Neurosci. 2024, 44, e2196232024. [Google Scholar] [CrossRef] [PubMed]
- Cai, M.; Bao, Y. Spatial attention modulates auditory dominance in audiovisual order judgment. Psych. J. 2023, 12, 537–539. [Google Scholar] [CrossRef] [PubMed]
- Cinetto, S.; Blini, E.; Zangrossi, A.; Corbetta, M.; Zorzi, M. Spatial regularities in a closed-loop audiovisual search task bias subsequent free-viewing behavior. Psychon. Bull. Rev. 2025, 32, 2977–2989. [Google Scholar] [CrossRef]
- Kern, L.; Niedeggen, M. Are auditory cues special? Evidence from cross-modal distractor-induced blindness. Atten. Percept. Psychophys. 2023, 85, 889–904. [Google Scholar] [CrossRef]
- He, Y.; Guo, Z.; Wang, X.; Sun, K.; Lin, X.; Wang, X.; Li, F.; Guo, Y.; Feng, T.; Zhang, J.; et al. Effects of Audiovisual Interactions on Working Memory Task Performance—Interference or Facilitation. Brain Sci. 2022, 12, 886. [Google Scholar] [CrossRef]
- Yuan, P.; Hu, R.; Zhang, X.; Wang, Y.; Jiang, Y. Cortical entrainment to hierarchical contextual rhythms recomposes dynamic attending in visual perception. eLife 2021, 10, e65118. [Google Scholar] [CrossRef]
- Aguado-López, B.; Palenciano, A.F.; Peñalver, J.M.G.; Díaz-Gutiérrez, P.; López-García, D.; Avancini, C.; Ciria, L.F.; Ruz, M. Proactive selective attention across competition contexts. Cortex 2024, 176, 113–128. [Google Scholar] [CrossRef]
- Xu, S.; Xu, M.; Kang, Q.; Yuan, X. Mobile Reading Attention of College Students in Different Reading Environments: An Eye-Tracking Study. Behav. Sci. 2025, 15, 953. [Google Scholar] [CrossRef]
- Das, A.; Wu, Z.; Skrjanec, I.; Feit, A.M. Shifting focus with hceye: Exploring the dynamics of visual highlighting and cognitive load on user attention and saliency prediction. Proc. ACM Hum.-Comput. Interact. 2024, 8, 236. [Google Scholar] [CrossRef]
- Zhang, C.; Wei, D.P.; Ji, Y.; Chen, D.; Li, X.Y.; Gong, X.D. The influence of interface attributes and interaction elements on user performance and cognitive load in task interruption scenarios. Int. J. Ind. Ergon. 2025, 108, 103761. [Google Scholar] [CrossRef]
- Zimmer, U.; Wendt, M.; Pacharra, M. Enhancing allocation of visual attention with emotional cues presented in two sensory modalities. Behav. Brain Funct. 2022, 18, 10. [Google Scholar] [CrossRef]
- Hu, J.; Badde, S.; Vetter, P. Auditory guidance of eye movements toward threat-related images in the absence of visual awareness. Front. Hum. Neurosci. 2024, 18, 1441915. [Google Scholar] [CrossRef] [PubMed]
- Saccani, M.S.; Contemori, G.; Del Popolo Cristaldi, F.; Bonato, M. Attentional load impacts multisensory integration, without leading to spatial processing asymmetries. Sci. Rep. 2025, 15, 16240. [Google Scholar] [CrossRef] [PubMed]
- Gao, Z.; Hui-Wang, Q.; Feng, G.; Lv, H. Exploring Sonification Mapping Strategies for Spatial Auditory Guidance in Immersive Virtual Environments. ACM Trans. Appl. Percept. 2022, 19, 9. [Google Scholar] [CrossRef]
- Fu, J.; Guo, X.; Tang, X.; Wang, A.; Zhang, M.; Gao, Y.; Seno, T. The Effects of Bilateral and Ipsilateral Auditory Stimuli on the Subcomponents of Visual Attention. i-Perception 2021, 12, 20416695211058222. [Google Scholar] [CrossRef]
- Tang, X.; Gu, J.; Lu, S.; Sun, J.; Du, Y. From Sound to Sight: The Cross-Modal Spread of Location-Based Inhibition of Return. Psychophysiology 2025, 62, e70123. [Google Scholar] [CrossRef]
- O’Dowd, A.; Hirst, R.J.; Seveso, M.A.; McKenna, E.M.; Newell, F.N. Generalisation to novel exemplars of learned shape categories based on visual and auditory spatial cues does not benefit from multisensory information. Psychon. Bull. Rev. 2025, 32, 417–429. [Google Scholar] [CrossRef]
- Jing, B.; Wu, C.; Pi, Z.; Zhou, Y.; Ma, H. Examining the Voice-Image Matching for Pedagogical Agents Presented in Instructional Videos. J. Exp. Educ. 2025, 1–21. [Google Scholar] [CrossRef]
- Groznik, V.; De Gobbis, A.; Georgiev, D.; Semeja, A.; Sadikov, A. Machine Learning-Based Detection of Cognitive Impairment from Eye-Tracking in Smooth Pursuit Tasks. Appl. Sci. 2025, 15, 7785. [Google Scholar] [CrossRef]
- Gaspelin, N.; Luck, S.J. The Role of Inhibition in Avoiding Distraction by Salient Stimuli. Trends Cogn. Sci. 2018, 22, 79–92. [Google Scholar] [CrossRef]
- Forschack, N.; Gundlach, C.; Hillyard, S.; Müller, M.M. Dynamics of attentional allocation to targets and distractors during visual search. NeuroImage 2022, 264, 119759. [Google Scholar] [CrossRef]
- Stolte, M.; Ansorge, U. Automatic capture of attention by flicker. Atten. Percept. Psychophys. 2021, 83, 1407–1415. [Google Scholar] [CrossRef] [PubMed]
- Wu, Z.; Feit, A.M. Understanding and Predicting Temporal Visual Attention Influenced by Dynamic Highlights in Monitoring Task. IEEE Trans. Hum. -Mach. Syst. 2025, 55, 1053–1063. [Google Scholar] [CrossRef]
- Kwon, J.; Schmidt, A.; Luo, C.; Jun, E.; Martinez, K. Visualizing Spatial Cognition for Wayfinding Design: Examining Gaze Behaviors Using Mobile Eye Tracking in Counseling Service Settings. ISPRS Int. J. Geo-Inf. 2025, 14, 406. [Google Scholar] [CrossRef]
- Hemmerich, K.; Luna, F.G.; Martín-Arévalo, E.; Lupiáñez, J. Understanding vigilance and its decrement: Theoretical, contextual, and neural insights. Front. Cogn. 2025, 4, 1617561. [Google Scholar] [CrossRef]
- Ko, S.; Kutchek, K.; Zhang, Y.; Jeon, M. Effects of Non-Speech Auditory Cues on Control Transition Behaviors in Semi-Automated Vehicles: Empirical Study, Modeling, and Validation. Int. J. Hum.-Comput. Interact. 2021, 38, 185–200. [Google Scholar] [CrossRef]
- El Iskandarani, M.; Sara, L.R.; Bolton, M. Examining dual-task interference effects of visual and auditory perceptual load in virtual reality. Int. J. Hum.-Comput. Stud. 2025, 205, 103619. [Google Scholar] [CrossRef]
- Dunifon, C.M.; Rivera, S.; Robinson, C.W. Auditory stimuli automatically grab attention: Evidence from eye tracking and attentional manipulations. J. Exp. Psychol. Hum. Percept. Perform. 2016, 42, 1947–1958. [Google Scholar] [CrossRef]
- Rummukainen, O.; Mendonça, C. Task-Relevant Spatialized Auditory Cues Enhance Attention Orientation and Peripheral Target Detection in Natural Scenes. J. Eye Mov. Res. 2016, 9, 1–10. [Google Scholar] [CrossRef]
- Du, P.; Li, D.; Liu, T.; Zhang, L.; Yang, X.; Li, Y. Crisis Map Design Considering Map Cognition. ISPRS Int. J. Geo-Inf. 2021, 10, 692. [Google Scholar] [CrossRef]
- Yu, J.; Zhou, M.; Wang, X.; Pu, G.; Cheng, C.; Chen, B. A Dynamic and Static Context-Aware Attention Network for Trajectory Prediction. ISPRS Int. J. Geo-Inf. 2021, 10, 336. [Google Scholar] [CrossRef]
- Wu, C.-F.; Gao, C.; Lin, K.-C.; Chang, Y.-H. Evaluating Impacts of Bus Route Map Design and Dynamic Real-Time Information Presentation on Bus Route Map Search Efficiency and Cognitive Load. ISPRS Int. J. Geo-Inf. 2022, 11, 338. [Google Scholar] [CrossRef]
- Ehinger, K.A.; Wolfe, J.M. When is it time to move to the next map? Optimal foraging in guided visual search. Atten. Percept. Psychophys. 2016, 78, 2135–2151. [Google Scholar] [CrossRef]
- Rymarkiewicz, W.; Cybulski, P.; Horbiński, T. Measuring Efficiency and Accuracy in Locating Symbols on Mobile Maps Using Eye Tracking. ISPRS Int. J. Geo-Inf. 2024, 13, 42. [Google Scholar] [CrossRef]
- Siepmann, N.; Edler, D.; Keil, J.; Kuchinke, L.; Dickmann, F. The position of sound in audiovisual maps: An experimental study of performance in spatial memory. Cartogr. Int. J. Geogr. Inf. Geovisualization 2020, 55, 136–150. [Google Scholar] [CrossRef]
- Medyńska-Gulij, B.; Gulij, J.; Cybulski, P.; Zagata, K.; Zawadzki, J.; Horbiński, T. Map design and usability of a simplified topographic 2D map on the smartphone in landscape and portrait orientations. ISPRS Int. J. Geo-Inf. 2022, 11, 577. [Google Scholar] [CrossRef]
- Huang, C.; Mees, O.; Zeng, A.; Burgard, W. Audio Visual Language Maps for Robot Navigation. In Experimental Robotics; Ang, M.H., Jr., Khatib, O., Eds.; Springer Proceedings in Advanced Robotics; Springer: Cham, Switzerland, 2024; Volume 30, p. 10. [Google Scholar]
- Besançon, L.; Ynnerman, A.; Keefe, D.F.; Yu, L.; Isenberg, T. The state of the art of spatial interfaces for 3D visualization. Comput. Graph. Forum 2021, 40, 293–326. [Google Scholar] [CrossRef]
- Kashevnik, A.; Lashkov, I.; Axyonov, A.; Ivanko, D.; Ryumin, D.; Kolchin, A.; Karpov, A. Multimodal Corpus Design for Audio-Visual Speech Recognition in Vehicle Cabin. IEEE Access 2021, 9, 34986–35003. [Google Scholar] [CrossRef]
- Pramanik, A.; Sarkar, S.; Maiti, J. A real-time video surveillance system for traffic pre-events detection. Accid. Anal. Prev. 2021, 154, 106019. [Google Scholar] [CrossRef]
- Coscia, A.; Suh, A.; Chang, R.; Endert, A. Preliminary Guidelines for Combining Data Integration and Visual Data Analysis. IEEE Trans. Vis. Comput. Graph. 2024, 30, 6678–6690. [Google Scholar] [CrossRef]
- Broeckelmann, E.M.; Martin, T.; Glazebrook, C.M. Auditory Cues and Feedback in the Serial Reaction Time Task: Evidence for Sequence Acquisition and Sensory Transfer. J. Mot. Behav. 2025, 57, 182–197. [Google Scholar] [CrossRef]
- Hirway, A.; Qiao, Y.; Murray, N. Spatial Audio in 360° Videos: Does it influence Visual Attention? In Proceedings of the 13th ACM Multimedia Systems Conference; Association for Computing Machinery: New York, NY, USA, 2022; pp. 39–51. [Google Scholar]
- Midtbø, T.; Larsen, E. Map Animations Versus Static Maps—When Is One of Them Better? In Proceedings of the Joint ICA Commissions Seminar on Internet-based Cartographic Teaching and Learning, Madrid, Spain, 6–8 July 2005. [Google Scholar]
- Medyńska-Gulij, B.; Wielebski, Ł.; Halik, Ł.; Smaczyński, M. Complexity Level of People Gathering Presentation on an Animated Map—Objective Effectiveness Versus Expert Opinion. ISPRS Int. J. Geo-Inf. 2020, 9, 117. [Google Scholar] [CrossRef]
- Shao, J.; Wu, J.; Tang, W.; Xue, C. How dynamic information layout in GIS interface affects users’ search performance: Integrating visual motion cognition into map information design. Behav. Inf. Technol. 2023, 42, 1686–1703. [Google Scholar] [CrossRef]
- Barvir, R.; Vozenilek, V. Developing Versatile Graphic Map Load Metrics. ISPRS Int. J. Geo-Inf. 2020, 9, 705. [Google Scholar] [CrossRef]
- Lin, X.; Pan, P. The Impact of Information Layout and Auxiliary Instruction Display Mode on the Usability of Virtual Fitting Interaction Interfaces. Information 2025, 16, 862. [Google Scholar] [CrossRef]
- Zhang, J.; Zhang, N.; Zhang, Y.; Xu, C. Research on Evaluation Method for Multimodal Information Interface Complexity. In Human-Computer Interaction—HCII 2025; Kurosu, M., Hashizume, A., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2025; Volume 15768, p. 31. [Google Scholar]
- Wang, Z.; Liu, F.; Lu, Z.; Jia, F.; Wang, J. EHMI: A Complexity Assessment Method for Automotive Intelligent Cockpit Human-Computer Interaction Interfaces: An Example from the Instrument Cluster. In Design, User Experience, and Usability—HCII 2025; Schrepp, M., Ed.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2025; Volume 15797, p. 21. [Google Scholar]
- Hsieh, M.C.; Chiu, M.C.; Hwang, S.L. An optimal range of information quantity on computer-based procedure interface design in the advanced main control room. J. Nucl. Sci. Technol. 2015, 52, 687–694. [Google Scholar] [CrossRef]
- Zhang, N.; Zhang, J.; Jiang, S.; Ge, W. The Effects of Layout Order on Interface Complexity: An Eye-Tracking Study for Dashboard Design. Sensors 2024, 24, 5966. [Google Scholar] [CrossRef] [PubMed]
- Roehrbein, F.; Coen-Cagli, R.; Schwartz, O. Dynamic scenes vs. static images: Differences in basic gazing behaviors for natural stimulus sets. J. Vis. 2011, 11, 486. [Google Scholar] [CrossRef]
- Afonso-Jaco, A.; Katz, B.F.G. Spatial knowledge via auditory information for blind individuals: Spatial cognition studies and the use of audio-VR. Sensors 2022, 22, 4794. [Google Scholar] [CrossRef]
- Boyer, E.O.; Portron, A.; Bevilacqua, F.; Lorenceau, J. Continuous Auditory Feedback of Eye Movements: An Exploratory Study toward Improving Oculomotor Control. Front. Neurosci. 2017, 11, 197. [Google Scholar] [CrossRef]
- Fleming, J.T.; Noyce, A.L.; Shinn-Cunningham, B.G. Audio-visual spatial alignment improves integration in the presence of a competing audio-visual stimulus. Neuropsychologia 2020, 146, 107530. [Google Scholar] [CrossRef]
- Eimontaite, I.; Gwilt, I.; Cameron, D.; Aitken, J.M.; Rolph, J.; Mokaram, S.; Law, J. Dynamic Graphical Signage Improves Response Time and Decreases Negative Attitudes Towards Robots in Human-Robot Co-working. In Human Friendly Robotics; Ficuciello, F., Ruggiero, F., Finzi, A., Eds.; Springer Proceedings in Advanced Robotics; Springer: Cham, Switzerland, 2019; Volume 7, p. 11. [Google Scholar]
- Yu, R.F.; Chan, A.H.S. Display movement velocity and dynamic visual search performance. Hum. Factors Ergon. Manuf. Serv. Ind. 2015, 25, 269–278. [Google Scholar] [CrossRef]
- Mahadevan, V.; Vasconcelos, N. Spatiotemporal Saliency in Dynamic Scenes. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 32, 171–177. [Google Scholar] [CrossRef]
- Parise, C.V.; Ernst, M.O. Multisensory integration operates on correlated input from unimodal transient channels. eLife 2025, 12, RP90841. [Google Scholar] [CrossRef] [PubMed]
- Frischkorn, G.T.; Wilhelm, O.; Oberauer, K. Process-oriented intelligence research: A review from the cognitive perspective. Intelligence 2022, 94, 101681. [Google Scholar] [CrossRef]
- Li, W.C.; Yu, C.S.; Greaves, M.; Braithwaite, G. How cockpit design impacts pilots’ attention distribution and perceived workload during aiming a stationary target. Procedia Manuf. 2015, 3, 5663–5669. [Google Scholar] [CrossRef]
- Yiu, C.Y.; Ng, K.K.H.; Li, Q.; Yuan, X. Gaze behaviours, situation awareness and cognitive workload of air traffic controllers in radar screen monitoring tasks with varying task complexity. Int. J. Occup. Saf. Ergon. 2025, 31, 504–515. [Google Scholar] [CrossRef]
- Xue, H.; Wang, T.; Zhang, X. Visual search in vibration environments: Effects of spatial ability, stimulus size and stimulus density. Int. J. Ind. Ergon. 2020, 79, 102988. [Google Scholar] [CrossRef]
- Yamamoto, S.; Miyazaki, M.; Iwano, T.; Kitazawa, S. Bayesian calibration of simultaneity in audiovisual temporal order judgments. PLoS ONE 2012, 7, e40379. [Google Scholar] [CrossRef] [PubMed]
- Rohe, T.; Ehlis, A.C.; Noppeney, U. The neural dynamics of hierarchical Bayesian causal inference in multisensory perception. Nat. Commun. 2019, 10, 1907. [Google Scholar] [CrossRef] [PubMed]
- Ferrari, A.; Noppeney, U. Attention controls multisensory perception via two distinct mechanisms at different levels of the cortical hierarchy. PLoS Biol. 2021, 19, e3001465. [Google Scholar] [CrossRef] [PubMed]




























| No Sound | Binaural Sound | |
|---|---|---|
| Typical participant scanning path | ![]() | ![]() |
| The overlapped saccade tracks | ![]() | ![]() |
| Low | Medium | High | |
|---|---|---|---|
| Typical participant scanning path | ![]() | ![]() | ![]() |
| The overlapped saccade tracks | ![]() | ![]() | ![]() |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhang, J.; Zhu, X.; Tang, W.; Ge, W.; Zhang, Y.; Li, J. Enhancing Alarm Localization in Multi-Window Map Interfaces with Spatialized Auditory Cues: An Eye-Tracking Study. ISPRS Int. J. Geo-Inf. 2026, 15, 69. https://doi.org/10.3390/ijgi15020069
Zhang J, Zhu X, Tang W, Ge W, Zhang Y, Li J. Enhancing Alarm Localization in Multi-Window Map Interfaces with Spatialized Auditory Cues: An Eye-Tracking Study. ISPRS International Journal of Geo-Information. 2026; 15(2):69. https://doi.org/10.3390/ijgi15020069
Chicago/Turabian StyleZhang, Jing, Xiaoyu Zhu, Wenzhe Tang, Weijia Ge, Yong Zhang, and Jing Li. 2026. "Enhancing Alarm Localization in Multi-Window Map Interfaces with Spatialized Auditory Cues: An Eye-Tracking Study" ISPRS International Journal of Geo-Information 15, no. 2: 69. https://doi.org/10.3390/ijgi15020069
APA StyleZhang, J., Zhu, X., Tang, W., Ge, W., Zhang, Y., & Li, J. (2026). Enhancing Alarm Localization in Multi-Window Map Interfaces with Spatialized Auditory Cues: An Eye-Tracking Study. ISPRS International Journal of Geo-Information, 15(2), 69. https://doi.org/10.3390/ijgi15020069











