Next Article in Journal
Visual Learning and Innovative Teaching in Primary Schools: A Mixed-Methods Study of Foreign Pupils
Previous Article in Journal
The Role of Visual Education in Training Processes: A Systematic Review of the Use of Visual Tools to Enhance Learning and Promote the Development of Soft Skills
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Psychological Dimensions Involved in Image Communication: A Multidisciplinary Research Proposal for Analyzing Cognitive and Perceptual Processes in Visual Education †

by
Giusi Antonia Toto
1,* and
Pierpaolo Limone
2
1
Department of Humanities, University of Foggia, Via Arpi 176, 71121 Foggia, Italy
2
Department of Human Sciences, Pegaso Online University, Isola F2, 80143 Napoli, Italy
*
Author to whom correspondence should be addressed.
Presented at the Learning and Teaching Strategies Mediated by Visual Education: Horizons of Research and Action (ASTERA 2025), Bari, Italy, 2 October 2025.
Proceedings 2026, 139(1), 7; https://doi.org/10.3390/proceedings2026139007
Published: 17 April 2026

Abstract

Image communication represents a fundamental domain of human experience that intersects cognitive neuroscience, educational psychology, and visual communication theory. The increasing digitalization of contemporary society has amplified the importance of visual literacy, defined as the ability to interpret, use, and create visual media. While neuroscientific research highlights the brain’s proficiency in processing visual information, significant gaps remain in understanding the underlying psychological mechanisms and their practical applications in educational contexts. This study proposes a multidisciplinary research design to systematically analyze these psychological dimensions. The research will integrate cognitive, perceptual, and pedagogical perspectives to understand how visual representations influence learning. The methodological design includes a multi-method approach combining experimental analysis, ethnographic observation, and psychometric evaluation on a stratified sample of 240 participants (aged 16–25) divided into three groups: high school students (n = 80), university students (n = 80), and young professionals (n = 80). The proposed methodology will utilize eye-tracking to analyze visual perception patterns, integrated with semantic differential methods to evaluate cognitive and affective associations with visual imagery. The expected results should clarify how the effectiveness of image communication depends on the coherence between technical and semantic aspects of visual imagery. The research aims to contribute to the theoretical framework of educational neuroscience, offering empirical evidence for optimizing teaching strategies based on multimodal visual communication.

1. Introduction and Theoretical Rationale

1.1. Context and Relevance

Communication through images is one of the most ancient and universal modalities of human knowledge transmission, from Paleolithic cave paintings to modern interactive digital interfaces [1]. In the contemporary era, understanding the psychological mechanisms underlying image perception is crucial for optimizing educational processes [2]. While neuroscientific research has revealed that the human visual system actively constructs meaningful representations [3], the translation of this knowledge into educational principles remains fragmentary. Visual education emerges as an interdisciplinary paradigm to address this gap, but research in the field often lacks methodological approaches that capture the complexity of the processes involved. This proposal aims to fill this gap through an innovative design that integrates multiple theoretical perspectives and advanced investigation techniques.

1.2. Gaps in Existing Literature

A systematic analysis of the literature reveals several critical gaps. First, many studies focus on isolated aspects of visual communication (e.g., color perception) without considering the dynamic interaction between different cognitive processes, which prevents the development of integrated theoretical models [4]. Second, research often relies on traditional methodologies (e.g., questionnaires) that do not capture cognitive processes in real time. The use of advanced technologies like eye-tracking remains limited [5]. Third, existing research often neglects the importance of individual and cultural differences in visual processing, and there is no systematic understanding of how these variations influence visual communication effectiveness in diverse educational contexts [6].

1.3. Theoretical Framework

The proposed research is based on a multidisciplinary theoretical framework. The central pillar is Paivio’s dual-coding theory, which posits two distinct information processing systems for verbal and non-verbal (imaginative) information [7]. This proposal extends the theory by incorporating recent neuroscientific findings on specialized subsystems for different types of visual and spatial information. The second pillar is embodied cognition theory, which emphasizes the role of sensorimotor experience in knowledge construction [8]. This is relevant for understanding how visual representations can facilitate learning by activating sensorimotor schemas. The third element is cognitive load theory, which provides principles for optimizing the design of educational materials by distinguishing between intrinsic, extraneous, and germane load [9].

1.4. Research Objectives and Hypotheses

The research has four main objectives:
  • Characterize the neurobiological and cognitive processes underlying the perception of visual information in educational contexts.
    Hypothesis 1a: It is hypothesized that the effectiveness of visual communication will be positively correlated with the coordinated activation of specific neural networks. We expect to observe activation in the ventral visual stream, including the fusiform gyrus for shape and object recognition, and area V4 for color processing. The co-activation of these areas is predicted to correlate positively with communication effectiveness, measured by higher accuracy in comprehension tasks and shorter response times.
  • Identify cognitive dimensions that influence the effectiveness of image communication, particularly visual literacy and spatial visualization abilities.
    Hypothesis 2a: It is hypothesized that visual literacy competencies will significantly mediate the relationship between visual material design and learning outcomes.
  • Examine individual and cultural variables that modulate the understanding of visual content.
    Hypothesis 3a: It is hypothesized that systematic, statistically significant differences related to cultural background will be observed in the interpretation of symbolic visual elements.
  • Develop and validate an integrated theoretical model of visual learning.
    Hypothesis 4a: It is hypothesized that a hierarchical model composed of distinct but interconnected stages (from feature detection to mental representation construction) will explain a significant portion of the variance in visual learning performance.

2. Proposed Methodology

2.1. Research Design

The study will adopt a multi-methodological design integrating quantitative and qualitative approaches, framed within a critical realism paradigm [10]. This approach is motivated by the complexity of the phenomenon, which involves processes operating on different temporal and psychological scales. The design will be articulated in four phases: (1) an exploratory phase for sample characterization; (2) a controlled experimental phase using eye-tracking; (3) an evaluation phase to test learning and retention; and (4) a qualitative analysis phase to investigate interpretive strategies. This multi-phase structure allows for the triangulation of results, increasing the study’s validity [11].

2.2. Participants and Selection Criteria

The sample will consist of 240 participants, recruited through stratified sampling and divided into three groups of 80 individuals each: high school students (aged 16–18), university students (aged 19–22), and young professionals (aged 23–25). The inclusion criteria are (1) age 16–25; (2) normal or corrected-to-normal vision; (3) no diagnosed neurological or psychiatric disorders; (4) proficiency in the language of instruction; and (5) signed informed consent. The exclusion criteria include diagnosed learning disorders or specific professional experience in graphic design. The sample size was determined via power analysis (G*Power 3.1), targeting a medium effect size (f = 0.25) with a statistical power of 0.80 at an alpha level of 0.05 [12]. Participants will be recruited through university channels and public advertisements, and will receive monetary compensation for their time.

2.3. Instruments and Materials

Visual literacy assessment: A battery of standardized tests will be used, including the Visual Literacy Assessment Test (VLAT) and the Spatial Visualization Test (SVT) [13,14]. Additionally, a new instrument, the Educational Visual Literacy Questionnaire (EVLQ), will be developed to assess metacognitive strategies and preferences. The development will follow a structured process, including a comprehensive literature review to identify key constructs, followed by focus groups with educational experts to ensure the content validity of the items. A pilot study will then be conducted to validate the EVLQ. The validation will involve statistical analyses such as exploratory factor analysis (EFA) to identify the underlying factor structure, confirmatory factor analysis (CFA) to test the hypothesized model, and reliability analysis (e.g., Cronbach’s alpha) to assess internal consistency.
Eye-tracking system: A high-resolution eye-tracking system (e.g., Tobii Pro Spectrum, 1200 Hz) will be used to record eye movements. The system will be calibrated for each participant using a standardized nine-point procedure. Metrics will include fixation duration, saccade patterns, and time to first fixation on predefined Areas of Interest (AOIs) [15].
Experimental visual materials: A corpus of 120 images will be developed, divided into four categories: scientific diagrams, infographics, educational photographs, and artistic compositions. The materials will be systematically designed and controlled for variables such as visual complexity (measured via computational algorithms), information density, and colorimetry [16]. A pilot study will be conducted to validate the stimulus set.

2.4. Experimental Procedures

The procedure will involve two separate sessions to mitigate fatigue.
  • Session 1: Characterization (90 min): Participants will complete a demographic questionnaire, the visual literacy tests (VLAT, SVT), and selected subtests from the Wechsler Adult Intelligence Scale (WAIS-IV) to measure baseline cognitive abilities [17].
  • Session 2: Eye-tracking experiment (120 min): Following calibration, participants will observe the 120 visual stimuli, presented in a randomized order. Each trial will consist of a 30 s image presentation, followed by comprehension questions. Eye movements and physiological data (pupil dilation, blink rate) will be recorded continuously. Subjective ratings of difficulty and interest will be collected after each block of stimuli.

2.5. Data Analysis Plan

Data quality control: Raw eye-tracking data will undergo a rigorous preprocessing pipeline. Trials with poor calibration accuracy (>1° of visual angle) or excessive data loss (>25% of samples) will be excluded from the analysis. Missing data points in questionnaires will be handled using multiple imputation methods [18].
Statistical analyses: Data will be analyzed using a hierarchical approach. Repeated-measures ANOVAs will be used to analyze between-group differences, with appropriate corrections for sphericity violations (e.g., Greenhouse–Geisser) [19]. Multiple regression models will be used to identify predictors of visual comprehension, controlling for potential confounding variables. To manage the risk of Type I errors from multiple comparisons, Bonferroni corrections will be applied to post hoc tests.
Qualitative analyses: Data from semi-structured interviews will be analyzed using thematic analysis [20]. To ensure validity, triangulation will be employed, including inter-rater reliability checks (Cohen’s Kappa) and member checking.

2.6. Ethical Considerations

The research will be conducted in full compliance with the ethical guidelines of the Declaration of Helsinki and will be submitted for approval to the Institutional Review Board (IRB) of the University of Foggia. All participants will provide written informed consent after receiving a complete description of the study, including potential risks (e.g., minor visual fatigue) and benefits. The consent form will explicitly detail the nature of the data being collected, particularly eye-tracking and physiological measures. Participants will be informed of their right to withdraw at any time without penalty. All data will be anonymized to protect confidentiality and stored on secure, encrypted servers. Following the experiment, participants will be fully debriefed about the research goals. In the event of incidental findings (e.g., discovery of previously undiagnosed visual impairments), a protocol is in place to inform the participant in a sensitive manner and recommend consultation with a specialist.

3. Expected Results

It is expected that visual literacy competencies will improve with age and educational experience, with young professionals outperforming university students, who, in turn, will outperform high school students. Eye-tracking data are expected to reveal more efficient visual exploration strategies (e.g., fewer fixations of longer duration) in participants with higher visual literacy. Analysis of Areas of Interest (AOIs) should show that experts focus more on semantically relevant information. Physiological measures, such as pupil dilation, are expected to correlate with cognitive load, with more complex stimuli eliciting greater pupillary responses. Finally, multiple regression models are expected to identify visual literacy and the clarity of the visual hierarchy as significant predictors of comprehension. The study will also explore alternative outcomes, such as non-significant findings or unexpected interaction effects, which would themselves provide valuable insights into the boundary conditions of visual learning theories.

4. Implications and Dissemination

4.1. Theoretical and Practical Implications

The research is expected to provide significant contributions to visual communication theory by extending dual-coding theory and providing a more nuanced model of visual processing. Practically, the results will yield evidence-based guidelines for the design of effective educational materials (e.g., infographics, diagrams). These guidelines could help reduce learning disparities by showing how well-designed materials can support learners with lower individual competencies.

4.2. Dissemination Plan

The results will be disseminated through multiple channels:
  • Academic publications: The submission of at least three articles to high-impact, peer-reviewed journals in educational psychology and cognitive neuroscience.
  • Conferences: the presentation of findings at major international conferences (e.g., AERA, CogSci).
  • Workshops: The organization of workshops for educators and instructional designers to translate research findings into practical applications.
  • Open science: Anonymized datasets and analysis scripts will be made available on a public repository (e.g., OSF) to promote transparency and replication.

5. Project Management

5.1. Limitations of the Study

The study has several limitations. The cross-sectional design does not allow for causal inferences about development; a longitudinal study would be needed for this. The sample, while stratified, is limited to a specific age range (16–25) and educational context, which may limit the generalizability of the findings to other populations (e.g., older adults or children). The use of laboratory-based stimuli may not fully capture the complexity of real-world learning environments.

5.2. Timeline and Implementation Schedule

The research project is planned for a total duration of 36 months, structured into four main phases. The initial phase, which will cover the first six months, will be dedicated to reviewing the existing scientific literature, obtaining the necessary ethics approvals, developing the research instruments, and their subsequent piloting phase. Subsequently, the second and most extensive phase will extend from the seventh to the twenty-fourth month, focusing on participant recruitment and data collection through two distinct sessions. The third phase, scheduled between the twenty-fifth and thirtieth months, will be entirely dedicated to the analysis of the collected data, both quantitative and qualitative. Finally, the last phase, spanning from the thirty-first to the thirty-sixth month, will focus on the dissemination of results through academic publications, conference presentations, and the preparation of practical guidelines for educators.
A total budget of €450,000 has been requested for the project, distributed over a three-year period. The main expense item is personnel costs, which include a post-doctoral researcher and a research assistant, for a total of €240,000. An amount of €35,000 has been allocated for equipment, including eye-tracker maintenance and servers. A sum of €30,000 is designated for the compensation of the 240 research participants. For dissemination activities, which include publications and conference participation, a budget of €30,000 has been allocated. Overheads, representing the university’s indirect costs at 20% of the budget, amount to €67,000. For consumables, such as software and office supplies, €18,000 has been budgeted. Lastly, a 5% contingency fund is included to cover unforeseen expenses, such as additional participant recruitment costs, technical issues with equipment, or the need for supplementary software licenses.

Author Contributions

Conceptualization, G.A.T. and P.L.; methodology, G.A.T. and P.L.; software, G.A.T. and P.L.; validation, G.A.T. and P.L.; formal analysis, G.A.T. and P.L.; investigation, G.A.T. and P.L.; resources, G.A.T. and P.L.; data curation, G.A.T. and P.L.; writing—original draft preparation, G.A.T. and P.L.; writing—review and editing, G.A.T. and P.L.; visualization, G.A.T. and P.L.; supervision, G.A.T. and P.L.; project administration, G.A.T. and P.L.; funding acquisition, G.A.T. and P.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Elkins, J. The Domain of Images; Cornell University Press: Ithaca, NY, USA, 1999. [Google Scholar]
  2. Kress, G.; Van Leeuwen, T. Reading Images: The Grammar of Visual Design, 2nd ed.; Routledge: Oxfordshire, UK, 2006. [Google Scholar]
  3. Rensink, R.A. The dynamic representation of scenes. Vis. Cogn. 2000, 7, 17–42. [Google Scholar] [CrossRef]
  4. Hegarty, M. The cognitive science of visual-spatial displays: Implications for design. Top. Cogn. Sci. 2011, 3, 446–474. [Google Scholar] [CrossRef] [PubMed]
  5. Jarodzka, H.; Scheiter, K.; Gerjets, P.; van Gog, T. In the eyes of the beholder: How experts and novices process visual information. Learn. Instr. 2010, 20, 146–154. [Google Scholar] [CrossRef]
  6. Nisbett, R.E.; Miyamoto, Y. The influence of culture: Holistic versus analytic perception. Trends Cogn. Sci. 2005, 9, 467–473. [Google Scholar] [CrossRef] [PubMed]
  7. Paivio, A. Mental Representations: A Dual Coding Approach; Oxford University Press: Oxford, UK, 1986. [Google Scholar]
  8. Barsalou, L.W. Grounded cognition. Annu. Rev. Psychol. 2008, 59, 617–645. [Google Scholar] [CrossRef] [PubMed]
  9. Sweller, J.; van Merriënboer, J.J.; Paas, F. Cognitive architecture and instructional design: 20 years later. Educ. Psychol. Rev. 2019, 31, 261–292. [Google Scholar] [CrossRef]
  10. Bhaskar, R. A Realist Theory of Science; Routledge: Oxfordshire, UK, 2008. [Google Scholar]
  11. Creswell, J.W.; Plano Clark, V.L. Designing and Conducting Mixed Methods Research, 3rd ed.; Sage Publications: Thousand Oaks, CA, USA, 2017. [Google Scholar]
  12. Faul, F.; Erdfelder, E.; Lang, A.G.; Buchner, A. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods 2007, 39, 175–191. [Google Scholar] [CrossRef] [PubMed]
  13. Lee, S.; Kim, S.H.; Kwon, B.C. VLAT: Development of a Visualization Literacy Assessment Test. IEEE Trans. Vis. Comput. Graph. 2017, 23, 551–560. [Google Scholar] [CrossRef] [PubMed]
  14. Bodner, G.M.; Guay, R.B. The Purdue Visualization of Rotations Test. Chem. Educ. 1997, 2, 1–17. [Google Scholar] [CrossRef]
  15. Salvucci, D.D.; Goldberg, J.H. Identifying fixations and saccades in eye-tracking protocols. In Proceedings of the 2000 Symposium on Eye Tracking Research & Applications, New York, NY, USA, 6–8 November 2000; pp. 71–78. [Google Scholar]
  16. Oliva, A.; Torralba, A. Modeling the shape of the scene: A holistic representation of the spatial envelope. Int. J. Comput. Vis. 2001, 42, 145–175. [Google Scholar] [CrossRef]
  17. Wechsler, D. Wechsler Adult Intelligence Scale–Fourth Edition (WAIS–IV); NCS Pearson: Bloomington, MI, USA, 2008. [Google Scholar]
  18. Van Buuren, S. Flexible Imputation of Missing Data, 2nd ed.; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
  19. Field, A. Discovering Statistics Using IBM SPSS Statistics, 5th ed.; Sage: Thousand Oaks, CA, USA, 2018. [Google Scholar]
  20. Braun, V.; Clarke, V. Using thematic analysis in psychology. Qual. Res. Psychol. 2006, 3, 77–101. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Toto, G.A.; Limone, P. Psychological Dimensions Involved in Image Communication: A Multidisciplinary Research Proposal for Analyzing Cognitive and Perceptual Processes in Visual Education. Proceedings 2026, 139, 7. https://doi.org/10.3390/proceedings2026139007

AMA Style

Toto GA, Limone P. Psychological Dimensions Involved in Image Communication: A Multidisciplinary Research Proposal for Analyzing Cognitive and Perceptual Processes in Visual Education. Proceedings. 2026; 139(1):7. https://doi.org/10.3390/proceedings2026139007

Chicago/Turabian Style

Toto, Giusi Antonia, and Pierpaolo Limone. 2026. "Psychological Dimensions Involved in Image Communication: A Multidisciplinary Research Proposal for Analyzing Cognitive and Perceptual Processes in Visual Education" Proceedings 139, no. 1: 7. https://doi.org/10.3390/proceedings2026139007

APA Style

Toto, G. A., & Limone, P. (2026). Psychological Dimensions Involved in Image Communication: A Multidisciplinary Research Proposal for Analyzing Cognitive and Perceptual Processes in Visual Education. Proceedings, 139(1), 7. https://doi.org/10.3390/proceedings2026139007

Article Metrics

Back to TopTop