4.1. First Stage: Adaptation of the Auditory-Visual Stroop Test
The AV-Stroop training and test tracks were developed using different sound stimulus types with different spectral characteristics. Tinnitus mechanisms can involve tonotopic modifications at the central level [
34]. Physical properties of the stimulus can activate different auditory fibers, given auditory pathway tonotopy.
We hypothesized that spectral characteristics of the stimulus could interfere with the AV-Stroop test’s performance. In addition, stimuli with spectral characteristics compatible with tinnitus auditory perception can be more effective in evaluating attentional issues and inhibitory control in tinnitus participants. These stimuli can evoke even more specific neuronal activity related to tinnitus.
Stroop’s tasks can involve activations in areas such as the lateral prefrontal cortex, posterior parietal cortex, occipitotemporal cortex, medial prefrontal cortex, anterior cingulate cortex, and interactions with the cerebellum [
35]. The anterior insula, anterior and posterior cingulate cortex, precuneus, medial and dorsolateral prefrontal cortex, and inferior parietal lobule are areas related with suffering, embodiment, functional, and social impact [
36].
Some studies aimed to increase the sensitivity of the Stroop test for application in tinnitus participants [
10,
11,
12,
16,
18,
19]; for a review, see [
9]. AV-Stroop test adaptation resembles another study. The auditory stimuli were presented on the right and left sides. Additionally, the association with the presentation side establishes a conflict between the stimulus dimensions [
16].
Stimuli presented only in congruent conditions (training stage) familiarized and habituated the participant to the most expected form of presentation—auditory and visual stimuli on the same side. Logical association was determined by the presentation way of the stimuli in sensory modality: “The side I hear is the side I see”. Hence, the interference effect can be more effective, instinctive, and basal in test tracks.
The adequacy of the training phase was confirmed by the performance data collected during the protocol observational validation (n = 5), where participants achieved an accuracy rate of 100% and demonstrated stable total task times by the conclusion of the 21-trial sequence. This objective evidence supports the conclusion that the training track was sufficient for procedural mastery and full comprehension of the task demands, even without reaching the threshold of behavioral automation.
We manipulated the distribution of congruent and incongruent trials for the test tracks. Incongruent trials accounted for 20% of presentations (rare stimulus effect), requiring attentional resources to inhibit the automatic response. Another study manipulated and reduced the proportional presentation of emotionally relevant words to increase the interference effect in the emotional Stroop [
12]. Other methodology presented more balanced distributions of congruent and incongruent trials [
16] or a higher percentage of incongruent stimuli [
17].
We used nonverbal auditory stimuli, unlike previous studies that applied Stroop in the auditory modality [
16,
21,
22,
23]. We sought to establish a simplified process for the dominant association and for the conflict. There was no bias towards the concept of laterality or aspects of semantics or language. Even the visual stimuli we used were basic, graphic, familiar geometric shapes that were easily differentiated.
We determined the total task time as a quantitative variable, as previously done [
10,
11]. Some studies established reaction time [
8,
12,
13,
14,
15,
16,
17,
18,
19,
20], with specific results for congruent and incongruent presentation trials, and a more detailed analysis of the interference effect.
4.2. Second Stage: Comparison of the Auditory-Visual Stroop Test, the Conventional Stroop Test, and the Montreal Cognitive Assessment Results
During the Second Stage, the number of registered errors was analyzed—studies’ second most analyzed variable [
8,
15,
20]. Some studies used accuracy as an outcome measure [
16,
17]. Two studies mentioned recorded errors [
8] or correct answers [
16], whereas other studies did not report this information explicitly [
15,
17,
20]. Desired functionalities created for training and test tracks—instructions, on-screen guidance, response identification, and path to presentation trials—had the expected effect.
The AV-Stroop test results were compared with the reference conventional test version in the same sample (
Table 6 and
Table 7). Stroop tasks in auditory and visual modalities may be conceptually equivalent, considering the correlation between participants’ individual scores in the two tasks [
37]. The moderate correlation observed between the two instruments supports their convergent validity in measuring attentional and inhibitory control (
Table 6 and
Table 7). However, the AV-Stroop captures a distinct dimension of executive function, as it extends to the auditory domain. Our proposal consisted of a conceptual adaptation, like previous studies [
8,
13,
14,
15,
16,
17,
19,
20].
C-Stroop test’s modifications could have a discrete effect on its magnitude but not on its quality. Our results reinforce a generalization effect of the desired interference phenomenon [
38]. The number of errors in the AV-Stroop test was positively correlated with the Stroop CN task only for the NB, showing greater efficiency of this stimulus for assessment (
Table 7).
The MOCA score did not interfere with the AV-Stroop test (
Table 6 and
Table 7), reinforcing that our proposal involved a simple, basal, easy-to-apply task. MOCA scores exhibited a wide distribution, indicating varied global cognitive profiles despite the absence of formal neurological diagnoses. The null association between MOCA scores and AV-Stroop performance suggests that the attentional and inhibitory control required for the AV Stroop tasks may be a distinct executive domain, independent of general cognitive screening metrics.
4.3. Third Stage: Application of the Auditory-Visual Stroop Test in Participants with Tinnitus and Controls
Tinnitus can have a generalized effect on cognitive performance. The tinnitus group required more time to complete all Stroop measures, even in congruent conditions (training track) (
Table 8). The generalized difference involved tasks that required controlled processing and those that required automatic processing. Other studies demonstrated longer reaction or total task times in the tinnitus group, even in trials with neutral stimuli, reinforcing the hypothesis of a generalized effect of tinnitus on cognitive performance [
8,
11,
13,
15]. One possible explanation for the generalized depletion of cognitive resources in participants with tinnitus is that they are constantly performing dual tasks, as attention to tinnitus consumes attentional resources [
13,
15].
Some authors have found that reaction time was longer in the tinnitus group only for the incongruent Stroop trials, and therefore, attention deficits in this population would not be attributed to a generalized depletion of cognitive resources [
14]. The dorsal anterior cingulate cortex, activated in Stroop tasks, was recruited in more generalized processing functions, with an influence on brain function as a whole [
20]. The auditory modality of the spatial Stroop demonstrated that participants with tinnitus were slower than controls in all stages of the test, regardless of the interference [
16]. Our results are consistent with findings of previous studies [
8,
10,
11,
13,
15,
16,
20], showing a more widespread effect of tinnitus on Stroop task performance.
There were a few errors in absolute values for the three types of stimuli presented (
Table 9), as observed in other studies [
8,
15,
20]. A higher concentration of participants with tinnitus was observed among those who made three or more errors for the three test ranges applied (
Table 10).
Previous studies demonstrated no difference in the number of errors between groups with and without tinnitus [
8,
15]. Our dissonant findings could be explained by our methodology (nonverbal stimuli, free of semantic associations, and did not require more elaborate processing). The Stroop interference conflict can be established in a more basal processing pathway, possibly related to tinnitus.
The WN and NB conditions showed the between-group differences in the distribution of participants regarding error counts (
Table 10). In the Study group, there were slightly more participants with noise-like tinnitus than tonal tinnitus (
Table 5). It is possible that a larger, more balanced sample, encompassing different tinnitus types, would reveal differences across all stimulus types, as observed for total task time. However, we note this is a speculative observation. This potential relationship warrants dedicated investigation in future trials using larger, subtype-stratified cohorts to determine if stimulus-tinnitus congruency dictates the magnitude of interference.
Another possible explanation for this finding is that most participants in the Study group experienced tinnitus with pitch measured in the 2–8 kHz frequency range (
Table 5). Broadband noise stimuli recruit a greater range of auditory nerve fibers, potentially activating regions closer to or within the tinnitus frequency range. In contrast to tonal stimuli, broadband noise encompasses a wider spectrum of frequencies. Sound stimulation using different masking sounds can interfere with the pattern of spontaneous neural firing along the auditory pathway and can temporarily suppress or eliminate tinnitus [
39,
40]. Sound stimuli compatible with tinnitus in a frequency range may contribute to the magnitude of this phenomenon [
40].
In the present study, we applied nonverbal auditory stimuli with distinct physical characteristics. We also included the additional test range (Tinnitus Pitch) to enhance test sensitivity. The analysis, considering the additional test range (Tinnitus Pitch) used with the Study Group, revealed no effect on total task time (
Table 11). However, there were more errors in this test track (
Table 11 and
Table 12). This finding demonstrated that the Tinnitus Pitch stimulus was more sensitive to cognitive interference in the AV-Stroop test. The increased interference in top-down executive control reinforces the idea that the Tinnitus Pitch was more efficient for evaluating participants with tinnitus.
Auditory sensory input disturbance is a condition for tinnitus onset. A deafferented sensory area can result in neuroplastic modifications along and beyond the central auditory pathway [
41,
42,
43,
44,
45]. The deafferented sensory area presents a specific frequency distribution. The frequencies of tinnitus behavioral expression can be associated with the region of auditory nerve fibers affected. In participants with tinnitus and high-frequency hearing loss, the tinnitus spectrum encompassed the frequencies affected by hearing loss [
46]. In the central auditory system, the distribution of nerve fibers follows a defined tonotopic pattern, with modifications in the frequency-specific neural firing pattern at various levels of this pathway [
40]. Therefore, the spectral characteristics of the sound stimulus can mobilize specific auditory nerve fibers throughout the auditory pathway. Even in individuals without alterations in conventional audiometry, deafferentation is a condition for the onset of tinnitus, and the spectral characteristics of tinnitus may be related to the deafferented area [
40,
46].
Our results reinforced this relationship and demonstrated the importance of psychoacoustic measurements for a more in-depth investigation of the symptom, and additionally, as a basis for establishing therapeutic strategies within audiological rehabilitation.
Another possible explanation for the fact that the Tinnitus Pitch test track was more sensitive to demonstrate the interference effect in the Study Group is that such a stimulus would present negative emotional valence, an effect already sought in previous studies that sought to sensitize the test from stimuli related to tinnitus and that involved emotional processing [
10,
12,
18,
19]. To investigate this further, future studies should incorporate subjective annoyance ratings for each stimuli track and utilize an emotional Stroop control paradigm.
Furthermore, the Tinnitus Pitch stimulus could activate an important pathway for attentional focus. Tinnitus, as a behaviorally relevant signal, has processing priority at the central nervous system level compared to other competing stimuli [
47].
In our study, we found that participants with tinnitus (the Study Group) made more errors in response to WN and NB stimuli compared to the control group. Additionally, a higher percentage of Study group participants made errors on all three stimuli (WN, NB, and TP). While both groups committed errors, only participants in the Study Group exceeded the threshold of three errors per test track (
Table 9). Notably, the Tinnitus Pitch test track highlighted the greatest difficulty in attentional and inhibitory control among the Study Group. These findings are shown in
Table 11 and
Table 12.
Some authors observed the greatest difference in the reaction time rather than in accuracy, stating that tinnitus could affect cognitive efficiency more than cognitive performance [
8]. Our findings indicated that tinnitus interference can affect cognitive performance and cognitive efficiency. This suggests that cognitive interference in tinnitus is not necessarily an all-or-nothing failure of accuracy, but rather a persistent drain on processing resources that manifests as slower cognitive throughput.
The cognitive interference observed in our results likely reflects a shared neural substrate between the executive control network and the tinnitus distress network. Regions such as the anterior cingulate cortex and the dorsolateral prefrontal cortex are known to be activated during the resolution of Stroop interference; however, neuroimaging studies consistently show these same areas are chronically recruited in individuals with bothersome tinnitus [
35,
36]. Our finding of increased errors during pitch-matched tracks suggests a ‘neural bottleneck’ where the shared resources of these cortical areas are overwhelmed by the simultaneous demands of suppressing the internal tinnitus percept and resolving external interference conflict.
The present study highlights a comprehensive and generalized effect of tinnitus on cognitive executive control. Therefore, the audiological approach to tinnitus needs to incorporate measures that assess and help improve attentional control and inhibitory control in individuals with tinnitus. Individuals with tinnitus need to improve their cognitive efficiency and their cognitive performance. This may help to achieve the therapeutic goals of audiological rehabilitation. In addition to the numerous sound-based intervention possibilities, audiological rehabilitation of tinnitus also relies on neurocognitive auditory training.
Unlike another study [
17], our findings showed that tinnitus can interfere with cognitive performance, with cognitive failures occurring through the auditory pathway. Our results reinforce alterations in auditory processing in tinnitus [
16]. Stroop paradigms involving the auditory modality can be more effective in demonstrating cognitive interference, as previously stated [
16].
It is important to note that the error rates observed across all AV-Stroop test tracks remained consistently low. As highlighted in previous literature [
8,
15], low error frequencies in the Stroop paradigm should be interpreted with caution, as they may suggest that accuracy is not the most sensitive measure for evaluating interference in cognitively intact populations. In our study, the stability of these rates suggests that participants prioritized accuracy, effectively compensating for the conflicting auditory stimuli. Rather than serving as a primary measure of interference, these low error rates confirm that the significant differences found in total task times represent a genuine increase in cognitive processing demand rather than a sacrifice in performance quality.
Additionally, the low frequency of errors observed across all tracks can suggest a ceiling effect in the AV-Stroop task. While the simplicity of the task ensures the test’s clinical accessibility for participants with significant tinnitus distress, it limits the granularity of the error analysis. However, the fact that significant differences were still observed—despite the narrow range of error scores—suggests that the acoustic stimuli compatible with the tinnitus perception (Tinnitus Pitch track) provide a robust enough interference to break through the ceiling effect and reveal differences in inhibitory control.
4.4. Limitations and Future Directions
Despite the strengths of this study, certain limitations must be acknowledged. This study does not report test-retest or inter-rater reliability for the AV-Stroop instrument. While the current results establish a foundation for the test’s clinical utility and convergent validity, the stability of the measure over time and across different examiners remains to be quantified. The absence of these metrics is a primary limitation that necessitates future investigation to ensure the instrument’s robustness for long-term clinical monitoring and multi-center research applications.
The presentation software adopted for the AV-Stroop test adaptation did not establish reaction time analysis for the congruent and incongruent trials’ presentations. The study’s objective was not to assess the influence of factors, such as hearing level, education level, depression, anxiety, tinnitus severity, tinnitus characteristics, and symptom onset time, on participants’ performance. While these variables were documented for sample characterization, their specific influence on inhibitory control represents a distinct research question reserved for future multivariate analyses.
We were unable to quantify the ‘rare stimulus effect’ elicited by the 80/20 congruent-to-incongruent ratio. Although this ratio was implemented to increase inhibitory demand—consistent with established oddball paradigms—total task time as the primary outcome measure precludes a separate analysis of trial-specific interference.
The use of a fixed administration order may introduce potential practice or order effects. Although task brevity was designed to minimize cognitive load, the lack of counterbalancing is a limitation to be addressed in future research.
While the MOCA identified a broad range of cognitive performance, it did not correlate with AV-Stroop total task times. This suggests the AV-Stroop may target specific executive resources not captured by global screening tools, though further research is needed to determine the clinical significance of this independence in the tinnitus population.
Tinnitus pitch matching is inherently subjective; therefore, we minimized potential variability by requiring confirmation of the matched frequency. Despite this internal verification, the established limitations of pitch-matching stability in the literature must be considered when interpreting the results.
The observed results yielded significant associations with moderate effect sizes. However, the relatively small sample size and low error variance should be considered when generalizing these findings. Future studies with larger cohorts could further delineate the nuances of AV-Stroop test interference in diverse clinical populations.
Given the exploratory nature of this research, formal corrections for multiple comparisons were not performed. This preserves the power to detect associations in a new assessment paradigm but increases the risk of Type I errors. Accordingly, borderline significant findings (p-value approximately 0.05) can be interpreted as preliminary trends that require confirmation in future trials.
The use of a post hoc error threshold (≥3) represents a limitation of the current study. While this cut-off was necessitated by the overall low error rates and the need to characterize specific performance clusters, future studies should utilize larger samples to establish pre-specified, validated clinical benchmarks for AV-Stroop test interference.
We believe that future applications of the AV-Stroop test in tinnitus participants could involve cognitive function assessment and therapeutic monitoring follow-up before and after an intervention. It would be important to investigate functional imaging of the neuronal connectivity network elicited by the AV-Stroop test. It can help clarify the effect of using stimuli compatible with the tinnitus pitch in assessing executive attentional control.
Future studies may clarify these aspects and help guide personalized therapeutic strategies. We agree that enhancing attentional and inhibitory control may be associated with improvements in tinnitus, as suggested by previous studies [
14,
48].
The audiological rehabilitation process should also consider tinnitus-directed attention. Additionally, the attentional state can be dysfunctional, contributing to the maintenance and chronicity of tinnitus. Neurocognitive auditory training using auditory Stroop tasks [
21,
22,
23] can be a complementary tool in the audiological rehabilitation of tinnitus.