Next Article in Journal
Sex-Specific Association of Body Mass Index with Hippocampal Subfield Volume and Cognitive Function in Non-Demented Chinese Older Adults
Previous Article in Journal
Transcranial Direct Current Stimulation Decreases P3 Amplitude and Inherent Delta Activity during a Waiting Impulsivity Paradigm: Crossover Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Probing the Bottleneck of Awareness Formed by Foveal Crowding: A Neurophysiological Study

School of Optometry and Vision Sciences, Bar-Ilan University, Ramat Gan 5290002, Israel
*
Author to whom correspondence should be addressed.
Brain Sci. 2024, 14(2), 169; https://doi.org/10.3390/brainsci14020169
Submission received: 5 January 2024 / Revised: 28 January 2024 / Accepted: 5 February 2024 / Published: 7 February 2024
(This article belongs to the Special Issue From Visual Perception to Consciousness)

Abstract

:
Crowding occurs when an easily identified isolated stimulus is surrounded by stimuli with similar properties, making it very difficult to identify. Crowding is suggested as a mechanism that creates a bottleneck in object recognition and awareness. Recently, we showed that brief presentation times at the fovea resulted in a significant crowding effect on target identification, impaired the target’s color awareness, and resulted in a slower reaction time. However, when tagging the target with a red letter, the crowding effect is abolished. Crowding is widely considered a grouping; hence, it is pre-attentive. An event-related potential (ERP) study that investigated the spatial–temporal properties of crowding suggested the involvement of higher-level visual processing. Here, we investigated whether ERP’s components may be affected by crowding and tagging, and whether the temporal advantage of ERP can be utilized to gain further information about the crowding mechanism. The participants reported target identification using our standard foveal crowing paradigm. It is assumed that crowding occurs due to a suppressive effect; thus, it can be probed by changes in perceptual (N1, ~160 ms) and attentive (P3 ~300–400 ms) components. We found a suppression effect (less negative ERP magnitude) in N1 under foveal crowding, which was recovered under tagging conditions. ERP’s amplitude components (N1 and P3) and the behavioral proportion correct are highly correlated. These findings suggest that crowding is an early grouping mechanism that may be combined with later processing involving the segmentation mechanism.

1. Introduction

The phenomenon in which the ability to recognize an object is reduced when it is surrounded by similar objects, known as crowding, has been widely studied. Crowding has recently been interpreted as optimal integration given the limited precision of peripheral processing [1]. Crowding is considered a form of inhibitory interaction between the surrounding nearby contours and the target and produces “a fundamental limit on conscious perception and object recognition” [2]. Recent data demonstrate that visual crowding is correlated with awareness [3]. One of the crowding theories is called Gestalt grouping [4,5,6]. To perceive the whole object, the local features (grouping) of an object must be combined. Nador and Reeves [7] found that peripheral crowding is eliminated when both spatial frequency and the orientation of the Gabor flankers differ from those of the Gabor target, but changing just one of them has almost no effect. If the local and basic features comply with the Gestalt principles [8,9], they can be combined. However, previous studies have shown that in the periphery, crowding is reduced when the target and flankers are dissimilar in shape, size, orientation, polarity, spatial frequency, depth, color, and motion (tagging and pop-out effects) [10,11,12,13,14,15]. Similarly, the fovea of strabismic amblyopes, which is considered as periphery, typically has extensive crowding, but tagging the target in foveal crowding experiments revealed that crowding is largely reduced [2,16]. Thus, when some features differ from others, the grouping process is disrupted. This hypothesis can explain how crowding is reduced in the periphery and fovea by tagging. These results strengthen the hypothesis that Gestalt grouping [4,5,6] is one factor among others for crowding. The classic term ‘pop-out’ may not accurately describe the tagging effect, so we prefer to use tagging as a more appropriate term.
There are two main hypotheses regarding information processing strategies in crowding: one suggests that crowding is bottom-up and pre-attentive based on the concept that crowding occurs at a lower visual processing level [17]. Dakin et al. (2009) suggested that crowding is not influenced by attentional limit, and that crowding and attention do not share the same neural mechanisms [9]. Specifically, crowding occurs even when people are completely unaware of the flankers; this suggests that awareness and attention are not early conditions for crowding [10,18,19].
Another perspective suggests that crowding is a top-down process and that an attentional process is based on the idea that crowding occurs at a higher processing level. Therefore, the attentional resolution in peripheral vision limits the access of crowded targets to the consciousness [20,21]. Yeshurun and Carrasco showed that attention improves performance at peripheral locations by enhancing spatial resolution [22]. Furthermore, attention reduces the critical target–flanker distance at which the flankers no longer interfere with target identification [23]. The accumulated data lead to the conclusion that crowding occurs at multiple stages in the visual hierarchy [10].
The crowding phenomenon has been explored using human behavior (psychophysical, PSY) [2,24,25] and in event-related potential (ERP) by Chicherov and Herzog (2014) and others [26,27,28,29,30], mainly in peripheral crowding studies. Some studies revealed the foveal crowding effect using adaptive optics or under brief presentation times [31,32,33]. To date, there have been many psychophysical studies on crowding (for a review, see Levi 2007 [2]), but only a few have used ERP to study foveal crowded conditions [34]. These studies showed that the main effect of peripheral or foveal crowding is suppression, displayed by the N1 ERP component [28,34]. Therefore, the N1 amplitude increased as the crowding effect decreased [34]. In contrast, the earlier component, P1, is assumed to be correlated with a physical energy stimulus and is not affected by behavior; since it appears very early, it should not be affected by crowding [34]. The main conclusion of Chicherov and Herzog’s (2014) study is that crowding occurs during higher-level visual processing. Another interpretation is related to the assumption that the N1 amplitude represents facilitatory or inhibitory lateral interactions [35]; therefore, the source of N1 is considered as early or intermediate visual processing [35,36]. A recent EEG study measured the N1 amplitude under peripheral crowding as well as uncrowding under attended and unattended conditions [30].
Most of the studies on pop-out were performed in the periphery of the visual field, assuming that this phenomenon directs attention to an area of interest where the participant is not looking directly [37]. Many studies have shown a high correlation between the P3 component, attention, and awareness. Its amplitude was found to increase with increasing attention and awareness resources [38,39,40,41,42,43]. In addition, one of the characteristics of P3 is that it is affected by attention and decision making [44]. Another recent study suggested that crowding occurs simultaneously across multiple levels of visual processing [45]. In addition, it was suggested that crowding occurs automatically and that it can be modulated by attention.
In our previous study [31], using brief presentation times (40, 80, 120, and 240 ms), participants with normal monocular and binocular vision were asked to indicate the direction of the E target under isolated and crowded conditions. The stimulus was presented at the fixation point in the center of vision (fovea) to reveal the crowding effect. We found that crowding significantly reduces sensitivity under monocular and binocular conditions (p < 0.0001). The sensitivity under crowding conditions increases as the presentation time increases. We also used a red letter relative to the background (black matrix) to isolate the target [12,31]. There was a significant improvement in the sensitivity at all presentation times.
We found that the foveal crowding effect can be reduced (to almost the level of the uncrowded condition) by tagging the target letter with a different color. Very interestingly, in further research [46], we showed that under a very brief presentation time, the participant’s perception of the target’s color (black or red) was impaired; in other words, in many trials, the participants were not aware about the target color, but crowding was still negated. Moreover, the reaction time under crowded conditions was markedly slower; however, the RT rebound that occurred even when the tag color was invisible showed that attentional and decision factors are not involved. It was suggested that the representation of an object consists of representations of many of the relevant aspects of it, and these are likely to be distributed in different cortical locations [47]. Thus, we wanted to explore the electrophysiological markers of early and late processing (attention and decision) using an ERP experiment that may shed light on the open question of whether there is a common mechanism for crowding and tagging (pop-out). Hence, our main hypothesis was that, since tagging decreases the foveal crowding effect, this effect would consequently increase the N1 amplitude, improve attention and decision making, and therefore also increase the P3 amplitude.

2. Methods

2.1. Participants

Ten adults with normal or corrected-to-normal vision and with no known neurological disorders participated in this study. Using the Ishihara test, we ensured that the participants did not have color deficiencies. The EEG data of one participant were removed from the analyses due to a technical failure during the experiment. The ages of the participants were between 20 and 38 (mean = 27.22, SD = 6.34). Visual functions were tested by a certified optometrist before participation in the study. All participants had a visual acuity of at least 20/20 and all were right-handed.
The participants signed a consent form that was approved by the Internal Review Board (IRB) of Bar-Ilan University, and all methods were performed according to the relevant guidelines and regulations; each participant was included only after ‘informed’ consent was obtained. All study protocols were approved by the Ethics Committee of Bar-Ilan University.

2.2. Stimuli and Procedures

The sitting distance was fixed at 185 cm. We measured the luminance intensity of the stimulus (E letter-isolated/crowding) and the background (a white screen) using a luminance meter (LS-100 KONICA MINOLTA, Tokyo, Japan). The luminance of both the black and red of the target letter and matrix was 9.5 cd/m2; it was presented on 65 cd/m2 of a white screen. The Weber contrasts were 85%. All stimuli were displayed binocularly. There was a matrix of E letters around the E target letter (black or red), which was arranged randomly. The size of the matrix was 5 × 5 letters. The size of each matrix letter corresponded to the size of the target letter. The target letter was an E letter presented at the center of the screen (presented on the central fovea), and it was marked by a fixation point. The letter E was displayed at a size of 2.7 mm, corresponding to a visual acuity of 20/20 (i.e., the stroke width was 1 arcminute, and the overall size of the letter was 5 arcminutes). The target letter was shown either in black or red. The task of the participants was to indicate the direction of the E target by clicking on the mouse key by using two different fingers (right click for E or left click for ꓱ); counterbalancing across participants was not performed because, in our opinion, this is the intuitive way. The side of the button corresponded to the side to which the signal was directed. In our further research, we did not find a significant difference between a black matrix with a red target letter and a red matrix with a black target letter, so we chose to present only the first condition [46]. There were four conditions (in random order [mixed by trials]): two for uncrowded (1-letter spacing between letters and a total size of 35 arcminutes) conditions (black, red) and two for crowded (0.4-letter spacing between letters and a total size of 23 arcminutes) conditions (black, red; in the middle of a black matrix) (see Figure 1). Note that the 0.4-letter spacing represents 0.4 times the letter size when the spacing is measured from edge to edge. Our ‘uncrowded’ definition relies on a previous article [48], and the spacing was carried out according to the methodology of visual acuity testing using the ETDRS chart where there is a space of one letter between letters to avoid crowding. Unlike in our previous article [31], here, we presented a spaced matrix (1-letter spacing) instead of a single letter to maintain a similar energy level (for ERP waves) for a smaller matrix (0.4-letter spacing).
For each condition, the stimulus was presented at one presentation time (30, 40, 50, 60, or 80 ms) that was adapted to each participant according to their detection threshold (60–80% for the black crowded condition). All experiments were administered in a dark room and were performed on the same day. Each run lasted about 4 min continuously without a break, but participants were allowed to take a break without any time limit between the runs, which were tested on 5 repetitions for each participant. There were 150 trials/conditions for each participant (i.e., 1500 trials/conditions for ten participants). We used a jitter of 0.4-letter spacing, i.e., all the stimuli were presented at the center of the screen with four possible identifications of letter rotations (up, down, right, and left) at random. The participants were instructed to respond quickly, unless the response speed led to errors. Although the reaction time data were not the main component of our study, they were used as references for the other results. The reaction times were measured for each trial from the moment the stimulus appeared until the subject clicked on the mouse key; trials in which no press was made were not included in the calculation of the reaction times.

2.3. Apparatus

Stimuli were presented on a 23.5″ (53.3 × 30 cm) LCD monitor (ASUS VG248QE) with a 100 Hz refresh rate and at a 1920 × 1080 pixel resolution using an NVIDIA GeForce GT 730 graphic card. The visual angle of the LCD monitor was 16.4° × 9.3°. The monitor had high temporal accuracy and was found to be suitable for visual psychophysics. The stimuli were presented using an in-house-developed platform for psychophysical experiments (PSY) developed by Yoram Bonneh and run on a Windows PC [49]. Note that the platform knows how to identify time missed and can filter those instances.

EEG Recording and Processing

EEGs were recorded from 64 passive tin electrodes (the Quik-Cap uses gel for application) arranged in standard locations across the scalp. (We performed our analyses on the main electrodes relevant to our study: PO3, POZ, PO4, O1, OZ, O2, F1, FZ, and F2, i.e., all analyses of P1 and N1 refer to the average of the occipital area electrodes (PO3, POZ, PO4, O1, OZ, and O2), and all analyses of P3 refer to the average of the frontal area electrodes (F1, FZ, and F2)). Since early EEG amplitudes, such as the P1 component, are sensitive to luminance and the stimulus size [34,50,51], we examined this component in the occipital region, represented by occipital (‘O’) electrodes. Also, since the P3 component is affected by attention and decision making, we examined this component in the frontal region, represented by frontal (‘F’) electrodes. The ERP results (Figure 2, the upper graphs) were obtained according to two main electrodes; OZ was considered as processing in the occipital area, and FZ was considered as processing in the frontal area. The EEGs were continuously sampled at 1 KHz and amplified with a SynAmps2/RT amplifier (Neuroscan, Victoria, Australia) with a bandpass of DC to 100 Hz. The data were low pass filtered at 30 Hz. For every condition, the average ERPs were computed over a 650 ms period per trial, with 150 trials per participant and 1350 trials per condition (one participant was not considered due to a technical fault in the recording). For each trial, the mean of 100 ms before the onset of the stimulus was taken as the baseline for the trial. The waveforms of the evoked responses (Figure 2), the correlations (Figure 3), and the global field power (Figure 4) were analyzed separately for each participant. Peak amplitudes were measured for a positive component, labeled as P1 (70–190 ms) and P3 (250–800 ms), and for a negative component, it was labeled as N1 (100–240 ms). Note that we found it appropriate to present only the graph of the correct answers because incorrect answers can result from a lack of attention, blinking, and so forth, which may add noise to the graph. Also, this method is compatible with the RT only presenting the results of the correct answers.

2.4. Data Analysis

After verifying that all of the data (proportion correct and log (RT)) were normally distributed and that the variances were homogenous (by Levene’s test), we used the two-factor ANOVA with replication test to compare the main groups (density and color). Also, we used a t-test, namely the Paired Two Sample for Means tool, to compare two specific parameters; the effect size (ES) is presented using Cohen’s d method. This method is used for comparing conditions. All statistical results were computed after a multiple comparison test was completed.
The calculated sample size in this study was based on the ‘Sample Size Justification’ [52] article and was calculated using G*Power 3.1.9.4 software, i.e., the sample size was calculated (post hoc) using the mean and SD (standard deviation) parameters (comparison between BC and RC was 1.57 for the proportion correct, 1.52 for the reaction time, and 1.89 for the N1 amplitude). All values showed that the required sample size was up to 10 participants. In addition, this sample size is consistent with previous studies from our lab [Sterkin et al., 2012 [35] (n = 8), Siman-Tov et al., 2021 [31] (n = 10)].
All correlation graphs refer to Pearson’s correlation, where each dot in the graphs represents the average runs of one participant.
Extreme value artifacts were rejected by denoting them as ‘bad blocks’: time regions around amplitudes exceeding a ±150 μVolt threshold were measured at the Mean Global Field Potential (MGFP) and +60 μVolts at the frontal channels (F7, Fz, and F8); samples at a time range of −200 to 800 ms were excluded. The following are the numbers of trials that were rejected for each condition in the occipital electrodes: BUC (191), BC (180), RUC (212), and RC (210) (i.e., around 15% of all of the trials). The following are the numbers of trials that were rejected in the frontal electrodes: BUC (161), BC (150), RUC (223), and RC (158) (i.e., around 13% of all of the trials).

3. Results

3.1. Psychophysics

The results (Figure 5) show that foveal crowding relative to an uncrowded condition reduces sensitivity (Figure 5(A2); t(9) = 7.73, p < 0.001, ES = 1.95) and increases the reaction time (RT) (Figure 5(B2); t(9) = 5.59, p < 0.001, ES = 1.41), whereas tagging reduces the crowding effect, i.e., improves sensitivity (Figure 5(A3); t(9) = 4.80, p < 0.001, ES = 1.62), and shortens the RT (Figure 5(B3); t(9) = 5.49, p < 0.001, ES = 1.56). In addition, the tagging improves sensitivity (Figure 5(A1); t(9) = 5.13, p < 0.001, ES = 1.04) and shortens the RT (Figure 5(B1); t(9) = 6.36, p < 0.001, ES = 1.39) even under uncrowding conditions. Table 1 presents an ANOVA test performed on the two main groups characterizing the stimuli, density, and color. We found a significantly different result under density (dense vs. spacious) and color (black vs. red) for the proportion correct and the reaction time, with no significant interaction between them (density and color). The ability to indicate the E direction was significantly reduced, and the reaction time was significantly increased under crowded (dense) compared to uncrowded (spacious) conditions. In addition, tagging (red) conditions improved the ability and the reaction time compared to black E. The fact that no interaction was found between density and color indicates that neither are required to reduce crowding.
To understand whether a relationship exists between crowding and the tagging mechanism, we correlated the percentage of crowding reduction with the percentage of tagging improvement (Figure 6). Importantly, we found a significant correlation between them (r = 0.87, p < 0.001).

3.2. ERP

Note that the total energy levels of all stimuli were equal (according to luminance and contrast), but the matrix size differed; it was larger for the 1-letter spacing (uncrowded) and smaller for the 0.4-letter spacing (crowded). Table 2 presents an ERP ANOVA test performed on the two main groups characterizing their stimuli, density, and color. The ERP results (Figure 2) of P1 followed the physical properties, revealing a significant difference (Table 2A) between the smaller matrix (0.4-letter spacing, the black crowded (BC), the red crowded (RC)) versus the larger (1-letter spacing, BUC, RUC) matrix. However, no significant difference was observed due to tagging within the same matrix density (Table 2A). Also note that the P1 results, at a time window of 50–200 ms, do not correlate with the psychophysics results (Figure 3A). Thus, P1 does not follow the psychophysical results, which is consistent with the assumption that P1 represents the basic stimulus properties [34]. Therefore, apparently, the difference between the two matrix densities (dense and spacious) is due to the visual retinal density of the matrix [50], i.e., there is a higher amplitude for a dense matrix (see the Section 2).
However, we found a significant correlation between the N1 amplitude, measured at the occipital electrodes at a time window of 150–250 ms, and the psychophysical results (Figure 3B). This result is consistent with the assumption that N1 represents lateral interactions [35] and crowding [34].
It was interesting to discover that there is a significant correlation between the P3 amplitude, measured at the frontal electrodes at a time window of 250–650 ms, and the psychophysical results (Figure 3C).
A significant N1 amplitude difference was observed between BC and BUC (Figure 4C) and between BC and RC (Figure 4D). Note that this significant result for N1 was found in the occipital area only (Figure 4A,B showing no difference between BC and BUC and between BC and RC in the global area electrodes). However, no significant difference was observed between BUC and RC (Figure 4E). The above results of the N1 receive further statistical support in Table 2.

4. Discussion

Our previous studies reinforced our results, indicating that crowding also occurs in the fovea [31], and the previous results are consistent with the current results; foveal crowding (under a brief presentation time) relative to a single letter (uncrowded) reduces the sensitivity and increases the reaction time (RT), whereas tagging reduces the crowding effect, improves sensitivity, and shortens the RT (as well as improves binocular summation). We found a significant difference between the uncrowded (one letter spacing) and the tagging conditions (Figure 5(A1)), which slightly differs from our previous study [31]. The difference stems from the fact that, in the previous article, we presented a single letter without any distractors, whereas in this study, the uncrowded condition consisted of a matrix with one-letter spacing (see the Methods Section).
However, it is important to note that this difference between BUC (black uncrowded) and RUC (red uncrowded) is significant (t(9) = 5.13, p < 0.001, ES = 1.04, Figure 5(A1)) and that it is still smaller compared with the differences obtained under tagging conditions (t(9) = 4.80, p < 0.001, ES = 1.62 Figure 5(A3)). These results are consistent with the viewpoint that crowding is affected by the grouping effect; therefore, tagging breaks up the grouping and reduces the crowding effect [11,53,54].
The issues of the underlying mechanisms of crowding, the locus, and the relationship between crowding and other visual phenomena, such as pop-out and masking, have been widely studied [2]. The assumption that the crowding effect is processed early in the occipital area [9,17,18] was demonstrated by the result that the threshold elevation aftereffect was substantially reduced during crowding [55] and because crowding occurs when the target is presented to one eye and the flankers are presented to the other eye [56]. However, other studies assumed that crowding results from an attention deficit [20,21,22,23] based on the finding that crowding, like other attentional effects, is stronger in the upper visual field than in the lower visual field [20] and the fact that the polarity effect in crowding has a temporal resolution similar to that for attention [57]. Thus, these studies apparently assume that crowding occurs beyond V1. An ERP foveal crowding study that analyzed all electrodes (i.e., for all brain regions) [34] found that the suppression of the N1 component occurred under crowded conditions. The authors concluded that crowding occurs during higher-level visual processing.
Here, we chose to use EEG, aiming to better understand the mechanism underlying crowding and tagging and to determine whether they are related. We also examined the ERP results (Figure 2, the upper graphs) according to two main electrodes: OZ was considered as processing in the occipital area, and FZ was considered as processing in the frontal area [58]. The other analyses, ANOVA test (Table 2) correlations (Figure 3), and field power tests (Figure 4) were examined according to the occipital electrodes (PO3, POZ, PO4, O1, OZ, and O2) and the frontal electrodes (F1, FZ, and F2).
In the occipital area, we found that the P1 component represents the basic physical stimulus properties, i.e., the amplitude value density of the matrix is, respectively, is high for dense amplitudes and low for spacious amplitudes (see the Methods Section, stimuli, and procedure) (Figure 2). In addition, we did not find a significant correlation between the P1 amplitude and the proportion correct (Figure 3A). These P1 results are also consistent with those of Peng et al.’s 2018 study [30]. Note that the N1 characteristics can be affected by the P1 characteristics; however, despite the similarity between the P1 amplitudes under dense matrix conditions (BC and RC) and under spacious matrix conditions (BUC and RUC), we found that N1 displays different characteristics between these conditions (Figure 2A upper, comparing the two continuous black and red lines).I In addition, there is a significant and correlative difference for the N1 component between the stimuli according to the psychophysical results (Figure 3B). These results are consistent with the findings of Polat and colleagues in 2012 [35] and with those of Herzog and colleagues in 2014 [34], i.e., that N1 represents facilitatory lateral interactions and crowding. Therefore, in crowded conditions, when facilitation and identification are reduced, the N1 amplitude is also reduced. We also did not find a significant difference between BUC and RC (Figure 4E). This finding corresponds to and reinforces what we found in our previous study [31]. We hypothesized that, because our stimulus was presented in the central fovea, the first effect of tagging results in the matrix grouping being broken up by a different color (red). This hypothesis is consistent with the conclusions of Peng et al.’s study [30]. In addition, differences were found in the N1 value only under different conditions in the occipital electrode (Figure 4), which supports the hypothesis that crowding occurs at a lower visual processing level.
The second effect Is raising the level of attention, even though the stimulus is presented at the location where the participant is looking all the time (it corresponds to the common assumption that attention exists in the gaze area). This result is consistent with the conclusions of Poletti et al. [59]. We found a relationship not only between N1 and crowding (Figure 4C), but also between N1 and tagging (Figure 4D), i.e., the N1 amplitude is higher under uncrowded and tagging conditions compared with crowded conditions. To test this relationship, we examined the frontal area on the P3 component, which represents attention [38,39,40,41] and decision making [44] (at a time window of 250–650 ms) and therefore represents tagging. We found a high correlation between P3 and the proportion correct (Figure 3C), i.e., the P3 amplitude increases as the crowding effect decreases. These findings show that there is a mutual relationship between crowding and tagging. Thus, our ERP data may be consistent with the hypothesis that the awareness of crowding effect (or uncrowded) consists of representations distributed in different cortical locations [47].

5. Conclusions

Our study strengthens the assumption that N1 is reduced under crowded conditions; in addition, we examined N1 in the occipital area and for all electrodes and found that this reduction is only processed in the occipital region. This method strengthens the assumption that crowding probably occurs at a lower visual processing level. We also found a high correlation between P3 and the proportion correct, which led us to assume that there is also a connection between crowding and tagging. The finding that the P3 amplitude is reduced under crowding conditions and increases under tagging conditions strengthens the assumption that crowding serves as a bottleneck for awareness.

Author Contributions

Z.S.-T. conducted the experiment and wrote the main manuscript M.L. and U.P. designed the manuscript and reviewed the data. All authors have read and agreed to the published version of the manuscript.

Funding

Funding was provided by the Israel Science Foundation (Grant No. 2494/21, UP).

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Bar Ilan University (ISU202107001). The ethics approval date is 11 October 2021.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author. The data are not publicly available due to the special encryption of the raw EEG data.

Acknowledgments

We would like to thank Oren Kadosh from Bar Ilan University for his assistance in processing and analyzing the EEG data.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Cicchini, G.M.; D’errico, G.; Burr, D.C. Crowding results from optimal integration of visual targets with contextual information. Nat. Commun. 2022, 13, 5741. [Google Scholar] [CrossRef]
  2. Levi, D.M. Crowding—An essential bottleneck for object recognition: A mini-review. Vis. Res. 2008, 48, 635–654. [Google Scholar] [CrossRef]
  3. Wallis, T.S.; Bex, P.J. Visual Crowding Is Correlated with Awareness. Curr. Biol. 2011, 21, 254–258. [Google Scholar] [CrossRef]
  4. Malania, M.; Herzog, M.H.; Westheimer, G. Grouping of contextual elements that affect vernier thresholds. J. Vis. 2007, 7, 1. [Google Scholar] [CrossRef]
  5. Saarela, T.P.; Sayim, B.; Westheimer, G.; Herzog, M.H. Global stimulus configuration modulates crowding. J. Vis. 2009, 9, 5. [Google Scholar] [CrossRef] [PubMed]
  6. Sayim, B.; Westheimer, G.; Herzog, M. Gestalt Factors Modulate Basic Spatial Vision. Psychol. Sci. 2010, 21, 641–644. [Google Scholar] [CrossRef] [PubMed]
  7. Nador, J.; Reeves, A. Crowding expands and is less sensitive to target-flanker differences during a shift of visual attention. Vis. Res. 2023, 212, 108305. [Google Scholar] [CrossRef] [PubMed]
  8. Banks, W.P.; Larson, D.W.; Prinzmetal, W. Asymmetry of visual interference. Percept. Psychophys. 1979, 25, 447–456. [Google Scholar] [CrossRef] [PubMed]
  9. Estes, W.K.; Allmeyer, D.H.; Reder, S.M. Serial position functions for letter identification at brief and extended exposure durations. Percept. Psychophys. 1976, 19, 1–15. [Google Scholar] [CrossRef]
  10. Whitney, D.; Levi, D.M. Visual crowding: A fundamental limit on conscious perception and object recognition. Trends Cogn. Sci. 2011, 15, 160–168. [Google Scholar] [CrossRef] [PubMed]
  11. Levi, D.M.; Toet, A.; Tripathy, S.P.; Kooi, F.L. The effect of similarity and duration on spatial interaction in peripheral vision. Spat. Vis. 1994, 8, 255–279. [Google Scholar] [CrossRef]
  12. Põder, E. Effect of colour pop-out on the recognition of letters in crowding conditions. Psychol. Res. 2007, 71, 641–645. [Google Scholar] [CrossRef]
  13. Reichenthal, A.; Ben-Tov, M.; Ben-Shahar, O.; Segev, R. What pops out for you pops out for fish: Four common visual features. J. Vis. 2019, 19, 1. [Google Scholar] [CrossRef]
  14. Treisman, A.M.; Gelade, G. A feature-integration theory of attention. Cogn. Psychol. 1980, 12, 97–136. [Google Scholar] [CrossRef]
  15. Treisman, A.; Souther, J. Search asymmetry: A diagnostic for preattentive processing of separable features. J. Exp. Psychol. Gen. 1985, 114, 285–310. [Google Scholar] [CrossRef]
  16. Polat, U.; Bonneh, Y.; Sagi, D. Lateral interactions and crowding in amblyopia. J. Vis. 2003, 3, 342. [Google Scholar] [CrossRef]
  17. Parkes, L.; Lund, J.; Angelucci, A.; Solomon, J.A.; Morgan, M. Compulsory averaging of crowded orientation signals in human vision. Nat. Neurosci. 2001, 4, 739–744. [Google Scholar] [CrossRef]
  18. Ho, C.; Cheung, S.-H. Crowding by Invisible Flankers. PLoS ONE 2011, 6, e28814. [Google Scholar] [CrossRef] [PubMed]
  19. Pelli, D.G.; A Tillman, K. The uncrowded window of object recognition. Nat. Neurosci. 2008, 11, 1129–1135. [Google Scholar] [CrossRef] [PubMed]
  20. He, S.; Cavanagh, P.; Intriligator, J. Attentional resolution and the locus of visual awareness. Nature 1996, 383, 334–337. [Google Scholar] [CrossRef] [PubMed]
  21. He, S.; Cavanagh, P.; Intriligator, J. Attentional resolution. Trends Cogn. Sci. 1997, 1, 115–121. [Google Scholar] [CrossRef] [PubMed]
  22. Yeshurun, Y.; Carrasco, M. Attention improves or impairs visual performance by enhancing spatial resolution. Nature 1998, 396, 72–75. [Google Scholar] [CrossRef] [PubMed]
  23. Yeshurun, Y.; Rashal, E. Precueing attention to the target location diminishes crowding and reduces the critical distance. J. Vis. 2010, 10, 16. [Google Scholar] [CrossRef] [PubMed]
  24. Strasburger, H.; Harvey, L.O.; Rentschler, I. Contrast thresholds for identification of numeric characters in direct and eccentric view. Percept. Psychophys. 1991, 49, 495–508. [Google Scholar] [CrossRef] [PubMed]
  25. Spillmann, L. Receptive Fields of Visual Neurons: The Early Years. Perception 2014, 43, 1145–1176. [Google Scholar] [CrossRef]
  26. Winsler, K.; Grainger, J.; Holcomb, P.J. On letter-specific crowding and reading: Evidence from ERPs. Neuropsychologia 2022, 176, 108396. [Google Scholar] [CrossRef] [PubMed]
  27. Chen, J.; He, Y.; Zhu, Z.; Zhou, T.; Peng, Y.; Zhang, X.; Fang, F. Attention-Dependent Early Cortical Suppression Contributes to Crowding. J. Neurosci. 2014, 34, 10465–10474. [Google Scholar] [CrossRef]
  28. Ronconi, L.; Bertoni, S.; Marotti, R.B. The neural origins of visual crowding as revealed by event-related potentials and oscillatory dynamics. Cortex 2016, 79, 87–98. [Google Scholar] [CrossRef]
  29. Chicherov, V.; Herzog, M.H. Targets but not flankers are suppressed in crowding as revealed by EEG frequency tagging. NeuroImage 2015, 119, 325–331. [Google Scholar] [CrossRef]
  30. Peng, C.; Hu, C.; Chen, Y. The Temporal Dynamic Relationship Between Attention and Crowding: Electrophysiological Evidence from an Event-Related Potential Study. Front. Neurosci. 2018, 12, 844. [Google Scholar] [CrossRef]
  31. Siman-Tov, Z.; Lev, M.; Polat, U. Binocular summation is affected by crowding and tagging. Sci. Rep. 2021, 11, 4843. [Google Scholar] [CrossRef]
  32. Lev, M.; Yehezkel, O.; Polat, U. Uncovering foveal crowding? Sci. Rep. 2015, 4, 4067. [Google Scholar] [CrossRef]
  33. Lev, M.; Polat, U. Space and time in masking and crowding. J. Vis. 2015, 15, 10. [Google Scholar] [CrossRef]
  34. Chicherov, V.; Plomp, G.; Herzog, M.H. Neural correlates of visual crowding. NeuroImage 2014, 93, 23–31. [Google Scholar] [CrossRef] [PubMed]
  35. Sterkin, A.; Yehezkel, O.; Polat, U. Learning to be fast: Gain accuracy with speed. Vis. Res. 2012, 61, 115–124. [Google Scholar] [CrossRef] [PubMed]
  36. Sterkin, A.; Yehezkel, O.; Bonneh, Y.S.; Norcia, A.; Polat, U. Backward masking suppresses collinear facilitation in the visual cortex. Vis. Res. 2009, 49, 1784–1794. [Google Scholar] [CrossRef] [PubMed]
  37. Karni, A.; Sagi, D. Where practice makes perfect in texture discrimination: Evidence for primary visual cortex plasticity. Proc. Natl. Acad. Sci. USA 1991, 88, 4966–4970. [Google Scholar] [CrossRef] [PubMed]
  38. Isreal, J.B.; Chesney, G.L.; Wickens, C.D.; Donchin, E. P300 and Tracking Difficulty: Evidence for Multiple Resources in Dual-Task Performance. Psychophysiology 1980, 17, 259–273. [Google Scholar] [CrossRef] [PubMed]
  39. Ritter, W.; Simson, R.; Vaughan, H.G. Event-Related Potential Correlates of Two Stages of Information Processing in Physical and Semantic Discrimination Tasks. Psychophysiology 1983, 20, 168–179. [Google Scholar] [CrossRef] [PubMed]
  40. Schubö, A.; Schröger, E.; Meinecke, C. Texture segmentation and visual search for pop-out targets: An ERP study. Cogn. Brain Res. 2004, 21, 317–334. [Google Scholar] [CrossRef]
  41. Wickens, C.D.; Isreal, J.; Donchin, E. The Event Related Cortical Potential as an Index of Task Workload. Proc. Hum. Factors Soc. Annu. Meet. 1977, 21, 282–286. [Google Scholar] [CrossRef]
  42. Ye, M.; Lyu, Y.; Sclodnick, B.; Sun, H.-J. The P3 Reflects Awareness and Can Be Modulated by Confidence. Front. Neurosci. 2019, 13, 510. [Google Scholar] [CrossRef]
  43. Koivisto, M.; Revonsuo , A. Event-related brain potential correlates of visual awareness. Neurosci. Biobehav. Rev. 2010, 34, 922–934. [Google Scholar]
  44. Nieuwenhuis, S.; Aston-Jones, G.; Cohen, J.D. Decision making, the P3, and the locus coeruleus-norepinephrine system. Psychol. Bull. 2005, 131, 510–532. [Google Scholar] [CrossRef] [PubMed]
  45. Jimenez, M.; Kimchi, R.; Yashar, A. Mixture-modeling approach reveals global and local processes in visual crowding. Sci. Rep. 2022, 12, 6726. [Google Scholar] [CrossRef]
  46. Siman-Tov, Z.; Lev, M.; Polat, U. Foveal crowding modifies a target’s properties under a brief presentation time. J. Vis. 2024. manuscript under Review. [Google Scholar]
  47. Crick, F.; Koch, C. Consciousness and neuroscience. Cereb. Cortex 1998, 8, 97–107. [Google Scholar] [CrossRef]
  48. Yehezkel, O.; Sterkin, A.; Lev, M.; Polat, U. Training on spatiotemporal masking improves crowded and uncrowded visual acuity. J. Vis. 2015, 15, 12. [Google Scholar] [CrossRef] [PubMed]
  49. Bonneh, Y.S.; Adini, Y.; Polat, U. Contrast sensitivity revealed by microsaccades. J. Vis. 2015, 15, 11. [Google Scholar] [CrossRef] [PubMed]
  50. Johannes, S.; Münte, T.; Heinze, H.; Mangun, G. Luminance and spatial attention effects on early visual processing. Cogn. Brain Res. 1995, 2, 189–205. [Google Scholar] [CrossRef] [PubMed]
  51. Luck, S.J. An Introduction to the Event-Related Potential Technique; MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
  52. Lakens, D. Sample Size Justification. Collabra Psychol. 2022, 8, 1–28. [Google Scholar] [CrossRef]
  53. Sayim, B.; Westheimer, G.; Herzog, M.H. Contrast polarity, chromaticity, and stereoscopic depth modulate contextual interactions in vernier acuity. J. Vis. 2008, 8, 12. [Google Scholar] [CrossRef] [PubMed]
  54. Baylis, G.C.; Driver, J. Visual parsing and response competition: The effect of grouping factors. Percept. Psychophys. 1992, 51, 145–162. [Google Scholar] [CrossRef] [PubMed]
  55. Blake, R.; Tadin, D.; Sobel, K.V.; Raissian, T.A.; Chong, S.C. Strength of early visual adaptation depends on visual awareness. Proc. Natl. Acad. Sci. USA 2006, 103, 4783–4788. [Google Scholar] [CrossRef] [PubMed]
  56. Flom, M.C.; Heath, G.G.; Takahashi, E. Contour Interaction and Visual Resolution: Contralateral Effects. Science 1963, 142, 979–980. [Google Scholar] [CrossRef] [PubMed]
  57. Chakravarthi, R.; Cavanagh, P. Temporal properties of the polarity advantage effect in crowding. J. Vis. 2007, 7, 11. [Google Scholar] [CrossRef]
  58. Doo, W.K.; Kweon, S.L. The Ten Twenty Electrode System: International Federation of Societies for Electroencephalography and Clinical Neurophysiology. Am. J. EEG Technol. 1961, 1, 13–19. [Google Scholar]
  59. Poletti, M.; Rucci, M.; Carrasco, M. Selective attention within the foveola. Nat. Neurosci. 2017, 20, 1413–1417. [Google Scholar] [CrossRef]
Figure 1. Stimuli. (A) 1. A black target letter in a black matrix with crowding of one-letter spacing (black uncrowded, BUC). 2. A red tagged letter in a black matrix with crowding of one-letter spacing (red uncrowded tagging, RUC). (B) 1. A black target letter in a black matrix with crowding of 0.4-letter spacing (black crowded, BC). 2. A red tagged letter in a black matrix with crowding of 0.4-letter spacing (red crowded tagging, RC). Note that the target letter is the letter in the center of the matrix.
Figure 1. Stimuli. (A) 1. A black target letter in a black matrix with crowding of one-letter spacing (black uncrowded, BUC). 2. A red tagged letter in a black matrix with crowding of one-letter spacing (red uncrowded tagging, RUC). (B) 1. A black target letter in a black matrix with crowding of 0.4-letter spacing (black crowded, BC). 2. A red tagged letter in a black matrix with crowding of 0.4-letter spacing (red crowded tagging, RC). Note that the target letter is the letter in the center of the matrix.
Brainsci 14 00169 g001
Figure 2. Each graph represents a different brain area denoted by a green electrode in the EEG inserted montage. Graph (A) represents the occipital electrodes (PO3, POZ, PO4, O1, OZ, and O2), and graph (B) represents the frontal electrodes (F1, FZ, and F2). The upper graphs represent the average (n = 9) ERP data for the correct response results. Each color and line type of the four lines represent a different matrix (on the upper right side of the graphs). The solid black line denotes BC, and the solid red line denotes RC. The dotted black line denotes BUC, and the dotted red line denotes RUC. The bottom graphs correspond to all conditions and represent the distribution of the count (the number of peaks) for the three components, namely P1 (blue), N1 (orange), and P3 (green), to ensure that the upper ERP graph faithfully represents the main components. The time windows are 50–200 ms for P1, 150–250 ms for N1, and 250–650 ms for P3. The y-axis represents the baseline corrected to the mean value of −200 ms, which precedes the stimulus onset data points.
Figure 2. Each graph represents a different brain area denoted by a green electrode in the EEG inserted montage. Graph (A) represents the occipital electrodes (PO3, POZ, PO4, O1, OZ, and O2), and graph (B) represents the frontal electrodes (F1, FZ, and F2). The upper graphs represent the average (n = 9) ERP data for the correct response results. Each color and line type of the four lines represent a different matrix (on the upper right side of the graphs). The solid black line denotes BC, and the solid red line denotes RC. The dotted black line denotes BUC, and the dotted red line denotes RUC. The bottom graphs correspond to all conditions and represent the distribution of the count (the number of peaks) for the three components, namely P1 (blue), N1 (orange), and P3 (green), to ensure that the upper ERP graph faithfully represents the main components. The time windows are 50–200 ms for P1, 150–250 ms for N1, and 250–650 ms for P3. The y-axis represents the baseline corrected to the mean value of −200 ms, which precedes the stimulus onset data points.
Brainsci 14 00169 g002
Figure 3. Pearson’s correlation between behavior and the ERP components. No significant correlation was found between the proportion correct and the P1 amplitude (A). A significant correlation was found between the proportion correct, the N1 amplitude (B), and the P3 amplitude (C). The N1 and P1 components of the occipital electrodes (the PO3, POZ, PO4, O1, OZ, and O2 electrodes) and the P3 component of the frontal electrodes (the F1, FZ, and F2 electrodes). The blue dots in the graph represent the participant’s average data (n = 9) from the 4 conditions (n = 36). The correlation only applies to the blue dots. The red dots represent the outlier data according to Pearson’s correlation formula.
Figure 3. Pearson’s correlation between behavior and the ERP components. No significant correlation was found between the proportion correct and the P1 amplitude (A). A significant correlation was found between the proportion correct, the N1 amplitude (B), and the P3 amplitude (C). The N1 and P1 components of the occipital electrodes (the PO3, POZ, PO4, O1, OZ, and O2 electrodes) and the P3 component of the frontal electrodes (the F1, FZ, and F2 electrodes). The blue dots in the graph represent the participant’s average data (n = 9) from the 4 conditions (n = 36). The correlation only applies to the blue dots. The red dots represent the outlier data according to Pearson’s correlation formula.
Brainsci 14 00169 g003
Figure 4. Global (refers to the average of all electrodes (64 electrodes)) field power (N1) between BC and BUC (A) and between BC and RC (B). Occipital field power (N1) between BC and BUC (C), between BC and RC (D), and between BUC and RC (E). The dots in the graph represent the participant’s average data (n = 9). Each dot in the graphs represents the average of the runs of each participant under two conditions indicated in each graph and separated by a diagonal line. The p value represents a statistical paired t-test.
Figure 4. Global (refers to the average of all electrodes (64 electrodes)) field power (N1) between BC and BUC (A) and between BC and RC (B). Occipital field power (N1) between BC and BUC (C), between BC and RC (D), and between BUC and RC (E). The dots in the graph represent the participant’s average data (n = 9). Each dot in the graphs represents the average of the runs of each participant under two conditions indicated in each graph and separated by a diagonal line. The p value represents a statistical paired t-test.
Brainsci 14 00169 g004
Figure 5. Psychophysics results: the proportion correct (A) and the reaction time (B) for the uncrowded (1), crowded (2), and tagging (3) effects. Comparisons between two conditions for each of the graphs (the proportion correct and the reaction time) are significant. The bars represent the mean across participants with SE (standard error) bars. Three asterisks represent p < 0.001.
Figure 5. Psychophysics results: the proportion correct (A) and the reaction time (B) for the uncrowded (1), crowded (2), and tagging (3) effects. Comparisons between two conditions for each of the graphs (the proportion correct and the reaction time) are significant. The bars represent the mean across participants with SE (standard error) bars. Three asterisks represent p < 0.001.
Brainsci 14 00169 g005
Figure 6. A high Pearson’s correlation for the psychophysical results: the crowding (reduction) [it represents the difference between BUC and BC conditions] and the tagging (improvement) [it represents the difference between RC and BC conditions] effects. The axis values are shown in proportion to the correct units. Each dot in the graph represents the difference in the average runs of each participant.
Figure 6. A high Pearson’s correlation for the psychophysical results: the crowding (reduction) [it represents the difference between BUC and BC conditions] and the tagging (improvement) [it represents the difference between RC and BC conditions] effects. The axis values are shown in proportion to the correct units. Each dot in the graph represents the difference in the average runs of each participant.
Brainsci 14 00169 g006
Table 1. Statistics of psychophysics ANOVA: two factors with replication. The proportion correct (A) and the reaction time (Log) (B) for all four conditions.
Table 1. Statistics of psychophysics ANOVA: two factors with replication. The proportion correct (A) and the reaction time (Log) (B) for all four conditions.
A
Proportion correct
Source of VariationdfFp-valueEffect Size
Sample (dense vs. spacious)130.01930.00000.45 (large)
Columns (black vs. red)117.86220.00020.33 (large)
Interaction (density and color)11.66630.2050
Within36
Total39
B
Reaction time (Log)
Source of VariationdfFp-valueEffect Size
Sample (dense vs. spacious)115.13170.00040.3 (large)
Columns (black vs. red)119.66700.00010.35 (large)
Interaction (density and color)10.21710.6441
Within36
Total39
Table 2. Statistics of ERP ANOVA: two factors with replication. The P1 (occipital) (A), the N1 (occipital) (B), and the N1 (global field) (C) for all four conditions.
Table 2. Statistics of ERP ANOVA: two factors with replication. The P1 (occipital) (A), the N1 (occipital) (B), and the N1 (global field) (C) for all four conditions.
A
Pl (Occipital)
Source of VariationdfFp-valueEffect Size
Sample (dense vs. spacious)16.29780.01730.16 (large)
Columns (black vs. red)10.05140.8220
Interaction (density and color)10.47300.4965
Within32
Total35
B
Nl (Occipital)
Source of VariationdfFp-valueEffect Size
Sample (dense vs. spacious)110.87590.00240.25 (large)
Columns (black vs. red)114.48430.00060.31 (large)
Interaction (density and color)12.14200.1531
Within32
Total35
C
Nl (Global field)
Source of VariationdfFp-value
Sample (dense vs. spacious)10.89670.3508
Columns (black vs. red)10.60660.4418
Interaction (density and color)10.51730.4772
Within32
Total35
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Siman-Tov, Z.; Lev, M.; Polat, U. Probing the Bottleneck of Awareness Formed by Foveal Crowding: A Neurophysiological Study. Brain Sci. 2024, 14, 169. https://doi.org/10.3390/brainsci14020169

AMA Style

Siman-Tov Z, Lev M, Polat U. Probing the Bottleneck of Awareness Formed by Foveal Crowding: A Neurophysiological Study. Brain Sciences. 2024; 14(2):169. https://doi.org/10.3390/brainsci14020169

Chicago/Turabian Style

Siman-Tov, Ziv, Maria Lev, and Uri Polat. 2024. "Probing the Bottleneck of Awareness Formed by Foveal Crowding: A Neurophysiological Study" Brain Sciences 14, no. 2: 169. https://doi.org/10.3390/brainsci14020169

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop