Next Article in Journal
Eye Movement Indicator Difference Based on Binocular Color Fusion and Rivalry
Previous Article in Journal
Eye-Based Recognition of User Traits and States—A Systematic State-of-the-Art Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Numerosity Perception and Perceptual Load: Exploring Sex Differences Through Eye-Tracking

Faculty of Social Sciences, Business and Economics, and Law, Åbo Akademi University, 20500 Turku, Finland
*
Author to whom correspondence should be addressed.
J. Eye Mov. Res. 2025, 18(2), 9; https://doi.org/10.3390/jemr18020009
Submission received: 16 October 2024 / Revised: 28 March 2025 / Accepted: 31 March 2025 / Published: 3 April 2025

Abstract

:
This study investigates sex differences in numerosity perception and visuospatial abilities in adults using eye-tracking methodology. We report the results of a controlled dual-task experiment that assessed the participants’ visuospatial and numerosity estimation abilities. We did not observe sex differences in reaction times and accuracy. However, we found that females consistently underestimated numerosity. This underestimation correlated with higher perceptual load in females, as evidenced by shorter fixation durations and increased fixation rates. These findings suggest that perceptual load, rather than visual or spatial abilities, significantly influences numerosity estimation. Our study contributes novel insights into sex differences in both numerosity estimation and visuospatial abilities. These results provide a foundation for future research on numerosity perception across various populations and contexts, with implications for educational strategies and cognitive training programs.

1. Introduction

What causes the sex difference in numerosity perception is one of the unanswered questions in numerosity research. Numerosity perception, the spontaneous extraction and mental representation of set size, is a fundamental ability that significantly influences human behavior [1]. We use numerosity perception to judge quantity or probability, often relying on numbers rather than other available types of information. Understanding the principles of numerosity perception is important: Numerosity perception is not only a life skill with survival value but is also vital in today’s world because it is closely related to mathematical abilities [2].
Numerosity perception, as a fundamental cognitive ability, is universal and emerges regardless of cultural experience or development [3]. That we all have the skill, as do many other complex animal species, does not mean that we are all equally good at numerosity perception. Krueger [4] demonstrated that several systematic differences exist in the accuracy of numerosity estimation. One of the more intriguing differences was that, compared to males, females tend to underestimate numerosity. Krueger speculated that this could be due to potential differences in visual acuity, visuospatial processing, and mathematical skills. Although Krueger’s [4] observation of sex differences in numerosity perception provided an early benchmark, subsequent replication of this specific finding has been limited, underscoring the need for further investigation into its underlying mechanisms.
In this paper, we seek to identify what causes the previously observed sex difference in numerosity perception. To test for potential sex differences in both visuospatial abilities and numerosity perception in a single experiment, we designed a dual-task experiment with two distinct but interrelated tasks: an attention-demanding multitarget visual search and a numerosity estimation task.

1.1. Previous Explanations for Sex Differences in Numerosity Perception

Krueger [4] observed that individuals tend to estimate numerosity accurately or overestimate it until the number of elements reaches a threshold of 10 to 20 items, beyond which they generally begin to underestimate. He also found a pronounced tendency among females to underestimate numerosity compared to males, with women reporting lower numerosity estimates at nearly all stimulus levels.
Krueger proposed several potential explanations for these sex differences in numerosity perception [4]: Firstly, he pointed to differences in visual acuity, which tends to be better in males, especially under well-lit conditions, allowing them to see objects more clearly and estimate numerosity more accurately. Secondly, he considered differences in arithmetic reasoning and quantitative evaluations, suggesting that differences in how the sexes approach magnitude estimation and production could play a role in numerosity estimation. Thirdly, Krueger discussed visual–spatial abilities. Historically, sex differences in visual-spatial ability have been considered among the most persistent individual differences in multifactor psychological tests. Males have decidedly better spatial skill than females [5]. Even if more recent studies report smaller differences in spatial ability, the differences remain non-trivial [6,7,8].

1.2. Selective Attention as a Cause of Sex Difference in Numerosity Perception

One possible explanation for the observed sex difference in numerosity perception is the difference in the ability to inhibit distracting information. Inhibition of responses is one of the few cognitive functions where a consistent sex difference is observed, as supported by research indicating that women exhibit better behavioral inhibitory control than men, demonstrated by faster detection and earlier resolution of response conflicts [9]. Numerosity perception involves searching for and identifying objects that belong to a group. In natural environments this means, among other things, distinguishing candidate targets from distractors. Selective attention preferentially processes relevant sensory information and filters out distracting information [10]. Selective attention depends on an active inhibition mechanism, which activates following the initial parallel processing of both target and distractor information [11]. Inhibition suppresses cognitive processing that is irrelevant to the task [12]. Neurophysiological observations suggest that most of the brain’s electrophysiological activity is specifically related to inhibition [13].
Although recent findings suggest that some of the differences in sustained attentional control, such as omission and commission errors, are related to sociocultural factors, the sex difference in reaction time appears to persist even after controlling for sociocultural influence [14]. The difference in inhibitory control is not limited to selective attention: Men are significantly more prone than women to impulse control disorders [15], for example, impulsive behavior and addictions [16]. The central importance of inhibitory control in producing selective attention, along with the strong sex difference observed in inhibitory control, suggests that differences in numerosity perception may also be influenced by sex-related differences in inhibition. Specifically, when exposed to a moderate number of candidate targets (including both task-relevant targets and distractors), women are expected to be more effective at ignoring distractors compared to men. This greater efficiency in distractor suppression may lead to lower estimates of distractor group sizes. Furthermore, more effective inhibition should result in a different distribution of mental load between men and women.
According to the load theory of selective attention [17], distractor processing depends on the interaction between perceptual and working memory load. High perceptual load in task-relevant stimuli exhausts perceptual capacity, leaving no spare capacity for distractor processing, thus enabling early selection. Conversely, low perceptual load leaves spare capacity that spills over to distractor processing, leading to late selection. Additionally, higher working memory load in tasks requiring active cognitive control increases distractor interference by limiting processing priorities.

1.3. Current Study

In this publication, we use previously unanalyzed data, which are part of a larger material. We have published results based on other subsets of the material in our previous publications [18,19]. To assess both visuospatial abilities and numerosity perception within a single experiment, we designed a dual-task setup. This included an attention-intensive visual search task paired with a numerosity estimation task. Our goal was to examine sex differences in numerosity estimation in a more natural-like environment. As König et al. [20] highlighted, it is important to study eye movements not only in controlled settings but also in more natural, unrestricted environments. To address this, we incorporated a novel approach by using three-dimensional moving objects rather than the traditionally employed two-dimensional ones. Through this dual-task experiment, we sought to explore whether the sex differences observed in numerosity estimation may stem from functional differences in selective attention, possibly due to variations in the inhibitory mechanism.

2. Methods

2.1. Participants

Twenty-one participants, twelve females (mean age = 29.2, range = 20–50), partook in the experiment. Each had normal or corrected-to-normal vision. Participation was voluntary, with written consent obtained beforehand. Participant privacy was maintained in accordance with the Declaration of Helsinki. Participants were unaware of the experimental conditions prior to the study.

2.2. Materials

In our experiment the participants were shown a simulated 3D street view tour, which included variable numbers of oncoming vehicles, cyclists, and pedestrians. We utilized 3D moving objects to create realistic 3D video simulations, based on data from the KITTI dataset [21]. The KITTI dataset was recorded using a moving vehicle equipped with camera systems, laser scans, GPS/IMU systems, and accelerometers while driving in and around Karlsruhe, Germany. Originally designed to advance computer vision and robotics for autonomous driving, this dataset provided a rich basis for generating stimuli.
The experimental stimuli were designed to balance experimental control with real-world relevance. These videos were processed using XVIZ v2.0.0 and visualized with streetscape.gl v1.0.0 (both developed by Uber Advanced Technologies Group, San Francisco, CA, USA) within the Mapbox environment (Mapbox GL JS v2.0.0; Mapbox, Inc., Washington, DC, USA). To allow for precise manipulation of target and non-target objects, we replaced the original vehicles, pedestrians, and cyclists with simplified 3D bounding boxes of different colors. For example, pedestrians were consistently represented by orange boxes, while bicycles were represented by pink boxes. Importantly, the speed, movement patterns, and direction of movement of all original objects were preserved, ensuring that the resulting stimuli maintained a naturalistic representation of a driving environment. The final stimuli consisted of 22 short video files, each approximately one minute in length (see Figure 1 for an example). The experiment was programmed with SR Research ExperimentBuilder v2.3.38 (SR Research Ltd., Ottawa, ON, Canada) and controlled using the EyeLink Portable Duo v6.12 system (SR Research Ltd., Ottawa, ON, Canada). Eye movements were recorded with the EyeLink Portable 1000 (SR Research Ltd., Ottawa, ON, Canada), using a head-fixed mount and chest rest to ensure consistent head positioning, viewing distance, and minimized data loss across participants.
To investigate the effects of perceptual and working memory load on participant performance, we systematically manipulated both factors within our task. Perceptual load (PL) was modulated by varying the number of distractor objects (oncoming vehicles) present in the driving scene. Increasing the number of distractors elevated the visual complexity of the display, requiring participants to process a greater amount of visual information. This manipulation primarily affected early perceptual processing. Working memory load (WML) was manipulated by the presence and number of targets (oncoming pedestrians or cyclists). While the presence and number of targets directly influenced the level of working memory load (WML), participants were instructed to remember the features of potential targets throughout the experiment. This requirement to maintain target features in working memory resulted in a baseline level of WML that was present even in trials without visible targets. The presence of targets further increased WML, as participants were required to encode and maintain information about each individual target. Furthermore, participants needed to track which targets they had already responded to, adding a further demand on working memory, particularly as the number of targets increased. Each video contained between zero and four targets, directly impacting the WML. Figure 1 provides an illustration of the four load conditions resulting from this 2 × 2 factorial manipulation (low PL—low WML, low PL—high WML, high PL—low WML, and high PL—high WML). The high PL—high WML condition presented participants with both a high number of distractors and the cognitive challenge of retaining information about multiple targets. Conversely, the low PL—low WML condition minimized perceptual demands (few distractors) while maintaining a baseline level of working memory load (WML) due to the ongoing need to remember target features. To preserve the naturalistic properties of the KITTI dataset, object counts within each load condition varied slightly within predefined thresholds (e.g., 1–9 distractors in low PL—low WML; 1–9 objects, including targets, in low PL—high WML). This variability ensured ecological validity while maintaining experimental control over load categories. The number of trials for each condition was as follows: (1) low PL—low WML: 6 trials; (2) low PL—high WML: 4 trials; (3) high PL—low WML: 2 trials; (4) high PL—high WML: 10 trials.

2.3. Procedure

The task consisted of two blocks. The first block included two instructional examples, while the second block had twenty-two trials with approximately one-minute video files. All participants viewed the same videos in the second block, featuring varying numbers of targets and irrelevant objects. The number of targets presented at the same time on the screen varied from one to four. Targets appeared in 14 of the 22 videos, while the remaining 8 videos contained only distractors. The number of objects (targets and distractors) in a video ranged from 1 to 17. The procedure of one session in the experiment is shown in Figure 2.
Participants performed two tasks during the experiment: 1. Object Identification Task. Participants were required to press predefined buttons when specific targets (pedestrians or bicycles) appeared in the videos; 2. Numerosity Estimation Task. At the end of the experiment, participants were asked to estimate the number of objects presented in the final trial. Participants received photo instructions on identifying targets before the experiment, without information on the number of objects or visual conditions. They were unaware of the numerosity task to prevent active counting. The entire experiment lasted about 25 min. In designing the numerosity estimation task, we adopted an approach similar to Krueger’s [4] method, which focused on singular judgments to mitigate bias. Krueger argued that avoiding comparative and sequential measurements could yield a more precise and authentic measure of perceived numerosity.

2.4. Measurements

Various signals have been identified as valid indicators of an individual’s mental load, such as cardiovascular signals, brain activities, and eye movements [22,23,24,25]. Notably, eye movements are particularly effective at capturing changes in mental load corresponding to the visual demands of a task [24].
We use fixation-related parameters, rather than pupil size, to measure mental load. A fixation is commonly defined as the period between the end of one saccade and the start of the next, during which the eyes remain relatively still, and it is a commonly used measure in eye-tracking research [26]. During active scene perception, our eyes move via saccadic movements, fixating on objects and elements for varying times. Variability in fixation duration is influenced by attentional, perceptual, and cognitive processes involved in scene analysis and comprehension [27]. Fixation duration is a reliable predictor of task performance, correlating with high load [28,29]. While pupil size has shown a positive correlation with mental load across various scenarios [30,31,32], it cannot differentiate between perceptual and working memory load.
Fixation rate—the number of fixations per second or minute—is negatively correlated with task difficulty [33]. Although it varies with task performance, it can serve as a measure of mental load [28]. However, while fixation-related parameters are valuable, there is still some ongoing discussion regarding their effectiveness for assessing mental load [22,30,31,32]. Studies have shown conflicting results, with some finding increased fixation durations associated with higher load [31,32], while others found decreased durations [30]. Tao et al. [24] hypothesize that these inconsistencies arise from differences in the types of mental load assessed. Each task imposes different levels of perceptual load (early processing filter) or working memory load (late processing filter), affecting what and how many items can be processed. Load theory of selective attention [17] posits that the allocation of attentional resources depends on the level of perceptual and working memory load. High perceptual load depletes attentional capacity, preventing distractor processing and facilitating early selection. Conversely, low perceptual load leaves spare capacity, allowing for late selection and greater distractor interference.
Liu et al. [22] propose that studies showing decreased fixation durations with increased load may have involved high perceptual load, where fewer items are processed per fixation, resulting in shorter durations and higher fixation rates. Conversely, high working memory load involves processing all relevant items in a single fixation, leading to longer durations and fewer fixations. Therefore, perceptual and working memory load have opposite effects on fixation duration and rate. Liu et al. [22] propose a method to analyze both fixation duration and fixation rate to assess mental load. Their findings indicate that different types of mental load have opposite effects on fixation-related parameters. Specifically, they observed shorter fixation durations and an increased number of fixations under high perceptual load, whereas longer fixation durations and fewer fixations occurred under high working memory load. Thus, perceptual and working memory load exert opposite influences on both fixation duration and fixation rate. To gain deeper insights into sex differences in perception, we combined fixation duration and fixation rate to assess mental load.
For the eye-tracking analysis, saccades were identified using the EyeLink system’s standard parameters and velocity-based saccade detection algorithm. This algorithm calculated instantaneous velocity and acceleration for each data sample and compared these values to preset thresholds. A saccade was detected if either velocity or acceleration surpassed the thresholds, and a fixation was identified once both fell below the thresholds. The system ensured the saccade signal persisted for a critical duration before confirming the initiation or termination of a saccade. The default thresholds for the head-stabilized (chinrest) mode were set at a velocity of 30°/s and an acceleration of 8000°/s2. Fixations were recorded whenever the pupil was visible and no saccade was occurring. The data from all participants who completed the experimental protocol were included in the subsequent analyses. There were no exclusions based on data loss, technical issues, or any other criteria.

3. Results

3.1. Mental Load

Descriptive statistics for fixation rate and average fixation duration are summarized in Table 1 and Table 2 and Figure 3. The Mann–Whitney U test was used to test the null hypothesis that there are no statistically significant differences in fixation measurements between sex groups. This test was chosen because the data were non-normally distributed, and prior studies recommended using the median for real-world data. Eye movement characteristics were similar to those typically observed in scene-viewing studies.

3.2. Average Fixation Duration

A Shapiro–Wilk normality test revealed that the data were non-normally distributed, justifying the use of the Mann–Whitney U test. This test was conducted to investigate the difference in average fixation duration between females (N trials = 264, Mdn = 342.72, IQR = 75.70) and males (N trials = 198, Mdn = 360.42, IQR = 86.64). The results were statistically significant (U = 22,874.00, p = 0.022), indicating a difference in average fixation duration between the two groups. Thus, males had significantly higher average fixation durations than females.

3.3. Fixation Rate

Using the Mann–Whitney U test, we analyzed the difference in fixation rate between females (Mdn = 157.06, IQR = 32.98) and males (Mdn = 152.8, IQR = 33.65). The data were non-normally distributed. The test results were statistically significant (U = 22,959.50, p = 0.025), indicating that females had a significantly higher fixation rate than males. These findings highlight significant differences in fixation measurements between sexes, with females exhibiting shorter fixation durations and higher fixation rates.

3.4. Accuracy, Reaction Time, and Numerosity Estimation

The results showed similar accuracy rates for females (56.61%) and males (57.34%) and negligible differences in reaction times (females: 1950.03 ms; males: 1954.8 ms). However, males’ reaction times decreased with task complexity (2300 ms to 1540 ms), suggesting improved efficiency, while females’ reaction times increased (1950 ms to 2200 ms), indicating greater mental effort [18].
The analysis of potential sex differences in numerosity estimation revealed statistically significant differences between males and females (U = 25.000, W = 103.000, p = 0.041). On average, females underestimated numerosity more than males (M females = 11.833, SD females = 4.407; M males = 16.333, SD males = 5.292). When comparing estimated numerosity to the actual value (17), males’ estimates (M = −0.667, SD = 5.292) did not significantly differ from the actual numerosity, while females’ estimates (M = −5.167, SD = 4.407) were significantly lower. These findings suggest a potential sex-based bias in numerosity perception, with females tending to underestimate numerosity compared to males [19].
To examine sex differences in eye-tracking measures, we averaged fixation duration and fixation rate across trials for each participant per video, yielding one value per participant per condition. A two-way MANOVA was conducted to examine the effects of sex (male, female) and video (22 levels) on average fixation duration and fixation rate. The results revealed a significant main effect of sex, F (2, 417) = 4.458, p = 0.012, indicating overall differences in fixation patterns between males and females. There was also a significant main effect of video, F (42, 836) = 2.261, p < 0.001, suggesting that stimulus-specific properties (e.g., object density, target presence) influenced both fixation duration and fixation rate. This video effect was explored to provide context for interpreting sex differences, as variations in scene complexity could modulate eye movement patterns. We did not collapse data by load type due to variability in object counts within categories, and preliminary analyses showed introduced noise.
Follow-up univariate analyses confirmed that sex had a significant effect on average fixation duration, F (1, 418) = 6.663, p = 0.010, with females exhibiting shorter fixation durations than males. The effect of sex on fixation rate was marginally significant, F (1, 418) = 3.295, p = 0.070, with females tending to make more frequent fixations. The main effect of video remained significant for both average fixation duration, F (21, 418) = 2.204, p = 0.002, and fixation rate, F (21, 418) = 3.029, p < 0.001, reinforcing the role of stimulus properties in guiding eye movements. The sex × video interaction remained non-significant for both dependent variables (p > 0.78).
Females showed shorter fixation durations (Mdn = 342.72 ms) and higher fixation rates (Mdn = 157.06 fixations/min) than males (Mdn = 360.42 ms and 152.80 fixations/min), patterns that align with the load theory of selective attention [17]. These suggest higher perceptual load in females and higher working memory load in males, influencing distractor processing differently across sexes.

4. Discussion

Our study builds on Krueger’s pioneering work [4], offering a more nuanced understanding of the factors influencing sex differences in numerosity perception. Krueger suggested that the underestimation of numerosity in females could be due to potential differences in visual acuity, visuospatial processing, and mathematical skills. However, our findings, showing equal performance of females and males in dynamic visuospatial tasks, challenge the notion that these abilities underlie the underestimation in females. Instead, our results indicate that differences in inhibition of distractors between sexes play a significant role. Our proposed explanation rests on the idea that sex differences in numerosity perception become most pronounced in situations requiring selective attention and the inhibition of irrelevant stimuli. Where “distractors” are present, the efficiency of distractor inhibition determines how many items ultimately reach working memory for counting. From our data, it appears that women’s stronger inhibition of distractors causes them to count fewer items, thus systematically underestimating numerosity. Baseline underestimation appears in simpler enumeration tasks (as in Krueger [4]), and additional distractors can accentuate that difference. Rather than contradicting Krueger’s findings, our work offers a mechanism—distractor inhibition—to explain how a pre-existing female bias toward underestimation might be magnified under higher-load conditions.
Our findings are further supported by evidence that numerosity perception is influenced by inhibitory control, as demonstrated across developmental stages. Gray and Reeve [34] noted that general cognitive functions, particularly working memory and response inhibition, support the early development of numerosity estimation (dot enumeration), with research linking these abilities in older children and adults to variations in inhibitory control [35,36,37,38]. Specifically, inhibitory control enhances estimation efficiency by preventing revisits to already processed items [38], a mechanism mirrored in our task where females’ shorter fixation durations and higher fixation rates suggest effective distractor suppression, leading to lower numerosity. Males’ longer fixations and lower rates indicate broader processing, consistent with less efficient inhibition. This aligns with the load theory of selective attention [17], where high perceptual load (PL) in females reduces distractor interference early (p. 342), while males’ higher working memory load (WML) increases it (p. 351).
The observed sex difference in the distribution of mental load suggests that men had to work harder to inhibit the distractors (higher working memory load), but on the other hand, they used fewer resources to inspect the objects (lower perceptual load). High working memory load, due to the inability to suppress irrelevant items, increases the number of items processed within a fixation, thereby lengthening fixation duration and allowing participants to obtain the whole image with fewer fixations [22]. Women, on the other hand, were able to filter out some distractors more effectively, as reflected in their reporting of lower numbers, but they dedicated more attention to inspecting the objects, indicating a higher perceptual load.
Consequently, one potential explanation for sex differences in numerosity perception could be disparities in inhibitory control, which our data suggest modulate load distribution. Research consistently shows sex differences in inhibition, with women demonstrating superior behavioral inhibitory control, evidenced by faster detection and resolution of response conflicts [9]. Although some differences in sustained attentional control, such as omission and commission errors, are tied to sociocultural factors, sex differences in reaction time persist even after accounting for these influences [14]. Given inhibitory control’s critical role in selective attention [17], it likely contributes to our observed sex differences, where females’ efficient distractor rejection aligns with lower estimates and higher PL, while males’ broader processing reflects greater WML. This supports our hypothesis that women’s stronger inhibition leads to a different mental workload distribution compared to men, amplifying underestimation in distractor-rich contexts like our 3D video task.
Displays that create a high degree of competition among objects necessitate a strong bias to select the target for further processing. Several studies indicate that fixation duration tends to increase with increased competition among objects. In tasks where fixation duration increased, it was associated with a rise in the number of inspected items [39]. In our study, objects competed intensely with candidate targets in the video, resulting in high perceptual load. Conversely, minimal competition between task-relevant stimuli requires little top-down control to resolve the conflict, leading to low perceptual load [40]. Torralbo and Beck [41] also found that stimuli inducing greater competition in the visual cortex reduced distractor interference, similar to high perceptual load. Therefore, differences in performance between females and males on visuospatial tasks across various studies may be attributed to differing experimental conditions and the perceptual load experienced by participants. For instance, studies often report mixed results. One study found no significant differences between sexes in the efficiency and accuracy of spatial orientation [6]. In contrast, another study showed that males outperformed females in both large-scale and small-scale spatial abilities, with a significantly larger effect size in large-scale spatial ability [7]. Additionally, a male advantage in spatial processing ability was reported, indicating that males outperformed female students in mental rotation tasks [8].
Interestingly, the findings of this study, which suggest that sex differences in numerosity perception may stem from differences in distractor inhibition and perceptual load distribution, open avenues for future research into cognitive training and its potential to mitigate these differences. Evidence suggests that perceptual capacity limits are not fixed but can be improved through targeted training. Prior studies have demonstrated that both children and adults who are experienced action video game players exhibit significantly greater resistance to distractors under high perceptual load conditions [40,42,43,44]. Longitudinal research further supports this notion, showing that perceptual training effects persist over time, leading to more efficient attentional control and reduced neural activity during similar tasks [45,46]. However, caution is warranted when interpreting these findings, as some studies argue that the causal relationship between action video game training and cognitive improvements is not as robust as often claimed, with pre-existing individual differences potentially driving observed effects [47,48]. Furthermore, while meta-analyses confirm positive impacts of action video games on attention and perception, concerns about methodological limitations, publication bias, and the need for larger, well-controlled studies highlight the necessity for more rigorous experimental designs [49]. Incorporating eye-tracking measures into training studies could help address these concerns by providing more precise insights into how perceptual and working memory loads evolve with practice, ultimately refining adaptive learning strategies.

5. Limitations

It is essential to point out that there is no single measure universally valid for determining mental load across varied scenarios. Physiological responses to mental load are highly scenario-dependent and influenced by numerous task characteristics and individual differences. Consequently, different physiological measures perform differently in various study contexts [24]. In their literature review, Tao et al. reported that fixation duration showed a statistically significant difference in discriminating tasks with varied mental load levels in 73% of cases. Although studies suggest that fixation-related information can reflect the effect of mental load, opposite effects have been found. For example, as mental load increased, fixation duration was either found to increase [31,32] or decrease [22,30]. These apparently contradictory findings may result from imprecise definitions of perceptual load and the assessment of different types of mental load. Specifically, fixation duration decreased as task demand increased in simulated flight tasks [50] and simulated driving tasks [51]. Fixation rate was positively correlated with mental load in pilot mission tasks [52] and hypermedia interaction tasks [53]. An important limitation of this study is its relatively small sample size. While our findings suggest a consistent pattern of results, caution should be taken when generalizing them to larger populations. Future research with a more substantial participant pool would help confirm the robustness of these effects and enable more comprehensive statistical modeling. Furthermore, the uneven distribution of trial types, designed to mirror the real-world variability of the KITTI dataset, might have subtly shaped the observed load effects across conditions, enriching our understanding of their dynamic interplay.

6. Conclusions

In our experiment measuring both visuospatial abilities and numerosity estimation, we found that females and males had comparable reaction times and accuracy. However, females consistently underestimated numerosity. This underestimation was linked to lower working memory load and higher perceptual load in females, as evidenced by shorter fixation durations and higher fixation rates. Taken together, this suggest that women are better at inhibiting distractors and, consequently, report lower and less accurate numerosity estimates compared to men, who are less efficient at inhibiting the distractors.
The similarity in reaction times and accuracy suggests that basic visuospatial abilities are comparable between sexes. However, the observed difference in the distribution of mental load between working memory load and perceptual load implies that there is a difference in strategies men and women employ in visual perception tasks. This does not necessarily point to differences in underlying visuospatial abilities but rather indicates that a difference in how the inhibitory mechanisms function affects how these abilities are utilized.
Our study addresses a previously underexplored aspect of numerosity estimation by examining sex differences in both numerosity estimation and visuospatial processing. These findings contribute to a deeper understanding of numerosity perception across different contexts and populations and suggest promising directions for future research.
Understanding how perceptual load influences numerosity estimation can inform educational strategies and cognitive training programs designed to mitigate these differences. Future studies should continue to explore these factors to enhance our comprehension of numerosity perception, ultimately contributing to more effective interventions and support systems for diverse populations.

Author Contributions

Conceptualization, J.B. and A.Ö.; methodology, J.B.; software, J.B.; validation, J.B.; formal analysis, J.B.; investigation, J.B.; resources, J.B.; data curation, J.B.; writing—original draft preparation, J.B.; writing—review and editing, J.B. and A.Ö; visualization, J.B.; supervision, J.B. and A.Ö.; project administration, J.B. and A.Ö.; funding acquisition, J.B. and A.Ö. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

In accordance with the guidelines of the Board for Research Ethics at Åbo Akademi University (FEN), formal ethical approval is required only for research that meets certain criteria, such as involving interventions that may harm participants or working with vulnerable populations (e.g., children under 15 without parental consent). As this study did not fall into any of these categories, ethical approval was not required.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data are contained within this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. He, L.; Zhang, J.; Zhou, T.; Chen, L. Connectedness affects dot numerosity judgment: Implications for configural processing. Psychon. Bull. Rev. 2009, 3, 509–517. [Google Scholar] [CrossRef]
  2. Cicchini, G.M.; Anobile, G.; Burr, D. Spontaneous perception of numerosity in humans. Nat. Commun. 2016, 7. [Google Scholar] [CrossRef]
  3. Mazzocco, M.M.; Feigenson, L.; Halberda, J. Preschoolers’ precision of the approximate number system predicts later school mathematics performance. PLoS ONE 2011, 6, e23749. [Google Scholar] [CrossRef]
  4. Krueger, L.E. Single judgments of numerosity. Percept. Psychophys. 1982, 31, 175–182. [Google Scholar] [CrossRef] [PubMed]
  5. Gaulin, S.J.C.; Fitzgerald, R.W. Sex-differences in spatial ability—An evolutionary hypothesis and test. Am. Nat. 1986, 127, 74–88. [Google Scholar]
  6. Dong, W.; Zhan, Z.; Liao, H.; Meng, L.; Liu, J. Assessing Similarities and Differences between Males and Females in Visual Behaviors in Spatial Orientation Tasks. ISPRS Int. J. Geo-Inf. 2020, 9, 115. [Google Scholar] [CrossRef]
  7. Yuan, L.; Kong, F.; Luo, Y.; Zeng, S.; Lan, J.; You, X. Gender Differences in Large-Scale and Small-Scale Spatial Ability: A Systematic Review Based on Behavioral and Neuroimaging Research. Front. Behav. Neurosci. 2019, 13, 128. [Google Scholar] [CrossRef]
  8. Casey, M.B.; Nuttall, R.; Pezaris, E.; Benbow, C.P. The Influence of Spatial Ability on Gender Differences in Mathe-matics College Entrance Test Scores across Diverse Samples. Dev. Psychol. 1995, 31, 697–705. [Google Scholar] [CrossRef]
  9. Yuan, J.; He, Y.; Qinglin, Z.; Chen, A.; Li, H. Gender differences in behavioral inhibitory control: ERP evidence from a two-choice oddball task. Psychophysiology 2008, 45, 986–993. [Google Scholar] [CrossRef]
  10. Pagnotta, M.F.; Pascucci, D.; Plomp, G. Selective attention involves a feature-specific sequential release from inhibitory gating. Neuroimage 2022, 246, 118782–118795. [Google Scholar] [CrossRef]
  11. Neumann, E.; DeSchepper, B.G. Costs and benefits of target activation and distractor inhibition in selective attention. J. Exp. Psychol. Learn. Mem. Cogn. 1991, 17, 1136–1145. [Google Scholar] [CrossRef]
  12. Van Moorselaar, D.; Slagter, H.A. Inhibition in selective attention. Ann. N. Y. Acad. Sci. 2020, 1464, 204–221. [Google Scholar] [CrossRef] [PubMed]
  13. Jensen, O.; Mazaheri, A. Shaping functional architecture by oscillatory alpha activity: Gating by inhibition. Front. Hum. Neurosci. 2010, 4, 186. [Google Scholar] [CrossRef]
  14. Riley, E.; Okabe, H.; Germine, L.; Wilmer, J.; Esterman, M.; DeGutis, J. Gender Differences in Sustained Attentional Control Relate to Gender Inequality across Countries. PLoS ONE 2016, 11, e0165100. [Google Scholar] [CrossRef]
  15. Li, C.S.; Zhang, S.; Duann, J.R.; Yan, P.; Sinha, R.; Mazure, C.M. Gender Differences in Cognitive Control: An Extend-ed Investigation of the Stop Signal Task. Brain Imaging Behav. 2009, 3, 262–276. [Google Scholar] [CrossRef] [PubMed]
  16. Mansouri, F.A.; Fehring, D.J.; Gaillard, A.; Jaberzadeh, S.; Parkington, H. Sex dependency of inhibitory control functions. Biol. Sex Differ. 2016, 7, 11. [Google Scholar] [CrossRef]
  17. Lavie, N.; Hirst, A.; de Fockert, J.W.; Viding, E. Load Theory of Selective Attention and Cognitive Control. J. Exp. Psychol. Gen. 2004, 133, 339–354. [Google Scholar] [CrossRef]
  18. Bend, J.; Öörni, A. Effects of Augmented Reality on Visuospatial Abilities of Males and Females. In Computing, Internet of Things and Data Analytics. ICCIDA 2023; García Márquez, F.P., Jamil, A., Ramirez, I.S., Eken, S., Hameed, A.A., Eds.; Studies in Computational Intelligence; Springer: Cham, Switzerland, 2024; Volume 1145. [Google Scholar] [CrossRef]
  19. Bend, J.; Öörni, A. Exploring Inter-Sex Differnces in numerosity perception and cognitive load. IJEP Int. J. Eng. Padagog. 2024, 14, 125. [Google Scholar] [CrossRef]
  20. König, P.; Wilming, N.; Kietzmann, T.C.; Ossandón, J.P.; Onat, S.; Ehinger, B.V.; Gameiro, R.R.; Kaspar, K. Eye movements as a window to cognitive processes. J. Eye Mov. Res. 2016, 9. [Google Scholar] [CrossRef]
  21. Geiger, A.; Lenz, P.; Stiller, C.; Urtasun, R. Vision meets robotics: The kitti dataset. Int. J. Robot. Res. 2013, 32, 1231–1237. [Google Scholar] [CrossRef]
  22. Liu, J.C.; Li, K.A.; Yeh, S.L.; Chien, S.Y. Assessing Perceptual Load and Cognitive Load by Fixation-Related Information of Eye Movements. Sensors 2022, 22, 1187. [Google Scholar] [CrossRef] [PubMed]
  23. Charles, R.L.; Nixon, J. Measuring mental load using physiological measures: A systematic review. Appl. Ergon. 2019, 74, 221–232. [Google Scholar] [CrossRef] [PubMed]
  24. Tao, D.; Tan, H.; Wang, H.; Zhang, X.; Qu, X.; Zhang, T. A Systematic Review of Physiological Measures of Mental Load. Int. J. Environ. Res. Public Health 2019, 16, 2716. [Google Scholar] [CrossRef]
  25. Andrzejewska, M.; Stolińska, A. Comparing the Difficulty of Tasks Using Eye Tracking Combined with Subjective and Behavioural Criteria. J. Eye Mov. Res. 2016, 9. [Google Scholar] [CrossRef]
  26. Holmquist, K.; Nyström, M.; Andersson, R.; Dewhurst, R.; Jarodzka, H.; van de Weijer, J. Eye Tracking: A Comprehensive Guide to Methods and Measures; Oxford University Press: Oxford, UK, 2011. [Google Scholar]
  27. Henderson, J.M.; Choi, W. Neural Correlates of Fixation Duration during Real-world Scene Viewing: Evidence from Fixation-related (FIRE) fMRI. J. Cogn. Neurosci. 2015, 27, 1137–1145. [Google Scholar] [CrossRef]
  28. Van Orden, K.F.; Lambert, W.; Mäkelig, S.; Jung, T.P. Eye activity correlates of load during visuospatial memory task. Hum. Factors 2001, 43, 111–121. [Google Scholar] [CrossRef]
  29. Tang, H.; Day, E.; Kendhammer, L.; Moore, J.N.; Brown, S.A.; Pienta, N.J. Eye Movement Patterns in Solving Science Ordering Problems. J. Eye Mov. Res. 2016, 9. [Google Scholar] [CrossRef]
  30. Mallick, R.; Slayback, D.; Touryan, J.; Ries, A.J.; Lance, B.J. The use of eye metrics to index cognitive load in video games. In Proceedings of the 2016 IEEE Second Workshop on Eye Tracking and Visualization (ETVIS); 2016; pp. 60–64. [Google Scholar] [CrossRef]
  31. Di Stasi, L.L.; Antolí, A.; Cañas, J.J. Evaluating mental load while interacting with computer-generated artificial environments. Entertain. Comput. 2013, 4, 63–69. [Google Scholar] [CrossRef]
  32. Recarte, M.A.; Nunes, L.M. Effects of verbal and spatial-imagery tasks on eye fixations while driving. J. Exp. Psychol. Appl. 2000, 6, 31–43. [Google Scholar] [CrossRef]
  33. Nakayama, M.; Takahashi, K.; Shimizu, Y. The act of task difficulty and eye-movement frequency for the ‘Oculomotor indices’. In Proceedings of the 2002 Symposium on Eye-Tracking Research & Applications, New Orleans, LA, USA, 25–27 March 2002; pp. 37–42. [Google Scholar] [CrossRef]
  34. Gray, S.A.; Reeve, R.A. Preschoolers’ Dot Enumeration Abilities Are Markers of Their Arithmetic Competence. PLoS ONE 2014, 9, e94428. [Google Scholar] [CrossRef]
  35. Melcher, D.; Piazza, M. The role of attentional priority and saliency in determining capacity limits in enumeration and visual working memory. PLoS ONE 2011, 6, e29296. [Google Scholar] [CrossRef] [PubMed]
  36. Piazza, M.; Fumarola, A.; Chinello, A.; Melcher, D. Subitizing reflects visuo-spatial object individuation capacity. Cognition 2011, 121, 147–153. [Google Scholar] [CrossRef]
  37. Wutz, A.; Melcher, D. Temporal buffering and visual capacity: The time course of object formation underlies capacity limits in visual cognition. Atten. Percept. Psychophys. 2013, 75, 921–933. [Google Scholar] [CrossRef]
  38. Trick, L.M.; Audet, D.; Dales, L. Age differences in enumerating things that move: Implications for the development of multiple-object tracking. Mem. Cogn. 2003, 31, 1229–1237. [Google Scholar] [CrossRef] [PubMed]
  39. Wienrich, C.; Heße, U.; Müller-Plath, G. Eye movements and attention in visual feature search with graded target-distractor-similarity. J. Eye Mov. Res. 2009, 3. [Google Scholar] [CrossRef]
  40. Murthy, G.; Groeger, J.A.; Greene, C.M. Twenty years of load theory—Where are we now, and where should we go next? Psychon. Bull. Rev. 2016, 23, 1316–1340. [Google Scholar] [CrossRef]
  41. Torralbo, A.; Beck, D.M. Perceptual load-induced selection as a result of local competitive interactions in visual cortex. Psychol. Sci. 2008, 19, 1045–1050. [Google Scholar] [CrossRef]
  42. Dye MW, G.; Green, C.S.; Bavelier, D. The development of attention skills in action video game players. Neuropsychologia 2009, 47, 1780–1789. [Google Scholar] [CrossRef]
  43. Green, C.S.; Bavelier, D. Action video games modify visual selective attention. Nature 2003, 423, 534–537. [Google Scholar] [CrossRef]
  44. Green, C.S.; Bavelier, D. Effects of action video game playing on the spatial distribution of visual selective attention. J. Exp. Psychol. Hum. Percept. Perform. 2006, 32, 1465–1478. [Google Scholar] [CrossRef]
  45. Feng, J.; Spence, I.; Pratt, J. Playing an action videogame reduces gender differences in spatial cognition. Psychol. Sci. 2007, 18, 850–855. [Google Scholar] [CrossRef] [PubMed]
  46. Li, R.; Polat, U.; Makous, W.; Bavelier, D. Enhancing the contrast sensitivity function through action video game training. Nat. Neurosci. 2009, 12, 549–551. [Google Scholar] [CrossRef] [PubMed]
  47. Unsworth, N.; Redick, T.S.; McMillan, B.D.; Hambrick, D.Z.; Kane, M.J.; Engle, R.W. Is Playing Video Games Related to Cognitive Abilities? Psychol. Sci. 2015, 26, 759–774. [Google Scholar] [CrossRef] [PubMed]
  48. Kristjánsson, Á. The case for causal influences of action videogame play upon vision and attention. Atten. Percept. Psychophys. 2013, 75, 667–672. [Google Scholar] [CrossRef]
  49. Bediou, B.; Adams, D.M.; Mayer, R.E.; Tipton, E.; Green, C.S.; Bavelier, D. Meta-analysis of action video game impact on perceptual, attentional, and cognitive skills. Psychol. Bull. 2018, 144, 77–110. [Google Scholar] [CrossRef]
  50. De Rivecourt, M.; Kuperus, M.N.; Post, W.J.; Mulder, L.J. Cardiovascular and eye activity measures as indices for momentary changes in mental effort during simulated flight. Ergonomics 2008, 51, 1295–1319. [Google Scholar] [CrossRef]
  51. Foy, H.J.; Chapman, P. Mental load is reflected in driver behaviour, physiology, eye movements and prefrontal cortex activation. Appl. Ergon. 2018, 73, 90–99. [Google Scholar] [CrossRef]
  52. Svensson, E.A.I.; Wilson, G.F. Psychological and psychophysiological models of pilot performance for systems development and mission evaluation. Int. J. Aviat. Psychol. 2009, 12, 95–110. [Google Scholar] [CrossRef]
  53. Di Stasi, L.L.; Antolí, A.; Gea, M.; Cañas, J.J. A neuroergonomic approach to evaluating mental load in hypermedia interactions. Int. J. Ind. Ergon. 2011, 41, 298–304. [Google Scholar] [CrossRef]
Figure 1. An example of different mental load conditions adopted in the experiment (perceptual load (PL) and working memory load (WML)). The central blue rectangle represents a car from the KITTI dataset, retained for naturalistic context but not counted in the numerosity task. Violet boxes represent bicycle targets, orange boxes represent pedestrian targets, and light blue boxes represent distractors. Participants were instructed to focus on violet and orange targets for identification and ignore the central blue car. To preserve the naturalistic properties of the KITTI dataset, object counts within each load condition varied slightly within predefined thresholds (e.g., 1–9 distractors in low PL—low WML; 1–9 objects, including targets, in low PL—high WML). This variability ensured ecological validity while maintaining experimental control over load categories.
Figure 1. An example of different mental load conditions adopted in the experiment (perceptual load (PL) and working memory load (WML)). The central blue rectangle represents a car from the KITTI dataset, retained for naturalistic context but not counted in the numerosity task. Violet boxes represent bicycle targets, orange boxes represent pedestrian targets, and light blue boxes represent distractors. Participants were instructed to focus on violet and orange targets for identification and ignore the central blue car. To preserve the naturalistic properties of the KITTI dataset, object counts within each load condition varied slightly within predefined thresholds (e.g., 1–9 distractors in low PL—low WML; 1–9 objects, including targets, in low PL—high WML). This variability ensured ecological validity while maintaining experimental control over load categories.
Jemr 18 00009 g001
Figure 2. The procedure of one session in the experiment.
Figure 2. The procedure of one session in the experiment.
Jemr 18 00009 g002
Figure 3. Graphs depicting (A) average fixation duration (in ms) and (B) fixation rate (fixations per minute) across four conditions of perceptual load (PL) and working memory load (WML): Low–Low (LL: low perceptual load, low working memory load), Low–High (LH: low perceptual load, high working memory load), High–Low: (HL: high perceptual load, low working memory load), and High–High (HH: high perceptual load, high working memory load).
Figure 3. Graphs depicting (A) average fixation duration (in ms) and (B) fixation rate (fixations per minute) across four conditions of perceptual load (PL) and working memory load (WML): Low–Low (LL: low perceptual load, low working memory load), Low–High (LH: low perceptual load, high working memory load), High–Low: (HL: high perceptual load, low working memory load), and High–High (HH: high perceptual load, high working memory load).
Jemr 18 00009 g003
Table 1. Mann–Whitney U test results.
Table 1. Mann–Whitney U test results.
VariableMann–Whitney U Test by Variable Sex. Marked Tests are Significant at p < 0.05
Rank Sum
Females
Rank Sum
Males
UZp-ValueValid N
Females
Valid N
Males
Average fixation duration57,854.049,099.022,874.0−2.30.022264198
Fixation rate (per minute)64,292.542,660.522,959.5−2.20.025264198
Table 2. Aggregate results’ descriptive statistics.
Table 2. Aggregate results’ descriptive statistics.
VariableAggregate Results’ Descriptive Statistics
SexN trialsMeanMedianMinimumMaximumVarianceStd.Dev.Coef.Var.
Average fixation durationFemales264357.36342.72226.23788.115391.1973.4220.55
Fixation rate (per minute)Females264156.59157.0674.83235.02637.9925.2616.13
Average fixation durationMales198377.69360.42190.67804.379895.5399.4826.34
Fixation rate (per minute)Males198151.88152.8070.43272.731065.2232.6421.49
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bend, J.; Öörni, A. Numerosity Perception and Perceptual Load: Exploring Sex Differences Through Eye-Tracking. J. Eye Mov. Res. 2025, 18, 9. https://doi.org/10.3390/jemr18020009

AMA Style

Bend J, Öörni A. Numerosity Perception and Perceptual Load: Exploring Sex Differences Through Eye-Tracking. Journal of Eye Movement Research. 2025; 18(2):9. https://doi.org/10.3390/jemr18020009

Chicago/Turabian Style

Bend, Julia, and Anssi Öörni. 2025. "Numerosity Perception and Perceptual Load: Exploring Sex Differences Through Eye-Tracking" Journal of Eye Movement Research 18, no. 2: 9. https://doi.org/10.3390/jemr18020009

APA Style

Bend, J., & Öörni, A. (2025). Numerosity Perception and Perceptual Load: Exploring Sex Differences Through Eye-Tracking. Journal of Eye Movement Research, 18(2), 9. https://doi.org/10.3390/jemr18020009

Article Metrics

Back to TopTop