1. Introduction
Eye tracking (ET) technology is transforming many areas of society, from research approaches (including data analysis, sports science, cognitive psychology, and reading studies) to clinical practice [
1]. It enables objective, real-time assessment of visual responses with a level of accuracy unattainable by the human eye [
2].
In clinical practice, the use of digital devices to assess visual function and diagnose impairments is becoming more and more common due to the lack of adequate tests for certain patient groups. This occurs especially in young children who are not able to understand or collaborate in order to obtain an accurate result with traditional diagnostic tools.
In this study, we focus on ET technologies that use infrared to record reflections from the cornea, pupil, or both [
3]. In this way, they can determine the patient’s eye position, movements, and fixations to capture gaze information. This is especially useful when dealing with uncooperative patients, preverbal children, or children with disabilities who are unable to provide verbal or motor responses [
4,
5].
To overcome the limitations of conventional tests, several ET systems have been developed, such as Irisbond or Tobii, which are integrated into health solutions such as Gazelab, Bitbrain, or DIVE, whose development is based on ET technology and artificial intelligence.
To obtain reliable metrics from ET assessments, accuracy must be ensured through proper calibration for each participant. There are several reasons why accuracy may drop: the quality of the ET camera, miscalculation of the position of the eyes, participants’ characteristics (such as eye colour), or visual aids, among others [
2,
6]. The use of glasses with high prescriptions or with different types of coatings (or even with dirty surfaces) can interfere with this perception, leading to lower calibration quality [
7] due to the reflections that occur on the lens’ surface [
8].
In addition, any intermediate optical surface may introduce variability in the apparent position of the cornea. This unpredictability complicates the calibration process and, consequently, reduces the precision of measurements. Moreover, many ophthalmic lenses incorporate infrared light filters that are likely to interfere with the functioning of eye trackers. For this reason, the use of progressive lenses or certain anti-glare coatings that alter corneal reflections is commonly considered an exclusion criterion in ET studies, given the difficulty of reliably detecting the cornea [
9]. However, to the best of our knowledge, no systematic study has evaluated and compared the effect of different lens types and treatments on ET.
The aim of this study is to determine whether the use of glasses with different prescriptions and optical filters affects the eye tracker’s ability to detect ocular position and, consequently, the accuracy of its metrics. To this end, participants will be evaluated both with and without different types of lenses using an ET device in order to assess how various lens types and filters may influence the system’s performance.
3. Results
A total of 14 patients were included in the study. The mean age was 26.86 years (the youngest was 18 years old and the oldest 40 years old) and the mean refractive error was 0.39 D in the spherical component and 0.27 D in the cylindrical component. None of them had visual pathologies, and all achieved a decimal visual acuity of 1.0 for both distance and near vision.
Regarding angle error,
Figure 5A shows the angle error results for different powers and the control group, while
Figure 5B shows the results for treatments. A significant increase in median angle error was observed in conditions with high dioptric power lenses, both positive and negative. L3 (+6 D) and L4 (−6 D) lenses showed the largest errors (
p-value < 0.05 and <0.001, respectively); the median value for the control lens was 1.94 Deg (IQR 0.94), while L3 presented 2.60 Deg (IQR 1.09) and L4 2.39 Deg (IQR 0.83). Lenses with cylindrical components (L5 and L6) also showed increases (median values of 2.17 and 2.32 Deg, respectively), although the increases were more moderate. As for ophthalmic lenses with filters (F1–F6), minor variations were detected, and the differences found were not significant.
The fixation stability increased in most of the powered lenses (
Figure 6A), although the only one that showed a significant difference (
p-value < 0.05) with respect to the control lens (median of −0.33 logDeg
2 and IQR 0.38) was the L4 (median of −4.83 × 10
−3 logDeg
2 and IQR 0.27). This effect suggests greater fixation instability with that lens. In contrast, ophthalmic lenses with filters (
Figure 6B) maintained fixation areas similar to the control condition, indicating less gaze dispersion.
With respect to the accuracy of saccadic movements, no significant differences were found for any of the lenses studied (
Figure 7A,B).
The greatest impact of the filters was observed in the number of gaps without samples received by the ET (
Figure 8B). In particular, the F4 showed the highest number of gaps (median value of 1538 gaps and IQR 2446) compared to the control lens (
p-value < 0.001), which showed practically no gaps in the recording. This filter prevents the wavelengths corresponding to the infrared from passing through, and thus, the infrared light from the ET was subject to interference in the transmission. F2 also showed an increased number of gaps. Its median value was 110.5 gaps, and IQR 1600, with a
p-value of <0.05 in comparison with the control lens.
On the other hand, the number of gaps increased markedly in the presence of the high-powered negative lens, as shown in
Figure 8A, the L4 lens (median value of 310 gaps and IQR 1376) compared to the control lens (median of 35.5 gaps and IQR 23), as the
p-value was <0.001. However, in the high-powered positive lens, no significant differences were observed compared to the control lens.
Finally, with regard to the proportion of valid frames, it can be said that the only powered lens that showed significant differences compared with the control lens was the L4, with a median value of 0.951 and IQR of 0.154 (
Figure 9A). As for the lenses with filters (
Figure 9B), both the F2 and the F4 lenses showed a lower proportion of valid frames (median of 0.950 and IQR of 0.158 for the F2 and median of 0.731 and IQR of 0.232 for the F4) compared to the control situation (median of 0.975 and IQR of 0.02), as the
p-value was <0.05 for F2 and <0.001 for F4. This indicates that there was a malfunction of the ET with the indicated lenses (L4, F2, and F4). It can be seen that all these gaps are distributed throughout the entire scan time; they do not occur at a specific moment. This clearly demonstrates that the increase in gaps and reduction in valid frames are caused by the lenses.
The sample size included in this study provides adequate statistical precision for the two primary quantitative variables analysed. For gaps, using a standard deviation of 47 units and assuming a two-sided α error of 0.05 and a β error of 0.20, the maximum estimation error was 35.2 gaps. This error margin is markedly smaller than the mean differences observed between the control group and the two lenses influencing the performance of the eye tracking the most (623 gaps with L4 and 1683 gaps with F4), indicating that the study is sufficiently powered to detect differences in clinically relevant magnitude. Similarly, for valid frames, the standard deviation was 0.022, and the calculated precision under the same α and β parameters was 0.0165. This level of error is negligible when compared with the observed between-group differences (0.07 between control lens and L4 and 0.21 between control lens and F4), again supporting that the sample size is more than adequate to ensure robust and reliable estimations. Taken together, these calculations confirm that the sample size is sufficient to sustain the validity of the statistical comparisons made.
4. Discussion
In this article, we demonstrate and quantify for the first time the effect of optical lenses and different filters on the functioning of the ET.
Different ophthalmic lenses—six with varying dioptric powers and six containing optical filters—were assessed for their impact on the performance of an ET device. Using the oculomotor control test in DIVE, we compared the metrics obtained with each lens to those from the same patient without lenses, under identical conditions.
The results show that the optical characteristics of the lenses significantly influence both the quality and accuracy of ET measurements. The high-powered spherical negative L4 lens (−6.00 D) was associated with poorer performance in variables such as angle error and number of gaps, fixation stability, and valid frames, while the high-powered spherical positive L3 lens (+6.00 D) showed worse results just for angle error.
We observed that the negative lens offers poorer quality in interaction with the ET compared to the positive lens, which could be associated with the difference in subjective pupil size caused by the power of the lens. In the case of the negative lens, the subjective pupil size is much smaller, which could make it difficult for the ET to collect valid samples due to the limited spatial resolution available to detect very small pupil images through the lens.
On the other hand, although the distance between the subject and the device remained constant at 65 cm, the placement of positive and negative lenses may affect the subjective distance recorded by the ET. The smaller pupil image produced by negative lenses could behave similarly to being physically farther from the device, and it is known that ET performance deteriorates at distances greater than the optimal working range [
14].
The degradation in performance with high-power lenses can be attributed to several reasons. Firstly, it should be noted that any lens with power induces optical aberrations that produce distortion in the image. The higher the power of the lens, the greater these aberrations will be. Geometrical aberrations and thermal nonlinearities in high-power laser beams passing through lenses can impact beam quality [
15].
Images seen through eyeglasses at different gaze directions are mainly influenced by three optical phenomena: image displacement, blur, and magnification [
16]. Image displacement, or prismatic error in optometry, arises from changes in the line of sight, while blur occurs when the image of a viewing point is not punctual. Magnification, defined as the ratio between object and image height [
16], becomes more pronounced in high-power lenses, where increased lens thickness enlarges the eye-to-lens separation. These factors affect the optical eye–lens–ET system and ultimately influence the quality of the ET samples collected from eye movements.
Ophthalmic lenses also influence pupil size, a factor that directly impacts the accuracy of ET. Variations in pupil size can lead to shifts in reported gaze position exceeding 2 degrees in camera-based eye trackers [
17]. Moreover, the three aforementioned optical phenomena likewise affect both pupil size and the apparent target eccentricity, effects that become especially relevant with higher-power lenses and larger visual field eccentricities [
18].
Although not analysed in this study, Concepción-Grande et al. [
19] reported that the type of multifocal lenses may also influence the results. They evaluated the influence of different progressive lenses on visual quality by measuring visual acuity and using an ET-based system to analyse fixations, since ET provides objective and quantitative data that complements traditional VA assessment.
It can be concluded that, on the one hand, lenses with high dioptric power, both positive and negative (L3: +6.00 D and L4: −6.00 D), generate interference in the transmission of information from the ET to the eye and back. On the other hand, when it comes to filtered lenses, it should be noted that the Natural IR filter (F4) and the SV Org. 1.5 AR (F2) had the greatest impact on the quality of the samples collected by the ET. These filters demonstrated poorer performance in the number of gaps and valid frames. When it comes to the Natural IR filter, this effect can be attributed to direct interference between the filter applied in this lens and the principle of operation of ET systems based on near-infrared light. However, with reference to the SV Org. 1.5 lens, the ET’s performance could be related to the quality and spectral range of its multilayer AR coating. Simpler, low-cost AR coatings usually consist of only a few layers optimised for the visible spectrum, whereas premium AR coatings include multiple layers designed to extend their low-reflectance performance into the near-infrared range and to reduce angular dependence. As a result, the basic AR may increase reflectance or generate parasitic reflections in the infrared band, thereby compromising the eye tracker’s ability to reliably detect the pupil and corneal reflections [
20].
This explanation is consistent with the data obtained, as the F2 condition showed a moderate but noticeable increase in gaps compared to both the control and other AR-coated lenses. Given that all experimental parameters remained constant, these differences can be reasonably attributed to the coating’s spectral inefficiency in the near-infrared range, rather than to procedural or setup factors.
The ET used for the study was a Tobii 5L model that uses emitters and cameras operating in the 850 nm wavelength to detect specific pupil and corneal reflections. The IR filter incorporated in these lenses is designed to attenuate or modify the transmission of this spectral range for eye protection or visual comfort purposes. This filtering affects both the light incident on the eye and the light returning to the system’s sensors, and it may compromise the detection of pupil or corneal reflections necessary for correct gaze point estimation.
Interestingly, although the infrared spectral transmission curves of INDO’s Natural IR and Energy Blue IR filters are practically identical, only the Energy Blue IR model allows the ET system to function correctly. This observation indicates that the problem is not solely related to the percentage of infrared transmission but rather to more complex optical factors that are not captured by standard spectral measurements. For instance, subtle differences in the coatings applied to the lens surface, such as the effectiveness of anti-reflective treatments, the presence of residual reflections, or the uniformity of coating thickness, may alter how infrared light is reflected or refracted as it enters and exits the lens. Additionally, the angular dependence of transmission, which describes how optical behaviour changes with the incidence angle of light, could significantly affect the infrared signals detected by the ET system, especially since eye trackers rely on precise corneal reflections. These nuances highlight that two lenses with nearly identical spectral transmission curves can still interact very differently with infrared-based technologies, emphasising the need for more detailed optical characterisation beyond standard transmittance data.
The multilayer coatings of this type of lens can induce internal reflection phenomena or optical interference, which further hinders eye segmentation by the ET software. Together, these factors might explain the observed deterioration in data quality when using this IR-filtered lens, highlighting the need to consider the spectral properties of optical treatments when designing studies based on ET technologies.
An exhaustive review of the scientific literature has been carried out in order to identify studies that evaluate the performance of ET systems in the presence of different types of ophthalmic lenses, both in terms of their dioptric powers and the optical treatments applied, including filters and coatings. However, no studies have been found that specifically and systematically address the technical limitations of these devices when used in combination with spectacles. This lack of studies highlights the relevance of the present work, which constitutes a first rigorous approach to the analysis of the impact that different types of lenses can have on the reliability and accuracy of the records obtained by eye trackers.
The value of this study lies not only in identifying which lenses are most likely to induce errors or loss of information in the capture of data with ET but also in demonstrating that the quantitative results provided by these devices when assessing certain visual functions may not be entirely accurate in specific optical conditions.
Data loss may make it more difficult to obtain reliable estimates of fixation stability or saccadic accuracy for each stimulus presented during the test. Because the final outcome for each metric is calculated as the median across stimuli, a reduced number of valid samples could increase the variance of the results. However, data loss by itself would not artificially increase fixation stability, as having fewer samples typically results in a smaller measured fixation area rather than a larger one. In our study, no significant differences were found in saccadic accuracy for any of the lenses examined (
Figure 7).
This opens up a line of research of great interest and potential, allowing progress towards a more precise and critical interpretation of the data collected with ET technologies, especially in clinical and applied research contexts where visual precision is essential.
The main limitations of this study are as follows. First, the study had a relatively small sample size, which was a direct consequence of applying strict exclusion criteria to minimise potential biases, as well as the inherent difficulty of an exploratory protocol requiring each participant to complete 13 different lens and filter conditions. Second, each lens type was evaluated in isolation rather than in a more natural situation, where patients wear their own spectacles to correct refractive errors at the standard testing distance, allowing adaptation effects to be properly considered. Third, multifocal lenses were not included in the analysis, despite their widespread clinical use and potential impact on ET performance. Fourth, there was a lack of transmission profiles for the two filters analysed, which would be essential to gain an in-depth understanding of their differential behaviour, as both are designed to filter IR light. This aspect represents a limitation of the study and will be addressed in future research. Finally, lenses that combine both refractive power and optical filters, as typically found in real-world conditions, were not examined together but separately, limiting the generalisability of the findings. Future studies could also explore validation stimuli specifically adapted to each visual task (fixation, saccades, or smooth pursuit) to further reduce potential variability related to stimulus geometry and optimise ET performance assessment.
Despite these constraints, the results emphasise the importance of accounting for both the optical properties of lenses and their treatments when interpreting ET data. Such considerations are crucial to ensure the validity of results and to avoid misinterpretations that could compromise scientific evidence or clinical decision-making.