This study aimed to explore the effectiveness of ET technology for assessing the usability affordances of jar opening ADs. Interestingly, we have identified TDoF%, NoF%, and TDoV% as key metrics for assessing usability affordances in ADs. We also found that the variation in evaluation times across affordances suggests that some affordances are less effectively communicated through the models’ pictures and would benefit from clearer signifiers. Specifically, lid slippery and effort level are less effectively communicated, whereas robustness and easiness to grip are conveyed more effectively. In this sense, we have analysed the potential impact of adding rubber to the grip area as a signifier. Non-parametric multi-factor testing revealed that some models were rated more favourably than others depending on the affordance, confirming that the selected models effectively represent different levels of each affordance. Additionally, the presence of rubber consistently enhanced affordance clarity across all models. The lack of significant interaction effects indicates that rubber has a uniform positive impact, enhancing the evaluation of the affected affordance regardless of the model.
4.1. About the Use of ET Metrics for Evaluating Usability Affordances
The study sought to identify ET metrics that effectively gauge the usability of jar opening ADs. In this sense, significant differences in metrics aligned with differences in on-screen assessment ratings, along with significant correlations between these metrics and affordance ratings. Specifically, fixation and visit metrics, such as TDoF%, NoF%, TDoV%, and NoV, demonstrated significant correlations with on-screen assessment affordance ratings, suggesting their relevance for assessing usability affordances. Although the correlation coefficients were not high—a common feature of Spearman’s coefficients—they were significant and consistent with the hypothesis that higher affordance ratings correspond to more frequent and prolonged visual attention. In particular, TDoF%, NoF%, and TDoV% showed the highest correlations within the AoI Model. This result aligns with prior research on product selection suggesting that products receiving higher attention are more likely to be selected [
39,
40]. In contrast, metrics related to the first fixation—such as TtFF% and DoFF%—provided limited information, with low and non-significant correlations with on-screen assessments. This limitation may be due to other factors unrelated to affordance perception. ET records revealed that participants’ initial fixations often centred on the screen due to the gaze restriction blue dot used before affordance assessment, potentially distorting these metrics for centrally located models. Future studies should ensure that models are not placed in the central region of visual stimuli.
The differences observed in fixation and visit metrics across AoIs align with the findings of Burlamaqui and Dong [
29], who argued that affordances can be perceived through visual inspection of critical product elements. For affordances such as
comfort,
easiness to grip,
effort level, and
easiness to use, the AoIs Grip and Placement on Lid were the most observed, as these regions convey essential information for the evaluation of these specific affordances. The results also corroborate findings by Federico and Brandimonte [
31], who emphasised that fixation times are prolonged in areas associated with product manipulation, a pattern they observed in usability affordances of tools, where the manipulation zones attracted significantly more attention compared to other parts of the tools.
In summary, this study has identified TDoF%, NoF%, and TDoV% as key metrics for assessing usability affordances in ADs, while first fixation metrics offer limited utility due to contextual bias in this study. The addition of rubber to the grip area significantly improved on-screen assessment affordance ratings, as reflected in higher fixation and visit metrics. These findings underscore the importance of targeted AoI analysis in uncovering the visual strategies underlying affordance perception. Future research should refine experimental designs to minimise biases and further explore the interplay between ET metrics, product characteristics, and perceived affordances.
4.2. About Signifiers of AD Usability
A detailed examination of rubber’s role as a signifier for each affordance (
Figure 3) reveals several positive effects. Models with rubber generally received higher ratings from on-screen assessment, particularly where rubber’s physical properties were most advantageous. For instance, rubber enhanced
robustness of the grip area, especially in model 6, by providing a sense of increased thickness.
Comfort improved in the rubber versions (significantly in models 1, 5, and 6), likely due to the perceived softness, reducing pressure points and providing greater comfort. Rubber also clarified the intended grip area, as in models 1, 2, and 6, resulting in perceiving an improved ease of grip. In model 1, the rubber version was rated higher for
lid slippery, as the enhanced grip is perceived to reduce slippage and facilitate tighter sealing. Additionally, the rubbered model 1 was also rated more positively for
easiness to use, suggesting that the defined grip area clarified how to operate the model. Finally, no significant differences were observed between rubber and non-rubber versions for the affordance of
effort level, which was one of the least effectively conveyed.
To conduct a more detailed study of other design elements that function as signifiers, a qualitative analysis by affordance has been carried out, supported by participants’ visual attention from heatmaps (
Figure 4). Comments follow:
Robustness. Participants focused primarily on the placement area on the lid, highlighting its importance in evaluating this affordance. Material and thickness notably influenced perceptions: in models 2 and 3, the steel material served as a positive signifier, while in model 4, the thick plastic conveyed solidity. Additionally, the solid and somewhat bulky shape of model 4 may have further reinforced the perception of robustness.
Comfort. The grip area received the most attention, particularly in models 1 and 6, which are elongated, plastic, and feature large grip areas, characteristics that likely act as comfort signifiers, especially with rubber reinforcement. In contrast, model 2, rated lowest for comfort, showed fewer fixations in its rubber version, possibly due to its slim design and steel material, which may have been perceived as uncomfortable when pressing the hand.
Easiness to grip. Attention across AoIs varied by model: in model 1, the grip area received most attention; in models 5 and 6, focus centred on the placement area on the lid, while in model 3, both areas were equally observed. Models 2 and 4 showed similar pattern, with attention distributed between both areas. The handle design in model 3, with its familiar shape and material, likely acted as a signifier of easiness to grip. In contrast, model 2, perceived as the hardest to grasp, received less attention, possibly due to its wire-like shape and limited support surface, leading participants to dismiss it early.
Lid slippery. Attention primarily focused on the placement area on the lid, with minimal focus on the grip area. Higher-rated models displayed features enhancing adhesion perception: models 2 and 3, with metal contact areas and protruding tabs, were likely perceived as less slippery due to these elements (
Figure 6). Similarly, models 4 and 6, featuring soft, rubber-like plastic in the contact area, acted as a positive signifier of adhesion. In contrast, model 1 was perceived as more slippery due to its smooth hard plastic and model 5 due to its toothless metal surface (
Figure 6).
Effort level. Model 2, one of the best rated, attracted more attention to the grip area, while the lower-rated models were more observed in the placement area on the lid. Regarding signifiers, model 2’s metallic and lever-shaped design suggests reduced effort. However, model 1, despite a similar lever shape, was rated as requiring more effort, possibly due to its plastic material, perceived as a negative signifier. For model 3, the rating may reflect the short handle length, limiting leverage and acting as a negative signifier.
Easiness to use. Attention patterns revealed that in model 1, focus was directed towards the grip area, while in models 4 and 6, greater attention was given to the placement area on the lid. Higher-rated models like model 1, with its longer grip and larger palm area, drew more attention to the grip. In model 4, the lid area could also serve as a grip, and model 6 had multiple placement zones, explaining the attention on these areas. In contrast, lower-rated models 2 and 3, perceived as harder to use, had complex mechanisms that acted as signifiers difficult to interpret. Overall, the placement area on the lid was observed more than the grip area, indicating that easiness to use is better understood by recognising how to place the opener on the lid rather than how to hold it.
Table 9 summarises the key areas in evaluating affordances and the signifiers that influence perception of ADs.
The results from the usability evaluation with real products (UA2) largely aligned with the on-screen evaluation performed during the ET recordings (UA1), reinforcing the reliability of on-screen presentation of products and, consequently, of ET for assessing usability affordances. Consistent findings were observed for comfort, easiness to grip, and easiness to use, indicating that these affordances were adequately assessed through ET. However, discrepancies emerged for robustness, where model 1, initially perceived as fragile in ET due to its plastic appearance, was rated higher upon physical interaction. Similarly, for lid slippery, models 2 and 3, considered non-slippery in ET, were found to be slippery during real use due to material properties, such as the lack of adhesion from metal tabs. For effort level, although model 2 was highly rated in on-screen assessment, its ranking during real use was lower, likely due to a less functional lever design that complicates gripping far from the jar. In contrast, model 1, which received a negative rating from on-screen assessment, performed best during real use, as its soft material reduced perceived effort compared to the hard plastic observed visually. Some of these differences highlight the limitations of visual-based methods in capturing material properties and physical interactions. Further research should be conducted on how to use images whose assessments align with real-use evaluations.