MDPI - Publisher of Open Access Journals

26 pages, 6831 KB

Open AccessArticle

Human–Robot Interaction and Tracking System Based on Mixed Reality Disassembly Tasks

by Raúl Calderón-Sesmero, Adrián Lozano-Hernández, Fernando Frontela-Encinas, Guillermo Cabezas-López and Mireya De-Diego-Moro

Robotics 2025, 14(8), 106; https://doi.org/10.3390/robotics14080106 - 30 Jul 2025

Viewed by 2002

Abstract

Disassembly is a crucial process in industrial operations, especially in tasks requiring high precision and strict safety standards when handling components with collaborative robots. However, traditional methods often rely on rigid and sequential task planning, which makes it difficult to adapt to unforeseen [...] Read more.

Disassembly is a crucial process in industrial operations, especially in tasks requiring high precision and strict safety standards when handling components with collaborative robots. However, traditional methods often rely on rigid and sequential task planning, which makes it difficult to adapt to unforeseen changes or dynamic environments. This rigidity not only limits flexibility but also leads to prolonged execution times, as operators must follow predefined steps that do not allow for real-time adjustments. Although techniques like teleoperation have attempted to address these limitations, they often hinder direct human–robot collaboration within the same workspace, reducing effectiveness in dynamic environments. In response to these challenges, this research introduces an advanced human–robot interaction (HRI) system leveraging a mixed-reality (MR) interface embedded in a head-mounted device (HMD). The system enables operators to issue real-time control commands using multimodal inputs, including voice, gestures, and gaze tracking. These inputs are synchronized and processed via the Robot Operating System (ROS2), enabling dynamic and flexible task execution. Additionally, the integration of deep learning algorithms ensures precise detection and validation of disassembly components, enhancing accuracy. Experimental evaluations demonstrate significant improvements, including reduced task completion times, enhanced operator experience, and compliance with strict adherence to safety standards. This scalable solution offers broad applicability for general-purpose disassembly tasks, making it well-suited for complex industrial scenarios. Full article

(This article belongs to the Special Issue Robot Teleoperation Integrating with Augmented Reality)

► Show Figures

Figure 1

28 pages, 3441 KB

Open AccessArticle

Which AI Sees Like Us? Investigating the Cognitive Plausibility of Language and Vision Models via Eye-Tracking in Human-Robot Interaction

by Khashayar Ghamati, Maryam Banitalebi Dehkordi and Abolfazl Zaraki

Sensors 2025, 25(15), 4687; https://doi.org/10.3390/s25154687 - 29 Jul 2025

Viewed by 1109

Abstract

As large language models (LLMs) and vision–language models (VLMs) become increasingly used in robotics area, a crucial question arises: to what extent do these models replicate human-like cognitive processes, particularly within socially interactive contexts? Whilst these models demonstrate impressive multimodal reasoning and perception [...] Read more.

As large language models (LLMs) and vision–language models (VLMs) become increasingly used in robotics area, a crucial question arises: to what extent do these models replicate human-like cognitive processes, particularly within socially interactive contexts? Whilst these models demonstrate impressive multimodal reasoning and perception capabilities, their cognitive plausibility remains underexplored. In this study, we address this gap by using human visual attention as a behavioural proxy for cognition in a naturalistic human-robot interaction (HRI) scenario. Eye-tracking data were previously collected from participants engaging in social human-human interactions, providing frame-level gaze fixations as a human attentional ground truth. We then prompted a state-of-the-art VLM (LLaVA) to generate scene descriptions, which were processed by four LLMs (DeepSeek-R1-Distill-Qwen-7B, Qwen1.5-7B-Chat, LLaMA-3.1-8b-instruct, and Gemma-7b-it) to infer saliency points. Critically, we evaluated each model in both stateless and memory-augmented (short-term memory, STM) modes to assess the influence of temporal context on saliency prediction. Our results presented that whilst stateless LLaVA most closely replicates human gaze patterns, STM confers measurable benefits only for DeepSeek, whose lexical anchoring mirrors human rehearsal mechanisms. Other models exhibited degraded performance with memory due to prompt interference or limited contextual integration. This work introduces a novel, empirically grounded framework for assessing cognitive plausibility in generative models and underscores the role of short-term memory in shaping human-like visual attention in robotic systems. Full article

(This article belongs to the Special Issue Multimodal Human Behavior Understanding in Human–AI Interaction: Sensor-Based Signal Processing and Interaction Techniques)

► Show Figures

Figure 1

21 pages, 2624 KB

Open AccessArticle

GMM-HMM-Based Eye Movement Classification for Efficient and Intuitive Dynamic Human–Computer Interaction Systems

by Jiacheng Xie, Rongfeng Chen, Ziming Liu, Jiahao Zhou, Juan Hou and Zengxiang Zhou

J. Eye Mov. Res. 2025, 18(4), 28; https://doi.org/10.3390/jemr18040028 - 9 Jul 2025

Cited by 1 | Viewed by 798

Abstract

Human–computer interaction (HCI) plays a crucial role across various fields, with eye-tracking technology emerging as a key enabler for intuitive and dynamic control in assistive systems like Assistive Robotic Arms (ARAs). By precisely tracking eye movements, this technology allows for more natural user [...] Read more.

Human–computer interaction (HCI) plays a crucial role across various fields, with eye-tracking technology emerging as a key enabler for intuitive and dynamic control in assistive systems like Assistive Robotic Arms (ARAs). By precisely tracking eye movements, this technology allows for more natural user interaction. However, current systems primarily rely on the single gaze-dependent interaction method, which leads to the “Midas Touch” problem. This highlights the need for real-time eye movement classification in dynamic interactions to ensure accurate and efficient control. This paper proposes a novel Gaussian Mixture Model–Hidden Markov Model (GMM-HMM) classification algorithm aimed at overcoming the limitations of traditional methods in dynamic human–robot interactions. By incorporating sum of squared error (SSE)-based feature extraction and hierarchical training, the proposed algorithm achieves a classification accuracy of 94.39%, significantly outperforming existing approaches. Furthermore, it is integrated with a robotic arm system, enabling gaze trajectory-based dynamic path planning, which reduces the average path planning time to 2.97 milliseconds. The experimental results demonstrate the effectiveness of this approach, offering an efficient and intuitive solution for human–robot interaction in dynamic environments. This work provides a robust framework for future assistive robotic systems, improving interaction intuitiveness and efficiency in complex real-world scenarios. Full article

► Show Figures

Figure 1

18 pages, 4185 KB

Open AccessArticle

An Empirical Study on Pointing Gestures Used in Communication in Household Settings

by Tymon Kukier, Alicja Wróbel, Barbara Sienkiewicz, Julia Klimecka, Antonio Galiza Cerdeira Gonzalez, Paweł Gajewski and Bipin Indurkhya

Electronics 2025, 14(12), 2346; https://doi.org/10.3390/electronics14122346 - 8 Jun 2025

Viewed by 1101

Abstract

Gestures play an integral role in human communication. Our research aims to develop a gesture understanding system that allows for better interpretation of human instructions in household robotics settings. We conducted an experiment with 34 participants who used pointing gestures to teach concepts [...] Read more.

Gestures play an integral role in human communication. Our research aims to develop a gesture understanding system that allows for better interpretation of human instructions in household robotics settings. We conducted an experiment with 34 participants who used pointing gestures to teach concepts to an assistant. Gesture data were analyzed using manual annotations (MAXQDA) and the computational methods of pose estimation and k-means clustering. The study revealed that participants tend to maintain consistent pointing styles, with one-handed pointing and index finger gestures being the most common. Gaze and pointing often co-occur, as do leaning forward and pointing. Using our gesture categorization algorithm, we analyzed gesture information values. As the experiment progressed, the information value of gestures remained stable, although the trends varied between participants and were associated with factors such as age and gender. These findings underscore the need for gesture recognition systems to balance generalization with personalization for more effective human–robot interaction. Full article

(This article belongs to the Special Issue Applications of Computer Vision, 3rd Edition)

► Show Figures

Figure 1

17 pages, 18945 KB

Open AccessArticle

Collaborative Robot Control Based on Human Gaze Tracking

by Francesco Di Stefano, Alice Giambertone, Laura Salamina, Matteo Melchiorre and Stefano Mauro

Sensors 2025, 25(10), 3103; https://doi.org/10.3390/s25103103 - 14 May 2025

Cited by 1 | Viewed by 1278

Abstract

Gaze tracking is gaining relevance in collaborative robotics as a means to enhance human–machine interaction by enabling intuitive and non-verbal communication. This study explores the integration of human gaze into collaborative robotics by demonstrating the possibility of controlling a robotic manipulator with a [...] Read more.

Gaze tracking is gaining relevance in collaborative robotics as a means to enhance human–machine interaction by enabling intuitive and non-verbal communication. This study explores the integration of human gaze into collaborative robotics by demonstrating the possibility of controlling a robotic manipulator with a practical and non-intrusive setup made up of a vision system and gaze-tracking software. After presenting a comparison between the major available systems on the market, OpenFace 2.0 was selected as the primary gaze-tracking software and integrated with a UR5 collaborative robot through a MATLAB-based control framework. Validation was conducted through real-world experiments, analyzing the effects of raw and filtered gaze data on system accuracy and responsiveness. The results indicate that gaze tracking can effectively guide robot motion, though signal processing significantly impacts responsiveness and control precision. This work establishes a foundation for future research on gaze-assisted robotic control, highlighting its potential benefits and challenges in enhancing human–robot collaboration. Full article

(This article belongs to the Special Issue Advanced Robotic Manipulators and Control Applications)

► Show Figures

Figure 1

20 pages, 1942 KB

Open AccessArticle

Operator Expertise in Bilateral Teleoperation: Performance, Manipulation, and Gaze Metrics

by Harun Tugal, Ihsan Tugal, Fumiaki Abe, Masaki Sakamoto, Shu Shirai, Ipek Caliskanelli and Robert Skilton

Electronics 2025, 14(10), 1923; https://doi.org/10.3390/electronics14101923 - 9 May 2025

Cited by 2 | Viewed by 1632

Abstract

This paper presents a comprehensive user study aimed as assessing and differentiating operator expertise within bilateral teleoperation systems. The primary objective is to identify key performance metrics that effectively distinguish novice from expert users. Unlike prior approaches that focus primarily on psychological evaluations, [...] Read more.

This paper presents a comprehensive user study aimed as assessing and differentiating operator expertise within bilateral teleoperation systems. The primary objective is to identify key performance metrics that effectively distinguish novice from expert users. Unlike prior approaches that focus primarily on psychological evaluations, this study emphasizes direct performance analysis across a range of telerobotic tasks. Ten participants (six novices and four experts) were assessed based on task completion time and difficulty, error rates, manipulator motion characteristics, gaze behaviour, and subjective feedback via questionnaires. The results show that experienced operators outperformed novices by completing tasks faster, making fewer errors, and demonstrating smoother manipulator control, as reflected by reduced jerks and higher spatial precision. Also, experts maintained consistent performance even as task complexity increased, whereas novices experienced a sharp decline, particularly at higher difficulty levels. Questionnaire responses further revealed that novices experienced higher mental and physical demands, especially in unfamiliar tasks, while experts demonstrated higher concentration and arousal levels. Additionally, the study introduces gaze transition entropy (GTE) and stationary gaze entropy (SGE) metrics to quantify visual attention strategies, with experts exhibiting more focused, goal-oriented gaze patterns, while novices showed more erratic and inefficient behaviour. These findings highlight both quantitative and qualitative measures as critical for evaluating operator performance and informing future teleoperation training programs. Full article

(This article belongs to the Special Issue Haptic Systems and the Tactile Internet: Design and Applications)

► Show Figures

Figure 1

29 pages, 5776 KB

Open AccessArticle

Intention Reasoning for User Action Sequences via Fusion of Object Task and Object Action Affordances Based on Dempster–Shafer Theory

by Yaxin Liu, Can Wang, Yan Liu, Wenlong Tong and Ming Zhong

Sensors 2025, 25(7), 1992; https://doi.org/10.3390/s25071992 - 22 Mar 2025

Viewed by 870

Abstract

To reduce the burden on individuals with disabilities when operating a Wheelchair Mounted Robotic Arm (WMRA), many researchers have focused on inferring users’ subsequent task intentions based on their “gazing” or “selecting” of scene objects. In this paper, we propose an innovative intention [...] Read more.

To reduce the burden on individuals with disabilities when operating a Wheelchair Mounted Robotic Arm (WMRA), many researchers have focused on inferring users’ subsequent task intentions based on their “gazing” or “selecting” of scene objects. In this paper, we propose an innovative intention reasoning method for users’ action sequences by fusing object task and object action affordances based on Dempster–Shafer Theory (D-S theory). This method combines the advantages of probabilistic reasoning and visual affordance detection to establish an affordance model for objects and potential tasks or actions based on users’ habits and object attributes. This facilitates encoding object task (OT) affordance and object action (OA) affordance using D-S theory to perform action sequence reasoning. Specifically, the method includes three main aspects: (1) inferring task intentions from the object of user focus based on object task affordances encoded with Causal Probabilistic Logic (CP-Logic); (2) inferring action intentions based on object action affordances; and (3) integrating OT and OA affordances through D-S theory. Experimental results demonstrate that the proposed method reduces the number of interactions by an average of 14.085% compared to independent task intention inference and by an average of 52.713% compared to independent action intention inference. This demonstrates that the proposed method can capture the user’s real intention more accurately and significantly reduce unnecessary human–computer interaction. Full article

(This article belongs to the Special Issue Advances in Sensing and Robotic Assistive Technologies in Rehabilitation)

► Show Figures

Figure 1

24 pages, 21314 KB

Open AccessArticle

RELAY: Robotic EyeLink AnalYsis of the EyeLink 1000 Using an Artificial Eye

by Anna-Maria Felßberg and Dominykas Strazdas

Vision 2025, 9(1), 18; https://doi.org/10.3390/vision9010018 - 1 Mar 2025

Cited by 1 | Viewed by 1718

Abstract

The impact of ambient brightness surroundings on the peak velocities of visually guided saccades remains a topic of debate in the field of eye-tracking research. While some studies suggest that saccades in darkness are slower than in light, others question this finding, citing [...] Read more.

The impact of ambient brightness surroundings on the peak velocities of visually guided saccades remains a topic of debate in the field of eye-tracking research. While some studies suggest that saccades in darkness are slower than in light, others question this finding, citing inconsistencies influenced by factors such as pupil deformation during saccades, gaze position, or the measurement technique itself. To investigate these, we developed RELAY (Robotic EyeLink AnalYsis), a low-cost, stepper motor-driven artificial eye capable of simulating human saccades with controlled pupil, gaze directions, and brightness. Using the EyeLink 1000, a widely employed eye tracker, we assessed accuracy and precision across three illumination settings. Our results confirm the reliability of the EyeLink 1000, demonstrating no artifacts in pupil-based eye tracking related to brightness variations. This suggests that previously observed changes in peak velocities with varying brightness are likely due to human factors, warranting further investigation. However, we observed systematic deviations in measured pupil size depending on gaze direction. These findings emphasize the importance of reporting illumination conditions and gaze parameters in eye-tracking experiments to ensure data consistency and comparability. Our novel artificial eye provides a robust and reproducible platform for evaluating eye tracking systems and deepening our understanding of the human visual system. Full article

(This article belongs to the Section Visual Neuroscience)

► Show Figures

Figure 1

25 pages, 9755 KB

Open AccessEditor’s ChoiceArticle

Marker-Based Safety Functionality for Human–Robot Collaboration Tasks by Means of Eye-Tracking Glasses

by Enrico Masi, Nhu Toan Nguyen, Eugenio Monari, Marcello Valori and Rocco Vertechy

Machines 2025, 13(2), 122; https://doi.org/10.3390/machines13020122 - 6 Feb 2025

Viewed by 2152

Abstract

Human–robot collaboration (HRC) remains an increasingly growing trend in the robotics research field. Despite the widespread usage of collaborative robots on the market, several safety issues still need to be addressed to develop industry-ready applications exploiting the full potential of the technology. This [...] Read more.

Human–robot collaboration (HRC) remains an increasingly growing trend in the robotics research field. Despite the widespread usage of collaborative robots on the market, several safety issues still need to be addressed to develop industry-ready applications exploiting the full potential of the technology. This paper focuses on hand-guiding applications, proposing an approach based on a wearable device to reduce the risk related to operator fatigue or distraction. The methodology aims at ensuring operator’s attention during the hand guidance of a robot end effector in order to avoid injuries. This goal is achieved by detecting a region of interest (ROI) and checking that the gaze of the operator is kept within this area by means of a pair of eye-tracking glasses (Pupil Labs Neon, Berlin, Germany). The detection of the ROI is obtained primarily by the tracking camera of the glasses, acquiring the position of predefined ArUco markers, thus obtaining the corresponding contour area. In case of the misdetection of one or more markers, their position is estimated through the optical flow methodology. The performance of the proposed system is initially assessed with a motorized test bench simulating the rotation of operator’s head in a repeatable way and then in an HRC scenario used as case study. The tests show that the system can effectively identify a planar ROI in the context of a HRC application in real time. Full article

(This article belongs to the Section Automation and Control Systems)

► Show Figures

Figure 1

18 pages, 2412 KB

Open AccessArticle

Infants Display Anticipatory Gaze During a Motor Contingency Paradigm

by Marcelo R. Rosales, José Carlos Pulido, Carolee Winstein, Nina S. Bradley, Maja Matarić and Beth A. Smith

Sensors 2025, 25(3), 844; https://doi.org/10.3390/s25030844 - 30 Jan 2025

Viewed by 1553

Abstract

Background: Examining visual behavior during a motor learning paradigm can enhance our understanding of how infants learn motor skills. The aim of this study was to determine if infants who learned a contingency visually anticipated the outcomes of their behavior. Methods: 15 infants [...] Read more.

Background: Examining visual behavior during a motor learning paradigm can enhance our understanding of how infants learn motor skills. The aim of this study was to determine if infants who learned a contingency visually anticipated the outcomes of their behavior. Methods: 15 infants (6–9 months of age) participated in a contingency learning paradigm. When an infant produced a right leg movement, a robot provided reinforcement by clapping. Three types of visual gaze events were identified: predictive, reactive, and not looking. An exploratory analysis examined the trends in visual-motor behavior that can be used to inform future questions and practices in contingency learning studies. Results: All classically defined learners visually anticipated robot activation at greater than random chance (W = 21; p = 0.028). Specifically, all but one learners displayed a distribution of gaze timing identified as predictive (skewness: 0.56–2.42) with the median timing preceding robot activation by 0.31 s (range: −0.40–0.18 s). Conclusions: Findings suggest that most learners displayed visual anticipation withing the first minutes of performing the paradigm. Further, the classical definition of learning a contingency paradigm in infants can be sharpened to further the design of contingency learning studies and advance the processes infants use to learn motor skills. Full article

(This article belongs to the Special Issue Wearable Sensors and Machine Learning in the Biomechanics of Human Movement)

► Show Figures

Figure 1

18 pages, 8219 KB

Open AccessArticle

Evolution of the “4-D Approach” to Dynamic Vision for Vehicles

by Ernst Dieter Dickmanns

Electronics 2024, 13(20), 4133; https://doi.org/10.3390/electronics13204133 - 21 Oct 2024

Viewed by 1625

Abstract

Spatiotemporal models for the 3-D shape and motion of objects allowed large progress in the 1980s in visual perception of moving objects observed from a moving platform. Despite the successes demonstrated with several vehicles, the “4-D approach” has not been accepted generally. Its [...] Read more.

Spatiotemporal models for the 3-D shape and motion of objects allowed large progress in the 1980s in visual perception of moving objects observed from a moving platform. Despite the successes demonstrated with several vehicles, the “4-D approach” has not been accepted generally. Its advantage is that only the last image of the sequence needs to be analyzed in detail to allow the full state vectors of moving objects, including their velocity components, to be reconstructed by the feedback of prediction errors. The vehicle carrying the cameras can, thus, together with conventional measurements, directly create a visualization of the situation encountered. In 1994, at the final demonstration of the project PROMETHEUS, two sedan vehicles using this approach were the only ones worldwide capable of driving autonomously in standard heavy traffic on three-lane Autoroutes near Paris at speeds up to 130 km/h (convoy driving, lane changes, passing). Up to ten vehicles nearby could be perceived. In this paper, the three-layer architecture of the perception system is reviewed. At the end of the 1990s, the system evolved from mere recognition of objects in motion, to understanding complex dynamic scenes by developing behavioral capabilities, like fast saccadic changes in the gaze direction for flexible concentration on objects of interest. By analyzing motion of objects over time, the situation for decision making was assessed. In the third-generation system “EMS-vision” behavioral capabilities of agents were represented on an abstract level for characterizing their potential behaviors. These maneuvers form an additional knowledge base. The system has proven capable of driving in networks of minor roads, including off-road sections, with avoidance of negative obstacles (ditches). Results are shown for road vehicle guidance. Potential transitions to a robot mind and to the now-favored CNN are touched on. Full article

(This article belongs to the Special Issue Advancement on Smart Vehicles and Smart Travel)

► Show Figures

Figure 1

18 pages, 5107 KB

Open AccessArticle

Perceptive Recommendation Robot: Enhancing Receptivity of Product Suggestions Based on Customers’ Nonverbal Cues

by Masaya Iwasaki, Akiko Yamazaki, Keiichi Yamazaki, Yuji Miyazaki, Tatsuyuki Kawamura and Hideyuki Nakanishi

Biomimetics 2024, 9(7), 404; https://doi.org/10.3390/biomimetics9070404 - 2 Jul 2024

Cited by 3 | Viewed by 1606

Abstract

Service robots that coexist with humans in everyday life have become more common, and they have provided customer service in physical shops around the world in recent years. However, their potential in effective sales strategies has not been fully realized due to their [...] Read more.

Service robots that coexist with humans in everyday life have become more common, and they have provided customer service in physical shops around the world in recent years. However, their potential in effective sales strategies has not been fully realized due to their low social presence. This study aims to clarify what kind of robot behavior enhances the social presence of service robots and how it affects human–robot interaction and purchasing behavior. We conducted two experiments with a sales robot, Pepper, at a retail shop in Kyoto. In Experiment 1, we showed that the robot’s social presence increased and that customers looked at the robot longer when the robot understood human gaze information and was capable of shared attention. In Experiment 2, we showed that the probability of customers picking up products increased when the robot suggested products based on the humans’ degree of attention from gaze and posture information. These results indicate that the robot’s ability to understand and make utterances about a customer’s orientation and attention effectively enhances human–robot communication and purchasing motivation. Full article

(This article belongs to the Special Issue Intelligent Human-Robot Interaction: 2nd Edition)

► Show Figures

Figure 1

20 pages, 8912 KB

Open AccessArticle

Implementation of Engagement Detection for Human–Robot Interaction in Complex Environments

by Sin-Ru Lu, Jia-Hsun Lo, Yi-Tian Hong and Han-Pang Huang

Sensors 2024, 24(11), 3311; https://doi.org/10.3390/s24113311 - 22 May 2024

Cited by 6 | Viewed by 2635

Abstract

This study develops a comprehensive robotic system, termed the robot cognitive system, for complex environments, integrating three models: the engagement model, the intention model, and the human–robot interaction (HRI) model. The system aims to enhance the naturalness and comfort of HRI by enabling [...] Read more.

This study develops a comprehensive robotic system, termed the robot cognitive system, for complex environments, integrating three models: the engagement model, the intention model, and the human–robot interaction (HRI) model. The system aims to enhance the naturalness and comfort of HRI by enabling robots to detect human behaviors, intentions, and emotions accurately. A novel dual-arm-hand mobile robot, Mobi, was designed to demonstrate the system’s efficacy. The engagement model utilizes eye gaze, head pose, and action recognition to determine the suitable moment for interaction initiation, addressing potential eye contact anxiety. The intention model employs sentiment analysis and emotion classification to infer the interactor’s intentions. The HRI model, integrated with Google Dialogflow, facilitates appropriate robot responses based on user feedback. The system’s performance was validated in a retail environment scenario, demonstrating its potential to improve the user experience in HRIs. Full article

(This article belongs to the Special Issue Emotion Recognition Technologies in Human-Machine Interaction)

► Show Figures

Figure 1

16 pages, 3532 KB

Open AccessArticle

Robotics Perception: Intention Recognition to Determine the Handball Occurrence during a Football or Soccer Match

by Mohammad Mehedi Hassan, Stephen Karungaru and Kenji Terada

AI 2024, 5(2), 602-617; https://doi.org/10.3390/ai5020032 - 8 May 2024

Cited by 3 | Viewed by 3510

Abstract

In football or soccer, a referee controls the game based on the set rules. The decisions made by the referee are final and can’t be appealed. Some of the decisions, especially after a handball event, whether to award a penalty kick or a [...] Read more.

In football or soccer, a referee controls the game based on the set rules. The decisions made by the referee are final and can’t be appealed. Some of the decisions, especially after a handball event, whether to award a penalty kick or a yellow/red card can greatly affect the final results of a game. It is therefore necessary that the referee does not make an error. The objective is therefore to create a system that can accurately recognize such events and make the correct decision. This study chose handball, an event that occurs in a football game (Not to be confused with the game of Handball). We define a handball event using object detection and robotic perception and decide whether it is intentional or not. Intention recognition is a robotic perception of emotion recognition. To define handball, we trained a model to detect the hand and ball which are primary objects. We then determined the intention using gaze recognition and finally combined the results to recognize a handball event. On our dataset, the results of the hand and the ball object detection were 96% and 100% respectively. With the gaze recognition at 100%, if all objects were recognized, then the intention and handball event recognition were at 100%. Full article

(This article belongs to the Section AI in Autonomous Systems)

► Show Figures

Figure 1

14 pages, 1508 KB

Open AccessArticle

A Mouth and Tongue Interactive Device to Control Wearable Robotic Limbs in Tasks where Human Limbs Are Occupied

by Hongwei Jing, Tianjiao Zheng, Qinghua Zhang, Benshan Liu, Kerui Sun, Lele Li, Jie Zhao and Yanhe Zhu

Biosensors 2024, 14(5), 213; https://doi.org/10.3390/bios14050213 - 24 Apr 2024

Cited by 2 | Viewed by 3112

Abstract

The Wearable Robotic Limb (WRL) is a type of robotic arm worn on the human body, aiming to enhance the wearer’s operational capabilities. However, proposing additional methods to control and perceive the WRL when human limbs are heavily occupied with primary tasks presents [...] Read more.

The Wearable Robotic Limb (WRL) is a type of robotic arm worn on the human body, aiming to enhance the wearer’s operational capabilities. However, proposing additional methods to control and perceive the WRL when human limbs are heavily occupied with primary tasks presents a challenge. Existing interactive methods, such as voice, gaze, and electromyography (EMG), have limitations in control precision and convenience. To address this, we have developed an interactive device that utilizes the mouth and tongue. This device is lightweight and compact, allowing wearers to achieve continuous motion and contact force control of the WRL. By using a tongue controller and mouth gas pressure sensor, wearers can control the WRL while also receiving sensitive contact feedback through changes in mouth pressure. To facilitate bidirectional interaction between the wearer and the WRL, we have devised an algorithm that divides WRL control into motion and force-position hybrid modes. In order to evaluate the performance of the device, we conducted an experiment with ten participants tasked with completing a pin-hole assembly task with the assistance of the WRL system. The results show that the device enables continuous control of the position and contact force of the WRL, with users perceiving feedback through mouth airflow resistance. However, the experiment also revealed some shortcomings of the device, including user fatigue and its impact on breathing. After experimental investigation, it was observed that fatigue levels can decrease with training. Experimental studies have revealed that fatigue levels can decrease with training. Furthermore, the limitations of the device have shown potential for improvement through structural enhancements. Overall, our mouth and tongue interactive device shows promising potential in controlling the WRL during tasks where human limbs are occupied. Full article

(This article belongs to the Special Issue Devices and Wearable Devices toward Innovative Applications)

► Show Figures

Figure 1

Search Results (66)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (66)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI