Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (330)

Search Parameters:
Keywords = cameras on mobile devices

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
13 pages, 2745 KB  
Perspective
Clinical Use of Infrared Thermography: Where Are We and Where Are We Going
by Agnieszka Wnuk-Scardaccione and Jan Bilski
Medicina 2026, 62(6), 1204; https://doi.org/10.3390/medicina62061204 (registering DOI) - 22 Jun 2026
Abstract
Medical infrared thermography, which involves the use of infrared thermal cameras for the non-invasive assessment of skin surface temperature distribution, has gained increasing interest in recent years as a tool supporting diagnosis and treatment monitoring. The aim of this article is to present [...] Read more.
Medical infrared thermography, which involves the use of infrared thermal cameras for the non-invasive assessment of skin surface temperature distribution, has gained increasing interest in recent years as a tool supporting diagnosis and treatment monitoring. The aim of this article is to present the historical background and critically reassess the current role of infrared thermography in medicine, with particular emphasis on standardization as a key determinant of its clinical utility. This Perspective highlights the fundamental impact of methodological variability on diagnostic performance and reproducibility. A structured framework for standardization is proposed, encompassing patient preparation, environmental conditions, device parameters and calibration, image acquisition protocols, region-of-interest definition and analysis, as well as reporting and clinical interpretation. The analysis demonstrates how inconsistencies at each of these levels reduce measurement reliability, limit inter-study comparability, and weaken clinical confidence in infrared thermography. The article also addresses the growing availability of mobile thermal imaging systems and their integration with artificial intelligence, while emphasizing the need for stronger evidence-based support across all methodological domains. The presented analysis suggests that, despite existing limitations, medical infrared thermography holds considerable potential as a supportive clinical tool. However, its broader clinical implementation remains limited by several factors, with the lack of standardized protocols constituting a major and practically addressable translational barrier. Wider adoption will require standardization efforts alongside rigorous validation studies and application-specific interpretative guidelines. Addressing these challenges through technological advances and coordinated international standardization may facilitate meaningful progress over the next decade. Full article
Show Figures

Figure 1

16 pages, 1810 KB  
Article
Gaze Tracking- and Facial Movement-Driven Human–Computer Interaction System
by Yue Liu, Yuxiang Li, Lu Leng and Cheonshik Kim
Appl. Sci. 2026, 16(11), 5653; https://doi.org/10.3390/app16115653 - 4 Jun 2026
Viewed by 225
Abstract
With the development of human–computer interaction technology, non-contact interaction based on gaze tracking and facial movements has become a research hotspot. Traditional mouse-and-keyboard methods pose challenges for people with disabilities or limited hand movements, while existing gaze-tracking systems often rely on expensive hardware [...] Read more.
With the development of human–computer interaction technology, non-contact interaction based on gaze tracking and facial movements has become a research hotspot. Traditional mouse-and-keyboard methods pose challenges for people with disabilities or limited hand movements, while existing gaze-tracking systems often rely on expensive hardware or lack sufficient accuracy. This paper designs and implements a real-time system using ordinary cameras, achieving natural, efficient interaction via multimodal input combination. The system uses an improved MobileNetV2 backbone to construct GazeTrackNet for gaze estimation. It adopts MediaPipe Face Mesh to detect facial landmarks. Meanwhile, it applies geometric feature analysis, including eye aspect ratio and mouth aspect ratio, to identify actions such as blinking and mouth opening. It adopts a hybrid control strategy that combines gaze jumping and head fine-tuning, using mouth state as the main control switch. Key contributions include a lightweight gaze-tracking algorithm that enables stable and efficient gaze detection on consumer-grade hardware, a multimodal interaction strategy based on facial movement that improves system stability and ease of use, and a complete prototype system that achieves real-time performance on standard laptops. Experimental results show an average gaze average angle error of 3.0°, 97% eye state recognition accuracy, and end-to-end latency below 70 ms. The system can satisfy the requirements of daily desktop interaction under normal indoor lighting, and shows potential for future barrier-free interaction applications after further validation with target users. Existing gaze-tracking methods either suffer from low precision on lightweight devices or bring heavy computational overhead. Common facial recognition approaches also face frequent false trigger interference. Compared with them, our scheme achieves balanced accuracy and real-time performance via an attention-enhanced structure, and the designed dual anti-shake mechanism effectively suppresses misjudgment, delivering a more stable hands-free interaction experience. Full article
(This article belongs to the Special Issue Image Processing: Technologies, Methods, Apparatus)
Show Figures

Figure 1

23 pages, 22564 KB  
Article
A Multi-Module Fusion Framework for Restoring Human and Machine Vision Quality in Compressed Video
by Keren He, Kun Xiang, Yufei Gao, Yang Yu and Jinjia Zhou
Sensors 2026, 26(11), 3494; https://doi.org/10.3390/s26113494 - 1 Jun 2026
Viewed by 299
Abstract
With the increasing demand for video processing in both human perception and machine vision applications, enhancing heavily compressed video has become a critical problem in practical multimedia systems. In many real-world scenarios, video data acquired by image sensors are often compressed for efficient [...] Read more.
With the increasing demand for video processing in both human perception and machine vision applications, enhancing heavily compressed video has become a critical problem in practical multimedia systems. In many real-world scenarios, video data acquired by image sensors are often compressed for efficient transmission and storage, which introduces compression artifacts and degrades both visual quality and downstream task performance. This issue is especially significant in sensor-based systems such as surveillance cameras and mobile imaging devices. To address these challenges, we propose a novel joint human–machine video enhancement framework for compressed video enhancement that jointly targets human perceptual quality and machine vision performance. The framework integrates four complementary components: a Spatio-Temporal Fusion Module that leverages inter-frame correlations, a High-Frequency Semantic Fusion module for recovering structurally important details relevant to machine tasks, a Texture-Guided Model that enhances low-level visual features, and a Refined Attention Residual Quality Enhancement Module that adaptively emphasizes salient regions. By progressively combining these modules, the framework effectively restores compressed content while preserving task-relevant semantics. The experimental results demonstrate that our method consistently outperforms existing approaches, achieving higher PSNR and SSIM as well as improved object detection and video object segmentation performance. These results highlight the framework’s practical applicability for compressed video enhancement in sensor-based systems, including intelligent surveillance and autonomous imaging platforms. Full article
(This article belongs to the Special Issue Advances in Learning-Based Sensing-Driven Multimedia Processing)
Show Figures

Figure 1

13 pages, 945 KB  
Article
Application of Smart Sensors in Commodity Management
by Chao-Kong Chung, Meng-Yun Chung and Guo-Ming Sung
Sensors 2026, 26(10), 3096; https://doi.org/10.3390/s26103096 - 14 May 2026
Viewed by 338
Abstract
Integrating sensors with wireless communication capabilities into smart wireless sensing devices allows us to form a wireless sensing network. This network works in conjunction with monitors to display and control parameters at different locations or in the environment. By deploying a wireless sensing [...] Read more.
Integrating sensors with wireless communication capabilities into smart wireless sensing devices allows us to form a wireless sensing network. This network works in conjunction with monitors to display and control parameters at different locations or in the environment. By deploying a wireless sensing network, the system can interact with the user by sending notifications when necessary, based on the environmental conditions and user activities detected by the wireless sensors, and make corresponding adjustments to or control the environment. The advancement and widespread adoption of the internet have enabled the development of this technology. Wireless sensors are widely used in product positioning and environmental monitoring management, making the management of complex products more accurate. The Monitor and Control System (MCS), which combines network cameras and wireless sensors with neural network technology and fuzzy control systems, improves the existing positioning method and enhances positioning accuracy. Product management, which comprises comprehensive digital services and is facing serious staff shortages, has turned to digital payment to reduce labor costs. This experiment was simulated using Network Simulator 2 (NS2). In the sensing system part, the application of a ZigBee network and its status were explored, and interference was analyzed. Information on network interference simulations and their impact on normal services was compiled for network management purposes. Using NS2 network simulation, this study utilizes ZigBee with different neuron nodes and different training times to find the best network model, compares various queuing mechanisms and functions as a network interference intrusion detection system, and explores its node defense capabilities in cases of interference. Node Density: Node density is typically determined by the number of nodes in the simulation area and the size of the scene. Low Density: Sparse node distribution, prone to network partitioning, is suitable for testing latency-tolerant networks (DTNs) or route discovery capabilities. High Density: It entails dense node distribution, severe signal interference, and packet collisions. It is suitable for testing MAC layer collision prevention mechanisms (such as CSMA/CA) and the scalability of outing protocols. Configuration Method: the “set Dest” tool is used in a Tcl script to generate a mobile scene file, defining the number of nodes, range (X, Y), and time to be more significant in product management. Full article
(This article belongs to the Topic AI Sensors and Transducers)
Show Figures

Figure 1

15 pages, 1595 KB  
Article
Vision-Guided Precision Tool Alignment and Target Contact for a Mobile Manipulator Using YOLO Detection and Depth-Based 3D Localization
by Yanyan Dai and KiDong Lee
Electronics 2026, 15(9), 1890; https://doi.org/10.3390/electronics15091890 - 29 Apr 2026
Viewed by 430
Abstract
Precision alignment and target contact are critical tasks for mobile manipulators in industrial inspection and flexible manufacturing. However, achieving high accuracy after navigation remains challenging due to accumulated errors from mobile base localization, perception noise, and calibration uncertainty. This paper proposes a vision-guided [...] Read more.
Precision alignment and target contact are critical tasks for mobile manipulators in industrial inspection and flexible manufacturing. However, achieving high accuracy after navigation remains challenging due to accumulated errors from mobile base localization, perception noise, and calibration uncertainty. This paper proposes a vision-guided precision alignment framework for mobile manipulators using a single front-facing RGB-D camera. The method integrates YOLO-based target detection, AR marker-assisted plane depth estimation, and depth-based 3D localization within a coarse-to-fine alignment strategy. After navigation, the manipulator first moves to a predefined pre-alignment pose, followed by visual localization and iterative refinement to compensate for residual errors before executing precise target contact. The proposed system is implemented and evaluated in a Gazebo-based simulation environment using a mobile manipulator platform model. In a static touch panel experiment with 50 trials, the system achieves a success rate of 98%, with positioning errors maintained within a millimeter-level range. Simulation results demonstrate that the proposed method provides stable alignment performance in the simulation environment without relying on external sensing devices such as force sensors or multi-camera systems. The proposed approach shows promising potential for precision contact tasks in mobile manipulation. Full article
(This article belongs to the Special Issue Nonlinear Analysis and Control of Electronic Systems)
Show Figures

Figure 1

7 pages, 17541 KB  
Proceeding Paper
SCALEeat: Vision-Guided Food Scale for Automated Macronutrient Estimation
by Angelo Pamis Alcontin, Charls Gerald De Gala Correa and Julius Tube Sese
Eng. Proc. 2026, 134(1), 83; https://doi.org/10.3390/engproc2026134083 - 28 Apr 2026
Viewed by 599
Abstract
SCALEeat, a self-contained smart food scale, was developed to offer a convenient solution and replace manual logging with on-device recognition and weighing. The device integrated a Raspberry Pi 5, a camera, and a load cell, identifies foods and computes calories, carbohydrates, protein, and [...] Read more.
SCALEeat, a self-contained smart food scale, was developed to offer a convenient solution and replace manual logging with on-device recognition and weighing. The device integrated a Raspberry Pi 5, a camera, and a load cell, identifies foods and computes calories, carbohydrates, protein, and fat from measured weight through the Philippine Food Composition Tables (PhilFCT). Using transfer learning, a MobileNetV3-Large model trained on 25 commonly consumed items from ENNS, this achieved a 97.33% top-1 accuracy on a 300-image test set. Deployed on the prototype, SCALEeat achieved 93.60% accuracy, demonstrating practical accuracy and a lower-friction path to routine dietary assessment. Full article
Show Figures

Figure 1

27 pages, 17739 KB  
Article
3D Radiometric Thermography Mosaics with Low-Cost Mobile Sensor Stack
by Scott McAvoy, Jonathan Klingspon, Adrian Tong, Eric Lo, Nathan Hui, Maurizio Seracini, Dominique Rissolo, Neal Driscoll and Falko Kuester
Remote Sens. 2026, 18(9), 1335; https://doi.org/10.3390/rs18091335 - 27 Apr 2026
Viewed by 494
Abstract
Infrared thermography provides key information for a wide range of diagnostic applications within built and natural environments. As thermal states are changing with ambient conditions, it is important to deploy thermal imaging systems and operators opportunistically. It is therefore an attractive proposition to [...] Read more.
Infrared thermography provides key information for a wide range of diagnostic applications within built and natural environments. As thermal states are changing with ambient conditions, it is important to deploy thermal imaging systems and operators opportunistically. It is therefore an attractive proposition to make these systems more affordable and accessible. Low-cost thermal sensors generally produce low-resolution outputs. To increase data density across large subjects, diagnosticians may create image mosaics from multiple overlapping thermographs. The registration of individual inputs into large mosaics is aided by the acquisition of additional sensor data (photographs and depthmaps), which can provide critical spatial references. In many cases, the materials inherent to the modern built environment present challenges to traditional data registration workflows between multiple sensor streams. Mobile devices offer an opportunity to innovate in the creation of these mosaics, integrating rapid geospatial mapping functionality with radiometric thermography within a 3D context. In this paper the authors evaluate the FLIR One Pro thermal camera module along with iOS/iPhone specific rapid mapping capabilities, and present a methodology: (1) introducing a workflow for the integration of short-range (within 0.3–5 m capture distance) iPhone mobile sensor data into modeling pipelines; (2) introducing a calibration model enabling effective registration and fusion of multi-modal inputs from the iPhone mobile sensor stack and FLIR One thermographic module; and (3) detailing an alternative open-source methodology for the evaluation and translation of thermographic imagery for multi-sensor fusion. The end product of this pipeline is a 3D radiometric thermographic mosaic: a spatially continuous, textured surface model in which hundreds of individual low-resolution thermographs are fused into a single queryable output retaining full 16-bit temperature values at every point. All datasets have been made openly available and the two case studies used in this paper have been made accessible at full resolution for interactive 3D online viewing. Full article
(This article belongs to the Special Issue Remote Sensing for 2D/3D Mapping)
Show Figures

Figure 1

25 pages, 37592 KB  
Article
Deep-Learning-Based Mobile Application for Real-Time Recognition of Cultural Artifacts in Museum Environments
by Pablo Minango, Marcelo Zambrano, Carmen Inés Huerta Suarez and Juan Minango
Appl. Sci. 2026, 16(9), 4064; https://doi.org/10.3390/app16094064 - 22 Apr 2026
Viewed by 713
Abstract
Dissemination and conservation of cultural heritage have been challenged by continued accessibility in museums, where traditional information delivery systems are at times ineffective in terms if interaction with visitors. The current paper investigates RumiArt IA, a mobile application, to identify cultural objects in [...] Read more.
Dissemination and conservation of cultural heritage have been challenged by continued accessibility in museums, where traditional information delivery systems are at times ineffective in terms if interaction with visitors. The current paper investigates RumiArt IA, a mobile application, to identify cultural objects in real-time, remaining fully in the scope of this line of research without relying on internet connectivity. The system, which is developed based on the Rumiñahui Museum and Cultural Center, Ecuador, uses transfer learning in the MobileNetV2 architecture with INT8 post-training quantization to identify 21 cultural artifacts spread across six thematic rooms. The experiment involved building a dataset of 36,000 images under diverse lighting conditions, viewing angles, and distances; furthermore, artificial transformations were explicitly crafted to simulate real museum conditions such as glass reflections and non-frontal capture angles. Quantization was used to reduce each model to 775 KB as compared with the 2.4 MB, with accuracy loss not reaching more than 0.5 percent (DKL < 0.05). Assessment of 9450 validation images yielded a general accuracy of 92.2%, with an inference time of 63 ms on current devices with a high throughput and 215 ms on mid-range hardware from 2020. Practical validation involving 50 visitors of the museum showed a success rate of 93.7%, with average user satisfaction at 8.5/10 and 87%, indicating they would recommend the application. An in-depth error study of the most difficult room (88.3% accuracy) indicated that 47% of the errors were due to the angles of the camera, which blocked out distinguishing features, and 22% were caused by display case reflections and the shadows of the visitors. These results indicate that end-to-end machine learning can provide consistent cultural heritage recognition in resource-constrained settings but its efficiency is susceptible to physical capture factors that cannot be resolved by data augmentation. Offline mode and low memory footprint (less than 90 MB when loaded on six models) of the system are especially relevant to application in situations where there is no guarantee of cloud connectivity. Full article
(This article belongs to the Special Issue Intelligent Interaction in Cultural Heritage)
Show Figures

Figure 1

35 pages, 8415 KB  
Article
Research on Three-Dimensional Positioning Method for Automatic Strawberry Fruit Picking Based on Vision–IMU Fusion
by Bowen Liu, Chuhan Chen, Junqiu Li, Qinghui Zhang and Yinghao Meng
Agriculture 2026, 16(8), 893; https://doi.org/10.3390/agriculture16080893 - 17 Apr 2026
Viewed by 1173
Abstract
Accurate fruit localization and efficient harvesting are key challenges for agricultural robots, especially in dynamic orchard environments, where platform vibration, fruit occlusion, and computational resource limitations of embedded devices significantly impact system performance. To address these issues, this paper proposes a lightweight “fruit [...] Read more.
Accurate fruit localization and efficient harvesting are key challenges for agricultural robots, especially in dynamic orchard environments, where platform vibration, fruit occlusion, and computational resource limitations of embedded devices significantly impact system performance. To address these issues, this paper proposes a lightweight “fruit detection + harvesting” framework. First, by integrating MobileNetV4 and Triplet Attention mechanisms, an improved YOLOv8n network is designed, with the improved YOLOv8n Precision reaching 98.148% and FPS reaching 30 FPS on Jetson Nano, achieving a good balance between detection accuracy and computational efficiency suitable for edge deployment. Second, a strawberry three-dimensional coordinate reconstruction method based on weighted 3D centroid reconstruction is proposed, utilizing depth bias adjustment coefficients to improve spatial accuracy. Third, to address localization errors caused by vibration and platform motion, a dynamic compensation and temporal fusion strategy based on an Inertial Measurement Unit (IMU) is proposed. The rotation matrix estimated from IMU data is first used to correct camera pose variations. Then, an adaptive sliding window is employed to smooth the coordinate sequence. Finally, an Extended Kalman Filter (EKF) is applied to further refine the fused results by incorporating temporal dynamics, ensuring that the reconstructed three-dimensional coordinates in the robotic arm reference frame achieve higher stability and continuity. Experimental results in orchard scenarios show that compared with traditional methods, the system has higher localization accuracy, stronger robustness to dynamic disturbances, and higher harvesting efficiency. This work provides a practical and deployable solution for advancing intelligent fruit-harvesting robots. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Figure 1

28 pages, 28199 KB  
Article
Augmented Reality as a Tool for 5G Learning: Interactive Visualization of NSA/SA Architectures and Network Components
by Nathaly Orozco Garzón, David Herrera, Angel Gomez, Pablo Plaza, Henry Carvajal Mora, Roberto Sánchez Albán, José Vega-Sánchez and Paola Vinueza-Naranjo
Informatics 2026, 13(4), 58; https://doi.org/10.3390/informatics13040058 - 3 Apr 2026
Viewed by 2001
Abstract
The rapid advancement of digital and mobile technologies has reshaped the educational landscape, fostering the adoption of interactive and learner-centered methodologies. Among these, immersive technologies such as Augmented Reality (AR), when coupled with next-generation wireless communication systems, hold the potential to revolutionize knowledge [...] Read more.
The rapid advancement of digital and mobile technologies has reshaped the educational landscape, fostering the adoption of interactive and learner-centered methodologies. Among these, immersive technologies such as Augmented Reality (AR), when coupled with next-generation wireless communication systems, hold the potential to revolutionize knowledge acquisition and student engagement. In this paper, we present the design and development of an AR-based educational tool specifically oriented to teaching concepts of fifth-generation (5G) mobile networks. The tool provides a real-time interactive visualization of 3D network components on mobile devices, enabling learners to explore 5G NSA/SA architectures in an accessible manner with real-world environments through mobile devices and their integrated cameras. The application was developed using Blender for 3D modeling and Unity as the rendering engine, incorporating the Vuforia SDK for marker-based AR tracking, and it was deployed on the Android operating system. Unlike traditional static approaches, the proposed solution enables learners to explore complex network architectures and key functionalities of 5G in an interactive and accessible manner. To assess its perceived effectiveness, quantitative surveys were conducted with both university and high school students, focusing on usability, engagement, and perceived learning outcomes. Results indicate that the tool is user-friendly, enhances motivation, and supports conceptual understanding as perceived by participants of 5G technologies. These findings highlight the potential of AR, supported by advanced wireless networks, as a pedagogical strategy to improve STEM education and foster technological literacy in the era of digital transformation. Full article
Show Figures

Figure 1

11 pages, 4770 KB  
Data Descriptor
Pasture Plant’s Dataset
by Rafael Curado, Pedro Gonçalves, Maria R. Marques and Mário Antunes
Data 2026, 11(3), 63; https://doi.org/10.3390/data11030063 - 19 Mar 2026
Viewed by 995
Abstract
Identifying the plant species comprising a pasture, among other aspects, is crucial for assessing its nutritional value for grazing animals and facilitating its effective management. Traditionally, it requires labor-intensive visual inspection. Artificial Intelligence (AI) offers a solution for automatic classification, yet robust datasets [...] Read more.
Identifying the plant species comprising a pasture, among other aspects, is crucial for assessing its nutritional value for grazing animals and facilitating its effective management. Traditionally, it requires labor-intensive visual inspection. Artificial Intelligence (AI) offers a solution for automatic classification, yet robust datasets for training such models in natural, uncontrolled environments are scarce. This data descriptor presents a dataset of 741 images collected in pasture lands in the Centre of Portugal using standard cameras at a height of 50 cm. A semi-automated annotation pipeline was employed, utilizing a Faster R-CNN model followed by manual verification and refinement. The dataset contains 1744 annotations across four categories: ‘Shrubs’, ‘Grasses’, ‘Legumes’, and ‘Others’. It includes diverse morphological variations and captures real-world challenges such as occlusion and lighting variability. This dataset serves as a benchmark for training object detection models in agricultural settings, facilitating the development of automated monitoring systems for precision agriculture. Such a mechanism could be incorporated into a mobile application, mounted on a drone, or embedded in an animal-worn device, enabling automated sampling and identification of the plant composition within a pasture. Full article
Show Figures

Figure 1

13 pages, 2877 KB  
Article
Devising a Coaching Method for a Smartphone-Based Slit-Lamp Microscope and Its Learning Effects: A Pilot Study
by Hokuto Ubukata, Haruo Toda, Hiroki Nishimura, Shintaro Nakayama, Mai Nishio, Takahiro Mizukami, Kosei Tomita and Eisuke Shimizu
J. Clin. Med. 2026, 15(5), 1928; https://doi.org/10.3390/jcm15051928 - 3 Mar 2026
Viewed by 542
Abstract
Objectives: To develop an effective learning method for using a smartphone-based slit-lamp microscope (SBSL) and to identify key points to emphasize when coaching individuals with no prior SBSL experience. Methods: This study included 60 orthoptic students: 40 second-year students (control group: [...] Read more.
Objectives: To develop an effective learning method for using a smartphone-based slit-lamp microscope (SBSL) and to identify key points to emphasize when coaching individuals with no prior SBSL experience. Methods: This study included 60 orthoptic students: 40 second-year students (control group: 20, training group 1: 20) and 20 first-year students (training group 2). Subjects were instructed to record the anterior eye segment of a patient-role subject using the Smart Eye Camera. The control group was given paper instruction and was shown the demonstration of the SBSL beforehand. In addition, training groups 1 and 2 watched a tutorial video, practiced using the SBSL for 30 min, and received guidance from an expert. Four ophthalmologists evaluated the recordings based on the eyelid, conjunctiva, cornea, pupil including iris, lens, and anterior chamber depth. Results: ANOVAs showed significant differences among groups for all items. The control group had significantly lower scores than both training groups, while no significant differences were found between training groups 1 and 2. Principal component analysis of training groups 1 and 2 showed that the first principal component accounted for 74.36% of the variance. The second principal component accounted for 10.71%, with a wide range of loadings (anterior chamber depth of 0.7780 to conjunctiva of −0.5585), implying the existence of different favorite focusing depths within subjects. Conclusions: A coaching program consisting of tutorial video learning, a 30 min hands-on trial, and feedback is effective in helping individuals without an ophthalmological background acquire anterior segment imaging skills using SBSL. Comprehensive focusing across the entire anterior segment should also be emphasized. Full article
(This article belongs to the Section Ophthalmology)
Show Figures

Figure 1

20 pages, 3275 KB  
Article
Real-Time Emotion Recognition Performance of Mobile Devices: A Detailed Analysis of Camera and TrueDepth Sensors Using Apple’s ARKit
by Céline Madeleine Aldenhoven, Leon Nissen, Marie Heinemann, Cem Doğdu, Alexander Hanke, Stephan Jonas and Lara Marie Reimer
Sensors 2026, 26(3), 1060; https://doi.org/10.3390/s26031060 - 6 Feb 2026
Cited by 2 | Viewed by 1242
Abstract
Facial features hold information about a person’s emotions, motor function, or genetic defects. Since most current mobile devices are capable of real-time face detection using cameras and depth sensors, real-time facial analysis can be utilized in several mobile use cases. Understanding the real-time [...] Read more.
Facial features hold information about a person’s emotions, motor function, or genetic defects. Since most current mobile devices are capable of real-time face detection using cameras and depth sensors, real-time facial analysis can be utilized in several mobile use cases. Understanding the real-time emotion recognition capabilities of device sensors and frameworks is vital for developing new, valid applications. Therefore, we evaluated on-device emotion recognition using Apple’s ARKit on an iPhone 14 Pro. A native app elicited 36 blend shape-specific movements and 7 discrete emotions from N=31 healthy adults. Per frame, standardized ARKit blend shapes were classified using a prototype-based cosine similarity metric; performance was summarized as accuracy and area under the receiver operating characteristic curves. Cosine similarity achieved an overall accuracy of 68.3%, exceeding the mean of three human raters (58.9%; +9.4 percentage points, ≈16% relative). Per-emotion accuracy was highest for joy, fear, sadness, and surprise, and competitive for anger, disgust, and contempt. AUCs were ≥0.84 for all classes. The method runs in real time on-device using only vector operations, preserving privacy and minimizing compute. These results indicate that a simple, interpretable cosine-similarity classifier over ARKit blend shapes delivers human-comparable, real-time facial emotion recognition on commodity hardware, supporting privacy-preserving mobile applications. Full article
(This article belongs to the Section Optical Sensors)
Show Figures

Figure 1

16 pages, 12168 KB  
Article
Real-Time Segmentation of Tactile Paving and Zebra Crossings for Visually Impaired Assistance Using Embedded Visual Sensors
by Yiqiang Jiang, Shicheng Yan and Jianyang Liu
Sensors 2026, 26(3), 770; https://doi.org/10.3390/s26030770 - 23 Jan 2026
Viewed by 683
Abstract
This study aims to address the safety and mobility challenges faced by visually impaired individuals. To this end, a lightweight, high-precision semantic segmentation network is proposed for scenes containing tactile paving and zebra crossings. The network is successfully deployed on an intelligent guide [...] Read more.
This study aims to address the safety and mobility challenges faced by visually impaired individuals. To this end, a lightweight, high-precision semantic segmentation network is proposed for scenes containing tactile paving and zebra crossings. The network is successfully deployed on an intelligent guide robot equipped with a high-definition camera and a Huawei Atlas 310 embedded computing platform. To enhance both real-time performance and segmentation accuracy on resource-constrained devices, an improved G-GhostNet backbone is designed for feature extraction. Specifically, it is combined with a depthwise separable convolution-based Coordinate Attention module and a redesigned Atrous Spatial Pyramid Pooling (ASPP) module to capture multi-scale contextual features. A dedicated decoder efficiently fuses multi-level features to refine segmentation of tactile paving and zebra crossings. Experimental results demonstrate that the proposed model achieves mPA of 97% and 93%, mIoU of 94% and 86% for tactile paving and zebra crossing segmentation, respectively, with an inference speed of 59.2 fps. These results significantly outperform several mainstream semantic segmentation networks, validating the effectiveness and practical value of the proposed method in embedded systems for visually impaired travel assistance. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

21 pages, 9102 KB  
Article
A Lightweight Edge AI Framework for Adaptive Traffic Signal Control in Mid-Sized Philippine Cities
by Alex L. Maureal, Franch Maverick A. Lorilla and Ginno L. Andres
Sustainability 2026, 18(3), 1147; https://doi.org/10.3390/su18031147 - 23 Jan 2026
Viewed by 1848
Abstract
Mid-sized Philippine cities commonly rely on fixed-time traffic signal plans that cannot respond to short-term, demand-driven surges, resulting in measurable idle time at stop lines, increased delay, and unnecessary emissions, while adaptive signal control has demonstrated performance benefits, many existing solutions depend on [...] Read more.
Mid-sized Philippine cities commonly rely on fixed-time traffic signal plans that cannot respond to short-term, demand-driven surges, resulting in measurable idle time at stop lines, increased delay, and unnecessary emissions, while adaptive signal control has demonstrated performance benefits, many existing solutions depend on centralized infrastructure and high-bandwidth connectivity, limiting their applicability for resource-constrained local government units (LGUs). This study reports a field deployment of TrafficEZ, a lightweight edge AI signal controller that reallocates green splits locally using traffic-density approximations derived from cabinet-mounted cameras. The controller follows a macroscopic, cycle-level control abstraction consistent with Transportation System Models (TSMs) and does not rely on stationary flow–density–speed (fundamental diagram) assumptions. The system estimates queued demand and discharge efficiency on-device and updates green time each cycle without altering cycle length, intergreen intervals, or pedestrian safety timings. A quasi-experimental pre–post evaluation was conducted at three signalized intersections in El Salvador City using an existing 125 s, three-phase fixed-time plan as the baseline. Observed field results show average per-vehicle delay reductions of 18–32%, with reclaimed effective green translating into approximately 50–200 additional vehicles per hour served at the busiest approaches. Box-occupancy durations shortened, indicating reduced spillback risk, while conservative idle-time estimates imply corresponding CO2 savings during peak periods. Because all decisions run locally within the signal cabinet, operation remained robust during backhaul interruptions and supported incremental, intersection-by-intersection deployment; per-cycle actions were logged to support auditability and governance reporting. These findings demonstrate that density-driven edge AI can deliver practical mobility, reliability, and sustainability gains for LGUs while supporting evidence-based governance and performance reporting. Full article
Show Figures

Figure 1

Back to TopTop