Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (640)

Search Parameters:
Keywords = feature point tracks

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
28 pages, 5779 KiB  
Article
Regional Wave Spectra Prediction Method Based on Deep Learning
by Yuning Liu, Rui Li, Wei Hu, Peng Ren and Chao Xu
J. Mar. Sci. Eng. 2025, 13(8), 1461; https://doi.org/10.3390/jmse13081461 - 30 Jul 2025
Viewed by 166
Abstract
The wave spectrum, as a key statistical feature describing wave energy distribution, is crucial for understanding wave propagation mechanisms and supporting ocean engineering applications. This study, based on ERA5 reanalysis spectrum data, proposes a model combining CNN and xLSTM for rapid gridded wave [...] Read more.
The wave spectrum, as a key statistical feature describing wave energy distribution, is crucial for understanding wave propagation mechanisms and supporting ocean engineering applications. This study, based on ERA5 reanalysis spectrum data, proposes a model combining CNN and xLSTM for rapid gridded wave spectrum prediction over the Bohai and Yellow Seas domain. It uses 2D gridded spectrum data rather than a spectrum at specific points as input and analyzes the impact of various input factors at different time lags on wave development. The results show that incorporating water depth and mean sea level pressure significantly reduces errors. The model performs well across seasons with the seasonal spatial average root mean square error (SARMSE) of spectral energy remaining below 0.040 m2·s and RMSEs for significant wave height (SWH) and mean wave period (MWP) of 0.138 m and 1.331 s, respectively. At individual points, the spectral density bias is near zero, correlation coefficients range from 0.95 to 0.98, and the peak frequency RMSE is between 0.03 and 0.04 Hz. During a typical cold wave event, the model accurately reproduces the energy evolution and peak frequency shift. Buoy observations confirm that the model effectively tracks significant wave height trends under varying conditions. Moreover, applying a frequency-weighted loss function enhances the model’s ability to capture high-frequency spectral components, further improving prediction accuracy. Overall, the proposed method shows strong performance in spectrum prediction and provides a valuable approach for regional wave spectrum modeling. Full article
(This article belongs to the Section Physical Oceanography)
Show Figures

Figure 1

21 pages, 4095 KiB  
Article
GNSS-Based Multi-Target RDM Simulation and Detection Performance Analysis
by Jinxing Li, Qi Wang, Meng Wang, Youcheng Wang and Min Zhang
Remote Sens. 2025, 17(15), 2607; https://doi.org/10.3390/rs17152607 - 27 Jul 2025
Viewed by 341
Abstract
This paper proposes a novel Global Navigation Satellite System (GNSS)-based remote sensing method for simulating Radar Doppler Map (RDM) features through joint electromagnetic scattering modeling and signal processing, enabling characteristic parameter extraction for both point and ship targets in multi-satellite scenarios. Simulations demonstrate [...] Read more.
This paper proposes a novel Global Navigation Satellite System (GNSS)-based remote sensing method for simulating Radar Doppler Map (RDM) features through joint electromagnetic scattering modeling and signal processing, enabling characteristic parameter extraction for both point and ship targets in multi-satellite scenarios. Simulations demonstrate that the B3I signal achieves a significantly enhanced range resolution (tens of meters) compared to the B1I signal (hundreds of meters), attributable to its wider bandwidth. Furthermore, we introduce an Unscented Particle Filter (UPF) algorithm for dynamic target tracking and state estimation. Experimental results show that four-satellite configurations outperform three-satellite setups, achieving <10 m position error for uniform motion and <18 m for maneuvering targets, with velocity errors within ±2 m/s using four satellites. The joint detection framework for multi-satellite, multi-target scenarios demonstrates an improved detection accuracy and robust localization performance. Full article
Show Figures

Figure 1

14 pages, 1419 KiB  
Article
GhostBlock-Augmented Lightweight Gaze Tracking via Depthwise Separable Convolution
by Jing-Ming Guo, Yu-Sung Cheng, Yi-Chong Zeng and Zong-Yan Yang
Electronics 2025, 14(15), 2978; https://doi.org/10.3390/electronics14152978 - 25 Jul 2025
Viewed by 164
Abstract
This paper proposes a lightweight gaze-tracking architecture named GhostBlock-Augmented Look to Coordinate Space (L2CS), which integrates GhostNet-based modules and depthwise separable convolution to achieve a better trade-off between model accuracy and computational efficiency. Conventional lightweight gaze-tracking models often suffer from degraded accuracy due [...] Read more.
This paper proposes a lightweight gaze-tracking architecture named GhostBlock-Augmented Look to Coordinate Space (L2CS), which integrates GhostNet-based modules and depthwise separable convolution to achieve a better trade-off between model accuracy and computational efficiency. Conventional lightweight gaze-tracking models often suffer from degraded accuracy due to aggressive parameter reduction. To address this issue, we introduce GhostBlocks, a custom-designed convolutional unit that combines intrinsic feature generation with ghost feature recomposition through depthwise operations. Our method enhances the original L2CS architecture by replacing each ResNet block with GhostBlocks, thereby significantly reducing the number of parameters and floating-point operations. The experimental results on the Gaze360 dataset demonstrate that the proposed model reduces FLOPs from 16.527 × 108 to 8.610 × 108 and parameter count from 2.387 × 105 to 1.224 × 105 while maintaining comparable gaze estimation accuracy, with MAE increasing only slightly from 10.70° to 10.87°. This work highlights the potential of GhostNet-augmented designs for real-time gaze tracking on edge devices, providing a practical solution for deployment in resource-constrained environments. Full article
Show Figures

Figure 1

25 pages, 5055 KiB  
Article
FlickPose: A Hand Tracking-Based Text Input System for Mobile Users Wearing Smart Glasses
by Ryo Yuasa and Katashi Nagao
Appl. Sci. 2025, 15(15), 8122; https://doi.org/10.3390/app15158122 - 22 Jul 2025
Viewed by 341
Abstract
With the growing use of head-mounted displays (HMDs) such as smart glasses, text input remains a challenge, especially in mobile environments. Conventional methods like physical keyboards, voice recognition, and virtual keyboards each have limitations—physical keyboards lack portability, voice input has privacy concerns, and [...] Read more.
With the growing use of head-mounted displays (HMDs) such as smart glasses, text input remains a challenge, especially in mobile environments. Conventional methods like physical keyboards, voice recognition, and virtual keyboards each have limitations—physical keyboards lack portability, voice input has privacy concerns, and virtual keyboards struggle with accuracy due to a lack of tactile feedback. FlickPose is a novel text input system designed for smart glasses and mobile HMD users, integrating flick-based input and hand pose recognition. It features two key selection methods: the touch-panel method, where users tap a floating UI panel to select characters, and the raycast method, where users point a virtual ray from their wrist and confirm input via a pinch motion. FlickPose uses five left-hand poses to select characters. A machine learning model trained for hand pose recognition outperforms Random Forest and LightGBM models in accuracy and consistency. FlickPose was tested against the standard virtual keyboard of Meta Quest 3 in three tasks (hiragana, alphanumeric, and kanji input). Results showed that raycast had the lowest error rate, reducing unintended key presses; touch-panel had more deletions, likely due to misjudgments in key selection; and frequent HMD users preferred raycast, as it maintained input accuracy while allowing users to monitor their text. A key feature of FlickPose is adaptive tracking, which ensures the keyboard follows user movement. While further refinements in hand pose recognition are needed, the system provides an efficient, mobile-friendly alternative for HMD text input. Future research will explore real-world application compatibility and improve usability in dynamic environments. Full article
(This article belongs to the Special Issue Extended Reality (XR) and User Experience (UX) Technologies)
Show Figures

Figure 1

22 pages, 9247 KiB  
Article
Enhancing Restoration in Urban Waterfront Spaces: Environmental Features, Visual Behavior, and Design Implications
by Shiqin Zhou, Chang Lin and Quanle Huang
Buildings 2025, 15(14), 2567; https://doi.org/10.3390/buildings15142567 - 21 Jul 2025
Viewed by 249
Abstract
Urbanization poses mental health risks for urban dwellers, whereas natural environments offer mental health benefits by providing restorative experiences through visual stimuli. While urban waterfront spaces are recognized for their mental restorative potential, the specific environmental features and individual visual behaviors that drive [...] Read more.
Urbanization poses mental health risks for urban dwellers, whereas natural environments offer mental health benefits by providing restorative experiences through visual stimuli. While urban waterfront spaces are recognized for their mental restorative potential, the specific environmental features and individual visual behaviors that drive these benefits remain inadequately understood. Grounded in restorative environments theory, this study investigates how these factors jointly influence restoration. Employing a controlled laboratory experiment, subjects viewed real-life images of nine representative spatial locations from the waterfront space of Guangzhou Long Bund. Data collected during the multimodal experiments included subjective scales data (SRRS), physiological measurement data (SCR; LF/HF), and eye-tracking data. Key findings revealed the following: (1) The element visibility rate and visual characteristics of plant and building elements significantly influence restorative benefits. (2) Spatial configuration attributes (degree of enclosure, spatial hierarchy, and depth perception) regulate restorative benefits. (3) Visual behavior patterns (attributes of fixation points, fixation duration, and moderate dispersion of fixations) are significantly associated with restoration benefits. These findings advance the understanding of the mechanisms linking environmental stimuli, visual behavior, and psychological restorative benefits. They translate into evidence-based design principles for urban waterfront spaces. This study provides a refined perspective and empirical foundation for enhancing the restorative benefits of urban waterfront spaces through design. Full article
(This article belongs to the Section Architectural Design, Urban Science, and Real Estate)
Show Figures

Figure 1

20 pages, 5236 KiB  
Article
Leakage Detection in Subway Tunnels Using 3D Point Cloud Data: Integrating Intensity and Geometric Features with XGBoost Classifier
by Anyin Zhang, Junjun Huang, Zexin Sun, Juju Duan, Yuanai Zhang and Yueqian Shen
Sensors 2025, 25(14), 4475; https://doi.org/10.3390/s25144475 - 18 Jul 2025
Viewed by 344
Abstract
Detecting leakage using a point cloud acquired by mobile laser scanning (MLS) presents significant challenges, particularly from within three-dimensional space. These challenges primarily arise from the prevalence of noise in tunnel point clouds and the difficulty in accurately capturing the three-dimensional morphological characteristics [...] Read more.
Detecting leakage using a point cloud acquired by mobile laser scanning (MLS) presents significant challenges, particularly from within three-dimensional space. These challenges primarily arise from the prevalence of noise in tunnel point clouds and the difficulty in accurately capturing the three-dimensional morphological characteristics of leakage patterns. To address these limitations, this study proposes a classification method based on XGBoost classifier, integrating both intensity and geometric features. The proposed methodology comprises the following steps: First, a RANSAC algorithm is employed to filter out noise from tunnel objects, such as facilities, tracks, and bolt holes, which exhibit intensity values similar to leakage. Next, intensity features are extracted to facilitate the initial separation of leakage regions from the tunnel lining. Subsequently, geometric features derived from the k neighborhood are incorporated to complement the intensity features, enabling more effective segmentation of leakage from the lining structures. The optimal neighborhood scale is determined by selecting the scale that yields the highest F1-score for leakage across various multiple evaluated scales. Finally, the XGBoost classifier is applied to the binary classification to distinguish leakage from tunnel lining. Experimental results demonstrate that the integration of geometric features significantly enhances leakage detection accuracy, achieving an F1-score of 91.18% and 97.84% on two evaluated datasets, respectively. The consistent performance across four heterogeneous datasets indicates the robust generalization capability of the proposed methodology. Comparative analysis further shows that XGBoost outperforms other classifiers, such as Random Forest, AdaBoost, LightGBM, and CatBoost, in terms of balance of accuracy and computational efficiency. Moreover, compared to deep learning models, including PointNet, PointNet++, and DGCNN, the proposed method demonstrates superior performance in both detection accuracy and computational efficiency. Full article
(This article belongs to the Special Issue Application of LiDAR Remote Sensing and Mapping)
Show Figures

Figure 1

17 pages, 3180 KiB  
Article
Ensemble-Based Correction for Anomalous Diffusion Exponent Estimation in Single-Particle Tracking
by Roman Lavrynenko, Lyudmyla Kirichenko, Sergiy Yakovlev, Sophia Lavrynenko and Nataliya Ryabova
Appl. Sci. 2025, 15(14), 8000; https://doi.org/10.3390/app15148000 - 18 Jul 2025
Viewed by 210
Abstract
The analysis of anomalous diffusion characteristics within single-particle tracking data is a key problem in several applied-science domains, including biosignal processing, bioinformatics, and biotechnology. This task becomes particularly challenging in the presence of short trajectories, localization errors, and non-ergodicity, features that are common [...] Read more.
The analysis of anomalous diffusion characteristics within single-particle tracking data is a key problem in several applied-science domains, including biosignal processing, bioinformatics, and biotechnology. This task becomes particularly challenging in the presence of short trajectories, localization errors, and non-ergodicity, features that are common in real experimental data. To address these limitations, this work proposes an approach that improves the robustness and accuracy of estimating the anomalous diffusion exponent α, even for very short trajectories of up to 10 points. The approach includes an ensemble-based variance estimation of the exponent α, along with a bias correction based on time–ensemble averaged mean squared displacement, which reduces the systematic bias. These components integrate well into neural network architectures and are suitable for analyzing experimental trajectories in biotechnology and bioprocess engineering applications. Full article
(This article belongs to the Section Applied Biosciences and Bioengineering)
Show Figures

Figure 1

14 pages, 217 KiB  
Article
Narration as Characterization in First-Person Realist Fiction: Complicating a Universally Acknowledged Truth
by James Phelan
Humanities 2025, 14(7), 151; https://doi.org/10.3390/h14070151 - 16 Jul 2025
Viewed by 269
Abstract
I argue that the universally accepted assumption that in realist fiction a character narrator’s narration contributes to their characterization needs to be complicated. Working with a conception of narrative as rhetoric that highlights readerly interest in the author’s handling of the mimetic, thematic, [...] Read more.
I argue that the universally accepted assumption that in realist fiction a character narrator’s narration contributes to their characterization needs to be complicated. Working with a conception of narrative as rhetoric that highlights readerly interest in the author’s handling of the mimetic, thematic, and synthetic components of narrative, I suggest that the question about narration as characterization is one about the relation between the mimetic (character as possible person) and synthetic (character as invented construct) components. In addition, understanding the mimetic-synthetic relation requires attention to issues at the macro and micro levels of such narratives. At the macro level, I note the importance of (1) the tacit knowledge, shared by both authors and audiences, of the fictionality of character narration, which means authors write and readers read with an interest in its payoffs; and of (2) the recognition that character narration functions simultaneously along two tracks of communication: that between the character narrator and their narratee, and that between the author and their audience. These macro level matters then provide a frame within which authors and readers understand what happens at the micro level. At that level, I identify seven features of a character’s telling that have the potential to be used for characterization—voice, occasion, un/reliability, authority, self-consciousness, narrative control, and aesthetics. I also note that these features have their counterparts in the author’s telling. Finally, I propose that characterization via narration results from the interaction between the salient features of the character’s telling and their counterparts in the author’s telling. I develop these points through the analysis of four diverse case studies: Mark Twain’s Huckleberry Finn, Robert Browning’s “My Last Duchess,” Nadine Gordimer’s “Homage,” and Ernest Hemingway’s A Farewell to Arms. Full article
43 pages, 190510 KiB  
Article
From Viewing to Structure: A Computational Framework for Modeling and Visualizing Visual Exploration
by Kuan-Chen Chen, Chang-Franw Lee, Teng-Wen Chang, Cheng-Gang Wang and Jia-Rong Li
Appl. Sci. 2025, 15(14), 7900; https://doi.org/10.3390/app15147900 - 15 Jul 2025
Viewed by 262
Abstract
This study proposes a computational framework that transforms eye-tracking analysis from statistical description to cognitive structure modeling, aiming to reveal the organizational features embedded in the viewing process. Using the designers’ observation of a traditional Chinese landscape painting as an example, the study [...] Read more.
This study proposes a computational framework that transforms eye-tracking analysis from statistical description to cognitive structure modeling, aiming to reveal the organizational features embedded in the viewing process. Using the designers’ observation of a traditional Chinese landscape painting as an example, the study draws on the goal-oriented nature of design thinking to suggest that such visual exploration may exhibit latent structural tendencies, reflected in patterns of fixation and transition. Rather than focusing on traditional fixation hotspots, our four-dimensional framework (Region, Relation, Weight, Time) treats viewing behavior as structured cognitive networks. To operationalize this framework, we developed a data-driven computational approach that integrates fixation coordinate transformation, K-means clustering, extremum point detection, and linear interpolation. These techniques identify regions of concentrated visual attention and define their spatial boundaries, allowing for the modeling of inter-regional relationships and cognitive organization among visual areas. An adaptive buffer zone method is further employed to quantify the strength of connections between regions and to delineate potential visual nodes and transition pathways. Three design-trained participants were invited to observe the same painting while performing a think-aloud task, with one participant selected for the detailed demonstration of the analytical process. The framework’s applicability across different viewers was validated through consistent structural patterns observed across all three participants, while simultaneously revealing individual differences in their visual exploration strategies. These findings demonstrate that the proposed framework provides a replicable and generalizable method for systematically analyzing viewing behavior across individuals, enabling rapid identification of both common patterns and individual differences in visual exploration. This approach opens new possibilities for discovering structural organization within visual exploration data and analyzing goal-directed viewing behaviors. Although this study focuses on method demonstration, it proposes a preliminary hypothesis that designers’ gaze structures are significantly more clustered and hierarchically organized than those of novices, providing a foundation for future confirmatory testing. Full article
(This article belongs to the Special Issue New Insights into Computer Vision and Graphics)
Show Figures

Figure 1

31 pages, 17130 KiB  
Article
A Space-Time Plume Algorithm to Represent and Compute Dynamic Places
by Brent Dell and May Yuan
Computers 2025, 14(7), 278; https://doi.org/10.3390/computers14070278 - 15 Jul 2025
Viewed by 309
Abstract
Contrary to what is represented in geospatial databases, places are dynamic and shaped by events. Point clustering analysis commonly assumes events occur in an empty space and therefore ignores geospatial features where events take place. This research introduces relational density, a novel concept [...] Read more.
Contrary to what is represented in geospatial databases, places are dynamic and shaped by events. Point clustering analysis commonly assumes events occur in an empty space and therefore ignores geospatial features where events take place. This research introduces relational density, a novel concept redefining density as relative to the spatial structure of geospatial features rather than an absolute measure. Building on this, we developed Space-Time Plume, a new algorithm for detecting and tracking evolving event clusters as smoke plumes in space and time, representing dynamic places. Unlike conventional density-based methods, Space-Time Plume dynamically adapts spatial reachability based on the underlying spatial structure and other zone-based parameters across multiple temporal intervals to capture hierarchical plume dynamics. The algorithm tracks plume progression, identifies spatiotemporal relationships, and reveals the emergence, evolution, and disappearance of event-driven places. A case study of crime events in Dallas, Texas, USA, demonstrates the algorithm’s performance and its capacity to represent and compute criminogenic places. We further enhance metaball rendering with Perlin noise to visualize plume structures and their spatiotemporal evolution. A comparative analysis with ST-DBSCAN shows Space-Time Plume’s competitive computational efficiency and ability to represent dynamic places with richer geographic insights. Full article
Show Figures

Figure 1

15 pages, 3444 KiB  
Article
A LiDAR-Driven Approach for Crop Row Detection and Navigation Line Extraction in Soybean–Maize Intercropping Systems
by Mingxiong Ou, Rui Ye, Yunfei Wang, Yaoyao Gu, Ming Wang, Xiang Dong and Weidong Jia
Appl. Sci. 2025, 15(13), 7439; https://doi.org/10.3390/app15137439 - 2 Jul 2025
Viewed by 221
Abstract
Crop row identification and navigation line extraction are essential components for enabling autonomous operations of agricultural machinery. Aiming at the soybean–maize strip intercropping system, this study proposes a LiDAR-based algorithm for crop row detection and navigation line extraction. The proposed method consists of [...] Read more.
Crop row identification and navigation line extraction are essential components for enabling autonomous operations of agricultural machinery. Aiming at the soybean–maize strip intercropping system, this study proposes a LiDAR-based algorithm for crop row detection and navigation line extraction. The proposed method consists of four primary stages: point cloud preprocessing, crop row region identification, feature point clustering, and navigation line extraction. Specifically, a combination of K-means and Euclidean clustering algorithms is employed to extract feature points representing crop rows. The central lines of the crop rows are then fitted using the least squares method, and a stable navigation path is constructed based on angle bisector principles. Field experiments were conducted under three representative scenarios: broken rows with missing plants, low occlusion, and high occlusion. The results demonstrate that the proposed method exhibits strong adaptability and robustness across various environments, achieving over 80% accuracy in navigation line extraction, with up to 90% in low-occlusion settings. The average navigation angle was controlled within 0.28°, with the minimum reaching 0.17°, and the average processing time remained below 75.62 ms. Moreover, lateral deviation tests confirmed the method’s high precision and consistency in path tracking, validating its feasibility and practicality for application in strip intercropping systems. Full article
(This article belongs to the Section Agricultural Science and Technology)
Show Figures

Figure 1

22 pages, 11841 KiB  
Article
LVID-SLAM: A Lightweight Visual-Inertial SLAM for Dynamic Scenes Based on Semantic Information
by Shuwen Wang, Qiming Hu, Xu Zhang, Wei Li, Ying Wang and Enhui Zheng
Sensors 2025, 25(13), 4117; https://doi.org/10.3390/s25134117 - 1 Jul 2025
Viewed by 492
Abstract
Simultaneous Localization and Mapping (SLAM) remains challenging in dynamic environments. Recent approaches combining deep learning with algorithms for dynamic scenes comprise two types: faster, less accurate object detection-based methods and highly accurate, computationally costly instance segmentation-based methods. In addition, maps lacking semantic information [...] Read more.
Simultaneous Localization and Mapping (SLAM) remains challenging in dynamic environments. Recent approaches combining deep learning with algorithms for dynamic scenes comprise two types: faster, less accurate object detection-based methods and highly accurate, computationally costly instance segmentation-based methods. In addition, maps lacking semantic information hinder robots from understanding their environment and performing complex tasks. This paper presents a lightweight visual-inertial SLAM system. The system is based on the classic ORB-SLAM3 framework, which starts a new thread for object detection and tightly couples the semantic information of object detection with geometric information to remove feature points from dynamic objects. In addition, Inertial Measurement Unit (IMU) data are employed to assist in feature point extraction, thereby compensating for visual pose tracking loss. Finally, a dense octree-based semantic map is constructed by fusing semantic information and visualized using ROS. LVID-SLAM demonstrates excellent pose accuracy and robustness in highly dynamic scenes on the public TUM dataset, with an average ATE reduction of more than 80% compared to ORB-SLAM3. The experimental results demonstrate that LVID-SLAM outperforms other methods in dynamic conditions, offering both real-time capability and robustness. Full article
(This article belongs to the Section Navigation and Positioning)
Show Figures

Figure 1

17 pages, 7199 KiB  
Article
YED-Net: Yoga Exercise Dynamics Monitoring with YOLOv11-ECA-Enhanced Detection and DeepSORT Tracking
by Youyu Zhou, Shu Dong, Hao Sheng and Wei Ke
Appl. Sci. 2025, 15(13), 7354; https://doi.org/10.3390/app15137354 - 30 Jun 2025
Viewed by 364
Abstract
Against the backdrop of the deep integration of national fitness and sports science, this study addresses the lack of standardized movement assessment in yoga training by proposing an intelligent analysis system that integrates an improved YOLOv11-ECA detector with the DeepSORT tracking algorithm. A [...] Read more.
Against the backdrop of the deep integration of national fitness and sports science, this study addresses the lack of standardized movement assessment in yoga training by proposing an intelligent analysis system that integrates an improved YOLOv11-ECA detector with the DeepSORT tracking algorithm. A dynamic adaptive anchor mechanism and an Efficient Channel Attention (ECA) module are introduced, while the depthwise separable convolution in the C3k2 module is optimized with a kernel size of 2. Furthermore, a Parallel Spatial Attention (PSA) mechanism is incorporated to enhance multi-target feature discrimination. These enhancements enable the model to achieve a high detection accuracy of 98.6% mAP@0.5 while maintaining low computational complexity (2.35 M parameters, 3.11 GFLOPs). Evaluated on the SND Sun Salutation Yoga Dataset released in 2024, the improved model achieves a real-time processing speed of 85.79 frames per second (FPS) on an RTX 3060 platform, with an 18% reduction in computational cost compared to the baseline. Notably, it achieves a 0.9% improvement in AP@0.5 for small targets (<20 px). By integrating the Mars-smallCNN feature extraction network with a Kalman filtering-based trajectory prediction module, the system attains 58.3% Multiple Object Tracking Accuracy (MOTA) and 62.1% Identity F1 Score (IDF1) in dense multi-object scenarios, representing an improvement of approximately 9.8 percentage points over the conventional YOLO+DeepSORT method. Ablation studies confirm that the ECA module, implemented via lightweight 1D convolution, enhances channel attention modeling efficiency by 23% compared to the original SE module and reduces the false detection rate by 1.2 times under complex backgrounds. This study presents a complete “detection–tracking–assessment” pipeline for intelligent sports training. Future work aims to integrate 3D pose estimation to develop a closed-loop biomechanical analysis system, thereby advancing sports science toward intelligent decision-making paradigms. Full article
(This article belongs to the Special Issue Advances in Image Recognition and Processing Technologies)
Show Figures

Figure 1

17 pages, 3041 KiB  
Article
Error Prediction and Simulation of Strapdown Inertial Navigation System Based on Deep Neural Network
by Jinlai Liu, Tianran Zhang, Lubin Chang and Pinglan Li
Electronics 2025, 14(13), 2622; https://doi.org/10.3390/electronics14132622 - 28 Jun 2025
Viewed by 303
Abstract
In order to address the problem of error accumulation in long-duration autonomous navigation using Strapdown Inertial Navigation Systems (SINS), this paper proposes an error prediction and correction method based on Deep Neural Networks (DNN). A 12-dimensional feature vector is constructed using angular increments, [...] Read more.
In order to address the problem of error accumulation in long-duration autonomous navigation using Strapdown Inertial Navigation Systems (SINS), this paper proposes an error prediction and correction method based on Deep Neural Networks (DNN). A 12-dimensional feature vector is constructed using angular increments, velocity increments, and real-time attitude and velocity states from the inertial navigation system, while a 9-dimensional response vector is composed of attitude, velocity, and position errors. The proposed DNN adopts a feedforward architecture with two hidden layers containing 10 and 5 neurons, respectively, using ReLU activation functions and trained with the Levenberg–Marquardt algorithm. The model is trained and validated on a comprehensive dataset comprising 5 × 103 seconds of real vehicle motion data collected at 100 Hz sampling frequency, totaling 5 × 105 sample points with a 7:3 train-test split. Experimental results demonstrate that the DNN effectively captures the nonlinear propagation characteristics of inertial errors and significantly outperforms traditional SINS and LSTM-based methods across all dimensions. Compared to pure SINS calculations, the proposed method achieves substantial error reductions: yaw angle errors decrease from 2.42 × 10−2 to 1.10 × 10−4 radians, eastward velocity errors reduce from 455 to 4.71 m/s, northward velocity errors decrease from 26.8 to 4.16 m/s, latitude errors reduce from 3.83 × 10−3 to 7.45 × 10−4 radians, and longitude errors reduce dramatically from 3.82 × 10−2 to 1.5 × 10−4 radians. The method also demonstrates superior performance over LSTM-based approaches, with yaw errors being an order of magnitude smaller and having significantly better trajectory tracking accuracy. The proposed method exhibits strong robustness even in the absence of external signals, showing high potential for engineering applications in complex or GPS-denied environments. Full article
(This article belongs to the Special Issue Wireless Sensor Network: Latest Advances and Prospects)
Show Figures

Figure 1

22 pages, 2027 KiB  
Article
Blockchain-Based Identity Management System Prototype for Enhanced Privacy and Security
by Haifa Mohammed Alanzi and Mohammad Alkhatib
Electronics 2025, 14(13), 2605; https://doi.org/10.3390/electronics14132605 - 27 Jun 2025
Viewed by 409
Abstract
An Identity Management System (IDMS) is responsible for managing and organizing identities and credentials exchanged between users, Identity Providers (IDPs), and Service Providers (SPs). The primary goal of IDMS is to ensure the confidentiality and privacy of users’ personal data. Traditional IDMS relies [...] Read more.
An Identity Management System (IDMS) is responsible for managing and organizing identities and credentials exchanged between users, Identity Providers (IDPs), and Service Providers (SPs). The primary goal of IDMS is to ensure the confidentiality and privacy of users’ personal data. Traditional IDMS relies on a third party to store user information and authenticate the user. However, this approach poses threats to user privacy and increases the risk of single point of failure (SPOF), user tracking, and data unavailability. In contrast, decentralized IDMSs that use blockchain technology offer potential solutions to these issues as they offer powerful features including immutability, transparency, anonymity, and decentralization. Despite its advantages, blockchain technology also suffers from limitations related to performance, third-party control, weak authentication, and data leakages. Furthermore, some blockchain-based IDMSs still exhibit centralization issues, which can compromise user privacy and create SPOF risks. This study proposes a decentralized IDMS that leverages blockchain and smart contract technologies to address the shortcomings of traditional IDMSs. The proposed system also utilizes the Interplanetary file system (IPFS) to enhance the scalability and performance by reducing the on-chain storage load. Additionally, the proposed IDMS employs the Elliptic Curve Integrated Encryption Scheme (ECIES) to provide an extra layer of security to protect users’ sensitive information while improving the performance of the systems’ transactions. Security analysis and experimental results demonstrated that the proposed IDMS offers significant security and performance advantages compared to its counterparts. Full article
Show Figures

Figure 1

Back to TopTop