MDPI - Publisher of Open Access Journals

12 pages, 825 KB

Open AccessArticle

Sequential Add-On Therapy Modifies Mortality Risk Stratification in Group 1.4 Pulmonary Arterial Hypertension: A Real-World, Single-Center Retrospective Cohort Study from Mexico

by Arturo Cortes-Telles, Yuliana Valeria Priego-Escamilla, Diana Lizbeth Ortíz-Farias, Saúl Vázquez-López, Yuri Noemí Pou-Aguilar and Esperanza Figueroa-Hurtado

J. Clin. Med. 2026, 15(13), 4924; https://doi.org/10.3390/jcm15134924 (registering DOI) - 24 Jun 2026

Abstract

Background: Dynamic risk stratification is fundamental to the modern management of pulmonary arterial hypertension (PAH). However, data on the impact of sequential add-on therapy in patients with Group 1.4 PAH—particularly in Latin American populations—remains limited. This study evaluated changes in risk classification using [...] Read more.

Background: Dynamic risk stratification is fundamental to the modern management of pulmonary arterial hypertension (PAH). However, data on the impact of sequential add-on therapy in patients with Group 1.4 PAH—particularly in Latin American populations—remains limited. This study evaluated changes in risk classification using COMPERA 2.0 and REVEAL Lite 2 scores in patients treated with endothelin receptor antagonist (ERA) and phosphodiesterase type 5 inhibitor (PDE5i) combination therapy (macitentan + sildenafil) at a referral center in Mexico. Methods: In this single-center, retrospective cohort study, 25 patients with a confirmed diagnosis of PAH between 1st January 2022 and 31st December 2024 were evaluated at baseline and after 24 weeks of treatment. Clinical, functional, and biochemical parameters were recorded. Within-patient changes were analyzed using the Wilcoxon signed-rank test, and agreement between risk assessment tools was assessed using Spearman’s correlation coefficient. Results: At 24 weeks, patients demonstrated significant improvement in World Health Organization functional class (p = 0.002) and a significant reduction in brain natriuretic peptide levels (p = 0.003). Both COMPERA 2.0 and REVEAL Lite 2 scores showed a consistent shift toward lower-risk categories. A strong concordance between the two tools was observed. Conclusions: Sequential add-on ERA + PDE5i therapy was associated with meaningful improvement in risk stratification among patients with Group 1.4 PAH. These findings support the clinical utility of simplified, noninvasive risk assessment tools in real-world settings, particularly in resource-constrained environments. Full article

(This article belongs to the Special Issue Clinical Research on Pulmonary Hypertension and Its Complications)

23 pages, 1532 KB

Open AccessArticle

A Contactless Edge-AI Prototype for Simulated Apnea-like Respiratory Suppression and Motion Artifact Detection Using 60 GHz FMCW Radar

by Sathit Pairoch, Pattarapong Phasukkit and Nongluck Houngkamhang

Technologies 2026, 14(7), 388; https://doi.org/10.3390/technologies14070388 (registering DOI) - 24 Jun 2026

Abstract

Sleep-related respiratory disturbances are difficult to monitor continuously outside specialized laboratories because conventional polysomnography is resource-intensive and intrusive. This study presents a contactless edge-AI engineering prototype for detecting controlled voluntary respiratory-motion suppression and motion artifacts using a 60 GHz frequency-modulated continuous-wave radar. The [...] Read more.

Sleep-related respiratory disturbances are difficult to monitor continuously outside specialized laboratories because conventional polysomnography is resource-intensive and intrusive. This study presents a contactless edge-AI engineering prototype for detecting controlled voluntary respiratory-motion suppression and motion artifacts using a 60 GHz frequency-modulated continuous-wave radar. The system integrates a 60 GHz radar front end, lightweight local preprocessing, an INT8 one-dimensional convolutional neural network deployed on the Analog Devices MAX78000 CNN accelerator (Analog Devices Thailand, Chon Buri, Thailand), and an event-driven Raspberry Pi Zero 2W gateway for alert transmission. Evaluation was performed using a controlled healthy-volunteer dataset consisting of normal breathing, voluntary breath-holding-induced respiratory suppression, and deliberate motion artifact. The final valid test set contained 270 technically valid 30 s windows balanced across the three classes. The INT8 model achieved an overall accuracy of 92.6% (95% confidence interval: 88.8–95.2%), with a macro-averaged precision, recall, and F1-score of 92.6%, 92.6%, and 92.5%, respectively. Active CNN inference on the MAX78000 consumed 0.152 ± 0.011 mJ and was completed in 5.20 ± 0.11 ms, corresponding to approximately 280-fold lower active inference energy than Python 3.14.6/TensorFlow Lite 2.21.0-based execution on the Raspberry Pi Zero 2W. These results demonstrate the feasibility of privacy-aware, low-power respiratory-pattern classification at the edge. However, the study should be interpreted strictly as an engineering proof-of-concept based on controlled voluntary breathing and movement tasks in healthy volunteers. It is not a clinically validated apnea or obstructive sleep apnea detection system and did not include polysomnography, oxygen saturation measurement, airflow sensing, sleep staging, or diagnosed patient cohorts. Full article

(This article belongs to the Special Issue Wearable Vital Signs and Activities Detection: Systems and Applications)

21 pages, 6451 KB

Open AccessArticle

Mepilex Dressings in Managing Radiation-Induced Moist Desquamation in Head and Neck Cancer

by Shely Kagan, Yulya Kagan, Tharshini Yoganathan, Madette Galapin, Christina Yang, Britney Zhang, Shivani Verma, Henry C. Y. Wong, Amir H. Safavi, Michael C. Tjong, Shirley S. W. Tse, Shing Fung Lee, Sarah Bayrakdarian, Edward Chow and Irene Karam

Radiation 2026, 6(2), 21; https://doi.org/10.3390/radiation6020021 (registering DOI) - 22 Jun 2026

Viewed by 132

Abstract

Background: Radiation dermatitis (ARD), particularly its most severe form, moist desquamation (MD), is a frequent and distressing complication of external beam radiotherapy (RT) in head and neck (H&N) cancer patients. Standard management often provides limited benefit for healing and symptom control. Silicone-based foam [...] Read more.

Background: Radiation dermatitis (ARD), particularly its most severe form, moist desquamation (MD), is a frequent and distressing complication of external beam radiotherapy (RT) in head and neck (H&N) cancer patients. Standard management often provides limited benefit for healing and symptom control. Silicone-based foam dressings, including Mepilex Lite and Mepilex Ag, may offer atraumatic adherence, moisture balance, and pain reduction. This study evaluated their real-world effectiveness for MD after conventional RT. Methods: Ten H&N cancer patients with clinically confirmed MD post-radiotherapy were prospectively followed until healing. Patients received Mepilex Lite or Mepilex Ag based on exudate level and infection risk, with dressings changed every three days. Patient- and healthcare provider-reported measures were collected throughout follow-up. The primary endpoint was time to MD resolution, defined as healing to grade 1 skin status. Secondary endpoints included changes in symptom burden, dressing tolerability and satisfaction, and adverse events. Results: Median age was 69 years (range 44–78). All wounds healed to grade 1, with a mean time of 8.6 days (SD 3.9). No infections or adverse events occurred. Pain, burning, and interference with daily activity decreased, and most patients reported improved comfort. Conclusions: In this small prospective cohort study, use of Mepilex dressings was associated with rapid healing, good tolerability, and improvement in patient-reported symptoms of acute radiation dermatitis. These findings suggest that Mepilex dressings may be a promising management option and warrant evaluation in larger comparative studies. Full article

► Show Figures

Figure 1

18 pages, 4314 KB

Open AccessArticle

Optimizing a Multimodal Large Language Model for Ultrasound-Based Thyroid Nodule Malignancy Classification: A Comparative Study of Few-Shot Learning, Prompt Engineering, and Fine-Tuning

by Yu-Hsuan Li, Yu-Cheng Cheng, Chih-Yun Chang and I-Te Lee

Diagnostics 2026, 16(12), 1931; https://doi.org/10.3390/diagnostics16121931 (registering DOI) - 22 Jun 2026

Viewed by 122

Abstract

Objectives: Multimodal large language models (MLLMs) have shown potential for medical image classification. We evaluated four optimization strategies in two MLLMs—GPT-4o (gpt-4o-2024-08-06) and Gemini 2.5 Flash-Lite—for ultrasound-based thyroid nodule malignancy classification using two public datasets and a clinical cohort of nodules with atypia [...] Read more.

Objectives: Multimodal large language models (MLLMs) have shown potential for medical image classification. We evaluated four optimization strategies in two MLLMs—GPT-4o (gpt-4o-2024-08-06) and Gemini 2.5 Flash-Lite—for ultrasound-based thyroid nodule malignancy classification using two public datasets and a clinical cohort of nodules with atypia of undetermined significance (AUS) cytology. Methods: Text prompting, few-shot learning, fine-tuning, and a hybrid strategy combining fine-tuning with few-shot learning were evaluated for each model. Performance was assessed using the Digital Database of Thyroid Images (DDTI; n = 80), a 1000-image test subset of TN5000, and an institutional AUS cohort with surgical pathology (n = 84). In the AUS cohort, the best-performing strategy was compared with the consensus classification of three endocrinologists and the American Thyroid Association (ATA) ultrasound risk stratification. Results: For GPT-4o, the hybrid strategy achieved the highest area under the receiver operating characteristic curve (AUC) in DDTI (0.866), TN5000 (0.689), and the AUS cohort (0.836). In the AUS cohort, its specificity was higher than that of endocrinologist consensus and ATA risk stratification when only high-suspicion nodules were classified as malignant (95.1% vs. 70.7% and 70.7%; p = 0.002 and p = 0.001, respectively), while sensitivity did not differ significantly (72.1% vs. 74.4% and 79.1%, respectively; both p > 0.05). However, the hybrid model misclassified 12 of 43 malignant nodules, corresponding to a false-negative rate of 27.9%. When high- and intermediate-suspicion ATA categories were classified as malignant, ATA sensitivity increased to 83.7% and specificity decreased to 56.1%; the hybrid model had a higher AUC than ATA risk stratification (0.836 vs. 0.749; p = 0.017). For Gemini 2.5 Flash-Lite, few-shot learning, fine-tuning, and the hybrid strategy did not improve AUC relative to text prompting in any dataset. Conclusions: The hybrid strategy produced the most consistent performance gains for GPT-4o across the three datasets but did not improve Gemini 2.5 Flash-Lite. The optimized GPT-4o model achieved high specificity in the diagnostically challenging AUS cohort, although its false-negative rate limits its use as a stand-alone diagnostic tool. Further validation in larger, prospective multicenter cohorts is required before clinical use. Full article

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

► Show Figures

Graphical abstract

24 pages, 9488 KB

Open AccessArticle

GCMembrane-LLM: An Evidence-Grounded Domain-Specific Large Language Model for Structure–Performance Reasoning in Graphene and Carbon Nanotube Separation Membranes

by Youyang Liu, Shuhan Liu, Yao He, Ziyi Yan, Yilu Zhao, Xinyu Zhang, Zhen Li and Ning Wei

Membranes 2026, 16(6), 214; https://doi.org/10.3390/membranes16060214 (registering DOI) - 21 Jun 2026

Viewed by 160

Abstract

Graphene and carbon nanotube (CNT) membranes are promising for filtration, desalination, and water treatment, yet their performance requires the joint interpretation of their architecture, nanoconfined transport, selectivity, fouling, swelling, defects, stability, and operating conditions. Here, GCMembrane-LLM was developed as an evidence-grounded domain-specific large [...] Read more.

Graphene and carbon nanotube (CNT) membranes are promising for filtration, desalination, and water treatment, yet their performance requires the joint interpretation of their architecture, nanoconfined transport, selectivity, fouling, swelling, defects, stability, and operating conditions. Here, GCMembrane-LLM was developed as an evidence-grounded domain-specific large language model. A curated 582-paper corpus generated 12,208 cleaned membrane-specific question–answer pairs for Low-Rank Adaptation (LoRA)-based supervised fine-tuning of Llama-3.1-8B-Instruct, and retrieval-augmented generation provided article-title and page-level traceability. GCMembraneBench included 100 application-oriented questions on graphene oxide (GO) membranes, CNT membranes, GO/CNT hybrids, and cross-material reasoning. Under direct answering without retrieval context, the anonymized and shuffled automatic evaluation showed that GCMembrane-LLM achieved a mean weighted score of 4.237/5.0, exceeding Llama-3.1-8B-Instruct and Doubao-1.5-lite. A stratified 30-question blinded manual assessment showed the same ranking. The application cases further yielded membrane science conclusions: CNT-assisted GO/CNT transport should be evaluated with dispersion, interfacial compatibility, defects, and stability; GO desalination depends on swelling control, interlayer spacing, and defect suppression; and CNT high flux requires joint examination of pore diameter, entrance chemistry, hydration barriers, ion rejection, and operating conditions. GCMembrane-LLM supports source-traceable evidence organization and preliminary hypothesis formulation before experimental validation. Full article

(This article belongs to the Special Issue Nano-Confined Transport in Advanced Membranes for Enhanced Filtration and Desalination: From Mechanisms to Scalable Applications)

► Show Figures

Figure 1

21 pages, 2782 KB

Open AccessArticle

LDST-ChangeNet: Lightweight Remote Sensing Change Detection Model Based on Dual Spatio-Temporal Attention and Multi-Scale Decoding

by Shuang Li, Shoubin Wang, Pengcheng Gao, Guili Peng and Zhen Huang

Remote Sens. 2026, 18(12), 2020; https://doi.org/10.3390/rs18122020 - 17 Jun 2026

Viewed by 187

Abstract

Remote sensing image change detection is widely used in urban expansion analysis, land-use monitoring, and disaster assessment. Nevertheless, it still faces significant challenges due to pseudo-change interference in high-resolution imagery, the large-scale variation in small changed objects, and the need for lightweight models [...] Read more.

Remote sensing image change detection is widely used in urban expansion analysis, land-use monitoring, and disaster assessment. Nevertheless, it still faces significant challenges due to pseudo-change interference in high-resolution imagery, the large-scale variation in small changed objects, and the need for lightweight models in real-world engineering applications. To address these issues, this paper proposes LDST-ChangeNet, a lightweight dual spatiotemporal attention network for change detection. The network adopts a Siamese EfficientNet-B1 as its dual-branch encoder and employs a differential bi-temporal feature fusion strategy (Diff) to explicitly model temporal discrepancies, enabling efficient feature extraction while significantly reducing model complexity. A Position Attention Module (PAM) is introduced at the encoder bottleneck to suppress pseudo changes caused by non-structural factors. Meanwhile, a lightweight Pyramid Pooling Module (PPM-lite) is incorporated at the entrance of the deepest decoder features to enhance multi-scale contextual representation. Furthermore, a Boundary Attention Module (BAM) is applied in the decoder output stage to improve boundary delineation and small-object change detection. Experimental results on the LEVIR-CD and WHU-CD datasets show that LDST-ChangeNet outperforms other state-of-the-art methods, achieving F1-scores of 90.67% and 91.08%, respectively. The model maintains a lightweight design, requiring only 11.72 M parameters and 10.03 GFLOPs on LEVIR-CD, and 11.77 M parameters and 9.12 GFLOPs on WHU-CD. Full article

► Show Figures

Figure 1

23 pages, 3369 KB

Open AccessArticle

Improved MobileNetV2 Architecture with Modified Lite Attention Model for Detection of Plant Leaf Disease

by Shiny Rajendrakumar and Rajashekarappa

AgriEngineering 2026, 8(6), 248; https://doi.org/10.3390/agriengineering8060248 - 17 Jun 2026

Viewed by 227

Abstract

Global agriculture is seriously threatened by plant diseases, which result in large losses in both productivity and quality. Timely and accurate disease detection is essential for effective crop management and food security. This work presents an improved MobileNetV2 architecture with Modified Lite Attention [...] Read more.

Global agriculture is seriously threatened by plant diseases, which result in large losses in both productivity and quality. Timely and accurate disease detection is essential for effective crop management and food security. This work presents an improved MobileNetV2 architecture with Modified Lite Attention (MLA) Model for detecting plant leaf disease. Our methodology incorporates pre-processing, feature extraction through attention model, convolution layers, and classifying into diseased or healthy categories. Further, multiclassification of diseases is performed on a dataset comprising 4432 samples including whitefly, leaf spot, leaf curl, yellowish and healthy leaves. The proposed attention model is compared with existing attention models like CBAM (Convolutional Block Attention Model), SE (Squeeze and Excitation), ECA (Efficient Channel Attention) and SDMnet (Spatially Dilated Multi-Scale Network) to validate our hybrid MLA feature extraction technique. Customizing the categorization with fully connected layers and utilisation of a pre-trained MobileNetV2 model allow the system to achieve excellent results. Findings show encouraging accuracy, surpassing 97% compared to existing techniques for multiclass dataset classification. The integration of MobileNetV2 with custom dense layers enables robust detection even with limited datasets, making it ideal for use in mobile or low-resource agricultural environments. Further, the proposed method is tested on the PlantVillage dataset consisting of 10,836 samples using K-Fold cross-validation for K = 5 and K = 4 to obtain an average accuracy of 98.4% and 98.69%, respectively. Full article

(This article belongs to the Special Issue The Application of Machine Learning and Deep Learning Techniques in Agriculture)

► Show Figures

Figure 1

19 pages, 2057 KB

Open AccessArticle

Research on Human Sitting Posture Recognition Based on an Improved LeNet-5 Optimization Algorithm

by Wei Li, Bowen Yang, Dawen Sun, Shijun Sun, Zhenyang Qin and Qianjin Liu

Processes 2026, 14(12), 1964; https://doi.org/10.3390/pr14121964 - 17 Jun 2026

Viewed by 178

Abstract

Human sitting posture recognition is critical for smart seating, ergonomic monitoring, and healthcare systems. However, existing deep learning approaches typically rely on highly complex network architectures that are computationally expensive, hindering their lightweight deployment on edge devices. Furthermore, current methods frequently struggle with [...] Read more.

Human sitting posture recognition is critical for smart seating, ergonomic monitoring, and healthcare systems. However, existing deep learning approaches typically rely on highly complex network architectures that are computationally expensive, hindering their lightweight deployment on edge devices. Furthermore, current methods frequently struggle with indistinct boundaries among multi-class postures and are highly prone to overfitting when constrained by small-sample pressure sensor datasets. To bridge this gap, this paper proposes a novel, lightweight posture recognition framework specifically tailored for pressure distribution maps. First, sitting pressure data is collected using a thin-film pressure array sensor and uniformly mapped into an [M × N] image representation, establishing an effective sample format for Convolutional Neural Network (CNN) inputs. Second, as our primary architectural contribution, we fundamentally optimize the classic LeNet-5 network to enhance complex feature representation without inflating model complexity. Specifically, the depth of the convolutional layers is increased with a progressively increasing channel configuration. Batch Normalization (BN) is introduced to accelerate convergence and ensure training stability, while a Dropout mechanism is embedded within the fully connected layers to strictly penalize overfitting under small-sample constraints. These architectural improvements are synergistically combined with targeted data augmentation strategies—including random translation, rotation, and intensity perturbation—to further strengthen the model’s generalization capability. Experimental results demonstrate that the proposed method achieves a classification accuracy of 95.5% in a five-class sitting posture recognition task, significantly outperforming baseline models such as the traditional LeNet-5, AlexNet-Lite, and VGG-Small. The findings indicate that this approach achieves an optimal balance among recognition accuracy, training stability, and low model complexity, providing a robust algorithmic baseline and proof-of-concept for smart healthcare perception systems, paving the way for future large-scale subject-independent validation. Full article

(This article belongs to the Special Issue Advances in Computer Vision and Image Processing for Industrial Processes)

► Show Figures

Figure 1

30 pages, 43797 KB

Open AccessArticle

Modular Framework for Responsive and Explainable Robotic Assistance with Intention Prediction Using Human-Centric Digital Twins

by Usman Asad, Azfar Khalid, Waqas Akbar Lughmani, Shummaila Rasheed and Muhammad Mahabat Khan

Sensors 2026, 26(12), 3810; https://doi.org/10.3390/s26123810 - 15 Jun 2026

Viewed by 294

Abstract

Proactive robotic assistance in human–robot collaboration (HRC) requires systems that can perceive evolving task contexts, anticipate user needs, and intervene appropriately without disrupting human workflow. We present the Agentic Unified Robotic Assistance (AURA) Framework, which couples Large Language Model (LLM) reasoning grounded by [...] Read more.

Proactive robotic assistance in human–robot collaboration (HRC) requires systems that can perceive evolving task contexts, anticipate user needs, and intervene appropriately without disrupting human workflow. We present the Agentic Unified Robotic Assistance (AURA) Framework, which couples Large Language Model (LLM) reasoning grounded by Standard Operating Procedures (SOPs) with a modular layer of specialized Intent, Motion, Perception, Sound, Affordance, and Performance Monitors that supply structured context to a central decision-making module, making the framework reconfigurable and auditable without retraining or re-prompting. We introduce a human-in-the-loop teleoperation data collection methodology and an offline evaluation scheme with an Appropriateness Score (A-Score) tailored to proactive intervention timing, and release a benchmark dataset of annotated multimodal HRC episodes containing workspace and robot wrist camera videos, robot joint states, and labeled intervention events. Across three tasks of varying complexity, we observe progressive gains in intent prediction and decision-making as the modules are supplied with richer grounded context (prior-state memory and tracked object locations), with Combined F1 rising by over 20 points between context-poor and context-rich conditions. The structured grounding allows lightweight multimodal backbones such as Gemini 3.1 Flash Lite to perform on par with heavier reasoning-tier models at roughly one-fifth the inference latency. Together, these contributions establish a scalable framework, benchmark, and evaluation methodology for advancing proactive robotic assistance in collaborative environments. Full article

(This article belongs to the Special Issue Advanced Sensors and AI Integration for Human–Robot Teaming)

► Show Figures

Figure 1

22 pages, 7177 KB

Open AccessArticle

Optimization-Oriented Vision-Guided Robotic Grasping for Bolt Handling in Intelligent Manufacturing

by Pengzhan Fu, Zhenlin Zhang, Long Liu, Yingze Xi, Xingwei Zhao and Xuan Wang

Mathematics 2026, 14(12), 2133; https://doi.org/10.3390/math14122133 - 15 Jun 2026

Viewed by 172

Abstract

Accurate detection and reliable grasping of small bolts are essential for intelligent manufacturing and automated assembly. However, this remains a challenge due to the small size, slender geometry, and metallic reflective surfaces of bolts. In this paper, we propose a vision-guided robotic bolt [...] Read more.

Accurate detection and reliable grasping of small bolts are essential for intelligent manufacturing and automated assembly. However, this remains a challenge due to the small size, slender geometry, and metallic reflective surfaces of bolts. In this paper, we propose a vision-guided robotic bolt handling framework that integrates lightweight object detection, optimization-oriented grasp execution, and collision-aware trajectory planning. The lightweight YOLOv8n-BoltLite detector, improved with E-C2f, LCA, SA-PAN, and WD-IoU loss, enhances localization accuracy and feature representation for small and slender bolts. A robotic grasping framework is designed to transform detection results into executable robotic actions through 3D pose estimation, mid-shank grasp point generation, and optimization-oriented execution formulation. Additionally, a five-segment trajectory planning strategy ensures safe and efficient robot motion. Experimental results show that YOLOv8n-BoltLite achieves a five-run average mAP of 99.64 ± 0.05% with 198 FPS, and 3.02 M parameters. On an additional challenging external test set involving illumination variation, clutter, partial occlusion, reflection, and clustered bolts, the proposed detector achieves 94.62 ± 0.18%, outperforming recent lightweight detectors under the same training protocol. Robotic experiments involving 1000 controlled grasping trials and 300 multi-target grasping attempts demonstrate a controlled-condition success rate of 97.0% and improved target-selection reliability in multi-bolt scenes. These results suggest that the proposed framework offers a practical and efficient solution for automated bolt handling in intelligent manufacturing environments. Full article

(This article belongs to the Special Issue Intelligent Systems and Dynamic Scheduling: Optimization and Management, 2nd Edition)

► Show Figures

Figure 1

25 pages, 17895 KB

Open AccessArticle

YOLO-PowerLite V2: An Enhanced Lightweight Detector for Real-Time Tiny Anomaly Identification on Overhead Transmission Lines in Complex Environments

by Shuangfeng Wei, Yuhang Cai, Shaobo Zhong and Zheng Lv

Remote Sens. 2026, 18(12), 1937; https://doi.org/10.3390/rs18121937 - 11 Jun 2026

Viewed by 235

Abstract

Aiming at the core pain point that in existing object detection models, it is difficult to balance detection accuracy and real-time inference efficiency on edge computing devices in UAV-based intelligent inspection of power transmission lines, this paper proposes a lightweight YOLO-PowerLiteV2 model for [...] Read more.

Aiming at the core pain point that in existing object detection models, it is difficult to balance detection accuracy and real-time inference efficiency on edge computing devices in UAV-based intelligent inspection of power transmission lines, this paper proposes a lightweight YOLO-PowerLiteV2 model for anomaly target detection in power transmission lines to address the shortcomings of YOLO-PowerLite. Based on YOLO11n as the baseline, the model achieves compression of model volume while guaranteeing detection performance through four core improvements: the C3k2-UIB lightweight backbone module, the MCA (Multi-scale Cross-Axis) attention mechanism, the MBConv lightweight detection head, and the MFM (Modulation Feature Fusion) module. Experiments were conducted on a dataset constructed from 5563 aerial images of transmission lines containing three types of targets: bird nests, defective insulators, and balloons. The results show that YOLO-PowerLiteV2 achieves a mAP@50 of 95.2%, with only 0.97 M parameters and 2.8 G floating point operations (FLOPs). Compared with the baseline model, the number of parameters is reduced by 62.5%, and FLOPs are decreased by 56.25%. On the NVIDIA Jetson Xavier NX edge platform, the model achieves 59.5 FPS with only 16.8 ms latency, outperforming the baseline by 31% in frame rate. Its comprehensive performance outperforms mainstream lightweight detection models. The model demonstrates excellent adaptability to UAV edge-terminal deployment requirements, thereby providing technical support for real-time intelligent inspection of power transmission lines. Full article

(This article belongs to the Special Issue Advances in Artificial Intelligence (AI) and Deep Learning (DL) in UAV-Based Remote Sensing)

► Show Figures

Figure 1

26 pages, 8878 KB

Open AccessArticle

RCF-Face: Risk-Calibrated Factorization for Lightweight Face Verification Under Composite Disturbances

by Yanhao Shen, Zhibin Zhao and Yalong Meng

Electronics 2026, 15(12), 2497; https://doi.org/10.3390/electronics15122497 - 6 Jun 2026

Viewed by 151

Abstract

Compact face verifiers at access-control and surveillance terminals must operate under specular reflection, off-frontal pose, and mask occlusion, disturbances that frequently co-occur in practice. Existing objectives optimize identity discrimination and score-threshold reliability separately. As a result, disturbance cues leak into the identity embedding [...] Read more.

Compact face verifiers at access-control and surveillance terminals must operate under specular reflection, off-frontal pose, and mask occlusion, disturbances that frequently co-occur in practice. Existing objectives optimize identity discrimination and score-threshold reliability separately. As a result, disturbance cues leak into the identity embedding and inflate the impostor-score tail near the operating false-acceptance rate (FAR)—a structural weakness that backbone scaling alone does not resolve within a fixed parameter budget. We propose RCF-Face, a 1.12 M-parameter verifier for the sub-2 M-parameter, sub-0.2 GFLOP budget of Ampere-class edge GPUs, trained under a unified objective that combines representation-level leakage suppression (HSIC-based identity–disturbance factorization with augmentation-based counterfactual consistency) and decision-level score shaping via a softplus tail-risk penalty anchored to the target FAR. Across four disturbance-focused protocols (CFP-FP, IJB-C, RMFRD, and a reflection-augmented subset), RCF-Face reaches a Composite-Disturbance Score (CDS, the mean verification accuracy over the four protocols) of 89.05 at 0.166 GFLOPs, improving on the capacity-matched AdaFace-Lite baseline by 4.67 percentage points, with both per-protocol gains over this baseline significant under Bonferroni correction. Zero-shot evaluation on two open-source reflection-related datasets (SoF and MeGlass) shows that the robustness gains transfer to real-world images, although these datasets isolate single disturbance axes and do not constitute a composite-disturbance benchmark. On a Jetson Orin NX 16 GB, batch-1 latency is 2.41 ms (FP16) and 1.66 ms (INT8), supporting Ampere-class edge-GPU deployment. Full article

(This article belongs to the Special Issue Recent Advances in Object Detection and Computer Vision)

► Show Figures

Figure 1

30 pages, 4654 KB

Open AccessArticle

Hybrid Knowledge Distillation for Edge-Efficient Video Action Recognition: Improving Lightweight 3D CNNs via Joint Distillation

by Mohammad Rasras and Iuliana Marin

Computers 2026, 15(6), 371; https://doi.org/10.3390/computers15060371 - 5 Jun 2026

Viewed by 449

Abstract

One of the remaining challenges in deploying 3D CNN models in resource-constrained environments is the high computational demand. In this paper, we design three lightweight architectures that have distinct spatiotemporal topologies, namely, Lite-R21D, Lite-MC3, and Lite-LF, to reduce computational cost. However, these compact [...] Read more.

One of the remaining challenges in deploying 3D CNN models in resource-constrained environments is the high computational demand. In this paper, we design three lightweight architectures that have distinct spatiotemporal topologies, namely, Lite-R21D, Lite-MC3, and Lite-LF, to reduce computational cost. However, these compact models have restricted representational capacity, which consequently limits their ability to capture complex spatiotemporal features. To overcome this, we employ Knowledge Distillation (KD) and further investigate hybrid combinations of response-based, spatiotemporal attention, and intermediate feature alignment paradigms. By analyzing knowledge transfer across these diverse architectures, our experiments on UCF101 and HMDB51 demonstrate that combining these distillation configurations consistently outperforms single KD methods, resulting in a substantial increase in accuracy across all Student models. Our optimal hybrid setup achieves 92.07% accuracy on UCF101 and 65.56% on HMDB51, compared to the Teacher’s 94.74% and 69.48%, reducing the accuracy gap to only 2.67% and 3.92%. These gains are achieved alongside significant efficiency improvements. The proposed models operate with up to 87% fewer parameters and an 89% reduction in Floating-Point Operations (FLOPs), achieving 6.7× faster inference. Our findings highlight that hybrid distillation is an effective approach for transferring and utilizing complex spatiotemporal knowledge in lightweight models. Full article

(This article belongs to the Special Issue Multimodal Pattern Recognition of Social Signals in HCI (Third Edition))

► Show Figures

Figure 1

32 pages, 4025 KB

Open AccessArticle

An Efficient Multimodal Framework for Barley Drought Stress Detection on Resource-Constrained Devices

by Rihab Boukouba, Dalenda Ben Aissa, Amira Guidara, Nadia Smaoui and Chantal Ebel

AgriEngineering 2026, 8(6), 230; https://doi.org/10.3390/agriengineering8060230 - 5 Jun 2026

Viewed by 200

Abstract

Drought stress significantly impacts barley (Hordeum vulgare L.) production, necessitating early and accurate detection systems for precision agriculture. Traditional monitoring approaches rely on manual inspection or single-modality sensing, which often fail to capture the complex physiological responses to water deficit. This study [...] Read more.

Drought stress significantly impacts barley (Hordeum vulgare L.) production, necessitating early and accurate detection systems for precision agriculture. Traditional monitoring approaches rely on manual inspection or single-modality sensing, which often fail to capture the complex physiological responses to water deficit. This study presents a novel multimodal deep learning framework that integrates RGB imaging with environmental sensor data (temperature and humidity) for real-time drought stress classification in barley plants. The proposed architecture employs EfficientNetV2-S for visual feature extraction, coupled with a dedicated sensor encoding branch, unified through a cross-modal attention mechanism and gated multimodal fusion strategy. To address the computational constraints of agricultural IoT systems, we implemented comprehensive CPU optimization techniques and model compression via TensorFlow Lite INT8 quantization, achieving a 68.5% reduction in training time and 90% reduction in model size. The system was validated on a custom greenhouse dataset (379 samples, 80/20 split) and the PlantVillage dataset (26,000 images, binary reformulation). A 10-seed evaluation protocol demonstrated that the full multimodal model achieves 98.3 ± 1.5% accuracy, outperforming both an image-only baseline (97.4 ± 1.8%) and a sensor-only MLP (73.8 ± 3.5%). Across seeds, the model also achieved an F1-score of 98.34 ± 1.48% and ROC-AUC of 99.93 ± 0.13%. Ablation analysis with ANOVA (F(4,36) = 4.44, p = 0.005) confirmed that multimodal fusion improves accuracy by 0.92% over image-only models, with the full gated cross-modal attention mechanism outperforming all simplified baselines, including AgriFusionNet (75.22%), Shallow CNN (92.54%), Logistic Regression multimodal (92.11%), and Random Forest multimodal (89.91%). These results further show that relying on environmental data alone is insufficient, reinforcing the benefit of multimodal fusion. External validation on PlantVillage achieved 99.97% accuracy, demonstrating strong generalization capabilities. The optimized model operates efficiently on CPU-only hardware (training time: 9.1 min/epoch), making it suitable for edge deployment in resource-constrained agricultural environments. This work demonstrates that a low-cost, CPU-compatible multimodal deep learning system can reliably detect drought stress in barley under real greenhouse conditions and provides a practical and scalable solution for early stress monitoring in precision agriculture. Full article

(This article belongs to the Special Issue Precision Agriculture: Sensor-Based Systems and IoT-Enabled Machinery)

► Show Figures

Figure 1

15 pages, 3164 KB

Open AccessArticle

Drift-Robust Lightweight Deep Learning on Open Gas Sensor Benchmarks: A Reproducible Architecture Study with CBRN Applicability Mapping

by Soohwan Kim, Myeongsik Shin, Ku Kang, Doo-Hee Lee, David G. Churchill and Yoon Jeong Jang

Molecules 2026, 31(11), 1884; https://doi.org/10.3390/molecules31111884 - 1 Jun 2026

Viewed by 281

Abstract

Resource-constrained edge processors deployed on unmanned aerial vehicles and wearable platforms require compact, drift-robust gas classification models for a range of environmental and security monitoring applications, including CBRN-motivated scenarios. Existing approaches rely on server-grade architectures incompatible with edge-board-scale deployment, or on classifiers that [...] Read more.

Resource-constrained edge processors deployed on unmanned aerial vehicles and wearable platforms require compact, drift-robust gas classification models for a range of environmental and security monitoring applications, including CBRN-motivated scenarios. Existing approaches rely on server-grade architectures incompatible with edge-board-scale deployment, or on classifiers that chemically degrade severely under long-term sensor drift. Each UCI gas class was mapped to a CBRN behavioral category based on physicochemical analogy (molecular functional group, vapor pressure, and metal-oxide semiconductor (MOS) cross-sensitivity pattern), following established precedent. Analyzed were Ammonia (NH₃), Acetaldehyde (CH₃CHO), Acetone ((CH₃)₂CO), Ethylene (C₂H₄), Ethanol (C₂H₅OH), Toluene (C₆H₅CH₃). We propose herein an end-to-end pipeline integrating a novel 1-D convolutional neural network with depth-wise separable convolutions (LiteSensor-Net), INT8 post-training quantization, structured magnitude pruning, and a knowledge-distillation domain-adaptation module (KD–DM) for sensor drift compensation. Using the UCI Gas Sensor Array Drift Dataset (13,910 measurements; 16 metal-oxide sensors; six analyte gases; a 36-month work span). LiteSensor-Net achieved accuracy = 92.63 ± 2.02%, macro-F1 = 0.898, model size = 5.99 kB INT8 pruned, inference latency = 6.3 ms, RAM footprint = 31.7 kB, and energy per inference = 0.04 mJ (all metrics on Raspberry Pi 4B, ARM Cortex-A72). Under chronological forward-chaining evaluation, KD–DM–20 achieved 47.91 ± 18.79% mean accuracy over Batches 2–10, representing a +9.25 pp improvement over uncompensated NC (38.66%). A six-metric benchmark framework—accuracy, macro-F1, model size, inference latency, RAM footprint, and energy per inference—is introduced to standardize edge-AI gas classifier evaluation. The proposed pipeline provides an open-source, deployable foundation for edge-class gas classification systems, with CBRN detection as a motivating application. Full operational validation on certified chemical simulants remains as future work. Full article

(This article belongs to the Special Issue Advanced Fluorescent Probes for Bioimaging and Environmental Sensing)

► Show Figures

Figure 1

Search Results (535)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (535)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI