MDPI - Publisher of Open Access Journals

29 pages, 8017 KB

Open AccessArticle

Quantum-Inspired Variational Inference for Non-Convex Stochastic Optimization: A Unified Mathematical Framework with Convergence Guarantees and Applications to Machine Learning in Communication Networks

by Abrar S. Alhazmi

Mathematics 2026, 14(7), 1236; https://doi.org/10.3390/math14071236 - 7 Apr 2026

Abstract

Non-convex stochastic optimization presents fundamental mathematical challenges across machine learning, wireless networks, data center resource allocation, and optical wireless communication systems, where complex loss landscapes with multiple local minima and saddle points impede classical variational inference methods. This paper introduces the Quantum-Inspired Variational [...] Read more.

Non-convex stochastic optimization presents fundamental mathematical challenges across machine learning, wireless networks, data center resource allocation, and optical wireless communication systems, where complex loss landscapes with multiple local minima and saddle points impede classical variational inference methods. This paper introduces the Quantum-Inspired Variational Inference (QIVI) framework, which systematically integrates quantum mechanical principles (superposition, entanglement, and measurement operators) into classical variational inference through rigorous mathematical formulations grounded in Hilbert space theory and operator algebras. We develop a unified optimization framework that encodes classical parameters as quantum-inspired states within finite-dimensional complex Hilbert spaces, employing unitary evolution operators and adaptive basis selection governed by gradient covariance eigendecomposition. The core mathematical contribution establishes that QIVI achieves a convergence rate of

O ({log}^{2} T / T^{1 / 2})

for

σ

-strongly non-convex functions, provably improving upon the classical

O (T^{- 1 / 4})

rate, yielding a theoretical speedup factor of

1.85

–

1.96 \times

. Comprehensive experiments across synthetic benchmarks, Bayesian neural networks, and real-world applications in network optimization and financial portfolio management demonstrate 23–

47 %

faster convergence, 15–

35 %

superior objective values, and 28–

46 %

improved uncertainty calibration. The principal contributions include: (i) a rigorous Hilbert space-based mathematical framework for quantum-inspired variational inference grounded in operator algebras, (ii) a novel hybrid quantum–classical algorithm (QIVI) with adaptive basis selection via gradient covariance eigendecomposition, (iii) formal convergence proofs establishing provable improvement over classical methods, (iv) comprehensive empirical validation across diverse problem domains relevant to machine learning and network optimization, and (v) demonstration of the framework’s applicability to optimization problems arising in wireless networks, data center resource allocation, and network system design. Statistical validation using the Friedman test (

χ^{2} = 847.3

,

p < 0.001

) and post hoc Wilcoxon signed-rank tests with Holm–Bonferroni correction confirm that QIVI’s improvements over all baseline methods are statistically significant at the

α = 0.05

level across all benchmark categories. The framework discovers

18.1

out of 20 true modes in multimodal distributions versus

9.1

for classical methods, demonstrating the potential of quantum-inspired optimization approaches for challenging stochastic problems arising in machine learning, wireless communication, and network optimization. Full article

31 pages, 6459 KB

Open AccessArticle

Cooperative Hybrid Domain Network for Salient Object Detection in Optical Remote Sensing Images

by Yi Gu, Jianhang Zhou and Lelei Yan

Remote Sens. 2026, 18(7), 1087; https://doi.org/10.3390/rs18071087 - 4 Apr 2026

Viewed by 149

Abstract

Salient Object Detection (SOD) in Optical Remote Sensing Images (ORSIs) aims to localize and segment visually prominent objects amidst complex backgrounds and extreme scale variations. However, we observe that current frequency-aware methods typically rely on a naive feature aggregation paradigm, merging frequency and [...] Read more.

Salient Object Detection (SOD) in Optical Remote Sensing Images (ORSIs) aims to localize and segment visually prominent objects amidst complex backgrounds and extreme scale variations. However, we observe that current frequency-aware methods typically rely on a naive feature aggregation paradigm, merging frequency and spatial features via simple concatenation, addition, or direct combination. This shallow interaction overlooks the inherent semantic misalignment between the two domains, resulting in feature redundancy and poor boundary delineation. To address this limitation, we propose the Cooperative Hybrid Domain Network (CHDNet), a framework designed to facilitate synergistic cooperation between heterogeneous domains. Specifically, we propose the Cross-Domain Multi-Head Self-Attention (CD-MHSA) mechanism as a semantic bridge following the encoder. It employs a dimension expansion strategy to construct a Unified Interaction Manifold and utilizes a Frequency Anchor Interaction mechanism to achieve precise modulation of spatial textures using global spectral cues. Furthermore, to address the dual challenges of lacking explicit interpretation mechanisms for semantic co-occurrence and the susceptibility of topological structures to fracture in complex scenes during the decoding phase, we design a Multi-Branch Cooperative Decoder (MBCD) comprising three parallel paths: edge semantics, global relations, and reverse correction. This module dynamically integrates these heterogeneous clues through a Cooperative Fusion Strategy, combining explicit global dependency modeling with dual-domain reverse mining. Extensive experiments on multiple benchmark datasets demonstrate that the proposed CHDNet achieves performance superior to state-of-the-art (SOTA) methods. Full article

► Show Figures

Figure 1

21 pages, 1291 KB

Open AccessArticle

Development of a Software Model for Classification and Automatic Cataloging of Archive Documents

by Adilbek Dauletov, Bahodir Muminov, Noila Matyakubova, Uldona Abdurahmonova, Khurshida Bakhriyeva and Makhbubakhon Fayzieva

Information 2026, 17(4), 341; https://doi.org/10.3390/info17040341 - 1 Apr 2026

Viewed by 295

Abstract

This study proposes an integrated software model for automatic document classification and metadata generation based on the Dublin Core standard to address the issue of rapid and consistent management of archival documents in a digital environment. This approach combines the stages of receiving [...] Read more.

This study proposes an integrated software model for automatic document classification and metadata generation based on the Dublin Core standard to address the issue of rapid and consistent management of archival documents in a digital environment. This approach combines the stages of receiving incoming documents, converting them to text using optical character recognition (OCR), image preprocessing (binarization, deskew, noise reduction), and text cleaning and vectorization (TF–IDF) into a single pipeline. In the document classification stage, the Bidirectional Encoder Representations from Transformers (BERT) model with a context-sensitive transformer architecture is used, along with classical machine learning models (Logistic Regression, Naive Bayes, Support Vector Machine) and an ensemble approach (LightGBM), to increase the accuracy by modeling the document content at a deep semantic level. Experiments were conducted on the RVL-CDIP dataset, and the OCR efficiency was evaluated using the Character Error Rate (CER) indicator, and the classification results were evaluated using the accuracy, precision, recall and F1-score metrics. The results confirmed the high stability and generalization ability of the BERT (accuracy, 95.1%; F1, 95.0%) and LightGBM (accuracy, 93.2%; F1, 93.2%) models. In the final stage, OCR, NER, and classification outputs are automatically organized into Dublin Core metadata elements (Title, Creator, Date, Description, Subject, Type, Format, Language) and exported in JSON/XML formats. This automation significantly reduces manual cataloging effort and improves indexing and retrieval efficiency in digital archival systems. Full article

(This article belongs to the Topic Advances in Integrative AI, Machine Learning, and Big Data for Transformative Applications)

► Show Figures

Graphical abstract

23 pages, 18538 KB

Open AccessArticle

MSRNet: Mamba-Based Self-Refinement Framework for Remote Sensing Change Detection

by Haoxuan Sun, Xiaogang Yang, Ruitao Lu, Jing Zhang, Bo Li and Tao Zhang

Remote Sens. 2026, 18(7), 1042; https://doi.org/10.3390/rs18071042 - 30 Mar 2026

Viewed by 279

Abstract

Accurate change detection (CD) in very high-resolution (VHR, <1 m) optical remote sensing images remains challenging, as it requires effective modeling of long-range bi-temporal dependencies and robustness against label noise in complex urban environments. Existing deep learning-based CD methods either rely on convolutional [...] Read more.

Accurate change detection (CD) in very high-resolution (VHR, <1 m) optical remote sensing images remains challenging, as it requires effective modeling of long-range bi-temporal dependencies and robustness against label noise in complex urban environments. Existing deep learning-based CD methods either rely on convolutional operations with limited receptive fields or employ global attention mechanisms with high computational cost, making it difficult to simultaneously achieve efficient global context modeling and fine-grained structural sensitivity. To address these challenges, we propose a Mamba-based self-refinement framework for remote sensing change detection (MSRNet). Specifically, we introduce an attention-enhanced oblique state space module (AOSS) to model spatio-temporal dependencies with linear complexity while preserving fine-grained structural information. The four-branch attention fusion module (FBAM) further enhances cross-dimensional feature interaction to improve the discriminative capability of differential representations. In addition, a self-refinement module (SRM) incorporates a momentum encoder to generate high-quality pseudo-labels, mitigating annotation noise and enabling learning from latent changes. Extensive experiments on two benchmark VHR datasets, LEVIR-CD and WHU-CD, demonstrate that MSRNet achieves state-of-the-art performance in both accuracy and computational efficiency. Full article

(This article belongs to the Section AI Remote Sensing)

► Show Figures

Figure 1

24 pages, 1545 KB

Open AccessArticle

PMSDA: Progressive Multi-Strategy Domain Alignment for Cross-Scene Vibration Recognition in Distributed Optical Fiber Sensing

by Yuxiang Ni, Jing Cheng, Di Wu, Qianqian Duan, Linhua Jiang, Xing Hu and Dawei Zhang

Photonics 2026, 13(4), 334; https://doi.org/10.3390/photonics13040334 - 29 Mar 2026

Viewed by 368

Abstract

Distributed optical fiber vibration sensing (DVS) has shown strong potential in perimeter security, pipeline leakage monitoring, transportation safety, and structural health diagnostics owing to its high sensitivity, long-range coverage, and immunity to electromagnetic interference. However, severe cross-scene distribution mismatch is often encountered in [...] Read more.

Distributed optical fiber vibration sensing (DVS) has shown strong potential in perimeter security, pipeline leakage monitoring, transportation safety, and structural health diagnostics owing to its high sensitivity, long-range coverage, and immunity to electromagnetic interference. However, severe cross-scene distribution mismatch is often encountered in real-world deployments: indoor, outdoor, and pipeline environments exhibit markedly different noise patterns and time–frequency characteristics, thereby degrading the generalization ability of models trained in a single scene. To address this challenge, we propose a Progressive Multi-Strategy Domain Alignment (PMSDA) framework for label-disjoint cross-scene vibration recognition. PMSDA uses a compact expansion–compression encoder together with complementary alignment mechanisms—maximum mean discrepancy (MMD), correlation alignment (CORAL), and adversarial domain discrimination—to learn a scene-robust latent space from a labeled indoor source and two unlabeled target domains (outdoor and pipeline) within a single alternating-training model. Because the fine-grained source and target label spaces are disjoint, PMSDA is formulated as a representation-transfer framework rather than a standard label-shared unsupervised domain adaptation method; target-domain recognition is therefore performed through domain-specific prototype clustering in the aligned latent space. On three representative scenes with nine event classes in total, PMSDA achieved 89.5% accuracy, 86.7% macro-F1, and 0.93 AUC for Indoor→Outdoor, and 85.8%, 84.7%, and 0.87, respectively, for Indoor→Pipeline, outperforming traditional feature+SVM/RF pipelines, CNN/ResNet baselines, and representation-transfer baselines adapted from DANN/CDAN/SHOT under the same evaluation protocol. These results indicate that PMSDA is a promising and effective framework for offline cross-scene DVS evaluation under disjoint target event sets. Full article

(This article belongs to the Special Issue Machine Learning and Artificial Intelligence for Optical Networks)

► Show Figures

Figure 1

33 pages, 3891 KB

Open AccessArticle

Correlation and Semantic Prior-Guided Multi-Scale Cross-Modal Interaction Network for SAR-OPT Image Fusion

by Xiaoyang Hou, Lingxi Zhou, Chenguo Feng, Hao Cha, Yang Liu, Liguo Liu and Haibo Liu

Remote Sens. 2026, 18(7), 975; https://doi.org/10.3390/rs18070975 - 24 Mar 2026

Viewed by 298

Abstract

Syntheticaperture radar (SAR) and optical (OPT) image fusion aims to leverage their complementary information to obtain a more comprehensive representation of ground objects. However, significant discrepancies exist between the two modalities in terms of imaging mechanisms and feature distributions. Consequently, existing multi-modal image [...] Read more.

Syntheticaperture radar (SAR) and optical (OPT) image fusion aims to leverage their complementary information to obtain a more comprehensive representation of ground objects. However, significant discrepancies exist between the two modalities in terms of imaging mechanisms and feature distributions. Consequently, existing multi-modal image fusion methods struggle to achieve robust cross-modal feature alignment and deep semantic consistency between the fused results and the source modalities. To address the above challenges, this paper proposes a correlation and semantic prior-guided multi-scale cross-modal interaction network (CSP-MCIN) for effective SAR-OPT image fusion. Specifically, CSP-MCIN first employs two modality-specific encoders based on ResNet-18 to extract low-level details and high-level semantic features from SAR and OPT images, respectively. Subsequently, a multi-scale interactive decoder integrating cross-modal Transformers and gated fusion units is constructed to align and aggregate semantic and detail information from both encoders. Finally, to strengthen source-modality representations, a novel loss function combining a pixel-domain correlation loss and a CLIP-guided semantic consistency loss is designed and optimized under a PCGrad-based multi-objective optimization scheme. Experimental results on public SAR-OPT image datasets demonstrate that the proposed CSP-MCIN achieves superior fusion performance and computational efficiency compared with state-of-the-art approaches. Full article

(This article belongs to the Special Issue Multimodal Data Fusion for Synthetic Aperture Radar (SAR) Image Processing)

► Show Figures

Figure 1

23 pages, 2873 KB

Open AccessArticle

An Online Calibration Method for UAV Electro-Optical Pod Zoom Cameras Based on IMU-Vision Fusion

by Weiming Zhu, Zhangsong Shi, Huihui Xu, Qingping Hu, Wenjian Ying and Fan Gui

Drones 2026, 10(3), 224; https://doi.org/10.3390/drones10030224 - 22 Mar 2026

Viewed by 307

Abstract

To address the calibration challenge caused by the nonlinear variation in intrinsic parameters during continuous camera zooming in UAV electro-optical pods, this paper proposes an online calibration method based on IMU-visual fusion. Traditional offline calibration cannot adapt to dynamic scenarios, while existing self-calibration [...] Read more.

To address the calibration challenge caused by the nonlinear variation in intrinsic parameters during continuous camera zooming in UAV electro-optical pods, this paper proposes an online calibration method based on IMU-visual fusion. Traditional offline calibration cannot adapt to dynamic scenarios, while existing self-calibration methods suffer from slow convergence and insufficient robustness. The proposed method aims to achieve real-time and accurate estimation of camera intrinsic parameters during zooming. Specifically, we first construct a unified state estimation framework that encodes the internal and external parameters of the camera and the 3D positions of scene feature points into a high-dimensional state vector, then establish a camera motion model based on IMU data, construct a visual observation model by combining the pinhole camera and second-order radial distortion model to establish a nonlinear mapping from 3D feature points to 2D pixel coordinates, and adopt an improved ORB algorithm for feature extraction and LK optical flow method to achieve high-precision cross-frame feature matching to enhance the stability of visual observation. Most importantly, we design a tight-coupling fusion strategy based on the Extended Kalman Filter (EKF) prediction-update iteration mechanism, which fuses IMU high-frequency motion constraints and visual geometric constraints in real time to suppress parameter drift induced by focal length changes. Finally, we recursively solve the state vector to complete the online dynamic estimation of intrinsic parameters. Monte Carlo simulation experiments and real UAV flight experiments confirm that the method has both high estimation accuracy and strong environmental adaptability, can meet the high-precision calibration needs of UAVs in dynamic scenarios, and provides reliable technical support for accurate target positioning. Full article

► Show Figures

Figure 1

16 pages, 7174 KB

Open AccessArticle

Aberration-Conditioned Attention-Driven Centroid Localization: From Simulation Mechanism to Double-Spot Experiment

by Zhonghao Zhao, Jia Hou, Yuanting Liu, Anwei Liu and Zhiping He

Photonics 2026, 13(3), 304; https://doi.org/10.3390/photonics13030304 - 20 Mar 2026

Viewed by 228

Abstract

In size, weight, and power (SWaP)-constrained optical systems, such as spaceborne LiDAR, high-precision centroid localization often relies on focal-plane measurements without dedicated wavefront sensors. Under such conditions, the nonlinear coupling between optical aberrations and sensor noise introduces systematic bias that is difficult to [...] Read more.

In size, weight, and power (SWaP)-constrained optical systems, such as spaceborne LiDAR, high-precision centroid localization often relies on focal-plane measurements without dedicated wavefront sensors. Under such conditions, the nonlinear coupling between optical aberrations and sensor noise introduces systematic bias that is difficult to mitigate using conventional centroiding methods. To address this issue, we propose a physics-conditioned feature correction framework based on an aberration-conditioned attention mechanism. A hybrid CNN–Transformer architecture is employed to predict and compensate for systematic centroid bias. Specifically, convolutional layers encode the degraded spot morphology, while a multi-head attention mechanism leverages Seidel aberration coefficients to adaptively modulate spatial features for precise regression. Given the unavailability of absolute ground-truth coordinates in empirical scenarios, a physics-consistent simulation framework based on scalar diffraction theory is constructed to generate synthetic data for supervised learning. Simulation results indicate that the proposed method objectively reduces anisotropic systematic bias, achieving a localization root-mean-square error (RMSE) of 0.011 to 0.021 pixels, and maintains stable sub-pixel accuracy even under a 10% empirical prior perturbation. To evaluate generalization performance and engineering reliability, a wedge-based double-spot platform is developed to verify physical consistency via geometric invariance. Experimental results demonstrate a measured spacing standard deviation (SD) of 0.015 to 0.039 pixels. This validates the framework’s transferability from theoretical simulation to controlled physical measurements, providing an algorithmic foundation for precision optical metrology in hardware-constrained environments. Full article

(This article belongs to the Special Issue Advancements in Optics and Laser Measurement)

► Show Figures

Figure 1

12 pages, 970 KB

Open AccessArticle

Frequency and Hearing Loss Phenotypes of OPA1 Variants in a Cohort of 18,475 Patients with Hearing Impairment

by Masayuki Kawakita, Hideaki Moteki, Shin-ya Nishio, Yumiko Kobayashi, Mika Adachi, Takayuki Okano, Hiroshi Yamazaki, Jun Nakayama, Shinya Ohira, Takashi Ishino, Yutaka Takumi and Shin-ichi Usami

Genes 2026, 17(3), 341; https://doi.org/10.3390/genes17030341 - 19 Mar 2026

Viewed by 370

Abstract

Background/Objectives: The OPA1 gene encodes a dynamin-related GTPase essential for mitochondrial fusion. Variants in OPA1 are a major cause of autosomal dominant optic atrophy (DOA). A subset of DOA patients exhibits hearing loss, often manifesting as auditory neuropathy spectrum disorder (ANSD). In this [...] Read more.

Background/Objectives: The OPA1 gene encodes a dynamin-related GTPase essential for mitochondrial fusion. Variants in OPA1 are a major cause of autosomal dominant optic atrophy (DOA). A subset of DOA patients exhibits hearing loss, often manifesting as auditory neuropathy spectrum disorder (ANSD). In this study, we aimed to describe the frequency of OPA1-related hearing loss in a large cohort of patients with hearing loss and to explore the genotype–phenotype correlations and appropriate interventions. Methods: A total of 18,475 Japanese patients with hearing loss were recruited. Targeted massively parallel sequencing of 158 deafness-related genes was performed, and individuals with OPA1 variants were identified. Clinical data, including age of onset, audiological findings, and systemic features, were retrospectively reviewed. Results: Ten individuals from eight independent families carrying OPA1 variants were identified. Three variants were classified as pathogenic or likely pathogenic, while five were variants of uncertain significance. Hearing loss was typically post-lingual in onset and progressive, with predominantly mild-to-moderate severity. Missense variants tended to be associated with DOA-plus phenotypes and ANSD. Five patients obtained only limited benefit from hearing aids, whereas one patient who received a cochlear implant achieved good speech perception. Conclusions: OPA1 is a rare causative gene for hearing loss and is frequently associated with the ANSD phenotype. Affected individuals exhibited phenotypic heterogeneity, which may reflect incomplete penetrance or the influence of mitochondrial DNA-related factors. Full article

(This article belongs to the Section Human Genomics and Genetic Diseases)

► Show Figures

Figure 1

25 pages, 649 KB

Open AccessArticle

A Multimodal Biomedical Sensing Approach for Muscle Activation Onset Detection

by Qiang Chen, Haofei Li, Zhe Xiang, Moxian Lin, Yinfei Yi, Haoran Tang and Yan Zhan

Sensors 2026, 26(6), 1907; https://doi.org/10.3390/s26061907 - 18 Mar 2026

Viewed by 202

Abstract

Muscle onset detection is a fundamental problem in electromyography signal analysis, human–machine interaction, and rehabilitation assessment. In medical and biomedical applications, slow muscle activation onset processes are widely encountered in scenarios such as rehabilitation training, postural regulation, and fine motor control. Such processes [...] Read more.

Muscle onset detection is a fundamental problem in electromyography signal analysis, human–machine interaction, and rehabilitation assessment. In medical and biomedical applications, slow muscle activation onset processes are widely encountered in scenarios such as rehabilitation training, postural regulation, and fine motor control. Such processes are typically characterized by slowly varying amplitudes, long temporal durations, and high susceptibility to noise interference, which poses significant challenges for accurate identification of onset timing. To address these issues, a lightweight temporal attention method for slow muscle activation onset detection is proposed and systematically validated under multimodal experimental settings. The proposed method takes surface electromyography signals as the primary input, while synchronously acquired optical motion image data are incorporated into the experimental design and result analysis, thereby aligning with the common joint use of optical imaging and physiological signals in medical and biomedical research. From a methodological perspective, the proposed framework is composed of lightweight temporal feature encoding, a slow activation-aware temporal attention mechanism, and noise suppression with stable decision strategies. Under the constraint of low computational complexity, the ability to model progressive activation signals is effectively enhanced. Experiments are conducted on a dataset containing multiple types of slow activation movements, and model performance is evaluated using five-fold cross-validation. The results demonstrate that under regular signal-to-noise ratio conditions, the proposed method significantly outperforms traditional threshold-based approaches, classical machine learning models, and several deep learning baselines in terms of onset detection accuracy, recall, and precision. Specifically, onset detection accuracy reaches approximately

92 %

, recall is around

90 %

, and precision is approximately

93 %

. Meanwhile, the average onset detection error and detection delay are reduced to about

41 ms

and

28 ms

, respectively, with the false positive rate controlled at approximately

2.2 %

. Stable performance is further maintained under different noise levels and cross-subject settings, indicating strong robustness and generalization capability. Full article

(This article belongs to the Special Issue Application of Optical Imaging in Medical and Biomedical Research)

► Show Figures

Figure 1

5 pages, 3160 KB

Open AccessProceeding Paper

Modeling Framework for Solid-Phase Peptide Synthesis on SiO₂

by Nicholas Smoliak, Pedro Parreira, Craig Macdonald and Vihar Georgiev

Eng. Proc. 2026, 127(1), 14; https://doi.org/10.3390/engproc2026127014 - 16 Mar 2026

Viewed by 182

Abstract

Solid-phase peptide synthesis (SPPS) allows for the sequential assembly of diverse peptide sequences. Alongside its scalability and capacity for automation, this makes it the method of choice for the synthesis of peptide-based pharmaceuticals. SPPS reaction pathways, however, suffer from a negative environmental footprint [...] Read more.

Solid-phase peptide synthesis (SPPS) allows for the sequential assembly of diverse peptide sequences. Alongside its scalability and capacity for automation, this makes it the method of choice for the synthesis of peptide-based pharmaceuticals. SPPS reaction pathways, however, suffer from a negative environmental footprint due to the super-stoichiometric quantities of reagents and high solvent use required to ensure reaction completion. In this paper, we propose the use of charge-based measurements as a complement to optical methods for measuring reaction completion. We extend the capabilities of our hybrid modeling framework to a representative four-step SPPS pathway on SiO₂, showing each reaction intermediate, its molecular encoding, and the resulting modeled surface potential (

ψ_{0}

). We show that the simulated

ψ_{0} (pH)

plots are separable for three of the four key reaction steps in the representative pathway, indicating that charge-based measurements may help verify protection/deprotection steps. Full article

(This article belongs to the Proceedings of International Conference on Responsible Electronics and Circular Technologies)

► Show Figures

Figure 1

24 pages, 5693 KB

Open AccessArticle

From Geometric Alignment to Scale Balance: Directional Strip Convolution and Efficient Scale Fusion for Remote Sensing Ship Detection

by Jing Sun, Guoyou Shi, Yaxin Yang and Xiaolian Cheng

Remote Sens. 2026, 18(6), 873; https://doi.org/10.3390/rs18060873 - 12 Mar 2026

Viewed by 312

Abstract

Optical remote sensing ship detection faces significant challenges in realistic maritime scenes due to strong background clutter (e.g., docks, shorelines, wake streaks), extreme scale variation, and the elongated geometry of ships with diverse orientations. These factors frequently lead to geometric misalignment, unstable localization, [...] Read more.

Optical remote sensing ship detection faces significant challenges in realistic maritime scenes due to strong background clutter (e.g., docks, shorelines, wake streaks), extreme scale variation, and the elongated geometry of ships with diverse orientations. These factors frequently lead to geometric misalignment, unstable localization, and false alarms, particularly in congested ports and complex sea states. To enhance robustness under clutter while retaining the set prediction efficiency of DETR, we propose the Directional Efficient Network (DENet), a structure-aware enhancement built upon RT-DETR. DENet introduces two complementary components. First, Directional Strip Convolution (DSConv) replaces the standard

3 \times 3

convolution for spatial mixing. By predicting offsets conditioned on input features, DSConv performs strip aggregation that aligns with slender hull structures, thereby suppressing interference from line-shaped background patterns. Second, Efficient Scale Fusion (ESF) augments the Hybrid Encoder as an additive residual correction. It combines multiple receptive field branches with lightweight differential compensation to balance low-frequency context and high-frequency structural transitions, ensuring stable multi-scale fusion in cluttered scenes. Extensive experiments demonstrate the effectiveness of DENet. On ShipRSImageNet,

{AP}^{val}

improves from 58.8% to 63.2% and

{AP}_{50}^{val}

increases from 68.5% to 73.6%. Consistent gains are also observed on NWPU VHR-10, where

{AP}^{val}

reaches 63.0% and

{AP}_{50}^{val}

reaches 94.6%, alongside improvements on the Infrared Ship Database and VisDrone2019-DET, validating the method’s generalization capabilities. Full article

(This article belongs to the Special Issue Advances in Deep Learning and Machine Learning for Remote Sensing Image Analysis)

► Show Figures

Figure 1

23 pages, 2347 KB

Open AccessArticle

Tolerance Analysis and Experimental Validation of ROMI—A High-Precision Linear Delta Robot for Microsurgery

by Xiaoyu Huang, Jiazhe Tang, Elizabeth Rendon-Morales and Rodrigo Aviles-Espinosa

Designs 2026, 10(2), 31; https://doi.org/10.3390/designs10020031 - 11 Mar 2026

Viewed by 259

Abstract

In this paper we present the design of a tolerance analysis-based closed-loop system and a compensation framework applied to high-precision linear Delta robots. It considers the modelling of static and dynamic errors propagation arising from the structural tolerances and the end-effector’s positioning. This [...] Read more.

In this paper we present the design of a tolerance analysis-based closed-loop system and a compensation framework applied to high-precision linear Delta robots. It considers the modelling of static and dynamic errors propagation arising from the structural tolerances and the end-effector’s positioning. This approach is combined with a closed-loop control system implemented using high-resolution optical encoders. The model is applied to the ROMI robot, a high-precision experimental Delta robot designed for microsurgical applications. Our simulation results reveal a theoretical home position error (the centre of the robot’s platform) of 1.9 mm, which is effectively compensated through kinematic calibration and a tolerance analysis-based closed-loop system. The proposed framework is evaluated experimentally through proof-of-concept experiments mimicking a microsurgical resection task conducted on a human peripheral nerve sample. The results from executing micrometre scale parallelogram and circular trajectories showed error reduction rates of 92.3% and 51.2% respectively, after five trajectory iterations. These findings confirm that manufacturing-induced errors can be consistently compensated using the proposed methodology, thus eliminating the need for ultra-high-precision machined components. This work establishes a practical and scalable pathway for designing more affordable high-precision robotic systems suitable for microsurgical and other high-precision applications. Full article

(This article belongs to the Special Issue Advancements in Robotic Design, Manufacturing, and the Action-Perception Loop)

► Show Figures

Graphical abstract

27 pages, 9034 KB

Open AccessArticle

A Comparison of Optimisation Algorithms for Electronic Polarisation Control in Quantum Key Distribution

by Matt Young, Haofan Duan, Stefano Pirandola and Marco Lucamarini

Appl. Sci. 2026, 16(5), 2568; https://doi.org/10.3390/app16052568 - 7 Mar 2026

Viewed by 317

Abstract

Polarisation encoding is widely used in fibre-based Quantum Key Distribution (QKD), but random birefringence in optical fibres causes the transmitted states to drift, requiring active compensation at the receiver. Electronic Polarisation Controllers (EPCs) are commonly used for this purpose, yet the relationship between [...] Read more.

Polarisation encoding is widely used in fibre-based Quantum Key Distribution (QKD), but random birefringence in optical fibres causes the transmitted states to drift, requiring active compensation at the receiver. Electronic Polarisation Controllers (EPCs) are commonly used for this purpose, yet the relationship between their control voltages and the resulting polarisation transformation is highly nonlinear and difficult to model. While optimisation algorithms are frequently employed to align and stabilise polarisation states, their comparative performance has not been systematically studied in realistic QKD settings. In this work, we benchmark four optimisation algorithms for electronic polarisation control, using both a numerical model and a 50 km fibre-based experimental setup. We evaluate each algorithm in terms of convergence time, failure rate, and stability, under both initial alignment and continuous drift compensation scenarios. Coordinate Descent achieved the fastest average alignment time (2.1 ms in simulation; 34.6 s experimentally), while Simulated Annealing delivered perfect reliability. We further propose a hybrid control strategy that combines fast initial alignment with high-reliability realignment. This approach was validated over a continuous 2 h QKD simulation with real fibre drift, demonstrating robust polarisation control without manual intervention. Our results provide guidance for algorithm selection in practical QKD deployments and suggest a pathway to resilient, autonomous polarisation tracking in long-distance quantum networks. Full article

(This article belongs to the Special Issue Quantum Communication and Quantum Information)

► Show Figures

Figure 1

13 pages, 3486 KB

Open AccessArticle

Dual-Band Infrared Metasurface with High-Efficiency Focusing and Full-Stokes Polarization Analysis

by Lifeng Ma, Yi Huang, Yanhong Xie, Na Xie, Lu Zhang, Huilin Jiang and Jun Chang

Photonics 2026, 13(3), 256; https://doi.org/10.3390/photonics13030256 - 5 Mar 2026

Viewed by 486

Abstract

This study proposes a dual-band, mid-wave infrared (MWIR) and long-wave infrared (LWIR) polarization-multiplexed optical system based on a metasurface. By employing matrix-based phase encoding technology, we pioneered the use of a dual-band polarization multiplexing architecture for parallel processing, achieving full-Stokes polarization detection. This [...] Read more.

This study proposes a dual-band, mid-wave infrared (MWIR) and long-wave infrared (LWIR) polarization-multiplexed optical system based on a metasurface. By employing matrix-based phase encoding technology, we pioneered the use of a dual-band polarization multiplexing architecture for parallel processing, achieving full-Stokes polarization detection. This system realized wavelength and polarization multiplexing across six axial focal planes and the off-axis focal points on each focal plane. The system also achieved a high transmittance of 85%; the average transmittance of this system exceeded 70% in the 3–12 μm range. The focusing efficiency in the MWIR and LWIR is 71.1% and 62.5%, respectively, with polarization crosstalk below −25 dB. We used the inverse design method, shortening the design cycle by 80%. It provides a compact solution for infrared imaging, multispectral analysis, and biological tissue pathological detection. Full article

(This article belongs to the Special Issue Advanced Photonics Metamaterials and Metasurfaces: Science and Applications, 2nd Edition)

► Show Figures

Figure 1

Search Results (638)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (638)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI