Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (638)

Search Parameters:
Keywords = optical encoding

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
29 pages, 8017 KB  
Article
Quantum-Inspired Variational Inference for Non-Convex Stochastic Optimization: A Unified Mathematical Framework with Convergence Guarantees and Applications to Machine Learning in Communication Networks
by Abrar S. Alhazmi
Mathematics 2026, 14(7), 1236; https://doi.org/10.3390/math14071236 - 7 Apr 2026
Abstract
Non-convex stochastic optimization presents fundamental mathematical challenges across machine learning, wireless networks, data center resource allocation, and optical wireless communication systems, where complex loss landscapes with multiple local minima and saddle points impede classical variational inference methods. This paper introduces the Quantum-Inspired Variational [...] Read more.
Non-convex stochastic optimization presents fundamental mathematical challenges across machine learning, wireless networks, data center resource allocation, and optical wireless communication systems, where complex loss landscapes with multiple local minima and saddle points impede classical variational inference methods. This paper introduces the Quantum-Inspired Variational Inference (QIVI) framework, which systematically integrates quantum mechanical principles (superposition, entanglement, and measurement operators) into classical variational inference through rigorous mathematical formulations grounded in Hilbert space theory and operator algebras. We develop a unified optimization framework that encodes classical parameters as quantum-inspired states within finite-dimensional complex Hilbert spaces, employing unitary evolution operators and adaptive basis selection governed by gradient covariance eigendecomposition. The core mathematical contribution establishes that QIVI achieves a convergence rate of O(log2T/T1/2) for σ-strongly non-convex functions, provably improving upon the classical O(T1/4) rate, yielding a theoretical speedup factor of 1.851.96×. Comprehensive experiments across synthetic benchmarks, Bayesian neural networks, and real-world applications in network optimization and financial portfolio management demonstrate 23–47% faster convergence, 15–35% superior objective values, and 28–46% improved uncertainty calibration. The principal contributions include: (i) a rigorous Hilbert space-based mathematical framework for quantum-inspired variational inference grounded in operator algebras, (ii) a novel hybrid quantum–classical algorithm (QIVI) with adaptive basis selection via gradient covariance eigendecomposition, (iii) formal convergence proofs establishing provable improvement over classical methods, (iv) comprehensive empirical validation across diverse problem domains relevant to machine learning and network optimization, and (v) demonstration of the framework’s applicability to optimization problems arising in wireless networks, data center resource allocation, and network system design. Statistical validation using the Friedman test (χ2=847.3, p<0.001) and post hoc Wilcoxon signed-rank tests with Holm–Bonferroni correction confirm that QIVI’s improvements over all baseline methods are statistically significant at the α=0.05 level across all benchmark categories. The framework discovers 18.1 out of 20 true modes in multimodal distributions versus 9.1 for classical methods, demonstrating the potential of quantum-inspired optimization approaches for challenging stochastic problems arising in machine learning, wireless communication, and network optimization. Full article
31 pages, 6459 KB  
Article
Cooperative Hybrid Domain Network for Salient Object Detection in Optical Remote Sensing Images
by Yi Gu, Jianhang Zhou and Lelei Yan
Remote Sens. 2026, 18(7), 1087; https://doi.org/10.3390/rs18071087 - 4 Apr 2026
Viewed by 149
Abstract
Salient Object Detection (SOD) in Optical Remote Sensing Images (ORSIs) aims to localize and segment visually prominent objects amidst complex backgrounds and extreme scale variations. However, we observe that current frequency-aware methods typically rely on a naive feature aggregation paradigm, merging frequency and [...] Read more.
Salient Object Detection (SOD) in Optical Remote Sensing Images (ORSIs) aims to localize and segment visually prominent objects amidst complex backgrounds and extreme scale variations. However, we observe that current frequency-aware methods typically rely on a naive feature aggregation paradigm, merging frequency and spatial features via simple concatenation, addition, or direct combination. This shallow interaction overlooks the inherent semantic misalignment between the two domains, resulting in feature redundancy and poor boundary delineation. To address this limitation, we propose the Cooperative Hybrid Domain Network (CHDNet), a framework designed to facilitate synergistic cooperation between heterogeneous domains. Specifically, we propose the Cross-Domain Multi-Head Self-Attention (CD-MHSA) mechanism as a semantic bridge following the encoder. It employs a dimension expansion strategy to construct a Unified Interaction Manifold and utilizes a Frequency Anchor Interaction mechanism to achieve precise modulation of spatial textures using global spectral cues. Furthermore, to address the dual challenges of lacking explicit interpretation mechanisms for semantic co-occurrence and the susceptibility of topological structures to fracture in complex scenes during the decoding phase, we design a Multi-Branch Cooperative Decoder (MBCD) comprising three parallel paths: edge semantics, global relations, and reverse correction. This module dynamically integrates these heterogeneous clues through a Cooperative Fusion Strategy, combining explicit global dependency modeling with dual-domain reverse mining. Extensive experiments on multiple benchmark datasets demonstrate that the proposed CHDNet achieves performance superior to state-of-the-art (SOTA) methods. Full article
Show Figures

Figure 1

21 pages, 1291 KB  
Article
Development of a Software Model for Classification and Automatic Cataloging of Archive Documents
by Adilbek Dauletov, Bahodir Muminov, Noila Matyakubova, Uldona Abdurahmonova, Khurshida Bakhriyeva and Makhbubakhon Fayzieva
Information 2026, 17(4), 341; https://doi.org/10.3390/info17040341 - 1 Apr 2026
Viewed by 295
Abstract
This study proposes an integrated software model for automatic document classification and metadata generation based on the Dublin Core standard to address the issue of rapid and consistent management of archival documents in a digital environment. This approach combines the stages of receiving [...] Read more.
This study proposes an integrated software model for automatic document classification and metadata generation based on the Dublin Core standard to address the issue of rapid and consistent management of archival documents in a digital environment. This approach combines the stages of receiving incoming documents, converting them to text using optical character recognition (OCR), image preprocessing (binarization, deskew, noise reduction), and text cleaning and vectorization (TF–IDF) into a single pipeline. In the document classification stage, the Bidirectional Encoder Representations from Transformers (BERT) model with a context-sensitive transformer architecture is used, along with classical machine learning models (Logistic Regression, Naive Bayes, Support Vector Machine) and an ensemble approach (LightGBM), to increase the accuracy by modeling the document content at a deep semantic level. Experiments were conducted on the RVL-CDIP dataset, and the OCR efficiency was evaluated using the Character Error Rate (CER) indicator, and the classification results were evaluated using the accuracy, precision, recall and F1-score metrics. The results confirmed the high stability and generalization ability of the BERT (accuracy, 95.1%; F1, 95.0%) and LightGBM (accuracy, 93.2%; F1, 93.2%) models. In the final stage, OCR, NER, and classification outputs are automatically organized into Dublin Core metadata elements (Title, Creator, Date, Description, Subject, Type, Format, Language) and exported in JSON/XML formats. This automation significantly reduces manual cataloging effort and improves indexing and retrieval efficiency in digital archival systems. Full article
Show Figures

Graphical abstract

23 pages, 18538 KB  
Article
MSRNet: Mamba-Based Self-Refinement Framework for Remote Sensing Change Detection
by Haoxuan Sun, Xiaogang Yang, Ruitao Lu, Jing Zhang, Bo Li and Tao Zhang
Remote Sens. 2026, 18(7), 1042; https://doi.org/10.3390/rs18071042 - 30 Mar 2026
Viewed by 279
Abstract
Accurate change detection (CD) in very high-resolution (VHR, <1 m) optical remote sensing images remains challenging, as it requires effective modeling of long-range bi-temporal dependencies and robustness against label noise in complex urban environments. Existing deep learning-based CD methods either rely on convolutional [...] Read more.
Accurate change detection (CD) in very high-resolution (VHR, <1 m) optical remote sensing images remains challenging, as it requires effective modeling of long-range bi-temporal dependencies and robustness against label noise in complex urban environments. Existing deep learning-based CD methods either rely on convolutional operations with limited receptive fields or employ global attention mechanisms with high computational cost, making it difficult to simultaneously achieve efficient global context modeling and fine-grained structural sensitivity. To address these challenges, we propose a Mamba-based self-refinement framework for remote sensing change detection (MSRNet). Specifically, we introduce an attention-enhanced oblique state space module (AOSS) to model spatio-temporal dependencies with linear complexity while preserving fine-grained structural information. The four-branch attention fusion module (FBAM) further enhances cross-dimensional feature interaction to improve the discriminative capability of differential representations. In addition, a self-refinement module (SRM) incorporates a momentum encoder to generate high-quality pseudo-labels, mitigating annotation noise and enabling learning from latent changes. Extensive experiments on two benchmark VHR datasets, LEVIR-CD and WHU-CD, demonstrate that MSRNet achieves state-of-the-art performance in both accuracy and computational efficiency. Full article
(This article belongs to the Section AI Remote Sensing)
Show Figures

Figure 1

24 pages, 1545 KB  
Article
PMSDA: Progressive Multi-Strategy Domain Alignment for Cross-Scene Vibration Recognition in Distributed Optical Fiber Sensing
by Yuxiang Ni, Jing Cheng, Di Wu, Qianqian Duan, Linhua Jiang, Xing Hu and Dawei Zhang
Photonics 2026, 13(4), 334; https://doi.org/10.3390/photonics13040334 - 29 Mar 2026
Viewed by 368
Abstract
Distributed optical fiber vibration sensing (DVS) has shown strong potential in perimeter security, pipeline leakage monitoring, transportation safety, and structural health diagnostics owing to its high sensitivity, long-range coverage, and immunity to electromagnetic interference. However, severe cross-scene distribution mismatch is often encountered in [...] Read more.
Distributed optical fiber vibration sensing (DVS) has shown strong potential in perimeter security, pipeline leakage monitoring, transportation safety, and structural health diagnostics owing to its high sensitivity, long-range coverage, and immunity to electromagnetic interference. However, severe cross-scene distribution mismatch is often encountered in real-world deployments: indoor, outdoor, and pipeline environments exhibit markedly different noise patterns and time–frequency characteristics, thereby degrading the generalization ability of models trained in a single scene. To address this challenge, we propose a Progressive Multi-Strategy Domain Alignment (PMSDA) framework for label-disjoint cross-scene vibration recognition. PMSDA uses a compact expansion–compression encoder together with complementary alignment mechanisms—maximum mean discrepancy (MMD), correlation alignment (CORAL), and adversarial domain discrimination—to learn a scene-robust latent space from a labeled indoor source and two unlabeled target domains (outdoor and pipeline) within a single alternating-training model. Because the fine-grained source and target label spaces are disjoint, PMSDA is formulated as a representation-transfer framework rather than a standard label-shared unsupervised domain adaptation method; target-domain recognition is therefore performed through domain-specific prototype clustering in the aligned latent space. On three representative scenes with nine event classes in total, PMSDA achieved 89.5% accuracy, 86.7% macro-F1, and 0.93 AUC for Indoor→Outdoor, and 85.8%, 84.7%, and 0.87, respectively, for Indoor→Pipeline, outperforming traditional feature+SVM/RF pipelines, CNN/ResNet baselines, and representation-transfer baselines adapted from DANN/CDAN/SHOT under the same evaluation protocol. These results indicate that PMSDA is a promising and effective framework for offline cross-scene DVS evaluation under disjoint target event sets. Full article
(This article belongs to the Special Issue Machine Learning and Artificial Intelligence for Optical Networks)
Show Figures

Figure 1

33 pages, 3891 KB  
Article
Correlation and Semantic Prior-Guided Multi-Scale Cross-Modal Interaction Network for SAR-OPT Image Fusion
by Xiaoyang Hou, Lingxi Zhou, Chenguo Feng, Hao Cha, Yang Liu, Liguo Liu and Haibo Liu
Remote Sens. 2026, 18(7), 975; https://doi.org/10.3390/rs18070975 - 24 Mar 2026
Viewed by 298
Abstract
Syntheticaperture radar (SAR) and optical (OPT) image fusion aims to leverage their complementary information to obtain a more comprehensive representation of ground objects. However, significant discrepancies exist between the two modalities in terms of imaging mechanisms and feature distributions. Consequently, existing multi-modal image [...] Read more.
Syntheticaperture radar (SAR) and optical (OPT) image fusion aims to leverage their complementary information to obtain a more comprehensive representation of ground objects. However, significant discrepancies exist between the two modalities in terms of imaging mechanisms and feature distributions. Consequently, existing multi-modal image fusion methods struggle to achieve robust cross-modal feature alignment and deep semantic consistency between the fused results and the source modalities. To address the above challenges, this paper proposes a correlation and semantic prior-guided multi-scale cross-modal interaction network (CSP-MCIN) for effective SAR-OPT image fusion. Specifically, CSP-MCIN first employs two modality-specific encoders based on ResNet-18 to extract low-level details and high-level semantic features from SAR and OPT images, respectively. Subsequently, a multi-scale interactive decoder integrating cross-modal Transformers and gated fusion units is constructed to align and aggregate semantic and detail information from both encoders. Finally, to strengthen source-modality representations, a novel loss function combining a pixel-domain correlation loss and a CLIP-guided semantic consistency loss is designed and optimized under a PCGrad-based multi-objective optimization scheme. Experimental results on public SAR-OPT image datasets demonstrate that the proposed CSP-MCIN achieves superior fusion performance and computational efficiency compared with state-of-the-art approaches. Full article
Show Figures

Figure 1

23 pages, 2873 KB  
Article
An Online Calibration Method for UAV Electro-Optical Pod Zoom Cameras Based on IMU-Vision Fusion
by Weiming Zhu, Zhangsong Shi, Huihui Xu, Qingping Hu, Wenjian Ying and Fan Gui
Drones 2026, 10(3), 224; https://doi.org/10.3390/drones10030224 - 22 Mar 2026
Viewed by 307
Abstract
To address the calibration challenge caused by the nonlinear variation in intrinsic parameters during continuous camera zooming in UAV electro-optical pods, this paper proposes an online calibration method based on IMU-visual fusion. Traditional offline calibration cannot adapt to dynamic scenarios, while existing self-calibration [...] Read more.
To address the calibration challenge caused by the nonlinear variation in intrinsic parameters during continuous camera zooming in UAV electro-optical pods, this paper proposes an online calibration method based on IMU-visual fusion. Traditional offline calibration cannot adapt to dynamic scenarios, while existing self-calibration methods suffer from slow convergence and insufficient robustness. The proposed method aims to achieve real-time and accurate estimation of camera intrinsic parameters during zooming. Specifically, we first construct a unified state estimation framework that encodes the internal and external parameters of the camera and the 3D positions of scene feature points into a high-dimensional state vector, then establish a camera motion model based on IMU data, construct a visual observation model by combining the pinhole camera and second-order radial distortion model to establish a nonlinear mapping from 3D feature points to 2D pixel coordinates, and adopt an improved ORB algorithm for feature extraction and LK optical flow method to achieve high-precision cross-frame feature matching to enhance the stability of visual observation. Most importantly, we design a tight-coupling fusion strategy based on the Extended Kalman Filter (EKF) prediction-update iteration mechanism, which fuses IMU high-frequency motion constraints and visual geometric constraints in real time to suppress parameter drift induced by focal length changes. Finally, we recursively solve the state vector to complete the online dynamic estimation of intrinsic parameters. Monte Carlo simulation experiments and real UAV flight experiments confirm that the method has both high estimation accuracy and strong environmental adaptability, can meet the high-precision calibration needs of UAVs in dynamic scenarios, and provides reliable technical support for accurate target positioning. Full article
Show Figures

Figure 1

16 pages, 7174 KB  
Article
Aberration-Conditioned Attention-Driven Centroid Localization: From Simulation Mechanism to Double-Spot Experiment
by Zhonghao Zhao, Jia Hou, Yuanting Liu, Anwei Liu and Zhiping He
Photonics 2026, 13(3), 304; https://doi.org/10.3390/photonics13030304 - 20 Mar 2026
Viewed by 228
Abstract
In size, weight, and power (SWaP)-constrained optical systems, such as spaceborne LiDAR, high-precision centroid localization often relies on focal-plane measurements without dedicated wavefront sensors. Under such conditions, the nonlinear coupling between optical aberrations and sensor noise introduces systematic bias that is difficult to [...] Read more.
In size, weight, and power (SWaP)-constrained optical systems, such as spaceborne LiDAR, high-precision centroid localization often relies on focal-plane measurements without dedicated wavefront sensors. Under such conditions, the nonlinear coupling between optical aberrations and sensor noise introduces systematic bias that is difficult to mitigate using conventional centroiding methods. To address this issue, we propose a physics-conditioned feature correction framework based on an aberration-conditioned attention mechanism. A hybrid CNN–Transformer architecture is employed to predict and compensate for systematic centroid bias. Specifically, convolutional layers encode the degraded spot morphology, while a multi-head attention mechanism leverages Seidel aberration coefficients to adaptively modulate spatial features for precise regression. Given the unavailability of absolute ground-truth coordinates in empirical scenarios, a physics-consistent simulation framework based on scalar diffraction theory is constructed to generate synthetic data for supervised learning. Simulation results indicate that the proposed method objectively reduces anisotropic systematic bias, achieving a localization root-mean-square error (RMSE) of 0.011 to 0.021 pixels, and maintains stable sub-pixel accuracy even under a 10% empirical prior perturbation. To evaluate generalization performance and engineering reliability, a wedge-based double-spot platform is developed to verify physical consistency via geometric invariance. Experimental results demonstrate a measured spacing standard deviation (SD) of 0.015 to 0.039 pixels. This validates the framework’s transferability from theoretical simulation to controlled physical measurements, providing an algorithmic foundation for precision optical metrology in hardware-constrained environments. Full article
(This article belongs to the Special Issue Advancements in Optics and Laser Measurement)
Show Figures

Figure 1

12 pages, 970 KB  
Article
Frequency and Hearing Loss Phenotypes of OPA1 Variants in a Cohort of 18,475 Patients with Hearing Impairment
by Masayuki Kawakita, Hideaki Moteki, Shin-ya Nishio, Yumiko Kobayashi, Mika Adachi, Takayuki Okano, Hiroshi Yamazaki, Jun Nakayama, Shinya Ohira, Takashi Ishino, Yutaka Takumi and Shin-ichi Usami
Genes 2026, 17(3), 341; https://doi.org/10.3390/genes17030341 - 19 Mar 2026
Viewed by 370
Abstract
Background/Objectives: The OPA1 gene encodes a dynamin-related GTPase essential for mitochondrial fusion. Variants in OPA1 are a major cause of autosomal dominant optic atrophy (DOA). A subset of DOA patients exhibits hearing loss, often manifesting as auditory neuropathy spectrum disorder (ANSD). In this [...] Read more.
Background/Objectives: The OPA1 gene encodes a dynamin-related GTPase essential for mitochondrial fusion. Variants in OPA1 are a major cause of autosomal dominant optic atrophy (DOA). A subset of DOA patients exhibits hearing loss, often manifesting as auditory neuropathy spectrum disorder (ANSD). In this study, we aimed to describe the frequency of OPA1-related hearing loss in a large cohort of patients with hearing loss and to explore the genotype–phenotype correlations and appropriate interventions. Methods: A total of 18,475 Japanese patients with hearing loss were recruited. Targeted massively parallel sequencing of 158 deafness-related genes was performed, and individuals with OPA1 variants were identified. Clinical data, including age of onset, audiological findings, and systemic features, were retrospectively reviewed. Results: Ten individuals from eight independent families carrying OPA1 variants were identified. Three variants were classified as pathogenic or likely pathogenic, while five were variants of uncertain significance. Hearing loss was typically post-lingual in onset and progressive, with predominantly mild-to-moderate severity. Missense variants tended to be associated with DOA-plus phenotypes and ANSD. Five patients obtained only limited benefit from hearing aids, whereas one patient who received a cochlear implant achieved good speech perception. Conclusions: OPA1 is a rare causative gene for hearing loss and is frequently associated with the ANSD phenotype. Affected individuals exhibited phenotypic heterogeneity, which may reflect incomplete penetrance or the influence of mitochondrial DNA-related factors. Full article
(This article belongs to the Section Human Genomics and Genetic Diseases)
Show Figures

Figure 1

25 pages, 649 KB  
Article
A Multimodal Biomedical Sensing Approach for Muscle Activation Onset Detection
by Qiang Chen, Haofei Li, Zhe Xiang, Moxian Lin, Yinfei Yi, Haoran Tang and Yan Zhan
Sensors 2026, 26(6), 1907; https://doi.org/10.3390/s26061907 - 18 Mar 2026
Viewed by 202
Abstract
Muscle onset detection is a fundamental problem in electromyography signal analysis, human–machine interaction, and rehabilitation assessment. In medical and biomedical applications, slow muscle activation onset processes are widely encountered in scenarios such as rehabilitation training, postural regulation, and fine motor control. Such processes [...] Read more.
Muscle onset detection is a fundamental problem in electromyography signal analysis, human–machine interaction, and rehabilitation assessment. In medical and biomedical applications, slow muscle activation onset processes are widely encountered in scenarios such as rehabilitation training, postural regulation, and fine motor control. Such processes are typically characterized by slowly varying amplitudes, long temporal durations, and high susceptibility to noise interference, which poses significant challenges for accurate identification of onset timing. To address these issues, a lightweight temporal attention method for slow muscle activation onset detection is proposed and systematically validated under multimodal experimental settings. The proposed method takes surface electromyography signals as the primary input, while synchronously acquired optical motion image data are incorporated into the experimental design and result analysis, thereby aligning with the common joint use of optical imaging and physiological signals in medical and biomedical research. From a methodological perspective, the proposed framework is composed of lightweight temporal feature encoding, a slow activation-aware temporal attention mechanism, and noise suppression with stable decision strategies. Under the constraint of low computational complexity, the ability to model progressive activation signals is effectively enhanced. Experiments are conducted on a dataset containing multiple types of slow activation movements, and model performance is evaluated using five-fold cross-validation. The results demonstrate that under regular signal-to-noise ratio conditions, the proposed method significantly outperforms traditional threshold-based approaches, classical machine learning models, and several deep learning baselines in terms of onset detection accuracy, recall, and precision. Specifically, onset detection accuracy reaches approximately 92%, recall is around 90%, and precision is approximately 93%. Meanwhile, the average onset detection error and detection delay are reduced to about 41ms and 28ms, respectively, with the false positive rate controlled at approximately 2.2%. Stable performance is further maintained under different noise levels and cross-subject settings, indicating strong robustness and generalization capability. Full article
(This article belongs to the Special Issue Application of Optical Imaging in Medical and Biomedical Research)
Show Figures

Figure 1

5 pages, 3160 KB  
Proceeding Paper
Modeling Framework for Solid-Phase Peptide Synthesis on SiO2 
by Nicholas Smoliak, Pedro Parreira, Craig Macdonald and Vihar Georgiev
Eng. Proc. 2026, 127(1), 14; https://doi.org/10.3390/engproc2026127014 - 16 Mar 2026
Viewed by 182
Abstract
Solid-phase peptide synthesis (SPPS) allows for the sequential assembly of diverse peptide sequences. Alongside its scalability and capacity for automation, this makes it the method of choice for the synthesis of peptide-based pharmaceuticals. SPPS reaction pathways, however, suffer from a negative environmental footprint [...] Read more.
Solid-phase peptide synthesis (SPPS) allows for the sequential assembly of diverse peptide sequences. Alongside its scalability and capacity for automation, this makes it the method of choice for the synthesis of peptide-based pharmaceuticals. SPPS reaction pathways, however, suffer from a negative environmental footprint due to the super-stoichiometric quantities of reagents and high solvent use required to ensure reaction completion. In this paper, we propose the use of charge-based measurements as a complement to optical methods for measuring reaction completion. We extend the capabilities of our hybrid modeling framework to a representative four-step SPPS pathway on SiO2, showing each reaction intermediate, its molecular encoding, and the resulting modeled surface potential (ψ0). We show that the simulated ψ0(pH) plots are separable for three of the four key reaction steps in the representative pathway, indicating that charge-based measurements may help verify protection/deprotection steps. Full article
Show Figures

Figure 1

24 pages, 5693 KB  
Article
From Geometric Alignment to Scale Balance: Directional Strip Convolution and Efficient Scale Fusion for Remote Sensing Ship Detection
by Jing Sun, Guoyou Shi, Yaxin Yang and Xiaolian Cheng
Remote Sens. 2026, 18(6), 873; https://doi.org/10.3390/rs18060873 - 12 Mar 2026
Viewed by 312
Abstract
Optical remote sensing ship detection faces significant challenges in realistic maritime scenes due to strong background clutter (e.g., docks, shorelines, wake streaks), extreme scale variation, and the elongated geometry of ships with diverse orientations. These factors frequently lead to geometric misalignment, unstable localization, [...] Read more.
Optical remote sensing ship detection faces significant challenges in realistic maritime scenes due to strong background clutter (e.g., docks, shorelines, wake streaks), extreme scale variation, and the elongated geometry of ships with diverse orientations. These factors frequently lead to geometric misalignment, unstable localization, and false alarms, particularly in congested ports and complex sea states. To enhance robustness under clutter while retaining the set prediction efficiency of DETR, we propose the Directional Efficient Network (DENet), a structure-aware enhancement built upon RT-DETR. DENet introduces two complementary components. First, Directional Strip Convolution (DSConv) replaces the standard 3×3 convolution for spatial mixing. By predicting offsets conditioned on input features, DSConv performs strip aggregation that aligns with slender hull structures, thereby suppressing interference from line-shaped background patterns. Second, Efficient Scale Fusion (ESF) augments the Hybrid Encoder as an additive residual correction. It combines multiple receptive field branches with lightweight differential compensation to balance low-frequency context and high-frequency structural transitions, ensuring stable multi-scale fusion in cluttered scenes. Extensive experiments demonstrate the effectiveness of DENet. On ShipRSImageNet, APval improves from 58.8% to 63.2% and AP50val increases from 68.5% to 73.6%. Consistent gains are also observed on NWPU VHR-10, where APval reaches 63.0% and AP50val reaches 94.6%, alongside improvements on the Infrared Ship Database and VisDrone2019-DET, validating the method’s generalization capabilities. Full article
Show Figures

Figure 1

23 pages, 2347 KB  
Article
Tolerance Analysis and Experimental Validation of ROMI—A High-Precision Linear Delta Robot for Microsurgery
by Xiaoyu Huang, Jiazhe Tang, Elizabeth Rendon-Morales and Rodrigo Aviles-Espinosa
Designs 2026, 10(2), 31; https://doi.org/10.3390/designs10020031 - 11 Mar 2026
Viewed by 259
Abstract
In this paper we present the design of a tolerance analysis-based closed-loop system and a compensation framework applied to high-precision linear Delta robots. It considers the modelling of static and dynamic errors propagation arising from the structural tolerances and the end-effector’s positioning. This [...] Read more.
In this paper we present the design of a tolerance analysis-based closed-loop system and a compensation framework applied to high-precision linear Delta robots. It considers the modelling of static and dynamic errors propagation arising from the structural tolerances and the end-effector’s positioning. This approach is combined with a closed-loop control system implemented using high-resolution optical encoders. The model is applied to the ROMI robot, a high-precision experimental Delta robot designed for microsurgical applications. Our simulation results reveal a theoretical home position error (the centre of the robot’s platform) of 1.9 mm, which is effectively compensated through kinematic calibration and a tolerance analysis-based closed-loop system. The proposed framework is evaluated experimentally through proof-of-concept experiments mimicking a microsurgical resection task conducted on a human peripheral nerve sample. The results from executing micrometre scale parallelogram and circular trajectories showed error reduction rates of 92.3% and 51.2% respectively, after five trajectory iterations. These findings confirm that manufacturing-induced errors can be consistently compensated using the proposed methodology, thus eliminating the need for ultra-high-precision machined components. This work establishes a practical and scalable pathway for designing more affordable high-precision robotic systems suitable for microsurgical and other high-precision applications. Full article
Show Figures

Graphical abstract

27 pages, 9034 KB  
Article
A Comparison of Optimisation Algorithms for Electronic Polarisation Control in Quantum Key Distribution
by Matt Young, Haofan Duan, Stefano Pirandola and Marco Lucamarini
Appl. Sci. 2026, 16(5), 2568; https://doi.org/10.3390/app16052568 - 7 Mar 2026
Viewed by 317
Abstract
Polarisation encoding is widely used in fibre-based Quantum Key Distribution (QKD), but random birefringence in optical fibres causes the transmitted states to drift, requiring active compensation at the receiver. Electronic Polarisation Controllers (EPCs) are commonly used for this purpose, yet the relationship between [...] Read more.
Polarisation encoding is widely used in fibre-based Quantum Key Distribution (QKD), but random birefringence in optical fibres causes the transmitted states to drift, requiring active compensation at the receiver. Electronic Polarisation Controllers (EPCs) are commonly used for this purpose, yet the relationship between their control voltages and the resulting polarisation transformation is highly nonlinear and difficult to model. While optimisation algorithms are frequently employed to align and stabilise polarisation states, their comparative performance has not been systematically studied in realistic QKD settings. In this work, we benchmark four optimisation algorithms for electronic polarisation control, using both a numerical model and a 50 km fibre-based experimental setup. We evaluate each algorithm in terms of convergence time, failure rate, and stability, under both initial alignment and continuous drift compensation scenarios. Coordinate Descent achieved the fastest average alignment time (2.1 ms in simulation; 34.6 s experimentally), while Simulated Annealing delivered perfect reliability. We further propose a hybrid control strategy that combines fast initial alignment with high-reliability realignment. This approach was validated over a continuous 2 h QKD simulation with real fibre drift, demonstrating robust polarisation control without manual intervention. Our results provide guidance for algorithm selection in practical QKD deployments and suggest a pathway to resilient, autonomous polarisation tracking in long-distance quantum networks. Full article
(This article belongs to the Special Issue Quantum Communication and Quantum Information)
Show Figures

Figure 1

13 pages, 3486 KB  
Article
Dual-Band Infrared Metasurface with High-Efficiency Focusing and Full-Stokes Polarization Analysis
by Lifeng Ma, Yi Huang, Yanhong Xie, Na Xie, Lu Zhang, Huilin Jiang and Jun Chang
Photonics 2026, 13(3), 256; https://doi.org/10.3390/photonics13030256 - 5 Mar 2026
Viewed by 486
Abstract
This study proposes a dual-band, mid-wave infrared (MWIR) and long-wave infrared (LWIR) polarization-multiplexed optical system based on a metasurface. By employing matrix-based phase encoding technology, we pioneered the use of a dual-band polarization multiplexing architecture for parallel processing, achieving full-Stokes polarization detection. This [...] Read more.
This study proposes a dual-band, mid-wave infrared (MWIR) and long-wave infrared (LWIR) polarization-multiplexed optical system based on a metasurface. By employing matrix-based phase encoding technology, we pioneered the use of a dual-band polarization multiplexing architecture for parallel processing, achieving full-Stokes polarization detection. This system realized wavelength and polarization multiplexing across six axial focal planes and the off-axis focal points on each focal plane. The system also achieved a high transmittance of 85%; the average transmittance of this system exceeded 70% in the 3–12 μm range. The focusing efficiency in the MWIR and LWIR is 71.1% and 62.5%, respectively, with polarization crosstalk below −25 dB. We used the inverse design method, shortening the design cycle by 80%. It provides a compact solution for infrared imaging, multispectral analysis, and biological tissue pathological detection. Full article
Show Figures

Figure 1

Back to TopTop