Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (4,430)

Search Parameters:
Keywords = vision deep learning

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
29 pages, 23987 KB  
Article
YoLeTooth: A Unified Framework for Joint Tooth Segmentation and Periapical Lesion Detection in Panoramic Radiographs
by Gianmarco Scarano, Simone Agostinelli, Irene Amerini and Piero Papi
J. Imaging 2026, 12(6), 272; https://doi.org/10.3390/jimaging12060272 (registering DOI) - 20 Jun 2026
Abstract
Chronic periapical periodontitis is a persistent inflammatory disease characterized by progressive bone destruction around the tooth apex. Manual radiographic detection of these lesions is subjective and time-consuming, highlighting the need for automated diagnostic tools. This paper presents a unified deep learning framework for [...] Read more.
Chronic periapical periodontitis is a persistent inflammatory disease characterized by progressive bone destruction around the tooth apex. Manual radiographic detection of these lesions is subjective and time-consuming, highlighting the need for automated diagnostic tools. This paper presents a unified deep learning framework for joint tooth segmentation and periapical lesion detection in panoramic radiographs. Our approach employs a joint process: first, a deep learning model identifies and segments individual teeth according to standard dental numbering systems, while a second one detects periapical lesions within the tooth regions obtained from the segmentation outputs in the first stage. The framework incorporates an advanced loss function (Powerful IoU v2) to improve bounding-box regression accuracy and a spatial association mechanism to map detected lesions to specific teeth based on geometric overlap analysis. Our proposed tooth segmentation model achieves an mAP@50 of 97.7% and a mean Dice coefficient of 93.5%, while the periapical lesion detector reaches an mAP@50 of 91.9%. Furthermore, our region-of-interest approach yields a 3.49× computational speedup, averaging 0.1589 s per radiograph when compared to full-image processing. Trained exclusively on open-source datasets, this reproducible framework achieves explicit tooth-to-lesion mapping, providing an efficient and practical tool for periapical lesion screening. Full article
51 pages, 5501 KB  
Review
State of the Art in AI-Based Visual Inspection for Industrial Quality Control: Methods, Benchmarks, Challenges, and Autonomous Systems
by Amal Jayawardena, Jung-Hoon Sul, Diluka Moratuwage, Jaliya L. Wijayaraja and Lasitha Piyathilaka
Electronics 2026, 15(12), 2727; https://doi.org/10.3390/electronics15122727 (registering DOI) - 20 Jun 2026
Abstract
Industrial quality control is a critical component of modern manufacturing, as defects can lead to significant economic losses and safety risks. Traditional inspection methods, largely reliant on human operators or rule-based systems, often suffer from inconsistency, limited scalability, and reduced accuracy in complex [...] Read more.
Industrial quality control is a critical component of modern manufacturing, as defects can lead to significant economic losses and safety risks. Traditional inspection methods, largely reliant on human operators or rule-based systems, often suffer from inconsistency, limited scalability, and reduced accuracy in complex environments. Recent advances in artificial intelligence (AI), particularly in deep learning and computer vision, have enabled automated defect detection and classification with unprecedented performance. This paper provides a comprehensive review of AI-based image processing techniques for industrial quality control, covering classification, detection, and segmentation approaches. Key applications across manufacturing sectors are discussed, alongside current challenges such as data scarcity, real-time implementation, and model generalisation. Furthermore, this paper explores emerging trends toward autonomous inspection systems, integrating real-time analytics, edge computing, and intelligent decision making. The insights presented aim to guide future research toward robust, scalable, and fully automated quality control solutions in smart manufacturing environments. Full article
Show Figures

Figure 1

25 pages, 5240 KB  
Article
Monocular Estimation of Grape Berry Size (Caliber) Distributions Using Geometry-Aware Representations and Structured Prediction
by Matias Soto, Pablo Ormeño-Arriagada and Jorge Vasquez
Appl. Sci. 2026, 16(12), 6225; https://doi.org/10.3390/app16126225 (registering DOI) - 20 Jun 2026
Abstract
Grape caliber distributions are critical for packing, grading, yield estimation, and post-harvest logistics. However, estimating reliable caliber histograms from single images remains challenging due to occlusion and dense bunch structure. This work presents a two-stage monocular pipeline that integrates instance segmentation, geometry-aware representations, [...] Read more.
Grape caliber distributions are critical for packing, grading, yield estimation, and post-harvest logistics. However, estimating reliable caliber histograms from single images remains challenging due to occlusion and dense bunch structure. This work presents a two-stage monocular pipeline that integrates instance segmentation, geometry-aware representations, residual quantity correction, and structured histogram prediction. In the first stage, a YOLO-based model detects grape instances and a calibration object, enabling the construction of geometry-aware auxiliary channels and a segmentation-derived counting prior. In the second stage, these representations are used to estimate total grape count and caliber distributions. Results show that RGBDT consistently outperforms RGB, indicating that geometry-aware cues improve both histogram fidelity and counting accuracy. The framework achieves stable performance under realistic conditions while maintaining low runtime, supporting practical deployment in agricultural environments. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
36 pages, 4092 KB  
Article
Functional Profiling in Paralympic Water Polo Using Deep Learning, Stereo Vision, and Phase-Based Kinematic Analysis: A Pilot Study
by Andrea Zanela
Bioengineering 2026, 13(6), 707; https://doi.org/10.3390/bioengineering13060707 (registering DOI) - 19 Jun 2026
Viewed by 82
Abstract
Paralympic water polo requires classification systems that reflect sport-specific functional performance under ecologically valid conditions. This pilot study proposes a task-specific kinematic profiling framework for deriving objective, biomechanically interpretable descriptors of residual motor function. Five male national-level water polo athletes—three with eligible motor [...] Read more.
Paralympic water polo requires classification systems that reflect sport-specific functional performance under ecologically valid conditions. This pilot study proposes a task-specific kinematic profiling framework for deriving objective, biomechanically interpretable descriptors of residual motor function. Five male national-level water polo athletes—three with eligible motor impairments and two able-bodied reference participants—performed standardized sport-specific tasks comprising upright floating, vertical propulsion, unilateral passing, non-contested shooting, and contested shooting under physical opposition. Stereoscopic video, OpenPose-based three-dimensional reconstruction, and phase-based analysis were used to extract features and composite indices of postural control, propulsion capacity, upper-limb residual function, and resistance to perturbation. Automatic ball-release detection matched manual frame-level verification in all 128 analyzed ball-related trials. Within the task-specific indices, where higher scores indicate greater functional burden, core values ranged from 0.05–0.15 for upright floating, 0.29–0.68 for combined arm-and-leg vertical propulsion, and 0.040–0.148 for contested shooting across the available subject–side combinations. The profiles showed task- and side-specific differences in stabilization, propulsion, and post-contact motor reorganization. The framework uses pose estimation as a quantitative measurement tool and treats visibility interruptions as functionally meaningful events rather than noise. It is not intended to replace official classification procedures, but to provide transparent and interpretable candidate descriptors for future evidence-based classification research in Paralympic water polo. Full article
Show Figures

Figure 1

30 pages, 86354 KB  
Article
GeometricPrinciples of Stereo Vision: A Quantitative Evaluation and Physical Validation of the Classical Pipeline
by Angel Fernando Ceballos-Espinoza, David Balderas-Silva, Alfredo Diaz-Lara and Rita Q. Fuentes-Aguilar
Appl. Sci. 2026, 16(12), 6212; https://doi.org/10.3390/app16126212 (registering DOI) - 19 Jun 2026
Viewed by 61
Abstract
Stereo vision is essential for passive three-dimensional perception in resource-constrained applications that require low power consumption, predictable latency, and explainable geometry. Although deep learning architectures dominate recent benchmarks, the classical block-matching pipeline remains a foundational approach. Optimizing this pipeline involves navigating complex trade-offs [...] Read more.
Stereo vision is essential for passive three-dimensional perception in resource-constrained applications that require low power consumption, predictable latency, and explainable geometry. Although deep learning architectures dominate recent benchmarks, the classical block-matching pipeline remains a foundational approach. Optimizing this pipeline involves navigating complex trade-offs among matching robustness, map density, and computational efficiency. This study systematically surveys and physically validates the classical stereo framework. After revisiting geometric first principles, three matching costs (SAD, NCC, ZNCC) are benchmarked alongside Sobel preprocessing and structural refinements, with subsequent validation using a calibrated consumer webcam rig. Middlebury benchmarks (2001–2021) indicate that while SAD fails under complex radiometric distortion, NCC consistently achieves superior quantitative metrics, incurring only a 1.2-fold computational overhead. Extending the disparity search range improves foreground localization, while block size imposes a trade-off between resolving the aperture problem and preserving fine geometric detail. To bridge theoretical analysis and practical deployment, the pipeline is validated using a custom-calibrated consumer stereo rig. The optimized Sobel-NCC architecture is then evaluated for real-time edge deployment on constrained hardware (NVIDIA Jetson Nano) and narrow-baseline sensors (OAK-D SR) in the context of agricultural robotic manipulation. By prioritizing metric precision over dense prediction, the classical pipeline reconstructs target surfaces with approximately 1 cm depth accuracy at 21 frames per second. These results demonstrate that optimized local algorithms offer deterministic and reliable geometric foundations for real-time edge-computed robotics. Although neural networks are essential for dense reconstructions in ill-posed regions, the foundational principles established here remain indispensable for advanced stereo vision system deployment. Full article
(This article belongs to the Section Robotics and Automation)
40 pages, 1911 KB  
Article
Monocular 3D Position Estimation of a Moving Vehicle Based on a Kalman-Goldschmidt Adaptive Filter
by Diana Kalita, Pavel Lyakhov, Valery Andreev and Denis Butusov
J. Sens. Actuator Netw. 2026, 15(3), 48; https://doi.org/10.3390/jsan15030048 (registering DOI) - 18 Jun 2026
Viewed by 63
Abstract
Determining the 3D position of a vehicle from a 2D image plays a key role in video surveillance, autonomous driving, and spatial localization. However, localization accuracy can significantly degrade in conditions of incomplete or synthetic measurement noise and keypoint jitter. In this paper, [...] Read more.
Determining the 3D position of a vehicle from a 2D image plays a key role in video surveillance, autonomous driving, and spatial localization. However, localization accuracy can significantly degrade in conditions of incomplete or synthetic measurement noise and keypoint jitter. In this paper, we propose a new iterative 3D position estimation algorithm (KGA). This algorithm includes geometric correction and calibration steps for converting from 2D to 3D coordinates; trajectory prediction and correction using a Kalman filter; and adaptive tuning of the filter parameters using the Goldschmidt algorithm. Experiments confirm that KGA outperforms the standard (FK) and modified (MFK) Kalman filters in accuracy and convergence speed, demonstrating robustness to various camera angles and noise levels. The novelty of this approach lies in the integration of the Goldschmidt algorithm into the Kalman filter to create an adaptation mechanism that dynamically adjusts the measurement noise covariance based on instantaneous innovation magnitude. Unlike end-to-end deep learning trackers or nonlinear filters (EKF/UKF), KGA is designed as a lightweight post-processing stage that can be seamlessly integrated into existing detection pipelines while maintaining the low computational footprint required for UAV-based edge deployment. The algorithm is of practical value for computer vision systems requiring accurate and robust tracking under varying observational conditions, with current implementation suitable for offline or buffered processing, and clear pathways to real-time deployment through code optimization. The algorithm is of practical value for computer vision systems requiring accurate and robust tracking under varying observational conditions. Full article
(This article belongs to the Section Big Data, Computing and Artificial Intelligence)
38 pages, 37709 KB  
Review
An Overview of the Research Status and Advances in Precision Feeding Technology and Equipment in Aquaculture
by Ke Chen, Sixian Li, Tieli Lyu, Dongfang Li, Zhiqiang Zhou, Jieyu Xian and Maohua Xiao
Animals 2026, 16(12), 1898; https://doi.org/10.3390/ani16121898 - 18 Jun 2026
Viewed by 106
Abstract
Precision feeding is an important foundation for improving production efficiency in aquaculture, reducing feed waste, mitigating water pollution, and promoting the intelligent development of aquaculture. Conventional feeding practices remain heavily dependent on operator experience and are typically executed at predetermined times or fixed [...] Read more.
Precision feeding is an important foundation for improving production efficiency in aquaculture, reducing feed waste, mitigating water pollution, and promoting the intelligent development of aquaculture. Conventional feeding practices remain heavily dependent on operator experience and are typically executed at predetermined times or fixed ration levels. Such approaches frequently result in extensive feeding management, poor adaptability, low feed utilization efficiency, and delayed responses to environmental changes. Advances in machine vision, the Internet of Things, machine learning, deep learning, and automatic control have progressively shifted aquaculture feeding research beyond standalone automatic feeders toward integrated systems encompassing demand perception, intelligent decision-making, precise control, and equipment coordination. This paper reviews the state of the art in precision feeding technologies and equipment in aquaculture. At the technical level, it summarizes advances in feeding demand perception, intelligent feeding decision-making, and precise control and execution. At the equipment level, it reviews the main types, design features, and field application status of precision feeding equipment in intensive aquaculture, pond aquaculture, and offshore aquaculture scenarios. Despite the considerable progress achieved, the practical deployment of precision feeding still faces several limitations. Environmental disturbances, water turbidity, illumination variation, and sensor drift may compromise the reliability of feeding demand perception. Existing decision-making models frequently exhibit limited generalizability across species, growth stages, and aquaculture scenarios. Moreover, insufficient integration of sensing, decision-making, and execution restricts the development of fully closed-loop feeding systems. High initial investment, maintenance costs, and the shortage of skilled personnel further constrain the adoption of precision feeding equipment, particularly in resource-limited regions. On this basis, the main challenges including sensing accuracy, model practicability, closed-loop control, equipment reliability, and standardization, are examined. Future development trends are also discussed, covering multi-source information fusion, synergy between mechanistic models and data-driven methods, system-level closed-loop control, equipment modularization, and industrial application. This review is expected to provide a reference for subsequent research and engineering applications. Full article
24 pages, 882 KB  
Systematic Review
Artificial Intelligence, Deep Learning, and Computer Vision in Hysteroscopy: A Systematic Review
by Rafał Watrowski, Attilio Di Spiezio Sardo, Peter Török, Andrea Rosati, Stoyan Kostov, Ibrahim Alkatout and Salvatore Giovanni Vitale
Diagnostics 2026, 16(12), 1899; https://doi.org/10.3390/diagnostics16121899 - 18 Jun 2026
Viewed by 197
Abstract
Background/Objectives: Hysteroscopy is the gold standard for visualization and treatment of intrauterine pathology. Because hysteroscopic interpretation remains operator-dependent, artificial intelligence (AI) has been evaluated as a tool to improve consistency, lesion recognition, and decision support. We aimed to systematically review AI, machine learning [...] Read more.
Background/Objectives: Hysteroscopy is the gold standard for visualization and treatment of intrauterine pathology. Because hysteroscopic interpretation remains operator-dependent, artificial intelligence (AI) has been evaluated as a tool to improve consistency, lesion recognition, and decision support. We aimed to systematically review AI, machine learning (ML), deep learning (DL), or computer-aided diagnosis (CAD) applications in hysteroscopy. Methods: A systematic search of PubMed/MEDLINE and EBSCOhost was performed from database inception to 8 March 2026, supplemented by targeted searches. Risk of bias was assessed using QUADAS-2 (diagnostic), PROBAST (prognostic), RoB2, and structured technical quality domains. Results: Nineteen primary studies were included, covering five areas: diagnostic classification and object detection (n = 8), real-time lesion detection and localization (n = 4), segmentation and visual-field support (n = 3), operative guidance (n = 1), and prognostic or decision-support applications (n = 3). Performance was highest in narrowly defined binary tasks and in large multicenter systems (e.g., ECCADx: AUC 0.979 internal, 0.975 external) and in prognostic fertility-prediction models after hysteroscopic adhesiolysis (AUC up to 0.992). Broader multiclass classification of heterogeneous lesions showed uneven and lower performance. Most studies were single-center, retrospective, and lacked external validation. Only one randomized study linked AI support to measurable procedural outcomes. Conclusions: The available studies indicate good technical performance in selected hysteroscopic tasks, particularly binary classification, focal lesion detection, and postoperative fertility stratification. Current evidence, however, remains limited by retrospective design, operator-dependent image acquisition, inconsistent validation, and scarce outcome-based clinical testing. In the short term, the most likely role of these systems is to support image interpretation, improve visual quality control, highlight suspicious lesions, and integrate hysteroscopic findings with complementary clinical data. Full article
Show Figures

Figure 1

26 pages, 13171 KB  
Article
A Deep Learning Approach for Pixel-Level Material Classification via Hyperspectral Imaging
by Savvas Sifnaios, George Arvanitakis, Fotios K. Konstantinidis, Georgios Tsimiklis, Angelos Amditis and Panayiotis Frangos
J. Imaging 2026, 12(6), 267; https://doi.org/10.3390/jimaging12060267 - 18 Jun 2026
Viewed by 171
Abstract
Recent advancements in computer vision, particularly in detection, segmentation, and classification, have significantly impacted various domains. However, these advancements are still strongly tied to RGB-based systems, which are insufficient for applications in industries such as waste sorting, pharmaceuticals, and defence, where material characterization [...] Read more.
Recent advancements in computer vision, particularly in detection, segmentation, and classification, have significantly impacted various domains. However, these advancements are still strongly tied to RGB-based systems, which are insufficient for applications in industries such as waste sorting, pharmaceuticals, and defence, where material characterization beyond shape or visible colour is necessary. Hyperspectral (HS) imaging captures spatial and spectral information for each pixel and therefore offers a promising route for material-level classification. This study evaluates the potential of combining HS imaging with deep learning for plastic material classification. The work includes: (i) the design of an experimental setup with a HS line-scan camera, conveyor, and controlled illumination; (ii) the construction of an object-disjoint dataset of HDPE, PET, PP, and PS samples with semi-automated mask generation and Raman spectroscopy-based labelling; and (iii) the development of P1CH, a lightweight pixel-wise 1D convolutional hyperspectral classifier. On object-disjoint test images, P1CH achieved 97.44% all-pixel accuracy. A boundary sensitivity analysis, reported separately because semi-automated labels are uncertain at material/background interfaces, yielded 99.94% accuracy after excluding a pre-defined two-pixel border band. Additional ablation, baseline, and robustness analyses show that the proposed pixel-wise spectral approach is effective for small fragments, visually similar plastics, and overlapping materials, while black or very dark plastics remain challenging under the present camera and illumination configuration. Full article
(This article belongs to the Special Issue Advancement in Hyperspectral Image Processing with Machine Learning)
Show Figures

Figure 1

33 pages, 4450 KB  
Article
Attention-Enhanced Hybrid CNN–ViT Framework for Genus-Level Classification of Selected Macrofungi from Basidiospore Micrographs
by Şuheda Aldemir Terman, Mustafa Emre Akçay, Ebubekir Seyyarer, Faruk Ayata and İsmail Acar
Appl. Sci. 2026, 16(12), 6167; https://doi.org/10.3390/app16126167 - 18 Jun 2026
Viewed by 171
Abstract
The development of rapid and reproducible image analysis approaches that support genus-level pre-classification of macrofungi is important for taxonomic pre-evaluation and controlled microscopic data analysis. In this study, an advanced deep learning-based approach, namely the Attention-Enhanced Hybrid CNN–ViT Framework, was rigorously evaluated for [...] Read more.
The development of rapid and reproducible image analysis approaches that support genus-level pre-classification of macrofungi is important for taxonomic pre-evaluation and controlled microscopic data analysis. In this study, an advanced deep learning-based approach, namely the Attention-Enhanced Hybrid CNN–ViT Framework, was rigorously evaluated for genus-level classification, using basidiospore micrographs of five carefully selected macrofungal genera. The proposed approach integrates the ability of convolutional neural networks to identify local texture and contour patterns with the global context-modelling capability of Vision Transformer structures. The objective is to enhance the extraction of distinctive representations from microscopic spore images through feature fusion and attention mechanisms. A series of experiments was conducted on a curated dataset consisting of light microscopy images of the genera Agaricus, Hebeloma, Inocybe, Amanita, and Russula. The models were compared using a range of evaluation metrics, including accuracy, F1-score, MCC, ROC-AUC, and PR-AUC. The results showed that the InceptionV3 + ViT-B16 + Fusion configuration was the most successful hybrid model, achieving an accuracy of 0.9213 ± 0.0182, an F1-score of 0.9212 ± 0.0179, a Matthews correlation coefficient (MCC) of 0.9040 ± 0.0222, a receiver operating characteristic (ROC)-area under the curve (AUC) of 0.9896 ± 0.0069, and a precision-recall (PR)-AUC of 0.9684 ± 0.0192, respectively. The present findings demonstrate that basidiospore images can carry distinctive visual information for genus-level automated classification under controlled conditions. However, it is important to note that these results should not be interpreted as claims of species-level identification or field generalisability. This is due to the use of a single microscope-camera system, a single preparation protocol, and the absence of an independent external test set. The present study demonstrates that deep learning-based microscopic image analysis can be evaluated as a preliminary classification tool in macrofungal taxonomy. It also shows that such tools can provide a foundation for future work supported by specimen-level validation, external test sets, and different imaging protocols. Full article
(This article belongs to the Section Applied Microbiology)
Show Figures

Figure 1

47 pages, 2452 KB  
Systematic Review
The CMA Agentic Platform: Autonomous Asset Verification and Algorithmic Auditor Governance
by Abdulkarim Hamdan J. Alhazmi, Sardar M. N. Islam and Maria Prokofieva
FinTech 2026, 5(2), 55; https://doi.org/10.3390/fintech5020055 - 17 Jun 2026
Viewed by 89
Abstract
Saudi Arabia’s audit market faces three governance challenges that existing frameworks may not fully address. These challenges concern a potential regulatory gap around autonomous AI accountability, a trust dimension that standard technology-adoption models may not fully capture, and limited mechanisms for independently verified [...] Read more.
Saudi Arabia’s audit market faces three governance challenges that existing frameworks may not fully address. These challenges concern a potential regulatory gap around autonomous AI accountability, a trust dimension that standard technology-adoption models may not fully capture, and limited mechanisms for independently verified ESG assurance under Vision 2030. This study adopts a conceptual design approach within the design science research tradition and proposes the CMA Agentic AI Platform as a practical response to these challenges. The platform comprises two segments. Segment 1 deploys autonomous drone swarms to verify corporate assets across four audit tasks—asset valuation, ESG compliance, anomaly detection and construction progress—using deep learning, thermal imaging and social-media cross-referencing. Segment 2 continuously monitors discretionary accruals and uses objective earnings-management data to inform auditor assignment and rotation decisions. This approach replaces subjective reputational assessments with transparent, quantifiable governance criteria. The platform is governed through the Triadic Agentic Framework, which extends classical agency theory by distributing authority across the Principal, the Human Agent and the AI Agent. The framework also operationalises Trust Expectancy as the primary adoption condition. The evidence base draws on two complementary streams: a PRISMA-guided systematic review and bibliometric analysis of thirty-nine peer-reviewed studies, and a documentary analysis of four national agentic-AI regulatory frameworks (SDAIA, MDDI/IMDA, NIST and ICO). The study contributes the concept of Algorithmic Accountability as a distinct governance domain, the Triadic Agentic Framework as an operational architecture for autonomous regulatory monitoring, and a reframing of the UTAUT trust construct for agentic-AI adoption in mature professional contexts. The platform converts theoretical governance into a regulatory architecture with direct implications for concentrated capital market regulators. Full article
Show Figures

Figure 1

30 pages, 20991 KB  
Review
Machine Learning for CRISPR-Based Diagnostics
by Haniel Siqueira Mortagua Walflor and Lia Carolina Soares Medeiros
Int. J. Mol. Sci. 2026, 27(12), 5485; https://doi.org/10.3390/ijms27125485 - 17 Jun 2026
Viewed by 240
Abstract
CRISPR-based diagnostics now detect viral, bacterial, and cancer-associated nucleic acids with sensitivities approaching quantitative PCR; however, their translation to decentralized care rests on computational design and interpretation that current datasets cannot sustain. Pandemic-era Cas12a assays reached 95% positive predictive agreement against reverse transcription [...] Read more.
CRISPR-based diagnostics now detect viral, bacterial, and cancer-associated nucleic acids with sensitivities approaching quantitative PCR; however, their translation to decentralized care rests on computational design and interpretation that current datasets cannot sustain. Pandemic-era Cas12a assays reached 95% positive predictive agreement against reverse transcription quantitative PCR (RT-qPCR) at 10 copies/μL, and deep neural networks now design Cas13 detection assays spanning 1933 vertebrate-infecting viruses, ranking candidate guides at Spearman correlations of 0.69 to 0.84 across internal and external validation. Generative deep-learning systems improve single-nucleotide discrimination two- to three-fold, computer vision classifies lateral flow outputs at 96.5% accuracy, and multi-biomarker fusion reaches an area under the receiver operating characteristic curve (AUC) of 0.998 in lung cancer detection. These results mask a narrow data foundation. Cas13a guide prediction still draws from a single screening library of 19,209 guide–target pairs, Cas12a has one published diagnostic model, and signal classifiers almost uniformly validate on single-site cohorts. This review synthesizes mechanistic constraints, predictive and generative models, and point-of-care classifiers, and maps the path beyond this data ceiling. Evolutionary pretraining on RNA corpora and lab-in-the-loop agents that convert model failure into targeted data acquisition define the route forward. Full article
(This article belongs to the Special Issue CRISPR/Cas Systems and Genome Editing—3rd Edition)
Show Figures

Figure 1

35 pages, 48685 KB  
Article
Efficient Multitask Onboard Vision Sensing for Open-Pit Mining Advanced Driver Assistance System with Classification-Guided Adaptive Temporal Inference
by Maximiliano Vélez and Claudio Urrea
Sensors 2026, 26(12), 3860; https://doi.org/10.3390/s26123860 - 17 Jun 2026
Viewed by 321
Abstract
Cameras and IMUs on heavy mining trucks supply the visual signal that Advanced Driver Assistance Systems (ADASs) use in open-pit operations. Haul roads in a surface mine are unstructured and unmarked, so a perception model must be both accurate and fast. We address [...] Read more.
Cameras and IMUs on heavy mining trucks supply the visual signal that Advanced Driver Assistance Systems (ADASs) use in open-pit operations. Haul roads in a surface mine are unstructured and unmarked, so a perception model must be both accurate and fast. We address this with a video-based multitask pipeline for a mining Driver Support System (DSS): a single BiSeNetV1 network produces drivable-area segmentation and steering-direction classification in one forward pass. Training used only 100 frames sampled non-sequentially from in-cab recordings of a real open-pit mine; evaluation used two full onboard sequences. To exploit temporal redundancy without annotating video, we propose an Adaptive Clockwork (A-CW) inference scheme: the spatial path runs on every frame, while the context path is refreshed only on keyframes whose cadence is set by the classification output, the same signal shown to the driver as a steering hint. This classification-guided policy increases context updates on curved segments, where the scene changes more rapidly, and reduces them on straight sections, where semantic redundancy is higher. The selected A-CW configuration was evaluated on full temporal test sequences, including one route kept entirely outside the training source. On this unseen route, A-CW achieved 94.70% road-class IoU and 73.68% Top-1 Accuracy. GPU-only throughput increased from about 55 FPS with frame-by-frame inference to 168.01 FPS, and display-excluded end-to-end processing in the simulated ADAS pipeline remained at approximately 37.5 FPS. Full article
(This article belongs to the Section Vehicular Sensing)
Show Figures

Figure 1

27 pages, 8573 KB  
Article
LTM-UNet: Linear Transformer–Mamba with Attention-Based U-Net for Context-Aware Breast Ultrasound Image Segmentation
by Shivpratap Singh Kushwah, Santosh Prakash Chouhan, Narinder Singh Punn and Mahua Bhattacharya
Diagnostics 2026, 16(12), 1888; https://doi.org/10.3390/diagnostics16121888 - 17 Jun 2026
Viewed by 204
Abstract
Background/Objectives: Accurate breast lesion segmentation using deep learning models requires precise understanding of both global contextual relevance and finer lesion structure details, which remains a challenge for existing convolutional and transformer-based approaches. This study aims to address these limitations by proposing a [...] Read more.
Background/Objectives: Accurate breast lesion segmentation using deep learning models requires precise understanding of both global contextual relevance and finer lesion structure details, which remains a challenge for existing convolutional and transformer-based approaches. This study aims to address these limitations by proposing a new segmentation model capable of improving context-aware dense segmentation tasks for ultrasound images. Method: We propose LTM-UNet, a novel segmentation method integrating transformer-based encoding with state-space-driven decoding in a U-Net-style framework. The architecture utilizes an efficient vision transformer encoder to extract multi-scale global representations. These features are refined through an attention-guided skip-fusion mechanism incorporating spatial-channel attention preserving finer spatial details and thereby minimizes the semantic gap between encoder and decoder features. Additionally, a direction-aware decoder based on a state-space model is introduced to efficiently capture long-range dependencies and enhance relevant feature reconstruction. Results: Extensive experiments on benchmark ultrasound medical imaging datasets demonstrate the effectiveness of the proposed method. The model achieves dice-score coefficients of 82.41% on the BUSI dataset and 86.62% on Dataset B (UDIAT), outperforming several existing segmentation approaches in both dice-score coefficient and Intersection-over-Union (IoU) metrics. Conclusions: The integration of efficient transformer-based global feature extraction, attention-enhanced feature fusion, and state-space-driven decoding enables LTM-UNet to effectively capture both structural details and contextual information, resulting in superior segmentation performance compared to existing methods. Full article
Show Figures

Figure 1

21 pages, 107753 KB  
Article
Individual Urban Tree Detection from Multispectral Satellite Imagery via Point-Supervised Deep Learning
by Thomas Martinoli, Luca Morandini and Piero Fraternali
Remote Sens. 2026, 18(12), 2021; https://doi.org/10.3390/rs18122021 - 17 Jun 2026
Viewed by 172
Abstract
Monitoring urban biodiversity is essential for designing resilient and sustainable cities. Urban trees provide a wide range of ecosystem services (ESs), including air pollution reduction, urban heat island mitigation, and psychological benefits for citizens. Accurate and updated tree inventories are therefore essential tools [...] Read more.
Monitoring urban biodiversity is essential for designing resilient and sustainable cities. Urban trees provide a wide range of ecosystem services (ESs), including air pollution reduction, urban heat island mitigation, and psychological benefits for citizens. Accurate and updated tree inventories are therefore essential tools for urban environmental monitoring. However, existing urban tree inventories are often incomplete or outdated, especially in private areas, limiting accurate ES assessment and urban planning. Earth observation satellite missions, particularly very-high-resolution multispectral (VHR-MS) imagery, offer a valuable alternative to field surveys for gathering information on urban environments. This work proposes a deep learning (DL) framework based on VHR-MS satellite imagery for the automatic generation of accurate urban tree inventories. DL models reduce human effort and save operational time by automatically learning complex representations and patterns from satellite imagery. The proposed encoder–decoder architecture extends prior point-based detection approaches by integrating a ResNet-50 backbone and a percentile-based threshold calibration procedure. Given the lack of suitable training data covering heterogeneous and densely vegetated urban environments, a dedicated dataset was constructed from VHR-MS satellite imagery acquired over the Lombardy region (Italy). The dataset encompasses a wide range of land uses and land covers, including residential and industrial zones, public parks, private gardens, and agricultural areas. Through the photointerpretation of more than 2800 images, precise coordinates for more than 50,000 manually annotated trees were obtained. The DL model is trained with point-level annotations, enabling precise localization of individual trees while reducing annotation ambiguity in dense urban contexts. On the Lombardy dataset at 30 cm/px resolution, the proposed framework achieves 86.72% Precision, 66.92% Recall, an F1-score of 75.54%, and a localization error of 1.473 m. Full article
(This article belongs to the Special Issue Remote Sensing Applied in Urban Environment Monitoring)
Show Figures

Figure 1

Back to TopTop