Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (764)

Search Parameters:
Keywords = small-object layer

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
33 pages, 4122 KB  
Article
Empirical Evaluation of UNet for Segmentation of Applicable Surfaces for Seismic Sensor Installation
by Mikhail Uzdiaev, Marina Astapova, Andrey Ronzhin and Aleksandra Figurek
J. Imaging 2026, 12(1), 34; https://doi.org/10.3390/jimaging12010034 - 8 Jan 2026
Viewed by 114
Abstract
The deployment of wireless seismic nodal systems necessitates the efficient identification of optimal locations for sensor installation, considering factors such as ground stability and the absence of interference. Semantic segmentation of satellite imagery has advanced significantly, and its application to this specific task [...] Read more.
The deployment of wireless seismic nodal systems necessitates the efficient identification of optimal locations for sensor installation, considering factors such as ground stability and the absence of interference. Semantic segmentation of satellite imagery has advanced significantly, and its application to this specific task remains unexplored. This work presents a baseline empirical evaluation of the U-Net architecture for the semantic segmentation of surfaces applicable for seismic sensor installation. We utilize a novel dataset of Sentinel-2 multispectral images, specifically labeled for this purpose. The study investigates the impact of pretrained encoders (EfficientNetB2, Cross-Stage Partial Darknet53—CSPDarknet53, and Multi-Axis Vision Transformer—MAxViT), different combinations of Sentinel-2 spectral bands (Red, Green, Blue (RGB), RGB+Near Infrared (NIR), 10-bands with 10 and 20 m/pix spatial resolution, full 13-band), and a technique for improving small object segmentation by modifying the input convolutional layer stride. Experimental results demonstrate that the CSPDarknet53 encoder generally outperforms the others (IoU = 0.534, Precision = 0.716, Recall = 0.635). The combination of RGB and Near-Infrared bands (10 m/pixel resolution) yielded the most robust performance across most configurations. Reducing the input stride from 2 to 1 proved beneficial for segmenting small linear objects like roads. The findings establish a baseline for this novel task and provide practical insights for optimizing deep learning models in the context of automated seismic nodal network installation planning. Full article
(This article belongs to the Special Issue Image Segmentation: Trends and Challenges)
Show Figures

Figure 1

30 pages, 6739 KB  
Article
A Fusion Algorithm for Pedestrian Anomaly Detection and Tracking on Urban Roads Based on Multi-Module Collaboration and Cross-Frame Matching Optimization
by Wei Zhao, Xin Gong, Lanlan Li and Luoyang Zuo
Sensors 2026, 26(2), 400; https://doi.org/10.3390/s26020400 - 8 Jan 2026
Viewed by 102
Abstract
Amid rapid advancements in artificial intelligence, the detection of abnormal human behaviors in complex traffic environments has garnered significant attention. However, detection errors frequently occur due to interference from complex backgrounds, small targets, and other factors. Therefore, this paper proposes a research methodology [...] Read more.
Amid rapid advancements in artificial intelligence, the detection of abnormal human behaviors in complex traffic environments has garnered significant attention. However, detection errors frequently occur due to interference from complex backgrounds, small targets, and other factors. Therefore, this paper proposes a research methodology that integrates the anomaly detection YOLO-SGCF algorithm with the tracking BoT-SORT-ReID algorithm. The detection module uses YOLOv8 as the baseline model, incorporating Swin Transformer to enhance global feature modeling capabilities in complex scenes. CBAM and CA attention are embedded into the Neck and backbone, respectively: CBAM enables dual-dimensional channel-spatial weighting, while CA precisely captures object location features by encoding coordinate information. The Neck layer incorporates GSConv convolutional modules to reduce computational load while expanding feature receptive fields. The loss function is replaced with Focal-EIoU to address sample imbalance issues and precisely optimize bounding box regression. For tracking, to enhance long-term tracking stability, ReID feature distances are incorporated during the BoT-SORT data association phase. This integrates behavioral category information from YOLO-SGCF, enabling the identification and tracking of abnormal pedestrian behaviors in complex environments. Evaluations on our self-built dataset (covering four abnormal behaviors: Climb, Fall, Fight, Phone) show mAP@50%, precision, and recall reaching 92.2%, 90.75%, and 86.57% respectively—improvements of 3.4%, 4.4%, and 6% over the original model—while maintaining an inference speed of 328.49 FPS. Additionally, generalization testing on the UCSD Ped1 dataset (covering six abnormal behaviors: Biker, Skater, Car, Wheelchair, Lawn, Runner) yielded an mAP score of 92.7%, representing a 1.5% improvement over the original model and outperforming existing mainstream models. Furthermore, the tracking algorithm achieved an MOTA of 90.8% and an MOTP of 92.6%, with a 47.6% reduction in IDS, demonstrating superior tracking performance compared to existing mainstream algorithms. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

21 pages, 10154 KB  
Article
CRS-Y: A Study and Application of a Target Detection Method for Underwater Blasting Construction Sites
by Xiaowu Huang, Han Gao, Linna Li, Yucheng Zhao and Chen Men
Appl. Sci. 2026, 16(2), 615; https://doi.org/10.3390/app16020615 - 7 Jan 2026
Viewed by 67
Abstract
To strengthen the safety management and control of explosives in underwater blasting construction sites, this study proposes an improved YOLOv11-based network named CRS-Y, designed to enhance the detection accuracy of explosives in complex underwater environments and improve the recognition capability of multi-scale targets. [...] Read more.
To strengthen the safety management and control of explosives in underwater blasting construction sites, this study proposes an improved YOLOv11-based network named CRS-Y, designed to enhance the detection accuracy of explosives in complex underwater environments and improve the recognition capability of multi-scale targets. To address the limitations of traditional object detection methods in handling complex backgrounds and low-resolution targets, a lightweight re-parameterized vision transformer was integrated into the C3K module, forming a novel CSP structure (C3K-RepViT) that enhances feature extraction under small receptive fields. In combination with the Efficient Multi-Scale Attention (EMSA) mechanism, the model’s spatial feature representation is further strengthened, enabling a more effective understanding of objects in complex scenes. Furthermore, to reduce the computational cost of the P2 feature layer, a new convolutional structure named SPD-DSConv (Space-to-Depth Depthwise Separable Convolution) is proposed, which integrates downsampling and channel expansion within depthwise separable convolution. This design achieves a balance between parameter reduction and multidimensional feature learning. Finally, the Inner-IoU loss function is introduced to dynamically adjust auxiliary bounding box scales, accelerating regression convergence for both high-IoU and low-IoU samples, thereby optimizing bounding box shapes and localization accuracy while improving overall detection performance and robustness. Experimental results demonstrate that the proposed CRS-Y model achieved superior performance on the VOC2012, URPC2020, and self-constructed underwater blasting datasets, effectively meeting the real-time detection requirements of underwater blasting construction scenarios while exhibiting strong generalization ability and practical value. Full article
Show Figures

Figure 1

23 pages, 3750 KB  
Article
Lightweight Frame Format for Interoperability in Wireless Sensor Networks of IoT-Based Smart Systems
by Samer Jaloudi
Future Internet 2026, 18(1), 33; https://doi.org/10.3390/fi18010033 - 7 Jan 2026
Viewed by 61
Abstract
Applications of smart cities, smart buildings, smart agriculture systems, smart grids, and other smart systems benefit from Internet of Things (IoT) protocols, networks, and architecture. Wireless Sensor Networks (WSNs) in smart systems that employ IoT use wireless communication technologies between sensors in the [...] Read more.
Applications of smart cities, smart buildings, smart agriculture systems, smart grids, and other smart systems benefit from Internet of Things (IoT) protocols, networks, and architecture. Wireless Sensor Networks (WSNs) in smart systems that employ IoT use wireless communication technologies between sensors in the Things layer and the Fog layer hub. Such wireless protocols and networks include WiFi, Bluetooth, and Zigbee, among others. However, the payload formats of these protocols are heterogeneous, and thus, they lack a unified frame format that ensures interoperability. In this paper, a lightweight, interoperable frame format for low-rate, small-size Wireless Sensor Networks (WSNs) in IoT-based systems is designed, implemented, and tested. The practicality of this system is underscored by the development of a gateway that transfers collected data from sensors that use the unified frame to online servers via message queuing and telemetry transport (MQTT) secured with transport layer security (TLS), ensuring interoperability using the JavaScript Object Notation (JSON) format. The proposed frame is tested using market-available technologies such as Bluetooth and Zigbee, and then applied to smart home applications. The smart home scenario is chosen because it encompasses various smart subsystems, such as healthcare monitoring systems, energy monitoring systems, and entertainment systems, among others. The proposed system offers several advantages, including a low-cost architecture, ease of setup, improved interoperability, high flexibility, and a lightweight frame that can be applied to other wireless-based smart systems and applications. Full article
(This article belongs to the Special Issue Wireless Sensor Networks and Internet of Things)
Show Figures

Figure 1

16 pages, 14723 KB  
Article
FALW-YOLOv8: A Lightweight Model for Detecting Pipeline Defects
by Huazhong Wang, Xuetao Wang and Lihua Sun
Electronics 2026, 15(1), 209; https://doi.org/10.3390/electronics15010209 - 1 Jan 2026
Viewed by 213
Abstract
Pipelines are critical infrastructures in both industrial production and daily life. However, defects frequently arise due to environmental and manufacturing factors, which may lead to severe safety risks. To overcome the limitations of traditional object detection methods, such as inefficient feature extraction and [...] Read more.
Pipelines are critical infrastructures in both industrial production and daily life. However, defects frequently arise due to environmental and manufacturing factors, which may lead to severe safety risks. To overcome the limitations of traditional object detection methods, such as inefficient feature extraction and the loss of critical information, this paper proposes an improved algorithm, termed FALW-YOLOv8, built upon the YOLOv8 architecture. Specifically, the FasterBlock is incorporated into the C2f module to replace standard convolutional layers, effectively reducing computational redundancy while improving feature extraction efficiency. In addition, the ADown module is employed to enhance multi-scale feature preservation, while the LSKA attention mechanism is introduced to improve detection accuracy, particularly for small defects. The Wise-IoU v2 loss function is further adopted to refine bounding box regression for complex samples. Experimental results demonstrate that the proposed FALW-YOLOv8 achieves a 5.8% improvement in mAP50, along with a 34.8% reduction in model parameters and a 30.86% decrease in computational cost. These results indicate that the proposed method achieves a favorable balance between accuracy and efficiency, making it well-suited for real-time industrial pipeline inspection applications. Full article
Show Figures

Figure 1

22 pages, 2359 KB  
Review
Airport Ground-Based Aerial Object Surveillance Technologies for Enhanced Safety: A Systematic Review
by Joel Samu and Chuyang Yang
Drones 2026, 10(1), 22; https://doi.org/10.3390/drones10010022 - 31 Dec 2025
Viewed by 270
Abstract
Airport airspace safety is increasingly threatened by small, unmanned aircraft systems and wildlife that traditional radar cannot detect. While earlier reviews addressed general counter-UAS techniques, individual sensors, or the detection of single objects such as birds or drones, none has systematically reviewed airport-based, [...] Read more.
Airport airspace safety is increasingly threatened by small, unmanned aircraft systems and wildlife that traditional radar cannot detect. While earlier reviews addressed general counter-UAS techniques, individual sensors, or the detection of single objects such as birds or drones, none has systematically reviewed airport-based, multi-sensor surveillance strategies through a safety-theoretical lens. A systematic review, performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 statement, synthesized recent research on fixed, ground-based aerial detection capabilities for small aerial hazards, specifically unmanned aircraft systems (sUAS) and avian targets, within operational airport environments. Searches targeted English-language, peer-reviewed articles from 2016 through 2025 in Web of Science and Scopus. Due to methodological heterogeneity across sensor technologies, a narrative synthesis was executed. The review of thirty-six studies, analyzed through Reason’s Swiss Cheese Model and Endsley’s Situational Awareness framework, found that only layered multi-sensor fusion architectures effectively address detection gaps for Low-Slow-Small (LSS) threats. Based on these findings, the review proposes seamless integration with Air Traffic Management (ATM) and UAS Traffic Management (UTM) systems through standardized data-exchange interfaces, complemented by theoretically grounded risk-based deployment strategies aligning surveillance technology tiers with operational risk profiles, from basic Remote ID receivers in low-risk rural environments to comprehensive multi-sensor fusion at high-density hubs, major airports, and urban vertiports. Full article
Show Figures

Figure 1

27 pages, 26736 KB  
Article
A Lightweight Traffic Sign Small Target Detection Network Suitable for Complex Environments
by Zonghong Feng, Liangchang Li, Kai Xu and Yong Wang
Appl. Sci. 2026, 16(1), 326; https://doi.org/10.3390/app16010326 - 28 Dec 2025
Viewed by 221
Abstract
With the increasing frequency of traffic safety issues and the rapid development of autonomous driving technology, traffic sign detection is highly susceptible to adverse weather conditions such as changes in light intensity, fog, rain, snow, and partial occlusion, which places higher demands on [...] Read more.
With the increasing frequency of traffic safety issues and the rapid development of autonomous driving technology, traffic sign detection is highly susceptible to adverse weather conditions such as changes in light intensity, fog, rain, snow, and partial occlusion, which places higher demands on the accurate recognition of traffic signs. This paper proposes an improved DAYOLO model based on YOLOv8n, aiming to balance detection accuracy and model complexity. First, the Bottleneck in the C2f module of the YOLOv8n backbone network is replaced with Bottleneck DAttention. Introducing DAttention allows for more effective feature extraction, thereby improving model performance. Second, an ultra-lightweight and efficient upsampler, Dysample, is introduced into the neck network to further improve performance and reduce computational overhead. Finally, a Task-Aligned Dynamic Detection Head (TADDH) is introduced. TADDH enhances task interaction through a dynamic mechanism and utilizes shared convolutional modules to reduce parameters and improve efficiency. Simultaneously, an additional Layer2 detection head is added to the model to strengthen the extraction and fusion of features at different scales, thereby improving the detection accuracy of small traffic signs. Furthermore, replacing SlideLoss with NWDLoss can better handle prediction results with more complex distributions and more accurately measure the distance between predicted and ground truth boxes in the feature space during object detection. Experimental results show that DAYOLO achieves 97.2% mAP on the SDCCVP dataset, which is 5.3 higher than the baseline model YOLOv8n; the frame rate reaches 120, which is 37.8% higher than YOLOv8; and the number of parameters is reduced by 6.2%, outperforming models such as YOLOv3, YOLOv5, YOLOv6, and YOLOv7. In addition, DAYOLO achieves 80.8 mAP on the TT100K dataset, which is 9.2% higher than the baseline model YOLOv8n. The proposed method achieves a balance between model size and detection accuracy, meets the needs of traffic sign detection, and provides new ideas and methods for future research in the field of traffic sign detection. Full article
Show Figures

Figure 1

19 pages, 3910 KB  
Article
Defect Detection Algorithm of Galvanized Sheet Based on S-C-B-YOLO
by Yicheng Liu, Gaoxia Fan, Hanquan Zhang and Dong Xiao
Mathematics 2026, 14(1), 110; https://doi.org/10.3390/math14010110 - 28 Dec 2025
Viewed by 203
Abstract
Galvanized steel sheets are vital anti-corrosion materials, yet their surface quality is prone to defects that impact performance. Manual inspection is inefficient, while conventional machine vision struggles with complex, small-scale defects in industrial settings. Although deep learning offers promising solutions, standard object detection [...] Read more.
Galvanized steel sheets are vital anti-corrosion materials, yet their surface quality is prone to defects that impact performance. Manual inspection is inefficient, while conventional machine vision struggles with complex, small-scale defects in industrial settings. Although deep learning offers promising solutions, standard object detection models like YOLOv5 (which is short for ‘You Only Look Once’) exhibit limitations in handling the subtle textures, scale variations, and reflective surfaces characteristic of galvanized sheet defects. To address these challenges, this paper proposes S-C-B-YOLO, an enhanced detection model based on YOLOv5. First, a Squeeze-and-Excitation (SE) attention mechanism is integrated into the deep layers of the backbone network to adaptively recalibrate channel-wise features, improving focus on defect-relevant information. Second, a Transformer block is combined with a C3 module to form a C3TR module, enhancing the model’s ability to capture global contextual relationships for irregular defects. Finally, the original path aggregation network (PANet) is replaced with a bidirectional feature pyramid network (Bi-FPN) to facilitate more efficient multi-scale feature fusion, significantly boosting sensitivity to small defects. Extensive experiments on a dedicated galvanized sheet defect dataset show that S-C-B-YOLO achieves a mean average precision (mAP@0.5) of 92.6% and an inference speed of 62 FPS, outperforming several baseline models including YOLOv3, YOLOv7, and Faster R-CNN. The proposed model demonstrates a favorable balance between accuracy and speed, offering a robust and practical solution for automated, real-time defect inspection in galvanized steel production. Full article
(This article belongs to the Special Issue Advance in Neural Networks and Visual Learning)
Show Figures

Figure 1

19 pages, 2216 KB  
Article
Research on Bi-Level Optimal Scheduling Strategy for Agricultural Park Integrated Energy System Considering External Meteorological Environmental Uncertainty
by Zeyi Wang, Yao Wang, Li Xie, Hongyu Sun, Xueshan Ni and Hua Zheng
Processes 2026, 14(1), 95; https://doi.org/10.3390/pr14010095 - 26 Dec 2025
Viewed by 176
Abstract
The Agricultural Park Integrated Energy System (APIES) is a key platform for integrating distributed renewable energy (DRE) with agricultural production. However, its economic operation and the stability of crop growth environments are severely challenged by bidirectional uncertainties from external meteorology. These include the [...] Read more.
The Agricultural Park Integrated Energy System (APIES) is a key platform for integrating distributed renewable energy (DRE) with agricultural production. However, its economic operation and the stability of crop growth environments are severely challenged by bidirectional uncertainties from external meteorology. These include the inherent variability of wind-solar generation and critical agricultural loads, such as supplementary lighting and temperature control, a challenge that existing models with static environmental parameters fail to address. To solve this, a bi-level optimization scheduling model for APIES considering meteorological uncertainty is proposed. The upper layer minimizes operation costs by quantifying uncertainties via triangular fuzzy chance constraints, with core constraints on DRE output, energy storage charging-discharging, and load shifting, solved by YALMIP-Gurobi linear programming. The lower layer maximizes crop growth environment satisfaction using a dynamic weight adaptive mechanism and NSGA-II multi-objective algorithm. The two layers iterate alternately for coordination. Using a small agricultural park in Xinjiang, China, as a case study, the results indicate that the proposed two-layer optimal scheduling model reduces costs by 10.8% compared to the traditional single-layer optimization model, and improves environmental satisfaction by 4.3% compared to the fixed-weight two-layer optimization model. Full article
Show Figures

Figure 1

20 pages, 3302 KB  
Article
Laser Propulsion in Confinement Regime: The Role of Film Thickness in the Impulse Generation Process
by Pietro Battocchio, Meriem Bembli, Nicola Bazzanella, Mattia Biesuz, Marina Scarpa, Gian Domenico Sorarù and Antonio Miotello
Appl. Sci. 2026, 16(1), 224; https://doi.org/10.3390/app16010224 - 25 Dec 2025
Viewed by 140
Abstract
A small amount of mass is generally ejected with high exhaust velocities from the surface of materials irradiated by intense laser pulses, so that a net impulse is generated on the target because of momentum conservation. This phenomenon proved to be a potential [...] Read more.
A small amount of mass is generally ejected with high exhaust velocities from the surface of materials irradiated by intense laser pulses, so that a net impulse is generated on the target because of momentum conservation. This phenomenon proved to be a potential solution to generate thrust on far objects, with promising application in space debris removal and control of nanosatellites. Among the different tested strategies, the deposition on the surface of the target of a layer transparent to laser radiation results in a considerable increase in the generated impulse, due to the confinement of the expansion of the ablation plume. In this work impulse generation was measured, using aluminum as target, and PVC, SiO2, TiO2 and CNCs (cellulose nanocrystals) as confinement layers with thickness 0.35 μm. The results show that the generated impulses increase with the thickness of the ejected confinement layer. Additionally, the kinetic energy of the confinement layer, for a given material, does not depend on its thickness, but it is affected by the energy dissipation paths during the interaction with the laser pulse, where the strength of substrate–film adhesion and the Young’s modulus of the latter are shown to play an important role. Full article
Show Figures

Figure 1

18 pages, 249 KB  
Article
Mental Health Challenges at the Intersection of First-Year, First-Generation College Students and Second-Generation Immigrant Identities: A Qualitative Study
by Cassandre Horne and Precious Chibuike Chukwuere
Healthcare 2026, 14(1), 21; https://doi.org/10.3390/healthcare14010021 - 21 Dec 2025
Viewed by 264
Abstract
Background/Objectives: First-year, first-generation college students who are also second-generation immigrants often face significant mental health challenges as they navigate both higher education and early adulthood. This study explored how mental health challenges are shaped by their intersecting identities and framed their experiences using [...] Read more.
Background/Objectives: First-year, first-generation college students who are also second-generation immigrants often face significant mental health challenges as they navigate both higher education and early adulthood. This study explored how mental health challenges are shaped by their intersecting identities and framed their experiences using Bronfenbrenner’s socio-ecological model. Methods: This study was conducted in the office of first-generation success at a 4-year R1 university, adopting a qualitative research approach and a small stories research design. A purposive sampling technique was implemented to sample first-year, first-generation students and second-generation immigrants. Two focus group discussions were conducted, each with groups comprising 11 participants (n = 22). The participants were between 18 and 19 years old. The data were analyzed using a thematic approach, with trustworthiness ensured through the establishment of credibility, dependability, confirmability, and transferability. Results: Two themes emerged: “Finding self” and “Balancing Competing Demands” within the first-year, first-generation population. Additionally, stress was identified in the second-generation immigrant group under the theme of “Cultural Expectations”. Conclusions: Framing the stories within the socio-ecological model illustrates the multi-layered mental health burden of this population group, particularly within the socio-political climate shaped by heightened immigration policy, restrictive enforcement practices, and public discourse surrounding immigrant communities. Recognizing their mental health as integral to their overall health and academic success highlights the need to broaden scholarly and clinical understanding of individuals and compounding contextual variables that may be related to adverse emotional states. Full article
24 pages, 8304 KB  
Article
STAIR-DETR: A Synergistic Transformer Integrating Statistical Attention and Multi-Scale Dynamics for UAV Small Object Detection
by Linna Hu, Penghao Xue, Bin Guo, Yiwen Chen, Weixian Zha and Jiya Tian
Sensors 2025, 25(24), 7681; https://doi.org/10.3390/s25247681 - 18 Dec 2025
Viewed by 401
Abstract
Detecting small objects in unmanned aerial vehicle (UAV) imagery remains a challenging task due to the limited target scale, cluttered backgrounds, severe occlusion, and motion blur commonly observed in dynamic aerial environments. This study presents STAIR-DETR, a real-time synergistic detection framework derived from [...] Read more.
Detecting small objects in unmanned aerial vehicle (UAV) imagery remains a challenging task due to the limited target scale, cluttered backgrounds, severe occlusion, and motion blur commonly observed in dynamic aerial environments. This study presents STAIR-DETR, a real-time synergistic detection framework derived from RT-DETR, featuring comprehensive enhancements in feature extraction, resolution transformation, and detection head design. A Statistical Feature Attention (SFA) module is incorporated into the neck to replace the original AIFI, enabling token-level statistical modeling that strengthens fine-grained feature representation while effectively suppressing background interference. The backbone is reinforced with a Diverse Semantic Enhancement Block (DSEB), which employs multi-branch pathways and dynamic convolution to enrich semantic expressiveness without sacrificing spatial precision. To mitigate information loss during scale transformation, an Adaptive Scale Transformation Operator (ASTO) is proposed by integrating Context-Guided Downsampling (CGD) and Dynamic Sampling (DySample), achieving context-aware compression and content-adaptive reconstruction across resolutions. In addition, a high-resolution P2 detection head is introduced to leverage shallow-layer features for accurate classification and localization of extremely small targets. Extensive experiments conducted on the VisDrone2019 dataset demonstrate that STAIR-DETR attains 41.7% mAP@50 and 23.4% mAP@50:95, outperforming contemporary state-of-the-art (SOTA) detectors while maintaining real-time inference efficiency. These results confirm the effectiveness and robustness of STAIR-DETR for precise small object detection in complex UAV-based imaging scenarios. Full article
(This article belongs to the Special Issue Dynamics and Control System Design for Robotics)
Show Figures

Figure 1

20 pages, 8786 KB  
Article
Learning to Count Crowds from Low-Altitude Aerial Views via Point-Level Supervision and Feature-Adaptive Fusion
by Junzhe Mao, Lin Nai, Jinqi Bai, Chang Liu and Liangfeng Xu
Appl. Sci. 2025, 15(24), 13211; https://doi.org/10.3390/app152413211 - 17 Dec 2025
Viewed by 243
Abstract
Counting small, densely clustered objects from low-altitude aerial views is challenging due to large scale variations, complex backgrounds, and severe occlusion, which often degrade the performance of fully supervised or density-regression methods. To address these issues, we propose a weakly supervised crowd counting [...] Read more.
Counting small, densely clustered objects from low-altitude aerial views is challenging due to large scale variations, complex backgrounds, and severe occlusion, which often degrade the performance of fully supervised or density-regression methods. To address these issues, we propose a weakly supervised crowd counting framework that leverages point-level supervision and a feature-adaptive fusion strategy to enhance perception under low-altitude aerial views. The network comprises a front-end feature extractor and a back-end fusion module. The front-end adopts the first 13 convolutional layers of VGG16-BN to capture multi-scale semantic features while preserving crucial spatial details. The back-end integrates a Feature-Adaptive Fusion module and a Multi-Scale Feature Aggregation module: the former dynamically adjusts fusion weights across scales to improve robustness to scale variation, and the latter aggregates multi-scale representations to better capture targets in dense, complex scenes. Point-level annotations serve as weak supervision to substantially reduce labeling cost while enabling accurate localization of small individual instances. Experiments on several public datasets, including ShanghaiTech Part A, ShanghaiTech Part B, and UCF_CC_50, demonstrate that our method surpasses existing mainstream approaches, effectively mitigating scale variation, background clutter, and occlusion, and providing an efficient and scalable weakly supervised solution for small-object counting. Full article
Show Figures

Figure 1

18 pages, 52336 KB  
Article
Self-Supervised Representation Learning for Data-Efficient DRIL Classification in OCT Images
by Pavithra Kodiyalbail Chakrapani, Akshat Tulsani, Preetham Kumar, Geetha Maiya, Sulatha Venkataraya Bhandary and Steven Fernandes
Diagnostics 2025, 15(24), 3221; https://doi.org/10.3390/diagnostics15243221 - 16 Dec 2025
Viewed by 314
Abstract
Background/Objectives: Disorganization of the retinal inner layers (DRIL) is an important biomarker of diabetic macular edema (DME) that has a very strong association with visual acuity (VA) in patients. But the unavailability of annotated training data from experts severely limits the adaptability of [...] Read more.
Background/Objectives: Disorganization of the retinal inner layers (DRIL) is an important biomarker of diabetic macular edema (DME) that has a very strong association with visual acuity (VA) in patients. But the unavailability of annotated training data from experts severely limits the adaptability of models pretrained on real-world images owing to significant variations in the domain, posing two primary challenges for the design of efficient computerized DRIL detection methods. Methods: In an attempt to address these challenges, we propose a novel, self-supervision-based learning framework that employs a huge unlabeled optical coherence tomography (OCT) dataset to learn and detect clinically applicable interpretations before fine-tuning with a small proprietary dataset of annotated OCT images. In this research, we introduce a spatial Bootstrap Your Own Latent (BYOL) with a hybrid spatial aware loss function aimed to capture anatomical representations from unlabeled OCT dataset of 108,309 images that cover various retinal abnormalities, and then adapt the learned interpretations for DRIL classification employing 823 annotated OCT images. Results: With an accuracy of 99.39%, the proposed two-stage approach substantially exceeds the direct transfer learning models pretrained on ImageNet. Conclusions: The findings demonstrate the efficacy of domain-specific self-supervised learning for rare retinal pathological detection tasks with limited annotated data. Full article
(This article belongs to the Special Issue Artificial Intelligence in Eye Disease, 4th Edition)
Show Figures

Graphical abstract

15 pages, 3782 KB  
Article
The Effect of Mild Cyclic Loads on the Stress State of Degenerative Knee Joint Cartilages: A Numerical Study Aided by Experimental Data
by Oleg Ardatov, Vaiga Zemaitiene, Eiva Bernotiene and Arturas Kilikevicius
Biomedicines 2025, 13(12), 3097; https://doi.org/10.3390/biomedicines13123097 - 16 Dec 2025
Viewed by 293
Abstract
Background/Objectives: This study investigates the effect of mild cyclic loads on the stress state of degenerative knee joint cartilages using a combination of experimental data and numerical modeling. Methods: A three-dimensional finite element model of the knee joint was developed based [...] Read more.
Background/Objectives: This study investigates the effect of mild cyclic loads on the stress state of degenerative knee joint cartilages using a combination of experimental data and numerical modeling. Methods: A three-dimensional finite element model of the knee joint was developed based on CT scans, incorporating key components such as the femur, tibia, cartilage layers, and meniscus. Special attention was given to the mechanical properties of cartilages, which were determined through high-sensitivity dynamometer tests of cartilage samples. The experimentally obtained force–displacement curves for cartilage samples affected by third-degree gonarthrosis were integrated into the numerical model. This allowed for an in-depth investigation of the interactions between neighboring tissues of the knee joint under cyclic loading and unloading conditions. Results: Experimental data revealed nonlinear mechanical behavior of cartilage under loading and unloading conditions, characterized by an elastic hysteresis loop. Experimental results demonstrated that degenerated cartilage, under small stresses (up to 0.13 MPa), retains an elastic hysteresis behavior. The numerical simulation provided insights into the stress distribution within the knee joint components, revealing that even in cases of cartilage degeneration, as long as its structural integrity is maintained, mild loads do not cause sufficient stress concentrators, while the longitudinal tears in the same conditions cause the increment of stress values up to 20%. Conclusions: Findings contribute to a better understanding of the mechanical response of degenerative cartilage and offer valuable guidance for the development of therapeutic and rehabilitation strategies for patients with degenerative tissue diseases. Full article
(This article belongs to the Special Issue Updates on Tissue Repair and Regeneration Pathways)
Show Figures

Figure 1

Back to TopTop