Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (42)

Search Parameters:
Keywords = cascaded group attention

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 2567 KiB  
Review
Non-Platinum Group Metal Oxygen Reduction Catalysts for a Hydrogen Fuel Cell Cathode: A Mini-Review
by Naomi Helsel and Pabitra Choudhury
Catalysts 2025, 15(6), 588; https://doi.org/10.3390/catal15060588 - 13 Jun 2025
Viewed by 975
Abstract
Although platinum-based catalysts are highly effective for the oxygen reduction reaction (ORR) in proton exchange membrane fuel cells (PEMFCs), their high cost and scarcity limit large-scale commercialization. As a result, platinum group metal-free catalysts—particularly Fe-N-C materials—have received increasing attention as promising alternatives. Despite [...] Read more.
Although platinum-based catalysts are highly effective for the oxygen reduction reaction (ORR) in proton exchange membrane fuel cells (PEMFCs), their high cost and scarcity limit large-scale commercialization. As a result, platinum group metal-free catalysts—particularly Fe-N-C materials—have received increasing attention as promising alternatives. Despite significant progress, no platinum-group metal-free (PGM-free) catalyst has yet matched the performance and durability of commercial Pt/C in acidic media. Recent advances in synthesis strategies, however, have led to notable improvements in the activity, stability, and active site density of Fe-N-C catalysts. This review highlights key synthesis approaches, including pyrolysis, MOF-derived templates, and cascade anchoring, and discusses how these methods contribute to improved nitrogen coordination, electronic structure modulation, and active site engineering. The continued refinement of these strategies, alongside improved catalyst screening techniques, is essential for closing the performance gap and enabling the practical deployment of non-PGM catalysts in PEMFC technologies. Full article
(This article belongs to the Special Issue Feature Review Papers in Electrocatalysis)
Show Figures

Figure 1

18 pages, 4652 KiB  
Article
CGDU-DETR: An End-to-End Detection Model for Ship Detection in Day–Night Transition Environments
by Wei Wu, Xiyu Fan, Zhuhua Hu and Yaochi Zhao
J. Mar. Sci. Eng. 2025, 13(6), 1155; https://doi.org/10.3390/jmse13061155 - 11 Jun 2025
Cited by 1 | Viewed by 442
Abstract
In this study, we propose an end-to-end detection model based on cascaded spatial priors and dynamic upsampling for ship detection tasks in day–night transition environments, named the Cascaded Group and Dynamic Upsample-DEtection TRansformer (CGDU-DETR). To address the limitations of traditional methods in complex [...] Read more.
In this study, we propose an end-to-end detection model based on cascaded spatial priors and dynamic upsampling for ship detection tasks in day–night transition environments, named the Cascaded Group and Dynamic Upsample-DEtection TRansformer (CGDU-DETR). To address the limitations of traditional methods in complex lighting conditions (e.g., strong reflections, low light), we designed a novel CG-Net model based on cascaded group attention and introduced a dynamic feature upsampling algorithm, effectively enhancing the model’s ability to extract multi-scale features and detect targets in complex backgrounds. The experimental results show that the CGDU-DETR achieves an AP of 93.4% on the day–night transition dataset, representing a 2.86% improvement over YOLOv12, and a recall of 95.2%, representing a 24.44% improvement over YOLOv12. Particularly for complex categories such as cargo ships and law enforcement vessels, the CGDU-DETR significantly outperforms YOLOv12, with improvements of 35.9% in AP and 63.7% in recall. Moreover, generalization experiments on the WSODD public dataset further validate the robustness of the model, with the CGDU-DETR achieving an AP of 95.1%, representing an 11.6% improvement over YOLOv12. These results demonstrate that the CGDU-DETR has significant advantages in ship detection tasks under day–night transition environments, effectively handling complex lighting and background interference, and providing reliable technical support for all-weather maritime surveillance. Full article
(This article belongs to the Section Ocean Engineering)
Show Figures

Figure 1

20 pages, 7085 KiB  
Article
A Lightweight Citrus Ripeness Detection Algorithm Based on Visual Saliency Priors and Improved RT-DETR
by Yutong Huang, Xianyao Wang, Xinyao Liu, Liping Cai, Xuefei Feng and Xiaoyan Chen
Agronomy 2025, 15(5), 1173; https://doi.org/10.3390/agronomy15051173 - 12 May 2025
Cited by 2 | Viewed by 807
Abstract
As one of the world’s economically valuable fruit crops, citrus has its quality and productivity closely tied to the degree of fruit ripeness. However, accurately and efficiently detecting citrus ripeness in complex orchard environments for selective robotic harvesting remains a challenge. To address [...] Read more.
As one of the world’s economically valuable fruit crops, citrus has its quality and productivity closely tied to the degree of fruit ripeness. However, accurately and efficiently detecting citrus ripeness in complex orchard environments for selective robotic harvesting remains a challenge. To address this, we constructed a citrus ripeness detection dataset under complex orchard conditions, proposed a lightweight algorithm based on visual saliency priors and the RT-DETR model, and named it LightSal-RTDETR. To reduce computational overhead, we designed the E-CSPPC module, which efficiently combines cross-stage partial networks with gated and partial convolutions, combined with cascaded group attention (CGA) and inverted residual mobile block (iRMB), which minimizes model complexity and computational demand and simultaneously strengthens the model’s capacity for feature representation. Additionally, the Inner-SIoU loss function was employed for bounding box regression, while a weight initialization method based on visual saliency maps was proposed. Experiments on our dataset show that LightSal-RTDETR achieves a mAP@50 of 81%, improving by 1.9% over the original model while reducing parameters by 28.1% and computational cost by 26.5%. Therefore, LightSal-RTDETR effectively solves the citrus ripeness detection problem in orchard scenes with high complexity, offering an efficient solution for smart agriculture applications. Full article
(This article belongs to the Special Issue Advanced Machine Learning in Agriculture—2nd Edition)
Show Figures

Figure 1

23 pages, 12779 KiB  
Article
Crack-MsCGA: A Deep Learning Network with Multi-Scale Attention for Pavement Crack Detection
by Guoxi Liu, Xiaojing Wu, Fei Dai, Guozhi Liu, Lecheng Li and Bi Huang
Sensors 2025, 25(8), 2446; https://doi.org/10.3390/s25082446 - 12 Apr 2025
Cited by 3 | Viewed by 828
Abstract
Pavement crack detection is crucial for ensuring road safety and reducing maintenance costs. Existing methods typically use convolutional neural networks (CNNs) to extract multi-level features from pavement images and employ attention mechanisms to enhance global features. However, the fusion of low-level features introduces [...] Read more.
Pavement crack detection is crucial for ensuring road safety and reducing maintenance costs. Existing methods typically use convolutional neural networks (CNNs) to extract multi-level features from pavement images and employ attention mechanisms to enhance global features. However, the fusion of low-level features introduces substantial interference, leading to low detection accuracy for small-scale cracks with subtle local structures and varying global morphologies. In this paper, we propose a computationally efficient deep learning network with CNNs and multi-scale attention for multi-scale crack detection, named Crack-MsCGA. In this network, we avoid fusing low-level features to reduce noise interference. Then, we propose a multi-scale attention mechanism (MsCGA) to learn local detail features and global features from high-level features, compensating for the reduced detailed information. Specifically, first, MsCGA employs local window attention to learn short-range dependencies, aggregating local features within each window. Second, it applies a cascaded group attention mechanism to learn long-range dependencies, extracting global features across the entire image. Finally, it uses a multi-scale attention fusion strategy based on Mixed Local Channel Attention (MLCA) selectively to fuse local features and global features of pavement cracks. Compared with five existing methods, it improves the AP@50 by 11.3% for small-scale, 8.1% for medium-scale, and 5.9% for large-scale detection over the state-of-the-art methods in the DH807 dataset. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

18 pages, 16349 KiB  
Article
Research on Economic Operation of Cascade Small Hydropower Stations Within Plants Based on Refined Efficiency Models
by Daohong Wei, Chunpeng Feng and Dong Liu
Energies 2025, 18(4), 964; https://doi.org/10.3390/en18040964 - 17 Feb 2025
Viewed by 608
Abstract
In order to enhance the overall power generation efficiency of cascade hydropower, it is essential to conduct modelling optimization of its in-plant operation. However, existing studies have devoted minimal attention to the detailed modelling of turbine operating performance curves within the in-plant economic [...] Read more.
In order to enhance the overall power generation efficiency of cascade hydropower, it is essential to conduct modelling optimization of its in-plant operation. However, existing studies have devoted minimal attention to the detailed modelling of turbine operating performance curves within the in-plant economic operation model. This represents a significant challenge to the practical application of the optimization results. This study presents a refined model of a hydraulic turbine operating performance curve, which was established by combining a particle swarm optimization (PSO) algorithm and a backpropagation (BP) neural network. The model was developed using a cascade small hydropower group as an illustrative example. On this basis, an in-plant economic operation model of a cascade small hydropower group was established, which is based on the principle of ’setting electricity by water’ and has the goal of maximizing power generation. The model was optimized using a genetic algorithm, which was employed to optimize the output of the units. In order to ascertain the efficacy of the methodology proposed in this study, typical daily operational scenarios of a cascade small hydropower group were selected for comparison. The results demonstrate that, in comparison with the actual operational strategy, the proposed model and method enhance the total output by 3.38%, 2.11%, and 3.56%, respectively, across the three typical scenarios. This method enhances the efficiency of power generation within the cascade small hydropower group and demonstrates substantial engineering application value. Full article
(This article belongs to the Section B: Energy and Environment)
Show Figures

Figure 1

23 pages, 8929 KiB  
Article
Disease Detection Algorithm for Tea Health Protection Based on Improved Real-Time Detection Transformer
by Zhijie Lin, Zilong Zhu, Lingling Guo, Jingjing Chen and Jiyi Wu
Appl. Sci. 2025, 15(4), 2063; https://doi.org/10.3390/app15042063 - 16 Feb 2025
Viewed by 619
Abstract
Traditional disease detection methods typically depend on visual assessments conducted by human experts, which are time-consuming and subjective. Thus, there is an urgent demand for automated and efficient approaches to accurately detect and classify tea diseases. This study presents an enhanced Real-Time Detection [...] Read more.
Traditional disease detection methods typically depend on visual assessments conducted by human experts, which are time-consuming and subjective. Thus, there is an urgent demand for automated and efficient approaches to accurately detect and classify tea diseases. This study presents an enhanced Real-Time Detection Transformer (RT-DETR), tailored for the accurate and efficient identification of tea diseases in natural environments. The proposed method integrates three novel components: Faster-LTNet, CG Attention Module, and RMT Spatial Prior Block, to significantly improve computational efficiency, feature representation, and detection capabilities. Faster-LTNet employs partial convolution and hierarchical design to optimize computational resources, while the CG Attention Module enhances multi-head self-attention by introducing grouped feature inputs and cascading operations to reduce redundancy and increase attention diversity. The RMT Spatial Prior Block integrates a Manhattan distance-based spatial decay matrix and linear decomposition strategy to improve global and local context modeling, reducing attention complexity. The enhanced RT-DETR model achieves a detection precision of 89.20% and a processing speed of 346.40 FPS. While the precision improves, the FPS value also increases by 109, which is superior to the traditional model in terms of precision and real-time processing. Additionally, compared to the baseline model, the FLOPs are reduced by 50%, and the overall model size and parameter size are decreased by approximately 50%. These findings indicate that the proposed algorithm is well-suited for efficient, real-time, and lightweight agricultural disease detection. Full article
(This article belongs to the Special Issue Recent Advances in Precision Farming and Digital Agriculture)
Show Figures

Figure 1

9 pages, 1858 KiB  
Communication
Role of Human DNA Ligases in Mediating Pharmacological Activities of Flavonoids
by Daekyu Sun and Vijay Gokhale
Int. J. Mol. Sci. 2025, 26(4), 1456; https://doi.org/10.3390/ijms26041456 - 10 Feb 2025
Viewed by 683
Abstract
Dietary flavonoids are a group of polyphenol compounds originating from plants that have drawn much attention in the last few decades. Flavonoid-rich foods and dietary supplements are used worldwide due to their health benefits, including antioxidative, anti-inflammatory, immunity-enhancing, anticarcinogenic, estrogenic, and favorable cardiovascular [...] Read more.
Dietary flavonoids are a group of polyphenol compounds originating from plants that have drawn much attention in the last few decades. Flavonoid-rich foods and dietary supplements are used worldwide due to their health benefits, including antioxidative, anti-inflammatory, immunity-enhancing, anticarcinogenic, estrogenic, and favorable cardiovascular effects. The main objective of our study was to explore the molecular targets of flavonoids to gain insight into the mechanism of action behind their biological effects. In this study, a novel class of resorcinol-based flavonoid compounds was identified as a potent inhibitor of human DNA ligase activity. Human DNA ligases are crucial in the maintenance of genetic integrity and cell fate determination. Thus, our results strongly suggest that this activity against human DNA ligases is responsible, at least in part, for the cellular effects of flavonoid compounds. We anticipate that the results from our studies will improve our understanding of how interactions with human DNA ligases cascade into the recognized health benefits of flavonoids, particularly their wide variety of anticancer effects. Full article
(This article belongs to the Special Issue The Role of Natural Products in Drug Discovery)
Show Figures

Figure 1

30 pages, 7287 KiB  
Article
Context-Aware Tomato Leaf Disease Detection Using Deep Learning in an Operational Framework
by Divas Karimanzira
Electronics 2025, 14(4), 661; https://doi.org/10.3390/electronics14040661 - 8 Feb 2025
Cited by 3 | Viewed by 1517
Abstract
Tomato cultivation is a vital agricultural practice worldwide, yet it faces significant challenges due to various diseases that adversely affect crop yield and quality. This paper presents a novel tomato disease detection system within an operational framework that leverages an innovative deep learning-based [...] Read more.
Tomato cultivation is a vital agricultural practice worldwide, yet it faces significant challenges due to various diseases that adversely affect crop yield and quality. This paper presents a novel tomato disease detection system within an operational framework that leverages an innovative deep learning-based classifier, specifically a Vision Transformer (ViT) integrated with cascaded group attention (CGA) and a modified Focaler-CIoU (Complete Intersection over Union) loss function. The proposed method aims to enhance the accuracy and robustness of disease detection by effectively capturing both local and global contextual information while addressing the challenges of sample imbalance in the dataset. To improve interpretability, we integrate Explainable Artificial Intelligence (XAI) techniques, enabling users to understand the rationale behind the model’s classifications. Additionally, we incorporate a large language model (LLM) to generate comprehensive, context-aware explanations and recommendations based on the identified diseases and other relevant factors, thus bridging the gap between technical analysis and user comprehension. Our evaluation against state-of-the-art deep learning methods, including convolutional neural networks (CNNs) and other transformer-based models, demonstrates that the ViT-CGA model significantly outperforms existing techniques, achieving an overall accuracy of 96.5%, an average precision of 93.9%, an average recall of 96.7%, and an average F1-score of 94.2% for tomato leaf disease classification. The integration of CGA and Focaler-CIoU loss not only contributes to improved model interpretability and stability but also empowers farmers and agricultural stakeholders with actionable insights, fostering informed decision making in disease management. This research advances the field of automated disease detection in crops and provides a practical framework for deploying deep learning solutions in agricultural settings, ultimately supporting sustainable farming practices and enhancing food security. Full article
Show Figures

Figure 1

24 pages, 6534 KiB  
Article
Leveraging Deep Spatiotemporal Sequence Prediction Network with Self-Attention for Ground-Based Cloud Dynamics Forecasting
by Sheng Li, Min Wang, Minghang Shi, Jiafeng Wang and Ran Cao
Remote Sens. 2025, 17(1), 18; https://doi.org/10.3390/rs17010018 - 25 Dec 2024
Cited by 1 | Viewed by 908
Abstract
Ground-based cloud image features high-spatiotemporal resolution, presenting detailed local cloud structures and valuable weather information, which are crucial for meteorological forecasting. However, the inherent fuzziness and dynamism of ground-based clouds have hindered the development of effective prediction algorithms, resulting in low accuracy. This [...] Read more.
Ground-based cloud image features high-spatiotemporal resolution, presenting detailed local cloud structures and valuable weather information, which are crucial for meteorological forecasting. However, the inherent fuzziness and dynamism of ground-based clouds have hindered the development of effective prediction algorithms, resulting in low accuracy. This paper presents CloudPredRNN++, a novel method for predicting ground-based cloud dynamics, leveraging a deep spatiotemporal sequence prediction network enhanced with a self-attention mechanism. Initially, a Cascaded Causal LSTM (CCLSTM) with a dual-memory group decoupling structure is designed to enhance the representation of short-term cloud changes. Next, self-attention memory units are incorporated to capture the long-term dependencies and emphasize the non-stationary characteristics of cloud movements. These components are integrated into cloud dynamic feature mining units, which concurrently extract spatiotemporal features to strengthen unified spatiotemporal modeling. Finally, by embedding gradient highway units and adding skip connection, CloudPredRNN++ is constructed into a hierarchical recursive structure, mitigating the gradient vanishing and enhancing the uniform modeling of temporal–spatial features. Experiments on the sequence ground-based cloud dataset demonstrate that CloudPredRNN++ can predict the future cloud state more accurately and quickly. Compared with other spatiotemporal sequence prediction models, CloudPredRNN++ shows significant improvements in evaluation metrics, improving the accuracy of cloud dynamics forecasting and alleviating long-term dependency decay, thus confirming the effectiveness in ground-based cloud prediction tasks. Full article
Show Figures

Figure 1

19 pages, 4786 KiB  
Article
RT-DETR-Tea: A Multi-Species Tea Bud Detection Model for Unstructured Environments
by Yiyong Chen, Yang Guo, Jianlong Li, Bo Zhou, Jiaming Chen, Man Zhang, Yingying Cui and Jinchi Tang
Agriculture 2024, 14(12), 2256; https://doi.org/10.3390/agriculture14122256 - 10 Dec 2024
Cited by 3 | Viewed by 1463
Abstract
Accurate bud detection is a prerequisite for automatic tea picking and yield statistics; however, current research suffers from missed detection due to the variety of singleness and false detection under complex backgrounds. Traditional target detection models are mainly based on CNN, but CNN [...] Read more.
Accurate bud detection is a prerequisite for automatic tea picking and yield statistics; however, current research suffers from missed detection due to the variety of singleness and false detection under complex backgrounds. Traditional target detection models are mainly based on CNN, but CNN can only achieve the extraction of local feature information, which is a lack of advantages for the accurate identification of targets in complex environments, and Transformer can be a good solution to the problem. Therefore, based on a multi-variety tea bud dataset, this study proposes RT-DETR-Tea, an improved object detection model under the real-time detection Transformer (RT-DETR) framework. This model uses cascaded group attention to replace the multi-head self-attention (MHSA) mechanism in the attention-based intra-scale feature interaction (AIFI) module, effectively optimizing deep features and enriching the semantic information of features. The original cross-scale feature-fusion module (CCFM) mechanism is improved to establish the gather-and-distribute-Tea (GD-Tea) mechanism for multi-level feature fusion, which can effectively fuse low-level and high-level semantic information and large and small tea bud features in natural environments. The submodule of DilatedReparamBlock in UniRepLKNet was employed to improve RepC3 to achieve an efficient fusion of tea bud feature information and ensure the accuracy of the detection head. Ablation experiments show that the precision and mean average precision of the proposed RT-DETR-Tea model are 96.1% and 79.7%, respectively, which are increased by 5.2% and 2.4% compared to those of the original model, indicating the model’s effectiveness. The model also shows good detection performance on the newly constructed tea bud dataset. Compared with other detection algorithms, the improved RT-DETR-Tea model demonstrates superior tea bud detection performance, providing effective technical support for smart tea garden management and production. Full article
Show Figures

Figure 1

18 pages, 4450 KiB  
Article
RS-DETR: An Improved Remote Sensing Object Detection Model Based on RT-DETR
by Hao Zhang, Zheng Ma and Xiang Li
Appl. Sci. 2024, 14(22), 10331; https://doi.org/10.3390/app142210331 - 10 Nov 2024
Cited by 5 | Viewed by 4390
Abstract
Object detection is a fundamental task in computer vision. Recently, deep-learning-based object detection has made significant progress. However, due to large variations in target scale, the predominance of small targets, and complex backgrounds in remote sensing imagery, remote sensing object detection still faces [...] Read more.
Object detection is a fundamental task in computer vision. Recently, deep-learning-based object detection has made significant progress. However, due to large variations in target scale, the predominance of small targets, and complex backgrounds in remote sensing imagery, remote sensing object detection still faces challenges, including low detection accuracy, poor real-time performance, high missed detection rates, and high false detection rates in practical applications. To enhance remote sensing target detection performance, this study proposes a new model, the remote sensing detection transformer (RS-DETR). First, we incorporate cascaded group attention (CGA) into the attention-driven feature interaction module. By capturing features at different levels, it enhances the interaction between features through cascading and improves computational efficiency. Additionally, we propose an enhanced bidirectional feature pyramid network (EBiFPN) to facilitate multi-scale feature fusion. By integrating features across multiple scales, it improves object detection accuracy and robustness. Finally, we propose a novel bounding box regression loss function, Focaler-GIoU, which makes the model focus more on difficult samples, improving detection performance for small and overlapping targets. Experimental results on the satellite imagery multi-vehicles dataset (SIMD) and the high-resolution remote sensing object detection (TGRS-HRRSD) dataset show that the improved algorithm achieved mean average precision (mAP) of 78.2% and 91.6% at an intersection over union threshold of 0.5, respectively, which is an improvement of 2.0% and 1.5% over the baseline model. This result demonstrates the effectiveness and robustness of our proposed method for remote sensing image object detection. Full article
Show Figures

Figure 1

13 pages, 2213 KiB  
Article
Monocular Visual Pig Weight Estimation Method Based on the EfficientVit-C Model
by Songtai Wan, Hui Fang and Xiaoshuai Wang
Agriculture 2024, 14(9), 1571; https://doi.org/10.3390/agriculture14091571 - 10 Sep 2024
Cited by 1 | Viewed by 1460
Abstract
The meat industry is closely related to people’s daily lives and health, and with the growing global population and increasing demand for meat, the development of efficient pig farming technology is particularly important. However, China’s pig industry still faces multiple challenges, such as [...] Read more.
The meat industry is closely related to people’s daily lives and health, and with the growing global population and increasing demand for meat, the development of efficient pig farming technology is particularly important. However, China’s pig industry still faces multiple challenges, such as high labor costs, high biosecurity risks, and low production efficiency. Therefore, there is an urgent need to develop a fast, accurate, and non-invasive method to estimate pig body data to increase production efficiency, enhance biosecurity measures, and improve pig health. This study proposes EfficientVit-C model for image segmentation and cascade several models to estimate the weight of pigs. The EfficientVit-C network uses a cascading group attention module and improves computational efficiency through parameter redistribution and structured pruning. This method uses only one camera for weight estimation, reducing equipment costs and maintenance expenses. The results show that the improved EfficientVit-C model can segment pigs accurately and efficiently the mAP50 curve convergence is 98.2%, the recall is 92.6%, and the precision is 96.5%. The accuracy of pig weight estimation is 100 kg +/− 3.11 kg. On the Jetson Orin NX platform, the average time to complete image segmentation for each 640*480 resolution image was 4.1 ms, and the average time required to complete pig weight estimation was 31 ms. The results show that this method can quickly and accurately estimate the weight of pigs and provide guidance for the subsequent weight evaluation procedures of pigs. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Figure 1

20 pages, 27367 KiB  
Article
MCG-RTDETR: Multi-Convolution and Context-Guided Network with Cascaded Group Attention for Object Detection in Unmanned Aerial Vehicle Imagery
by Chushi Yu and Yoan Shin
Remote Sens. 2024, 16(17), 3169; https://doi.org/10.3390/rs16173169 - 27 Aug 2024
Cited by 8 | Viewed by 3028
Abstract
In recent years, object detection in unmanned aerial vehicle (UAV) imagery has been a prominent and crucial task, with advancements in drone and remote sensing technologies. However, detecting targets in UAV images pose challenges such as complex background, severe occlusion, dense small targets, [...] Read more.
In recent years, object detection in unmanned aerial vehicle (UAV) imagery has been a prominent and crucial task, with advancements in drone and remote sensing technologies. However, detecting targets in UAV images pose challenges such as complex background, severe occlusion, dense small targets, and lighting conditions. Despite the notable progress of object detection algorithms based on deep learning, they still struggle with missed detections and false alarms. In this work, we introduce an MCG-RTDETR approach based on the real-time detection transformer (RT-DETR) with dual and deformable convolution modules, a cascaded group attention module, a context-guided feature fusion structure with context-guided downsampling, and a more flexible prediction head for precise object detection in UAV imagery. Experimental outcomes on the VisDrone2019 dataset illustrate that our approach achieves the highest AP of 29.7% and AP50 of 58.2%, surpassing several cutting-edge algorithms. Visual results further validate the model’s robustness and capability in complex environments. Full article
Show Figures

Figure 1

20 pages, 5603 KiB  
Article
Multi-Scale Fusion Uncrewed Aerial Vehicle Detection Based on RT-DETR
by Minling Zhu and En Kong
Electronics 2024, 13(8), 1489; https://doi.org/10.3390/electronics13081489 - 14 Apr 2024
Cited by 20 | Viewed by 5914
Abstract
With the rapid development of science and technology, uncrewed aerial vehicle (UAV) technology has shown a wide range of application prospects in various fields. The accuracy and real-time performance of UAV target detection play a vital role in ensuring safety and improving the [...] Read more.
With the rapid development of science and technology, uncrewed aerial vehicle (UAV) technology has shown a wide range of application prospects in various fields. The accuracy and real-time performance of UAV target detection play a vital role in ensuring safety and improving the work efficiency of UAVs. Aimed at the challenges faced by the current UAV detection field, this paper proposes the Gathering Cascaded Dilated DETR (GCD-DETR) model, which aims to improve the accuracy and efficiency of UAV target detection. The main innovations of this paper are as follows: (1) The Dilated Re-param Block is creatively applied to the dilatation-wise Residual module, which uses the large kernel convolution and the parallel small kernel convolution together and fuses the feature maps generated by multi-scale perception, greatly improving the feature extraction ability, thereby improving the accuracy of UAV detection. (2) The Gather-and-Distribute mechanism is introduced to effectively enhance the ability of multi-scale feature fusion so that the model can make full use of the feature information extracted from the backbone network and further improve the detection performance. (3) The Cascaded Group Attention mechanism is innovatively introduced, which not only saves the computational cost but also improves the diversity of attention by dividing the attention head in different ways, thus enhancing the ability of the model to process complex scenes. In order to verify the effectiveness of the proposed model, this paper conducts experiments on multiple UAV datasets of complex scenes. The experimental results show that the accuracy of the improved RT-DETR model proposed in this paper on the two UAV datasets reaches 0.956 and 0.978, respectively, which is 2% and 1.1% higher than that of the original RT-DETR model. At the same time, the FPS of the model is also improved by 10 frames per second, which achieves an effective balance between accuracy and speed. Full article
(This article belongs to the Special Issue Applications of Computer Vision, 2nd Edition)
Show Figures

Figure 1

29 pages, 1080 KiB  
Review
Cell Type-Specific Extracellular Vesicles and Their Impact on Health and Disease
by Sohil Amin, Hamed Massoumi, Deepshikha Tewari, Arnab Roy, Madhurima Chaudhuri, Cedra Jazayerli, Abhi Krishan, Mannat Singh, Mohammad Soleimani, Emine E. Karaca, Arash Mirzaei, Victor H. Guaiquil, Mark I. Rosenblatt, Ali R. Djalilian and Elmira Jalilian
Int. J. Mol. Sci. 2024, 25(5), 2730; https://doi.org/10.3390/ijms25052730 - 27 Feb 2024
Cited by 21 | Viewed by 6509
Abstract
Extracellular vesicles (EVs), a diverse group of cell-derived exocytosed particles, are pivotal in mediating intercellular communication due to their ability to selectively transfer biomolecules to specific cell types. EVs, composed of proteins, nucleic acids, and lipids, are taken up by cells to affect [...] Read more.
Extracellular vesicles (EVs), a diverse group of cell-derived exocytosed particles, are pivotal in mediating intercellular communication due to their ability to selectively transfer biomolecules to specific cell types. EVs, composed of proteins, nucleic acids, and lipids, are taken up by cells to affect a variety of signaling cascades. Research in the field has primarily focused on stem cell-derived EVs, with a particular focus on mesenchymal stem cells, for their potential therapeutic benefits. Recently, tissue-specific EVs or cell type-specific extracellular vesicles (CTS-EVs), have garnered attention for their unique biogenesis and molecular composition because they enable highly targeted cell-specific communication. Various studies have outlined the roles that CTS-EVs play in the signaling for physiological function and the maintenance of homeostasis, including immune modulation, tissue regeneration, and organ development. These properties are also exploited for disease propagation, such as in cancer, neurological disorders, infectious diseases, autoimmune conditions, and more. The insights gained from analyzing CTS-EVs in different biological roles not only enhance our understanding of intercellular signaling and disease pathogenesis but also open new avenues for innovative diagnostic biomarkers and therapeutic targets for a wide spectrum of medical conditions. This review comprehensively outlines the current understanding of CTS-EV origins, function within normal physiology, and implications in diseased states. Full article
Show Figures

Figure 1

Back to TopTop