Next Article in Journal
Autonomous Drilling and the Idea of Next-Generation Deep Mineral Exploration
Previous Article in Journal
LocSys: A Low-Code Paradigm for the Development of Cyber-Physical Applications
Previous Article in Special Issue
A Metric Learning-Based Improved Oriented R-CNN for Wildfire Detection in Power Transmission Corridors
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Review

A Review of DEtection TRansformer: From Basic Architecture to Advanced Developments and Visual Perception Applications

by
Liang Yu
,
Lin Tang
and
Lisha Mu
*
College of Software Engineering, Sichuan Polytechnic University, Deyang 618000, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(13), 3952; https://doi.org/10.3390/s25133952
Submission received: 11 May 2025 / Revised: 19 June 2025 / Accepted: 20 June 2025 / Published: 25 June 2025
(This article belongs to the Special Issue Object Detection and Recognition Based on Deep Learning)

Abstract

DEtection TRansformer (DETR) introduced an end-to-end object detection paradigm using Transformers, eliminating hand-crafted components like anchor boxes and Non-Maximum Suppression (NMS) via set prediction and bipartite matching. Despite its potential, the original DETR suffered from slow convergence, poor small object detection, and low efficiency, prompting extensive research. This paper systematically reviews DETR’s technical evolution from a “problem-driven” perspective, focusing on advancements in attention mechanisms, query design, training strategies, and architectural efficiency. We also outline DETR’s applications in autonomous driving, medical imaging, and remote sensing, and its expansion to fine-grained classification and video understanding. Finally, we summarize current challenges and future directions. This “problem-driven” analysis offers researchers a comprehensive and insightful overview, aiming to fill gaps in the existing literature on DETR’s evolution and logic.
Keywords: object detection; DETR; transformer; attention; end to end; deep learning object detection; DETR; transformer; attention; end to end; deep learning

Share and Cite

MDPI and ACS Style

Yu, L.; Tang, L.; Mu, L. A Review of DEtection TRansformer: From Basic Architecture to Advanced Developments and Visual Perception Applications. Sensors 2025, 25, 3952. https://doi.org/10.3390/s25133952

AMA Style

Yu L, Tang L, Mu L. A Review of DEtection TRansformer: From Basic Architecture to Advanced Developments and Visual Perception Applications. Sensors. 2025; 25(13):3952. https://doi.org/10.3390/s25133952

Chicago/Turabian Style

Yu, Liang, Lin Tang, and Lisha Mu. 2025. "A Review of DEtection TRansformer: From Basic Architecture to Advanced Developments and Visual Perception Applications" Sensors 25, no. 13: 3952. https://doi.org/10.3390/s25133952

APA Style

Yu, L., Tang, L., & Mu, L. (2025). A Review of DEtection TRansformer: From Basic Architecture to Advanced Developments and Visual Perception Applications. Sensors, 25(13), 3952. https://doi.org/10.3390/s25133952

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop