Next Article in Journal
Classifying Reflectance Targets under Ambient Light Conditions Using Passive Spectral Measurements
Next Article in Special Issue
Design of an Interactive Mind Calligraphy System by Affective Computing and Visualization Techniques for Real-Time Reflections of the Writer’s Emotions
Previous Article in Journal
Detection of Gait Abnormalities for Fall Risk Assessment Using Wrist-Worn Inertial Sensors and Deep Learning
Previous Article in Special Issue
Detecting Defects on Solid Wood Panels Based on an Improved SSD Algorithm
Article

A Visual Tracker Offering More Solutions

1
College of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, China
2
Big Data Institute, East University of Heilongjiang, Harbin 150066, China
3
Forestry Intelligent Equipment Engineering Research Center, Harbin 150040, China
*
Author to whom correspondence should be addressed.
Sensors 2020, 20(18), 5374; https://doi.org/10.3390/s20185374
Received: 3 August 2020 / Revised: 9 September 2020 / Accepted: 16 September 2020 / Published: 19 September 2020
(This article belongs to the Special Issue Visual Sensor Networks for Object Detection and Tracking)
Most trackers focus solely on robustness and accuracy. Visual tracking, however, is a long-term problem with a high time limitation. A tracker that is robust, accurate, with long-term sustainability and real-time processing, is of high research value and practical significance. In this paper, we comprehensively consider these requirements in order to propose a new, state-of-the-art tracker with an excellent performance. EfficientNet-B0 is adopted for the first time via neural architecture search technology as the backbone network for the tracking task. This improves the network feature extraction ability and significantly reduces the number of parameters required for the tracker backbone network. In addition, maximal Distance Intersection-over-Union is set as the target estimation method, enhancing network stability and increasing the offline training convergence rate. Channel and spatial dual attention mechanisms are employed in the target classification module to improve the discrimination of the trackers. Furthermore, the conjugate gradient optimization strategy increases the speed of the online learning target classification module. A two-stage search method combined with a screening module is proposed to enable the tracker to cope with sudden target movement and reappearance following a brief disappearance. Our proposed method has an obvious speed advantage compared with pure global searching and achieves an optimal performance on OTB2015, VOT2016, VOT2018-LT, UAV-123 and LaSOT while running at over 50 FPS. View Full-Text
Keywords: visual tracking; neural architecture search; dual attention mechanisms; two-stage search visual tracking; neural architecture search; dual attention mechanisms; two-stage search
Show Figures

Figure 1

MDPI and ACS Style

Zhao, L.; Ishag Mahmoud, M.A.; Ren, H.; Zhu, M. A Visual Tracker Offering More Solutions. Sensors 2020, 20, 5374. https://doi.org/10.3390/s20185374

AMA Style

Zhao L, Ishag Mahmoud MA, Ren H, Zhu M. A Visual Tracker Offering More Solutions. Sensors. 2020; 20(18):5374. https://doi.org/10.3390/s20185374

Chicago/Turabian Style

Zhao, Long, Mubarak A. Ishag Mahmoud, Honge Ren, and Meng Zhu. 2020. "A Visual Tracker Offering More Solutions" Sensors 20, no. 18: 5374. https://doi.org/10.3390/s20185374

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop