Skip to Content

347 Results Found

  • Article
  • Open Access
355 Views
19 Pages

21 January 2026

Precise 3D perception is critical for indoor robotics, augmented reality, and autonomous navigation. However, existing multi-frame depth estimation methods often suffer from significant performance degradation in challenging indoor scenarios characte...

  • Article
  • Open Access
16 Citations
6,310 Views
16 Pages

An FPGA-Based Ultra-High-Speed Object Detection Algorithm with Multi-Frame Information Fusion

  • Xianlei Long,
  • Shenhua Hu,
  • Yiming Hu,
  • Qingyi Gu and
  • Idaku Ishii

26 August 2019

An ultra-high-speed algorithm based on Histogram of Oriented Gradient (HOG) and Support Vector Machine (SVM) for hardware implementation at 10,000 frames per second (FPS) under complex backgrounds is proposed for object detection. The algorithm is im...

  • Article
  • Open Access
5 Citations
4,374 Views
23 Pages

23 August 2024

In various practical applications, such as autonomous vehicle and unmanned aerial vehicle navigation, Global Navigation Satellite Systems (GNSSs) are commonly used for positioning. However, traditional GNSS positioning methods are often affected by d...

  • Article
  • Open Access
2 Citations
2,988 Views
17 Pages

4 January 2024

Convolutional neural networks (CNNs) have become instrumental in advancing multi-frame image super-resolution (SR), a technique that merges multiple low-resolution images of the same scene into a high-resolution image. In this paper, a novel deep lea...

  • Article
  • Open Access
9 Citations
5,133 Views
18 Pages

21 December 2022

The detection of drivable areas in off-road scenes is a challenging problem due to the presence of unstructured class boundaries, irregular features, and dust noise. Three-dimensional LiDAR data can effectively describe the terrain features, and a bi...

  • Article
  • Open Access
1 Citations
1,675 Views
24 Pages

26 August 2025

Infrared dim and small target detection aims to accurately localize targets within complex backgrounds or clutter. However, under extremely low signal-to-noise ratio (SNR) conditions, single-frame detection methods often fail to effectively detect su...

  • Article
  • Open Access
2 Citations
2,068 Views
18 Pages

UWB-Assisted Bluetooth Localization Using Regression Models and Multi-Scan Processing

  • Pan Li,
  • Runyu Guan,
  • Bing Chen,
  • Shaojian Xu,
  • Danli Xiao,
  • Luping Xu and
  • Bo Yan

9 October 2024

Bluetooth devices have been widely used for pedestrian positioning and navigation in complex indoor scenes. Bluetooth beacons are scattered throughout the entire indoor walkable area containing stairwells, and pedestrian positioning can be obtained b...

  • Article
  • Open Access
1,224 Views
24 Pages

Milepost-to-Vehicle Monocular Depth Estimation with Boundary Calibration and Geometric Optimization

  • Enhua Zhang,
  • Tao Ma,
  • Handuo Yang,
  • Jiaqi Li,
  • Zhiwei Xie and
  • Zheng Tong

29 August 2025

Milepost-assisted positioning estimates the distance between a vehicle-mounted camera and a milepost as a reference position for autonomous driving. However, the accuracy of monocular metric depth estimation is compromised by camera installation angl...

  • Article
  • Open Access
267 Views
27 Pages

15 February 2026

Road defect detection is essential for traffic safety and infrastructure maintenance. Excising automated methods based on 2D image analysis lack spatial context and cannot provide accurate 3D localization required for maintenance planning. We propose...

  • Article
  • Open Access
2 Citations
1,509 Views
29 Pages

The accurate detection and localization of polyps during endoscopic examinations are critical for early disease diagnosis and cancer prevention. However, the presence of artifacts and noise, along with the high similarity between polyps and surroundi...

  • Article
  • Open Access
13 Citations
4,900 Views
18 Pages

20 April 2022

Head pose and eye gaze are vital clues for analysing a driver’s visual attention. Previous approaches achieve promising results from point clouds in constrained conditions. However, these approaches face challenges in the complex naturalistic d...

  • Article
  • Open Access
281 Views
19 Pages

10 December 2025

H. 265/HEVC still dominates the video encoding application market with its mature industrial ecosystem and excellent hardware support. However, its high computational complexity remains a major barrier to its wider application. To tackle this problem...

  • Article
  • Open Access
10 Citations
5,210 Views
24 Pages

YOLO-FIX: Improved YOLOv11 with Attention and Multi-Scale Feature Fusion for Detecting Glue Line Defects on Mobile Phone Frames

  • Tianrun Ye,
  • Shize Huang,
  • Weiwei Qin,
  • Haiyang Tu,
  • Ping Zhang,
  • Yafei Wang,
  • Chunming Gao and
  • Yanli Gong

26 February 2025

This paper presents YOLO-FIX, an improved intelligent detection model based on YOLOv11, designed to identify glue line defects in mobile phone frames. The model addresses the challenges of complex glue line morphology, background interference, and il...

  • Article
  • Open Access
4 Citations
2,025 Views
15 Pages

3 September 2024

This study aims to address the problem in tracking technology in which targeted cruising ships or submarines sailing near the water surface are tracked at low frame rates or with some frames missing in the video image, so that the tracked targets hav...

  • Technical Note
  • Open Access
4 Citations
3,591 Views
15 Pages

LiDAR Data Enrichment by Fusing Spatial and Temporal Adjacent Frames

  • Hao Fu,
  • Hanzhang Xue,
  • Xiaochang Hu and
  • Bokai Liu

12 September 2021

In autonomous driving scenarios, the point cloud generated by LiDAR is usually considered as an accurate but sparse representation. In order to enrich the LiDAR point cloud, this paper proposes a new technique that combines spatial adjacent frames an...

  • Article
  • Open Access
1,116 Views
22 Pages

27 April 2025

In additive manufacturing processes, the metal melt pool is decisive for processing quality. A single sensor is incapable of fully capturing its physical characteristics and is prone to data inaccuracies. This study proposes a multi-sensor monitoring...

  • Article
  • Open Access
30 Citations
7,143 Views
20 Pages

Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception

  • Li Wang,
  • Ruifeng Li,
  • Jingwen Sun,
  • Xingxing Liu,
  • Lijun Zhao,
  • Hock Soon Seah,
  • Chee Kwang Quah and
  • Budianto Tandianus

21 September 2019

To autonomously move and operate objects in cluttered indoor environments, a service robot requires the ability of 3D scene perception. Though 3D object detection can provide an object-level environmental description to fill this gap, a robot always...

  • Article
  • Open Access
772 Views
25 Pages

An Intelligent System for Pigeon Egg Management: Integrating a Novel Lightweight YOLO Model and Multi-Frame Fusion for Robust Detection and Positioning

  • Yufan Cheng,
  • Yao Liu,
  • Qianhui Li,
  • Tao Jiang,
  • Chengyue Ji,
  • Longshen Liu,
  • Ya Zhong,
  • Jinling Wu and
  • Guanchi Chen

21 November 2025

To address the issues of high breakage rates and substantial labor costs in pigeon egg farming, this study proposes an intelligent pigeon egg recognition and positioning system based on an improved YOLOv12n object detection algorithm and OpenCV barco...

  • Article
  • Open Access
2 Citations
4,616 Views
11 Pages

6 April 2021

The multi-frame super-resolution techniques have been prosperous over the past two decades. However, little attention has been paid to the combination of deep learning and multi-frame super-resolution. One reason is that most deep learning-based supe...

  • Article
  • Open Access
2 Citations
2,914 Views
23 Pages

DS-Trans: A 3D Object Detection Method Based on a Deformable Spatiotemporal Transformer for Autonomous Vehicles

  • Yuan Zhu,
  • Ruidong Xu,
  • Chongben Tao,
  • Hao An,
  • Huaide Wang,
  • Zhipeng Sun and
  • Ke Lu

30 April 2024

Facing the significant challenge of 3D object detection in complex weather conditions and road environments, existing algorithms based on single-frame point cloud data struggle to achieve desirable results. These methods typically focus on spatial re...

  • Article
  • Open Access
2 Citations
2,477 Views
16 Pages

An Efficient Multi-Scale Attention Feature Fusion Network for 4K Video Frame Interpolation

  • Xin Ning,
  • Yuhang Li,
  • Ziwei Feng,
  • Jinhua Liu and
  • Youdong Ding

Video frame interpolation aims to generate intermediate frames in a video to showcase finer details. However, most methods are only trained and tested on low-resolution datasets, lacking research on 4K video frame interpolation problems. This limitat...

  • Article
  • Open Access
1 Citations
3,818 Views
27 Pages

Fast Coherent Video Style Transfer via Flow Errors Reduction

  • Li Wang,
  • Xiaosong Yang and
  • Jianjun Zhang

21 March 2024

For video style transfer, naively applying still image techniques to process a video frame-by-frame independently often causes flickering artefacts. Some works adopt optical flow into the design of temporal constraint loss to secure temporal consiste...

  • Article
  • Open Access
343 Views
14 Pages

FreeViBe+: An Enhanced Method for Moving Target Separation

  • Jianwei Wu,
  • Keju Zhang,
  • Yuhan Shen and
  • Jiaxiang Lin

1 December 2025

An enhanced method called FreeViBe+ for moving target segmentation is proposed in this paper, addressing limitations in the ViBe algorithm such as ghosting, shadows, and holes. To eliminate ghosts, multi-frame background modeling is introduced. Shado...

  • Article
  • Open Access
11 Citations
4,709 Views
20 Pages

Multi-Dimensional Automatic Detection of Scanning Radar Images of Marine Targets Based on Radar PPInet

  • Xiaolong Chen,
  • Jian Guan,
  • Xiaoqian Mu,
  • Zhigao Wang,
  • Ningbo Liu and
  • Guoqing Wang

26 September 2021

Traditional radar target detection algorithms are mostly based on statistical theory. They have weak generalization capabilities for complex sea clutter environments and diverse target characteristics, and their detection performance would be signifi...

  • Article
  • Open Access
2 Citations
2,385 Views
23 Pages

A Sliding Window-Based CNN-BiGRU Approach for Human Skeletal Pose Estimation Using mmWave Radar

  • Yuquan Luo,
  • Yuqiang He,
  • Yaxin Li,
  • Huaiqiang Liu,
  • Jun Wang and
  • Fei Gao

11 February 2025

In this paper, we present a low-cost, low-power millimeter-wave (mmWave) skeletal joint localization system. High-quality point cloud data are generated using the self-developed BHYY_MMW6044 59–64 GHz mmWave radar device. A sliding window mecha...

  • Article
  • Open Access
9 Citations
3,426 Views
11 Pages

Single-Shot Multi-Frame Imaging of Femtosecond Laser-Induced Plasma Propagation

  • Tianyong Zhang,
  • Baoshan Guo,
  • Lan Jiang,
  • Tong Zhu,
  • Yanhong Hua,
  • Ningwei Zhan and
  • Huan Yao

21 April 2023

Single-shot ultrafast multi-frame imaging technology plays a crucial role in the observation of laser-induced plasma. However, there are many challenges in the application of laser processing, such as technology fusion and imaging stability. To provi...

  • Article
  • Open Access
1 Citations
4,934 Views
16 Pages

27 September 2024

Collaboration among road agents, such as connected autonomous vehicles and roadside units, enhances driving performance by enabling the exchange of valuable information. However, existing collaboration methods predominantly focus on perception tasks...

  • Article
  • Open Access
13 Citations
4,843 Views
16 Pages

Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection

  • Zhenyu Zhai,
  • Qiantong Wang,
  • Zongxu Pan,
  • Zhentong Gao and
  • Wenlong Hu

2 October 2022

Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU...

  • Article
  • Open Access
326 Views
16 Pages

Video Super-Resolution Combining Dual Motion Compensation and Multi-Scale Structure–Texture Prior

  • Xiaolei Liu,
  • Jiawei Shi,
  • Jiayi Xu,
  • Pengfei Song,
  • Hongxia Gao,
  • Fuhai Wang,
  • Meining Ji,
  • Chen Chen and
  • Xianghao Kong

7 January 2026

Video super-resolution methods based on convolutional kernels or optical flow often face challenges such as limited utilization of multi-frame detail information or strong reliance on accurate optical flow estimation. To address these issues, this pa...

  • Article
  • Open Access
437 Views
30 Pages

8 January 2026

Amid rapid advancements in artificial intelligence, the detection of abnormal human behaviors in complex traffic environments has garnered significant attention. However, detection errors frequently occur due to interference from complex backgrounds,...

  • Article
  • Open Access
540 Views
16 Pages

LIF-VSR: A Lightweight Framework for Video Super-Resolution with Implicit Alignment and Attentional Fusion

  • Songyi Zhang,
  • Hailin Zhang,
  • Xiaolin Wang,
  • Kailei Song,
  • Zhizhuo Han,
  • Zhitao Zhang and
  • Wenchi Cheng

17 January 2026

Video super-resolution (VSR) has advanced rapidly in enhancing video quality and restoring compressed content, yet leading methods often remain too costly for real-world use. We present LIF-VSR, a lightweight, near-real-time framework built with an e...

  • Article
  • Open Access
2 Citations
4,590 Views
19 Pages

Burst-Enhanced Super-Resolution Network (BESR)

  • Jiaao Li,
  • Qunbo Lv,
  • Wenjian Zhang,
  • Yu Zhang and
  • Zheng Tan

23 March 2024

Multi-frame super-resolution (MFSR) leverages complementary information between image sequences of the same scene to increase the resolution of the reconstructed image. As a branch of MFSR, burst super-resolution aims to restore image details by leve...

  • Article
  • Open Access
7 Citations
4,841 Views
20 Pages

6 February 2022

Video satellite imagery has become a hot research topic in Earth observation due to its ability to capture dynamic information. However, its high temporal resolution comes at the expense of spatial resolution. In recent years, deep learning (DL) base...

  • Article
  • Open Access
1 Citations
1,327 Views
22 Pages

18 April 2025

Large-volume parenterals (LVPs), as essential medical products, are widely used in healthcare settings, making their safety inspection crucial. Current methods for detecting foreign particles in LVP solutions through image analysis primarily rely on...

  • Article
  • Open Access
2,721 Views
19 Pages

FIFA3D: Flow-Guided Feature Aggregation for Temporal Three-Dimensional Object Detection

  • Ruiqi Ma,
  • Chunwei Wang,
  • Chi Chen,
  • Yihan Zeng,
  • Bijun Li,
  • Qin Zou,
  • Qingqiu Huang,
  • Xinge Zhu and
  • Hang Xu

23 January 2025

Detecting accurate 3D bounding boxes from LiDAR point clouds is crucial for autonomous driving. Recent studies have shown the superiority of the performance of multi-frame 3D detectors, yet eliminating the misalignment across frames and effectively a...

  • Article
  • Open Access
9 Citations
2,824 Views
19 Pages

This paper proposes a robust multi-frame video super-resolution (SR) scheme to obtain high SR performance under large upscaling factors. Although the reference low-resolution frames can provide complementary information for the high-resolution frame,...

  • Article
  • Open Access
538 Views
20 Pages

10 December 2025

Realizing real-time dense 3D reconstruction on resource-limited mobile platforms remains a significant challenge, particularly in low-texture environments that demand robust multi-frame fusion to resolve matching ambiguities. However, the inherent ti...

  • Article
  • Open Access
1,899 Views
12 Pages

11 August 2024

Multi-frame super-resolution (MFSR) generates a super-resolution (SR) image from a burst consisting of multiple low-resolution images. Burst Super-Resolution Transformer (BSRT) is a state-of-the-art deep learning model for MFSR. However, in this stud...

  • Article
  • Open Access
10 Citations
2,560 Views
27 Pages

29 January 2024

Space infrared dim target recognition is an important applications of space situational awareness (SSA). Due to the weak observability and lack of geometric texture of the target, it may be unreliable to rely only on grayscale features for recognitio...

  • Article
  • Open Access
8 Citations
5,125 Views
15 Pages

17 August 2023

For compressed images and videos, quality enhancement is essential. Though there have been remarkable achievements related to deep learning, deep learning models are too large to apply to real-time tasks. Therefore, a fast multi-frame quality enhance...

  • Article
  • Open Access
4 Citations
3,789 Views
16 Pages

Learning the Relative Dynamic Features for Word-Level Lipreading

  • Hao Li,
  • Nurbiya Yadikar,
  • Yali Zhu,
  • Mutallip Mamut and
  • Kurban Ubul

13 May 2022

Lipreading is a technique for analyzing sequences of lip movements and then recognizing the speech content of a speaker. Limited by the structure of our vocal organs, the number of pronunciations we could make is finite, leading to problems with homo...

  • Article
  • Open Access
5 Citations
3,077 Views
21 Pages

14 April 2023

By virtue of the merits of wide swath, persistent observation, and rapid operational response, geostationary remote sensing satellites (e.g., GF-4) show tremendous potential for sea target system surveillance and situational awareness. However, ships...

  • Article
  • Open Access
395 Views
19 Pages

Dynamic Feature Fusion for Sparse Radar Detection: Motion-Centric BEV Learning with Adaptive Task Balancing

  • Yixun Sang,
  • Junjie Cui,
  • Yaoguang Sun,
  • Fan Zhang,
  • Yanting Li and
  • Guoqiang Shi

2 February 2026

This paper proposes a novel motion-aware framework to address key challenges in 4D millimeter-wave radar detection for autonomous driving. While existing methods struggle with sparse point clouds and dynamic object characterization, our approach intr...

  • Article
  • Open Access
3 Citations
1,905 Views
20 Pages

A multispectral infrared zoom optical system design and a single-frame hierarchical guided filtering image enhancement algorithm are proposed to address the technical problems of low contrast, blurred edges, and weak signal strength of single-spectru...

  • Article
  • Open Access
268 Views
24 Pages

TASONet: A Spatial Enhancement and Temporal Modeling Framework for UAV Small Object Tracking

  • Ruiqi Ma,
  • Changcai Lai,
  • Qinghua Sheng,
  • Zehao Tao and
  • Xiaorun Li

11 February 2026

Multi object tracking (MOT) in UAV imagery is challenged by weak feature representation of small objects due to limited resolution, which leads to frequent missed detections. However, enhancing small object features often amplifies background noise a...

  • Article
  • Open Access
803 Views
24 Pages

Memory-Based Temporal Transformer U-Net for Multi-Frame Infrared Small Target Detection

  • Zicheng Feng,
  • Wenlong Zhang,
  • Donghui Liu,
  • Xingfu Tao,
  • Ang Su and
  • Yixin Yang

23 November 2025

In the field of infrared small target detection (ISTD), single-frame ISTD (SISTD), using only spatial features, cannot deal well with dim targets in cluttered backgrounds. In contrast, multi-frame ISTD (MISTD), utilizing spatio-temporal information f...

  • Article
  • Open Access
445 Views
27 Pages

Shadow Spatiotemporal Track-Before-Detect Approach for Distributed UAV-Borne Video SAR

  • Liwu Wen,
  • Ming Ke,
  • Ming Jiang,
  • Jinshan Ding and
  • Xuejun Huang

20 January 2026

Shadow detection has become a key technology for ground-based moving target indication in video synthetic aperture radar (SAR). However, single-platform video SAR faces the issue of moving-target shadows being occluded. This paper proposes a new dyna...

  • Article
  • Open Access
22 Citations
8,215 Views
23 Pages

LiDAR-Visual-Inertial Odometry Based on Optimized Visual Point-Line Features

  • Xuan He,
  • Wang Gao,
  • Chuanzhen Sheng,
  • Ziteng Zhang,
  • Shuguo Pan,
  • Lijin Duan,
  • Hui Zhang and
  • Xinyu Lu

27 January 2022

This study presents a LiDAR-Visual-Inertial Odometry (LVIO) based on optimized visual point-line features, which can effectively compensate for the limitations of a single sensor in real-time localization and mapping. Firstly, an improved line featur...

  • Article
  • Open Access
802 Views
22 Pages

6 December 2025

Video frame interpolation in ultra-high-definition extreme motion scenes remains highly challenging due to large displacements, nonlinear motion, and occlusions that disrupt spatio-temporal symmetry. To address this issue, this study proposes a frame...

  • Article
  • Open Access
488 Views
24 Pages

26 November 2025

Three-dimensional (3D) reconstruction is increasingly being adopted in construction site management. While most existing studies rely on auxiliary equipment such as LiDAR and depth cameras, monocular depth estimation offers broader applicability unde...

of 7