You are currently on the new version of our website. Access the old version .

253 Results Found

  • Article
  • Open Access
4 Citations
3,062 Views
16 Pages

Unsupervised Learning from Videos for Object Discovery in Single Images

  • Dong Zhao,
  • Baoqing Ding,
  • Yulin Wu,
  • Lei Chen and
  • Hongchao Zhou

29 December 2020

This paper proposes a method for discovering the primary objects in single images by learning from videos in a purely unsupervised manner—the learning process is based on videos, but the generated network is able to discover objects from a sing...

  • Article
  • Open Access
3 Citations
3,706 Views
17 Pages

Learning Domain-Adaptive Landmark Detection-Based Self-Supervised Video Synchronization for Remote Sensing Panorama

  • Ling Mei,
  • Yizhuo He,
  • Farnoosh Javadi Fishani,
  • Yaowen Yu,
  • Lijun Zhang and
  • Helge Rhodin

9 February 2023

The synchronization of videos is an essential pre-processing step for multi-view reconstruction such as the image mosaic by UAV remote sensing; it is often solved with hardware solutions in motion capture studios. However, traditional synchronization...

  • Article
  • Open Access
15 Citations
4,088 Views
41 Pages

Abnormal event detection is one of the most challenging tasks in computer vision. Many existing deep anomaly detection models are based on reconstruction errors, where the training phase is performed using only videos of normal events and the model i...

  • Article
  • Open Access
2,261 Views
21 Pages

Attention-Guided HDR Reconstruction for Enhancing Smart City Applications

  • Yung-Yao Chen,
  • Chih-Hsien Hsia,
  • Sin-Ye Jhong and
  • Chin-Feng Lai

12 November 2023

In the context of smart city development, video surveillance serves as a critical component for maintaining public safety and operational efficiency. However, traditional surveillance systems are often constrained by a limited dynamic range, leading...

  • Article
  • Open Access
664 Views
16 Pages

15 October 2025

Conventional audio and video codecs are designed for human perception, often discarding subtle spectral cues that are essential for machine-based analysis. To overcome this limitation, we propose a machine-oriented compression framework that reinterp...

  • Article
  • Open Access
2 Citations
6,429 Views
27 Pages

HFR Projector Camera Based Visible Light Communication System for Real-Time Video Streaming

  • Atul Sharma,
  • Sushil Raut,
  • Kohei Shimasaki,
  • Taku Senoo and
  • Idaku Ishii

19 September 2020

This study develops a projector–camera-based visible light communication (VLC) system for real-time broadband video streaming, in which a high frame rate (HFR) projector can encode and project a color input video sequence into binary image patt...

  • Article
  • Open Access
4 Citations
3,463 Views
15 Pages

29 March 2024

Video super-resolution (VSR) remains challenging for real-world applications due to complex and unknown degradations. Existing methods lack the flexibility to handle video sequences with different degradation levels, thus failing to reflect real-worl...

  • Article
  • Open Access
4 Citations
3,033 Views
15 Pages

Lightweight Super-Resolution with Self-Calibrated Convolution for Panoramic Videos

  • Fanjie Shang,
  • Hongying Liu,
  • Wanhao Ma,
  • Yuanyuan Liu,
  • Licheng Jiao,
  • Fanhua Shang,
  • Lijun Wang and
  • Zhenyu Zhou

30 December 2022

Panoramic videos are shot by an omnidirectional camera or a collection of cameras, and can display a view in every direction. They can provide viewers with an immersive feeling. The study of super-resolution of panoramic videos has attracted much att...

  • Article
  • Open Access
11 Citations
4,401 Views
13 Pages

1 September 2018

Nowadays, video surveillance has become ubiquitous with the quick development of artificial intelligence. Multi-object detection (MOD) is a key step in video surveillance and has been widely studied for a long time. The majority of existing MOD algor...

  • Article
  • Open Access
4 Citations
2,295 Views
14 Pages

1 April 2024

Dynamic mode decomposition (DMD) is a powerful tool for separating the background and foreground in videos. This algorithm decomposes a video into dynamic modes, called DMD modes, to facilitate the extraction of the near-zero mode, which represents t...

  • Article
  • Open Access
305 Views
21 Pages

10 January 2026

This study presents a comprehensive digital workflow for the archaeological investigation and heritage enhancement of the Coëby megalithic necropolis (Brittany, France). Dating to the Middle Neolithic, between the 4th and 3rd millennia BC, this...

  • Article
  • Open Access
1,732 Views
17 Pages

Use of Spatio-Structural Parameters of the Multiscan Video Signal for Improving Accuracy of Control over Object Geometric Parameters

  • Vladimír Tlach,
  • Michael Yurievich Alies,
  • Ivan Kuric,
  • Milan Sága,
  • Yuriy Konstantinovich Shelkovnikov,
  • Igor Olegovic Arkhipov,
  • Aleksandr Ivanovich Korshunov and
  • Anastasia Alekseevna Meteleva

26 February 2023

In the present paper, we consider the issue of improving the accuracy of measurements and the peculiar features of the measurements of the geometric parameters of objects by optoelectronic systems, based on a television multiscan in the analogue mode...

  • Article
  • Open Access
4 Citations
3,472 Views
14 Pages

Real-Time Hyperspectral Video Acquisition with Coded Slits

  • Guoliang Tang,
  • Zi Wang,
  • Shijie Liu,
  • Chunlai Li and
  • Jianyu Wang

21 January 2022

We propose a real-time hyperspectral video acquisition system that uses coded slits. Conventional imaging spectrometers usually have scanning mechanisms that reduce the temporal resolution or sacrifice the spatial resolution to acquire spectral infor...

  • Article
  • Open Access
11 Citations
5,529 Views
19 Pages

A High-Speed Imaging Method Based on Compressive Sensing for Sound Extraction Using a Low-Speed Camera

  • Ge Zhu,
  • Xu-Ri Yao,
  • Zhi-Bin Sun,
  • Peng Qiu,
  • Chao Wang,
  • Guang-Jie Zhai and
  • Qing Zhao

11 May 2018

This paper reports an efficient method for sound extraction from high-speed light spot videos reconstructed from the coded light spot images captured with a low-speed camera based on compressive sensing, but at the expense of consuming time. The prop...

  • Communication
  • Open Access
2 Citations
3,687 Views
12 Pages

Accurate and Serialized Dense Point Cloud Reconstruction for Aerial Video Sequences

  • Shibiao Xu,
  • Bingbing Pan,
  • Jiguang Zhang and
  • Xiaopeng Zhang

17 March 2023

Traditional multi-view stereo (MVS) is not applicable for the point cloud reconstruction of serialized video frames. Among them, the exhausted feature extraction and matching for all the prepared frames are time-consuming, and the scope of the search...

  • Article
  • Open Access
7 Citations
3,788 Views
12 Pages

Accurate Wide Angle SAR Imaging Based on LS-CS-Residual

  • Zhonghao Wei,
  • Bingchen Zhang and
  • Yirong Wu

25 January 2019

Wide angle synthetic aperture radar (WASAR) receives data from a large angle, which causes the problem of aspect dependent scattering. L 1 regularization is a common compressed sensing (CS) model. The L 1 regularization based WASAR im...

  • Article
  • Open Access
9 Citations
4,178 Views
18 Pages

5 February 2022

Building heritage contributes to the historical context and industrial history of a city. Brick warehouses, which comprise a systematic interface between components, demand an interactive manipulation of inspected parts to interpret their constructio...

  • Article
  • Open Access
2 Citations
2,677 Views
14 Pages

CT-Video Matching for Retrograde Intrarenal Surgery Based on Depth Prediction and Style Transfer

  • Honglin Lei,
  • Yanqi Pan,
  • Tao Yu,
  • Zuoming Fu,
  • Chongan Zhang,
  • Xinsen Zhang,
  • Peng Wang,
  • Jiquan Liu,
  • Xuesong Ye and
  • Huilong Duan

14 October 2021

Retrograde intrarenal surgery (RIRS) is a minimally invasive endoscopic procedure for the treatment of kidney stones. Traditionally, RIRS is usually performed by reconstructing a 3D model of the kidney from preoperative CT images in order to locate t...

  • Article
  • Open Access
1 Citations
1,222 Views
17 Pages

27 September 2024

This paper presents an enhanced method for the transmission of 3D video in the Multi-view Video plus Depth (MVD) format over Two-Way Relay Channels (TWRC). Our approach addresses the unique challenges of MVD-based 3D video by combining Hierarchical Q...

  • Article
  • Open Access
22 Citations
7,347 Views
15 Pages

Vertical Dynamic Deflection Measurement in Concrete Beams with the Microsoft Kinect

  • Xiaojuan Qi,
  • Derek Lichti,
  • Mamdouh El-Badry,
  • Jacky Chow and
  • Kathleen Ang

19 February 2014

The Microsoft Kinect is arguably the most popular RGB-D camera currently on the market, partially due to its low cost. It offers many advantages for the measurement of dynamic phenomena since it can directly measure three-dimensional coordinates of o...

  • Article
  • Open Access
2 Citations
2,526 Views
17 Pages

28 November 2023

High-Efficiency Video Coding (HEVC) is one of the most widely studied coding standards. It still uses the block-based hybrid coding framework of Advanced Video Coding (AVC), and compared to AVC, it can double the compression ratio while maintaining t...

  • Article
  • Open Access
1 Citations
5,546 Views
24 Pages

With an increased interest in applications that require a clean background image, such as video surveillance, object tracking, street view imaging and location-based services on web-based maps, multiple algorithms have been developed to reconstruct a...

  • Article
  • Open Access
119 Views
22 Pages

Accurate assessment of rice resistance to Sogatella furcifera (Horváth) is essential for breeding insect-resistant cultivars. Traditional assessment methods rely on manual scoring of damage severity, which is subjective and inefficient. To ove...

  • Article
  • Open Access
20 Citations
5,279 Views
18 Pages

1 December 2021

The conventional reconstruction method of off-axis digital holographic microscopy (DHM) relies on computational processing that involves spatial filtering of the sample spectrum and tilt compensation between the interfering waves to accurately recons...

  • Article
  • Open Access
17 Citations
3,926 Views
19 Pages

Event Encryption for Neuromorphic Vision Sensors: Framework, Algorithm, and Evaluation

  • Bowen Du,
  • Weiqi Li,
  • Zeju Wang,
  • Manxin Xu,
  • Tianchen Gao,
  • Jiajie Li and
  • Hongkai Wen

24 June 2021

Nowadays, our lives have benefited from various vision-based applications, such as video surveillance, human identification and aided driving. Unauthorized access to the vision-related data greatly threatens users’ privacy, and many encryption scheme...

  • Article
  • Open Access
6 Citations
3,364 Views
13 Pages

A GAN-Based Video Intra Coding

  • Guangyu Zhong,
  • Jun Wang,
  • Jiyuan Hu and
  • Fan Liang

Intra prediction is a vital part of the image/video coding framework, which is designed to remove spatial redundancy within a picture. Based on a set of predefined linear combinations, traditional intra prediction cannot cope with coding blocks with...

  • Article
  • Open Access
6 Citations
3,695 Views
32 Pages

We designed a teaching–learning sequence on relative motion in classical mechanics, based on the model of educational reconstruction and on the fundamental design principle of highlighting those conceptual elements which could be valuable in th...

  • Article
  • Open Access
19 Citations
8,731 Views
20 Pages

A Novel Abandoned Object Detection System Based on Three-Dimensional Image Information

  • Yiliang Zeng,
  • Jinhui Lan,
  • Bin Ran,
  • Jing Gao and
  • Jinlin Zou

23 March 2015

A new idea of an abandoned object detection system for road traffic surveillance systems based on three-dimensional image information is proposed in this paper to prevent traffic accidents. A novel Binocular Information Reconstruction and Recognition...

  • Article
  • Open Access
1 Citations
5,058 Views
20 Pages

Automatic 3D Reconstruction: Mesh Extraction Based on Gaussian Splatting from Romanesque–Mudéjar Churches

  • Nelson Montas-Laracuente,
  • Emilio Delgado Martos,
  • Carlos Pesqueira-Calvo,
  • Giovanni Intra Sidola,
  • Ana Maitín,
  • Alberto Nogales and
  • Álvaro José García-Tejedor

28 July 2025

This research introduces an automated 3D virtual reconstruction system tailored for architectural heritage (AH) applications, contributing to the ongoing paradigm shift from traditional CAD-based workflows to artificial intelligence-driven methodolog...

  • Article
  • Open Access
1,532 Views
15 Pages

18 November 2025

Three-dimensional Human Reconstruction from Monocular Vision is a key technology in Virtual Reality and digital humans. It aims to recover the 3D structure and pose of the human body from 2D images or video. Current methods for dynamic 3D reconstruct...

  • Article
  • Open Access
161 Views
15 Pages

14 January 2026

Remote photoplethysmography (rPPG) enables non-contact acquisition of human physiological parameters using ordinary cameras, and has been widely applied in medical monitoring, human–computer interaction, and health management. However, most exi...

  • Article
  • Open Access
3 Citations
3,034 Views
16 Pages

9 April 2021

In general dynamic scenes, blurring is the result of the motion of multiple objects, camera shaking or scene depth variations. As an inverse process, deblurring extracts a sharp video sequence from the information contained in one single blurry image...

  • Article
  • Open Access
8 Citations
3,377 Views
15 Pages

19 February 2019

The development of high-speed camera systems and image processing techniques has promoted the use of vision-based methods as a practical alternative for the analysis of non-contact structural dynamic responses. In this study, a deviation extraction m...

  • Article
  • Open Access
4 Citations
5,964 Views
25 Pages

Hardware Implementation and Validation of 3D Underwater Shape Reconstruction Algorithm Using a Stereo-Catadioptric System

  • Rihab Hmida,
  • Abdessalem Ben Abdelali,
  • Frédéric Comby,
  • Lionel Lapierre,
  • Abdellatif Mtibaa and
  • René Zapata

31 August 2016

In this paper, we present a new stereo vision-based system and its efficient hardware implementation for real-time underwater environments exploration throughout 3D sparse reconstruction based on a number of feature points. The proposed underwater 3D...

  • Article
  • Open Access
5 Citations
5,442 Views
12 Pages

9 December 2017

Recently, the stereo imaging-based image enhancement approach has attracted increasing attention in the field of video analysis. This paper presents a dual camera-based stereo image defogging algorithm. Optical flow is first estimated from the stereo...

  • Article
  • Open Access
26 Citations
5,037 Views
21 Pages

31 March 2019

Depth-based reconstruction of three-dimensional (3D) shape of objects is one of core problems in computer vision with a lot of commercial applications. However, the 3D scanning for point cloud-based video streaming is expensive and is generally unatt...

  • Article
  • Open Access
1 Citations
1,532 Views
34 Pages

A Hybrid Contrast and Texture Masking Model to Boost High Efficiency Video Coding Perceptual Rate-Distortion Performance

  • Javier Ruiz Atencia,
  • Otoniel López-Granado,
  • Manuel Pérez Malumbres,
  • Miguel Martínez-Rach,
  • Damian Ruiz Coll,
  • Gerardo Fernández Escribano and
  • Glenn Van Wallendael

22 August 2024

As most of the videos are destined for human perception, many techniques have been designed to improve video coding based on how the human visual system perceives video quality. In this paper, we propose the use of two perceptual coding techniques, n...

  • Article
  • Open Access
556 Views
21 Pages

E-Sem3DGS: Monocular Human and Scene Reconstruction via Event-Aided Semantic 3DGS

  • Xiaoting Yin,
  • Hao Shi,
  • Kailun Yang,
  • Jiajun Zhai,
  • Shangwei Guo and
  • Kaiwei Wang

27 December 2025

Reconstructing animatable humans, together with their surrounding static environments, from monocular, motion-blurred videos is still challenging for current neural rendering methods. Existing monocular human reconstruction approaches achieve impress...

  • Article
  • Open Access
5 Citations
7,269 Views
16 Pages

12 January 2023

Recent studies have shown that deep learning achieves excellent performance in reconstructing 3D scenes from multiview images or videos. However, these reconstructions do not provide the identities of objects, and object identification is necessary f...

  • Article
  • Open Access
11 Citations
4,306 Views
19 Pages

13 February 2021

With the help of deep neural networks, video super-resolution (VSR) has made a huge breakthrough. However, these deep learning-based methods are rarely used in specific situations. In addition, training sets may not be suitable because many methods o...

  • Article
  • Open Access
4 Citations
3,268 Views
15 Pages

A General Framework for Reconstructing Full-Sample Continuous Vehicle Trajectories Using Roadside Sensing Data

  • Guimin Su,
  • Zimu Zeng,
  • Andi Song,
  • Cong Zhao,
  • Feng Shen,
  • Liangxiao Yuan and
  • Xinghua Li

28 February 2023

Vehicle trajectory data play an important role in autonomous driving and intelligent traffic control. With the widespread deployment of roadside sensors, such as cameras and millimeter-wave radar, it is possible to obtain full-sample vehicle trajecto...

  • Article
  • Open Access
6 Citations
4,109 Views
22 Pages

Generation of Multiple Frames for High Resolution Video SAR Based on Time Frequency Sub-Aperture Technique

  • Congrui Yang,
  • Zhen Chen,
  • Yunkai Deng,
  • Wei Wang,
  • Pei Wang and
  • Fengjun Zhao

2 January 2023

Video Synthetic Aperture Radar (ViSAR) operating in spotlight mode has received widespread attention in recent years because of its ability to form a sequence of SAR images for a region of interest (ROI). However, due to the heavy computational burde...

  • Article
  • Open Access
6 Citations
3,045 Views
21 Pages

29 June 2022

For accurate and effective automatic vehicle identification, morphological detection and deep convolutional networks were combined to propose a method for locating and identifying vehicle models from unmanned aerial vehicle (UAV) videos. First, the r...

  • Article
  • Open Access
3 Citations
6,079 Views
28 Pages

Model-Based Real-Time Non-Rigid Tracking

  • Sebastián Bronte,
  • Luis M. Bergasa,
  • Daniel Pizarro and
  • Rafael Barea

14 October 2017

This paper presents a sequential non-rigid reconstruction method that recovers the 3D shape and the camera pose of a deforming object from a video sequence and a previous shape model of the object. We take PTAM (Parallel Mapping and Tracking), a stat...

  • Article
  • Open Access
5 Citations
3,007 Views
16 Pages

A Research on Accident Reconstruction of Bus–Two-Wheeled Vehicle Based on Vehicle Damage and Human Head Injury

  • Shang Gao,
  • Mao Li,
  • Qian Wang,
  • Xianlong Jin,
  • Xinyi Hou,
  • Chuang Qin and
  • Shuangzhi Fu

The problem of large calculation models in bus–two-wheeled vehicle traffic accidents (TA) leads to the difficulty of balancing the calculation efficiency and accuracy, as well as difficulties in accident reconstruction. Herein, two typical acci...

  • Article
  • Open Access
4 Citations
2,789 Views
18 Pages

29 August 2022

A human can infer the magnitude of interaction force solely based on visual information because of prior knowledge in human–robot interaction (HRI). A method of reconstructing tactile information through cross-modal signal processing is propose...

  • Article
  • Open Access
29 Citations
5,050 Views
15 Pages

25 October 2020

Leafy vegetables are an essential source of the various nutrients that people need in their daily lives. The quantification of vegetable phenotypes and yield estimation are prerequisites for the selection of genetic varieties and for the improvement...

  • Article
  • Open Access
15 Citations
9,969 Views
18 Pages

8 September 2020

Frame interpolation, which generates an intermediate frame given adjacent ones, finds various applications such as frame rate up-conversion, video compression, and video streaming. Instead of using complex network models and additional data involved...

  • Article
  • Open Access
1,392 Views
20 Pages

Seed 3D Phenotyping Across Multiple Crops Using 3D Gaussian Splatting

  • Jun Gao,
  • Chao Zhu,
  • Junguo Hu,
  • Fei Deng,
  • Zhaoxin Xu and
  • Xiaomin Wang

8 November 2025

This study introduces a versatile seed 3D reconstruction method that is applicable to multiple crops—including maize, wheat, and rice—and designed to overcome the inefficiency and subjectivity of manual measurements and the high costs of...

  • Article
  • Open Access
7 Citations
2,781 Views
18 Pages

23 May 2022

The electromagnetic protection of IT devices includes a number of organizational and technical measures aimed at ensuring control over radiated and conducted revealing emissions. This is of particular importance for ensuring information security in w...

of 6