You are currently viewing a new version of our website. To view the old version click .

201 Results Found

  • Article
  • Open Access
70 Citations
15,102 Views
13 Pages

20 August 2020

Taking Chengdu as an example, based on the destination image theory and employing the content analysis methodology, this paper conducts data mining on the online comment texts of TikTok short food videos, and analyzes the impact of short food videos...

  • Article
  • Open Access
4 Citations
7,499 Views
11 Pages

22 February 2024

The birth and following growth of social media platforms has influenced a lot. In addition to beneficial features, it has long-been noticed that heavy consumption of social media can have negative effects beyond a simple lack of time for other things...

  • Article
  • Open Access
6 Citations
4,230 Views
27 Pages

Approaches to Identifying Emotions and Affections During the Museum Learning Experience in the Context of the Future Internet

  • Iana Fominska,
  • Stefano Di Tore,
  • Michele Nappi,
  • Gerardo Iovane,
  • Maurizio Sibilio and
  • Angela Gelo

10 November 2024

The Future Internet aims to revolutionize digital interaction by integrating advanced technologies like AI and IoT, enabling a dynamic and resilient network. It envisions emotionally intelligent systems that can interpret and respond to human feeling...

  • Article
  • Open Access
352 Views
25 Pages

High-Frame-Rate Camera-Based Vibration Analysis for Health Monitoring of Industrial Robots Across Multiple Postures

  • Tuniyazi Abudoureheman,
  • Hayato Otsubo,
  • Feiyue Wang,
  • Kohei Shimasaki and
  • Idaku Ishii

2 December 2025

Accurate vibration measurement is crucial for maintaining the performance, reliability, and safety of automated manufacturing environments. Abnormal vibrations caused by faults in gears or bearings can degrade positional accuracy, reduce productivity...

  • Article
  • Open Access
16 Citations
4,535 Views
17 Pages

A Short Video Classification Framework Based on Cross-Modal Fusion

  • Nuo Pang,
  • Songlin Guo,
  • Ming Yan and
  • Chien Aun Chan

12 October 2023

The explosive growth of online short videos has brought great challenges to the efficient management of video content classification, retrieval, and recommendation. Video features for video management can be extracted from video image frames by vario...

  • Feature Paper
  • Article
  • Open Access
1 Citations
4,792 Views
27 Pages

31 August 2024

The automatic transformation of short background videos from real scenarios into other forms with a visually pleasing style, like those used in cartoons, holds application in various domains. These include animated films, video games, advertisements,...

  • Article
  • Open Access
8 Citations
3,115 Views
30 Pages

Express Image and Video Analysis Technology QAVIS: Application in System for Video Monitoring of Peter the Great Bay (Sea of Japan/East Sea)

  • Vitaly K. Fischenko,
  • Anna A. Goncharova,
  • Grigory I. Dolgikh,
  • Petr S. Zimin,
  • Aleksey E. Subote,
  • Nelly A. Klescheva and
  • Andrey V. Golik

1 October 2021

The article describes the technology of express analysis of images and videos, recorded by coastal video monitoring systems, developed by the authors. Its main feature is its ability to measure or evaluate in real time the signals of sea waves, sea l...

  • Article
  • Open Access
94 Citations
16,987 Views
20 Pages

Calibration of Action Cameras for Photogrammetric Purposes

  • Caterina Balletti,
  • Francesco Guerra,
  • Vassilios Tsioukas and
  • Paolo Vernier

18 September 2014

The use of action cameras for photogrammetry purposes is not widespread due to the fact that until recently the images provided by the sensors, using either still or video capture mode, were not big enough to perform and provide the appropriate analy...

  • Article
  • Open Access
9 Citations
2,896 Views
16 Pages

Topic-Oriented Text Features Can Match Visual Deep Models of Video Memorability

  • Ricardo Kleinlein,
  • Cristina Luna-Jiménez,
  • David Arias-Cuadrado,
  • Javier Ferreiros and
  • Fernando Fernández-Martínez

12 August 2021

Not every visual media production is equally retained in memory. Recent studies have shown that the elements of an image, as well as their mutual semantic dependencies, provide a strong clue as to whether a video clip will be recalled on a second vie...

  • Systematic Review
  • Open Access
17 Citations
5,869 Views
46 Pages

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review

  • Dengshan Li,
  • Rujing Wang,
  • Peng Chen,
  • Chengjun Xie,
  • Qiong Zhou and
  • Xiufang Jia

31 December 2021

Video object and human action detection are applied in many fields, such as video surveillance, face recognition, etc. Video object detection includes object classification and object location within the frame. Human action recognition is the detecti...

  • Article
  • Open Access
174 Citations
17,543 Views
19 Pages

18 July 2019

Fire is an abnormal event which can cause significant damage to lives and property. In this paper, we propose a deep learning-based fire detection method using a video sequence, which imitates the human fire detection process. The proposed method use...

  • Article
  • Open Access
2 Citations
3,281 Views
19 Pages

15 December 2021

Detecting saliency in videos is a fundamental step in many computer vision systems. Saliency is the significant target(s) in the video. The object of interest is further analyzed for high-level applications. The segregation of saliency and the backgr...

  • Article
  • Open Access
17 Citations
10,350 Views
24 Pages

Online reviews have become an important source of information for consumers, significantly influencing their purchasing decisions. However, the abundance and variety of review formats, especially the mix of text, image, and video elements, can lead t...

  • Article
  • Open Access
5 Citations
9,251 Views
22 Pages

Social media influencers strategically design the auditory and visual features of short videos to enhance consumer engagement. Among these, auditory emotional arousal and visual variation play crucial roles, yet their interactive effects remain under...

  • Article
  • Open Access
22 Citations
9,176 Views
18 Pages

Diverse Scene Stitching from a Large-Scale Aerial Video Dataset

  • Tao Yang,
  • Jing Li,
  • Jingyi Yu,
  • Sibing Wang and
  • Yanning Zhang

28 May 2015

Diverse scene stitching is a challenging task in aerial video surveillance. This paper presents a hybrid stitching method based on the observation that aerial videos captured in real surveillance settings are neither totally ordered nor completely un...

  • Article
  • Open Access
1,870 Views
15 Pages

Within the domain of multi-label classification for micro-videos, utilizing terrestrial datasets as a foundation, researchers have embarked on profound endeavors yielding extraordinary accomplishments. The research into multi-label classification bas...

  • Article
  • Open Access
3 Citations
2,843 Views
27 Pages

Enhancing Underwater Video from Consecutive Frames While Preserving Temporal Consistency

  • Kai Hu,
  • Yuancheng Meng,
  • Zichen Liao,
  • Lei Tang and
  • Xiaoling Ye

Current methods for underwater image enhancement primarily focus on single-frame processing. While these approaches achieve impressive results for static images, they often fail to maintain temporal coherence across frames in underwater videos, which...

  • Article
  • Open Access
12 Citations
8,071 Views
29 Pages

5 April 2023

This paper proposes a multi–convolutional neural network (CNN)-based system for the detection, tracking, and recognition of the emotions of dogs in surveillance videos. This system detects dogs in each frame of a video, tracks the dogs in the v...

  • Article
  • Open Access
12 Citations
8,733 Views
19 Pages

2 January 2024

The escalating use of security cameras has resulted in a surge in images requiring analysis, a task hindered by the inefficiency and error-prone nature of manual monitoring. In response, this study delves into the domain of anomaly detection in CCTV...

  • Feature Paper
  • Article
  • Open Access
16 Citations
4,896 Views
15 Pages

Action recognition is an active research field that aims to recognize human actions and intentions from a series of observations of human behavior and the environment. Unlike image-based action recognition mainly using a two-dimensional (2D) convolut...

  • Article
  • Open Access
12 Citations
2,930 Views
31 Pages

25 February 2023

Activity recognition in unmanned aerial vehicle (UAV) surveillance is addressed in various computer vision applications such as image retrieval, pose estimation, object detection, object detection in videos, object detection in still images, object d...

  • Letter
  • Open Access
6 Citations
3,356 Views
11 Pages

Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks

  • Tim Boers,
  • Joost van der Putten,
  • Maarten Struyvenberg,
  • Kiki Fockens,
  • Jelmer Jukema,
  • Erik Schoon,
  • Fons van der Sommen,
  • Jacques Bergman and
  • Peter de With

24 July 2020

Early Barrett’s neoplasia are often missed due to subtle visual features and inexperience of the non-expert endoscopist with such lesions. While promising results have been reported on the automated detection of this type of early cancer in sti...

  • Article
  • Open Access
749 Views
23 Pages

Images Versus Videos in Contrast-Enhanced Ultrasound for Computer-Aided Diagnosis

  • Marina Adriana Mercioni,
  • Cătălin Daniel Căleanu and
  • Mihai-Eronim-Octavian Ursan

9 October 2025

The background of the article refers to the diagnosis of focal liver lesions (FLLs) through contrast-enhanced ultrasound (CEUS) based on the integration of spatial and temporal information. Traditional computer-aided diagnosis (CAD) systems predomina...

  • Article
  • Open Access
98 Citations
16,763 Views
9 Pages

Using the Horse Grimace Scale (HGS) to Assess Pain Associated with Acute Laminitis in Horses (Equus caballus)

  • Emanuela Dalla Costa,
  • Diana Stucke,
  • Francesca Dai,
  • Michela Minero,
  • Matthew C. Leach and
  • Dirk Lebelt

3 August 2016

Acute laminitis is a common equine disease characterized by intense foot pain, both acutely and chronically. The Obel grading system is the most widely accepted method for describing the severity of laminitis by equine practitioners, however this met...

  • Article
  • Open Access
29 Citations
5,179 Views
15 Pages

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval

  • Hanqing Chen,
  • Chunyan Hu,
  • Feifei Lee,
  • Chaowei Lin,
  • Wei Yao,
  • Lu Chen and
  • Qiu Chen

29 April 2021

Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and h...

  • Article
  • Open Access
5 Citations
3,844 Views
20 Pages

Real-Time Multi-Label Upper Gastrointestinal Anatomy Recognition from Gastroscope Videos

  • Tao Yu,
  • Huiyi Hu,
  • Xinsen Zhang,
  • Honglin Lei,
  • Jiquan Liu,
  • Weiling Hu,
  • Huilong Duan and
  • Jianmin Si

24 March 2022

Esophagogastroduodenoscopy (EGD) is a critical step in the diagnosis of upper gastrointestinal disorders. However, due to inexperience or high workload, there is a wide variation in EGD performance by endoscopists. Variations in performance may resul...

  • Article
  • Open Access
4 Citations
2,369 Views
23 Pages

Regional Time-Series Coding Network and Multi-View Image Generation Network for Short-Time Gait Recognition

  • Wenhao Sun,
  • Guangda Lu,
  • Zhuangzhuang Zhao,
  • Tinghang Guo,
  • Zhuanping Qin and
  • Yu Han

23 May 2023

Gait recognition is one of the important research directions of biometric authentication technology. However, in practical applications, the original gait data is often short, and a long and complete gait video is required for successful recognition....

  • Article
  • Open Access
4 Citations
2,487 Views
24 Pages

Surveying of Nearshore Bathymetry Using UAVs Video Stitching

  • Jinchang Fan,
  • Hailong Pei and
  • Zengjie Lian

In this paper, we extended video stitching to nearshore bathymetry for videos that were captured for the same coastal field simultaneously by two unmanned aerial vehicles (UAVs). In practice, a video captured by a single UAV often shows a limited coa...

  • Article
  • Open Access
97 Citations
11,516 Views
17 Pages

Human Activity Classification Using the 3DCNN Architecture

  • Roberta Vrskova,
  • Robert Hudec,
  • Patrik Kamencay and
  • Peter Sykora

17 January 2022

Interest in utilizing neural networks in a variety of scientific and academic studies and in industrial applications is increasing. In addition to the growing interest in neural networks, there is also a rising interest in video classification. Objec...

  • Article
  • Open Access
1,242 Views
10 Pages

Effectiveness of Watching a Kumagai Method Video for Long-Nipple Bottle-Feeding for Children with Cleft Lip and Palate: A Pilot Experimental Before–After Trial Study

  • Shingo Ueki,
  • Yukari Kumagai,
  • Yumi Hirai,
  • Eri Nagatomo,
  • Shoko Miyauchi,
  • Takuro Inoue,
  • Qi An,
  • Eri Tashiro and
  • Junko Miyata

8 November 2024

Aim: This study aimed to determine whether the Kumagai method could be followed by watching an instructional video and to compare the feeding actions of specialists and the general population. Materials and Methods: Eleven adults from diverse backgro...

  • Article
  • Open Access
7 Citations
2,943 Views
16 Pages

Emotion Classification Based on Pulsatile Images Extracted from Short Facial Videos via Deep Learning

  • Shlomi Talala,
  • Shaul Shvimmer,
  • Rotem Simhon,
  • Michael Gilead and
  • Yitzhak Yitzhaky

19 April 2024

Most human emotion recognition methods largely depend on classifying stereotypical facial expressions that represent emotions. However, such facial expressions do not necessarily correspond to actual emotional states and may correspond to communicati...

  • Article
  • Open Access
1 Citations
4,475 Views
14 Pages

A Fast and Cost-Effective (FACE) Instrument Setting to Construct Focus-Extended Images

  • Gilbert Audira,
  • Ting-Wei Hsu,
  • Kelvin H.-C. Chen,
  • Jong-Chin Huang,
  • Ming-Der Lin,
  • Tzong-Rong Ger and
  • Chung-Der Hsiao

29 November 2022

Image stacking is a crucial method for micro or macro photography. It captures images at different focal planes and then merges them into a single, all-in-focus image with extended focus. This method has been extensively used for digital documentatio...

  • Article
  • Open Access
17 Citations
7,996 Views
27 Pages

Underwater Communications for Video Surveillance Systems at 2.4 GHz

  • Sandra Sendra,
  • Jaime Lloret,
  • Jose Miguel Jimenez and
  • Joel J.P.C. Rodrigues

23 October 2016

Video surveillance is needed to control many activities performed in underwater environments. The use of wired media can be a problem since the material specially designed for underwater environments is very expensive. In order to transmit the images...

  • Article
  • Open Access
66 Citations
5,794 Views
8 Pages

Clinically Feasible and Accurate View Classification of Echocardiographic Images Using Deep Learning

  • Kenya Kusunose,
  • Akihiro Haga,
  • Mizuki Inoue,
  • Daiju Fukuda,
  • Hirotsugu Yamada and
  • Masataka Sata

25 April 2020

A proper echocardiographic study requires several video clips recorded from different acquisition angles for observation of the complex cardiac anatomy. However, these video clips are not necessarily labeled in a database. Identification of the acqui...

  • Article
  • Open Access
12 Citations
3,449 Views
28 Pages

17 October 2023

The inspection of condition of underwater pipelines (UPs) based on autonomous underwater vehicles (AUVs) requires high accuracy of positioning while the AUV is moving along to the object being examined. Currently, acoustic, magnetometric, and visual...

  • Article
  • Open Access
9 Citations
3,366 Views
15 Pages

27 May 2021

The field of research related to video data has difficulty in extracting not only spatial but also temporal features and human action recognition (HAR) is a representative field of research that applies convolutional neural network (CNN) to video dat...

  • Article
  • Open Access
583 Views
25 Pages

Every minute, vast amounts of video and image data are uploaded worldwide to the internet and social media platforms, creating a rich visual archive of human experiences—from weddings and family gatherings to significant historical events such...

  • Article
  • Open Access
11 Citations
2,450 Views
17 Pages

1 September 2022

As a sub-field of video content analysis, action recognition has received extensive attention in recent years, which aims to recognize human actions in videos. Compared with a single image, video has a temporal dimension. Therefore, it is of great si...

  • Article
  • Open Access
4 Citations
3,253 Views
13 Pages

An Approach for Fall Prediction Based on Kinematics of Body Key Points Using LSTM

  • Bahareh Mobasheri,
  • Seyed Reza Kamel Tabbakh and
  • Yahya Forghani

Many studies have used sensors attached to adults in order to collect signals by which one can carry out analyses to predict falls. In addition, there are research studies in which videos and photographs were used to extract and analyze body posture...

  • Article
  • Open Access
7 Citations
2,374 Views
17 Pages

Semantically-Enhanced Feature Extraction with CLIP and Transformer Networks for Driver Fatigue Detection

  • Zhen Gao,
  • Xiaowen Chen,
  • Jingning Xu,
  • Rongjie Yu,
  • Heng Zhang and
  • Jinqiu Yang

12 December 2024

Drowsy driving is a leading cause of commercial vehicle traffic crashes. The trend is to train fatigue detection models using deep neural networks on driver video data, but challenges remain in coarse and incomplete high-level feature extraction and...

  • Article
  • Open Access
5 Citations
2,482 Views
17 Pages

29 February 2024

Surveillance video analytics encounters unprecedented challenges in 5G and IoT environments, including complex intra-class variations, short-term and long-term temporal dynamics, and variable video quality. This study introduces Edge-Enhanced TempoFu...

  • Article
  • Open Access
28 Citations
9,100 Views
14 Pages

A New Approach to Image-Based Estimation of Food Volume

  • Hamid Hassannejad,
  • Guido Matrella,
  • Paolo Ciampolini,
  • Ilaria De Munari,
  • Monica Mordonini and
  • Stefano Cagnoni

10 June 2017

A balanced diet is the key to a healthy lifestyle and is crucial for preventing or dealing with many chronic diseases such as diabetes and obesity. Therefore, monitoring diet can be an effective way of improving people’s health. However, manual repor...

  • Article
  • Open Access
1 Citations
1,191 Views
12 Pages

5 April 2025

With the growing interest in health and fitness, whey protein supplements are becoming increasingly popular among fitness enthusiasts and athletes. The surge in demand for whey protein supplements highlights the need for cost-effective methods to cha...

  • Article
  • Open Access
1,069 Views
25 Pages

Multimodal Fusion Image Stabilization Algorithm for Bio-Inspired Flapping-Wing Aircraft

  • Zhikai Wang,
  • Sen Wang,
  • Yiwen Hu,
  • Yangfan Zhou,
  • Na Li and
  • Xiaofeng Zhang

This paper presents FWStab, a specialized video stabilization dataset tailored for flapping-wing platforms. The dataset encompasses five typical flight scenarios, featuring 48 video clips with intense dynamic jitter. The corresponding Inertial Measur...

  • Article
  • Open Access
2 Citations
3,331 Views
15 Pages

4 December 2024

Video prediction, which is the task of predicting future video frames based on past observations, remains a challenging problem because of the complexity and high dimensionality of spatiotemporal dynamics. To address the problems associated with spat...

  • Article
  • Open Access
6 Citations
2,775 Views
17 Pages

Gaze Tracking Based on Concatenating Spatial-Temporal Features

  • Bor-Jiunn Hwang,
  • Hui-Hui Chen,
  • Chaur-Heh Hsieh and
  • Deng-Yu Huang

11 January 2022

Based on experimental observations, there is a correlation between time and consecutive gaze positions in visual behaviors. Previous studies on gaze point estimation usually use images as the input for model trainings without taking into account the...

  • Review
  • Open Access
56 Citations
21,129 Views
28 Pages

A Review of Image Processing Techniques for Deepfakes

  • Hina Fatima Shahzad,
  • Furqan Rustam,
  • Emmanuel Soriano Flores,
  • Juan Luís Vidal Mazón,
  • Isabel de la Torre Diez and
  • Imran Ashraf

16 June 2022

Deep learning is used to address a wide range of challenging issues including large data analysis, image processing, object detection, and autonomous control. In the same way, deep learning techniques are also used to develop software and techniques...

  • Article
  • Open Access
20 Citations
5,091 Views
17 Pages

16 June 2023

In the billions of faces that are shaped by thousands of different cultures and ethnicities, one thing remains universal: the way emotions are expressed. To take the next step in human–machine interactions, a machine (e.g., a humanoid robot) mu...

  • Article
  • Open Access
12 Citations
3,995 Views
19 Pages

14 April 2022

Person re-identification(Re-ID) technology has been a research hotspot in intelligent video surveillance, which accurately retrieves specific pedestrians from massive video data. Most research focuses on the short-term scenarios of person Re-ID to de...

  • Article
  • Open Access
1 Citations
3,969 Views
20 Pages

Multimedia Data Modelling Using Multidimensional Recurrent Neural Networks

  • Zhen He,
  • Shaobing Gao,
  • Liang Xiao,
  • Daxue Liu and
  • Hangen He

1 September 2018

Modelling the multimedia data such as text, images, or videos usually involves the analysis, prediction, or reconstruction of them. The recurrent neural network (RNN) is a powerful machine learning approach to modelling these data in a recursive way....

of 5