You are currently viewing a new version of our website. To view the old version click .

294 Results Found

  • Article
  • Open Access
77 Citations
15,799 Views
16 Pages

28 November 2019

Aerial human action recognition is an emerging topic in drone applications. Commercial drone platforms capable of detecting basic human actions such as hand gestures have been developed. However, a limited number of aerial video datasets are availabl...

  • Article
  • Open Access
63 Citations
6,814 Views
13 Pages

24 August 2019

This paper presents the simultaneous utilization of video images and inertial signals that are captured at the same time via a video camera and a wearable inertial sensor within a fusion framework in order to achieve a more robust human action recogn...

  • Article
  • Open Access
6 Citations
4,658 Views
19 Pages

A Multi-Scale Video Longformer Network for Action Recognition

  • Congping Chen,
  • Chunsheng Zhang and
  • Xin Dong

26 January 2024

Action recognition has found extensive applications in fields such as video classification and security monitoring. However, existing action recognition methods, such as those based on 3D convolutional neural networks, often struggle to capture compr...

  • Article
  • Open Access
11 Citations
3,373 Views
16 Pages

Spatiotemporal Interaction Residual Networks with Pseudo3D for Video Action Recognition

  • Jianyu Chen,
  • Jun Kong,
  • Hui Sun,
  • Hui Xu,
  • Xiaoli Liu,
  • Yinghua Lu and
  • Caixia Zheng

1 June 2020

Action recognition is a significant and challenging topic in the field of sensor and computer vision. Two-stream convolutional neural networks (CNNs) and 3D CNNs are two mainstream deep learning architectures for video action recognition. To combine...

  • Article
  • Open Access
3 Citations
3,564 Views
18 Pages

15 November 2022

Spatiotemporal and motion feature representations are the key to video action recognition. Typical previous approaches are to utilize 3D CNNs to cope with both spatial and temporal features, but they suffer from huge computations. Other approaches ar...

  • Article
  • Open Access
2 Citations
3,527 Views
17 Pages

Action Recognition in Videos through a Transfer-Learning-Based Technique

  • Elizabeth López-Lozada,
  • Humberto Sossa,
  • Elsa Rubio-Espino and
  • Jesús Yaljá Montiel-Pérez

17 October 2024

In computer vision, human action recognition is a hot topic, popularized by the development of deep learning. Deep learning models typically accept video input without prior processing and train them to achieve recognition. However, conducting prelim...

  • Article
  • Open Access
3 Citations
6,551 Views
19 Pages

A Generalized Pyramid Matching Kernel for Human Action Recognition in Realistic Videos

  • Jun Zhu,
  • Quan Zhou,
  • Weijia Zou,
  • Rui Zhang and
  • Wenjun Zhang

24 October 2013

Human action recognition is an increasingly important research topic in the fields of video sensing, analysis and understanding. Caused by unconstrained sensing conditions, there exist large intra-class variations and inter-class ambiguities in reali...

  • Article
  • Open Access
1 Citations
2,251 Views
15 Pages

4 January 2025

Recognizing the action of plastic bag taking from CCTV video footage represents a highly specialized and niche challenge within the broader domain of action video classification. To address this challenge, our paper introduces a novel benchmark video...

  • Article
  • Open Access
3 Citations
2,284 Views
16 Pages

28 February 2025

Video action recognition aims to achieve the automatic classification of human behaviors by analyzing the actions in videos, with its core lying in accurately capturing the spatial detail features of images and the temporal dynamic features among vid...

  • Article
  • Open Access
4 Citations
3,228 Views
18 Pages

Training a model to recognize human actions in videos is computationally intensive. While modern strategies employ transfer learning methods to make the process more efficient, they still face challenges regarding flexibility and efficiency. Existing...

  • Article
  • Open Access
2 Citations
3,427 Views
18 Pages

24 October 2024

Conventional approaches to video action recognition perform global attention over the entire video patches, which may be ineffective due to the temporal redundancy of video frames. Recent works on masked video modeling adopt a high-ratio tube masking...

  • Article
  • Open Access
10 Citations
3,452 Views
18 Pages

An Efficient Human Instance-Guided Framework for Video Action Recognition

  • Inwoong Lee,
  • Doyoung Kim,
  • Dongyoon Wee and
  • Sanghoon Lee

12 December 2021

In recent years, human action recognition has been studied by many computer vision researchers. Recent studies have attempted to use two-stream networks using appearance and motion features, but most of these approaches focused on clip-level video ac...

  • Article
  • Open Access
2 Citations
3,601 Views
19 Pages

DanceTrend: An Integration Framework of Video-Based Body Action Recognition and Color Space Features for Dance Popularity Prediction

  • Shiying Ding,
  • Xingyu Hou,
  • Yujia Liu,
  • Wenxuan Zhu,
  • Dong Fang,
  • Yusi Fan,
  • Kewei Li,
  • Lan Huang and
  • Fengfeng Zhou

18 November 2023

Background: With the rise of user-generated content (UGC) platforms, we are witnessing an unprecedented surge in data. Among various content types, dance videos have emerged as a potent medium for artistic and emotional expression in the Web 2.0 era....

  • Article
  • Open Access
3 Citations
3,764 Views
18 Pages

27 September 2023

The primary goal of this study is to develop a deep neural network for action recognition that enhances accuracy and minimizes computational costs. In this regard, we propose a modified EMO-MoviNet-A2* architecture that integrates Evolving Normalizat...

  • Review
  • Open Access
11 Citations
11,172 Views
31 Pages

Intelligent Video Analytics for Human Action Recognition: The State of Knowledge

  • Marek Kulbacki,
  • Jakub Segen,
  • Zenon Chaczko,
  • Jerzy W. Rozenblit,
  • Michał Kulbacki,
  • Ryszard Klempous and
  • Konrad Wojciechowski

25 April 2023

The paper presents a comprehensive overview of intelligent video analytics and human action recognition methods. The article provides an overview of the current state of knowledge in the field of human activity recognition, including various techniqu...

  • Article
  • Open Access
1 Citations
3,001 Views
17 Pages

FineTea: A Novel Fine-Grained Action Recognition Video Dataset for Tea Ceremony Actions

  • Changwei Ouyang,
  • Yun Yi,
  • Hanli Wang,
  • Jin Zhou and
  • Tao Tian

31 August 2024

Methods based on deep learning have achieved great success in the field of video action recognition. When these methods are applied to real-world scenarios that require fine-grained analysis of actions, such as being tested on a tea ceremony, limitat...

  • Article
  • Open Access
7 Citations
3,731 Views
19 Pages

WLiT: Windows and Linear Transformer for Video Action Recognition

  • Ruoxi Sun,
  • Tianzhao Zhang,
  • Yong Wan,
  • Fuping Zhang and
  • Jianming Wei

2 February 2023

The emergence of Transformer has led to the rapid development of video understanding, but it also brings the problem of high computational complexity. Previously, there were methods to divide the feature maps into windows along the spatiotemporal dim...

  • Article
  • Open Access
80 Citations
7,839 Views
32 Pages

Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences

  • Chirag I. Patel,
  • Dileep Labana,
  • Sharnil Pandya,
  • Kirit Modi,
  • Hemant Ghayvat and
  • Muhammad Awais

18 December 2020

Human Action Recognition (HAR) is the classification of an action performed by a human. The goal of this study was to recognize human actions in action video sequences. We present a novel feature descriptor for HAR that involves multiple features and...

  • Article
  • Open Access
1,318 Views
20 Pages

10 April 2025

In video human action-recognition tasks, motion tempo describes the dynamic patterns and temporal scales of human motion. Different categories of actions are typically composed of sub-actions with varying motion tempos. Effectively capturing sub-acti...

  • Article
  • Open Access
1,819 Views
15 Pages

22 April 2025

Large-scale Visual-Language Models have demonstrated powerful adaptability in video recognition tasks. However, existing methods typically rely on fine-tuning or text prompt tuning. In this paper, we propose a visual-only prompting method that employ...

  • Article
  • Open Access
19 Citations
5,631 Views
21 Pages

15 July 2020

Modeling spatiotemporal representations is one of the most essential yet challenging issues in video action recognition. Existing methods lack the capacity to accurately model either the correlations between spatial and temporal features or the globa...

  • Article
  • Open Access
2 Citations
4,536 Views
22 Pages

9 September 2022

Depth video sequence-based deep models for recognizing human actions are scarce compared to RGB and skeleton video sequences-based models. This scarcity limits the research advancements based on depth data, as training deep models with small-scale da...

  • Article
  • Open Access
34 Citations
6,891 Views
12 Pages

20 May 2020

Existing public domain multi-modal datasets for human action recognition only include actions of interest that have already been segmented from action streams. These datasets cannot be used to study a more realistic action recognition scenario where...

  • Article
  • Open Access
2 Citations
2,596 Views
18 Pages

18 June 2022

Incorporating multi-modality data is an effective way to improve action recognition performance. Based on this idea, we investigate a new data modality in which Whole-Body Keypoint and Skeleton (WKS) labels are used to capture refined body informatio...

  • Article
  • Open Access
5 Citations
5,106 Views
20 Pages

29 September 2023

This study introduces an optimal topology of vision transformers for real-time video action recognition in a cloud-based solution. Although model performance is a key criterion for real-time video analysis use cases, inference latency plays a more cr...

  • Review
  • Open Access
19 Citations
7,607 Views
19 Pages

An Overview of the Vision-Based Human Action Recognition Field

  • Fernando Camarena,
  • Miguel Gonzalez-Mendoza,
  • Leonardo Chang and
  • Ricardo Cuevas-Ascencio

Artificial intelligence’s rapid advancement has enabled various applications, including intelligent video surveillance systems, assisted living, and human–computer interaction. These applications often require one core task: video-based h...

  • Systematic Review
  • Open Access
17 Citations
5,892 Views
46 Pages

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review

  • Dengshan Li,
  • Rujing Wang,
  • Peng Chen,
  • Chengjun Xie,
  • Qiong Zhou and
  • Xiufang Jia

31 December 2021

Video object and human action detection are applied in many fields, such as video surveillance, face recognition, etc. Video object detection includes object classification and object location within the frame. Human action recognition is the detecti...

  • Article
  • Open Access
9 Citations
4,566 Views
17 Pages

6 January 2024

Behavioral analysis of animals in the wild plays an important role for ecological research and conservation and has been mostly performed by researchers. We introduce an action detection approach that automates this process by detecting animals and p...

  • Feature Paper
  • Article
  • Open Access
11 Citations
7,429 Views
12 Pages

Best Frame Selection to Enhance Training Step Efficiency in Video-Based Human Action Recognition

  • Abdorreza Alavi Gharahbagh,
  • Vahid Hajihashemi,
  • Marta Campos Ferreira,
  • José J. M. Machado and
  • João Manuel R. S. Tavares

10 February 2022

In recent years, with the growth of digital media and modern imaging equipment, the use of video processing algorithms and semantic film and image management has expanded. The usage of different video datasets in training artificial intelligence algo...

  • Article
  • Open Access
1,788 Views
23 Pages

Traffic Accident Rescue Action Recognition Method Based on Real-Time UAV Video

  • Bo Yang,
  • Jianan Lu,
  • Tao Liu,
  • Bixing Zhang,
  • Chen Geng,
  • Yan Tian and
  • Siyu Zhang

24 July 2025

Low-altitude drones, which are unimpeded by traffic congestion or urban terrain, have become a critical asset in emergency rescue missions. To address the current lack of emergency rescue data, UAV aerial videos were collected to create an experiment...

  • Article
  • Open Access
9 Citations
2,607 Views
18 Pages

8 September 2023

Compared with traditional methods, the action recognition model based on 3D convolutional deep neural network captures spatio-temporal features more accurately, resulting in higher accuracy. However, the large number of parameters and computational r...

  • Article
  • Open Access
2 Citations
4,161 Views
23 Pages

26 December 2022

Recently, Transformer-based video recognition models have achieved state-of-the-art results on major video recognition benchmarks. However, their high inference cost significantly limits research speed and practical use. In video compression, methods...

  • Article
  • Open Access
8 Citations
6,866 Views
23 Pages

In this paper, we describe how information obtained from multiple views usinga network of cameras can be effectively combined to yield a reliable and fast humanactivity recognition system. First, we present a score-based fusion technique for combinin...

  • Proceeding Paper
  • Open Access
1 Citations
1,578 Views
7 Pages

15 November 2023

Beyond traditional surveillance applications, sensor-based human action recognition and segmentation responds to a growing demand in the health and safety sector. Recently, skeletal action recognition has largely been dominated by spatio-temporal gra...

  • Article
  • Open Access
1,788 Views
14 Pages

Separable ConvNet Spatiotemporal Mixer for Action Recognition

  • Hsu-Yung Cheng,
  • Chih-Chang Yu and
  • Chenyu Li

Video action recognition is vital in the research area of computer vision. In this paper, we develop a novel model, named Separable ConvNet Spatiotemporal Mixer (SCSM). Our goal is to develop an efficient and lightweight action recognition backbone t...

  • Article
  • Open Access
581 Views
25 Pages

9 November 2025

Different action recognition tasks exhibit significant variations in their reliance on local versus global features. Particularly for long-video understanding, dynamically balancing the contributions of both has become a critical challenge for improv...

  • Article
  • Open Access
3 Citations
1,828 Views
24 Pages

Video Abnormal Behavior Recognition and Trajectory Prediction Based on Lightweight Skeleton Feature Extraction

  • Ling Wang,
  • Cong Ding,
  • Yifan Zhang,
  • Tie Hua Zhou,
  • Wei Ding,
  • Keun Ho Ryu and
  • Kwang Woo Nam

7 June 2024

Video action recognition based on skeleton nodes is a highlighted issue in the computer vision field. In real application scenarios, the large number of skeleton nodes and behavior occlusion problems between individuals seriously affect recognition s...

  • Article
  • Open Access
17 Citations
3,913 Views
14 Pages

2 February 2021

The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike....

  • Article
  • Open Access
35 Citations
7,734 Views
18 Pages

Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network

  • Le Wang,
  • Jinliang Zang,
  • Qilin Zhang,
  • Zhenxing Niu,
  • Gang Hua and
  • Nanning Zheng

21 June 2018

Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal informat...

  • Article
  • Open Access
3 Citations
2,040 Views
14 Pages

A Dynamic Position Embedding-Based Model for Student Classroom Complete Meta-Action Recognition

  • Zhaoyu Shou,
  • Xiaohu Yuan,
  • Dongxu Li,
  • Jianwen Mo,
  • Huibing Zhang,
  • Jingwei Zhang and
  • Ziyong Wu

20 August 2024

The precise recognition of entire classroom meta-actions is a crucial challenge for the tailored adaptive interpretation of student behavior, given the intricacy of these actions. This paper proposes a Dynamic Position Embedding-based Model for Stude...

  • Article
  • Open Access
57 Citations
14,179 Views
15 Pages

Video-Based Human Activity Recognition Using Deep Learning Approaches

  • Guilherme Augusto Silva Surek,
  • Laio Oriel Seman,
  • Stefano Frizzo Stefenon,
  • Viviana Cocco Mariani and
  • Leandro dos Santos Coelho

13 July 2023

Due to its capacity to gather vast, high-level data about human activity from wearable or stationary sensors, human activity recognition substantially impacts people’s day-to-day lives. Multiple people and things may be seen acting in the video...

  • Article
  • Open Access
2,915 Views
24 Pages

13 October 2024

Secondary actions in vehicles are activities that drivers engage in while driving that are not directly related to the primary task of operating the vehicle. Secondary Action Recognition (SAR) in drivers is vital for enhancing road safety and minimiz...

  • Article
  • Open Access
98 Citations
11,539 Views
17 Pages

Human Activity Classification Using the 3DCNN Architecture

  • Roberta Vrskova,
  • Robert Hudec,
  • Patrik Kamencay and
  • Peter Sykora

17 January 2022

Interest in utilizing neural networks in a variety of scientific and academic studies and in industrial applications is increasing. In addition to the growing interest in neural networks, there is also a rising interest in video classification. Objec...

  • Article
  • Open Access
3 Citations
1,901 Views
16 Pages

8 December 2023

Human action recognition (HAR) as the most representative human-centred computer vision task is critical in human resource management (HRM), especially in human resource recruitment, performance appraisal, and employee training. Currently, prevailing...

  • Article
  • Open Access
22 Citations
7,737 Views
18 Pages

This paper focuses on image and video content analysis of handball scenes and applying deep learning methods for detecting and tracking the players and recognizing their activities. Handball is a team sport of two teams played indoors with the ball w...

  • Article
  • Open Access
3 Citations
3,574 Views
13 Pages

31 January 2024

Our approach to action recognition is grounded in the intrinsic coexistence of and complementary relationship between audio and visual information in videos. Going beyond the traditional emphasis on visual features, we propose a transformer-based net...

  • Article
  • Open Access
2,088 Views
17 Pages

21 August 2025

Human Action Recognition has seen significant advances through transformer-based architectures, yet achieving a nuanced understanding often requires fusing multiple data modalities. Standard models relying solely on RGB video can struggle with action...

  • Article
  • Open Access
2,971 Views
21 Pages

Variable Temporal Length Training for Action Recognition CNNs

  • Tan-Kun Li,
  • Kwok-Leung Chan and
  • Tardi Tjahjadi

25 May 2024

Most current deep learning models are suboptimal in terms of the flexibility of their input shape. Usually, computer vision models only work on one fixed shape used during training, otherwise their performance degrades significantly. For video-relate...

  • Article
  • Open Access
9 Citations
3,861 Views
28 Pages

Recognizing human actions through video analysis has gained significant attention in applications like surveillance, sports analytics, and human–computer interaction. While deep learning models such as 3D convolutional neural networks (CNNs) an...

  • Article
  • Open Access
25 Citations
5,847 Views
25 Pages

14 April 2021

The increasing demand for surveillance systems has resulted in an unprecedented rise in the volume of video data being generated daily. The volume and frequency of the generation of video streams make it both impractical as well as inefficient to man...

of 6