Skip Content
You are currently on the new version of our website. Access the old version .

82 Results Found

  • Article
  • Open Access
2 Citations
2,841 Views
21 Pages

26 September 2024

Recently, Vision Transformers (ViTs) have been actively applied to fine-grained visual recognition (FGVR). ViT can effectively model the interdependencies between patch-divided object regions through an inherent self-attention mechanism. In addition,...

  • Article
  • Open Access
5 Citations
3,478 Views
23 Pages

While visual appearances play a main role in recognizing the concepts captured in images, additional information can provide complementary information for fine-grained image recognition, where concepts with similar visual appearances such as species...

  • Article
  • Open Access
2 Citations
2,082 Views
20 Pages

Attention-Based Spatiotemporal-Aware Network for Fine-Grained Visual Recognition

  • Yili Ren,
  • Ruidong Lu,
  • Guan Yuan,
  • Dashuai Hao and
  • Hongjue Li

2 September 2024

On public benchmarks, current macro facial expression recognition technologies have achieved significant success. However, in real-life scenarios, individuals may attempt to conceal their true emotions. Conventional expression recognition often overl...

  • Article
  • Open Access
122 Views
17 Pages

13 January 2026

Fine-grained action detection remains a challenging and actively studied problem. While previous methods predominantly rely on convolutional neural networks (CNNs), their limited modeling capacity and high computational cost restrict their effectiven...

  • Article
  • Open Access
2 Citations
3,021 Views
12 Pages

16 July 2021

Vegetable and fruit recognition can be considered as a fine-grained visual categorization (FGVC) task, which is challenging due to the large intraclass variances and small interclass variances. A mainstream direction to address the challenge is to ex...

  • Article
  • Open Access
68 Citations
9,163 Views
18 Pages

MultiCAM: Multiple Class Activation Mapping for Aircraft Recognition in Remote Sensing Images

  • Kun Fu,
  • Wei Dai,
  • Yue Zhang,
  • Zhirui Wang,
  • Menglong Yan and
  • Xian Sun

6 March 2019

Aircraft recognition in remote sensing images has long been a meaningful topic. Most related methods treat entire images as a whole and do not concentrate on the features of parts. In fact, a variety of aircraft types have small interclass variance,...

  • Article
  • Open Access
7 Citations
3,868 Views
19 Pages

13 April 2023

Multi-scale feature fusion techniques and covariance pooling have been shown to have positive implications for completing computer vision tasks, including fine-grained image classification. However, existing algorithms that use multi-scale feature fu...

  • Article
  • Open Access
3 Citations
3,241 Views
17 Pages

30 September 2021

Accurate identification of insect pests is the key to improve crop yield and ensure quality and safety. However, under the influence of environmental conditions, the same kind of pests show obvious differences in intraclass representation, while the...

  • Article
  • Open Access
4 Citations
2,259 Views
16 Pages

27 September 2022

In MOOC learning, learners’ emotions have an important impact on the learning effect. In order to solve the problem that learners’ emotions are not obvious in the learning process, we propose a method to identify learner emotion by combin...

  • Article
  • Open Access
2 Citations
1,667 Views
16 Pages

Pairwise Guided Multilayer Cross-Fusion Network for Bird Image Recognition

  • Jingsheng Lei,
  • Yao Jin,
  • Liya Huang,
  • Yuan Ji and
  • Shengying Yang

9 September 2023

Bird identification is the first step in collecting data on bird diversity and abundance, which also helps research on bird distribution and population measurements. Most research has built end-to-end training models for bird detection task via CNNs...

  • Article
  • Open Access
6 Citations
1,782 Views
25 Pages

4 April 2025

Smart fisheries, integrating advanced technologies such as the Internet of Things (IoT), artificial intelligence (AI), and image processing, are pivotal in enhancing aquaculture efficiency, sustainability, and resource management by enabling real-tim...

  • Article
  • Open Access
6 Citations
2,765 Views
14 Pages

Two-Branch Attention Learning for Fine-Grained Class Incremental Learning

  • Jiaqi Guo,
  • Guanqiu Qi,
  • Shuiqing Xie and
  • Xiangyuan Li

1 December 2021

As a long-standing research area, class incremental learning (CIL) aims to effectively learn a unified classifier along with the growth of the number of classes. Due to the small inter-class variances and large intra-class variances, fine-grained vis...

  • Article
  • Open Access
5 Citations
3,925 Views
14 Pages

Artificial intelligence research in natural language processing in the context of poetry struggles with the recognition of holistic content such as poetic symbolism, metaphor, and other fine-grained attributes. Given these challenges, multi-modal ima...

  • Article
  • Open Access
29 Citations
7,376 Views
14 Pages

Temporal and Fine-Grained Pedestrian Action Recognition on Driving Recorder Database

  • Hirokatsu Kataoka,
  • Yutaka Satoh,
  • Yoshimitsu Aoki,
  • Shoko Oikawa and
  • Yasuhiro Matsui

20 February 2018

The paper presents an emerging issue of fine-grained pedestrian action recognition that induces an advanced pre-crush safety to estimate a pedestrian intention in advance. The fine-grained pedestrian actions include visually slight differences (e.g.,...

  • Article
  • Open Access
1,740 Views
21 Pages

Local Diversity-Guided Weakly Supervised Fine-Grained Image Classification Method

  • Yuebo Meng,
  • Xianglong Luo,
  • Hua Zhan,
  • Bo Wang,
  • Shilong Su and
  • Guanghui Liu

25 February 2025

For fine-grained recognition, capturing distinguishable features and effectively utilizing local information play a key role, since the objects of recognition exhibit subtle differences in different subcategories. Finding subtle differences between s...

  • Feature Paper
  • Article
  • Open Access
1,477 Views
23 Pages

Learning High-Order Features for Fine-Grained Visual Categorization with Causal Inference

  • Yuhang Zhang,
  • Yuan Wan,
  • Jiahui Hao,
  • Zaili Yang and
  • Huanhuan Li

19 April 2025

Recently, causal models have gained significant attention in natural language processing (NLP) and computer vision (CV) due to their capability of capturing features with causal relationships. This study addresses Fine-Grained Visual Categorization (...

  • Article
  • Open Access
3 Citations
2,221 Views
20 Pages

15 September 2023

Vehicle make and model recognition (VMMR) is an important aspect of intelligent transportation systems (ITS). In VMMR systems, surveillance cameras capture vehicle images for real-time vehicle detection and recognition. These captured images pose cha...

  • Article
  • Open Access
3 Citations
3,246 Views
22 Pages

A Fine-Grained Car Recognition Method Based on a Lightweight Attention Network and Regularized Fine-Tuning

  • Cheng Zhang,
  • Qiaochu Li,
  • Chang Liu,
  • Yi Zhang,
  • Ding Zhao,
  • Chao Ji and
  • Jin Wang

Car fine recognition is a typical scenario for fine-grained image classification, which has great research and application value in both civilian and military fields. However, current research on fine-grained classification is often limited to improv...

  • Technical Note
  • Open Access
4 Citations
2,069 Views
13 Pages

Remote Sensing Image Harmonization Method for Fine-Grained Ship Classification

  • Jingpu Zhang,
  • Ziyan Zhong,
  • Xingzhuo Wei,
  • Xianyun Wu and
  • Yunsong Li

17 June 2024

Target recognition and fine-grained ship classification in remote sensing face challenges of high inter-class similarity and sample scarcity. A transfer fusion-based ship image harmonization algorithm is proposed to overcome these challenges. This al...

  • Article
  • Open Access
8 Citations
4,147 Views
12 Pages

17 May 2022

Fine-grained image classification is a challenging computer visual task due to the small interclass variations and large intra-class variations. Extracting expressive feature representation is an effective way to improve the accuracy of fine-grained...

  • Article
  • Open Access
501 Views
14 Pages

YOLOv13-SwinTongue: Tongue Coating Diagnosis Using an Enhanced YOLOv13 with Swin Transformer

  • Xiangqiang Yang,
  • Jinchao Hao,
  • Yonggang Wang,
  • Yunfeng Man,
  • Renjie Yang and
  • Qinge Wu

29 December 2025

Tongue coating is a crucial diagnostic indicator in traditional Chinese medicine, intuitively reflecting the body’s physiological and pathological conditions. However, traditional visual inspection methods are highly susceptible to subjective b...

  • Article
  • Open Access
817 Views
22 Pages

12 August 2025

Fine-grained aircraft classification in remote sensing is a critical task within the field of remote sensing image processing, aiming to precisely distinguish between different types of aircraft in aerial images. Due to the high visual similarity amo...

  • Article
  • Open Access
16 Citations
4,752 Views
15 Pages

11 May 2019

The focus of fine-grained image classification tasks is to ignore interference information and grasp local features. This challenge is what the visual attention mechanism excels at. Firstly, we have constructed a two-level attention convolutional net...

  • Article
  • Open Access
1 Citations
3,145 Views
17 Pages

27 June 2023

Fine-grained image classification remains an ongoing challenge in the computer vision field, which is particularly intended to identify objects within sub-categories. It is a difficult task since there is both minimal and substantial intra-class vari...

  • Article
  • Open Access
1,144 Views
19 Pages

A Scene Knowledge Integrating Network for Transmission Line Multi-Fitting Detection

  • Xinhang Chen,
  • Xinsheng Xu,
  • Jing Xu,
  • Wenjie Zheng and
  • Qianming Wang

23 December 2024

Aiming at the severe occlusion problem and the tiny-scale object problem in the multi-fitting detection task, the Scene Knowledge Integrating Network (SKIN), including the scene filter module (SFM) and scene structure information module (SSIM) is pro...

  • Article
  • Open Access
2 Citations
2,440 Views
21 Pages

Recognition of Occluded Goods under Prior Inference Based on Generative Adversarial Network

  • Mingxuan Cao,
  • Kai Xie,
  • Feng Liu,
  • Bohao Li,
  • Chang Wen,
  • Jianbiao He and
  • Wei Zhang

22 March 2023

Aiming at the recognition of intelligent retail dynamic visual container goods, two problems that lead to low recognition accuracy must be addressed; one is the lack of goods features caused by the occlusion of the hand, and the other is the high sim...

  • Article
  • Open Access
7 Citations
2,642 Views
19 Pages

5 February 2021

Egocentric activity recognition in first-person video (FPV) requires fine-grained matching of the camera wearer’s action and the objects being operated. The traditional method used for third-person action recognition does not suffice because of (1) t...

  • Article
  • Open Access
104 Citations
10,948 Views
30 Pages

With the development of advanced information and intelligence technologies, precision agriculture has become an effective solution to monitor and prevent crop pests and diseases. However, pest and disease recognition in precision agriculture applicat...

  • Article
  • Open Access
803 Views
22 Pages

ACE-Net: A Fine-Grained Deepfake Detection Model with Multimodal Emotional Consistency

  • Shaoqian Yu,
  • Xingyu Chen,
  • Yuzhe Sheng,
  • Han Zhang,
  • Xinlong Li and
  • Sijia Yu

13 November 2025

The alarming realism of Deepfake presents a significant challenge to digital authenticity, yet its inherent difficulty in synchronizing the emotional cues between facial expressions and speech offers a critical opportunity for detection. However, mos...

  • Article
  • Open Access
1 Citations
1,877 Views
13 Pages

4 July 2023

The analysis of thin sections for lithology identification is a staple technique in geology. Although recent strides in deep learning have catalyzed the development of models for thin section recognition leveraging varied deep neural networks, there...

  • Article
  • Open Access
474 Views
19 Pages

14 November 2025

Zero-shot action recognition remains challenging due to the visual–semantic gap and the persistent bias toward seen classes, particularly under the generalized setting where both seen and unseen categories appear during inference. To address th...

  • Article
  • Open Access
1 Citations
1,202 Views
17 Pages

16 June 2025

Predicting the motion of handwritten digits in video sequences is challenging due to complex spatiotemporal dependencies, variable writing styles, and the need to preserve fine-grained visual details—all of which are essential for real-time han...

  • Article
  • Open Access
378 Views
29 Pages

VT-MFLV: Vision–Text Multimodal Feature Learning V Network for Medical Image Segmentation

  • Wenju Wang,
  • Jiaqi Li,
  • Zinuo Ye,
  • Yuyang Cai,
  • Zhen Wang and
  • Renwei Zhang

28 November 2025

Currently, existing multimodal segmentation methods face limitations in effectively leveraging medical text to guide visual feature learning. They often suffer from insufficient multimodal fusion and inadequate accuracy in fine-grained lesion segment...

  • Article
  • Open Access
4 Citations
2,917 Views
21 Pages

18 November 2022

The development of effective and comprehensive methods for mapping and monitoring reservoirs is essential for the utilization of water resources and flood control. Remote sensing has the great advantages of broad spatial coverage and regular revisit...

  • Article
  • Open Access
595 Views
17 Pages

Fine-Grained Image Recognition with Bio-Inspired Gradient-Aware Attention

  • Bing Ma,
  • Junyi Li,
  • Zhengbei Jin,
  • Wei Zhang,
  • Xiaohui Song and
  • Beibei Jin

12 December 2025

Fine-grained image recognition is one of the key tasks in the field of computer vision. However, due to subtle inter-class differences and significant intra-class differences, it still faces severe challenges. Conventional approaches often struggle w...

  • Article
  • Open Access
1 Citations
1,396 Views
21 Pages

1 September 2025

Phytoplankton plays a pivotal role in marine ecosystems and global biogeochemical cycles. Accurate identification and monitoring of phytoplankton are essential for understanding environmental dynamics and climate variations. Despite the significant p...

  • Article
  • Open Access
432 Views
21 Pages

18 December 2025

Micro-expression recognition (MER), as an important branch of intelligent visual sensing, enables the analysis of subtle facial movements for applications in emotion understanding, human–computer interaction and security monitoring. However, ex...

  • Article
  • Open Access
2,363 Views
17 Pages

12 August 2025

Person re-identification (Re-ID) has attracted considerable attention in the field of computer vision, primarily due to its critical role in video surveillance and public security applications. However, most existing Re-ID approaches rely on image-le...

  • Article
  • Open Access
1,134 Views
25 Pages

Hierarchical Deep Learning Model for Identifying Similar Targets in UAV Imagery

  • Dmytro Borovyk,
  • Oleksander Barmak,
  • Pavlo Radiuk and
  • Iurii Krak

25 October 2025

Accurate object detection in UAV imagery is critical for situational awareness, yet conventional deep learning models often struggle to distinguish between visually similar targets. To address this challenge, this study introduces a hierarchical deep...

  • Article
  • Open Access
2 Citations
1,915 Views
19 Pages

10 October 2025

Weakly Supervised Video Anomaly Detection (WSVAD) is a critical task in computer vision. It aims to localize and recognize abnormal behaviors using only video-level labels. Without frame-level annotations, it becomes significantly challenging to mode...

  • Review
  • Open Access
40 Citations
22,684 Views
44 Pages

9 September 2024

The dynamic expressions of emotion convey both the emotional and functional states of an individual’s interactions. Recognizing the emotional states helps us understand human feelings and thoughts. Systems and frameworks designed to recognize h...

  • Article
  • Open Access
3 Citations
2,898 Views
19 Pages

Does De-Iconization Affect Visual Recognition of Russian and English Iconic Words?

  • Yulia Lavitskaya,
  • Yulia Sedelkina,
  • Elizaveta Korotaevskaya,
  • Liubov Tkacheva,
  • Maria Flaksman and
  • Andrey Nasledov

Iconic words constitute an integral part of the lexicon of a language, exhibiting form-meaning resemblance. Over the course of time, semantic and phonetic transformations “weaken” the degree of iconicity of a word. This iconicity loss is...

  • Article
  • Open Access
360 Views
24 Pages

NovAc-DL: Novel Activity Recognition Based on Deep Learning in the Real-Time Environment

  • Saksham Singla,
  • Sheral Singla,
  • Karan Singla,
  • Priya Kansal,
  • Sachin Kansal,
  • Alka Bishnoi and
  • Jyotindra Narayan

Real-time fine-grained human activity recognition (HAR) remains a challenging problem due to rapid spatial–temporal variations, subtle motion differences, and dynamic environmental conditions. Addressing this difficulty, we propose NovAc-DL, a...

  • Article
  • Open Access
2,548 Views
24 Pages

In this work, the utility of multimodal vision–language models (VLMs) for visual product understanding in e-commerce is investigated, focusing on two complementary models: ColQwen2 (vidore/colqwen2-v1.0) and ColPali (vidore/colpali-v1.2-hf). Th...

  • Article
  • Open Access
911 Views
24 Pages

Facial expression plays an important role in human–computer interaction and affective computing. However, existing expression recognition methods cannot effectively capture multi-scale structural details contained in facial expressions, leading...

  • Article
  • Open Access
1,123 Views
20 Pages

Fast Normalization for Bilinear Pooling via Eigenvalue Regularization

  • Sixiang Xu,
  • Huihui Dong,
  • Chen Zhang and
  • Chaoxue Wang

10 April 2025

Bilinear pooling, as an aggregation approach that outputs second-order statistics of deep learning features, has demonstrated effectiveness in a wide range of visual recognition tasks. Among major improvements on the bilinear pooling, matrix square r...

  • Article
  • Open Access
264 Views
22 Pages

A Ship Incremental Recognition Framework via Unknown Extraction and Joint Optimization Learning

  • Yugao Li,
  • Guangzhen Bao,
  • Jianming Hu,
  • Xiyang Zhi,
  • Tianyi Hu,
  • Junjie Wang and
  • Wenbo Wu

2 January 2026

With the rapid growth of the marine economy and the increasing demand for maritime security, ship target detection has become critically important in both military and civilian applications. However, in complex remote sensing scenarios, challenges su...

  • Article
  • Open Access
6 Citations
3,283 Views
19 Pages

9 February 2023

Lithology identification is the basis for sweet spot evaluation, prediction, and precise exploratory deployment and has important guiding significance for areas with low exploration degrees. The lithology of the shale strata, which are composed of fi...

  • Article
  • Open Access
331 Views
12 Pages

Analytical Modeling of Hybrid CNN-Transformer Dynamics for Emotion Classification

  • Ergashevich Halimjon Khujamatov,
  • Mirjamol Abdullaev and
  • Sabina Umirzakova

25 December 2025

Facial expression recognition (FER) is crucial for affective computing and human–computer interaction; however, it is still difficult to achieve under various conditions in the real world, such as lighting, occlusion, and pose. This work presen...

  • Article
  • Open Access
1,911 Views
21 Pages

24 September 2025

Timely detection of road surface defects such as cracks and potholes is critical for ensuring traffic safety and reducing infrastructure maintenance costs. While recent advances in image-based deep learning techniques have shown promise for automated...

of 2