Skip Content
You are currently on the new version of our website. Access the old version .

22 Results Found

  • Article
  • Open Access
719 Views
17 Pages

16 October 2025

Recent advances in exoskeleton-assisted rehabilitation have highlighted the significance of lower limb movement intention recognition through deep learning. However, discrete motion phase classification and continuous real-time joint kinematics estim...

  • Article
  • Open Access
4 Citations
2,206 Views
17 Pages

Learnable Anchor Embedding for Asymmetric Face Recognition

  • Jungyun Kim,
  • Tiong-Sik Ng and
  • Andrew Beng Jin Teoh

Face verification and identification traditionally follow a symmetric matching approach, where the same model (e.g., ResNet-50 vs. ResNet-50) generates embeddings for both gallery and query images, ensuring compatibility. However, real-world scenario...

  • Article
  • Open Access
2 Citations
2,099 Views
16 Pages

Query-Based Object Visual Tracking with Parallel Sequence Generation

  • Chang Liu,
  • Bin Zhang,
  • Chunjuan Bo and
  • Dong Wang

24 July 2024

Query decoders have been shown to achieve good performance in object detection. However, they suffer from insufficient object tracking performance. Sequence-to-sequence learning in this context has recently been explored, with the idea of describing...

  • Article
  • Open Access
7 Citations
5,609 Views
16 Pages

30 December 2023

Multiple object tracking (MOT) plays an important role in intelligent video-processing tasks, which aims to detect and track all moving objects in a scene. Joint-detection-and-tracking (JDT) methods are thriving in MOT tasks, because they accomplish...

  • Article
  • Open Access
1,698 Views
18 Pages

Dense Object Detection Based on De-Homogenized Queries

  • Yueming Huang,
  • Chenrui Ma,
  • Hao Zhou,
  • Hao Wu and
  • Guowu Yuan

Dense object detection is widely used in automatic driving, video surveillance, and other fields. This paper focuses on the challenging task of dense object detection. Currently, detection methods based on greedy algorithms, such as non-maximum suppr...

  • Article
  • Open Access
1 Citations
1,404 Views
20 Pages

E-InMeMo: Enhanced Prompting for Visual In-Context Learning

  • Jiahao Zhang,
  • Bowen Wang,
  • Hong Liu,
  • Liangzhi Li,
  • Yuta Nakashima and
  • Hajime Nagahara

Large-scale models trained on extensive datasets have become the standard due to their strong generalizability across diverse tasks. In-context learning (ICL), widely used in natural language processing, leverages these models by providing task-speci...

  • Article
  • Open Access
1,365 Views
12 Pages

30 July 2025

Sensing transparent objects has many applications in human daily life, including robot navigation and grasping. However, this task presents significant challenges due to the unpredictable nature of scenes that extend beyond/behind transparent objects...

  • Article
  • Open Access
2,112 Views
17 Pages

Dichotomy Graph Sketch: Summarizing Graph Streams with High Accuracy Based on Deep Learning

  • Ding Li,
  • Wenzhong Li,
  • Guoqiang Zhang,
  • Yizhou Chen,
  • Xu Zhong,
  • Mingkai Lin and
  • Sanglu Lu

16 December 2023

In many applications, data streams are indispensable to describe the relationships between nodes in networks, such as social networks, computer networks, and hyperlink networks. Fundamentally, a graph stream is a dynamic representation of a graph, wh...

  • Article
  • Open Access
16 Citations
3,568 Views
16 Pages

21 May 2024

Existing point cloud feature learning networks often learn high-semantic point features representing the global context by incorporating sampling, neighborhood grouping, neighborhood-wise feature learning, and feature aggregation. However, this proce...

  • Article
  • Open Access
83 Views
26 Pages

23 January 2026

Accurate spatiotemporal forecasting underpins high-stakes decision making in smart urban systems, from traffic control and energy scheduling to environment monitoring. Yet two persistent gaps limit current models: (i) spatial modules are often biased...

  • Article
  • Open Access
2 Citations
2,706 Views
17 Pages

9 January 2025

Universal image segmentation aims to handle all segmentation tasks within a single model architecture and ideally requires only one training phase. To achieve task-conditioned joint training, a task token needs to be used in the multi-task training t...

  • Article
  • Open Access
6 Citations
5,027 Views
14 Pages

28 February 2024

Object detection is a fundamental task of remote-sensing image processing. Most existing object detection detectors handle regression and classification tasks through learning from a fixed set of learnable anchors or queries. To simplify object candi...

  • Article
  • Open Access
2 Citations
2,943 Views
19 Pages

Graph neural networks have a wide range of applications, such as citation networks, social networks, and knowledge graphs. Among various graph analyses, node classification has garnered much attention. While many of the recent network embedding model...

  • Article
  • Open Access
90 Citations
12,727 Views
21 Pages

Few-Shot Classification of Aerial Scene Images via Meta-Learning

  • Pei Zhang,
  • Yunpeng Bai,
  • Dong Wang,
  • Bendu Bai and
  • Ying Li

31 December 2020

Convolutional neural network (CNN) based methods have dominated the field of aerial scene classification for the past few years. While achieving remarkable success, CNN-based methods suffer from excessive parameters and notoriously rely on large amou...

  • Feature Paper
  • Article
  • Open Access
60 Citations
7,707 Views
20 Pages

A Comparative Analysis of Active Learning for Biomedical Text Mining

  • Usman Naseem,
  • Matloob Khushi,
  • Shah Khalid Khan,
  • Kamran Shaukat and
  • Mohammad Ali Moni

An enormous amount of clinical free-text information, such as pathology reports, progress reports, clinical notes and discharge summaries have been collected at hospitals and medical care clinics. These data provide an opportunity of developing many...

  • Article
  • Open Access
1,607 Views
24 Pages

20 October 2025

Packing approaches enhance training efficiency by filling the padding space in each batch with shorter sequences, thereby reducing the total number of batches per epoch. This approach has proven effective in both pre-training and supervised fine-tuni...

  • Article
  • Open Access
294 Views
22 Pages

Research on Propeller Defect Diagnosis of Rotor UAVs Based on MDI-STFFNet

  • Beining Cui,
  • Dezhi Jiang,
  • Xinyu Wang,
  • Lv Xiao,
  • Peisen Tan,
  • Yanxia Li and
  • Zhaobin Tan

19 December 2025

To address flight safety risks from rotor defects in rotorcraft drones operating in complex low-altitude environments, this study proposes a high-precision diagnostic model based on the Multimodal Data Input and Spatio-Temporal Feature Fusion Network...

  • Article
  • Open Access
7 Citations
3,519 Views
20 Pages

28 October 2022

Natural imagery segmentation has been transferred to land cover classification in remote sensing imagery with excellent performance. However, two key issues have been overlooked in the transfer process: (1) some objects were easily overwhelmed by the...

  • Article
  • Open Access
13 Citations
5,049 Views
20 Pages

Cross-Modal Retrieval and Semantic Refinement for Remote Sensing Image Captioning

  • Zhengxin Li,
  • Wenzhe Zhao,
  • Xuanyi Du,
  • Guangyao Zhou and
  • Songlin Zhang

3 January 2024

Two-stage remote sensing image captioning (RSIC) methods have achieved promising results by incorporating additional pre-trained remote sensing tasks to extract supplementary information and improve caption quality. However, these methods face limita...

  • Article
  • Open Access
1,610 Views
27 Pages

Dynamic Asymmetric Attention for Enhanced Reasoning and Interpretability in LLMs

  • Feng Wen,
  • Xiaoming Lu,
  • Haikun Yu,
  • Chunyang Lu,
  • Huijie Li and
  • Xiayang Shi

12 August 2025

The remarkable success of autoregressive Large Language Models (LLMs) is predicated on the causal attention mechanism, which enforces a static and rigid form of informational asymmetry by permitting each token to attend only to its predecessors. Whil...

  • Article
  • Open Access
22 Citations
5,728 Views
37 Pages

19 June 2022

Virtual Reality (VR) has been adopted as a leading technology for the metaverse, yet most previous VR systems provide one-size-fits-all experiences to users. Context-awareness in VR enables personalized experiences in the metaverse, such as improved...

  • Article
  • Open Access
1 Citations
993 Views
26 Pages

2 November 2025

Reliable fault diagnosis of milling machines is essential for maintaining operational stability and cost-effective maintenance; however, it remains challenging due to limited labeled data and the highly non-stationary nature of acoustic emission (AE)...