You are currently on the new version of our website. Access the old version .

48 Results Found

  • Article
  • Open Access
3 Citations
4,114 Views
19 Pages

2 December 2024

This research presents a retrospective analysis of zero-shot object detectors in automating image labeling for eyeglasses detection. The increasing demand for high-quality annotations in object detection is being met by AI foundation models with open...

  • Article
  • Open Access
5 Citations
2,877 Views
14 Pages

The Kuwaiti dialect is a particular dialect of Arabic spoken in Kuwait; it differs significantly from standard Arabic and the dialects of neighboring countries in the same region. Few research papers with a focus on the Kuwaiti dialect have been publ...

  • Article
  • Open Access
11 Citations
2,905 Views
12 Pages

Zero-Shot Learning (ZSL) is related to training machine learning models capable of classifying or predicting classes (labels) that are not involved in the training set (unseen classes). A well-known problem in Deep Learning (DL) is the requirement fo...

  • Article
  • Open Access
4 Citations
1,759 Views
21 Pages

7 June 2024

Species recognition is a crucial part of understanding the abundance and distribution of various organisms and is important for biodiversity conservation and management. Traditional vision-based deep learning-driven species recognition requires large...

  • Article
  • Open Access
1 Citations
2,427 Views
19 Pages

Embedded Zero-Shot Image Classification Based on Bidirectional Feature Mapping

  • Huadong Sun,
  • Zhibin Zhen,
  • Yinghui Liu,
  • Xu Zhang,
  • Xiaowei Han and
  • Pengyi Zhang

17 June 2024

The zero-shot image classification technique aims to explore the semantic information shared between seen and unseen classes through visual features and auxiliary information and, based on this semantic information, to complete the knowledge migratio...

  • Article
  • Open Access
3 Citations
2,171 Views
27 Pages

5 June 2025

Automated food safety inspection systems rely heavily on the visual detection of contamination, spoilage, and foreign objects in food products. Current approaches typically require extensive labeled training data for each specific hazard type, limiti...

  • Article
  • Open Access
2,486 Views
21 Pages

17 May 2024

Video classification is a challenging task in computer vision that requires analyzing the content of a video to assign it to one or more predefined categories. However, due to the vast amount of visual data contained in videos, the classification pro...

  • Article
  • Open Access
21 Citations
7,397 Views
22 Pages

26 April 2021

With the rapid developments of hyperspectral imaging, the cost of collecting hyperspectral data has been lower, while the demand for reliable and detailed hyperspectral annotations has been much more substantial. However, limited by the difficulties...

  • Article
  • Open Access
10 Citations
3,280 Views
18 Pages

11 September 2022

Remote sensing image scene classification takes image blocks as classification units and predicts their semantic descriptors. Because it is difficult to obtain enough labeled samples for all classes of remote sensing image scenes, zero-shot classific...

  • Article
  • Open Access
719 Views
33 Pages

Hybrid LLM-Assisted Fault Diagnosis Framework for 5G/6G Networks Using Real-World Logs

  • Aymen D. Salman,
  • Akram T. Zeyad,
  • Shereen S. Jumaa,
  • Safanah M. Raafat,
  • Fanan Hikmat Jasim and
  • Amjad J. Humaidi

12 December 2025

This paper presents Hy-LIFT (Hybrid LLM-Integrated Fault Diagnosis Toolkit), a multi-stage framework for interpretable and data-efficient fault diagnosis in 5G/6G networks that integrates a high-precision interpretable rule-based engine (IRBE) for kn...

  • Article
  • Open Access
2 Citations
5,518 Views
20 Pages

This study introduces a novel AI-driven approach to support elderly patients in Thailand with medication management, focusing on accurate drug label interpretation. Two model architectures were explored: a Two-Stage Optical Character Recognition (OCR...

  • Article
  • Open Access
1 Citations
963 Views
26 Pages

Accurate gender identification from written text is critical for author profiling, recommendation systems, and demographic analytics in digital ecosystems. This study introduces a scalable framework for gender classification in Turkish, combining con...

  • Article
  • Open Access
1 Citations
2,422 Views
17 Pages

Zero-Shot Day–Night Domain Adaptation for Face Detection Based on DAl-CLIP-Dino

  • Huadong Sun,
  • Yinghui Liu,
  • Ziyang Chen and
  • Pengyi Zhang

Two challenges in computer vision (CV) related to face detection are the difficulty of acquisition in the target domain and the degradation of image quality. Especially in low-light situations, the poor visibility of images is difficult to label, whi...

  • Feature Paper
  • Article
  • Open Access
26 Citations
7,104 Views
25 Pages

7 March 2020

Deep learning architectures are the most effective methods for analyzing and classifying Ultra-Spectral Images (USI). However, effective training of a Deep Learning (DL) gradient classifier aiming to achieve high classification accuracy, is extremely...

  • Article
  • Open Access
1,140 Views
25 Pages

In computer vision, the performance of detectors trained under optimal lighting conditions is significantly impaired when applied to infrared domains due to the scarcity of labeled infrared target domain data and the inherent degradation in infrared...

  • Article
  • Open Access
6 Citations
3,969 Views
16 Pages

Cross-Domain Fake News Detection Using a Prompt-Based Approach

  • Jawaher Alghamdi,
  • Yuqing Lin and
  • Suhuai Luo

8 August 2024

The proliferation of fake news poses a significant challenge in today’s information landscape, spanning diverse domains and topics and undermining traditional detection methods confined to specific domains. In response, there is a growing inter...

  • Article
  • Open Access
804 Views
15 Pages

8 September 2025

Compositional Zero-Shot Learning (CZSL) seeks to enable machines to recognize objects and attributes (i.e., primitives),learn their associations, and generalize to novel compositions, enabling systems to exhibit a human-like ability to infer and gene...

  • Article
  • Open Access
2 Citations
3,102 Views
40 Pages

With the growing demand for labeled textual data in Natural Language Processing (NLP), traditional data collection and annotation methods face significant challenges, such as high cost, limited scalability, and privacy constraints. This study present...

  • Article
  • Open Access
1 Citations
2,758 Views
32 Pages

28 November 2023

Emotion recognition is a vital task within Natural Language Processing (NLP) that involves automatically identifying emotions from text. As the need for specialized and nuanced emotion recognition models increases, the challenge of fine-grained emoti...

  • Article
  • Open Access
5 Citations
7,561 Views
35 Pages

Multimodal Data Fusion for Tabular and Textual Data: Zero-Shot, Few-Shot, and Fine-Tuning of Generative Pre-Trained Transformer Models

  • Shadi Jaradat,
  • Mohammed Elhenawy,
  • Richi Nayak,
  • Alexander Paz,
  • Huthaifa I. Ashqar and
  • Sebastien Glaser

7 April 2025

In traffic safety analysis, previous research has often focused on tabular data or textual crash narratives in isolation, neglecting the potential benefits of a hybrid multimodal approach. This study introduces the Multimodal Data Fusion (MDF) framew...

  • Article
  • Open Access
2,975 Views
13 Pages

10 February 2020

Visual relationship detection (VRD), a challenging task in the image understanding, suffers from vague connection between relationship patterns and visual appearance. This issue is caused by the high diversity of relationship-independent visual appea...

  • Article
  • Open Access
1 Citations
2,096 Views
16 Pages

An Adaptive Mixup Hard Negative Sampling for Zero-Shot Entity Linking

  • Shisen Cai,
  • Xi Wu,
  • Maihemuti Maimaiti,
  • Yichang Chen,
  • Zhixiang Wang and
  • Jiong Zheng

20 October 2023

Recently, the focus of entity linking research has centered on the zero-shot scenario, where the entity purposed to be labeled at the time of testing was never observed during the training phase, or it may belong to a different domain than the source...

  • Article
  • Open Access
2 Citations
3,310 Views
18 Pages

18 September 2025

Risk identification is a critical concern in the construction industry. In recent years, there has been a growing trend of applying artificial intelligence (AI) tools to detect risks from unstructured data sources such as news articles, social media,...

  • Article
  • Open Access
9 Citations
8,904 Views
13 Pages

7 March 2024

Object detection is a crucial research topic in the fields of computer vision and artificial intelligence, involving the identification and classification of objects within images. Recent advancements in deep learning technologies, such as YOLO (You...

  • Article
  • Open Access
3,366 Views
24 Pages

22 August 2024

The contrastive vision–language pre-trained model CLIP, driven by large-scale open-vocabulary image–text pairs, has recently demonstrated remarkable zero-shot generalization capabilities in diverse downstream image tasks, which has made n...

  • Article
  • Open Access
2 Citations
2,049 Views
25 Pages

29 July 2025

The proliferation of digital public service platforms and the expansion of e-government initiatives have significantly increased the volume and diversity of citizen-generated feedback. This trend emphasizes the need for classification systems that ar...

  • Article
  • Open Access
2,253 Views
21 Pages

Comparative Analysis of BERT and GPT for Classifying Crisis News with Sudan Conflict as an Example

  • Yahya Masri,
  • Zifu Wang,
  • Anusha Srirenganathan Malarvizhi,
  • Samir Ahmed,
  • Tayven Stover,
  • David W. S. Wong,
  • Yongyao Jiang,
  • Yun Li,
  • Qian Liu and
  • Chaowei Yang
  • + 3 authors

8 July 2025

To obtain actionable information for humanitarian and other emergency responses, an accurate classification of news or events is critical. Daily news and social media are hard to classify based on conveyed information, especially when multiple catego...

  • Article
  • Open Access
18 Citations
5,727 Views
21 Pages

19 August 2018

Most supervised classification methods for polarimetric synthetic aperture radar (PolSAR) data rely on abundant labeled samples, and cannot tackle the problem that categorizes or infers unseen land cover classes without training samples. Aiming to ca...

  • Article
  • Open Access
4 Citations
3,358 Views
17 Pages

4 July 2023

The research presented in the paper aims at increasing the capacity to identify security weaknesses in programming languages that are less supported by specialized security analysis tools, based on the knowledge gathered from securing the popular one...

  • Article
  • Open Access
1,491 Views
17 Pages

28 August 2025

Market indices, such as the S&P 500, serve as compressed representations of complex constituent-level dynamics. This study proposes a zero-shot forecasting framework capable of predicting index-level trajectories without direct supervision from i...

  • Article
  • Open Access
1,509 Views
29 Pages

DASeg: A Domain-Adaptive Segmentation Pipeline Using Vision Foundation Models—Earthquake Damage Detection Use Case

  • Huili Huang,
  • Andrew Zhang,
  • Danrong Zhang,
  • Max Mahdi Roozbahani and
  • James David Frost

14 August 2025

Limited labeled imagery and tight response windows hinder the accurate damage quantification for post-disaster assessment. The objective of this study is to develop and evaluate a deep learning-based Domain-Adaptive Segmentation (DASeg) workflow to d...

  • Article
  • Open Access
10 Citations
7,838 Views
19 Pages

18 April 2024

Social media platforms have surpassed cultural and linguistic boundaries, thus enabling online communication worldwide. However, the expanded use of various languages has intensified the challenge of online detection of hate speech content. Despite t...

  • Article
  • Open Access
4 Citations
4,540 Views
16 Pages

12 September 2021

Similarity learning using deep convolutional neural networks has been applied extensively in solving computer vision problems. This attraction is supported by its success in one-shot and zero-shot classification applications. The advances in similari...

  • Article
  • Open Access
1,874 Views
21 Pages

24 September 2025

Timely detection of road surface defects such as cracks and potholes is critical for ensuring traffic safety and reducing infrastructure maintenance costs. While recent advances in image-based deep learning techniques have shown promise for automated...

  • Article
  • Open Access
2 Citations
2,364 Views
27 Pages

Zero-Shot Rolling Bearing Fault Diagnosis Based on Attribute Description

  • Guorong Fan,
  • Lijun Li,
  • Yue Zhao,
  • Hui Shi,
  • Xiaoyi Zhang and
  • Zengshou Dong

Traditional fault diagnosis methods for rolling bearings rely on nemerous labeled samples, which are difficult to obtain in engineering applications. Moreover, when unseen fault categories appear in the test set, these models fail to achieve accurate...

  • Article
  • Open Access
1 Citations
2,435 Views
13 Pages

13 June 2023

Although image recognition technologies are developing rapidly with deep learning, conventional recognition models trained by supervised learning with class labels do not work well when test inputs from untrained classes are given. For example, a rec...

  • Article
  • Open Access
4 Citations
2,461 Views
22 Pages

Generalized Zero-Shot Space Target Recognition Based on Global-Local Visual Feature Embedding Network

  • Yuanpeng Zhang,
  • Jingye Guan,
  • Haobo Wang,
  • Kaiming Li,
  • Ying Luo and
  • Qun Zhang

28 October 2023

Existing deep learning-based space target recognition methods rely on abundantly labeled samples and are not capable of recognizing samples from unseen classes without training. In this article, based on generalized zero-shot learning (GZSL), we prop...

  • Article
  • Open Access
547 Views
20 Pages

An Integrated Framework with SAM and OCR for Pavement Crack Quantification and Geospatial Mapping

  • Nut Sovanneth,
  • Asnake Adraro Angelo,
  • Felix Obonguta and
  • Kiyoyuki Kaito

Pavement condition assessment using computer vision has emerged as an efficient alternative to traditional manual surveys, which are often labor-intensive and time-consuming. Leveraging deep learning, pavement distress such as cracks can be automatic...

  • Article
  • Open Access
340 Views
23 Pages

Semantics- and Physics-Guided Generative Network for Radar HRRP Generalized Zero-Shot Recognition

  • Jiaqi Zhou,
  • Tao Zhang,
  • Siyuan Mu,
  • Yuze Gao,
  • Feiming Wei and
  • Wenxian Yu

19 December 2025

High-resolution range profile (HRRP) target recognition has garnered significant attention in radar automatic target recognition (RATR) research for its rich structural information and low computational costs. With the rapid advancements in deep lear...

  • Article
  • Open Access
10 Citations
4,302 Views
23 Pages

19 April 2023

Human-to-human dialogues constitute an essential research area for linguists, serving as a conduit for knowledge transfer in the study of dialogue systems featuring human-to-machine interaction. Dialogue systems have garnered significant acclaim and...

  • Article
  • Open Access
442 Views
28 Pages

28 November 2025

This paper presents an integrated three-dimensional (3D) quality inspection system for mold manufacturing that addresses critical industrial constraints, including zero-shot generalization without retraining, complete decision traceability for regula...

  • Article
  • Open Access
14 Citations
3,589 Views
19 Pages

18 December 2023

Supervised training has traditionally been the cornerstone of hate speech detection models, but it often falls short when faced with unseen scenarios. Zero and few-shot learning offers an interesting alternative to traditional supervised approaches....

  • Article
  • Open Access
2,759 Views
18 Pages

MLLM-Search: A Zero-Shot Approach to Finding People Using Multimodal Large Language Models

  • Angus Fung,
  • Aaron Hao Tan,
  • Haitong Wang,
  • Bensiyon Benhabib and
  • Goldie Nejat

28 July 2025

Robotic search of people in human-centered environments, including healthcare settings, is challenging, as autonomous robots need to locate people without complete or any prior knowledge of their schedules, plans, or locations. Furthermore, robots ne...

  • Feature Paper
  • Article
  • Open Access
161 Views
24 Pages

8 January 2026

Root-cause analysis (RCA) in large-scale microservice-based payment systems is challenging due to complex failure propagation along service dependencies, limited availability of labeled incident data, and heterogeneous service topologies across deplo...

  • Article
  • Open Access
725 Views
28 Pages

7 November 2025

Medical anomaly detection is challenged by limited labeled data and domain shifts, which reduce the performance and generalization of deep learning (DL) models. Hybrid convolutional neural network–Vision Transformer (CNN–ViT) architecture...

  • Article
  • Open Access
276 Views
15 Pages

Few-Shot Transfer Learning for Diabetes Risk Prediction Across Global Populations

  • Shrinit Babel,
  • Sunit Babel,
  • John Hodgson and
  • Enrico Camporesi

19 December 2025

Background and Objectives: Type 2 diabetes mellitus (T2DM) affects over 537 million adults worldwide and disproportionately burdens low- and middle-income countries, where diagnostic resources are limited. Predictive models trained in one population...

  • Article
  • Open Access
1 Citations
3,604 Views
24 Pages

2 July 2024

Social media serves as a platform for netizens to stay informed and express their opinions through the Internet. Currently, the social media discourse environment faces a significant security threat—offensive comments. A group of users posts co...

  • Article
  • Open Access
21 Citations
7,042 Views
23 Pages

18 February 2023

Drug abuse is a serious problem in the United States, with over 90,000 drug overdose deaths nationally in 2020. A key step in combating drug abuse is detecting, monitoring, and characterizing its trends over time and location, also known as pharmacov...