Skip Content
You are currently on the new version of our website. Access the old version .

51 Results Found

  • Article
  • Open Access
4 Citations
2,660 Views
14 Pages

25 September 2024

Multi-label text classification (MLTC) aims to assign appropriate labels to each document from a given set. Prior research has acknowledged the significance of label information, but its utilization remains insufficient. Existing approaches often foc...

  • Article
  • Open Access
1 Citations
3,781 Views
25 Pages

Threat Intelligence Extraction Framework (TIEF) for TTP Extraction

  • Anooja Joy,
  • Madhav Chandane,
  • Yash Nagare and
  • Faruk Kazi

The increasing complexity and scale of cyber threats demand advanced, automated methodologies for extracting actionable cyber threat intelligence (CTI). The automated extraction of Tactics, Techniques, and Procedures (TTPs) from unstructured threat r...

  • Article
  • Open Access
5 Citations
2,408 Views
22 Pages

5 December 2022

Plagiarism is an act of intellectual high treason that damages the whole scholarly endeavor. Many attempts have been undertaken in recent years to identify text document plagiarism. The effectiveness of researchers’ suggested strategies to iden...

  • Article
  • Open Access
8 Citations
4,073 Views
17 Pages

VC-SLAM—A Handcrafted Data Corpus for the Construction of Semantic Models

  • Andreas Burgdorf,
  • Alexander Paulus,
  • André Pomp and
  • Tobias Meisen

25 January 2022

Ontology-based data management and knowledge graphs have emerged in recent years as efficient approaches for managing and utilizing diverse and large data sets. In this regard, research on algorithms for automatic semantic labeling and modeling as a...

  • Article
  • Open Access
1,396 Views
18 Pages

17 November 2025

This study presents the development of an AI-powered chatbot designed to facilitate accurate and efficient retrieval of information from the FDA drug labeling documents. Leveraging OpenAI’s GPT-3.5-turbo model within a controlled, document-grou...

  • Article
  • Open Access
893 Views
18 Pages

SATrack: Semantic-Aware Alignment Framework for Visual–Language Tracking

  • Yangyang Tian,
  • Liusen Xu,
  • Zhe Li,
  • Liang Jiang,
  • Cen Chen and
  • Huanlong Zhang

4 October 2025

Visual–language tracking often faces challenges like target deformation and confusion caused by similar objects. These issues can disrupt the alignment between visual inputs and their textual descriptions, leading to cross-modal semantic drift...

  • Article
  • Open Access
9 Citations
4,157 Views
15 Pages

16 October 2020

Knowledge graph completion can make knowledge graphs more complete, which is a meaningful research topic. However, the existing methods do not make full use of entity semantic information. Another challenge is that a deep model requires large-scale m...

  • Article
  • Open Access
2 Citations
3,515 Views
20 Pages

22 June 2025

This paper addresses the challenges of sample scarcity and class imbalance in remote sensing image semantic segmentation by proposing a decoupled synthetic sample generation framework based on a latent diffusion model. The method consists of two stag...

  • Article
  • Open Access
2 Citations
1,935 Views
19 Pages

10 October 2025

Weakly Supervised Video Anomaly Detection (WSVAD) is a critical task in computer vision. It aims to localize and recognize abnormal behaviors using only video-level labels. Without frame-level annotations, it becomes significantly challenging to mode...

  • Article
  • Open Access
208 Views
17 Pages

Recent vision-language pre-training models, like CLIP, have been shown to generalize well across a variety of multitask modalities. Nonetheless, their generalization for downstream tasks is limited. As a lightweight adaptation approach, prompt learni...

  • Article
  • Open Access
6 Citations
3,968 Views
22 Pages

Multi-Modal Prototypes for Few-Shot Object Detection in Remote Sensing Images

  • Yanxing Liu,
  • Zongxu Pan,
  • Jianwei Yang,
  • Peiling Zhou and
  • Bingchen Zhang

16 December 2024

Few-shot object detection has attracted extensive attention due to the abomination of time-consuming or even impractical large-scale data labeling. Current studies attempted to employ prototype-matching approaches for object detection, constructing c...

  • Article
  • Open Access
1 Citations
1,239 Views
18 Pages

Earth observation data serve as a fundamental resource in Earth system science. The rapid advancement of remote sensing and in situ measurement technologies has led to the generation of massive volumes of data, accompanied by a growing body of geogra...

  • Article
  • Open Access
1,215 Views
22 Pages

Concatenation Augmentation for Improving Deep Learning Models in Finance NLP with Scarce Data

  • César Vaca,
  • Jesús-Ángel Román-Gallego,
  • Verónica Barroso-García,
  • Fernando Tejerina and
  • Benjamín Sahelices

Nowadays, financial institutions increasingly leverage artificial intelligence to enhance decision-making and optimize investment strategies. A specific application is the automatic analysis of large volumes of unstructured textual data to extract re...

  • Article
  • Open Access
1 Citations
3,152 Views
17 Pages

25 April 2025

Unsupervised sentence embedding, vital for numerous NLP tasks, struggles with the inherent asymmetry of semantic relationships within contrastive learning (CL). This paper proposes Label Smoothing-based Ranking Negative Sampling (LS-RNS), a novel fra...

  • Article
  • Open Access
144 Citations
5,424 Views
16 Pages

Developing Multi-Labelled Corpus of Twitter Short Texts: A Semi-Automatic Method

  • Xuan Liu,
  • Guohui Zhou,
  • Minghui Kong,
  • Zhengtong Yin,
  • Xiaolu Li,
  • Lirong Yin and
  • Wenfeng Zheng

1 August 2023

Facing fast-increasing electronic documents in the Digital Media Age, the need to extract textual features of online texts for better communication is growing. Sentiment classification might be the key method to catch emotions of online communication...

  • Article
  • Open Access
2 Citations
2,131 Views
13 Pages

Implicit Stance Detection with Hashtag Semantic Enrichment

  • Li Dong,
  • Zinao Su,
  • Xianghua Fu,
  • Bowen Zhang and
  • Genan Dai

26 May 2024

Stance detection is a crucial task in natural language processing and social computing, focusing on classifying expressed attitudes towards specific targets based on the input text. Conventional methods predominantly view stance detection as a task o...

  • Article
  • Open Access
1 Citations
2,280 Views
17 Pages

4 March 2023

Raw image classification datasets generally maintain a long-tailed distribution in the real world. Standard classification algorithms face a substantial issue because many labels only relate to a few categories. The model learning processes will tend...

  • Article
  • Open Access
17 Citations
4,491 Views
16 Pages

XLNet-Caps: Personality Classification from Textual Posts

  • Ying Wang,
  • Jiazhuang Zheng,
  • Qing Li,
  • Chenglong Wang,
  • Hanyun Zhang and
  • Jibing Gong

Personality characteristics represent the behavioral characteristics of a class of people. Social networking sites have a multitude of users, and the text messages generated by them convey a person’s feelings, thoughts, and emotions at a particular t...

  • Article
  • Open Access
4 Citations
2,517 Views
18 Pages

25 January 2025

Weakly supervised crack segmentation aims to create pixel-level crack masks with minimal human annotation, which often only differentiate between crack and normal no-crack patches. This task is crucial for assessing structural integrity and safety in...

  • Article
  • Open Access
3 Citations
1,966 Views
21 Pages

5 December 2024

Author Gender Identification (AGI) is an extensively studied subject owing to its significance in several domains, such as security and marketing. Recognizing an author’s gender may assist marketers in segmenting consumers more effectively and...

  • Article
  • Open Access
2 Citations
2,600 Views
23 Pages

Customer complaints play an important role in the adjustment of business operations and improvement of services, particularly in the aviation industry. However, extracting adequate textual features to perform a multi-label classification of complaint...

  • Article
  • Open Access
22 Citations
4,051 Views
17 Pages

14 December 2020

Information privacy is a critical design feature for any exchange system, with privacy-preserving applications requiring, most of the time, the identification and labelling of sensitive information. However, privacy and the concept of “sensitiv...

  • Article
  • Open Access
14 Citations
7,022 Views
34 Pages

Unsupervised Event Graph Representation and Similarity Learning on Biomedical Literature

  • Giacomo Frisoni,
  • Gianluca Moro,
  • Giulio Carlassare and
  • Antonella Carbonaro

21 December 2021

The automatic extraction of biomedical events from the scientific literature has drawn keen interest in the last several years, recognizing complex and semantically rich graphical interactions otherwise buried in texts. However, very few works revolv...

  • Article
  • Open Access
59 Citations
9,496 Views
28 Pages

Depression Detection Based on Hybrid Deep Learning SSCL Framework Using Self-Attention Mechanism: An Application to Social Networking Data

  • Aleena Nadeem,
  • Muhammad Naveed,
  • Muhammad Islam Satti,
  • Hammad Afzal,
  • Tanveer Ahmad and
  • Ki-Il Kim

13 December 2022

In today’s world, mental health diseases have become highly prevalent, and depression is one of the mental health problems that has become widespread. According to WHO reports, depression is the second-leading cause of the global burden of dise...

  • Article
  • Open Access
1 Citations
987 Views
20 Pages

10 September 2025

Scientific creativity is a crucial indicator of adolescents’ potential in science and technology, and its automated evaluation plays a vital role in the early identification of innovative talent. To address challenges such as limited sample siz...

  • Article
  • Open Access
1 Citations
2,455 Views
13 Pages

13 June 2023

Although image recognition technologies are developing rapidly with deep learning, conventional recognition models trained by supervised learning with class labels do not work well when test inputs from untrained classes are given. For example, a rec...

  • Article
  • Open Access
1,032 Views
22 Pages

6 December 2024

Pesticide registration information is an essential part of the pesticide knowledge base. However, the large amount of unstructured text data that it contains pose significant challenges for knowledge storage, retrieval, and utilization. To address th...

  • Article
  • Open Access
518 Views
23 Pages

Comparative Study of Machine Learning Models for Textual Medical Note Classification

  • Yan Zhang,
  • Huynh Trung Nguyen Le,
  • Nathan Lopez and
  • Kira Phan

23 December 2025

The expansion of electronic health records (EHRs) has generated a large amount of unstructured textual data, such as clinical notes and medical reports, which contain diagnostic and prognostic information. Effective classification of these textual me...

  • Article
  • Open Access
13 Citations
5,707 Views
13 Pages

Users of web or chat social networks typically use emojis (e.g., smilies, memes, hearts) to convey in their textual interactions the emotions underlying the context of the communication, aiming for better interpretability, especially for short polyse...

  • Article
  • Open Access
3,197 Views
14 Pages

18 August 2025

Open-vocabulary semantic segmentation aims to label each pixel of an image based on text descriptions provided at inference time. Recent approaches for this task are based on methods which require two stages: the first one uses a mask generator to ge...

  • Article
  • Open Access
2,889 Views
21 Pages

UAV-OVD: Open-Vocabulary Object Detection in UAV Imagery via Multi-Level Text-Guided Decoding

  • Lijie Tao,
  • Guoting Wei,
  • Zhuo Wang,
  • Zhaoshuai Qi,
  • Ying Li and
  • Haokui Zhang

14 July 2025

Object detection in drone-captured imagery has attracted significant attention due to its wide range of real-world applications, including surveillance, disaster response, and environmental monitoring. Although the majority of existing methods are de...

  • Article
  • Open Access
1,534 Views
23 Pages

27 May 2025

Integrating information from heterogeneous data sources poses significant mathematical challenges, particularly in ensuring the reliability and reducing the uncertainty of predictive models. This paper introduces the Geometric Orthogonal Multimodal F...

  • Article
  • Open Access
1 Citations
1,456 Views
28 Pages

13 May 2025

Fine-grained feature extraction and affective semantic mapping remain significant challenges in product form analysis. To address these issues, this study proposes a contrastive learning-based cross-modal fusion approach for product form imagery reco...

  • Article
  • Open Access
4 Citations
2,311 Views
16 Pages

Statement Recognition of Access Control Policies in IoT Networks

  • Li Ma,
  • Zexian Yang,
  • Zhaoxiong Bu,
  • Qidi Lao and
  • Wenyin Yang

16 September 2023

Access Control Policies (ACPs) are essential for ensuring secure and authorized access to resources in IoT networks. Recognizing these policies involves identifying relevant statements within project documents expressed in natural language. While cur...

  • Article
  • Open Access
60 Citations
8,358 Views
20 Pages

Towards Improved Classification Accuracy on Highly Imbalanced Text Dataset Using Deep Neural Language Models

  • Sarang Shaikh,
  • Sher Muhammad Daudpota,
  • Ali Shariq Imran and
  • Zenun Kastrati

19 January 2021

Data imbalance is a frequently occurring problem in classification tasks where the number of samples in one category exceeds the amount in others. Quite often, the minority class data is of great importance representing concepts of interest and is of...

  • Article
  • Open Access
1 Citations
1,514 Views
16 Pages

Few-shot relation extraction constitutes a critical task in natural language processing. Its aim is to train a model using a limited number of labeled samples when labeled data are scarce, thereby enabling the model to rapidly learn and accurately id...

  • Article
  • Open Access
829 Views
14 Pages

Construction and Application of Knowledge Graph for Power Grid New Equipment Start-Up

  • Wei Tang,
  • Yue Zhang,
  • Xun Mao,
  • Hetong Jia,
  • Kai Lv,
  • Lianfei Shan,
  • Yongtian Qiao and
  • Tao Jiang

17 October 2025

To address the lack of effective risk-identification methods during the commissioning of new power grid equipment, we propose a knowledge graph construction approach for both scheme generation and risk identification. First, a gated attention mechani...

  • Article
  • Open Access
1,486 Views
24 Pages

29 August 2025

Precise classification of unsound wheat grains is essential for crop yields and food security, yet most existing approaches rely on vision-only models that demand large labeled datasets, which is often impractical in real-world, data-scarce settings....

  • Article
  • Open Access
31 Citations
6,595 Views
38 Pages

Values (why to conserve) and Attributes (what to conserve) are essential concepts of cultural heritage. Recent studies have been using social media to map values and attributes conveyed by the public to cultural heritage. However, it is rare to conne...

  • Article
  • Open Access
1,856 Views
15 Pages

Zero-Shot Learning for Sustainable Municipal Waste Classification

  • Dishant Mewada,
  • Eoin Martino Grua,
  • Ciaran Eising,
  • Patrick Denny,
  • Pepijn Van de Ven and
  • Anthony Scanlan

Automated waste classification is an essential step toward efficient recycling and waste management. Traditional deep learning models, such as convolutional neural networks, rely on extensive labeled datasets to achieve high accuracy. However, the an...

  • Article
  • Open Access
11 Citations
4,406 Views
22 Pages

A New Sentence-Based Interpretative Topic Modeling and Automatic Topic Labeling

  • Olzhas Kozbagarov,
  • Rustam Mussabayev and
  • Nenad Mladenovic

10 May 2021

This article presents a new conceptual approach for the interpretative topic modeling problem. It uses sentences as basic units of analysis, instead of words or n-grams, which are commonly used in the standard approaches.The proposed approach’s speci...

  • Article
  • Open Access
16 Citations
5,696 Views
26 Pages

Introduction: Due to the lack of labeled data, applying predictive maintenance algorithms for facility management is cumbersome. Most companies are unwilling to share data or do not have time for annotation. In addition, most available facility manag...

  • Article
  • Open Access
10 Citations
3,677 Views
21 Pages

The increasing expansion of biomedical documents has increased the number of natural language textual resources related to the current applications. Meanwhile, there has been a great interest in extracting useful information from meaningful coherent...

  • Article
  • Open Access
13 Citations
3,769 Views
15 Pages

Leverage Boosting and Transformer on Text-Image Matching for Cheap Fakes Detection

  • Tuan-Vinh La,
  • Minh-Son Dao,
  • Duy-Dong Le,
  • Kim-Phung Thai,
  • Quoc-Hung Nguyen and
  • Thuy-Kieu Phan-Thi

10 November 2022

The explosive growth of the social media community has increased many kinds of misinformation and is attracting tremendous attention from the research community. One of the most prevalent ways of misleading news is cheapfakes. Cheapfakes utilize non-...

  • Article
  • Open Access
6 Citations
2,879 Views
16 Pages

22 August 2023

Aspect-based sentiment analysis (ABSA) is a task of fine-grained sentiment analysis that aims to determine the sentiment of a given target. With the increased prevalence of smart devices and social media, diverse data modalities have become more abun...

  • Article
  • Open Access
482 Views
27 Pages

29 December 2025

The lack of explicit negative labels issue is a prevalent challenge in numerous domains, including CV, NLP, and Recommender Systems (RSs). To address this challenge, many negative sample completion methods are proposed, such as optimizing sample dist...

  • Article
  • Open Access
9 Citations
5,642 Views
21 Pages

Multilabel Text Classification with Label-Dependent Representation

  • Rodrigo Alfaro,
  • Héctor Allende-Cid and
  • Héctor Allende

11 March 2023

Assigning predefined classes to natural language texts, based on their content, is a necessary component in many tasks in organizations. This task is carried out by classifying documents within a set of predefined categories using models and computat...

  • Article
  • Open Access
2 Citations
1,622 Views
27 Pages

With the growing adoption of electric vehicles, optimizing the user experience of charging infrastructure has become critical. However, extracting actionable insights from the vast number of user reviews remains a significant challenge, impeding dema...

  • Article
  • Open Access
1 Citations
2,694 Views
19 Pages

SYNCode: Synergistic Human–LLM Collaboration for Enhanced Data Annotation in Stack Overflow

  • Meng Xia,
  • Shradha Maharjan,
  • Tammy Le,
  • Will Taylor and
  • Myoungkyu Song

Large language models (LLMs) have rapidly advanced natural language processing, showcasing remarkable effectiveness as automated annotators across various applications. Despite their potential to significantly reduce annotation costs and expedite wor...

  • Article
  • Open Access
3 Citations
3,730 Views
21 Pages

aRTIC GAN: A Recursive Text-Image-Conditioned GAN

  • Edoardo Alati,
  • Carlo Alberto Caracciolo,
  • Marco Costa,
  • Marta Sanzari ,
  • Paolo Russo and
  • Irene Amerini

Generative Adversarial Networks have recently demonstrated the capability to synthesize photo-realistic real-world images. However, they still struggle to offer high controllability of the output image, even if several constraints are provided as inp...

of 2