Skip to Content
You are currently on the new version of our website. Access the old version .

220 Results Found

  • Article
  • Open Access
1,458 Views
25 Pages

Research articles are valuable resources for Information Retrieval and Natural Language Processing (NLP) tasks, offering opportunities to analyze key components of scholarly content. This study investigates the presence of methodological terminology...

  • Article
  • Open Access
24 Citations
5,027 Views
16 Pages

An Enhanced Neural Word Embedding Model for Transfer Learning

  • Md. Kowsher,
  • Md. Shohanur Islam Sobuj,
  • Md. Fahim Shahriar,
  • Nusrat Jahan Prottasha,
  • Mohammad Shamsul Arefin,
  • Pranab Kumar Dhar and
  • Takeshi Koshiba

10 March 2022

Due to the expansion of data generation, more and more natural language processing (NLP) tasks are needing to be solved. For this, word representation plays a vital role. Computation-based word embedding in various high languages is very useful. Howe...

  • Communication
  • Open Access
297 Views
12 Pages

Linguistic Influence on Multidimensional Word Embeddings: Analysis of Ten Languages

  • Anna V. Aleshina,
  • Andrey L. Bulgakov,
  • Yanliang Xin and
  • Larisa S. Skrebkova

Understanding how linguistic typology shapes multilingual embeddings is important for cross-lingual NLP. We examine static MUSE word embedding for ten diverse languages (English, Russian, Chinese, Arabic, Indonesian, German, Lithuanian, Hindi, Tajik...

  • Article
  • Open Access
52 Citations
11,843 Views
23 Pages

A Phishing-Attack-Detection Model Using Natural Language Processing and Deep Learning

  • Eduardo Benavides-Astudillo,
  • Walter Fuertes,
  • Sandra Sanchez-Gordon,
  • Daniel Nuñez-Agurto and
  • Germán Rodríguez-Galán

23 April 2023

Phishing is a type of cyber-attack that aims to deceive users, usually using fraudulent web pages that appear legitimate. Currently, one of the most-common ways to detect these phishing pages according to their content is by entering words non-sequen...

  • Article
  • Open Access
320 Views
33 Pages

A Multi-Stage NLP Framework for Knowledge Discovery from Crop Disease Research Literature

  • Jantima Polpinij,
  • Manasawee Kaenampornpan,
  • Christopher S. G. Khoo,
  • Wei-Ning Cheng and
  • Bancha Luaphol

14 January 2026

Extracting and organizing knowledge from the agricultural crop disease research literature are challenging tasks because of the heterogeneous terminologies, complicated symptom descriptions, and unstructured nature of scientific documents. In this st...

  • Article
  • Open Access
2 Citations
4,269 Views
15 Pages

Completing WordNets with Sememe Knowledge

  • Shengwen Li,
  • Bing Li,
  • Hong Yao,
  • Shunping Zhou,
  • Junjie Zhu and
  • Zhuang Zeng

WordNets organize words into synonymous word sets, and the connections between words present the semantic relationships between them, which have become an indispensable source for natural language processing (NLP) tasks. With the development and evol...

  • Article
  • Open Access
3 Citations
4,251 Views
18 Pages

Changing the Geometry of Representations: α-Embeddings for NLP Tasks

  • Riccardo Volpi,
  • Uddhipan Thakur and
  • Luigi Malagò

26 February 2021

Word embeddings based on a conditional model are commonly used in Natural Language Processing (NLP) tasks to embed the words of a dictionary in a low dimensional linear space. Their computation is based on the maximization of the likelihood of a cond...

  • Article
  • Open Access
193 Citations
24,314 Views
19 Pages

Transfer Learning for Sentiment Analysis Using BERT Based Supervised Fine-Tuning

  • Nusrat Jahan Prottasha,
  • Abdullah As Sami,
  • Md Kowsher,
  • Saydul Akbar Murad,
  • Anupam Kumar Bairagi,
  • Mehedi Masud and
  • Mohammed Baz

30 May 2022

The growth of the Internet has expanded the amount of data expressed by users across multiple platforms. The availability of these different worldviews and individuals’ emotions empowers sentiment analysis. However, sentiment analysis becomes e...

  • Article
  • Open Access
15 Citations
5,524 Views
20 Pages

25 February 2024

The development of noninvasive and cost-effective methods of detecting Alzheimer’s disease (AD) is essential for its early prevention and mitigation. We optimize the detection of AD using natural language processing (NLP) of spontaneous speech...

  • Article
  • Open Access
1,576 Views
36 Pages

This study aims to identify and evaluate the essential design features, strengths, and limitations of a virtual reality (VR) application that has been developed to train an international sales force effectively for a premium global wine brand. The st...

  • Article
  • Open Access
818 Views
30 Pages

14 November 2025

Architectural discourse is a specialised language whose key terms shift with context, which complicates empirical claims about meaning. This study addresses this problem by testing whether a rigorously audited, reproducible NLP framework can recover...

  • Article
  • Open Access
1,804 Views
28 Pages

Handwritten Keyword Spotting (KWS) remains a challenging task, particularly in segmentation-free scenarios where word images must be retrieved and ranked based on their similarity to a query without relying on prior page-level segmentation. Tradition...

  • Article
  • Open Access
2 Citations
2,267 Views
25 Pages

29 July 2025

The proliferation of digital public service platforms and the expansion of e-government initiatives have significantly increased the volume and diversity of citizen-generated feedback. This trend emphasizes the need for classification systems that ar...

  • Feature Paper
  • Article
  • Open Access
35 Citations
6,639 Views
18 Pages

15 June 2021

In social media, users are spreading misinformation easily and without fact checking. In principle, they do not have a malicious intent, but their sharing leads to a socially dangerous diffusion mechanism. The motivations behind this behavior have be...

  • Article
  • Open Access
2 Citations
4,210 Views
28 Pages

26 March 2025

In recent years, the rapid growth of cryptocurrency markets has highlighted the urgent need for advanced security solutions capable of addressing a spectrum of unique threats, from phishing and wallet hacks to complex blockchain vulnerabilities. This...

  • Article
  • Open Access
1,411 Views
16 Pages

1 September 2025

This paper investigates the effectiveness of Word2Vec-based molecular representation learning on SMILES (Simplified Molecular Input Line Entry System) strings for a downstream prediction task related to the market approvability of chemical compounds....

  • Article
  • Open Access
936 Views
28 Pages

17 November 2025

This study is intended to evaluate and contrast the performance of varying combinations of embedding algorithms and weighting systems in measuring perception-based text similarity using the Cosine Similarity approach. Within a structured experiment d...

  • Article
  • Open Access
23 Citations
5,142 Views
20 Pages

A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution

  • Ana Bárbara Cardoso,
  • Bruno Martins and
  • Jacinto Estima

This article describes a novel approach for toponym resolution with deep neural networks. The proposed approach does not involve matching references in the text against entries in a gazetteer, instead directly predicting geo-spatial coordinates. Mult...

  • Article
  • Open Access
307 Views
22 Pages

Detecting Behavioral and Emotional Themes Through Latent and Explicit Knowledge

  • Oded Mcdossi,
  • Rotem Klein,
  • Ali Shaer,
  • Rotem Dror and
  • Adir Solomon

26 January 2026

Social organizations increasingly rely on Natural Language Processing (NLP) to analyze large-scale textual data for high-stakes decisions, including university admissions, financial aid allocation, and job hiring. Current methods primarily employ top...

  • Article
  • Open Access
246 Views
20 Pages

5 February 2026

Efficient retrieval of mathematical and structural similarities in System Dynamics models remains a significant challenge for traditional lexical systems, which often fail to capture the contextual dependencies of simulation processes. This paper pre...

  • Review
  • Open Access
39 Citations
8,205 Views
32 Pages

Survey of Neural Text Representation Models

  • Karlo Babić,
  • Sanda Martinčić-Ipšić and
  • Ana Meštrović

30 October 2020

In natural language processing, text needs to be transformed into a machine-readable representation before any processing. The quality of further natural language processing tasks greatly depends on the quality of those representations. In this surve...

  • Article
  • Open Access
34 Citations
11,298 Views
16 Pages

Mapping ESG Trends by Distant Supervision of Neural Language Models

  • Natraj Raman,
  • Grace Bang and
  • Armineh Nourbakhsh

The integration of Environmental, Social and Governance (ESG) considerations into business decisions and investment strategies have accelerated over the past few years. It is important to quantify the extent to which ESG-related conversations are car...

  • Article
  • Open Access
1 Citations
2,174 Views
22 Pages

Recommending Words Using a Bayesian Network

  • Pedro Santos,
  • Matilde Pato,
  • Nuno Datia,
  • José Sobral,
  • Noel Leitão,
  • Manuel Ramos Ferreira and
  • Nuno Gomes

Asset management involves the coordinated activities of an organisation to derive value from assets, which may include physical assets. It encompasses activities related to design, construction, installation, operation, maintenance, renewal, and asse...

  • Article
  • Open Access
6 Citations
4,323 Views
31 Pages

20 April 2024

In the evolving field of machine learning, deploying fair and transparent models remains a formidable challenge. This study builds on earlier research, demonstrating that neural architectures exhibit inherent biases by analyzing a broad spectrum of t...

  • Article
  • Open Access
2,370 Views
15 Pages

Using Enhanced Representations to Predict Medical Procedures from Clinician Notes

  • Roberto Móstoles,
  • Oscar Araque and
  • Carlos Á. Iglesias

24 July 2024

Nowadays, most health professionals use electronic health records to keep track of patients. To properly use and share these data, the community has relied on medical classification standards to represent patient information. However, the coding proc...

  • Article
  • Open Access
3 Citations
5,432 Views
49 Pages

Interpretable Topic Extraction and Word Embedding Learning Using Non-Negative Tensor DEDICOM

  • Lars Hillebrand,
  • David Biesner,
  • Christian Bauckhage and
  • Rafet Sifa

Unsupervised topic extraction is a vital step in automatically extracting concise contentual information from large text corpora. Existing topic extraction methods lack the capability of linking relations between these topics which would further help...

  • Article
  • Open Access
9 Citations
2,860 Views
27 Pages

14 September 2023

Improving the quality of healthcare services is of the utmost importance in healthcare systems. Patient experience is a key aspect that should be gauged and monitored continuously. However, the measurement of such a vital indicator typically cannot b...

  • Article
  • Open Access
3 Citations
1,558 Views
33 Pages

Making Images Speak: Human-Inspired Image Description Generation

  • Chifaa Sebbane,
  • Ikram Belhajem and
  • Mohammed Rziza

28 April 2025

Despite significant advances in deep learning-based image captioning, many state-of-the-art models still struggle to balance visual grounding (i.e., accurate object and scene descriptions) with linguistic coherence (i.e., grammatical fluency and appr...

  • Feature Paper
  • Article
  • Open Access
1 Citations
1,559 Views
40 Pages

13 February 2025

In this article, we present a model for analyzing the co-occurrence count data derived from practical fields such as user–item or item–item data from online shopping platforms and co-occurring word–word pairs in sequences of texts....

  • Article
  • Open Access
10 Citations
1,872 Views
15 Pages

Dynamic Multi-Granularity Translation System: DAG-Structured Multi-Granularity Representation and Self-Attention

  • Shenrong Lv,
  • Bo Yang,
  • Ruiyang Wang,
  • Siyu Lu,
  • Jiawei Tian,
  • Wenfeng Zheng,
  • Xiaobing Chen and
  • Lirong Yin

9 October 2024

In neural machine translation (NMT), the sophistication of word embeddings plays a pivotal role in the model’s ability to render accurate and contextually relevant translations. However, conventional models with single granularity of word segme...

  • Article
  • Open Access
1,180 Views
19 Pages

The Specialist’s Paradox: Generalist AI May Better Organize Medical Knowledge

  • Carlo Galli,
  • Maria Teresa Colangelo,
  • Marco Meleti and
  • Elena Calciolari

21 July 2025

This study investigates the ability of six pre-trained sentence transformers to organize medical knowledge by performing unsupervised clustering on 70 high-level Medical Subject Headings (MeSH) terms across seven medical specialties. We evaluated mod...

  • Article
  • Open Access
15 Citations
7,104 Views
23 Pages

20 March 2023

Over the past few years, word embeddings and bidirectional encoder representations from transformers (BERT) models have brought better solutions to learning text representations for natural language processing (NLP) and other tasks. Many NLP applicat...

  • Article
  • Open Access
9 Citations
4,848 Views
23 Pages

Linguists have been focused on a qualitative comparison of the semantics from different languages. Evaluation of the semantic interpretation among disparate language pairs like English and Tamil is an even more formidable task than for Slavic languag...

  • Article
  • Open Access
1,853 Views
14 Pages

15 January 2025

Pre-trained language models such as BERT, GPT-3, and T5 have made significant advancements in natural language processing (NLP). However, their widespread adoption raises concerns about intellectual property (IP) protection, as unauthorized use can u...

  • Article
  • Open Access
39 Citations
7,051 Views
12 Pages

13 December 2018

Recurrent neural network (RNN) has achieved remarkable success in sequence labeling tasks with memory requirement. RNN can remember previous information of a sequence and can thus be used to solve natural language processing (NLP) tasks. Named entity...

  • Article
  • Open Access
3 Citations
1,842 Views
27 Pages

Virtual simulators of embedded systems and analyses of student surveys regarding their use at the early stage of the process of learning embedded systems, are presented in this article. The questionnaires were prepared in the Polish language, and the...

  • Article
  • Open Access
7 Citations
3,529 Views
18 Pages

Modeling Trajectories Obtained from External Sensors for Location Prediction via NLP Approaches

  • Lívia Almada Cruz,
  • Ticiana Linhares Coelho da Silva,
  • Régis Pires Magalhães,
  • Wilken Charles Dantas Melo,
  • Matheus Cordeiro,
  • José Antonio Fernandes de Macedo and
  • Karine Zeitouni

2 October 2022

Representation learning seeks to extract useful and low-dimensional attributes from complex and high-dimensional data. Natural language processing (NLP) was used to investigate the representation learning models to extract words’ feature vector...

  • Article
  • Open Access
38 Citations
8,065 Views
16 Pages

Sentiment analysis on social media platforms (i.e., Twitter or Facebook) has become an important tool to learn about users’ opinions and preferences. However, the accuracy of sentiment analysis is disrupted by the challenges of natural language...

  • Article
  • Open Access
14 Citations
7,915 Views
20 Pages

26 September 2022

Semantic Textual Similarity (STS) is an important task in the area of Natural Language Processing (NLP) that measures the similarity of the underlying semantics of two texts. Although pre-trained contextual embedding models such as Bidirectional Enco...

  • Article
  • Open Access
4 Citations
2,850 Views
22 Pages

28 November 2024

The expanding Arabic user base presents a unique opportunity for researchers to tap into vast online Arabic resources. However, the lack of reliable Arabic word embedding models and the limited availability of Arabic corpora poses significant challen...

  • Systematic Review
  • Open Access
6 Citations
6,276 Views
17 Pages

1 July 2021

This article presents a systematic literature review on quantifying the proximity between independently trained monolingual word embedding spaces. A search was carried out in the broader context of inducing bilingual lexicons from cross-lingual word...

  • Article
  • Open Access
36 Citations
8,484 Views
14 Pages

Sentiment-Aware Word Embedding for Emotion Classification

  • Xingliang Mao,
  • Shuai Chang,
  • Jinjing Shi,
  • Fangfang Li and
  • Ronghua Shi

29 March 2019

Word embeddings are effective intermediate representations for capturing semantic regularities between words in natural language processing (NLP) tasks. We propose sentiment-aware word embedding for emotional classification, which consists of integra...

  • Article
  • Open Access
3 Citations
5,610 Views
19 Pages

Enhancing Word Embeddings for Improved Semantic Alignment

  • Julian Szymański,
  • Maksymilian Operlejn and
  • Paweł Weichbroth

10 December 2024

This study introduces a method for the improvement of word vectors, addressing the limitations of traditional approaches like Word2Vec or GloVe through introducing into embeddings richer semantic properties. Our approach leverages supervised learning...

  • Article
  • Open Access
4 Citations
2,871 Views
23 Pages

Efficient Estimate of Low-Frequency Words’ Embeddings Based on the Dictionary: A Case Study on Chinese

  • Xianwen Liao,
  • Yongzhong Huang,
  • Changfu Wei,
  • Chenhao Zhang,
  • Yongqing Deng and
  • Ke Yi

21 November 2021

Obtaining high-quality embeddings of out-of-vocabularies (OOVs) and low-frequency words is a challenge in natural language processing (NLP). To efficiently estimate the embeddings of OOVs and low-frequency words, we propose a new method that uses the...

  • Article
  • Open Access
95 Citations
22,327 Views
26 Pages

14 September 2022

Introduction: The advances in the digital era have necessitated the adoption of communication as the main channel for modern business. In the past, business negotiations, profiling, seminars, shopping, and agreements were in-person but today everythi...

  • Article
  • Open Access
576 Citations
76,851 Views
29 Pages

Sentiment Analysis Based on Deep Learning: A Comparative Study

  • Nhan Cach Dang,
  • María N. Moreno-García and
  • Fernando De la Prieta

The study of public opinion can provide us with valuable information. The analysis of sentiment on social networks, such as Twitter or Facebook, has become a powerful means of learning about the users’ opinions and has a wide range of applicati...

  • Article
  • Open Access
5 Citations
2,089 Views
21 Pages

Exploring LLM Embedding Potential for Dementia Detection Using Audio Transcripts

  • Brandon Alejandro Llaca-Sánchez,
  • Luis Roberto García-Noguez,
  • Marco Antonio Aceves-Fernández,
  • Andras Takacs and
  • Saúl Tovar-Arriaga

17 July 2025

Dementia is a neurodegenerative disorder characterized by progressive cognitive impairment that significantly affects daily living. Early detection of Alzheimer’s disease—the most common form of dementia—remains essential for prompt...

  • Article
  • Open Access
4 Citations
2,700 Views
20 Pages

Know an Emotion by the Company It Keeps: Word Embeddings from Reddit/Coronavirus

  • Alejandro García-Rudolph,
  • David Sanchez-Pinsach,
  • Dietmar Frey,
  • Eloy Opisso,
  • Katryna Cisek and
  • John D. Kelleher

31 May 2023

Social media is a crucial communication tool (e.g., with 430 million monthly active users in online forums such as Reddit), being an objective of Natural Language Processing (NLP) techniques. One of them (word embeddings) is based on the quotation, &...

  • Article
  • Open Access
11 Citations
9,963 Views
17 Pages

Tacit knowledge, often implicit and deeply embedded within individuals and organizational practices, is critical for fostering innovation and decision-making in knowledge management systems (KMS). Converting tacit knowledge into explicit forms enhanc...

  • Article
  • Open Access
30 Citations
8,083 Views
15 Pages

28 June 2021

Entity-based information extraction is one of the main applications of Natural Language Processing (NLP). Recently, deep transfer-learning utilizing contextualized word embedding from pre-trained language models has shown remarkable results for many...

of 5