You are currently on the new version of our website. Access the old version .

13 Results Found

  • Article
  • Open Access
52 Citations
9,405 Views
16 Pages

Text classification is an essential aspect in many applications, such as spam detection and sentiment analysis. With the growing number of textual documents and datasets generated through social media and news articles, an increasing number of machin...

  • Article
  • Open Access
13 Citations
5,200 Views
22 Pages

22 October 2018

The ability to learn robust, resizable feature representations from unlabeled data has potential applications in a wide variety of machine learning tasks. One way to create such representations is to train deep generative models that can learn to cap...

  • Article
  • Open Access
37 Citations
15,408 Views
22 Pages

10 June 2020

With the growth of online information and sudden expansion in the number of electronic documents provided on websites and in electronic libraries, there is difficulty in categorizing text documents. Therefore, a rule-based approach is a solution to t...

  • Article
  • Open Access
1 Citations
1,031 Views
40 Pages

SemaTopic: A Framework for Semantic-Adaptive Probabilistic Topic Modeling

  • Amani Drissi,
  • Salma Sassi,
  • Richard Chbeir,
  • Anis Tissaoui and
  • Abderrazek Jemai

19 September 2025

Topic modeling is a crucial technique for Natural Language Processing (NLP) which helps to automatically uncover coherent topics from large-scale text corpora. Yet, classic methods tend to suffer from poor semantic depth and topic coherence. In this...

  • Article
  • Open Access
10 Citations
4,129 Views
18 Pages

Guided Semi-Supervised Non-Negative Matrix Factorization

  • Pengyu Li,
  • Christine Tseng,
  • Yaxuan Zheng,
  • Joyce A. Chew,
  • Longxiu Huang,
  • Benjamin Jarman and
  • Deanna Needell

20 April 2022

Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform classi...

  • Article
  • Open Access
6 Citations
3,233 Views
15 Pages

Integrating Text Classification into Topic Discovery Using Semantic Embedding Models

  • Ana Laura Lezama-Sánchez,
  • Mireya Tovar Vidal and
  • José A. Reyes-Ortiz

31 August 2023

Topic discovery involves identifying the main ideas within large volumes of textual data. It indicates recurring topics in documents, providing an overview of the text. Current topic discovery models receive the text, with or without pre-processing,...

  • Article
  • Open Access
1,664 Views
14 Pages

Text classification is an important research field in text mining and natural language processing, gaining momentum with the growth of social networks. Despite the accuracy advancements made by deep learning models, existing graph neural network-base...

  • Article
  • Open Access
30 Citations
5,695 Views
15 Pages

17 September 2018

Sentiment analysis of online tourist reviews is playing an increasingly important role in tourism. Accurately capturing the attitudes of tourists regarding different aspects of the scenic sites or the overall polarity of their online reviews is key t...

  • Article
  • Open Access
29 Citations
7,454 Views
13 Pages

Overfitting Reduction of Text Classification Based on AdaBELM

  • Xiaoyue Feng,
  • Yanchun Liang,
  • Xiaohu Shi,
  • Dong Xu,
  • Xu Wang and
  • Renchu Guan

6 July 2017

Overfitting is an important problem in machine learning. Several algorithms, such as the extreme learning machine (ELM), suffer from this issue when facing high-dimensional sparse data, e.g., in text classification. One common issue is that the exten...

  • Article
  • Open Access
50 Citations
5,957 Views
14 Pages

An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming

  • Celestine Iwendi,
  • Suresh Ponnan,
  • Revathi Munirathinam,
  • Kathiravan Srinivasan and
  • Chuan-Yu Chang

11 November 2019

As the field of data science grows, document analytics has become a more challenging task for rough classification, response analysis, and text summarization. These tasks are used for the analysis of text data from various intelligent sensing systems...

  • Article
  • Open Access
1 Citations
1,686 Views
19 Pages

22 January 2025

Long short-term memory (LSTM) networks have shown great promise in sequential data analysis, especially in time-series and natural language processing. However, their potential for multi-view clustering has been largely underexplored. In this paper,...

  • Article
  • Open Access
21 Citations
4,565 Views
18 Pages

4 May 2021

Text document clustering refers to the unsupervised classification of textual documents into clusters based on content similarity and can be applied in applications such as search optimization and extracting hidden information from data generated by...

  • Article
  • Open Access
7 Citations
8,091 Views
15 Pages

19 January 2023

Due to the availability of a vast amount of unstructured data in various forms (e.g., the web, social networks, etc.), the clustering of text documents has become increasingly important. Traditional clustering algorithms have not been able to solve t...