Skip to Content
You are currently on the new version of our website. Access the old version .

Big Data and Cognitive Computing, Volume 9, Issue 2

2025 February - 31 articles

Cover Story: Android malware detection using AI is essential for preventing cyberattacks. This study applies genetic programming symbolic classifiers (GPSCs) to extract symbolic expressions (SEs) that classify malware. To optimize GPSC hyperparameters, a random hyperparameter value search (RHVS) and 5-fold cross-validation (5FCV) were used. The highly imbalanced dataset was balanced using preprocessing and oversampling techniques. Three approaches were tested: all input variables, high-importance features, and PCA. The best SEs formed threshold-based voting ensembles (TBVEs), achieving a peak accuracy of 0.98. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
  • You may sign up for email alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.

Articles (31)

  • Article
  • Open Access
11 Citations
3,949 Views
23 Pages

The rising prevalence of social media turns them into huge, rich repositories of human emotions. Understanding and categorizing human emotion from social media content is of fundamental importance for many reasons, such as improvement of user experie...

  • Article
  • Open Access
1 Citations
2,157 Views
21 Pages

Reusing ML Models in Dynamic Data Environments: Data Similarity-Based Approach for Efficient MLOps

  • Eduardo Peixoto,
  • Diogo Torres,
  • Davide Carneiro,
  • Bruno Silva and
  • Ruben Marques

The rapid integration of Machine Learning (ML) in organizational practices has driven demand for substantial computational resources, incurring both high economic costs and environmental impact, particularly from energy consumption. This challenge is...

  • Article
  • Open Access
3 Citations
2,534 Views
18 Pages

Previously, it was suggested that the “persona-driven” approach can contribute to producing sufficiently diverse synthetic training data for Large Language Models (LLMs) that are currently about to run out of real natural language texts....

  • Article
  • Open Access
3 Citations
1,697 Views
29 Pages

Impact on Classification Process Generated by Corrupted Features

  • Simona Moldovanu,
  • Dan Munteanu and
  • Carmen Sîrbu

The topic of this study is the testing of the robustness of machine learning (ML) and neural network (NN) models with a new idea based on corrupted data. Typically, ML and NN classifiers are trained on real feature data; however, a portion of the fea...

  • Review
  • Open Access
26 Citations
12,937 Views
32 Pages

The convergence of cloud computing and the Industrial Internet of Things (IIoT) has significantly transformed industrial operations, enabling intelligent, scalable, and efficient systems. This survey provides a comprehensive analysis of the role clou...

  • Article
  • Open Access
1 Citations
1,796 Views
12 Pages

The decline in oral health commonly occurs as a natural consequence of aging or due to various pathological factors. Tooth loss, which diminishes masticatory ability, has been associated with negative impacts on cognitive function. This observational...

  • Article
  • Open Access
1 Citations
3,560 Views
24 Pages

This article addresses the impact of generative artificial intelligence on the creation of composite sketches for police investigations. The automation of this task, traditionally performed through artistic methods or image composition, has become a...

  • Article
  • Open Access
5 Citations
5,056 Views
19 Pages

As Large Language Models (LLMs) continue to advance, their capabilities in code clone detection have garnered significant attention. While much research has assessed LLM performance on human-generated code, the proliferation of LLM-generated code rai...

  • Article
  • Open Access
10 Citations
9,734 Views
23 Pages

Artificial intelligence (AI) affects many aspects of modern life, and most predictions are that the impact of AI on business and society will only increase. In the marketing function of today’s leading businesses, two main types of AI can be di...

  • Article
  • Open Access
7 Citations
3,826 Views
24 Pages

Convolutional Neural Networks (CNNs) have proven to be very effective in image classification due to their status as a powerful feature learning algorithm. Traditional approaches have considered the problem of multiclass classification, where the goa...

  • Article
  • Open Access
2 Citations
3,167 Views
25 Pages

Self-Supervised Foundation Model for Template Matching

  • Anton Hristov,
  • Dimo Dimov and
  • Maria Nisheva-Pavlova

Finding a template location in a query image is a fundamental problem in many computer vision applications, such as localization of known objects, image registration, image matching, and object tracking. Currently available methods fail when insuffic...

  • Article
  • Open Access
5 Citations
1,849 Views
21 Pages

Explainable Deep Learning for COVID-19 Vaccine Sentiment in Arabic Tweets Using Multi-Self-Attention BiLSTM with XLNet

  • Asmaa Hashem Sweidan,
  • Nashwa El-Bendary,
  • Shereen A. Taie,
  • Amira M. Idrees and
  • Esraa Elhariri

The COVID-19 pandemic has generated a vast corpus of online conversations regarding vaccines, predominantly on social media platforms like X (formerly known as Twitter). However, analyzing sentiment in Arabic text is challenging due to the diverse di...

  • Article
  • Open Access
2 Citations
1,501 Views
34 Pages

Integrating deep learning into microbiological and cell analysis from microscopic image samples has gained significant attention in recent years, driven by the rise of novel medical technologies and pressing global health challenges. Numerous methods...

  • Article
  • Open Access
7 Citations
4,781 Views
16 Pages

The importance of enhancing the accuracy of time-series forecasting using artificial intelligence tools is increasingly critical in light of the rapid advancements in modern technologies, particularly deep learning and neural networks. These approach...

  • Article
  • Open Access
1,287 Views
13 Pages

Human cultural complexity did not arise in a vacuum. This study employs agent-based modeling (ABM) and ecological modeling perspectives, combined with reinforcement-learning techniques, to investigate whether conditions that allowed for the lower spo...

  • Article
  • Open Access
2 Citations
2,294 Views
30 Pages

Dependency Reduction Techniques for Performance Improvement of Hyperledger Fabric Blockchain

  • Ju-Won Kim,
  • Jae-Geun Song,
  • In-Hwan Park,
  • Dong-Hwan Jo,
  • Yong-Jin Kim and
  • Ju-Wook Jang

We propose dependency reduction techniques for the performance enhancement of the Hyperledger Fabric blockchain. A dependency hazard may result from the parallelism in Hyperledger Fabric, which executes multiple transactions simultaneously in a singl...

  • Article
  • Open Access
3 Citations
3,509 Views
25 Pages

Technology Innovation and Social and Behavioral Commitment: A Case Study of Digital Transformation in the Moroccan Insurance Industry

  • Soukaina Abdallah-Ou-Moussa,
  • Martin Wynn,
  • Omar Kharbouch,
  • Sara El Aoufi and
  • Zakaria Rouaine

Digital transformation (DT) has become an imperative for companies seeking to evolve in a constantly changing industrial ecosystem, driven by the continual development and application of innovative digital technologies. Nevertheless, the success rate...

  • Article
  • Open Access
2 Citations
4,210 Views
24 Pages

The influence of scientific papers is measured by their citations. Although predicting the papers’ citation impact based on non-content factors has garnered extensive attention, the influence of such factors is rarely compared. In this article,...

  • Article
  • Open Access
15 Citations
3,610 Views
21 Pages

VSA-GCNN: Attention Guided Graph Neural Networks for Brain Tumor Segmentation and Classification

  • Kambham Pratap Joshi,
  • Vishruth Boraiah Gowda,
  • Parameshachari Bidare Divakarachari,
  • Paramesh Siddappa Parameshwarappa and
  • Raj Kumar Patra

For the past few decades, brain tumors have had a substantial influence on human life, and pose severe health risks if not treated and diagnosed in the early stages. Brain tumor problems are highly diverse and vary extensively in terms of size, type,...

  • Article
  • Open Access
2 Citations
2,339 Views
25 Pages

Multi-class object detectors often suffer from the class imbalance issue, where substantial model performance discrepancies exist between classes. Generative adversarial networks (GANs), an emerging deep learning research topic, are able to learn fro...

  • Article
  • Open Access
1,530 Views
49 Pages

Android malware detection using artificial intelligence today is a mandatory tool to prevent cyber attacks. To address this problem in this paper the proposed methodology consists of the application of genetic programming symbolic classifier (GPSC) t...

  • Article
  • Open Access
2 Citations
2,297 Views
41 Pages

Leveraging Open Big Data from R&D Projects with Large Language Models

  • Desireé Ruiz,
  • Yudith Cardinale,
  • Abraham Casas and
  • Vanessa Moscardó

Recent studies have highlighted the potential of Large Language Models (LLMs) to become experts in specific areas of knowledge through the utilization of techniques that enhance their context. Nevertheless, an interesting and underexplored applicatio...

  • Article
  • Open Access
11 Citations
2,890 Views
17 Pages

Pneumonia remains a significant cause of morbidity and mortality among pediatric patients worldwide. Accurate and timely diagnosis is crucial for effective treatment and improved patient outcomes. Traditionally, pneumonia diagnosis has relied on a co...

  • Article
  • Open Access
9 Citations
8,501 Views
31 Pages

BankNet: Real-Time Big Data Analytics for Secure Internet Banking

  • Kaushik Sathupadi,
  • Sandesh Achar,
  • Shinoy Vengaramkode Bhaskaran,
  • Nuruzzaman Faruqui and
  • Jia Uddin

The rapid growth of Internet banking has necessitated advanced systems for secure, real-time decision making. This paper introduces BankNet, a predictive analytics framework integrating big data tools and a BiLSTM neural network to deliver high-accur...

  • Article
  • Open Access
10 Citations
5,618 Views
21 Pages

Labeling Network Intrusion Detection System (NIDS) Rules with MITRE ATT&CK Techniques: Machine Learning vs. Large Language Models

  • Nir Daniel,
  • Florian Klaus Kaiser,
  • Shay Giladi,
  • Sapir Sharabi,
  • Raz Moyal,
  • Shalev Shpolyansky,
  • Andres Murillo,
  • Aviad Elyashar and
  • Rami Puzis

Analysts in Security Operations Centers (SOCs) are often occupied with time-consuming investigations of alerts from Network Intrusion Detection Systems (NIDSs). Many NIDS rules lack clear explanations and associations with attack techniques, complica...

  • Article
  • Open Access
3 Citations
1,816 Views
17 Pages

Evaluating the Effect of Surrogate Data Generation on Healthcare Data Assessment

  • Saeid Sanei,
  • Tracey K. M. Lee,
  • Issam Boukhennoufa,
  • Delaram Jarchi,
  • Xiaojun Zhai and
  • Klaus McDonald-Maier

In healthcare applications, often it is not possible to record sufficient data as required for deep learning or data-driven classification and feature detection systems due to the patient condition, various clinical or experimental limitations, or ti...

  • Article
  • Open Access
6 Citations
4,197 Views
30 Pages

Long-Term Forecasting of Solar Irradiation in Riyadh, Saudi Arabia, Using Machine Learning Techniques

  • Khalil AlSharabi,
  • Yasser Bin Salamah,
  • Majid Aljalal,
  • Akram M. Abdurraqeeb and
  • Fahd A. Alturki

Forecasting of time series data presents some challenges because the data’s nature is complex and therefore difficult to accurately forecast. This study presents the design and development of a novel forecasting system that integrates efficient...

  • Article
  • Open Access
2,131 Views
15 Pages

Fit Talks: Forecasting Fitness Awareness in Saudi Arabia Using Fine-Tuned Transformers

  • Nora Alturayeif,
  • Deemah Alqahtani,
  • Sumayh S. Aljameel,
  • Najla Almajed,
  • Lama Alshehri,
  • Nourah Aldhuwaihi,
  • Madawi Alhadyan and
  • Nouf Aldakheel

Understanding public sentiment on health and fitness is essential for addressing regional health challenges in Saudi Arabia. This research employs sentiment analysis to assess fitness awareness by analyzing content from the X platform (formerly Twitt...

  • Article
  • Open Access
3 Citations
3,506 Views
21 Pages

Low-Cost Embedded System Applications for Smart Cities

  • Victoria Alejandra Salazar Herrera,
  • Hugo Puertas de Araújo,
  • César Giacomini Penteado,
  • Mario Gazziro and
  • João Paulo Carmo

The Internet of Things (IoT) represents a transformative technology that allows interconnected devices to exchange data over the Internet, enabling automation and real-time decision making in a variety of areas. A key aspect of the success of the IoT...

  • Article
  • Open Access
8 Citations
4,748 Views
30 Pages

Synthetic Data Generation (SDG) is a promising solution for healthcare, offering the potential to generate synthetic patient data closely resembling real-world data while preserving privacy. However, data scarcity and heterogeneity, particularly in u...

Get Alerted

Add your email address to receive forthcoming issues of this journal.

XFacebookLinkedIn
Big Data Cogn. Comput. - ISSN 2504-2289