Skip to Content

Machine Learning and Knowledge Extraction, Volume 7, Issue 1

2025 March - 26 articles

Cover Story: “Better than trees” describes opportunities to improve machine learning interpretability by applying semilattices through algebraic machine learning. Unlike trees, semilattices can include connections between elements that are in different hierarchies. This enables semilattices to be better than trees in balancing the accuracy and complexity of models. In this paper, the advantages of semilattices are explained using the practical example of urban food access landscapes, comprising food deserts, food oases, and food swamps. The means by which algebraic semilattices can provide a basis for machine learning models is explained. Thus, rather than proposing improvements to tree-based methods, this paper provides guidance for the formulation of machine learning models based on algebraic semilattices. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
  • You may sign up for email alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.

Articles (26)

  • Article
  • Open Access
1,956 Views
19 Pages

Equatorial plasma bubbles (EPBs) are regions of depleted electron density that form in the Earth’s ionosphere due to Rayleigh–Taylor instability. These bubbles can cause signal scintillation, leading to signal loss and errors in position...

  • Article
  • Open Access
1,710 Views
18 Pages

Non-negative Matrix Factorization (NMF) has gained popularity due to its effectiveness in clustering and feature selection tasks. It is particularly valuable for managing high-dimensional data by reducing dimensionality and providing meaningful seman...

  • Article
  • Open Access
3 Citations
2,712 Views
34 Pages

The electrical storm optimization (ESO) algorithm, inspired by the dynamic nature of electrical storms, is a novel population-based metaheuristic that employs three dynamically adjusted parameters: field resistance, field intensity, and field conduct...

  • Article
  • Open Access
5 Citations
6,235 Views
29 Pages

Multimodal Deep Learning for Android Malware Classification

  • James Arrowsmith,
  • Teo Susnjak and
  • Julian Jang-Jaccard

This study investigates the integration of diverse data modalities within deep learning ensembles for Android malware classification. Android applications can be represented as binary images and function call graphs, each offering complementary persp...

  • Article
  • Open Access
2 Citations
3,600 Views
27 Pages

We introduce a weakly supervised segmentation approach that leverages class activation maps and the Segment Anything Model to generate high-quality masks using only classification data. A pre-trained classifier produces class activation maps that, on...

  • Article
  • Open Access
10 Citations
3,892 Views
19 Pages

Comparative Analysis of Perturbation Techniques in LIME for Intrusion Detection Enhancement

  • Mantas Bacevicius,
  • Agne Paulauskaite-Taraseviciene,
  • Gintare Zokaityte,
  • Lukas Kersys and
  • Agne Moleikaityte

The growing sophistication of cyber threats necessitates robust and interpretable intrusion detection systems (IDS) to safeguard network security. While machine learning models such as Decision Tree (DT), Random Forest (RF), k-Nearest Neighbors (K-NN...

  • Article
  • Open Access
16 Citations
6,025 Views
21 Pages

This research paper presents novel approaches for detecting credit card risk through the utilization of Long Short-Term Memory (LSTM) networks and XGBoost algorithms. Facing the challenge of securing credit card transactions, this study explores the...

  • Article
  • Open Access
3 Citations
2,875 Views
12 Pages

Explainability is essential for AI models, especially in clinical settings where understanding the model’s decisions is crucial. Despite their impressive performance, black-box AI models are unsuitable for clinical use if their operations canno...

  • Article
  • Open Access
3 Citations
5,843 Views
19 Pages

Weighted Kappa for Interobserver Agreement and Missing Data

  • Matthijs J. Warrens,
  • Alexandra de Raadt,
  • Roel J. Bosker and
  • Henk A. L. Kiers

The weighted kappa coefficient is commonly used for assessing agreement between two raters on an ordinal scale. This study is the first to assess the impact of missing data on the value of weighted kappa. We compared three methods for handling missin...

  • Article
  • Open Access
1 Citations
3,071 Views
20 Pages

Decoding Mental States in Social Cognition: Insights from Explainable Artificial Intelligence on HCP fMRI Data

  • José Diogo Marques dos Santos,
  • Luís Paulo Reis and
  • José Paulo Marques dos Santos

Artificial neural networks (ANNs) have been used for classification tasks involving functional magnetic resonance imaging (fMRI), though typically focusing only on fractions of the brain in the analysis. Recent work combined shallow neural networks (...

  • Article
  • Open Access
1,876 Views
27 Pages

Investigating and Optimizing MINDWALC Node Classification to Extract Interpretable Decision Trees from Knowledge Graphs

  • Maximilian Legnar,
  • Joern-Helge Heinrich Siemoneit,
  • Gilles Vandewiele,
  • Jürgen Hesser,
  • Zoran Popovic,
  • Stefan Porubsky and
  • Cleo-Aron Weis

This work deals with the investigation and optimization of the MINDWALC node classification algorithm with a focus on its ability to learn human-interpretable decision trees from knowledge graph databases. For this, we introduce methods to optimize M...

  • Article
  • Open Access
23 Citations
8,582 Views
18 Pages

Generative large language models (LLMs) have revolutionized the development of knowledge-based systems, enabling new possibilities in applications like ChatGPT, Bing, and Gemini. Two key strategies for domain adaptation in these systems are Domain-Sp...

  • Article
  • Open Access
1,385 Views
24 Pages

Triple Down on Robustness: Understanding the Impact of Adversarial Triplet Compositions on Adversarial Robustness

  • Sander Joos,
  • Tim Van hamme,
  • Willem Verheyen,
  • Davy Preuveneers and
  • Wouter Joosen

Adversarial training, a widely used technique for fortifying the robustness of machine learning models, has seen its effectiveness further bolstered by modifying loss functions or incorporating additional terms into the training objective. While thes...

  • Review
  • Open Access
20 Citations
19,240 Views
42 Pages

The integration of machine learning (ML) with big data has revolutionized industries by enabling the extraction of valuable insights from vast and complex datasets. This convergence has fueled advancements in various fields, leading to the developmen...

  • Article
  • Open Access
51 Citations
14,698 Views
32 Pages

This study introduces the Pixel-Level Interpretability (PLI) model, a novel framework designed to address critical limitations in medical imaging diagnostics by enhancing model transparency and diagnostic accuracy. The primary objective is to evaluat...

  • Article
  • Open Access
1 Citations
3,745 Views
25 Pages

Basketball players are traditionally classified into five positions. This study examines the correlation between player performance, game statistics, and designated positions. It also explores how statistical contributions have evolved over time. Mac...

  • Article
  • Open Access
3 Citations
3,393 Views
22 Pages

Unsupervised Word Sense Disambiguation Using Transformer’s Attention Mechanism

  • Radu Ion,
  • Vasile Păiș,
  • Verginica Barbu Mititelu,
  • Elena Irimia,
  • Maria Mitrofan,
  • Valentin Badea and
  • Dan Tufiș

Transformer models produce advanced text representations that have been used to break through the hard challenge of natural language understanding. Using the Transformer’s attention mechanism, which acts as a language learning memory, trained o...

  • Article
  • Open Access
1 Citations
1,708 Views
13 Pages

This study investigates the relationship between consumer personality traits, specifically openness, and responses to product designs. Consumers are categorized based on their levels of openness, and their affective responses to nine vase designs, va...

  • Article
  • Open Access
2 Citations
8,370 Views
13 Pages

A Comparative Analysis of European Media Coverage of the Israel–Gaza War Using Hesitant Fuzzy Linguistic Term Sets

  • Walaa Abuasaker,
  • Mónica Sánchez,
  • Jennifer Nguyen,
  • Nil Agell,
  • Núria Agell and
  • Francisco J. Ruiz

Representing and interpreting human opinions within an unstructured framework is inherently complex. Hesitant fuzzy linguistic term sets offer a comprehensive context that facilitates a nuanced understanding of diverse perspectives. This study introd...

  • Article
  • Open Access
4 Citations
2,988 Views
18 Pages

Interdisciplinary research (IDR) is essential for addressing complex global challenges that surpass the capabilities of any single discipline. However, measuring interdisciplinarity remains challenging due to conceptual ambiguities and inconsistent m...

  • Review
  • Open Access
11 Citations
12,842 Views
44 Pages

AI Advances in ICU with an Emphasis on Sepsis Prediction: An Overview

  • Charithea Stylianides,
  • Andria Nicolaou,
  • Waqar Aziz Sulaiman,
  • Christina-Athanasia Alexandropoulou,
  • Ilias Panagiotopoulos,
  • Konstantina Karathanasopoulou,
  • George Dimitrakopoulos,
  • Styliani Kleanthous,
  • Eleni Politi and
  • Andreas S. Panayides
  • + 9 authors

Artificial intelligence (AI) is increasingly applied in a wide range of healthcare and Intensive Care Unit (ICU) areas to serve—among others—as a tool for disease detection and prediction, as well as for healthcare resources’ manage...

  • Article
  • Open Access
2,019 Views
20 Pages

Balancing the accuracy and the complexity of models is a well established and ongoing challenge. Models can be misleading if they are not accurate, but models may be incomprehensible if their accuracy depends upon their being complex. In this paper,...

  • Article
  • Open Access
18 Citations
6,621 Views
23 Pages

Urban happiness prediction presents a complex challenge, due to the nonlinear and multifaceted relationships among socio-economic, environmental, and infrastructural factors. This study introduces an advanced hybrid model combining a gradient boostin...

  • Article
  • Open Access
1 Citations
4,047 Views
25 Pages

Benchmarking with a Language Model Initial Selection for Text Classification Tasks

  • Agus Riyadi,
  • Mate Kovacs,
  • Uwe Serdült and
  • Victor Kryssanov

The now-globally recognized concerns of AI’s environmental implications resulted in a growing awareness of the need to reduce AI carbon footprints, as well as to carry out AI processes responsibly and in an environmentally friendly manner. Benc...

  • Article
  • Open Access
1 Citations
1,701 Views
23 Pages

Comparison of Off-the-Shelf Methods and a Hotelling Multidimensional Approximation for Data Drift Detection

  • J. Ramón Navarro-Cerdán,
  • Vicent Ortiz Castelló and
  • David Millán Escrivá

Data drift can significantly impact the outcome of a model. Early detection of data drift is crucial for ensuring user confidence in predictions. It allows the user to check if a particular model needs retraining using updated data to adapt to the ev...

  • Article
  • Open Access
10 Citations
6,476 Views
28 Pages

Deep learning models are widely used for medical image analysis and require large datasets, while sufficient high-quality medical data for training are scarce. Data augmentation has been used to improve the performance of these models. The lack of tr...

XFacebookLinkedIn
Mach. Learn. Knowl. Extr. - ISSN 2504-4990