Knowledge Engineering and Data Mining, 3rd Edition

A special issue of Electronics (ISSN 2079-9292). This special issue belongs to the section "Computer Science & Engineering".

Deadline for manuscript submissions: 31 January 2026 | Viewed by 9419

Special Issue Editors


E-Mail Website
Guest Editor
Faculty of Computer Science and Information Technology, West Pomeranian University of Technology Szczecin, Zolnierska 49, 71-210 Szczecin, Poland
Interests: ontology; knowledge representation; semantic web technologies; OWL; RDF; knowledge engineering; knowledge bases; knowledge management; reasoning; information extraction; ontology learning; sustainability; sustainability assessment; ontology evaluation
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
Institute of Computer Science, Faculty of Science and Technology, University of Silesia, ul. Będzińska 39, 41-200 Sosnowiec, Poland
Interests: knowledge representation and reasoning; rule-based knowledge bases; outliers mining; expert systems; decision support systems; information retrieval systems
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Extracting knowledge from data is a fundamental process in the creation of intelligent information retrieval systems, decision support, and knowledge management. This Special Issue welcomes the submission of research that addresses data mining methods, multidimensional data analysis, supervised and unsupervised learning methods, methods of knowledge base management, language ontologies, ontology learning, and others. We encourage you to present novel algorithms and work on practical solutions, i.e., applications/systems presenting the real-world application of the proposed research achievements.

The Special Issue covers the entire process of knowledge engineering, from data acquisition and data mining to knowledge extraction and exploitation. This Special Issue therefore encourages researchers to contribute to a collective effort that promotes the comprehension of trends and future questions in the field of knowledge engineering and data mining. Topics include, but are not limited to, the following:

  • knowledge acquisition and engineering;
  • data mining methods;
  • big knowledge analytics;
  • data mining, knowledge discovery, and machine learning;
  • knowledge modeling and processing;
  • knowledge acquisition and engineering;
  • query and natural language processing;
  • data and information modeling;
  • data and information semantics;
  • data-intensive applications;
  • knowledge representation and reasoning;
  • decision support systems;
  • decision-making;
  • group decision-making;
  • rules mining;
  • outliers mining;
  • data exploration;
  • data science;
  • semantic web data and linked data;
  • ontologies and controlled vocabularies;
  • data acquisition;
  • multidimensional data analysis;
  • artificial intelligence and knowledge management;
  • knowledge representation in artificial intelligence;
  • supervised and unsupervised learning methods;
  • parallel processing and modeling;
  • languages based on parallel programming and data mining.

Dr. Agnieszka Konys
Prof. Dr. Agnieszka Nowak-Brzezińska
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Electronics is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • knowledge engineering
  • knowledge representation and reasoning
  • decision support systems
  • knowledge acquisition
  • outliers mining
  • decision making
  • data mining
  • data science
  • data exploration
  • multidimensional data analysis
  • supervised and unsupervised learning methods
  • ontology
  • knowledge-based systems
  • ontology learning
  • artificial intelligence
  • knowledge management
  • methods of knowledge base management
  • parallel processing and modeling
  • languages based on parallel programming and data mining

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Related Special Issues

Published Papers (5 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

Jump to: Review

38 pages, 4944 KB  
Article
Integrated Survey Classification and Trend Analysis via LLMs: An Ensemble Approach for Robust Literature Synthesis
by Eleonora Bernasconi, Domenico Redavid and Stefano Ferilli
Electronics 2025, 14(17), 3404; https://doi.org/10.3390/electronics14173404 - 27 Aug 2025
Viewed by 776
Abstract
This study proposes a novel, scalable framework for the automated classification and synthesis of survey literature by integrating state-of-the-art Large Language Models (LLMs) with robust ensemble voting techniques. The framework consolidates predictions from three independent models—GPT-4, LLaMA 3.3, and Claude 3—to generate consensus-based [...] Read more.
This study proposes a novel, scalable framework for the automated classification and synthesis of survey literature by integrating state-of-the-art Large Language Models (LLMs) with robust ensemble voting techniques. The framework consolidates predictions from three independent models—GPT-4, LLaMA 3.3, and Claude 3—to generate consensus-based classifications, thereby enhancing reliability and mitigating individual model biases. We demonstrate the generalizability of our approach through comprehensive evaluation on two distinct domains: Question Answering (QA) systems and Computer Vision (CV) survey literature, using a dataset of 1154 real papers extracted from arXiv. Comprehensive visual evaluation tools, including distribution charts, heatmaps, confusion matrices, and statistical validation metrics, are employed to rigorously assess model performance and inter-model agreement. The framework incorporates advanced statistical measures, including k-fold cross-validation, Fleiss’ kappa for inter-rater reliability, and chi-square tests for independence to validate classification robustness. Extensive experimental evaluations demonstrate that this ensemble approach achieves superior performance compared to individual models, with accuracy improvements of 10.0% over the best single model on QA literature and 10.9% on CV literature. Furthermore, comprehensive cost–benefit analysis reveals that our automated approach reduces manual literature synthesis time by 95% while maintaining high classification accuracy (F1-score: 0.89 for QA, 0.87 for CV), making it a practical solution for large-scale literature analysis. The methodology effectively uncovers emerging research trends and persistent challenges across domains, providing researchers with powerful tools for continuous literature monitoring and informed decision-making in rapidly evolving scientific fields. Full article
(This article belongs to the Special Issue Knowledge Engineering and Data Mining, 3rd Edition)
Show Figures

Figure 1

36 pages, 2903 KB  
Article
Improving Education Predictions Through Reasoning by Analogy and Causal Relationships Applied to Smart Exploitation of Data
by Antonio Lorenzo, José A. Olivas, Francisco P. Romero and Jesus Serrano-Guerrero
Electronics 2025, 14(12), 2339; https://doi.org/10.3390/electronics14122339 - 7 Jun 2025
Viewed by 612
Abstract
To make predictions, one can use machine learning and/or knowledge-based approaches. Knowledge-based approaches focus on developing systems with reasoning capabilities to solve application problems. Traditionally, statistical techniques have been used, while more recently, machine learning techniques have been used to make predictions. Both [...] Read more.
To make predictions, one can use machine learning and/or knowledge-based approaches. Knowledge-based approaches focus on developing systems with reasoning capabilities to solve application problems. Traditionally, statistical techniques have been used, while more recently, machine learning techniques have been used to make predictions. Both types of techniques are based almost exclusively on the analysis of historical data. This paper proposes a model that combines knowledge engineering and intelligent data analysis, leveraging the causal relationship between a past event and its known consequences. By determining the similarity between a current analogous situation and the past event, the model infers what the consequences of the current situation might be. The main contribution is the combination of various knowledge engineering techniques to improve the prediction outcomes for certain events. The present approach not only relies on analysing historical data but also integrates smart data utilization, the identification of the most similar past event, and the prediction or definition of cause–effect rules based on causal inference. One use case is presented: predicting the percentage of students who are promoted to the next grade with all subjects passed over the four years of middle school. Applying statistical regression techniques, a predicted value of 68.67% was obtained. Applying the proposed model, a value of 62.85% was obtained. The actual value published by the Spanish Department of Education for the 2021–2022 school year was 63.95%. The prediction using statistical techniques deviated 7.3% from the actual value. The proposed method deviated only 1.7% from the actual value. The proposed method improved the prediction compared to the value obtained using statistical techniques. Full article
(This article belongs to the Special Issue Knowledge Engineering and Data Mining, 3rd Edition)
Show Figures

Figure 1

18 pages, 447 KB  
Article
A k-Means Algorithm with Automatic Outlier Detection
by Guojun Gan
Electronics 2025, 14(9), 1723; https://doi.org/10.3390/electronics14091723 - 23 Apr 2025
Viewed by 1313
Abstract
Data clustering is a fundamental machine learning task found in many real-world applications. However, real data usually contain noise or outliers. Handling outliers in a clustering algorithm can improve the clustering accuracy. In this paper, we propose a variant of the k-means [...] Read more.
Data clustering is a fundamental machine learning task found in many real-world applications. However, real data usually contain noise or outliers. Handling outliers in a clustering algorithm can improve the clustering accuracy. In this paper, we propose a variant of the k-means algorithm to provide data clustering and outlier detection simultaneously. In the proposed algorithm, outlier detection is integrated with the clustering process and is achieved via a term added to the objective function of the k-means algorithm. The proposed algorithm generates two partition matrices: one provides cluster groups and the other can be used to detect outliers. We use both synthetic data and real data to demonstrate the effectiveness and efficiency of the proposed algorithm and show that the clustering performance of the proposed approach is better than other, similar methods. Full article
(This article belongs to the Special Issue Knowledge Engineering and Data Mining, 3rd Edition)
Show Figures

Figure 1

17 pages, 692 KB  
Article
Modeling Investment Decisions Through Decision Tree Regression—A Behavioral Finance Theory Approach
by Dana Rad, Lavinia Denisia Cuc, Gabriel Croitoru, Bogdan Cosmin Gomoi, Luminița Mazuru, Raluca Simina Bilți, Sergiu Rusu, Maria Sinaci and Florentina Simona Barbu
Electronics 2025, 14(8), 1505; https://doi.org/10.3390/electronics14081505 - 9 Apr 2025
Cited by 1 | Viewed by 2385
Abstract
This study examines the key factors influencing investment decisions through decision tree regression, grounded in behavioral finance theory. By analyzing a comprehensive dataset incorporating behavioral, demographic, and financial variables—including investment attitudes, decision-making behaviors, financial education, age, income, and education—this study identifies significant predictors [...] Read more.
This study examines the key factors influencing investment decisions through decision tree regression, grounded in behavioral finance theory. By analyzing a comprehensive dataset incorporating behavioral, demographic, and financial variables—including investment attitudes, decision-making behaviors, financial education, age, income, and education—this study identifies significant predictors of investment outcomes. While the model shows moderate predictive performance (R2 = 0.185; MAPE = 172.96%), it identifies hierarchical relationships among behavioral, cognitive, and demographic predictors. These results highlight the complexity of investment decisions and the need for integrative, behavioral-driven approaches in predictive modeling. Investment attitudes (25.88%), decision-making behaviors (19.53%), and financial education (16.68%) emerge as the most influential variables, while traditional demographic factors such as income and age have a lower impact. The hierarchical structure of the decision tree highlights critical decision-making patterns, particularly regarding speculative behaviors and investment attitudes. These findings challenge classical models of rationality by emphasizing the dominant role of behavioral factors in investment decision making. This study contributes to bridging computational modeling with financial economics, demonstrating the utility of decision tree regression in uncovering complex investor behavior. Practical implications include enhancing personalized financial advisory services and designing targeted financial literacy programs to improve decision-making efficiency. These insights, while exploratory, can guide future research and decision-support systems in behavioral finance. Full article
(This article belongs to the Special Issue Knowledge Engineering and Data Mining, 3rd Edition)
Show Figures

Figure 1

Review

Jump to: Research

33 pages, 1322 KB  
Review
Outlier Detection in Streaming Data for Telecommunications and Industrial Applications: A Survey
by Roland N. Mfondoum, Antoni Ivanov, Pavlina Koleva, Vladimir Poulkov and Agata Manolova
Electronics 2024, 13(16), 3339; https://doi.org/10.3390/electronics13163339 - 22 Aug 2024
Cited by 3 | Viewed by 3538
Abstract
Streaming data are present all around us. From traditional radio systems streaming audio to today’s connected end-user devices constantly sending information or accessing services, data are flowing constantly between nodes across various networks. The demand for appropriate outlier detection (OD) methods in the [...] Read more.
Streaming data are present all around us. From traditional radio systems streaming audio to today’s connected end-user devices constantly sending information or accessing services, data are flowing constantly between nodes across various networks. The demand for appropriate outlier detection (OD) methods in the fields of fault detection, special events detection, and malicious activities detection and prevention is not only persistent over time but increasing, especially with the recent developments in Telecommunication systems such as Fifth Generation (5G) networks facilitating the expansion of the Internet of Things (IoT). The process of selecting a computationally efficient OD method, adapted for a specific field and accounting for the existence of empirical data, or lack thereof, is non-trivial. This paper presents a thorough survey of OD methods, categorized by the applications they are implemented in, the basic assumptions that they use according to the characteristics of the streaming data, and a summary of the emerging challenges, such as the evolving structure and nature of the data and their dimensionality and temporality. A categorization of commonly used datasets in the context of streaming data is produced to aid data source identification for researchers in this field. Based on this, guidelines for OD method selection are defined, which consider flexibility and sample size requirements and facilitate the design of such algorithms in Telecommunications and other industries. Full article
(This article belongs to the Special Issue Knowledge Engineering and Data Mining, 3rd Edition)
Show Figures

Figure 1

Back to TopTop