Computers

17 pages, 2534 KiB

Open AccessArticle

LeakPred: An Approach for Identifying Components with Resource Leaks in Android Mobile Applications

by Josias Gomes Lima, Rafael Giusti and Arilo Claudio Dias-Neto

Computers 2024, 13(6), 140; https://doi.org/10.3390/computers13060140 (registering DOI) - 3 Jun 2024

Context: Mobile devices contain some resources, for example, the camera, battery, and memory, that are allocated, used, and then deallocated by mobile applications. Whenever a resource is allocated and not correctly released, a defect called a resource leak occurs, which can cause [...] Read more.

Context: Mobile devices contain some resources, for example, the camera, battery, and memory, that are allocated, used, and then deallocated by mobile applications. Whenever a resource is allocated and not correctly released, a defect called a resource leak occurs, which can cause crashes and slowdowns. Objective: In this study, we intended to demonstrate the usefulness of the LeakPred approach in terms of the number of components with resource leak problems identified in applications. Method: We compared the approach’s effectiveness with three state-of-the-art methods in identifying leaks in 15 Android applications. Result: LeakPred obtained the best median (85.37%) of components with identified leaks, the best coverage (96.15%) of the classes of leaks that could be identified in the applications, and an accuracy of 81.25%. The Android Lint method achieved the second best median (76.92%) and the highest accuracy (100%), but only covered 1.92% of the leak classes. Conclusions: LeakPred is effective in identifying leaky components in applications. Full article

(This article belongs to the Topic Artificial Intelligence Models, Tools and Applications)

► Show Figures

Figure 1

26 pages, 8365 KiB

Open AccessArticle

High-Performance Computing Storage Performance and Design Patterns—Btrfs and ZFS Performance for Different Use Cases

by Vedran Dakic, Mario Kovac and Igor Videc

Computers 2024, 13(6), 139; https://doi.org/10.3390/computers13060139 (registering DOI) - 3 Jun 2024

Abstract

Filesystems are essential components in contemporary computer systems that organize and manage data. Their performance is crucial in various applications, from web servers to data storage systems. This paper helps to pick the suitable filesystem by comparing btrfs with ZFS by considering multiple [...] Read more.

Filesystems are essential components in contemporary computer systems that organize and manage data. Their performance is crucial in various applications, from web servers to data storage systems. This paper helps to pick the suitable filesystem by comparing btrfs with ZFS by considering multiple situations and applications, ranging from sequential and random performance in the most common use cases to extreme use cases like high-performance computing (HPC). It showcases each option’s benefits and drawbacks, considering different usage scenarios. The performance of btrfs and ZFS will be evaluated through rigorous testing. They will assess their capabilities in handling huge files, managing numerous small files, and the speed of data read and write across varied usage levels. The analysis indicates no definitive answer; the selection of the optimal filesystem is contingent upon individual data-access requirements. Full article

► Show Figures

Figure 1

16 pages, 902 KiB

Open AccessArticle

Insights into How to Enhance Container Terminal Operations with Digital Twins

by Marvin Kastner, Nicolò Saporiti, Ann-Kathrin Lange and Tommaso Rossi

Computers 2024, 13(6), 138; https://doi.org/10.3390/computers13060138 - 30 May 2024

Abstract

The years 2021 and 2022 showed that maritime logistics are prone to interruptions. Ports especially turned out to be bottlenecks with long queues of waiting vessels. This leads to the question of whether this can be (at least partly) mitigated by means of [...] Read more.

The years 2021 and 2022 showed that maritime logistics are prone to interruptions. Ports especially turned out to be bottlenecks with long queues of waiting vessels. This leads to the question of whether this can be (at least partly) mitigated by means of better and more flexible terminal operations. Digital Twins have been in use in production and logistics to increase flexibility in operations and to support operational decision-making based on real-time information. However, the true potential of Digital Twins to enhance terminal operations still needs to be further investigated. A Delphi study is conducted to explore the operational pain points, the best practices to counter them, and how these best practices can be supported by Digital Twins. A questionnaire with 16 propositions is developed, and a panel of 17 experts is asked for their degrees of confirmation for each. The results indicate that today’s terminal operations are far from ideal, and leave space for optimisation. The experts see great potential in analysing the past working shift data to identify the reasons for poor terminal performance. Moreover, they agree on the proposed best practices and support the use of emulation for detailed ad hoc simulation studies to improve operational decision-making. Full article

(This article belongs to the Special Issue IT in Production and Logistics)

► Show Figures

Figure 1

27 pages, 4827 KiB

Open AccessArticle

Machine Learning-Based Crop Yield Prediction in South India: Performance Analysis of Various Models

by Uppugunduri Vijay Nikhil, Athiya M. Pandiyan, S. P. Raja and Zoran Stamenkovic

Computers 2024, 13(6), 137; https://doi.org/10.3390/computers13060137 - 29 May 2024

Viewed by 176

Abstract

Agriculture is one of the most important activities that produces crop and food that is crucial for the sustenance of a human being. In the present day, agricultural products and crops are not only used for local demand, but globalization has allowed us [...] Read more.

Agriculture is one of the most important activities that produces crop and food that is crucial for the sustenance of a human being. In the present day, agricultural products and crops are not only used for local demand, but globalization has allowed us to export produce to other countries and import from other countries. India is an agricultural nation and depends a lot on its agricultural activities. Prediction of crop production and yield is a necessary activity that allows farmers to estimate storage, optimize resources, increase efficiency and decrease costs. However, farmers usually predict crops based on the region, soil, weather conditions and the crop itself based on experience and estimates which may not be very accurate especially with the constantly changing and unpredictable climactic conditions of the present day. To solve this problem, we aim to predict the production and yield of various crops such as rice, sorghum, cotton, sugarcane and rabi using Machine Learning (ML) models. We train these models with the weather, soil and crop data to predict future crop production and yields of these crops. We have compiled a dataset of attributes that impact crop production and yield from specific states in India and performed a comprehensive study of the performance of various ML Regression Models in predicting crop production and yield. The results indicated that the Extra Trees Regressor achieved the highest performance among the models examined. It attained a R-Squared score of 0.9615 and showed lowest Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) of 21.06 and 33.99. Following closely behind are the Random Forest Regressor and LGBM Regressor, achieving R-Squared scores of 0.9437 and 0.9398 respectively. Moreover, additional analysis revealed that tree-based models, showing a R-Squared score of 0.9353, demonstrate better performance compared to linear and neighbors-based models, which achieved R-Squared scores of 0.8568 and 0.9002 respectively. Full article

(This article belongs to the Special Issue Harnessing Artificial Intelligence for Social and Semantic Understanding)

► Show Figures

Figure 1

44 pages, 4162 KiB

Open AccessReview

Object Tracking Using Computer Vision: A Review

by Pushkar Kadam, Gu Fang and Ju Jia Zou

Computers 2024, 13(6), 136; https://doi.org/10.3390/computers13060136 - 28 May 2024

Viewed by 281

Abstract

Object tracking is one of the most important problems in computer vision applications such as robotics, autonomous driving, and pedestrian movement. There has been a significant development in camera hardware where researchers are experimenting with the fusion of different sensors and developing image [...] Read more.

Object tracking is one of the most important problems in computer vision applications such as robotics, autonomous driving, and pedestrian movement. There has been a significant development in camera hardware where researchers are experimenting with the fusion of different sensors and developing image processing algorithms to track objects. Image processing and deep learning methods have significantly progressed in the last few decades. Different data association methods accompanied by image processing and deep learning are becoming crucial in object tracking tasks. The data requirement for deep learning methods has led to different public datasets that allow researchers to benchmark their methods. While there has been an improvement in object tracking methods, technology, and the availability of annotated object tracking datasets, there is still scope for improvement. This review contributes by systemically identifying different sensor equipment, datasets, methods, and applications, providing a taxonomy about the literature and the strengths and limitations of different approaches, thereby providing guidelines for selecting equipment, methods, and applications. Research questions and future scope to address the unresolved issues in the object tracking field are also presented with research direction guidelines. Full article

(This article belongs to the Special Issue Advanced Image Processing and Computer Vision)

► Show Figures

Figure 1

22 pages, 3497 KiB

Open AccessArticle

Two-Phase Fuzzy Real-Time Approach for Fuzzy Demand Electric Vehicle Routing Problem with Soft Time Windows

by Mohamed A. Wahby Shalaby and Sally S. Kassem

Computers 2024, 13(6), 135; https://doi.org/10.3390/computers13060135 - 27 May 2024

Viewed by 181

Abstract

Environmental concerns have called for several measures to be taken within the logistics and transportation fields. Among these measures is the adoption of electric vehicles instead of diesel-operated vehicles for personal and commercial-delivery use. The optimized routing of electric vehicles for the commercial [...] Read more.

Environmental concerns have called for several measures to be taken within the logistics and transportation fields. Among these measures is the adoption of electric vehicles instead of diesel-operated vehicles for personal and commercial-delivery use. The optimized routing of electric vehicles for the commercial delivery of products is the focus of this paper. We study the effect of several practical challenges that are faced when routing electric vehicles. Electric vehicle routing faces the additional challenge of the potential need for recharging while en route, leading to more travel time, and hence cost. Therefore, in this work, we address the issue of electric vehicle routing problem, allowing for partial recharging while en route. In addition, the practical mandate of the time windows set by customers is also considered, where electric vehicle routing problems with soft time windows are studied. Real-life experience shows that the delivery of customers’ demands might be uncertain. In addition, real-time traffic conditions are usually uncertain due to congestion. Therefore, in this work, uncertainties in customers’ demands and traffic conditions are modeled and solved using fuzzy methods. The problems of fuzzy real-time, fuzzy demand, and electric vehicle routing problems with soft time windows are addressed. A mixed-integer programming mathematical model to represent the problem is developed. A novel two-phase solution approach is proposed to solve the problem. In phase I, the classical genetic algorithm (GA) is utilized to obtain an optimum/near-optimum solution for the fuzzy demand electric vehicle routing problem with soft time windows (FD-EVRPSTW). In phase II, a novel fuzzy real-time-adaptive optimizer (FRTAO) is developed to overcome the challenges of recharging and real-time traffic conditions facing FD-EVRPSTW. The proposed solution approach is tested on several modified benchmark instances, and the results show the significance of recharging and congestion challenges for routing costs. In addition, the results show the efficiency of the proposed two-phase approach in overcoming the challenges and reducing the total costs. Full article

(This article belongs to the Special Issue Recent Advances in Autonomous Vehicle Solutions)

► Show Figures

Figure 1

16 pages, 23626 KiB

Open AccessArticle

DCTE-LLIE: A Dual Color-and-Texture-Enhancement-Based Method for Low-Light Image Enhancement

by Hua Wang, Jianzhong Cao, Lei Yang and Jijiang Huang

Computers 2024, 13(6), 134; https://doi.org/10.3390/computers13060134 - 27 May 2024

Viewed by 230

Abstract

The enhancement of images captured under low-light conditions plays a vitally important role in the area of image processing and can significantly affect the performance of following operations. In recent years, deep learning techniques have been leveraged in the area of low-light image [...] Read more.

The enhancement of images captured under low-light conditions plays a vitally important role in the area of image processing and can significantly affect the performance of following operations. In recent years, deep learning techniques have been leveraged in the area of low-light image enhancement tasks, and deep-learning-based low-light image enhancement methods have been the mainstream for low-light image enhancement tasks. However, due to the inability of existing methods to effectively maintain the color distribution of the original input image and to effectively handle feature descriptions at different scales, the final enhanced image exhibits color distortion and local blurring phenomena. So, in this paper, a novel dual color-and-texture-enhancement-based low-light image enhancement method is proposed, which can effectively enhance low-light images. Firstly, a novel color enhancement block is leveraged to help maintain color distribution during the enhancement process, which can further eliminate the color distortion effect; after that, an attention-based multiscale texture enhancement block is proposed to help the network focus on multiscale local regions and extract more reliable texture representations automatically, and a fusion strategy is leveraged to fuse the multiscale feature representations automatically and finally generate the enhanced reflection component. The experimental results on public datasets and real-world low-light images established the effectiveness of the proposed method on low-light image enhancement tasks. Full article

► Show Figures

Figure 1

19 pages, 3161 KiB

Open AccessArticle

Modeling and Analysis of Dekker-Based Mutual Exclusion Algorithms

by Libero Nigro, Franco Cicirelli and Francesco Pupo

Computers 2024, 13(6), 133; https://doi.org/10.3390/computers13060133 - 25 May 2024

Viewed by 324

Abstract

Mutual exclusion is a fundamental problem in concurrent/parallel/distributed systems. The first pure-software solution to this problem for two processes, which is not based on hardware instructions like test-and-set, was proposed in 1965 by Th.J. Dekker and communicated by E.W. Dijkstra. The correctness of [...] Read more.

Mutual exclusion is a fundamental problem in concurrent/parallel/distributed systems. The first pure-software solution to this problem for two processes, which is not based on hardware instructions like test-and-set, was proposed in 1965 by Th.J. Dekker and communicated by E.W. Dijkstra. The correctness of this algorithm has generally been studied under the strong memory model, where the read and write operations on a memory cell are atomic or indivisible. In recent years, some variants of the algorithm have been proposed to make it RW-safe when using the weak memory model, which makes it possible, e.g., for multiple read operations to occur simultaneously to a write operation on the same variable, with the read operations returning (flickering) a non-deterministic value. This paper proposes a novel approach to formal modeling and reasoning on a mutual exclusion algorithm using Timed Automata and the Uppaal tool, and it applies this approach through exhaustive model checking to conduct a thorough analysis of the Dekker’s algorithm and some of its variants proposed in the literature. This paper aims to demonstrate that model checking, although necessarily limited in the scalability of the number

N

of the processes due to the state explosions problem, is effective yet powerful for reasoning on concurrency and process action interleaving, and it can provide significant results about the correctness and robustness of the basic version and variants of the Dekker’s algorithm under both the strong and weak memory models. In addition, the properties of these algorithms are also carefully studied in the context of a tournament-based binary tree for

N \geq 2

processes. Full article

(This article belongs to the Special Issue Best Practices, Challenges and Opportunities in Software Engineering)

► Show Figures

Figure 1

26 pages, 2756 KiB

Open AccessArticle

A Blockchain-Based Electronic Health Record (EHR) System for Edge Computing Enhancing Security and Cost Efficiency

by Valerio Mandarino, Giuseppe Pappalardo and Emiliano Tramontana

Computers 2024, 13(6), 132; https://doi.org/10.3390/computers13060132 - 24 May 2024

Viewed by 331

Abstract

Blockchain technology offers unique features, such as transparency, the immutability of data, and the capacity to establish trust without a central authority. Such characteristics can be leveraged to support the collaboration among several different software systems operating within the healthcare ecosystem, while ensuring [...] Read more.

Blockchain technology offers unique features, such as transparency, the immutability of data, and the capacity to establish trust without a central authority. Such characteristics can be leveraged to support the collaboration among several different software systems operating within the healthcare ecosystem, while ensuring data integrity and make electronic health records (EHRs) more easily accessible. To provide a solution based on blockchain technology, this paper has evaluated the main issues that arise when large amounts of data are expected, i.e., mainly cost and performance. A balanced approach that maximizes the benefits and mitigates the constraints of the blockchain has been designed. The proposed decentralized application (dApp) architecture employs a hybrid storage strategy that involves storing medical records locally, on users’ devices, while utilizing blockchain to manage an index of these data. The dApp clients facilitate interactions among participants, leveraging a smart contract to enable patients to set authorization policies, thereby ensuring that only designated healthcare providers and authorized entities have access to specific medical records. The blockchain data-immutability property is used to validate data stored externally. This solution significantly reduces the costs related to the utilization of the blockchain, while retaining its advantages, and improves performance, since the majority of data are available off-chain. Full article

(This article belongs to the Special Issue When Blockchain Meets IoT: Challenges and Potentials)

► Show Figures

Figure 1

31 pages, 2472 KiB

Open AccessArticle

A Step-by-Step Methodology for Obtaining the Reliability of Building Microgrids Using Fault TreeAnalysis

by Gustavo A. Patiño-Álvarez, Johan S. Arias-Pérez and Nicolás Muñoz-Galeano

Computers 2024, 13(6), 131; https://doi.org/10.3390/computers13060131 - 24 May 2024

Viewed by 274

Abstract

This paper introduces an improved methodology designed to address a practical deficit of existing methodologies by incorporating circuit-level analysis in the assessment of building microgrid reliability. The scientific problem at hand involves devising a systematic approach that integrates circuit modeling, Probability Density Function [...] Read more.

This paper introduces an improved methodology designed to address a practical deficit of existing methodologies by incorporating circuit-level analysis in the assessment of building microgrid reliability. The scientific problem at hand involves devising a systematic approach that integrates circuit modeling, Probability Density Function (PDF) selection, formulation of reliability functions, and Fault Tree Analysis (FTA) tailored specifically for the distinctive features of building microgrids. This method entails analyzing inter-component relationships to gain comprehensive insights into system behavior. By harnessing the circuit models and theoretical framework proposed herein, precise estimations of microgrid failure rates can be attained. To complement this approach, we propose a thorough investigation utilizing reliability curves and importance measures, providing valuable insights into individual device failure probabilities over time. Such time-based analysis plays a crucial role in proactively identifying potential failures and facilitating efficient maintenance planning for microgrid devices. We demonstrate the application of this methodology to the University of Antioquia (UdeA) Microgrid, a low-voltage system comprising critical components such as solar panels, microinverters, inverters/chargers, batteries, and charge controllers. Full article

► Show Figures

Figure 1

23 pages, 516 KiB

Open AccessArticle

Exploiting Anytime Algorithms for Collaborative Service Execution in Edge Computing

by Luís Nogueira, Jorge Coelho and David Pereira

Computers 2024, 13(6), 130; https://doi.org/10.3390/computers13060130 - 23 May 2024

Viewed by 232

Abstract

The diversity and scarcity of resources across devices in heterogeneous computing environments can impact their ability to meet users’ quality-of-service (QoS) requirements, especially in open real-time environments where computational loads are unpredictable. Despite this uncertainty, timely responses to events remain essential to ensure [...] Read more.

The diversity and scarcity of resources across devices in heterogeneous computing environments can impact their ability to meet users’ quality-of-service (QoS) requirements, especially in open real-time environments where computational loads are unpredictable. Despite this uncertainty, timely responses to events remain essential to ensure desired performance levels. To address this challenge, this paper introduces collaborative service execution, enabling resource-constrained IoT devices to collaboratively execute services with more powerful neighbors at the edge, thus meeting non-functional requirements that might be unattainable through individual execution. Nodes dynamically form clusters, allocating resources to each service and establishing initial configurations that maximize QoS satisfaction while minimizing global QoS impact. However, the complexity of open real-time environments may hinder the computation of optimal local and global resource allocations within reasonable timeframes. Thus, we reformulate the QoS optimization problem as a heuristic-based anytime optimization problem, capable of interrupting and quickly adapting to environmental changes. Extensive simulations demonstrate that our anytime algorithms rapidly yield satisfactory initial service solutions and effectively optimize the solution quality over iterations, with negligible overhead compared to the benefits gained. Full article

(This article belongs to the Special Issue Intelligent Edge: When AI Meets Edge Computing)

14 pages, 968 KiB

Open AccessArticle

Robust Algorithms for the Analysis of Fast-Field-Cycling Nuclear Magnetic Resonance Dispersion Curves

by Villiam Bortolotti, Pellegrino Conte, Germana Landi, Paolo Lo Meo, Anastasiia Nagmutdinova, Giovanni Vito Spinelli and Fabiana Zama

Computers 2024, 13(6), 129; https://doi.org/10.3390/computers13060129 - 23 May 2024

Viewed by 259

Abstract

Fast-Field-Cycling (FFC) Nuclear Magnetic Resonance (NMR) relaxometry is a powerful, non-destructive magnetic resonance technique that enables, among other things, the investigation of slow molecular dynamics at low magnetic field intensities. FFC-NMR relaxometry measurements provide insight into molecular motion across various timescales within a [...] Read more.

Fast-Field-Cycling (FFC) Nuclear Magnetic Resonance (NMR) relaxometry is a powerful, non-destructive magnetic resonance technique that enables, among other things, the investigation of slow molecular dynamics at low magnetic field intensities. FFC-NMR relaxometry measurements provide insight into molecular motion across various timescales within a single experiment. This study focuses on a model-free approach, representing the NMRD profile

R_{1}

as a linear combination of Lorentzian functions, thereby addressing the challenges of fitting data within an ill-conditioned linear least-squares framework. Tackling this problem, we present a comprehensive review and experimental validation of three regularization approaches to implement the model-free approach to analyzing NMRD profiles. These include (1) MF-UPen, utilizing locally adapted

L_{2}

regularization; (2) MF-L1, based on

L_{1}

penalties; and (3) a hybrid approach combining locally adapted

L_{2}

and global

L_{1}

penalties. Each method’s regularization parameters are determined automatically according to the Balancing and Uniform Penalty principles. Our contributions include the implementation and experimental validation of the MF-UPen and MF-MUPen algorithms, and the development of a “dispersion analysis” technique to assess the existence range of the estimated parameters. The objective of this work is to delineate the variance in fit quality and correlation time distribution yielded by each algorithm, thus broadening the set of software tools for the analysis of sample structures in FFC-NMR studies. The findings underline the efficacy and applicability of these algorithms in the analysis of NMRD profiles from samples representing different potential scenarios. Full article

► Show Figures

Figure 1

27 pages, 2040 KiB

Open AccessArticle

Machine Learning Decision System on the Empirical Analysis of the Actual Usage of Interactive Entertainment: A Perspective of Sustainable Innovative Technology

by Rex Revian A. Guste and Ardvin Kester S. Ong

Computers 2024, 13(6), 128; https://doi.org/10.3390/computers13060128 - 23 May 2024

Viewed by 335

Abstract

This study focused on the impact of Netflix’s interactive entertainment on Filipino consumers, seamlessly combining vantage points from consumer behavior and employing data analytics. This underlines the revolutionary aspect of interactive entertainment in the quickly expanding digital media ecosystem, particularly as Netflix pioneers [...] Read more.

This study focused on the impact of Netflix’s interactive entertainment on Filipino consumers, seamlessly combining vantage points from consumer behavior and employing data analytics. This underlines the revolutionary aspect of interactive entertainment in the quickly expanding digital media ecosystem, particularly as Netflix pioneers fresh content distribution techniques. The main objective of this study was to find the factors impacting the real usage of Netflix’s interactive entertainment among Filipino viewers, filling a critical gap in the existing literature. The major goal of using advanced data analytics techniques in this study was to understand the subtle dynamics affecting customer behavior in this setting. Specifically, the random forest classifier with hard and soft classifiers was assessed. The random forest compared to LightGBM was also employed, alongside the different algorithms of the artificial neural network. Purposive sampling was used to obtain responses from 258 people who had experienced Netflix’s interactive entertainment, resulting in a comprehensive dataset. The findings emphasized the importance of hedonic motivation, underlining the requirement for highly engaging and rewarding interactive material. Customer service and device compatibility, for example, have a significant impact on user uptake. Furthermore, behavioral intention and habit emerged as key drivers, revealing interactive entertainment’s long-term influence on user engagement. Practically, the research recommends strategic platform suggestions that emphasize continuous innovation, user-friendly interfaces, and user-centric methods. This study was able to fill in the gap in the literature on interactive entertainment, which contributes to a better understanding of consumer consumption and lays the groundwork for future research in the dynamic field of digital media. Moreover, this study offers essential insights into the intricate interaction of consumer preferences, technology breakthroughs, and societal influences in the ever-expanding environment of digital entertainment. Lastly, the comparative approach to the use of machine learning algorithms provides insights for future works to adopt and employ among human factors and consumer behavior-related studies. Full article

(This article belongs to the Special Issue Harnessing Artificial Intelligence for Social and Semantic Understanding)

► Show Figures

Figure 1

24 pages, 5079 KiB

Open AccessArticle

Machine Learning for Predicting Key Factors to Identify Misinformation in Football Transfer News

by Ife Runsewe, Majid Latifi, Mominul Ahsan and Julfikar Haider

Computers 2024, 13(6), 127; https://doi.org/10.3390/computers13060127 - 23 May 2024

Viewed by 307

Abstract

The spread of misinformation in football transfer news has become a growing concern. To address this challenge, this study introduces a novel approach by employing ensemble learning techniques to identify key factors for predicting such misinformation. The performance of three ensemble learning models, [...] Read more.

The spread of misinformation in football transfer news has become a growing concern. To address this challenge, this study introduces a novel approach by employing ensemble learning techniques to identify key factors for predicting such misinformation. The performance of three ensemble learning models, namely Random Forest, AdaBoost, and XGBoost, was analyzed on a dataset of transfer rumours. Natural language processing (NLP) techniques were employed to extract structured data from the text, and the veracity of each rumor was verified using factual transfer data. The study also investigated the relationships between specific features and rumor veracity. Key predictive features such as a player’s market value, age, and timing of the transfer window were identified. The Random Forest model outperformed the other two models, achieving a cross-validated accuracy of 95.54%. The top features identified by the model were a player’s market value, time to the start/end of the transfer window, and age. The study revealed weak negative relationships between a player’s age, time to the start/end of the transfer window, and rumor veracity, suggesting that for older players and times further from the transfer window, rumors are slightly less likely to be true. In contrast, a player’s market value did not have a statistically significant relationship with rumor veracity. This study contributes to the existing knowledge of misinformation detection and ensemble learning techniques. Despite some limitations, this study has significant implications for media agencies, football clubs, and fans. By discerning the credibility of transfer news, stakeholders can make informed decisions, reduce the spread of misinformation, and foster a more transparent transfer market. Full article

► Show Figures

Figure 1

17 pages, 3025 KiB

Open AccessArticle

An Improved Ensemble-Based Cardiovascular Disease Detection System with Chi-Square Feature Selection

by Ayad E. Korial, Ivan Isho Gorial and Amjad J. Humaidi

Computers 2024, 13(6), 126; https://doi.org/10.3390/computers13060126 - 22 May 2024

Viewed by 359

Abstract

Cardiovascular disease (CVD) is a leading cause of death globally; therefore, early detection of CVD is crucial. Many intelligent technologies, including deep learning and machine learning (ML), are being integrated into healthcare systems for disease prediction. This paper uses a voting ensemble ML [...] Read more.

Cardiovascular disease (CVD) is a leading cause of death globally; therefore, early detection of CVD is crucial. Many intelligent technologies, including deep learning and machine learning (ML), are being integrated into healthcare systems for disease prediction. This paper uses a voting ensemble ML with chi-square feature selection to detect CVD early. Our approach involved applying multiple ML classifiers, including naïve Bayes, random forest, logistic regression (LR), and k-nearest neighbor. These classifiers were evaluated through metrics including accuracy, specificity, sensitivity, F1-score, confusion matrix, and area under the curve (AUC). We created an ensemble model by combining predictions from the different ML classifiers through a voting mechanism, whose performance was then measured against individual classifiers. Furthermore, we applied chi-square feature selection method to the 303 records across 13 clinical features in the Cleveland cardiac disease dataset to identify the 5 most important features. This approach improved the overall accuracy of our ensemble model and reduced the computational load considerably by more than 50%. Demonstrating superior effectiveness, our voting ensemble model achieved a remarkable accuracy of 92.11%, representing an average improvement of 2.95% over the single highest classifier (LR). These results indicate the ensemble method as a viable and practical approach to improve the accuracy of CVD prediction. Full article

(This article belongs to the Special Issue Machine and Deep Learning in the Health Domain 2024)

► Show Figures

Figure 1

Journal Menu

Journal Browser

Computers, Volume 13, Issue 6 (June 2024) – 15 articles

Further Information

Guidelines

MDPI Initiatives

Follow MDPI