AI | July 2025 - Browse Articles

27 pages, 3019 KB

Open AccessArticle

New Deep Learning-Based Approach for Source Code Generation: Application to Computer Vision Systems

by Wafa Alshehri, Salma Kammoun Jarraya and Arwa Allinjawi

AI 2025, 6(7), 162; https://doi.org/10.3390/ai6070162 - 21 Jul 2025

Cited by 1 | Viewed by 1680

Deep learning has enabled significant progress in source code generation, aiming to reduce the manual, error-prone, and time-consuming aspects of software development. While many existing models rely on recurrent neural networks (RNNs) with sequence-to-sequence architectures, these approaches struggle with the long and complex [...] Read more.

Deep learning has enabled significant progress in source code generation, aiming to reduce the manual, error-prone, and time-consuming aspects of software development. While many existing models rely on recurrent neural networks (RNNs) with sequence-to-sequence architectures, these approaches struggle with the long and complex token sequences typical in source code. To address this, we propose a grammar-based convolutional neural network (CNN) combined with a tree-based representation to enhance accuracy and efficiency. Our model achieves state-of-the-art results on the benchmark HEARTHSTONE dataset, with a BLEU score of 81.4 and an Acc+ of 62.1%. We further evaluate the model on our proposed dataset, AST2CVCode, designed for computer vision applications, achieving 86.2 BLEU and 51.9% EM. Additionally, we introduce BLEU+, an enhanced evaluation metric tailored for functional correctness in code generation, which achieves a BLEU+ score of 92.0% on the AST2CVCode dataset. These results demonstrate the effectiveness of our approach in both model architecture and evaluation methodology. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

► Show Figures

Figure 1

18 pages, 1261 KB

Open AccessArticle

Firmware Attestation in IoT Swarms Using Relational Graph Neural Networks and Static Random Access Memory

by Abdelkabir Rouagubi, Chaymae El Youssofi and Khalid Chougdali

AI 2025, 6(7), 161; https://doi.org/10.3390/ai6070161 - 21 Jul 2025

Viewed by 1225

Abstract

The proliferation of Internet of Things (IoT) swarms—comprising billions of low-end interconnected embedded devices—has transformed industrial automation, smart homes, and agriculture. However, these swarms are highly susceptible to firmware anomalies that can propagate across nodes, posing serious security threats. To address this, we [...] Read more.

The proliferation of Internet of Things (IoT) swarms—comprising billions of low-end interconnected embedded devices—has transformed industrial automation, smart homes, and agriculture. However, these swarms are highly susceptible to firmware anomalies that can propagate across nodes, posing serious security threats. To address this, we propose a novel Remote Attestation (RA) framework for real-time firmware verification, leveraging Relational Graph Neural Networks (RGNNs) to model the graph-like structure of IoT swarms and capture complex inter-node dependencies. Unlike conventional Graph Neural Networks (GNNs), RGNNs incorporate edge types (e.g., Prompt, Sensor Data, Processed Signal), enabling finer-grained detection of propagation dynamics. The proposed method uses runtime Static Random Access Memory (SRAM) data to detect malicious firmware and its effects without requiring access to firmware binaries. Experimental results demonstrate that the framework achieves 99.94% accuracy and a 99.85% anomaly detection rate in a 4-node swarm (Swarm-1), and 100.00% accuracy with complete anomaly detection in a 6-node swarm (Swarm-2). Moreover, the method proves resilient against noise, dropped responses, and trace replay attacks, offering a robust and scalable solution for securing IoT swarms. Full article

► Show Figures

Figure 1

21 pages, 2395 KB

Open AccessArticle

A Robust Stacking-Based Ensemble Model for Predicting Cardiovascular Diseases

by Hayat Bihri, Lalla Amina Charaf, Salma Azzouzi and My El Hassan Charaf

AI 2025, 6(7), 160; https://doi.org/10.3390/ai6070160 - 21 Jul 2025

Cited by 1 | Viewed by 2143

Abstract

Background/Objectives: Cardiovascular diseases (CVDs) remain the primary cause of mortality worldwide, underscoring the critical importance of developing accurate early prediction models. In this study, we propose an advanced stacking ensemble learning framework to improve the predictive performance for CVD diagnosis. Methods: The methodology [...] Read more.

Background/Objectives: Cardiovascular diseases (CVDs) remain the primary cause of mortality worldwide, underscoring the critical importance of developing accurate early prediction models. In this study, we propose an advanced stacking ensemble learning framework to improve the predictive performance for CVD diagnosis. Methods: The methodology encompasses comprehensive data preprocessing, feature selection, cross-validation, and the construction of a stacking architecture integrating Random Forest (RF), Support Vector Machine (SVM), and CatBoost as base learners. Two meta-learning configurations were examined: Logistic Regression (LR) and a Multilayer Perceptron (MLP). Results: Experimental results indicate that the MLP-based stacking model achieves superior performance, with an accuracy of 97.06%, outperforming existing approaches reported in the literature. Furthermore, the model demonstrates high recall (96.08%) and precision (98%), confirming its robustness and generalization capacity. Conclusions: Compared to individual classifiers and traditional ensemble methods, the proposed approach yields significantly enhanced predictive outcomes, highlighting the potential of deep learning-based stacking strategies in cardiovascular risk assessment. Full article

(This article belongs to the Section Medical & Healthcare AI)

► Show Figures

Figure 1

33 pages, 534 KB

Open AccessReview

Local AI Governance: Addressing Model Safety and Policy Challenges Posed by Decentralized AI

by Bahrad A. Sokhansanj

AI 2025, 6(7), 159; https://doi.org/10.3390/ai6070159 - 17 Jul 2025

Cited by 3 | Viewed by 7129

Abstract

Policies and technical safeguards for artificial intelligence (AI) governance have implicitly assumed that AI systems will continue to operate via massive power-hungry data centers operated by large companies like Google and OpenAI. However, the present cloud-based AI paradigm is being challenged by rapidly [...] Read more.

Policies and technical safeguards for artificial intelligence (AI) governance have implicitly assumed that AI systems will continue to operate via massive power-hungry data centers operated by large companies like Google and OpenAI. However, the present cloud-based AI paradigm is being challenged by rapidly advancing software and hardware technologies. Open-source AI models now run on personal computers and devices, invisible to regulators and stripped of safety constraints. The capabilities of local-scale AI models now lag just months behind those of state-of-the-art proprietary models. Wider adoption of local AI promises significant benefits, such as ensuring privacy and autonomy. However, adopting local AI also threatens to undermine the current approach to AI safety. In this paper, we review how technical safeguards fail when users control the code, and regulatory frameworks cannot address decentralized systems as deployment becomes invisible. We further propose ways to harness local AI’s democratizing potential while managing its risks, aimed at guiding responsible technical development and informing community-led policy: (1) adapting technical safeguards for local AI, including content provenance tracking, configurable safe computing environments, and distributed open-source oversight; and (2) shaping AI policy for a decentralized ecosystem, including polycentric governance mechanisms, integrating community participation, and tailored safe harbors for liability. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

► Show Figures

Figure 1

21 pages, 1118 KB

Open AccessReview

Integrating Large Language Models into Robotic Autonomy: A Review of Motion, Voice, and Training Pipelines

by Yutong Liu, Qingquan Sun and Dhruvi Rajeshkumar Kapadia

AI 2025, 6(7), 158; https://doi.org/10.3390/ai6070158 - 15 Jul 2025

Cited by 1 | Viewed by 8366

Abstract

This survey provides a comprehensive review of the integration of large language models (LLMs) into autonomous robotic systems, organized around four key pillars: locomotion, navigation, manipulation, and voice-based interaction. We examine how LLMs enhance robotic autonomy by translating high-level natural language commands into [...] Read more.

This survey provides a comprehensive review of the integration of large language models (LLMs) into autonomous robotic systems, organized around four key pillars: locomotion, navigation, manipulation, and voice-based interaction. We examine how LLMs enhance robotic autonomy by translating high-level natural language commands into low-level control signals, supporting semantic planning and enabling adaptive execution. Systems like SayTap improve gait stability through LLM-generated contact patterns, while TrustNavGPT achieves a 5.7% word error rate (WER) under noisy voice-guided conditions by modeling user uncertainty. Frameworks such as MapGPT, LLM-Planner, and 3D-LOTUS++ integrate multi-modal data—including vision, speech, and proprioception—for robust planning and real-time recovery. We also highlight the use of physics-informed neural networks (PINNs) to model object deformation and support precision in contact-rich manipulation tasks. To bridge the gap between simulation and real-world deployment, we synthesize best practices from benchmark datasets (e.g., RH20T, Open X-Embodiment) and training pipelines designed for one-shot imitation learning and cross-embodiment generalization. Additionally, we analyze deployment trade-offs across cloud, edge, and hybrid architectures, emphasizing latency, scalability, and privacy. The survey concludes with a multi-dimensional taxonomy and cross-domain synthesis, offering design insights and future directions for building intelligent, human-aligned robotic systems powered by LLMs. Full article

► Show Figures

Figure 1

20 pages, 351 KB

Open AccessArticle

Multi-Level Depression Severity Detection with Deep Transformers and Enhanced Machine Learning Techniques

by Nisar Hussain, Amna Qasim, Gull Mehak, Muhammad Zain, Grigori Sidorov, Alexander Gelbukh and Olga Kolesnikova

AI 2025, 6(7), 157; https://doi.org/10.3390/ai6070157 - 15 Jul 2025

Cited by 3 | Viewed by 2487

Abstract

Depression is now one of the most common mental health concerns in the digital era, calling for powerful computational tools for its detection and its level of severity estimation. A multi-level depression severity detection framework in the Reddit social media network is proposed [...] Read more.

Depression is now one of the most common mental health concerns in the digital era, calling for powerful computational tools for its detection and its level of severity estimation. A multi-level depression severity detection framework in the Reddit social media network is proposed in this study, and posts are classified into four levels: minimum, mild, moderate, and severe. We take a dual approach using classical machine learning (ML) algorithms and recent Transformer-based architectures. For the ML track, we build ten classifiers, including Logistic Regression, SVM, Naive Bayes, Random Forest, XGBoost, Gradient Boosting, K-NN, Decision Tree, AdaBoost, and Extra Trees, with two recently proposed embedding methods, Word2Vec and GloVe embeddings, and we fine-tune them for mental health text classification. Of these, XGBoost yields the highest F1-score of 94.01 using GloVe embeddings. For the deep learning track, we fine-tune ten Transformer models, covering BERT, RoBERTa, XLM-RoBERTa, MentalBERT, BioBERT, RoBERTa-large, DistilBERT, DeBERTa, Longformer, and ALBERT. The highest performance was achieved by the MentalBERT model, with an F1-score of 97.31, followed by RoBERTa (96.27) and RoBERTa-large (96.14). Our results demonstrate that, to the best of the authors’ knowledge, domain-transferred Transformers outperform non-Transformer-based ML methods in capturing subtle linguistic cues indicative of different levels of depression, thereby highlighting their potential for fine-grained mental health monitoring in online settings. Full article

(This article belongs to the Special Issue AI in Bio and Healthcare Informatics)

► Show Figures

Figure 1

21 pages, 2793 KB

Open AccessArticle

Link Predictions with Bi-Level Routing Attention

by Yu Wang, Shu Xu, Zenghui Ding, Cong Liu and Xianjun Yang

AI 2025, 6(7), 156; https://doi.org/10.3390/ai6070156 - 14 Jul 2025

Viewed by 1071

Abstract

Background/Objectives: Knowledge Graphs (KGs) are often incomplete, which can significantly impact the performance of downstream applications. Manual completion of KGs is time-consuming and costly, emphasizing the importance of developing automated methods for KGC. Link prediction serves as a fundamental task in this domain. [...] Read more.

Background/Objectives: Knowledge Graphs (KGs) are often incomplete, which can significantly impact the performance of downstream applications. Manual completion of KGs is time-consuming and costly, emphasizing the importance of developing automated methods for KGC. Link prediction serves as a fundamental task in this domain. The semantic correlation among entity features plays a crucial role in determining the effectiveness of link-prediction models. Notably, the human brain can often infer information using a limited set of salient features. Methods: Inspired by this cognitive principle, this paper proposes a lightweight Bi-level routing attention mechanism specifically designed for link-prediction tasks. This proposed module explores a theoretically grounded and lightweight structural design aimed at enhancing the semantic recognition capability of language models without altering their core parameters. The proposed module enhances the model’s ability to attend to feature regions with high semantic relevance. With only a marginal increase of approximately one million parameters, the mechanism effectively captures the most semantically informative features. Result: It replaces the original feature-extraction module within the KGML framework and is evaluated on the publicly available WN18RR and FB15K-237 dataset. Conclusions: Experimental results demonstrate consistent improvements in standard evaluation metrics, including Mean Rank (MR), Mean Reciprocal Rank (MRR), and Hits@10, thereby confirming the effectiveness of the proposed approach. Full article

► Show Figures

Figure 1

35 pages, 1356 KB

Open AccessArticle

Intricate and Multifaceted Socio-Ethical Dilemmas Facing the Development of Drone Technology: A Qualitative Exploration

by Hisham O. Khogali and Samir Mekid

AI 2025, 6(7), 155; https://doi.org/10.3390/ai6070155 - 13 Jul 2025

Viewed by 2793

Abstract

Background: Drones are rapidly establishing themselves as one of the most critical technologies. Robotics, automated machinery, intelligent manufacturing, and other high-impact technological research and applications bring up pressing ethical, social, legal, and political issues. Methods: The present research aims to present the results [...] Read more.

Background: Drones are rapidly establishing themselves as one of the most critical technologies. Robotics, automated machinery, intelligent manufacturing, and other high-impact technological research and applications bring up pressing ethical, social, legal, and political issues. Methods: The present research aims to present the results of a qualitative investigation that looked at perceptions of the growing socio-ethical conundrums surrounding the development of drone applications. Results: According to the obtained results, participants often share similar opinions about whether different drone applications are approved by the public, regardless of their level of experience. Perceptions of drone applications appear consistent across various levels of expertise. The most notable associations are with military objectives (73%), civil protection (61%), and passenger transit and medical purposes (56%). Applications that have received high approval include science (8.70), agriculture (8.78), and disaster management (8.87), most likely due to their obvious social benefits and reduced likelihood of ethical challenges. Conclusions: The study’s findings can help shape the debate on drone acceptability in particular contexts, inform future research on promoting value-sensitive development in society more broadly, and guide researchers and decision-makers on the use of drones, as people’s attitudes, understanding, and usage will undoubtedly impact future advancements in this technology. Full article

(This article belongs to the Special Issue Controllable and Reliable AI)

► Show Figures

Figure 1

31 pages, 2736 KB

Open AccessArticle

Unseen Attack Detection in Software-Defined Networking Using a BERT-Based Large Language Model

by Mohammed N. Swileh and Shengli Zhang

AI 2025, 6(7), 154; https://doi.org/10.3390/ai6070154 - 11 Jul 2025

Cited by 4 | Viewed by 1765

Abstract

Software-defined networking (SDN) represents a transformative shift in network architecture by decoupling the control plane from the data plane, enabling centralized and flexible management of network resources. However, this architectural shift introduces significant security challenges, as SDN’s centralized control becomes an attractive target [...] Read more.

Software-defined networking (SDN) represents a transformative shift in network architecture by decoupling the control plane from the data plane, enabling centralized and flexible management of network resources. However, this architectural shift introduces significant security challenges, as SDN’s centralized control becomes an attractive target for various types of attacks. While the body of current research on attack detection in SDN has yielded important results, several critical gaps remain that require further exploration. Addressing challenges in feature selection, broadening the scope beyond Distributed Denial of Service (DDoS) attacks, strengthening attack decisions based on multi-flow analysis, and building models capable of detecting unseen attacks that they have not been explicitly trained on are essential steps toward advancing security measures in SDN environments. In this paper, we introduce a novel approach that leverages Natural Language Processing (NLP) and the pre-trained Bidirectional Encoder Representations from Transformers (BERT)-base-uncased model to enhance the detection of attacks in SDN environments. Our approach transforms network flow data into a format interpretable by language models, allowing BERT-base-uncased to capture intricate patterns and relationships within network traffic. By utilizing Random Forest for feature selection, we optimize model performance and reduce computational overhead, ensuring efficient and accurate detection. Attack decisions are made based on several flows, providing stronger and more reliable detection of malicious traffic. Furthermore, our proposed method is specifically designed to detect previously unseen attacks, offering a solution for identifying threats that the model was not explicitly trained on. To rigorously evaluate our approach, we conducted experiments in two scenarios: one focused on detecting known attacks, achieving an accuracy, precision, recall, and F1-score of 99.96%, and another on detecting previously unseen attacks, where our model achieved 99.96% in all metrics, demonstrating the robustness and precision of our framework in detecting evolving threats, and reinforcing its potential to improve the security and resilience of SDN networks. Full article

(This article belongs to the Special Issue Artificial Intelligence for Network Management)

► Show Figures

Figure 1

26 pages, 718 KB

Open AccessReview

Advancements in Semi-Supervised Deep Learning for Brain Tumor Segmentation in MRI: A Literature Review

by Chengcheng Jin, Theam Foo Ng and Haidi Ibrahim

AI 2025, 6(7), 153; https://doi.org/10.3390/ai6070153 - 11 Jul 2025

Cited by 1 | Viewed by 2912

Abstract

For automatic tumor segmentation in magnetic resonance imaging (MRI), deep learning offers very powerful technical support with significant results. However, the success of supervised learning is strongly dependent on the quantity and accuracy of labeled training data, which is challenging to acquire in [...] Read more.

For automatic tumor segmentation in magnetic resonance imaging (MRI), deep learning offers very powerful technical support with significant results. However, the success of supervised learning is strongly dependent on the quantity and accuracy of labeled training data, which is challenging to acquire in MRI. Semi-supervised learning approaches have arisen to tackle this difficulty, yielding comparable brain tumor segmentation outcomes with fewer labeled samples. This literature review explores key semi-supervised learning techniques for medical image segmentation, including pseudo-labeling, consistency regularization, generative adversarial networks, contrastive learning, and holistic methods. We specifically examine the application of these approaches in brain tumor MRI segmentation. Our findings suggest that semi-supervised learning can outperform traditional supervised methods by providing more effective guidance, thereby enhancing the potential for clinical computer-aided diagnosis. This literature review serves as a comprehensive introduction to semi-supervised learning in tumor MRI segmentation, including glioma segmentation, offering valuable insights and a comparative analysis of current methods for researchers in the field. Full article

(This article belongs to the Section Medical & Healthcare AI)

► Show Figures

Figure 1

26 pages, 3252 KB

Open AccessArticle

Interactive Mitigation of Biases in Machine Learning Models for Undergraduate Student Admissions

by Kelly Van Busum and Shiaofen Fang

AI 2025, 6(7), 152; https://doi.org/10.3390/ai6070152 - 9 Jul 2025

Cited by 1 | Viewed by 1958

Abstract

Bias and fairness issues in artificial intelligence (AI) algorithms are major concerns, as people do not want to use software they cannot trust. Because these issues are intrinsically subjective and context-dependent, creating trustworthy software requires human input and feedback. (1) Introduction: This work [...] Read more.

Bias and fairness issues in artificial intelligence (AI) algorithms are major concerns, as people do not want to use software they cannot trust. Because these issues are intrinsically subjective and context-dependent, creating trustworthy software requires human input and feedback. (1) Introduction: This work introduces an interactive method for mitigating the bias introduced by machine learning models by allowing the user to adjust bias and fairness metrics iteratively to make the model more fair in the context of undergraduate student admissions. (2) Related Work: The social implications of bias in AI systems used in education are nuanced and can affect university reputation and student retention rates motivating a need for the development of fair AI systems. (3) Methods and Dataset: Admissions data over six years from a large urban research university was used to create AI models to predict admissions decisions. These AI models were analyzed to detect biases they may carry with respect to three variables chosen to represent sensitive populations: gender, race, and first-generation college students. We then describe a method for bias mitigation that uses a combination of machine learning and user interaction. (4) Results and Discussion: We use three scenarios to demonstrate that this interactive bias mitigation approach can successfully decrease the biases towards sensitive populations. (5) Conclusion: Our approach allows the user to examine a model and then iteratively and incrementally adjust bias and fairness metrics to change the training dataset and generate a modified AI model that is more fair, according to the user’s own determination of fairness. Full article

(This article belongs to the Special Issue Exploring the Use of Artificial Intelligence in Education)

► Show Figures

Figure 1

32 pages, 6788 KB

Open AccessArticle

Knee Osteoarthritis Detection and Classification Using Autoencoders and Extreme Learning Machines

by Jarrar Amjad, Muhammad Zaheer Sajid, Ammar Amjad, Muhammad Fareed Hamid, Ayman Youssef and Muhammad Irfan Sharif

AI 2025, 6(7), 151; https://doi.org/10.3390/ai6070151 - 8 Jul 2025

Viewed by 2293

Abstract

Background/Objectives: Knee osteoarthritis (KOA) is a prevalent disorder affecting both older adults and younger individuals, leading to compromised joint function and mobility. Early and accurate detection is critical for effective intervention, as treatment options become increasingly limited as the disease progresses. Traditional diagnostic [...] Read more.

Background/Objectives: Knee osteoarthritis (KOA) is a prevalent disorder affecting both older adults and younger individuals, leading to compromised joint function and mobility. Early and accurate detection is critical for effective intervention, as treatment options become increasingly limited as the disease progresses. Traditional diagnostic methods rely heavily on the expertise of physicians and are susceptible to errors. The demand for utilizing deep learning models in order to automate and improve the accuracy of KOA image classification has been increasing. In this research, a unique deep learning model is presented that employs autoencoders as the primary mechanism for feature extraction, providing a robust solution for KOA classification. Methods: The proposed model differentiates between KOA-positive and KOA-negative images and categorizes the disease into its primary severity levels. Levels of severity range from “healthy knees” (0) to “severe KOA” (4). Symptoms range from typical joint structures to significant joint damage, such as bone spur growth, joint space narrowing, and bone deformation. Two experiments were conducted using different datasets to validate the efficacy of the proposed model. Results: The first experiment used the autoencoder for feature extraction and classification, which reported an accuracy of 96.68%. Another experiment using autoencoders for feature extraction and Extreme Learning Machines for actual classification resulted in an even higher accuracy value of 98.6%. To test the generalizability of the Knee-DNS system, we utilized the Butterfly iQ+ IoT device for image acquisition and Google Colab’s cloud computing services for data processing. Conclusions: This work represents a pioneering application of autoencoder-based deep learning models in the domain of KOA classification, achieving remarkable accuracy and robustness. Full article

(This article belongs to the Special Issue AI in Bio and Healthcare Informatics)

► Show Figures

Figure 1

20 pages, 4752 KB

Open AccessArticle

Designing an AI-Supported Framework for Literary Text Adaptation in Primary Classrooms

by Savvas A. Chatzichristofis, Alexandros Tsopozidis, Avgousta Kyriakidou-Zacharoudiou, Salomi Evripidou and Angelos Amanatiadis

AI 2025, 6(7), 150; https://doi.org/10.3390/ai6070150 - 8 Jul 2025

Cited by 2 | Viewed by 1794

Abstract

Background/Objectives: This paper introduces a pedagogically grounded framework for transforming canonical literary texts in primary education through generative AI. Guided by multiliteracies theory, Vygotskian pedagogy, and epistemic justice, the system aims to enhance interpretive literacy, developmental alignment, and cultural responsiveness among learners aged [...] Read more.

Background/Objectives: This paper introduces a pedagogically grounded framework for transforming canonical literary texts in primary education through generative AI. Guided by multiliteracies theory, Vygotskian pedagogy, and epistemic justice, the system aims to enhance interpretive literacy, developmental alignment, and cultural responsiveness among learners aged 7–12. Methods: The proposed system enables educators to perform age-specific text simplification, visual re-narration, lexical reinvention, and multilingual augmentation through a suite of modular tools. Central to the design is the Ethical–Pedagogical Validation Layer (EPVL), a GPT-powered auditing module that evaluates AI-generated content across four normative dimensions: developmental appropriateness, cultural sensitivity, semantic fidelity, and ethical transparency. Results: The framework was fully implemented and piloted with primary educators (N = 8). The pilot demonstrated high usability, curricular alignment, and perceived value for classroom application. Unlike commercial Large Language Models (LLMs), the system requires no prompt engineering and supports editable, policy-aligned controls for normative localization. Conclusions: By embedding ethical evaluation within the generative loop, the framework fosters calibrated trust in human–AI collaboration and mitigates cultural stereotyping and ideological distortion. It advances a scalable, inclusive model for educator-centered AI integration, offering a new pathway for explainable and developmentally appropriate AI use in literary education. Full article

(This article belongs to the Special Issue AI Bias in the Media and Beyond)

► Show Figures

Figure 1

20 pages, 30273 KB

Open AccessArticle

Integrated Framework of LSTM and Physical-Informed Neural Network for Lithium-Ion Battery Degradation Modeling and Prediction

by Yan Ding, Jinqi Zhu, Yang Liu, Dan Ning and Mingyue Qin

AI 2025, 6(7), 149; https://doi.org/10.3390/ai6070149 - 7 Jul 2025

Cited by 2 | Viewed by 3074

Abstract

Accurate prediction of the State of Health (SOH) of lithium-ion batteries is essential for ensuring their safe and reliable operation. However, traditional deep learning approaches often suffer from challenges such as overfitting, limited generalization capability, and suboptimal prediction accuracy. To address these issues, [...] Read more.

Accurate prediction of the State of Health (SOH) of lithium-ion batteries is essential for ensuring their safe and reliable operation. However, traditional deep learning approaches often suffer from challenges such as overfitting, limited generalization capability, and suboptimal prediction accuracy. To address these issues, this paper proposes a novel framework that combines a Long Short-Term Memory (LSTM) network with a Physics-Informed Neural Network (PINN), referred to as LSTM-PINN, for high-precision SOH estimation. The proposed framework models battery degradation using state-space equations and extracts latent temporal features. These features are further integrated into a Deep Hidden Temporal Physical Module (DeepHTPM), which incorporates physical prior knowledge into the learning process. This integration significantly enhances the model’s ability to accurately capture the complex dynamics of battery degradation. The effectiveness of LSTM-PINN is validated using two publicly available datasets based on graphite cathode materials (NASA and CACLE). Extensive experimental results demonstrate the superior predictive performance of the proposed model, achieving Mean Absolute Errors (MAEs) of just 0.594% and 0.746% and Root Mean Square Errors (RMSEs) of 0.791% and 0.897% on the respective datasets. Our proposed LSTM-PINN framework enables accurate battery aging modeling, advancing lithium-ion battery SOH prediction for industrial applications. Full article

► Show Figures

Figure 1

19 pages, 6421 KB

Open AccessFeature PaperArticle

Automated Deadlift Techniques Assessment and Classification Using Deep Learning

by Wegar Lien Grymyr and Isah A. Lawal

AI 2025, 6(7), 148; https://doi.org/10.3390/ai6070148 - 7 Jul 2025

Viewed by 2109

Abstract

This paper explores the application of deep learning techniques for evaluating and classifying deadlift weightlifting techniques from video input. The increasing popularity of weightlifting, coupled with the injury risks associated with improper form, has heightened interest in this area of research. To address [...] Read more.

This paper explores the application of deep learning techniques for evaluating and classifying deadlift weightlifting techniques from video input. The increasing popularity of weightlifting, coupled with the injury risks associated with improper form, has heightened interest in this area of research. To address these concerns, we developed an application designed to classify three distinct styles of deadlifts: conventional, Romanian, and sumo. In addition to style classification, our application identifies common mistakes such as a rounded back, overextension at the top of the lift, and premature lifting of the hips in relation to the back. To build our model, we created a comprehensive custom dataset comprising lateral-view videos of lifters performing deadlifts, which we meticulously annotated to ensure accuracy. We adapted the MoveNet model to track keypoints on the lifter’s joints, which effectively represented their motion patterns. These keypoints not only served as visualization aids in the training of Convolutional Neural Networks (CNNs) but also acted as the primary features for Long Short-Term Memory (LSTM) models, both of which we employed to classify the various deadlift techniques. Our experimental results showed that both models achieved impressive F1-scores, reaching up to 0.99 for style and 1.00 for execution form classifications on the test dataset. Furthermore, we designed an application that integrates keypoint visualizations with motion pattern classifications. This tool provides users with valuable feedback on their performance and includes a replay feature for self-assessment, helping lifters refine their technique and reduce the risk of injury. Full article

► Show Figures

Figure 1

36 pages, 1084 KB

Open AccessArticle

Quantifying Claim Robustness Through Adversarial Framing: A Conceptual Framework for an AI-Enabled Diagnostic Tool

by Christophe Faugere

AI 2025, 6(7), 147; https://doi.org/10.3390/ai6070147 - 7 Jul 2025

Viewed by 4722

Abstract

Objectives: We introduce the conceptual framework for the Adversarial Claim Robustness Diagnostics (ACRD) protocol, a novel tool for assessing how factual claims withstand ideological distortion. Methods: Based on semantics, adversarial collaboration, and the devil’s advocate approach, we develop a three-phase evaluation process combining [...] Read more.

Objectives: We introduce the conceptual framework for the Adversarial Claim Robustness Diagnostics (ACRD) protocol, a novel tool for assessing how factual claims withstand ideological distortion. Methods: Based on semantics, adversarial collaboration, and the devil’s advocate approach, we develop a three-phase evaluation process combining baseline evaluations, adversarial speaker reframing, and dynamic AI calibration along with quantified robustness scoring. We introduce the Claim Robustness Index that constitutes our final validity scoring measure. Results: We model the evaluation of claims by ideologically opposed groups as a strategic game with a Bayesian-Nash equilibrium to infer the normative behavior of evaluators after the reframing phase. The ACRD addresses shortcomings in traditional fact-checking approaches and employs large language models to simulate counterfactual attributions while mitigating potential biases. Conclusions: The framework’s ability to identify boundary conditions of persuasive validity across polarized groups can be tested across important societal and political debates ranging from climate change issues to trade policy discourses. Full article

(This article belongs to the Special Issue AI Bias in the Media and Beyond)

► Show Figures

Figure 1

25 pages, 5231 KB

Open AccessArticle

Using AI for Optimizing Packing Design and Reducing Cost in E-Commerce

by Hayder Zghair and Rushi Ganesh Konathala

AI 2025, 6(7), 146; https://doi.org/10.3390/ai6070146 - 4 Jul 2025

Cited by 2 | Viewed by 4477

Abstract

This research explores how artificial intelligence (AI) can be leveraged to optimize packaging design, reduce operational costs, and enhance sustainability in e-commerce. As packaging waste and shipping inefficiencies grow alongside global online retail demand, traditional methods for determining box size, material use, and [...] Read more.

This research explores how artificial intelligence (AI) can be leveraged to optimize packaging design, reduce operational costs, and enhance sustainability in e-commerce. As packaging waste and shipping inefficiencies grow alongside global online retail demand, traditional methods for determining box size, material use, and logistics planning have become economically and environmentally inadequate. Using a three-phase framework, this study integrates data-driven diagnostics, AI modeling, and real-world validation. In the first phase, a systematic analysis of current packaging inefficiencies was conducted through secondary data, benchmarking, and cost modeling. Findings revealed significant waste caused by over-packaging, suboptimal box-sizing, and poor alignment between product characteristics and logistics strategy. In the second phase, a random forest (RF) machine learning model was developed to predict optimal packaging configurations using key product features: weight, volume, and fragility. This model was supported by AI simulation tools that enabled virtual testing of material performance, space efficiency, and damage risk. Results demonstrated measurable improvements in packaging optimization, cost reduction, and emission mitigation. The third phase validated the AI framework using practical case studies—ranging from a college textbook to a fragile kitchen dish set and a high-volume children’s bicycle. The model successfully recommended right-sized packaging for each product, resulting in reduced material usage, improved shipping density, and enhanced protection. Simulated cost-saving scenarios further confirmed that smart packaging and AI-generated configurations can drive efficiency. The research concludes that AI-based packaging systems offer substantial strategic value, including cost savings, environmental benefits, and alignment with regulatory and consumer expectations—providing scalable, data-driven solutions for e-commerce enterprises such as Amazon and others. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

► Show Figures

Figure 1

31 pages, 9156 KB

Open AccessArticle

A Comparative Analysis of Deep Learning-Based Segmentation Techniques for Terrain Classification in Aerial Imagery

by Martina Formichini and Carlo Alberto Avizzano

AI 2025, 6(7), 145; https://doi.org/10.3390/ai6070145 - 3 Jul 2025

Cited by 1 | Viewed by 1859

Abstract

Background: Deep convolutional neural networks (CNNs) have become widely popular for many imaging applications, and they have also been applied in various studies for monitoring and mapping areas of land. Nevertheless, most of these networks were designed to perform in different scenarios, such [...] Read more.

Background: Deep convolutional neural networks (CNNs) have become widely popular for many imaging applications, and they have also been applied in various studies for monitoring and mapping areas of land. Nevertheless, most of these networks were designed to perform in different scenarios, such as autonomous driving and medical imaging. Methods: In this work, we focused on the usage of existing semantic networks applied to terrain segmentation. Even though several existing networks have been used to study land segmentation using transfer learning methodologies, a comparative analysis of how the underlying network architectures perform has not yet been conducted. Since this scenario is different from the one in which these networks were developed, featuring irregular shapes and an absence of models, not all of them can be correctly transferred to this domain. Results: Fifteen state-of-the-art neural networks were compared, and we found that, in addition to slight differences in performance, there were relevant differences in the numbers and types of outliers that were worth highlighting. Our results show that the best-performing models achieved a pixel-level class accuracy of 99.06%, with an F1-score of 72.94%, 71.5% Jaccard loss, and 88.43% recall. When investigating the outliers, we found that PSPNet, FCN, and ICNet were the most effective models. Conclusions: While most of this work was performed on an existing terrain dataset collected using aerial imagery, this approach remains valid for investigation of other datasets with more classes or richer geographical extensions. For example, a dataset composed of Copernicus images opens up new opportunities for large-scale terrain analysis. Full article

► Show Figures

Figure 1

15 pages, 2605 KB

Open AccessArticle

Automatic Weight-Bearing Foot Series Measurements Using Deep Learning

by Jordan Tanzilli, Alexandre Parpaleix, Fabien de Oliveira, Mohamed Ali Chaouch, Maxime Tardieu, Malo Huard and Aymeric Guibal

AI 2025, 6(7), 144; https://doi.org/10.3390/ai6070144 - 2 Jul 2025

Viewed by 1377

Abstract

Background: Foot deformities, particularly hallux valgus, significantly impact patients’ quality of life. Conventional radiographs are essential for their assessment, but manual measurements are time-consuming and variable. This study assessed the reliability of a deep learning-based solution (Milvue, France) that automates podiatry angle measurements [...] Read more.

Background: Foot deformities, particularly hallux valgus, significantly impact patients’ quality of life. Conventional radiographs are essential for their assessment, but manual measurements are time-consuming and variable. This study assessed the reliability of a deep learning-based solution (Milvue, France) that automates podiatry angle measurements from radiographs compared to manual measurements made by radiologists. Methods: A retrospective, non-interventional study at Perpignan Hospital analyzed the weight-bearing foot radiographs of 105 adult patients (August 2017–August 2022). The deep learning (DL) model’s measurements were compared to those of two radiologists for various angles (M1-P1, M1-M2, M1-M5, and P1-P2 for Djian–Annonier, calcaneal slope, first metatarsal slope, and Meary–Tomeno angles). Statistical analyses evaluated DL performance and inter-observer variability. Results: Of the 105 patients included (29 men and 76 women; mean age 55), the DL solution showed excellent consistency with manual measurements, except for the P1-P2 angle. The mean absolute error (MAE) for the frontal view was lowest for M1-M2 (0.96°) and highest for P1-P2 (3.16°). Intraclass correlation coefficients (ICCs) indicated excellent agreement for M1-P1, M1-M2, and M1-M5. For the lateral view, the MAE was 0.92° for calcaneal pitch and 2.83° for Meary–Tomeno, with ICCs ≥ 0.93. For hallux valgus detection, accuracy was 94%, sensitivity was 91.1%, and specificity was 97.2%. Manual measurements averaged 203 s per patient, while DL processing was nearly instantaneous. Conclusions: The DL solution reliably automates foot alignment assessments, significantly reducing time without compromising accuracy. It may improve clinical efficiency and consistency in podiatric evaluations. Full article

(This article belongs to the Section Medical & Healthcare AI)

► Show Figures

Figure 1

25 pages, 2109 KB

Open AccessArticle

Designing Artificial Intelligence: Exploring Inclusion, Diversity, Equity, Accessibility, and Safety in Human-Centric Emerging Technologies

by Matteo Zallio, Chiara Bianca Ike and Camelia Chivăran

AI 2025, 6(7), 143; https://doi.org/10.3390/ai6070143 - 2 Jul 2025

Cited by 2 | Viewed by 2676

Abstract

Background: The implementation of artificial intelligence (AI) has become a pivotal interdisciplinary challenge, creating new opportunities for sharing information, driving innovation, and transforming societal interactions with technology. While AI offers numerous benefits, its rapid evolution raises critical concerns about its impact on inclusion, [...] Read more.

Background: The implementation of artificial intelligence (AI) has become a pivotal interdisciplinary challenge, creating new opportunities for sharing information, driving innovation, and transforming societal interactions with technology. While AI offers numerous benefits, its rapid evolution raises critical concerns about its impact on inclusion, diversity, equity, accessibility, and safety (IDEAS). Method: This pilot study aimed to explore these issues and identify ways to embed the IDEAS principles into AI design. A qualitative study was conducted with industrial and academic experts in the field. Semi-structured interviews gathered insights into the opportunities, challenges, and future implications of AI from diverse professional and cultural perspectives. Result: Findings highlight uncertainties in AI’s trajectory and its profound cross-sector influence. Key issues emerged, including bias, data privacy, transparency, and accessibility. Participants stressed the need for greater awareness and structured dialogue to integrate the IDEAS principles throughout the AI lifecycle. Conclusion: This study underscores the urgency of addressing AI’s ethical and societal impacts. Embedding the IDEAS principles into its development can help mitigate risks and foster more inclusive, equitable, and accessible technologies. Full article

► Show Figures

Figure 1

14 pages, 1051 KB

Open AccessArticle

Geo-Statistics and Deep Learning-Based Algorithm Design for Real-Time Bus Geo-Location and Arrival Time Estimation Features with Load Resiliency Capacity

by Smail Tigani

AI 2025, 6(7), 142; https://doi.org/10.3390/ai6070142 - 1 Jul 2025

Viewed by 1059

Abstract

This paper introduces a groundbreaking decentralized approach for real-time bus monitoring and geo-location, leveraging advanced geo-statistical and multivariate statistical methods. The proposed long short-term memory (LSTM) model predicts bus arrival times with confidence intervals and reconstructs missing positioning data, offering cities an accurate, [...] Read more.

This paper introduces a groundbreaking decentralized approach for real-time bus monitoring and geo-location, leveraging advanced geo-statistical and multivariate statistical methods. The proposed long short-term memory (LSTM) model predicts bus arrival times with confidence intervals and reconstructs missing positioning data, offering cities an accurate, resource-efficient tracking solution within typical infrastructure limits. By employing decentralized data processing, our system significantly reduces network traffic and computational load, enabling data sharing and sophisticated analysis. Utilizing the Haversine formula, the system estimates pessimistic and optimistic arrival times, providing real-time updates and enhancing the accuracy of bus tracking. Our innovative approach optimizes real-time bus tracking and arrival time estimation, ensuring robust performance under varying traffic conditions. This research demonstrates the potential of integrating advanced statistical techniques with decentralized computing to revolutionize public transit systems. Full article

► Show Figures

Figure 1

17 pages, 1746 KB

Open AccessArticle

ODEI: Object Detector Efficiency Index

by Wenan Yuan

AI 2025, 6(7), 141; https://doi.org/10.3390/ai6070141 - 1 Jul 2025

Viewed by 1231

Abstract

Object detectors often rely on multiple metrics to reflect their accuracy and speed performances independently. This article introduces object detector efficiency index (ODEI), a hardware-agnostic metric designed to assess object detector efficiency based on speed-normalized accuracy, utilizing established concepts including mean average precision [...] Read more.

Object detectors often rely on multiple metrics to reflect their accuracy and speed performances independently. This article introduces object detector efficiency index (ODEI), a hardware-agnostic metric designed to assess object detector efficiency based on speed-normalized accuracy, utilizing established concepts including mean average precision (mAP) and floating-point operations (FLOPs). By defining seven mandatory parameters that must be specified when ODEI is invoked, the article aims to clarify long-standing confusions within literature regarding evaluation metrics and promote fair and transparent benchmarking research in the object detection space. Usage demonstration of ODEI using state-of-the-art (SOTA) YOLOv12 and RT-DETRv3 studies is also included. Full article

(This article belongs to the Topic State-of-the-Art Object Detection, Tracking, and Recognition Techniques)

► Show Figures

Figure 1

12 pages, 407 KB

Open AccessArticle

A Practice-Oriented Computational Thinking Framework for Teaching Neural Networks to Working Professionals

by Jing Tian

AI 2025, 6(7), 140; https://doi.org/10.3390/ai6070140 - 29 Jun 2025

Cited by 1 | Viewed by 1197

Abstract

Background: Conventional machine learning courses are usually designed for academic learners, instead of working professionals. This study addresses this gap by proposing a new instructional framework that builds practical computational thinking skills for developing neural network models on business data. Methods: This study [...] Read more.

Background: Conventional machine learning courses are usually designed for academic learners, instead of working professionals. This study addresses this gap by proposing a new instructional framework that builds practical computational thinking skills for developing neural network models on business data. Methods: This study proposes a five-component computational thinking framework tailed for working professionals, aligned with the standard data science pipeline and an artificial intelligence instructional taxonomy. The proposed course instructional framework consists of mixed lectures, visualization-driven and coding-driven workshops, case studies, group discussions, and gamified model tuning tasks. Results: Across 28 face-to-face course iterations conducted between 2019 and 2024, participants consistently demonstrated satisfactions in gaining computational-thinking skills. Conclusions: The tailored framework has been implemented to strengthen working professionals’ computational thinking skills for neural-network work on industrial applications. Full article

(This article belongs to the Special Issue Development of Artificial Intelligence and Computational Thinking: Future Directions, Opportunities, and Challenges)

► Show Figures

Figure 1

17 pages, 238 KB

Open AccessArticle

Leveraging ChatGPT in K-12 School Discipline: Potential Applications and Ethical Considerations

by Joseph C. Kush

AI 2025, 6(7), 139; https://doi.org/10.3390/ai6070139 - 27 Jun 2025

Viewed by 2378

Abstract

This paper investigates the utility of an Artificial Intelligence (AI) system, as it examines AI-generated output when prompted with a series of vignettes reflecting typical disciplinary challenges encountered by K-12 students. Specifically, the study focuses on possible racial biases embedded within ChatGPT, a [...] Read more.

This paper investigates the utility of an Artificial Intelligence (AI) system, as it examines AI-generated output when prompted with a series of vignettes reflecting typical disciplinary challenges encountered by K-12 students. Specifically, the study focuses on possible racial biases embedded within ChatGPT, a prominent language-based AI system. An analysis of AI-generated responses to disciplinary vignettes involving students of diverse racial backgrounds uncovered subtle yet prevalent racial biases present in the output. The findings indicate that while ChatGPT generally offered recommendations that were consistent and appropriate across racial lines, instances of pronounced and prejudicial disparities were observed. This study highlights the critical necessity of acknowledging and rectifying racial biases inherent in AI systems, especially in contexts where such technologies are utilized for school discipline. It provides guidance for educators and practitioners on the cautious use of AI-driven tools in disciplinary contexts, and emphasizes the ongoing imperative to mitigate biases in AI systems to ensure fair and equitable outcomes for all students, irrespective of race or ethnicity. Full article

(This article belongs to the Special Issue AI Bias in the Media and Beyond)

20 pages, 5431 KB

Open AccessArticle

Architectural Gaps in Generative AI: Quantifying Cognitive Risks for Safety Applications

by He Wen and Pingfan Hu

AI 2025, 6(7), 138; https://doi.org/10.3390/ai6070138 - 25 Jun 2025

Viewed by 1417

Abstract

Background: The rapid integration of generative AIs, such as ChatGPT, into industrial, process, and construction management introduces both operational advantages and emerging cognitive risks. While these models support task automation and safety analysis, their internal architecture differs fundamentally from human cognition, posing [...] Read more.

Background: The rapid integration of generative AIs, such as ChatGPT, into industrial, process, and construction management introduces both operational advantages and emerging cognitive risks. While these models support task automation and safety analysis, their internal architecture differs fundamentally from human cognition, posing interpretability and trust challenges in high-risk contexts. Methods: This study investigates whether architectural design elements in Transformer-based generative models contribute to a measurable divergence from human reasoning. A methodological framework is developed to examine core AI mechanisms—vectorization, positional encoding, attention scoring, and optimization functions—focusing on how these introduce quantifiable “distances” from human semantic understanding. Results: Through theoretical analysis and a case study involving fall prevention advice in construction, six types of architectural distances are identified and evaluated using cosine similarity and attention mapping. The results reveal misalignments in focus, semantics, and response stability, which may hinder effective human–AI collaboration in safety-critical decisions. Conclusions: These findings suggest that such distances represent not only algorithmic abstraction but also potential safety risks when generative AI is deployed in practice. The study advocates for the development of AI architectures that better reflect human cognitive structures to reduce these risks and improve reliability in safety applications. Full article

(This article belongs to the Special Issue Leveraging Simulation and Deep Learning for Enhanced Health and Safety)

► Show Figures

Figure 1

18 pages, 1987 KB

Open AccessArticle

AI-HOPE-TGFbeta: A Conversational AI Agent for Integrative Clinical and Genomic Analysis of TGF-β Pathway Alterations in Colorectal Cancer to Advance Precision Medicine

by Ei-Wen Yang, Brigette Waldrup and Enrique Velazquez-Villarreal

AI 2025, 6(7), 137; https://doi.org/10.3390/ai6070137 - 24 Jun 2025

Cited by 7 | Viewed by 1541

Abstract

Introduction: Early-onset colorectal cancer (EOCRC) is rising rapidly, particularly among the Hispanic/Latino (H/L) populations, who face disproportionately poor outcomes. The transforming growth factor-beta (TGF-β) signaling pathway plays a critical role in colorectal cancer (CRC) progression by mediating epithelial-to-mesenchymal transition (EMT), immune evasion, and [...] Read more.

Introduction: Early-onset colorectal cancer (EOCRC) is rising rapidly, particularly among the Hispanic/Latino (H/L) populations, who face disproportionately poor outcomes. The transforming growth factor-beta (TGF-β) signaling pathway plays a critical role in colorectal cancer (CRC) progression by mediating epithelial-to-mesenchymal transition (EMT), immune evasion, and metastasis. However, integrative analyses linking TGF-β alterations to clinical features remain limited—particularly for diverse populations—hindering translational research and the development of precision therapies. To address this gap, we developed AI-HOPE-TGFbeta (Artificial Intelligence agent for High-Optimization and Precision Medicine focused on TGF-β), the first conversational artificial intelligence (AI) agent designed to explore TGF-β dysregulation in CRC by integrating harmonized clinical and genomic data via natural language queries. Methods: AI-HOPE-TGFbeta utilizes a large language model (LLM), Large Language Model Meta AI 3 (LLaMA 3), a natural language-to-code interpreter, and a bioinformatics backend to automate statistical workflows. Tailored for TGF-β pathway analysis, the platform enables real-time cohort stratification and hypothesis testing using harmonized datasets from the cBio Cancer Genomics Portal (cBioPortal). It supports mutation frequency comparisons, odds ratio testing, Kaplan–Meier survival analysis, and subgroup evaluations across race/ethnicity, microsatellite instability (MSI) status, tumor stage, treatment exposure, and age. The platform was validated by replicating findings on the SMAD4, TGFBR2, and BMPR1A mutations in EOCRC. Exploratory queries were conducted to examine novel associations with clinical outcomes in H/L populations. Results: AI-HOPE-TGFbeta successfully recapitulated established associations, including worse survival in SMAD4-mutant EOCRC patients treated with FOLFOX (fluorouracil, leucovorin and oxaliplatin) (p = 0.0001) and better outcomes in early-stage TGFBR2-mutated CRC patients (p = 0.00001). It revealed potential population-specific enrichment of BMPR1A mutations in H/L patients (OR = 2.63; p = 0.052) and uncovered MSI-specific survival benefits among SMAD4-mutated patients (p = 0.00001). Exploratory analysis showed better outcomes in SMAD2-mutant primary tumors vs. metastatic cases (p = 0.0010) and confirmed the feasibility of disaggregated ethnicity-based queries for TGFBR1 mutations, despite small sample sizes. These findings underscore the platform’s capacity to detect both known and emerging clinical–genomic patterns in CRC. Conclusions: AI-HOPE-TGFbeta introduces a new paradigm in cancer bioinformatics by enabling natural language-driven, real-time integration of genomic and clinical data specific to TGF-β pathway alterations in CRC. The platform democratizes complex analyses, supports disparity-focused investigation, and reveals clinically actionable insights in underserved populations, such as H/L EOCRC patients. As a first-of-its-kind system studying TGF-β, AI-HOPE-TGFbeta holds strong promise for advancing equitable precision oncology and accelerating translational discovery in the CRC TGF-β pathway. Full article

(This article belongs to the Section Medical & Healthcare AI)

► Show Figures

Figure 1

27 pages, 2634 KB

Open AccessArticle

Enhancing Acoustic Leak Detection with Data Augmentation: Overcoming Background Noise Challenges

by Deniz Quick, Jens Denecke and Jürgen Schmidt

AI 2025, 6(7), 136; https://doi.org/10.3390/ai6070136 - 24 Jun 2025

Viewed by 1571

Abstract

A leak detection method is developed for leaks typically encountered in industrial production. Leaks of 1 mm diameter and less are considered at operating pressures up to 10 bar. The system uses two separate datasets—one for the leak noises and the other for [...] Read more.

A leak detection method is developed for leaks typically encountered in industrial production. Leaks of 1 mm diameter and less are considered at operating pressures up to 10 bar. The system uses two separate datasets—one for the leak noises and the other for the background noises—both are linked using a developed mixup technique and thus simulate leaks trained in background noises. A specific frequency window between 11 and 20 kHz is utilized to generate a quadratic input for image recognition. With this method, detection accuracies of over 95% with a false alarm rate under 2% can be achieved on a test dataset under the background noises of hydraulic machines in laboratory conditions. Full article

(This article belongs to the Section AI Systems: Theory and Applications)

► Show Figures

Figure 1

15 pages, 1949 KB

Open AccessArticle

High-Performance and Lightweight AI Model with Integrated Self-Attention Layers for Soybean Pod Number Estimation

by Qian Huang

AI 2025, 6(7), 135; https://doi.org/10.3390/ai6070135 - 24 Jun 2025

Viewed by 1237

Abstract

Background: Soybean is an important global crop in food security and agricultural economics. Accurate estimation of soybean pod counts is critical for yield prediction, breeding programs, precision farming, etc. Traditional methods, such as manual counting, are slow, labor-intensive, and prone to errors. With [...] Read more.

Background: Soybean is an important global crop in food security and agricultural economics. Accurate estimation of soybean pod counts is critical for yield prediction, breeding programs, precision farming, etc. Traditional methods, such as manual counting, are slow, labor-intensive, and prone to errors. With rapid advancements in artificial intelligence (AI), deep learning has enabled automatic pod number estimation in collaboration with unmanned aerial vehicles (UAVs). However, existing AI models are computationally demanding and require significant processing resources (e.g., memory). These resources are often not available in rural regions and small farms. Methods: To address these challenges, this study presents a set of lightweight, efficient AI models designed to overcome these limitations. By integrating model simplification, weight quantization, and squeeze-and-excitation (SE) self-attention blocks, we develop compact AI models capable of fast and accurate soybean pod count estimation. Results and Conclusions: Experimental results show a comparable estimation accuracy of 84–87%, while the AI model size is significantly reduced by a factor of 9–65, thus making them suitable for deployment in edge devices, such as Raspberry Pi. Compared to existing models such as YOLO POD and SoybeanNet, which rely on over 20 million parameters to achieve approximately 84% accuracy, our proposed lightweight models deliver a comparable or even higher accuracy (84.0–86.76%) while using fewer than 2 million parameters. In future work, we plan to expand the dataset by incorporating diverse soybean images to enhance model generalizability. Additionally, we aim to explore more advanced attention mechanisms—such as CBAM or ECA—to further improve feature extraction and model performance. Finally, we aim to implement the complete system in edge devices and conduct real-world testing in soybean fields. Full article

► Show Figures

Figure 1

12 pages, 639 KB

Open AccessArticle

Identification of Perceptual Phonetic Training Gains in a Second Language Through Deep Learning

by Georgios P. Georgiou

AI 2025, 6(7), 134; https://doi.org/10.3390/ai6070134 - 23 Jun 2025

Cited by 1 | Viewed by 1641

Abstract

Background/Objectives: While machine learning has made substantial strides in pronunciation detection in recent years, there remains a notable gap in the literature regarding research on improvements in the acquisition of speech sounds following a training intervention, especially in the domain of perception. This [...] Read more.

Background/Objectives: While machine learning has made substantial strides in pronunciation detection in recent years, there remains a notable gap in the literature regarding research on improvements in the acquisition of speech sounds following a training intervention, especially in the domain of perception. This study addresses this gap by developing a deep learning algorithm designed to identify perceptual gains resulting from second language (L2) phonetic training. Methods: The participants underwent multiple sessions of high-variability phonetic training, focusing on discriminating challenging L2 vowel contrasts. The deep learning model was trained on perceptual data collected before and after the intervention. Results: The results demonstrated good model performance across a range of metrics, confirming that learners’ gains in phonetic training could be effectively detected by the algorithm. Conclusions: This research underscores the potential of deep learning techniques to track improvements in phonetic training, offering a promising and practical approach for evaluating language learning outcomes and paving the way for more personalized, adaptive language learning solutions. Deep learning enables the automatic extraction of complex patterns in learner behavior that might be missed by traditional methods. This makes it especially valuable in educational contexts where subtle improvements need to be captured and assessed objectively. Full article

► Show Figures

Figure 1

26 pages, 4271 KB

Open AccessArticle

Machine Learning-Based Predictive Maintenance for Photovoltaic Systems

by Ali Al-Humairi, Enmar Khalis, Zuhair A. Al-Hemyari and Peter Jung

AI 2025, 6(7), 133; https://doi.org/10.3390/ai6070133 - 20 Jun 2025

Cited by 3 | Viewed by 5381

Abstract

The performance of photovoltaic systems is highly dependent on environmental conditions, with soiling due to dust accumulation often being referred to as a predominant energy degradation factor, especially in dry and semi-arid environments. This paper introduces an AI-based robotic cleaning system that can [...] Read more.

The performance of photovoltaic systems is highly dependent on environmental conditions, with soiling due to dust accumulation often being referred to as a predominant energy degradation factor, especially in dry and semi-arid environments. This paper introduces an AI-based robotic cleaning system that can independently forecast and schedule cleaning sessions from real-time sensor and environmental data. Methods: The system integrates sources of data like embedded sensors, weather stations, and DustIQ data to create an integrated dataset for predictive modeling. Machine learning models were employed to forecast soiling loss based on significant atmospheric parameters such as relative humidity, air pressure, ambient temperature, and wind speed. Dimensionality reduction through the principal component analysis and correlation-based feature selection enhanced the model performance as well as the interpretability. A comparative study of four conventional machine learning models, including logistic regression, k-nearest neighbors, decision tree, and support vector machine, was conducted to determine the most appropriate approach to classifying cleaning needs. Results: Performance, based on accuracy, precision, recall, and F1-score, demonstrated that logistic regression and SVM provided optimal classification performance with accuracy levels over 92%, and F1-scores over 0.90, demonstrating outstanding balance between recall and precision. The KNN and decision tree models, while slightly poorer in terms of accuracy (around 85–88%), had computational efficiency benefits, making them suitable for utilization in resource-constrained applications. Conclusions: The proposed system employs a dry-cleaning mechanism that requires no water, making it highly suitable for arid regions. It reduces unnecessary cleaning operations by approximately 30%, leading to decreased mechanical wear and lower maintenance costs. Additionally, by minimizing delays in necessary cleaning, the system can improve annual energy yield by 3–5% under high-soiling conditions. Overall, the intelligent cleaning schedule minimizes manual intervention, enhances sustainability, reduces operating costs, and improves system performance in challenging environments. Full article

► Show Figures

Figure 1

Journal Menu

Journal Browser

AI, Volume 6, Issue 7 (July 2025) – 32 articles

Further Information

Guidelines

MDPI Initiatives

Follow MDPI