Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (80)

Search Parameters:
Keywords = CRISP-DM

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 32193 KB  
Article
Object Detection on Road: Vehicle’s Detection Based on Re-Training Models on NVIDIA-Jetson Platform
by Sleiter Ramos-Sanchez, Jinmi Lezama, Ricardo Yauri and Joyce Zevallos
J. Imaging 2026, 12(1), 20; https://doi.org/10.3390/jimaging12010020 - 1 Jan 2026
Viewed by 470
Abstract
The increasing use of artificial intelligence (AI) and deep learning (DL) techniques has driven advances in vehicle classification and detection applications for embedded devices with deployment constraints due to computational cost and response time. In the case of urban environments with high traffic [...] Read more.
The increasing use of artificial intelligence (AI) and deep learning (DL) techniques has driven advances in vehicle classification and detection applications for embedded devices with deployment constraints due to computational cost and response time. In the case of urban environments with high traffic congestion, such as the city of Lima, it is important to determine the trade-off between model accuracy, type of embedded system, and the dataset used. This study was developed using a methodology adapted from the CRISP-DM approach, which included the acquisition of traffic videos in the city of Lima, their segmentation, and manual labeling. Subsequently, three SSD-based detection models (MobileNetV1-SSD, MobileNetV2-SSD-Lite, and VGG16-SSD) were trained on the NVIDIA Jetson Orin NX 16 GB platform. The results show that the VGG16-SSD model achieved the highest average precision (mAP 90.7%), with a longer training time, while the MobileNetV1-SSD (512×512) model achieved comparable performance (mAP 90.4%) with a shorter time. Additionally, data augmentation through contrast adjustment improved the detection of minority classes such as Tuk-tuk and Motorcycle. The results indicate that, among the evaluated models, MobileNetV1-SSD (512×512) achieved the best balance between accuracy and computational load for its implementation in ADAS embedded systems in congested urban environments. Full article
(This article belongs to the Special Issue Advances in Machine Learning for Computer Vision Applications)
Show Figures

Figure 1

30 pages, 1993 KB  
Article
Artificial Intelligence Pipeline for Mammography-Based Breast Cancer Detection: An Integrated Systematic Review and Large-Scale Experimental Validation
by Daniel Añez, Giuseppe Conti, Juan José Uriarte, José-Javier Serrano-Olmedo, Ricardo Martínez-Murillo and Oscar Casanova-Carvajal
Medicina 2025, 61(12), 2237; https://doi.org/10.3390/medicina61122237 - 18 Dec 2025
Viewed by 839
Abstract
Background and Objectives: Breast cancer remains a leading cause of cancer-related morbidity and mortality worldwide, and robust, interpretable artificial intelligence (AI) pipelines are increasingly being explored to support mammography-based detection. This study combines a PRISMA 2020-compliant systematic review with an original experimental validation [...] Read more.
Background and Objectives: Breast cancer remains a leading cause of cancer-related morbidity and mortality worldwide, and robust, interpretable artificial intelligence (AI) pipelines are increasingly being explored to support mammography-based detection. This study combines a PRISMA 2020-compliant systematic review with an original experimental validation to characterize current evidence and address identified gaps in reproducibility and interpretability. Materials and Methods: A PRISMA 2020-guided systematic review and an original experimental study were conducted. The review searched PubMed and Scopus/ScienceDirect for studies using convolutional neural networks (CNNs), support vector machines (SVMs) or eXtreme Gradient Boosting (XGBoost) for breast cancer detection in mammography and related imaging modalities, and identified 45 eligible articles. In parallel, we implemented and evaluated representative CNN (ResNet-50, EfficientNetB0 and MobileNetV3-Small) and classical machine learning (SVM and XGBoost) pipelines on the CBIS-DDSM dataset, following a CRISP-DM-inspired workflow and using Grad-CAM and SHAP to provide image- and feature-level explanations within a reproducible machine-learning-operations (MLOps)-oriented framework. Results: The systematic review revealed substantial heterogeneity in datasets, preprocessing pipelines, and validation strategies, with a predominant reliance on internal validation and limited use of explainable AI methods. In our experimental evaluation, ResNet-50 achieved the best performance (AUC-ROC 0.95; sensitivity 89%), followed by XGBoost (AUC-ROC 0.90; sensitivity 74%) and SVM (AUC-ROC 0.84; sensitivity 66%), while EfficientNetB0 and MobileNetV3-Small showed lower discrimination. Grad-CAM produced qualitatively plausible heatmaps centered on annotated lesions, and SHAP analyses indicated that simple global image-intensity and size descriptors dominated the predictions of the classical models. Conclusions: By integrating systematic evidence and large-scale experiments on CBIS-DDSM, this study highlights both the potential and the limitations of current AI pipelines for mammography-based breast cancer detection, underscoring the need for more standardized preprocessing, rigorous external validation, and routine use of explainable AI before clinical deployment. Full article
Show Figures

Figure 1

45 pages, 12590 KB  
Article
An End-to-End Data and Machine Learning Pipeline for Energy Forecasting: A Systematic Approach Integrating MLOps and Domain Expertise
by Xun Zhao, Zheng Grace Ma and Bo Nørregaard Jørgensen
Information 2025, 16(9), 805; https://doi.org/10.3390/info16090805 - 16 Sep 2025
Viewed by 2417
Abstract
Energy forecasting is critical for modern power systems, enabling proactive grid control and efficient resource optimization. However, energy forecasting projects require systematic approaches that span project inception to model deployment while ensuring technical excellence, domain alignment, regulatory compliance, and reproducibility. Existing methodologies such [...] Read more.
Energy forecasting is critical for modern power systems, enabling proactive grid control and efficient resource optimization. However, energy forecasting projects require systematic approaches that span project inception to model deployment while ensuring technical excellence, domain alignment, regulatory compliance, and reproducibility. Existing methodologies such as CRISP-DM provide a foundation but lack explicit mechanisms for iterative feedback, decision checkpoints, and continuous energy-domain-expert involvement. This paper proposes a modular end-to-end framework for energy forecasting that integrates formal decision gates in each phase, embeds domain-expert validation, and produces fully traceable artifacts. The framework supports controlled iteration, rollback, and automation within an MLOps-compatible structure. A comparative analysis demonstrates its advantages in functional coverage, workflow logic, and governance over existing approaches. A case study on short-term electricity forecasting for a 2560 m2 office building validates the framework, achieving 24-h-ahead predictions with an RNN, reaching an RMSE of 1.04 kWh and an MAE of 0.78 kWh. The results confirm that the framework enhances forecast accuracy, reliability, and regulatory readiness in real-world energy applications. Full article
Show Figures

Figure 1

38 pages, 2993 KB  
Article
CRISP-NET: Integration of the CRISP-DM Model with Network Analysis
by Héctor Alejandro Acuña-Cid, Eduardo Ahumada-Tello, Óscar Omar Ovalle-Osuna, Richard Evans, Julia Elena Hernández-Ríos and Miriam Alondra Zambrano-Soto
Mach. Learn. Knowl. Extr. 2025, 7(3), 101; https://doi.org/10.3390/make7030101 - 16 Sep 2025
Viewed by 2253
Abstract
To carry out data analysis, it is necessary to implement a model that guides the process in an orderly and sequential manner, with the aim of maintaining control over software development and its documentation. One of the most widely used tools in the [...] Read more.
To carry out data analysis, it is necessary to implement a model that guides the process in an orderly and sequential manner, with the aim of maintaining control over software development and its documentation. One of the most widely used tools in the field of data analysis is the Cross-Industry Standard Process for Data Mining (CRISP-DM), which serves as a reference framework for data mining, allowing the identification of patterns and, based on them, supporting informed decision-making. Another tool used for pattern identification and the study of relationships within systems is network analysis (NA), which makes it possible to explore how different components are interconnected. The integration of these tools can be justified and developed under the principles of Situational Method Engineering (SME), which allows for the adaptation and customization of existing methods according to the specific needs of a problem or context. Through SME, it is possible to determine which components of CRISP-DM need to be adjusted to efficiently incorporate NA, ensuring that this integration aligns with the project’s objectives in a structured and effective manner. The proposed methodological process was applied in a real working group, which allowed its functionality to be validated, each phase to be documented, and concrete outputs to be generated, demonstrating its usefulness for the development of analytical projects. Full article
Show Figures

Figure 1

19 pages, 3919 KB  
Article
The Estimation of the Remaining Useful Life of Ceramic Plates Used in Iron Ore Filtration Through a Reliability Model and Machine Learning Methods Applied to Industrial Process Variables of a Pims
by Robert Bento Florentino and Luiz Gustavo Lourenço Moura
Appl. Sci. 2025, 15(14), 8081; https://doi.org/10.3390/app15148081 - 21 Jul 2025
Viewed by 656
Abstract
The intensive use of various sensors in industrial machines has the potential to indicate the real-time health status of critical equipment. This is achieved through the connectivity of their automation systems (PIMS and MES), enabling the optimization of the preventive maintenance interval, a [...] Read more.
The intensive use of various sensors in industrial machines has the potential to indicate the real-time health status of critical equipment. This is achieved through the connectivity of their automation systems (PIMS and MES), enabling the optimization of the preventive maintenance interval, a reduction in corrective maintenance and safety-related failures, an increase in productivity and reliability and a reduction in maintenance costs. Through the use of the CRISP-DM data analysis methodology, the fault logs of ceramic plates applied in an iron ore filtration process are coupled with sensor readings of the process variables over the time of operation to create exponential survival models via two techniques: a multiple linear regression model with averaged data and a random forest regression machine learning model with individual instant data. The instantaneous reliability of ceramic plates is then used in the online prediction of the remaining useful life of the components. The model obtained from the instantaneous reading of 12 sensors led to the estimation of the remaining useful life for ceramic plates with up to 5600 h of use, allowing the adoption of a strategy of replacing these components by condition instead of replacing them by a fixed time, leading to increased process reliability and improved stock planning. The linear regression model for reliability prediction had an R2 of 78.32%, whereas the random forest regression model had an R2 of 63.7%. The final model for predicting the remaining useful life had an R2 of 99.6%. Full article
Show Figures

Figure 1

23 pages, 1590 KB  
Article
A Decision Support System for Classifying Suppliers Based on Machine Learning Techniques: A Case Study in the Aeronautics Industry
by Ana Claudia Andrade Ferreira, Alexandre Ferreira de Pinho, Matheus Brendon Francisco, Laercio Almeida de Siqueira and Guilherme Augusto Vilas Boas Vasconcelos
Computers 2025, 14(7), 271; https://doi.org/10.3390/computers14070271 - 10 Jul 2025
Viewed by 1782
Abstract
This paper presents the application of four machine learning algorithms to segment suppliers in a real case. The algorithms used were K-Means, Hierarchical K-Means, Agglomerative Nesting (AGNES), and Fuzzy Clustering. The analyzed company has suppliers that have been clustered using responses such as [...] Read more.
This paper presents the application of four machine learning algorithms to segment suppliers in a real case. The algorithms used were K-Means, Hierarchical K-Means, Agglomerative Nesting (AGNES), and Fuzzy Clustering. The analyzed company has suppliers that have been clustered using responses such as the number of non-conformities, location, and quantity supplied, among others. The CRISP-DM methodology was used for the work development. The proposed methodology is important for both industry and academia, as it helps managers make decisions about the quality of their suppliers and compares the use of four different algorithms for this purpose, which is an important insight for new studies. The K-Means algorithm obtained the best performance both for the metrics obtained and the simplicity of use. It is important to highlight that no studies to date have been conducted using the four algorithms proposed here applied in an industrial case, and this work shows this application. The use of artificial intelligence in industry is essential in this Industry 4.0 era for companies to make decisions, i.e., to have ways to make better decisions using data-driven concepts. Full article
Show Figures

Figure 1

32 pages, 2007 KB  
Article
Exploring the Relationship Between Project Characteristics and Time–Cost Deviations for Colombian Rural Roads
by Jose Quintero, Alexander Murgas, Adriana Gómez-Cabrera and Omar Sánchez
Infrastructures 2025, 10(7), 178; https://doi.org/10.3390/infrastructures10070178 - 9 Jul 2025
Cited by 1 | Viewed by 2782
Abstract
Rural road programs are essential for enhancing connectivity in remote areas, yet they frequently encounter schedule delays and budget overruns. This study explores the extent to which specific project characteristics influence these deviations in Colombian rural road contracts. A dataset comprising 229 projects [...] Read more.
Rural road programs are essential for enhancing connectivity in remote areas, yet they frequently encounter schedule delays and budget overruns. This study explores the extent to which specific project characteristics influence these deviations in Colombian rural road contracts. A dataset comprising 229 projects was extracted from the national SECOP open-procurement platform and processed using the CRISP-DM protocol. Following the cleaning and coding of 14 project-level variables, statistical analyses were conducted using Spearman correlations, Kruskal–Wallis tests, and post-hoc Wilcoxon comparisons to identify significant bivariate relations I confirm I confirm I confirm hips. A Random Forest model was subsequently applied to determine the most influential multivariate predictors of cost and time deviations. In parallel, a directed content analysis of contract addenda reclassified 22 recorded deviation descriptors into ten internationally recognized categories of causality, enabling an integrated interpretation of both statistical and documentary evidence. The findings indicate that contract value, geographical region, and contractor configuration are significant determinants of cost and time performance. Additionally, project intensity and discrepancies between awarded and bid values emerged as key contributors to cost escalation. Scope changes and adverse weather conditions together accounted for 76% of all documented deviation triggers, underscoring the relevance of robust front-end planning and climate-risk considerations in rural infrastructure delivery. The findings provide information for stakeholders, policymakers, and professionals who aim to manage the risk of schedule and budget deviations in public infrastructure projects. Full article
Show Figures

Figure 1

28 pages, 3098 KB  
Article
Proactive Complaint Management in Public Sector Informatics Using AI: A Semantic Pattern Recognition Framework
by Marco Esperança, Diogo Freitas, Pedro V. Paixão, Tomás A. Marcos, Rafael A. Martins and João C. Ferreira
Appl. Sci. 2025, 15(12), 6673; https://doi.org/10.3390/app15126673 - 13 Jun 2025
Cited by 3 | Viewed by 5528
Abstract
The digital transformation of public services has led to a surge in the volume and complexity of informatics-related complaints, often marked by ambiguous language, inconsistent terminology, and fragmented reporting. Conventional keyword-based approaches are inadequate for detecting semantically similar issues expressed in diverse ways. [...] Read more.
The digital transformation of public services has led to a surge in the volume and complexity of informatics-related complaints, often marked by ambiguous language, inconsistent terminology, and fragmented reporting. Conventional keyword-based approaches are inadequate for detecting semantically similar issues expressed in diverse ways. This study proposes an AI-powered framework that employs BERT-based sentence embeddings, semantic clustering, and classification algorithms, structured under the CRISP-DM methodology, to standardize and automate complaint analysis. Leveraging real-world interaction logs from a public sector agency, the system harmonizes heterogeneous complaint narratives, uncovers latent issue patterns, and enables early detection of technical and usability problems. The approach is deployed through a real-time dashboard, transforming complaint handling from a reactive to a proactive process. Experimental results show a 27% reduction in repeated complaint categories and a 32% increase in classification efficiency. The study also addresses ethical concerns, including data governance, bias mitigation, and model transparency. This work advances citizen-centric service delivery by demonstrating the scalable application of AI in public sector informatics. Full article
Show Figures

Figure 1

24 pages, 1212 KB  
Article
Comparative Evaluation of Automatic Detection and Classification of Daily Living Activities Using Batch Learning and Stream Learning Algorithms
by Paula Sofía Muñoz, Ana Sofía Orozco, Jaime Pabón, Daniel Gómez, Ricardo Salazar-Cabrera, Jesús D. Cerón, Diego M. López and Bernd Blobel
J. Pers. Med. 2025, 15(5), 208; https://doi.org/10.3390/jpm15050208 - 20 May 2025
Viewed by 977
Abstract
Background/Objectives: Activities of Daily Living (ADLs) are crucial for assessing an individual’s autonomy, encompassing tasks such as eating, dressing, and moving around, among others. Predicting these activities is part of health monitoring, elderly care, and intelligent systems, improving quality of life, and facilitating [...] Read more.
Background/Objectives: Activities of Daily Living (ADLs) are crucial for assessing an individual’s autonomy, encompassing tasks such as eating, dressing, and moving around, among others. Predicting these activities is part of health monitoring, elderly care, and intelligent systems, improving quality of life, and facilitating early dependency detection, all of which are relevant components of personalized health and social care. However, the automatic classification of ADLs from sensor data remains challenging due to high variability in human behavior, sensor noise, and discrepancies in data acquisition protocols. These challenges limit the accuracy and applicability of existing solutions. This study details the modeling and evaluation of real-time ADL classification models based on batch learning (BL) and stream learning (SL) algorithms. Methods: The methodology followed is the Cross-Industry Standard Process for Data Mining (CRISP-DM). The models were trained with a comprehensive dataset integrating 23 ADL-centric datasets using accelerometers and gyroscopes data. The data were preprocessed by applying normalization and sampling rate unification techniques, and finally, relevant sensor locations on the body were selected. Results: After cleaning and debugging, a final dataset was generated, containing 238,990 samples, 56 activities, and 52 columns. The study compared models trained with BL and SL algorithms, evaluating their performance under various classification scenarios using accuracy, area under the curve (AUC), and F1-score metrics. Finally, a mobile application was developed to classify ADLs in real time (feeding data from a dataset). Conclusions: The outcome of this study can be used in various data science projects related to ADL and Human activity recognition (HAR), and due to the integration of diverse data sources, it is potentially useful to address bias and improve generalizability in Machine Learning models. The principal advantage of online learning algorithms is dynamically adapting to data changes, representing a significant advance in personal autonomy and health care monitoring. Full article
Show Figures

Figure 1

11 pages, 12478 KB  
Article
Computer Vision-Based Obstacle Detection Mobile System for Visually Impaired Individuals
by Gisel Katerine Bastidas-Guacho, Mario Alejandro Paguay Alvarado, Patricio Xavier Moreno-Vallejo, Patricio Rene Moreno-Costales, Nayely Samanta Ocaña Yanza and Jhon Carlos Troya Cuestas
Multimodal Technol. Interact. 2025, 9(5), 48; https://doi.org/10.3390/mti9050048 - 18 May 2025
Cited by 2 | Viewed by 3644
Abstract
Traditional tools, such as canes, are no longer enough to subsist the mobility and orientation of visually impaired people in complex environments. Therefore, technological solutions based on computer vision tasks are presented as promising alternatives to help detect obstacles. Object detection models are [...] Read more.
Traditional tools, such as canes, are no longer enough to subsist the mobility and orientation of visually impaired people in complex environments. Therefore, technological solutions based on computer vision tasks are presented as promising alternatives to help detect obstacles. Object detection models are easy to couple to mobile systems, do not require a large consumption of resources on mobile phones, and act in real-time to alert users of the presence of obstacles. However, existing object detectors were mostly trained with images from platforms such as Kaggle, and the number of existing objects is still limited. For this reason, this study proposes to implement a mobile system that integrates an object detection model for the identification of obstacles intended for visually impaired people. Additionally, the mobile application integrates multimodal feedback through auditory and haptic interaction, ensuring that users receive real-time obstacle alerts via voice guidance and vibrations, further enhancing accessibility and responsiveness in different navigation contexts. The chosen scenario to develop the obstacle detection application is the Specialized Educational Unit Dr. Luis Benavides for impaired people, which is the source of images for building the dataset for the model and evaluating it with impaired individuals. To determine the best model, the performance of YOLO is evaluated by means of a precision adjustment through the variation of epochs, using a proprietary data set of 7600 diverse images. The YOLO-300 model turned out to be the best, with a mAP of 0.42. Full article
Show Figures

Figure 1

53 pages, 1551 KB  
Article
From Crisis to Algorithm: Credit Delinquency Prediction in Peru Under Critical External Factors Using Machine Learning
by Jomark Noriega, Luis Rivera, Jorge Castañeda and José Herrera
Data 2025, 10(5), 63; https://doi.org/10.3390/data10050063 - 28 Apr 2025
Cited by 1 | Viewed by 2662
Abstract
Robust credit risk prediction in emerging economies increasingly demands the integration of external factors (EFs) beyond borrowers’ control. This study introduces a scenario-based methodology to incorporate EF—namely COVID-19 severity (mortality and confirmed cases), climate anomalies (temperature deviations, weather-induced road blockages), and social unrest—into [...] Read more.
Robust credit risk prediction in emerging economies increasingly demands the integration of external factors (EFs) beyond borrowers’ control. This study introduces a scenario-based methodology to incorporate EF—namely COVID-19 severity (mortality and confirmed cases), climate anomalies (temperature deviations, weather-induced road blockages), and social unrest—into machine learning (ML) models for credit delinquency prediction. The approach is grounded in a CRISP-DM framework, combining stationarity testing (Dickey–Fuller), causality analysis (Granger), and post hoc explainability (SHAP, LIME), along with performance evaluation via AUC, ACC, KS, and F1 metrics. The empirical analysis uses nearly 8.2 million records compiled from multiple sources, including 367,000 credit operations granted to individuals and microbusiness owners by a regulated Peruvian financial institution (FMOD) between January 2020 and September 2023. These data also include time series of delinquency by economic activity, external factor indicators (e.g., mortality, climate disruptions, and protest events), and their dynamic interactions assessed through Granger causality to evaluate both the intensity and propagation of external shocks. The results confirm that EF inclusion significantly enhances model performance and robustness. Time-lagged mortality (COVID MOV) emerges as the most powerful single predictor of delinquency, while compound crises (climate and unrest) further intensify default risk—particularly in portfolios without public support. Among the evaluated models, CNN and XGB consistently demonstrate superior adaptability, defined as their ability to maintain strong predictive performance across diverse stress scenarios—including pandemic, climate, and unrest contexts—and to dynamically adjust to varying input distributions and portfolio conditions. Post hoc analyses reveal that EF effects dynamically interact with borrower income, indebtedness, and behavioral traits. This study provides a scalable, explainable framework for integrating systemic shocks into credit risk modeling. The findings contribute to more informed, adaptive, and transparent lending decisions in volatile economic contexts, relevant to financial institutions, regulators, and risk practitioners in emerging markets. Full article
(This article belongs to the Section Information Systems and Data Management)
Show Figures

Figure 1

25 pages, 1443 KB  
Article
Predicting Urban Traffic Congestion with VANET Data
by Wilson Chango, Pamela Buñay, Juan Erazo, Pedro Aguilar, Jaime Sayago, Angel Flores and Geovanny Silva
Computation 2025, 13(4), 92; https://doi.org/10.3390/computation13040092 - 7 Apr 2025
Cited by 7 | Viewed by 3659
Abstract
The purpose of this study lies in developing a comparison of neural network-based models for vehicular congestion prediction, with the aim of improving urban mobility and mitigating the negative effects associated with traffic, such as accidents and congestion. This research focuses on evaluating [...] Read more.
The purpose of this study lies in developing a comparison of neural network-based models for vehicular congestion prediction, with the aim of improving urban mobility and mitigating the negative effects associated with traffic, such as accidents and congestion. This research focuses on evaluating the effectiveness of different neural network architectures, specifically Transformer and LSTM, in order to achieve accurate and reliable predictions of vehicular congestion. To carry out this research, a rigorous methodology was employed that included a systematic literature review based on the PRISMA methodology, which allowed for the identification and synthesis of the most relevant advances in the field. Likewise, the Design Science Research (DSR) methodology was applied to guide the development and validation of the models, and the CRISP-DM (Cross-Industry Standard Process for Data Mining) methodology was used to structure the process, from understanding the problem to implementing the solutions. The dataset used in this study included key variables related to traffic, such as vehicle speed, vehicular flow, and weather conditions. These variables were processed and normalized to train and evaluate various neural network architectures, highlighting LSTM and Transformer networks. The results obtained demonstrated that the LSTM-based model outperformed the Transformer model in the task of congestion prediction. Specifically, the LSTM model achieved an accuracy of 0.9463, with additional metrics such as a loss of 0.21, an accuracy of 0.93, a precision of 0.29, a recall of 0.71, an F1-score of 0.42, an MSE of 0.07, and an RMSE of 0.26. In conclusion, this study demonstrates that the LSTM-based model is highly effective for predicting vehicular congestion, surpassing other architectures such as Transformer. The integration of this model into a simulation environment showed that real-time traffic information can significantly improve urban mobility management. These findings support the utility of neural network architectures in sustainable urban planning and intelligent traffic management, opening new perspectives for future research in this field. Full article
(This article belongs to the Section Computational Engineering)
Show Figures

Figure 1

17 pages, 2687 KB  
Article
Transcriptomic and Metabolomic Analysis of the Uterine Tissue of Yaoshan Chicken and Its Crossbreeds to Reveal the Molecular Mechanism Influencing Eggshell Quality
by Xiaomeng Miao, Jia Liu, Qian Gong, Fugui Li, Yalan Zhang, Qiyue Liang, Diyan Li and Zhonghua Ning
Genes 2025, 16(4), 383; https://doi.org/10.3390/genes16040383 - 27 Mar 2025
Cited by 1 | Viewed by 1647
Abstract
Background/Objectives: Eggshell quality is a critical factor influencing consumer preference and the economic benefits of poultry enterprises, and the uterus is the key site for eggshell synthesis. Yaoshan chicken (YS), an indigenous chicken breed in China, is renowned for its flavorful meat and [...] Read more.
Background/Objectives: Eggshell quality is a critical factor influencing consumer preference and the economic benefits of poultry enterprises, and the uterus is the key site for eggshell synthesis. Yaoshan chicken (YS), an indigenous chicken breed in China, is renowned for its flavorful meat and high-quality eggs. However, its egg production is lower compared to specialized strains. Therefore, the GYR crossbreed was developed by three-line hybridization for YS chicken, which can produce green-shelled eggs with better eggshell thickness and strength than YS chicken (p < 0.01). To explore the molecular mechanisms underlying the differences in eggshell quality between GYR and YS chickens, we conducted an integrated transcriptomic and metabolomic analysis. Methods: Twelve uterus samples (six from GYR and six from YS chickens) were collected during the period of eggshell calcification at 260 days of age. RNA sequencing (RNA-seq) and liquid chromatography–mass spectrometry (LC-MS/MS) were performed to identify differentially expressed genes (DEGs) and differential metabolites (DMs), respectively. Results: A total of 877 DEGs were identified in the GYR group, including 196 upregulated and 681 downregulated genes (|log2 (fold change)| > 1, p-value < 0.05). Additionally, 79 DMs were detected, comprising 50 upregulated and 29 downregulated metabolites (|log₂ (fold change)| > 1, VIP > 1). Notably, the key DEGs (SLCO1B3, SLCO1B1, PTGR1, LGR6, MELTF, CRISP2, GVINP1, and OVSTL), important DMs (prostaglandin-related DMs and biliverdin) and signaling pathways (calcium signaling, neuroactive ligand–receptor interaction, arachidonic acid metabolism, bile secretion, and primary bile acid biosynthesis) were major regulators of the eggshell quality. Furthermore, an integrated transcriptomic and metabolomic analysis revealed two significant gene–metabolite pairs associated with eggshell quality: PTGDS–prostaglandin E2 and PTGS1–prostaglandin E2. Conclusions: This study provides a theoretical foundation for the improved eggshell quality of Yaoshan chicken. Full article
(This article belongs to the Section Animal Genetics and Genomics)
Show Figures

Figure 1

14 pages, 2091 KB  
Data Descriptor
Historical Hourly Information of Four European Wind Farms for Wind Energy Forecasting and Maintenance
by Javier Sánchez-Soriano, Pedro Jose Paniagua-Falo and Carlos Quiterio Gómez Muñoz
Data 2025, 10(3), 38; https://doi.org/10.3390/data10030038 - 19 Mar 2025
Cited by 1 | Viewed by 1537
Abstract
For an electric company, having an accurate forecast of the expected electrical production and maintenance from its wind farms is crucial. This information is essential for operating in various existing markets, such as the Iberian Energy Market Operator—Spanish Hub (OMIE in its Spanish [...] Read more.
For an electric company, having an accurate forecast of the expected electrical production and maintenance from its wind farms is crucial. This information is essential for operating in various existing markets, such as the Iberian Energy Market Operator—Spanish Hub (OMIE in its Spanish acronym), the Portuguese Hub (OMIP in its Spanish acronym), and the Iberian electricity market between the Kingdom of Spain and the Portuguese Republic (MIBEL in its Spanish acronym), among others. The accuracy of these forecasts is vital for estimating the costs and benefits of handling electricity. This article explains the process of creating the complete dataset, which includes the acquisition of the hourly information of four European wind farms as well as a description of the structure and content of the dataset, which amounts to 2 years of hourly information. The wind farms are in three countries: Auvergne-Rhône-Alpes (France), Aragon (Spain), and the Piemonte region (Italy). The dataset was built and validated following the CRISP-DM methodology, ensuring a structured and replicable approach to data processing and preparation. To confirm its reliability, the dataset was tested using a basic predictive model, demonstrating its suitability for wind energy forecasting and maintenance optimization. The dataset presented is available and accessible for improving the forecasting and management of wind farms, especially for the detection of faults and the elaboration of a preventive maintenance plan. Full article
Show Figures

Figure 1

27 pages, 7182 KB  
Article
Detection of Leaf Diseases in Banana Crops Using Deep Learning Techniques
by Nixon Jiménez, Stefany Orellana, Bertha Mazon-Olivo, Wilmer Rivas-Asanza and Iván Ramírez-Morales
AI 2025, 6(3), 61; https://doi.org/10.3390/ai6030061 - 17 Mar 2025
Cited by 11 | Viewed by 6031
Abstract
Leaf diseases, such as Black Sigatoka and Cordana, represent a growing threat to banana crops in Ecuador. These diseases spread rapidly, impacting both leaf and fruit quality. Early detection is crucial for effective control measures. Recently, deep learning has proven to be a [...] Read more.
Leaf diseases, such as Black Sigatoka and Cordana, represent a growing threat to banana crops in Ecuador. These diseases spread rapidly, impacting both leaf and fruit quality. Early detection is crucial for effective control measures. Recently, deep learning has proven to be a powerful tool in agriculture, enabling more accurate analysis and identification of crop diseases. This study applied the CRISP-DM methodology, consisting of six phases: business understanding, data understanding, data preparation, modeling, evaluation, and deployment. A dataset of 900 banana leaf images was collected—300 of Black Sigatoka, 300 of Cordana, and 300 of healthy leaves. Three pre-trained models (EfficientNetB0, ResNet50, and VGG19) were trained on this dataset. To improve performance, data augmentation techniques were applied using TensorFlow Keras’s ImageDataGenerator class, expanding the dataset to 9000 images. Due to the high computational demands of ResNet50 and VGG19, training was performed with EfficientNetB0. The models—EfficientNetB0, ResNet50, and VGG19—demonstrated the ability to identify leaf diseases in bananas, with accuracies of 88.33%, 88.90%, and 87.22%, respectively. The data augmentation increased the performance of EfficientNetB0 to 87.83%, but did not significantly improve its accuracy. These findings highlight the value of deep learning techniques for early disease detection in banana crops, enhancing diagnostic accuracy and efficiency. Full article
(This article belongs to the Special Issue Artificial Intelligence in Agriculture)
Show Figures

Graphical abstract

Back to TopTop