MDPI - Publisher of Open Access Journals

21 pages, 4519 KB

Open AccessArticle

The FAM111A Gene: Genetic, Epigenetic, and Pharmacological Targets and Mechanistic Insights with Clinical Relevance

by Kyriaki Hatziagapiou, Feneli Karachaliou, Trias Thireou, Eleni Koniari, Dimitrios Chaniotis, Apostolos Beloukas, Galateia Stathori, Panagiota Kafkaloudi, Bettina Krumbholz, George P. Chrousos and Louis Papageorgiou

Pharmaceuticals 2026, 19(3), 375; https://doi.org/10.3390/ph19030375 - 27 Feb 2026

Viewed by 111

Abstract

Background/Objectives: FAM111A is a trypsin-like serine protease that has emerged as a regulator of DNA replication, and is directly related to genome stability, protein homeostasis, antiviral defense and cancer progression. Pathogenic variants in FAM111A are correlated with genetic syndromes such as Kenny–Caffey [...] Read more.

Background/Objectives: FAM111A is a trypsin-like serine protease that has emerged as a regulator of DNA replication, and is directly related to genome stability, protein homeostasis, antiviral defense and cancer progression. Pathogenic variants in FAM111A are correlated with genetic syndromes such as Kenny–Caffey syndrome type 2 (KCS2) and gracile bone dysplasia/osteocraniostenosis (GCLEB/OCS). This study focuses on the evolutionary, genetic, and structural analysis of FAM111A, in order to identify key regions and candidate pharmacological targets that are related to this enzyme’s function. Methods: The methodology of this in silico study includes separate analyses at the sequence, structural and functional levels. Initially, data mining was carried out using NCBI/Protein (2025), and then data filtering was performed in order to identify representative FAM111A sequences for several species. Sequence analysis was then executed through multiple alignments and phylogenetic analyses. Through this, conserved domains and motifs were identified. For structural analysis, human pathogenic mutations and protein structures were identified through searches in biological databases including PDB and ClinVar, and then all data were analyzed in order to identify candidate pharmacological targets related to FAM111A function. Results: Approximately 1850 FAM111A protein sequences were retrieved for several species, and after filtering processes a dataset of 85 representative sequences was generated. Evolutionary analysis indicates that FAM111A originated in early metazoans, with progressive domain specialization leading to mammal-restricted acquisition of regulatory elements, including the PIP-box PCNA (proliferating cell nuclear antigen) interacting peptide and UBL (ubiquitin-like) domains. The ubiquitin-like/DNA binding domain and catalytic serine protease domain (SPD) are the most conserved, containing seven highly conserved motifs. The structural analysis was based on two protein structures and 34 critical mutations that accumulate in two distinct regions. Finally, by combining the results, six pharmacological targets and 100 inhibitors are proposed. Conclusions: Advancing the structural and function characterization of FAM111A, coupled with pharmacological target identification and evolutionary insights, will be critical to validate this underexplored protease as a therapeutic genetic target in genetic disorders, cancer, and antiviral responses. Full article

(This article belongs to the Section Pharmacology)

► Show Figures

Figure 1

34 pages, 10465 KB

Open AccessArticle

Metallogenic Mechanism of Decratonic Gold Deposit: Geochemical Evidence from Dongbaligou Gold Deposit and Its Ore-Forming Intrusions in Southern Jilin

by Jiuda Sun, Zhongyuan Xu, Xiaofei Yu, Kai Chen and Zhuoyi Wang

Minerals 2026, 16(3), 235; https://doi.org/10.3390/min16030235 - 26 Feb 2026

Viewed by 169

Abstract

This text systematically investigates the Laotudingzi monzogranite (a gold-hosting intrusion) and the Dongbaligou gold ore deposit in the Laoling gold ore belt through comprehensive geochronological, whole-rock geochemical (macroelement and microelement), strontium-neodymium-lead-hafnium isotopic and in situ sulfur-lead isotopic analysis of pyrite, combined with hydrogen-oxygen [...] Read more.

This text systematically investigates the Laotudingzi monzogranite (a gold-hosting intrusion) and the Dongbaligou gold ore deposit in the Laoling gold ore belt through comprehensive geochronological, whole-rock geochemical (macroelement and microelement), strontium-neodymium-lead-hafnium isotopic and in situ sulfur-lead isotopic analysis of pyrite, combined with hydrogen-oxygen isotopic studies of hydrothermal quartz. The results demonstrate a significant Early–Middle Jurassic magmatic-mineralization event in southern Jilin Province (Ji’nan). The gold mine is structurally controlled by detachment fractures within the Laoling metamorphic core complex, which developed in an extended environment. The metallogenic materials are primarily derived from adakitic magma, supporting a “decratonic-type” genetic model. By integrating geochronological, geochemical, and isotopic datasets from the ore-related intrusions and gold deposits, as well as fluid inclusion characteristics, we elucidate the metallogenic mechanism linking Jurassic gold mineralization to subduction-related cratonic destruction. The process involved lower crustal thickening induced by Paleo-Pacific Plate subduction, lithospheric destabilization via gravitational foundering and delamination, and syn-extensional magmatism that sourced ore-forming fluids during cratonic lithosphere thinning. This work establishes a genetic framework connecting plate subduction, lithospheric removal, and gold endowment in convergent margin settings. Full article

(This article belongs to the Section Mineral Geochemistry and Geochronology)

► Show Figures

Figure 1

49 pages, 13968 KB

Open AccessArticle

Application of Machine Learning Methods for Predicting the Factor of Safety in Rock Slopes

by Miguel Trinidad and Moe Momayez

Geotechnics 2026, 6(1), 15; https://doi.org/10.3390/geotechnics6010015 - 3 Feb 2026

Viewed by 252

Abstract

Factor of Safety (FOS) is a significant index to measure the stability condition of a rock slope in mining or civil engineering. In this paper, we evaluate and compare four different machine learning models, Gaussian Process Regressor (GPR), Support Vector Regressor (SVR), Random [...] Read more.

Factor of Safety (FOS) is a significant index to measure the stability condition of a rock slope in mining or civil engineering. In this paper, we evaluate and compare four different machine learning models, Gaussian Process Regressor (GPR), Support Vector Regressor (SVR), Random Forest (RF), and a hybrid genetic algorithm–multi-layer perceptron (GA-MLP), using two separate real-world datasets. The two separate datasets used in this study are from a previously conducted study on highway excavation with rock cutting in China, and another one in a mining site in Peru, with five geotechnical properties used as inputs, including slope height, slope angle, unit weight, cohesion, and friction angle. The two separate datasets were separated into training, validation, and testing datasets. The testing dataset of the models is unseen data used to assess model performance in an unbiased manner. The result shows that the SVR had the highest prediction accuracy, followed by GPR for the mining dataset, and GPR had the highest performance among all the models for the highway excavation dataset. From the boxplot, we can see that SVR, while having the highest predictive accuracy, has a larger variance in prediction compared to GPR for the mining dataset. Full article

► Show Figures

Figure 1

14 pages, 2690 KB

Open AccessArticle

Parameter Inversion of Probability Integral Model Based on GA–BFGS Hybrid Algorithm

by Tan Hao, Duan Jinling, Yang Jingyu, Xu Jia and Zhu Mingfei

Appl. Sci. 2026, 16(3), 1291; https://doi.org/10.3390/app16031291 - 27 Jan 2026

Viewed by 164

Abstract

The probability integral method is the primary technique for predicting mining-induced subsidence in China, and its predictive accuracy strongly depends on the precision of the model parameters. To improve the accuracy and stability of parameter inversion and to overcome the convergence randomness of [...] Read more.

The probability integral method is the primary technique for predicting mining-induced subsidence in China, and its predictive accuracy strongly depends on the precision of the model parameters. To improve the accuracy and stability of parameter inversion and to overcome the convergence randomness of the Genetic Algorithm (GA) in global search, as well as the tendency of the BFGS quasi-Newton method (BFGS) to converge to local optima in non-convex optimization problems, a hybrid GA–BFGS optimization algorithm is proposed for inverting the parameters of the probability integral model. This hybrid approach combines the global exploration capability of GA with the fast local refinement of BFGS, resulting in a more efficient and robust parameter optimization process. Simulation results under ideal conditions without model error demonstrate that the proposed GA–BFGS algorithm outperforms pattern search (PS), GA, and BFGS in terms of inversion accuracy, convergence stability, and robustness to noise and outliers. In engineering applications, the inversion accuracy is reduced compared with simulation experiments, which can be attributed to complex geological conditions and inherent model uncertainties. Therefore, further improvements in subsidence prediction accuracy require not only refined inversion algorithms but also the development of more accurate prediction models that explicitly account for site-specific geological and mining conditions. Full article

► Show Figures

Figure 1

19 pages, 5509 KB

Open AccessArticle

Application of Multi-Sensor Data Fusion and Machine Learning for Early Warning of Cambrian Limestone Water Hazards

by Hang Li, Yijia Li, Wantong Lin, Huaixiang Yang and Kefeng Liu

Sensors 2025, 25(22), 6854; https://doi.org/10.3390/s25226854 - 10 Nov 2025

Viewed by 617

Abstract

The issue of water disasters in the mining floor is extremely severe. Despite significant progress in the on-site monitoring and identification of water inrush channels, research on the spatial development characteristics of cracks and the temporal evolution patterns remains insufficient, resulting in the [...] Read more.

The issue of water disasters in the mining floor is extremely severe. Despite significant progress in the on-site monitoring and identification of water inrush channels, research on the spatial development characteristics of cracks and the temporal evolution patterns remains insufficient, resulting in the incomplete development of microseismic-based water disaster early warning theory and practice. Based on this, the present study first derives the expressions for the diameter and length of water inrush channels according to the damage characteristics of microseismic events and the glazed porcelain shape features of the channels. A theoretical model for the correlation between microseismic-water inrush volume is established, and the relationship between microseismic and water level is revealed. Analysis of field monitoring data further indicates that when high-energy microseismic features (such as single high-energy events and higher daily cumulative energy) are detected, the aquifer water level begins to decline, followed by high water inrush events. Therefore, a decrease in water level accompanied by high-energy microseismic features can serve as an important early warning marker for water disasters. Finally, advanced machine learning methods are applied, in which the optimal index combination for floor water inrush prediction is obtained through the genetic algorithm, and the weights of each index are determined by integrating the analytic hierarchy process with the random forest model. Field engineering verification demonstrates that the integrated early warning system performs significantly better than any single monitoring indicator, and all high-water-inrush events are successfully predicted within four days. Full article

(This article belongs to the Section Intelligent Sensors)

► Show Figures

Figure 1

22 pages, 926 KB

Open AccessReview

Regulatory Mechanisms of Total Soluble Solids in Tomato: From QTL Mapping to Gene Editing

by Minghua Xu, Shujing Ji, Shengqun Pang, Yongen Lu, Shouming Li and Wei Xu

Foods 2025, 14(21), 3692; https://doi.org/10.3390/foods14213692 - 29 Oct 2025

Viewed by 1323

Abstract

Total Soluble Solids (TSS) in tomatoes is a core indicator for evaluating fruit quality and processing characteristics. Its composition mainly consists of soluble sugars (such as fructose and glucose) and organic acids (such as citric acid and malic acid). The contents of sugars [...] Read more.

Total Soluble Solids (TSS) in tomatoes is a core indicator for evaluating fruit quality and processing characteristics. Its composition mainly consists of soluble sugars (such as fructose and glucose) and organic acids (such as citric acid and malic acid). The contents of sugars and acids and their ratio directly affect the flavor and nutritional value. Cultivated tomatoes have a TSS of 4–6%, compared with 10–15% in wild varieties. In recent years, with the advancement of molecular biology and genomics technologies, significant progress has been made in the research on the regulatory mechanisms of tomato fruit TSS and major sugars and acids, including the identification of major quantitative trait locus (QTLs) (Lin5, SlALMT9), functional characterization via CRISPR/Cas9 and elucidation of the transporter network. Breaking the negative correlation between TSS and yield remains a major bottleneck in breeding. Analyzing the mechanism by which environmental factors regulate the TSS and optimizing cultivation measures are crucial for increasing the TSS content in tomatoes. The deep integration of cutting-edge technologies (such as Genome-wide association studies (GWAS), metabolome-wide association studies (mGWAS), Genomic selection (GS), genome editing, and crop modeling) with design breeding is expected to accelerate the development of high-TSS tomato varieties. This paper reviews the current research status from the following four aspects: QTL mapping related to tomato TSS and mining of major genes, metabolic and transport mechanisms of major sugars and acids and key genes, the influence of environmental factors on TSS, and application of genetic improvement strategies and technologies. Full article

(This article belongs to the Special Issue Technologies and Strategies to Improve the Quality and Extend the Shelf-Life of Post-Harvest Fruits and Vegetables)

► Show Figures

Figure 1

32 pages, 5250 KB

Open AccessReview

Artificial Intelligence in Edible Mushroom Cultivation, Breeding, and Classification: A Comprehensive Review

by Muharagi Samwel Jacob, Anran Xu, Keqing Qian, Zhengxiang Qi, Xiao Li and Bo Zhang

J. Fungi 2025, 11(11), 758; https://doi.org/10.3390/jof11110758 - 22 Oct 2025

Cited by 1 | Viewed by 3672

Abstract

Edible mushrooms have gained global popularity due to their nutritional value, medicinal properties, bioactive compounds and industrial applications. Despite their long-standing roles in ecology, nutrition, and traditional medicine, their additional functions in cultivation, breeding, and classification processes are still in their infancy due [...] Read more.

Edible mushrooms have gained global popularity due to their nutritional value, medicinal properties, bioactive compounds and industrial applications. Despite their long-standing roles in ecology, nutrition, and traditional medicine, their additional functions in cultivation, breeding, and classification processes are still in their infancy due to technological constraints. The advent of Artificial Intelligence (AI) technologies has transformed the cultivation process of mushrooms, genetic breeding, and classification methods. However, the analysis of the application of AI in the mushroom production cycle is currently scattered and unorganized. This comprehensive review explores the application of AI technologies in mushroom cultivation, breeding, and classification. Four databases (Scopus, IEEE Xplore, Web of Science, and PubMed) and one search engine (Google Scholar) were used to perform a thorough review of the literature on the utility of AI in various aspects of the mushroom production cycle, including intelligent environmental control, disease detection, yield prediction, germplasm characterization, genotype–phenotype integration, genome editing, gene mining, multi-omics, automatic species identification and grading. In order to fully realize the potential of these edge-cutting AI technologies in transforming mushroom breeding, classification, and cultivation, this review addresses challenges and future perspectives while calling for interdisciplinary approaches and multimodal fusion. Full article

(This article belongs to the Special Issue Edible and Medicinal Macrofungi, 4th Edition)

► Show Figures

Figure 1

22 pages, 8042 KB

Open AccessArticle

WSF: A Transformer-Based Framework for Microphenotyping and Genetic Analyzing of Wheat Stomatal Traits

by Honghao Zhou, Haijiang Min, Shaowei Liang, Bingxi Qin, Qi Sun, Zijun Pei, Qiuxiao Pan, Xiao Wang, Jian Cai, Qin Zhou, Yingxin Zhong, Mei Huang, Dong Jiang, Jiawei Chen and Qing Li

Plants 2025, 14(19), 3016; https://doi.org/10.3390/plants14193016 - 29 Sep 2025

Viewed by 824

Abstract

Stomata on the leaves of wheat serve as important gateways for gas exchange with the external environment. Their morphological characteristics, such as size and density, are closely related to physiological processes like photosynthesis and transpiration. However, due to the limitations of existing analysis [...] Read more.

Stomata on the leaves of wheat serve as important gateways for gas exchange with the external environment. Their morphological characteristics, such as size and density, are closely related to physiological processes like photosynthesis and transpiration. However, due to the limitations of existing analysis methods, the efficiency of analyzing and mining stomatal phenotypes and their associated genes still requires improvement. To enhance the accuracy and efficiency of stomatal phenotype traits analysis and to uncover the related key genes, this study selected 210 wheat varieties. A novel semantic segmentation model based on transformer for wheat stomata, called Wheat Stoma Former (WSF), was proposed. This model enables fully automated and highly efficient stomatal mask extraction and accurately analyzes phenotypic traits such as the length, width, area, and number of stomata on both the adaxial (Ad) and abaxial (Ab) surfaces of wheat leaves based on the mask images. The model evaluation results indicate that coefficients of determination (R²) between the predicted values and the actual measurements for stomatal length, width, area, and number were 0.88, 0.86, 0.81, and 0.93, respectively, demonstrating the model’s high precision and effectiveness in stomatal phenotypic trait analysis. The phenotypic data were combined with sequencing data from the wheat 660 K SNP chip and subjected to a genome-wide association study (GWAS) to analyze the genetic basis of stomatal traits, including length, width, and number, on both adaxial and abaxial surfaces. A total of 36 SNP peak loci significantly associated with stomatal traits were identified. Through candidate gene identification and functional analysis, two genes—TraesCS2B02G178000 (on chromosome 2B, related to stomatal number on the abaxial surface) and TraesCS6A02G290600 (on chromosome 6A, related to stomatal length on the adaxial surface)—were found to be associated with stomatal traits involved in regulating stomatal movement and closure, respectively. In conclusion, our WSF model demonstrates valuable advances in accurate and efficient stomatal phenotyping for locating genes related to stomatal traits in wheat and provides breeders with accurate phenotypic data for the selection and breeding of water-efficient wheat varieties. Full article

(This article belongs to the Special Issue Machine Learning for Plant Phenotyping in Crops)

► Show Figures

Figure 1

24 pages, 18107 KB

Open AccessArticle

Time-Course Transcriptome, Metabolome, and Weighted Gene Co-Expression Network Analysis Reveal the Roles of the OsBELH4A Gene in Regulating Leaf Senescence and Grain Yield of Rice

by Ruyi Zheng, Tianyu Chen, Jianjian Li, Chengcheng Hu, Zhiming Yu, Zhanghui Zeng, Zhehao Chen, Lilin Wang, Taihe Xiang and Xiaoping Huang

Plants 2025, 14(19), 2973; https://doi.org/10.3390/plants14192973 - 25 Sep 2025

Viewed by 1162

Abstract

Rice (Oryza sativa L.) is one of the major food crops. Yield and quality are affected by premature leaf senescence, a complex and tightly regulated developmental process. To elucidate the molecular regulatory mechanism controlling rice leaf senescence, the integrative transcriptome, metabolome and [...] Read more.

Rice (Oryza sativa L.) is one of the major food crops. Yield and quality are affected by premature leaf senescence, a complex and tightly regulated developmental process. To elucidate the molecular regulatory mechanism controlling rice leaf senescence, the integrative transcriptome, metabolome and weighted gene co-expression network analysis (WGCNA) of flag leaves in five development stages (FL1–FL5) was performed. In this study, a total of 9412 differential expressed genes (DEGs) were identified. To further mine DEGs related to leaf senescence, a total of five stage-specific modules were characterized by WGCNA. Among them, two modules displayed continuous down-regulated and up-regulated trends from stages FL1 to FL5, which were considered to be highly negatively and positively correlated with the senescence trait, respectively. GO enrichment results showed that the genes clustered in stage-specific modules were significantly enriched in a vast number of senescence-associated biological processes. Furthermore, large numbers of senescence-related genes were identified, mainly participating in transcription regulation, hormone pathways, degradation of chlorophyll, ROS metabolism, senescence-associated genes (SAGs), and others. Most importantly, a total of 40 hub genes associated with leaf senescence were identified. In addition, the metabolome analysis showed that a total of 309 differential metabolites (DMs) were identified by WGCNA. The integrative transcriptome and metabolome analysis identified a key hub gene OsBELH4A based on the correlation analysis conducted between 40 hub genes and 309 DMs. The results of function validation showed that OsBELH4A overexpression lines displayed delayed leaf senescence, and significantly increased grain number per plant and grain number per panicle. By contrast, its knockout lines displayed premature leaf senescence and reduced grain yield. Exogenous hormone treatment showed that OsBELH4A significantly responded to SA and auxin. These findings provide novel insights into leaf senescence, and further contribute to providing genetic resources for the breeding of crops resistant to premature senescence. Full article

(This article belongs to the Special Issue Crop Yield Improvements Through Genetic and Biological Breeding)

► Show Figures

Graphical abstract

18 pages, 2657 KB

Open AccessArticle

GRE: A Framework for Significant SNP Identification Associated with Wheat Yield Leveraging GWAS–Random Forest Joint Feature Selection and Explainable Machine Learning Genomic Selection Algorithm

by Mei Song, Shanghui Zhang, Shijie Qiu, Ran Qin, Chunhua Zhao, Yongzhen Wu, Han Sun, Guangchen Liu and Fa Cui

Genes 2025, 16(10), 1125; https://doi.org/10.3390/genes16101125 - 24 Sep 2025

Cited by 1 | Viewed by 1348

Abstract

Background: Facing global wheat production pressures such as environmental degradation and reduced cultivated land, breeding innovation is urgent to boost yields. Genomic selection (GS) is a useful wheat breeding technology to make the breeding process more efficient, increasing the genetic gain per [...] Read more.

Background: Facing global wheat production pressures such as environmental degradation and reduced cultivated land, breeding innovation is urgent to boost yields. Genomic selection (GS) is a useful wheat breeding technology to make the breeding process more efficient, increasing the genetic gain per unit time and cost. Precise genomic estimated breeding value (GEBV) via genome-wide markers is usually hampered by high-dimensional genomic data. Methods: To address this, we propose GRE, a framework combining genome-wide association study (GWAS)’s biological significance and random forest (RF)’s prediction efficiency for an explainable machine learning GS model. First, GRE identifies significant SNPs affecting wheat yield traits by comparison of the constructed 24 SNP subsets (intersection/union) selected by leveraging GWAS and RF, to analyze the marker scale’s impact. Furthermore, GRE compares six GS algorithms (GBLUP and five machine learning models), evaluating performance via prediction accuracy (Pearson correlation coefficient, PCC) and error. Additionally, GRE leverages Shapley additive explanations (SHAP) explainable techniques to overcome traditional GS models’ “black box” limitation, enabling cross-scale quantitative analysis and revealing how significant SNPs affect yield traits. Results: Results show that XGBoost and ElasticNet perform best in the union (383 SNPs) of GWAS and RF’s TOP 200 SNPs, with high accuracy (PCC > 0.864) and stability (standard deviation, SD < 0.005), and the significant SNPs identified by XGBoost are precisely explained by their main and interaction effects on wheat yield by SHAP. Conclusions: This study provides tool support for intelligent breeding chip design, important trait gene mining, and GS technology field transformation, aiding global agricultural sustainable productivity. Full article

(This article belongs to the Section Plant Genetics and Genomics)

► Show Figures

Figure 1

28 pages, 886 KB

Open AccessReview

Heavy Metals in Bioenergy Crop Production, Biomass Quality, and Biorefinery: Global Impacts and Sustainable Management Strategies

by Amir Sadeghpour, Moein Javid, Sowmya Koduru, Sirwan Babaei and Eric C. Brevik

Bioresour. Bioprod. 2025, 1(1), 2; https://doi.org/10.3390/bioresourbioprod1010002 - 18 Sep 2025

Viewed by 2011

Abstract

Heavy metals (HMs) including cadmium (Cd), lead (Pb), arsenic (As), zinc (Zn), copper (Cu), chromium (Cr), and nickel (Ni) pose significant challenges to bioenergy crop production due to their persistence, toxicity, and bioaccumulation in soils and plants. This study not only summarizes the [...] Read more.

Heavy metals (HMs) including cadmium (Cd), lead (Pb), arsenic (As), zinc (Zn), copper (Cu), chromium (Cr), and nickel (Ni) pose significant challenges to bioenergy crop production due to their persistence, toxicity, and bioaccumulation in soils and plants. This study not only summarizes the mechanisms of HM absorption, translocation, and accumulation in bioenergy crops, but also critically assesses their impact on crop development, biomass quality, and biorefinery processes. Heavy metals disrupt key physiological processes and modify lignocellulosic composition, which is important for biofuel and biogas production. Global soil contamination from sources like industrial emissions, mining, and agricultural activities exacerbates these problems, posing a threat to both energy security and environmental sustainability. Sustainable management strategies, including phytoremediation, microbial bioremediation, soil amendments, and genetic engineering, are explored to mitigate HM effects while enhancing crop resilience. This review emphasizes the importance of integrating techniques to balance bioenergy production with environmental and human health and safety, including the use of HM-tolerant crop varieties, enhanced biorefinery processes, and robust policy frameworks. Future research should focus on developing scalable remediation technologies and interdisciplinary solutions that align with the United Nations’ Sustainable Development Goals and meet global bioenergy needs. Full article

► Show Figures

Figure 1

18 pages, 797 KB

Open AccessArticle

A Two-Level Rule-Mining Approach to Classify Breast Cancer Patterns Using Adaptive Directed Mutation and Genetic Algorithm

by Hui-Ching Wu and Ming-Hseng Tseng

Eng 2025, 6(7), 154; https://doi.org/10.3390/eng6070154 - 7 Jul 2025

Viewed by 743

Abstract

Breast cancer represents a significant public health concern in both Western countries and Asia. Accurate and early detection is critical to improving long-term patient survival. For physicians to understand the classification and decision rules and to evaluate their results, it is preferable to [...] Read more.

Breast cancer represents a significant public health concern in both Western countries and Asia. Accurate and early detection is critical to improving long-term patient survival. For physicians to understand the classification and decision rules and to evaluate their results, it is preferable to use white box approaches to develop prediction models. This paper proposes a novel classification technique for extracting malignant prediction rules from training datasets containing numerical and binary nominal attributes. The classification technique introduced in this study facilitates the discovery of breast cancer patterns by integrating a real-coded genetic algorithm, an adaptive directed mutation operator, and a two-level malignant-rule-mining process. The experimental results, compared with existing rule-based methods from previous studies, demonstrate that the proposed approach generates simple and interpretable decision rules and effectively identifies patterns that lead to accurate breast cancer classification. Full article

(This article belongs to the Special Issue Advanced Artificial Intelligence Techniques for Disease Prediction, Diagnosis and Management)

► Show Figures

Figure 1

11 pages, 930 KB

Open AccessCommunication

GeneHarmony: A Knowledge-Based Tool for Biomarker Discovery in Disease: Sjögren’s Disease vs. Rheumatoid Arthritis and Systemic Lupus Erythematosus

by Micaela F. Beckman, Adam Alexander, Jean-Luc C. Mougeot and Farah Bahrani Mougeot

Int. J. Mol. Sci. 2025, 26(13), 6379; https://doi.org/10.3390/ijms26136379 - 2 Jul 2025

Viewed by 1169

Abstract

Sjögren’s Disease (SjD), Rheumatoid Arthritis (RA), and Systemic Lupus Erythematosus (SLE) are autoimmune diseases with overlapping genetic features, yet the etiologies of these diseases are poorly understood. Using these rheumatic diseases as an example of proof of concept, our aim was to develop [...] Read more.

Sjögren’s Disease (SjD), Rheumatoid Arthritis (RA), and Systemic Lupus Erythematosus (SLE) are autoimmune diseases with overlapping genetic features, yet the etiologies of these diseases are poorly understood. Using these rheumatic diseases as an example of proof of concept, our aim was to develop a tool that simplifies analysis of gene–disease associations applicable to any disease and to perform comparisons. This tool is meant to provide insights into associated gene symbols and gene expression data to identify candidate biomarkers in common among these diseases. The Diseasesv2.0 and GTExv8 databases were utilized for data collection, providing searchable disease names, affiliated gene symbols, confidence scores (ranging from 0 to 5, with 5 being the most confident), and gene expression across the panel of 54 tissue types present in GTExv8. Data infrastructure was established on a Postgres database using Plotlyv5.17.0 and Streamlitv1.27.2 Python packages. The resulting database was used to investigate the genetic associations among SjD, RA, and SLE, including confidence scores from 2.50 to 5.00. STRINGv12 analysis determined significant pathways (FDR < 0.05). Analysis using our tool revealed the following refined gene associations for each disease: SjD based on ‘Sjogren’ search term (n = 12 genes), RA (n = 231 genes), and SLE (n = 137 genes). We found seven genes in common, namely, CD4, CD8A, IL6, IL17A, TNFS13B, TNF, and TRIM21. With the exception of IL17A, these genes were expressed in tissue types known or suggested to be affected by SjD. STRINGv12 determined significant KEGG pathways involving interleukin signaling, cytokine signaling, and the immune system. We developed a tool that simplifies the data mining process, allowing users to search for diseases of interest and view common gene associations and gene expression. Some of the genes identified through our tool may be further explored to better understand SjD pathogenesis and systemic impact. Full article

(This article belongs to the Special Issue Computational Molecular Biology of Metabolic Pathways and Signal Transduction Pathways)

► Show Figures

Figure 1

22 pages, 2689 KB

Open AccessArticle

Functional and Genetic Insights into the Role of the NR4A1 Gene in the Litter Size of the Shaanbei White Cashmere Goat

by Ebadu Areb, Yutian Bi, Yangyang Bai, Qihui Zhu, Lingyuan Ma, Chuanying Pan, Xiaolei Chen and Xianyong Lan

Animals 2025, 15(12), 1729; https://doi.org/10.3390/ani15121729 - 11 Jun 2025

Cited by 1 | Viewed by 1522

Abstract

Nuclear receptor subfamily 4 group A member 1 (NR4A1) plays a crucial role in regulating various physiological processes. As gene mining for reproductive traits is essential, this study aimed to investigate the mRNA expression, genetic variation, and association of the NR4A1 [...] Read more.

Nuclear receptor subfamily 4 group A member 1 (NR4A1) plays a crucial role in regulating various physiological processes. As gene mining for reproductive traits is essential, this study aimed to investigate the mRNA expression, genetic variation, and association of the NR4A1 gene with goat litter size. We examined the mRNA expression levels of the NR4A1 gene in eight different tissues of female Shaanbei White Cashmere (SBWC) goats (n = 6). Then, a novel 11-bp insertion/deletion (InDel) variant was genotyped in 1136 SBWC goats, 87 SNPs were identified through resequencing (n = 120), and selection signal analysis was undertaken. The NR4A1 gene was expressed in all examined tissues, including the ovary and the oviduct, suggesting its role in goat reproduction. Both the 11-bp InDel and 13 SNP variants showed significant association with litter size. Additionally, four potential transcription factor binding sites were predicted within the insertion allele, which may contribute to increased litter size. Selection signal analysis revealed strong pressure on the NR4A1 gene region in the Cashmere goat population. These findings suggest that NR4A1 is a promising candidate gene for improving litter size in goats and could be utilized as a genetic marker in breeding programs. Full article

(This article belongs to the Topic Advances in Molecular Genetics and Breeding of Cattle, Sheep, and Goats)

► Show Figures

Figure 1

16 pages, 680 KB

Open AccessReview

Revolutionizing Utility of Big Data Analytics in Personalized Cardiovascular Healthcare

by Praneel Sharma, Pratyusha Sharma, Kamal Sharma, Vansh Varma, Vansh Patel, Jeel Sarvaiya, Jonsi Tavethia, Shubh Mehta, Anshul Bhadania, Ishan Patel and Komal Shah

Bioengineering 2025, 12(5), 463; https://doi.org/10.3390/bioengineering12050463 - 27 Apr 2025

Cited by 4 | Viewed by 2184

Abstract

The term “big data analytics (BDA)” defines the computational techniques to study complex datasets that are too large for common data processing software, encompassing techniques such as data mining (DM), machine learning (ML), and predictive analytics (PA) to find patterns, correlations, and insights [...] Read more.

The term “big data analytics (BDA)” defines the computational techniques to study complex datasets that are too large for common data processing software, encompassing techniques such as data mining (DM), machine learning (ML), and predictive analytics (PA) to find patterns, correlations, and insights in massive datasets. Cardiovascular diseases (CVDs) are attributed to a combination of various risk factors, including sedentary lifestyle, obesity, diabetes, dyslipidaemia, and hypertension. We searched PubMed and published research using the Google and Cochrane search engines to evaluate existing models of BDA that have been used for CVD prediction models. We critically analyse the pitfalls and advantages of various BDA models using artificial intelligence (AI), machine learning (ML), and artificial neural networks (ANN). BDA with the integration of wide-ranging data sources, such as genomic, proteomic, and lifestyle data, could help understand the complex biological mechanisms behind CVD, including risk stratification in risk-exposed individuals. Predictive modelling is proposed to help in the development of personalized medicines, particularly in pharmacogenomics; understanding genetic variation might help to guide drug selection and dosing, with the consequent improvement in patient outcomes. To summarize, incorporating BDA into cardiovascular research and treatment represents a paradigm shift in our approach to CVD prevention, diagnosis, and management. By leveraging the power of big data, researchers and clinicians can gain deeper insights into disease mechanisms, improve patient care, and ultimately reduce the burden of cardiovascular disease on individuals and healthcare systems. Full article

(This article belongs to the Special Issue Smart Applications and Technology for Cardiovascular Disease Management)

► Show Figures

Figure 1

Search Results (134)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (134)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI