Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (8)

Search Parameters:
Keywords = manual vs. machine scoring

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 6213 KB  
Article
Preoperative Prediction of Axillary Lymph Node Metastasis in Breast Cancer Using Radiomics Features of Voxel-Wise DCE-MRI Time-Intensity-Curve Profile Maps
by Ya Ren, Kexin Chen, Meng Wang, Jie Wen, Sha Feng, Honghong Luo, Cuiju He, Yuan Guo, Dehong Luo, Xin Liu, Dong Liang, Hairong Zheng, Na Zhang and Zhou Liu
Biomedicines 2025, 13(10), 2562; https://doi.org/10.3390/biomedicines13102562 - 21 Oct 2025
Viewed by 923
Abstract
Objective: Axillary lymph node (ALN) status in breast cancer is pivotal for guiding treatment and determining prognosis. The study aimed to explore the feasibility and efficacy of a radiomics model using voxel-wise dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) time-intensity-curve (TIC) profile maps [...] Read more.
Objective: Axillary lymph node (ALN) status in breast cancer is pivotal for guiding treatment and determining prognosis. The study aimed to explore the feasibility and efficacy of a radiomics model using voxel-wise dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) time-intensity-curve (TIC) profile maps to predict ALN metastasis in breast cancer. Methods: A total of 615 breast cancer patients who underwent preoperative DCE-MRI from October 2018 to February 2024 were retrospectively enrolled and randomly allocated into training (n = 430) and testing (n = 185) sets (7:3 ratio). Based on wash-in rate, wash-out enhancement, and wash-out stability, each voxel within manually segmented 3D lesions that were categorized into 1 of 19 TIC subtypes from the DCE-MRI images. Three feature sets were derived: composition ratio (type-19), radiomics features of TIC subtypes (type-19-radiomics), and radiomics features of third-phase DCE-MRI (phase-3-radiomics). Student’s t-test and the least absolute shrinkage and selection operator (LASSO) was used to select features. Four models (type-19, type-19-radiomics, type-19-combined, and phase-3-radiomics) were constructed by a support vector machine (SVM) to predict ALN status. Model performance was assessed using sensitivity, specificity, accuracy, F1 score, and area under the curve (AUC). Results: The type-19-combined model significantly outperformed the phase-3-radiomics model (AUC = 0.779 vs. 0.698, p < 0.001; 0.674 vs. 0.559) and the type-19 model (AUC = 0.779 vs. 0.541, p < 0.001; 0.674 vs. 0.435, p < 0.001) in cross-validation and independent testing sets. The type-19-radiomics showed significantly better performance than the phase-3-radiomics model (AUC = 0.764 vs. 0.698, p = 0.002; 0.657 vs. 0.559, p = 0.037) and type-19 model (AUC = 0. 764 vs. 0.541, p < 0.001; 0.657 vs. 0.435, p < 0.001) in cross-validation and independent testing sets. Among four models, the type-19-combined model achieved the highest AUC (0.779, 0.674) in cross-validation and testing sets. Conclusions: Radiomics analysis of voxel-wise DCE-MRI TIC profile maps, simultaneously quantifying temporal and spatial hemodynamic heterogeneity, provides an effective, noninvasive method for predicting ALN metastasis in breast cancer. Full article
(This article belongs to the Special Issue Breast Cancer Research: Charting Future Directions)
Show Figures

Figure 1

21 pages, 2336 KB  
Article
Machine and Deep Learning on Radiomic Features from Contrast-Enhanced Mammography and Dynamic Contrast-Enhanced Magnetic Resonance Imaging for Breast Cancer Characterization
by Roberta Fusco, Vincenza Granata, Teresa Petrosino, Paolo Vallone, Maria Assunta Daniela Iasevoli, Mauro Mattace Raso, Sergio Venanzio Setola, Davide Pupo, Gerardo Ferrara, Annarita Fanizzi, Raffaella Massafra, Miria Lafranceschina, Daniele La Forgia, Laura Greco, Francesca Romana Ferranti, Valeria De Soccio, Antonello Vidiri, Francesca Botta, Valeria Dominelli, Enrico Cassano, Charlotte Marguerite Lucille Trombadori, Paolo Belli, Giovanna Trecate, Chiara Tenconi, Maria Carmen De Santis, Luca Boldrini and Antonella Petrilloadd Show full author list remove Hide full author list
Bioengineering 2025, 12(9), 952; https://doi.org/10.3390/bioengineering12090952 - 2 Sep 2025
Cited by 1 | Viewed by 1896
Abstract
Objective: The aim of this study was to evaluate the accuracy of machine and deep learning approaches on radiomics features obtained by Dynamic Contrast Enhanced Magnetic Resonance Imaging (DCE-MRI) and contrast enhanced mammography (CEM) in the characterization of breast cancer and in the [...] Read more.
Objective: The aim of this study was to evaluate the accuracy of machine and deep learning approaches on radiomics features obtained by Dynamic Contrast Enhanced Magnetic Resonance Imaging (DCE-MRI) and contrast enhanced mammography (CEM) in the characterization of breast cancer and in the prediction of the tumor molecular profile. Methods: A total of 153 patients with malignant and benign lesions were analyzed and underwent MRI examinations. Considering the histological findings as the ground truth, three different types of findings were used in the analysis: (1) benign versus malignant lesions; (2) G1 + G2 vs. G3 classification; (3) the presence of human epidermal growth factor receptor 2 (HER2+ vs. HER2−). Radiomic features (n = 851) were extracted from manually segmented regions of interest using the PyRadiomics platform, following IBSI-compliant protocols. Highly correlated features were excluded, and the remaining features were standardized using z-score normalization. A feature selection process based on Elastic Net regularization (α = 0.5) was used to reduce dimensionality. Synthetic balancing of the training data was applied using the ROSE method to address class imbalance. Model performance was evaluated using repeated 10-fold cross-validation and AUC-based metrics. Results: Among the 153 patients enrolled in the studies, 113 were malignant lesions. Among the 113 malignant lesions, 32 had high grading (G3) and 66 had the HER2+ receptor. Radiomic features derived from both CEM and DCE-MRI showed strong discriminative performance for malignancy detection, with several features achieving AUCs above 0.80. Gradient Boosting Machine (GBM) achieved the highest accuracy (0.911) and AUC (0.907) in differentiating benign from malignant lesions. For tumor grading, the neural network model attained the best accuracy (0.848), while LASSO yielded the highest sensitivity (0.667) for detecting high-grade tumors. In predicting HER2+ status, the neural network also performed best (AUC = 0.669), with a sensitivity of 0.842. Conclusions: Radiomics-based machine learning models applied to multiparametric CEM and DCE-MRI images offer promising, non-invasive tools for breast cancer characterization. The models effectively distinguished benign from malignant lesions and showed potential in predicting histological grade and HER2 status. These results demonstrate that radiomic features extracted from CEM and DCE-MRI, when analyzed through machine and deep learning models, can support accurate breast cancer characterization. Such models may assist clinicians in early diagnosis, histological grading, and biomarker assessment, potentially enhancing personalized treatment planning and non-invasive decision-making in routine practice. Full article
Show Figures

Figure 1

24 pages, 3720 KB  
Article
A Comparative Study of the Accuracy and Readability of Responses from Four Generative AI Models to COVID-19-Related Questions
by Zongjing Liang, Yun Kuang, Xiaobo Liang, Gongcheng Liang and Zhijie Li
COVID 2025, 5(7), 99; https://doi.org/10.3390/covid5070099 - 30 Jun 2025
Viewed by 1569
Abstract
The purpose of this study is to compare the accuracy and readability of Coronavirus Disease 2019 (COVID-19)-prevention and control knowledge texts generated by four current generative artificial intelligence (AI) models—two international models (ChatGPT and Gemini) and two domestic models (Kimi and Ernie Bot)—and [...] Read more.
The purpose of this study is to compare the accuracy and readability of Coronavirus Disease 2019 (COVID-19)-prevention and control knowledge texts generated by four current generative artificial intelligence (AI) models—two international models (ChatGPT and Gemini) and two domestic models (Kimi and Ernie Bot)—and to evaluate the other performance characteristics of texts generated by domestic and international models. This paper uses the questions and answers in the COVID-19 prevention guidelines issued by the U.S. Centers for Disease Control and Prevention (CDC) as the evaluation criteria. The accuracy, readability, and comprehensibility of the texts generated by each model are scored against the CDC standards. Then the neural network model in the intelligent algorithms is used to identify the factors that affect readability. Then the medical topics of the generated text are analyzed using text analysis technology. Finally, a questionnaire-based manual scoring approach was used to evaluate the AI-generated texts, which was then compared to automated machine scoring. Accuracy: domestic models have higher textual accuracy, while international models have higher reliability. Readability: domestic models produced more fluent and publicly accessible language; international models generated more standardized and formally structured texts with greater consistency. Comprehensibility: domestic models offered superior readability, while international models were more stable in output. Readability factors: the average words per sentence (AWPS) emerged as the most significant factor influencing readability across all models. Topic analysis: ChatGPT emphasized epidemiological knowledge; Gemini focused on general medical and health topics; Kimi provided more multidisciplinary content; and Ernie Bot concentrated on clinical medicine. From the empirical results, it can be found that the manual and machine scoring are highly consistent in the indicators SimHash and FKGL, which proves the effectiveness of the evaluation method proposed in this paper. Conclusion: Texts generated by domestic models are more accessible and better suited for public education, clinical communication, and health consultations. In contrast, the international model has a higher accuracy in generating expertise, especially in epidemiological studies and assessing knowledge literature on disease severity. The inclusion of manual evaluations confirms the reliability of the proposed assessment framework. It is therefore recommended that future AI-generated knowledge systems for infectious disease control balance professional rigor with public comprehensibility, in order to provide reliable and accessible reference materials during major infectious disease outbreaks. Full article
(This article belongs to the Section COVID Public Health and Epidemiology)
Show Figures

Figure 1

15 pages, 1100 KB  
Article
18F-FDG PET/CT Radiomics for Predicting Therapy Response in Primary Mediastinal B-Cell Lymphoma: A Bi-Centric Pilot Study
by Fabiana Esposito, Luigi Manco, Luca Urso, Sara Adamantiadis, Giovanni Scribano, Lucrezia De Marchi, Adriano Venditti, Massimiliano Postorino, Nicoletta Urbano, Roberta Gafà, Antonio Cuneo, Agostino Chiaravalloti, Mirco Bartolomei and Luca Filippi
Cancers 2025, 17(11), 1827; https://doi.org/10.3390/cancers17111827 - 30 May 2025
Cited by 2 | Viewed by 2652
Abstract
Purpose: This bi-centric pilot study investigates the predictive value of pre-treatment [18F]FDG PET/CT radiomics for assessing therapy response in primary mediastinal B-cell lymphoma (PMBCL). Methods: All PMBCL patients underwent PET/CT with [18F]FDG between January 2011 and January 2022 at [...] Read more.
Purpose: This bi-centric pilot study investigates the predictive value of pre-treatment [18F]FDG PET/CT radiomics for assessing therapy response in primary mediastinal B-cell lymphoma (PMBCL). Methods: All PMBCL patients underwent PET/CT with [18F]FDG between January 2011 and January 2022 at Policlinico Tor Vergata University Hospital of Rome (70% training and 30% internal validation cohort) and Sant’Anna University Hospital of Ferrara (external validation cohort). The Deauville score (DS) was used as a predictor of therapy response (DS1-DS3 vs. DS4/DS5). A total of 121 quantitative radiomics features (RFts) were extracted from manually segmented volumes of interest (VOIs) in PET and CT images, according to IBSI. ComBat harmonization was applied to correct the center variability of features, followed by class balancing with SMOTE. Two machine learning (ML) prediction models, the PET model and the CT model, were independently developed using robust RFts. For each ML model, two different algorithms were trained (i.e., Random Forest, RF, and Support Vector Machine, SVM) using 10-fold cross validation, tested on the internal/external validation set. Receiver operating characteristic (ROC) curves, area under the curve (AUC), classification accuracy (CA), precision (Prec), sensitivity (Sen), specificity (Spec), true positive (TP) scores, and true negative (TN) scores were computed. Results: The entire dataset was composed of 29 samples for the Rome cohort (23 from D1–D3 and 6 from D4/D5) and 9 samples for the Ferrara cohort (4 from D1–D3 and 5 from D4/D5). A total of 27 RFts were identified as robust for each imaging modality. Both the CT and PET models effectively predicted the Deauville score. The performance metrics of the best classifier (SVM) for the CT and PET models in external validation were AUC = 0.75/0.80, CA = 0.85/0.77, Prec = 0.97/0.67, Sen = 0.60/0.80, Spec = 0.98/0.75, TP = 75.0%/66.7%, and TN = 77.8%/85.7%, respectively. Conclusions: ML models trained on [18F]FDG PET/CT radiomic features in PMBLC patients could predict the Deauville score. Full article
(This article belongs to the Special Issue Radiomics in Cancer Imaging: Theory and Applications in Solid Tumours)
Show Figures

Figure 1

25 pages, 7331 KB  
Article
A Deep Learning Approach for Brain Tumor Firmness Detection Based on Five Different YOLO Versions: YOLOv3–YOLOv7
by Norah Fahd Alhussainan, Belgacem Ben Youssef and Mohamed Maher Ben Ismail
Computation 2024, 12(3), 44; https://doi.org/10.3390/computation12030044 - 1 Mar 2024
Cited by 27 | Viewed by 7223
Abstract
Brain tumor diagnosis traditionally relies on the manual examination of magnetic resonance images (MRIs), a process that is prone to human error and is also time consuming. Recent advancements leverage machine learning models to categorize tumors, such as distinguishing between “malignant” and “benign” [...] Read more.
Brain tumor diagnosis traditionally relies on the manual examination of magnetic resonance images (MRIs), a process that is prone to human error and is also time consuming. Recent advancements leverage machine learning models to categorize tumors, such as distinguishing between “malignant” and “benign” classes. This study focuses on the supervised machine learning task of classifying “firm” and “soft” meningiomas, critical for determining optimal brain tumor treatment. The research aims to enhance meningioma firmness detection using state-of-the-art deep learning architectures. The study employs a YOLO architecture adapted for meningioma classification (Firm vs. Soft). This YOLO-based model serves as a machine learning component within a proposed CAD system. To improve model generalization and combat overfitting, transfer learning and data augmentation techniques are explored. Intra-model analysis is conducted for each of the five YOLO versions, optimizing parameters such as the optimizer, batch size, and learning rate based on sensitivity and training time. YOLOv3, YOLOv4, and YOLOv7 demonstrate exceptional sensitivity, reaching 100%. Comparative analysis against state-of-the-art models highlights their superiority. YOLOv7, utilizing the SGD optimizer, a batch size of 64, and a learning rate of 0.01, achieves outstanding overall performance with metrics including mean average precision (99.96%), precision (98.50%), specificity (97.95%), balanced accuracy (98.97%), and F1-score (99.24%). This research showcases the effectiveness of YOLO architectures in meningioma firmness detection, with YOLOv7 emerging as the optimal model. The study’s findings underscore the significance of model selection and parameter optimization for achieving high sensitivity and robust overall performance in brain tumor classification. Full article
(This article belongs to the Special Issue Deep Learning Applications in Medical Imaging)
Show Figures

Figure 1

13 pages, 6517 KB  
Article
Machine Learning for Digital Scoring of PRMT6 in Immunohistochemical Labeled Lung Cancer
by Abeer M. Mahmoud, Eileen Brister, Odile David, Klara Valyi-Nagy, Maria Sverdlov, Peter H. Gann and Sage J. Kim
Cancers 2023, 15(18), 4582; https://doi.org/10.3390/cancers15184582 - 15 Sep 2023
Cited by 3 | Viewed by 2572
Abstract
Lung cancer is the leading cause of cancer death in the U.S. Therefore, it is imperative to identify novel biomarkers for the early detection and progression of lung cancer. PRMT6 is associated with poor lung cancer prognosis. However, analyzing PRMT6 expression manually in [...] Read more.
Lung cancer is the leading cause of cancer death in the U.S. Therefore, it is imperative to identify novel biomarkers for the early detection and progression of lung cancer. PRMT6 is associated with poor lung cancer prognosis. However, analyzing PRMT6 expression manually in large samples is time-consuming posing a significant limitation for processing this biomarker. To overcome this issue, we trained and validated an automated method for scoring PRMT6 in lung cancer tissues, which can then be used as the standard method in future larger cohorts to explore population-level associations between PRMT6 expression and sociodemographic/clinicopathologic characteristics. We evaluated the ability of a trained artificial intelligence (AI) algorithm to reproduce the PRMT6 immunoreactive scores obtained by pathologists. Our findings showed that tissue segmentation to cancer vs. non-cancer tissues was the most critical parameter, which required training and adjustment of the algorithm to prevent scoring non-cancer tissues or ignoring relevant cancer cells. The trained algorithm showed a high concordance with pathologists with a correlation coefficient of 0.88. The inter-rater agreement was significant, with an intraclass correlation of 0.95 and a scale reliability coefficient of 0.96. In conclusion, we successfully optimized a machine learning algorithm for scoring PRMT6 expression in lung cancer that matches the degree of accuracy of scoring by pathologists. Full article
(This article belongs to the Special Issue Digital Pathology: Basics, Clinical Applications and Future Trends)
Show Figures

Figure 1

21 pages, 1886 KB  
Article
Phybrata Sensors and Machine Learning for Enhanced Neurophysiological Diagnosis and Treatment
by Alex J. Hope, Utkarsh Vashisth, Matthew J. Parker, Andreas B. Ralston, Joshua M. Roper and John D. Ralston
Sensors 2021, 21(21), 7417; https://doi.org/10.3390/s21217417 - 8 Nov 2021
Cited by 7 | Viewed by 4992
Abstract
Concussion injuries remain a significant public health challenge. A significant unmet clinical need remains for tools that allow related physiological impairments and longer-term health risks to be identified earlier, better quantified, and more easily monitored over time. We address this challenge by combining [...] Read more.
Concussion injuries remain a significant public health challenge. A significant unmet clinical need remains for tools that allow related physiological impairments and longer-term health risks to be identified earlier, better quantified, and more easily monitored over time. We address this challenge by combining a head-mounted wearable inertial motion unit (IMU)-based physiological vibration acceleration (“phybrata”) sensor and several candidate machine learning (ML) models. The performance of this solution is assessed for both binary classification of concussion patients and multiclass predictions of specific concussion-related neurophysiological impairments. Results are compared with previously reported approaches to ML-based concussion diagnostics. Using phybrata data from a previously reported concussion study population, four different machine learning models (Support Vector Machine, Random Forest Classifier, Extreme Gradient Boost, and Convolutional Neural Network) are first investigated for binary classification of the test population as healthy vs. concussion (Use Case 1). Results are compared for two different data preprocessing pipelines, Time-Series Averaging (TSA) and Non-Time-Series Feature Extraction (NTS). Next, the three best-performing NTS models are compared in terms of their multiclass prediction performance for specific concussion-related impairments: vestibular, neurological, both (Use Case 2). For Use Case 1, the NTS model approach outperformed the TSA approach, with the two best algorithms achieving an F1 score of 0.94. For Use Case 2, the NTS Random Forest model achieved the best performance in the testing set, with an F1 score of 0.90, and identified a wider range of relevant phybrata signal features that contributed to impairment classification compared with manual feature inspection and statistical data analysis. The overall classification performance achieved in the present work exceeds previously reported approaches to ML-based concussion diagnostics using other data sources and ML models. This study also demonstrates the first combination of a wearable IMU-based sensor and ML model that enables both binary classification of concussion patients and multiclass predictions of specific concussion-related neurophysiological impairments. Full article
(This article belongs to the Special Issue Artificial Intelligence and Internet of Things in Health Applications)
Show Figures

Figure 1

16 pages, 3612 KB  
Article
Comparison of Multiplexed Immunofluorescence Imaging to Chromogenic Immunohistochemistry of Skin Biomarkers in Response to Monkeypox Virus Infection
by Anup Sood, Yunxia Sui, Elizabeth McDonough, Alberto Santamaría-Pang, Yousef Al-Kofahi, Zhengyu Pang, Peter B. Jahrling, Jens H. Kuhn and Fiona Ginty
Viruses 2020, 12(8), 787; https://doi.org/10.3390/v12080787 - 23 Jul 2020
Cited by 32 | Viewed by 6917
Abstract
Over the last 15 years, advances in immunofluorescence-imaging based cycling methods, antibody conjugation methods, and automated image processing have facilitated the development of a high-resolution, multiplexed tissue immunofluorescence (MxIF) method with single cell-level quantitation termed Cell DIVETM. Originally developed for fixed [...] Read more.
Over the last 15 years, advances in immunofluorescence-imaging based cycling methods, antibody conjugation methods, and automated image processing have facilitated the development of a high-resolution, multiplexed tissue immunofluorescence (MxIF) method with single cell-level quantitation termed Cell DIVETM. Originally developed for fixed oncology samples, here it was evaluated in highly fixed (up to 30 days), archived monkeypox virus-induced inflammatory skin lesions from a retrospective study in 11 rhesus monkeys to determine whether MxIF was comparable to manual H-scoring of chromogenic stains. Six protein markers related to immune and cellular response (CD68, CD3, Hsp70, Hsp90, ERK1/2, ERK1/2 pT202_pY204) were manually quantified (H-scores) by a pathologist from chromogenic IHC double stains on serial sections and compared to MxIF automated single cell quantification of the same markers that were multiplexed on a single tissue section. Overall, there was directional consistency between the H-score and the MxIF results for all markers except phosphorylated ERK1/2 (ERK1/2 pT202_pY204), which showed a decrease in the lesion compared to the adjacent non-lesioned skin by MxIF vs an increase via H-score. Improvements to automated segmentation using machine learning and adding additional cell markers for cell viability are future options for improvement. This method could be useful in infectious disease research as it conserves tissue, provides marker colocalization data on thousands of cells, allowing further cell level data mining as well as a reduction in user bias. Full article
Show Figures

Figure 1

Back to TopTop