Next Article in Journal
Modified Significance Analysis of Microarrays in Heterogeneous Diseases
Next Article in Special Issue
Deep Learning for Integrated Analysis of Insulin Resistance with Multi-Omics Data
Previous Article in Journal
The Impact of Air Pollution Exposure on the MicroRNA Machinery and Lung Cancer Development
Previous Article in Special Issue
Validation of the Italian Version of the Educational Needs Assessment Tool in Rheumatoid Arthritis Patients and Factors Associated with Educational Needs

Breast Cancer Type Classification Using Machine Learning

Department of Genetics, School of Medicine, Louisiana State University Health Sciences Center, 533 Bolivar, New Orleans, LA 70112, USA
Author to whom correspondence should be addressed.
Academic Editor: Anguraj Sadanandam
J. Pers. Med. 2021, 11(2), 61;
Received: 23 December 2020 / Revised: 12 January 2021 / Accepted: 15 January 2021 / Published: 20 January 2021
(This article belongs to the Special Issue Use of Clinical Decision Support Software within Health Care Systems)
Background: Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Advances in genomic research have enabled use of precision medicine in clinical management of breast cancer. A critical unmet medical need is distinguishing triple negative breast cancer, the most aggressive and lethal form of breast cancer, from non-triple negative breast cancer. Here we propose use of a machine learning (ML) approach for classification of triple negative breast cancer and non-triple negative breast cancer patients using gene expression data. Methods: We performed analysis of RNA-Sequence data from 110 triple negative and 992 non-triple negative breast cancer tumor samples from The Cancer Genome Atlas to select the features (genes) used in the development and validation of the classification models. We evaluated four different classification models including Support Vector Machines, K-nearest neighbor, Naïve Bayes and Decision tree using features selected at different threshold levels to train the models for classifying the two types of breast cancer. For performance evaluation and validation, the proposed methods were applied to independent gene expression datasets. Results: Among the four ML algorithms evaluated, the Support Vector Machine algorithm was able to classify breast cancer more accurately into triple negative and non-triple negative breast cancer and had less misclassification errors than the other three algorithms evaluated. Conclusions: The prediction results show that ML algorithms are efficient and can be used for classification of breast cancer into triple negative and non-triple negative breast cancer types. View Full-Text
Keywords: gene expression; breast cancer; classification; machine learning gene expression; breast cancer; classification; machine learning
Show Figures

Figure 1

MDPI and ACS Style

Wu, J.; Hicks, C. Breast Cancer Type Classification Using Machine Learning. J. Pers. Med. 2021, 11, 61.

AMA Style

Wu J, Hicks C. Breast Cancer Type Classification Using Machine Learning. Journal of Personalized Medicine. 2021; 11(2):61.

Chicago/Turabian Style

Wu, Jiande, and Chindo Hicks. 2021. "Breast Cancer Type Classification Using Machine Learning" Journal of Personalized Medicine 11, no. 2: 61.

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Back to TopTop