Next Article in Journal
Thoracic Surgeon Impressions of the Impact of the COVID-19 Pandemic on Lung Cancer Care—Lessons from the First Wave in Canada
Previous Article in Journal
Primary Signet Ring Cell/Histiocytoid Carcinoma of the Eyelid: Somatic Mutations in CDH1 and Other Clinically Actionable Mutations Imply Early Use of Targeted Agents
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Histone Acetylation Modulator Gene Signature for Classification and Prognosis of Breast Cancer

1
Department of Pathology, Peking University Cancer Hospital, Beijing 100142, China
2
Department of Breast Surgery, Peking University People’s Hospital, Beijing 100044, China
*
Author to whom correspondence should be addressed.
Curr. Oncol. 2021, 28(1), 928-939; https://doi.org/10.3390/curroncol28010091
Submission received: 19 January 2021 / Revised: 7 February 2021 / Accepted: 12 February 2021 / Published: 17 February 2021

Abstract

:
Regulators of histone acetylation are promising epigenetic targets for therapy in breast cancer. In this study, we comprehensively analyzed the expression of histone acetylation modulator genes in breast cancer using TCGA data sources. A gene signature composed of eight histone acetylation modulators (HAMs) was found to be effective for the classification and prognosis of breast cancers, especially in the HER2-enriched and basal-like molecular subtypes. The eight genes consist of two histone acetylation writers (GTF3C4 and CLOCK), two erasers (HDAC2 and SIRT7) and four readers (BRD4, BRD7, SP100, and BRWD3). Both histone acetylation writer genes and eraser genes were found to be differentially expressed between the two groups indicating a close relationship exists between overall histone acetylation level and prognosis of breast cancer in HER2-enriched and basal-like breast cancer.

1. Introduction

Breast cancer has overtaken lung cancer as the most commonly diagnosed cancer globally in 2020 [1]. Breast cancer is widely accepted as a highly heterogeneous disease. The current approach to classifying breast cancer into clinical subtypes is based on the immunohistochemistry (IHC) results of estrogen receptor, progesterone receptor, human epidermal growth factor receptor 2, and proliferation marker Ki67. However, this IHC based clinical subtyping system is not ideal, and gene expression profiling (intrinsic subtyping) reveals a deeper appreciation for the disease heterogeneity [2]. In 2000, Perou et al. developed the PAM50 intrinsic subtypes of breast cancer based on a set of 50 genes [3]. Since then, various other tests based on gene expression quantification have been developed to provide molecular stratification of breast cancer [4].
Previous pan-cancer studies have revealed that besides genomic alteration, epigenetic changes were common and played an essential role in various cancer types including breast cancer [5,6]. Epigenetic changes generally involve DNA methylation and histone modification which can regulate chromatin structure. DNA methylation has been comprehensively analyzed in breast cancer together with the transcriptomic data and two characterized DNA methylation features named Epi-LumB and Epi-Basal have been identified to indicate poor prognosis [7]. However, studies involving histone modification, particularly histone acetylation are limited in breast cancer. Histone acetylation was a dynamic and reversible process controlled by histone acetylation modulator (HAM) genes including the writers, the erasers and the readers. The acetyl-group was added to the lysine residue of either H3 or H4 by a group of histone acetyltransferases (HATs) which was the writer while the acetyl-group can be removed by specific histone deacetylase (HDAC) which was referred to as the eraser. Additionally, there are proteins called histone acetylation readers that could recognize acetylated histones and recruit transcription machinery [8]. The bromodomain and extra-terminal domain (BET) family was one of the readers which can bind specifically to acetylated H3/H4 and recruit downstream effectors to activate transcription [9]. Histone acetylation is closely related to transcription activation either by directly making the nucleosomes structurally loose and easy for RNA polymerase to go through or by acting as a marker to recruit transcription machinery [10].
Previous studies found that the overall H4 acetylation level was reduced from normal breast epithelium to paired breast cancer tissue [11] and high histone acetylation level was detected in luminal-type of breast cancer, associated with a good prognosis [12]. Moreover, both HDAC inhibitors and BET inhibitors showed promising therapeutic efficacy for breast cancer patients [13,14]. The histone acetylation modulators often work in a network where one acetylation modulator gene can modify multiple lysine sites and each lysine site can be modified by multiple modulators [15]. Moreover, the balance between histone acetylation writer and eraser also contributes to the overall acetylation level. However, most current studies are focused either on a single acetylation site or the acetylation status of a specific target gene. Studies involving a whole set of HAM genes in breast cancer are limited. In this study, we comprehensively analyzed the expression of 73 histone acetylation modulator genes in breast cancer using TCGA data sources. A gene signature composed of eight HAMs was found to be effective for the classification and prognosis prediction of breast cancer.

2. Results

2.1. A Histone Acetylation Modulator Gene Signature Could Classify Breast Cancers into Two Groups

A total of 73 histone acetylation modulator genes were studied as in previous literature [16]. Out of the 73 histone acetylation modulator genes, eight of them were found to be related to breast cancer prognosis in the process of feature selection. The feature selecting process was based on the Cox regression model by the function “FSbyCOX” in the package CancerSubtypes [17]. The survival analyses were performed for the eight genes separately by comparing the overall survival of the high-expression and low-expression groups. The expression of the eight genes were all correlated with the overall survival of breast cancer although in two different trends. Specifically, the high expression of BRD4, SIRT7 and SP100 were correlated with good prognosis while high expression of the other five genes indicated a poor prognosis (Figure 1). Among the eight genes, there are two histone acetylation writers which are GTF3C4 and CLOCK, two erasers which are HDAC2 and SIRT7 as well as four readers which are BRD4, BRD7, SP100, and BRWD3. A panel composed of the eight genes was used as a gene signature to further classify breast cancer and was briefed as HAM signature. Transcriptomic data and clinical follow-up of the 1102 breast cancer patients from the TCGA breast cancer cohort were used for the classification. Non-negative matrix factorization (NMF) clustering was used to classify them into two groups named HAM1 and HAM2 (Figure 2). NMF is an efficient unsupervised machine-learning algorithm to identify distinct molecular patterns and molecular classification with high-throughput data. The classification result for each TCGA sample was listed in Table S1. Cluster validity was represented by the average silhouette width, where a higher average silhouette width indicates higher sample tightness and better cluster separation (Figure 3). The average silhouette width for the clustering reached a value of 0.67 with the silhouette width for HAM1 and HAM2 reaching 0.72 and 0.64, respectively. The mRNA expression level of the eight genes in normal breast tissue and breast cancers including four intrinsic subtypes were analyzed in Figure S1. Various patterns of expression were noticed for the eight genes which suggest that the HAM groups could be very different from the PAM50 intrinsic subtypes.

2.2. Classification Using Histone Acetylation Modulator Genes Distinguished Two Prognosis Groups in HER2-Enriched and Basal-Like Intrinsic Subtypes

To further analyze the characteristics of HAM1 and HAM2 groups, the distribution of four intrinsic subtypes in the two groups were calculated and displayed in Table 1. Compared with HAM2, the HAM1 group was more enriched in basal-like and HER2-enriched subtypes. In order to identify the prognostic value for the clustering, survival analysis was conducted comparing the overall survival between HAM1 and HAM2 groups. It was found that the HAM1 group has a better prognosis than the HAM2 group indicating a clinical significance and prognostic value for this classification (Figure 4). Further analysis revealed that it was the HER2-enriched and basal-like subtypes in HAM1 and HAM2 that contributed to the survival difference while Luminal A and B subtypes of the two groups showed no or minor survival differences (Figure 5). These results indicate that in the HER2-enriched and basal-like molecular subtypes the differential expression of the HAM signature indicates a different prognosis. Although whether there is a causal effect between the expression of HAM signature and survival remains elusive, it suggested that the HAM classification can be used as a further stratification of the PAM50 subtypes. Additionally, differentially expressed genes (DEGs) between the two groups were analyzed and listed in Table S2. The volcano plot also showed that genes in HAM1 and HAM2 groups have different expression patterns (Figure S2).

2.3. The Eight Featured Genes Belonged to Two Basis Components with a Different Expression Pattern

Basis component analysis of the NMF clustering revealed that there were two basic components in the eight featured genes with CLOCK, GTF3C4 and BRWD3 being Basis 1 and the others being Basis 2 (Figure 6 and Figure 7). The expression of genes in Basis 1, CLOCK, GTF3C4 and BRWD3, were in a highly positive correlation with each other and correlation among genes in Basis 2 were slightly weaker (Figure 6). Worth to note, all of the writers were in Basis 1 while all erasers were in Basis 2. Additionally, genes in Basis 1 showed a higher expression in HAM2 and genes in Basis 2 expressed more in HAM1, which were all statistically significant except for HDAC2 and SP100 (Figure 8).

3. Discussion

Histone acetylation modulator genes have been shown to play an essential role in the epigenetic control of breast cancer while its specific role in breast cancer was unequivocal. In this study, a gene signature composed of eight histone acetylation modulator genes was identified which can be used to classify breast cancers into HAM1 and HAM2 groups with their expression value. Significantly, the overall survival of the HER2-enriched and basal-like molecular subtypes between these two groups was different, with HAM1 group showing a much better prognosis. It suggested a clinical significance and prognostic value for the HAM group classification. Specifically, the HAM gene signature can be measured along with the PAM50 as a further stratification. In cases classified as HER2-enriched and basal-like subtypes by PAM50, further classification into HAM1 group would suggest a good prognosis while the classification result of HAM2 indicates a poor prognosis. Indeed, numerous studies have shown that each intrinsic subtype identified by PAM50 was still heterogeneous and a further stratification particularly with a prognostic significance was necessary [18,19,20].
The specific roles of the eight signature genes in breast cancer have been studied before although in different depths. For the two writers, CLOCK was reported to have a regulatory role in breast cancer tumorigenesis [21,22], while no specific studies have reported the role of GTF3C4 in breast cancer. For the two erasers, overexpression of HDAC2 had a strong effect on breast cancer prognosis [23,24,25] while the effect of SIRT7 seems to be controversial [26,27,28]. Moreover, the writer genes were found to be more highly expressed in HAM2 than in HAM1 while the erasers showed the opposite trend. However, since the HAM gene signatures comprise only 8 of the 73 HAM genes, the overall histone acetylation level cannot be directly speculated by the expression of the signature genes. Instead, the acetylation status in HAM1 and HAM2 groups should be checked in the breast cancer sample. The function and the correlation of these signature genes should be further explored.

4. Materials and Methods

4.1. Data Collection and Processing

Data acquisition and analysis were conducted using R software (https://www.r-project.org/, version 4.0.3) unless otherwise mentioned. RNA-seq and clinical data were downloaded from the TCGA dataset [29] using the TCGAbioloinks R/Bioconductor package (version 2.18.0) [30]. Generally, we used TCGAbiolinks to download 1102 breast cancer samples with Illumina HiSeq RNASeqV2 data.
The Fragments Per Kilobase of transcript per Million fragments mapped (FPKM) is the most commonly used normalization method for RNA transcript reads. Upper-quartile normalized FPKM (FPKM-UQ) uses Upper-quartile gene counts rather than total gene counts for normalization and is believed to have better accuracy in gene differential expression identification [31]. In this study, FPKM-UQ RNA-seq data were downloaded and prepared using the GDCquery, GDCdownload, and GDCprepare functions.

4.2. Identification and Verification of Featured Genes

For the 73 histone acetylation modulator genes, those significantly associated with a prognostic value were selected using the function “FSbyCOX” in the package CancerSubtypes (version 1.16.0) [17]. The “FSbyCOX” function selected featured genes by the COX regression model. Eight genes were found to have a significant prognostic value while the other 65 genes have no prognostic value in breast cancer. Further verification of the selected genes was performed by analyzing the association between their expression and overall survival. The optimum cutpoint regarding low and high expression threshold was determined with the “surv_cutpoint” function in the “survminer” package in R. Survival analyses were performed for each of the eight genes by comparing the overall survival of the high-expression and low-expression group.

4.3. Nonnegative Matrix Factorization Clustering

NMF is a clustering method widely used for cancer molecular subtyping using gene expression data [32,33]. Standard ‘brunet’ for 30 iterations was selected by NMF, using the R package ‘NMF’ [34]. In the clustering, the correlation coefficient of each two random samples was calculated using the expression value of the eight feature genes. All of the correlation values can then be plotted in an 1102 × 1102 matrix with each row and column representing one sample in the same order. NMF algorithm was used for the clustering by setting the number of the components to 2 (κ = 2). Silhouette widths were generated from the NMF consensus membership matrix to represent the cluster validity.

4.4. Survival Analysis

Survival analyses were performed using the ‘survival’ (version 2.41) package [35]. The Kaplan–Meier method was used to estimate the survival outcomes of all patients by different categories; groups were compared using the log-rank statistic [36]. p-values were calculated as two-sided, with statistical significance declared for p less than 0.05.

4.5. Analysis of Differentially Expressed Genes between the HAM1 and HAM2 Groups

The log2-transformed FPKM-UQ data were analyzed using limma (Version 3.46.0) package [37] functions lmFit, eBayes, and topTable to identify DEGs between HAM1 and HAM2 groups of patients. Student’s t-test was utilized to calculate the p values of genes. Genes with p < 0.05 were considered as DEGs.

Supplementary Materials

The following are available online at https://www.mdpi.com/1718-7729/28/1/91/s1, Figure S1: Expression of the eight HAM signature genes in normal breast tissues and breast cancers with four PAM50 intrinsic molecular subtypes, Figure S2: Volcano plot of differentially expressed genes between HAM1 and HAM2 groups, Table S1: HAM group classification result for each TCGA sample, Table S2: List of differentially expressed genes between HAM1 and HAM2.

Author Contributions

Conceptualization, M.L. and T.H.; methodology, T.H.; software, M.L.; validation, M.L. and T.H.; formal analysis, M.L.; investigation, M.L. and W.H.; resources, T.H. and Y.L.; data curation, M.L.; writing—original draft preparation, M.L.; writing—review and editing, M.L., W.H., Y.L. and T.H.; visualization, T.H.; supervision, M.L.; project administration, T.H.; funding acquisition, M.L., W.H. and T.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 82002979 and 81702839), and the Scientific Research and Development Funds of Peking University People’s Hospital (Grant No. RDY2020-16).

Institutional Review Board Statement

Ethical review and approval were waived for this study, due to the use of open-accessed data.

Informed Consent Statement

Patient consent was waived due to the de-identification in the TCGA database.

Data Availability Statement

The data presented in this study are available in the article and supplementary material, and are also available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J. Clin. 2021. [Google Scholar] [CrossRef]
  2. Szymiczek, A.; Lone, A.; Akbari, M.R. Molecular Intrinsic versus Clinical Subtyping in Breast Cancer: A Comprehensive Review. Clin. Genet. 2020. [Google Scholar] [CrossRef] [PubMed]
  3. Perou, C.M.; Sørlie, T.; Eisen, M.B.; Van De Rijn, M.; Jeffrey, S.S.; Rees, C.A.; Pollack, J.R.; Ross, D.T.; Johnsen, H.; Akslen, L.A.; et al. Molecular Portraits of Human Breast Tumours. Nat. Cell Biol. 2000, 406, 747–752. [Google Scholar] [CrossRef]
  4. Cascianelli, S.; Molineris, I.; Isella, C.; Masseroli, M.; Medico, E. Machine Learning for RNA Sequencing-Based Intrinsic Subtyping of Breast Cancer. Sci. Rep. 2020, 10, 1–13. [Google Scholar] [CrossRef]
  5. Mohammad, H.P.; Barbash, O.; Creasy, C.L. Targeting Epigenetic Modifications in Cancer Therapy: Erasing the Roadmap to Cancer. Nat. Med. 2019, 25, 403–418. [Google Scholar] [CrossRef]
  6. Park, J.W.; Han, J.-W. Targeting Epigenetics for Cancer Therapy. Arch. Pharmacal. Res. 2019, 42, 159–170. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Stefansson, O.A.; Moran, S.; Gomez, A.; Sayols, S.; Arribas-Jorba, C.; Sandoval, J.; Hilmarsdottir, H.; Ólafsdóttir, E.; Tryggvadottir, L.; Jonasson, J.G.; et al. A DNA Methylation-Based Definition of Biologically Distinct Breast Cancer Subtypes. Mol. Oncol. 2014, 9, 555–568. [Google Scholar] [CrossRef]
  8. Yun, M.; Wu, J.; Workman, J.L.; Li, B. Readers of Histone Modifications. Cell Res. 2011, 21, 564–578. [Google Scholar] [CrossRef] [Green Version]
  9. Jain, A.K.; Barton, M.C. Bromodomain Histone Readers and Cancer. J. Mol. Biol. 2017, 429, 2003–2010. [Google Scholar] [CrossRef]
  10. Marmorstein, R.; Zhou, M.-M. Writers and Readers of Histone Acetylation: Structure, Mechanism, and Inhibition. Cold Spring Harb. Perspect. Biol. 2014, 6, a018762. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Suzuki, J.; Chen, Y.-Y.; Scott, G.K.; Devries, S.; Chin, K.; Benz, C.C.; Waldman, F.M.; Hwang, E.S. Protein Acetylation and Histone Deacetylase Expression Associated with Malignant Breast Cancer Progression. Clin. Cancer Res. 2009, 15, 3163–3171. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Elsheikh, S.E.; Green, A.R.; Rakha, E.A.; Powe, D.G.; Ahmed, R.A.; Collins, H.M.; Soria, D.; Garibaldi, J.M.; Paish, C.E.; Ammar, A.A.; et al. Global Histone Modifications in Breast Cancer Correlate with Tumor Phenotypes, Prognostic Factors, and Patient Outcome. Cancer Res. 2009, 69, 3802–3809. [Google Scholar] [CrossRef] [Green Version]
  13. Manzotti, G.; Ciarrocchi, A.; Sancisi, V. Inhibition of BET Proteins and Histone Deacetylase (HDACs): Crossing Roads in Cancer Therapy. Cancers 2019, 11, 304. [Google Scholar] [CrossRef] [Green Version]
  14. Romero, D. HDAC Inhibitors Tested in Phase III Trial. Nat. Rev. Clin. Oncol. 2019, 16, 465. [Google Scholar] [CrossRef] [PubMed]
  15. Wilting, R.H.; Dannenberg, J.-H. Epigenetic Mechanisms in Tumorigenesis, Tumor Cell Heterogeneity and Drug Resistance. Drug Resist. Updat. 2012, 15, 21–38. [Google Scholar] [CrossRef] [Green Version]
  16. Hu, Z.; Zhou, J.; Jiang, J.; Yuan, J.; Zhang, Y.; Wei, X.; Loo, N.; Wang, Y.; Pan, Y.; Zhang, T. Genomic Characterization of Genes Encoding Histone Acetylation Modulator Proteins Identifies Therapeutic Targets for Cancer Treatment. Nat. Commun. 2019, 10, 1–17. [Google Scholar] [CrossRef]
  17. Xu, T.; Le, T.D.; Liu, L.; Su, N.; Wang, R.; Sun, B.-Y.; Colaprico, A.; Bontempi, G.; Li, J. Cancer Subtypes: An R/Bioconductor Package for Molecular Cancer Subtype Identification, Validation and Visualization. Bioinformatics 2017, 33, 3131–3133. [Google Scholar] [CrossRef] [PubMed]
  18. Mathews, J.C.; Nadeem, S.; Levine, A.J.; Pouryahya, M.; Deasy, J.O.; Tannenbaum, A. Robust and Interpretable PAM50 Reclassification Exhibits Survival Advantage for Myoepithelial and Immune Phenotypes. npj Breast Cancer 2019, 5, 1–8. [Google Scholar] [CrossRef] [Green Version]
  19. Lehmann, B.D.; Bauer, J.A.; Chen, X.; Sanders, M.E.; Chakravarthy, A.B.; Shyr, Y.; Pietenpol, J.A. Identification of Human Triple-Negative Breast Cancer Subtypes and Preclinical Models for Selection of Targeted Therapies. J. Clin. Investig. 2011, 121, 2750–2767. [Google Scholar] [CrossRef] [Green Version]
  20. Lipton, A.; Goodman, L.; Leitzel, K.; Cook, J.; Sperinde, J.; Haddad, M.; Köstler, W.J.; Huang, W.; Weidler, J.M.; Ali, S.; et al. HER3, p95HER2, and HER2 Protein Expression Levels Define Multiple Subtypes of HER2-Positive Metastatic Breast Cancer. Breast Cancer Res. Treat. 2013, 141, 43–53. [Google Scholar] [CrossRef] [Green Version]
  21. Hoffman, A.E.; Yi, C.-H.; Zheng, T.; Stevens, R.G.; Leaderer, D.; Zhang, Y.; Holford, T.R.; Hansen, J.; Paulson, J.; Zhu, Y. CLOCK in Breast Tumorigenesis: Genetic, Epigenetic, and Transcriptional Profiling Analyses. Cancer Res. 2010, 70, 1459–1468. [Google Scholar] [CrossRef] [Green Version]
  22. Hadadi, E.; Taylor, W.; Li, X.-M.; Aslan, Y.; Villote, M.; Rivière, J.; Duvallet, G.; Auriau, C.; Dulong, S.; Ray-Mond-Letron, I. Chronic Circadian Disruption Modulates Breast Cancer Stemness and Immune Microenvi-Ronment to Drive Metastasis in Mice. Nat. Commun. 2020, 11, 1–17. [Google Scholar] [CrossRef]
  23. Bayat, S.; Mansoori Derakhshan, S.; Mansoori Derakhshan, N.; Shekari Khaniani, M.; Alivand, M.R. Down-Regulation of HDAC2 and HDAC3 Via Oleuropein as a Potent Prevention and Therapeutic Agent in MCF-7 Breast Cancer Cells. J. Cell. Biochem. 2019, 120, 9172–9180. [Google Scholar] [CrossRef]
  24. Shan, W.; Jiang, Y.; Yu, H.; Huang, Q.; Liu, L.; Guo, X.; Li, L.; Mi, Q.; Zhang, K.; Yang, Z. HDAC2 Overexpression Correlates with Aggressive Clinicopathological Features and DNA-Damage Response Pathway of Breast Cancer. Am. J. Cancer Res. 2017, 7, 1213. [Google Scholar]
  25. Müller, B.M.; Jana, L.; Kasajima, A.; Lehmann, A.; Prinzler, J.; Budczies, J.; Winzer, K.-J.; Dietel, M.; Weichert, W.; Denkert, C. Differential Expression of Histone Deacetylases HDAC1, 2 and 3 In Human Breast Cancer-Overexpression of HDAC2 and HDAC3 is Associated with Clinicopathological Indicators of Disease Progression. BMC Cancer 2013, 13, 215. [Google Scholar] [CrossRef] [Green Version]
  26. Huo, Q.; Li, Z.; Cheng, L.; Yang, F.; Xie, N. SIRT7 Is a Prognostic Biomarker Associated With Immune Infiltration in Luminal Breast Cancer. Front. Oncol. 2020, 10. [Google Scholar] [CrossRef]
  27. Tang, X.; Shi, L.; Xie, N.; Liu, Z.; Qian, M.; Meng, F.; Xu, Q.; Zhou, M.; Cao, X.; Zhu, W.-G.; et al. SIRT7 Antagonizes TGF-β Signaling and Inhibits Breast Cancer Metastasis. Nat. Commun. 2017, 8, 1–14. [Google Scholar] [CrossRef] [Green Version]
  28. Geng, Q.; Peng, H.; Chen, F.; Luo, R.; Li, R. High Expression of Sirt7 Served as a Predictor of Adverse Outcome in Breast Cancer. Int. J. Clin. Exp. Pathol. 2015, 8, 1938–1945. [Google Scholar]
  29. Network, C.G.A. Comprehensive Molecular Portraits of Human Breast Tumours. Nature 2012, 490, 61. [Google Scholar] [PubMed] [Green Version]
  30. Colaprico, A.; Silva, T.C.; Olsen, C.; Garofano, L.; Cava, C.; Garolini, D.; Sabedot, T.S.; Malta, T.M.; Pagnotta, S.M.; Castiglioni, I.; et al. TCGAbiolinks: An R/Bioconductor Package for Integrative Analysis of TCGA Data. Nucleic Acids Res. 2016, 44, e71. [Google Scholar] [CrossRef] [PubMed]
  31. Jensen, M.A.; Ferretti, V.; Grossman, R.L.; Staudt, L.M. The NCI Genomic Data Commons as an Engine for Precision Medicine. Blood 2017, 130, 453–459. [Google Scholar] [CrossRef]
  32. Gao, Y.; Church, G. Improving Molecular Cancer Class Discovery through Sparse Non-negative Matrix Factorization. Bioinformatics 2005, 21, 3970–3975. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Mirzal, A. Nonparametric Tikhonov Regularized NMF and Its Application in Cancer Clustering. IEEE/ACM Trans. Comput. Biol. Bioinform. 2014, 11, 1208–1217. [Google Scholar] [CrossRef]
  34. Gaujoux, R.; Seoighe, C. A Flexible R Package for Nonnegative Matrix Factorization. BMC Bioinform. 2010, 11, 367. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Therneau, T.M.; Grambsch, P.M. The Cox Model. In Statistics for Biology and Health; Springer Science and Business Media LLC.: Berlin, Germany, 2000; pp. 39–77. [Google Scholar]
  36. Bland, J.M.; Altman, D.G. The Logrank Test. BMJ 2004, 328, 1073. [Google Scholar] [CrossRef] [Green Version]
  37. Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. Limma Powers Differential Expression Analyses for RNA-Sequencing and Microarray Studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Survival analysis of the eight featured genes. Survival analyses were conducted by Kaplan–Meier method according to the expression level of each specific gene. The optimum cutpoint for distinguishing low and high expression group was determined with the “surv_cutpoint” function.
Figure 1. Survival analysis of the eight featured genes. Survival analyses were conducted by Kaplan–Meier method according to the expression level of each specific gene. The optimum cutpoint for distinguishing low and high expression group was determined with the “surv_cutpoint” function.
Curroncol 28 00091 g001
Figure 2. The heatmap of correlation coefficient is clustered by non-negative matrix factorization (NMF). The final clustering generated two groups. One is the rectangle in the upper-left and the other is the rectangle in the lower-right which were named as HAM1 and HAM2, respectively. Each point in the heatmap represents the correlation coefficient between two samples, as displayed by color scale. Three annotation tracks that contributed to the clustering were displayed above the heatmap, including basis components, consensus and silhouette width. Except for the consensus which represents the consistency in the 30 times running of NMF clustering, both the ‘basis component’ and ‘silhouette’ tracks are analyzed in more detail in the following figures.
Figure 2. The heatmap of correlation coefficient is clustered by non-negative matrix factorization (NMF). The final clustering generated two groups. One is the rectangle in the upper-left and the other is the rectangle in the lower-right which were named as HAM1 and HAM2, respectively. Each point in the heatmap represents the correlation coefficient between two samples, as displayed by color scale. Three annotation tracks that contributed to the clustering were displayed above the heatmap, including basis components, consensus and silhouette width. Except for the consensus which represents the consistency in the 30 times running of NMF clustering, both the ‘basis component’ and ‘silhouette’ tracks are analyzed in more detail in the following figures.
Curroncol 28 00091 g002
Figure 3. The silhouette width analysis of the clustering using HAM signature. The value of silhouette width represents the cluster validity where a higher average silhouette width indicates higher sample tightness and better cluster separation. Silhouette width for each sample was calculated and displayed by the plot with the average silhouette width for each cluster and total patients presented.
Figure 3. The silhouette width analysis of the clustering using HAM signature. The value of silhouette width represents the cluster validity where a higher average silhouette width indicates higher sample tightness and better cluster separation. Silhouette width for each sample was calculated and displayed by the plot with the average silhouette width for each cluster and total patients presented.
Curroncol 28 00091 g003
Figure 4. Kaplan–Meier estimate of the overall survival for HAM1 and HAM2 groups of patients.
Figure 4. Kaplan–Meier estimate of the overall survival for HAM1 and HAM2 groups of patients.
Curroncol 28 00091 g004
Figure 5. Kaplan–Meier estimate of the overall survival for four intrinsic subtypes between the HAM1 and HAM2 groups.
Figure 5. Kaplan–Meier estimate of the overall survival for four intrinsic subtypes between the HAM1 and HAM2 groups.
Curroncol 28 00091 g005
Figure 6. Gene expression correlation plot of the HAM signature genes. Numbers in each square represents the correlation efficiency between the two genes of the specific row and column. Squares with a “×” mark are those insignificantly correlated gene pairs defined by a p-value larger than 0.05.
Figure 6. Gene expression correlation plot of the HAM signature genes. Numbers in each square represents the correlation efficiency between the two genes of the specific row and column. Squares with a “×” mark are those insignificantly correlated gene pairs defined by a p-value larger than 0.05.
Curroncol 28 00091 g006
Figure 7. Two major basis components analysis for the NMF clustering. The HAM signature genes can be divided into two basis components with GTF3C4, CLOCK, and BRWD3 being one of the components and the other five genes being the other.
Figure 7. Two major basis components analysis for the NMF clustering. The HAM signature genes can be divided into two basis components with GTF3C4, CLOCK, and BRWD3 being one of the components and the other five genes being the other.
Curroncol 28 00091 g007
Figure 8. Expression of the eight HAM signature genes in HAM1 and HAM2 groups of patients. All of the eight HAM signature genes, except for HDAC and SP100, showed significant differential gene expression between HAM1 and HAM2 groups. Genes in the upper row including BRWD3, CLOCK and GTF3C4 have higher expression in HAM2 group while those in the lower row including SIRT7, BRD4 and BRD7 are expressed more in HAM1 group.
Figure 8. Expression of the eight HAM signature genes in HAM1 and HAM2 groups of patients. All of the eight HAM signature genes, except for HDAC and SP100, showed significant differential gene expression between HAM1 and HAM2 groups. Genes in the upper row including BRWD3, CLOCK and GTF3C4 have higher expression in HAM2 group while those in the lower row including SIRT7, BRD4 and BRD7 are expressed more in HAM1 group.
Curroncol 28 00091 g008
Table 1. Distribution of four molecular intrinsic subtypes in HAM1 and HAM2 groups.
Table 1. Distribution of four molecular intrinsic subtypes in HAM1 and HAM2 groups.
HAM GroupsLuminal ALuminal BBasal-LikeHER2-Enriched
HAM168 (35.1%)47 (24.2%)55 (28.3%)24 (12.4%)
HAM2164 (50.9%)78 (24.2%)46 (14.3%)34 (10.6%)
A Chi-squared test was performed to compare the distribution of four intrinsic subtypes between the two groups with the p-value found to be 0.00023.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Long, M.; Hou, W.; Liu, Y.; Hu, T. A Histone Acetylation Modulator Gene Signature for Classification and Prognosis of Breast Cancer. Curr. Oncol. 2021, 28, 928-939. https://doi.org/10.3390/curroncol28010091

AMA Style

Long M, Hou W, Liu Y, Hu T. A Histone Acetylation Modulator Gene Signature for Classification and Prognosis of Breast Cancer. Current Oncology. 2021; 28(1):928-939. https://doi.org/10.3390/curroncol28010091

Chicago/Turabian Style

Long, Mengping, Wei Hou, Yiqiang Liu, and Taobo Hu. 2021. "A Histone Acetylation Modulator Gene Signature for Classification and Prognosis of Breast Cancer" Current Oncology 28, no. 1: 928-939. https://doi.org/10.3390/curroncol28010091

Article Metrics

Back to TopTop