Critical Analysis of Genome-Wide Association Studies: Triple Negative Breast Cancer Quae Exempli Causa

Jurj, Maria-Ancuta; Buse, Mihail; Zimta, Alina-Andreea; Paradiso, Angelo; Korban, Schuyler S.; Pop, Laura-Ancuta; Berindan-Neagoe, Ioana

doi:10.3390/ijms21165835

Open AccessReview

Critical Analysis of Genome-Wide Association Studies: Triple Negative Breast Cancer Quae Exempli Causa

by

Maria-Ancuta Jurj

^1,†,

Mihail Buse

^2,†

,

Alina-Andreea Zimta

²

,

Angelo Paradiso

³

,

Schuyler S. Korban

⁴,

Laura-Ancuta Pop

^1,*

and

Ioana Berindan-Neagoe

^1,5

¹

Research Center for Functional Genomics, Biomedicine and Translational Medicine, Iuliu Hatieganu University of Medicine and Pharmacy, 400337 Cluj-Napoca, Cluj, Romania

²

MEDFUTURE Research Center for Advanced Medicine, Iuliu Hatieganu University of Medicine and Pharmacy, 400337 Cluj-Napoca, Romania

³

Laboratorio Oncologia Sperimentale Clinica, Istituto Oncologico-Bari, I-70124 Bari, Italy

⁴

Department of Natural Resources and Environmental Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA

⁵

Department of Functional Genomics and Experimental Pathology, The Oncology Institute “Prof. Dr. I Chiricuta”, 400015 Cluj-Napoca, Cluj, Romania

^*

Author to whom correspondence should be addressed.

^†

Authors with equal contribution.

Int. J. Mol. Sci. 2020, 21(16), 5835; https://doi.org/10.3390/ijms21165835

Submission received: 15 July 2020 / Revised: 10 August 2020 / Accepted: 11 August 2020 / Published: 14 August 2020

(This article belongs to the Special Issue Functional Genomics in Health and Disease)

Download

Browse Figures

Versions Notes

Abstract

Genome-wide association studies (GWAS) are useful in assessing and analyzing either differences or variations in DNA sequences across the human genome to detect genetic risk factors of diseases prevalent within a target population under study. The ultimate goal of GWAS is to predict either disease risk or disease progression by identifying genetic risk factors. These risk factors will define the biological basis of disease susceptibility for the purposes of developing innovative, preventative, and therapeutic strategies. As single nucleotide polymorphisms (SNPs) are often used in GWAS, their relevance for triple negative breast cancer (TNBC) will be assessed in this review. Furthermore, as there are different levels and patterns of linkage disequilibrium (LD) present within different human subpopulations, a plausible strategy to evaluate known SNPs associated with incidence of breast cancer in ethnically different patient cohorts will be presented and discussed. Additionally, a description of GWAS for TNBC will be presented, involving various identified SNPs correlated with miRNA sites to determine their efficacies on either prognosis or progression of TNBC in patients. Although GWAS have identified multiple common breast cancer susceptibility variants that individually would result in minor risks, it is their combined effects that would likely result in major risks. Thus, one approach to quantify synergistic effects of such common variants is to utilize polygenic risk scores. Therefore, studies utilizing predictive risk scores (PRSs) based on known breast cancer susceptibility SNPs will be evaluated. Such PRSs are potentially useful in improving stratification for screening, particularly when combining family history, other risk factors, and risk prediction models. In conclusion, although interpretation of the results from GWAS remains a challenge, the use of SNPs associated with TNBC may elucidate and better contextualize these studies.

Keywords:

breast cancer; CNVs; GWAS; linkage disequilibrium; predictive risk scores; SNPs; structural variants; TNBC

1. Introduction

One of the most important discoveries of modern genetics following the release of the complete sequence of the human genome and mapping of all genes is the role of genetic variation in disease incidence. To date, there are 88 million genetic variants that have been characterized. A genetic variant is an umbrella term referring to different types of alterations in a sequence within a certain region of the genome. Although the majority of these variants are unlikely to impact human health, there are some that have been associated with disease. These associations have provided valuable information about human diseases. Such genetic variants include indels (“insertions” or “deletions”), copy number variations (CNVs), translocations, transversions, and single nucleotide polymorphisms (SNPs). For the purpose of simplicity and coherence, we will solely focus on SNPs in the present review.

SNPs are nucleotide variations occurring at a specific location within a genome, and with a frequency greater than 1% in a population. However, it should be noted that these variations do not necessarily cause disease. SNPs at low frequencies are usually referred to as mutations and, in combination with other genetic or epigenetic factors, may be responsible for causing disease. Of the aforementioned 88 million variants characterized in the human genome, 84.7 million are SNPs [1]. As SNPs have two alleles, with some exceptions wherein three alleles have been identified, there are typically two base-pair probabilities (or substitutions) for a given allele within a population [2]. SNPs are frequently used as markers for a genomic region for pursuing genetic studies, with the majority of these SNPs having minimal impacts on biological systems. For example, it has been demonstrated that, while an individual SNP or two SNPs do not appear to correlate with specific cancers, a group of 77 SNPs is found to be highly associated with breast cancer development [3]. In addition to genome-wide association studies (GWAS), the use of next-generation sequencing technologies (NGST) has also afforded the use of SNPs for both genetic mapping and evolutionary studies, as they offer unique advantages over other forms of genetic markers. Firstly, when compared with microsatellites and mitochondrial DNAs, SNPs are less affected by homoplasy, as they correspond to single base nucleotide substitutions, and thus their origins can be explained by mutation models. Secondly, in order to determine parent relatedness and population structure, SNPs have been utilized to quantify genetic variations within an individual genome. Thirdly, SNPs are highly desirable as they are present in large numbers. However, it should be noted that, as with all marker systems, there is a certain level of bias within a panel of SNPs under consideration. In order to overcome some disadvantages of the use of SNP markers in genetic studies, the use of randomly distributed SNPs throughout a genome is highly recommended for any population under investigation [4]. At a minimum, haplotype blocks within a genome should be randomly chosen [4]. Moreover, it is important to point out that SNPs and genetic mutations are neither the same, nor are they synonymous [2].

It is known that GWAS assess and analyze differences or variations in DNA sequences across the human genome to detect genetic risk factors of diseases prevalent in a population under investigation. The ultimate goal of GWAS is to predict either disease risk or disease progression by utilizing genetic risk factors to define the biological basis of disease susceptibility. This also enables the development of innovative and preventative therapeutic strategies [2]. There are two fundamental concepts underlying GWAS, including linkage disequilibrium (LD) and a common disease–common variant (CD–CV) hypothesis. These concepts will be discussed in more detail later on in this review. A basic technical workflow for a typical GWAS is presented in Figure 1.

Therefore, GWAS serve as a tool to identify associations between genetic regions and specific traits of interest. A basic GWAS will evaluate the genetic profiles of hundreds of patients of a well-defined phenotype to those of hundreds of control subjects. In this review, we will present and discuss the most significant findings obtained thus far in GWAS for the triple negative breast cancer (TNBC) disease. We have elected to focus on the TNBC subtype mainly owing to its aggressiveness, young age of diagnosis, and limited treatment options.

2. Fundamental Principles of Genome-Wide Association Studies

As mentioned above, there are two fundamental concepts underlying GWAS. These are linkage disequilibrium (LD) and the common disease–common variant (CD–CV) hypothesis.

LD is defined as a non-random association of two alleles at two or more loci [5]. In turn, this provides insights into past genetic conditions and constraints, allowing for the determination of whether selection was either natural; epigenetic [6]; or owing to other mechanisms that do not occur in isolation, such as genetic drift or gene flow. If detected throughout the genome when investigating populations, LD mirrors population history, breeding system, and geographic subdivision patterns. When investigating genomic regions, LD reveals history of natural selection, gene conversion, mutation, and other forces that either contribute to or cause gene-frequency evolution. Therefore, detecting LD does not guarantee either linkage or lack of equilibrium. Ultimately, it is the local recombination rate that determines how the aforementioned factors affect LD within a certain genomic region or between paired loci [5]. As to be expected, this is related to the concept of chromosomal linkage, which infers that two markers found on a chromosome will remain physically linked on that chromosome throughout consecutive familial generations. However, recombination events will separate chromosomal segments within a family from one generation to another, and this effect is continuously amplified through several subsequent generations. Inevitably, recombination events will break apart segments of chromosomes carrying linked alleles until all alleles within a population are in linkage equilibrium. Simply stated, linkage disequilibrium involves coupling markers at the population level [2]. Furthermore, the rate of LD decay is dependent on the following factors: population size, number of founding chromosomes within a population, and number of generations for which the population has existed. Therefore, it comes as no surprise that there are different levels and patterns of LD when comparing different human subpopulations. For example, the most ancestral human population is that of an African-descent population, which, owing to the accumulation of more recombination events, has smaller regions of LD. Meanwhile, on average, European-descent and Asian-descent populations have larger regions of LD than African-descent populations. This is attributed to the fact that European- and Asian-descendant populations have been generated by founding events, whereby they have split from the African population, thus inherently changing the number of founding chromosomes, population size, and generational age of the population [2].

In general, closely linked polymorphic SNPs have strong LD between them. The International HapMap consortium has demonstrated that the human genome contains haplotype blocks, within which either most or all are high LD SNPs. Thus, there is a fine-scale pattern of LD present in human populations. Subsequently, it has been assumed that high LD levels detected among SNPs are for those alleles that exhibit increased risks of complex inherited diseases [5]. Interestingly, this has been in fact observed for those SNPs significantly associated with breast cancer when GWAS have been conducted and large numbers of SNPs have been surveyed [7]. However, it is important to take into consideration that LD in GWAS can be generated by either undetected or unknown population stratification. Moreover, GWAS have been successful at uncovering associated SNPs, despite the low overall breast cancer risk within a population, and even in identifying new causative alleles [5]. In fact, five new variants have been found to be associated with familial breast cancer, but only 3.6% of familial breast cancer can be attributed to these alleles [7]. However, it is important to point out that it is the relatively high frequency of these causative alleles that allows for their detection by GWAS.

The CD–CV hypothesis [8] has been developed based on the following two principles: common diseases differ from rare disorders in terms of their underlying genetic architecture, and the discovery of several susceptibility variants for a common disease is of high minor allele frequency [2]. In other words, this hypothesis proposes that common diseases are influenced by genetic variations common within a population [8]. Firstly, this suggests that there has to be a high correlation between allele frequency and population occurrence. Secondly, if common genetic variants influence disease, then the effect size or penetrance for any one variant must be small relative to those exhibited by rare diseases [2]. This further implies that, if the same SNP causes a small change in gene expression that alters disease risk by a small proportion, this creates a scenario wherein the frequency of disease incidence and the causal allele are only lowly-correlated. Thus, common variants cannot yield high effects. Thirdly, disease susceptibility is influenced by multiple common alleles based on the condition that common alleles have small genetic effects, and that common diseases exhibit heritability. Additionally, if an allele of a single SNP slightly increases disease risk, this implies that such an SNP accounts for a small amount of variance of the total variation caused by genetic factors. Consequently, multiple genetic factors synergistically account for the total genetic risk of a common genetic variation [2]. However, it is important not to jump to the conclusion that the entire genetic component of any disease is attributable only to common alleles.

3. Challenges of Genome-Wide Association Studies

While discussing genetic heterogeneity and the potential role of rare genetic variants in complex human diseases, McClellan and King [9] have pointed out some important and interesting criticisms of GWAS. Despite the fact that some of these criticisms have already been addressed, it is important to go through them to better understand these issues, and to improve the outcomes of GWAS.

One of these noted issues pertains to the fact that some of the genetic variants lack biological functions, and thereby their relative importance is highly diminished. In fact, it has been observed that GWAS are populated by risk variants of no known functions. Thus, the utility and reliability of GWAS have been questioned as most detected SNPs in GWAS are from intergenic regions [9]. Furthermore, GWAS identify approximate locations of loci associated with disease variants rather than attempt to specifically identify functional SNPs. This is attributed to widespread LD between segregating sites within a given human population. In addition, most SNPs in SNP arrays have unknown biological functions, as most SNPs in HapMaps are located in noncoding regions, and SNP arrays usually do not select for SNPs of known functions. Moreover, it is also important to emphasize that GWAS variants are not functional variants that confer risk, also referred to as “risk variants” in the published literature. Thus, 100% of a subpopulation carrying a risk allele does not truly suggest that all subjects of such a population are predisposed to risk. This simply indicates that LD patterns at a target locus are different than those of another subpopulation. Although the majority of thousands of “risk variants” that have been identified from GWAS have no apparent known biological functions, these are explained by using deduction and rationale, as outlined by the CD–CV hypothesis. This suggests that most genotyping platforms select for common variants. Moreover, as evolution has ensured that the most common variants are neutral, it should come as no surprise that most GWAS findings are neutral, originating from factors other than associations with disease risk. On the basis of evolutionary genetics, most alleles are in fact recent, and they are rare [10,11]. It is unclear what is exactly required for a common allele to remain in a population, as mechanisms of evolution can both facilitate and hinder heritability, particularly as they do not occur in isolation. For example, an allele can significantly increase in frequency without any need for selection when either a population bottleneck occurs (genetic drift) or when a subpopulation migrates and integrates with another (gene flow).

In another claim, it has been reported that it is population stratification that results in GWAS hits [9]. Although population stratification or substructure inflates test statistics, this can be readily identified, and adjusted for accordingly. In general, populations differ among each other over many loci and not only for one or two SNPs, which is precisely how whole-genome data are used to identify stratification. This is exemplified by the particularly fine-scale sub-populations in Europe that can be readily separated utilizing whole-genome data. Most importantly, as population stratification is one of the fundamental assumptions taken into consideration by the CD–CV hypothesis, the GWAS community has established methods to deal with population stratification that are fairly effective for common variants. For example, EigenStrat is a multi-dimensional scaling approach for addressing stratification, and is commonly used as a standard practice in the case-control GWAS dataset. Additionally, family-based study designs in GWAS have an advantage in protecting against stratification. Lastly, frequency estimates are dependent on sample size, thus conferring additional variations to such results [10,11].

As with all studies, sample size significantly impacts interpretations of data. Single GWAS analyses are relatively underpowered owing to the fact that they have a limited number of samples, which drastically increases the probability of false-positive findings. Given this, implementing meta-analysis of several GWAS can overcome these small-sample numbers and study-specific limitations, thus providing a more robust statistical analysis and reduced false-positive results. To date, there are many published articles describing the meta-analysis of GWAS [12,13,14]. However, each meta-analysis consists of several stages comprised of analysis set-up, investigating heterogeneity, data storage, and variant selection for any subsequent analysis. There are several parameters and methods employed for meta-analyses, such as p-values, fixed effects, random effects, Bayesian statistics, and multivariate analysis [15]. Using such meta-analysis methods, a new collaboration, iGOGS, has discovered 74 new susceptibly loci for hormone-dependent cancers [16,17,18,19]. However, there are other consortia that have used this method for identifying other SNPs relevant for each type of disease, such as BCAC [20], ISC [21], and MAGIC [22].

The use of GWAS for cancer research studies has encountered several challenges, including the following: sample size; high numbers (430) of significant SNPs for cancer; association of several SNPs with multiple cancer localizations; implications of identified genes in several key signaling pathways involved in cancer; modulation of some pathways by lifestyle and environment; and, lastly, the fact that most studies are conducted using European populations, thus limiting extrapolation of these findings to other populations [23].

4. Genome-Wide Association Studies on Triple Negative Breast Cancer

4.1. Triple Negative Breast Cancer

Breast cancer remains a highly serious disease with the highest female incidence and mortality, as determined by age-standardized rate (ASR) per 100,000, of 46.3 and 13.0 worldwide, 54.5 and 15.5 for Eastern Europe [24], and 51.6 and 14.6 for Romania, respectively [25]. Breast cancer is a heterogeneous disease, and it is divided into several subtypes, each with its own risk, as well as pathological and molecular characteristics. Pathologically, breast cancer is subdivided into in situ carcinoma and invasive carcinoma. The in-situ carcinoma is further subdivided into ductal and lobular, each with its own subtypes and characteristics. Meanwhile, invasive carcinoma is subdivided into tubular, ductal lobular, invasive lobular, mucinous, medullary, infiltrating ductal, and papillary [26,27]. In another classification system currently used by pathologists, several biomarkers frequently observed in breast cancer are used, including estrogen receptor (ER), progesterone receptor (PR), receptor tyrosine-protein kinase erbB-2 (HER2/neu), and p53. Expression of these biomarkers has been mainly determined for the invasive carcinoma subtype; moreover, the presence or absence of these biomarkers is correlated with responses of targeted treatments for particular patients [28]. Finally, the last breast cancer classification relies on the use of molecular profiles that separate breast cancer into the following six subtypes: basal-like [29], luminal A, luminal B, HER2-enriched, normal-like, and claudin-low [30,31,32]. Furthermore, molecular classifications of breast cancer have allowed for the use of enhanced protein staining antibodies techniques.

The above-mentioned receptor-based molecular classification undertaken by staining for estrogen, progesterone, and Her2 protein receptors has led to the identification of the breast cancer subtype triple negative breast cancer (TNBC). This subtype is the most aggressive and lethal form of all breast cancer subtypes, and is based on its elevated metastatic rate, reoccurrence, and drug resistance. Furthermore, as the nomenclature suggests, TNBC is characterized by a lack of ER, PR, and HER2/Neu expression [33,34,35]. TNBC accounts for ~10–20% of total breast cancer diagnosis, and its molecular profile reveals that it exhibits almost explicitly basal-like tumors, but not all basal-like tumors are triple negative. TNBC is pursued in numerous research studies and clinical trials because of its heterogeneity (exhibited both clinically and biologically), unfavorable outcomes, and aggressiveness. However, owing to the limited availability of targeted treatment options, conventional chemotherapy and radiotherapy are primarily still used [36,37,38,39,40]. Although there are good results obtained in many clinical trials using different targeted treatments for TNBC, there are no reported results from Phase III clinical trials, thus suggesting that the predominant treatment approach remains chemotherapy [41]. There are two Phase III clinical trials using poly (ADP-ribose) polymerase (PARP) inhibitors for treatment of breast cancer patients, yielding better results than that with chemotherapy. The first clinical trial focused on the use of Olaparib, and included patients with positive pathogenic BRCA1/2 mutations, Her2 negative, locally advanced, or metastatic cancer that have been previously treated by chemotherapy (OlympiAD) [42]. The second clinical trial relied on the use of Talazoparib for breast cancer patients positive for BRCA1/2 pathogenic mutations with either locally advanced or metastatic cancers (EMBRACA) [43]. In 2018 and 2019, the FDA and EU, respectively, approved the use of Olaparib for treatment of breast cancer patients with the aforementioned conditions.

4.2. Genome-Wide Association Studies Identifying SNPs for TNBC

Most GWAS related to cancer have evaluated either relationships of SNPs to risk of disease [23,44,45,46] or associations of SNPs to disease-specific prognosis for patients [47,48,49]. As for breast cancer, there are two studies that have associated SNPs with the risk of breast cancer and with the outcome of patients [49,50]. During the period of 2008 to 2018, a total of 41 articles were published, and retrieved in PubMed. These studies have evaluated associations of SNPs to outcomes of patients with TNBC, and of those SNPs associated with risks of developing TNBC (Table S1).

On the basis of the literature, the following genes are associated with patient prognosis: lymphocyte-specific protein 1 (LSP1), TOX high mobility group box family member 3 (TOX3), transition protein 1 (TNP1), carbohydrate sulfotransferase 9 (CHST9), aquaporin 4 (AQP4), BRCA1 DNA repair associated (BRCA1), and fibroblast growth factor receptor 2 (FGFR2) (Table S1). Moreover, the majority of SNPs associated with risk of breast cancer are present in the following genes: mitogen-activated protein kinase kinase kinase 1 (MAP3K1), RAD51 paralog B (RAD51L1), BRCA1, BRCA2, estrogen receptor 1 (ESR1), MDM5, transforming growth factor beta 1 (TGFB1), telomerase reverse transcriptase (TERT), carbohydrate sulfotransferase 9 (CHST9), or REV1 DNA directed polymerase (REV1). Although there is some overlap with BRCA1/2 genes, for the most part, these associated genes are distinct. Moreover, the majority of SNPs identified in TNBC studies are associated with the risk of developing this disease, while only six SNPs are associated with either survival or prognosis of TNBC.

The most common SNPs associated with TNBC are located within introns of protein-coding genes (Figure 2 and Table S1), thus confirming that intronic variants can significantly influence the final cellular functional units. This is owing to their influence over alternative splicing [51] and generation of modified non-coding RNAs [52]. In the first instance, the SNP effect will be confined to a single protein variant, which, in turn, can either disrupt or over-activate interacting proteins belonging to various signaling pathways. Thus, knowledge of all affected interactions as a consequence of an early SNP event in breast cancer development could offer better predictions of disease prognosis and effective targeting of a central causative SNP.

In the second instance, if an intronic variant generates either long non-coding RNAs or circular RNAs [52], with either gain-of-function or loss-of-function, it will influence the expression of multiple proteins. The fallacy in this current approach of focusing on these genetic variants is that our knowledge is limited to the original protein-coding gene, but without any information on potential non-coding RNAs generated from these genetic loci. As such, for those four TNBC-associated SNPs, located in non-coding genes, there are likely several intronic variants that should also be included.

The third most common TNBC-associated SNPs are located in exons, yielding missense transcripts (Figure 2 and Table S1). This observed change directly affects protein variants, as these variants can incur either a gain-of-function or loss-of-function alteration that can, in turn, either yield an ineffective tumor suppressor or generate an oncogene. The down-stream effects of these variants are dependent on established interactions and cellular processes in which they are involved. TGFB1 is a tumor suppressor that, upon acquisition of oncogenic mutations, begins to stimulate malignant cell invasion and metastasis through epithelial-to-mesenchymal transition [53]. The transcript variant (TGFB1):c.29C > T (p.Pro10Leu) is a risk factor for breast cancer (see Table S1). Caspase-8 (Casp-8) is another interesting case, with specific effects in TNBC. In addition to its involvement in extrinsic apoptosis, Casp-8 stimulates cell cycle progression, while it inhibits cell invasion and migration through modulation of multiple genes; thus, this Caspase may play a more profound role in TNBC than in other malignant pathologies [54]. Interestingly, for the rs1045485 SNP located within exon 10 of Caspase 8, it is the C allele that offers a protective role over BC development [55].

The 3′UTR variants are the fourth most common SNP-generated alterations in TNBC. These variants are most likely characterized by their altered expression of transcripts, by either creating or deleting new miRNA binding sites [56]. For instance, for the KRAS variant rs61764370 (A > C) containing TNBC tumors, there is a decrease in let-7 miRNA level [57]. Furthermore, synonymous variants rank fifth in terms of types of SNPs-generated variants in TNBC. Although these variants have been detected for quite a while, but without any clinical significance, more recently, they have been found to influence protein variations (through changes at splicing site preferences) and expression (creating new miRNA binding sites, less stable mRNAs, and hindering translation through changes in codon choices) [58]. BABAM1 rs8170 (G > A) and ALS2CR12 rs17468277 (C > T) are two synonymous genetic variants associated with an increased risk of breast cancer. Following a frameshift mutation of the BRCA1 gene, it is detected that rs80357794 (delC) is strongly correlated with hereditary ovarian and aggressive forms of breast cancer, including TNBC (Table S1).

When analyzing the population frequency of alleles, it is important not to overlook the necessity of co-occurrence in order for a genetic variant to incur either a protective or a detrimental role. For example, LSP1 rs3817198 has more or less the same subpopulation frequency of T and C alleles in both European and African populations; however, the C allele increases the risk of breast cancer in a European population, while it is a protective factor in an African population [59,60]. This finding is most likely owing to the incidence of associations with other SNPs in the European population, such as BRCA1 mutations. The rs13387042 A allele homozygosity is strongly correlated with breast cancer development and aggressive development, and it is more frequently distributed in an African population. Meanwhile, the rs1436904 risk allele G is more frequently detected in American, East Asian, and European populations. Furthermore, the SLC4A7 risk allele has a C variant that is far more common in an East Asia population in comparison with all other populations. Surprisingly, it is observed that the highest associations between this SLC4A7 allele and incidence of ER+ breast cancer are detected in Taiwanese, Chinese, and Korean populations. Meanwhile, rs67397200 is associated with ER- breast cancer in women of European heritage. Finally, rs1219648 is the highest risk SNP for the FGFR2 gene, and this allele has a high frequency in both European and African populations (Table S1).

We have used the GWAS catalogue from EMBL-EBI (The European Bioinformatics Institute) to search for SNPs associated with TNBC. A total of five SNPs have been identified, and these are presented in Figure 3.

Using the GWAS catalogue from EMBL-EBI, two SNPs (rs3803662 and rs8170) were selected, based on their identification in several studies, and patterns adjacent to these two SNPs were analyzed. For rs3803662, we observed that two additional SNPs were associated with breast cancer, and a single SNP was associated with both breast cancer and Parkinson’s disease. Meanwhile, for rs8170, we identified a single SNP that was associated with ovarian cancer; three SNPs that were associated with breast cancer; and a single SNP, rs2363956, that was associated with TNBC (Figure 3). It is important to point out that the significance of identifying these few SNPs is the fact they are common in different diseases.

On the basis of frequencies of different types of functional consequences generated by SNPs exhibited in TNBC, it is revealed that intronic variants are the most common functional consequence described in 42 different studies investigating SNPs in TNBC (data taken from frequencies determined from information presented in Table S1) (Figure 4).

Furthermore, the physical locations of each SNP associated with TNBC, previously described, along with each of the somatic chromosomes are presented in Figure 5. As can be noted, certain chromosomal locations, such as those of the q arm of chromosome 6 or the p arm of chromosome 19, are more susceptible to those SNPs detected in TNBC (Figure 5). Moreover, particular chromosomes, such as chromosome 5, have many more SNPs along various chromosomal locations when compared with other chromosomes, such as chromosomes 21 and 22 (Figure 5).

Interestingly, chromosomal locations of TNBC-related SNPs reveal that those presented in 19p13.11 and in 6q25.1-q25.2 (see Figure 5 and Table S1) are most likely in linkage disequilibrium. In 19p13.11, there are seven SNPs, including rs8170, rs2363956, rs8100241, rs4808611, rs3745185, rs61494113, and rs67397200. Meanwhile, in 6q25.1-q25.2, there are six SNPs, including rs2046210, rs3757318, rs3757322, rs12662670, rs3757318, and rs3020314.

Among all SNPs associated with TNBC, only those SNPs with the most significant differences detected in subpopulation frequencies are presented in Figure 6. The divergent bar graphs demonstrate differences in the composition of allele frequencies exhibited within each of the five subpopulations: African, American, East-Asian, European, and South-Asian. This finding emphasizes the importance of knowledge of allele frequencies in subpopulations, especially when undertaking a GWAS.

It has been reported that approximately 14% of all TNBC cases correspond to mutations in high- and moderate-penetrance breast cancer genes [61,62]. The clinical utility of GWAS in determining those high-penetrance breast cancer gene mutations was initially very high back in the 1990s, when BRCA1 and BRCA2 were first cloned. Currently, it is now widely accepted that there are no other high-penetrance genes accounting for relevant proportions of familial cases. Moreover, complex diseases are now better understood; that is, individual risk for breast cancer is not determined by a single genetic variant, but rather by interactions among environmental factors, patterns in polymorphisms, and multiple genetic variants [63].

High-penetrance breast cancer genes, such as BRCA1 and BRCA2, are routinely diagnosed in clinical practice in many countries. This yields a positive feedback loop, facilitating risk estimation and implementation of cancer prevention strategies [64]. Therefore, there must be a shift, in both the value and direction of clinical research, towards either moderate- or low-penetrance genes along with coupling of testing for both of these penetrance types.

New treatments and effective management strategies are in dire need for TNBC patients carrying germline mutations in genes other than BRCA1 and BRCA2, as these patients account for approximately 4% of all TNBC cases [62]. Large-scale routine clinical testing of a new generation of different susceptibility alleles must be undertaken, but this will most likely come along with major technical and economic burdens [64]. Moreover, to accurately determine the overall risk of TNBC in patients, we propose that additional large-scale GWAS should be performed for ancestries other than those for European populations. A recent GWAS carried out by Mavaddat et al. [65] is the largest study to date, but its 94,075 cases and 75,017 controls are of European ancestry.

Additionally, with the advent of personalized medicine, screening programs will be adapted for a patient’s individual genetic risk. Examples of these forms of individualized patient care include decreasing age of initiation and increasing investigation intervals of mammography, as well as integration of magnetic resonance imaging and risk-reducing surgery [63]. For the highest clinical utility of these forms of screenings, we should focus on an individual’s moderate-penetrance susceptibility genes; however, the utility of such an individualized genome-based approach is yet to be validated. Currently, there appears to be little justification for complete sequencing of moderate penetrance genes, such as those of ATM, BRIP1, CHEK2, PALB2, and RAD50 in BRCA1/BRCA2-negative high-risk breast cancer families. This is because of the fact that, in most populations, incidences of mutations in these genes are rare [63].

Therefore, despite enhanced protocols for the management and identification of patients carrying mutations for TNBC susceptibility genes, additional studies are required. Specifically, it is important to investigate and assess the clinical utility of low-penetrance genes/loci SNPs. The necessity of these additional studies has also been suggested by other researchers [65]. A highly useful tool for GWAS for investigating genetic variants is the availability of high-density SNP arrays. These arrays decode relevant variants with increased breast cancer risk in very large groups of both patients and controls. The main purpose of these arrays is to determine multiple polymorphisms that are predicted to synergistically impact an individual patient’s risk. However, it should be noted that most of these low-penetrance variants occur at such high frequencies that they can be detected in control cohorts despite their significant impact on breast cancer risk [63].

4.3. Importance of the Relationship between SNPs and miRNAs in BC

Triple-negative breast cancer is an aggressive form of malignancy with a high prevalence in the young female population. Although this disease has a strong genetic component, it is far less associated with environmental exposure as it is for double-positive breast cancer, lung cancer, colon cancer, cervical cancer, and others. Moreover, the majority of these mutations are germline mutations [65]. Thus far, many GWAS have generally disregarded associating these mutations with either the generation of new RNA species or the appearance of new regulatory networks as a result of these mutations. Among the wide variety of non-coding RNAs affected by SNPs are microRNAs and long non-coding RNAs (lncRNAs) [52,66,67]. We have elected to focus on microRNAs owing to the availability of a large amount of relevant data, as well as our laboratories’ experience with these RNA species.

MicroRNAs (miRNAs) are relevant to our discussion of SNPs, as they contribute to cancer risk [68]; moreover, SNPs can either create or remove miRNA binding sites [69]. Furthermore, there are limited studies on miRNA biogenesis and target selection of SNPs in miRNA genes [70]. In fact, this group of non-coding RNAs is responsible for regulating gene expression by binding to mRNA, resulting in silencing of mRNA, thereby promoting their destabilization and degradation. These miRNAs regulate gene expression of many fundamental processes within a cell, such as tumor suppression [71]. Therefore, it comes as no surprise that different SNP variants significantly alter miRNA-mediated functions [70].

Altering miRNA functions has an effect on the regulation of gene expression. Moreover, this form of polymorphism influences gene function and contributes to variability in disease susceptibility and severity in patients [72]. There is increasing evidence that altered sequences at miRNA target sites contribute to cancer [68,72,73]. For example, miR-146a and the G allele of rs2910164 have been correlated with breast cancer risk (odds ratio (OR) = 1.77; 95% confidence interval (CI) = 1.40–2.23) as its predicted binding sites are 3′ untranslated regions of BRCA1 and BRCA2 [74]. It is reported that there is an increased risk of ER- breast cancer (OR = 2.09; 95% CI = 1.19–3.67) in patients carrying the A allele for SNP rs743554, which is the predicted miRNA binding site for ITGB4. However, it should be noted that the status of HER2 is not included in this study; therefore, associations with TNBC could not be determined [72]. Furthermore, there is an increased risk of ER-breast cancer in women of African ancestry with two miRNA-associated SNPs, miR-4725 (rs73991220; OR = 1.27, 95% CI = 1.09–1.48) and PAPD4 (rs146287903; OR = 0.49, 95% CI = 0.33–0.72) [75]. In addition, this GWAS has also found that rs72631820 in miR-339-3p is associated with increased risk in ER-positive breast cancers, while rs12355840 in miR-202 is associated with stage 1 breast cancer (OR = 0.78, p = 0.005) [75].

Polymorphisms in miRNAs have been linked to increased risks of several cancers, such as colon [76,77], breast [78,79], and bladder [80]. In addition, relationships of SNPs developed in miRNA networks with treatment responses and outcomes of different cancer patients have also been reported [81,82]. A tool has been developed that can correlate the effects of SNPs on miRNAs’ abilities to regulate gene expression [83]. This tool has been tested on different cancer types, including breast cancer, and it has been found that there are several miRNAs with high numbers of SNPs, such as has-mir-221-13SNPs, has-mir-141-10SNPs, and has-mir-64a-10SNPs [83]. In a control case study, it is observed that rs72631823 in pre-miRNA-34a is associated with an increased risk of triple negative breast cancer [84]. In earlier studies with three GWAS of ER-negative breast cancer patients, including the TNBC group [85,86], it has been observed that rs4245739 in the 3′UTR region of MDM4, creating a novel site for miR-191 [87], is specific to triple negative breast cancer patients [16]. This is further confirmed by yet another study wherein it is reported that another SNP rs34091, also located on the 3′UTR of MDM4, is also associated with an increased risk of triple negative breast cancer [88]. Accordingly, MirSNP [89], a single polymorphism, can affect the predicted miRNA site for either one or several miRNAs within a specific gene. All this information is summarized in Table 1.

Using the same GWAS concepts as in the case of general identification of SNPs, we can direct the observed miRNA-associated mutation frequencies towards four molecular consequences.

Firstly, modifications in miRNA promoter/enhancer regions can influence specific miRNA expression. As these forms of mutations have only been recently investigated, data are scarce. For instance, in lung cancer, the G > C SNP in −617 site and the A > G SNP in −604 site in the promoter region of miR-7 decrease the levels of expression of this miRNA, and lead to a worse prognosis of the disease [90].

Secondly, mutations in the primary miRNA processing proteins influence their function, and thus their processed miRNA availability. This was exemplified in a study of stage I and stage II breast cancer patients of African ancestry. Incidence of a higher frequency of the A allele in rs78393591, belonging to the Drosha gene, resulted in a higher risk for developing breast cancer [75].

Thirdly, mutations in miRNA sequences, especially in the miRNA seed region, would affect its binding preferences. For instance, a higher frequency of the C allele in the rs72631820 locus of miR-339-3p was associated with a higher risk of having ER-tumors in patients of African ancestry with stage I or stage II BC [75]. Meanwhile, the rs2910164 C > G in miR146a shows an increased risk of developing BC in the Australian population [74].

Lastly, mutations in the 3′UTR of one or a few miRNA targets would indirectly influence miRNAs targeting profiles of a diseased cell. For instance, women of GA or AA genotypes at the rs743554 locus have higher probabilities of developing ER-breast cancer and worse survival rates. Such a mutation in the 3′UTR of integrin β4 mRNA results in loss of tumor suppressor miRNA; that is, miR-34a ability to bind to ITGB4 [72]. It is proposed that this causes an enhanced integrin β4 to promote tumor cell growth, survival, and invasion. This finding is supported by the observed poor survival of carriers of the variant allele. This is further supported by the fact that the miR-34 family, a direct transcriptional p53 target, down-regulates cell cycle progression genes [71]. One such example of an SNP within a gene that generates a binding site for new miRNAs is the SNP rs4245739 (C minor allele) in the 3′-UTR of the MDM4 gene, which provides a binding site for miR-191 and miR-877-3p. It should be noted that this gain in miRNA binding site is primarily reported for small cell lung cancer and prostate cancer [69]. Despite several examples that have been provided by Moszynska et al. [70], excluding those for breast cancer, there is indeed an overlap, as the SNP rs4245739 is found to be correlated with increased risk of breast cancer development [69,88,91]. Moreover, MDM4 is an oncoprotein that negatively regulates the p53 tumor suppressor protein, and its overexpression can lead to cancer progression [92].

In recent years, miRNAs, small non-coding RNAs that can regulate gene expression, have garnered interest owing to their capacity to act as biomarkers for both diagnosis and prognosis of diseases [93]. Interestingly, the effects of SNPs in target miRNAs sites and their interactions with disease have been extensively studied. SNPs can cause modifications in amino acids, changes in mRNA transcript stabilization, and shifts to the binding affinities of transcription factors [94]. Subsequently, several resources covering the effects of SNPs on miRNA regulation of different genes have been developed, such as Patrocles [95], dbSMR [96], MirSNP [89], and PolymiRTS [97]. In addition to all of the above mentioned examples of studies specifically focusing on miRNA binding sites, processing genes, or miRNA mutations, there is also an abundance of sequencing data that most likely contain information about miRNA-related mutations and their associations with various forms of cancer, as well as specifically with receptor negative breast cancer.

Future studies should be more focused on mutations found in intronic regions, as these might be the origins of non-coding RNA species other than those of major protein-coding mRNA. This would have a major impact on how we interpret findings from future GWAS, particularly when considering that a single miRNA mutation has a major downstream effect owing to the ability of miRNAs to interact with hundreds of mRNAs at a time.

5. Risk Factor Scores for Prediction of Breast Cancer

The association of disease with particular alleles remains unclear, as it should. With the availability of large numbers of published GWAS identifying disease susceptibility loci, the following studies are specifically related to breast cancer [7,98,99,100,101,102,103,104,105]. Such susceptibility loci infer the relative increased risk of a disease at a given locus, but albeit not necessarily the particular locus involved in the expression of the disease itself. This hypothesis has contributed to an interesting research paradox. If we are to assume that the disease locus is necessary for disease expression, there is little to no evidence to support such a finding. Conversely, if we are to detect an association between a disease and an allele, there is little supportive evidence beyond such an association. Unfortunately, the association itself is not sufficient for any clinical value. For these associations to be useful to a clinician or a patient, they must be correlated to predictive analytical models. One such predictive model is the predictive risk score. In this model, it is proposed that the more factors, such as environment, that are taken into consideration and accounted for by the predictive risk scores, the better the prediction and the higher the clinical value.

The following section will discuss predictive models used in some GWAS related to breast cancer, although not specifically related to TNBC, but still correlated to receptors of specific subtypes.

In the general population, it is only a small proportion of breast cancer patients who actually exhibit rare mutations in certain genes, such as BRCA1 and BRCA2 genes, that confer the highest risks of developing breast cancer. GWAS have discovered multiple common breast cancer susceptibility variants, which individually present small risks, but their combined effects can be substantial. One method to quantify the combined effects of common variants is to use polygenic risk scores, as these genomic profiles permit stratification of women based on their risks of developing breast cancer [106]. Currently, GWAS have identified 170 breast cancer susceptibility loci [107,108]. According to genome-wide heritability estimates, these loci account for only 40% of the heritability exhibited by all common variants on a genome-wide SNP array. This indirectly suggests that discrimination by the predictive risk score (PRS) could be improved by including more variants, and thereby widening the significance threshold. Additionally, these variants must differ based on breast cancer subtypes; for example, estrogen receptor (ER)-positive versus ER-negative. Therefore, these subtype-specific PRSs may allow for improved prediction of the disease, as well as allow for the selection of women for preventative treatments, which would be extremely beneficial for more aggressive breast cancer subtypes, such as TNBC.

Mavaddat et al. [3] developed PRSs based on 77 established breast cancer susceptibility SNPs in a study of 33,673 breast cancer cases combined with 33,381 control women of European origin. They found that women in the highest 1% of the PRS had a threefold increased risk of developing breast cancer compared with women in the middle quintile (OR = 3.36, 95% CI = 2.95–3.83). In addition, they reported that ORs for ER-positive and ER-negative disease were 3.73 (95% CI = 3.24–4.30) and 2.80 (95% CI = 2.26–3.46), respectively. Moreover, they found that lifetime risk of breast cancer for women in the lowest and highest quintiles of the PRS was 5.2% and 16.6%, respectively, for women without family history, and 8.6% and 24.4%, respectively, for women with a first-degree family history of breast cancer.

These findings highlight the potential for combining PRS and other known risk factors for risk stratification. Moreover, the risk strata defined by PRS have allowed for the evaluation of risk reduction strategies. There are various other studies that have used similar approaches by combining PRS with environmental, modifiable, or non-modifiable risk factors. For example, Garcia-Closas et al. [16] have used breast cancer as a representative model to demonstrate that genetic information, when combined with other factors, provides layered levels of risk stratification that could facilitate personal decision-making or population-based prevention programs. The breast cancer model is uniquely advantageous because it already has established modifiable risk factors, such as menopausal hormone therapy, options for chemoprevention (e.g., endocrine therapy prevents estrogen receptor positive breast cancer), and screening strategies for early detection. As mentioned above, methods for detecting low-frequency causative alleles are required. Maas et al. [109] have attempted to address this by evaluating combined risk stratification utility of common low-penetrant (small effect size) SNPs and epidemiological risk factors. A total of 17,171 cases and 19,862 controls sampled from the Breast and Prostate Cancer Cohort Consortium (BPC3), along with 5879 women participating in a 2010 National Health Interview Survey, have been taken into consideration and studied. This model is used to map the distribution of an absolute risk for the population of Caucasian women in the United States after adjusting for competing causes of mortality. The degree of stratification of absolute risk can be attributed to the following: non-modifiable factors, including SNPs, family history, and components of menstrual and reproductive history; along with modifiable factors, including body mass index (BMI), menopausal hormone therapy, alcohol consumption, and smoking. This suggests that this model permits the identification of population subsets with increased risks that would benefit from altering modifiable factors using risk-reduction strategies. One of the interesting discoveries is that women in the highest decile of risk are attributed to non-modifiable factors; that is, those women with low BMI, who do not smoke or drink, and who do not use menopausal hormone therapy (MHT) appear to have risks comparable to those of average women in the general population [109].

Rudolph et al. [107] specifically investigated how combining PRS with environmental risk factors would improve risk prediction; however, integrating PRS into risk prediction models entailed the evaluation of their joint associations with known environmental factors. Specifically, joint associations of those described by Mavaddat et al. [3], consisting of 77 SNP PRS with factors of reproductive history, alcohol consumption, menopausal hormone therapy (MHT), height, and BMI, were evaluated. These analyses were based on datasets from 20 studies consisting of up to 23,104 invasive breast cancer cases along with similar numbers of controls. Both global and tail-based goodness-of-fit tests in logistic regression models were performed with outcomes expressed for overall breast cancer, as well as for ER status. The best non-multiplicative joint associations with 77-SNP PRS were obtained for alcohol consumption (p-interaction = 0.009), adult height (p-interaction = 0.025), and the combined MHT (p-interaction = 0.038) specific to an ER-positive status [107].

More recently, Mavaddat et al. [65] have enhanced their earlier investigations, wherein they have generated PRS from the largest accessible genome-wide association datasets, optimized for predicting the ER-specific disease, to empirically validate PRS in prospective studies. It is reported that PRS is capable of improving stratification, specifically for screening for ER-specific PRS. Therefore, it could inform treating physicians to preventively target the use of endocrine therapies. Clinical translational studies within the framework of present screening protocols are required to assess the risks and benefits of including PRS.

6. Conclusions

Even though GWAS are not disease-specific, they provide important and new knowledge for both research and clinical studies. GWAS catalogues or databases offer valuable SNP libraries that assist in gaining a better understanding of a target human disease in correlation with other diseases, and also facilitate in developing new research strategies that could aid clinicians. Although most GWAS focus on European populations, these methodologies could easily be transferred to studies of other population subtypes.

As for TNBC patients, there are large numbers of identified SNPs that correlate with cancer risk, but very few SNPs that correlate with either survival or prognosis. In addition, developing a better understanding of how these alterations relate to gene and protein expression in TNBC will help in translating these findings in specific tests that could aid clinicians either in screening patients at risk for TNBC or in identifying new drugs that could improve patient outcomes by using personalized medicine.

The use of high throughput technologies would bring significant advantages to GWAS and databases, as these would correlate SNPs with other biological relevant data, and subsequently translate this complex information to patient care. Furthermore, GWAS can be used for TNBC by identifying correlations of different identified SNPs with miRNA sites useful for targeting to determine their influence on either prognosis or progression of TNBC in patients. Additionally, a functional consequence of miRNA-associated polymorphisms is variability in disease susceptibility. Furthermore, PRS can potentially improve stratification for screening, especially when combining family history, other risk factors, and risk prediction models.

In conclusion, interpretations of GWAS findings remain challenging; however, identifying and using SNPs specific for TNBC, we may be able to elucidate and better contextualize these types of studies. However, further studies should be undertaken on various other population subgroups, as well as pursuing analysis of general or healthy populations in order to overcome the limitations of our current knowledge.

Supplementary Materials

The following are available online at https://www.mdpi.com/1422-0067/21/16/5835/s1. Table S1: SNPs associated with triple negative breast cancer risk, prognosis, and survival, including subpopulation frequencies.

Author Contributions

M.-A.J. conceived the content of the review and wrote the article, while L.-A.P. was involved in the focus of this article as well as contributed to section of this review, reviewing and writing the article. M.B. carefully checked the literature review and contributed to writing and formatting of the article, as well as editing, while A.-A.Z. prepared all tables and figures. A.P. and S.S.K. was involved in editing and modifying some of the content. I.B.-N. was responsible for coordination of the topic, analysis of data, and interpretation. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Funding Agency—Ministry of Research and Innovation, through the project PNCDI III 2015–2020 “Increasing the performance of scientific research and technology transfer in translational medicine through the formation of a new generation of young researchers”; ECHITAS, by the Executive Unit for Financing Higher Education, Research, Development, and Innovation (UEFISCDI), Romania, project title: “Addressing the complex exposome profile in hormone-dependent cancers of the breast and prostate and its influence on tumoral genome”; ACHILLE, grant ID: PN-III-P4-ID-PCE-2016-0795; no. 164/2017; POSCCE 709/2010 grant with the title, “Clinical and economical impact of proteome and transcriptome molecular profiling in neoadjuvant therapy of triple negative breast cancer (BREASTIMPACT)”; as well as Maria-Ancuta Jurj won a scholarship from the European Social Found, Human Capital Operational Programme 2014-2020, project no. POCU/380/6/13/125171.

Conflicts of Interest

The authors declare no conflict of interest.

References

Genomes Project Consortium; Auton, A.; Brooks, L.D.; Durbin, R.M.; Garrison, E.P.; Kang, H.M.; Korbel, J.O.; Marchini, J.L.; McCarthy, S.; McVean, G.A.; et al. A global reference for human genetic variation. Nature 2015, 526, 68–74. [Google Scholar] [CrossRef]
Bush, W.S.; Moore, J.H. Chapter 11: Genome-wide association studies. PLoS Comput. Biol. 2012, 8, e1002822. [Google Scholar] [CrossRef] [PubMed]
Mavaddat, N.; Pharoah, P.D.; Michailidou, K.; Tyrer, J.; Brook, M.N.; Bolla, M.K.; Wang, Q.; Dennis, J.; Dunning, A.M.; Shah, M.; et al. Prediction of breast cancer risk based on profiling with common genetic variants. J. Natl. Cancer Inst. 2015, 107. [Google Scholar] [CrossRef] [PubMed]
Kumar, S.; Banks, T.W.; Cloutier, S. SNP discovery through next-generation sequencing and its applications. Int. J. Plant Genom. 2012, 2012, 831460. [Google Scholar] [CrossRef]
Slatkin, M. Linkage disequilibrium—Understanding the evolutionary past and mapping the medical future. Nat. Rev. Genet. 2008, 9, 477–485. [Google Scholar] [CrossRef]
Sachs, T. Epigenetic selection: An alternative mechanism of pattern formation. J. Theor. Biol. 1988, 134, 547–559. [Google Scholar] [CrossRef]
Easton, D.F.; Pooley, K.A.; Dunning, A.M.; Pharoah, P.D.; Thompson, D.; Ballinger, D.G.; Struewing, J.P.; Morrison, J.; Field, H.; Luben, R.; et al. Genome-wide association study identifies novel breast cancer susceptibility loci. Nature 2007, 447, 1087–1093. [Google Scholar] [CrossRef]
Reich, D.E.; Lander, E.S. On the allelic spectrum of human disease. Trends Genet. TIG 2001, 17, 502–510. [Google Scholar] [CrossRef]
McClellan, J.; King, M.C. Genetic heterogeneity in human disease. Cell 2010, 141, 210–217. [Google Scholar] [CrossRef]
Wang, K.; Bucan, M.; Grant, S.F.; Schellenberg, G.; Hakonarson, H. Strategies for genetic studies of complex diseases. Cell 2010, 142, 351–353; author reply 353–355. [Google Scholar] [CrossRef] [PubMed][Green Version]
Klein, R.J.; Xu, X.; Mukherjee, S.; Willis, J.; Hayes, J. Successes of genome-wide association studies. Cell 2010, 142, 350–351; author reply 353–355. [Google Scholar] [CrossRef] [PubMed]
Zeggini, E.; Ioannidis, J.P. Meta-analysis in genome-wide association studies. Pharmacogenomics 2009, 10, 191–201. [Google Scholar] [CrossRef] [PubMed]
Panagiotou, O.A.; Willer, C.J.; Hirschhorn, J.N.; Ioannidis, J.P. The power of meta-analysis in genome-wide association studies. Annu. Rev. Genom. Hum. Genet. 2013, 14, 441–465. [Google Scholar] [CrossRef] [PubMed]
Dimou, N.L.; Tsirigos, K.D.; Elofsson, A.; Bagos, P.G. GWAR: Robust analysis and meta-analysis of genome-wide association studies. Bioinformatics (Oxford, England) 2017, 33, 1521–1527. [Google Scholar] [CrossRef] [PubMed]
Evangelou, E.; Ioannidis, J.P. Meta-analysis methods for genome-wide association studies and beyond. Nat. Rev. Genet. 2013, 14, 379–389. [Google Scholar] [CrossRef] [PubMed]
Garcia-Closas, M.; Couch, F.J.; Lindstrom, S.; Michailidou, K.; Schmidt, M.K.; Brook, M.N.; Orr, N.; Rhie, S.K.; Riboli, E.; Feigelson, H.S.; et al. Genome-wide association studies identify four ER negative-specific breast cancer risk loci. Nat. Genet. 2013, 45, 392–398, 398e1–398e2. [Google Scholar] [CrossRef]
Eeles, R.A.; Olama, A.A.; Benlloch, S.; Saunders, E.J.; Leongamornlert, D.A.; Tymrakiewicz, M.; Ghoussaini, M.; Luccarini, C.; Dennis, J.; Jugurnauth-Little, S.; et al. Identification of 23 new prostate cancer susceptibility loci using the iCOGS custom genotyping array. Nat. Genet. 2013, 45, 385–391, 391e1–391e2. [Google Scholar] [CrossRef]
Pharoah, P.D.; Tsai, Y.Y.; Ramus, S.J.; Phelan, C.M.; Goode, E.L.; Lawrenson, K.; Buckley, M.; Fridley, B.L.; Tyrer, J.P.; Shen, H.; et al. GWAS meta-analysis and replication identifies three new susceptibility loci for ovarian cancer. Nat. Genet. 2013, 45, 362–370, 370e1–370e2. [Google Scholar] [CrossRef]
Michailidou, K.; Hall, P.; Gonzalez-Neira, A.; Ghoussaini, M.; Dennis, J.; Milne, R.L.; Schmidt, M.K.; Chang-Claude, J.; Bojesen, S.E.; Bolla, M.K.; et al. Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nat. Genet. 2013, 45, 353–361, 361e1–361e2. [Google Scholar] [CrossRef]
Consortium, T.B.C.A. BCAC. University of Cambrige: Cambrige, UK. Available online: http://bcac.ccge.medschl.cam.ac.uk/ (accessed on 10 July 2020).
Brigham, M.G. ISC. Mass General Brigham: Boston, MA, USA. Available online: https://www.massgeneral.org/ (accessed on 10 July 2020).
MAGIC Consortium. Sanger Institute: Cambrige, UK. Available online: https://www.magicinvestigators.org/ (accessed on 10 July 2020).
Sud, A.; Kinnersley, B.; Houlston, R.S. Genome-wide association studies of cancer: Current insights and future perspectives. Nat. Rev. Cancer 2017, 17, 692–704. [Google Scholar] [CrossRef] [PubMed]
Globocan Breast Worldwide. Available online: https://gco.iarc.fr/ (accessed on 10 July 2020).
Globocan Breast Romania. Available online: https://gco.iarc.fr/ (accessed on 10 July 2020).
Li, C.I.; Uribe, D.J.; Daling, J.R. Clinical characteristics of different histologic types of breast cancer. Br. J. Cancer 2005, 93, 1046–1052. [Google Scholar] [CrossRef] [PubMed]
Connolly, J.K.R.; LiVolsi, V.; Page, D.; Patchefsky, A.; Silverberg, S. Recommendations for the reporting of breast carcinoma. Association of Directors of Anatomic and Surgical Pathology. Am. J. Clin. Pathol. 1995, 104, 614–619. [Google Scholar]
Lester, S.C.; Bose, S.; Chen, Y.Y.; Connolly, J.L.; de Baca, M.E.; Fitzgibbons, P.L.; Hayes, D.F.; Kleer, C.; O’Malley, F.P.; Page, D.L.; et al. Protocol for the examination of specimens from patients with invasive carcinoma of the breast. Arch. Pathol. Lab. Med. 2009, 133, 1515–1538. [Google Scholar] [CrossRef]
Bustos, M.A.; Salomon, M.P.; Nelson, N.; Hsu, S.C.; DiNome, M.L.; Hoon, D.S.; Marzese, D.M. Genome-wide chromatin accessibility, DNA methylation and gene expression analysis of histone deacetylase inhibition in triple-negative breast cancer. Genom. Data 2017, 12, 14–16. [Google Scholar] [CrossRef]
Sorlie, T.; Perou, C.M.; Tibshirani, R.; Aas, T.; Geisler, S.; Johnsen, H.; Hastie, T.; Eisen, M.B.; van de Rijn, M.; Jeffrey, S.S.; et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc. Natl. Acad. Sci. USA 2001, 98, 10869–10874. [Google Scholar] [CrossRef] [PubMed]
Sorlie, T.; Tibshirani, R.; Parker, J.; Hastie, T.; Marron, J.S.; Nobel, A.; Deng, S.; Johnsen, H.; Pesich, R.; Geisler, S.; et al. Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc. Natl. Acad. Sci. USA 2003, 100, 8418–8423. [Google Scholar] [CrossRef] [PubMed]
Perou, C.M.; Sorlie, T.; Eisen, M.B.; van de Rijn, M.; Jeffrey, S.S.; Rees, C.A.; Pollack, J.R.; Ross, D.T.; Johnsen, H.; Akslen, L.A.; et al. Molecular portraits of human breast tumours. Nature 2000, 406, 747–752. [Google Scholar] [CrossRef]
Gluz, O.; Liedtke, C.; Gottschalk, N.; Pusztai, L.; Nitz, U.; Harbeck, N. Triple-negative breast cancer—Current status and future directions. Ann. Oncol. Off. J. Eur. Soc. Med. Oncol. 2009, 20, 1913–1927. [Google Scholar] [CrossRef]
Braicu, C.; Chiorean, R.; Irimie, A.; Chira, S.; Tomuleasa, C.; Neagoe, E.; Paradiso, A.; Achimas-Cadariu, P.; Lazar, V.; Berindan-Neagoe, I. Novel insight into triple-negative breast cancers, the emerging role of angiogenesis, and antiangiogenic therapy. Expert Rev. Mol. Med. 2016, 18, e18. [Google Scholar] [CrossRef]
Pop, L.A.; Cojocneanu-Petric, R.M.; Pileczki, V.; Morar-Bolba, G.; Irimie, A.; Lazar, V.; Lombardo, C.; Paradiso, A.; Berindan-Neagoe, I. Genetic alterations in sporadic triple negative breast cancer. Breast 2018, 38, 30–38. [Google Scholar] [CrossRef]
Carey, L.; Winer, E.; Viale, G.; Cameron, D.; Gianni, L. Triple-negative breast cancer: Disease entity or title of convenience? Nat. Rev. Clin. Oncol. 2010, 7, 683–692. [Google Scholar] [CrossRef]
Dent, R.; Trudeau, M.; Pritchard, K.I.; Hanna, W.M.; Kahn, H.K.; Sawka, C.A.; Lickley, L.A.; Rawlinson, E.; Sun, P.; Narod, S.A. Triple-negative breast cancer: Clinical features and patterns of recurrence. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 2007, 13, 4429–4434. [Google Scholar] [CrossRef]
Liedtke, C.; Mazouni, C.; Hess, K.R.; Andre, F.; Tordai, A.; Mejia, J.A.; Symmans, W.F.; Gonzalez-Angulo, A.M.; Hennessy, B.; Green, M.; et al. Response to neoadjuvant therapy and long-term survival in patients with triple-negative breast cancer. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 2008, 26, 1275–1281. [Google Scholar] [CrossRef]
Foulkes, W.D.; Smith, I.E.; Reis-Filho, J.S. Triple-negative breast cancer. N. Engl. J. Med. 2010, 363, 1938–1948. [Google Scholar] [CrossRef]
Balacescu, O.; Balacescu, L.; Virtic, O.; Visan, S.; Gherman, C.; Drigla, F.; Pop, L.; Bolba-Morar, G.; Lisencu, C.; Fetica, B.; et al. Blood genome-wide transcriptional profiles of HER2 negative breast cancers patients. Mediat. Inflamm. 2016, 2016, 3239167. [Google Scholar] [CrossRef]
Denkert, C.; Liedtke, C.; Tutt, A.; von Minckwitz, G. Molecular alterations in triple-negative breast cancer-the road to new treatment strategies. Lancet (London, England) 2017, 389, 2430–2442. [Google Scholar] [CrossRef]
Robson, M.E.; Tung, N.; Conte, P.; Im, S.A.; Senkus, E.; Xu, B.; Masuda, N.; Delaloge, S.; Li, W.; Armstrong, A.; et al. OlympiAD final overall survival and tolerability results: Olaparib versus chemotherapy treatment of physician’s choice in patients with a germline BRCA mutation and HER2-negative metastatic breast cancer. Ann. Oncol. Off. J. Eur. Soc. Med. Oncol. 2019, 30, 558–566. [Google Scholar] [CrossRef]
Litton, J.K.; Rugo, H.S.; Ettl, J.; Hurvitz, S.A.; Goncalves, A.; Lee, K.H.; Fehrenbacher, L.; Yerushalmi, R.; Mina, L.A.; Martin, M.; et al. Talazoparib in patients with advanced breast cancer and a germline BRCA mutation. N. Engl. J. Med. 2018, 379, 753–763. [Google Scholar] [CrossRef]
Hoffman, J.; Fejerman, L.; Hu, D.; Huntsman, S.; Li, M.; John, E.M.; Torres-Mejia, G.; Kushi, L.; Ding, Y.C.; Weitzel, J.; et al. Identification of novel common breast cancer risk variants at the 6q25 locus among Latinas. Breast Cancer Res. BCR 2019, 21, 3. [Google Scholar] [CrossRef]
Mhatre, S.; Wang, Z.; Nagrani, R.; Badwe, R.; Chiplunkar, S.; Mittal, B.; Yadav, S.; Zhang, H.; Chung, C.C.; Patil, P.; et al. Common genetic variation and risk of gallbladder cancer in India: A case-control genome-wide association study. Lancet Oncol. 2017, 18, 535–544. [Google Scholar] [CrossRef]
Nagrani, R.; Mhatre, S.; Rajaraman, P.; Chatterjee, N.; Akbari, M.R.; Boffetta, P.; Brennan, P.; Badwe, R.; Gupta, S.; Dikshit, R. Association of Genome-Wide Association Study (GWAS) identified SNPs and risk of breast cancer in an indian population. Sci. Rep. 2017, 7, 40963. [Google Scholar] [CrossRef]
Swierniak, M.; Wojcicka, A.; Czetwertynska, M.; Dlugosinska, J.; Stachlewska, E.; Gierlikowski, W.; Kot, A.; Gornicka, B.; Koperski, L.; Bogdanska, M.; et al. Association between GWAS-derived rs966423 genetic variant and overall mortality in patients with differentiated thyroid cancer. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 2016, 22, 1111–1119. [Google Scholar] [CrossRef]
Kang, B.W.; Jeon, H.S.; Chae, Y.S.; Lee, S.J.; Park, J.Y.; Choi, J.E.; Park, J.S.; Choi, G.S.; Kim, J.G. Association between GWAS-identified genetic variations and disease prognosis for patients with colorectal cancer. PLoS ONE 2015, 10, e0119649. [Google Scholar] [CrossRef]
Barrdahl, M.; Canzian, F.; Lindstrom, S.; Shui, I.; Black, A.; Hoover, R.N.; Ziegler, R.G.; Buring, J.E.; Chanock, S.J.; Diver, W.R.; et al. Association of breast cancer risk loci with breast cancer survival. Int. J. Cancer 2015, 137, 2837–2845. [Google Scholar] [CrossRef]
Bayraktar, S.; Thompson, P.A.; Yoo, S.Y.; Do, K.A.; Sahin, A.A.; Arun, B.K.; Bondy, M.L.; Brewster, A.M. The relationship between eight GWAS-identified single-nucleotide polymorphisms and primary breast cancer outcomes. Oncologist 2013, 18, 493–500. [Google Scholar] [CrossRef]
Pagani, F.; Baralle, F.E. Genomic variants in exons and introns: Identifying the splicing spoilers. Nat. Rev. Genet. 2004, 5, 389–396. [Google Scholar] [CrossRef]
Giral, H.; Landmesser, U.; Kratzer, A. Into the wild: GWAS exploration of non-coding RNAs. Front. Cardiovasc. Med. 2018, 5, 181. [Google Scholar] [CrossRef]
Barcellos-Hoff, M.H.; Akhurst, R.J. Transforming growth factor-beta in breast cancer: Too much, too late. Breast Cancer Res. BCR 2009, 11, 202. [Google Scholar] [CrossRef]
De Blasio, A.; Di Fiore, R.; Morreale, M.; Carlisi, D.; Drago-Ferrante, R.; Montalbano, M.; Scerri, C.; Tesoriere, G.; Vento, R. Unusual roles of caspase-8 in triple-negative breast cancer cell line MDA-MB-231. Int. J. Oncol. 2016, 48, 2339–2348. [Google Scholar] [CrossRef]
Cox, A.; Dunning, A.M.; Garcia-Closas, M.; Balasubramanian, S.; Reed, M.W.; Pooley, K.A.; Scollen, S.; Baynes, C.; Ponder, B.A.; Chanock, S.; et al. A common coding variant in CASP8 is associated with breast cancer risk. Nat. Genet. 2007, 39, 352–358. [Google Scholar] [CrossRef]
Skeeles, L.E.; Fleming, J.L.; Mahler, K.L.; Toland, A.E. The impact of 3’UTR variants on differential expression of candidate cancer susceptibility genes. PLoS ONE 2013, 8, e58609. [Google Scholar] [CrossRef]
Paranjape, T.; Heneghan, H.; Lindner, R.; Keane, F.K.; Hoffman, A.; Hollestelle, A.; Dorairaj, J.; Geyda, K.; Pelletier, C.; Nallur, S.; et al. A 3′-untranslated region KRAS variant and triple-negative breast cancer: A case-control and genetic analysis. Lancet Oncol. 2011, 12, 377–386. [Google Scholar] [CrossRef]
Hunt, R.C.; Simhadri, V.L.; Iandoli, M.; Sauna, Z.E.; Kimchi-Sarfaty, C. Exposing synonymous mutations. Trends Genet. TIG 2014, 30, 308–321. [Google Scholar] [CrossRef]
Antoniou, A.C.; Sinilnikova, O.M.; McGuffog, L.; Healey, S.; Nevanlinna, H.; Heikkinen, T.; Simard, J.; Spurdle, A.B.; Beesley, J.; Chen, X.; et al. Common variants in LSP1, 2q35 and 8q24 and breast cancer risk for BRCA1 and BRCA2 mutation carriers. Hum. Mol. Genet. 2009, 18, 4442–4456. [Google Scholar] [CrossRef]
Huo, D.; Zheng, Y.; Ogundiran, T.O.; Adebamowo, C.; Nathanson, K.L.; Domchek, S.M.; Rebbeck, T.R.; Simon, M.S.; John, E.M.; Hennis, A.; et al. Evaluation of 19 susceptibility loci of breast cancer in women of African ancestry. Carcinogenesis 2012, 33, 835–840. [Google Scholar] [CrossRef]
Buys, S.S.; Sandbach, J.F.; Gammon, A.; Patel, G.; Kidd, J.; Brown, K.L.; Sharma, L.; Saam, J.; Lancaster, J.; Daly, M.B. A study of over 35,000 women with breast cancer tested with a 25-gene panel of hereditary cancer genes. Cancer 2017, 123, 1721–1730. [Google Scholar] [CrossRef]
Couch, F.J.; Hart, S.N.; Sharma, P.; Toland, A.E.; Wang, X.; Miron, P.; Olson, J.E.; Godwin, A.K.; Pankratz, V.S.; Olswold, C.; et al. Inherited mutations in 17 breast cancer susceptibility genes among a large triple-negative breast cancer cohort unselected for family history of breast cancer. J. Clin. Oncol. Off. J. Am. Soc. Clin. Oncol. 2015, 33, 304–311. [Google Scholar] [CrossRef]
Ripperger, T.; Gadzicki, D.; Meindl, A.; Schlegelberger, B. Breast cancer susceptibility: Current knowledge and implications for genetic counselling. Eur. J. Hum. Genet. 2009, 17, 722–731. [Google Scholar] [CrossRef]
Stratton, M.R.; Rahman, N. The emerging landscape of breast cancer susceptibility. Nat. Genet. 2008, 40, 17–22. [Google Scholar] [CrossRef]
Ellsworth, D.L.; Turner, C.E.; Ellsworth, R.E. A review of the hereditary component of triple negative breast cancer: High- and moderate-penetrance breast cancer genes, low-penetrance loci, and the role of nontraditional genetic elements. J. Oncol. 2019, 2019, 4382606. [Google Scholar] [CrossRef]
Zhang, F.; Lupski, J.R. Non-coding genetic variants in human disease. Hum. Mol. Genet. 2015, 24, R102–R110. [Google Scholar] [CrossRef] [PubMed]
Nishizaki, S.S.; Boyle, A.P. Mining the unknown: Assigning function to noncoding single nucleotide polymorphisms. Trends Genet. TIG 2017, 33, 34–45. [Google Scholar] [CrossRef] [PubMed]
Calin, G.A.; Croce, C.M. MicroRNA signatures in human cancers. Nat. Rev. Cancer 2006, 6, 857–866. [Google Scholar] [CrossRef]
Moszynska, A.; Gebert, M.; Collawn, J.F.; Bartoszewski, R. SNPs in microRNA target sites and their potential role in human disease. Open Biol. 2017, 7. [Google Scholar] [CrossRef]
Gong, J.; Tong, Y.; Zhang, H.M.; Wang, K.; Hu, T.; Shan, G.; Sun, J.; Guo, A.Y. Genome-wide identification of SNPs in microRNA genes and the SNP effects on microRNA target binding and biogenesis. Hum. Mutat. 2012, 33, 254–263. [Google Scholar] [CrossRef]
He, L.; He, X.; Lim, L.P.; de Stanchina, E.; Xuan, Z.; Liang, Y.; Xue, W.; Zender, L.; Magnus, J.; Ridzon, D.; et al. A microRNA component of the p53 tumour suppressor network. Nature 2007, 447, 1130–1134. [Google Scholar] [CrossRef]
Brendle, A.; Lei, H.; Brandt, A.; Johansson, R.; Enquist, K.; Henriksson, R.; Hemminki, K.; Lenner, P.; Forsti, A. Polymorphisms in predicted microRNA-binding sites in integrin genes and breast cancer: ITGB4 as prognostic marker. Carcinogenesis 2008, 29, 1394–1399. [Google Scholar] [CrossRef]
Esquela-Kerscher, A.; Slack, F.J. Oncomirs—MicroRNAs with a role in cancer. Nat. Rev. Cancer 2006, 6, 259–269. [Google Scholar] [CrossRef]
Upadhyaya, A.; Smith, R.A.; Chacon-Cortes, D.; Revechon, G.; Bellis, C.; Lea, R.A.; Haupt, L.M.; Chambers, S.K.; Youl, P.H.; Griffiths, L.R. Association of the microRNA-Single Nucleotide Polymorphism rs2910164 in miR146a with sporadic breast cancer susceptibility: A case control study. Gene 2016, 576, 256–260. [Google Scholar] [CrossRef]
Qian, F.; Feng, Y.; Zheng, Y.; Ogundiran, T.O.; Ojengbede, O.; Zheng, W.; Blot, W.; Ambrosone, C.B.; John, E.M.; Bernstein, L.; et al. Genetic variants in microRNA and microRNA biogenesis pathway genes and breast cancer risk among women of African ancestry. Hum. Genet. 2016, 135, 1145–1159. [Google Scholar] [CrossRef]
Naccarati, A.; Pardini, B.; Stefano, L.; Landi, D.; Slyskova, J.; Novotny, J.; Levy, M.; Polakova, V.; Lipska, L.; Vodicka, P. Polymorphisms in miRNA-binding sites of nucleotide excision repair genes and colorectal cancer risk. Carcinogenesis 2012, 33, 1346–1351. [Google Scholar] [CrossRef]
Mullany, L.E.; Wolff, R.K.; Herrick, J.S.; Buas, M.F.; Slattery, M.L. SNP Regulation of microRNA Expression and Subsequent Colon Cancer Risk. PLoS ONE 2015, 10, e0143894. [Google Scholar] [CrossRef]
Nicoloso, M.S.; Sun, H.; Spizzo, R.; Kim, H.; Wickramasinghe, P.; Shimizu, M.; Wojcik, S.E.; Ferdin, J.; Kunej, T.; Xiao, L.; et al. Single-nucleotide polymorphisms inside microRNA target sites influence tumor susceptibility. Cancer Res. 2010, 70, 2789–2798. [Google Scholar] [CrossRef] [PubMed]
Khan, S.; Greco, D.; Michailidou, K.; Milne, R.L.; Muranen, T.A.; Heikkinen, T.; Aaltonen, K.; Dennis, J.; Bolla, M.K.; Liu, J.; et al. MicroRNA related polymorphisms and breast cancer risk. PLoS ONE 2014, 9, e109973. [Google Scholar] [CrossRef]
Yang, H.; Dinney, C.P.; Ye, Y.; Zhu, Y.; Grossman, H.B.; Wu, X. Evaluation of genetic variants in microRNA-related genes and risk of bladder cancer. Cancer Res. 2008, 68, 2530–2537. [Google Scholar] [CrossRef] [PubMed]
Chen, K.; Song, F.; Calin, G.A.; Wei, Q.; Hao, X.; Zhang, W. Polymorphisms in microRNA targets: A gold mine for molecular epidemiology. Carcinogenesis 2008, 29, 1306–1311. [Google Scholar] [CrossRef] [PubMed]
Salzman, D.W.; Weidhaas, J.B. SNPing cancer in the bud: microRNA and microRNA-target site polymorphisms as diagnostic and prognostic biomarkers in cancer. Pharmacol. Ther. 2013, 137, 55–63. [Google Scholar] [CrossRef]
Wilk, G.; Braun, R. regQTLs: Single nucleotide polymorphisms that modulate microRNA regulation of gene expression in tumors. PLoS Genet. 2018, 14, e1007837. [Google Scholar] [CrossRef]
Kalapanida, D.; Zagouri, F.; Gazouli, M.; Zografos, E.; Dimitrakakis, C.; Marinopoulos, S.; Giannos, A.; Sergentanis, T.N.; Kastritis, E.; Terpos, E.; et al. Evaluation of pre-mir-34a rs72631823 single nucleotide polymorphism in triple negative breast cancer: A case-control study. Oncotarget 2018, 9, 36906–36913. [Google Scholar] [CrossRef]
Haiman, C.A.; Chen, G.K.; Vachon, C.M.; Canzian, F.; Dunning, A.; Millikan, R.C.; Wang, X.; Ademuyiwa, F.; Ahmed, S.; Ambrosone, C.B.; et al. A common variant at the TERT-CLPTM1L locus is associated with estrogen receptor-negative breast cancer. Nat. Genet. 2011, 43, 1210–1214. [Google Scholar] [CrossRef]
Stevens, K.N.; Fredericksen, Z.; Vachon, C.M.; Wang, X.; Margolin, S.; Lindblom, A.; Nevanlinna, H.; Greco, D.; Aittomaki, K.; Blomqvist, C.; et al. 19p13.1 is a triple-negative-specific breast cancer susceptibility locus. Cancer Res. 2012, 72, 1795–1803. [Google Scholar] [CrossRef] [PubMed]
Wynendaele, J.; Bohnke, A.; Leucci, E.; Nielsen, S.J.; Lambertz, I.; Hammer, S.; Sbrzesny, N.; Kubitza, D.; Wolf, A.; Gradhand, E.; et al. An illegitimate microRNA target site within the 3′ UTR of MDM4 affects ovarian cancer progression and chemosensitivity. Cancer Res. 2010, 70, 9641–9649. [Google Scholar] [CrossRef] [PubMed]
Purrington, K.S.; Slager, S.; Eccles, D.; Yannoukakos, D.; Fasching, P.A.; Miron, P.; Carpenter, J.; Chang-Claude, J.; Martin, N.G.; Montgomery, G.W.; et al. Genome-wide association study identifies 25 known breast cancer susceptibility loci as risk factors for triple-negative breast cancer. Carcinogenesis 2014, 35, 1012–1019. [Google Scholar] [CrossRef] [PubMed]
Liu, C.; Zhang, F.; Li, T.; Lu, M.; Wang, L.; Yue, W.; Zhang, D. MirSNP, a database of polymorphisms altering miRNA target sites, identifies miRNA-related SNPs in GWAS SNPs and eQTLs. BMC Genom. 2012, 13, 661. [Google Scholar] [CrossRef]
Zhao, J.; Wang, K.; Liao, Z.; Li, Y.; Yang, H.; Chen, C.; Zhou, Y.A.; Tao, Y.; Guo, M.; Ren, T.; et al. Promoter mutation of tumor suppressor microRNA-7 is associated with poor prognosis of lung cancer. Mol. Clin. Oncol. 2015, 3, 1329–1336. [Google Scholar] [CrossRef]
Milne, R.L.; Kuchenbaecker, K.B.; Michailidou, K.; Beesley, J.; Kar, S.; Lindstrom, S.; Hui, S.; Lemacon, A.; Soucy, P.; Dennis, J.; et al. Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer. Nat. Genet. 2017, 49, 1767–1778. [Google Scholar] [CrossRef]
Wade, M.; Wang, Y.V.; Wahl, G.M. The p53 orchestra: Mdm2 and Mdmx set the tone. Trends Cell. Biol. 2010, 20, 299–309. [Google Scholar] [CrossRef]
Bartel, D.P. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell 2004, 116, 281–297. [Google Scholar] [CrossRef]
Griffith, O.L.; Montgomery, S.B.; Bernier, B.; Chu, B.; Kasaian, K.; Aerts, S.; Mahony, S.; Sleumer, M.C.; Bilenky, M.; Haeussler, M.; et al. ORegAnno: An open-access community-driven resource for regulatory annotation. Nucleic Acids Res. 2008, 36, D107–D113. [Google Scholar] [CrossRef]
Hiard, S.; Charlier, C.; Coppieters, W.; Georges, M.; Baurain, D. Patrocles: A database of polymorphic miRNA-mediated gene regulation in vertebrates. Nucleic Acids Res. 2010, 38, D640–D651. [Google Scholar] [CrossRef]
Hariharan, M.; Scaria, V.; Brahmachari, S.K. dbSMR: A novel resource of genome-wide SNPs affecting microRNA mediated regulation. BMC Bioinform. 2009, 10, 108. [Google Scholar] [CrossRef] [PubMed]
Bhattacharya, A.; Ziebarth, J.D.; Cui, Y. PolymiRTS Database 3.0: Linking polymorphisms in microRNAs and their target sites with human diseases and biological pathways. Nucleic Acids Res. 2014, 42, D86–D91. [Google Scholar] [CrossRef] [PubMed]
Barnholtz-Sloan, J.S.; Shetty, P.B.; Guan, X.; Nyante, S.J.; Luo, J.; Brennan, D.J.; Millikan, R.C. FGFR2 and other loci identified in genome-wide association studies are associated with breast cancer in African-American and younger women. Carcinogenesis 2010, 31, 1417–1423. [Google Scholar] [CrossRef]
Fletcher, O.; Johnson, N.; Orr, N.; Hosking, F.J.; Gibson, L.J.; Walker, K.; Zelenika, D.; Gut, I.; Heath, S.; Palles, C.; et al. Novel breast cancer susceptibility locus at 9q31.2: Results of a genome-wide association study. J. Natl. Cancer Inst. 2011, 103, 425–435. [Google Scholar] [CrossRef] [PubMed]
Gold, B.; Kirchhoff, T.; Stefanov, S.; Lautenberger, J.; Viale, A.; Garber, J.; Friedman, E.; Narod, S.; Olshen, A.B.; Gregersen, P.; et al. Genome-wide association study provides evidence for a breast cancer risk locus at 6q22.33. Proc. Natl. Acad. Sci. USA 2008, 105, 4340–4345. [Google Scholar] [CrossRef] [PubMed]
Hein, R.; Maranian, M.; Hopper, J.L.; Kapuscinski, M.K.; Southey, M.C.; Park, D.J.; Schmidt, M.K.; Broeks, A.; Hogervorst, F.B.; Bueno-de-Mesquita, H.B.; et al. Comparison of 6q25 breast cancer hits from Asian and European Genome Wide Association Studies in the Breast Cancer Association Consortium (BCAC). PLoS ONE 2012, 7, e42380. [Google Scholar] [CrossRef]
Kim, H.C.; Lee, J.Y.; Sung, H.; Choi, J.Y.; Park, S.K.; Lee, K.M.; Kim, Y.J.; Go, M.J.; Li, L.; Cho, Y.S.; et al. A genome-wide association study identifies a breast cancer risk variant in ERBB4 at 2q34: Results from the Seoul Breast Cancer Study. Breast Cancer Res. BCR 2012, 14, R56. [Google Scholar] [CrossRef]
Thomas, G.; Jacobs, K.B.; Kraft, P.; Yeager, M.; Wacholder, S.; Cox, D.G.; Hankinson, S.E.; Hutchinson, A.; Wang, Z.; Yu, K.; et al. A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1). Nat. Genet. 2009, 41, 579–584. [Google Scholar] [CrossRef]
Turnbull, C.; Ahmed, S.; Morrison, J.; Pernet, D.; Renwick, A.; Maranian, M.; Seal, S.; Ghoussaini, M.; Hines, S.; Healey, C.S.; et al. Genome-wide association study identifies five new breast cancer susceptibility loci. Nat. Genet. 2010, 42, 504–507. [Google Scholar] [CrossRef]
Zheng, W.; Long, J.; Gao, Y.T.; Li, C.; Zheng, Y.; Xiang, Y.B.; Wen, W.; Levy, S.; Deming, S.L.; Haines, J.L.; et al. Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1. Nat. Genet. 2009, 41, 324–328. [Google Scholar] [CrossRef]
Mavaddat, N.; Michailidou, K.; Dennis, J.; Lush, M.; Fachal, L.; Lee, A.; Tyrer, J.P.; Chen, T.H.; Wang, Q.; Bolla, M.K.; et al. Polygenic risk scores for prediction of breast cancer and breast cancer subtypes. Am. J. Hum. Genet. 2019, 104, 21–34. [Google Scholar] [CrossRef] [PubMed]
Rudolph, A.; Song, M.; Brook, M.N.; Milne, R.L.; Mavaddat, N.; Michailidou, K.; Bolla, M.K.; Wang, Q.; Dennis, J.; Wilcox, A.N.; et al. Joint associations of a polygenic risk score and environmental risk factors for breast cancer in the Breast Cancer Association Consortium. Int. J. Epidemiol. 2018, 47, 526–536. [Google Scholar] [CrossRef] [PubMed]
Michailidou, K.; Lindstrom, S.; Dennis, J.; Beesley, J.; Hui, S.; Kar, S.; Lemacon, A.; Soucy, P.; Glubb, D.; Rostamianfar, A.; et al. Association analysis identifies 65 new breast cancer risk loci. Nature 2017, 551, 92–94. [Google Scholar] [CrossRef] [PubMed]
Maas, P.; Barrdahl, M.; Joshi, A.D.; Auer, P.L.; Gaudet, M.M.; Milne, R.L.; Schumacher, F.R.; Anderson, W.F.; Check, D.; Chattopadhyay, S.; et al. Breast cancer risk from modifiable and nonmodifiable risk factors among white women in the United States. JAMA Oncol. 2016, 2, 1295–1302. [Google Scholar] [CrossRef] [PubMed]

Figure 1. A technical flow chart for genome-wide association studies (GWAS). Abbreviation: QC, quality control.

Figure 2. Frequencies of each type of functional consequence caused by single nucleotide polymorphisms (SNPs) associated with TNBC. Abbreviations: SNPs, single nucleotide polymorphisms; TNBC, triple negative breast cancer; UTR, untranslated region.

Figure 3. Schematic representation of SNPs associated with triple negative breast cancer obtained from the EMBL-EBI GWAS catalogue. Abbreviation: SNPs- single nucleotide polymorphism, EMBL-EBI, The European Bioinformatics Institute.

Figure 4. Correlations of the two most studied SNPs associated with triple negative breast cancer and other nearby SNPs identified following analysis of the GWAS catalogue. Abbreviations: SNPs, single nucleotide polymorphisms; GWAS, Genome-wide association studies.

Figure 5. Locations of TNBC-related SNPs distributed across all somatic human chromosomes. In red color, SNPs related to TNBC risk, and in violet color, SNPs associated with different outcomes of TNBC (taken from Table S1). Abbreviations: A, adenine; C, cytosine; T, thymine; G, guanine.

Figure 6. Divergent bar graphs illustrating significant differences among subpopulation frequencies noted in TNBC-related SNPs. Abbreviations: A, adenine, C, cytosine; G, guanine; T, thymine.

Table 1. Single nucleotide polymorphisms (SNPs) associated with predicted miRNA target site.

SNP	miRNA	Targeted Gene	Effect
rs4245739	hsa-miR-191-5p	MDM4	Create
	hsa-miR-3545-3p		Break
	hsa-miR-3669		Create
	hsa-miR-4427		Break
	hsa-miR-887		Create
rs72993667	hsa-let-7a-3p	ESR1	Break
	hsa-let-7b-3p		Break
	hsa-let-7f-1-3p		Break
	hsa-miR-3613-3p		Break
	hsa-miR-548n		Decrease
rs4973768	hsa-miR-302a-5p	SLC4A7	Create
rs4808616	hsa-miR-3121-3p	ABHD8	Break
	hsa-miR-3189-3p		Decrease
	hsa-miR-635		Decrease
rs61764370	hsa-miR-1262	KRAS	Create
	hsa-miR-34b-3p		Create
	hsa-miR-4701-3p		Create
	hsa-miR-4701-3p		Decrease

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jurj, M.-A.; Buse, M.; Zimta, A.-A.; Paradiso, A.; Korban, S.S.; Pop, L.-A.; Berindan-Neagoe, I. Critical Analysis of Genome-Wide Association Studies: Triple Negative Breast Cancer Quae Exempli Causa. Int. J. Mol. Sci. 2020, 21, 5835. https://doi.org/10.3390/ijms21165835

AMA Style

Jurj M-A, Buse M, Zimta A-A, Paradiso A, Korban SS, Pop L-A, Berindan-Neagoe I. Critical Analysis of Genome-Wide Association Studies: Triple Negative Breast Cancer Quae Exempli Causa. International Journal of Molecular Sciences. 2020; 21(16):5835. https://doi.org/10.3390/ijms21165835

Chicago/Turabian Style

Jurj, Maria-Ancuta, Mihail Buse, Alina-Andreea Zimta, Angelo Paradiso, Schuyler S. Korban, Laura-Ancuta Pop, and Ioana Berindan-Neagoe. 2020. "Critical Analysis of Genome-Wide Association Studies: Triple Negative Breast Cancer Quae Exempli Causa" International Journal of Molecular Sciences 21, no. 16: 5835. https://doi.org/10.3390/ijms21165835

APA Style

Jurj, M.-A., Buse, M., Zimta, A.-A., Paradiso, A., Korban, S. S., Pop, L.-A., & Berindan-Neagoe, I. (2020). Critical Analysis of Genome-Wide Association Studies: Triple Negative Breast Cancer Quae Exempli Causa. International Journal of Molecular Sciences, 21(16), 5835. https://doi.org/10.3390/ijms21165835

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Critical Analysis of Genome-Wide Association Studies: Triple Negative Breast Cancer Quae Exempli Causa

Abstract

1. Introduction

2. Fundamental Principles of Genome-Wide Association Studies

3. Challenges of Genome-Wide Association Studies

4. Genome-Wide Association Studies on Triple Negative Breast Cancer

4.1. Triple Negative Breast Cancer

4.2. Genome-Wide Association Studies Identifying SNPs for TNBC

4.3. Importance of the Relationship between SNPs and miRNAs in BC

5. Risk Factor Scores for Prediction of Breast Cancer

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI