Identification of Novel Endogenous Controls for qPCR Normalization in SK-BR-3 Breast Cancer Cell Line

Jain, Nityanand; Mitre, Ingrida; Nitisa, Dina; Pirsko, Valdis; Cakstina-Dzerve, Inese

doi:10.3390/genes12101631

Open AccessArticle

Identification of Novel Endogenous Controls for qPCR Normalization in SK-BR-3 Breast Cancer Cell Line

by

Nityanand Jain

^*

,

Ingrida Mitre

,

Dina Nitisa

,

Valdis Pirsko

and

Inese Cakstina-Dzerve

^*

Laboratory of Molecular Genetics, Institute of Oncology, Faculty of Medicine, Riga Stradiņš University, Dzirciema Street 16, LV-1007 Riga, Latvia

^*

Authors to whom correspondence should be addressed.

Genes 2021, 12(10), 1631; https://doi.org/10.3390/genes12101631

Submission received: 10 September 2021 / Revised: 12 October 2021 / Accepted: 13 October 2021 / Published: 17 October 2021

(This article belongs to the Section Human Genomics and Genetic Diseases)

Download

Browse Figures

Versions Notes

Abstract

Normalization of gene expression using internal controls or reference genes (RGs) has been the method of choice for standardizing the technical variations in reverse transcription quantitative polymerase chain reactions (RT-qPCR). Conventionally, ACTB and GAPDH have been used as reference genes despite evidence from literature discouraging their use. Hence, in the present study we identified and investigated novel reference genes in SK-BR-3, an HER2-enriched breast cancer cell line. Transcriptomic data of 82 HER2-E breast cancer samples from TCGA database were analyzed to identify twelve novel genes with stable expression. Additionally, thirteen RGs from the literature were analyzed. The expression variations of the candidate genes were studied over five successive passages (p) in two parallel cultures S1 and S2 and in acute and chronic hypoxia using various algorithms. Finally, the most stable RGs were selected and validated for normalization of the expression of three genes of interest (GOIs) in normoxia and hypoxia. Our results indicate that HSP90AB1, DAD1, PFN1 and PUM1 can be used in any combination of three (triplets) for optimizing intra- and inter-assay gene expression differences in the SK-BR-3 cell line. Additionally, we discourage the use of conventional RGs (ACTB, GAPDH, RPL13A, RNA18S and RNA28S) as internal controls for RT-qPCR in SK-BR-3 cell line.

Keywords:

SK-BR-3; RT-qPCR; reference genes; hypoxia; gene expression; breast cancer cell line; HER2 enriched

1. Introduction

Reverse Transcription Quantitative Polymerase Chain Reaction (RT-qPCR) represents a modified variant of the popular conventional PCR with diverse applications, ranging from functional genomics to molecular medicine, virology, microbiology, and biotechnology [1]. Quantitative PCR-based assays can target both DNA (genome) and RNA (transcriptome), thereby making it an extremely powerful and important technique in molecular diagnostics [2]. While functional genomics deals with understanding the functions and interactions of genes and proteins at a genome-wide level including the role of ligands, receptors, and signaling networks that converge on transcriptional regulation [2], transcriptomic analysis, deals with ascertaining the functional significance to expression signature changes between tissues, disease states, or treatment [2]. Large-scale analysis of expression patterns is performed by RNA-Seq or high-throughput microarray analysis, however, the findings for individual genes usually are validated by RT-qPCR due to its high sensitivity, specificity, reproducibility, and broad dynamic range [2,3,4].

However, this enhanced sensitivity of RT-qPCR imposes special conditions. The protocol necessitates accurate and precise pipetting, high-quality RNA, accurate estimation of RNA concentration, and efficient reverse transcription [3]. Other considerations include standardization of RT-qPCR protocols [5], maintaining consistency of used reagents [6,7] and careful attention towards assay design, template preparation, and statistical analysis [8]. Any deviation from these requirements (by human error etc.) can introduce variations and influence the accuracy and precision of the results [9,10,11,12]. To account for such deviations, several strategies are recommended that can be incorporated in the protocol at different stages [12,13,14] ranging from ensuring that a similar sample size is chosen to using controls such as spike-in foreign RNA and reference genes (previously housekeeping genes).

Normalization using reference genes has been the method of choice by most researchers since variations in the experimental workflow will affect all genes similarly [14,15,16]. The expression of reference genes is expected to be sufficient and stable across different tissues and cell lines under varied experimental conditions [17]. However, variance in the expression of conventional reference genes like ACTB and GAPDH across different cell types has been noted [18,19,20,21,22] causing a continued search for suitable candidate genes. In addition, given the highly specific nature of RT-qPCR, the Minimum Information for Publication of Quantitative RT-PCR Experiments (MIQE) guidelines recommend using more than one reference gene for normalization unless a clear evidence of uniform expression dynamics of a single reference gene is reported for specific experimental conditions [11]. Identification and validation of novel reference genes, hence, becomes imperative for accurate normalization. In the past, reference genes have been identified using various large-scale gene expression profiling methods such as Expressed Sequence Tags (ESTs), Serial Analysis of Gene Expression (SAGE) and Microarray Analysis [23,24,25]. However, with the advent of technology, better techniques using RNA-seq data have been employed to identify stable reference genes. Several studies have previously identified novel reference genes and/or validated conventional reference genes for the study of breast cancer [26,27,28,29,30,31,32,33,34,35,36,37,38].

Breast cancer represents the most common malignant disease worldwide among women, accounting for 24% of new cancer cases and 15% of cancer-related deaths in 2018 [39] with the number predicted to almost double to 46% by 2040 [40]. With the immense burden of the disease, it becomes crucial to develop better protocols, prediction tools, diagnostics, and treatment modalities. The PAM50 (Prediction Analysis for Microarrays) represents a 50 gene classifier containing mostly hormone receptor, proliferation-related, myoepithelial and basal feature-related genes and is widely used to classify breast cancer into molecular subtypes [41,42,43]. The HER2-Enriched (HER2-E) subtype according to PAM50 is defined by higher expression of ERBB2 along with the upregulated expression of tumor proliferation-related genes at the RNA and protein levels in comparison to other cancer types [44,45]. SK-BR-3, established in 1970 from the pleural effusion of a Caucasian female with malignant breast adenocarcinoma, is a human breast cancer cell line overexpressing ERBB2 gene product [46]. There are contrasting views in the literature over the classification of SK-BR-3 with some authors including it in the luminal subtype [47,48], with others classifying it as HER2-E [49,50,51].

Hypoxia is one of the principal drivers of tumor progression and growth in vivo. The presence of hypoxic conditions in the cancer microenvironment is not only a recognized event in cancer development but also is sustained by the cancer cells themselves, secondary to the inflammatory processes [52,53,54]. Such a condition in the local tumor microenvironment is necessary to induce angiogenesis and release of growth factors whilst inducing structural and functional damage to the healthy surrounding tissue [54]. It is estimated that about 1.5% of the whole human genome is transcriptionally responsive to hypoxia, thereby affecting gene expression [55]. Hence, given the critical role of hypoxia in tumorigenesis and its direct impact on gene expression, we decided to investigate the stability of the chosen reference genes in different hypoxic conditions.

In the present study, we identified and validated novel reference genes that could be used to normalize qPCR data in SK-BR-3 breast cancer cell line. Additionally, we compared expression stability of newly identified genes with previously reported reference genes [26,27,28,29,30,31,32,33,34,35,36,37,38] to select suitable candidate reference genes. Our study reports for the first time to our knowledge, a comprehensive analysis, combining previous and novel candidates studied over multiple successive passages (p), in replicate cultures S1 and S2, and validated in various hypoxic conditions for SK-BR-3 cell line.

2. Materials and Methods

2.1. TCGA Transcriptomic Analysis for Selection of Novel Candidate Reference Genes

The transcriptome profiling datasets of 82 HER2-E (PAM50 classifier) breast cancer samples were downloaded from TCGA-BRCA legacy archive via R package TCGAbiolinks [56,57,58]. The scaled_estimate (Platform—Illumina HiSeq; file extension—rsem.genes.results) values, which represent the estimated frequency of gene/transcripts amongst the total number of transcripts that were sequenced, were obtained from the database. TPM (transcripts per million) values were generated by multiplying scaled_estimate values to a factor of 1 million (10⁶). For data analysis and visualization, the TPM values were converted to logarithmic scale using log2(TPM).

2.2. Gene Ontology (GO)

GO Annotation and enrichment analysis was done using a web based open-access tool—The Gene Ontology Resource, powered by Panther Classification (http://geneontology.org/ (accessed on 15 April 2021)) [59,60,61,62]. Gene ranking was based on the fold enrichment against the background frequency of total genes annotated to that term in the designated species (Homo Sapiens; whole genome, GO version Oct 2020, doi: 10.5281/zenodo.408174) [63]. Fisher’s Exact test with False Discovery Rate (FDR) correction was used to estimate significance with the cut-off FDR value of <0.05. The tool was further used to group genes based on functional classification.

2.3. Culture and Seeding Conditions

Samples were collected from the SK-BR-3 cell line (ATCC, HTB-30) that had been used in our laboratory for previous studies [64]. For consecutive passage analysis, two cultures, S1 and S2, were established from different laboratory lineages of SK-BR-3 which were cultured over five consecutive passages (p7–p11). For hypoxic exposure analysis, cultures at the level of 80–90% confluence were transferred to Xvivo System Sx2 (BioSpherix Medical; 37 °C, 5% CO₂, 2% O₂). The length of hypoxic exposure for acute hypoxia samples was 24 h and 72 h. To obtain chronic hypoxia samples, the cultures (n = 3) were fully maintained in the hypoxic environment for four consecutive passages. From each culture, 3 lysates (in triplicates) were collected per passage. Cells were cultured in RPMI-1640 (Lonza, BE12-115F), supplemented with 10% FBS (fetal bovine serum; Sigma Aldrich, F9665) at 37 °C, 5% CO₂ with the growth medium replaced every 2–3 days. Cell passaging was performed using 1x TrypLE solution (Thermo Fisher Scientific, A12177-02). Cells were grown to 80–100% confluence in T-25 cm² flasks (Sarstedt). Cell count and viability were estimated using a cell counting chamber (Improved Neubauer Hemocytometer). For further consecutive passages, cells were seeded at a density of 5000 cells/cm². Three TRIzol lysates (1 × 10⁶ cells) were obtained from each passage for both cultures for RNA isolation.

2.4. RNA Extraction and cDNA Synthesis

Total RNA was extracted using Trizol reagent (Thermo Fisher Scientific, Waltham, MA, USA; 15596026) according to the manufacturer’s protocol. The concentration and quality of RNA were assessed by Nanodrop 2000 with the mean absorption ratios A260/280 and A260/230 checked to ensure RNA purity. RNA integrity was evaluated using 1.8% agarose gel electrophoresis. The RNA was further examined for DNA contamination by PCR for ACTB and GAPDH. The PCR reaction was performed in the presence of both positive and negative controls. No amplified PCR product was found on the agarose gel after PCR and electrophoresis of the RNA samples (except for positive controls). The cDNA synthesis reaction was carried out using the High-Capacity cDNA Reverse transcription kit (Thermo Fisher Scientific, Waltham, MA, USA; 4368814) in accordance with the manufacturer’s protocol and guidelines and was stored at −20 °C until further analysis.

2.5. Selection of Reference Genes and Primer Design

In total, thirteen reference genes were selected by searching for relevant literature related to various breast cancer cell lines [26,27,28,29,30,31,32,33,34,35,36,37,38]. Twelve more genes were selected from the TCGA dataset analysis and were referred to as Novel Candidate Reference genes. All selected genes are summarized in Table 1. Along with these 25 candidate reference genes, three genes of interest (target genes; GOIs) were also normalized to test the selected reference genes (Table 1). Primers for all 12 selected novel candidate reference genes, PPIA and SNAI1 were designed using Primer3Plus in accordance with the MIQE guidelines [11,65] while the rest were taken from literature or from our previous study in MCF-7 cell line [66]. The gene function, primer pairs and respective melting curves of all the selected genes are presented in Supplementary File S1.

Selection of the three genes of interest was based on the data from Expression atlas (https://www.ebi.ac.uk/gxa/home (accessed on 15 April 2021)); European Bioinformatics Institute). The atlas was searched for SK-BR-3 cell line and GOIs were randomly selected based on high expression (AURKA; 241 TPM), medium expression (BUB1; 31 TPM), and low expression (SNAI1; 5 TPM).

2.6. Primer Efficiency

Standard (calibration) curves were analyzed using different concentrations and dilutions as shown in Supplementary File S1. For each reaction, 7 µL was used in a 384 well plate. Each dilution was done in triplicate for each primer pair along with appropriate non-template controls (NTC). Real-time PCR was performed using the ViiA7 RT-PCR thermocycler (Thermo Fisher Scientific). The cycling parameters were 95 °C for 10 min followed by 40 cycles of amplification at 95 °C for 15 s, 58 °C for 30 s and 72 °C for 30 s with signal acquisition. After that melting curve were obtained by signal acquisition from 58 °C to 95 °C in increments of 0.05 °C/s.

2.7. Reverse Transcription Quantitative PCR (RT-qPCR)

Reverse transcription quantitative PCR (RT-qPCR) was performed with 10 ng of cDNA per reaction using ViiA 7 RT-PCR thermocycler (Thermo Fisher Scientific). Triplicate reactions of each sample were done using HOT FIREPol EvaGreen qPCR Supermix (Solis Biodyne, Tartu, Estonia; 08-36-00020) on 384 well plates (Applied Biosystems™, Thermo Fisher Scientific, Waltham, MA, USA; LS4309849). The cycling parameters were 95 °C for 10 min followed by 40 cycles of amplification at 95 °C for 15 s, 58 °C for 30 s and 72 °C for 30 s. This was followed by melting curve acquisition as described above. All assays were performed with non-template controls (NTC).

2.8. Determination of the Least Variable Reference Genes and Validation in Hypoxic Conditions

The Cq values obtained from RT-qPCR were used to determine the stability of the candidate reference genes in the sequential normoxic passage samples using different algorithms (coefficient of variation—CV%, NormFinder [67], geNorm [17], BestKeeper [68], Comparative ∆Ct [69] and RefFinder [70]). The least variable reference genes were then tested in hypoxic samples to verify their stability and expression in hypoxic conditions. The working methodology and cut-off criteria for each of these algorithms is demonstrated in Supplementary File S4. Data management and storage along with descriptive analysis was done using MS Excel (Microsoft Office 365). The various algorithms mentioned above were performed using R v4.0.2 (via R studio v1.3.1056). A complete workflow employed in the present study is shown in Scheme 1.

3. Results

3.1. TCGA Analysis for Selection of Novel Reference Genes

Novel candidate reference genes were selected on the basis of HER-E breast cancer sample transcriptomic data from TCGA legacy dataset by applying the following criteria: (I) medium to high expression levels—mean (log2(TPM) ≥ 5; (II) low expression variance—standard deviation (log2(TPM)) ≤ 1; (III) no exceptional expression—no log2 (TPM) differs from the mean log2 (TPM) by a factor of two or more (criteria based on study by Li Y. et al.) [71]. Once the genes were filtered according to the criteria, CV% (Coefficient of Variation) was calculated. The lower the CV%, the more stable the expression of a candidate reference gene.

A complete list of 3363 ranked genes which fulfilled the selection criteria is available in Supplementary File S5. The candidates for validation (12 genes) were selected, based on further criteria including association with dissimilar cellular functions and that the selected genes are not subunits of the same protein as encoded by traditional reference genes. Upon consideration, from the top 10 genes, six genes were selected (GABARAP, PFN1, UBC, EIF5A, CFL1 and TPT1). Three more genes were selected from ranks 11–20 (NACA, DAD1 and PSMB4). Finally, three genes (TUBA1B, RBX1 and BSG) were randomly selected from the top 400 ranks.

Based on expression levels (measured as log2[TPM]), ACTB, TPT1 and GAPDH showed high expression levels while PUM1, SF3A1 and PPIA showed low expression levels in HER2-E samples (Figure 1a). Based on CV%, however, GABARAP, ACTB and PFN1 were the genes with the least variable expression, while PUM1, SF3A1 and PGK1 were the genes with the most variable expression (Figure 1b). Genes with higher levels of expression generally were associated with lower inter-sample variation.

3.2. Gene Ontology (GO) Over-Representation Analysis

Based on fold enrichment (Table 2), the top ranked biological process was modulation by symbiont of the host process which included GAPDH, UBC and PGK1. However, the most significantly enriched biological process was cellular response to cytokine stimulus (FDR = 3.33 × 10⁻⁰³) which included GAPDH, RBX1, TUBA1B, HSP90AB1, RPL13A, UBC, CFL1, PPIA and PSMB4. A complete ontology for Molecular function and Cellular Component is presented in Supplementary File S2.

3.3. Grouping of Genes Based on Functional Classification (Panther)

Panther was used to group the candidate reference genes based on function. The grouping was done across five different ontologies (Figure 2; Supplementary File S2)—GO biological process, GO molecular function, GO cellular component, protein class and pathway. Most of the genes in GO cellular component analysis were associated with cell or cell part (17 genes each) (Figure 2). In protein class classification six genes were identified as genes encoding cytoskeletal proteins (ACTB, TUBA1B, TPT1, PFN1, CFL1 and GABARAP). Finally, based on pathway, three genes were associated with cytoskeletal regulation by Rho GTPase pathway (ACTB, PFN1 and CFL1) while two genes were associated each with glycolysis (GAPDH and PGK1) and Huntington disease (GAPDH and ACTB).

3.4. Candidate Reference Gene Stability in SK-BR-3 Cell Line

In the present study, we collected three technical replicates from five consecutive passages (p7–p11) in two replicate cultures S1 and S2. The biological replicates were analyzed for all 25 candidate reference genes, thereby producing a dataset with 250 Cq values. A similar dataset was analyzed for the three genes of interest (dataset of 30 Cq values). Different algorithms were then used to analyze the stability of the reference gene expression. NormFinder, geNorm, comparative ∆Ct and RefFinder all ranked HSP90AB1, PGK1, DAD1, PUM1 and RPL13A as the most stable genes in the consecutive passages of SK-BR-3 (Figure 3). BestKeeper, however, ranked RBX1, CFL1 and UBC as the most stable genes. RNA18S, TUBA1B and RNA28S were consistently ranked as the least stable reference gene candidates by all algorithms.

3.5. Candidate Reference Gene Stability in Replicate Cultures

To analyze the expression stability of candidate reference genes in replicate cultures S1 and S2, we applied all algorithms separately, as shown in Supplementary File S3. Descriptive analysis revealed that CCSER2 and GABARAP showed low expression levels (Cq > 28), while RNA28S and RNA18S showed high expression levels (Cq < 15). Coefficient of variation (CV%) analyses revealed that TUBA1B showed high variation (>50%) and hence was not a suitable candidate. Further analysis with BestKeeper revealed ACTB, SF3A1, CFL1, UBC and NACA to have a low/moderate correlation with the BestKeeper Index (BI). After removal of these 10 candidate reference genes, the remaining 15 genes were re-analyzed using various algorithms, and none of the remaining genes violated any criteria. Finally, cumulative rankings from both cultures revealed HSP90AB1, DAD1, PGK1, RPL13A and PUM1 to be the top five most stable genes.

3.6. Selection of Reference Genes for Further Validation

Since geNorm indicated that use of two genes (Supplementary File S3) would be sufficient, we decided to select the top five least variable genes from our analysis and test different triplets (rather than in pairs, as a good practice) of the selected five genes. To select the genes, we analyzed the results obtained so far from both cultures whilst also factoring-in CV% rankings (Supplementary File S3). As a result, HSP90AB1, DAD1, PFN1, RPL13A and PUM1 were selected for further analysis and normalization of the genes of interest.

3.7. Normalization of Genes of Interest (GOIs)

The ∆∆Ct method is the method of choice for normalization of the gene expression of genes of interest [72]. A modification of the ∆∆Ct method was employed in the present study. The modification allowed us to account for differential primer efficiencies and the use of multiple reference genes [73,74]. Subsequently, to account for primer efficiency in the equation, the efficiencies from broad and narrow range dilutions were used (Supplementary File S1). We then normalized the three genes of interest (GOIs)—AURKA, BUB1 and SNAI1—with triplet pairs of the five chosen reference genes (10 possible triplets as shown in Figure 4). According to the guidelines, any arbitrary sample can be considered as an internal calibrator (control) for normalization without any effects on the relative quantification result. Hence, we considered passage p7 as the internal calibrator since it was the initial passage in the experiment.

For evaluation of successful normalization, the NRQs (normalized relative quantities) should be checked for patterns in expression, and the difference should be minimal between each sample after normalization [73,75]. For AURKA, we noticed that the expression in comparison with p7 decreased in p8 followed by increase in p9. The NRQs were found to be close to p7 levels in p10 and p11 (Figure 4a). However, this trend was not observed in the triplets with RPL13A (triplets 2, 5, 6, 7, 9 and 10). The expression of triplets with RPL13A in p9 and p10 was highly elevated in comparison to p7 (Figure 4a). Similar observations were made for BUB1 and SNAI1 (Figure 4b,c). This may be explained by the fact that among the five selected genes RPL13A had the lowest correlation r (r = 0.856) with the BestKeeper Index (Additional Table S10 in Supplementary File S3), which was also shown by NormFinder results. Hence, RPL13A was not an ideal candidate and we removed it from the further analysis.

3.8. Effects of Hypoxia on the Stability of the Selected Reference Genes

To evaluate the effects of hypoxia on reference gene stabilities, we applied all algorithms combining data with hypoxic sample results for the four reference genes (HSP90AB1, PUM1, DAD1 and PFN1). In hypoxia, DAD1 and HSP90AB1 were the two most stable genes as revealed by NormFinder, geNorm, Comparative ∆Ct and RefFinder (Figure 5). BestKeeper, however, ranked PUM1 as the most stable gene, and the DAD1 showed the highest correlation with the BestKeeper index. Comparison of the expression stabilities of the genes in normoxia vs. hypoxia showed that the expression stability of all four genes decreased after the addition of hypoxic samples to the dataset (Figure 5). Only DAD1 and PFN1 displayed improved expression stability in hypoxia according to RefFinder (Figure 5F). Similarly, only these two genes improved their correlation with the BestKeeper index in hypoxia. Nonetheless, no gene violated the respective cutoffs in any of the algorithms, making them suitable reference genes.

3.9. Experimental Validation of Reference Genes in Hypoxic Conditions

We found that after normalization AURKA was slightly upregulated (Figure 6A) after 24 h of hypoxic exposure while after 72 h, different triplets indicated different results. The normalization factor (NF) was within the acceptable limits (NF < 2 to 3-fold relative to the average), thus eliminating potential causative issues, e.g., starting material quantity or quality, or a problem with one of the reference genes (either not stably expressed, or not adequately measured) [75]. As pointed out by the other authors [34,75], the choice of calibrator sample (reference genes) does not influence the relative quantification result.

Although numbers may differ, the actual fold differences between the samples remain identical, therefore the results are fully equivalent and only rescaled. The expression of BUB1 was sequentially increasing, while SNAI1 after an initial (24 h) decrease became upregulated, but still below the control levels (Figure 6). Both AURKA, and BUB1 were downregulated (−0.78 and −1.26 average log2 fold change, respectively) in chronic hypoxia, when normalized by all four reference gene triplets (Figure 6). However, SNAI1 was upregulated in chronic hypoxia (+1.03 average log2 fold change). Based on our analysis, we can conclude that the selected reference gene triplets were able to successfully normalize the genes of interest irrespective of the expression levels of gene of interest in SK-BR-3 cell line (AURKA—high expression; BUB1—medium expression; SNAI1—low expression).

4. Discussion

A pivotal aspect in any gene expression study is the selection of appropriate internal controls that are expressed uniformly irrespective of culturing conditions, experimental treatment, nutrient stress etc. These controls (or references) normalize any variations in starting quantities, calibration issues or poor pipetting, thereby providing accurate results. However, complex gene-gene interactions and environmental effects on gene expression complicate the identification of such controls. Another layer of complexity in this pursuit is added by intrinsic heterogeneity of the cancer cells. Several tools and algorithms (NormFinder, geNorm, BestKeeper, Comparative ∆Ct and RefFinder) have been developed that can aid in sorting, selection, and validation of the reference genes. However, the considerations regarding the applicability of these algorithms and the acceptable cut-off limits are often misinterpreted or misapplied. Finally, the identification and extensive validation of the reference genes for each type of biological object in different conditions may not be feasible in every instance. Such studies are often time and resource (labor, financial) consuming. In the present study, we have identified and validated novel reference genes for normalization of RT-qPCR results in SK-BR-3 breast cancer cell line.

In the past the TCGA database has been commonly used to identify novel and reliable candidate genes in different cancer types [37,38], but it has never been investigated individually for HER2-E subtype of breast cancer. Our analysis of TCGA legacy data revealed that GABARAP, ACTB and PFN1 were the top three genes with the least variation amongst the samples (Figure 1b). However, upon validation in the SK-BR-3 cell line, we found that the least variation amongst the samples was displayed by RBX1, UBC and CFL1 (Supplementary File S3). These differences between TCGA database and RT-qPCR results are normal and expected. Firstly, the database represents a heterogenous mixture of similar subtypes while the SK-BR-3 is a single cell line, and therefore constitutes a more homogenous sample. Secondly, all samples of TCGA repository come from primary untreated tumors collected from different institutes. Thirdly, there is an inherent possibility of bias in the biorepository generation process, stemming from different institutional research interests, operative protocols, or patient populations [76]. Additionally, it must be considered that tissue specimens (TCGA data) in addition to cancer and normal mammary cells contain various types of stromal and immune cells, which are absent from the cell culture samples. This type of heterogeneity also may have effects on the overall transcription levels and stability. The database accommodates for most of the possible gene expression variations in tissues (in vivo) and, since cell lines like SK-BR-3 have been under constant cultivation over long-term, they tend to behave like outliers when compared to the TCGA database. Nonetheless, such extrapolation, bridging and application of in vivo data to in vitro conditions enable us to find common core genes which are uniformly expressed in both scenarios.

Gene Ontology (GO) analysis (Table 2) revealed that the candidate genes selected based on the TCGA samples and literature were most significantly enriched in cellular response to cytokine stimulus (GO:0071345; CFL1, GAPDH, HSP90AB1, PPIA, PSMB4, RBX1, RPL13A, TUBA1B and UBC). Indeed, it is well known that breast cancer cells respond to various cytokines in their microenvironment. The cells have been shown to evade cytokines such as TGF-β in the early stages (due to its anti-proliferative effects), however, in the later stages TGF-β stimulates the progression of the disease by inducing epithelial to mesenchymal transition (EMT type 3) [77,78,79]. IL-1 (adipokine) is known to increase the aromatase activity in SK-BR-3 cells, resulting in the generation of bioactive estrogens and increased cellular proliferation [80]. Other cytokines like IL-6, IL-19, IL-20, TGF-α, TNF-α, and IL-23 are also known to promote cancer progression [77]. Using data available from Mouse Genome Informatics (MGI; http://www.informatics.jax.org/ (accessed on 12 June 2021)), CFL1 was associated with cellular response to IL-1, IL-6 and TNF, GAPDH and RPL13A were associated with response to IFN-γ, while HSP90AB1 and TUBA1B were associated with the response to IL-4. Apart from the cytokine response, the genes were found to be enriched in viral life cycle (GO:0019058; BSG, HSP90AB1, PCBP1, PPIA and UBC) and positive regulation of viral processes (GO:0048524; BSG, PFN1 and PPIA). Evidence of various virus-related DNAs like EBV (Epstein-Barr virus), HPV (Human Papillomavirus), BLV (bovine leukemia virus) and MMTV (Mouse mammary tumor virus) has been found in breast cancers [81,82,83,84]. In fact, Lawson et al., demonstrated the presence of HPV-associated pre-malignant koilocytes in normal and malignant breast tissues [85], indicating possible oncogenic correlation between the viruses and breast cancer. Using data from MGI, we found that BSG was associated with positive regulation of viral entry into the host cell, while HSP90AB1 was associated with virion attachment to the host cell. On the other hand, PFN1 and PPIA were found to be associated with positive regulation of viral transcription and viral genome replication, respectively.

To understand the role of the four validated reference genes in metabolic processes, GO biological processes analysis in association with data from MGI (http://www.informatics.jax.org/ (accessed on 12 June 2021)) were investigated. The genes were found to be associated with negative regulation of apoptosis (DAD1) and nucleobase containing compound metabolic process (Table 2). HSP90AB1 was found to be associated with positive regulation of activities of telomerase, phosphoprotein phosphatase and protein serine/threonine kinase. PFN1 was found to play a role in the positive regulation of DNA metabolic processes and of transcription by RNA polymerase II, while PUM1 was associated with the regulation of mRNA stability and the production of miRNAs (microRNAs). Based on Molecular function, HSP90AB1 and PFN1 shared two common ontologies—RNA binding and Cytoskeletal protein binding (Supplementary File S2), both of which seemed to be downregulated in hypoxic conditions (for PFN1, fold change was 0.68 and 0.45 after 24 and 72 h of acute hypoxia, respectively, and 0.89 in chronic hypoxia). With respect to cellular component analysis the regulation of different genes (even from the same ontology) during hypoxia was divergent/unrelated. Whilst PUM1, HSP90AB1 and PFN1 shared two ontologies, nuclear and cytosol-associated genes, the genes were differently regulated with PUM1 showing upregulation while the other two showed downregulation in gene expression.

Our analysis revealed that the four reference genes (HSP90AB1, DAD1, PUM1 and PFN1), when used in any combination of three, successfully normalized the genes of interest (AURKA, BUB1 and SNAI1) in both hypoxic and normoxic conditions over multiple passages. HSP90AB1, previously known as HSPCB, was first described as a candidate reference gene by Jacob et al. [31] in a variety of 25 different cancer cell lines. However, the only breast cancer line in that study was MCF-7 (Luminal A subtype). The gene has been included among the most stable reference genes in other tissues/organs, e.g., ovary, muscle tissue, adipose tissue etc. [86,87]. Heat Shock Proteins like HSP90AB1 have been previously shown to be downregulated in response to hypoxia in pig adipose-derived stromal/stem cells [88] as well as in human hepatocytes [89]. Such downregulation (fold expression change of 0.71 after 24 h and 0.88 after 72 h in acute hypoxia and 0.52 in chronic hypoxia) appeared to have an impact on the stability of the gene by marginally lowering the expression stability in hypoxic conditions (Figure 5).

DAD1, a novel reference gene identified in the present study, is a small integral membrane protein of the oligosaccharyltransferase (OST) enzyme complex involved in the highly conserved asparagine-linked glycosylation of proteins in all eukaryotic cells [90]. The gene has been shown to be involved in apoptosis. Loss of DAD1 gene led to apoptosis in hamster cell lines and yeast cells [91,92]. In fact, DAD1 was preferentially expressed in hepatocellular carcinoma (HCC) and prostate cancer cells [93,94]. Hence, it has been postulated that high expression of DAD1 in HCC cells can activate OST and block apoptosis, thereby enhancing tumor cell survival [93]. We speculate that a similar role of the gene in the SK-BR-3 breast cancer cell line could explain its consistent and stable expression. In A431 epithelial carcinoma cells, DAD1 was upregulated in hypoxic conditions (by a fold change of 1.2) after exposure of 72 h [95]. In our results, however, DAD1 was downregulated by 0.64-fold change after exposure to acute hypoxia for 24 and 72 h. The expression then increased and approached normoxia levels during chronic hypoxia (fold change 0.95). The differences in our results from those in A431 cells could be explained by the cardinal differences in background transcriptome and epigenome of A4321 and SK-BR-3 cells. Furthermore, there were differences in culturing conditions. While our cells were exposed to 2% O₂ in acute hypoxia, the A431 cells were exposed to <0.1% O₂ [95]. Our results suggest that SK-BR-3 cells after an initial shock phase quickly adapted to chronic hypoxic conditions and continued to express core genes in levels similar to normoxia. This is also supported by the fact that the expression stability of DAD1 improved in hypoxia, i.e., it was expressed even more stably after exposure to hypoxia.

PFN1, another novel reference gene identified in the present study, is often regarded as the founding member of its family constituting four profilin genes (PFN1, PFN2, PFN3 and PFN4). These genes have been associated with almost every aspect of cellular functions including proliferation, survival, motility, endocytosis, membrane trafficking, mRNA splicing as well as gene transcription [96,97]. The overexpression of PFN1 could negatively regulate cancer cell motility in breast cancer cells [98]. Additionally, it has been demonstrated that in triple negative breast cancer cell lines, overexpression of PFN1 suppresses AKT (serine-threonine kinase) activation via upregulation of PTEN (phosphatase and tensin homolog) [99], indicating the tumor-suppressive character of PFN1 gene. Similar observations have also been reported in pancreatic cancer cells [100]. These findings are of interest since the expression of PFN1 in the SK-BR-3 cell line seems to be uniform and stable despite its anti-tumorigenic nature. In fact, depletion of PFN1 in breast cancer cells has enabled hyper-migratory phenotype in vitro and enhanced hematogenous dissemination from primary tumor in vivo [101]. We can hypothesize that some downstream regulation is at play, which leads to decreased PFN1 protein levels in breast cancer cells despite rather uniform expression of the gene.

Finally, the fourth reference gene identified for the SK-BR-3 cell line, PUM1 has been previously described as a candidate reference gene in various cancers including breast cancers [28,33,35]. The gene has been associated with cancer cell growth, migration, and invasion [102]. In fact, amongst the four selected reference genes in our study, only PUM1 showed an increase in expression fold change in hypoxia. The fold change was 0.80 and 1.13 after 24 h and 72 h of acute hypoxia, respectively, and reached 1.56 in chronic hypoxia. These findings suggest that PUM1 may be more involved in the regulation of cellular functions under hypoxia in comparison to the other identified reference genes. Another candidate gene, RPL13A, has been previously described as a stable reference gene in breast cancers [27,33]. The expression of RPL13A was quite stable in our analysis, however, it did not yield successful normalization of the genes of interest.

Various other reference genes in the past have been described to be suitable reference genes in breast cancers including GAPDH, ACTB, PPIA and RNA28S/RNA18S [26,29,32,34]. However, there is also contrary evidence suggesting against the use of these genes as references [18,19,20,21,22]. The use of GAPDH as a reference gene has long been a topic of debate. It is overexpressed in cervical, prostate, pancreatic and lung cancers, and it has been the least stable gene in multiple studies [29,34,103,104]. Similar concerns have been raised concerning use of ACTB as a single internal control [28]. RNA18S and RNA28S have been reported previously to be stable reference gene candidates [34], however, concerns regarding the absence of purified mRNA samples and their relatively high abundance compared to the target mRNAs have been reported [17]. Both RNA28S and RNA18S were highly expressed (mean Cq = 7–8) in the present study and in our study of MCF-7 cell line [66]. Secondly, these two genes are not included in the TCGA database, which makes it difficult to compare their expression between in vivo and in vitro scenarios.

PPIA along with ACTB was found to be the most stable reference gene for basal type breast cancer cell lines in hypoxic and serum deprived conditions [36]. However, our analysis revealed that the expression of PPIA was moderately stable and ranked ninth in the combined analysis (Supplementary File S3). In our study of MCF-7 breast cancer cell line, we identified GAPDH-CCSER2-PCBP1 triplet as the most stable reference gene triplet which could be used to normalize the expression of genes of interest in various nutrient stress conditions [66]. However, the expression of CCSER2 was extremely variable in the present study. The gene did not reach the thresholds set for TCGA analysis, and the expression of the gene was low (mean Cq = 27.4). Hence, we eliminated it from our analysis in early stages. PCBP1 and GAPDH were among the top 15 most stable reference genes in our analysis (Supplementary File S3). Interestingly, GABARAP was identified to have the least variation (CV%) among our novel candidate reference genes, however, the gene was associated with low expression and poor primer efficiency when confirmed by RT-qPCR (Supplementary Files S1 and S3).

Nonetheless, the results of the present study are constrained by some limitations. First, the expression stabilities of the reference genes were validated in vitro only. Second, these reference genes were tested in normoxic and hypoxic conditions only. Their use for normalization of expression in other conditions such as nutrient stress, drug treatment etc. remains to be validated. Finally, given the heterogeneous behavior of cancer cells, there is a need for inter-laboratory validation to further confirm our results. The major question that arises is how many more reference genes can we identify. Although a precise number will be difficult to predict, the estimates in normal human tissues (using ESTs) predict the numbers to be in the range of 3100 to 6900 genes [23], thereby making a plethora of reference genes still waiting to be identified and validated that could be more promising than the ones previously reported. However, we agree with other authors that rather than testing thousands of genes, we need to validate a panel of reference genes whose expression under varying conditions can be proven to be as minimally variable and as robust as possible [35]. Accordingly, based on our analysis, we suggest the use of HSP90AB1, DAD1, PFN1 and PUM1 in any combination of three (triplet), thereby giving other researchers not one but four different combinations to choose from based on their individual experimental designs and needs.

5. Conclusions

Based on the results of the present study, we suggest the use of HSP90AB1, DAD1, PFN1 and PUM1 in any combination of threes (triplet) for normalization of the expression of genes of interest in SK-BR-3 breast cancer cell line. The conventional RT-qPCR reference genes such as ACTB, GAPDH, RPL13A, RNA18S, RNA28S, as well as CCSER2 and GABARAP are not appropriate for use as reference genes in the SK-BR-3 cell line.

Supplementary Materials

The following are available online at www.mdpi.com/10.3390/genes12101631/s1, Supplementary File S1: Primer Description and Efficiency, Supplementary File S2: Gene Ontology, Supplementary File S3: Replicate Culture Analysis, Supplementary File S4: Description of Algorithms, and Supplementary File S5: TCGA filtered ranking for HER2-E samples.

Author Contributions

N.J. and I.C.-D. conceptualized the study while I.C.-D., I.M., D.N. and N.J. were responsible for methodology. Transcriptomic data and statistical analysis were done by N.J. Data curation was done by N.J. and I.M. Validation of the study protocol and results was done by V.P., I.C.-D., D.N., I.M., and N.J. Visualization was done by N.J. and V.P. while project supervision and funding acquisition was done by I.C.-D. Investigations, coding, and validation were done by N.J. Original draft was prepared by N.J. and I.C.-D. while revisions and final review was done by V.P., D.N., I.C.-D., I.M., and N.J. All authors have read and agreed to the published version of the manuscript.

Funding

The present study was funded by Riga Stradiņš University (RSU) Project Nb.5-1/252/2020.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not Applicable.

Data Availability Statement

The full TCGA dataset (after processing and ranking for HER2-E subtype) is available as Supplementary File S5. The RT-qPCR raw datasets and metadata analyzed are available from corresponding authors upon reasonable request.

Acknowledgments

We are extremely grateful and thankful to our colleagues in the Latvian Biomedical Research and Study Centre (LV-BMC) for their unconditional support in providing the Viia7 platform for performing the RT-qPCR reactions.

Conflicts of Interest

The authors declare no competing interests in the present study. Further, neither funders nor the funding institution had a role in the design of the study; in the collection, analysis, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Bustin, S.A. Real-time, fluorescence-based quantitative PCR: A snapshot of current procedures and preferences. Expert Rev. Mol. Diagn. 2005, 5, 493–498. [Google Scholar] [CrossRef] [PubMed]
Bustin, S.A.; Nolan, T. Pitfalls of Quantitative Real-Time Reverse-Transcription Polymerase Chain Reaction. J. Biomol. Tech. JBT 2004, 15, 155–166. [Google Scholar] [PubMed]
Remans, T.; Keunen, E.; Bex, G.J.; Smeets, K.; Vangronsveld, J.; Cuypers, A. Reliable Gene Expression Analysis by Reverse Transcription-Quantitative PCR: Reporting and Minimizing the Uncertainty in Data Accuracy. Plant Cell 2014, 26, 3829–3837. [Google Scholar] [CrossRef] [PubMed]
Kubista, M.; Andrade, J.M.; Bengtsson, M.; Forootan, A.; Jonak, J.; Lind, K.; Sindelka, R.; Sjöback, R.; Sjögreen, B.; Strömbom, L.; et al. The real-time polymerase chain reaction. Mol. Asp. Med. 2006, 27, 95–125. [Google Scholar] [CrossRef] [PubMed]
Gabert, J.; Beillard, E.; Van Der Velden, V.H.J.; Bi, W.; Grimwade, D.; Pallisgaard, N.; Barbany, G.; Cazzaniga, G.; Cayuela, J.M.; Cavé, H.; et al. Standardization and quality control studies of ‘real-time’ quantitative reverse transcriptase polymerase chain reaction of fusion gene transcripts for residual disease detection in leukemia—A Europe Against Cancer Program. Leukemia 2003, 17, 2318–2357. [Google Scholar] [CrossRef] [PubMed]
Wolffs, P.; Grage, H.; Hagberg, O.; Rådström, P. Impact of DNA Polymerases and Their Buffer Systems on Quantitative Real-Time PCR. J. Clin. Microbiol. 2004, 42, 408–411. [Google Scholar] [CrossRef] [PubMed][Green Version]
Yeung, A.T.; Holloway, B.P.; Adams, P.S.; Shipley, G.L. Evaluation of dual-labeled fluorescent DNA probe purity versus performance in real-time PCR. BioTechniques 2004, 36, 266–275. [Google Scholar] [CrossRef] [PubMed]
Ginzinger, D.G. Gene quantification using real-time quantitative PCR: An emerging technology hits the mainstream. Exp. Hematol. 2002, 30, 503–512. [Google Scholar] [CrossRef] [PubMed]
Nolan, T.; Hands, R.E.; Bustin, S.A. Quantification of mRNA using real-time RT-PCR. Nat. Protoc. 2006, 1, 1559–1582. [Google Scholar] [CrossRef] [PubMed]
Udvardi, M.K.; Czechowski, T.; Scheible, W.-R. Eleven Golden Rules of Quantitative RT-PCR. Plant Cell 2008, 20, 1736–1737. [Google Scholar] [CrossRef] [PubMed]
Bustin, S.A.; Benes, V.; Garson, J.; Hellemans, J.; Huggett, J.; Kubista, M.; Mueller, R.; Nolan, T.; Pfaffl, M.; Shipley, G.L.; et al. The MIQE Guidelines: Minimum Information for Publication of Quantitative Real-Time PCR Experiments. Clin. Chem. 2009, 55, 611–622. [Google Scholar] [CrossRef] [PubMed]
Baker, M. qPCR: Quicker and easier but don′t be sloppy. Nat. Methods 2011, 8, 207–212. [Google Scholar] [CrossRef]
Taylor, S.; Wakem, M.; Dijkman, G.; Alsarraj, M.; Nguyen, M. A practical approach to RT-qPCR—Publishing data that conform to the MIQE guidelines. Methods 2010, 50, S1–S5. [Google Scholar] [CrossRef] [PubMed]
Huggett, J.F.; Dheda, K.; Bustin, S.; Zumla, P.S.A. Real-time RT-PCR normalisation; strategies and considerations. Genes Immun. 2005, 6, 279–284. [Google Scholar] [CrossRef] [PubMed]
Vandesompele, J.; Kubista, M.; Pfaffl, M.W. Reference gene validation software for improved normalization. In Real-Time PCR: Current Technology and Applications; Logan, J., Edwards, K., Saunders, N., Eds.; Caister Academic Press: Poole, UK, 2009; pp. 47–64. [Google Scholar]
Bustin, S.A.; Benes, V.; Garson, J.; Hellemans, J.; Huggett, J.; Kubista, M.; Mueller, R.; Nolan, T.; Pfaffl, M.; Shipley, G.; et al. The need for transparency and good practices in the qPCR literature. Nat. Methods 2013, 10, 1063–1067. [Google Scholar] [CrossRef] [PubMed]
Vandesompele, J.; De Preter, K.; Pattyn, F.; Poppe, B.; Van Roy, N.; De Paepe, A.; Speleman, F. Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol. 2002, 3, research0034.1. [Google Scholar] [CrossRef] [PubMed]
Thellin, O.; Zorzi, W.; Lakaye, B.; De Borman, B.; Coumans, B.; Hennen, G.; Grisar, T.; Igout, A.; Heinen, E. Housekeeping genes as internal standards: Use and limits. J. Biotechnol. 1999, 75, 291–295. [Google Scholar] [CrossRef] [PubMed]
Lee, P.D.; Sladek, R.; Greenwood, C.M.; Hudson, T.J. Control Genes and Variability: Absence of Ubiquitous Reference Transcripts in Diverse Mammalian Expression Studies. Genome Res. 2002, 12, 292–297. [Google Scholar] [CrossRef] [PubMed]
Barber, R.D.; Harmer, D.W.; Coleman, R.A.; Clark, B.J. GAPDH as a housekeeping gene: Analysis of GAPDH mRNA expression in a panel of 72 human tissues. Physiol. Genom. 2005, 21, 389–395. [Google Scholar] [CrossRef] [PubMed]
Ghani, M.; Sato, C.; Rogaeva, E. Segmental duplications in genome-wide significant loci and housekeeping genes; warning for GAPDH and ACTB. Neurobiol. Aging 2013, 34, 1710.e1–1710.e4. [Google Scholar] [CrossRef] [PubMed]
Olsvik, P.A.; Søfteland, L.; Lie, K.K. Selection of reference genes for qRT-PCR examination of wild populations of Atlantic cod Gadus morhua. BMC Res. Notes 2008, 1, 47, Erratum in 2011, 4, 456. [Google Scholar] [CrossRef] [PubMed]
Zhu, J.; He, F.; Song, S.; Wang, J.; Yu, J. How many human genes can be defined as housekeeping with current expression data? BMC Genom. 2008, 9, 172. [Google Scholar] [CrossRef] [PubMed]
Velculescu, V.E.; Madden, S.L.; Zhang, L.; Lash, A.E.; Yu, J.; Rago, C.; Lal, A.; Wang, C.J.; Beaudry, G.A.; Ciriello, K.M.; et al. Analysis of human transcriptomes. Nat. Genet. 1999, 23, 387–388. [Google Scholar] [CrossRef] [PubMed]
Eisenberg, E.; Levanon, E.Y. Human housekeeping genes are compact. Trends Genet. 2003, 19, 362–365. [Google Scholar] [CrossRef] [PubMed]
Morse, D.L.; Carroll, D.; Weberg, L.; Borgstrom, M.C.; Ranger-Moore, J.; Gillies, R.J. Determining suitable internal standards for mRNA quantification of increasing cancer progression in human breast cells by real-time reverse transcriptase polymerase chain reaction. Anal. Biochem. 2005, 342, 69–77. [Google Scholar] [CrossRef] [PubMed]
De Jonge, H.J.M.; Fehrmann, R.; De Bont, E.S.J.M.; Hofstra, R.; Gerbens, F.; Kamps, W.A.; de Vries, E.; Van Der Zee, A.G.J.; Meerman, G.J.T.; Ter Elst, A. Evidence Based Selection of Housekeeping Genes. PLoS ONE 2007, 2, e898. [Google Scholar] [CrossRef] [PubMed]
Lyng, M.B.; Laenkholm, A.-V.; Pallisgaard, N.; Ditzel, H.J. Identification of genes for normalization of real-time RT-PCR data in breast carcinomas. BMC Cancer 2008, 8, 20. [Google Scholar] [CrossRef] [PubMed]
Gur-Dedeoglu, B.; Konu, O.; Bozkurt, B.; Ergul, G.; Seckin, S.; Yulug, I.G. Identification of Endogenous Reference Genes for qRT-PCR Analysis in Normal Matched Breast Tumor Tissues. Oncol. Res. Featur. Preclin. Clin. Cancer Ther. 2009, 17, 353–365. [Google Scholar] [CrossRef] [PubMed]
Lemma, S.; Avnet, S.; Salerno, M.; Chano, T.; Baldini, N. Identification and Validation of Housekeeping Genes for Gene Expression Analysis of Cancer Stem Cells. PLoS ONE 2016, 11, e0149481. [Google Scholar] [CrossRef] [PubMed]
Jacob, F.; Guertler, R.; Naim, S.; Nixdorf, S.; Fedier, A.; Hacker, N.F.; Heinzelmann-Schwarz, V. Careful Selection of Reference Genes Is Required for Reliable Performance of RT-qPCR in Human Normal and Cancer Cell Lines. PLoS ONE 2013, 8, e59180. [Google Scholar] [CrossRef] [PubMed]
Maltseva, D.V.; Khaustova, N.A.; Fedotov, N.N.; Matveeva, E.O.; Lebedev, A.E.; Shkurnikov, M.U.; Galatenko, V.V.; Schumacher, U.; Tonevitsky, A.G. High-throughput identification of reference genes for research and clinical RT-qPCR analysis of breast cancer samples. J. Clin. Bioinform. 2013, 3, 13. [Google Scholar] [CrossRef] [PubMed]
Kılıç, Y.; Çelebiler, A.; Sakızlı, M. Selecting housekeeping genes as references for the normalization of quantitative PCR data in breast cancer. Clin. Transl. Oncol. 2013, 16, 184–190. [Google Scholar] [CrossRef] [PubMed]
Liu, L.-L.; Zhao, H.; Ma, T.-F.; Ge, F.; Chen, C.-S.; Zhang, Y.-P. Identification of Valid Reference Genes for the Normalization of RT-qPCR Expression Studies in Human Breast Cancer Cell Lines Treated with and without Transient Transfection. PLoS ONE 2015, 10, e0117058. [Google Scholar] [CrossRef] [PubMed]
Tilli, T.M.; Castro, C.D.S.; Tuszynski, J.A.; Carels, N. A strategy to identify housekeeping genes suitable for analysis in breast cancer diseases. BMC Genom. 2016, 17, 639. [Google Scholar] [CrossRef] [PubMed]
Albuquerque, A.P.; Balmana, M.; Reis, C.A.; Beltrao, E.I. Identification of appropriate housekeeping genes for quantitative RT-PCR analysis in MDA-MB-231 and NCI-H460 human cancer cell lines under hypoxia and serum deprivation. J. Mol. Clin. Med. 2018, 1, 127–134. [Google Scholar] [CrossRef]
Jo, J.; Choi, S.; Oh, J.; Lee, S.-G.; Choi, S.Y.; Kim, K.K.; Park, C. Conventionally used reference genes are not outstanding for normalization of gene expression in human cancer research. BMC Bioinform. 2019, 20 (Suppl. S10), 245. [Google Scholar] [CrossRef] [PubMed]
Krasnov, G.S.; Kudryavtseva, A.V.; Snezhkina, A.V.; Lakunina, V.A.; Beniaminov, A.D.; Melnikova, N.V.; Dmitriev, A.A. Pan-Cancer Analysis of TCGA Data Revealed Promising Reference Genes for qPCR Normalization. Front. Genet. 2019, 10, 97. [Google Scholar] [CrossRef] [PubMed]
Bray, F.; Me, J.F.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2018, 68, 394–424, Erratum in 2020, 70, 313. [Google Scholar] [CrossRef] [PubMed]
Heer, E.; Harper, A.; Escandor, N.; Sung, H.; McCormack, V.; Fidler-Benaoudia, M.M. Global burden and trends in premenopausal and postmenopausal breast cancer: A population-based study. Lancet Glob. Health 2020, 8, e1027–e1037. [Google Scholar] [CrossRef] [PubMed]
Parker, J.S.; Mullins, M.; Cheang, M.C.U.; Leung, S.; Voduc, D.; Vickery, T.; Davies, S.; Fauron, C.; He, X.; Hu, Z.; et al. Supervised Risk Predictor of Breast Cancer Based on Intrinsic Subtypes. J. Clin. Oncol. 2009, 27, 1160–1167. [Google Scholar] [CrossRef] [PubMed]
Dowsett, M.; Sestak, I.; Knowles, E.L.; Sidhu, K.; Dunbier, A.; Cowens, J.W.; Ferree, S.; Storhoff, J.; Schaper, C.; Cuzick, J. Comparison of PAM50 Risk of Recurrence Score With Oncotype DX and IHC4 for Predicting Risk of Distant Recurrence After Endocrine Therapy. J. Clin. Oncol. 2013, 31, 2783–2790. [Google Scholar] [CrossRef] [PubMed]
Gnant, M.; Filipits, M.; Greil, R.; Stoeger, H.; Rudas, M.; Bago-Horvath, Z.; Mlineritsch, B.; Kwasny, W.; Knauer, M.; Singer, C.; et al. Predicting distant recurrence in receptor-positive breast cancer patients with limited clinicopathological risk: Using the PAM50 Risk of Recurrence score in 1478 postmenopausal patients of the ABCSG-8 trial treated with adjuvant endocrine therapy alone. Ann. Oncol. 2013, 25, 339–345. [Google Scholar] [CrossRef] [PubMed]
Ferrari, A.; Vincent-Salomon, A.; Pivot, X.; Sertier, A.-S.; Thomas, E.; Tonon, L.; Boyault, S.; Mulugeta, E.; Treilleux, I.; MacGrogan, G.; et al. A whole-genome sequence and transcriptome perspective on HER2-positive breast cancers. Nat. Commun. 2016, 7, 12222. [Google Scholar] [CrossRef] [PubMed]
Cejalvo, J.M.; Pascual, T.; Fernandez-Martinez, A.; Maristany, F.B.; Gomis, R.; Perou, C.; Munoz, M.; Prat, A. Clinical implications of the non-luminal intrinsic subtypes in hormone receptor-positive breast cancer. Cancer Treat. Rev. 2018, 67, 63–70. [Google Scholar] [CrossRef] [PubMed]
SK-BR-3: Human Breast Cancer Cell Line (ATCC HTB-30) Memorial Sloan Kettering Cancer Center. Available online: https://www.mskcc.org/research-advantage/support/technology/tangible-material/human-breast-cell-line-sk-br-3 (accessed on 30 November 2020).
Kenny, P.A.; Lee, G.Y.; Myers, C.A.; Neve, R.M.; Semeiks, J.R.; Spellman, P.T.; Lorenz, K.; Lee, E.H.; Barcellos-Hoff, M.H.; Petersen, O.W.; et al. The morphologies of breast cancer cell lines in three-dimensional assays correlate with their profiles of gene expression. Mol. Oncol. 2007, 1, 84–96. [Google Scholar] [CrossRef] [PubMed]
Kao, J.; Salari, K.; Bocanegra, M.; Choi, Y.-L.; Girard, L.; Gandhi, J.; Kwei, K.A.; Hernandez-Boussard, T.; Wang, P.; Gazdar, A.F.; et al. Molecular Profiling of Breast Cancer Cell Lines Defines Relevant Tumor Models and Provides a Resource for Cancer Gene Discovery. PLoS ONE 2009, 4, e6146. [Google Scholar] [CrossRef] [PubMed]
Marcotte, R.; Sayad, A.; Brown, K.R.; Sanchez-Garcia, F.; Reimand, J.; Haider, M.; Virtanen, C.; Bradner, J.E.; Bader, G.D.; Mills, G.B.; et al. Functional Genomic Landscape of Human Breast Cancer Drivers, Vulnerabilities, and Resistance. Cell 2016, 164, 293–309. [Google Scholar] [CrossRef] [PubMed]
Siddiqui, R.A.; Harvey, K.A.; Walker, C.; Altenburg, J.; Xu, Z.; Terry, C.; Camarillo, I.; Jones-Hall, Y.; Mariash, C. Characterization of synergistic anti-cancer effects of docosahexaenoic acid and curcumin on DMBA-induced mammary tumorigenesis in mice. BMC Cancer 2013, 13, 418. [Google Scholar] [CrossRef] [PubMed]
Maristany, F.B.; Griguolo, G.; Pascual, T.; Pare, L.; Nuciforo, P.; Llombart-Cussac, A.; Bermejo, B.; Oliveira, M.; Morales, S.; Martínez, N.; et al. Phenotypic changes of HER2-positive breast cancer during and after dual HER2 blockade. Nat. Commun. 2020, 11, 385. [Google Scholar] [CrossRef] [PubMed]
Finger, E.C.; Giaccia, A.J. Hypoxia, inflammation, and the tumor microenvironment in metastatic disease. Cancer Metastasis Rev. 2010, 29, 285–293. [Google Scholar] [CrossRef] [PubMed]
Triner, D.; Shah, Y.M. Hypoxia-inducible factors: A central link between inflammation and cancer. J. Clin. Investig. 2016, 126, 3689–3698. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Zhang, H.; Wang, M.; Schmid, T.; Xin, Z.; Kozhuharova, L.; Yu, W.-K.; Huang, Y.; Cai, F.; Biskup, E. Hypoxia in Breast Cancer—Scientific Translation to Therapeutic and Diagnostic Clinical Applications. Front. Oncol. 2021, 11, 652266. [Google Scholar] [CrossRef] [PubMed]
Denko, N.C.; Fontana, L.A.; Hudson, K.M.; Sutphin, P.D.; Raychaudhuri, S.; Altman, R.; Giaccia, A.J. Investigating hypoxic tumor physiology through gene expression patterns. Oncogene 2003, 22, 5907–5914. [Google Scholar] [CrossRef] [PubMed]
Colaprico, A.; Silva, T.C.; Olsen, C.; Garofano, L.; Cava, C.; Garolini, D.; Sabedot, T.S.; Malta, T.; Pagnotta, S.M.; Castiglioni, I.; et al. TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 2015, 44, e71. [Google Scholar] [CrossRef] [PubMed]
Silva, T.C.; Colaprico, A.; Olsen, C.; D′Angelo, F.; Bontempi, G.; Ceccarelli, M.; Noushmehr, H. TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages. F1000Research 2016, 5, 1542. [Google Scholar] [CrossRef] [PubMed]
Mounir, M.; Lucchetta, M.; Silva, T.C.; Olsen, C.; Bontempi, G.; Chen, X.; Noushmehr, H.; Colaprico, A.; Papaleo, E. New functionalities in the TCGAbiolinks package for the study and integration of cancer data from GDC and GTEx. PLoS Comput. Biol. 2019, 15, e1006701. [Google Scholar] [CrossRef] [PubMed]
Ashburner, M.; Ball, C.A.; Blake, J.; Botstein, D.; Butler, H.; Cherry, J.M.; Davis, A.P.; Dolinski, K.; Dwight, S.S.; Eppig, J.T.; et al. Gene Ontology: Tool for the unification of biology. Nat. Genet. 2000, 25, 25–29. [Google Scholar] [CrossRef] [PubMed]
The Gene Ontology Consortium; Carbon, S.; Douglass, E.; Good, B.M.; Unni, D.R.; Harris, N.L.; Mungall, C.J.; Basu, S.; Chisholm, R.L.; Dodson, R.J.; et al. The Gene Ontology resource: Enriching a Gold mine. Nucleic Acids Res. 2020, 49, D325–D334. [Google Scholar] [CrossRef] [PubMed]
Mi, H.; Muruganujan, A.; Ebert, D.; Huang, X.; Thomas, P.D. PANTHER version 14: More genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res. 2018, 47, D419–D426. [Google Scholar] [CrossRef] [PubMed]
Mi, H.; Muruganujan, A.; Casagrande, J.T.; Thomas, P. Large-scale gene function analysis with the PANTHER classification system. Nat. Protoc. 2013, 8, 1551–1566. [Google Scholar] [CrossRef] [PubMed]
Dalmer, T.R.A.; Clugston, R.D. Gene ontology enrichment analysis of congenital diaphragmatic hernia-associated genes. Pediatr. Res. 2018, 85, 13–19, Erratum in 2019, 86, 676. [Google Scholar] [CrossRef] [PubMed]
Pirsko, V.; Cakstina, I.; Priedite, M.; Dortane, R.; Feldmane, L.; Nakazawa-Miklasevica, M.; Daneberga, Z.; Gardovskis, J.; Miklasevics, E. An Effect of Culture Media on Epithelial Differentiation Markers in Breast Cancer Cell Lines MCF7, MDA-MB-436 and SkBr3. Medicina 2018, 54, 11. [Google Scholar] [CrossRef] [PubMed]
Untergasser, A.; Nijveen, H.; Rao, X.; Bisseling, T.; Geurts, R.; Leunissen, J.A.M. Primer3Plus, an enhanced web interface to Primer3. Nucleic Acids Res. 2007, 35, W71–W74. [Google Scholar] [CrossRef] [PubMed]
Jain, N.; Nitisa, D.; Pirsko, V.; Cakstina, I. Selecting suitable reference genes for qPCR normalization: A comprehensive analysis in MCF-7 breast cancer cell line. BMC Mol. Cell Biol. 2020, 21, 1–19. [Google Scholar] [CrossRef] [PubMed]
Andersen, C.L.; Jensen, J.L.; Ørntoft, T.F. Normalization of Real-Time Quantitative Reverse Transcription-PCR Data: A Model-Based Variance Estimation Approach to Identify Genes Suited for Normalization, Applied to Bladder and Colon Cancer Data Sets. Cancer Res. 2004, 64, 5245–5250. [Google Scholar] [CrossRef] [PubMed]
Pfaffl, M.W.; Tichopad, A.; Prgomet, C.; Neuvians, T.P. Determination of stable housekeeping genes, differentially regulated target genes and sample integrity: BestKeeper—Excel-based tool using pair-wise correlations. Biotechnol. Lett. 2004, 26, 509–515. [Google Scholar] [CrossRef] [PubMed]
Silver, N.; Best, S.; Jiang, J.; Thein, S.L. Selection of housekeeping genes for gene expression studies in human reticulocytes using real-time PCR. BMC Mol. Biol. 2006, 7, 33. [Google Scholar] [CrossRef] [PubMed]
Xie, F.; Xiao, P.; Chen, D.; Xu, L.; Zhang, B. miRDeepFinder: A miRNA analysis tool for deep sequencing of plant small RNAs. Plant. Mol. Biol. 2012, 80, 75–84. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Zhang, L.; Li, R.; Zhang, M.; Li, Y.; Wang, H.; Wang, S.; Bao, Z. Systematic identification and validation of the reference genes from 60 RNA-Seq libraries in the scallop Mizuhopecten yessoensis. BMC Genom. 2019, 20, 288. [Google Scholar] [CrossRef] [PubMed]
Livak, K.J.; Schmittgen, T.D. Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2⁻ΔΔCT Method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef] [PubMed]
Pfaffl, M.W. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001, 29, e45. [Google Scholar] [CrossRef] [PubMed]
Hellemans, J.; Mortier, G.; De Paepe, A.; Speleman, F.; Vandesompele, J. qBase relative quantification framework and software for management and automated analysis of real-time quantitative PCR data. Genome Biol. 2007, 8, R19. [Google Scholar] [CrossRef] [PubMed]
D′haene, B.; Hellemans, J. The importance of quality control during qPCR data analyis. Int. Drug Discov. 2010, 18, 24. [Google Scholar]
Cooper, L.A.; Demicco, E.G.; Saltz, J.H.; Powell, R.T.; Rao, A.; Lazar, A.J. PanCancer insights from The Cancer Genome Atlas: The pathologist′s perspective. J. Pathol. 2017, 244, 512–524. [Google Scholar] [CrossRef] [PubMed]
Esquivel-Velázquez, M.; Ostoa-Saloma, P.; Palacios-Arreola, M.I.; Castro, K.E.N.; Castro, J.I.; Morales-Montor, J. The Role of Cytokines in Breast Cancer Development and Progression. J. Interf. Cytokine Res. 2015, 35, 1–16. [Google Scholar] [CrossRef] [PubMed]
Joshi, A. TGF-ß signaling, tumor microenvironment and tumor progression: The butterfly effect. Front. Biosci. 2010, 15, 180–194. [Google Scholar] [CrossRef] [PubMed]
Vincent, T.; Neve, E.P.A.; Johnson, J.R.; Kukalev, A.; Rojo, F.; Albanell, J.; Pietras, K.; Virtanen, I.; Philipson, L.; Leopold, P.L.; et al. A SNAIL1–SMAD3/4 transcriptional repressor complex promotes TGF-β mediated epithelial–mesenchymal transition. Nature 2009, 11, 943–950. [Google Scholar] [CrossRef] [PubMed]
Honma, S.; Shimodaira, K.; Shimizu, Y.; Tsuchiya, N.; Saito, H.; Yanaihara, T.; Okai, T. The Influence of Inflammatory Cytokines on Estrogen Production and Cell Proliferation in Human Breast Cancer Cells. Endocr. J. 2002, 49, 371–377. [Google Scholar] [CrossRef] [PubMed]
Horiuchi, K.; Mishima, K.; Ohsawa, M.; Aozasa, K. Carcinoma of stomach and breast with lymphoid stroma: Localisation of Epstein-Barr virus. J. Clin. Pathol. 1994, 47, 538–540. [Google Scholar] [CrossRef] [PubMed]
Di Lonardo, A.; Venuti, A.; Marcante, M.L. Human papillomavirus in breast cancer. Breast Cancer Res. Treat. 1992, 21, 95–100. [Google Scholar] [CrossRef] [PubMed]
Buehring, G.C.; Shen, H.M.; Jensen, H.M.; Block, G. Bovine leukemia virus infection is significantly associated with risk of breast cancer. Proc. Am. Assoc. Cancer Res. 2007, 48, 1747. [Google Scholar]
Wang, Y.; Holland, J.F.; Bleiweiss, I.J.; Melana, S.; Liu, X.; Pelisson, I.; Cantarella, A.; Stellrecht, K.; Mani, S.; Pogo, B.G. Detection of mammary tumor virus env gene-like sequences in human breast cancer. Cancer Res. 1995, 55, 5173–5179. [Google Scholar] [PubMed]
Lawson, J.S.; Glenn, W.K.; Heng, B.; Ye, Y.; Tran, B.; Lutze-Mann, L.; Whitaker, N.J. Koilocytes indicate a role for human papilloma virus in breast cancer. Br. J. Cancer 2009, 101, 1351–1356. [Google Scholar] [CrossRef] [PubMed]
Fu, J.; Bian, L.; Zhao, L.; Dong, Z.; Gao, X.; Luan, H.; Sun, Y.; Song, H. Identification of genes for normalization of quantitative real-time PCR data in ovarian tissues. Acta Biochim. et Biophys. Sin. (Shanghai) 2010, 42, 568–574. [Google Scholar] [CrossRef] [PubMed]
Perez, L.J.; Rios, L.; Trivedi, P.; D′Souza, K.; Cowie, A.; Nzirorera, C.; Webster, D.; Brunt, K.; Legare, J.-F.; Hassan, A.; et al. Validation of optimal reference genes for quantitative real time PCR in muscle and adipose tissue for obesity and diabetes research. Sci. Rep. 2017, 7, 3612. [Google Scholar] [CrossRef] [PubMed]
Bukowska, J.; Słowińska, M.; Cierniak, P.; Kopcewicz, M.; Walendzik, K.; Frazier, T.; Gawrońska-Kozak, B. The effect of hypoxia on the proteomic signature of pig adipose-derived stromal/stem cells (pASCs). Sci. Rep. 2020, 10, 20035. [Google Scholar] [CrossRef] [PubMed]
Sonna, L.A.; Cullivan, M.L.; Sheldon, H.K.; Pratt, R.E.; Lilly, C.M. Effect of hypoxia on gene expression by human hepatocytes (HepG2). Physiol. Genom. 2003, 12, 195–207. [Google Scholar] [CrossRef] [PubMed]
Sanjay, A.; Fu, J.; Kreibich, G. DAD1 Is Required for the Function and the Structural Integrity of the Oligosaccharyltransferase Complex. J. Biol. Chem. 1998, 273, 26094–26099. [Google Scholar] [CrossRef] [PubMed]
Nakashima, T.; Sekiguchi, T.; Kuraoka, A.; Fukushima, K.; Shibata, Y.; Komiyama, S.; Nishimoto, T. Molecular cloning of a human cDNA encoding a novel protein, DAD1, whose defect causes apoptotic cell death in hamster BHK21 cells. Mol. Cell. Biol. 1993, 13, 6367–6374. [Google Scholar] [CrossRef] [PubMed]
Silberstein, S.; Collins, P.G.; Kelleher, D.J.; Gilmore, R. The essential OST2 gene encodes the 16-kD subunit of the yeast oligosaccharyltransferase, a highly conserved protein expressed in diverse eukaryotic organisms. J. Cell Biol. 1995, 131, 371–383. [Google Scholar] [CrossRef] [PubMed]
Tanaka, K.; Kondoh, N.; Shuda, M.; Matsubara, O.; Imazeki, N.; Ryo, A.; Wakatsuki, T.; Hada, A.; Goseki, N.; Igari, T.; et al. Enhanced expression of mRNAs of antisecretory factor-1, gp96, DAD1 and CDC34 in human hepatocellular carcinomas. Biochim. et Biophys. Acta (BBA)-Mol. Basis Dis. 2001, 1536, 1–12. [Google Scholar] [CrossRef] [PubMed]
Zhang, H.; Hoang, T.; Saeed, B.; Ng, S.C. Induction of apoptosis in prostatic tumor cell line DU145 by staurosporine, a potent inhibitor of protein kinases. Prostate 1996, 29, 69–76. [Google Scholar] [CrossRef] [PubMed]
Ren, Y.; Hao, P.; Dutta, B.; Cheow, E.S.H.; Sim, K.H.; Gan, C.S.; Lim, S.K.; Sze, S.K. Hypoxia Modulates A431 Cellular Pathways Association to Tumor Radioresistance and Enhanced Migration Revealed by Comprehensive Proteomic and Functional Studies. Mol. Cell. Proteom. 2013, 12, 485–498. [Google Scholar] [CrossRef] [PubMed]
Ding, Z.; Bae, Y.H.; Roy, P. Molecular insights on context-specific role of profilin-1 in cell migration. Cell Adhes. Migr. 2012, 6, 442–449. [Google Scholar] [CrossRef] [PubMed]
Witke, W. The role of profilin complexes in cell motility and other cellular processes. Trends Cell Biol. 2004, 14, 461–469. [Google Scholar] [CrossRef] [PubMed]
Bae, Y.H.; Ding, Z.; Zou, L.; Wells, A.; Gertler, F.; Roy, P. Loss of profilin-1 expression enhances breast cancer cell motility by Ena/VASP proteins. J. Cell. Physiol. 2008, 219, 354–364. [Google Scholar] [CrossRef] [PubMed]
Das, T.; Bae, Y.H.; Wells, A.; Roy, P. Profilin-1 overexpression upregulates PTEN and suppresses AKT activation in breast cancer cells. J. Cell. Physiol. 2008, 218, 436–443. [Google Scholar] [CrossRef] [PubMed]
Yao, W.; Ji, S.; Qin, Y.; Yang, J.; Xu, J.; Zhang, B.; Xu, W.; Liu, J.; Shi, S.; Liu, L.; et al. Profilin-1 suppresses tumorigenicity in pancreatic cancer through regulation of the SIRT3-HIF1α axis. Mol. Cancer 2014, 13, 1–12. [Google Scholar] [CrossRef] [PubMed]
Roy, P.; Gau, D.; Bae, Y.; Ohayon, L. Breast cancer cell invasiveness is stimulated by loss of membrane interaction of actinbinding protein profilin1 via altered phosphoinositide metabolism. FASEB J. 2019, 33, 488.13. [Google Scholar] [CrossRef]
Guan, X.; Chen, S.; Liu, Y.; Wang, L.-L.; Zhao, Y.; Zong, Z.-H. PUM1 promotes ovarian cancer proliferation, migration and invasion. Biochem. Biophys. Res. Commun. 2018, 497, 313–318. [Google Scholar] [CrossRef] [PubMed]
Kim, J.W.; Kim, S.J.; Han, S.M.; Paik, S.Y.; Hur, S.Y.; Kim, Y.W.; Lee, J.M.; Namkoong, S.E. Increased Glyceraldehyde-3-Phosphate Dehydrogenase Gene Expression in Human Cervical Cancers. Gynecol. Oncol. 1998, 71, 266–269. [Google Scholar] [CrossRef] [PubMed]
Rondinelli, R.H.; Epner, D.E.; Tricoli, J.V. Increased glyceraldehyde-3-phosphate dehydrogenase gene expression in late pathological stage human prostate cancer. Prostate Cancer Prostatic Dis. 1997, 1, 66–72. [Google Scholar] [CrossRef] [PubMed]

Scheme 1. Brief overview of the workflow of analysis employed in the present study. The boxes indicate the different steps including lab wet work that was performed. The circles indicate the algorithms used for selection and identification of appropriate reference genes in SK-BR-3 breast cancer cell line. Blue boxes indicate the cellular wet lab work whilst the green and pink boxes indicate the two pipelines followed for selection of candidate reference genes. The yellow boxes indicate the major work packages and milestones in the common workflow employed. Grey circles indicate the different algorithms employed for the determination of appropriate reference genes.

Figure 1. Ranking of the novel (pink) and conventional (blue) candidate reference genes. The rankings are based on TCGA database analysis. (a) Gene ranking based on expression levels (log2[TPM]); (b) Scatterplot showing the order of genes based on CV% and mean log2[TPM] values. CCSER2 (grey boxplot) was retrieved as FAM190B and violated the selection criteria due to low expression levels.

Figure 2. Grouping of the candidate reference genes according to Panther Functional Classification, which is further based on five different ontologies.

Figure 3. Ranking of candidate reference genes according to various algorithms (A) NormFinder; (B) geNorm; (C) Comparative ∆Ct; (D) BestKeeper; (E) Correlation with BestKeeper Index (BI) and (F) RefFinder.

Figure 4. Normalization of the three genes of interest (GOIs) in sequential passage experiments of SK-BR-3 cell line (S1 + S2) for (a) AURKA; (b) BUB1 and (c) SNAI1. The corresponding triplet codes shown on the x-axis are interpreted in figure legend. Error bars indicate the standard error (S.E.). Calib (p7) stands for the calibration passage 7, i.e., the initial passage which was used as an internal experimental control to obtain normalized relative quantities (NRQ) by Pfaffl’s method.

Figure 5. Ranking of the candidate reference genes according to various algorithms (A) NormFinder; (B) geNorm; (C) Comparative ∆Ct; (D) BestKeeper; (E) correlation with BestKeeper Index (BI) and (F) RefFinder. Yellow bars indicate the stability values for the genes in normoxia samples, while the green bars indicate the stability values for the genes in all samples (normoxia and hypoxia). The reference genes are ranked according to the stability rankings in all samples (green bars). The stability of the reference genes in the sample pool increases from left to right for all algorithms.

Figure 6. Normalization (Pfaffl’s method) of the three genes of interest (GOIs) in SK-BR-3 cell line in acute hypoxia (A–C) and chronic hypoxia (D–F) for (A,D) AURKA; (B,E) BUB1 and (C,F) SNAI1.

Table 1. Summary table of candidate reference genes and genes of interest (GOI).

Cell Line	Source	Selected Candidate Reference Genes
Breast cancer various cell lines	Literature	ACTB [26,29,32,34], CCSER2 [35], GAPDH [34], HNRNPL [37], HSP90AB1 [31], PCBP1 [37], PGK1 [30], PPIA [36], PUM1 [28,33,35], RNA18S [26,34], RNA28S [66], RPL13A [27,33], SF3A1 [32,38]
HER2-E tissue samples	TCGA (Novel)	BSG, CFL1, DAD1, EIF5A, GABARAP, NACA, PFN1, PSMB4, RBX1, TPT1, TUBA1B, UBC
Genes of Interest (GOI)	Expression Atlas	AURKA, BUB1, SNAI1

Table 2. Gene Ontology (Biological Process) of candidate reference genes ranked by fold enrichment.

GO ID	GO Term	No. of Genes *	Fold Enrichment	Raw p Value	FDR
GO:0044003	Modulation by symbiont of host process	3	46.10	4.24 × 10⁻⁰⁵	3.05 × 10⁻⁰²
GO:0006090	Pyruvate metabolic process	3	38.31	7.20 × 10⁻⁰⁵	3.68 × 10⁻⁰²
GO:0061418	Regulation of transcription from RNA polymerase II promoter in response to hypoxia	3	34.87	9.43 × 10⁻⁰⁵	4.27 × 10⁻⁰²
GO:0048524	Positive regulation of viral process	3	32.67	7.14 × 10⁻⁰⁶	1.03 × 10⁻⁰²
GO:0019058	Viral life cycle	5	20.33	4.26 × 10⁻⁰⁶	7.52 × 10⁻⁰³
GO:0006417	Regulation of translation	5	12.12	4.90 × 10⁻⁰⁵	3.26 × 10⁻⁰²
GO:0071345	Cellular response to cytokine stimulus	9	7.86	8.38 × 10⁻⁰⁷	3.33 × 10⁻⁰³
GO:0043066	Negative regulation of apoptotic process	7	6.96	4.13 × 10⁻⁰⁵	3.12 × 10⁻⁰²
GO:0006139	Nucleobase containing compound metabolic process	11	3.64	6.05 × 10⁻⁰⁵	3.56 × 10⁻⁰²
GO:0010604	Positive regulation of macromolecule metabolic process	12	3.05	1.33 × 10⁻⁰⁴	5.02 × 10⁻⁰²
GO:0043170	Macromolecule metabolic process	17	2.43	2.16 × 10⁻⁰⁵	2.01 × 10⁻⁰²

* No. of genes indicates the number of genes from the input selected candidate reference genes that are represented by the respective GO term.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jain, N.; Mitre, I.; Nitisa, D.; Pirsko, V.; Cakstina-Dzerve, I. Identification of Novel Endogenous Controls for qPCR Normalization in SK-BR-3 Breast Cancer Cell Line. Genes 2021, 12, 1631. https://doi.org/10.3390/genes12101631

AMA Style

Jain N, Mitre I, Nitisa D, Pirsko V, Cakstina-Dzerve I. Identification of Novel Endogenous Controls for qPCR Normalization in SK-BR-3 Breast Cancer Cell Line. Genes. 2021; 12(10):1631. https://doi.org/10.3390/genes12101631

Chicago/Turabian Style

Jain, Nityanand, Ingrida Mitre, Dina Nitisa, Valdis Pirsko, and Inese Cakstina-Dzerve. 2021. "Identification of Novel Endogenous Controls for qPCR Normalization in SK-BR-3 Breast Cancer Cell Line" Genes 12, no. 10: 1631. https://doi.org/10.3390/genes12101631

APA Style

Jain, N., Mitre, I., Nitisa, D., Pirsko, V., & Cakstina-Dzerve, I. (2021). Identification of Novel Endogenous Controls for qPCR Normalization in SK-BR-3 Breast Cancer Cell Line. Genes, 12(10), 1631. https://doi.org/10.3390/genes12101631

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Identification of Novel Endogenous Controls for qPCR Normalization in SK-BR-3 Breast Cancer Cell Line

Abstract

1. Introduction

2. Materials and Methods

2.1. TCGA Transcriptomic Analysis for Selection of Novel Candidate Reference Genes

2.2. Gene Ontology (GO)

2.3. Culture and Seeding Conditions

2.4. RNA Extraction and cDNA Synthesis

2.5. Selection of Reference Genes and Primer Design

2.6. Primer Efficiency

2.7. Reverse Transcription Quantitative PCR (RT-qPCR)

2.8. Determination of the Least Variable Reference Genes and Validation in Hypoxic Conditions

3. Results

3.1. TCGA Analysis for Selection of Novel Reference Genes

3.2. Gene Ontology (GO) Over-Representation Analysis

3.3. Grouping of Genes Based on Functional Classification (Panther)

3.4. Candidate Reference Gene Stability in SK-BR-3 Cell Line

3.5. Candidate Reference Gene Stability in Replicate Cultures

3.6. Selection of Reference Genes for Further Validation

3.7. Normalization of Genes of Interest (GOIs)

3.8. Effects of Hypoxia on the Stability of the Selected Reference Genes

3.9. Experimental Validation of Reference Genes in Hypoxic Conditions

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI