- freely available
Microarrays 2014, 3(2), 137-158; doi:10.3390/microarrays3020137
Published: 17 April 2014
Abstract: Molecular prognostic markers are urgently needed in order to improve therapy decisions in prostate cancer. To better understand the requirements for biomarker studies, we re-analyzed prostate cancer tissue microarray immunohistochemistry (IHC) data from 39 prognosis markers in subsets of 50 – >10,000 tumors. We found a strong association between the “prognostic power” of individual markers and the number of tissues that should be minimally included in such studies. The prognostic relevance of more than 90% of the 39 IHC markers could be detected if ≥6400 tissue samples were analyzed. Studying markers of tissue quality, including immunohistochemistry of ets-related gene (ERG) and vimentin, and fluorescence in-situ hybridization analysis of human epidermal growth factor receptor 2 (HER2), we found that 18% of tissues in our tissue microarray (TMA) showed signs of reduced tissue preservation and limited immunoreactivity. Comparing the results of Kaplan-Meier survival analyses or associations to ERG immunohistochemistry in subsets of tumors with and without exclusion of these defective tissues did not reveal statistically relevant differences. In summary, our study demonstrates that TMA-based marker validation studies using biochemical recurrence as an endpoint require at least 6400 individual tissue samples for establishing statistically relevant associations between the expression of molecular markers and patient outcome if weak to moderate prognosticators should also be reliably identified.
Prostate cancer is the most frequent malignancy in men. The clinical behavior ranges from slowly growing indolent tumors to highly aggressive and metastatic cancers. Based on the results of large autopsy studies demonstrating a high prevalence of prostate cancers also in men who never experienced symptoms of the disease during their lifetime, it is assumed that most prostate cancer patients would be manageable without surgery and its associated side effects . Accordingly, distinguishing between the low malignant and indolent form of the disease that does not require immediate therapy and the aggressive cancers that will eventually progress to life-threatening disease is the clinically most relevant discipline of current prostate cancer research.
Only recently, commercial molecular classifiers have become available. These tests are based on mRNA expression profiling of defined gene sets, allow for estimating the biological aggressiveness of a cancer and, therefore, may aid in therapy decisions [2,3,4]. These classifiers underscore the interest of the diagnostic industry in the topic of prostate cancer prognosticators. It can be expected that future generations of such classifiers can be substantially improved if genes are included that on their own already exhibit strong and independent prognostic power.
During the last decades, a multitude of studies announced prognostic biomarkers for prostate cancer. However, although more than hundred different prognostic markers have been suggested (reviewed in ), none of them has entered clinical routine testing as to yet. This disappointing failure to translate research findings into clinical applications is partly due to the fact that data obtained on virtually all of these markers vary largely between different studies. This is even true for the most established prognostic parameters, such as p53 or phosphatase and tensin homolog (PTEN). More than 50 studies analyzed the impact of p53 alterations on prostate cancer phenotype and prognosis. Although most immunohistochemistry studies reported a link between nuclear p53 accumulation and adverse tumor features, such as high grade, advanced stage, and peripheral zone origin , as well as poor prognosis after radical prostatectomy  or external beam radiation  and unfavorable clinical courses in conservatively managed patients , there are also studies that do not confirm these associations [10,11]. Likewise, genomic deletion of PTEN has been unequivocally linked to adverse tumor features in several studies [12,13,14,15,16], other studies again employing immunohistochemistry reported highly variable results on the prognostic value of PTEN expression. For example, an association between loss of PTEN expression and poor patient prognosis was only found in one  out of four studies [13,17,18,19], and a link between loss of PTEN expression and high Gleason grade or advanced tumor stage was only reported in two [20,21] out of five studies on this topic [18,19,20,21,22].
It is quite obvious that most of the discrepant results in the literature are due to (i) technical issues, and (ii) relatively small patient cohorts used for most studies. It is obvious that different antibodies, staining protocols, and scoring criteria that are employed in most studies can cause massive experimental variation. Due to an intense dispute with a reviewer of one of our manuscripts on the issue whether our frequency of p53 immunostaining in prostate cancer was lower than the 50% suggested by another group due to protocol issues (in our opinion) or to missed heterogeneity in a tissue microarray (TMA) setting (the reviewer’s opinion), we were forced to experimentally demonstrate that the range of p53 positive prostate cancers could be brought from 2.5% to 98% solely by protocol modifications . However, the example of HER2 immunohistochemistry analysis of breast cancer demonstrates that a considerable (but not a complete) degree of assay standardization can be achieved . However, even in such a highly standardized analysis including various controls, the quality of the tissues samples will impact the results. This is due to the fact that postsurgical tissue fixation cannot be fully standardized. The most frequently used fixative, i.e., formalin, causes proteins to cross-link and makes them impassible for microbial degradation or autolysis. The efficiency of the fixation process depends on the proper penetration of the formalin into the tissue, but obviously, the success of penetration greatly depends on the size and the composition of a given tissue. In case of too much or too little fixation, the tissue may not be suitable for analysis. This is a serious problem particularly in immunohistochemistry studies, where lack of immunoreactivity cannot be distinguished from a true negative result due to biological absence of the protein of interest.
The tissue microarray (TMA) technology has proven to be excellently suited for rapid and cost efficient analysis of large numbers of tissue samples . While in studies analyzing conventional large sections the study cohort size was typically limited to less than 100 samples due to the time and costs connected with such “classical” analyses, it is the availability of suitable tissues that first of all limits the size of TMA studies. As a consequence, TMA studies including hundreds of tissue samples are often viewed as “large-scale” analyses. Extent and impact of low-quality tissues that are inevitably included in every large-scale tissue analysis are, however, unknown. In the present study, we took advantage of our very large prostate cancer tissue microarray comprising more than 12,000 tissue spots and molecular data from more than one hundred proteins analyzed by means of immunohistochemistry to better understand the impact of the sample size and the tissue quality on the outcome of TMA studies for marker validation purposes. Biochemical (PSA) recurrence was used as an endpoint in this project dealing with patients having undergone prostatectomy. This reason for this is that PSA recurrence is the “easiest” (most frequent) clinical endpoint to analyze in prostatectomy studies and it is strongly associated with other clinical endpoints such as metastasis or cancer-related death.
2. Experimental Section
Patients. Radical prostatectomy specimens were available from 12,427 patients undergoing surgery between 1992 and 2012 at the Department of Urology and the Martini Clinics at the University Medical Center Hamburg-Eppendorf. Follow-up data were available for a total of 12,344 patients with a median follow-up of 36 months (range of 1 to 241 months; Table 1). Prostate specific antigen (PSA) values were measured following surgery and PSA recurrence was defined as a postoperative PSA of 0.2 ng/mL and increasing at first of appearance. All prostate specimens were analyzed according to a standard procedure, including a complete embedding of the entire prostate for histological analysis . The TMA manufacturing process was described earlier in detail . In short, one 0.6 mm core was taken from a representative tissue block from each patient. The tissues were distributed among 27 TMA blocks, each containing 144 to 522 tumor samples. For internal controls, each TMA block also contained various control tissues, including normal prostate tissue.
TMA Database. The molecular database attached to this TMA contained results on more than 100 molecular markers. For example, we analyzed expression of therapy target genes like epidermal growth factor receptor (EGFR)  and human epidermal growth factor receptor 2 (HER2) , putative prognosticators including p53 [7,27], proliferation marker Ki67 , mammalian target of rapamycin (mTOR) , cluster of differentiation (CD) 10 , serine peptidase inhibitor Kazal type 1 (SPINK1) , karyopherin alpha 2 (KPNA2) , cysteine-rich secretory protein 3 (CRISP3) , nibrin (NBS1) , RNA binding motif protein 3 (RMB3) , and lysophosphatidylcholine acyltransferase 1 (LPCAT1) , mitochondrial content , prostate-specific markers like prostate specific antigen (PSA), prostate specific membrane antigen (PSMA) , alpha-methylacyl-CoA racemase (AMACR), and androgen receptor (AR), microvessel density , or immunological target proteins like CD117 , CD147 , and CD151 , and determined gene copy number alterations of important tumor suppressor loci in prostate cancer, including 8p (lipoprotein lipase, LPL) , 3p13 (forkhead box P1, FOXP1) , 5q21 (chromodomain helicase DNA binding protein 1, CHD1) , 6q15 (mitogen-activated protein kinase kinase kinase 7, MAP3K7) , 10q23 (PTEN) , TMPRSS2:ERG fusion  and PTEN breakage . For this study, we selected 39 different protein markers that predicted patient prognosis if analyzed in our current 12,427 samples TMA or in an earlier version of the TMA comprising 11,156 of the 12,427 samples. Based on the results of previous survival analyses using biochemical (PSA) recurrence as a clinical endpoint, our selection included very strong prognostic markers (e.g., the p53 tumor suppressor, Figure 1a ), markers with only marginal prognostic relevance (e.g., CD147, Figure 1b ), and those with an intermediate prognostic impact (e.g., LPCAT1, Figure 1c ).
Immunohistochemistry (IHC). Freshly cut TMA sections were used for all experiments. IHC analysis was performed using an ETS-related gene (ERG)-specific antibody as described before  and an antibody directed against vimentin to identify tissue samples that might have impaired immunoreactivity. For vimentin detection, freshly cut TMA sections were immunostained on one day and in one experiment. Slides were deparaffinized and exposed to heat-induced antigen retrieval for 5 minutes at 100 °C in pH 6 Tris-EDTA-Citrate buffer. Primary antibody specific for vimentin (mouse monoclonal antibody, DAKO, Glostrup, DK; clone V9; dilution 1:18,000) was applied at 37 °C for 60 minutes. Bound antibody was then visualized using the EnVision Kit (Dako, Glostrup, Denmark) according to the manufacturer’s directions. Presence or absence of ERG and vimentin staining was recorded in all tissue spots.
|Table 1. Composition of the prognosis tissue microarray containing 12,427 prostate cancer specimens. The percentage in the column “Study cohort on tissue microarray (TMA)” refers to the fraction of samples across each category. The percentage in column “Biochemical relapse among categories” refers to the fraction of samples with biochemical relapse within each parameter in the different categories. pT, pathological tumor stage; pN, pathological lymph node stage.|
|Parameter||No. of patients (%)|
|Study cohort on TMA||Biochemical relapse among categories|
|(n = 12,427)|
|n||11,665 (93.9%)||2769 (23.7%)|
|≤50||334 (2.7%)||81 (24.3%)|
|51–59||3061 (24.8%)||705 (23%)|
|60–69||7188 (58.2%)||1610 (22.4%)|
|≥70||1761 (14.3%)||370 (21%)|
|Pretreatment prostate specific antigen (PSA) (ng/mL)|
|<4||1585 (12.9%)||242 (15.3%)|
|4–10||7480 (60.9%)||1355 (18.1%)|
|10–20||2412 (19.6%)||737 (30.6%)|
|>20||812 (6.6%)||397 (48.9%)|
|pT category (AJCC 2002)|
|pT2||8187 (66.2%)||1095 (13.4%)|
|pT3a||2660 (21.5%)||817 (30.7%)|
|pT3b||1465 (11.8%)||796 (54.3%)|
|pT4||63 (0.5%)||51 (81%)|
|≤3 + 3||2983 (24.1%)||368 (12.3%)|
|3 + 4||6945 (56.2%)||1289 (18.6%)|
|4 + 3||1848 (15%)||788 (42.6%)|
|≥4 + 4||584 (4.7%)||311 (53.3%)|
|pN0||6970 (91%)||1636 (23.5%)|
|pN+||693 (9%)||393 (56.7%)|
|Negative||9990 (81.9%)||1848 (18.5%)|
|Positive||2211 (18.1%)||853 (38.6%)|
Numbers do not always add up to 12,427 in the different categories because of cases with missing data. Abbreviation: AJCC, American Joint Committee on Cancer.
Fluorescence in situ hybridization (FISH). A 4-μm TMA section was used for two-color FISH. For proteolytic slide pretreatment, a commercial kit was used (Paraffin pretreatment reagent kit, Vysis). A Spectrum-Orange–labeled HER2 probe was used together with a Spectrum-Green–labeled centromere 17 probe (PathVysion; Abbott Molecular). Before hybridization, TMA sections were de-paraffinized, air dried, and dehydrated in 70%, 85%, and 100% ethanol followed by denaturation for 5 minutes at 74 °C in 70% formamide-2× SSC solution. Following overnight hybridization at 37 °C in a humidified chamber, slides were washed and counterstained with 0.2 μmol/L 4',6-diamidino-2-phenylindole, an antifade solution. Presence or absence of red and green FISH signals was recorded in all tissue spots.
Statistics. Statistical calculations were performed with JPM 9 (JMP®, Version 9. SAS Institute Inc., Cary, NC, USA, 1989–2007) Contingency tables and the chi²-test were performed to search for associations between molecular parameters and tumor phenotype. Survival curves were calculated according to Kaplan‑Meier. The Log-Rank test was applied to detect significant differences between groups.
3. Results and Discussion
3.1. Impact of the Tissue Quality
In order to identify tissues with poor immunoreactivity, we performed ERG and vimentin immunohistochemistry analysis of our TMA. These proteins are expressed in virtually every human tissue. ERG is a member of the E26 transformation-specific (ETS) transcription factor family that is expressed in endothelial cells. ERG had been extensively studied in our TMA before since it is strongly expressed in about 50% of prostate cancers , and has been linked to early onset prostate cancer . For the purpose of identifying low quality tissues, we re-analyzed our large 12,427 samples prostate cancer TMA for ERG expression specifically in endothelial cells (Figure 2a). In addition, we stained the TMA for vimentin, a type III intermediate filament that is strongly expressed in mesenchymal cells, which typically accompany prostate cancer cells (Figure 2b). Since blood vessels and mesenchymal cells can be found in virtually every prostate tissue sample, we considered complete absence of vimentin and ERG staining as an indicator of impaired immunoreactivity. In addition, we took advantage of the results from an earlier study, where we demonstrated that low-quality tissues with impaired immunoreactivity also showed a poor performance in fluorescence in-situ hybridization (FISH) analysis of gene copy numbers . In the present study, we performed HER2 FISH analysis on the TMA and considered absence of FISH signals as an additional indicator of poor tissue quality. In summary, all tissue spots that showed simultaneous lack of ERG and vimentin immunostaining and absence of HER2 FISH signals were considered “low-quality”.
A total of 11,223 tissue spots was included in this analysis. The remaining tissue spots were excluded because they were severely damaged or absent in the TMA slides. Simultaneous lack of ERG, vimentin, and HER2 signals were found in 2056 (18.3%) of the analyzed tissue spots. These “low-quality” tissues were randomly distributed across the TMA, and there was no obvious association between tissue reactivity and tumor phenotype or patient outcome (Figure 3). The marginally significant p-values obtained in these analyses do not reflect true associations but are attributable to slight variations between the groups.
To further investigate the performance of these “low-quality” spots in IHC experiments, we next compared them to staining patterns of our 39 IHC markers in subsets of 2000 “low-quality” and 2000 “high-quality” tissues (i.e., samples that were positive for all of ERG, vimentin and HER2). This analysis revealed that, although the “low-quality” tissues can be stained with most of the tested antibodies, there was a average reduction of about 12 percent points in the fraction of positive tissue samples across all of these markers in “low-quality” tissues as compared to “high-quality” tissues (Figure 4). These date demonstrate, that “low-quality” tissues bear a high risk to underestimate the true expression level and may even result in false negative findings.
These findings imply that problems may arise if it comes to comparisons between biomarkers analyzed by immunohistochemistry. In such a scenario, false positive associations can potentially occur if the level of immunostaining of the analyzed markers parallels the quality of the tissue in a relevant fraction of samples. In contrast, inverse associations must always be considered valid. Here, the same tissues that are negative for one marker stain positive for the other, thus excluding the possibility of false associations due to reduced immunoreactivity. To assess the potential impact of “low-quality” tissues on the reliability of associations between ERG and other IHC markers, we used our existing ERG IHC data , which showed a positive result in about 50% of cancers. We studied the associations of all 39 markers to ERG expression, including markers with strong associations to ERG positivity (e.g., Marker #24, Figure 5a), markers with strong associations to ERG negativity (e.g., Marker #34, Figure 5c), markers with weak associations to ERG positivity (e.g., #12 (MTC02), Figure 5b), and markers lacking such associations (e.g., Marker #32, Figure 5d). Particularly for the latter set of markers, it could be possible that “low-quality” tissues drive such weak associations. All analyses were performed in differently sized subsets of our large TMA, and the significance of associations was compared between tissue sets containing both “low” and “high”‑quality tissues and tissue sets after excluding the 2056 “low-quality” tissues from the data. Since statistical associations will become stronger the more samples are analyzed, we performed the analyses in randomly selected subsets of 1600, 3200, 6400, and 10,000 samples. To compensate for incidental findings that might arise from random subset selection, we repeated each analysis five times. The Log-rank chi2 p-value was recorded from each analysis, and the average Log‑rank chi2 p-value was calculated from the five repeated analyses. All results are shown in Table 2. Following the same analysis strategy, we also questioned whether the reduced immunoreactivity in the “low-quality” tissues impacted the outcome of prognosis associations. For this analysis, we selected five of our 39 prognostic markers set and performed Kaplan-Meier survival plots to compare the impact of these markers (using biochemical (PSA) recurrence as clinical endpoint) before and after exclusion of the 2056 “low-quality” tissues. All results are shown in Table 3, and examples of Kaplan-Meier plots are given in Figure 4.
In both sets of calculations, we did not observe changes in the analysis results, regardless if the “low-quality” tissue was excluded from the analysis or not, demonstrating that the ≈20% “low-quality” tissues present in our TMA did not significantly impact the study outcome. Nevertheless, the examples of associations with ERG expression given in Figure 5 confirm that the fraction of entirely negative samples can be slightly overestimated unless the “low-quality” tissues are exclude, as indicated by a difference of five percent points between tumors with a negative result for both Marker #24 and ERG in subsets of cancers before and after exclusion of the “low-quality” tissues. However, the finding that even positive associations resulting from discrete expression differences remained significant after exclusion of the “low-quality” tissues (Figure 5b) clearly demonstrates that associations between different markers can be reliably detected in large-scale TMA studies. Since “low-quality” tissues were randomly distributed across all samples irrespective of the clinical course (Figure 3), it was not surprising that there was no difference in the ability to detect prognostic differences in tissues with or without “low-quality” tissues. Here, the “underestimation” of the true staining intensity resulted in a smooth shift of all survival curves either towards an overall better prognosis (i.e., if strong expression of the marker was linked to poor prognosis), or towards worse prognosis (i.e., if strong expression of the marker was linked to good prognosis), whereas the relative distance between the curves remained largely constant. However, this analysis also suggested that the number of samples included in marker validation analyses might have a much stronger impact on the analysis result than the tissue quality, thus prompting us to analyze the impact of the sample size in more detail below.
|Table 2. Impact of the tissue quality on the association between expression of ERG and other IHC markers. The chi2 p-values are given for survival analyses in subsets of 1600–10,000 tissue spots. “Low quality tissue” indicates whether tissues with impaired immunoreactivity were excluded from analysis or not (included). “Association strength” separates the markers into those with weak, moderate, or strong positive associations (i.e., the marker is more frequently expressed in ERG positive than in ERG negative cancers), those with inverse associations (i.e., the marker is more frequently expressed in ERG negative than in ERG positive cancers), and those that are unrelated to ERG (no association).|
|Marker||Low quality tissue||Number of analyzed tissue spots||Association strength|
|Marker #32||included||0.1039||0.5243||0.1879||0.0952||No association|
|Marker #12 (MTC02)||included||<0.0001||<0.0001||<0.0001||<0.0001||Weak|
|Marker #2 (CD10)||included||<0.0001||<0.0001||<0.0001||<0.0001||Moderate|
|Marker #39 (p53)||included||<0.0001||<0.0001||<0.0001||<0.0001||Moderate|
|Marker #16 (NBS1)||included||<0.0001||<0.0001||<0.0001||<0.0001||Moderate|
|Marker #18 (AR)||included||<0.0001||<0.0001||<0.0001||<0.0001||Moderate|
|Marker #36 (KPNA2)||included||<0.0001||<0.0001||<0.0001||<0.0001||Moderate|
|Marker #6 (FOXP2)||included||<0.0001||<0.0001||<0.0001||<0.0001||Strong|
|Marker #15 (LPCAT)||included||<0.0001||<0.0001||<0.0001||<0.0001||Strong|
|Marker #19 (RBM3)||included||<0.0001||<0.0001||<0.0001||<0.0001||Strong|
|Marker #1 (CD147)||included||<0.0001||<0.0001||<0.0001||<0.0001||Inverse|
|Table 3. Impact of the tissue quality on the outcome of Kaplan-Meier survival analysis. The chi2p-values are given for survival analyses in subsets of 1600–10,000 tissue spots. “Low quality tissue” indicates whether tissues with impaired immunoreactivity were excluded from analysis or not (included).|
|Marker||Low quality tissue||Number of analyzed tissue spots|
3.2. Impact of the Sample Size
In order to estimate the minimal sample size that is required to yield statistically stable results in prostate cancer prognosis marker validation studies, we carried out serial analyses in randomly selected subsets of 50, 100, 200, 400, 800, 1600, 3200, 6400 and all (12,427) samples included in our TMA. We performed Kaplan-Meier survival plots and Log-rank chi2 tests including a total of 39 protein markers with confirmed prognostic relevance from our molecular database. The smallest sample set that revealed a Log-rank p-value of 0.001 or less was considered to be sufficient for reliable marker analysis provided that this significance level held also true in the analysis of all larger sample sets. In addition, in order to rank the “prognostic power” of our 39 markers, we summarized the Log-rank values emerging from all subset analyses of each marker. This strategy was selected because chi2 values can be easily extracted from all tests and thus provide a simple, however objective, index of the power of individual markers. We grouped our markers according to the accumulated chi2 values into markers with “weak” (sum of all chi2 values <100), “moderate” (sum chi2 101–299), and “strong” prognostic power (sum chi2 ≥300). The Log-rank p-values for all markers in each sample subset and the accumulated Log-rank chi2 values per marker are shown in Table 4, and exemplary Kaplan-Meier plots are given in Figure 6.
The results of this analysis first of all demonstrate a close relationship between the “prognostic power” of a marker and the numbers of samples that need to be analyzed in order to reliably evaluate the marker’s prognostic potential (Figure 7 and Table 4). Given that the power of a marker of interest is typically not known before the analysis is performed (particularly in case of novel and uncharacterized candidate markers), and that four markers revealed prognostic relevance only if the entire sample set was analyzed (Table 4), our findings imply that as many samples as possible should be included in such marker validation experiments in order to also reliably detect minor associations between prostate cancer genotype and clinical behavior. However, from a more practical point of view, our data also demonstrates that a cohort size of 6400 prostate cancers is sufficient to reproduce the prognostic value of the vast majority (i.e., 35 out of 39, 90%) of the markers included in our study.
|Table 4. Impact of the sample size on the outcome of Kaplan-Meier survival analyses. The chi2p-values are given for survival analyses in subsets of 50–12,427 tissue spots. “n analyzable” gives the number of interpretable tissue spots if the entire TMA was analyzed. “Marker power” indicates the relative prognostic power as described in the results section. Bold face indicates the minimal sample set that was considered sufficient to evaluate the respective marker. Grey color indicates sample sizes that yielded strong prognostic relevance.|
|Marker||n analy-zable||Number of analyzed tissue spots||Marker Power|
|Marker #1 (CD147)||7605||0.2417||0.4218||0.4550||0.4984||0.5396||0.3539||0.3428||0.0379||0.0019||weak|
|Marker #6 (FOXP2)||8284||0.7963||0.4436||0.0082||0.4058||0.4309||0.0776||0.0308||0.0011||<0.0001||weak|
|Marker #2 (CD10)||8488||0.0722||0.5977||0.6268||0.2465||0.3409||0.5137||0.0001||0.0062||0.0012||weak|
|Marker #12 (MTC02)||8407||0.4313||0.9545||0.7656||0.9315||0.3693||0.2517||0.0013||0.0004||<0.0001||weak|
|Marker #16 (NBS1)||8026||0.6506||0.0525||0.2071||0.7379||0.8382||0.2079||0.0004||0.0026||<0.0001||weak|
|Marker #15 (LPCAT)||8762||0.7141||0.1713||0.5933||0.6978||0.2428||0.0449||0.0020||0.0003||<0.0001||weak|
|Marker #18 (AR)||7856||0.4576||0.0246||0.9451||0.6439||0.7573||0.3058||<0.0001||0.0034||<0.0001||moderate|
|Marker #19 (RBM3)||8303||0.4712||0.1035||0.4893||0.2494||0.0134||0.0658||0.0006||<0.0001||<0.0001||moderate|
|Marker #36 (KPNA2)||7943||0.1490||0.3803||0.0483||0.0044||<0.0001||<0.0001||<0.0001||<0.0001||<0.0001||strong|
|Marker #39 (p53)||10946||0.2686||0.1962||0.0192||<0.0001||0.0055||<0.0001||<0.0001||<0.0001||<0.0001||strong|
Importantly, only two (5%) of the 39 markers in our study, including p53 as a prime example of a very strong prognostic marker , had sufficient prognostic power to allow for conclusive results also in small cohorts including less than 500 prostate cancers. This finding is of particular interest since the majority of prostate cancer marker studies still analyze less than 500 cancer samples . Our findings provide a simple explanation for the highly discrepant results on most potential prognostic biomarkers. We also found significant associations in less than 500 samples that, however, did not hold true in the next larger subsets and must, therefore, be considered incidental. For example, Marker #7 revealed significant p-values in subsets of 50 and 200 samples, but not in subsets of 100, 400, or even 3200 samples, demonstrating that analysis of small subsets can occasionally lead to incidental statistical results.
It is further of note that there is always a considerable fraction of samples that does not yield interpretable results. This is due to typical TMA-related issues, including exhausted tissue cores resulting in empty spots in the TMA section or lack of tumor cells in the tissue spot. In our study, the fraction of non-interpretable tissue cores was about 35%, independent from the size of the subset selected for analysis (Figure 8a). As a consequence, the number of interpretable samples varied between 6494 and 10,946 (average 8592) spots for the different markers in the entire dataset (n = 12,427) and averaged for example 33.4 cancers in the 50 samples subset, 1019.7 cancers in the 1600 samples subset, or 4065 cancers in the 6400 samples subset (Figure 8b). Therefore, a certain dropout rate of TMA spots should always be taken into account if a TMA is built. The fraction of interpretable samples can potentially be increased if multiple samples of the same cancer specimen are included in the TMA. We do not recommend this procedure, however. For example, building a 6000‑samples TMA from 2000 cancers with three spots from each cancer can be expected to result in about 1800 interpretable cancers (which still is too small a number for reliable statistical analysis according to our data), but is connected with the same costs, analysis time, and tissue consumption as compared to a 6000 samples TMA built from one punch per tumor, which will, however, result in about 3900 interpretable cancers. In addition, analysis of multiple cores always introduces a statistical bias into the analysis. This is because not all of the multiple tissue spots per tumor will be analyzable, and tumors with three to four interpretable spots might have a higher likelihood to detect positive staining as compared to tumors with only one to two interpretable tissue spots.
The availability of a very large prostate cancer prognosis TMA with an extensive molecular database, including samples from more than 12,000 individual prostate cancers as well as molecular data from 39 prognostic relevant protein markers enabled us to evaluate the impact of qualitative and quantitative factors for prostate cancer biomarker studies. The results of our analyses suggest that such studies should aim at the analysis of at least 6000 individual prostate cancer samples to obtain reliable statistical findings allowing for a concluding judgment of a potential prognostic value of a marker of interest. Only for particularly strong markers, reliable results can also be obtained from substantially smaller cohorts. However, very strong prognostic markers appear to be rare, and the power of a marker is often not known before the analysis is made. Our data further suggest that almost 20% of the tissues included in a prostate cancer TMA may have limited tissue reactivity, potentially compromising the results of some analyses. While there is no impact of tissue reactivity on the results of prognostic studies, this issue is more relevant if it comes to comparisons between biomarkers analyzed by immunohistochemistry. Our data suggest, however, that even such associations that result from only discrete expression differences can be reliably identified in large-scale TMA analyses.
We thank Janett Lüttgens, Sünje Seekamp, Inge Brandt, Silvia Schnöger and Sascha Eghtessadi for excellent technical assistance.
CB, KG, SS, CW, SM, MCT analyzed all immunostainings. MK analyzed FISH. AM and CB performed statistical analyses. GS, TS, and RS wrote the manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
- Graefen, M.; Ahyai, S.; Heuer, R.; Salomon, G.; Schlomm, T.; Isbarn, H.; Budaus, L.; Heinzer, H.; Huland, H. Active surveillance for prostate cancer. Urologe A 2008, 47, 261–269, doi:10.1007/s00120-008-1638-0.
- Cooperberg, M.R.; Simko, J.P.; Cowan, J.E.; Reid, J.E.; Djalilvand, A.; Bhatnagar, S.; Gutin, A.; Lanchbury, J.S.; Swanson, G.P.; Stone, S.; et al. Validation of a cell-cycle progression gene panel to improve risk stratification in a contemporary prostatectomy cohort. J. Clin. Oncol. 2013, 31, 1428–1434, doi:10.1200/JCO.2012.46.4396.
- Cuzick, J.; Swanson, G.P.; Fisher, G.; Brothman, A.R.; Berney, D.M.; Reid, J.E.; Mesher, D.; Speights, V.O.; Stankiewicz, E.; Foster, C.S.; et al. Prognostic value of an RNA expression signature derived from cell cycle proliferation genes in patients with prostate cancer: A retrospective study. Lancet Oncol. 2011, 12, 245–255, doi:10.1016/S1470-2045(10)70295-3.
- Badani, K.; Thompson, D.J.; Buerki, C.; Davicioni, E.; Garrison, J.; Ghadessi, M.; Mitra, A.P.; Wood, P.J.; Hornberger, J. Impact of a genomic classifier of metastatic risk on postoperative treatment recommendations for prostate cancer patients: A report from the decide study group. Oncotarget 2013, 4, 600–609.
- Schlomm, T.; Erbersdobler, A.; Mirlacher, M.; Sauter, G. Molecular staging of prostate cancer in the year 2007. World J. Urol. 2007, 25, 19–30, doi:10.1007/s00345-007-0153-z.
- Zellweger, T.; Ninck, C.; Bloch, M.; Mirlacher, M.; Koivisto, P.A.; Helin, H.J.; Mihatsch, M.J.; Gasser, T.C.; Bubendorf, L. Expression patterns of potential therapeutic targets in prostate cancer. Int. J. Canc. 2005, 113, 619–628, doi:10.1002/ijc.20615.
- Schlomm, T.; Iwers, L.; Kirstein, P.; Jessen, B.; Kollermann, J.; Minner, S.; Passow-Drolet, A.; Mirlacher, M.; Milde-Langosch, K.; Graefen, M.; et al. Clinical significance of p53 alterations in surgically treated prostate cancers. Mod. Pathol. 2008, 21, 1371–1379, doi:10.1038/modpathol.2008.104.
- Vergis, R.; Corbishley, C.M.; Thomas, K.; Horwich, A.; Huddart, R.; Khoo, V.; Eeles, R.; Sydes, M.R.; Cooper, C.S.; Dearnaley, D.; Parker, C. Expression of Bcl-2, p53, and MDM2 in localized prostate cancer with respect to the outcome of radical radiotherapy dose escalation. Int. J. Radiat. Oncol. Biol. Phys. 2010, 78, 35–41, doi:10.1016/j.ijrobp.2009.07.1728.
- Kudahetti, S.; Fisher, G.; Ambroisine, L.; Foster, C.; Reuter, V.; Eastham, J.; Moller, H.; Kattan, M.W.; Cooper, C.S.; Scardino, P.; Cuzick, J.; Berney, D.M. P53 immunochemistry is an independent prognostic marker for outcome in conservatively treated prostate cancer. BJU Int. 2009, 104, 20–24, doi:10.1111/j.1464-410X.2009.08407.x.
- Uzoaru, I.; Rubenstein, M.; Mirochnik, Y.; Slobodskoy, L.; Shaw, M.; Guinan, P. An evaluation of the markers p53 and Ki-67 for their predictive value in prostate cancer. J. Surg. Oncol. 1998, 67, 33–37, doi:10.1002/(SICI)1096-9098(199801)67:1<33::AID-JSO7>3.0.CO;2-N.
- Incognito, L.S.; Cazares, L.H.; Schellhammer, P.F.; Kuban, D.A.; van Dyk, E.O.; Moriarty, R.P.; Wright, G.L., Jr.; Somers, K.D. Overexpression of p53 in prostate carcinoma is associated with improved overall survival but not predictive of response to radiotherapy. Int. J. Oncol. 2000, 17, 761–769.
- Han, B.; Mehra, R.; Lonigro, R.J.; Wang, L.; Suleman, K.; Menon, A.; Palanisamy, N.; Tomlins, S.A.; Chinnaiyan, A.M.; Shah, R.B. Fluorescence in situ hybridization study shows association of PTEN deletion with erg rearrangement during prostate cancer progression. Mod. Pathol. 2009, 22, 1083–1093, doi:10.1038/modpathol.2009.69.
- McCall, P.; Witton, C.J.; Grimsley, S.; Nielsen, K.V.; Edwards, J. Is pten loss associated with clinical outcome measures in human prostate cancer? Br. J. Canc. 2008, 99, 1296–1301, doi:10.1038/sj.bjc.6604680.
- Sircar, K.; Yoshimoto, M.; Monzon, F.A.; Koumakpayi, I.H.; Katz, R.L.; Khanna, A.; Alvarez, K.; Chen, G.; Darnel, A.D.; Aprikian, A.G.; et al. TEN genomic deletion is associated with p-akt and ar signalling in poorer outcome, hormone refractory prostate cancer. J. Pathol. 2009, 218, 505–513, doi:10.1002/path.2559.
- Yoshimoto, M.; Cunha, I.W.; Coudry, R.A.; Fonseca, F.P.; Torres, C.H.; Soares, F.A.; Squire, J.A. Fish analysis of 107 prostate cancers shows that PTEN genomic deletion is associated with poor clinical outcome. Br. J. Canc. 2007, 97, 678–685, doi:10.1038/sj.bjc.6603924.
- Reid, A.H.; Attard, G.; Ambroisine, L.; Fisher, G.; Kovacs, G.; Brewer, D.; Clark, J.; Flohr, P.; Edwards, S.; Berney, D.M.; et al. Molecular characterisation of ERG, ETV1 and PTEN gene loci identifies patients at low and high risk of death from prostate cancer. Br. J. Canc. 2010, 102, 678–684, doi:10.1038/sj.bjc.6605554.
- Koumakpayi, I.H.; Le Page, C.; Mes-Masson, A.M.; Saad, F. Hierarchical clustering of immunohistochemical analysis of the activated ErbB/PI3K/Akt/NF-kappaB signalling pathway and prognostic significance in prostate cancer. Br. J. Canc. , 102, 1163–1173.
- Bedolla, R.; Prihoda, T.J.; Kreisberg, J.I.; Malik, S.N.; Krishnegowda, N.K.; Troyer, D.A.; Ghosh, P.M. Determining risk of biochemical recurrence in prostate cancer by immunohistochemical detection of PTEN expression and akt activation. Clin. Canc. Res. 2007, 13, 3860–3867, doi:10.1158/1078-0432.CCR-07-0091.
- Osman, I.; Dai, J.; Mikhail, M.; Navarro, D.; Taneja, S.S.; Lee, P.; Christos, P.; Shen, R.; Nanus, D.M. Loss of neutral endopeptidase and activation of protein kinase b (AKT) is associated with prostate cancer progression. Cancer 2006, 107, 2628–2636, doi:10.1002/cncr.22312.
- McMenamin, M.E.; Soung, P.; Perera, S.; Kaplan, I.; Loda, M.; Sellers, W.R. Loss of PTEN expression in paraffin-embedded primary prostate cancer correlates with high gleason score and advanced stage. Canc. Res. 1999, 59, 4291–4296.
- Bertram, J.; Peacock, J.W.; Fazli, L.; Mui, A.L.; Chung, S.W.; Cox, M.E.; Monia, B.; Gleave, M.E.; Ong, C.J. Loss of PTEN is associated with progression to androgen independence. Prostate 2006, 66, 895–902, doi:10.1002/pros.20411.
- Thomas, G.V.; Horvath, S.; Smith, B.L.; Crosby, K.; Lebel, L.A.; Schrage, M.; Said, J.; de Kernion, J.; Reiter, R.E.; Sawyers, C.L. Antibody-based profiling of the phosphoinositide 3-kinase pathway in clinical prostate cancer. Clin. Canc. Res. 2004, 10, 8351–8356, doi:10.1158/1078-0432.CCR-04-0130.
- Sauter, G.; Lee, J.; Bartlett, J.M.; Slamon, D.J.; Press, M.F. Guidelines for human epidermal growth factor receptor 2 testing: Biologic and methodologic considerations. J. Clin. Oncol. 2009, 27, 1323–1333, doi:10.1200/JCO.2007.14.8197.
- Kononen, J.; Bubendorf, L.; Kallioniemi, A.; Barlund, M.; Schraml, P.; Leighton, S.; Torhorst, J.; Mihatsch, M.J.; Sauter, G.; Kallioniemi, O.P. Tissue microarrays for high-throughput molecular profiling of tumor specimens. Nat. Med. 1998, 4, 844–847, doi:10.1038/nm0798-844.
- Schlomm, T.; Kirstein, P.; Lwers, L.; Daniel, B.; Steuber, T.; Walz, J.; Chun, F.H.K.; Haese, A.; Kollermann, J.; Graefen, M.; et al. Clinical significance of epidermal growth factor receptor protein overexpression and gene copy number gains in prostate cancer. Clin. Canc. Res. 2007, 13, 6579–6584, doi:10.1158/1078-0432.CCR-07-1257.
- Minner, S.; Jessen, B.; Stiedenroth, L.; Burandt, E.; Kollermann, J.; Mirlacher, M.; Erbersdobler, A.; Eichelberg, C.; Fisch, M.; Brummendorf, T.H.; et al. Low level HER2 overexpression is associated with rapid tumor cell proliferation and poor prognosis in prostate cancer. Clin. Canc. Res. 2010, 16, 1553–1560, doi:10.1158/1078-0432.CCR-09-2546.
- Kluth, M.; Harasimowicz, S.; Burkhardt, L.; Grupp, K.; Krohn, A.; Prien, K.; Gjoni, J.; Hass, T.; Galal, R.; Graefen, M.; et al. Clinical significance of different types of p53 gene alteration in surgically treated prostate cancer. Int. J. Canc. 2014, doi:10.1002/ijc.28784.
- Muller, J.; Ehlers, A.; Burkhardt, L.; Sirma, H.; Steuber, T.; Graefen, M.; Sauter, G.; Minner, S.; Simon, R.; Schlomm, T.; Michl, U. Loss of p(ser2448)-mTOR expression is linked to adverse prognosis and tumor progression in erg-fusion-positive cancers. Int. J. Canc. 2013, 132, 1333–1340, doi:10.1002/ijc.27768.
- Fleischmann, A.; Schlomm, T.; Huland, H.; Kollermann, J.; Simon, P.; Mirlacher, M.; Salomon, G.; Chun, F.H.; Steuber, T.; Simon, R.; et al. Distinct subcellular expression patterns of neutral endopeptidase (CD10) in prostate cancer predict diverging clinical courses in surgically treated patients. Clin. Canc. Res. 2008, 14, 7838–7842, doi:10.1158/1078-0432.CCR-08-1432.
- Grupp, K.; Diebel, F.; Sirma, H.; Simon, R.; Breitmeyer, K.; Steurer, S.; Hube-Magg, C.; Prien, K.; Pham, T.; Weigand, P.; et al. SPINK1 expression is tightly linked to 6q15- and 5q21-deleted erg-fusion negative prostate cancers but unrelated to psa recurrence. Prostate 2013, 73, 1690–1698.
- Grupp, K.; Habermann, M.; Sirma, H.; Simon, R.; Steurer, S.; Hube-Magg, C.; Prien, K.; Burkhardt, L.; Jedrzejewska, K.; Salomon, G.; et al. High nuclear karyopherin alpha 2 expression is a strong and independent predictor of biochemical recurrence in prostate cancer patients treated by radical prostatectomy. Mod. Pathol. 2014, 27, 96–106, doi:10.1038/modpathol.2013.127.
- Grupp, K.; Kohl, S.; Sirma, H.; Simon, R.; Steurer, S.; Becker, A.; Adam, M.; Izbicki, J.; Sauter, G.; Minner, S.; Schlomm, T.; Tsourlakis, M.C. Cysteine-rich secretory protein 3 overexpression is linked to a subset of PTEN-deleted ERG fusion-positive prostate cancers with early biochemical recurrence. Mod. Pathol. 2013, 26, 733–742, doi:10.1038/modpathol.2012.206.
- Grupp, K.; Boumesli, R.; Tsourlakis, M.C.; Koop, C.; Wilczak, W.; Adam, M.; Sauter, G.; Simon, R.; Izbicki, J.R.; Graefen, M.; et al. The prognostic impact of high nijmegen breakage syndrome (NBS1) gene expression in erg negative prostate cancers lacking PTEN deletion is driven by kpna2 expression. Int. J. Canc. 2014, doi:10.1002/ijc.28778.
- Grupp, K.; Wilking, J.; Prien, K.; Hube-Magg, C.; Sirma, H.; Simon, R.; Steurer, S.; Budaus, L.; Haese, A.; Izbicki, J.; et al. High RNA-binding motif protein 3 expression is an independent prognostic marker in operated prostate cancer and tightly linked to erg activation and pten deletions. Eur. J. Canc. 2014, 50, 852–861, doi:10.1016/j.ejca.2013.12.003.
- Grupp, K.; Sanader, S.; Sirma, H.; Simon, R.; Koop, C.; Prien, K.; Hube-Magg, C.; Salomon, G.; Graefen, M.; Heinzer, H.; et al. High lysophosphatidylcholine acyltransferase 1 expression independently predicts high risk for biochemical recurrence in prostate cancers. Mol. Oncol. 2013, 7, 1001–1011, doi:10.1016/j.molonc.2013.07.009.
- Grupp, K.; Jedrzejewska, K.; Tsourlakis, M.C.; Koop, C.; Wilczak, W.; Adam, M.; Quaas, A.; Sauter, G.; Simon, R.; Izbicki, J.R.; et al. High mitochondria content is associated with prostate cancer disease progression. Mol. Canc. 2013, 12, 145, doi:10.1186/1476-4598-12-145.
- Minner, S.; Wittmer, C.; Graefen, M.; Salomon, G.; Steuber, T.; Haese, A.; Huland, H.; Bokemeyer, C.; Yekebas, E.; Dierlamm, J.; et al. High level PSMA expression is associated with early psa recurrence in surgically treated prostate cancer. Prostate 2011, 71, 281–288, doi:10.1002/pros.21241.
- Erbersdobler, A.; Isbarn, H.; Dix, K.; Steiner, I.; Schlomm, T.; Mirlacher, M.; Sauter, G.; Haese, A. Prognostic value of microvessel density in prostate cancer: A tissue microarray study. World J. Urol. 2010, 28, 687–692, doi:10.1007/s00345-009-0471-4.
- Fleischmann, A.; Schlomm, T.; Kollermann, J.; Sekulic, N.; Huland, H.; Mirlacher, M.; Sauter, G.; Simon, R.; Erbersdobler, A. Immunological microenvironment in prostate cancer: High mast cell densities are associated with favorable tumor characteristics and good prognosis. Prostate 2009, 69, 976–981, doi:10.1002/pros.20948.
- Grupp, K.; Hohne, T.S.; Prien, K.; Hube-Magg, C.; Tsourlakis, M.C.; Sirma, H.; Pham, T.; Heinzer, H.; Graefen, M.; Michl, U.; et al. Reduced CD147 expression is linked to ERG fusion-positive prostate cancers but lacks substantial impact on psa recurrence in patients treated by radical prostatectomy. Exp. Mol. Pathol. 2013, 95, 227–234, doi:10.1016/j.yexmp.2013.08.002.
- Minner, S.; de Silva, C.; Rink, M.; Dahlem, R.; Chun, F.; Fisch, M.; Hoppner, W.; Wagner, W.; Bokemeyer, C.; Terracciano, L.; et al. Reduced CD151 expression is related to advanced tumour stage in urothelial bladder cancer. Pathology 2012, 44, 448–452, doi:10.1097/PAT.0b013e32835576ee.
- El Gammal, A.T.; Bruchmann, M.; Zustin, J.; Isbarn, H.; Hellwinkel, O.J.; Kollermann, J.; Sauter, G.; Simon, R.; Wilczak, W.; Schwarz, J.; et al. Chromosome 8p deletions and 8q gains are associated with tumor progression and poor prognosis in prostate cancer. Clin. Canc. Res. 2010, 16, 56–64, doi:10.1158/1078-0432.CCR-09-1423.
- Krohn, A.; Seidel, A.; Burkhardt, L.; Bachmann, F.; Grupp, K.; Becker, A.; Adam, M.; Graefen, M.; Huland, H.; Steurer, S.; et al. Recurrent deletion of 3p13 targets multiple tumor suppressor genes and defines a distinct subgroup of aggressive erg fusion positive prostate cancers. J. Pathol. 2013, 231, 130–141, doi:10.1002/path.4223.
- Burkhardt, L.; Fuchs, S.; Krohn, A.; Masser, S.; Mader, M.; Kluth, M.; Bachmann, F.; Huland, H.; Steuber, T.; Graefen, M.; et al. CHD1 is a 5q21 tumor suppressor required for erg rearrangement in prostate cancer. Canc. Res. 2013, 73, 2795–2805.
- Kluth, M.; Hesse, J.; Heinl, A.; Krohn, A.; Steurer, S.; Sirma, H.; Simon, R.; Schumacher, U.; Grupp, K.; Izbicki, J.; et al. Genomic deletion of MAP3K7 at 6q12–22 is associated with early PSA recurrence in prostate cancer and absence of TMPRSS2:ERG fusions. Mod. Pathol. 2013, 26, 975–983, doi:10.1038/modpathol.2012.236.
- Krohn, A.; Diedler, T.; Burkhardt, L.; Mayer, P.S.; De Silva, C.; Meyer-Kornblum, M.; Kotschau, D.; Tennstedt, P.; Huang, J.; Gerhauser, C.; et al. Genomic deletion of PTEN is associated with tumor progression and early PSA recurrence in erg fusion-positive and fusion-negative prostate cancer. Am. J. Pathol. 2012, 181, 401–412, doi:10.1016/j.ajpath.2012.04.026.
- Minner, S.; Enodien, M.; Sirma, H.; Luebke, A.M.; Krohn, A.; Mayer, P.S.; Simon, R.; Tennstedt, P.; Muller, J.; Scholz, L.; et al. ERG status is unrelated to PSA recurrence in radically operated prostate cancer in the absence of antihormonal therapy. Clin. Canc. Res. 2011, 17, 5878–5888, doi:10.1158/1078-0432.CCR-11-1251.
- Weischenfeldt, J.; Simon, R.; Feuerbach, L.; Schlangen, K.; Weichenhan, D.; Minner, S.; Wuttig, D.; Warnatz, H.J.; Stehr, H.; Rausch, T.; et al. Integrative genomic analyses reveal androgen-driven somatic alteration landscape in early-onset prostate cancer. Canc. Cell 2013, 23, 159–170, doi:10.1016/j.ccr.2013.01.002.
- Tapia, C.; Schraml, P.; Simon, R.; Al-Kuraya, K.S.; Maurer, R.; Mirlacher, M.; Novotny, H.; Spichtin, H.; Mihatsch, M.J.; Sauter, G. HER2 analysis in breast cancer: Reduced immunoreactivity in fish non-informative cancer biopsies. Int. J. Oncol. 2004, 25, 1551–1557.
© 2014 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).