Next Article in Journal
Genetic Association of Beta-Lactams-Induced Hypersensitivity Reactions: A Protocol for Systematic Review and Meta-Analysis
Next Article in Special Issue
Gene Expression Meta-Analysis of Potential Shared and Unique Pathways between Autoimmune Diseases under Anti-TNFα Therapy
Previous Article in Journal
miR-129-5p Participates in Hair Follicle Growth by Targeting HOXC13 in Rabbit
Previous Article in Special Issue
Identification of Potential Diagnostic Biomarkers and Biological Pathways in Hypertrophic Cardiomyopathy Based on Bioinformatics Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Transcriptomic Analysis of Canine Osteosarcoma from a Precision Medicine Perspective Reveals Limitations of Differential Gene Expression Studies

1
Scott-Ritchey Research Center, College of Veterinary Medicine, Auburn University, Auburn, AL 36849, USA
2
Department of Pathobiology, College of Veterinary Medicine, Auburn University, Auburn, AL 36849, USA
3
HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
4
Center for Advanced Science, Innovation, and Commerce, Alabama Agricultural Experiment Station, Auburn, AL 36849, USA
5
Department of Clinical Sciences, College of Veterinary Medicine, Auburn University, Auburn, AL 36849, USA
*
Author to whom correspondence should be addressed.
Genes 2022, 13(4), 680; https://doi.org/10.3390/genes13040680
Submission received: 2 February 2022 / Revised: 1 April 2022 / Accepted: 11 April 2022 / Published: 13 April 2022
(This article belongs to the Special Issue Transcriptomics and Bioinformatics in Precision Medicine)

Abstract

:
Despite significant advances in cancer diagnosis and treatment, osteosarcoma (OSA), an aggressive primary bone tumor, has eluded attempts at improving patient survival for many decades. The difficulty in managing OSA lies in its extreme genetic complexity, drug resistance, and heterogeneity, making it improbable that a single-target treatment would be beneficial for the majority of affected individuals. Precision medicine seeks to fill this gap by addressing the intra- and inter-tumoral heterogeneity to improve patient outcome and survival. The characterization of differentially expressed genes (DEGs) unique to the tumor provides insight into the phenotype and can be useful for informing appropriate therapies as well as the development of novel treatments. Traditional DEG analysis combines patient data to derive statistically inferred genes that are dysregulated in the group; however, the results from this approach are not necessarily consistent across individual patients, thus contradicting the basis of precision medicine. Spontaneously occurring OSA in the dog shares remarkably similar clinical, histological, and molecular characteristics to the human disease and therefore serves as an excellent model. In this study, we use transcriptomic sequencing of RNA isolated from primary OSA tumor and patient-matched normal bone from seven dogs prior to chemotherapy to identify DEGs in the group. We then evaluate the universality of these changes in transcript levels across patients to identify DEGs at the individual level. These results can be useful for reframing our perspective of transcriptomic analysis from a precision medicine perspective by identifying variations in DEGs among individuals.

1. Introduction

Osteosarcoma (OSA) is a highly aggressive and spontaneous tumor of the bone seen primarily in the appendicular skeleton of pediatric patients. Approximately 10–20% of patients exhibit macro-metastatic lesions at the time of diagnosis, while 80–90% of patients are presumed to harbor micro-metastases [1]. Metastases occur almost exclusively in the lungs and, once present, make management difficult. The 5-year survival rate for cases with detectable metastasis at the time of diagnosis is approximately 20–25% [1,2]. The current standard-of-care treatment includes surgical resection and combination chemotherapy [1]. However, 30–40% of patients will relapse within 3 years of starting treatment [3]. Metastectomy is considered the second-line treatment and improves 5-year survival rates to 40% [4]. Moreover, osteosarcoma is particularly proficient in acquiring multiple drug-resistant pathways, a major limiting factor for patient survival. Subsequent reoccurrence is exceedingly common and repeated systemic chemotherapy is frequently required; unfortunately, the eventual resistance of the metastases to treatment is inevitable [4]. Despite significant advances in the cancer therapy domain, the standard-of-care treatment and survival rates for OSA have remained essentially unchanged for 40 years [1,5,6]. For these reasons, it is imperative to explore and implement improved models to develop more effective treatment approaches.
Canine OSA has the potential to serve as an excellent model for the human disease due to similar clinical, histological, and molecular characteristics [7,8,9]. Primary tumors occur at similar sites with comparable histological presentation, response to treatment, and occurrence and distribution of metastases [10]. Further support for dogs as a valuable translational model is evidenced by a cross-species gene signature study which used oligonucleotide arrays to examine expression from a limited set of orthologous genes. The researchers found indistinguishable gene expression patterns between canine and pediatric OSA [7]. Studies utilizing whole genome and/or whole exome sequencing have reached similar conclusions [8,9]. The primary difference between canine and human OSA is the age of onset. Canine OSA has a bimodal age distribution, with peaks at 18–24 months and 7 years, though older, larger breed dogs are typically affected more often. On the other hand, human OSA primarily occurs in the second decade of life [4]. OSA prevalence is also greater in dogs, occurring 27 times more frequently than in humans, and progression occurs rapidly [11]. With amputation of the affected limb in combination with chemotherapy, the 1-year survival rate for dogs is approximately 45% [11].
New molecular tools, such as next-generation sequencing, have allowed significant improvements to be made in the identification of common genetic changes that are associated with specific types of cancer [12]. This technology has inaugurated the era of personalized or precision medicine, which utilizes an individual’s specific genetic changes to guide treatment. This approach seeks to classify individual patients into groups based on the presence of key gene mutations that directly impact the tumor’s sensitivity to specific chemotherapeutic agents. In this manner, treatments can be selected that have a higher likelihood of efficacy due to a better understanding of the relevant functioning pathways in that particular tumor.
At the molecular level, OSA, whether human or canine, is characterized by substantial genetic complexity and instability [13]. Relapsed OSA is considerably more complex, involving multiple drug-resistant pathways. The unique genetic complexity of OSA poses limitations for therapy, and precision medicine is not immune to these limitations [14]. The personalized treatment of two high risk human OSA patients based on comprehensive molecular profiling via next-generation exome sequencing showed no significant benefit to overall health or disease progression [15]. However, the targeted therapy was implemented after chemotherapy and refractory disease appeared, when drug resistance mechanisms are highly convoluted and make interpretation difficult. Fortunately, precision medicine is now entering a new phase where the transcriptomic analysis of individual patients may provide unique perspectives for treatment by addressing the intrinsic heterogeneity of gene expression in tumors, both within and between patients. A common approach to identifying targetable components of a tumor involves the analysis of differentially expressed genes (DEGs) in tumors compared to normal tissue.
Patient samples and clinical information are important factors to consider in differential gene studies and are unfortunately often overlooked. The source of normal tissue should be derived from the tumor’s cell-of-origin, and, in the case of OSA, that includes osteoblasts and osteocytes. Prior studies of differential gene expression in OSA have used RNA isolated from a canine osteoblast cell line [9]; tissues from unrelated organs such as liver, lymph node, and kidney [7]; tissue harvested adjacent to the tumor, potentially jeopardizing the normality of the sample [16]; bone tissue from unrelated patients [17]; or have not clearly described the origin of the matched normal samples [18,19]. While these data are undoubtedly useful to advancing our understanding of the disease, evaluation using primary normal bone cells harvested distally from the tumor within the same patient may yield more appropriate results. Furthermore, consistency in chemotherapy status among patients at the point of sample collection is important for drawing conclusions related to tumor status. The administration of chemotherapeutic agents provides a selective pressure that alters the tumor’s cellular population and phenotype [14,20]. Analysis of tumor gene expression prior to any chemotherapy provides a snapshot of the tumor phenotype, independent of acquired drug resistance or variation in individual drug response. Many studies in human OSA have utilized tissues harvested from individuals after the onset of chemotherapy, making interpretation of the results difficult [15,21,22].
DEG analysis combines the tumor and normal samples into two distinct groups to derive statistically meaningful DEGs that are generalizable across individuals. However, combining patients into a group implies that the samples are similar, if not identical, thereby contradicting and disregarding the intra- and inter-tumoral heterogeneity that forms the basis of precision medicine. While this type of analysis is critical for determining statistical differences in the group, individual-level analysis can provide additional insight into the differences among patients.
Given these limitations of traditional group differential gene expression studies, we supplemented this analysis with a more novel approach of evaluating individual tumors based on genes identified in the group approach. This study uses transcriptomic sequencing of RNA derived from seven primary canine osteosarcoma tumors with patient-matched normal bone to identify DEGs and pathways. Importantly, the normal bone samples, which serve as the comparator for determining baseline gene expression levels, were harvested from the phalanges of each patient. Furthermore, all samples were obtained prior to chemotherapy or evidence of macrometastatic lung disease. We explore the resulting DEGs in terms of group analysis for tumor vs. bone, as well as the discrete analysis of individual patients to identify the extent of heterogeneity in gene expression among individuals. These results highlight the phenotypic diversity of primary OSA among individuals and provide a supplemental approach to traditional methods of analyzing DEGs, particularly when the goal is an application in precision medicine.

2. Materials and Methods

2.1. Description of Data

Animals: Samples were obtained from seven dogs undergoing routine limb amputation for the clinical treatment of osteosarcoma (OSA) at the Auburn University College of Veterinary Medicine. Patient characteristics are briefly summarized in Table 1. The histopathology of tissue immediately adjacent to that used for RNA extraction was performed to confirm OSA and ensure that intact neoplastic tissue was entered into the experimental pipeline. Thoracic radiographs performed prior to amputation indicated no evidence of pulmonary macro-metastatic disease. In all cases, samples were obtained prior to chemotherapy treatment.

2.2. RNA Isolation and Sequencing

Normal bone RNA: Patient-matched samples were collected to obtain normal bone RNA to allow transcript levels to be compared with the tumors. The second phalanx was removed from the amputated leg within one hour of amputation. In all cases, the tumor was at least one joint space proximal to the phalanx. Due to the location, which was distal and distant from the primary tumor, it is unlikely that the normal bone sample had undergone any neoplastic events related to the tumor. For each dog, bone preparation and RNA extraction for the phalanx samples was performed according to Nance et al. [23].
Tumor RNA: For each dog, the tumor was dissected into approximately 3 mm by 3 mm samples, flash frozen in liquid nitrogen, and stored at −80 °C until RNA extraction. RNA isolation was accomplished using a homogenizer in combination with acid guanidinium thiocyanate-phenol-chloroform extraction. After weighing, approximately 100–200 mg of the frozen tumor samples was added to a 14 mL snap cap tube (Falcon) containing 2 mL of chilled Tri-Reagent (Molecular Research Center, Cincinnati, OH, USA). The sample was subjected to mechanical homogenization in short but frequent bursts until sufficiently homogeneized, as determined visually. The samples were divided into two microcentrifuge tubes and after 10 min incubation at room temperature, 100 μL of bromochloropropane (BCP) (Molecular Research Center) was added to each, vortexed thoroughly, and incubated for 5 min at room temperature. After centrifugation at 20,000× g for 15 min at 4 °C, the RNA-containing aqueous layer was carefully transferred to a new tube. Contaminating DNA was removed by the addition of DNase I Reaction Buffer (10% of the total volume) and 10 μL DNase I (Thermo Fisher Scientific Waltham, MA, USA), followed by a 10 min incubation at room temperature. DNase was inactivated by the addition of 10 μL of EDTA and heated at 65 °C for 10 min. Following this, 10% volume sodium acetate (3 M, pH 5.2) (VWR International, Radnor, PA, USA) was added to each tube, followed by 70% volume isopropyl alcohol, and the samples were briefly vortexed. Following a 15-min incubation at −20 °C, the samples were centrifuged at 20,000× g for 20 min at 4 °C. The supernatant was decanted, the pellet dislodged in 1 mL 70% ethanol, and centrifuged at 20,000× g for 30 min at 4 °C. After the supernatant was removed, the pellet was air dried for approximately 5–10 min and subsequently resuspended in 15 μL of RNase-free water. The RNA was quantified using a Nanodrop 2000 instrument (Thermo Fisher Scientific).
RNA Sequencing: The samples were commercially prepared and sequenced (HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA). Initial quality control analysis was performed using the Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA) and an RNA integrity number (RIN) was generated. RNA sequencing libraries were produced with 500 ng of total RNA using the TruSeq PolyA library kit (Illumina) to deplete samples of any RNA aside from polyadenylated mRNA. They were pooled and sequenced on two lanes of the Illumina HiSeq v4 (PE, 50 bp, 25 M reads) yielding an average of 34 M reads per sample.

2.3. Bioinformatic Pipeline and Data Analysis

The trimming of adapters and the first leading base was performed using Trimmomatic (v0.40) [24] with a minimum length of 36 bp and raw quality was assessed using FASTQC (v0.10.1) [25]. After trimming, approximately 32–45 million sequence reads remained for each sample and FASTQC was used to ensure all bases had a Phred quality score above 28. Reads were mapped to the indexed canine reference genome (CanFam3.1) obtained from ENSEMBL (release 103) using HiSat2 (v2.2.1) [26] and a table of mapped read counts was generated with Stringtie (v1.3.3) [27]. On average, 93% of the reads mapped to the canine reference genome. The bioinformatic pipeline is summarized in Figure 1.

2.4. Group Analysis

The statistical analysis and identification of differentially expressed genes was performed using DEseq2 (v3.14) [28] to generate the traditional group analysis results (Supplementary Data S1). These results provide a list of genes that are broadly differentially expressed among samples. The DEseq2 package applies a general linear model with a negative binomial distribution and applies the Benjamini–Hochberg procedure to control for the false discovery rate (FDR). The pre-filtering of genes with less than 1 read was performed prior to statistical analysis. A multi-factor design was used for statistical analysis to include patient ID as a term in the design formula (design = ~dog + tissue source). This design has been recommended in the DEseq2 vignette for analyzing paired samples because it accounts for differences between individuals. Log2 fold-changes were calculated relative to bone. To extract significant DEGs while minimizing noise, the data were filtered using a false discovery rate (FDR) less than 0.05, base mean greater than 10, and log2 fold-change greater than 1 and less than −1 (corresponding to a fold-change of 2 and −2, respectively) (Supplementary Data S2). The variance stabilizing transformation function in DEseq2 was used to transform the data to fit an approximately homoscedastic distribution and to remove the dependence of the variance on the mean [28]. The results of this transformation were used for the visualization and clustering of the results to generate a heatmap and a principal component analysis plot. Hallmark pathway analysis and gene ontology (GO) enrichment analysis of the upregulated genes was performed using Metascape with a p-value cut-off of 0.01, minimum overlap of 3, and minimum enrichment of 1.5 [29]. Due to the constraints of gene nomenclature in Metascape, only the upregulated genes with identified human orthologs (total 670) were included in the analysis. Raw counts are available in the Supplementary Information (Supplementary Data S4).

2.5. Individual Analysis

To explore how the DEGs varied among individual patients, we first filtered the data to include only the significant genes as defined by FDR < 0.05, base mean > 10, and an FC cut-off of 2 from the classical group analysis. We then calculated log2 fold-change values for each gene by subtracting the variance-stabilized transformed counts (on the log2 scale) of bone from tumors for each dog (Supplementary Data S3). We used these results to determine the number one upregulated and downregulated gene in each patient.

3. Results

3.1. RNA Quality

RIN numbers were used to determine sequencing suitability based on RNA integrity. Samples with an RIN below 5 cannot guarantee reliable sequencing results; for all except two (patient B bone RIN = 4.5, patient A tumor RIN = 5.0), the RIN was above 5, indicating moderately intact RNA. The average RIN for tumor and bone was 6.8 and 6.5, respectively.

3.2. Group Analysis

For the traditional group analysis, we compared the gene expression from all tumor tissues to all normal tissues and found a total of 3742 differentially expressed genes with a false discovery rate (FDR) of less than 0.05. After further filtering using a base mean greater than 10 and fold-change values greater than 2 and less than −2, there were 2031 significant DEGs. Of these, 803 genes were upregulated with a fold-change greater than 2, and 1228 genes were downregulated with a fold-change less than −2 in tumor compared to normal bone (Figure 2A). Hierarchical clustering of the 2031 significant DEGs shows that the normal and tumor samples cluster together in terms of over- and under-expression (Figure 2B). This is as expected based on studies in other organisms where as many as a third of genes show evidence for altered expression in tumors compared to normal tissue.
Principal component analysis was performed to further evaluate sample clustering (Figure 2C). As expected, all bone samples clustered together, distinct from the tumor samples.
To evaluate the biologically significant DEGs, we subset the significant results based on a fold-change cut-off of 2 and sorted them based on the log2 fold-change or adjusted p-value (FDR). Ordering based on log2 fold-change allows us to evaluate the most highly over- or under-expressed genes that still meet the adjusted p-value below 0.05. On the other hand, ordering the gene list based on an adjusted p-value provides DEGs with higher confidence in shared trends among individuals. The top 10 downregulated and upregulated genes are listed in Table 2. Many of the canine ENSEMBL gene identifications did not correspond to known gene symbols or human orthologs for the top 10 genes when ordered based on smallest and largest log2 fold-change, but when ordered based on smallest adjusted p-value, all 10 genes corresponded to known gene symbols (Table 2). Based on sorting by padj, the top upregulated genes included GTSE1, HELLS, SPAG5, RAD54L, and IQGAP3. The top downregulated genes included PLIN1, CL1, FMO2, CIDEC, and ESM1.
Hallmark pathway analysis of the upregulated DEGs revealed G2M checkpoint (M5901), E2F targets (M5925), MTORC1 signaling (M5924), and epithelial mesenchymal transition (M5930) as the most enriched pathways in tumors (Figure 3A). The top pathways from gene ontology (GO) enrichment analysis included the mitotic cell cycle (GO:0000278), regulation of cell cycle process (GO:0010564), DNA metabolic process (GO:0006259), and spindle organization (GO:0007051) (Figure 3B). These results are consistent with previously published data.
The classical group analysis identifies DEGs that are broadly dysregulated among patients. As a whole, each of the tumors exhibited similar patterns in terms of DEGs and these are distinct from normal bone. However, expression was not necessarily consistent across patients for all DEGs. To observe how the top genes for the group analysis varied between individuals, we plotted the normalized counts of the number one upregulated and downregulated gene according to the smallest adjusted p-values (Figure 4B,D) and the largest log2 fold-change difference (Figure 4A,C). The top upregulated gene according to the largest fold-change difference was ENSCAFG00000044295 (L2FC = 13.5, FDR = 1.3 × 10−4), which, according to ENSEMBL, encodes an uncharacterized protein of 120 amino acids in length. The top upregulated gene with the smallest FDR was GTSE1 (L2FC = 2.9, FDR = 8.6 × 10−31), or G2 and S phase-expressed protein 1, which encodes a protein that binds the tumor suppressor protein P53 to repress its ability to induce apoptosis in response to DNA damage. The top downregulated gene according to fold-change was ARHGEF1 (L2FC = −10.7, FDR = 2.9 × 10−3), or rho guanine nucleotide exchange factor 1, which encodes a protein that may be involved in forming a complex with G proteins and the stimulation of rho-dependent signals. The top downregulated gene with the smallest FDR was PLIN1 (L2FC = −4.6, FDR = 1.9 × 10−35), or perilipin 1, which encodes a protein involved in adipocyte lipid metabolism.
Visualization of these normalized gene counts shows some variation among patients, some even showing an inverse relationship despite having an FDR < 0.05 (Figure 4C).

3.3. Individual Analysis

To supplement the classical group analysis, we sought to identify the top DEGs in each individual patient by calculating log2 fold-change values for each pair of tumor and normal samples using the significant genes from the group analysis. Figure 5 depicts the normalized counts of the top upregulated gene in each patient. While the results show a general trend shared among individuals (Figure 5C–F), the top genes for patient A and B are heavily skewed by substantial counts in the tumor of that specific transcript, and minimal to no counts in the other patients (Figure 5A,B). The top upregulated gene for patient A, ENSCAFG00000041995, encodes a predicted long non-coding RNA of 1359 bp with unknown functions (Figure 5A). Interestingly, this gene was found to be the second top downregulated gene when ordered by fold-change for the group analysis, with a log2 fold-change of −10.25 and FDR of 0.006 (Table 2). For patient B, the top upregulated gene was LOC403585, also known as serum amyloid A1, which encodes a protein that is expressed in response to inflammation (Figure 5B). In the group analysis, this gene was found to be the second top upregulated gene when ordered by fold-change, with a log2 fold-change of 12.75 and an FDR of 0.0004. The top upregulated gene for both patients C and G was TFPI2 (tissue factor pathway inhibitor 2), which encodes a serine proteinase inhibitor that has been identified as a tumor suppressor in a variety of cancers (Figure 5C). The top upregulated gene in patient D, COL11A1 (collagen type XI α 1 chain), encodes a protein component of pro-collagen type XI, a major component of bone tissue (Figure 5D). For patient E, the top upregulated gene was SFRP2 (secreted frizzled related protein 2), which encodes a protein involved in Wnt signaling (Figure 5E). The top upregulated gene in Patient F was ENSCAFG00000028460, which encodes two long non-coding RNA transcripts with unknown functions (Figure 5F). These three genes, COL11A1, SFRP2, and ENSCAFG00000028460, were not observed in the top 20 down- or up-regulated genes from the group analysis.
Similarly, the top downregulated gene for each patient was determined and more overlap among individuals was observed (Figure 6). The top downregulated gene for patient A was CYTL1 (cytokine-like 1), which encodes a protein that is expressed in bone marrow and cord blood mononuclear cells with the CD34 surface receptor [30] (Figure 6A). The top downregulated gene for patients B, E, and G was ENSCAFG00000034058, which encodes a 1501 bp long non-coding RNA with uncharacterized function (Figure 6B). The top downregulated gene for patient C was MYOC (myocilin) which encodes a protein involved in cytoskeletal function (Figure 6C). The top downregulated gene in patients D and F was MEPE (matrix extracellular phosphoglycoprotein) which encodes a protein component of the extracellular matrix of bone (Figure 6D).
To compare the individual and group analysis results, we extracted the log2 fold-change values and adjusted p-values (FDR) from the group analysis for each top gene from the individual analysis (Table 3). While the results show similarities, the top upregulated gene in patient A (ENSCAFG00000041995) shows distinct opposition in terms of the direction of fold-change from the group analysis (L2FC = −10.25). In fact, while this gene is significantly over-expressed in patient A’s tumor, it is modestly downregulated in two tumors (patients B and C) and shows no evidence of expression in tumor or normal bone for the remaining patients (Figure 5A). Furthermore, this gene was identified as the second most downregulated gene from the group results when ordered by log2 fold-change (Table 2).

4. Discussion

Osteosarcoma is a highly complex and volatile bone tumor primarily seen in pediatric patients for which treatment has remained stagnant for almost 40 years. It is well-established that canine OSA is an ideal model for studying the human disease, as well as developing and testing therapeutic alternatives, such as precision medicine concepts. Using dogs, we aim to fill gaps in the current knowledge of OSA tumorigenesis while identifying potential therapeutic targets that can be tested and evaluated in a clinical setting.
While the technology of precision medicine has rapidly advanced, the overall clinical efficacy of such therapies leaves much to be desired, due primarily to the inherent heterogeneity and rapid evolution of drug-resistant mechanisms that define cancer [31]. Though cancer is traditionally considered a disease of genotypic origin, the phenotype reflects the accumulated genetic complexity and actionable targets that may be exploited. Transcriptomic sequencing provides a snapshot of gene expression and a link between genotypic and phenotypic landscape. In the context of cancer, where aberrant transcriptional patterns are pervasive, transcriptome profiling can identify and quantify changes in gene activity that are distinctive of tumors. Using this information, we can identify differentially expressed genes associated with various cellular processes and pathways that may be broadly expressed among patients to guide current treatment options and develop new therapies.
Traditional differential gene analysis groups individuals together to derive statistically meaningful DEGs; however, this approach fails to account for intra- and inter-tumoral heterogeneity and therefore is limited in its application for precision medicine. Deriving individual fold-change values provides additional insight into the differences among patients and potential targets that may be beneficial for a subset of patients. Individual-level analyses are difficult due to the lack of statistical inference that can be derived from a single sample. To address this limitation, we selected only the significant DEGs from the group analysis to be carried forward to the individual-level analysis, thereby providing some statistical basis for the individual results. However, it must be recognized that some distinctive gene expression alterations may be so specific to one individual that they will be lost from this analysis.
Using this concept, we sequenced the transcriptomes of seven primary canine OSA tumors and patient-matched normal bone samples to derive differentially expressed genes (DEGs). In contrast to previously published studies, all samples were collected before chemotherapy and evidence of macro-metastatic lung disease to limit confounding interpretation of the results. Furthermore, normal bone samples were collected from each patient to generate suitable baseline gene expression levels. To our knowledge, this is the first study to derive normal expression for each patient using a biologically appropriate sample (bone) which is representative of the tumor’s cell-of-origin (osteoblasts and osteocytes, depleted of bone marrow) in primary osteosarcoma prior to chemotherapy. This pairwise comparison reduces biological variability, increases statistical power, and provides a more thorough perspective of differential gene expression.
The clustering of the DEGs derived from the traditional group analysis showed distinct relationships between tumor and normal samples as a group. The wide dispersion and variability in the tumor samples, as evidenced in the PCA plot (Figure 2C), is believed to be related to the intra- and inter-tumoral heterogeneity. The bone tissues were depleted of bone marrow and any cells other than those embedded within the bone matrix, whereas the tumor samples may contain many different types of cells at various stages of differentiation (for example, infiltrating immune cells and de-differentiated cells with a more stem-cell-like phenotype). Furthermore, even tumors of the same type or source exhibit heterogeneity in their molecular profile [32].
We used traditional group analysis to statistically identify the top genes that are broadly dysregulated among the patients. Significant DEGs were selected based on padj < 0.05, FC > 2 or <−2, and a base mean count > 10, and these results were ranked based on log2 fold-change as well as an adjusted p-value. The top upregulated genes from the group analysis included GTSE1 (when sorted by padj) and ENSCAFG00000044295 (when sorted by log2 fold-change). GTSE1 regulates the G1/S cell cycle transition and has been reported to be overexpressed in other human cancers [33,34,35,36]. GTSE1 has been implicated in conferring cisplatin resistance in human osteosarcoma, though more studies are necessary to confirm its role in OSA tumorigenesis [37]. ENSCAFG00000044295 encodes a 120 amino acid protein with uncharacterized functions that does not correlate to any human ENSEMBL IDs. The top downregulated genes included PLIN1 (when sorted by padj) andARHGEF1 (when sorted by log2 fold-change). The downregulation of PLIN1 mRNA has been reported in breast cancer and is considered a tumor suppressor in breast cancer progression [38,39]. The dysregulation of rho GTPases, including ARHGEF1, are commonly reported in a variety of cancers [40].
We showed that the group results can sometimes be misleading due to the heterogeneity among individuals. In some cases, a bi-directional change in expression was observed among patients. To circumvent this, we used individual-level analysis to derive log2 fold-change values for each patient using only the significant genes identified in the group analysis. We used this information to identify the top upregulated and downregulated genes in each patient and compare their expression across individuals. The top downregulated gene was shared between patients B, E, and G (ENSCAFG00000034058), as well as for patients D and F (MEPE). ENSCAFG00000034058 encodes a long non-coding RNA with unknown functions. Co-expression network analysis has revealed MEPE as a dysregulated gene in human osteosarcoma [41]. The top upregulated gene was different for all individuals (A: ENSCAFG00000041995, B: LOC403585, D: COL11A1, E: SFRP2, F: ENSCAFG00000028460), except for patients C and G (TFPI2). The upregulation of COL11A1 and SFRP2 has been reported in human osteosarcoma [22,42]. TFPI2, a tumor suppressor gene, has been shown to be upregulated in other cancers, including colorectal, gastric, and prostate [43,44,45]. Both ENSCAFG00000041995 and ENSCAFG00000028460 encode uncharacterized long non-coding RNAs that do not correlate to known human ENSEMBL gene IDs. More studies are needed to evaluate their role in OSA tumorigenesis.
The top upregulated gene in patient A (ENSCAFG00000041995) showed an inverse relationship in terms of the direction of fold-change in the group analysis. In fact, this gene was the second-most downregulated gene according to the group results sorted by log2 fold-change, despite having an adjusted p-value less than 0.05. The observed differences in gene expression among individuals may be due to (1) the effects of different mutations in combination with genetic and/or environmental effects, (2) the stage of the tumor, or (3) initial RNA quality. This example represents the hallmark conclusion of this study; despite relatively stringent filtering conditions to minimize noise and confidently identify the top DEGs, traditional group analyses can be misleading and lead to the identification of therapeutic targets that may be ineffective for most patients.
Consistent with the conventional theme of research, this study is not without limitations. With a small sample size of seven, the statistical power is limited. As with all next-generation sequencing studies, computational challenges may limit the downstream interpretation. For example, the extent of gene annotation in the canine reference genome (CanFam3.1) limited our ability to identify the appropriate human orthologs. Additionally, it is becoming increasingly evident that non-coding RNAs such as micro-RNA and long non-coding RNA (lncRNA) play a dynamic role in tumor progression [18,19]. This study utilized a sequencing approach that excluded RNAs without poly-A tails. However, some lncRNAs are produced and processed with poly-A tails and as such were included in the sequencing. Furthermore, the heterogeneity of tumors makes it difficult to understand and characterize the phenotypic landscape, its components, and how those components affect tumor progression. It is also difficult to identify the distribution of these changes; for example, a highly upregulated gene may be vastly over-expressed, but only prevalent in only a small subset of cells. The clonal hypothesis implies a spatial distribution of different tumor cell populations, and using a single small section for sequencing may impair our ability to detect other cellular subsets. With the advancement of single-cell RNA sequencing and the development of accompanying bioinformatics tools, we may be able to elucidate the details of a tumor’s phenotype more clearly. While more studies are needed to evaluate the role these dysregulated genes play in tumorigenesis, these results support the notion that traditional group DEG analysis should be supplemented by individual-level analysis in the context of precision medicine for the identification of potential therapeutic targets.

5. Conclusions

Canine OSA serves as an excellent model for determining molecular targets and developing and evaluating personalized precision treatment. The relatively ample availability of dogs with spontaneously occurring OSA provides a powerful and underused translational model. Limb amputation offers the opportunity to easily collect tumor samples for sequencing as well as appropriate normal tissue, for example, from the distal phalanges, to allow comparisons within the same patient prior to chemotherapy [23].
The goal of precision medicine is to use an individual tumor’s molecular profile, rather than tumor category or stage, to inform therapeutic decisions and design novel treatments. Group DEG analysis has been traditionally used to identify relevant genes, but these results do not account for intra- or inter-tumoral differences. Despite using a multi-factor paired design to account for differences between individuals, as well as strict filtering parameters to minimize noise, our results showed some conflicting elements and variation in DEG expression. These results suggest that identifying significantly up- or downregulated genes as potential therapeutic targets using traditional group analysis may not always be appropriate, even when stringent filtering conditions are used. While these conclusions may be well established in the bioinformatics community, they may be lesser known in the precision oncology field, where DEG analysis is pervasive. Supplementing DEG analyses with individual-level analyses provides additional insight into the intra- and inter-tumoral heterogeneity.
The novelty of this study lies in the sample set as well as the analytical workflow. In contrast to other published studies, (1) all samples were obtained prior to chemotherapy, which can alter the phenotype; (2) normal bone samples (depleted of contaminating tissue) from each patient were used as the comparator tissue for baseline gene expression; and (3) DEG analysis was supplemented by individual-level analysis and compared to group DEG analysis. This study demonstrates that the variation in DEG expression between individuals, obtained using traditional group DEG analysis, is sufficient to warrant further individual-level analysis to identify more effective targets for precision therapy.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes13040680/s1, Data S1: Results for all genes, Data S2: significant genes, Data S3: individual fold-change, Data S4: raw gene counts.

Author Contributions

Conceptualization, R.L.N., D.S., P.A. and B.F.S.; methodology, R.L.N., D.S., P.A. and B.F.S.; case accrual and sampling, B.M., S.L., A.N.S., A.A.S., N.B., M.S. and J.K.; validation, R.L.N., S.J.C. and D.S.; formal analysis, R.L.N., S.J.C., D.S. and X.W.; data curation, R.L.N., S.J.C., D.S. and X.W.; writing—original draft preparation, R.L.N.; writing—review and editing, B.F.S.; visualization, R.L.N.; supervision, B.F.S.; project administration, B.F.S.; funding acquisition, B.F.S. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This study was generously funded by the Scott Shockley Foundation and the Auburn University Research Initiative in Cancer (AURIC). Contribution from X.W. is supported by a National Science Foundation EPSCoR RII Track-4 Research Fellowship (1928770), a USDA National Institute of Food and Agriculture, Hatch project 1018100, and a laboratory start-up fund from Auburn University College of Veterinary Medicine. S.J.C. and D.S. received support from the State of Alabama’s Cancer Research Fund.

Data Availability Statement

The data used in this study have been deposited into the NCBI GEO repository under accession number GSE199489. Data and scripts used for processing and generating figures can be found at DEseq2, https://github.com/rln0005/OSA_RNAseq (accessed on 1 February 2022).

Acknowledgments

We would like to extend our sincerest gratitude to Elizabeth Stickelmaier and the oncology team at the Auburn University College of Veterinary Medicine for coordinating patient sample acquisition. We are also very grateful for the knowledge and expertise of Tonia Schwartz in the Department of Biological Sciences at Auburn University.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Geller, D.S.; Gorlick, R. Osteosarcoma: A Review of Diagnosis, Management, and Treatment Strategies. Clin. Adv. Hematol. Oncol. 2010, 8, 705–718. [Google Scholar] [PubMed]
  2. Luetke, A.; Meyers, P.A.; Lewis, I.; Juergens, H. Osteosarcoma Treatment—Where Do We Stand? A State of the Art Review. Cancer Treat. Rev. 2014, 40, 523–532. [Google Scholar] [CrossRef] [PubMed]
  3. Longhi, A.; Errani, C.; De Paolis, M.; Mercuri, M.; Bacci, G. Primary Bone Osteosarcoma in the Pediatric Age: State of the Art. Cancer Treat. Rev. 2006, 32, 423–436. [Google Scholar] [CrossRef] [PubMed]
  4. Gorlick, R.; Khanna, C. Osteosarcoma. J. Bone Min. Res. 2010, 25, 683–691. [Google Scholar] [CrossRef] [PubMed]
  5. Mirabello, L.; Troisi, R.J.; Savage, S.A. Osteosarcoma Incidence and Survival Rates from 1973 to 2004: Data from the Surveillance, Epidemiology, and End Results Program. Cancer 2009, 115, 1531–1543. [Google Scholar] [CrossRef] [Green Version]
  6. Sayles, L.C.; Breese, M.R.; Koehne, A.L.; Leung, S.G.; Lee, A.G.; Liu, H.-Y.; Spillinger, A.; Shah, A.T.; Tanasa, B.; Straessler, K.; et al. Genome-Informed Targeted Therapy for Osteosarcoma. Cancer Discov. 2019, 9, 46–63. [Google Scholar] [CrossRef] [Green Version]
  7. Paoloni, M.; Davis, S.; Lana, S.; Withrow, S.; Sangiorgi, L.; Picci, P.; Hewitt, S.; Triche, T.; Meltzer, P.; Khanna, C. Canine Tumor Cross-Species Genomics Uncovers Targets Linked to Osteosarcoma Progression. BMC Genom. 2009, 10, 625. [Google Scholar] [CrossRef] [Green Version]
  8. Sakthikumar, S.; Elvers, I.; Kim, J.; Arendt, M.L.; Thomas, R.; Turner-Maier, J.; Swofford, R.; Johnson, J.; Schumacher, S.E.; Alföldi, J.; et al. SETD2 Is Recurrently Mutated in Whole-Exome Sequenced Canine Osteosarcoma. Cancer Res. 2018, 78, 3421–3431. [Google Scholar] [CrossRef] [Green Version]
  9. Gardner, H.L.; Sivaprakasam, K.; Briones, N.; Zismann, V.; Perdigones, N.; Drenner, K.; Facista, S.; Richholt, R.; Liang, W.; Aldrich, J.; et al. Canine Osteosarcoma Genome Sequencing Identifies Recurrent Mutations in DMD and the Histone Methyltransferase Gene SETD2. Commun. Biol. 2019, 2, 266. [Google Scholar] [CrossRef] [Green Version]
  10. Withrow, S.J. Withrow and MacEwen’s Small Animal Clinical Oncology; Elsevier Health Sciences: St. Louis, MO, USA, 2007; ISBN 978-0-7216-0558-6. [Google Scholar]
  11. Simpson, S.; Dunning, M.D.; de Brot, S.; Grau-Roma, L.; Mongan, N.P.; Rutland, C.S. Comparative Review of Human and Canine Osteosarcoma: Morphology, Epidemiology, Prognosis, Treatment and Genetics. Acta Vet. Scand. 2017, 59, 71. [Google Scholar] [CrossRef]
  12. Zhao, J.; Dean, D.C.; Hornicek, F.J.; Yu, X.; Duan, Z. Emerging Next-Generation Sequencing-Based Discoveries for Targeted Osteosarcoma Therapy. Cancer Lett. 2020, 474, 158–167. [Google Scholar] [CrossRef]
  13. Martin, J.W.; Squire, J.A.; Zielenska, M. The Genetics of Osteosarcoma. Sarcoma 2012, 2012, e627254. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Lipinski, K.A.; Barber, L.J.; Davies, M.N.; Ashenden, M.; Sottoriva, A.; Gerlinger, M. Cancer Evolution and the Limits of Predictability in Precision Cancer Medicine. Trends Cancer 2016, 2, 49–63. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Subbiah, V.; Wagner, M.J.; McGuire, M.F.; Sarwari, N.M.; Devarajan, E.; Lewis, V.O.; Westin, S.; Kato, S.; Brown, R.E.; Anderson, P. Personalized Comprehensive Molecular Profiling of High Risk Osteosarcoma: Implications and Limitations for Precision Medicine. Oncotarget 2015, 6, 40642–40654. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Simpson, S.; Dunning, M.; de Brot, S.; Alibhai, A.; Bailey, C.; Woodcock, C.L.; Mestas, M.; Akhtar, S.; Jeyapalan, J.N.; Lothion-Roy, J.; et al. Molecular Characterisation of Canine Osteosarcoma in High Risk Breeds. Cancers 2020, 12, 2405. [Google Scholar] [CrossRef]
  17. Endo-Munoz, L.; Cumming, A.; Sommerville, S.; Dickinson, I.; Saunders, N.A. Osteosarcoma Is Characterised by Reduced Expression of Markers of Osteoclastogenesis and Antigen Presentation Compared with Normal Bone. Br. J. Cancer 2010, 103, 73–81. [Google Scholar] [CrossRef]
  18. Xie, L.; Yao, Z.; Zhang, Y.; Li, D.; Hu, F.; Liao, Y.; Zhou, L.; Zhou, Y.; Huang, Z.; He, Z.; et al. Deep RNA Sequencing Reveals the Dynamic Regulation of MiRNA, LncRNAs, and MRNAs in Osteosarcoma Tumorigenesis and Pulmonary Metastasis. Cell Death Dis. 2018, 9, 772. [Google Scholar] [CrossRef] [Green Version]
  19. Zhao, X.; Wang, Q.; Lin, F.; Wang, X.; Wang, Y.; Wang, J.; Wang, C. RNA Sequencing of Osteosarcoma Gene Expression Profile Revealed That MiR-214-3p Facilitates Osteosarcoma Cell Proliferation via Targeting Ubiquinol-Cytochrome c Reductase Core Protein 1 (UQCRC1). Med. Sci. Monit. 2019, 25, 4982–4991. [Google Scholar] [CrossRef]
  20. Glickman, M.S.; Sawyers, C.L. Converting Cancer Therapies into Cures: Lessons from Infectious Diseases. Cell 2012, 148, 1089–1098. [Google Scholar] [CrossRef] [Green Version]
  21. Martson, A.; Kõks, S.; Reimann, E.; Prans, E.; Erm, T.; Maasalu, K. Transcriptome Analysis of Osteosarcoma Identifies Suppression of Wnt Pathway and Up-Regulation of Adiponectin as Potential Biomarker. Genom. Discov. 2013, 1, 3. [Google Scholar] [CrossRef] [Green Version]
  22. Ho, X.D.; Phung, P.; Q Le, V.; H Nguyen, V.; Reimann, E.; Prans, E.; Kõks, G.; Maasalu, K.; Le, N.T.; H Trinh, L.; et al. Whole Transcriptome Analysis Identifies Differentially Regulated Networks between Osteosarcoma and Normal Bone Samples. Exp. Biol. Med. 2017, 242, 1802–1811. [Google Scholar] [CrossRef]
  23. Nance, R.; Agarwal, P.; Sandey, M.; Starenki, D.; Koehler, J.; Sajib, A.M.; Smith, B.F. A Method for Isolating RNA from Canine Bone. Biotechniques 2020, 68, 311–317. [Google Scholar] [CrossRef]
  24. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Babraham Bioinformatics—FastQC a Quality Control Tool for High Throughput Sequence Data. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 26 October 2021).
  26. Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A Fast Spliced Aligner with Low Memory Requirements. Nat. Methods 2015, 12, 357–360. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Pertea, M.; Pertea, G.M.; Antonescu, C.M.; Chang, T.-C.; Mendell, J.T.; Salzberg, S.L. StringTie Enables Improved Reconstruction of a Transcriptome from RNA-Seq Reads. Nat. Biotechnol. 2015, 33, 290–295. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Love, M.I.; Huber, W.; Anders, S. Moderated Estimation of Fold Change and Dispersion for RNA-Seq Data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef] [Green Version]
  29. Zhou, Y.; Zhou, B.; Pache, L.; Chang, M.; Khodabakhshi, A.H.; Tanaseichuk, O.; Benner, C.; Chanda, S.K. Metascape Provides a Biologist-Oriented Resource for the Analysis of Systems-Level Datasets. Nat. Commun. 2019, 10, 1523. [Google Scholar] [CrossRef] [PubMed]
  30. Liu, X.; Rapp, N.; Deans, R.; Cheng, L. Molecular Cloning and Chromosomal Mapping of a Candidate Cytokine Gene Selectively Expressed in Human CD34+ Cells. Genomics 2000, 65, 283–292. [Google Scholar] [CrossRef] [PubMed]
  31. Dugger, S.A.; Platt, A.; Goldstein, D.B. Drug Development in the Era of Precision Medicine. Nat. Rev. Drug Discov. 2018, 17, 183–196. [Google Scholar] [CrossRef]
  32. Marusyk, A.; Polyak, K. Tumor Heterogeneity: Causes and Consequences. Biochim. Biophys. Acta 2010, 1805, 105. [Google Scholar] [CrossRef] [Green Version]
  33. Lin, F.; Xie, Y.-J.; Zhang, X.-K.; Huang, T.-J.; Xu, H.-F.; Mei, Y.; Liang, H.; Hu, H.; Lin, S.-T.; Luo, F.-F.; et al. GTSE1 Is Involved in Breast Cancer Progression in P53 Mutation-Dependent Manner. J. Exp. Clin. Cancer Res. 2019, 38, 152. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Lai, W.; Zhu, W.; Li, X.; Han, Y.; Wang, Y.; Leng, Q.; Li, M.; Wen, X. GTSE1 Promotes Prostate Cancer Cell Proliferation via the SP1/FOXM1 Signaling Pathway. Lab. Investig. 2021, 101, 554–563. [Google Scholar] [CrossRef] [PubMed]
  35. Wu, X.; Wang, H.; Lian, Y.; Chen, L.; Gu, L.; Wang, J.; Huang, Y.; Deng, M.; Gao, Z.; Huang, Y. GTSE1 Promotes Cell Migration and Invasion by Regulating EMT in Hepatocellular Carcinoma and Is Associated with Poor Prognosis. Sci. Rep. 2017, 7, 5129. [Google Scholar] [CrossRef]
  36. Liu, A.; Zeng, S.; Lu, X.; Xiong, Q.; Xue, Y.; Tong, L.; Xu, W.; Sun, Y.; Zhang, Z.; Xu, C. Overexpression of G2 and S Phase-Expressed-1 Contributes to Cell Proliferation, Migration, and Invasion via Regulating P53/FoxM1/CCNB1 Pathway and Predicts Poor Prognosis in Bladder Cancer. Int. J. Biol. Macromol. 2019, 123, 322–334. [Google Scholar] [CrossRef]
  37. Xie, C.; Xiang, W.; Shen, H.; Shen, J. GTSE1 Is Possibly Involved in the DNA Damage Repair and Cisplatin Resistance in Osteosarcoma. J. Orthop. Surg. Res. 2021, 16, 713. [Google Scholar] [CrossRef]
  38. Zhou, C.; Wang, M.; Zhou, L.; Zhang, Y.; Liu, W.; Qin, W.; He, R.; Lu, Y.; Wang, Y.; Chen, X.-Z.; et al. Prognostic Significance of PLIN1 Expression in Human Breast Cancer. Oncotarget 2016, 7, 54488–54502. [Google Scholar] [CrossRef] [Green Version]
  39. Lu, C.; Wang, X.; Zhao, X.; Xin, Y.; Liu, C. Long Non-Coding RNA ARAP1-AS1 Accelerates Cell Proliferation and Migration in Breast Cancer through MiR-2110/HDAC2/PLIN1 Axis. Biosci. Rep. 2020, 40, BSR20191764. [Google Scholar] [CrossRef] [Green Version]
  40. Jung, H.; Yoon, S.R.; Lim, J.; Cho, H.J.; Lee, H.G. Dysregulation of Rho GTPases in Human Cancers. Cancers 2020, 12, 1179. [Google Scholar] [CrossRef] [PubMed]
  41. Wang, J.; Wang, Y.; Zhong, Y.; Li, X.; Du, S.; Xie, P.; Zheng, G.; Han, J. Identification of Co-Expression Modules and Pathways Correlated with Osteosarcoma and Its Metastasis. World J. Surg. Oncol. 2019, 17, 46. [Google Scholar] [CrossRef] [PubMed]
  42. Kim, H.; Yoo, S.; Zhou, R.; Xu, A.; Bernitz, J.M.; Yuan, Y.; Gomes, A.M.; Daniel, M.G.; Su, J.; Demicco, E.G.; et al. Oncogenic Role of SFRP2 in P53-Mutant Osteosarcoma Development via Autocrine and Paracrine Mechanism. Proc. Natl. Acad. Sci. USA 2018, 115, E11128–E11137. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Sierko, E.; Wojtukiewicz, M.Z.; Kisiel, W. The Role of Tissue Factor Pathway Inhibitor-2 in Cancer Biology. Semin. Thromb. Hemost. 2007, 33, 653–659. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Zhang, W.; Yang, C.; Wang, S.; Xiang, Z.; Dou, R.; Lin, Z.; Zheng, J.; Xiong, B. SDC2 and TFPI2 Methylation in Stool Samples as an Integrated Biomarker for Early Detection of Colorectal Cancer. Cancer Manag. Res. 2021, 13, 3601–3617. [Google Scholar] [CrossRef] [PubMed]
  45. Wang, S.; Xiao, X.; Zhou, X.; Huang, T.; Du, C.; Yu, N.; Mo, Y.; Lin, L.; Zhang, J.; Ma, N.; et al. TFPI-2 Is a Putative Tumor Suppressor Gene Frequently Inactivated by Promoter Hypermethylation in Nasopharyngeal Carcinoma. BMC Cancer 2010, 10, 617. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Overview of bioinformatic pipeline used to process and analyze data. RNA sequence data were subjected to modest trimming with Trimmomatic and quality analysis with FASTQC prior to mapping to the reference canine genome with HiSat2, counting reads with Stringtie, and DEG statistics with DEseq2 to generate significant DEGs (FDR < 0.05, FC > 2, ≤2) which were carried forward in pathway and individual analyses. Using the significant DEGs from the group analysis, fold-change values were generated for each patient to produce the individual-level data. Bioinformatic tools and packages utilized are indicated.
Figure 1. Overview of bioinformatic pipeline used to process and analyze data. RNA sequence data were subjected to modest trimming with Trimmomatic and quality analysis with FASTQC prior to mapping to the reference canine genome with HiSat2, counting reads with Stringtie, and DEG statistics with DEseq2 to generate significant DEGs (FDR < 0.05, FC > 2, ≤2) which were carried forward in pathway and individual analyses. Using the significant DEGs from the group analysis, fold-change values were generated for each patient to produce the individual-level data. Bioinformatic tools and packages utilized are indicated.
Genes 13 00680 g001
Figure 2. Analysis of differential gene expression in tumor vs normal group data reveals over 3000 significant DEGs. Volcano plot shows many genes are highly differentially expressed between tumor vs. normal (A). Genes indicated in red are significant in terms of both adjusted p-value (<0.05) and fold-change (>2 and ≤2). The sign of fold-change (positive or negative) was retained and is reported in terms of tumor compared to bone. Heatmap of the significant genes (FDR < 0.05) shows that these DEGs easily differentiate tumor from normal tissue (B). Each row represents a gene and upregulation is indicated in red, while downregulation is shown in blue. Patient and sample ID is indicated underneath the corresponding column. PCA plot shows grouping of normal bone samples (red circles) distinct from tumor samples (blue circles) as expected (C). The dispersion and variability of the tumor samples is thought to be related to intra- and inter-tumoral heterogeneity.
Figure 2. Analysis of differential gene expression in tumor vs normal group data reveals over 3000 significant DEGs. Volcano plot shows many genes are highly differentially expressed between tumor vs. normal (A). Genes indicated in red are significant in terms of both adjusted p-value (<0.05) and fold-change (>2 and ≤2). The sign of fold-change (positive or negative) was retained and is reported in terms of tumor compared to bone. Heatmap of the significant genes (FDR < 0.05) shows that these DEGs easily differentiate tumor from normal tissue (B). Each row represents a gene and upregulation is indicated in red, while downregulation is shown in blue. Patient and sample ID is indicated underneath the corresponding column. PCA plot shows grouping of normal bone samples (red circles) distinct from tumor samples (blue circles) as expected (C). The dispersion and variability of the tumor samples is thought to be related to intra- and inter-tumoral heterogeneity.
Genes 13 00680 g002
Figure 3. Pathway analysis of the upregulated DEGs using hallmark pathways (A) and gene ontology (GO) enrichment terms (B).
Figure 3. Pathway analysis of the upregulated DEGs using hallmark pathways (A) and gene ontology (GO) enrichment terms (B).
Genes 13 00680 g003
Figure 4. Plots of the normalized counts of the top up- and down-regulated genes in the classical group analysis show some variation among individual patients. The number one gene when ordered by log2 fold-change shows variation among individual patients (A,C). The top gene when ordered by smallest adjusted p-value shows less variation in terms of direction of fold-change, but some variation between individuals is still evident (B,D).
Figure 4. Plots of the normalized counts of the top up- and down-regulated genes in the classical group analysis show some variation among individual patients. The number one gene when ordered by log2 fold-change shows variation among individual patients (A,C). The top gene when ordered by smallest adjusted p-value shows less variation in terms of direction of fold-change, but some variation between individuals is still evident (B,D).
Genes 13 00680 g004
Figure 5. Plots of the top upregulated gene for each dog based on individual-level analysis reveal further variation among patients. The normalized counts of the top upregulated gene in each patient is shown. The top upregulated gene for patient A was ENSCAFG00000041995 (A). The top upregulated gene for patient B was LOC403585 (B). Patients C and G shared the same top upregulated gene, TFPI2 (C). The top upregulated gene for patient D was COL11A1 (D). The top upregulated gene for patient E was SFRP2 (E). The top upregulated gene for patient F was ENSCAFG00000028460 (F).
Figure 5. Plots of the top upregulated gene for each dog based on individual-level analysis reveal further variation among patients. The normalized counts of the top upregulated gene in each patient is shown. The top upregulated gene for patient A was ENSCAFG00000041995 (A). The top upregulated gene for patient B was LOC403585 (B). Patients C and G shared the same top upregulated gene, TFPI2 (C). The top upregulated gene for patient D was COL11A1 (D). The top upregulated gene for patient E was SFRP2 (E). The top upregulated gene for patient F was ENSCAFG00000028460 (F).
Genes 13 00680 g005
Figure 6. The top downregulated gene for each individual dog is shared among some patients. The normalized counts of the top downregulated gene in each patient are shown. The top downregulated gene in patient A was CYTL1 (A). Patients B, E, and G share the same top downregulated gene, ENSCAFG00000034058 (B). The top downregulated gene in patient C was MYOC (C). Patients D and F share the same top downregulated gene, MEPE (D).
Figure 6. The top downregulated gene for each individual dog is shared among some patients. The normalized counts of the top downregulated gene in each patient are shown. The top downregulated gene in patient A was CYTL1 (A). Patients B, E, and G share the same top downregulated gene, ENSCAFG00000034058 (B). The top downregulated gene in patient C was MYOC (C). Patients D and F share the same top downregulated gene, MEPE (D).
Genes 13 00680 g006
Table 1. Summary of canine osteosarcoma patients. Tumor and patient-matched normal bone samples were obtained from seven dogs undergoing limb amputation for osteosarcoma. Age is based on the date of limb amputation and breed is reported per the owner. (RIN = RNA integrity number, yrs = years, mos = months).
Table 1. Summary of canine osteosarcoma patients. Tumor and patient-matched normal bone samples were obtained from seven dogs undergoing limb amputation for osteosarcoma. Age is based on the date of limb amputation and breed is reported per the owner. (RIN = RNA integrity number, yrs = years, mos = months).
PatientSexNeuter StatusAgeTumor SiteBreedTumor RINBone RIN
AMCastrated10 yrs, 6 mosLeft distal radiusGolden Retriever5.06.6
BMCastrated7 yrsLeft distal femurRottweiler7.44.5
CMCastrated7 yrsLeft distal radiusDoberman Pinscher6.96.5
DMCastrated11 yrs, 9 mosLeft proximal humerusGreyhound6.46.7
EFSpayed7 yrs, 6 mosLeft distal radiusGreat Pyrenees6.96.9
FMCastrated7 yrsRight distal radiusGolden Retriever8.37.1
GFSpayed9 yrs, 9 mosRight distal tibiaGreyhound6.77.2
Table 2. List of top 10 upregulated and downregulated genes in tumor vs. normal from group analysis. When ordered by lowest and highest log2 fold-change, many of the ENSEMBL gene identifications do not correspond to known gene symbols. When ordered by smallest adjusted p-value (FDR), all ENSEMBL IDs correspond to known symbols.
Table 2. List of top 10 upregulated and downregulated genes in tumor vs. normal from group analysis. When ordered by lowest and highest log2 fold-change, many of the ENSEMBL gene identifications do not correspond to known gene symbols. When ordered by smallest adjusted p-value (FDR), all ENSEMBL IDs correspond to known symbols.
Top 10 Downregulated Genes in Tumor, Ordered by Log2 Fold-Change
ENSEMBLSYMBOLLog2FCPadjGENE NAME
ENSCAFG00000028799ARHGEF1−10.752.93 × 10−3Rho guanine nucleotide exchange factor 1
ENSCAFG00000041995NA−10.256.33 × 10−3NA
ENSCAFG00000007622NA−10.046.95 × 10−3NA
ENSCAFG00000005350NA−9.699.07 × 10−3NA
ENSCAFG00000014986FMO2−9.354.40 × 10−28flavin containing dimethylaniline monoxygenase 2
ENSCAFG00000042006NA−9.221.63 × 10−2NA
ENSCAFG00000049609NA−8.941.84 × 10−2NA
ENSCAFG00000008109NA−8.779.28 × 10−4NA
ENSCAFG00000029213LOC607979−8.672.22 × 10−2eukaryotic translation initiation factor 3, subunit L pseudogene
ENSCAFG00000011465NA−8.593.20 × 10−8NA
Top 10 Upregulated Genes in Tumor, Ordered by Log2 Fold-Change
ENSCAFG00000044295NA13.491.31 × 10−4NA
ENSCAFG00000009135LOC40358512.754.34 × 10−4serum amyloid A1
ENSCAFG00000032019NLRP1211.361.85 × 10−3NLR family pyrin domain containing 12
ENSCAFG00000013213NA10.325.31 × 10−3NA
ENSCAFG00000043115NA9.205.53 × 10−3NA
ENSCAFG00000007173NA7.951.31 × 10−3NA
ENSCAFG00000006648HOXC107.064.67 × 10−3homeobox C10
ENSCAFG00000015211APOBEC3Z16.583.81 × 10−6apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like
ENSCAFG00000008986RASAL16.121.68 × 10−6RAS protein activator like 1
ENSCAFG00000035513LOC1110936516.117.53 × 10−4uncharacterized LOC111093651
Top 10 Downregulated Genes in Tumor, Ordered by Adjusted p-value
ENSCAFG00000011986PLIN1−4.611.91 × 10−35perilipin 1
ENSCAFG00000006248CHL1−6.901.11 × 10−29cell adhesion molecule L1 like
ENSCAFG00000014986FMO2−9.354.40 × 10−28flavin containing dimethylaniline monoxygenase 2
ENSCAFG00000005266CIDEC−5.106.25 × 10−28cell death inducing DFFA like effector c
ENSCAFG00000018381ESM1−3.632.38 × 10−27endothelial cell specific molecule 1
ENSCAFG00000006392ACKR4−4.416.62 × 10−27atypical chemokine receptor 4
ENSCAFG00000030764SLC25A29−1.421.31 × 10−25solute carrier family 25 member 29
ENSCAFG00000003807KLF15−3.142.28 × 10−23Kruppel-like factor 15
ENSCAFG00000001854AQP7−3.582.66 × 10−23aquaporin 7
ENSCAFG00000015323PLIN4−3.527.60 × 10−22perilipin 4
Top 10 Upregulated Genes in Tumor, Ordered by Adjusted p-value
ENSCAFG00000000782GTSE12.888.51 × 10−31G2 and S-phase expressed 1
ENSCAFG00000008090HELLS2.341.54 × 10−27helicase, lymphoid specific
ENSCAFG00000018724SPAG53.332.28 × 10−23sperm associated antigen 5
ENSCAFG00000004272RAD54L2.481.35 × 10−21RAD54 like
ENSCAFG00000016616IQGAP33.041.41 × 10−20IQ motif containing GTPase activating protein 3
ENSCAFG00000010114CIT2.463.84 × 10−20citron rho-interacting serine/threonine kinase
ENSCAFG00000006648HOXC107.064.67 × 10−20homeobox C10
ENSCAFG00000016090TOP2A3.266.47 × 10−20DNA topoisomerase II α
ENSCAFG00000013255MKI673.042.35 × 10−18marker of proliferation Ki-67
ENSCAFG00000008478MOGS1.306.47 × 10−18mannosyl-oligosaccharide glucosidase
Table 3. Comparison of individual-level analysis with group analysis results. The top up- and downregulated gene for each patient according to log2 fold-change values from individual analysis shows some similarities to group analysis results. The top upregulated gene in patient A (ENSCAFG00000041995) shows distinct opposition in terms of direction of fold-change in the group results.
Table 3. Comparison of individual-level analysis with group analysis results. The top up- and downregulated gene for each patient according to log2 fold-change values from individual analysis shows some similarities to group analysis results. The top upregulated gene in patient A (ENSCAFG00000041995) shows distinct opposition in terms of direction of fold-change in the group results.
PatientGeneLog2FC (Individual) Log2FC (Group)FDR
Individual Analysis ResultsTop Upregulated GeneAENSCAFG000000419959.03Group Analysis Results−10.256.39 × 10−3
BLOC4035856.4212.754.37 × 10−4
CTFPI25.923.233.60 × 10−6
DCOL11A13.813.181.49 × 10−9
ESFRP28.633.197.36 × 10−6
FENSCAFG000000284606.091.794.15 × 10−2
GTFPI24.683.233.60 × 10−6
Top Downregulated GeneACYTL1−6.13−6.652.34 × 10−4
BENSCAFG00000034058−8.17−5.923.31 × 10−2
CMYOC−8.04−7.972.58 × 10−13
DMEPE−7.33−7.471.06 × 10−12
EENSCAFG00000034058−6.61−5.923.31 × 10−2
FMEPE−9.05−7.471.06 × 10−12
GENSCAFG00000034058−6.86−5.923.31 × 10−2
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nance, R.L.; Cooper, S.J.; Starenki, D.; Wang, X.; Matz, B.; Lindley, S.; Smith, A.N.; Smith, A.A.; Bergman, N.; Sandey, M.; et al. Transcriptomic Analysis of Canine Osteosarcoma from a Precision Medicine Perspective Reveals Limitations of Differential Gene Expression Studies. Genes 2022, 13, 680. https://doi.org/10.3390/genes13040680

AMA Style

Nance RL, Cooper SJ, Starenki D, Wang X, Matz B, Lindley S, Smith AN, Smith AA, Bergman N, Sandey M, et al. Transcriptomic Analysis of Canine Osteosarcoma from a Precision Medicine Perspective Reveals Limitations of Differential Gene Expression Studies. Genes. 2022; 13(4):680. https://doi.org/10.3390/genes13040680

Chicago/Turabian Style

Nance, Rebecca L., Sara J. Cooper, Dmytro Starenki, Xu Wang, Brad Matz, Stephanie Lindley, Annette N. Smith, Ashley A. Smith, Noelle Bergman, Maninder Sandey, and et al. 2022. "Transcriptomic Analysis of Canine Osteosarcoma from a Precision Medicine Perspective Reveals Limitations of Differential Gene Expression Studies" Genes 13, no. 4: 680. https://doi.org/10.3390/genes13040680

APA Style

Nance, R. L., Cooper, S. J., Starenki, D., Wang, X., Matz, B., Lindley, S., Smith, A. N., Smith, A. A., Bergman, N., Sandey, M., Koehler, J., Agarwal, P., & Smith, B. F. (2022). Transcriptomic Analysis of Canine Osteosarcoma from a Precision Medicine Perspective Reveals Limitations of Differential Gene Expression Studies. Genes, 13(4), 680. https://doi.org/10.3390/genes13040680

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop