Next Article in Journal
Differential Immunogenicity and Lung Disease-Inducing Potential of Mycobacterium immunogenum Genotypes and Impact of Co-Exposure with Pseudomonas: Optimizing a Mouse Model of Chronic Hypersensitivity Pneumonitis
Previous Article in Journal
Unraveling the Molecular Basis of Color Variation in Dioscorea alata Tubers: Integrated Transcriptome and Metabolomics Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

In-Silico Identification of Novel Pharmacological Synergisms: The Trabectedin Case

by
Laura Mannarino
1,2,*,
Nicholas Ravasio
3,
Maurizio D’Incalci
1,2,
Sergio Marchini
2,† and
Marco Masseroli
3,†
1
Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, Pieve Emanuele, 20072 Milan, Italy
2
Laboratory of Cancer Pharmacology, IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano, 20089 Milan, Italy
3
Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, 20133 Milan, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2024, 25(4), 2059; https://doi.org/10.3390/ijms25042059
Submission received: 11 January 2024 / Revised: 6 February 2024 / Accepted: 6 February 2024 / Published: 8 February 2024
(This article belongs to the Section Molecular Pharmacology)

Abstract

:
The in-silico strategy of identifying novel uses for already existing drugs, known as drug repositioning, has enhanced drug discovery. Previous studies have shown a positive correlation between expression changes induced by the anticancer agent trabectedin and those caused by irinotecan, a topoisomerase I inhibitor. Leveraging the availability of transcriptional datasets, we developed a general in-silico drug-repositioning approach that we applied to investigate novel trabectedin synergisms. We set a workflow allowing the identification of genes selectively modulated by a drug and possible novel drug interactions. To show its effectiveness, we selected trabectedin as a case-study drug. We retrieved eight transcriptional cancer datasets including controls and samples treated with trabectedin or its analog lurbinectedin. We compared gene signature associated with each dataset to the 476,251 signatures from the Connectivity Map database. The most significant connections referred to mitomycin-c, topoisomerase II inhibitors, a PKC inhibitor, a Chk1 inhibitor, an antifungal agent, and an antagonist of the glutamate receptor. Genes coherently modulated by the drugs were involved in cell cycle, PPARalpha, and Rho GTPases pathways. Our in-silico approach for drug synergism identification showed that trabectedin modulates specific pathways that are shared with other drugs, suggesting possible synergisms.

1. Introduction

The long duration of time required for a new drug to be placed on the market, with related cost-intensive investments and the high risk of failure from the development to the clinical application, pushes toward the implementation of faster strategies for the identification of novel compounds [1,2]. Information technology and data mining have made this possible through the development of in-silico approaches, like drug repositioning, which refers to drugs that have a different indication from the one they were initially designed for. The advantage of this approach is that repurposing already existing drugs reduces considerably the time to market and the costs and somehow prevents risks since it deals with compounds whose potential side effects are already known [1,2,3].
Our study is situated in this context and has two main objectives with both a methodological and a biological nature: (1) defining a general methodological workflow for the identification of genes selectively modulated by drugs of interest and for the discovery of potentially new combinations; (2) taking the drug trabectedin as an example use case to show validity and usefulness of the defined workflow, applying it to find genes that specifically characterize its mechanism, and to identify drugs inducing similar transcriptional profiles, which could be suitable for the exploration of new combinatorial therapies.
The choice of trabectedin for a case study was dictated by the long-lasting experience of our laboratory in the study of this drug of marine origin, which has an extraordinarily complex and interesting mechanism [4,5]. It acts as a transcriptional modulator, whose efficacy depends on a functional DNA repair pathway, and it also has a specific effect on the tumor microenvironment [5]. Probably due to its pleiotropic MoA, trabectedin has been demonstrated to be a good candidate for drug combinations. It has been used in combination with a topoisomerase I inhibitor, like irinotecan, for treating Ewing sarcoma [6,7] or in combination with anthracyclines for the therapy for soft-tissue sarcoma [8] and ovarian cancer [9,10]. Recently, a phase II study testing the combination with an anti-diabetic compound, known as pioglitazone, has started for patients affected with myxoid liposarcoma [11,12]. To overcome an important clinical side effect of trabectedin, i.e., liver toxicity, an analog drug called lurbinectedin has been developed and introduced in the clinic [13]. Lurbinectedin contains the same pentacyclic skeleton of the fused tetrahydroisoquinoline rings as trabectedin, but the additional tetrahydroisoquinoline of trabectedin is replaced by a tetrahydro β-carboline. Given their similar structure, the two drugs present an overlapping MoA, and, at the molecular level, they induce similar transcriptional effects [14,15]. Interestingly, in our previous work, we found and confirmed the synergy between trabectedin and irinotecan through a drug repositioning approach [16].
One attractive drug repositioning procedure is the “guilt by association” principle. This approach assumes that if two drugs elicit a similar transcriptional profile, they are likely to share the same mode of action (MoA), independently of the biological system (cell line or tissue) they have been applied to [3,17]. This strategy has proven to be effective in finding connections between drugs that have also been successful in the clinic [18]. The reasons for looking for drugs with similar transcriptional responses are multiple: (1) A compound can improve the effectiveness of another one. (2) Given an enhanced response, the administration dose can be modulated. (3) Consequently, to the first two points, dose modulation can help in lowering the side effects. One of the already available implementations for drug repositioning is the Connectivity Map (CMap) [19,20], which offers a database of 476,251 expression profiles derived from the treatment of different cell lines with 27,927 molecules. CMap aims to find connections among cells and drug treatments and allows searching for synergisms through the “guilt by association” principle.
Guided by these premises, we hypothesized that trabectedin and its analog lurbinectedin could have a selective action on specific genes in different cellular systems for which some sensitivities to the drugs can be exploited through drug repositioning approaches.

2. Results

2.1. Workflow for Drug-Specific Gene Selection

To identify genes that are selectively modulated by a drug of interest, we set up a workflow that allows determining drug-specific genes starting from public or in-house datasets. Figure 1 shows the workflow, which is divided into five main steps.
Briefly, the first step (A) consists of extracting a gene signature for each dataset of interest using up- and down-regulated differentially expressed genes. In step B, the signatures are loaded on the clue.io platform, which provides functions for signature–signature comparison using the Connectivity Map database. Given the interest in the transcriptional effects of drugs, the analysis is restricted to the compounds only. In step C, four identifiers (pert_id, pert_iname, cell_id, and gene_info) are used to extract genes and their expression values, expressed as z-score, from the CMap database in gctx format. Genes with their expression values are then compared with each signature from step A to identify common genes with the same direction of transcriptional regulation to obtain a gene list (step D). Finally (step E), the extracted gene lists are evaluated through pathway analysis, to identify their associated relevant pathways and are further investigated with correlation and clustering analysis to identify similarities between drugs. The following sections show in detail the results obtained from each step of the defined workflow when applied to the considered case of trabectedin.

2.2. Selection of Transcriptional Datasets

We applied our workflow to a specific drug of interest, namely trabectedin and its analog lurbinectedin, which share the same mechanism of action. First, we defined the whole set of data useful to investigate the transcriptional effects of trabectedin/lurbinectedin. We used data generated with the same type of technology (i.e., microarray platforms), with the availability of treated vs. untreated samples that allow for identifying transcriptionally modulated genes upon drug exposure. We selected both tumor and non-tumor datasets from the ArrayExpress [21] or Gene Expression Omnibus (GEO) [22] public databases, as listed in Table 1. They encompass cell line models of different tumors, like MV411 (myelomonocytic leukemia), JN-DSCRT1 (desmoplastic sarcoma), OCILy7 (lymphoma), U2932 (lymphoma), SHP-77 (small cell lung cancer), immune systems cells (monocytes), and two patient-derived xenograft models of myxoid liposarcoma (PDX-II and PDX-III).

2.3. Extraction of Gene Signatures

The main transcriptional effects of a drug can be best attributed to the genes that show the highest response upon treatment. In CMap, these genes can be loaded as a query gene list of the most modulated genes under the considered conditions. Thus, for each dataset, we extracted a list of the first 150 most up-regulated and the first 150 most down-regulated genes, when present, and drew a dataset-specific gene signature for each considered dataset. All signatures are reported in Table S1.

2.4. Identification of Compounds Correlated with the Drug of Interest

According to our defined workflow, the signature–signature comparison was performed using the CMap tool available on the clue.io platform. The only compound selection mode allows for selecting the compounds most connected to the gene expression profiles of the drug of interest (in our case trabectedin/lurbinectedin). In this way, in the CMap database, we found 10 compounds having high connections with trabectedin or lurbinectedin, namely the topoisomerase I inhibitor irinotecan or its active metabolite SN-38, topoisomerase II inhibitors (amsacrine and teniposide), mitomycin-c which binds to the DNA minor groove, SB-218078 which is an inhibitor of Chk1 kinase, SIB-1893 which is an antagonist of glutamate receptor, importazole which is an inhibitor of importin-Beta, bisindolylmalemide which is a protein kinase inhibitor, and pyrvinium-pamoate which is an anti-helminthync agent that inhibits Wnt pathway, as shown in the heatmap in Figure 2.
The heatmap color indicates the tau score, ranging from −100 (blue) to +100 (red), representing the value of the connection calculated between the query signature and each compound–cell line pair. For a better comprehension of the results, we reported the pre-identified signatures in the first line, depicted with colors as in the legend, while in the second line we showed the CMap cell line, again reported with colors as in the legend. Except for the pairs PDX-II-ET-24h1dose or PDX-II-ET-24h3dose and bisindolylmaleimide or SB-218078, overall, the trabectedin/lurbinectedin-induced signatures showed a high connection with all 10 compounds, independently on the cell line of origin. Heatmaps specific to each dataset are reported in Figure S1A–H.

2.5. Identification of Genes with the Same Transcriptional Modulation

Once the highly connected compounds were identified, the investigation was moved to a deeper level. Since the connections of CMap are derived from genes having the same activation or inhibition following drug treatment, we aimed at identifying those genes coherently modulated by the drug of interest (in our case, both trabectedin/lurbinectedin) and each identified compound. Thus, from the GEO GSE92742 CMap data, we selected the genes and their transcriptional values from all the cell line–compound pairs previously found. These were compared to the genes of the query signatures and only common genes with the same direction of regulation (up or down) in at least 70% of all the samples were selected (Table S2). To improve the investigation of the obtained common gene profiles, we checked whether the query signatures presented similarities among each other based on the selected genes. Thus, we made their clustering analysis (as specified in the Methods section), which identified two distinct groups: one composed of the JNDSCRT1, PDX-II, PDX-III, and MV411 datasets, and another one consisting of the U2932, OCILy7, SHP-77, and monocytes datasets (depicted in orange and green, respectively, in Figure S2A). Each pair of signatures taken from the two groups presented a high correlation within the same group and less with the other one (Figure S2B). Through this approach, we improved the investigation of the obtained common gene profiles and identified two main groups explaining the transcriptional similarities of the initial datasets.

2.6. Common Pathways

Gene lists can be meaningless if not placed in the biological context. Thus, a pathway analysis of the identified common genes was performed. As shown in Figure 3, in our case, we found 12 pathways: the Cyclin A/B1/B2 associated events during the G2/M transition pathway was associated with the activity of 8 out of 10 compounds, as expected from compounds of the same drug family (i.e., topoisomerase inhibitors); SN-38, irinotecan, and teniposide had the most pathways in common (7 out of 12), while the mitomycin-c and SB-218078 compounds showed the involvement of the PPARalpha pathway in the regulation of transcription.
The considered trabectedin use case demonstrates that the workflow we defined enables the identification of drugs inducing transcriptional effects similar to a specific drug treatment of interest. Indeed, the approach can be applied for the study of any drug.

3. Discussion

We developed a workflow for identifying drugs with similarities in their mechanism of action, taking advantage of the CMap database, and extracting and annotating coherently modulated genes. It can be easily applied to any drug of interest. As an example of a relevant application in our pipeline, we selected trabectedin and its analog lurbinectedin as candidate drugs for such a study, since these two drugs share similar and overlapping transcriptional profiles [14,15].
To work with similar data sources, we selected gene expression profiles from microarray technology, used to assess the transcriptional effect elicited by either trabectedin or lurbinectedin with respect to a control condition. Overall, we worked with datasets from monocytes of healthy donors or preclinical models of different tumors, like myxoid liposarcoma, desmoplastic sarcoma, lymphoma, and small-cell lung cancer.
The drug-induced transcriptional profiles were used to build gene signatures, i.e., lists of the most modulated genes sorted based on their differential expression; these were used as inputs for the CMap software (https://clue.io, accessed on 1 August 2020) [19] to identify drug–gene interactions. The use of these signatures of most modulated genes allowed us to consider each dataset as independent, thus overcoming any technical issue in comparing gene profiles from different sources and platforms while preserving the biological information on the drug’s effect. The method we used to select drugs with a similar mechanism of action was the so-called “guilt by association” principle as follows: if two drugs modulate the same genes in the same way, they are likely to share their MoA [1,3]. Thus, we focused on the transcriptionally modulated gene profiles rather than synergisms guided by the same molecular drug structure. Guided by this assumption, we used CMap to select the 10 most connected, e.g., correlated, drugs to trabectedin/lurbinectedin. The choice of using only 10 compounds was guided by the intent to focus on the closest drugs; however, the number of considered compounds can be freely chosen according to the specific analytical needs. Even though trabectedin and lurbinectedin are structurally very different from the other compounds for which a potential synergism is envisaged, they share the high lipophilicity of their active metabolites.
The workflow we propose in this work represents a step forward for the selection of genes that support the high connection between drugs. To this aim, we used the GSE92742 data from the CMap database. This dataset provides the genes modulated by all the compounds of the CMap database with their associated differential transcriptional value, expressed as a z-score, facilitating the identification of those genes that are up-regulated (positive z-scores) or down-regulated (negative z-scores) by a drug. Moreover, the GSE92742 data supplies useful metadata about compounds and cell lines, which are key to precisely identifying the datasets of interest. By comparing trabectedin/lurbinectedin signatures with the GSE92742 gene profiles and requiring only common genes with the same regulation direction (up or down) in at least 70% of all the samples, we were able to gather all the genes that support the high connections with trabectedin/lurbinectedin for each of the 10 most connected compounds.
Finally, we questioned whether the identified common genes could be linked to specific biological characteristics (i.e., annotations). Performing enrichment analysis, we showed that the main pathways under trabectedin/lurbinectedin MoA are related to the cell cycle, especially in the blockage of the G2/M phase. The cell cycle blockage in this phase is a hallmark of trabectedin [4,5,23]. It has been described in prostate cancer stem cells [24], and with an enhanced effect when trabectedin is administered in combination with other drugs like PARP1 inhibitors in sarcoma [25] or campthotecin in sarcoma models [26]. The arrest in the G2/M has also been connected to the other 8 out of the 10 compounds evaluated in this study. The other important biological function shared by trabectedin and the other identified compounds, especially mitomycin-c and SB-218078, is the transcriptional regulation guided by PPARalpha, a pathway that is already possible to target with PPARalpha-agonists, for example in the treatment of metabolic syndromes [27].
Among the obtained results, we found a high connection between trabectedin/lurbinectedin and the class of topoisomerase-I inhibitors. The efficacy of the combinatorial effect of trabectedin and one of these compounds, named irinotecan, was already validated in preclinical models of rhabdomyosarcoma [28] and in the clinic for the management of Ewing sarcoma [6,7]. The combinatorial administration of trabectedin and irinotecan already validated in the clinical setting further supports the potential of our approach.
Overall, we showed good results obtained by the application of our workflow. These prove the validity of the workflow we defined, which can be straightforwardly generalized to any drug of interest for which the search for novel synergisms is required. The identification of drugs with similar MoA opens new hypotheses for the exploration of novel combinatorial drug therapies.

4. Materials and Methods

4.1. Dataset

The used datasets were extracted from ArrayExpress [21] or Gene Expression Omnibus [22] according to the following criteria:
  • Availability of control samples under untreated conditions;
  • Availability of samples treated with either trabectedin or lurbinectedin;
  • At least three samples for each condition (either treated or untreated);
  • Gene expression data generated by microarray technology.
The final eight selected datasets are reported in Table 1 and further detailed in Table S3.

4.2. Microarray Data Analysis

Each dataset was analyzed independently. Microarray data analysis was performed as previously reported [16], starting from normalized data when available (all datasets, except for OCLy7 and U2932); otherwise, the R package Limma version 3.46.0 [29] was used to normalize the data with the “quantile” normalization method [30]. All datasets were annotated according to the corresponding microarray source and annotation package, as reported in Table 2. The R package Limma version 3.46.0 [29] was used to identify differentially expressed genes (DEGs) by comparing treated conditions to controls. DEGs were filtered with a q-value of 0.05.

4.3. Connectivity Map

The Connectivity Map resource was accessed through the clue.io platform (https://clue.io, accessed on 1 August 2020) to identify drugs that elicit a similar response to that of trabectedin or lurbinectedin based on differential gene expression signatures. CMap allows for finding connections between diseases, genes, and pharmacological compounds.

4.4. Definition of a Gene Signature

The identified DEGs were sorted in descending order according to their base-two logarithmic-fold change (logFC). Since CMap allows queries with a maximum of 300 tags (i.e., genes), the first 150 most up-regulated and the first 150 most down-regulated DEGs for each dataset were selected when present. Only genes belonging to the BING (Best INferred Genes) [31] list of the clue.io platform were retained. For each dataset, these lists of DEGs were defined as gene signatures using the listmaker function of clue.io.

4.5. CMap Query

The clue.io platform provides different functions for the upload and analysis of data. All eight created gene signatures were loaded through the query function and used to identify compounds with similar expression profiles, which were then visualized with the heatmap function. Given the interest in searching for drugs with a similar mechanism of action following the “guilt by association” principle [3,17], results were limited to the compounds. They were sorted from the most to the least expression–correlated ones, based on the tau score. This last one, also known as “the CMap connectivity score”, is defined as the fraction of reference gene sets (originating from the signatures present in Touchstone, the reference database of CMap) with greater similarity to the compound than that of the gene signature under query, as reported in the clue.io documentation [31]. The tau score ranges from −100 to +100; when the score is higher (or lower) than +90 (or −90), the result is considered worthy of further investigation, and the CMap query, i.e., the loaded gene signature, can be defined as “connected” to that result, i.e., compound. All results were saved in .gctx file format [32] for further processing.

4.6. CMap Query Result Selection

The results in .gctx format were parsed with iPython [33] through a GCTx parser of cmapPy [32] imported with the specific Python instruction from cmapPy.pandasGEXpress.parse import parser. The first 10 results were selected according to their tau score. Graphic representations through heatmaps were created with the seaborn version v0.101 package using the clustermap function [34].

4.7. Identification of Genes with the Same Up- or Down-Regulation Direction

The selection of genes with commonalities between each dataset and the CMap database, and with the same up or down direction of regulation by the treatment, was conducted as follows.
Gene expression data related to the CMap dataset were retrieved from the Gene Expression Omnibus—GEO (https://www.ncbi.nlm.nih.gov/geo, accessed on 20 July 2020). CMap datasets are based on Phase I data [20], which are stored in GEO as Series GSE92742; from them, we specifically used the following:
  • GSE92742_Broad_LINCS_sig_info.txt, to retrieve information on cell lines and compounds;
  • GSE92742_Broad_LINCS_gene_info.txt, to retrieve information on genes;
  • GSE92742_Broad_LINCS_Level5_COMPZ.MODZ_n473647x12328.gctx, representing the level 5 data z-score vector [19].
The Python cmapPy package, version 4.0.1 [32] was used to parse the GSE92742 gctx file. The gctx format is an evolution of the gct text file, specifically for the storage of large matrices associated with metadata annotations [32].
  • The sig_info file (GSE92742_Broad_LINCS_sig_info.txt) was used to subset the previous gctx file based on the following:
  • The pertubagen (i.e., compound) id (pert_id), using the pert_id corresponding to the first 10 results selected as previously reported in section “CMap query result selection”, as reported in Table 3;
  • The pertubagen name (pert_iname), using the pert_iname corresponding to the first 10 results selected as previously reported in section “CMap query result selection”, as reported in Table 3;
  • The cell lines of human origin of interest (cell_id) associated with the CMap database, as reported in Table 4.
The gene_info file (GSE92742_Broad_LINCS_gene_info.txt) was used to select only the 978 landmark genes defined as widely expressed across lineage, which have been determined through the L1000 assay according to the clue.io documentation [31].
The application of these filters resulted in the final file in gctx format. Then, each gene signature related to the datasets of the cell lines in Table 4 was divided into up- and down-regulated genes, which were used to subset the selected gctx file independently for each dataset. Only genes with the same up- or down-regulation direction in at least 70% of all considered samples were selected.

4.8. Correlation and Clustering

Pearson correlation between the gene signatures of each pair of datasets was conducted with the corr() function of the Python pandas package version 1.0.4 for data analysis using the default method of the Pearson correlation [35]. Dataset clustering was performed with the dendrogram and linkage functions from the sciPy library version 1.5.0 [36], using the Ward variance minimization algorithm.

4.9. Functional Analysis

Genes were annotated using the org.Hs.eg.db [37] database of human annotations. Common genes were analyzed for functional classification with the clusterProfiler version 3.18.1 R package [38], using both the groupGO function to query the gene ontology (molecular function—MF ontology and biological process—BP ontology) [39] and the enrichPathway function for enrichment analysis, based on hypergeometric test, with the Reactome database [40], using p-value cutoff = 0.05 and the multiple testing correction of p-value with the Benjamini–Hochberg procedure [41]. Pathway visualization was conducted with iPython [33] using the matplotlib version 3.2.2 [42] and the seaborn version v0.10.1 [34] packages.

4.10. Workflow Implementation

The defined workflow was implemented with an iPython notebook, publicly available at https://drive.google.com/drive/folders/1g2w2fOVH8ulAZ82PYR8SAT4fO3PPgf_3?usp=share_link, accessed on 5 February 2024.

5. Conclusions

The present work provides a workflow to investigate potential drug synergisms. The results presented here show that our workflow is useful for the discovery of drugs with a similar MoA and for the identification and pathway analysis of the genes that support the hypothesis of these pharmacological synergisms.
The merits of this work rely on (1) the definition and automation of an easily reproducible workflow applicable to the study of any drug; (2) the use of heterogeneous biological models, such as cell lines, patient-derived xenograft, and human blood cells, which help in the establishment of genes whose expression modulation is strictly drug-dependent; and (3) the identification of novel compounds that can potentially be used in synergy with trabectedin or its analog lurbinectedin, like mitomycin-c, amsacrine, bisindolylmaleimide, pyrivinium-pamoate, SIB-1893, and SB-218078, which were never proposed before as such.
Being an in-silico approach, our work has the limitation of lacking experimental validation. The strength of the results obtained through these approaches strongly relies on the following confirmation with laboratory assessments. Indeed, when computationally computed drugs advance to the clinical stage, their chances of failure are notably high [43]. The causes can be identified by the use of too-simple models, like the cell lines in the CMap database, that are far from recapitulating the complexity of biological systems and make it difficult to generalize results to the real population. On the other hand, it is worth highlighting that in-silico methods do not presume to provide ready-to-use solutions; instead, they are positioned in the initial stages of the experimental process aimed at quickly generating a hypothesis.
Given the previous considerations, as a future extension of the presented work, experimental biological testing is needed to assess and finally validate the relevant pharmacological results obtained in the example application of our workflow.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms25042059/s1.

Author Contributions

Conceptualization, L.M., S.M. and M.M.; methodology, L.M.; software, L.M.; validation, L.M., S.M. and M.M.; formal analysis, L.M. and N.R.; investigation, L.M. and N.R.; resources, L.M.; data curation, L.M.; writing—original draft preparation, L.M.; writing—review and editing, S.M., M.M. and M.D.; visualization, L.M. and N.R.; supervision, S.M. and M.M.; project administration, S.M. and M.M.; funding acquisition, M.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Fondazione AIRC per la Ricerca sul Cancro, grant number AIRC IG 2019 ID 23059 to MDI.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

At https://drive.google.com/drive/folders/1g2w2fOVH8ulAZ82PYR8SAT4fO3PPgf_3?usp=share_link, accessed on 5 February 2024, we provide the defined workflow implemented within iPython notebooks.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Pushpakom, S.; Iorio, F.; Eyers, P.A.; Escott, K.J.; Hopper, S.; Wells, A.; Doig, A.; Guilliams, T.; Latimer, J.; McNamee, C.; et al. Drug Repurposing: Progress, Challenges and Recommendations. Nat. Rev. Drug Discov. 2019, 18, 41–58. [Google Scholar] [CrossRef]
  2. Jourdan, J.-P.; Bureau, R.; Rochais, C.; Dallemagne, P. Drug Repositioning: A Brief Overview. J. Pharm. Pharmacol. 2020, 72, 1145–1151. [Google Scholar] [CrossRef]
  3. Iorio, F.; Bosotti, R.; Scacheri, E.; Belcastro, V.; Mithbaokar, P.; Ferriero, R.; Murino, L.; Tagliaferri, R.; Brunetti-Pierri, N.; Isacchi, A.; et al. Discovery of Drug Mode of Action and Drug Repositioning from Transcriptional Responses. Proc. Natl. Acad. Sci. USA 2010, 107, 14621–14626. [Google Scholar] [CrossRef] [PubMed]
  4. D’Incalci, M.; Galmarini, C.M. A Review of Trabectedin (ET-743): A Unique Mechanism of Action. Mol. Cancer Ther. 2010, 9, 2157–2163. [Google Scholar] [CrossRef] [PubMed]
  5. D’Incalci, M.; Badri, N.; Galmarini, C.M.; Allavena, P. Trabectedin, a Drug Acting on Both Cancer Cells and the Tumour Microenvironment. Br. J. Cancer 2014, 111, 646–650. [Google Scholar] [CrossRef] [PubMed]
  6. Grünewald, T.G.; Alonso, M.; Avnet, S.; Banito, A.; Burdach, S.; Cidre-Aranaz, F.; Di Pompo, G.; Distel, M.; Dorado-Garcia, H.; Garcia-Castro, J.; et al. Sarcoma Treatment in the Era of Molecular Medicine. EMBO Mol. Med. 2020, 12, e11131. [Google Scholar] [CrossRef] [PubMed]
  7. Baruchel, S.; Pappo, A.; Krailo, M.; Baker, K.S.; Wu, B.; Villaluna, D.; Lee-Scott, M.; Adamson, P.C.; Blaney, S.M. A Phase 2 Trial of Trabectedin in Children with Recurrent Rhabdomyosarcoma, Ewing Sarcoma and Non-Rhabdomyosarcoma Soft Tissue Sarcomas: A Report from the Children’s Oncology Group. Eur. J. Cancer 2012, 48, 579–585. [Google Scholar] [CrossRef] [PubMed]
  8. Demetri, G.D.; von Mehren, M.; Jones, R.L.; Hensley, M.L.; Schuetze, S.M.; Staddon, A.; Milhem, M.; Elias, A.; Ganjoo, K.; Tawbi, H.; et al. Efficacy and Safety of Trabectedin or Dacarbazine for Metastatic Liposarcoma or Leiomyosarcoma after Failure of Conventional Chemotherapy: Results of a Phase III Randomized Multicenter Clinical Trial. J. Clin. Oncol. 2016, 34, 786–793. [Google Scholar] [CrossRef] [PubMed]
  9. Pignata, S.; Scambia, G.; Villanucci, A.; Naglieri, E.; Ibarbia, M.A.; Brusa, F.; Bourgeois, H.; Sorio, R.; Casado, A.; Reichert, D.; et al. A European, Observational, Prospective Trial of Trabectedin Plus Pegylated Liposomal Doxorubicin in Patients with Platinum-Sensitive Ovarian Cancer. Oncologist 2021, 26, e658–e668. [Google Scholar] [CrossRef] [PubMed]
  10. Colombo, N.; Gadducci, A.; Sehouli, J.; Rulli, E.; Mäenpää, J.; Sessa, C.; Montes, A.; Ottevanger, N.B.; Berger, R.; Vergote, I.; et al. INOVATYON/ENGOT-Ov5 Study: Randomized Phase III International Study Comparing Trabectedin/Pegylated Liposomal Doxorubicin (PLD) Followed by Platinum at Progression vs Carboplatin/PLD in Patients with Recurrent Ovarian Cancer Progressing within 6–12 Months after Last Platinum Line. Br. J. Cancer 2023, 128, 1503–1513. [Google Scholar] [CrossRef]
  11. Mario Negri Institute for Pharmacological Research. A Phase II Study on Trabectedin in Combination with PPARg Agonist Pioglitazone in Patients with Round Cell Myxoid Liposarcomas or Dedifferentiated G1 and G2 Liposarcomas with Stable Disease after a Monotherapy with Trabectedin. (TRABEPIO); 2021. Available online: https://classic.clinicaltrials.gov/ct2/show/NCT04794127 (accessed on 7 February 2024).
  12. Frapolli, R.; Bello, E.; Ponzo, M.; Craparotta, I.; Mannarino, L.; Ballabio, S.; Marchini, S.; Carrassa, L.; Ubezio, P.; Porcu, L.; et al. Combination of PPARγ Agonist Pioglitazone and Trabectedin Induce Adipocyte Differentiation to Overcome Trabectedin Resistance in Myxoid Liposarcomas. Clin. Cancer Res. 2019, 25, 7565–7575. [Google Scholar] [CrossRef] [PubMed]
  13. Markham, A. Lurbinectedin: First Approval. Drugs 2020, 80, 1345–1353. [Google Scholar] [CrossRef] [PubMed]
  14. Santamaría Nuñez, G.; Robles, C.M.G.; Giraudon, C.; Martínez-Leal, J.F.; Compe, E.; Coin, F.; Aviles, P.; Galmarini, C.M.; Egly, J.-M. Lurbinectedin Specifically Triggers the Degradation of Phosphorylated RNA Polymerase II and the Formation of DNA Breaks in Cancer Cells. Mol. Cancer Ther. 2016, 15, 2399–2412. [Google Scholar] [CrossRef] [PubMed]
  15. Belgiovine, C.; Bello, E.; Liguori, M.; Craparotta, I.; Mannarino, L.; Paracchini, L.; Beltrame, L.; Marchini, S.; Galmarini, C.M.; Mantovani, A.; et al. Lurbinectedin Reduces Tumour-Associated Macrophages and the Inflammatory Tumour Microenvironment in Preclinical Models. Br. J. Cancer 2017, 117, 628–638. [Google Scholar] [CrossRef] [PubMed]
  16. Mannarino, L.; Paracchini, L.; Craparotta, I.; Romano, M.; Marchini, S.; Gatta, R.; Erba, E.; Clivio, L.; Romualdi, C.; D’Incalci, M.; et al. A Systems Biology Approach to Investigate the Mechanism of Action of Trabectedin in a Model of Myelomonocytic Leukemia. Pharmacogenom. J. 2018, 18, 56–63. [Google Scholar] [CrossRef] [PubMed]
  17. Iorio, F.; Shrestha, R.L.; Levin, N.; Boilot, V.; Garnett, M.J.; Saez-Rodriguez, J.; Draviam, V.M. A Semi-Supervised Approach for Refining Transcriptional Signatures of Drug Response and Repositioning Predictions. PLoS ONE 2015, 10, e0139446. [Google Scholar] [CrossRef] [PubMed]
  18. Lussier, Y.A.; Chen, J.L. The Emergence of Genome-Based Drug Repositioning. Sci. Transl. Med. 2011, 3, 96ps35. [Google Scholar] [CrossRef]
  19. Lamb, J. The Connectivity Map: A New Tool for Biomedical Research. Nat. Rev. Cancer 2007, 7, 54–60. [Google Scholar] [CrossRef]
  20. Subramanian, A.; Narayan, R.; Corsello, S.M.; Peck, D.D.; Natoli, T.E.; Lu, X.; Gould, J.; Davis, J.F.; Tubelli, A.A.; Asiedu, J.K.; et al. A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles. Cell 2017, 171, 1437–1452.e17. [Google Scholar] [CrossRef]
  21. Athar, A.; Füllgrabe, A.; George, N.; Iqbal, H.; Huerta, L.; Ali, A.; Snow, C.; Fonseca, N.A.; Petryszak, R.; Papatheodorou, I.; et al. ArrayExpress Update–from Bulk to Single-Cell Expression Data. Nucleic Acids Res. 2019, 47, D711–D715. [Google Scholar] [CrossRef]
  22. Barrett, T.; Wilhite, S.E.; Ledoux, P.; Evangelista, C.; Kim, I.F.; Tomashevsky, M.; Marshall, K.A.; Phillippy, K.H.; Sherman, P.M.; Holko, M.; et al. NCBI GEO: Archive for Functional Genomics Data Sets—Update. Nucleic Acids Res. 2013, 41, D991–D995. [Google Scholar] [CrossRef]
  23. Larsen, A.K.; Galmarini, C.M.; D’Incalci, M. Unique Features of Trabectedin Mechanism of Action. Cancer Chemother. Pharmacol. 2016, 77, 663–671. [Google Scholar] [CrossRef]
  24. Acikgoz, E.; Guven, U.; Duzagac, F.; Uslu, R.; Kara, M.; Soner, B.C.; Oktem, G. Enhanced G2/M Arrest, Caspase Related Apoptosis and Reduced E-Cadherin Dependent Intercellular Adhesion by Trabectedin in Prostate Cancer Stem Cells. PLoS ONE 2015, 10, e0141090. [Google Scholar] [CrossRef]
  25. Pignochino, Y.; Capozzi, F.; D’Ambrosio, L.; Dell’Aglio, C.; Basiricò, M.; Canta, M.; Lorenzato, A.; Vignolo Lutati, F.; Aliberti, S.; Palesandro, E.; et al. PARP1 Expression Drives the Synergistic Antitumor Activity of Trabectedin and PARP1 Inhibitors in Sarcoma Preclinical Models. Mol. Cancer 2017, 16, 86. [Google Scholar] [CrossRef] [PubMed]
  26. Martinez-Cruzado, L.; Tornin, J.; Rodriguez, A.; Santos, L.; Allonca, E.; Fernandez-Garcia, M.T.; Astudillo, A.; Garcia-Pedrero, J.M.; Rodriguez, R. Trabectedin and Campthotecin Synergistically Eliminate Cancer Stem Cells in Cell-of-Origin Sarcoma Models. Neoplasia 2017, 19, 460–470. [Google Scholar] [CrossRef] [PubMed]
  27. van Raalte, D.H.; Li, M.; Pritchard, P.H.; Wasan, K.M. Peroxisome Proliferator-Activated Receptor (PPAR)-Alpha: A Pharmacological Target with a Promising Future. Pharm. Res. 2004, 21, 1531–1538. [Google Scholar] [CrossRef]
  28. Riccardi, A.; Meco, D.; Ubezio, P.; Mazzarella, G.; Marabese, M.; Faircloth, G.T.; Jimeno, J.; D’Incalci, M.; Riccardi, R. Combination of Trabectedin and Irinotecan Is Highly Effective in a Human Rhabdomyosarcoma Xenograft. Anticancer Drugs 2005, 16, 811–815. [Google Scholar] [CrossRef] [PubMed]
  29. Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. Limma Powers Differential Expression Analyses for RNA-Sequencing and Microarray Studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef]
  30. Bolstad, B.M.; Irizarry, R.A.; Astrand, M.; Speed, T.P. A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias. Bioinformatics 2003, 19, 185–193. [Google Scholar] [CrossRef] [PubMed]
  31. CONNECTOPEDIA [Clue.Io]. Available online: https://clue.io/connectopedia/ (accessed on 26 September 2022).
  32. Enache, O.M.; Lahr, D.L.; Natoli, T.E.; Litichevskiy, L.; Wadden, D.; Flynn, C.; Gould, J.; Asiedu, J.K.; Narayan, R.; Subramanian, A. The GCTx Format and cmap{Py, R, M, J} Packages: Resources for Optimized Storage and Integrated Traversal of Annotated Dense Matrices. Bioinformatics 2019, 35, 1427–1429. [Google Scholar] [CrossRef] [PubMed]
  33. Perez, F.; Granger, B.E. IPython: A System for Interactive Scientific Computing. Comput. Sci. Eng. 2007, 9, 21–29. [Google Scholar] [CrossRef]
  34. Waskom, M.L. Seaborn: Statistical Data Visualization. J. Open Source Softw. 2021, 6, 3021. [Google Scholar] [CrossRef]
  35. The Pandas Development Team. Pandas-Dev/Pandas. Available online: https://zenodo.org/records/10537285 (accessed on 7 February 2024).
  36. Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
  37. Carlson, M. org.Hs.eg.db: Genome Wide Annotation for Human. R Package Version 3.8.2. 2019. Available online: https://doi.org/10.18129/B9.bioc.org.Hs.eg.db (accessed on 7 February 2024).
  38. Wu, T.; Hu, E.; Xu, S.; Chen, M.; Guo, P.; Dai, Z.; Feng, T.; Zhou, L.; Tang, W.; Zhan, L.; et al. clusterProfiler 4.0: A Universal Enrichment Tool for Interpreting Omics Data. Innovation 2021, 2, 100141. [Google Scholar] [CrossRef] [PubMed]
  39. Gene Ontology Consortium The Gene Ontology Resource: Enriching a GOld Mine. Nucleic Acids Res. 2021, 49, D325–D334. [CrossRef]
  40. Jassal, B.; Matthews, L.; Viteri, G.; Gong, C.; Lorente, P.; Fabregat, A.; Sidiropoulos, K.; Cook, J.; Gillespie, M.; Haw, R.; et al. The Reactome Pathway Knowledgebase. Nucleic Acids Res. 2020, 48, D498–D503. [Google Scholar] [CrossRef] [PubMed]
  41. Benjamini, Y.; Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B (Methodol.) 1995, 57, 289–300. [Google Scholar] [CrossRef]
  42. Hunter, J.D. Matplotlib: A 2D Graphics Environment. Comput. Sci. Eng. 2007, 9, 90–95. [Google Scholar] [CrossRef]
  43. Begley, C.G.; Ashton, M.; Baell, J.; Bettess, M.; Brown, M.P.; Carter, B.; Charman, W.N.; Davis, C.; Fisher, S.; Frazer, I.; et al. Drug Repurposing: Misconceptions, Challenges, and Opportunities for Academic Researchers. Sci. Transl. Med. 2021, 13, eabd5524. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Schematic representation of the general workflow for the identification of genes selectively modulated by a drug of interest, which are defined and used in this work.
Figure 1. Schematic representation of the general workflow for the identification of genes selectively modulated by a drug of interest, which are defined and used in this work.
Ijms 25 02059 g001
Figure 2. Heatmap showing the first 10 compounds with the highest connection with trabectedin/lurbinectedin gene expression profiles. The names of the compounds are on the right and listed top-down from the most to the least connected one. The colors of the heatmap represent the tau score, ranging from −100 (low connection) to +100 (high connection). The darker the color the higher the connection: reds for positive values, blue for negative values. Lines on the top of the heatmap: the first represents the dataset signature as reported in the legend “signature”; the second represents the cell line as reported in the legend “cell line”.
Figure 2. Heatmap showing the first 10 compounds with the highest connection with trabectedin/lurbinectedin gene expression profiles. The names of the compounds are on the right and listed top-down from the most to the least connected one. The colors of the heatmap represent the tau score, ranging from −100 (low connection) to +100 (high connection). The darker the color the higher the connection: reds for positive values, blue for negative values. Lines on the top of the heatmap: the first represents the dataset signature as reported in the legend “signature”; the second represents the cell line as reported in the legend “cell line”.
Ijms 25 02059 g002
Figure 3. The figure shows on the x-axis the compounds analyzed in this work and on the y-axis the corresponding pathways identified as significant by the pathway analysis. Colored rectangles mean the significance of the pathway for the considered compound–pathway pairs. Colors represent the number of genes in the pathway as reported in the legend on the left (N genes). White indicates no significant pathway.
Figure 3. The figure shows on the x-axis the compounds analyzed in this work and on the y-axis the corresponding pathways identified as significant by the pathway analysis. Colored rectangles mean the significance of the pathway for the considered compound–pathway pairs. Colors represent the number of genes in the pathway as reported in the legend on the left (N genes). White indicates no significant pathway.
Ijms 25 02059 g003
Table 1. List of the selected datasets.
Table 1. List of the selected datasets.
Dataset IDBrief DescriptionOriginAvailable atAccession ID
MV411Gene expression profiling of a cell line model of myelomonocytic leukemiaCell lineArrayExpressE-MTAB-2978
JN-DSCRT1Desmoplastic small round cell sarcomaCell lineArrayExpressE-MTAB-4532
PDX-IIMyxoid liposarcoma—type IIPatient-derived xenograftArrayExpressE-MTAB-8632
PDX-IIIMyxoid liposarcoma type IIIPatient-derived xenograftArrayExpressE-MTAB-8632
OCILy7Lymphoma DLBCL subtype germinal center B cell (GBC)Cell lineGene Expression OmnibusGSE104197
U2932Lymphoma DLBCL subtype activated B cell-like (ABC)Cell lineGene Expression OmnibusGSE104197
MonocytesImmune system cells from donorsBlood cellsArrayExpressE-MTAB-5366
SHP-77Small cell lung cancerCell linein-house-
Table 2. Annotation packages and microarray sources of the used datasets.
Table 2. Annotation packages and microarray sources of the used datasets.
Dataset IDAnnotation PackageSource
MV4-11HsAgilentDesign026652Agilent, Santa Clara, CA, USA
JNDSCRT1HsAgilentDesign039494Agilent, Santa Clara, CA, USA
PDX-II, PDX-IIIHsAgilentDesign026652Agilent, Santa Clara, CA, USA
OCILy7, U2932IlluminaHumanv4Illumina, San Diego, CA, USA
MonocytesHsAgilentDesign039494Agilent, Santa Clara, CA, USA
SHP-77HsAgilentDesign039494Agilent, Santa Clara, CA, USA
Table 3. The unique compound identifier (pert_id) and name (pert_iname) of each considered compound.
Table 3. The unique compound identifier (pert_id) and name (pert_iname) of each considered compound.
pert_idpart_iname
BRD-A02481876importazole
BRD-A06352508SB-218078
BRD-A35588707teniposide
BRD-A36630025SN-38
BRD-A48237631mitomycin-c
BRD-K08547377irinotecan
BRD-K31342827bisindolylmaleimide
BRD-K67439147SIB-1893
BRD-K98490050amsacrine
BRD-M86331534pyrvinium-pamoate
Table 4. Considered cell lines of the CMap database.
Table 4. Considered cell lines of the CMap database.
cell_idBrief Description
A375malignant melanoma
A549non-small cell lung carcinoma
HA1Ekidney epithelial immortalized (non-cancer cell line)
HCC515non-small cell lung adenocarcinoma
HEPG2hepatocellular carcinoma
HT29colorectal adenocarcinoma
MCF7breast adenocarcinoma
PC3prostate adenocarcinoma
VCAPmetastatic prostate cancer
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mannarino, L.; Ravasio, N.; D’Incalci, M.; Marchini, S.; Masseroli, M. In-Silico Identification of Novel Pharmacological Synergisms: The Trabectedin Case. Int. J. Mol. Sci. 2024, 25, 2059. https://doi.org/10.3390/ijms25042059

AMA Style

Mannarino L, Ravasio N, D’Incalci M, Marchini S, Masseroli M. In-Silico Identification of Novel Pharmacological Synergisms: The Trabectedin Case. International Journal of Molecular Sciences. 2024; 25(4):2059. https://doi.org/10.3390/ijms25042059

Chicago/Turabian Style

Mannarino, Laura, Nicholas Ravasio, Maurizio D’Incalci, Sergio Marchini, and Marco Masseroli. 2024. "In-Silico Identification of Novel Pharmacological Synergisms: The Trabectedin Case" International Journal of Molecular Sciences 25, no. 4: 2059. https://doi.org/10.3390/ijms25042059

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop