miRBind: A Deep Learning Method for miRNA Binding Classification
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Preparation
2.2. Independent Chimeric Read Dataset (miRNA eCLIP)
2.3. Benchmarking Approaches
2.3.1. CNN Approach
2.3.2. DNABERT
2.3.3. miRBind
2.3.4. RNAhybrid
2.3.5. RNACofold
2.3.6. RNA22
2.3.7. Seed
2.3.8. Web Interface
2.3.9. Evaluation Measures
3. Results
4. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Bartel, D.P. Metazoan MicroRNAs. Cell 2018, 173, 20–51. [Google Scholar] [CrossRef]
- Lee, R.C.; Feinbaum, R.L.; Ambros, V. The C. Elegans Heterochronic Gene Lin-4 Encodes Small RNAs with Antisense Complementarity to Lin-14. Cell 1993, 75, 843–854. [Google Scholar] [CrossRef] [PubMed]
- Wightman, B.; Ha, I.; Ruvkun, G. Posttranscriptional Regulation of the Heterochronic Gene Lin-14 by Lin-4 Mediates Temporal Pattern Formation in C. Elegans. Cell 1993, 75, 855–862. [Google Scholar] [CrossRef] [PubMed]
- Pasquinelli, A.E.; Reinhart, B.J.; Slack, F.; Martindale, M.Q.; Kuroda, M.I.; Maller, B.; Hayward, D.C.; Ball, E.E.; Degnan, B.; Müller, P.; et al. Conservation of the Sequence and Temporal Expression of Let-7 Heterochronic Regulatory RNA. Nature 2000, 408, 86–89. [Google Scholar] [CrossRef] [PubMed]
- Kozomara, A.; Griffiths-Jones, S. MiRBase: Integrating MicroRNA Annotation and Deep-Sequencing Data. Nucleic Acids Res. 2011, 39, D152–D157. [Google Scholar] [CrossRef] [PubMed]
- Adams, L. Pri-MiRNA Processing: Structure Is Key. Nat. Rev. Genet. 2017, 18, 145. [Google Scholar] [CrossRef] [PubMed]
- Lund, E.; Güttinger, S.; Calado, A.; Dahlberg, J.E.; Kutay, U. Nuclear Export of MicroRNA Precursors. Science 2004, 303, 95–98. [Google Scholar] [CrossRef] [PubMed]
- O’Brien, J.; Hayder, H.; Zayed, Y.; Peng, C. Overview of MicroRNA Biogenesis, Mechanisms of Actions, and Circulation. Front. Endocrinol. 2018, 9, 402. [Google Scholar] [CrossRef]
- Saliminejad, K.; Khorram Khorshid, H.R.; Soleymani Fard, S.; Ghaffari, S.H. An Overview of MicroRNAs: Biology, Functions, Therapeutics, and Analysis Methods. J. Cell. Physiol. 2019, 234, 5451–5465. [Google Scholar] [CrossRef]
- Filipowicz, W.; Bhattacharyya, S.N.; Sonenberg, N. Mechanisms of Post-Transcriptional Regulation by MicroRNAs: Are the Answers in Sight? Nat. Rev. Genet. 2008, 9, 102–114. [Google Scholar] [CrossRef]
- Dueck, A.; Ziegler, C.; Eichner, A.; Berezikov, E.; Meister, G. MicroRNAs Associated with the Different Human Argonaute Proteins. Nucleic Acids Res. 2012, 40, 9850–9862. [Google Scholar] [CrossRef] [PubMed]
- Pasquinelli, A.E. MicroRNAs and Their Targets: Recognition, Regulation and an Emerging Reciprocal Relationship. Nat. Rev. Genet. 2012, 13, 271–282. [Google Scholar] [CrossRef] [PubMed]
- Kalla, R.; Ventham, N.T.; Kennedy, N.A.; Quintana, J.F.; Nimmo, E.R.; Buck, A.H.; Satsangi, J. MicroRNAs: New Players in IBD. Gut 2015, 64, 504–513. [Google Scholar] [CrossRef]
- Zealy, R.W.; Wrenn, S.P.; Davila, S.; Min, K.-W.; Yoon, J.-H. MicroRNA-Binding Proteins: Specificity and Function. WIREs RNA 2017, 8, e1414. [Google Scholar] [CrossRef] [PubMed]
- Lewis, B.P.; Shih, I.-H.; Jones-Rhoades, M.W.; Bartel, D.P.; Burge, C.B. Prediction of Mammalian MicroRNA Targets. Cell 2003, 115, 787–798. [Google Scholar] [CrossRef] [PubMed]
- Bartel, D.P. MicroRNA Target Recognition and Regulatory Functions. Cell 2009, 136, 215–233. [Google Scholar] [CrossRef]
- Broughton, J.P.; Lovci, M.T.; Huang, J.L.; Yeo, G.W.; Pasquinelli, A.E. Pairing Beyond the Seed Supports MicroRNA Targeting Specificity. Mol. Cell 2016, 64, 320–333. [Google Scholar] [CrossRef] [PubMed]
- Agarwal, V.; Bell, G.W.; Nam, J.-W.; Bartel, D.P. Predicting Effective MicroRNA Target Sites in Mammalian MRNAs. eLife 2015, 4, e05005. [Google Scholar] [CrossRef]
- Kudla, G.; Granneman, S.; Hahn, D.; Beggs, J.D.; Tollervey, D. Cross-Linking, Ligation, and Sequencing of Hybrids Reveals RNA–RNA Interactions in Yeast. Proc. Natl. Acad. Sci. USA 2011, 108, 10010–10015. [Google Scholar] [CrossRef]
- Helwak, A.; Kudla, G.; Dudnakova, T.; Tollervey, D. Mapping the Human MiRNA Interactome by CLASH Reveals Frequent Noncanonical Binding. Cell 2013, 153, 654–665. [Google Scholar] [CrossRef]
- John, B.; Enright, A.J.; Aravin, A.; Tuschl, T.; Sander, C.; Marks, D.S. Human MicroRNA Targets. PLoS Biol. 2004, 2, e363. [Google Scholar] [CrossRef] [PubMed]
- Enright, A.J.; John, B.; Gaul, U.; Tuschl, T.; Sander, C.; Marks, D.S. MicroRNA Targets in Drosophila. Genome Biol. 2004, 5, R1. [Google Scholar] [CrossRef] [PubMed]
- Kertesz, M.; Iovino, N.; Unnerstall, U.; Gaul, U.; Segal, E. The Role of Site Accessibility in MicroRNA Target Recognition. Nat. Genet. 2007, 39, 1278–1284. [Google Scholar] [CrossRef] [PubMed]
- Baek, D.; Villén, J.; Shin, C.; Camargo, F.D.; Gygi, S.P.; Bartel, D.P. The Impact of MicroRNAs on Protein Output. Nature 2008, 455, 64–71. [Google Scholar] [CrossRef] [PubMed]
- Selbach, M.; Schwanhäusser, B.; Thierfelder, N.; Fang, Z.; Khanin, R.; Rajewsky, N. Widespread Changes in Protein Synthesis Induced by MicroRNAs. Nature 2008, 455, 58–63. [Google Scholar] [CrossRef] [PubMed]
- Alexiou, P.; Maragkakis, M.; Papadopoulos, G.L.; Reczko, M.; Hatzigeorgiou, A.G. Lost in Translation: An Assessment and Perspective for Computational MicroRNA Target Identification. Bioinformatics 2009, 25, 3049–3055. [Google Scholar] [CrossRef] [PubMed]
- Ule, J.; Jensen, K.B.; Ruggiu, M.; Mele, A.; Ule, A.; Darnell, R.B. CLIP Identifies Nova-Regulated RNA Networks in the Brain. Science 2003, 302, 1212–1215. [Google Scholar] [CrossRef]
- Karagkouni, D.; Paraskevopoulou, M.D.; Chatzopoulos, S.; Vlachos, I.S.; Tastsoglou, S.; Kanellos, I.; Papadimitriou, D.; Kavakiotis, I.; Maniou, S.; Skoufos, G.; et al. DIANA-TarBase v8: A Decade-Long Collection of Experimentally Supported MiRNA–Gene Interactions. Nucleic Acids Res. 2018, 46, D239–D245. [Google Scholar] [CrossRef]
- Helwak, A.; Tollervey, D. Mapping the MiRNA Interactome by Cross-Linking Ligation and Sequencing of Hybrids (CLASH). Nat. Protoc. 2014, 9, 711–728. [Google Scholar] [CrossRef]
- Moore, M.J.; Scheel, T.K.H.; Luna, J.M.; Park, C.Y.; Fak, J.J.; Nishiuchi, E.; Rice, C.M.; Darnell, R.B. MiRNA–Target Chimeras Reveal MiRNA 3′-End Pairing as a Major Determinant of Argonaute Target Specificity. Nat. Commun. 2015, 6, 8864. [Google Scholar] [CrossRef]
- Riolo, G.; Cantara, S.; Marzocchi, C.; Ricci, C. MiRNA Targets: From Prediction Tools to Experimental Validation. Methods Protoc. 2020, 4, 1. [Google Scholar] [CrossRef] [PubMed]
- Peterson, S.M.; Thompson, J.A.; Ufkin, M.L.; Sathyanarayana, P.; Liaw, L.; Congdon, C.B. Common Features of MicroRNA Target Prediction Tools. Front. Genet. 2014, 5, 23. [Google Scholar] [CrossRef] [PubMed]
- Ekimler, S.; Sahin, K. Computational Methods for MicroRNA Target Prediction. Genes 2014, 5, 671–683. [Google Scholar] [CrossRef]
- Shaker, F.; Nikravesh, A.; Arezumand, R.; Aghaee-Bakhtiari, S.H. Web-based tools for miRNA studies analysis. Comput. Biol. Med. 2020, 127, 104060. [Google Scholar] [CrossRef]
- Betel, D.; Koppal, A.; Agius, P.; Sander, C.; Leslie, C. Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Genome Biol. 2010, 11, R90. [Google Scholar] [CrossRef] [PubMed]
- Maragkakis, M.; Reczko, M.; Simossis, V.A.; Alexiou, P.; Papadopoulos, G.L.; Dalamagas, T.; Giannopoulos, G.; Goumas, G.; Koukis, E.; Kourtis, K.; et al. DIANA-microT web server: Elucidating microRNA functions through target prediction. Nucleic Acids Res. 2009, 37, W273–W276. [Google Scholar] [CrossRef]
- Reczko, M.; Maragkakis, M.; Alexiou, P.; Grosse, I.; Hatzigeorgiou, A.G. Functional microRNA targets in protein coding sequences. Bioinformatics 2012, 28, 771–776. [Google Scholar] [CrossRef]
- Paraskevopoulou, M.D.; Georgakilas, G.; Kostoulas, N.; Vlachos, I.S.; Vergoulis, T.; Reczko, M.; Filippidis, C.; Dalamagas, T.; Hatzigeorgiou, A.G. DIANA-microT web server v5.0: Service integration into miRNA functional analysis workflows. Nucleic Acids Res. 2013, 41, W169–W173. [Google Scholar] [CrossRef]
- Wang, X.; El Naqa, I.M. Prediction of both conserved and nonconserved microRNA targets in animals. Bioinformatics 2008, 24, 325–332. [Google Scholar] [CrossRef]
- Bandyopadhyay, S.; Mitra, R. TargetMiner: microRNA target prediction with systematic identification of tissue-specific negative examples. Bioinformatics 2009, 25, 2625–2631. [Google Scholar] [CrossRef]
- Liu, H.; Yue, D.; Chen, Y.; Gao, S.-J.; Huang, Y. Improving performance of mammalian microRNA target prediction. BMC Bioinform. 2010, 11, 476. [Google Scholar] [CrossRef] [PubMed]
- Eraslan, G.; Avsec, Ž.; Gagneur, J.; Theis, F.J. Deep Learning: New Computational Modelling Techniques for Genomics. Nat. Rev. Genet. 2019, 20, 389–403. [Google Scholar] [CrossRef] [PubMed]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Min, S.; Lee, B.; Yoon, S. Deep Learning in Bioinformatics. Brief. Bioinform. 2017, 18, 851–869. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar] [CrossRef]
- Travis, A.J.; Moody, J.; Helwak, A.; Tollervey, D.; Kudla, G. Hyb: A Bioinformatics Pipeline for the Analysis of CLASH (Crosslinking, Ligation and Sequencing of Hybrids) Data. Methods 2014, 65, 263–273. [Google Scholar] [CrossRef] [PubMed]
- Manakov, S.A.; Shishkin, A.A.; Yee, B.A.; Shen, K.A.; Cox, D.C.; Park, S.S.; Foster, H.M.; Chapman, K.B.; Yeo, G.W.; Nostrand, E.L.V. Scalable and Deep Profiling of MRNA Targets for Individual MicroRNAs with Chimeric ECLIP. bioRxiv 2022. [Google Scholar] [CrossRef]
- Database Resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2017, 45, D12–D17. [CrossRef]
- Cunningham, F.; Allen, J.E.; Allen, J.; Alvarez-Jarreta, J.; Amode, M.R.; Armean, I.M.; Austine-Orimoloye, O.; Azov, A.G.; Barnes, I.; Bennett, R.; et al. Ensembl 2022. Nucleic Acids Res. 2022, 50, D988–D995. [Google Scholar] [CrossRef]
- Haeussler, M.; Zweig, A.S.; Tyner, C.; Speir, M.L.; Rosenbloom, K.R.; Raney, B.J.; Lee, C.M.; Lee, B.T.; Hinrichs, A.S.; Gonzalez, J.N.; et al. The UCSC Genome Browser Database: 2019 Update. Nucleic Acids Res. 2019, 47, D853–D858. [Google Scholar] [CrossRef]
- Ji, Y.; Zhou, Z.; Liu, H.; Davuluri, R.V. DNABERT: Pre-Trained Bidirectional Encoder Representations from Transformers Model for DNA-Language in Genome. Bioinformatics 2021, 37, 2112–2120. [Google Scholar] [CrossRef]
- Georgakilas, G.K.; Grioni, A.; Liakos, K.G.; Chalupova, E.; Plessas, F.C.; Alexiou, P. Multi-Branch Convolutional Neural Network for Identification of Small Non-Coding RNA Genomic Loci. Sci. Rep. 2020, 10, 9486. [Google Scholar] [CrossRef] [PubMed]
- Guo, H.; Viktor, H.L. Learning from Imbalanced Data Sets with Boosting and Data Generation: The DataBoost-IM Approach. SIGKDD Explor. Newsl. 2004, 6, 30–39. [Google Scholar] [CrossRef]
- Smith, M.R.; Martinez, T.; Giraud-Carrier, C. An Instance Level Analysis of Data Complexity. Mach Learn 2014, 95, 225–256. [Google Scholar] [CrossRef]
- Krüger, J.; Rehmsmeier, M. RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res. 2006, 34, W451–W454. [Google Scholar] [CrossRef]
- Bernhart, S.H.; Tafer, H.; Mückstein, U.; Flamm, C.; Stadler, P.F.; Hofacker, I.L. Partition Function and Base Pairing Probabilities of RNA Heterodimers. Algorithms Mol. Biol. 2006, 1, 3. [Google Scholar] [CrossRef]
- Lorenz, R.; Bernhart, S.H.; Höner zu Siederdissen, C.; Tafer, H.; Flamm, C.; Stadler, P.F.; Hofacker, I.L. ViennaRNA Package 2.0. Algorithms Mol. Biol. 2011, 6, 26. [Google Scholar] [CrossRef]
- Saito, T.; Rehmsmeier, M. The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets. PLoS ONE 2015, 10, e0118432. [Google Scholar] [CrossRef]
- Miranda, K.C.; Huynh, T.; Tay, Y.; Ang, Y.-S.; Tam, W.-L.; Thomson, A.M.; Lim, B.; Rigoutsos, I. A Pattern-Based Method for the Identification of MicroRNA Binding Sites and Their Corresponding Heteroduplexes. Cell 2006, 126, 1203–1217. [Google Scholar] [CrossRef]
AUPRC | Test Set 1:1 | Test Set 1:10 | Test Set 1:100 |
---|---|---|---|
miRBind1 | 0.9495 | 0.7447 | 0.3079 |
miRBind10 | 0.9614 | 0.8092 | 0.4531 |
miRBind20 | 0.9689 | 0.8410 | 0.5372 |
CNN1 | 0.9602 | 0.7862 | 0.4095 |
CNN10 | 0.9634 | 0.7969 | 0.4464 |
CNN20 | 0.9590 | 0.7880 | 0.4365 |
CNN100 | 0.9599 | 0.8005 | 0.4466 |
DNABERT1 | 0.9267 | 0.6300 | 0.1923 |
DNABERT10 | 0.9250 | 0.6440 | 0.2286 |
AUPRC | Test Set 1:1 | Test Set 1:10 | Test Set 1:100 |
---|---|---|---|
miRbind | 0.9689 | 0.8410 | 0.5372 |
CNN | 0.9634 | 0.7969 | 0.4464 |
DNABERT | 0.9267 | 0.6300 | 0.1923 |
RNAhybrid | 0.8439 | 0.4539 | 0.0924 |
Cofold | 0.7784 | 0.2842 | 0.0413 |
RNA22 | 0.6203 | 0.1507 | 0.0265 |
Seed | Sens: 0.1425 Prec: 0.8796 | Sens: 0.1425 Prec: 0.4612 | Sens: 0.1425 Prec: 0.0824 |
AUROC | Test Set 1:1 | Test Set 1:10 | Test Set 1:100 |
---|---|---|---|
miRBind | 0.9643 | 0.9654 | 0.9652 |
CNN | 0.9612 | 0.9626 | 0.9628 |
DNABERT | 0.9293 | 0.9310 | 0.9310 |
RNAhybrid | 0.8351 | 0.8406 | 0.8381 |
Cofold | 0.7839 | 0.7839 | 0.7812 |
RNA22 | 0.5343 | 0.5342 | 0.5375 |
Seed | fpr: 0.0195 tpr: 0.1425 | fpr: 0.0167 tpr: 0.1425 | fpr: 0.0159 tpr: 0.1425 |
AUPRC | Test Set 1:1 | Test Set 1:10 | Test Set 1:100 |
---|---|---|---|
miRbind | 0.8413 | 0.4668 | 0.1545 |
CNN | 0.8223 | 0.4268 | 0.1147 |
DNABERT | 0.6787 | 0.1904 | 0.0238 |
RNAhybrid | 0.7615 | 0.2932 | 0.0469 |
Cofold | 0.6862 | 0.1946 | 0.0246 |
RNA22 | 0.7116 | 0.2628 | 0.0392 |
Seed | Sens: 0.3774 Prec: 0.9278 | Sens: 0.3774 Prec: 0.6020 | Sens: 0.3774 Prec: 0.1586 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Klimentová, E.; Hejret, V.; Krčmář, J.; Grešová, K.; Giassa, I.-C.; Alexiou, P. miRBind: A Deep Learning Method for miRNA Binding Classification. Genes 2022, 13, 2323. https://doi.org/10.3390/genes13122323
Klimentová E, Hejret V, Krčmář J, Grešová K, Giassa I-C, Alexiou P. miRBind: A Deep Learning Method for miRNA Binding Classification. Genes. 2022; 13(12):2323. https://doi.org/10.3390/genes13122323
Chicago/Turabian StyleKlimentová, Eva, Václav Hejret, Ján Krčmář, Katarína Grešová, Ilektra-Chara Giassa, and Panagiotis Alexiou. 2022. "miRBind: A Deep Learning Method for miRNA Binding Classification" Genes 13, no. 12: 2323. https://doi.org/10.3390/genes13122323
APA StyleKlimentová, E., Hejret, V., Krčmář, J., Grešová, K., Giassa, I.-C., & Alexiou, P. (2022). miRBind: A Deep Learning Method for miRNA Binding Classification. Genes, 13(12), 2323. https://doi.org/10.3390/genes13122323