Next Article in Journal
Leveraging 3D Model Systems to Understand Viral Interactions with the Respiratory Mucosa
Next Article in Special Issue
Characterization and Diversity of 243 Complete Human Papillomavirus Genomes in Cervical Swabs Using Next Generation Sequencing
Previous Article in Journal
Novel Broad-Spectrum Antiviral Inhibitors Targeting Host Factors Essential for Replication of Pathogenic RNA Viruses
Previous Article in Special Issue
The International Virus Bioinformatics Meeting 2020
viruses-logo
Article Menu

Article Menu

Article

NCBI’s Virus Discovery Codeathon: Building “FIVE” —The Federated Index of Viral Experiments API Index

1
Laboratory of Clinical and Epidemiological Virology, KU Leuven Department of Microbiology, Immunology and Transplantation, Rega Institute, BE3000 Leuven, Belgium
2
Integrative Molecular and Biomedical Sciences Program, Baylor College of Medicine, Houston, TX 77030, USA
3
Margaret M. and Albert B. Alkek Department of Medicine, Nephrology, Baylor College of Medicine, Houston, TX 77030, USA
4
Department of Genetics, MD Anderson Cancer Center, Houston, TX 77030, USA
5
School of Medicine, Universidad Central del Caribe, Bayamón, PR 00960, USA
6
Genetics & Molecular Biology, Millersville University, 40 Dilworth Rd, Millersville, PA 17551, USA
7
Department of Epidemiology of Microbial Diseases, Yale School of Public Health (YSPH), 60 College Street, New Haven, CT 06510, USA
8
National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20894, USA
9
Computational Bioscience Program, University of Colorado Anschutz, Aurora, CO 80045, USA
10
AAAS Science and Technology Policy Fellow, Office of Data Science Strategy, Division of Program Coordination, Planning, and Strategic Initiatives, Office of the Director, National Institutes of Health, 31 Center Dr., Bethesda, MD 20894, USA
11
MRC-University of Glasgow Centre for Virus Research, Glasgow G61 1QH, UK
12
Biotechnology Graduate Program, University of Maryland Global Campus, 1616 McCormick Drive, Largo, MD 20774, USA
13
Boyce Thompson Institute, Ithaca, NY 14850, USA
14
School of Animal and Comparative Biomedical Sciences, The University of Arizona, Tucson, AZ 85721, USA
15
Laboratory of Cellular Oncology, National Cancer Institute, 37 Convent Dr., Bethesda, MD 20894, USA
16
Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
17
School of Life and Environmental Sciences and School of Medical Sciences, Marie Bashir Institute for Infectious Diseases and Biosecurity, The University of Sydney, Sydney, Australia
18
Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA
19
College of Science and Engineering, Flinders University, Bedford Park, SA 5042, Australia
20
Department of Computer Science, University of Maryland, College Park, MD 20740, USA
21
Joint Institute for Food Safety and Applied Nutrition, University of Maryland, College Park, MD 20740, USA
22
Novel Microdevices, Nucleic Acids, Baltimore, MD 21202, USA
23
Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20740, USA
24
AgBiome, 104 TW Alexander, Research Triangle, NC 27709, USA
25
Bioscience Division, Bikini Atoll Road, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
26
Center for Dark Energy Biosphere Investigations, University of Southern California, Los Angeles, CA 90089, USA
27
AAAS Science & Technology Policy Fellow, National Institutes of Health, Center for Information Technology, 6555 Rock Spring Drive, Bethesda, MD 20817, USA
28
Department of Marine and Environmental Biology, University of Southern California, Los Angeles, CA 90089, USA
29
DNANexus, 1975 W El Camino Real #204, Mountain View, CA 94040, USA
*
Authors to whom correspondence should be addressed.
Current affiliation: Department of Biological Sciences, Lehigh University, 111 Research Dr., Bethlehem, PA 18015, USA.
Academic Editors: Manja Marz, Bashar Ibrahim, Franziska Hufsky, Ronald Dijkman, Alban Ramette and Jenna Kelly
Viruses 2020, 12(12), 1424; https://doi.org/10.3390/v12121424
Received: 16 October 2020 / Accepted: 2 December 2020 / Published: 10 December 2020
(This article belongs to the Special Issue Virus Bioinformatics 2020)
Viruses represent important test cases for data federation due to their genome size and the rapid increase in sequence data in publicly available databases. However, some consequences of previously decentralized (unfederated) data are lack of consensus or comparisons between feature annotations. Unifying or displaying alternative annotations should be a priority both for communities with robust entry representation and for nascent communities with burgeoning data sources. To this end, during this three-day continuation of the Virus Hunting Toolkit codeathon series (VHT-2), a new integrated and federated viral index was elaborated. This Federated Index of Viral Experiments (FIVE) integrates pre-existing and novel functional and taxonomy annotations and virus–host pairings. Variability in the context of viral genomic diversity is often overlooked in virus databases. As a proof-of-concept, FIVE was the first attempt to include viral genome variation for HIV, the most well-studied human pathogen, through viral genome diversity graphs. As per the publication of this manuscript, FIVE is the first implementation of a virus-specific federated index of such scope. FIVE is coded in BigQuery for optimal access of large quantities of data and is publicly accessible. Many projects of database or index federation fail to provide easier alternatives to access or query information. To this end, a Python API query system was developed to enhance the accessibility of FIVE. View Full-Text
Keywords: data federation; CRISPR; protein domain; metagenomics; virus; genome graphs; HIV-1 data federation; CRISPR; protein domain; metagenomics; virus; genome graphs; HIV-1
Show Figures

Figure 1

MDPI and ACS Style

Martí-Carreras, J.; Gener, A.R.; Miller, S.D.; Brito, A.F.; Camacho, C.E.; Connor, R.; Deboutte, W.; Glickman, C.; Kristensen, D.M.; Meyer, W.K.; Modha, S.; Norris, A.L.; Saha, S.; Belford, A.K.; Biederstedt, E.; Brister, J.R.; Buchmann, J.P.; Cooley, N.P.; Edwards, R.A.; Javkar, K.; Muchow, M.; Muralidharan, H.S.; Pepe-Ranney, C.; Shah, N.; Shakya, M.; Tisza, M.J.; Tully, B.J.; Vanmechelen, B.; Virta, V.C.; Weissman, J.L.; Zalunin, V.; Efremov, A.; Busby, B. NCBI’s Virus Discovery Codeathon: Building “FIVE” —The Federated Index of Viral Experiments API Index. Viruses 2020, 12, 1424. https://doi.org/10.3390/v12121424

AMA Style

Martí-Carreras J, Gener AR, Miller SD, Brito AF, Camacho CE, Connor R, Deboutte W, Glickman C, Kristensen DM, Meyer WK, Modha S, Norris AL, Saha S, Belford AK, Biederstedt E, Brister JR, Buchmann JP, Cooley NP, Edwards RA, Javkar K, Muchow M, Muralidharan HS, Pepe-Ranney C, Shah N, Shakya M, Tisza MJ, Tully BJ, Vanmechelen B, Virta VC, Weissman JL, Zalunin V, Efremov A, Busby B. NCBI’s Virus Discovery Codeathon: Building “FIVE” —The Federated Index of Viral Experiments API Index. Viruses. 2020; 12(12):1424. https://doi.org/10.3390/v12121424

Chicago/Turabian Style

Martí-Carreras, Joan, Alejandro R. Gener, Sierra D. Miller, Anderson F. Brito, Christiam E. Camacho, Ryan Connor, Ward Deboutte, Cody Glickman, David M. Kristensen, Wynn K. Meyer, Sejal Modha, Alexis L. Norris, Surya Saha, Anna K. Belford, Evan Biederstedt, James R. Brister, Jan P. Buchmann, Nicholas P. Cooley, Robert A. Edwards, Kiran Javkar, Michael Muchow, Harihara S. Muralidharan, Charles Pepe-Ranney, Nidhi Shah, Migun Shakya, Michael J. Tisza, Benjamin J. Tully, Bert Vanmechelen, Valerie C. Virta, Jake L. Weissman, Vadim Zalunin, Alexandre Efremov, and Ben Busby. 2020. "NCBI’s Virus Discovery Codeathon: Building “FIVE” —The Federated Index of Viral Experiments API Index" Viruses 12, no. 12: 1424. https://doi.org/10.3390/v12121424

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop