Next Article in Journal
DS Evidence Theory-Based Energy Balanced Routing Algorithm for Network Lifetime Enhancement in WSN-Assisted IOT
Previous Article in Journal
Fibers of Failure: Classifying Errors in Predictive Processes
Previous Article in Special Issue
A Novel Method for Inference of Chemical Compounds of Cycle Index Two with Desired Properties Based on Artificial Neural Networks and Integer Programming
Open AccessArticle

Compression of Next-Generation Sequencing Data and of DNA Digital Files

Dipartimento di Informatica, Università di Salerno; Via Giovanni Paolo II, 132-84084 Fisciano (SA), Italy
This paper is an extended version of our paper published in the Proceedings of the 5th World Multidisciplinary Earth Sciences Symposium, WMESS 2019; (Prague; Czech Republic; 9–13 September 2019).
Algorithms 2020, 13(6), 151;
Received: 9 May 2020 / Revised: 14 June 2020 / Accepted: 17 June 2020 / Published: 24 June 2020
(This article belongs to the Special Issue 2020 Selected Papers from Algorithms Editorial Board Members)
The increase in memory and in network traffic used and caused by new sequenced biological data has recently deeply grown. Genomic projects such as HapMap and 1000 Genomes have contributed to the very large rise of databases and network traffic related to genomic data and to the development of new efficient technologies. The large-scale sequencing of samples of DNA has brought new attention and produced new research, and thus the interest in the scientific community for genomic data has greatly increased. In a very short time, researchers have developed hardware tools, analysis software, algorithms, private databases, and infrastructures to support the research in genomics. In this paper, we analyze different approaches for compressing digital files generated by Next-Generation Sequencing tools containing nucleotide sequences, and we discuss and evaluate the compression performance of generic compression algorithms by confronting them with a specific system designed by Jones et al. specifically for genomic file compression: Quip. Moreover, we present a simple but effective technique for the compression of DNA sequences in which we only consider the relevant DNA data and experimentally evaluate its performances. View Full-Text
Keywords: data compression; Next-Generation Sequencing data; DNA; genomes data compression; Next-Generation Sequencing data; DNA; genomes
Show Figures

Figure 1

MDPI and ACS Style

Carpentieri, B. Compression of Next-Generation Sequencing Data and of DNA Digital Files. Algorithms 2020, 13, 151.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Search more from Scilit
Back to TopTop