Viruses 2014, 6(6), 2259-2267; doi:10.3390/v6062259
Article

Clustering of Giant Virus-DNA Based on Variations in Local Entropy

1email, 2email and 2,3,* email
Received: 5 February 2014; in revised form: 19 May 2014 / Accepted: 21 May 2014 / Published: 30 May 2014
(This article belongs to the Special Issue Giant Viruses)
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract: We present a method for clustering genomic sequences based on variations in local entropy. We have analyzed the distributions of the block entropies of viruses and plant genomes. A distinct pattern for viruses and plant genomes is observed. These distributions, which describe the local entropic variability of the genomes, are used for clustering the genomes based on the Jensen-Shannon (JS) distances. The analysis of the JS distances between all genomes that infect the chlorella algae shows the host specificity of the viruses. We illustrate the efficacy of this entropy-based clustering technique by the segregation of plant and virus genomes into separate bins.
Keywords: information theory; genomic sequences; evolution; phylogeny; virus
PDF Full-text Download PDF Full-Text [348 KB, uploaded 30 May 2014 14:49 CEST]

Export to BibTeX |
EndNote


MDPI and ACS Style

Bose, R.; Thiel, G.; Hamacher, K. Clustering of Giant Virus-DNA Based on Variations in Local Entropy. Viruses 2014, 6, 2259-2267.

AMA Style

Bose R, Thiel G, Hamacher K. Clustering of Giant Virus-DNA Based on Variations in Local Entropy. Viruses. 2014; 6(6):2259-2267.

Chicago/Turabian Style

Bose, Ranjan; Thiel, Gerhard; Hamacher, Kay. 2014. "Clustering of Giant Virus-DNA Based on Variations in Local Entropy." Viruses 6, no. 6: 2259-2267.


Viruses EISSN 1999-4915 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert