Genes 2012, 3(3), 378-390; doi:10.3390/genes3030378
Article

Clustering Rfam 10.1: Clans, Families, and Classes

1email, 2email, 2email, 3email, 1,* email and 4,5,6,7,8,9email
Received: 5 May 2012; in revised form: 4 June 2012 / Accepted: 15 June 2012 / Published: 5 July 2012
(This article belongs to the Special Issue Feature Paper 2012)
View Full-Text   |   Download PDF [3062 KB, uploaded 5 July 2012]
Abstract: The Rfam database contains information about non-coding RNAs emphasizing their secondary structures and organizing them into families of homologous RNA genes or functional RNA elements. Recently, a higher order organization of Rfam in terms of the so-called clans was proposed along with its “decimal release”. In this proposition, some of the families have been assigned to clans based on experimental and computational data in order to find related families. In the present work we investigate an alternative classification for the RNA families based on tree edit distance. The resulting clustering recovers some of the Rfam clans. The majority of clans, however, are not recovered by the structural clustering. Instead, they get dispersed into larger clusters, which correspond roughly to well-described RNA classes such as snoRNAs, miRNAs, and CRISPRs. In conclusion, a structure-based clustering can contribute to the elucidation of the relationships among the Rfam families beyond the realm of clans and classes.
Keywords: Rfam; non-coding RNA; secondary structure; clans; clusters
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Export to BibTeX |
EndNote


MDPI and ACS Style

Lessa, F.A.; Raiol, T.; Brigido, M.M.; Martins Neto, D.S.B.; Walter, M.E.M.T.; Stadler, P.F. Clustering Rfam 10.1: Clans, Families, and Classes. Genes 2012, 3, 378-390.

AMA Style

Lessa FA, Raiol T, Brigido MM, Martins Neto DSB, Walter MEMT, Stadler PF. Clustering Rfam 10.1: Clans, Families, and Classes. Genes. 2012; 3(3):378-390.

Chicago/Turabian Style

Lessa, Felipe A.; Raiol, Tainá; Brigido, Marcelo M.; Martins Neto, Daniele S. B.; Walter, Maria Emília M. T.; Stadler, Peter F. 2012. "Clustering Rfam 10.1: Clans, Families, and Classes." Genes 3, no. 3: 378-390.


Genes EISSN 2073-4425 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert