- freely available
Mar. Drugs 2013, 11(3), 830-841; doi:10.3390/md11030830
Published: 12 March 2013
Abstract: Okadaic Acid (OA) constitutes the main active principle in Diarrhetic Shellfish Poisoning (DSP) toxins produced during Harmful Algal Blooms (HABs), representing a serious threat for human consumers of edible shellfish. Furthermore, OA conveys critical deleterious effects for marine organisms due to its genotoxic potential. Many efforts have been dedicated to OA biomonitoring during the last three decades. However, it is only now with the current availability of detailed molecular information on DNA organization and the mechanisms involved in the maintenance of genome integrity, that a new arena starts opening up for the study of OA contamination. In the present work we address the links between OA genotoxicity and chromatin by combining Next Generation Sequencing (NGS) technologies and bioinformatics. To this end, we introduce CHROMEVALOAdb, a public database containing the chromatin-associated transcriptome of the mussel Mytilus galloprovincialis (a sentinel model organism) in response to OA exposure. This resource constitutes a leap forward for the development of chromatin-based biomarkers, paving the road towards the generation of powerful and sensitive tests for the detection and evaluation of the genotoxic effects of OA in coastal areas.
Massive algal proliferations are among the most important sources of contamination in the sea. These episodes may arise as a consequence of either natural or anthropogenic causes, leading to large accumulations of algae in the marine environment . Quite often, massive algal proliferations include blooms of toxin-producing organisms known as Harmful Algal Blooms (HABs), producing high concentrations of potentially harmful biotoxins that are accumulated throughout the food chain. Among HAB biotoxins, Diarrhetic Shellfish Poisoning (DSP) toxins are especially predominant across European coasts, causing alterations in the gastrointestinal system of human consumers of contaminated shellfish [2,3]. The main active principle in DSPs is Okadaic Acid (OA) , which is synthesized by dinoflagellates of the genera Dinophysis and Prorocentrum . OA has genotoxic potential, constituting a tumor promoter and apoptosis inducer able to cause DNA oxidative damage [6,7]. Particularly, DNA Double Strand Breaks (DSBs) stand out for their severity among the genotoxic effects exerted by OA and require the activation of prompt repair mechanisms in order to avoid serious damage in the cell [8,9].
During the last 30 years, fisheries and aquaculture-based industries have experienced important economic losses due to the dramatic increase in the diversity of toxic algal species and the toxins they produce , constituting a serious threat for human consumers . Consequently, a very important effort has been devoted to OA biomonitoring in estuarine areas by using sentinel organisms, most notably bivalve molluscs [9,11]. These studies have progressively transitioned from traditional biomonitoring methods (based on physicochemical and physiological parameters) to more sensitive molecular probes [12,13,14,15]. Given the role of chromosomal proteins in the modulation of chromatin structure and DNA metabolism (including DNA repair) , the study of chromatin-associated biomarkers constitutes a powerful and sensitive approach for the evaluation of genotoxicity. The usefulness of chromatin-based genotoxicity tests has already been demonstrated in mammals, where histone H2A.X phosphorylation has been used to assess the extent of DNA repair following exposure of cells to DNA-damaging agents [17,18,19]. Yet, this approach is largely unexplored in those organisms where chromatin information is scarce, including bivalve molluscs . Furthermore, the lack of knowledge regarding gene and protein sequences in these organisms constitutes a very important barrier for the analysis of high-throughput -omic data, especially as it pertains to data assembly and annotation of highly divergent and/or lineage-specialized genes [20,21,22,23]. Even though the genome sequence of the Pacific oyster Crassostrea gigas has been recently published , the amount of information available for marine bivalves remains scarce compared to other model organisms in spite of their environmental value.
In the present work we specifically address the links between OA genotoxicity and potential chromatin-associated biomarkers by combining Next Generation Sequencing (NGS) technologies and bioinformatics. To this end, we introduce CHROMEVALOAdb , a database containing the chromatin-associated transcriptome of the mussel Mytilus galloprovincialis in response to OA exposure. The information provided in this database includes fully traceable raw ESTs assembled into consensus sequences and classified into unigenes linked to Gene Ontology (GO) information (function, process and subcellular compartment) as well as to expression information in response to OA. CHROMEVALOAdb allows for the manual browsing and keyword-based search of chromatin-associated contigs. In addition, the whole OA-specific transcriptome can be accessed by using built in BLAST and CLUSTAL W tools. Overall, the present work constitutes a leap forward in the study of the genotoxic effect exerted by OA in these organisms, paving the road towards the development of chromatin-based tests for detecting and evaluating the genotoxic effect of OA in the marine environment.
2. Results and Discussion
2.1. Sequencing and Annotation of OA-Specific ESTs in M. galloprovincialis
Mussels (M. galloprovincialis) were sampled in an area of the Galician coast (northwest Spain) subject to a low impact of dinoflagellate blooms. Specimens were experimentally exposed to OA in the laboratory (Figure 1, see Experimental Section) using a set of conditions that were previously proven to cause significant genotoxic damage (200 cells/mL of the OA-producing dinoflagellate Prorocentrum lima, 1 day exposure) [9,26]. The accumulation of OA in digestive gland tissue was subsequently confirmed by HPLC-MS quantification (Table 1).
|Table 1. HPLC-MS quantification of OA in digestive gland tissue.|
|Experimental conditions||OA-content (ng/g)|
|Control||Below detection limit (~0)|
Raw normalized libraries constructed from mussel specimens exposed and non-exposed to OA were sequenced using pyrosequencing technology at 40× depth, producing 493,440 and 491,109 raw reads for the control (NORM_MGC) and the OA-exposed (NORM_MGT) libraries, respectively. These data allowed the assembly of 16,395 consensus sequences in the case of the control library and 24,624 consensus sequences from the OA-exposed library, with average length values of 712 and 644 bp, respectively. Approximately 44% of the assembled sequences (17,952) were annotated by using BLAST (blastx) homology searches against non-redundant (nr) protein databases, including 7335 contigs in the control library and 10,617 contigs in the OA-exposed library (38% and 45%, respectively), setting an expectation (e) value of 1 × 10−6 or better (Table 2).
|Table 2. Amount of data in each step of the data processing pipeline.|
2.2. Novel Chromatin-Associated Transcripts in CHROMEVALOAdb
Chromatin-associated transcripts were identified from the assembled OA-specific transcriptome from M. galloprovincialis by following two complementary strategies (see Experimental Section for details). On one hand, a list of keywords identifying chromatin-associated components was used to screen annotated transcripts regarding sequence description and related gene ontology terms (Supplementary Figures S1 and S2). On the other hand, BLAST homology comparisons were performed against specialized chromatin databases. The combination of both strategies resulted in the identification of 14,480 chromatin-associated contigs in control and OA-exposed libraries among which 1124 were identified as chromatin-associated unigenes (Table 2). The analysis of gene expression profiles (Supplementary Figure S3) allowed us to define groups of statistically significant unigenes upregulated and downregulated in the presence of OA (a total number of 1254) among which 90 were identified as chromatin-associated (Table 2). This information, along with gene ontology and expression profile data, constitutes the core of CHROMEVALOAdb.
The ontological analysis of the biological processes on which the identified chromatin-associated unigenes could be potentially involved revealed that cellular and metabolic processes are most significantly deregulated in response to OA (Figure 2). Furthermore, a significant deregulation of genes involved in chromatin remodeling (inhibited) and transmembrane transport (overexpressed) was identified through global ontological analyses based on the whole OA-specific transcriptome (Fisher’s exact test approach using topGO R-bioconductor package, Supplementary Figure S4). Even though additional experimental studies will be needed to decipher the functional role of chromatin-associated unigenes in response to OA, these results may be indicative of an activation in protective detoxifying mechanisms in mussels after one day of exposure to OA, once DNA has been repaired.
Comparisons between OA-specific EST information from CHROMEVALOAdb and Mytilus ESTs information from the MytiBase EST knowledge database  revealed that approximately 25% of the chromatin-associated sequences contained in CHROMEVALOAdb are redundant with MytiBase sequences. This extends also to the case of the complete OA-specific transcriptome, with a 30% of the ESTs being redundant with MytiBase sequences considering no identity cutoff value (manuscript in preparation). In other words, approximately 75% of the ESTs contained in CHROMEVALOAdb constitute previously unknown transcripts in the mussel M. galloprovincialis, establishing a very important contribution not only for the study of OA chromatin-associated biomarkers, but also for the characterization of the mussel genome.
2.3. Availability, Management and Application of Data Stored in CHROMEVALOAdb
Management of data quality constitutes a basic requirement of NGS projects that is often overlooked, resulting in the loss of important information for fine sequence curation and identification of DNA polymorphisms, among other quantitative analyses. The structure of CHROMEVALOAdb strengthens this aspect by providing full access to raw reads used to assemble the consensus sequences annotated in the database. This feature facilitates the alignment of quality-filtered raw sequences, establishing links with specific expression patterns in response to OA. Furthermore, the availability of the full dataset of contigs allows users to retrieve anonymous sequences by using the BLAST tool interface and communicate new chromatin-associated findings through a standardized feedback form, contributing to the curation of the information in CHROMEVALOAdb. Processed data, on the other hand, is also downloadable as flat text files containing information that can be filtered by keywords (Figure 3).
The information contained in CHROMEVALOAdb serves a dual purpose. First, it helps identify previously unknown chromatin-associated transcripts in the mussel M. galloprovincialis, specially histone variants and chromatin remodeling factors (Figure 4A,B). This aim is motivated by the role of chromatin-associated proteins in the maintenance of genome integrity, most notably in the case of DNA DSB repair [20,23]. Within this context, the generation of new molecular data and its organization in CHROMEVALOAdb helps increase the knowledge about mollusc chromatin, setting up a framework for studying its role in DNA repair. The second purpose of CHROMEVALOAdb is to establish cause-effect relationships between OA exposure and specific expression patterns of chromatin-associated factors involved in the maintenance of genome integrity. This approach will help identify potentially sensitive biomarkers of OA genotoxic effect. To this end, CHROMEVALOAdb provides differential expression information for chromatin-associated unigenes, using an intuitive graphical format based on arrows (up-regulated and down-regulated transcripts, Figure 4C). The combination of the newly characterized DNA sequences together with their associated expression information in response to OA paves the road towards the development of chromatin-based tests for detecting and evaluating the genotoxic effect of OA in the marine environment.
3. Experimental Section
3.1. Synthesis of ESTs Libraries and Transcriptome Assembly
Mussel specimens (M. galloprovincialis) were sampled in Valcobo beach, Galicia (northwest coast of Spain, 43°19′02.71″N 8°21′56.35″W) and immediately transported to the laboratory thereafter where they were maintained under controlled light/temperature conditions and fed with a standard mixture of the microalgae Isochrysis galbana and Tetraselmis suecica (Figure 1). Individuals were subsequently divided into a control group and a group exposed to OA that was additionally fed with a culture of the DSP-producing microalgae P. lima (200 cells/mL for 24 h). The quantification of OA in digestive gland tissue was performed by using high performance liquid chromatography coupled to mass spectrometry (HPLC-MS). Extraction of mRNA was subsequently performed from pooled digestive gland tissue (hepatopancreas) from five individuals in each group. The choice of this tissue as mRNA source is motivated by its ability to accumulate the biotoxin in large amounts and its detoxifying role in mussel metabolism .
cDNA libraries were synthesized using the SMARTerTM PCR cDNA synthesis kit (Clontech, Mountain View, CA, USA) with an extra purification step using GeneJET™ PCR Purification Kit (Thermo Scientific, Waltham, MA, USA), and normalization was performed following the protocol of the Trimer cDNA Normalization Kit (Evrogen, Moscow, Russia). Libraries were sequenced using Roche-454 FLX+ Titanium pyrosequencing, obtaining both exposed and control datasets. Reads from both libraries were pre-processed (quality filtering and contaminantion removal) by combining the CD-HIT-454  and the BLAST+ software  implemented in the SeqtrimNext pipeline , as well as the Cutadapt v1.0 software . Sequence assembly was carried out using MIRA v.3.4.0 sequence assembler . The sequences described in this work are available at the Sequence Read Archive (SRA) database under the accession number SRA056210.
3.2. Database Contents, Accessibility and Tool Implementation
The relational structure of CHROMEVALOAdb was developed using MySQL, allowing full traceability of raw ESTs from consensus sequences of individual genes. Contigs are classified into unigenes to eliminate redundancy based on BLAST analysis parameters (same top blastx hit, mean similarity larger than 80% and an e-value below 1 × 10−10). The descriptions of the unigenes are linked to their corresponding contigs and to ontology annotations. All the information stored in CHROMEVALOAdb is freely available for browsing and downloading without login or registering requirements. The information gathered by CHROMEVALOAdb is managed through Perl-written Common Gateway Interfaces (CGIs) that communicate with the Relational Database Management System (RDBMS) MySQL using Perl’s database interface (DBI) module. Server-side tools for sequence alignment, data visualization and result formatting/retrieval are administered by built in HTML web interfaces. BLAST results are formatted and interactively presented in HTML format including graphics, using Bioperl packages. Multiple sequence alignments are generated using CLUSTAL W  and displayed with an embedded applet of the alignment editor Jalview [35,36]. Local data is linked to reference public databases such as NCBI repositories for extended homolog sequence descriptions and AmiGO  for gene ontology term definitions.
3.3. Gene Annotation and Expression Analysis
The functional annotation of the consensus read assemblies was carried out using the Blast2GO suite , combining Gene Ontology (GO), InterProScan (IPS) protein domain information  and annotation enrichment using ANNEX . Additionally, full-length transcripts were subsequently identified using the Full-Lengther tool . Identification of chromatin-associated transcripts was subsequently implemented following two complementary strategies. First, a keyword-based routine was defined to identify chromatin-associated transcripts among sequence descriptions and related ontology terms (Supplementary Materials S1 and S2). Secondly, BLAST (blastn and blastx) homology searches were performed against the Histone Database , as well as against ChromDB  and CREMOFAC  databases, setting an e-value threshold of 1 × 10−10. Functionally annotated and classified sequences, along with relevant metadata, are organized and stored in CHROMEVALOAdb.
The biological processes on which the identified chromatin-associated unigenes could be potentially involved were studied by performing ontological analyses based on GO terms (Supplementary Figure S3). Expression profiles in response to OA were further studied by comparing control and OA-exposed libraries, using the edgeR package from R-Bioconductor  with the False Discovery Rate (FDR) threshold set to 0.1 (Supplementary Figure S4). Read count for each assembled sequence was performed using SQL-based queries on the raw data contained in CHROMEVALOAdb. This approach allowed us to define groups of statistically significant unigenes upregulated and downregulated in the presence of OA.
CHROMEVALOAdb provides a powerful resource to investigate the molecular basis underlying the genotoxic effect of OA in mussels and for understanding the chromatin-associated mechanisms that counteract the harmful effect of this toxin in these organisms (i.e., mechanisms involved in DNA repair). Furthermore, it allows the establishment of cause-effect relationships between OA and the differential expression of chromatin-associated factors involved in DNA DSB repair, helping to identify potential sensitive biomarkers for the development of chromatin-based OA genotoxicity tests. The implementation of these tests in natural populations has critical implications for the evaluation of DNA damage in commercially relevant organisms, the optimization of their harvesting and the elaboration of additional tests designed to evaluate the safety of their consumption and potential implications for consumer’s health. The design of CHROMEVALOAdb sets the basis for the future integration of model-based and semi-automated curation systems. In addition, the characterization of additional transcriptomes (i.e., at different stages of the genotoxic stress and in different tissues), together with data integration and workflow automation for interactome network development, constitute future objectives for the improvement of the database. Altogether, these approaches will help increase the knowledge of the chromatin-associated mechanisms involved in the response to the genotoxic effect of OA, by using Knowledge Discovery in Databases (KDD) techniques.
This work was supported by grants awarded to J.M.E.-L by the Spanish Ministry of Economy and Competitivity (CGL2011-24812 & Ramon y Cajal Subprogramme), by the Xunta de Galicia (10-PXIB-103-077-PR) and by a fellowship from Campus do Mar (International Campus of Excellence). Partial support was also obtained from grants by the Spanish Ministry of Economy and Competitivity (AGL2008-05346, AGL2012-30897), by the Natural Sciences and Engineering Research Council of Canada (NSERC-46399-12) and by the Xunta de Galicia (CN2012/127). V.A.-P. was supported by the “Plan I2C” (Xunta de Galicia) and the European Social Fund (ESF). C.R.-C. and R.G.-R. are recipients of FPU and Postdoctoral fellowships from the Spanish Ministry of Education, respectively. We thank the editors and two anonymous reviewers for their constructive comments, which helped us to improve the final version of the manuscript.
- Samples Availability: Available from the authors.
- Cardozo, K.H.; Guaratini, T.; Barros, M.P.; Falcao, V.R.; Tonon, A.P.; Lopes, N.P.; Campos, S.; Torres, M.A.; Souza, A.O.; Colepicolo, P.; et al. Metabolites from algae with economical impact. Comp. Biochem. Physiol. C 2007, 146, 60–78.
- Aune, T.; Yndestad, M. Diarrhetic Shellfish Poisoning. In Algal Toxins in Seafood and Drinking Water; Falconer, I.R., Ed.; Academic Press: London, UK, 1993; pp. 87–104.
- James, K.J.; Carey, B.; O’Halloran, J.; van Pelt, F.; Skrabakova, Z. Shellfish toxicity: Human health implications of marine algal toxins. Epidemiol. Infect. 2010, 138, 927–940, doi:10.1017/S0950268810000853.
- Vale, P. Profiles of fatty acids and 7-O-acyl okadaic acid esters in bivalves: Can bacteria be involved in acyl esterification of okadaic acid? Comp. Biochem. Physiol. C 2010, 151, 18–24.
- Yasumoto, T.; Oshima, Y.; Sugawara, W.; Fukuyo, Y.; Oguri, H.; Igarashi, T.; Fujita, N. Identification of Dinophysis fortii as the causative organism of diarrhetic shellfish poisoning. Bull. Jpn. Soc. Sci. Fish. 1980, 46, 1405–1411.
- Leira, F.; Alvarez, C.; Vieites, J.M.; Vieytes, M.R.; Botana, L.M. Study of cytoskeletal changes induced by okadaic acid in BE(2)-M17 cells by means of a quantitative fluorimetric microplate assay. Toxicol. In Vitro 2001, 15, 277–282.
- Suganuma, M.; Fujiki, H.; Suguri, H.; Yoshizawa, S.; Hirota, M.; Nakayasu, M.; Ojika, M.; Wakamatsu, K.; Yamada, K.; Sugimura, T. Okadaic acid: An additional non-phorbol-12-tetradecanoate-13-acetate-type tumor promoter. Proc. Natl. Acad. Sci. USA 1988, 85, 1768–1771.
- Valdiglesias, V.; Laffon, B.; Pasaro, E.; Mendez, J. Evaluation of okadaic acid-induced genotoxicity in human cells using the micronucleus test and gammaH2AX analysis. J. Toxicol. Environ. Health A 2012, 74, 980–992, doi:10.1080/15287394.2011.582026.
- Florez-Barros, F.; Prado-Alvarez, M.; Mendez, J.; Fernandez-Tajes, J. Evaluation of genotoxicity in gills and hemolymph of clam Ruditapes decussatus fed with the toxic dinoflagellate Prorocentrum lima. J. Toxicol. Environ. Health A 2011, 74, 971–979, doi:10.1080/15287394.2011.582025.
- Van Dolah, F.M.; Ramsdell, J.S. Okadaic acid inhibits a protein phosphatase activity involved in formation of the mitotic spindle of GH4 rat pituitary cells. J. Cell. Physiol. 1992, 151, 190–198, doi:10.1002/jcp.1041520124.
- Wells, P.G.; Depledge, M.H.; Butler, J.N.; Manock, J.J.; Knap, A.H. Rapid toxicity assessment and biomonitoring of marine contaminants—Exploiting the potential of rapid biomarker assays and microscale toxicity tests. Mar. Pollut. Bull. 2001, 42, 799–804.
- Sassolas, A.; Hayat, A.; Catanante, G.; Marty, J.-L. Detection of the marine toxin okadaic acid: Assessing seafood safety. Talanta 2012. in press.
- Marcaillou-Le Baut, C.; Amzil, Z.; Vernoux, J.P.; Pouchus, Y.F.; Bohec, M.; Simon, J.F. Studies on the detection of okadaic acid in mussels: Preliminary comparison of bioassays. Nat. Toxins 1994, 2, 312–317, doi:10.1002/nt.2620020510.
- Ledreux, A.; Serandour, A.L.; Morin, B.; Derick, S.; Lanceleur, R.; Hamlaoui, S.; Furger, C.; Bire, R.; Krys, S.; Fessard, V.; et al. Collaborative study for the detection of toxic compounds in shellfish extracts using cell-based assays. Part II: Application to shellfish extracts spiked with lipophilic marine toxins. Anal. Bioanal. Chem. 2012, 403, 1995–2007.
- Manfrin, C.; Dreos, R.; Battistella, S.; Beran, A.; Gerdol, M.; Varotto, L.; Lanfranchi, G.; Venier, P.; Pallavicini, A. Mediterranean mussel gene expression profile induced by okadaic acid exposure. Environ. Sci. Technol. 2010, 44, 8276–8283, doi:10.1021/es102213f.
- Dinant, C.; Houtsmuller, A.B.; Vermeulen, W. Chromatin structure and DNA damage repair. Epigenetics Chromatin 2008, 1, 9, doi:10.1186/1756-8935-1-9.
- Dickey, J.S.; Redon, C.E.; Nakamura, A.J.; Baird, B.J.; Sedelnikova, O.A.; Bonner, W. H2AX: Functional roles and potential applications. Chromosoma 2009, 118, 683–695, doi:10.1007/s00412-009-0234-4.
- Watters, G.P.; Smart, D.J.; Harvey, J.S.; Austin, C.A. H2AX phosphorylation as a genotoxicity endpoint. Mutat. Res. 2009, 679, 50–58, doi:10.1016/j.mrgentox.2009.07.007.
- Albino, A.P.; Jorgensen, E.D.; Rainey, P.; Gillman, G.; Clark, T.J.; Zhao, H.; Traganos, F.; Darzynkiewicz, Z. Gama-H2AX: A potential DNA damage response biomarker for assessing toxicological risk of tobacco products. Mutat. Res. 2009, 678, 43–52, doi:10.1016/j.mrgentox.2009.06.009.
- Gonzalez-Romero, R.; Rivera-Casas, C.; Fernandez-Tajes, J.; Ausio, J.; Méndez, J.; Eirín-López, J.M. Chromatin specialization in bivalve molluscs: A leap forward for the evaluation of okadaic acid genotoxicity in the marine environment. Comp. Biochem. Physiol. C 2012, 155, 175–181.
- Eirín-López, J.M.; Lewis, J.D.; Howe, L.; Ausió, J. Common phylogenetic origin of protamine-like (PL) proteins and histone H1: Evidence from bivalve PL genes. Mol. Biol. Evol. 2006, 23, 1304–1317, doi:10.1093/molbev/msk021.
- Eirín-López, J.M.; Ruiz, M.F.; González-Tizón, A.M.; Martínez, A.; Sánchez, L.; Méndez, J. Molecular evolutionary characterization of the mussel Mytilus histone multigene family: First record of a tandemly repeated unit of five histone genes containing an H1 subtype with “orphon” features. J. Mol. Evol. 2004, 58, 131–144, doi:10.1007/s00239-003-2531-5.
- González-Romero, R.; Rivera-Casas, C.; Frehlick, L.J.; Méndez, J.; Ausió, J.; Eirín-López, J.M. Histone H2A (H2A.X and H2A.Z) variants in molluscs: Molecular characterization and potential implications for chromatin dynamics. PLoS One 2012, 7, e30006.
- Zhang, G.; Fang, X.; Guo, X.; Li, L.; Luo, R.; Xu, F.; Yang, P.; Zhang, L.; Wang, X.; Qi, H.; et al. The oyster genome reveals stress adaptation and complexity of shell formation. Nature 2012, 490, 49–54.
- CHROMEVALOAdb. Available online: http://chromevaloa.udc.es (accessed on 1 September 2012).
- Pinto-Silva, C.R.; Creppy, E.E.; Matias, W.G. Micronucleus test in mussels Perna perna fed with the toxic dinoflagellate Prorocentrum lima. Arch. Toxicol. 2005, 79, 422–426.
- Venier, P.; de Pitta, C.; Bernante, F.; Varotto, L.; de Nardi, B.; Bovo, G.; Roch, P.; Novoa, B.; Figueras, A.; Pallavicini, A.; et al. MytiBase: A knowledgebase of mussel (M. galloprovincialis) transcribed sequences. BMC Genomics 2009, 10, 72.
- Svensson, S. Effects, dynamics and management of okadaic acid in blue mussels, Mytilus edulis. Ph.D Thesis, Göteborg University, Strömstad, Sweden, 2003.
- Niu, B.; Fu, L.; Sun, S.; Li, W. Artificial and natural duplicates in pyrosequencing reads of metagenomic data. BMC Bioinforma. 2010, 11, 187.
- Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410.
- SeqtrimNext. Available online: http://www.scbi.uma.es/seqtrimnext (accessed on 5 June 2012).
- Martin, M. Cutadapt Removes Adapter Sequences from High-Throughput Sequencing Reads. EMBnet J. 2011, 17, pp. 10–12. Available online: http://journal.embnet.org/index.php/embnetjournal/article/view/200 (accessed on 21 May 2012).
- Chevreux, B.; Wetter, T.; Suhai, S. Genome Sequence Assembly Using Trace Signals and Additional Sequence Information. In Proceedings of the German Conference on Bioinformatics (GCB); Computer Science and Biology: Hannover, Germany, 1999; pp. 45–56.
- Thompson, J.D.; Higgins, D.G.; Gibson, T.J. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignments through sequence weighting, position specific gap penalties and weight matrix choice. Nucl. Acids Res. 1994, 22, 4673–4680, doi:10.1093/nar/22.22.4673.
- Waterhouse, A.M.; Procter, J.B.; Martin, D.M.; Clamp, M.; Barton, G.J. Jalview Version 2—A multiple sequence alignment editor and analysis workbench. Bioinformatics 2009, 25, 1189–1191, doi:10.1093/bioinformatics/btp033.
- Clamp, M.; Cuff, J.; Searle, S.M.; Barton, G.J. The Jalview Java alignment editor. Bioinformatics 2004, 20, 426–427, doi:10.1093/bioinformatics/btg430.
- Carbon, S.; Ireland, A.; Mungall, C.J.; Shu, S.; Marshall, B.; Lewis, S. AmiGO: Online access to ontology and annotation data. Bioinformatics 2009, 25, 288–289.
- Gotz, S.; Garcia-Gomez, J.M.; Terol, J.; Williams, T.D.; Nagaraj, S.H.; Nueda, M.J.; Robles, M.; Talon, M.; Dopazo, J.; Conesa, A. High-throughput functional annotation and data mining with the Blast2GO suite. Nucl. Acids Res. 2008, 36, 3420–3435, doi:10.1093/nar/gkn176.
- Zdobnov, E.M.; Apweiler, R. InterProScan—An integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001, 17, 847–848, doi:10.1093/bioinformatics/17.9.847.
- Myhre, S.; Tveit, H.; Mollestad, T.; Laegreid, A. Additional gene ontology structure for improved biological reasoning. Bioinformatics 2006, 22, 2020–2027.
- Lara, A.J.; Perez-Trabado, G.; Villalobos, D.P.; Diaz-Moreno, S.; Canton, F.R.; Claros, M.G. A Web Tool to Discover Full-Length Sequences: Full-Lengther. In Innovations in Hibrid Intelligent Systems; Corchado, E., Corchado, J.M., Abraham, A., Eds.; Springer: Berlin, Germany, 2007.
- Marino-Ramirez, L.; Levine, K.M.; Morales, M.; Zhang, S.; Moreland, R.T.; Baxevanis, A.D.; Landsman, D. The Histone Database: An integrated resource for histones and histone fold-containing proteins. Database (Oxford) 2011, 2011, bar048.
- Gendler, K.; Paulsen, T.; Napoli, C. ChromDB: The chromatin database. Nucl. Acids Res. 2008, 36, D298–D302, doi:10.1093/nar/gkm768.
- Shipra, A.; Chetan, K.; Rao, M.R. CREMOFAC-A database of chromatin remodeling factors. Bioinformatics 2006, 22, 2940–2944, doi:10.1093/bioinformatics/btl509.
- Robinson, M.D.; McCarthy, D.J.; Smyth, G.K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26, 139–140.
© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).