Next Article in Journal / Special Issue
ICA-Based Resting-State Networks Obtained on Large Autism fMRI Dataset ABIDE
Previous Article in Journal
Global Database for Naturally Occurring Radionuclides Associated with Offshore Oil and Gas Production
Previous Article in Special Issue
Macao-ebird: A Curated Dataset for Artificial-Intelligence-Powered Bird Surveillance and Conservation in Macao
 
 
data-logo
Article Menu

Article Menu

Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Data Descriptor

A DNA Barcode Dataset for the Aquatic Fauna of the Panama Canal: Novel Resources for Detecting Faunal Change in the Neotropics

by
Kristin Saltonstall
1,*,
Rachel Collin
1,
Celestino Aguilar
2,
Fernando Alda
1,3,
Laura M. Baldrich-Mora
1,
Victor Bravo
1,
María Fernanda Castillo
1,
Sheril Castro
1,4,
Luis F. De León
5,
Edgardo Díaz-Ferguson
6,
Humberto A. Garcés
4,
Eyda Gómez
1,
Rigoberto G. González
1,
Maribel A. González-Torres
4,
Hector M. Guzman
1,
Alexandra Hiller
1,
Roberto Ibáñez
1,
César Jaramillo
1,2,
Klara L. Kaiser
1,7,
Yulang Kam
1,
Mayra Lemus Peralta
1,
Oscar G. Lopez
1,
Maycol E. Madrid C.
1,4,
Matthew J. Miller
8,
Natalia Ossa-Hernandez
1,
Ruth G. Reina
1,†,
D. Ross Robertson
1,
Tania E. Romero-Gonzalez
1,
Milton Sandoval
1,
Oris Sanjur
1,
Carmen Schlöder
1,
Ashley E. Sharpe
1,
Diana Sharpe
9,
Jakob Siepmann
1,7,
David Strasiewsky
1,7,
Mark E. Torchin
1,
Melany Tumbaco
1,
Marta Vargas
1,
Miryam Venegas-Anaya
1,10,
Benjamin C. Victor
11,12 and
Gustavo Castellanos-Galindo
1,7,13
add Show full author list remove Hide full author list
1
Smithsonian Tropical Research Institute, Ancon 0801, Panama
2
Facultad de Medicina, Universidad de Panamá, Panama City 0824, Panama
3
Instituto de Investigación en Recursos Cinegéticos (IREC; CSIC-UCLM-JCCM), 13071 Ciudad Real, Spain
4
School of Biology, FACINET, Universidad de Panamá, Panama City 0824, Panama
5
Department of Biology, University of Massachusetts, Boston, MA 02125, USA
6
Estación Científica Coiba (Coiba AIP), Clayton 0843, Panama
7
Department of Biology, Chemistry, Pharmacy, Freie Universität, 14195 Berlin, Germany
8
OIKOS Genomics, Abu Dhabi P.O. Box 127788, United Arab Emirates
9
Environmental Science and Public Policy, Harvard University, Boston, MA 02138, USA
10
Centro de Investigaciones Hidráulicas e Hidrotécnicas, Universidad Tecnológica de Panamá, Panama City 0819, Panama
11
Ocean Science Foundation, Irvine, CA 92604, USA
12
Guy Harvey Research Institute, Nova Southeastern University, Dania Beach, FL 33004, USA
13
Leibniz Institute of Freshwater Ecology and Inland Fisheries, 12587 Berlin, Germany
*
Author to whom correspondence should be addressed.
Deceased.
Data 2025, 10(7), 108; https://doi.org/10.3390/data10070108
Submission received: 30 April 2025 / Revised: 11 June 2025 / Accepted: 12 June 2025 / Published: 2 July 2025
(This article belongs to the Special Issue Benchmarking Datasets in Bioinformatics, 2nd Edition)

Abstract

DNA metabarcoding is a powerful biodiversity monitoring tool, enabling simultaneous assessments of diverse biological communities. However, its accuracy depends on the reliability of reference databases that assign taxonomic identities to obtained sequences. Here we provide a DNA barcode dataset for aquatic fauna of the Panama Canal, a region that connects the Western Atlantic and Eastern Pacific oceans. This unique setting creates opportunities for trans-oceanic dispersal while acting as a modern physical dispersal barrier for some terrestrial organisms. We sequenced 852 specimens from a diverse array of taxa (e.g., fishes, zooplankton, mollusks, arthropods, reptiles, birds, and mammals) using COI, and in some cases, 12S and 16S barcodes. These data were collected for a variety of studies, many of which have sought to understand recent changes in aquatic communities in the Panama Canal. The DNA barcodes presented here are all from captured specimens, which confirms their presence in Panama and, in many cases, inside the Panama Canal. Both native and introduced taxa are included. This dataset represents a valuable resource for environmental DNA (eDNA) work in the Panama Canal region and across the Neotropics aimed at monitoring ecosystem health, tracking non-native and potentially invasive species, and understanding the ecology and distribution of these freshwater and euryhaline taxa.
Dataset: 10.25573/data.28899749; https://boldsystems.org, BOLD Projects: BSFFA, BSCFA, PCIF, PCPL, PCINV, INVPA, DS-BPUSNM.
Dataset License: CC-BY

1. Summary

Environmental DNA (eDNA) analysis using DNA metabarcoding provides novel opportunities in the biological, ecological, and geological sciences for both monitoring biodiversity and ecosystem health and looking at trophic interactions in terrestrial and aquatic environments [1]. DNA can be extracted from environmental samples without any obvious signs of biological source materials (e.g., soil, sediment, or water) as well as biological samples (e.g., fecal material, animal gut contents, or plant tissues). Entire communities can then be described through metabarcoding, which uses generic primers to amplify DNA for a given group of organisms, and next-generation sequencing techniques, making it possible to do comprehensive biodiversity surveys and monitoring programs for limited effort and costs [1,2,3]. Although there are constraints in the ability of this technique to detect total communities (e.g., primer mismatches or the failure to detect rare species due low DNA concentrations [4,5]), the lower cost of sample collection and the opportunity to sample more broadly have made metabarcoding an increasingly popular tool. Metabarcoding can also reduce work effort and dependency on taxonomic expertise [6], and in some cases also allows better taxonomic resolution, particularly for studies of diet description and food-webs [7].
However, taxonomically complete and reliable sequence reference databases are key to the success of metabarcoding and eDNA studies [1]. This is particularly important in species rich assemblages, such as Neotropical habitats. If the DNA reference database coverage is low or the taxonomic identifications that are available are incorrect, generated sequences get classified as unknown “Operational Taxonomic Units” (OTUs) or are mis-identified, limiting the utility of the technique and the conclusions that can be drawn from metabarcoding studies. Although diversity can still be measured without knowing taxonomy assignments for sequences, accurate identifications are often needed for monitoring biodiversity effectively, making ecological inferences, or creating conservation priorities.
The Panama Canal, opened in 1914, is an artificial waterway, about 70 km in length, that connects the Eastern Pacific and Western Atlantic Oceans (Figure 1) and provides an essential conduit for global maritime trade and transport. It also creates opportunities for trans-oceanic dispersal and a modern physical dispersal barrier between eastern and western Panama for some terrestrial organisms. There are three locks on either side, which operate by gravity flow and raise and lower ships between sea level and 27 m elevation. An expansion of the Panama Canal, completed in 2016, allows larger ships to make the transit and implements water-saving basins at the new locks.
It is well understood that the communities of organisms inhabiting Gatun Lake, a large freshwater lake created when the Canal was flooded in 1913, have undergone several major shifts over time. These shifts reflect the adaptation of previously riverine species to their new lacustrine environment, the intentional introduction of non-native species, and changing management and/or abiotic conditions [8,9,10,11,12,13,14,15,16]. Today, ongoing research has shown that, following the recent expansion of the waterway and the opening of new locks, both physical and biological changes are taking place. Salinities appear to be increasing [17] and long-term data suggest that fish communities are changing rapidly as more predatory marine fishes are found in the Canal [3,18].
One of the most commonly used loci for animal phylogenetic and DNA barcoding analysis is the mitochondrial cytochrome c oxidase I (COI) gene [19,20]. COI has been widely sequenced as it has high interspecific variation, is easy to align, and BOLD (the Barcode of Life Database) now provides an extensive reference database [21]. However, the cryptic morphology of many aquatic organisms makes them difficult to identify, particularly in the Panama Canal which may contain sister species from both oceans [8,11,12,18]. Further, taxonomic revisions, resulting in many species that were previously thought to have broad ranges being split into allopatric species, are now being undertaken, thus making former conspecific specimens from Mexico or Brazil not representative of a species from coastal Panama [22,23]. A recent study using eDNA to identify fishes in the Panama Canal found multiple genera that could not be resolved with COI alone. Seventeen taxa were only identified to genus and several others could not be resolved because congeners on either side of the Canal (e.g., Pacific and Atlantic Oceans), such as species in the genera Anchoa, Mugil, and Micropogonias, had not been previously sequenced [3].
Despite its unique geographical location and a long history of biological studies documenting the biodiversity of Panama, genetic information on many species that inhabit the Canal, or could potentially move into it, is lacking. While the COI locus database is the most complete for this region, it is still incomplete for several vertebrate and the majority of invertebrate taxa. Here we present a dataset to expand DNA barcode coverage for the aquatic fauna of the Panama Canal.

2. Data Description

The dataset consists of 792 COI, 151 12S, and 214 16S mitochondrial DNA barcodes sequenced from 852 aquatic organisms collected in Panama and, in many cases, within the Panama Canal watershed (Figure 1). Of these, 98.6% of vertebrate specimens have been identified to species level, and 68.1% of invertebrates were identified to genus and 23.1% to species, using morphological identifications confirmed by sequence similarity verification using BLAST in the NCBI database. When identification to genus or species level with high confidence was not possible (using both morphology and/or sequence variation), we report higher level taxonomy. Taxonomic coverage includes a broad range of taxa, from fishes, birds, reptiles, and mammals, to benthic and planktonic invertebrates (Table 1). Common fauna that are considered aquatic indicators in the Panama Canal, such as Macrobrachium amazonicum ((Heller, 1862); River prawn), Corbicula fluminea ((O.F. Müller, 1774); freshwater clam), Melanoides tuberculata ((O.F. Müller, 1774); Black snail), Cichla monoculus Agassiz, 1831 (Peacock bass), Megalops atlanticus Valenciennes, 1847 (Tarpon), and Gambusia nicaraguensis Günther, 1866 (Mosquitofish), are included [24]. A metadata spreadsheet with sampling information (geographic location, collector, and collection dates, etc.); GenBank Accession numbers; and BOLD project information is provided in Supplementary Table S1. Additional information regarding identification methodology, photo vouchers, and sequence chromatograms can be found for most specimens in their corresponding BOLD project records. To augment the dataset, we also include barcodes of aquatic birds and reptiles that are found in Canal habitats but were collected elsewhere in Panama. Accession numbers for barcodes of fishes that come from the surrounding oceans and have the physiological capability to enter and survive in the Canal are included in a separate worksheet in Supplementary Table S1. All sequences in fasta format and sample metadata from associated BOLD projects can also be downloaded from Figshare at: https://figshare.com/s/7509415eab90bb93825d.

3. Methods

When possible, we used DNA and/or tissues from voucher specimens preserved in the collections at the Smithsonian Tropical Research Institute (STRI; Fish, Birds, and Invertebrates) and previous projects. We supplemented these collections with novel sampling in the Panama Canal, as needed. Additional fish were collected using gill nets at sampling sites throughout Gatun Lake or rotenone treatment in contained areas, such as lock chambers. Fish species that were not obtained through field sampling were purchased at local fish markets. Fin clips or muscle tissues were sampled from fish and tissues were frozen until further processing. Plankton were collected at depths of 0–3 m at two locations (near Gamboa and at or near Miraflores Lake) with several methods, including manual pumping through a 20 uM mesh or horizontal tows with a 135 uM net. Other invertebrates were collected at Miraflores and Gatun Lock chambers during maintenance events when all water was drained from the lock chambers or opportunistically during field campaigns elsewhere in Gatun Lake and Panama Canal infrastructure. All invertebrate samples were stored in 100% ethanol until DNA was extracted.
DNA was extracted from most tissue samples with the Qiagen Blood and Tissue DNA Extraction kit using the manufacturer’s protocol. Plankton were extracted with a rapid lysis protocol [25]. A fragment of the 5′ end of the COI gene, the approximately 670 bp Folmer region [19], was amplified for all organisms, as well as the mitochondrial 12S and 16S ribosomal RNA loci for some fish and crab species [26,27]. Primers used for each taxonomic group can be found in Table 2. PCR products were cleaned using GELase enzyme (Epicentre Biotechnologies, Madison, WI, USA) or KAPA Pure magnetic beads (Roche, Basel, Switzerland) and sequenced on an ABI 3130XL or 3500XL sequencer. Sequences were aligned and analyzed, and taxonomic identifications verified using the BLAST to NCBI option in Geneious Prime v. 2025.0.3. Lab work and sequencing was carried out in the STRI Naos Molecular Laboratory in Panama. All novel tissues (aside from plankton which were destroyed during the DNA extraction process) and extracted DNA samples have been entered into the STRI Molecular Evolution Cryo-Collection and are publicly available. Voucher specimens for many of the fishes and birds are also available in the STRI collections. Photo vouchers can be found in the various BOLD projects.

4. Scientific Relevance and Considerations

This dataset serves as a key reference that provides expanded taxonomic coverage of DNA barcodes for aquatic organisms that are found in the Panama Canal Watershed. Many of the species that are represented are closely linked in the ever-changing food web of Panama Canal’s Gatun Lake and some, including mosquito fish (Gambusia nicaraguensis), caiman (Caiman crocodilus fuscus (Cope, 1868)), and river prawn (Macrobrachium amazonicum), are considered indicator species of freshwater habitats in the Panama Canal [24]. Several marine fishes, such as jacks (Caranx spp.), snooks (Centropomus spp.), ladyfish (Elops spp.), and needlefish (Strongylura spp.), are now increasingly common in Gatun Lake and are represented by multiple closely related species that originate from both oceans [3,18]. Threatened native taxa, such as the American crocodile (Crocodylus acutus Cuvier, 1807), and multiple introduced freshwater species that are now found throughout Gatun Lake, such as peacock bass (Cichla monoculus), Nile tilapia (Oreochromis niloticus (Linneaus, 1858)), Asian clam (Corbicula fluminea), and trumpet snail (Melanoides tuberculata), are also included. In addition, we report the first observation of Latonopsis australis Sars, 1888 (Cladocera) in Panama. While additional sampling at more sites and across seasons will surely yield additional taxa, particularly for plankton and other invertebrates, the dataset provides a powerful resource for future studies using DNA barcodes for species identification and long-term monitoring from environmental DNA (eDNA) and other types of biological samples, both in Panama and elsewhere in the region. In addition, advances in technology now permit the sequencing of whole mitogenomes at costs that are similar to using Sanger sequencing. We encourage future projects to generate high quality mitogenomes, rather than pursuing single locus sequencing, to provide additional options for multi-locus detections using eDNA.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/data10070108/s1, Table S1: Sample Data.

Author Contributions

Conceptualization, K.S., R.C., L.F.D.L. and G.C.-G.; Data curation, K.S., L.F.D.L., E.D.-F., R.G.G., M.A.G.-T., M.E.M.C. and D.R.R.; Formal analysis, K.S., L.F.D.L., H.A.G., A.H. and M.E.M.C.; Funding acquisition, K.S., R.C., M.J.M., T.E.R.-G. and G.C.-G.; Investigation, R.C., C.A., F.A., L.M.B.-M., V.B., M.F.C., S.C., L.F.D.L., E.D.-F., H.A.G., E.G., M.A.G.-T., H.M.G., A.H., R.I., C.J., K.L.K., Y.K., M.L.P., O.G.L., M.E.M.C., M.J.M., N.O.-H., R.G.R., T.E.R.-G., M.S., O.S., C.S., A.E.S., D.S. (Diana Sharpe), J.S., D.S. (David Strasiewsky), M.E.T., M.T., M.V., M.V.-A. and G.C.-G.; Methodology, K.S., R.C., L.F.D.L., M.E.T. and G.C.-G.; Resources, R.C. and L.F.D.L.; Software, K.S.; Supervision, K.S., R.C., O.S., M.E.T., M.V. and G.C.-G.; Writing—original draft, K.S.; Writing—review & editing, K.S., R.C., F.A., H.A.G., R.G.G., M.A.G.-T., A.H., R.I., M.L.P., M.E.M.C., D.R.R., M.E.T., M.V., B.C.V. and G.C.-G. All authors have read and agreed to the published version of the manuscript.

Funding

We thank the following funders for the support of the various projects that have been included in this dataset: Smithsonian Institution Barcoding Network (plankton, arthropods, fish, birds, and reptiles); Panamanian Secretaría Nacional De Ciencia, Tecnología e Innovación (SENACYT; Grant No. PFID-FID-2021-105; Plankton); Deutsche Forschungsgemeinschaft (DFG, German Research Foundation; CA 2261/3-1, project number 471823073; fish).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data presented in this manuscript are publicly available in BOLD and/or GenBank. GenBank Accession numbers and collection information for all sequences are provided in Supplementary Table S1. Additional information on most samples, including detailed collection information, sequences, chromatogram files, and photos of organisms, can be found in the following projects in BOLD: Fishes, BSFFA, BSCFA, PCIF, PCIFP, Invertebrates, INVPA, PCINV, PCPL. Birds, DS-BUSNM.

Acknowledgments

We thank the Autoridad del Canal de Panamá (ACP) and Ministerio de Ambiente (MiAmbiente) for permission to conduct research in the Panama Canal Watershed. This work would not have been possible without the help of the STRI small boats program. Milton Solano assisted with the preparation of Figure 1. Eldredge Bermingham and the members of his lab were integral to the collection and processing of many freshwater fish and bird samples that are included here. We thank Rafael Aguilar, Allison Burgos, Marianela Camarena, Jacqueline Delgado, and Estefania Espinosa for their help with plankton sample collection; Aydeé Cornejo for help with the identification of freshwater invertebrates; Harilaos Lessios for sharing samples; and the STRI Punta Culebra Nature Center for permission to sample in their exhibit. The frozen tissue collection at STRI is a globally important research resource that has been built up over the past decade by dedicated, visionary researchers. This collection of DNA barcodes adds value to their important contributions to future generations of researchers.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
STRISmithsonian Tropical Research Institute
ACPAutoridad del Canal de Panamá (Panama Canal Authority)
COICytochrome c oxidase subunit I
DNADeoxyribonucleic acid
PCRPolymerase Chain Reaction

References

  1. Taberlet, P.; Coissac, E.; Pompanon, F.; Brochmann, C.; Willeslev, E. Towards next-generation biodiversity assessment using DNA metabarcoding. Mol. Ecol. 2012, 21, 2045–2050. [Google Scholar] [CrossRef] [PubMed]
  2. Zhang, S.; Zhao, J.; Yao, M. A comprehensive and comparative evaluation of primers for metabarcoding eDNA from fish. Methods Ecol. Evol. 2020, 11, 1609–1625. [Google Scholar] [CrossRef]
  3. Schreiber, L.; Castellanos-Galindo, G.A.; Robertson, D.R.; Torchin, M.; Chavarria, K.; Laakmann, S.; Saltonstall, K. Environmental DNA (eDNA) reveals potential for interoceanic fish invasions across the Panama Canal. Ecol. Evol. 2023, 13, e9675. [Google Scholar] [CrossRef]
  4. Collins, R.A.; Bakker, J.; Wangensteen, O.S.; Soto, A.Z.; Corrigan, L.; Sims, D.W.; Genner, M.J.; Mariani, S. Non-specific amplification compromises environmental DNA metabarcoding with COI. Methods Ecol. Evol. 2019, 10, 1985–2001. [Google Scholar] [CrossRef]
  5. Andres, K.J.; Lodge, D.M.; Sethi, S.A.; Andrés, J. Detecting and analysing intraspecific genetic variation with eDNA: From population genetics to species abundance. Mol. Ecol. 2023, 32, 4118–4132. [Google Scholar] [CrossRef]
  6. Rodgers, T.W.; Xu, C.C.Y.; Giacalone, J.; Kapheim, K.M.; Saltonstall, K.; Vargas, M.; Yu, D.W.; Somervuo, P.; McMillan, W.O.; Jansen, P.A. Carrion fly-derived DNA metabarcoding is an effective tool for mammal surveys: Evidence from a known tropical mammal community. Mol. Ecol. Resour. 2017, 17, e133–e145. [Google Scholar] [CrossRef]
  7. Casey, J.M.; Meyer, C.P.; Morat, F.; Brandl, S.J.; Planes, S.; Parravicini, V. Reconstructing hyperdiverse food webs: Gut content metabarcoding as a tool to disentangle trophic interactions on coral reefs. Methods Ecol. Evol. 2019, 10, 1157–1170. [Google Scholar] [CrossRef]
  8. Hildebrand, S.F. The Panama Canal as a passageway for fishes, with lists and remarks on the fishes and invertebrates observed. Zoologica 1938, 24, 15–45. [Google Scholar] [CrossRef]
  9. Rubinoff, R.W.; Rubinoff, I.R.A. Interoceanic Colonization of a Marine Goby through the Panama Canal. Nature 1968, 217, 476–478. [Google Scholar] [CrossRef]
  10. Zaret, T.M.; Paine, R.T. Species Introduction in a Tropical Lake: A newly introduced piscivore can produce population changes in a wide range of trophic levels. Science 1973, 182, 449–455. [Google Scholar] [CrossRef]
  11. McCosker, J.E.; Dawson, C.E. Biotic Passage through the Panama Canal, with Particular Reference to Fishes. Mar. Biol. 1975, 30, 343–351. [Google Scholar] [CrossRef]
  12. Gunter, G. Marine Fishes of Panama as Related to the Canal. Gulf Res. Rep. 1979, 6, 267–273. [Google Scholar] [CrossRef]
  13. Angehr, G. Rapid Long-distance Colonization of Lake Gatun, Panama, by Snail Kites. Wilson Bull. 1999, 11, 265–268. [Google Scholar]
  14. Smith, S.A.; Bell, G.; Bermingham, E. Cross–Cordillera exchange mediated by the Panama Canal increased the species richness of local freshwater fish assemblages. Proc. R. Soc. London. Ser. B Biol. Sci. 2004, 271, 1889–1896. [Google Scholar] [CrossRef] [PubMed]
  15. Sharpe, D.M.T.; De León, L.F.; González, R.; Torchin, M.E. Tropical fish community does not recover 45 years after predator introduction. Ecology 2017, 98, 412–424. [Google Scholar] [CrossRef]
  16. Guzman, H.M.; Real, C.K. Have Antillean manatees (Trichechus manatus manatus) entered the Eastern Pacific Ocean? Mar. Mammal Sci. 2023, 39, 274–280. [Google Scholar] [CrossRef]
  17. Castrellón, M.G.; Lu, C.; Domínguez, I.; Matos, R.; Anguizola, K.; Popescu, I. Spatiotemporal distribution of salinity in Gatun Lake and the Panama Canal pre- and post-expansion. J. Hydrol. Reg. Stud. 2025, 58, 102199. [Google Scholar] [CrossRef]
  18. Castellanos-Galindo, G.A.; Robertson, D.R.; Sharpe, D.M.T.; Torchin, M.E. A new wave of marine fish invasions through the Panama and Suez canals. Nat. Ecol. Evol. 2020, 4, 1444–1446. [Google Scholar] [CrossRef]
  19. Folmer, O.; Black, M.; Hoeh, W.; Lutz, R.; Vrijenhoek, R. DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol. Mar. Biol. Biotechnol. 1994, 3, 294–299. [Google Scholar]
  20. Hebert, P.D.N.; Cywinska, A.; Ball, S.L.; deWaard, J.R. Biological identifications through DNA barcodes. Proc. R. Soc. B Biol. Sci. 2003, 270, 313–321. [Google Scholar] [CrossRef]
  21. Ratnasingham, S.; Hebert, P.D. BOLD: The Barcode of Life Data System (http://www.barcodinglife.org). Mol. Ecol. Notes 2007, 7, 355–364. [Google Scholar] [CrossRef]
  22. Marceniuk, A.P.; Caires, R.A.; Rotundo, M.M.; Cerqueira, N.; Siccha-Ramirez, R.; Wosiacki, W.B.; Oliveira, C. Taxonomic revision of the Menticirrhus americanus (Linnaeus, 1758) and M. littoralis (Holbrook, 1847) (Percomorphacea: Sciaenidae) species complexes from the western Atlantic. Zootaxa 2020, 4822, 301–333. [Google Scholar] [CrossRef] [PubMed]
  23. Torres-Hernández, E.; Betancourt-Resendes, I.; Angulo, A.; Robertson, D.R.; Barraza, E.; Espinoza, E.; Díaz-Jaimes, P.; Domínguez-Domínguez, O. A multi-locus approach to elucidating the evolutionary history of the clingfish Tomicodon petersii (Gobiesocidae) in the Tropical Eastern Pacific. Mol. Phylogenetics Evol. 2022, 166, 107316. [Google Scholar] [CrossRef] [PubMed]
  24. URS Holdings, Inc. Panama Canal Expansion Project—Third Set of Locks: Category III Environmental Impact Study; URS Holdings, Inc.: San Francisco, CA, USA, 2007. [Google Scholar]
  25. Zietara, M.S.; Arndt, A.; Geets, A.; Hellemans, B.; Volckaert, F.A. The nuclear rDNA region of Gyrodactylus arcuatus and G. branchicus (Monogenea: Gyrodactylidae). J. Parasitol. 2000, 86, 1368–1373. [Google Scholar] [CrossRef]
  26. Palumbi, S.; Martin, A.; Romano, S.; McMillan, W.O.; Stice, L.; Grabowski, G. The Simple Fool’s Guide to PCR; University of Hawaii: Honolulu, HI, USA, 2002. [Google Scholar]
  27. Li, C.; Ortí, G. Molecular phylogeny of Clupeiformes (Actinopterygii) inferred from nuclear and mitochondrial DNA sequences. Mol. Phylogenetics Evol. 2007, 44, 386–398. [Google Scholar] [CrossRef]
  28. Baldwin, C.C.; Mounts, J.H.; Smith, D.G.; Weigt, L.A. Genetic identification and color descriptions of early life-history stages of Belizean Phaeoptyx and Astrapogon (Teleostei: Apogonidae) with Comments on identification of adult Phaeoptyx. Zootaxa 2009, 2008, 1–22. [Google Scholar] [CrossRef]
  29. Hebert, P.D.N.; Stoeckle, M.Y.; Zemlak, T.S.; Francis, C.M. Identification of Birds through DNA Barcodes. PLoS Biol. 2004, 2, e312. [Google Scholar] [CrossRef]
  30. Tavares, E.S.; Baker, A.J. Single mitochondrial gene barcodes reliably identify sister-species in diverse clades of birds. BMC Evol. Biol. 2008, 8, 81. [Google Scholar] [CrossRef] [PubMed]
  31. Coeur d’acier, A.; Cruaud, A.; Artige, E.; Genson, G.; Clamens, A.-L.; Pierre, E.; Hudaverdian, S.; Simon, J.-C.; Jousselin, E.; Rasplus, J.-Y. DNA Barcoding and the Associated PhylAphidB@se Website for the Identification of European Aphids (Insecta: Hemiptera: Aphididae). PLoS ONE 2014, 9, e97620. [Google Scholar] [CrossRef]
  32. Meyer, C.P. Molecular systematics of cowries (Gastropoda: Cypraeidae) and diversification patterns in the tropics. Biol. J. Linn. Soc. 2003, 79, 401–459. [Google Scholar] [CrossRef]
  33. Bowles, J.; McManus, D.P. Rapid discrimination of Echinococcus species and strains using a polymerase chain reaction-based RFLP method. Mol. Biochem. Parasitol. 1993, 57, 231–239. [Google Scholar] [CrossRef] [PubMed]
  34. Miura, O.; Kuris, A.M.; Torchin, M.E.; Hechinger, R.F.; Dunham, E.J.; Chiba, S. Molecular-genetic analyses reveal cryptic species of trematodes in the intertidal gastropod, Batillaria cumingi (Crosse). Int. J. Parasitol. 2005, 35, 793–801. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Sampling sites in the Panama Canal Watershed (as indicated with white circles). Names indicate locations where the majority of fish and invertebrate (e.g., plankton, arthropods, crustacea, and mollusks) specimens were collected. Gatun Locks: fishes, invertebrates; Limon: fishes; Escobal: fishes; Cuipo: fishes; Gamboa: fishes, plankton; La Laguna: fishes, invertebrates; Pedro Miguel and Miraflores Locks: fishes, invertebrates. BCI = Barro Colorado Island.
Figure 1. Sampling sites in the Panama Canal Watershed (as indicated with white circles). Names indicate locations where the majority of fish and invertebrate (e.g., plankton, arthropods, crustacea, and mollusks) specimens were collected. Gatun Locks: fishes, invertebrates; Limon: fishes; Escobal: fishes; Cuipo: fishes; Gamboa: fishes, plankton; La Laguna: fishes, invertebrates; Pedro Miguel and Miraflores Locks: fishes, invertebrates. BCI = Barro Colorado Island.
Data 10 00108 g001
Table 1. Taxonomic coverage of aquatic fauna in the Panama Canal Watershed and surrounding oceans for which DNA barcodes have been generated. Numbers in columns 4–6 refer to the number of barcodes available for each taxonomic group. See Supplementary Table S1 for additional taxonomic and collection details of individual samples. Photo vouchers of many specimens can be found in BOLD.
Table 1. Taxonomic coverage of aquatic fauna in the Panama Canal Watershed and surrounding oceans for which DNA barcodes have been generated. Numbers in columns 4–6 refer to the number of barcodes available for each taxonomic group. See Supplementary Table S1 for additional taxonomic and collection details of individual samples. Photo vouchers of many specimens can be found in BOLD.
Locus
PhylumClassOrderCOI12S16S
ArthropodaArachnidaTrombidiformes300
BranchiopodaAnomopoda2300
Ctenopoda400
CopepodaCalanoida200
Cyclopoida2300
InsectaColeoptera800
Diptera2100
Ephemeroptera3800
Hemiptera5400
Lepidoptera100
Megaloptera500
Neuroptera600
Odonata4700
Plecoptera700
Trichoptera1800
Unknown100
MalacostracaAmphipoda100
Decapoda64036
Ostracoda
Unknown
1400
600
ChordataActinopterygiiAcanthuriformes566
Atheriniformes322
Beloniformes644
Blenniiformes100
Carangiformes984763
Characiformes311412
Cichliformes1867
Clupeiformes533
Cyprinodontiformes27442
Elopiformes747
Gobiiformes301113
Gymnotiformes100
Holocentriformes001
Mugiliformes422
Perciformes614239
Siluriformes2843
Spariformes111
Synbranchiformes311
Syngnathiformes201
ChondrichthyesCarcharhiniformes100
AvesAnseriformes300
Charadriiformes3600
Coraciiformes400
Gruiformes200
Pelecaniformes500
MammaliaSirenia200
ReptiliaCrocodilia500
Squamata202
Testudines507
CnidariaHydrozoaAnthoathecata400
MolluscaBivalviaMytilidae100
Unionida200
Venerida200
GastropodaArchitaenioglossa300
Cerithiodea100
Lymnaeoidea100
PlatyhelminthesTrematodaOpisthorchiida300
Plagiorchiida100
Unknown100
RotiferaBdelloideaPhilodinida100
Ploima3600
Table 2. PCR amplification and sequencing primers used for each taxonomic group.
Table 2. PCR amplification and sequencing primers used for each taxonomic group.
LocusTaxonomic
Groups
Forward PrimerReverse PrimerSource
COIFishesFISH-BCL:
TCAACYAATCAYAAAGATATYGGCAC
FISH-BCH:
ACTTCYGGGTGRCCRAARAATCA
[28]
BirdsBirdF1:
TTCTCCAACCACAAAGACATTGGCAC
Ltyr:
TGTAAAAAGGWCTACAGCCTAACGC
BirdR1:
ACGTGGGAGATAATTCCAAATCCTG
COI907aH2:
GTRGCNGAYGTRAARTATGCTCG
[29,30]
Reptiles
Mammals
FISH-BCL:
TCAACYAATCAYAAAGATATYGGCAC
LCO1490:
GGTCAACAAATCATAAAGATATTGG
FISH-BCH:
ACTTCYGGGTGRCCRAARAATCA
HCO2198:
TAAACTTCAGGGTGACAAAAAATCA
[19,28]
Arthropods *
Crustacea
Mollusks
LCO1490:
GGTCAACAAATCATAAAGATATTGG
dgLCO1490:
GGTCAACAAATCATAAAGAYATYGG
HCO2198:
TAAACTTCAGGGTGACAAAAAATCA dgHCO2198:
TAAACTTCAGGGTGACCAAARAAYCA
HCO2198puc: TAAACTTCWGGRTGWCCAAARAATC
[19,31,32]
TrematodesJB3:
TTTTTTGGGCATCCTGAGGTTTAT
COI-Rtrema: CAACAAATCATGATGCAAAAGG[33,34]
12SFishes12S229F:
GYCGGTAAAAYTCGTGCCAG
12S954R:
YCCAAGYGCACCTTCCGGTA
[27]
16SFishes
Crustacea
16Sar:
CGCCTGTTTATCAAAAACAT
16Sbr:
CCGGTCTGAACTCAGATCACGT
[26]
* Multiple primers were mixed together to improve amplification success.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Saltonstall, K.; Collin, R.; Aguilar, C.; Alda, F.; Baldrich-Mora, L.M.; Bravo, V.; Castillo, M.F.; Castro, S.; De León, L.F.; Díaz-Ferguson, E.; et al. A DNA Barcode Dataset for the Aquatic Fauna of the Panama Canal: Novel Resources for Detecting Faunal Change in the Neotropics. Data 2025, 10, 108. https://doi.org/10.3390/data10070108

AMA Style

Saltonstall K, Collin R, Aguilar C, Alda F, Baldrich-Mora LM, Bravo V, Castillo MF, Castro S, De León LF, Díaz-Ferguson E, et al. A DNA Barcode Dataset for the Aquatic Fauna of the Panama Canal: Novel Resources for Detecting Faunal Change in the Neotropics. Data. 2025; 10(7):108. https://doi.org/10.3390/data10070108

Chicago/Turabian Style

Saltonstall, Kristin, Rachel Collin, Celestino Aguilar, Fernando Alda, Laura M. Baldrich-Mora, Victor Bravo, María Fernanda Castillo, Sheril Castro, Luis F. De León, Edgardo Díaz-Ferguson, and et al. 2025. "A DNA Barcode Dataset for the Aquatic Fauna of the Panama Canal: Novel Resources for Detecting Faunal Change in the Neotropics" Data 10, no. 7: 108. https://doi.org/10.3390/data10070108

APA Style

Saltonstall, K., Collin, R., Aguilar, C., Alda, F., Baldrich-Mora, L. M., Bravo, V., Castillo, M. F., Castro, S., De León, L. F., Díaz-Ferguson, E., Garcés, H. A., Gómez, E., González, R. G., González-Torres, M. A., Guzman, H. M., Hiller, A., Ibáñez, R., Jaramillo, C., Kaiser, K. L., ... Castellanos-Galindo, G. (2025). A DNA Barcode Dataset for the Aquatic Fauna of the Panama Canal: Novel Resources for Detecting Faunal Change in the Neotropics. Data, 10(7), 108. https://doi.org/10.3390/data10070108

Article Metrics

Back to TopTop