1. Introduction
The genus Entamoeba comprises several species of parasitic amoebae infecting vertebrates, most of them inhabiting the intestinal tract. Exceptions include a few species described from the oral cavity of various mammals, one species reported from the stomach of fish, and three species described from environmental samples; of these, a facultative free-living or host-associated lifestyle has been documented for one species (E. moshkovskii), whereas no evidence is currently available to assess whether a similar behaviour occurs in the other species. The vast majority of Entamoeba species are considered commensals; however, a small number are pathogenic. Among them, the human species E. histolytica has attracted the greatest research interest.
In the life cycle of intestinal
Entamoeba species, two morphological stages are recognized: the trophozoite and the cyst (in species inhabiting the oral cavity, only trophozoites have been described). Trophozoites lack a defined shape and show few conspicuous internal structures other than the nucleus and some vacuoles; consequently, the range of morphological characters available for taxonomic purposes is essentially restricted to cell size and nuclear features. In cysts, in addition to size range and nuclear morphology, other characters can be considered, such as the presence and morphology of chromatoid bodies (ribosomes arranged in a helical pattern [
1,
2]), the presence of vacuoles, and the number of nuclei in mature cysts. However, chromatoid bodies frequently disappear in mature cysts, and vacuoles occur irregularly even among cysts from the same isolate. Ultimately, the set of morphological characters that can be reliably assessed in cysts is reduced to three: cyst size, the number of nuclei in mature cysts, and nuclear morphology.
Because cysts are the forms most commonly encountered in faecal samples, cyst morphology—particularly nuclear morphology—combined with host species and, in some cases, the geographical origin of the samples, has constituted the basis for the identification of most of the proposed species. Specifically, the main morphological features considered by different authors include nuclear diameter, the arrangement of peripheral chromatin, the size and appearance of the endosome, its position, and the presence and distribution of small chromatin granules and filaments between the endosome and the nuclear membrane [
3]. However, in some instances, the consideration of such a wide array of characters appears to reflect more the need of investigators to provide differential criteria supporting species identifications or the proposal of new taxa than the existence of truly taxonomically informative features. As a result, the validity of some of these criteria has been questioned, both because of the influence of environmental factors and the possible occurrence of artefacts [
4]. An extreme example of the failure to discriminate between artefacts and genuine characters led to the proposal of
Councilmania lafleuri [
5] (see
Section 4).
In addition to morphological traits, commonly used differential criteria have included host species, possible associations with pathological conditions, and the geographical origin of the material. Nevertheless, these criteria have not always been applied consistently. In some cases, the inconsistencies went unchallenged owing to the scientific authority of the authors proposing new taxa, whereas in others (particularly those involving non-pathogenic species from animal hosts) it reflects the limited attention paid to taxa of little or no medical relevance. The outcome has been a somewhat confusing taxonomy, marked by recurrent controversies throughout the 20th century. The incorporation of immunological, physiological, biochemical, and genetic criteria, especially over the past three decades, has helped to resolve some taxonomic problems and has also generated new debates.
The aim of this review is to provide a comprehensive historical analysis of the taxonomy of the genus
Entamoeba, tracing the successive criteria that have been applied to species recognition and delimitation from the late 19th century to the present. By revisiting original descriptions, subsequent revisions, and long-standing controversies, we seek to clarify a number of taxonomic ambiguities that have persisted in the literature, often as a consequence of inconsistent or implicitly applied criteria. In particular, this review highlights how different standards have been used to delimit species depending on host group, biological relevance, or period, and how these practices have shaped the current understanding of diversity within the genus. By compiling and critically reassessing the full spectrum of species names historically proposed for
Entamoeba, including those later synonymized or overlooked, we provide the most complete and contextualized overview of species diversity, to date. We do not propose modifications to the taxonomic status of any species (e.g., validation as a distinct taxon or synonymizing it with other names); rather, we summarize and report the views expressed by previous authors, citing them in each case, and occasionally point out inconsistencies, insufficiently justified proposals, or uneven application of criteria. Only in
Section 5 (family-group name), based on a critical reading of the relevant literature, do we discuss which term should be used in accordance with the rules of nomenclature.
Beyond its historical scope, this work aims to bridge classical taxonomy with modern molecular approaches, placing recent discoveries of cryptic diversity and genetic lineages within their appropriate conceptual and nomenclatural framework. Finally, we discuss the current status of species delimitation in Entamoeba and outline the main challenges that remain, particularly in relation to integrative taxonomy, the interpretation of molecular variation, and the stable application of species names. In doing so, this review seeks to offer a coherent reference framework to support future taxonomic, evolutionary, and epidemiological studies on this genus.
2. Early Controversies: Up to the Proposal of Entamoeba histolytica
Although descriptions of intestinal disorders attributed to amoebae can be recognized in the writings of ancient authors (such as the “fluxus ventris” described by Celsus and Hippocrates) the first description of a parasitic amoeba in humans was that of
Amoeba gingivalis, reported in 1849 by G. Gros [
6] and later renamed
Entamoeba gingivalis by E. Brumpt [
7]. This species inhabits the oral cavity and is generally regarded as a commensal, although it has been associated with gingival disease [
8,
9,
10,
11,
12].
The first description of an intestinal amoeba is usually attributed to W. Lambl in 1860 [
13], who reported amoebae in a child from Prague suffering from diarrhoea who subsequently died of enteritis. However, C. Dobell [
14] argued that the amoebae described by Lambl, measuring 4–6 µm in diameter, were in fact degenerate forms of
Trichomonas hominis (
Pentatrichomonas hominis). Dobell instead credited T. R. Lewis [
15] and D. D. Cunningham [
16] with the discovery of intestinal amoebae, based on observations in Indian patients with diarrhoea, although it is possible that the symptoms were attributable to the cholera epidemic prevailing in India at that time rather than to amoebic infection.
A few years later, in 1875, the Russian physician F. Lösch proposed the name
Amoeba coli for the “amoebae of the colon” observed in a peasant suffering from dysentery who ultimately died of pneumonia [
17]. At autopsy, Lösch found numerous intestinal ulcers containing amoebae, but he did not consider the organisms to be the cause of the intestinal pathology. Instead, he attributed the disease to other aetiologies and regarded the amoebae merely as a mechanical (irritant) factor preventing ulcer healing. According to Massiutin [
18], Lösch reinforced this view after examining additional patients harbouring intestinal amoebae in the absence of any associated pathology.
In retrospect, the first case studied by Lösch most likely corresponded to the species currently known as
E. histolytica, whereas the subsequent cases probably involved
E. dispar or
E. coli. Lösch’s lack of taxonomic precision was later used by O. Casagrandi and P. Barbagallo [
19] to (re)name the human parasitic species as
E. hominis, and by Dobell [
14] to support and justify the validation of the specific epithets introduced by F. Schaudinn in 1903 [
20].
The situation became more complex when B. Grassi [
21] described intestinal amoebae in the faeces of individuals without dysentery, identifying them as
Amoeba coli sensu Lösch and considering them to be non-pathogenic. Grassi also observed cysts, but misinterpreted them as coccidia [
14]. For more than three decades thereafter, published studies failed to clarify whether human intestinal amoebae were genuinely pathogenic. In some investigations of dysenteric cases, amoebae were detected and associated with the patients’ intestinal disease [
22,
23,
24], whereas in other studies amoebae were found in asymptomatic individuals [
19,
25,
26] or were regarded as non-pathogenic [
27].
William T. Councilman and H. A. Lafleur demonstrated that dysentery may have different aetiologies, showing that amoebic dysentery is clinically and aetiologically distinct from other forms of dysentery [
24]. They proposed the name
Amoeba dysenteriae for the organism responsible for intestinal ulcerations, and
Amoeba coli Lösch for the non-pathogenic form. However, their proposal was not supported by morphological evidence and was therefore not accepted.
The studies of H. Quincke and E. Roos [
28], as well as that of Roos [
29], would have been of considerable importance had they received sufficient attention, but they went largely unnoticed. These authors reported morphological differences that allowed the distinction of two different types of amoebae: those found in patients with dysentery exhibited distinctive features (such as differences in cytoplasmic appearance, nuclear morphology, the presence of phagocytosed erythrocytes, and motility) when compared with amoebae observed in non-dysenteric individuals. They also provided a very brief description of cysts, reporting diameters of 10–12 µm for amoebae associated with dysentery and 16–17 µm for those found in asymptomatic patients. However, no information was given on the number of nuclei present in either cyst type.
In essence, two main issues prevented resolution of the debate (in addition to the aforementioned lack of recognition of certain studies, partly attributable to difficulties in the dissemination and accessibility of scientific publications at the time): first, the assumption that all intestinal amoebae, whether associated with disease or not, belonged to a single species; and second, the treatment of dysentery as a disease entity rather than as a syndrome with diverse, and sometimes coexisting, aetiologies within the same patient. From a taxonomic perspective, this meant that the potential taxonomic value of morphological differences observed in trophozoites from different cases was largely disregarded, and that host species (humans) was treated as a strict criterion, under the assumption that all morphologically similar amoebae infecting the same host and occupying the same anatomical niche (the intestine) must belong to a single species.
The situation changed when the respected German zoologist F. Schaudinn published a study proposing the existence of two species of human intestinal amoebae: a commensal species (
E. coli) and a pathogenic species causing tissue lysis, which he named
E. histolytica [
20]. Schaudinn attempted to demonstrate the innocuous or pathogenic nature of these species through self-infection experiments using material obtained from the cases he studied. Three years later, he died as a result of a liver abscess, probably caused by one of these self-infections. Schaudinn identified the octonucleate cysts of
E. coli, but he never recognized cysts of
E. histolytica, which he believed to form spores instead [
20]. Moreover, C. Huber [
30] described trophozoites and tetranucleate cysts in patients with dysentery, but following Schaudinn opinion, they were assigned to a different species,
E. tetragona [
30], a misspelling corrected to
E. tetragena [
31], one of the many junior synonyms later attributed to
E. histolytica.
Although Schaudinn’s proposal shed light albeit with some remaining ambiguities on the debate surrounding human intestinal amoebae, it also introduced a nomenclatural inconsistency. His interpretation, which would have warranted critical scrutiny, was initially accepted largely on the basis of Schaudinn’s considerable academic authority and was subsequently retained in order to avoid confusion, particularly among clinicians. It is likely that Schaudinn was influenced by the work of O. Casagrandi and P. Barbagallo, who had proposed the generic name
Entamoeba for human intestinal amoebae and considered
E. hominis to be the correct name for the organism originally described by Lösch [
19,
26]. Assuming that Lösch’s species was non-pathogenic (supported by Lösch’s own reluctance to associate amoebae with dysentery) and that
E. hominis was therefore a junior synonym of
Amoeba coli, Schaudinn applied correctly the name
E. coli to the non-pathogenic species. However, he did not take into account the name
Amoeba dysenteriae proposed earlier by Councilman and Lafleur for a pathogenic amoeba [
24]. As a consequence, the name
E. histolytica introduced by Schaudinn was in fact a junior synonym of an earlier name. There is strong evidence that the amoeba described by Lösch as
Amoeba coli was pathogenic and associated with dysentery. Therefore, Schaudinn should have applied the name
E. coli to the pathogenic species rather than to the commensal one. Had Schaudinn’s proposal been revised accordingly,
E. coli Lösch, 1875 would have been assigned to the pathogenic species, and the non-pathogenic species would have been renamed
E. hominis Casagrandi and Barbagallo, 1895, the earliest specific name (other than
E. coli) used for amoebae not associated with dysentery. Depending on the interpretation of Lösch’s work (pathogen or not pathogen amoeba), the senior synonym could have been either
A. coli Lösch, 1875, or
A. dysenteriae Councilman and Lafleur, 1891.
Proposals along these lines were indeed made [
32], but they appeared nearly fifteen years after the “birth” of
E. histolytica, at a time when a change in nomenclature would already have caused considerable confusion. To preserve Schaudinn’s proposal in accordance with the International Code of Zoological Nomenclature, Dobell suggested treating
Amoeba coli Lösch not as a valid scientific name but as a descriptive term (“amoebae of the colon”) [
14]. Under this interpretation, the first valid use of the specific epithet
coli would be that of Grassi in 1879 [
21], who applied it to non-pathogenic intestinal amoebae; consequently,
E. hominis would be unavailable as a junior synonym of
E. coli Grassi 1879.
This reasoning created a further difficulty: the earliest valid name for pathogenic amoebae would then have been E. dysenteriae. Dobell resolved this by proposing that this epithet should also be considered unavailable, arguing that Councilman and Lafleur had introduced it as a replacement for Amoeba coli Lösch and that it therefore constituted a junior synonym of E. coli. This interpretation was accepted, despite the conceptual inconsistency it concealed: Councilman and Lafleur had proposed A. dysenteriae specifically to replace Lösch’s name because they regarded it as non-diagnostic, and they clearly referred to a pathogenic amoeba. If A. coli Lösch were not a valid species name, and E. coli Grassi applied to a non-pathogenic species, then E. dysenteriae would in fact be an available name for the pathogenic taxon.
In summary, the correct names for the pathogenic and non-pathogenic species should have been either E. coli Lösch, 1875 and E. hominis Casagrandi and Barbagallo, 1895, respectively, or E. dysenteriae Councilman and Lafleur, 1891 and E. coli Grassi, 1879. Such a revision is unthinkable today, after more than a century of consistent use of the names E. histolytica and E. coli for the pathogenic and commensal human amoebae, respectively. Moreover, in accordance with the International Code of Zoological Nomenclature, prevailing usage has fixed these names, which are therefore valid in any case. One practical consequence of this historical process is that the correct authorship of E. coli should be attributed to Grassi, 1879, rather than to Lösch, 1875, as would otherwise have been expected.
3. The Genus Name: Entamoeba vs. Endamoeba
Coinciding with the period of debate and confusion at the end of the 19th century regarding the existence of one or more species of human intestinal amoebae, a parallel controversy emerged concerning the correct genus name for parasitic amoebae. Initially, parasitic amoebae were included in the genus
Amoeba together with free-living species, until J. Leidy separated the intestinal amoebae of cockroaches (
Amoeba blattae Bütschli, 1878) into a distinct genus, which he named
Endamoeba [
33]. The morphological characteristics of this amoeba differ markedly from those of species currently assigned to
Entamoeba, both in trophozoites and cysts [
34].
Several years after Leidy’s proposal, and apparently without knowledge of his work, Casagrandi and Barbagallo introduced the name
Entamoeba to include the parasitic amoebae they studied in human samples [
26]. The name
Entamoeba was subsequently adopted in later studies, most notably in the influential work of Schaudinn [
20]. In 1908, based on Schaudinn’s erroneous interpretation that
E. histolytica formed spores, M. Lühe proposed that the pathogenic species should be placed in a separate genus,
Poneramoeba [
35].
Leidy’s earlier name remained largely unnoticed until E. Chatton drew attention to the possible synonymy between
Endamoeba and
Entamoeba [
36]. Chatton and P. Lalung-Bonnaire argued that sufficient differences existed between the parasitic amoebae of cockroaches and those infecting humans to justify their placement in separate genera; moreover, considering
Entamoeba to be too similar to
Endamoeba, they proposed the new genus
Loschia (in honour of Lösch) [
37]. One month later, A. Alexeieff proposed the genus
Proctamoeba to include amoebae parasitising vertebrates, including humans [
38].
The growing interest in human intestinal amoebae led to the publication of numerous studies describing new genera and species, many of which were later shown to be based on contamination with free-living amoebae or on mixtures of different organisms [
39]. Authors’ opinions regarding the appropriate genus name were divided, although
Entamoeba was used most frequently (
Table 1). Many of the species described during this period were subsequently regarded as synonyms; nevertheless, particular attention should be paid to the work of S. von Prowazek, who described
Entamoeba butschlii [
40], and to that of C. M. Wenyon and F. W. O’Connor, who described
Entamoeba nana [
41]. Both species were later reassigned to newly established genera,
Iodamoeba (by Dobell in 1919 [
14]) and
Endolimax (by S. L. Brug in 1918 [
42]), respectively.
In 1928, the International Commission on Zoological Nomenclature (ICZN) issued Opinion 99 [
43], stating that the name
Entamoeba should be regarded both as a homonym and a synonym of
Endamoeba. For those authors who considered that the vertebrate species included in
Entamoeba (with the genera
Endolimax and
Iodamoeba excluded from this discussion) should not be placed in the same genus as the invertebrate species, the name of the new genus could not be
Entamoeba but rather
Loschia, in accordance with the principle of priority. This criterion, however, was not widely accepted, and many authors argued that the two names (
Endamoeba and
Entamoeba) should be retained as separate genera. Indeed, the majority of newly described species continued to be assigned to
Entamoeba (
Table 1).
Following the contributions of several authoritative authors, most notably Dobell [
44] and H. Kirby [
45,
46], who supported the retention of
Entamoeba as a valid name, the ICZN issued a new ruling in 1954, replacing the previous one [
47]. In this opinion,
Entamoeba was accepted as a valid genus distinct from
Endamoeba, thereby definitively resolving the issue. Adoption of
Entamoeba was rapid, and from the 1960s onwards virtually all studies employed this name.
Surprisingly, some species originally described from invertebrates (and therefore attributable to
Endamoeba) have been considered in recent studies as belonging to
Entamoeba [
48], and isolates from cockroaches have also been identified as
Entamoeba [
49]. However, no morphological data were provided to allow even a tentative assessment of the correctness of these identifications, and phylogenetically these isolates are placed in a separate sister clade [
49]. This has led some authors to include invertebrates within the host range of
Entamoeba (e.g., refs. [
50,
51]).
Spurious infections in invertebrates with access to faecal material are, however, plausible. For example, H. H. Pai and colleagues [
52] detected cysts of
E. histolytica in the intestine of cockroaches experimentally exposed to positive human faeces for up to four days post-exposure, as well as cysts of
E. histolytica/
E. dispar and
E. coli in cockroaches captured in schools in Taiwan. In addition, unpublished sequences available in GenBank assigned to
E. moshkovskii (a species originally described from wastewater [
53] and known to develop in both poikilothermic and homeothermic vertebrates [
54,
55]) have been obtained from cockroach and beetle samples.
Considering temperature as a limiting factor for trophozoite development, these findings strongly suggest spurious parasitosis, at least for E. histolytica/E. dispar and E. coli. For free-living species such as E. moshkovskii, current data do not allow a definitive conclusion as to whether their presence in invertebrates reflects spurious infection or genuine development in the intestinal tract, similar to that observed in wastewater or environments heavily contaminated with organic matter. Even if it were ultimately demonstrated that some Entamoeba species (e.g., E. moshkovskii) can truly develop in invertebrate hosts, the clear morphological differences between Entamoeba and Endamoeba support their retention as separate genera (as is also the case for Endolimax and Iodamoeba), even if host overlap were to occur in some species.
5. The Family Name: Endamoebidae vs. Entamoebidae
A further issue arising from the debate on the validity of the generic name
Entamoeba concerns the correct family name. According to Article 29 of the International Code of Zoological Nomenclature (the Code) [
69], family-group names must be formed from the name of the type genus, and Article 64 states that any genus considered valid may be designated as the type genus, not necessarily the oldest one.
It is important to note that, as discussed above, the name
Endamoeba itself was never in dispute; the debate focused exclusively on whether
Entamoeba should be regarded as a valid and independent genus. Both before and after the definitive resolution of the generic nomenclature in 1954, the family name Endamoebidae Calkins, 1926 was widely used in classic handbooks to include both
Endamoeba and
Entamoeba [
70,
71,
72,
73,
74,
75], as well as in more recent taxonomic publications and reviews [
48,
76,
77,
78,
79]. Occasionally, some authors have used Endamoebidae for the family while designating
Entamoeba as the type genus [
80,
81], a contradictory practice that does not conform to the provisions of the Code. However, since the adoption of the name Entamoebidae Chatton, 1925 by Thomas Cavalier-Smith in 1991 [
82], its use has increased steadily. This name has been employed in global taxonomic revisions [
83,
84] as well as in numerous focused studies [
85,
86,
87,
88,
89,
90,
91], and it is the family name currently used, at the time of writing, in databases such as GenBank.
During the 19th century, intestinal amoebae were generally included in the family Amoebidae, a classification usually attributed to C. G. Ehrenberg [
92]. The vast majority of studies published in the early 20th century addressing taxonomy, species validity, or the
Entamoeba vs.
Endamoeba debate made no reference to the family level and did not designate a type genus. The family-group name Entamoebidae was introduced by Chatton [
93]. However, when establishing this family, Chatton wrote:
“6° Entamoebidae, dont certaines, comme l’
Entamoeba blattae, ne se distinguent guère des précédentes. Synénergides comme elles, à gamètes non flagellés comme ceux d’
Amoeba flava. Mais l’
Entamoeba blattae n’est certainement pas la forme la plus centrale du groupe. Les Entamibes des Vertébrés, autant qu’on en peut juger par le peu qu’on en connaît, paraissent s’en écarter notablement. Leur cycle est d’ailleurs inconnu. Cependant la schizogonie d’
E. ranarum (Collin, 1913) ou celle d’
Entamoeba coli (Mathis et Mercier, 1917), à défaut de la gamétogenèse, mettent bien en évidence la condition synénergide de ces Amibes.” [
93] (p. 50).
Although apparently derived from the generic name
Entamoeba, Chatton neither designated a type genus nor applied the family name in a manner consistent with the nomenclatural rules in force at the time. The validity of
Endamoeba itself was never under discussion; Chatton’s reference to
Entamoeba blattae is a misidentification of
Endamoeba blattae, as no justification for this reassignment is provided in the original work, and the same confusion is already evident in one of his earlier publications [
36] (p. 282):
“Ce sont d’une part les Amibes parasites que l’on a réunies, provisoirement sans doute, mais bien artificiellement dans le genre Entamoeba Leidy (1879).”
Under the Règles internationales de la Nomenclature zoologique (the Règles) [
94], which governed zoological nomenclature at the time, Chatton’s family-group name Entamoebidae was not validly established, as he neither designated a type genus nor applied the name in accordance with the Règles. According to Article 31 of the Règles, a name based on an erroneous identification cannot be retained for the misidentified species and therefore cannot serve as a valid foundation for a family-group name. Furthermore, Articles 4 and 29–30 required every family name to be explicitly linked to an existing and clearly defined genus name, a condition that is not met in Chatton’s text.
If Chatton’s use of Entamoeba blattae were interpreted merely as an orthographic error (that is, an inadvertent misspelling of Endamoeba) then, under Article 25 of the Règles, the corresponding family name could be corrected to Endamoebidae. The current Code similarly allows the correction of “incorrect original spellings” when these result from evident lapsus or typographical errors (Arts. 32.4–32.5). However, if Chatton’s error is interpreted not as orthographic but as conceptual (affecting the taxonomic application of the genus name rather than its spelling) then it falls under the provisions of Article 31 of the Règles and Articles 67.13–67.14 of the Code, which state that names based on misidentified taxa cannot be retained or corrected by emendation. In this latter case, Entamoebidae cannot be regarded as a correctable spelling of Endamoebidae, but must instead be considered an invalid name established on an erroneous taxonomic concept. Under both the Règles and the modern Code, a misidentified species cannot validly fix a type genus (Arts. 61.1.3, 63.2), and a family-group name lacking a properly fixed type is therefore of uncertain application.
In 1926, G. N. Calkins introduced the family-group name Endamoebidae, stating:
“Family 3. Endamoebidae: these are parasitic amoebae … The genus generally recognized is represented by a vast number of species with ill-defined diagnostic characters (genus:
Endamoeba), while other genera (e.g.,
Endolimax?
Councilmania, etc.) are forms about which the taxonomic position is still in dispute” [
95] (p. 338).
In this case, Endamoeba is explicitly used as the type genus, thereby fulfilling the requirements of both the Règles (Art. 4) and the modern Code (Arts. 13.1, 63.2). Calkins’ action therefore provided the first validly typified family-group name for these amoebae, independently of the temporary confusion surrounding the relationship between Endamoeba and Entamoeba. Although Endamoebidae was formally published one year after Entamoebidae, it entered prevailing usage shortly after its introduction and well before 1961. In accordance with Article 40.2 of the Code, even if Entamoebidae were considered acceptable, a family-group name replaced before 1961 that has achieved prevailing usage is to be maintained.
Therefore, given the invalidity of Entamoebidae under both the historical Règles and the modern Code, and the consistent and widespread use of Endamoebidae throughout the 20th century, the reinstatement of Entamoebidae Chatton, 1925 by Cavalier-Smith [
82] cannot be supported. Depending on how Chatton’s name is interpreted (either as an incorrect spelling or as a taxonomically inconsistent usage) the correct and valid family-group name for amoebae including
Endamoeba and
Entamoeba would therefore be either Endamoebidae Chatton, 1925 emend. Calkins, 1926, or Endamoebidae Calkins, 1926 (=1925 by priority).
In our view, Chatton’s usage can be interpreted as reflecting a taxonomically inconsistent concept (treating Entamoeba as the sole genus encompassing intestinal amoebae of both vertebrates and invertebrates) rather than a simple orthographic lapse, which might reasonably occur once but be unlikely to be repeated across different publications. Accordingly, the second option, Endamoebidae Calkins, 1926, may be considered a plausible choice for the correct and valid family-group name.
8. Interpreting Species Delimitation in Entamoeba: Lessons from the Past and Consequences for Current Taxonomy and Diversity
Describing, delimiting, and naming species constitute what is traditionally referred to as alpha taxonomy. These processes are conceptually distinct and, as shown in previous sections, have not always been clearly separated in the taxonomic history of
Entamoeba. Specimen identification and species discovery are conceptually different operations, as are species description and species delimitation [
312]. Specimen identification refers to the assignment of an individual organism, or an isolate in the case of protists, to a previously described and named species based on existing diagnostic criteria and does not in itself test species boundaries but rather applies them [
313]. Species discovery, by contrast, involves the recognition that a set of organisms cannot be satisfactorily accommodated within any known species and therefore represents an unrecognized taxon; in asexual organisms such as
Entamoeba, this step is necessarily decoupled from reproductive criteria and instead relies on the detection of coherent, non-random biological variation consistent with independent evolutionary trajectories [
314]. (It is generally accepted that species of
Entamoeba reproduce exclusively asexually; although the possibility of sexual reproduction has been suggested [
315,
316], this hypothesis remains unconfirmed). Species delimitation entails the explicit formulation and evaluation of boundaries between newly identified lineages and established species, with the aim of determining whether the observed diversity reflects intraspecific variation or the existence of independently evolving entities. Under a lineage-based view of species, which is especially applicable to asexual organisms, delimitation is best understood as a hypothesis-testing process rather than the application of a single defining property [
317]. Finally, species description constitutes the formal taxonomic act that follows delimitation, whereby a species hypothesis is made explicit through a diagnostic description, the designation of type material or reference entities, and the assignment of a valid scientific name in accordance with the relevant nomenclatural code. This step enables effective communication, comparability, and the accumulation of knowledge across studies [
318,
319].
The key issues are the species concept adopted for delimitation and the criteria used both to differentiate species and to identify specimens. During the 19th century and the first half of the 20th century, the taxonomy of Entamoeba largely followed the classical framework applied by taxonomists to most organismal groups. In practice, the decision as to whether two populations belonged to the same or to different species relied overwhelmingly on a morphological–typological species concept. Species were regarded as fixed, or at least relatively stable, “types” defined, in Entamoeba, by a few diagnostic characters, and taxonomic judgments were based on the recognition of apparent morphological or biological differences among forms.
This typological approach dominated descriptive taxonomy in general, and particularly protist taxonomy, where modes of reproduction were poorly understood or inaccessible to direct observation. As a consequence, species boundaries were inferred almost exclusively from observable traits and overall similarity, rather than from population-level processes or evolutionary relationships. In Entamoeba, the scarcity of reliable and consistent morphological characters led taxonomists to complement morphology with additional criteria, including physiological traits (host species, ability to grow in vitro, results of experimental cross-infections) and biogeographical information. These supplementary criteria, however, were not applied consistently and were often the subject of debate. A striking asymmetry emerges when comparing species associated with humans and NHPs with those infecting other vertebrates. In the former case, the absence of clear morphological differences promoted a conservative taxonomic approach, whereby multiple biologically distinct entities were subsumed under a single species name. In contrast, for species infecting ungulates, reptiles, fish and other hosts, minor morphological variation, host specificity, or geographic origin alone often sufficed to justify the description of new species, frequently in the absence of comparative analyses or information on the life cycle.
Historically, three main approaches can be identified for delimiting and accepting or rejecting species within this genus:
Use of a “complete set” of traits. Under this approach, isolates were considered to represent a distinct species if they differed from related taxa in one or more of the following characters: trophozoite or cyst morphology, host species, biological traits (pathogenicity, in vitro cultivation, cross-infection experiments), biochemical or genetic markers, or geographic origin (see
Section 2,
Section 6 and
Section 7). Host species, sometimes combined with geographic origin, was frequently used as a decisive criterion. Notably, even the same authors applied this criterion inconsistently (e.g., G. A. Noble synonymized
E. debliecki from Western goats while almost simultaneously erecting
E. dilimani in Eastern goats,
E. antilocapra in antelope, and
E. bubalus in water buffalo; see
Section 7.3).
Use of a “partial set” of traits (option 1). In this framework, morphological, biological and biochemical differences were considered valid for species delimitation, whereas host species and geographic origin were regarded as insufficient on their own. These latter criteria could support a species description but were not accepted as primary delimiters.
Use of a “partial set” of traits (option 2). This approach is similar to the previous one but applies a stricter threshold: only clear and substantial biological or biochemical differences are considered taxonomically meaningful, whereas minor differences are dismissed. This option necessarily introduces a degree of subjectivity in defining what constitutes a “small” or “insufficient” difference. It was applied, for example, to propose
E. chattoni and
E. struthionis as synonyms of
E. polecki (see
Section 7.3).
A recent proposal has argued that lineage-based species concepts, such as the general or unified species concept [
317] and the pragmatic species concept [
320], are particularly appropriate for
Entamoeba [
283]. Under this framework, a species of
Entamoeba is defined as an independently evolving metapopulation lineage, delimited through the integration of morphological, ecological, genetic, and host-related evidence, in accordance with the principles of integrative taxonomy [
321]. In practice, the criteria currently applied in
Entamoeba taxonomy [
173] are largely compatible with this conceptual framework, although some modifications and refinements are required (see below).
Morphological criteria have never been fundamentally questioned and remain central to species identification. However, within each of the traditional morphological groups (bovis, histolytica, coli, gingivalis), interspecific morphological differences are often minimal or absent, making cryptic diversity a recurrent issue. By contrast, the remaining criteria historically used for species delimitation have been controversial to varying degrees.
The geographic origin of an isolate was occasionally invoked by some authors as a differentiating character (for example, for some species of the bovis group, see
Section 7.3), but it has ultimately not been accepted as a valid criterion for species delimitation, at least in humans and domestic or human-associated animals. Human migration, animal trade, and the widespread translocation of livestock and companion animals have likely facilitated the global dissemination of their associated
Entamoeba species, thereby eroding any meaningful biogeographic signal at the species level.
Most controversial is the use of the host criterion. As noted in the previous sections, some authors have considered the host–parasite relationship as rigid while others considered morphologically similar parasites in different host species as belonging to a single species. The best option is that pointed out by J. F. Kessel [
322] that each host–parasite species relationship should be considered individually. To avoid an uncontrolled proliferation of species based solely on host association, authoritative taxonomic revisions by leading specialists adopted a restricted use of host species as a differential criterion [
3,
14,
172,
270]. Host identity was considered taxonomically informative when the hosts belonged to clearly distinct zoological groups (e.g.,
E. gallinarum in birds,
E. coli in humans, and
E. muris in rodents); in contrast, when hosts were closely related or belonged to the same broad zoological group, host species alone was not regarded as sufficient for species delimitation and had to be supported by consistent morphological differences, or the nominal species were treated as synonyms (e.g.,
E. coli =
E. pitheci;
E. polecki =
E. debliecki;
E. histolytica =
E. dispar).
The inconsistent application of this criterion by different authors (sometimes even within the work of a single author) has contributed substantially to recurring taxonomic problems in the genus. An additional complication is that several
Entamoeba species are capable of infecting a broad range of hosts belonging to clearly divergent zoological groups (e.g.,
E. moshkovskii in both poikilothermic and homeothermic animals; see
Section 6). Conversely, a single host species may harbour multiple
Entamoeba species simultaneously; humans, for instance, can be infected by up to ten different species. As a consequence, species erected primarily on the basis of host specificity have frequently been questioned or rejected in later taxonomic revisions (see, for example, the discussion of rodent-associated species in
Section 7.2). Conversely, the opposite practice (synonymizing taxa solely on the basis of a shared host range) has also generated controversy and disagreement [
199].
Biochemical and genetic data have often been regarded as the solution to the problem of cryptic species. Indeed, the application of these approaches made it possible to separate and formally recognize as distinct species
E. dispar,
E. bangladeshi and
E. nuttalli from
E. histolytica (see
Section 6),
E. ecuadoriensis from
E. moshkovskii (
Section 7.1), and
E. suis from
E. polecki (
Section 7.3). However, the widespread use of genetic analyses (primarily SSU rRNA gene sequences) in epidemiological and ecological studies has revealed an unexpectedly high level of genetic diversity within
Entamoeba.
In response to this situation, recent authors have deliberately avoided proposing new species based solely on genetic divergence [
133], particularly when the detected lineages share the same host range [
196]. Instead, a non-standard subspecific nomenclature has been adopted to accommodate
Entamoeba diversity without formal taxonomic inflation [
173,
202,
210]. Under this framework, several categories have been introduced: subtypes (ST), defined as phylogenetic clusters within a formally recognized species [
173]; ribosomal lineages (RL), corresponding to isolates forming distinct phylogenetic branches [
173]; conditional lineages (CL), conceptually equivalent to RLs but based on partial SSU rRNA gene sequences obtained by Sanger sequencing [
202]; and sequence types (SQT), analogous to CLs but inferred from next-generation sequencing data [
210]. To date, eleven ribosomal lineages have been identified [
202].
This system has proven useful as an operational tool to manage the rapidly growing volume of sequence data. Nevertheless, because RLs and CLs may ultimately correspond to biological species, the criteria used for their definition warrant re-evaluation. As originally conceived, these categories are defined primarily by their position as branches in phylogenetic trees, which in turn are determined by sequence similarity, largely independent of the specific tree reconstruction method employed [
323]. Many taxonomists have criticized approaches such as DNA barcoding or algorithmic species-partitioning methods precisely because they rely on similarity thresholds rather than on diagnostic characters, whether qualitative or quantitative [
312,
321,
324,
325]. This same conceptual limitation applies to the current definitions of RLs and CLs.
To address this issue, a modification has recently been proposed [
283] based on a qualitative diagnostic criterion: the presence of compensatory base changes (CBCs; both bases of a paired site in the SSU rRNA molecule mutate while the pairing itself is maintained [
326]). This approach avoids reliance on arbitrary similarity thresholds and provides explicit diagnostic characters suitable for species delimitation. Its main limitations are that sequencing of the complete SSU rRNA gene may be required (the expansion region ES10 has been suggested [
327], but no universally applicable barcode region can be defined in this gene in
Entamoeba [
283]), and that it may fail to discriminate very recently diverged lineages (e.g.,
E. histolytica–
E. nuttalli), for which alternative loci may be more informative. Nevertheless, this framework eliminates subjectivity and meets the methodological requirements traditionally demanded by taxonomists.
This nonstandard nomenclature based on RLs and CLs should not be the definitive taxonomical option but a temporal solution while the data needed for species naming is obtained. The Code [
69] requires a description of the species using characters stated in words (Art. 13.1.1) but nothing is said about its nature: it can be a morphological description (this is considered necessary by Stensvold and colleagues [
173,
202]) but also a gene description could be valid instead. In any case, it is mandatory the species name should be anchored to a name-bearing type (Arts. 72 and 73), which can be hapantotypes but also drawings, illustrations or micrographs which have been widely used and accepted in
Entamoeba taxonomy.
There is an ongoing debate regarding the use of total DNA extracts as name-bearing types and the possibility of including them within the provisions of the different International Codes of Nomenclature [
328]. In Zoology and Protistology, Art. 72.5.1 of the Code defines eligible name-bearing types broadly as “an animal [including microeukaryotes], or any part of an animal”, which could, in principle, allow a physically deposited total DNA extract (traceable to the sampled individual and curated in a recognized repository) to be interpreted as “part of an animal”. By contrast, DNA sequences themselves, which are obtained through copying processes such as PCR prior to sequencing, cannot serve as types but are valid as a description of the organism’s DNA [
329]. However, the applicability of this provision to isolated DNA extracts remains debated, as this interpretation is not explicitly established under the Code and does not represent current standard practice. At present, modifications of the Code to allow DNA sequences to be used as name-bearing types are not under consideration [
330].
Even if the Code were interpreted to allow a total DNA extract traceable to a sampled individual to serve as type material, this requirement is difficult to meet in
Entamoeba, and the designation of new
Entamoeba species based on DNA extracts as types could generate future taxonomic controversy. Many
Entamoeba species cannot be cultivated in vitro, and DNA is typically obtained from faecal samples, which can be regarded, for the purposes of this discussion, as environmental samples. DNA extracted from such material (eDNA) represents a mixture of genetic material from multiple organisms, potentially including more than one
Entamoeba species if cases of mixed infections. Under the current Code, both environmental samples and eDNA are therefore problematic as potential type material, because it may be difficult or even impossible to unambiguously associate the DNA with the organism of interest and to exclude co-occurring taxa [
328].
In absence of the name-bearing type, the RL terminology seems to be for the moment the best alternative available. A subject of future debate will be the nomenclature of new species erected from RLs or other newly recognized lineages. In this review, we have listed a large number of
Entamoeba species that, although validly named and published, were in some cases erected without adequate comparison with other species in the genus or lacked a name-bearing type, as no image or drawing was included in the original publication. In some instances, these species were subsequently redescribed, whereas in others no additional descriptions have ever been provided. In many cases, regardless of the quality of the original description, they were later treated as synonyms. Because synonymies are taxonomic hypotheses rather than nomenclatural acts, a name previously regarded as a subjective synonym may be reinstated if new evidence supports its recognition as a distinct species, in accordance with the principle of priority (the Code, Art. 23). Such reinstatement does not constitute a new description, and the authorship and date remain those of the original publication. However, redescriptions and, where necessary, the designation of a neotype (the Code, Art. 75) may be required to stabilize the application of the name. It should be emphasized that a redescription does not in itself imply the establishment of a neotype. Moreover, the type locality and, in the case of parasites, the host species are those of the original type, and any neotype should originate “as nearly as practicable from the original type locality and, where relevant, from … the same host species as the original name-bearing type” (the Code, Art. 75.3.6). In the study of Clark and colleagues [
196], they resurrected the name
E. suis for an
Entamoeba species detected in pigs from Vietnam. In that work, the authors added a redescription based on morphological and genetic data and provided new images. While this approach supports the biological coherence and host association of the species, it does not satisfy the geographical requirements for neotype designation under the Code, because
E. suis was described in pigs from the United States [
271]; consequently, the Vietnamese isolate is best regarded as representative material supporting species recognition and the reinstatement of the name, but it cannot be considered a formal neotype. In the same study, the authors applied the name
E. equi to an
Entamoeba isolate from a horse. This species was originally described by Fantham [
175], but no name-bearing type was designated: the original description did not include drawings or photographs, and if a hapantotype was ever available, it is no longer accessible. As no images of
E. equi were provided by Clark and coworkers, the genetic data they presented contribute additional information but do not resolve the typification problem. Consequently,
E. equi currently lacks a holotype or syntypes and should be regarded as a
species inquirenda, that is, a species of doubtful identity requiring further investigation.
Another problem arises when newly detected lineages cannot be confidently assigned to any previously described species, whether synonymized or not. For example, several
Entamoeba species were historically described from different ruminant hosts and later synonymized on morphological grounds (see
Section 7.3). Genetic analyses now demonstrate that the morphospecies
E. bovis comprises multiple cryptic lineages that are not host-specific and may occur across several ruminant species. In such systems, where host specificity is broad and original descriptions were based on limited or ambiguous diagnostic information, the application of historical species names to newly delimited cryptic lineages is not straightforward. On the one hand, the occurrence of a lineage in the same general host group from which a species was originally described may support the pragmatic reuse of an available name, particularly if there is no evidence that multiple lineages co-occurred in that host at the time of the original description. On the other hand, the absence of verifiable links to name-bearing type material, combined with the recognition of multi-host cryptic diversity, introduces unavoidable uncertainty. Under these conditions, the reuse of historical names represents a taxonomic hypothesis rather than a definitive assignment, and alternative approaches, including the introduction of new names, may be equally justifiable. We therefore consider that decisions regarding name reuse should be made on a case-by-case basis, with explicit acknowledgment of the underlying assumptions and of their potential implications for nomenclatural stability.
This historically oriented review shows that the taxonomy of
Entamoeba has been shaped not only by the scarcity of robust diagnostic criteria and by differing opinions on their applicability but also by incomplete awareness of earlier literature. Several species names proposed in the late 19th and early 20th centuries subsequently fell into obscurity, occasionally leading later authors to assign newly detected isolates to a single available name simply because alternative names were unknown. In some cases, authors argued pragmatically for the continued use of an existing name in order to avoid leaving an organism unnamed (as in the case of
E. equi; see
Section 7.1). Although such decisions reflect a legitimate concern for terminological continuity, they also illustrate how the absence of a comprehensive historical framework can constrain taxonomic interpretation. By recovering and reassessing overlooked species names, the present review clarifies that more than one historical taxon may be applicable in some cases, thereby exposing the limitations of name assignment based primarily on convenience rather than explicit evidence. Even when these legacy names cannot be applied unambiguously, their recognition is essential to prevent the uncritical consolidation of distinct biological entities under a single name and to ensure that future taxonomic decisions are made within a complete, transparent, and historically informed nomenclatural context.
9. Toward an Operational Framework for Species Delimitation and Nomenclature in Entamoeba
Based on the historical analysis presented in this review and on current methodological possibilities, it is evident that future progress in the taxonomy of Entamoeba requires an explicit, coherent, and operational framework for species delimitation, identification, and naming. Such a framework must reconcile three constraints that have historically been difficult to integrate in this genus: (i) extreme morphological conservatism and cryptic diversity, (ii) frequent host sharing and incomplete host specificity, and (iii) a complex legacy of historical names of uneven diagnostic value. We propose the following general principles as a pragmatic synthesis to guide future taxonomic work on Entamoeba.
First, species delimitation should be explicitly framed within a lineage-based species concept, in which species are understood as independently evolving metapopulation lineages. Under this view, any line of evidence that supports evolutionary independence (such as molecular, morphological, ecological, or biological data) may be sufficient to support species delimitation. In Entamoeba, however, as shown in previous sections and it is indicated below, neither host species nor geographic origin should be regarded as valid criteria on their own for species delimitation, and morphology is frequently conserved across divergent lineages, such that genetic divergence often represents the only informative criterion. The use of multiple, congruent sources of evidence strengthens species hypotheses and remains desirable where feasible, but the absence of corroboration from additional data does not preclude species-level divergence based solely on genetic data in asexual protists. Rather, the decisive factor is whether the available evidence (genetic or otherwise) demonstrates evolutionary independence.
Second, morphological criteria should continue to be used as a primary tool for specimen identification and for assigning isolates to one of the traditionally recognized morphological groups (histolytica, coli, bovis, gingivalis). However, given the near absence of reliable interspecific morphological differences within these groups, morphology alone should not be considered sufficient for species delimitation except in rare cases where clear and consistent differences are demonstrable (for example, distinctive cyst structure, or clear size differences).
Third, host species and geographical origin should be regarded as contextual but not decisive evidence. Both criteria may contribute to species delimitation when supported by additional independent evidence, especially when infections are restricted to distinct zoological groups or to defined regions. Host specificity alone should not justify the erection of new species, nor should shared host range automatically lead to synonymization. The demonstrated capacity of several Entamoeba species to infect multiple, phylogenetically distant hosts, and the frequent occurrence of multiple species in a single host, preclude a simplistic host-based taxonomy.
Fourth, molecular data should play a central role in species delimitation, but their use must go beyond measures of overall sequence similarity. While SSU rRNA gene sequences have proven invaluable for revealing hidden diversity, the mere presence of divergent lineages is not, by itself, sufficient to define ribosomal lineages or species. Qualitative molecular criteria, such as the presence of compensatory base changes in conserved regions of the SSU rRNA secondary structure, provide a more robust, reproducible, and conceptually sound basis for delimiting independently evolving lineages. Qualitative criteria reduce subjectivity and align molecular evidence with diagnostic principles traditionally required in taxonomy.
Fifth, the current use of informal categories such as subtypes, ribosomal lineages, conditional lineages, and sequence types has been useful as an interim solution to manage increasing genetic data. However, because many of these entities likely correspond to biological species rather than intraspecific variants, their definitions and taxonomic status should be explicitly re-evaluated. Lineages supported by consistent qualitative molecular differences should be treated as candidate species and formally assessed under an integrative framework, rather than remaining indefinitely in a provisional nomenclatural limbo.
Finally, nomenclatural decisions must be grounded in both biological evidence and historical rigor. When a newly delimited lineage can be reasonably linked to a previously described species (based on host association, morphology, geography, and the absence of conflicting evidence) the reuse and reinstatement of an available name should be preferred, in accordance with the principle of priority. Such actions should be accompanied by detailed redescriptions and, where necessary, by the designation of neotypes following the provisions of the International Code of Zoological Nomenclature. Conversely, when historical names cannot be applied with confidence, the erection of new species names is justified and may be preferable to forced or ambiguous name reuse. In all cases, the underlying assumptions and uncertainties should be stated explicitly.
In conclusion, the long and often contentious taxonomic history of Entamoeba reflects not only methodological limitations but also shifting conceptual frameworks and incomplete engagement with earlier literature. By integrating historical analysis with modern molecular approaches and explicit species concepts, it is now possible to move toward a more stable, transparent, and biologically meaningful taxonomy. The framework outlined here does not aim to provide rigid rules but rather a coherent set of guiding principles to ensure that future taxonomic decisions in Entamoeba are reproducible, evidence-based, and consistent with both evolutionary theory and nomenclatural practice.