Next Article in Journal
Microbial Community Imbalance Drives Nitrous Oxide Emissions from Strongly Acidic Soil—Insights from a Laboratory Experiment with Microbial Inhibitors
Previous Article in Journal
Recent Advances on Chitosan-Based Thermosensitive Hydrogels for Skin Wound Treatment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Apicortin, a Putative Apicomplexan-Specific Protein, Is Present in Deep-Branching Opisthokonts

Institute of Molecular Life Sciences, HUN-REN Research Centre for Natural Sciences, Magyar Tudósok Körútja 2, 1117 Budapest, Hungary
Biology 2025, 14(6), 620; https://doi.org/10.3390/biology14060620
Submission received: 2 April 2025 / Revised: 24 May 2025 / Accepted: 25 May 2025 / Published: 28 May 2025

Simple Summary

Apicortin is an enigmatic protein that was first described in 2009 as a protein characteristic of apicomplexan parasites. These unicellular organisms cause serious illnesses in humans and domestic animals. The most infamous apicomplexan species is Plasmodium falciparum, which causes malaria and kills over 1 million people each year. Other apicomplexans are responsible for numerous infectious diseases in wild and domesticated animals, such as coccidiosis, babesiosis, and toxoplasmosis, resulting in a significant economic burden for animal husbandry. In contrast to bacterial pathogens, these apicomplexan parasites share many metabolic pathways with their animal/human hosts. This makes drug development extremely difficult—a drug that harms an apicomplexan parasite is also likely to harm its human host. In this study, we show that apicortin also occurs in early-branching (very simple) animals and fungi, as well as in ‘Choanoflagellata’, which are the closest relatives of animals. This knowledge is important since it facilitates a better understanding of the features and functional roles of this protein, thus indirectly contributing to the fight against apicomplexan parasites.

Abstract

Apicortin, a tubulin/microtubule-binding protein, was first described in 2009 as a protein characteristic of apicomplexans; it was found to be present in all Apicomplexa genomes already sequenced. Apart from these, it was found only in Trichoplax adhaerens, the only known representative of Placozoa at the time. Subsequent analyses revealed that it is present in both closely and distantly related taxa of Apicomplexa (Chrompodellids, Squirmids, Dinoflagellates, and Perkinsids, i.e., in Myzozoa). On the other hand, it turned out that it is also present in early-branching fungi that reproduce by zoospores. Now, we have shown that apicortin is found in many deep-branching opisthokonts. In addition to these fungi and T. adhaerens, it is also present in other simple animals, including further Placozoa and Ctenophora, and another opisthokont clade, choanoflagellates. However, apicortin-homologous sequences detected in the genomes/transcriptomes of bilaterian animals are the result of contamination.

1. Introduction

Apicortin contains two tubulin/microtubule-binding domains that are connected by a linker of 30–40 amino acids. The N-terminal part contains the so-called partial p25alpha domain; the DCX (doublecortin) domain is in the C-terminal half [1]. In apicomplexan apicortins, the partial p25alpha domain is preceded by a long (70–80 aa) disordered region [2], as shown in Figure 1. The presence of the p25alpha domain (either the complete domain or its shorter forms) is characterized by a specific phylogenetic distribution: it is closely related to the existence of the eukaryotic flagellum/cilium [3,4]. In apicortin, only the C-terminal third of the complete p25alpha domain is present. Its amino acid sequence is quite conservative and usually contains the conserved motif GXGXGXXGR, often referred to as a ‘Rossmann-like motif’ [5]. The DCX domain is named after doublecortin, a brain-specific X-linked gene/protein [6]. The evolutionarily conserved DCX domain is about 80 amino acids long and usually occurs as a tandem repeat, mainly in animals [7]. It also occurs as a single domain in protists.
Apicortin was first described in 2009 as a protein characteristic of apicomplexans; it was found to be present in all Apicomplexa genomes sequenced up to that point [1]. Apart from these, it was found only in Trichoplax adhaerens, the only known representative of Placozoa at that time. Later, it turned out that it is present in both closely and distantly related taxa of Apicomplexa (Chrompodellids, Squirmids, Dinoflagellates, and Perkinsids, collectively termed Myzozoa) [8,9]. On the other hand, it became evident that it is also present in early-branching fungi, which reproduce by zoospores [4].
The presence of apicortin in Opisthokonta (T. adhaerens, flagellated fungi) raised the possibility that it can be found in further opisthokont species. Now, it will be shown that apicortin is found in other deep-branching opisthokonts. In addition to flagellated fungi and T. adhaerens, it is also present in other simple animals, including further placozoan species and Ctenophora, and in another opisthokont clade, choanoflagellates. However, apicortin-homologous sequences found in the genome/transcriptome of bilaterian animals are usually the result of contamination. Genome and transcriptome assembly data often contain DNA and RNA contaminations, originating from other organisms introduced during nucleotide sequencing, e.g., a large-scale search identified millions of contaminated entries in GenBank [10]. Similarly, various genomes and proteomes have been shown to be contaminated with genes/proteins of apicomplexan parasite origin as well [11,12,13].

2. Materials and Methods

The accession numbers of protein and nucleotide sequences refer to the National Center for Biotechnology Information (NCBI) GenBank database unless otherwise stated. NCBI Blast was used to search databases (http://www.ncbi.nlm.nih.gov/BLAST/ (accessed on 20 February 2025)) [14]. Whole sequences of various apicortins were used as queries against protein and nucleotide databases (including transcriptome shotgun assemblies [TSAs] and expressed sequenced tags [ESTs]) to find similar sequences in Opisthokonta using BLASTP and TBLASTN programs, respectively. The queries were T. adhaerens XP_002111209, Babesia bovis XP_001609847, Plasmodium falciparum XP_001351735, Toxoplasma gondii XP_002364910, Jimgerdemannia flammicorona RUS30044.1, and Spizellomyces punctatus XP_016606225.1. The hits were considered if the BLAST E-value was lower than 1 × 10−5; the query and the hit were reciprocal best hits [15], and the hit contained both the partial p25alpha and DCX domains. Then, the hits were used as queries in BLASTP and BLASTX searches to find the most similar protein for each. Genome GC ratios were retrieved from NCBI genome datasets (https://www.ncbi.nlm.nih.gov/datasets/genome/ (accessed on 8 May 2025)).
Multiple alignments of sequences were conducted by the Clustal Omega program (https://www.ebi.ac.uk/jdispatcher/msa/clustalo/ (accessed on 8 May 2025)) [16]. The N-terminal amino acids before the p25alpha domain were trimmed. Phylogenetic trees were constructed based on Bayesian and maximum-likelihood (ML) methods. Bayesian analysis, using MrBayes v3.1.2 [17], was performed to construct phylogenetic trees. The WAG model [18] with rate variation among sites, allowing for some sites to be evolutionarily invariable (WAG + G + I), and default priors were used. Two independent analyses were run with three heated and one cold chain (temperature parameter 0.2) for 1.6 × 10−6 generations, with a sampling frequency of 0.01, and the first 25% of generations were discarded as burn-in. The two runs were convergent, shown by the fact that the value of the potential scale reduction factor (PSRF) [19] approached 1. The MEGA11 program [20] was used for ML analysis. Bootstrap values were calculated by bootstrap analyses of 1000 replicates with the WAG + G + I model.

3. Results

3.1. Search for New Apicortins

Whole sequences of various apicortins (cf. Methods) were used as queries against protein and nucleotide NCBI databases to find similar sequences in Opisthokonta using BLASTP and TBLASTN analyses, respectively. The new hits were analyzed to determine whether they were real apicortins or the result of genome contamination.
The new apicortin-like sequences are listed in Table 1. Only one new (previously unknown) protein sequence was found, namely, Hylaeus volcanicus XP_053992704. H. volcanicus is an insect that undergoes complete metamorphosis and belongs to the Colletidae family. There were no new hits among the ESTs.
There were many more new hits among transcriptome (TSA) sequences (Table 1). Only hits in which both the p25alpha and DCX domains are present were considered. Most of the hits are sequences found in insect transcriptomes. Others belong to Amphibia, Crustacea, Placozoa, Ctenophora, and Choanoflagellata. Of course, a hit does not automatically mean that it is a real apicortin, as it may also indicate contamination of the given genome/transcriptome. This may especially be likely in species belonging to Bilateria.
The line in Table 1, ‘Placozoa-consensus’, refers to the sequence published in Ref. [21]. It is a consensus sequence based on several newly sequenced placozoan species, which have not yet been deposited in the NCBI database.
These hits were used as queries in BLASTP and BLASTX searches to find the most similar protein for each sequence (Table 2). In most cases, the most similar protein was found in Apicomplexa; in other cases, in Placozoa (T. adhaerens), in Chrompodellids (Vitrella brassicaformis), and in Fungi (Lobulomycetales sp., Blyttiomyces sp.). The sequence identities, in general, are high (and the E-values are low); if not, it will be discussed later. Often, in the case of longer sequences, the TSA hit corresponds to more than one protein, and only one of them is an apicortin. This happens when the TSA is an ‘insect’ sequence, and the proteins are apicomplexan, most often Gregarina niphandrodes (as listed in Table 3). Some of these proteins have been found only in this species and do not possess any ortholog at all (XP_011128896, XP_011128900, XP_011128901, and XP_011128903).

3.2. Phylogenetic Analysis

Phylogenetic trees were constructed using Bayesian (Figure 2) and ML (Figure 3) methods. In addition to the newly identified apicortin-like sequences, apicomplexan (generally one per genus), chrompodellid (two per genus), a squirmid (Digyallum oweni), and fungal and placozoan apicortins were also included in the analysis. Some of the proteins contain a long N-terminus, which was not included in the alignment on which the analyses were based. Three apicortin-like sequences, previously shown to be present as contaminants in some species, were also included in the analyses. These are Aleochara curtula GATW02017439 [8,22], Porites astreoides GEHP01467367 [22], and Rhipicephalus microplus JT844686 [8].
The phylogenetic trees are quite similar; most importantly, there is a clade in both trees that contains all apicomplexan apicortins (“Apicomplexan clade”). The Bayesian tree reveals another clade containing fungal, placozoan, choanoflagellate, and ctenophora sequences (“Opisthokonta clade”). Within the latter clade, there are two sister groups: one of them contains apicortins of flagellated fungi and those of the Choanoflagellata, and the other one contains placozoan, ctenophora, and Jimgerdamannia apicortins. J. flammicorona is the only non-flagellated fungus that possesses an apicortin [4]. The ML tree also contains these two Opisthokonta groups, but they are not in a sister position.
One pair of chrompodellid apicortins (Chromera velia Cvel_6797 and V. brassicaformis CEM12737) is related to the apicomplexan clade; the other one differs significantly (C. velia Cvel_28653 and V. brassicaformis CEM06711), as can be seen from their position on the trees in accordance with data in the literature [8]. Bilaterian animal sequences are mostly located in the apicomplexan clade, except for Loxomitra sp. and Acanthochitona fascicularis sequences. The five insect-related apicortin-like sequences listed in Table 3, which are highly similar to G. niphandrodes apicortin, are sister to this Gregarinasina apicomplexan apicortin. Together, they are sisters to Oryctes rhinoceros and A. curtula sequences, which are also similar to G. niphandrodes apicortin (Table 1 and Ref. [8]).

3.3. Domains of Apicortin

Figure 4 and Figure 5 show the multiple alignments of the two characteristic domains of apicortins.
The partial p25alpha domain, in general, is the most conservative part of the molecule; the sequences are very similar, independent of whether the protein can be found in myzozoan or opisthokont species. However, there are three exceptions. Rosella allomyces (Fungi), P. falciparum (Apicomplexa), and Ctenophora sp. (Opisthokonta) lack the final part of this domain, which includes the Rossmann-like motif (Figure 4). The reason and the possible functional consequences are not known. Interestingly, in the Plasmodium genus, only species that parasitize mammals lack the Rossmann-like sequence. Species that occur in birds or reptiles (Plasmodium gallinaceum, Plasmodium relictum) do have this part of the sequence of the molecule (Figure 4). In the DCX domain, the overall similarity is somewhat lower between the two groups than in the case of the partial p25alpha domain; however, there is no exception: the similarity occurs through the whole domain in all orthologs (Figure 5).

4. Discussion

4.1. Phylogenetic Occurrence and Possible Function of Apicortins

Apicortin was found in 2009 as a characteristic protein of Apicomplexa [1]. This finding remains consistent: it is present in all Apicomplexa genomes [9], except in the smallest one, Babesia microti [23]. The namesake of Apicomplexa is the ‘apical complex’. One of the structural components of the apical complex is the conoid, which plays an important role in the invasion of the host cells by these parasites. It is likely that apicortin has some kind of role in providing the correct structure and function of the conoid. In T. gondii, apicortin was shown to be localized exclusively at the conoid and is essential for its structure and function [24,25]. A conoid consists of tubulin-based fibers. Thus, apicortin, which contains two tubulin-binding domains, p25alpha and DCX, is an ideal protein for its stabilization. P. falciparum does not have a conoid; here, apicortin is localized at the apical end of the parasites and is involved in the formation of the apical complex [26,27]. The observation that the drug tamoxifen hinders the growth of the asexual blood stage P. falciparum, which was attributed to the disruption of the apicortin–tubulin interaction observed in vitro, provides further evidence that apicortin has a critical function in apicomplexan microtubule function [26,27]. These data suggest that the presence of apicortin is correlated with that of the conoid and apical complex. This suggestion is corroborated by the fact that apicortin is present in other Myzozoa, namely, chrompodellids, squirmids, dinoflagellates, and perkinsids, which also possess a conoid or a conoid-like structure [28]. Moreover, it was demonstrated that the presence of apicortin is important in the parasite–host interaction; downregulation of apicortin leads to impaired host cell invasion [24,25,26].
Apart from these occurrences, apicortin was found originally in T. adhaerens, the only known representative of Placozoa at the time, where it is one of the most abundant proteins [29]. Placozoans belong to the simplest animals [30]; however, their exact phylogenetic position is debated [31,32,33,34]. Later, it became evident that in early-branching fungi, which reproduce by zoospores, apicortin is also present [4].
In the present work, it has been shown that not only T. adhaerens but also other recently discovered placozoans have apicortin, as well as another deep-branching animal, a ctenophore. Trichoplax sp. H2 [35] is a close relative of T. adhaerens; the apicortins of the two species are identical. A placozoan apicortin consensus sequence based on several newly sequenced placozoan species from the orders Hoilungea, Cladhexea, and Trichoplacea was also published [21] and used to produce a phylogenetic tree. This sequence shows 86% identity and 93% similarity with that of a T. adhaerens apicortin. Ctenophores, commonly known as comb jellies that inhabit marine waters worldwide, are one of the simplest non-bilaterian animals that are in strong “competition” with sponges for the title “the sister group to all other animals” [34,36,37]. The apicortin sequence of Ctenophora sp. shares a 31.25% identity and 47.22% similarity with that of the choanoflagellate Helgoeca nana. These values are about the same as those obtained in comparison with Ctenophora sp. and T. adhaerens apicortins (26.24% and 47.27%). This identity value is the lowest (and the E-value is the highest) in comparison with the other values discussed here. However, they are still significant enough to establish orthology between molecules of different phyla; T. adhaerens XP_002111209 and Ctenophora HBZJ01045600 are reciprocal best hits of each other. The ‘reciprocal best hit’ method helps to reveal a 1:1 orthology even in cases when the BLAST E-score is higher than 1 × 10−10 [15].
In addition to apicomplexans and ctenophores, Porifera (sponges) and Cnidaria are phyla of simple animals that do not belong to Bilateria. These phyla do not contain apicortin, but they do contain another p25alpha domain-containing protein, TPPP (tubulin polymerization promoting protein) [38]. On the one hand, it indicates that the correlation between the incidence of the p25alpha domain and that of the eukaryotic flagellum remains consistent; on the other hand, it suggests that apicortin and TPPP have similar functions, namely, stabilization of tubulin polymers. (An excellent review about TPPP, including its function in animals and humans, has been published recently [39].) In a cnidarian species, Porites astreoides, an apicortin-like sequence was found; however, it was shown to be a contamination of apicomplexan origin [22]. Its GC ratio (52.3%) is much higher than either the genome-wide GC ratio of P. astreoides (40%) [40] or that of the other animal (T. adhaerens) apicortin gene (39.7%).
Additionally, three species belonging to the Choanoflagellata [41], a sister group [42] to animals (Metazoa), also possess apicortin. (Choanoflagellata and Metazoa together form the Choanozoa clade [43].) All three Choanoflagellata belong to the Acanthoecidae family of the Acanthoecida order; apicortin appears to be absent in the other order of Choanoflagellates, Craspedida. In the species of this order, TPPP is present.
Of course, these opisthokont species do not have a conoid or apical complex. However, as mentioned, there is a strong correlation between the occurrence of the p25alpha domain and that of the eukaryotic flagellum/cilium, which are tubulin/microtubule-based structures. Apicomplexans also possess flagella that can only be found in male microgametes of these parasites; moreover, an evolutionary connection between the flagellum and apical complex was proposed [28,44]. Recent results suggest that the conoid complex evolved from flagellar components [45,46] or from the flagellar root apparatus [28,47]. Flagella and cilia are present in all animals; however, most fungi lack them, except the deeper branching clades that reproduce by motile zoospores propelled by a single, posteriorly oriented flagellum [48]. It was shown in several publications that there is a strict correlation between the incidence of the p25alpha domain and the flagellum in fungi, too [4,49,50]. As their name indicates, choanoflagellates also have a single flagellum.
The opisthokont apicortins lack the long, disordered N-terminal region [2,4], which was suggested to be necessary for the formation of the proper conoid structure in apicomplexans [25]. However, opisthokont apicortins do contain the two microtubule-binding domains (partial p25alpha and DCX); thus, it can be hypothesized that they play a role in the stabilization of flagellar microtubules. As long as this is just a hypothesis and we do not know the definitive answer, we also cannot answer the interesting question of why apicortin is lost in Bilateria.

4.2. Apicortin-like Contaminations in Animals

While the new choanoflagellate, placozoan, and ctenophora sequences are genuine apicortin hits, the other animal (Bilateria) hits appear to be from contamination. This is shown by their position on the phylogenetic trees. Most of them can be found within the apicomplexan clade, not within the Opisthokonta clade. If the homology between these sequences were due to real orthology, we would expect bilaterian species to be located in the Opisthokonta clade in the tree. If the bilaterian sequence is located within the Apicomplexa clade, then contamination or horizontal (lateral) gene transfer (HGT) may have occurred. HGT does not often happen from apicomplexans to animals (at least, the author has no information about it), although it is difficult to rule it out completely; however, there are examples in the opposite direction, from animals to Apicomplexa [51,52]. Bilaterian sequences that resemble the apicortin of G. niphandrodes (which is a Gregarinasina) (Table 2) are sisters to it and probably originate from (an) unknown yet non-sequenced Gregarinasina species. The high sequence identities (and low E-values) between such distant species also exclude the possibility of real orthology. Moreover, in several cases, the contamination is corroborated by a BLASTX search using the newly found TSAs as queries. In the case of longer sequences, the TSA hits correspond to more than one protein (two to ten), and only one of them is an apicortin. The other ones are various proteins, which are, in most cases, specific to Apicomplexa or even to a single apicomplexan species, such as G. niphandrodes (Table 3). The situation is the same in some other cases not shown in Table 3 (Schistocerca gregaria and Ptilocerembia catheriane). It is highly unlikely that 5–10 genes would have been transferred to animals at once by HGT; in these cases, it is much more likely that their presence was caused by contamination. Two further sequences in this ‘Gregarinasina’ clade are from A. curtula and O. rhinoceros, which are also similar to G. niphandrodes apicortin. A. curtula GATW02017439 was shown to be of Gregarinasina origin [8,22]; O. rhinoceros GHNO01082742 seems to also be a ‘foreign body’ in the insect transcriptome. This is suggested not only by its position in the tree and its sequence similarity to G. niphandrodes apicortin but also by its GC ratio of 43.1%, which is much higher than that of the whole genome (34.9%) [53] or the flanking TSA, GHNO01082741 (30.6%). The latter nucleotide is a real insect one, similar to TSAs in the same insect family, Scarabaeidae.
The Rhipicephalus microplus JT844686 sequence is sister to B. bovis apicortin, not surprisingly, since the sequence was shown to be of B. bovis origin [8]. In other cases, the source of the contamination is not so trivial. The Spea multiplicate TSA sequence, which is highly similar to T. gondii and Neospora caninum apicortins (Table 1 and Table 2), is within a clade on the tree that contains apicortins of the Eimeriorina suborder, to which these two apicortins belong.
On the Bayesian tree, there are only two bilaterian sequences in the Opisthokonta clade. Both of them are the most similar to T. adhaerens apicortin among the proteins of the NCBI database. Loxomitra sp. GIMU01103700 shows 85% identity and 91% similarity with T. adhaerens apicortin (Table S2). In comparison with the placozoan consensus apicortin sequence [21], these values are even higher: 89% identity and 95% similarity. These values are in the same range as between T. adhaerens and the placozoan consensus sequence (86% and 93%). Thus, it seems to be obvious that the sequence found in Loxomitra sp. originated from a placozoan species. This very high sequence identity suggests contamination, as HGT would have occurred some time ago, and the sequence may have changed significantly since then.
The habitat of placozoans and that of the species of the Loxosomatidae family are similar in the coastal zones of tropical and subtropical seas on various substrates such as rocks and corals [54]; thus, it can be imagined that the Loxomitra sample was indeed contaminated. The same phenomenon may be true for A. fascicularis (velvety mail shell) as well, which also lives in similar conditions; moreover, Trichoplax spp. are known to inhabit the shells of mollusks as well [55]. However, the identity/similarity between placozoan apicortins (Trichoplax spp., placozoan consensus sequence) and A. fascicularis is significantly lower (cca. 40/60%) than in the case of Loxomitra sp. It should be noted, however, that the A. fascicularis sequence is as similar to the chrompodellid V. brassicaformis apicortin (isolated from coral reefs) as it is to placozoan apicortins. However, it is interesting to compare the GC ratio of these apicortins/apicortin-like sequences: T. adhaerens 39.7%, V. brassicaformis 59.8%, and A. fascicularis 69.7%. This extremely high GC ratio is much higher than that of the whole genome of A. fascicularis (41%), suggesting the foreign origin of the apicortin-like sequence. Based on the similarity and GC ratio data, no definite statement can be made regarding the origin of the A. fascicularis sequence at this time. However, T. adhaerens cannot be the source of either contamination (cf. relatively low sequence similarity) or horizontal gene transfer (very different GC ratio).
Finally, the Tigriopus californicus sequence has different positions in the trees; in the Bayesian tree, it is located in the Apicomplexan clade, while in the ML tree, it is located outside the Apicomplexan clade in a sister position to the Opisthokonta clade. The T. californicus sequence shows a relatively low similarity to apicortin sequences (low identity percent, high E-value), so contamination is unlikely. It might be hypothesized that it is either a relic that has been uniquely preserved in this species, or rather its presence is the result of an ancient HGT. The latter case is made slightly more likely by the fact that the GC ratio of the apicortin-like sequence (39.0%) is slightly lower than the GC ratio of the whole genome of this species (42%).
In general, the presence of apicomplexan-derived contaminants in different genomes is not surprising. This has been previously pointed out by several authors [11,12,13]. In the case of wild animals, it is not possible to avoid infection by parasites before sequencing. Computational filters applied to the draft sequences are not always able to identify sequences of foreign origin. The taxonomic distribution of known complete genomes is not uniform. In the case of apicomplexan species, the genome/transcriptome is mainly known for species of medical/veterinary importance; however, for example, in the case of Gregarinasina, there are only a few fully sequenced species, although, as this study shows, they can often be sources of contamination. The knowledge of additional genomes will facilitate the bioinformatic filtering of contaminants and the identification of the source of contamination.

4.3. More General Evolutionary Considerations

For better visualization of the phylogenetic occurrence of apicortin, its presence/absence is shown on a eukaryotic evolutionary tree (Figure 6). It is based on earlier published trees [56,57] and shows the major lineages in eukaryotic supergroups, with some lineages lumped and others extended. Summarizing, it can be established that apicortin is present in two main clades. According to the more recent view, the two major eukaryotic groups are Opimoda+ and Diphoda+ [56,58]. Apicortin occurs in both ‘supergroups’. Its presence is limited to Myzozoa in Diphoda+ and to Opisthokonta in Opimoda.
Myzozoan apicortins were discussed in detail recently [9]. Now, it is just a reminder that apicortin was originally thought to be Apicomplexa-specific, but then it slowly became clear that it also occurs in other Myzozoa [9,10]. However, it is absent in Ciliophora, the sister group of Myzozoa [9]. This can be considered a fact, as several species belonging to Ciliophora have long had their whole genomes sequenced. Instead, they have another p25alpha domain-containing protein (usually several paralogs in a species), the so-called short-type TPPP [5].
Apicortin seems to be present in several early diverging lineages of Opisthokonta: early-branching Fungi, Choanoflagellata, and early-branching Metazoa. The Opisthokonta clade is composed of two main branches: Holomycota [59] and Holozoa [60]. The first contains, in addition to other groups, Fungi; the second contains, among others, Choanozoa [43]. The occurrence of apicortin in both main branches indicates that it was present in the Opisthokonta common ancestor. However, it seems that there were a few independent losses of this gene. Terrestrial fungi, which do not have zoospores, lost both flagella and apicortin. Some smaller Ophistokonta groups, such as Filasterea or Ichthyosporea, do not possess apicortin. Although as genome/transcriptome projects go on, this view may change, the lack of apicortin in these groups is in accordance with the fact that Filasterea and Ichthyosporea also lost their flagellum [41]. In Metazoa, Porifera and the (Cnidaria + Bilateria) clade also lack apicortin; instead, they contain another p25alpha domain-containing protein, TPPP. It means two further losses in Porifera and in the common ancestor of Cnidaria and Bilateria. It is worth noting that Figure 6 represents the most accepted view that Placozoa is sister to (Cnidaria + Bilateria); however, there is also support for the case that (Placozoa + Cnidaria) is sister to Bilateria [37,61]. If Placozoa forms a clade with Cnidaria, it would imply another gene loss in the ancestor of Cnidaria.
This phylogenetic distribution suggests that apicortin was either present in the last eukaryotic common ancestor and lost in many lineages or it underwent a horizontal transfer involving an Apicomplexa/Myzozoa to an ancestral opisthokont.

5. Conclusions

Apicortin was thought to be specific to the phylum Apicomplexa [1]. This statement is still valid; it is present in almost all apicomplexan genomes [9]. However, it has become obvious that its incidence is significantly broader. On the one hand, it can be found in Myzozoa (i.e., in Alveolata phyla, except Ciliata); on the other hand, it is present in simple Opisthokonta as well. The latter includes early-branching flagellated Fungi [4], Choanoflagellate, the sister clade to Metazoa (animals), and simple animals, such as Placozoa [1] and Ctenophora. However, it is absent in bilaterian animals. Apicortin-homologous sequences in the genome/transcriptome of these animals are the result of contamination. In some cases, its source can be exactly identified; in other cases, only the genus, family, or order can be suggested.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biology14060620/s1, Table S1: Accession numbers of proteins/TSAs/SRAs shown in Figure 2, Figure 3, Figure 4 and Figure 5 [62]. Table S2: Pairwise identity/similarity between three apicortin-(like) sequences. Figure S1: Multiple alignments of apicortins. Figure S2: Bayesian tree for Figure 2. Figure S3: ML tree for Figure 3.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in this paper and in the Supplementary Material.

Acknowledgments

The author thanks Judit Oláh (Institute of Molecular Life Sciences) and Miklós Müller (Rockefeller University) for their careful reading of the manuscript.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BLASTBasic Local Alignment Search Tool
BPPBayesian Posterior Probability
ESTExpressed Sequenced Tag
DCXDoublecortin
HGTHorizontal Gene Transfer
MLMaximum Likelihood
NCBINational Center for Biotechnology Information
PSRFPotential Scale Reduction Factor
SRASequence Read Archive
TSATranscriptome Shotgun Assembly
TPPPTubulin Polymerization Promoting Protein

References

  1. Orosz, F. Apicortin, a unique protein, with a putative cytoskeletal role, shared only by apicomplexan parasites and the placozoan Trichoplax adhaerens. Infect. Genet. Evol. 2009, 9, 1275–1286. [Google Scholar] [CrossRef] [PubMed]
  2. Orosz, F. Apicomplexan apicortins possess a long disordered N-terminal extension. Infect. Genet. Evol. 2011, 11, 1037–1044. [Google Scholar] [CrossRef] [PubMed]
  3. Orosz, F.; Ovádi, J. TPPP orthologs are ciliary proteins. FEBS Lett. 2008, 582, 3757–3764. [Google Scholar] [CrossRef]
  4. Orosz, F. On the TPPP-like proteins of flagellated Fungi. Fungal Biol. 2021, 125, 357–367. [Google Scholar] [CrossRef]
  5. Tirián, L.; Hlavanda, E.; Oláh, J.; Horváth, I.; Orosz, F.; Szabó, B.; Kovács, J.; Szabad, J.; Ovádi, J. TPPP/p25 promotes tubulin assemblies and blocks mitotic spindle formation. Proc. Natl. Acad. Sci. USA 2003, 100, 13976–13981. [Google Scholar] [CrossRef]
  6. Sapir, T.; Horesh, D.; Caspi, M.; Atlas, R.; Burgess, H.A.; Wolf, S.G.; Francis, F.; Chelly, J.; Elbaum, M.; Pietrokovski, S.; et al. Doublecortin mutations cluster in evolutionarily conserved functional domains. Hum. Mol. Genet. 2000, 9, 703–712. [Google Scholar] [CrossRef]
  7. Reiner, O.; Coquelle, F.M.; Peter, B.; Levy, T.; Kaplan, A.; Sapir, T.; Orr, I.; Barkai, N.; Eichele, G.; Bergmann, S. The evolving doublecortin (DCX) superfamily. BMC Genom. 2006, 7, 188. [Google Scholar] [CrossRef]
  8. Orosz, F. Wider than thought phylogenetic occurrence of apicortin, a characteristic protein of apicomplexan parasites. J. Mol. Evol. 2016, 82, 303–314. [Google Scholar] [CrossRef] [PubMed]
  9. Orosz, F. p25alpha Domain-Containing Proteins of Apicomplexans and Related Taxa. Microorganisms 2023, 11, 1528. [Google Scholar] [CrossRef]
  10. Steinegger, M.; Salzberg, S.L. Terminating contamination: Large-scale search identifies more than 2,000,000 contaminated entries in GenBank. Genome Biol. 2020, 21, 115. [Google Scholar] [CrossRef]
  11. Orosz, F. Two recently sequenced vertebrate genomes are contaminated with apicomplexan species of the Sarcocystidae family. Int. J. Parasitol. 2015, 45, 871–878. [Google Scholar] [CrossRef] [PubMed]
  12. Borner, J.; Burmester, T. Parasite infection of public databases: A data mining approach to identify apicomplexan contaminations in animal genome and transcriptome assemblies. BMC Genom. 2017, 18, 100. [Google Scholar] [CrossRef] [PubMed]
  13. Xie, J.; Tan, B.; Zhang, Y.A. Large-scale study into protist-animal interactions based on public genomic data using DNA barcodes. Animals 2023, 13, 2243. [Google Scholar] [CrossRef]
  14. Altschul, S.F.; Madden, T.L.; Schäffer, A.A.; Zhang, J.; Zhang, Z.; Miller, W.; Lipman, D.J. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997, 25, 3389–3402. [Google Scholar] [CrossRef]
  15. Leka, K.P.; Wideman, J.G. An introduction to comparative genomics, EukProt, and the reciprocal best hit (RBH) method for bench biologists: Ancestral phosphorylation of Tom22 in eukaryotes as a case study. Methods Enzymol. 2024, 707, 209–234. [Google Scholar] [CrossRef] [PubMed]
  16. Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T.J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.; Remmert, M.; Söding, J.; et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011, 7, 539. [Google Scholar] [CrossRef]
  17. Ronquist, F.; Huelsenbeck, J.P. MrBayes 3: Bayesian phylogenetic inference under mixture models. Bioinformatics 2003, 19, 1572–1574. [Google Scholar] [CrossRef]
  18. Whelan, S.; Goldman, N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol. Biol. Evol. 2001, 18, 691–699. [Google Scholar] [CrossRef]
  19. Gelman, A.; Rubin, D.B. Inference from iterative simulation using multiple sequences. Statist. Sci. 1992, 7, 457–472. [Google Scholar] [CrossRef]
  20. Tamura, K.; Stecher, G.; Kumar, S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol. 2021, 38, 3022–3027. [Google Scholar] [CrossRef]
  21. Tessler, M.; Neumann, J.S.; Kamm, K.; Osigus, H.-J.; Eshel, G.; Narechania, A.; Burns, J.A.; DeSalle, R.; Schierwater, B. Phylogenomics and the first higher taxonomy of Placozoa, an ancient and enigmatic animal phylum. Front. Ecol. Evol. 2022, 10, 1016357. [Google Scholar] [CrossRef]
  22. Orosz, F. Does apicortin, a characteristic protein of apicomplexan parasites and placozoa, occur in Eumetazoa? Acta Parasitol. 2018, 63, 617–633. [Google Scholar] [CrossRef] [PubMed]
  23. Cornillot, E.; Hadj-Kaddour, K.; Dassouli, A.; Noel, B.; Ranwez, V.; Vacherie, B.; Augagneur, Y.; Brès, V.; Duclos, A.; Randazzo, S.; et al. Sequencing of the smallest Apicomplexan genome from the human pathogen Babesia microti. Nucleic Acids Res. 2012, 40, 9102–9114. [Google Scholar] [CrossRef]
  24. Nagayasu, E.; Hwang, Y.C.; Liu, J.; Murray, J.M.; Hu, K. Loss of a doublecortin (DCX)-domain protein causes structural defects in a tubulin-based organelle of Toxoplasma gondii and impairs host-cell invasion. Mol. Biol. Cell 2017, 28, 411–428. [Google Scholar] [CrossRef]
  25. Leung, J.M.; Nagayasu, E.; Hwang, Y.C.; Liu, J.; Pierce, P.G.; Phan, I.Q.; Prentice, R.A.; Murray, J.M.; Hu, K. A doublecortin-domain protein of Toxoplasma and its orthologues bind to and modify the structure and organization of tubulin polymers. BMC Mol. Cell Biol. 2020, 21, 8. [Google Scholar] [CrossRef]
  26. Chakrabarti, M.; Garg, S.; Rajagopal, A.; Pati, S.; Singh, S. Targeted repression of Plasmodium apicortin by host microRNA impairs malaria parasite growth and invasion. Dis. Model. Mech. 2020, 13, dmm042820. [Google Scholar] [CrossRef]
  27. Chakrabarti, M.; Joshi, N.; Kumari, G.; Singh, P.; Shoaib, R.; Munjal, A.; Kumar, V.; Behl, A.; Abid, M.; Garg, S.; et al. Interaction of Plasmodium falciparum apicortin with α- and β-tubulin is critical for parasite growth and survival. Sci. Rep. 2021, 11, 4688. [Google Scholar] [CrossRef] [PubMed]
  28. Dos Santos Pacheco, N.; Tosetti, N.; Koreny, L.; Waller, R.F.; Soldati-Favre, D. Evolution, composition, assembly, and function of the conoid in Apicomplexa. Trends Parasitol. 2020, 36, 688–704. [Google Scholar] [CrossRef]
  29. Ringrose, J.H.; van den Toorn, H.W.P.; Eitel, M.; Post, H.; Neerincx, P.; Schierwater, B.; Altelaar, A.F.M.; Heck, A.J.R. Deep proteome profiling of Trichoplax adhaerens reveals remarkable features at the origin of metazoan multicellularity. Nat. Commun. 2013, 4, 1408. [Google Scholar] [CrossRef]
  30. Schierwater, B. My favorite animal Trichoplax adhaerens. Bioessays 2005, 27, 1294–1302. [Google Scholar] [CrossRef]
  31. Srivastava, M.; Begovic, E.; Chapman, J.; Putnam, N.H.; Hellsten, U.; Kawashima, T.; Kuo, A.; Mitros, T.; Salamov, A.; Carpenter, M.L.; et al. The Trichoplax genome and the nature of placozoans. Nature 2008, 454, 955–960. [Google Scholar] [CrossRef] [PubMed]
  32. Dellaporta, S.L.; Xu, A.; Sagasser, S.; Jakob, W.; Moreno, M.A.; Buss, L.W.; Schierwater, B. Mitochondrial genome of Trichoplax adhaerens supports Placozoa as the basal lower metazoan phylum. Proc. Nat. Acad. Sci. USA 2006, 103, 8751–8756. [Google Scholar] [CrossRef] [PubMed]
  33. Najle, S.R.; Grau-Bové, X.; Elek, A.; Navarrete, C.; Cianferoni, D.; Chiva, C.; Cañas-Armenteros, D.; Mallabiabarrena, A.; Kamm, K.; Sabidó, E.; et al. Stepwise emergence of the neuronal gene expression program in early animal evolution. Cell 2023, 186, 4676–4693.e29. [Google Scholar] [CrossRef]
  34. Whelan, N.V.; Kocot, K.M.; Moroz, T.P.; Mukherjee, K.; Williams, P.; Paulay, G.; Moroz, L.L.; Halanych, K.M. Ctenophore relationships and their placement as the sister group to all other animals. Nat. Ecol. Evol. 2017, 1, 1737–1746. [Google Scholar] [CrossRef]
  35. Kamm, K.; Osigus, H.-J.; Stadler, P.F.; DeSalle, R.; Schierwater, B. Trichoplax genomes reveal profound admixture and suggest stable wild populations without bisexual reproduction. Sci. Rep. 2018, 8, 11168. [Google Scholar] [CrossRef]
  36. Schultz, D.T.; Haddock, S.H.D.; Bredeson, J.V.; Green, R.E.; Simakov, O.; Rokhsar, D.S. Ancient gene linkages support ctenophores as sister to other animals. Nature 2023, 618, 110–117. [Google Scholar] [CrossRef]
  37. Pandey, A.; Braun, E.L. Phylogenetic Analyses of Sites in Different Protein Structural Environments Result in Distinct Placements of the Metazoan Root. Biology 2020, 9, 64. [Google Scholar] [CrossRef] [PubMed]
  38. Oláh, J.; Szénási, T.; Szabó, A.; Kovács, K.; Lőw, P.; Štifanić, M.; Orosz, F. Tubulin binding and polymerization promoting properties of Tubulin Polymerization Promoting Proteins are evolutionarily conserved. Biochemistry 2017, 56, 1017–1024. [Google Scholar] [CrossRef]
  39. Diaz, P.J.; Shi, Q.; McNeish, P.Y.; Banerjee, S. Tubulin Polymerization Promoting Proteins: Functional Diversity With Implications in Neurological Disorders. J. Neurosci. Res. 2025, 103, e70044. [Google Scholar] [CrossRef]
  40. Mansour, T.A.; Rosenthal, J.J.; Brown, C.T.; Roberson, L.M. Transcriptome of the Caribbean stony coral Porites astreoides from three developmental stages. GigaScience 2016, 5, 33. [Google Scholar] [CrossRef]
  41. King, N.; Westbrook, M.J.; Young, S.L.; Kuo, A.; Abedin, M.; Chapman, J.; Fairclough, S.; Hellsten, U.; Isogai, Y.; Letunic, I. The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans. Nature 2008, 451, 783–788. [Google Scholar] [CrossRef] [PubMed]
  42. Torruella, G.; de Mendoza, A.; Grau-Bové, X.; Antó, M.; Chaplin, M.A.; del Campo, J.; Eme, L.; Pérez-Cordón, G.; Whipps, C.M.; Nichols, K.M. Phylogenomics Reveals Convergent Evolution of Lifestyles in Close Relatives of Animals and Fungi. Curr. Biol. 2015, 25, 2404–2410. [Google Scholar] [CrossRef]
  43. Adl, S.M.; Bass, D.; Lane, C.E.; Lukeš, J.; Schoch, C.L.; Smirnov, A.; Agatha, S.; Berney, C.; Brown, M.W. Revisions to the Classification, Nomenclature, and Diversity of Eukaryotes. J. Eukar. Microbiol. 2018, 66, 4–119. [Google Scholar] [CrossRef]
  44. Morrissette, N.S.; Abbaali, I.; Ramakrishnan, C.; Hehl, A.B. The tubulin superfamily in apicomplexan parasites. Microorganisms 2023, 11, 706. [Google Scholar] [CrossRef]
  45. De Leon, J.C.; Scheumann, N.; Beatty, W.; Beck, J.R.; Tran, J.Q.; Yau, C.; Bradley, P.J.; Gull, K.; Wickstead, B.; Morrissette, N.S. A SAS-6-like protein suggests that the Toxoplasma conoid complex evolved from flagellar components. Eukaryot. Cell 2013, 12, 1009–1019. [Google Scholar] [CrossRef]
  46. Portman, N.; Šlapeta, J. The flagellar contribution to the apical complex: A new tool for the eukaryotic Swiss Army knife? Trends Parasitol. 2014, 30, 58–64. [Google Scholar] [CrossRef]
  47. Lévêque, M.F.; Berry, L.; Besteiro, S. An evolutionarily conserved SSNA1/DIP13 homologue is a component of both basal and apical complexes of Toxoplasma gondii. Sci. Rep. 2016, 6, 27809. [Google Scholar] [CrossRef] [PubMed]
  48. James, T.Y.; Letcher, P.M.; Longcore, J.E.; Mozley-Standridge, S.E.; Porter, D.; Powell, M.J.; Griffith, G.W.; Vilgalys, R. A molecular phylogeny of the flagellated fungi Chytridiomycota and description of a new phylum Blastocladiomycota. Mycologia 2006, 98, 860–871. [Google Scholar] [CrossRef] [PubMed]
  49. Orosz, F. On the TPPP Protein of the Enigmatic Fungus, Olpidium—Correlation between the Incidence of p25alpha Domain and That of the Eukaryotic Flagellum. Int. J. Mol. Sci. 2022, 23, 13927. [Google Scholar] [CrossRef]
  50. Orosz, F. Tubulin Polymerization Promoting Proteins (TPPPs) of Aphelidiomycota: Correlation between the Incidence of p25alpha Domain and the Eukaryotic Flagellum. J. Fungi 2023, 9, 376. [Google Scholar] [CrossRef]
  51. Templeton, T.J. Whole-genome natural histories of apicomplexan surface proteins. Trends Parasitol. 2007, 23, 205–212. [Google Scholar] [CrossRef] [PubMed]
  52. Kishore, S.P.; Stiller, J.W.; Deitsch, K.W. Horizontal gene transfer of epigenetic machinery and evolution of parasitism in the malaria parasite Plasmodium falciparum and other apicomplexans. BMC Evol. Biol. 2013, 13, 37. [Google Scholar] [CrossRef] [PubMed]
  53. Filipović, I.; Rašić, G.; Hereward, J.; Gharuka, M.; Devine, G.J.; Furlong, M.J.; Etebari, K. A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros). BMC Genom. 2022, 23, 426. [Google Scholar] [CrossRef]
  54. Iseto, T. Loxocorone, a New Genus of the Family Loxosomatidae (Entoprocta: Solitaria), with Descriptions of Two New Loxomitra (sensu stricto) and a New Loxocorone from Okinawa, the Ryukyu Archipelago, Japan. Zool. Sci. 2025, 19, 359–367. [Google Scholar] [CrossRef]
  55. Pearse, V.B.; Voigt, O. Field biology of placozoans (Trichoplax): Distribution, diversity, biotic interactions. Integr. Comp. Biol. 2007, 47, 677–692.2007. [Google Scholar] [CrossRef]
  56. Williamson, K.; Eme, L.; Baños, H.; McCarthy, C.G.; Susko, E.; Kamikawa, R.; Orr, R.J.S.; Muñoz-Gómez, S.A.; Minh, B.Q.; Simpson, A.G.B.; et al. A robustly rooted tree of eukaryotes reveals their excavate ancestry. Nature 2025, 640, 974–981. [Google Scholar] [CrossRef]
  57. Janouškovec, J.; Tikhonenkov, D.V.; Burki, F.; Howe, A.T.; Kolísko, M.; Mylnikov, A.P.; Keeling, P.J. Factors mediating plastid dependency and the origins of parasitism in apicomplexans and their close relatives. Proc. Natl. Acad. Sci. USA 2015, 112, 10200–102007. [Google Scholar] [CrossRef] [PubMed]
  58. Derelle, R.; Torruella, G.; Klimeš, V.; Brinkmann, H.; Kim, E.; Vlček, Č.; Lang, B.F.; Eliáš, M. Bacterial proteins pinpoint a single eukaryotic root. Proc. Natl. Acad. Sci. USA 2015, 112, E693–E699. [Google Scholar] [CrossRef]
  59. Tedersoo, L.; Sánchez-Ramírez, S.; Kõljalg, U.; Bahram, M.; Döring, M.; Schigel, D.; May, T.; Ryberg, M.; Abarenkov, K. High-level classification of the Fungi and a tool for evolutionary ecological analyses. Fungal Divers. 2018, 90, 135–159. [Google Scholar] [CrossRef]
  60. Lang, B.F.; O’Kelly, C.; Nerad, T.; Gray, M.W.; Burger, G. The Closest Unicellular Relatives of Animals. Curr. Biol. 2002, 12, 1773–1778. [Google Scholar] [CrossRef]
  61. Laumer, C.E.; Gruber-Vodicka, H.; Hadfield, M.G.; Pearse, V.B.; Riesgo, A.; Marioni, J.C.; Giribet, G. Support for a clade of placozoa and cnidaria in genes with minimal compositional bias. eLife 2018, 7, e36278. [Google Scholar] [CrossRef] [PubMed]
  62. Heiges, M.; Wang, H.; Robinson, E.; Aurrecoechea, C.; Gao, X.; Kaluskar, N.; Rhodes, P.; Wang, S.; He, C.Z.; Su, Y.; et al. CryptoDB: A Cryptosporidium bioinformatics resource update. Nucleic Acids Res. 2006, 34, D419–D422. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Schematic view of apicortins. The partial p25alpha and the DCX domains are indicated. The black squares label the position of the Rossmann-like motifs, and the dashed line corresponds to the disordered N-terminus of Apicomplexa-type apicortins.
Figure 1. Schematic view of apicortins. The partial p25alpha and the DCX domains are indicated. The black squares label the position of the Rossmann-like motifs, and the dashed line corresponds to the disordered N-terminus of Apicomplexa-type apicortins.
Biology 14 00620 g001
Figure 2. Phylogenetic tree of apicortin-like sequences by Bayesian analysis [17]. The root of the tree was chosen arbitrarily. Full circles at a node indicate that the branch was supported by maximal Bayesian posterior probability (BPP). All the other branches were supported by BPP, as indicated at the node. The accession numbers of proteins/TSAs are listed in Table S1. Operational taxonomic units marked with bold letters represent apicortins. (Tentative) contaminations found in Arthropoda (red), Spiralia (orange), Porifera (blue), and Amphibia (green) are labeled by colored letters.
Figure 2. Phylogenetic tree of apicortin-like sequences by Bayesian analysis [17]. The root of the tree was chosen arbitrarily. Full circles at a node indicate that the branch was supported by maximal Bayesian posterior probability (BPP). All the other branches were supported by BPP, as indicated at the node. The accession numbers of proteins/TSAs are listed in Table S1. Operational taxonomic units marked with bold letters represent apicortins. (Tentative) contaminations found in Arthropoda (red), Spiralia (orange), Porifera (blue), and Amphibia (green) are labeled by colored letters.
Biology 14 00620 g002
Figure 3. Phylogenetic tree of apicortin-like sequences by ML analysis [20]. The root of the tree was chosen arbitrarily. Numbers at a node indicate bootstrap values of 1000 replicates. The accession numbers of proteins/TSAs are listed in Table S1. Operational taxonomic units marked with uppercase letters represent apicortins. Operational taxonomic units marked with bold letters represent apicortins. (Tentative) contaminations found in Arthropoda (red), Spiralia (orange), Porifera (blue), and Amphibia (green) are labeled by colored letters.
Figure 3. Phylogenetic tree of apicortin-like sequences by ML analysis [20]. The root of the tree was chosen arbitrarily. Numbers at a node indicate bootstrap values of 1000 replicates. The accession numbers of proteins/TSAs are listed in Table S1. Operational taxonomic units marked with uppercase letters represent apicortins. Operational taxonomic units marked with bold letters represent apicortins. (Tentative) contaminations found in Arthropoda (red), Spiralia (orange), Porifera (blue), and Amphibia (green) are labeled by colored letters.
Biology 14 00620 g003
Figure 4. Multiple alignments of partial p25alpha domains of several apicortins by Clustal Omega [16]. In the upper and lower parts of the figure, there are opisthokont and myzozoan apicortins, respectively. Amino acids that are identical or biochemically similar in two-thirds of the proteins are marked with a red or blue background. A yellow background indicates that more than half of the amino acids are similar biochemically. The accession numbers of proteins/TSAs/SRAs are listed in Table S1.
Figure 4. Multiple alignments of partial p25alpha domains of several apicortins by Clustal Omega [16]. In the upper and lower parts of the figure, there are opisthokont and myzozoan apicortins, respectively. Amino acids that are identical or biochemically similar in two-thirds of the proteins are marked with a red or blue background. A yellow background indicates that more than half of the amino acids are similar biochemically. The accession numbers of proteins/TSAs/SRAs are listed in Table S1.
Biology 14 00620 g004
Figure 5. Multiple alignments of DCX domains of several apicortins by Clustal Omega [16]. In the upper and lower parts of the figure, there are opisthokont and myzozoan apicortins, respectively. Amino acids that are identical or biochemically similar in two-thirds of the proteins are marked with a red or blue background. A yellow background indicates that more than half of the amino acids are similar biochemically. The accession numbers of proteins/TSAs/SRAs are listed in Table S1.
Figure 5. Multiple alignments of DCX domains of several apicortins by Clustal Omega [16]. In the upper and lower parts of the figure, there are opisthokont and myzozoan apicortins, respectively. Amino acids that are identical or biochemically similar in two-thirds of the proteins are marked with a red or blue background. A yellow background indicates that more than half of the amino acids are similar biochemically. The accession numbers of proteins/TSAs/SRAs are listed in Table S1.
Biology 14 00620 g005
Figure 6. Occurrence of apicortin. The phylogenetic tree is based on Ref. [56]; the Myzozoan part is based on Ref. [57]. Red circles and the red color of operational taxonomic units indicate the presence of apicortin. Some lineages may not have sufficient data to definitively show absence.
Figure 6. Occurrence of apicortin. The phylogenetic tree is based on Ref. [56]; the Myzozoan part is based on Ref. [57]. Red circles and the red color of operational taxonomic units indicate the presence of apicortin. Some lineages may not have sufficient data to definitively show absence.
Biology 14 00620 g006
Table 1. List of new apicortin-like sequences.
Table 1. List of new apicortin-like sequences.
SpeciesAccession
Number 1
TaxonomyE-Value 2
T. adhaerens
E-Value 3
T. gondii
Hylaeus volcanicusXP_053992704Insecta/Endopterygota6 × 10−334 × 10−90
Schistocerca gregariaGJPN010108154Insecta/Polyneoptera4 × 10−334 × 10−77
Phlebotomus sergentiGKTC01023996Insecta/Endopterygota6 × 10−269 × 10−91
Galerucella calmariensisHAMF01019916Insecta/Endopterygota3 × 10−202 × 10−62
Galerucella pusillaHAMG01049297Insecta/Endopterygota3 × 10−202 × 10−62
Diabrotica virgiferaGHNJ01033740Insecta/Endopterygota4 × 10−161 × 10−52
Oryctes rhinocerosGHNO01082742Insecta/Endopterygota2 × 10−142 × 10−48
Ptilocerembia catherinaeGDBY01042306Insecta/Polyneoptera2 × 10−136 × 10−74
Coleotroctellus burckhardtiGJXT01027847Insecta/Paraneoptera4 × 10−136 × 10−46
Thanasimus formicariusGDPC01032790Insecta/Endopterygota8 × 10−111 × 10−42
Tigriopus californicusGHUE01002433Arthropoda/Crustacea3 × 10−72 × 10−19
Diaphanosoma celebensisGGQP01033578Arthropoda/Crustacea5 × 10−134 × 10−7
Loxomitra sp. KK-2020GIMU01103700Spiralia/Entoprocta9 × 10−1048 × 10−34
Acanthochitona fascicularisGJAX01016147Spiralia/Mollusca3 × 10−252 × 10−23
Platynereis dumeriliiHBZZ01068313Spiralia/Annelida4 × 10−248 × 10−79
Spea multiplicataGKIA01122747Chordata/Amphibia3 × 10−289 × 10−115
Trichoplax sp. H2 TR738GFSF01001671Placozoa2 × 10−1192 × 10−31
PlacozoaConsensus 4Placozoa3 × 10−1134 × 10−36
Ctenophora environmental sampleHBZJ01045600Ctenophora2 × 10−5-
Acanthoeca spectabilisGGPA01011677Choanoflagellata4 × 10−182 × 10−17
Helgoeca nanaGGOR01004317Choanoflagellata3 × 10−182 × 10−17
Savillea parvaGGOL01031575Choanoflagellata5 × 10−138 × 10−15
1 The accession numbers refer to TSAs, except XP_053992704, which is a protein. 2 E-values were obtained in searches where T. adhaerens XP_002111209 was used as the query. 3 E-values were obtained in searches where T. gondii XP_002364910 was used as the query. 4 From Ref. [21].
Table 2. Proteins that are the most similar to novel apicortin-like sequences.
Table 2. Proteins that are the most similar to novel apicortin-like sequences.
SpeciesAccession
Number 1
Most Similar ProteinE-Value 4Percent
Identity 4
Species 2Accession
Number 3
Hylaeus volcanicusXP_053992704Neospora caninumXP_0038831502 × 10−8765.57%
Schistocerca gregariaGJPN010108154Besnoitia besnoitiXP_0292174372 × 10−4750.50%
Phlebotomus sergentiGKTC01023996Neospora caninumXP_0038831503 × 10−8463.55%
Galerucella calmariensisHAMF01019916Gregarina niphandrodesXP_0111288982 × 10−7773.26%
Galerucella pusillaHAMG01049297Gregarina niphandrodesXP_0111288983 × 10−7773.26%
Diabrotica virgiferaGHNJ01033740Gregarina niphandrodesXP_0111288983 × 10−6871.15%
Oryctes rhinocerosGHNO01082742Gregarina niphandrodesXP_0111288986 × 10−6059.54%
Ptilocerembia catherinaeGDBY01042306Porospora cf. gigantea BXP_0683754681 × 10−5161.98%
Coleotroctellus burckhardtiGJXT01027847Gregarina niphandrodesXP_0111288982 × 10−6267.36%
Thanasimus formicariusGDPC01032790Gregarina niphandrodesXP_0111288986 × 10−5670.68%
Tigriopus californicusGHUE01002433Plasmodium gallinaceumXP_0285307553 × 10−1035.03%
Diaphanosoma celebensisGGQP01033578Vitrella brassicaformisCEL976991 × 10−1134.59%
Loxomitra sp.GIMU01103700Trichoplax adhaerensXP_0021112097 × 10−10284.66%
Acanthochitona fascicularisGJAX01016147Trichoplax adhaerensXP_0021112099 × 10−2438.69%
Platynereis dumeriliiHBZZ01068313Sarcocystis calchasiCAL78572176 × 10−5770.48%
Spea multiplicataGKIA01122747Neospora caninumXP_0038831502 × 10−10071.43%
Trichoplax sp. H2GFSF01001671Trichoplax adhaerensXP_0021112092 × 10−119100%
PlacozoaconsensusTrichoplax adhaerensXP_0021112093 × 10−11385.80%
CtenophoraHBZJ01045600Trichoplax adhaerensXP_0021112096 × 10−326.24%
Acanthoeca spectabilisGGPA01011677Lobulomycetales sp.KAL38970025 × 10−2443.31%
Helgoeca nanaGGOR01004317Lobulomycetales sp.KAL38970023 × 10−2642.24%
Savillea parvaGGOL01031575Blyttiomyces sp.KAJ3329948.14 × 10−2138.24%
1 The accession numbers refer to TSAs, except XP_053992704, which is a protein. 2 Color code: no color: Apicomplexa; yellow: Chrompodellids; blue: Placozoa; green: Fungi. 3 The accession numbers refer to proteins. 4 E-values and identity percentages were obtained in searches where hits listed in the second column were used as queries.
Table 3. Gregarina niphandrodes proteins corresponding to TSA sequences found in insects.
Table 3. Gregarina niphandrodes proteins corresponding to TSA sequences found in insects.
TSAGalerucella calmariensis
HAMF01019916
Galerucella pusilla
HAMG01049297
Thanasimus fornicarius
GDPC01032790
Diabrotica virgifera
GHNJ01033740
Coleotroctellus burckhardti GJXT01027847
Gregarina niphandrodes proteins and E-valuesXP_011128895
1.2 × 10−100
XP_011128895
2 × 10−100
XP_011128896
5.7 × 10−7
XP_011128896
2.8 × 10−7
XP_011128897
1.6 × 10−33
XP_011128897
5.8 × 10−33
XP_011128899
1 × 10−49
XP_011128899
1.5 × 10−49
XP_011128898
1.9 × 10−82
XP_011128898
2.7 × 10−82
XP_011128898
4.9 × 10−59
XP_011128898
3 × 10−73
XP_011128898
2 × 10−67
XP_011128900
7.5 × 10−16
XP_011128900
1.1 × 10−15
XP_011128900
3.8 × 10−17
XP_011128900
8 × 10−15
XP_011128900
5 × 10−5
XP_011128901
1.6 × 10−68
XP_011128901
2.4 × 10−68
XP_011128901
8.6 × 10−74
XP_011128901
2 × 10−72
XP_011128902
1.1 × 10−147
XP_011128902
9.2 × 10−147
XP_011128902
6.8 × 10−127
XP_011128902
4 × 10−152
XP_011128903
0
XP_011128903
0
XP_011128904
1 × 10−64
XP_011128904
2.4 × 10−137
XP_011128905
0
A yellow background indicates proteins specific to G. niphandrodes. A lilac background indicates very small (≤9 × 10−100) E-values. The accession number of an apicortin is bold.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Orosz, F. Apicortin, a Putative Apicomplexan-Specific Protein, Is Present in Deep-Branching Opisthokonts. Biology 2025, 14, 620. https://doi.org/10.3390/biology14060620

AMA Style

Orosz F. Apicortin, a Putative Apicomplexan-Specific Protein, Is Present in Deep-Branching Opisthokonts. Biology. 2025; 14(6):620. https://doi.org/10.3390/biology14060620

Chicago/Turabian Style

Orosz, Ferenc. 2025. "Apicortin, a Putative Apicomplexan-Specific Protein, Is Present in Deep-Branching Opisthokonts" Biology 14, no. 6: 620. https://doi.org/10.3390/biology14060620

APA Style

Orosz, F. (2025). Apicortin, a Putative Apicomplexan-Specific Protein, Is Present in Deep-Branching Opisthokonts. Biology, 14(6), 620. https://doi.org/10.3390/biology14060620

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop