Next Article in Journal
Hypoxia in the Blue Mussel Mytilus chilensis Induces a Transcriptome Shift Associated with Endoplasmic Reticulum Stress, Metabolism, and Immune Response
Previous Article in Journal
Identification of FASN Gene Polymorphisms, Expression and Their Relationship with Body Size Traits in Guizhou White Goat (Capra hircus) with Different Genders
Previous Article in Special Issue
Human-Induced Range Expansions Result in a Recent Hybrid Zone between Sister Species of Ducks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Orthologs at the Base of the Olfactores Clade

Silberman Institute of Life Sciences, Hebrew University, Jerusalem 91904, Israel
Genes 2024, 15(6), 657; https://doi.org/10.3390/genes15060657
Submission received: 12 April 2024 / Revised: 16 May 2024 / Accepted: 17 May 2024 / Published: 22 May 2024
(This article belongs to the Special Issue Genomics of Evolution and Adaptation in Animals)

Abstract

:
Tunicate orthologs in the human genome comprise just 84 genes of the 19,872 protein-coding genes and 23 of the 16,528 non-coding genes, yet they stand at the base of the Olfactores clade, which radiated to generate thousands of tunicate and vertebrate species. What were the powerful drivers among these genes that enabled this process? Many of these orthologs are present in gene families. We discuss the biological role of each family and the orthologs’ quantitative contribution to the family. Most important was the evolution of a second type of cadherin. This, a Type II cadherin, had the property of detaching the cell containing that cadherin from cells that expressed the Type I class. The set of such Type II cadherins could now detach and move away from their Type I neighbours, a process which would eventually evolve into the formation of the neural crest, “the fourth germ layer”, providing a wide range of possibilities for further evolutionary invention. A second important contribution were key additions to the broad development of the muscle and nerve protein and visual perception toolkits. These developments in mobility and vision provided the basis for the development of the efficient predatory capabilities of the Vertebrata.

Graphical Abstract

1. Introduction

In a 2010 paper, Domazet-Loso and Tautz [1] proposed a classification of the animal kingdom into 19 phylostrata, ranging from phylostratum 1, the first unicellular organisms, to phylostratum 19, the primates. In an alternative cladistic scheme—depicted in Figure 1–the phylostratum number is shown in parentheses after the name of the appropriate grouping:
The vertebrates and the tunicates share many of the genes that appeared over evolutionary time up to and including the emergence of the tunicates. Of these genes, those that were not present in earlier-appearing animals are the tunicate orthologs, the tunicate contributions to the vertebrate genome. By searching for orthologs, Domazet-Loso and Tautz [1] assigned many of the genes of the human genome to one or other of the 19 phylostratum levels. But finding orthologs is not an easy matter. Indeed, the Search for Orthologs Consortium comprises some dozen research groups that use various algorithms in their searches to handle the difficult problem of comparing tens of thousands of genes among thousands of living species. The results reported by these different research groups are by no means consistent. Liebeskind et al. [3] adopted a consensus approach where, for each gene, the modal value of the reports of the different research groups was chosen to represent that gene’s phylostratum level. Also reported was the step value, which is the median of the difference, for each gene, between the modal value and that reported by each ortholog research group. Litman and Stein [4] extended this work and published a list of all the protein-coding genes and many of the non-coding genes of the human genome, with each gene recorded with its phylostratum number.
The tunicates, with which this paper is concerned, fall in phylostratum 10, and the orthologs that they share with humans lie at the base of the Olfactores clade that consists of the tunicates and the vertebrates [2]. Phylostratum 10 in the Litman and Stein study [4] contained 84 protein-coding and 23 non-coding genes. These numbers are almost an order of magnitude smaller than the number of such genes found for any of the other phylostrata. We were intrigued by the fact that these small number of genes would lie at the base of the Olfactores clade, which branched out to the thousands of tunicate and vertebrate species. In what way did such a small number of genes begin that vast onward evolutionary process? But first we had to be sure that we had a valid set of tunicate orthologs and we proceeded to build a hand-curated set. Many of these orthologs are found to be present in gene families. In what follows, we will discuss these families in turn. The biological role of each family is described and the extent to which the tunicate orthologs contribute quantitatively to the family.

2. Methods

To prepare our list of tunicate-contributed orthologs, we began with the 84 protein-coding genes that Litman and Stein [4] had assigned to the tunicates, being phylostratum 10. Knowing that such an assignment was not absolute but only a consensus estimate, in that the various ortholog search groups did not agree with one another, we decided to hand-curate the list. Having only 84 genes to work with made the task of hand-curation feasible. Our searches for orthologs were performed using the protein BLAST (Basic Local Alignment Search Tool) program of the NCBI (National Center for Biotechnology Information of the National Library of Medicine): https://blast.ncbi.nlm.nih.gov/Blast.cgi (accessed on 6 September 2023) with the following search parameters: Max target sequences 1000, Expect threshold 200,000, Word size 2, Max matches in a query range 0, Matrix BLOSUM62, Gap Costs Existence: 11 Extension: 1, No compositional adjustments, No Low complexity regions filter.
To be recognised by us as an ortholog, a sequence found when searching the tunicates database had to be annotated with the same name as the probe sequence from the human genome. A sequence annotated as “-like” was rejected. In addition, the probe sequence had to be absent when the Branchiostomatidae database, or that of the Echinodermata or earlier-appearing clades were searched. In a few cases, noted explicitly in Table S1, the tunicate sequence had been annotated as “name-a” instead of “name-1”, and similarly for “b”, “c”, and ”d”. In other cases, again explicitly listed in Table S1, the tunicate sequence’s name was a more generic synonym but presented with a convincing Expect Value.
Sequence comparisons between two proteins and dot plots of the comparisons were made using the BLAST 2-sequences tool of the BLAST program, using the same search parameters as above. All one-on-one Expect values recorded in this paper were the results of a BLAST 2-sequences comparison between the two proteins.
Alignments between protein sequences were established and dot plots generated using the COBALT (Constraint-Based Alignment Tool) aligner at the NCBI https://www.ncbi.nlm.nih.gov/tools/cobalt/cobalt.cgi?CMD=Web (accessed on 6 September 2023). Phylograms were prepared using the phylogram tool of COBALT. Phylograms were rooted by explicitly determining this in the COBALT algorithm, at the gene with the lowest phylostratum number if there was such a gene, or left unrooted if there was no such case.
Properties of the orthologs listed in Table S1 were taken from the listings in GeneCards https://genealacart.genecards.org/Result (accessed on 7 August 2023). GeneCards also proved useful when the aliases of genes discussed in the literature had to be interpreted to provide HGNC symbols.
For finding the Unique Gene IDs of the tunicate-shared orthologs for Table S1, we searched the ANISEED database at:
Gene Ontology data for the genes not found to be present in gene families was extracted from The Database for Annotation, Visualization and Integrated Discovery (DAVID) at https://david.ncifcrf.gov/summary.jsp (accessed on 31 August 2023).

3. Results and Discussion

Table 1 lists the 84 genes reported in [4] as being in phylostratum 10, each recorded with its HGNC symbol together with its step value, and the number of research groups (of the 13 considered) whose reported phylostratum number agreed with the modal value (here 10):
The mean for the step value of these phylostratum 10 genes is 2.743 and its standard distribution 1.240. It would not be surprising, therefore, to find that many of those in the list of 84 were incorrectly assigned to phylostratum 10, being, in reality, one or two levels higher or lower. It was thought necessary, therefore, to curate the list by conducting a BLAST search for each gene on the list to check that it did appear annotated as such in the tunicates and did not appear as such in the branchiostomatidae nor in the earlier-appearing phylostrata. Only 25 genes of the 84 survived this curation.
Inasmuch as genes with phylostratum numbers reported as being above 10 might be incorrectly listed despite actually being in phylostratum 10, we searched the literature for genes reported as being present in tunicates. If they were not already in the list of 25 curated 10 s, they were subjected to a BLAST search as just described for their presence in tunicates and not in earlier animals. In total, 59 such genes were identified and are listed together with the original 25 in Supplementary Table S1, the total number of curated tunicate-vertebrate; the number of shared orthologs is thus 84 (coincidentally the same number as in the original list). For each gene in the list, the Expect value is listed for the 2-sequence comparison between that tunicate sequence and the corresponding human gene, as well as additional information concerning that gene.
The list of 84 orthologs in Table S1 is almost certainly not a complete accounting. Many genes that the previous estimates have identified as coming from the large contribution from the fish might be revealed to be tunicate-derived orthologs if systematically studied by our hand-curation methodology.

3.1. The Muscle Protein Genes

The Proteins of Muscle

Muscular tissue can be found as early as the Porifera, where the sponges have the capacity for coordinated whole-body contractions that enable them to expel sediments [5]. Their myocytes contain a myosin homologous to the myosins of higher organisms but the myosin of the myocytes contracts and expands in response to changes in the concentration of calcium and is not ATP-dependent, nor is it activated by any neural connection [6]. The muscular system is already well-developed in the sea anemone, where muscles can be found in the tentacles as well as in the body column [7]. The sea anemone being diploblastic, its muscles are, of course, not of mesodermal origin. In the triploblastic sea urchin, a complex muscular jaw apparatus is present that can scrape and tear the animal’s food items [8]. With the emergence of the cephalochordates such as the lancelet (amphioxus sp. of the branchiostomes), muscles line the notochord to allow the active swimming behaviour of the animal. In the tunicates, swimming is further developed, and in the Appendicularians, which retain the larval form for their entire lifespan, the tail is used to sweep food particles into the oral apparatus [9].
At each stage of evolutionary development, the range of molecules involved in muscle formation is, of course, increased. These molecules include actins, the light chain and heavy chain myosins, titin, meromyosins, myosin binding proteins, troponins, and tropomyosin. In most of these classes, tunicate orthologs contribute, in some cases, decisively, as we will now illustrate.
The muscle protein orthologs are specifically denoted as such in Table S1.
The light chain myosins have 12 representatives in the human genome with HGNC symbols beginning with MYL. Figure 2 depicts a phylogram of these proteins, rooted at the annotation of the coral ortholog of MYL6.
In addition to the coral’s MYL6, the proteins MYL2 and MYL6B also have orthologs in the corals. MYL4, MYL5, and MYL7 (in bold in the figure) have orthologs in the tunicates, while all the other proteins have orthologs either in the Echinodermata or in the Cephalochordata, leaving it to those four tunicate proteins to complete the family.
In a comprehensive analysis of the proteins in skeletal and cardiac muscles, Lindskog et al. (2015) found that MYL2 and MYL3 were present in both tissues, MYL1 only in skeletal muscle, while MYL4 and MYL7 were present only in cardiac muscle [10].
In the fourteen-membered heavy chain myosin class (having HGNC symbols beginning with MYH), the tunicate orthologs contribute five examples (Figure 3 below).
MYH14 and MYH7B (underlined) appeared only with the fish but the tunicates had contributed a substantial portion of the earlier orthologs.
Lindskog et al. (2015) found that MYH7 and MYK7b were present in both skeletal and cardiac tissues, MYH1, MHY2, MYH4, and MYH8 only in skeletal muscle, while MYH6 was only in cardiac muscle [10].
The tunicates contributed all three of the myosin binding protein C family (the MYBPCs). MYBPC1 is present in slow skeletal muscle, MYBPC2 in fast skeletal muscle, and MYBPC3 in cardiac muscle (Figure 4).
The tunicates contributed one of the three myomesin proteins (the MYOMs). The myomesins are present in the M band of striated muscles and act by stabilising the sarcomere during contraction of the muscle, acting as a molecular spring [11].
Two other proteins in the list presented by Lindskog et al. [10] are tunicate orthologs and thus appear, annotated, in Table S1. One is synaptopodin 2 (SYNPO2), the first to appear in evolution of the two synaptopodins, which are molecules that regulate actin fibres. The second is troponin T type 1 (TNTT1), the last to appear of the three TNTTs that regulate muscle contraction in response to alterations in intracellular calcium ion concentration. The tunicate orthologs of the genes involved in muscular function form an important contribution, extending the complement of muscle genes already present in earlier animals, thus providing a basis for the radiation of such genes in the evolution of the vertebrates.

3.2. The Gap Junction Proteins

The human genome contains 21 Gap Junction proteins (also known as connexins). They are divided by sequence similarity into five subgroups labelled as GJAP’s through GJEP’s or with the corresponding Greek letters, α through ε. Figure 5 shows a phylogram of these proteins:
(See also [12,13] for fuller discussions of the evolution and biology of the Gap Junction proteins). Seven of the gap junction proteins are found in Table S1 of the Orthologs at the base of the Olfactores, and these are labelled accordingly in the figure. Results of BLAST searches against the tunicate genomes are listed in Supplementary Table S1.
As can be seen in Figure 5, the A, B, C, and D subgroups all have representatives from the tunicates that, in each case, may be suggested to be the progenitors of that subgroup.
The Branchiostomatidae genomes do not contain gap Junction proteins so the gap junction proteins may be considered as momentous inventions of the founders of the Olfactores clade. Gap junction proteins are transmembrane proteins that connect one cell with its neighbour. Among many other roles in the body, gap junction proteins are found in the Schwann cells, whose development in evolution required, as we have seen, the tunicates’ invention of the first gap junction proteins. These Schwann cells form a sheath that wraps around a neuron and provides the insulating myelin coating to the nerves of the bony fishes and higher organisms. Gap junction proteins connect each Schwann cell with its neighbour in the sheath and thus provide a route that enables rapid cell to cell movement of solutes from the nerve cell interior to the extracellular surface. A build-up of potassium within the nerve cell is thus mitigated and nutrient flow from the extracellular medium to the interior of the nerve is facilitated. This transport has been estimated to be a million times faster than such movement would otherwise be in the absence of the gap junction proteins. The myelin coating that the Schwann cells provide insulates the nerve and allows electrical signals to pass from the interior to the exterior only where the Schwann cell coating is absent at the internodes. The electrical signal thus jumps (“saltatory movement”) from internode to internode, speeding up manyfold the rate of signal transmission. For example, whereas unmyelinated axon conduction velocities range from about 0.5 to 10 m/s, myelinated axons can conduct at velocities up to 150 m/s. [14]. In a giraffe, a signal that takes some 60 milliseconds to travel from the foot to the brain [15] will instead take some 6 s. It is to be noted that the cyclostomes such as the lamprey do not produce myelin, although they do possess Schwann cells [16]. Indeed, BLAST searches of the lamprey genome for some of the genes that a GeneCards search suggested as being involved in myelin production (MPZ, MBP, PMP2, and MAG) proved fruitless. Thus, myelin is an invention of the jawed fish.
Some myelin-associated genes (MYRF, PLP1, and PMP2) are present in the Tunicate genome (see also [17]) and, as we have seen, seven Gap Junction genes are present in the tunicates. Their function in the tunicates does not seem to be known, but a search of ANISEED, the ascidian database (see Methods), for the Gap Junction gene GJC1 returned:
REG00001500 (Cirobu.REG.KhC2.5771334-5773564|Msx)
In which, for this gene, a “CRM (cis-regulatory module) is active in the region of the primordial pharynx, in the sensory vesicle, and in the neck connecting the sensory vesicle, in the visceral ganglion, and in the caudal muscles. At earlier stages, it is active along the neural tube in the tailbud, along the neural folds in the neurula, in the precursors of sensory organs, in the ventral epidermis in both stages, and also in the mesoderm that will give rise to the muscles”. Presumably this gene and its congeners were co-opted by the ancestor of the Schwann cell to enable the formation of the myelinated nerves of the jawed fish and their descendants.
We searched the genome of the Branchiostomatidae for a gene that could be the ancestor of the gap junction genes of the tunicates. We found that GJAP8 had, as top hit with the Branchiostomatidae, a gene denoted as the “predicted Titin homologue” annotated as XP_019638343.1. This had an Expect Value against GJAP8 of 9 × 10−11. The protein had a length of 1894 amino acids. Figure 6 depicts the alignment of this protein against the seven tunicate Gap Junction genes; Figure 6A shows the alignment against the whole Titin sequence while Figure 6B shows the alignment against the 891 portion of the sequence that spans those of the shorter GAP Junction sequences.
Figure 7 shows a phylogram with the 891-long sequence of the Titin homologue and these seven tunicate Gap Junction proteins. The data suggest that a portion of a member of the Branchiostomatidae titins might have evolved to produce the Gap Junction proteins of the Olfactores.

3.3. The Cadherins

The human genome contains 23 cadherin (CDH) genes, with HGNC symbols CDH1 through CDH26 (three are absent). Of these twenty-three, seven (CDH1, CDH2, CDH7, CDH8, CDH11, CDH16, and CDH18) are found as orthologs in the Tunicates while another (CDH23) is an ortholog in the Branchiostomatidae. The twenty-three CDH genes are divided into two types, I and II. These CDH genes code for correspondingly-named membrane-bound CDH proteins, each containing multiple cadherin repeats of some 110 amino-acids. The Type II cadherins are missing a long sequence of amino acids in the N-terminal region. Refer to Supplementary Figures S1 and S2.
The upper two rows in Figures S2 and S3 depict cadherins of type I, while the lower rows depict those of type II. In each case, an ortholog found in the tunicates is marked with an asterisk. Figure S2 depicts the full sequences while Figure S3 shows the N-terminal sequences expanded. A cadherin from Homo sapiens and its tunicate ortholog show very close alignment. The green-shaded region of the sequences in Figure S2 marks the membrane-embedded portion of the proteins.
A phylogram of all the cadherins from H. sapiens is presented in Figure 8.
As can be seen, each of the two cadherin types contains proteins (labelled “tunicate”) that have orthologs in the tunicates and might perhaps be considered as founder members of that type. The single cadherin ortholog that derives from the Branchiostomatidae (designated by “lancelet”) lies separate from the cadherins in the two types (I and II).
The cadherins are well-named, being proteins that cause cells to adhere to one another under the influence of calcium (Ca). An x-ray crystallographic study of cadherins by Patel et al. (2005) showed that it was largely the N-terminal sequences of the molecules that determined the binding between two cadherins [18]. The N-terminal sequence of one molecule formed an anchor that fitted into a pocket in the second molecule, whose N-terminal sequence in turn inserted into a pocket on the first molecule. The process is called “swapping” of the two N-terminal sequences, although of course no chemical bonds are broken in this swapping. The different structures of the anchor (see the N-terminal sequences of the two types of cadherins in Figure 6 and pocket between type I and type II cadherins render the two types incompatible with each other. This incompatibility is of profound significance in embryological development. Cells from an epithelial sheet of cells held together by cadherins of one type (say Type I) will detach from the sheet if they are temporally programmed to express cadherins of Type II. This process, accompanied by the programmed lowering of expression of Type I cells, is of crucial significance in the formation and detachment of the neural crest [19].
Table 2 lists some frequently studied cadherins with the commonly used names that they are referred to in the literature, the tissue in the body where each protein is usually found, and the HGNC symbol for each protein.
Cells bearing cadherins of one name will readily form homotopic interactions, but less frequently heterotopic combinations, with other cadherins, and this leads to sorting-out of different cell types in embryological development.
In the animal body, the cadherins exert their adhesion function in many epithelial tissues but also, importantly, in neural tissue and the formation and detachment of the neural crest. Cells from the neural crest migrate throughout the embryo, giving rise to an array of cell types that characterize the vertebrate clade, including the peripheral sensory nervous system and most of the craniofacial skeleton.
In the cephalochordate Amphioxus, two cadherin paralogs are found [20]; one is expressed in the mesoderm and the other in the ectoderm, but both are found expressed in neural tissue, at different developmental periods. These two paralogs are, of course, of the same type and can only demonstrate adhesion and not repulsion. In Amphioxus, there are no cells that have been identified as homologous to neural crest [21]. The tunicates possess cadherins of both types so that both cohesive and repulsive interactions are now possible. In the tunicates [22], a CDH5-like protein is expressed at the tailbud stage in the nerve cord and in the peripheral neurons of the tail, while the CDH7-like protein is widely expressed in the epidermis and also in the nerve cord, the sensory vesicle, and the visceral ganglion. Neural crest-like cells are indeed found in the tunicates as migratory cells that produce pigment (as do vertebrate cells of neural crest origin) and, in addition, express many of the genes that regulate the development of vertebrate neural crest [23].
The role of the cadherins expanded greatly as the number of cadherins themselves expanded and diversified during the evolution of the vertebrates, with multiple roles in embryological morphogenesis [24].
An additional set of proteins involved in cell adhesion and, in particular, in cortical development and synapse formation [25] are the FLRTs (Fibronectin Leucine Rich Transmembrane proteins), of which FLRT2 and FLRT3 were tunicate innovations and which, in their developmental role, interact with latrophilin (HGNC symbol ADGRL3), also first found in the tunicates. Latrophilin and the FLRTs also interact with teneurin proteins (the TNFMs) to form trimeric complexes, important during synapse formation and in guiding the migration of young neurons [26].

3.4. The Claudins

The human genome contains 24 claudin genes symbolised by CLDN, enumerated up to CLDN34 with some absent members. These genes, of course, code for the correspondingly named proteins. The origin of the world claudin [27] is from the Latin claudere, meaning “to close”, very appropriate since their function is to form the Tight Junctions between adjacent cells in an epithelium or endothelium [28]. These Tight Junctions ensure that passage through them, from one side of an epithelium to the other, is tightly controlled in a selective permeability, with the different claudins having different selectivities [28]. Orthologs of the claudins are found in the tunicates (three of them) while another is found in the earlier-appearing Branchiostomatidae, but not in the even earlier Echinodermta. Figure 9 depicts a phylogram of the human claudin proteins with the earliest-appearing genes designated.
The figure suggests the close similarity of CLDN7, the first claudin to appear (in the Bran-chiostomatidae) to two of the next-appearing (in the tunicates): CLDN1 and CLDN19; the third tunicate claudin, CLDN18, has close similarity to the lamprey claudin CLDN10 and the two shark claudins CLDN11 and CLDN15.
Figure 10 shows a comparison of the sequence of the CLDN18 orthologs from the tunicate Ciona intestinalis and the lamprey Petromyzin marinus, that of the lamprey being shown as the lower sequence of the two in each row. The two sequences, analysed in a BLAST search against one comparison, returned an Expect Value of 2 × 10−9.
Defects in claudin genes are associated with congenital deafness in children. Studying the expression of claudin genes in the Zebrafish, Kollmar et al. (2001) found claudins in the otic and lateral-line placodes of the fish [29]. They suggested, on the basis of their findings, that claudins could have an additional role in vertebrate morphogenesis. Perhaps the role of the early claudins in tunicates might relate to a role in tunicate morphogenesis in addition to their function as aiding to form barriers between the external and internal environments, as indicated by electron microscopy studies.

3.5. The Ephrons and Ephrins

The human genome contains 14 ephron genes, symbolised by EPHA1 to EPHA10 (EPHA9 is absent) and EPHB to EPHB4 [30]. These code for corresponding membrane-bound proteins. Table 3 includes these genes together with the animal species where their latest shared orthologs appear.
Figure 11A displays a phylogram of the relation between the proteins of the A series, the phylogram being rooted at EPHA5, while Figure 11B displays the corresponding phylogram for the proteins of the B series, the phylogram being rooted at EPHB2. The ephron proteins are membrane-bound receptors for the eight ligand Ephrin proteins, symbolized by EFNA1 through to EFNA5 and EFNB1 through EFNB3, coded for by the corresponding EFN genes [30].
Table 3 also includes these genes together with the animal species where their latest shared orthologs appear.
Figure 12 displays a phylogram of the relation between these proteinss, the A and B series being clearly separated into their two subgroups.
Since both the ephrons and the ephrins are membrane-bound, their interactions are only cell to cell. The proteins of the A series of the receptor ephrons bind to the proteins of the A series of the ligand ephrins and repel the proteins of the B series of the ephrins. B series ephrons bind B series ephrins and repel those of the A series. These attractive and repulsive interactions between cells determine many morphogenetic processes in animal development [31]. Repulsive effects determine the boundaries between different cell types. These repulsive responses are likely to involve depolymerization of the actin cytoskeleton, leading to the collapse of the filopedia and retraction of the cells involved. Actin-cytoskeletal interactions are the probable base of the attractive responses. A combination of repulsive and attractive effects guides the movement of nerve cells during the embryonic formation of the brain and ensures appropriate movement of cells in the intestinal crypt, retaining the Paneth at the base of the crypt while guiding the movement of cells of the crypt towards the lumen of the intestine.
The ancestor of the Olfactores contributed none of the 14 Ephron genes but as many as 4 of the 8 Ephrin genes, a substantial contribution. Two Ephron genes (EPHA4 and EPHA2) were already present in the Cnidarians as well as one ephrin gene (EFNB1). In the tunicates themselves, these genes are involved in (among other processes) neural induction [32], endoderm invagination [33], asymmetric cell divisions in the vegetal hemisphere of the developing brain [34], delimiting the number of pigment cells in the central nervous system [35], and neural tube patterning [36].

3.6. The MAGE Genes

There are some 40 MAGE genes (the name derives from Melanoma Antigen Gene) in the human genome, divided into 2 subfamilies, Type I and Type II, according to their sequence and chromosome location [37]. The Type II family can again be subdivided according to their ancestry: those originating from an ortholog in the tunicates, descended from the tunicate gene Cirobu.g00007177 (also known as “melanoma-associated antigen D2 isoform X2 [C. intestinalis]”) and being named in the form MAGEDx, and those originating from an ortholog in the branchiostomata, most being named MAGEEx. The phylogram in Figure 13 shows how the Type II MAGE proteins divide into these two subfamilies, together with a separate protein NDN, necdin, also a member of the MAGE family.
The biological roles of the tunicate-contributed MAGE proteins are tabulated in Supplementary Table S1, and include a role in cell adhesion, in the regulation of ubiquitin-protein ligases, and response to stress.

3.7. The Crystallins

The Gamma Crystallins

The human genome contains 18 crystallin proteins. They are divided by sequence similarity into three subgroups, the α, β, and γ crystallins and, interacting together, they form the transparent lens of the eye. The six gamma crystallins are descended from a single ortholog in the Tunicates, namely the gene designated in the ANISEED database (see Methods) with the unique ID of Cirobu.g00014781. This gene can appear in BLAST searches of the Tunicates as one of three annotated sequences, two from C. intestinalis (XP_002126888.1 (named as gamma-crystallin S) and 2BV2_A (Named as Chain A, C. betagamma-crystallin)) and one from Styela clava (XP_039266482.1 (named as gamma-crystallin N-A-like)). In what follows, the sequence 2BV2_A has been used. The HGNC symbols for the six gamma crystallins of H. sapiens are CRYGA, CRYGB, CRYGC, CRYGD, CRYGN, and CRYGS. The three crystallin genes CRYBG1, CRYBG2, and CRYBG3, coding for proteins that also present in the vertebrate lens, but some ten-fold longer than the gamma crystallins, will not be discussed here. Supplementary Figure S3, part A depicts a COBALT-based alignment of these proteins compared with the six gamma crystallins and the single tunicate crystallin, while part B is a phylogram from these data. The gamma crystallins and the CRYBG proteins clearly separate into two families. The CRYBG genes are not found in the genome of the tunicates, although CRYBG2 is present in the Branchiostomatidae. These genes have curiously named synonyms of the form AIMx, where AIM stands for Absent In Melanoma [38].
Figure 14 depicts a COBALT-derived comparison of the six gamma crystallins together with the C. crystallin.
Figure 15 depicts the Phylogram that COBALT produced from the data, here rooted at the C. crystallin.
Note in Figure 14 that the tunicate crystallin is almost exactly half the length of the crystallins of H. sapiens, suggesting that the tunicate gene was doubled as it evolved into its vertebrate descendants. This suggestion is confirmed if the sequences of the tunicate crystallin and two vertebrate crystallins are compared, as in the dotplots of Figure 16:
The 83 residue sequence of the tunicate protein is repeated in the second half of both vertebrate proteins with a minor modification: a small break in the second half of the sequence of CRYGN (part A of Figure 16) and in the first half of the sequence for CRYGS (part B of Figure 16). This difference between the two plots suggests that the two vertebrate proteins arose by two independent gene duplications, compatible with the positions of these two vertebrate proteins in the phylogram of Figure 15. It would appear, from the phylogram, that the further speciation of the crystallins originated with CRYGS. These two crystallin proteins, CRYGN and CRYGS, are both found in our table of the orthologs at the base of the Olfactores since they appear to have evolved independently.
It was mentioned at the beginning of this section that, in vertebrates, the crystallins are found in the lens of the eye. In a tunicate, however, the crystallin protein is not found where one might expect it to be: in the ocellus or eyespot of the organism. Rather the crystallin is found in the palp, an organ in the snout of the larva of the sessile tunicates whose function is to provide, together with lectins and other components, a mucilage that fixes the tunicate to the seabed [44]. That the tunicate crystallin is not expressed in the organism’s eyespot suggested to Horie et al. (2008) “that the lens of the C. pigmented ocellus is not homologous with that of the vertebrate eye” [45].
The tunicate crystallin can bind calcium [46], thereby stabilising the protein against temperature denaturation. Vertebrate crystallins have lost these calcium-binding sites. Figure 17 shows a comparison of part of the amino-acid sequence of the lamprey’s CRYGN (upper row) and the tunicate crystallin (lower row). The wide arrows point to the sites at which calcium binds in the tunicate protein [46].
These mutations and the duplication of the tunicate’s sequence led to the dramatic evolution of the protein from a mucilage in the tunicates into the vertebrate proteins that confer upon the lens its transparency and hence the ability of the vertebrates to use their visual system in efficient predation.

3.8. The Distal-Less Genes

The human genome contains six members of the distal-less family with HGNC symbols DLX1 through DLX6. Of these, DLX1, DLX4, and DLX6 first appear in the Cnidaria, with DLX 1 and 4 present in corals and DLX6 in a sea anemone. DLX5 first appears in the Echinodermata, in a sea urchin, while DLX2 and DLX3 were the contribution of the tunicates (Figure 18)
A multiple alignment of these genes is shown as Figure 19:
A member of the gene family was first noticed in the fruit fly, D. melanogaster, as the gene Dll when a mutant lacking the distal ends of the limbs was found (reviewed in [47]), and the gene was postulated to control pattern formation along the proximal-distal axis of the limbs. A similar role in controlling pattern formation along the limb proximal-distal axis is found in the Crustacea [48]. The phenomenon of an absence of distal regions of the limbs is also found in mice, where targeted inactivation of the genes DLX5 and DLX6 led to severe malformation of the distal portion of the limbs. The human disease Split-hand/split-foot malformation (SHFM) is a human limb defect characterized by missing digits and fusion of remaining digits. Ullah et al. (2016) showed that a mutation in DLX6 is associated with the type I form of SHFM [49].
In the mice, disruption of DLX genes led to gross malformation of the craniofacial skeleton. These craniofacial defects, associated with distal-less gene mutations in the mouse, suggest that distal-less controls more than simply proximal-distal pattern information. Indeed, Dll, in its first appearance in Hydra (where, of course, no proximal-distal axis is present), is expressed in the head, battery, and the stem-like peduncle [50]. In Hydra, potent stinging cells are grouped with other neurons in what are called “battery complexes” on the hydra’s tentacles. Indeed, the mutation in the Drosophila Dll gene is also associated with defects in the antenna of the fly, a sensory organ. In the sea urchin (which too has no proximal-distal axis), ectodermal expression of distal-less is evident at the tips of spines but is not associated with other skeletal elements [51]. In the myriapod Glomeris marginata, the distal-less gene is expressed in the limb appendages but also, and to a large extent, in the mouth parts of the organism [52]. In the tunicate C. intestinalis, DLX3 is expressed in the atrial siphon and in addition in the adhesive organ, which is located in the head, at the anterior end of the body, and is used by the larvae at the beginning of metamorphosis to attach to a solid substrate [53]. Finally, in Amphioxus (the lancelet Branchiostoma floridae), a context where the proximal-distal axis is again not relevant, Holland et al. (1996) [54] showed that the animal’s single Dll gene is expressed in the anterior three fourths of the cerebral vesicle and is implicated in the “establishment of the dorsoventral axis, specification of migratory epidermal cells early in neurulation and the specification of forebrain”. They add the intriguing suggestion that “Such a multiplicity of Distal-less functions probably represents an ancestral chordate condition and, during craniate evolution, when this gene diversified into a family of six members, the original functions evidently tended to be parcelled out among the descendant genes”. Interestingly, in the parallel bilaterian evolutionary path to the insects, as we saw in the case of the Myriapod, the ancestral DLX gene again took on the function of regulating pattern formation in the proximal-distal axis and as well as specifying head development and aspects of sensory perception. Thus, the DLX genes have evolved to be interpreters that express their function by being directed to various locations in the body. In accordance with signalling systems present at this location, the DLX genes drive and direct the expression of genes that act together to form the particular organ or organs there found, noteworthy in the development of the limbs and the craniofacial skeleton.

3.9. DAVID Analysis of Individual Orthologs Not Discussed in Previous Subsections

In the list of the 84 tunicate orthologs assembled in Table S1, 36 have not yet been covered in the text so far. To tease out what might be their role in the tunicate lifestyle and perhaps in that of the vertebrates as well, we subjected the 36 to an analysis using the DAVID program (see Methods). As it describes itself, DAVID is “The Database for Annotation, Visualization and Integrated Discovery (DAVID) [that] provides a comprehensive set of functional annotation tools for investigators to understand the biological meaning behind large lists of genes”. We applied the functional annotation clustering tool of DAVID (which attempts to find common themes in groups of genes selected from a submitted list). Table 4 is extracted from the full output of the DAVID clustering.
Of these listed clusters, cluster 2 and cluster 4 continue the emphasis on adhesion between cells that was discussed previously in the sections on the cadherins and the ephrons and ephrins, while cluster 6 extends the list of genes concerned with muscular contraction. Cluster 7 contains genes concerned with embryological development while cluster 8 concerns the important zinc-finger genes with their widespread roles in gene regulation.
Information abstracted from the summaries in the GeneCards database (see Methods) is listed for each of these 36 genes in Table S1, together with all the orthologs that are assembled in that table.

3.10. Tunicate Orthologs That Are Non-Coding Genes

In their 2019 paper, Litman and Stein [4] estimated the ages of every one of the 19,651 protein-coding genes but succeeded in this endeavor for only 5981 of the 16,528 non-protein-coding genes of the human genome. In total, 23 of these 59,281 were contributed by the tunicate clade. Analysing these using the orthologs platform of GeneCards (see Methods) showed all 23 to be Ascidian orthologs to many vertebrate genomes, none of them having a representative in any animal lower than the Ascidians. The 23 are listed in Table S2, together with excerpts of the annotations found using DAVID’s Functional Analysis program (see Methods). Many of the microRNAs are listed as being implicated in the development of the muscular, neuronal, and vascular tissues, so important in the predatory behavior of vertebrates.

4. Conclusions

In the list that partitioned the 19,653 protein-coding genes of the human gene into their appropriate phylostratum level, Litman and Stein (2018) found, as stated earlier, that 84 of these genes fell into phylostratum 10, associated with the tunicates [4]. Thus, these tunicate-shared genes, which stand at the base of the Olfactores clade, are a very minor fraction of the genes of the human genome. This tunicate contribution was preceded by a large contribution of genes from the Branchiostomatidae (406) and was followed by an even larger number from the fishes (3650). The hand-curated list of phylostratum 10 genes that are listed in Table S1 of this paper again contains only 84 genes. That there are orthologs that are found shared with the vertebrates in the tunicates, and yet are not present in the Branchiostomatidae, accords with the suggestion of [2] Delsuc 2008 that the tunicates and vertebrates are sister clades of the Olfactores and succeeded the Branchiostomatidae in evolutionary time.
We were intrigued that this small number of 84 genes at the base of the Olfactores could open the vast evolutionary changes that led, on the one hand, to the tunicates themselves and, on the other, to the vertebrates. One must add in here the 23 non-coding ortholog genes that first appeared with the tunicates (Table S2) and are heavily implicated in the evolutionary development of vertebrate muscle, nerve, and vascular system, with MIR133B, in particular, being closely involved in all three of these systems.
Perhaps the most important contribution of the tunicates was the evolution of a second type of cadherin. This, a Type II cadherin, had the property of detaching the cell containing that cadherin from cells that expressed the Type I class. The set of such Type II cadherins could now detach and move away from their Type I neighbours, a process that would eventually evolve into the formation of the neural crest. Cells from the neural crest migrate throughout the embryo giving rise to an array of cell types that characterize the vertebrate clade, including the peripheral sensory nervous system and most of the craniofacial skeleton. Indeed, the neural crest can be considered as the fourth germ layer, providing a wide range of possibilities for further evolutionary invention [55]. Controlled interactions between cadherins are of prime importance in the formation of the skin appendage placodes, the cell thickenings, and budding from which arise the teeth, scales, hair, feathers, and mammary glands of the vertebrates.
A second important contribution was the broad development of the muscle and nerve protein toolkits. The tunicates contributed all three of the MYBPC genes that populate the M-band of the sarcomere, leading to a great improvement in the functioning of muscle and the tunicates contributed numerous genes to the other classes of muscle proteins. The tunicates “invented” the first gap junction genes, so important in the Schwann cells and the development of rapidly transmitting myelinated axons. The crystalline genes of the lens of the eye are again of tunicate origin being found in the tunicate palp’s mucilage that enables the larval tunicate to attach to the sea bottom. These developments in mobility and vision provided the basis for the development of the efficient predatory capabilities of the Vertebrata. The evolutionary change from the lifestyle of the Branchiostomatidae to the shared portion of the lifestyles of the Tunicata and vertebrata is perhaps not so great a leap and the 84 orthologs that lie at the base of the Olfactores clade were clearly adequate to bridge this gap.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes15060657/s1. Table S1: DAVID ANALYSIS OF TUNICATE NON-CODING GENES (Functional Annotation Table); Table S2: Tunicate orthologs of vertebrate proteins. References [56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85] are cited in the Supplementary Materials.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The help given to this research by two individuals in France (for an introduction to ANISEED), and one each in Italy (for information on the lancelet) and Denmark (for general bioinformatics issues) is gratefully acknowledged—all four are cited in this paper, as is the help in the preparation of the manuscript by a person from Israel.

Conflicts of Interest

There are no conflicts of interest between the author and any institution or company.

References

  1. Domazet-Lošo, T.; Tautz, D. Phylostratigraphic tracking of cancer genes suggests a link to the emergence of multicellularity in metazoa. BMC Biol. 2010, 8, 66. [Google Scholar] [CrossRef] [PubMed]
  2. Delsuc, F.; Brinkmann, H.; Chourrout, D.; Philippe, H. Tunicates and not cephalo-chordates are the closest living relatives of vertebrates. Nature 2006, 439, 965–968. [Google Scholar] [CrossRef] [PubMed]
  3. Liebeskind, B.J.; McWhite, D.; Marcotte, E.M. Towards consensus gene ages. Genome Biol. Evol. 2016, 8, 1812–1823. [Google Scholar] [CrossRef] [PubMed]
  4. Litman, T.; Stein, W.D. Obtaining estimates for the ages of all the protein-coding genes and most of the ontology-identified noncoding genes of the human genome, assigned to 19 phylostrata. Sem. Oncol. 2019, 46, 3–9. [Google Scholar] [CrossRef]
  5. Colgren, J.; Nichols, S.A. MRTF specifies a muscle-like contractile module in Porifera. Nat. Commun. 2022, 13, 4134. [Google Scholar] [CrossRef] [PubMed]
  6. Zullo, L.; Bozzo, M.; Daya, A.; Di Clemente, A.; Mancini, F.P.; Megighian, A.; Nesher, N.; Röttinger, E.; Shomrat, T.; Tiozzo, S.; et al. The diversity of muscle s and their regenerative potential across animals. Cells 2020, 9, 1925. [Google Scholar] [CrossRef]
  7. Jahnel, S.M.; Walz, M.; Technau, U. Development and epithelial organisation of muscle cells in the sea anemone Nematostella vectensis. Front. Zoolog. 2014, 11, 44. [Google Scholar] [CrossRef]
  8. Wilkie, I.C.; Candia Carnevali, M.D.; Andrietti, F. Mechanical properties of sea-urchin lantern muscles: A comparative investigation of intact muscle groups in Paracentrotus lividus (Lam.) and Stylocidaris affinis (Phil.) (Echinodermata, Echinoidea). J. Comp. Physiol. B 1998, 168, 204–212. [Google Scholar] [CrossRef]
  9. Hiebert, T.C.; Gemmell, B.J.; von Dassow, G.; Conley, K.R.; Sutherland, K.R. The hydrodynamics and kinematics of the appendicularian tail underpin peristaltic pumping. J. R. Soc. Interface 2023, 20, 20230404. [Google Scholar] [CrossRef]
  10. Lindskog, C.; Linné, J.; Fagerberg, L.; Hallström, B.M.; Sundberg, C.J.; Lindholm, M.; Huss, M.; Kampf, C.; Choi, H.; Liem, D.A.; et al. The human cardiac and skeletal muscle proteomes defined by transcriptomics and antibody-based profiling. BMC Genom. 2015, 16, 475. [Google Scholar] [CrossRef]
  11. Schoenauer, R.; Bertoncini, P.; Machaidze, G.; Aebi, U.; Perriard, J.C.; Hegner, M.; Agarkova, I. Myomesin is a molecular spring with adaptable elasticity. J. Mol. Biol. 2005, 349, 367–379. [Google Scholar] [CrossRef] [PubMed]
  12. Abascal, F.; Zardoya, R. Evolutionary analyses of gap junction protein families. Biochim. Biophys. Acta 2013, 1828, 4–14. [Google Scholar] [CrossRef]
  13. Lucaciu, S.A.; Leighton, S.E.; Hauser, A.; Yee, R.; Laird, D.W. Diversity in connexin biology. J. Biol. Chem. 2023, 299, 105263. [Google Scholar] [CrossRef] [PubMed]
  14. Purves, D.; Augustine, G.J.; Fitzpatrick, D.; Katz, L.C.; LaMantia, A.-S.; McNamara, J.O.; Williams, S.M. (Eds.) Neuroscience, 2nd ed.; Sinauer Associates: Sunderland, MA, USA, 2001. [Google Scholar]
  15. More, H.L.; OʼConnor, S.M.; Brøndum, E.; Wang, T.; Bertelsen, M.F.; Grøndahl, C.; Kastberg, K.; Hørlyck, A.; Funder, J.; Maxwell Donelan, J. Sensorimotor responsiveness and resolution in the giraffe. J. Exp. Biol. 2013, 216, 1003–1011. [Google Scholar] [CrossRef]
  16. Weil, M.T.; Heibeck, S.; Topperwien, M.; tom Dieck, S.; Ruhwedel, T.; Salditt, T.; Rodicio, M.C.; Morgan, J.R.; Nave, K.A.; Mobius, W.; et al. Axonal ensheathment in the nervous system of lamprey: Implications for the evolution of myelinating glia. J. Neurosci. 2018, 38, 6586–6596. [Google Scholar] [CrossRef]
  17. Gould, R.M.; Morrison, H.G.; Gilland, E.; Campbell, R.K. Myelin tetraspan family proteins but no non-tetraspan family proteins are present in the ascidian (C. intestinalis) genome. Biol. Bull. 2005, 209, 49–66. [Google Scholar] [CrossRef] [PubMed]
  18. Patel, S.D.; Ciatto, C.; Chen, C.P.; Bahna, F.; Rajebhosale, M.; Arkus, N.; Schieren, I.; Jessell, T.M.; Honig, B.; Price, S.R.; et al. Type II cadherin ectodomain structures: Implications for classical cadherin specificity. Cell 2006, 124, 1255–1268. [Google Scholar] [CrossRef]
  19. Taneyhill, L.A. To adhere or not to adhere: The role of cadherins in neural crest development. Cell Adhes. Migr. 2008, 2, 223–230. [Google Scholar] [CrossRef] [PubMed]
  20. Oda, H.; Akiyama-Oda, Y.; Zhang, S. Two classic cadherin-related molecules with no cadherin extracellular repeats in the cephalochordate amphioxus: Distinct adhesive specificities and possible involvement in the development of multicell-layered structures. J. Cell Sci. 2003, 117, 2757–2767. [Google Scholar] [CrossRef]
  21. York, J.R.; McCauley, D.W. The origin and evolution of vertebrate neural crest cells. Open Biol. 2020, 10, 190285. [Google Scholar] [CrossRef]
  22. Noda, T.; Satoh, N. A comprehensive survey of cadherin superfamily gene expression patterns in C. intestinalis. Gene Expr. Patterns 2008, 8, 349–356. [Google Scholar] [CrossRef] [PubMed]
  23. Jeffery, W.R.; Chiba, T.; Krajka, F.R.; Deyts, C.; Satoh, N.; Joly, J.S. Trunk lateral cells are neural crest-like cells in the ascidian C. intestinalis: Insights into the ancestry and evolution of the neural crest. Dev. Biol. 2008, 324, 152–160. [Google Scholar] [CrossRef] [PubMed]
  24. Niessen, C.M.; Leckband, D.; Yap, A.S. Tissue organization by cadherin adhesion molecules: Dynamic molecular and cellular mechanisms of morphogenetic regulation. Physiol. Rev. 2011, 91, 691–731. [Google Scholar] [CrossRef] [PubMed]
  25. Lu, Y.C.; Nazarko, O.V.; Sando, I.I.I.R.; Salzman, G.S.; Li, N.S.; Südhof, T.C.; Araç, D. Structural basis of latrophilin-FLRT-UNC5 interaction in cell adhesion. Structure 2015, 23, 1678–1691. [Google Scholar] [CrossRef] [PubMed]
  26. del Toro, D.; Carrasquero-Ordaz, M.A.; Chu, A.; Ruff, T.; Shahin, M.; Jackson, V.A.; Chavent, M.; Berbeira-Santana, M.; Seyit-Bremer, G.; Brignani, S.; et al. Structural basis of teneurin-latrophilin interaction in repulsive guidance of migrating neurons. Cell 2020, 180, 323–339. [Google Scholar] [CrossRef] [PubMed]
  27. Furuse, M.; Fujita, K.; Hiiragi, T.; Fujimoto, K.; Tsukita, S. Claudin-1 and -2: Novel integral membrane proteins localizing at tight junctions with no sequence similarity to occludin. J. Cell Biol. 1998, 141, 1539–1550. [Google Scholar] [CrossRef] [PubMed]
  28. Günzel, D.; Yu, A.S.L. Claudins and the modulation of tight junction permeability. Physiol. Rev. 2013, 93, 525–569. [Google Scholar] [CrossRef]
  29. Kollmar, R.; Nakamura, S.K.; Kappler, J.A.; Hudspeth, A.J. Expression and phylogeny of claudins in vertebrate primordia. Proc. Natl. Acad. Sci. USA 2001, 98, 10196–10201. [Google Scholar] [CrossRef] [PubMed]
  30. Mellott, D.O.; Burke, R.D. The molecular phylogeny of eph receptors and ephrin ligands. BMC Cell Biol. 2008, 9, 1–8. [Google Scholar] [CrossRef]
  31. Taylor, H.; Campbell, J.; Nobes, C.D. Ephs and ephrins. Curr. Biol. 2017, 27, R83–R102. [Google Scholar] [CrossRef]
  32. Williaume, G.; de Buyl, S.; Sirour, C.; Haupaix, N.; Bettoni, R.; Imai, K.S.; Satou, Y.; Dupont, G.; Hudson, C.; Yasuo, H. Cell geometry, signal dampening, and a bimodal transcriptional response underlie the spatial precision of an ERK-mediated embryonic induction. Dev. Cell 2021, 56, 2966–2979. [Google Scholar] [CrossRef] [PubMed]
  33. Fiuza, U.M.; Negishi, T.; Rouan, A.; Yasuo, H.; Lemaire, P. A Nodal/Eph signalling relay drives the transition from apical constriction to apico-basal shortening in ascidian endoderm invagination. Development 2020, 147, dev186965. [Google Scholar] [CrossRef]
  34. Negishi, T.; Nishida, H. Asymmetric and Unequal Cell Divisions in Ascidian Embryos. In Asymmetric Cell Division in Development, Differentiation and Cancer. Results and Problems in Cell Differentiation; Tassan, J.P., Kubiak, J., Eds.; Springer: Cham, Switzerland, 2017; Volume 61. [Google Scholar] [CrossRef]
  35. Haupaix, N.; Abitua, P.B.; Sirour, C.; Yasuo, H.; Levine, M.; Hudson, C. Ephrin-mediated restriction of ERK1/2 activity delimits the number of pigment cells in the C. CNS. Dev. Biol. 2014, 394, 170–180. [Google Scholar] [CrossRef] [PubMed]
  36. Stolfi, A.; Wagner, E.; Taliaferro, J.M.; Chou, S.; Levine, M. Neural tube patterning by Ephrin, FGF and Notch signaling relays. Development 2011, 138, 5429–5439. [Google Scholar] [CrossRef]
  37. Gee, R.R.F.; Chen, H.; Lee, A.K.; Daly, C.A.; Wilander, B.A.; Fon Tacer, K.; Potts, P.R. Emerging roles of the MAGE protein family in stress response pathways. J. Biol. Chem. 2020, 295, 16121–16155. [Google Scholar]
  38. Ray, M.E.; Wistow, G.; Su, Y.A.; Meltzer, P.S.; Trent, J.M. AIM1, a novel non-lens member of the bg-crystallin superfamily, is associated with the control of tumorigenicity in human malignant melanoma. Proc. Natl. Acad. Sci. USA 1997, 94, 3229–3234. [Google Scholar] [CrossRef]
  39. Shimeld Sebastian, M.; Purkiss, A.G.; Dirks, R.P.H.; Bateman, O.A.; Slingsby, C.; Lubsen, N.H. Urochordate-crystallin and the evolutionary origin of the vertebrate eye lens. Curr. Biol. 2005, 15, 1684–1689. [Google Scholar] [CrossRef]
  40. Riyahi, K.; Shimeld, S.M. Chordate βγ-crystallins and the evolutionary developmental biology of the vertebrate lens. Comp. Biochem. Physiol. Part B 2007, 147, 347–357. [Google Scholar] [CrossRef] [PubMed]
  41. Kappe, G.; Purkiss, A.G.; van Genesen, S.T.; Slingsby, C.; Lubsen, N.H. Explosive expansion of bc-crystallin genes in the ancestral vertebrate. J. Mol. Evol. 2010, 71, 219–230. [Google Scholar] [CrossRef]
  42. Slingsby, C.; Wistow, G.J.; Clark, A.R. Evolution of crystallins for a role in the vertebrate eye lens. Prot. Sci. 2013, 22, 367–380. [Google Scholar] [CrossRef]
  43. Cvekl, A.; Zhao, Y.; Mcgreal Rebecca Xie, Q.; Gu, X.; Zheng, D. Evolutionary origins of Pax6 control of crystallin genes. Genome Biol. Evol. 2017, 9, 2075–2092. [Google Scholar] [CrossRef] [PubMed]
  44. Zenga, F.; Wunderera, J.; Salvenmosera, W.; Hessb, M.W.; Ladurnera, P.; Rothbächera, U. Papillae revisited and the nature of the adhesive secreting collocytes. Dev. Biol. 2019, 448, 183–198. [Google Scholar] [CrossRef] [PubMed]
  45. Horie, T.; Sakurai, D.; Ohtsuki, H.; Terakita, A.; Shichida, Y.; Usukura, J.; Kusakabe, T.; Tsuda, M. Pigmented and nonpigmented ocelli in the brain vesicle of the ascidian larva. J. Comp. Neur. 2008, 509, 88–102. [Google Scholar] [CrossRef] [PubMed]
  46. Kozlyuk, N.; Sengupta, S.; Bierma, J.C.; Martin, R.W. Calcium binding dramatically stabilizes an ancestral crystallin fold in tunicate βγ-crystallin. Biochemistry 2016, 55, 6961–6968. [Google Scholar] [CrossRef] [PubMed]
  47. Cohen, S.M.; Bronner, G.; Kuttner, F.; Jurgens, G.; Jackie, H. Distal-less encodes a homoeodomain protein required for limb development in Drosophila. Nature 1989, 338, 432–434. [Google Scholar] [CrossRef] [PubMed]
  48. Williams, T.A.; Nulsen, C.; Nagy, L.M. A complex role for distal-less in crustacean appendage development. Dev. Biol. 2002, 241, 302–312. [Google Scholar] [CrossRef] [PubMed]
  49. Ullah, A.; Hammid, A.; Umair, M.; Ahmad, W. A novel heterozygous intragenic sequence variant in DLX6 probably underlies first case of autosomal dominant split-hand/foot malformation type 1. Mol. Syndromol. 2017, 8, 79–84. [Google Scholar] [CrossRef] [PubMed]
  50. Arendt, D. Many ways to build a polyp. Trends Gen. 2019, 35, 885–887. [Google Scholar] [CrossRef] [PubMed]
  51. Lowe, C.J.; Wray, G.A. Radical alterations in the roles of homeobox genes during echinoderm evolution. Nature 1997, 389, 718–721. [Google Scholar] [CrossRef]
  52. Prpic, N.M.; Tautz, D. The expression of the proximodistal axis patterning genes Distal-less and dachshund in the appendages of Glomeris marginata (Myriapoda: Diplopoda) suggests a special role of these genes in patterning the head appendages. Dev. Biol. 2003, 260, 97–112. [Google Scholar] [CrossRef]
  53. Caracciolo, A.; Di Gregorio, A.; Aniello, F.; Di Lauro, R.; Branno, M. Identification and developmental expression of three Distal-less homeobox containing genes in the ascidian C. intestinalis. Mech. Dev. 2000, 99, 173–176. [Google Scholar] [CrossRef]
  54. Holland, N.D.; Panganiban, G.; Henyey, E.L.; Holland, L.Z. Sequence and developmental expression of amphidll, an amphioxus Distal-less gene transcribed in the ectoderm, epidermis and nervous system: Insights into evolution of craniate forebrain and neural crest. Development 1996, 122, 2911–2920. [Google Scholar] [CrossRef]
  55. Shyamala, K.; Yanduri, S.; Girish, H.C.; Murgod, S. Neural crest: The fourth germ layer. J. Oral. Maxillofac. Pathol. 2015, 19, 221–229. [Google Scholar] [CrossRef]
  56. Wang, S.; DeLeon, C.; Sun, W.; Quae, S.R.; Roth, B.L.; Südhof, T.C. Alternative splicing of latrophilin-3 controls synapse formation. Nature 2024, 626, 128–135. [Google Scholar] [CrossRef]
  57. Huber, P.A.J. Caldesmon. Int. J. Biochem. Cell Biol. 1997, 29, 1047–1051. [Google Scholar] [CrossRef] [PubMed]
  58. Polanco, J.; Reyes-Vigil, F.; Weisberg, S.D.; Dhimitruka, I.; Brusés, J.L. differential spatiotemporal expression of type i and type ii cadherins associated with the segmentation of the central nervous system and formation of brain nuclei in the developing mouse. Front. Mol. Neurosci. 2021, 14, 2021. [Google Scholar] [CrossRef] [PubMed]
  59. Hashimoto, H.; Munro, E. Differential expression of a classic cadherin directs tissue-level contractile asymmetry during neural tube closure. Dev. Cell 2019, 51, 158–172. [Google Scholar] [CrossRef] [PubMed]
  60. Paulson, A.F.; Prasad, M.S.; Thuringer, A.H.; Manzerra, P. Regulation of cadherin expression in nervous system development. Cell Adhes. Migr. 2014, 8, 19–28. [Google Scholar] [CrossRef]
  61. Matsuoka, H.; Yamaoka, A.; Hamashima, T.; Shima, A.; Kosako, M.; Tahara, Y.; Kamishikiryo, J.; Michihara, A. EGF-dependent activation of ELK1 contributes to the induction of CLDND1 expression involved in tight junction formation. Biomedicines 2022, 10, 1792. [Google Scholar] [CrossRef]
  62. Exposito, J.Y.; Cluzel, C.; Garrone, R.; Lethias, C. Evolution of collagens. Anatom. Rec. 2002, 268, 302–316. [Google Scholar] [CrossRef]
  63. Panganiban, G.; Rubenstein, J.L.R. Developmental functions of the Distal-less/Dlx homeobox genes. Development 2002, 129, 4371–4386. [Google Scholar] [CrossRef] [PubMed]
  64. Stolfi, A.; Gainous, T.B.; Young, J.J.; Mori, A.; Levine, M.; Christiaen, L. Early chordate origins of the vertebrate second heart field. Science 2010, 329, 565–568. [Google Scholar] [CrossRef]
  65. Moser, C.; Gosselé, K.A.; Balaz, M.; Balazova, L.; Horvath, C.; Künzle, P.; Okreglicka, K.M.; Li, F.; Blüher, M.; Stierstorferf, B.; et al. FAM3D: A gut secreted protein and its potential in the regulation of glucose metabolism. Peptides 2023, 167, 171047. [Google Scholar] [CrossRef]
  66. Chen, W.; Gao, D.; Xie, L.; Wang, A.; Zhao, H.; Guo, C.; Sun, Y.; Nie, Y.; Hong, A.; Xiong, S. SCF-FBXO24 regulates cell proliferation by mediating ubiquitination and degradation of PRMT6. Biochem. Biophys. Res. Comm. 2020, 530, 75–81. [Google Scholar] [CrossRef] [PubMed]
  67. Satou, Y.; Tokuoka, M.; Oda-Ishii, I.; Tokuhiro, S.; Ishida, T.; Liu, B.; Iwamura, Y. A manually curated gene model set for an ascidian, C. robusta (C. intestinalis type A). Zool. Sci. 2022, 39, 253–260. [Google Scholar] [CrossRef] [PubMed]
  68. Smith, H.M.; Khairallah, S.M.; Nguyen, A.H.; Newman-Smith, E.; Smith, W.C. Misregulation of cell adhesion molecules in the C. neural tube closure mutant bugeye. Dev. Biol. 2021, 480, 14–24. [Google Scholar] [CrossRef]
  69. Abitua, P.B.; Gainous, T.B.; Kaczmarczyk, A.N.; Winchell, C.J.; Hudson, C.; Kamata, K.; Nakagawa, M.; Tsuda, M.; Kusakabe, T.G.; Levine, M. The pre-vertebrate origins of neurogenic placodes. Nature 2015, 524, 462–465. [Google Scholar] [CrossRef]
  70. Edvardsen, R.B.; Seo, H.C.; Jensen, M.F.; Mialon, A.; Mikhaleva, J.; Bjordal, M.; Cartry, J.; Reinhardt, R.; Weissenbach, J.; Wincker, P.; et al. Remodelling of the homeobox gene complement in the tunicate Oikopleura dioica. Curr. Biol. 2005, 15, R12–R13. [Google Scholar] [CrossRef]
  71. Labat-de-Hoz, L.; Rubio-Ramos, A.; Correas, I.; Alonso, M.A. The MAL family of proteins: Normal function, expression in cancer, and potential use as cancer biomarkers. Cancers 2023, 15, 2801. [Google Scholar] [CrossRef]
  72. Chen, Y.H.; Pai, C.W.; Huang, S.W.; Chang, S.N.; Lin, L.Y.; Chiang, F.T.; Lin, J.L.; Hwang, J.J.; Tsai, C.T. Inactivation of myosin binding protein C homolog in zebrafish as a model for human cardiac hypertrophy and diastolic dysfunction. J. Am. Heart Assoc. 2013, 2, e000231. [Google Scholar] [CrossRef]
  73. Razy-Krajka, F.; Stolf, A. Regulation and evolution of muscle development in tunicates. EvoDevo 2019, 10, 13. [Google Scholar] [CrossRef]
  74. Heissler, S.M.; Sellers, J.R. Myosin light chains: Teaching old dogs new tricks. Bioarchitecture 2014, 4, 169–188. [Google Scholar] [CrossRef] [PubMed]
  75. Ramirez, I.; Gholkar, A.A.; Velasquez, E.F.; Guo, X.; Tofig, B.; Damoiseaux, R.; Torres, J.Z. The myosin regulatory light chain Myl5 localizes to mitotic spindle poles and is required for proper cell division. Cytoskeleton 2021, 78, 23–35. [Google Scholar] [CrossRef] [PubMed]
  76. Shaffer, J.F.; Gillis, T.E. Evolution of the regulatory control of vertebrate striated muscle: The roles of troponin I and myosin binding protein-C. Physiol. Genom. 2010, 42, 406–419. [Google Scholar] [CrossRef]
  77. Langea, S.; Pinotsisc, N.; Agarkovad, I.; Ehlere, E. The M-band: The underestimated part of the sarcomere. BBA-Mol. Cell Res. 2020, 1867, 118440. [Google Scholar] [CrossRef] [PubMed]
  78. Du Pasquier, L. Speculations on the origin of the vertebrate immune system. Immunol. Lett. 2004, 92, 3–9. [Google Scholar] [CrossRef]
  79. Juriloff, D.M.; Harris, M.J. Insights into the etiology of mammalian neural tube closure defects from developmental, genetic and evolutionary studies. J. Dev. Biol. 2018, 6, 22. [Google Scholar] [CrossRef]
  80. Medina-Martinez, O.; Shah, R.; Jamrich, M. Pitx3 controls multiple aspects of lens development. Dev. Dyn. 2009, 238, 2193–2201. [Google Scholar] [CrossRef]
  81. Scales, S.J.; Hesser, B.A.; Masuda, E.S.; Scheller, R.H. Amisyn, a novel syntaxin-binding protein that may regulate snare complex assembly. J. Biol. Chem. 2002, 277, 28271–28279. [Google Scholar] [CrossRef]
  82. Wei, B.; Jin, J.P. TNNT1, TNNT2, and TNNT3: Isoform genes, regulation, and structure-function relationships. Gene 2016, 582, 1–13. [Google Scholar] [CrossRef]
  83. Satou, Y.; Imai, K.S. Ascidian Zic Genes. In Zic Family. Advances in Experimental Medicine and Biology; Aruga, J., Ed.; Springer: Singapore, 2018; Volume 1046. [Google Scholar] [CrossRef]
  84. Chen, J.S.; Pedro, M.S.; Zeller, R.W. miR-124 function during C. intestinalis neuronal development includes extensive interaction with the Notch signaling pathway. Development 2011, 138, 4943–4953. [Google Scholar] [CrossRef] [PubMed]
  85. Mercurio, S.; Cauteruccio, S.; Manenti, R.; Candiani, S.; Scarì, G.; Licandro, E.; Pennati, R. miR-7 Knockdown by Peptide Nucleic Acids in the Ascidian C. intestinalis. Int. J. Mol. Sci. 2019, 20, 5127. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Cladistic diagram of animal evolution with the tunicates as the sister clade of the vertebrates, together forming the Olfactores clade [2].
Figure 1. Cladistic diagram of animal evolution with the tunicates as the sister clade of the vertebrates, together forming the Olfactores clade [2].
Genes 15 00657 g001
Figure 2. Phylogram of the MYL proteins, rooted at the Coral ortholog of MYL6. The orthologs contributed by the tunicates are shown in bold type.
Figure 2. Phylogram of the MYL proteins, rooted at the Coral ortholog of MYL6. The orthologs contributed by the tunicates are shown in bold type.
Genes 15 00657 g002
Figure 3. Phylogram of the MYH proteins, rooted at the annotation of the Trichoplax ortholog of MYH10. Tunicate orthologs in bold type. Fish orthologs underlined.
Figure 3. Phylogram of the MYH proteins, rooted at the annotation of the Trichoplax ortholog of MYH10. Tunicate orthologs in bold type. Fish orthologs underlined.
Genes 15 00657 g003
Figure 4. Phylogram of the MYBPC and MYOM proteins rooted at the Lancelet ortholog of the giant muscle protein Titin, which appeared with the Branchiostomata. The orthologs contributed by the tunicates are shown in bold type and underlined.
Figure 4. Phylogram of the MYBPC and MYOM proteins rooted at the Lancelet ortholog of the giant muscle protein Titin, which appeared with the Branchiostomata. The orthologs contributed by the tunicates are shown in bold type and underlined.
Genes 15 00657 g004
Figure 5. Phylogram of the Gap Junction proteins. Those found with orthologs in the Tunicata are labelled on the figure.
Figure 5. Phylogram of the Gap Junction proteins. Those found with orthologs in the Tunicata are labelled on the figure.
Genes 15 00657 g005
Figure 6. (A) depicts the alignment of XP_019638343.1, the annotation for the sequence of the titin homolog of Branchiostomata belcheri against the seven tunicate Gap Junction genes listed in Table S1; (B) shows the alignment against the 891 portion of the sequence that spans those of the shorter GAP Junction sequences.
Figure 6. (A) depicts the alignment of XP_019638343.1, the annotation for the sequence of the titin homolog of Branchiostomata belcheri against the seven tunicate Gap Junction genes listed in Table S1; (B) shows the alignment against the 891 portion of the sequence that spans those of the shorter GAP Junction sequences.
Genes 15 00657 g006
Figure 7. Phylogram of the seven tunicate Gap Junction proteins rooted at the 891-long sequence of the Titin Branchiostomata belcheri homolog.
Figure 7. Phylogram of the seven tunicate Gap Junction proteins rooted at the 891-long sequence of the Titin Branchiostomata belcheri homolog.
Genes 15 00657 g007
Figure 8. Phylogram of the cadherin proteins of H. sapiens. Those with tunicate orthologs are indicated as such on the figure. The single cadherin with an ortholog in the Branchiostomatidae is indicated as “lancelet”.
Figure 8. Phylogram of the cadherin proteins of H. sapiens. Those with tunicate orthologs are indicated as such on the figure. The single cadherin with an ortholog in the Branchiostomatidae is indicated as “lancelet”.
Genes 15 00657 g008
Figure 9. Unrooted phylogram of the human claudin proteins with the three tunicate-shared orthologs indicated as such, the single Branchiostomatidae-shared ortholog indicated as “lancelet”, the single lamprey-shared, and the two shark-shared orthologs indicated as such.
Figure 9. Unrooted phylogram of the human claudin proteins with the three tunicate-shared orthologs indicated as such, the single Branchiostomatidae-shared ortholog indicated as “lancelet”, the single lamprey-shared, and the two shark-shared orthologs indicated as such.
Genes 15 00657 g009
Figure 10. Comparison of the sequence of the CLDN18 orthologs from the tunicate C. intestinalis and the lamprey P. marinus. The sequence of the lamprey ortholog is shown as the lower sequence of the two in each row.
Figure 10. Comparison of the sequence of the CLDN18 orthologs from the tunicate C. intestinalis and the lamprey P. marinus. The sequence of the lamprey ortholog is shown as the lower sequence of the two in each row.
Genes 15 00657 g010
Figure 11. (A)—Phylogram of the relation between the proteins of the EPHA series, the phylogram being rooted at EPHA5. (B)—the corresponding phylogram for the genes of the EPHB series, the phylogram being rooted at EPHB2.
Figure 11. (A)—Phylogram of the relation between the proteins of the EPHA series, the phylogram being rooted at EPHA5. (B)—the corresponding phylogram for the genes of the EPHB series, the phylogram being rooted at EPHB2.
Genes 15 00657 g011
Figure 12. Phylogram of the relation between the Ephrin proteins—the EFNA and EFNB series separate into their two subgroups.
Figure 12. Phylogram of the relation between the Ephrin proteins—the EFNA and EFNB series separate into their two subgroups.
Genes 15 00657 g012
Figure 13. Phylogram of the Type II MAGE proteins, together with a separate protein NDN, necdin, also a member of the MAGE family.
Figure 13. Phylogram of the Type II MAGE proteins, together with a separate protein NDN, necdin, also a member of the MAGE family.
Genes 15 00657 g013
Figure 14. Multiple alignment of the six gamma crystallins, compared with the single crystallin of the tunicates. The figure was generated by the COBALT program of the National Library of Medicine (see Methods). From top to bottom, the proteins depicted are:Chain A, betagamma-crystallin of C. intestinalis, CRYGA, CRYGB, CRYGC, CRYGD, CRYGN, and CRYGS of H. sapiens. The length of each sequence in amino acids is listed under the heading END.
Figure 14. Multiple alignment of the six gamma crystallins, compared with the single crystallin of the tunicates. The figure was generated by the COBALT program of the National Library of Medicine (see Methods). From top to bottom, the proteins depicted are:Chain A, betagamma-crystallin of C. intestinalis, CRYGA, CRYGB, CRYGC, CRYGD, CRYGN, and CRYGS of H. sapiens. The length of each sequence in amino acids is listed under the heading END.
Genes 15 00657 g014
Figure 15. Phylogram of the six gamma crystallins together with the Tunicate crystallin. The figure was generated by the COBALT program of the National Library of Medicine (see Methods), being rooted at the Tunicate crystallin. (see also [39,40,41,42,43] for fuller discussions of the evolution and biology of the lens and of the crystallin proteins).
Figure 15. Phylogram of the six gamma crystallins together with the Tunicate crystallin. The figure was generated by the COBALT program of the National Library of Medicine (see Methods), being rooted at the Tunicate crystallin. (see also [39,40,41,42,43] for fuller discussions of the evolution and biology of the lens and of the crystallin proteins).
Genes 15 00657 g015
Figure 16. Dotplots produced by the BLAST program (see Methods) with the tunicate sequence on the x-axis and, on the y-axis, in (A) the CRYGN and in (B) the CRYGS of the lamprey Petromyzon marinus.
Figure 16. Dotplots produced by the BLAST program (see Methods) with the tunicate sequence on the x-axis and, on the y-axis, in (A) the CRYGN and in (B) the CRYGS of the lamprey Petromyzon marinus.
Genes 15 00657 g016
Figure 17. Alignment of the amino-acid sequences of the lamprey CRYGN protein and the tunicate crystallin. The wide arrows show the mutations that occurred between the lamprey and tunicate in the sites that bind calcium in the latter organism (sequence data from [46]).
Figure 17. Alignment of the amino-acid sequences of the lamprey CRYGN protein and the tunicate crystallin. The wide arrows show the mutations that occurred between the lamprey and tunicate in the sites that bind calcium in the latter organism (sequence data from [46]).
Genes 15 00657 g017
Figure 18. Phylogram of the six DLX genes of the human genome together with the gene Dll of Drosophila melanogaster, rooted at the Dll gene of Hydra vulgaris. The organisms in which the orthologs with the human gene first appeared are indicated next to each gene. The phylogram was built using the COBALT program (see Methods).
Figure 18. Phylogram of the six DLX genes of the human genome together with the gene Dll of Drosophila melanogaster, rooted at the Dll gene of Hydra vulgaris. The organisms in which the orthologs with the human gene first appeared are indicated next to each gene. The phylogram was built using the COBALT program (see Methods).
Genes 15 00657 g018
Figure 19. Multiple alignment of the six DLX proteins of the human genome together with the gene Dll of Drosophila melanogaster. The rows from the top: Dll of Hydra, Dll of the fly, and then DLX 2, 5, 3, 1, 6, and 4 of H. sapiens. The alignment was made using the COBALT program (see Methods). The number of amino-acid residues in each protein is listed under “End”.
Figure 19. Multiple alignment of the six DLX proteins of the human genome together with the gene Dll of Drosophila melanogaster. The rows from the top: Dll of Hydra, Dll of the fly, and then DLX 2, 5, 3, 1, 6, and 4 of H. sapiens. The alignment was made using the COBALT program (see Methods). The number of amino-acid residues in each protein is listed under “End”.
Genes 15 00657 g019
Table 1. Orthologs from Phylostratum 10: Consortium’s Spread as steps (see text).
Table 1. Orthologs from Phylostratum 10: Consortium’s Spread as steps (see text).
HGNCStepsEqual to ModeHGNCStepsEqual to Mode
ADGRL33.154MGAT4D4.084
ANKRD164.774MSMP1.156
ANKRD422.388MUM12.105
ARHGEF382.674MYBPC32.086
ASB23.925MYL55.013
ATRAID0.4611NEBL2.166
BCL6B2.954NECTIN31.854
C18ORF210.629NEXN2.465
CALML62.478NPC1L14.855
CASR1.927OLFML2A1.545
CATIP3.184OTULIN1.557
CBLN11.384PDX12.774
CCDC782.624PHF21A2.464
CEP633.405PODN3.464
CEP832.467PRR142.276
CKAP22.545RBM143.613
CLDN10.837RHBDL24.544
CLDN181.255RNF44.853
CLDN190.927SCLT13.186
CLDN71.236SERINC54.385
EFNA11.924SLC35G23.314
FABP21.008SLC35G34.863
FAM155B1.545SLC35G45.133
FAM187A1.236SLC35G55.503
FAM3D2.384SLC35G64.254
FBXO242.236SLC6A143.543
FOXH12.955SLC6A63.464
GJA102.025SNRNP482.318
GJA31.167SPATA214.503
GJC22.386STXBP61.778
GNA154.003TGFB12.154
GTF2IRD23.003TGFB21.546
GTF2IRD2B4.303TLCD23.623
GVQW14.052TMEM2180.629
H1FOO3.233TNNC14.086
HMMR2.928TNNC24.084
HNF1A1.157TSPAN122.695
HSPB13.233WSB12.314
IKZF52.855ZBED52.464
IL17C1.784ZBED82.834
KIZ1.236ZC3H7B1.388
LRRC293.094ZNF914.882
Table 2. Names and tissue distributions of frequently studied Cadherins.
Table 2. Names and tissue distributions of frequently studied Cadherins.
Familiar NameTissue DistributionHGNCTypeHas Tunicate Ortholog?
Cadherin E EpithelialCDH1IYes
Cadherin N NeuralCDH2IYes
Cadherin P PlacentalCDH3I
Cadherin R RetinalCDH4I
Cadherin VE EpithelialCDH5II
Cadherin K Brain; kidneyCDH6II
Cadherin OB OsteoblastCDH11IIYes
Cadherin BR BrainCDH12II
Cadherin T and H HeartCDH13I
Cadherin M MuscleCDH15I
Table 3. Latest orthologs of Ephron and Ephrin genes.
Table 3. Latest orthologs of Ephron and Ephrin genes.
Ephron GenesEphrin Genes
HGNC SymbolLatest Ortholog in:HGNC SymbolLatest Ortholog in:
EPHA2TunicataEFNA1Tunicata
EPHA3ElasmobranchiiEFNA2Tunicata
EPHA4TunicataEFNA3Tunicata
EPHA5BranchiostomataEFNA4Tunicata
EPHA6ElasmobranchiiEFNA5Tunicata
EPHA7ElasmobranchiiEFNB1Tunicata
EPHA8ElasmobranchiiEFNB2Branchiostomato
EPHA10OsteichthyesEFNB3Elasmobranchii
EPHB2Branchiostomata
EPHB3Elasmobranchii
EPHB4Cnidaria
Table 4. Abridgement of DAVID cluster analysis of 36 genes not discussed in main text.
Table 4. Abridgement of DAVID cluster analysis of 36 genes not discussed in main text.
TermGenesFDR *
Cluster 2
heterophilic cell-cell adhesion via plasma membrane cell adhesion moleculesNECTIN3, CBLN1, NECTIN10.358759
identical protein bindingTFAP2E, COL23A1, HSPB1, NECTIN3, CBLN1, NECTIN10.671885
SynapseNECTIN3, CBLN1, NECTIN10.935728
Cluster 4
homophilic cell adhesion via plasma membrane adhesion moleculesPALLD, NECTIN3, NECTIN10.739434
DOMAIN:Ig-like C2-type 1PALLD, NECTIN3, NECTIN10.656496
DOMAIN:Ig-like C2-type 2PALLD, NECTIN3, NECTIN10.656496
Immunoglobulin domainPALLD, NECTIN3, NECTIN10.877976
Immunoglobulin-like foldPALLD, NECTIN3, NECTIN10.984375
Cluster 6
Z discPALLD, SYNPO2, HSPB10.481232
actin cytoskeletonCALD1, PALLD, SYNPO20.564483
actin bindingCALD1, PALLD, SYNPO20.614614
focal adhesionPALLD, SYNPO2, HSPB10.679141
Cluster 7
Signaling pathways regulating pluripotency of stem cellsFZD3, ZIC3, FZD60.180791
Developmental proteinFZD3, ZIC3, FZD6, PITX30.819281
Cluster 8
ZincZMAT1, ZIC3, ASXL31
Zinc-fingerZMAT1, ZIC3, ASXL30.988184
Metal-bindingZMAT1, ZIC3, ASXL31
* False Discovery Rate (maximum is 1).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Stein, W.D. Orthologs at the Base of the Olfactores Clade. Genes 2024, 15, 657. https://doi.org/10.3390/genes15060657

AMA Style

Stein WD. Orthologs at the Base of the Olfactores Clade. Genes. 2024; 15(6):657. https://doi.org/10.3390/genes15060657

Chicago/Turabian Style

Stein, Wilfred D. 2024. "Orthologs at the Base of the Olfactores Clade" Genes 15, no. 6: 657. https://doi.org/10.3390/genes15060657

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop