Next Article in Journal
Variable Selection for Sparse Data with Applications to Vaginal Microbiome and Gene Expression Data
Previous Article in Journal
Increased On-Target Rate and Risk of Concatemerization after CRISPR-Enhanced Targeting in ES Cells
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

CD99 and the Chicken Alloantigen D Blood System †

by
Janet E. Fulton
1,*,
Wiola Drobik-Czwarno
2,
Ashlee R. Lund
1,
Carl J. Schmidt
3 and
Robert L. Taylor, Jr.
4
1
Hy-Line International, Dallas Center, IA 50063, USA
2
Department of Animal Genetics and Conservation, Institute of Animal Science, Warsaw University of Life Sciences, 02-787 Warsaw, Poland
3
Department of Animal and Food Science, University of Delaware, Newark, DE 19716, USA
4
Division of Animal and Nutritional Sciences, West Virginia University, Morgantown, WV 26506, USA
*
Author to whom correspondence should be addressed.
Scientific Article No: 3451 of the West Virginia Agricultural and Forestry Experiment Station, Morgantown.
Genes 2023, 14(2), 402; https://doi.org/10.3390/genes14020402
Submission received: 11 November 2022 / Revised: 9 December 2022 / Accepted: 28 January 2023 / Published: 3 February 2023
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
The chicken D blood system is one of 13 alloantigen systems found on chicken red blood cells. Classical recombinant studies located the D blood system on chicken chromosome 1, but the candidate gene was unknown. Multiple resources were utilized to identify the chicken D system candidate gene, including genome sequence information from both research and elite egg production lines for which D system alloantigen alleles were reported, and DNA from both pedigree and non-pedigree samples with known D alleles. Genome-wide association analyses using a 600 K or a 54 K SNP chip plus DNA from independent samples identified a strong peak on chicken chromosome 1 at 125–131 Mb (GRCg6a). Cell surface expression and the presence of exonic non-synonymous SNP were used to identify the candidate gene. The chicken CD99 gene showed the co-segregation of SNP-defined haplotypes and serologically defined D blood system alleles. The CD99 protein mediates multiple cellular processes including leukocyte migration, T-cell adhesion, and transmembrane protein transport, affecting peripheral immune responses. The corresponding human gene is found syntenic to the pseudoautosomal region 1 of human X and Y chromosomes. Phylogenetic analyses show that CD99 has a paralog, XG, that arose by duplication in the last common ancestor of the amniotes.

Graphical Abstract

1. Introduction

In humans, CD99 is a cell surface protein, involved with multiple biological processes including cell adhesion, migration, and cell death. It is essential for the transendothelial migration of immune cells. It plays an important role in lymphocyte development, is involved with the upregulation of MHC class I and II and T cell receptor expression on thymocytes, and has important roles in both T-cell activation and early B-cell lymphopoiesis [1]. The CD99 protein, together with its highly conserved homolog Xg antigen, results in the Xg blood group [2]. CD99 is expressed ubiquitously at low levels, whereas Xg is expressed solely on erythrocytes [3].
Human Xg and CD99 genes, each consisting of 10 or 11 exons (alternative splice site), encode the Xga and CD99 glycoproteins, respectively, which are separately expressed on the red blood cell (RBC) membrane. They are located adjacent to each other in the genome in the same orientation. In humans, the CD99 gene lies within pseudoautosomal region 1, while part of the Xg gene is located within the pseudoautosomal region and a portion is X-specific. Thus, in humans, an incomplete Xg is present on the Y chromosome. The expression of Xg and CD99 is coregulated and these two molecules together define the human Xga/CD99 blood group [4].
An increased understanding of how variability within genes affecting immune responses can impact survival is relevant to improved animal health. The chicken is an excellent non-mammalian model organism for comparative immunology as well as an important food source. Early immunology studies revealed the partitioning of B- and T-cell development [5,6]. Rous sarcoma virus, the first tumor virus, and SRC, the first oncogene, were isolated in the chicken [7]. An increased demand for food due to a rising population stresses the importance of the chicken’s high-quality meat and egg protein. Innate and acquired immune responses are becoming more important due to the declining availability of antibiotics for animal production systems.
There are 13 blood systems known in the chicken which were named alphabetically in their order of discovery [8]. The B blood group system is now known to be the Major Histocompatibility Complex (MHC) which contains multiple immune-related genes and has a major impact on disease resistance [9,10]. The genes responsible for the A and E blood systems have recently been identified as C4BPM and FCAMR, respectively, with both genes having significant roles in the immune response [11]. Very little information on candidate gene or genomic location is known for any of the other chicken blood systems.
The availability of DNA banks comprising multiple DNA samples from individuals serologically typed for various blood systems, plus the availability of genome sequence information for several experimental lines with known blood system alleles, provided the resources for the identification of candidate genes of chicken blood systems. The chicken D blood system was first identified by Briles as the fourth red blood cell antigen [12]. The D system was autosomal with no linkage reported to any other known alloantigen or to multiple phenotypic traits including naked neck (Na) and pea comb (P) [9]. Classical recombination studies utilizing morphological phenotypes, and a chromosome rearrangement breakpoint located the D system gene on chromosome 1 [13].
Information on the biological role of the D system in the chicken is scarce; however, the few available studies point to a role in immune response. Divergent selection for bursa of Fabricius size or antibody response against sheep red blood cells in chicken lines resulted in differences in the frequency of D alleles indicating possible relationships with the D system [14,15,16]. Lipopolysaccharide (LPS) stimulation of macrophages triggered a higher nitrite production in D1D2 chickens compared with D1D3, D1D4, D2D3, and D3D3 genotypes within specific MHC-B genotypes [9]. However, no other D system effects have been reported to date.
Little research has been carried out with the D blood system since these early studies, most likely due to lack of alloantisera for the detection of the antigens. The primary purpose of the research presented herein was to determine the gene responsible for the D blood system antigen in the chicken. Identification of the causative gene, and DNA-based detection of the variants defining the D alleles would provide the tools needed to expand our information on the D system’s relevance to the chicken immune response. The observation that the candidate gene has an immunological impact on the mammalian immune system adds considerable value to the identification of the chicken D blood system antigen.

2. Materials and Methods

2.1. Genetic Material

DNA was available from birds from multiple sources with serologically defined D system alleles and is summarized in Table 1. The Northern Illinois University (NIU) DNA bank contained a set of pedigree samples produced from 2 sires (D1D2) each mated to a single dam (D1D2), with serological information for the segregating progeny. In addition, there were NIU non-pedigree samples having known D1 and/or D2 alleles. The HAS/LAS lines are a pair of lines divergently selected for high (HAS) or low (LAS) antibody levels after immunization with sheep red blood cells. Generations 11 and 13 had previously been typed for the D system and were reported as containing D1, D2, and D4 [15]. DNA was available for generation 41. UCD001 is the inbred Red Jungle Fowl line used as the source of the chicken genome reference (up to GRCg6a), and while its D system allele is not reported, it is likely to be similar to UCD003 [17] as this is not one of the 6 blood system alleles utilized for mapping. Lines WL1–5 are elite White Leghorn (WL) lines from Hy-Line International (Dallas Center, IA, USA) for which multiple generations of individuals with serologically defined D system allele information were available, either as segregating individuals or lines identified as homozygous for a specific allele. Lines RIR1 and WPR1–2 are elite lines from Hy-Line International and are Rhode Island Red (RIR) and White Plymouth Rock (WPR) breeds, respectively.
DNA pools were produced using the two NIU pedigree families for which both sire and dam were heterozygous D1/D2. Three DNA pools were prepared for each family, containing DNA of either D1D1, D1D2, or D2D2 alleles as determined by serology (3–8 DNA per pool, total sample size of 28). An additional set of DNA pools was generated from NIU non-pedigree birds, and was comprised of either D1D1, D1D2, or D2D2 individuals. Genome sequences were available for the five experimental lines listed (courtesy of Jacqueline Smith, Roslin Institute, and Hans Cheng, USDA/ADOL), four of which also had known serologically defined D system alleles. Genome sequences were also available for elite WL, RIR, and WPR layer lines [18].

2.2. GWAS Analysis

SNP genotypes for the three sets of DNA pools were obtained using the Affymetrix Axiom 600 K chicken SNP array. Genotyping was performed by GeneSeek (Lincoln, NE, USA). Genotype calling was performed with Affymetrix Analysis Power Tools. A quality control filter was applied with minor allele frequency of 0.1 and maximum missing genotypes of 0.1 resulting in genotypes from 580,961 SNP available for the analysis.

2.3. Identification of Candidate Gene

Intensity files from the 600 K SNP Affymetrix chip were exported and processed using a custom R script. Calls were recoded as 0 for AA, 1 for AB, and 2 for BB to represent the number of copies of B allele in the individual. A regression equation was developed for each SNP. Expected allele count (based on serology information) was used as response variable and intensity was used as explanatory variable. Coefficient of determination (r2) calculated from regression analysis was used to select SNPs that segregated in accordance with expected pool genotypes. All genes on chromosome 1 were examined, with subsequent focus on region with highest r2 value.
BWA and standard GATK pipeline were used to align sequences to GrCg6a and to call variants. The programme SNPEff [22] was used for variant annotation, while the programme SNPSift [23] identified variants with HIGH and MODERATE impact on the protein function according to sequence ontology terms. Biomart [24] and Uniprot databases [25] were used to identify cellular components for all proteins. Only genes whose proteins were known to be expressed on cell surface membranes were considered. Frequencies of SNPs and INDELs with high and moderate impact in the region of interest were compared between inbred and HYL lines fixed or segregating for D alleles. The candidate protein was the only one that was present on the plasma membrane, with missense variants which matched expected frequencies.

2.4. SNP Selection and Subsequent Genotyping

SNP within the candidate gene were selected based on whether they were predictive for non-synonymous changes, as protein differences are expected to be one cause of serological differences among blood system alleles. Additional synonymous SNP variants were chosen to provide better coverage of the gene, plus one indel within a potential splice region was also included. SNP allele and indel genotypes were identified through PACE ® (PCR Allelic Competitive Extension) chemistry (3CR Bioscience Ltd., Harlow, UK) which employs one common primer with two allele-specific primers and fluorescence detection with end-point reads [26]. All functional SNP within the candidate gene, their genomic and gene location, and their putative nucleotide change with corresponding amino acid change are listed in Table 2.
Samples from individuals with serologically defined D alleles, or from lines known to contain specific D system alleles were genotyped with the SNP set identified within the candidate gene. Specific combinations of SNP alleles were identified and where possible, SNP haplotypes were compared with D serological alleles, to associate specific haplotypes with specific D alleles. These haplotypes were then assigned an identifier matched with the serological moniker. DNA from RIR and WPR lines were included to determine if additional novel haplotypes were present in these two different breeds even though no serological information was available.

2.5. Sequence Comparison Methods

Ortholog and paralog predictions were performed using BLAST [27,28,29]. Synteny analysis was performed using SynSearcher and SynBrowser available from the Synteny Portal [30]. Multiple sequence alignments were performed with COBALT [31] and phylogenetic trees were generated with the interactive Tree of Life web server [32]. Protein topology and secondary structure were predicted using PredictProtein [33] and structural predictions were obtained from AlphaFold [34,35].

3. Results and Discussion

3.1. Identification of D System Candidate Gene

Results from a GWAS analysis using 600 K chicken SNP array genotypes from the pooled samples containing DNA from either D1, D2, or heterozygous pedigree individuals from the NIU DNA bank are shown in Figure 1A. A single strong peak was observed on chromosome 1. Closer examination of chromosome 1 (Figure 1B) shows that this narrow peak encompasses the 129.7 Mbp to 131.4 Mbp region. This region contains five genes with GO terms involving integral components of the membrane: ARSD, ARSE, CD99, P2RY8, and SLC25A6. This genomic region was confirmed by GWAS analysis with an independent set of samples (HYL1; n = 50) for which the D allele was known (D2, D3) and 54 K genotypes were available.
With the genomic region identified and independently confirmed, we attempted to narrow down to the specific gene utilizing available genomic sequences. Four inbred lines were known to contain either the D1 allele (line IAH-72) or the D3 (lines UCD003, IAH-61, and IAH-RHC) (see Table 1). A comparison of the sequences in this region for the D1 vs D3 lines showed that only those SNP within the CD99 gene fit with the known phenotype pattern, i.e., all D3 lines were homozygous and identical and differed from the D1 containing line. This identified the most likely candidate gene as CD99.
A close examination of the sequences of HYL lines for which D allele segregation had been previously identified showed that lines WL1, WL3, and WL4 had SNP segregation within CD99, while lines WL2 and WL5 did not. This specific SNP segregation pattern was not seen within any of the other genes within the candidate region identified in the GWAS analyses, including ARSD and ARSE. With the SNP patterns fitting to the expectation based on serological information, these additional sequences confirmed CD99 as the most likely D blood system candidate gene.

3.2. Haplotype Identification

The SNP panel developed for the CD99 gene detected 7 SNP and one indel, encompassing exons 2 through 11. Genotypes for each assay were obtained from individuals from the HYL lines for which D system allele information was known. A total of 11 unique CD99 haplotypes were found (see Table 2). The three inbred lines that contained the D3 allele were all homozygous for the same CD99 haplotype, which was subsequently labelled CD99-H03. The inbred line with the D1 allele contained a different haplotype, labelled as CD99-H01.
Lines WL2 and WL5, both of which are D3 homozygous, contained only CD99-H03, thus confirming that D3 = CD99-H03. For line WL1, CD99 haplotypes were defined for 58 individuals previously typed as either D2 and/or D3. Two haplotypes were found, CD99-H03 and another haplotype subsequently named CD99-H02; thus, D2 = CD99-H02. There was 100% agreement with the serological D alleles previously assigned and the CD99 haplotypes.
Line WL3 had previously been reported as segregating for D1 and D3, though no individual DNA with known D alleles was available. Within the 120 individuals tested, there were three CD99 haplotypes found: CD99-H01, CD99-H03, and a rare haplotype (found in two half-sibs) which was assigned the label CD99-H06.
Line WL4 segregated for alleles D1 and D3 and contained two CD99 haplotypes, CD99-H01 and CD99-H03, as found in the inbred lines and other Hy-Line lines containing these two serological types. Of the 119 samples for which both serological D alleles and CD99 haplotypes were identified, there was agreement for 106/119 (89%). None of these discrepancies involved the misidentification of homozygotes and are likely due to reagents failing to distinguish heterozygotes accurately.
The HAS and LAS lines both contained CD99-H01 and CD99-H03 and HAS also contained the CD99-H06 haplotype. The D blood system typing of generations 10 and 13 identified both lines as containing D1 and D3, which is consistent with the presence of CD99-H01 and CD99-H03 that we found in the generation 41 samples. Haplotype CD99-H06 is identical to CD99-H03 at all non-synonymous SNP and thus is expected to produce a protein identical to D3. The D4 allele previously identified within the LAS line has likely been lost during the intervening 28 generations.
There were 104 samples from the NIU DNA bank for which both serological D allele and CD99 haplotype information was available. The D alleles identified included D1, D2, and D3, with the three corresponding CD99 haplotypes also being found. There was agreement between the D allele and CD99 haplotype for 96/104 of the samples (92%). Samples with disparate results always had one allele correct, and the second allele was either not identified, or was mis-identified.
CD99 haplotypes were also obtained from samples for which no D blood system was available. These were the different breeds Rhode Island Red (RIR) and White Plymouth Rock (WPR)) and were examined to determine if additional haplotypes were present in different breeds. Within the RIR1 line, there were three haplotypes found (CD99-H04, CD99-H07 and CD-H08), and the WPR lines contained CD99-H01, CD99-H04, and novel haplotypes (CD99-H10 and CD00-H11). None of these have any serological information and thus cannot be assigned to specific D alleles; however, these two novel types differ from another haplotype by only one synonymous SNP and thus are likely not another D serological allele. All haplotypes found are reported in Table 2. A summary of the D system alleles included in this study and their corresponding CD99 haplotypes is given in Table 3. The source of each allele and haplotype is also provided.

3.3. Protein Structure and Antigenicity

Figure 2 shows the predicted structure of the chicken CD99. The signal peptide and transmembrane domains are predicted to be α helices, but the bulk of the molecule is predicted as a random coil. The location of the SNP used to define the chicken CD99 haplotypes is indicated on the figure. Except for the first SNP (V27M) which is on the signal peptide, they are located on the random coil. While these variants are sufficient to induce an antibody response, whether they have any impact on the functionality of the protein is unknown. The extracellular domain likely extends from the end of the signal peptide to the beginning of the transmembrane domain, encompassing the three amino acid variants V37I, G48D, and E124G. The intracellular domain likely extends from the end of the transmembrane domain to the carboxy terminus, and contains no amino acid variants.

3.4. Synteny Analysis

Blast (28, 29), multiple sequence alignments, and synteny analysis were used to further explore the relationships between CD99 and XG. Blast was used to identify CD99 and XG orthologs and a multiple sequence alignment indicates that XG is a paralog of CD99 (Figure 3). CD99 orthologs were identified throughout the Vertebrata subphylum of the phylum Chordata, but absent from the subphylums Tunicata and Cephalochordata, suggesting that CD99 arose in the last common ancestor of the vertebrata. XG was absent from fishes and amphibians and only identified in members of the amniote clade. This suggests that XG arose by the duplication of the CD99 gene in the last common ancestor of the amniotes.
Synteny analysis shows conservation of the gene order between the CD99 chromosomal regions of zebrafish, chicken, and humans (Figure 4). The region has been enlarged in the chicken and human chromosome by the insertion of other genes (Figure 5). This analysis also reveals that CD99 and XG are on the X chromosome in eutherians but on autosomes in other organisms including monotremes and marsupials. In humans, XG is located at the boundary between pseudoautosomal region 1 and X-specific sequences, while in other eutherians, XG is completely within the pseudoautosomal region [36]. It appears as if the CD99 and XG genes were within a large region that remained autosomal in non-eutherians but were incorporated into sex-chromosomes within the last common ancestor of eutherians.

4. Conclusions

Research with human CD99 shows that this molecule is involved with multiple critical biological processes, though the mechanism of action is still not clear. It mediates diverse cellular processes which impact immune function, inflammation, and cancer metastasis [1]. The impact of CD99 (alloantigen D) on traits of chickens is unknown. However, significant CD99 allele frequency differences are found in two sets of chicken lines that have been divergently selected for the antibody response to immunization with sheep red blood cells which include the HAS/LAS pair of lines mentioned herein and the “high” and “low” antibody lines described by Parmentier [37]. The HAS/LAS lines are known to have different responses against certain pathogens. The HAS line was more resistant to Newcastle disease virus, marble spleen disease virus, Marek’s disease virus, Eimeria tenella, Mycoplasma gallisepticum, as well as feather mites. Conversely, LAS chickens were more resistant to Staphylococcus aureus and Escherichia coli compared with HAS chickens [16,38,39]. The observation that two sets of divergently selected lines also showed differences in CD99 haplotype frequencies implies an association with antibody response. Furthermore, certain growth traits favored the LAS line. Females from LAS had higher weight early in life, began egg production earlier, and produced more eggs compared with HAS birds. Again, the alloantigen D system differences may have contributed to these effects [15]. With the development of rapid SNP-based tests to identify variants within CD99, the impact of these variants on various traits can now be studied.
It is worth noting that neither of the GWAS analyses identified the correct candidate gene. For the 600 K SNP data set, the top SNP (p = 2.08 × 10−11) is located 325 Kbp away from the start of CD99. The top SNP within the 54 K SNP data set (p = 8.24 × 10−11) was within the ARSD gene, located 104 Kbp away from CD99. Thus, while the GWAS was able to greatly narrow down the genomic region, it could not identify the actual gene. This is due to insufficient SNP in the region of interest, lack of segregation of those SNP in lines of interest, or lack of association of the SNP (i.e., within non-coding region) with the phenotype. The information from the sequence samples, and use of the candidate gene SNP panel were necessary to unequivocally identify the candidate gene. This shows the SNP chip strength in identifying specific regions, but also shows its limitation in the identification of the correct candidate gene without having additional SNP information from other sources.
In the human, the CD99 protein is known to be critical for the leukocyte transendothelial migration signaling pathway [40], thus playing an important role in leukocyte recruitment into inflamed areas [41]. Significantly different CD99 frequencies within two independent sets of chicken lines divergently selected for antibody response suggests that CD99 is also an important component of the chicken immune response. Further studies specifically examining the impact of CD99 variants are needed to confirm the impact on immune response and the health of the birds. Specific allelic combinations may generate positive or negative consequences which can be modulated through selection.
The thirteen known chicken blood systems were all initially identified with alloantisera, which was often very specific to the lines within which it was produced. The chicken B blood system identifies variations within the chicken MHC, which contains genes with major impacts on the immune system and subsequent response to multiple diseases [10]. It was not until the availability of the genome sequence and the development of DNA-based tests that extensive MHC variability could be detected in multiple chicken breeds, including indigenous populations and wild birds [42,43,44,45]. The genes responsible for the chicken A and E blood systems have been recently identified as C4BPM and FCAMR, which are either within or very close to the Regulators of Complement Activation (RCA) region [11]. Variation within genes of the RCA was initially identified as human blood types (Cromer and Knopps blood systems) [46,47,48], which are known to impact disease outcomes [49]. The gene responsible for a fourth chicken blood system has now been identified as CD99. In humans, the CD99 molecule, in combination with the Xg molecule, defines the human CD99/Xga blood system [50]. Thus, the four chicken blood groups for which the responsible genes have been identified all have an equivalent blood system identified in humans. It will be interesting to determine if this trend will continue when the other nine chicken blood systems are defined.

Author Contributions

Conceptualization, J.E.F. and R.L.T.J.; methodology, J.E.F., R.L.T.J., W.D.-C. and C.J.S.; validation, J.E.F. and A.R.L.; formal analysis, J.E.F. and A.R.L.; resources, J.E.F., W.D.-C. and R.L.T.J.; data curation, J.E.F.; writing—original draft preparation, J.E.F.; writing—review & editing, J.E.F., W.D.-C., C.J.S. and R.L.T.J.; visualization, J.E.F. and C.J.S.; supervision, J.E.F.; funding acquisition, J.E.F. and R.L.T.J. All authors have read and agreed to the published version of the manuscript.

Funding

Funding for this research was provided by the National Institute of Food and Agriculture, US Department of Agriculture National Research Support Program-8 (NRSP-8) co-ordinators, Hatch/Multistate project NE1834 The genetic bases for resistance and immunity to avian disease under accession number 1022063, Hy-Line International, and the West Virginia University Elwood and Ruth Briles Avian Alloantigen Research Fund.

Institutional Review Board Statement

Blood samples were obtained from Hy-Line flocks during routine blood sample collections under the care of the Institutional Animal Care and Use Committee.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to proprietary content from Hy-Line International.

Acknowledgments

The authors are indebted to Rene Kopulos and Linda Srygler who isolated, cataloged, and supplied the DNA of known blood system reactivities for this study (NIU). Molecular graphics and analyses were performed with UCSF Chimera, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from NIH P41-GM103311.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pasello, M.; Manara, M.C.; Scotlandi, K. CD99 at the crossroads of physiology and pathology. J. Cell Commun. Signal. 2018, 12, 55–68. [Google Scholar] [CrossRef]
  2. Johnson, N.C. XG: The forgotten blood group system. Immunohematology 2011, 27, 68–71. [Google Scholar] [CrossRef] [PubMed]
  3. Zimmerman, P.A. XG blood group puzzle solved: What’s next? Blood 2018, 132, 243. [Google Scholar] [CrossRef]
  4. Yeh, C.-C.; Chang, C.-J.; Twu, Y.-C.; Chu, C.-C.; Liu, B.-S.; Huang, J.-T.; Hung, S.-T.; Chan, Y.-S.; Tsai, Y.-J.; Lin, S.-W.; et al. The molecular genetic background leading to the formation of the human erythroid-specific Xg(a)/CD99 blood groups. Blood Adv. 2018, 2, 1854–1864. [Google Scholar] [CrossRef]
  5. Glick, G.; Chang, T.S.; Jaap, R.G. The bursa of Fabricius and antibody production. Poult. Sci. 1956, 35, 224–234. [Google Scholar] [CrossRef]
  6. Cooper, M.D.; Raymond, D.A.; Peterson, R.D.; South, M.A.; Good, R.A. The functions of the thymus system and the bursa system in the chicken. J. Exp. Med. 1966, 123, 75–102. [Google Scholar] [CrossRef]
  7. Weiss, R.A.; Vogt, P.K. 100 years of Rous sarcoma virus. J. Exp. Med. 2011, 208, 2351–2355. [Google Scholar] [CrossRef] [PubMed]
  8. Taylor, R.L., Jr.; Medarova, Z.; Briles, W.E. Immune effects of chicken non-MHC alloantigens. Poult. Sci. 2016, 95, 447–457. [Google Scholar] [CrossRef]
  9. Schierman, L.W.; Nordskog, A.W. Relationship of blood type to histocompatibility in chickens. Science 1961, 134, 1008–1009. [Google Scholar] [CrossRef] [PubMed]
  10. Miller, M.M.; Taylor, R.L., Jr. Brief review of the chicken Major Histocompatibility Complex: The genes, their distribution on chromosome 16, and their contributions to disease resistance. Poult. Sci. 2016, 95, 375–392. [Google Scholar] [CrossRef]
  11. Fulton, J.E.; Drobik-Czwarno, W.; Wolc, A.; McCarron, A.M.; Lund, A.R.; Schmidt, C.J.; Taylor, J.R.L. The Chicken A and E Blood Systems Arise from Genetic Variation in and around the Regulators of Complement Activation Region. J. Immunol. 2022, 209, 1128–1137. [Google Scholar] [CrossRef]
  12. Briles, R.W.; Gilmour, D.G. (Eds.) Erythrcyte Alloantigen Characterisitics; FASEB: Bethesda, MD, USA, 1980. [Google Scholar]
  13. Bitgood, J.J.; Kendall, R.L.; Briles, R.W.; Briles, W.E. Erythrocyte alloantigen loci Ea-D and Ea-I map to chromosome 1 in the chicken. Anim. Genet. 1991, 22, 449–454. [Google Scholar] [CrossRef]
  14. Scott, T.R.; Oduho, G.W.; Glick, B.; Hagan, F.; Briles, W.E.; Yamamoto, Y. Erythrocyte alloantigen diversity and some immunological effects of the B system in related New Hampshire strains. Poult. Sci. 1988, 67, 1210–1217. [Google Scholar] [CrossRef]
  15. Martin, A.; Dunnington, E.A.; Gross, W.B.; Briles, W.E.; Briles, R.W.; Siegel, P.B. Production traits and alloantigen systems in lines of chickens selected for high or low antibody responses to sheep erythrocytes. Poult. Sci. 1990, 69, 871–878. [Google Scholar] [CrossRef]
  16. Dunnington, E.A.; Briles, R.W.; Briles, W.E.; Gross, W.B.; Siegel, P.B. Allelic frequencies in eight alloantigen systems of chickens selected for high and low antibody response to sheep red blood cells. Poult. Sci. 1984, 63, 1470–1472. [Google Scholar] [CrossRef]
  17. Crittenden, L.B.; Provencher, L.; Santangelo, L.; Levin, I.; Abplanalp, H.; Briles, R.W.; Briles, W.E.; Dodgson, J.B. Characterization of a Red Jungle Fowl by White Leghorn Backcross Reference Population for Molecular Mapping of the Chicken Genome. Poult. Sci. 1993, 72, 334–348. [Google Scholar] [CrossRef]
  18. Kranis, A.; Gheyas, A.A.; Boschiero, C.; Turner, F.; Yu, L.; Smith, S.; Talbot, R.; Pirani, A.; Brew, F.; Kaiser, P.; et al. Development of a high density 600K SNP genotyping array for chicken. BMC Genom. 2013, 14, 59. [Google Scholar] [CrossRef]
  19. Abplanalp, H.; Briles, W.E.; Stone, H. (Eds.) Blood Group Systems Part II; FASEB: Bethesda, MD, USA, 1980. [Google Scholar]
  20. Hala, K. Inbred Lines of Avian Species; Toivanen, A., Toivanen, P., Eds.; CRC Press: Boca Raton, FL, USA, 1987. [Google Scholar]
  21. International Chicken Genome Sequencing C. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature 2004, 432, 695–716. [Google Scholar] [CrossRef]
  22. Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012, 6, 80–92. [Google Scholar] [CrossRef] [PubMed]
  23. Cingolani, P.; Patel, V.M.; Coon, M.; Nguyen, T.; Land, S.J.; Ruden, D.M.; Lu, X. Using Drosophila melanogaster as a Model for Genotoxic Chemical Mutational Studies with a New Program, SnpSift. Front. Genet. 2012, 3, 35. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Cunningham, F.; Allen, J.E.; Allen, J.; Alvarez-Jarreta, J.; Amode, M.R.; Armean, I.M.; Austine-Orimoloye, O.; Azov, A.G.; Barnes, I.F.; Bennet, R.; et al. Ensembl 2022. Nucleic Acids Res. 2022, 50, D988–D995. [Google Scholar] [CrossRef] [PubMed]
  25. UniProt C. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 2021, 49, D480–D489. [Google Scholar]
  26. Somyong, S.; Phetchawang, P.; Bihi, A.K.; Sonthirod, C.; Kongkachana, W.; Sangsrakru, D.; Jomchai, N.; Pootakham, W.; Tangphatsornruang, S. A SNP variation in an expansin (EgExp4) gene affects height in oil palm. PeerJ 2022, 10, e13046. [Google Scholar] [CrossRef] [PubMed]
  27. Ward, N.; Moreno-Hagelsieb, G. Quickly finding orthologs as reciprocal best hits with BLAT, LAST, and UBLAST: How much do we miss? PLoS ONE 2014, 9, e101850. [Google Scholar] [CrossRef] [PubMed]
  28. States, D.J.; Gish, W. Combined use of sequence similarity and codon bias for coding region identification. J. Comput. Biol. 1994, 1, 39–50. [Google Scholar] [CrossRef] [PubMed]
  29. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  30. Lee, J.; Hong, W.Y.; Cho, M.; Sim, M.; Lee, D.; Ko, Y.; Kim, J. Synteny Portal: A web-based application portal for synteny block analysis. Nucleic Acids Res. 2016, 44, W35–W40. [Google Scholar] [CrossRef]
  31. Papadopoulos, J.S.; Agarwala, R. COBALT: Constraint-based alignment tool for multiple protein sequences. Bioinformatics 2007, 23, 1073–1079. [Google Scholar] [CrossRef]
  32. Letunic, I.; Bork, P. Interactive Tree Of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021, 49, W293–W296. [Google Scholar] [CrossRef]
  33. Bernhofer, M.; Dallago, C.; Karl, T.; Satagopam, V.; Heinzinger, M.; Littmann, M.; Olenyi, T.; Qiu, J.; Schütze, K.; Yachdav, G.; et al. PredictProtein-Predicting Protein Structure and Function for 29 Years. Nucleic Acids Res. 2021, 49, W535–W540. [Google Scholar] [CrossRef]
  34. Varadi, M.; Anyango, S.; Deshpande, M.; Nair, S.; Natassia, C.; Yordanova, G.; Yuan, D.; Stroe, O.; Wood, G.; Laydon, A.; et al. AlphaFold Protein Structure Database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022, 50, D439–D444. [Google Scholar] [CrossRef]
  35. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
  36. Raudsepp, T.; Chowdhary, B.P. The Eutherian Pseudoautosomal Region. Cytogenet. Genome Res. 2015, 147, 81–94. [Google Scholar] [CrossRef]
  37. Parmentier, H.K.; Lammers, A.; Hoekman, J.J.; De Vries Reilingh, G.; Zaanen, I.T.; Savelkoul, H.F. Different levels of natural antibodies in chickens divergently selected for specific antibody responses. Dev. Comp. Immunol. 2004, 28, 39–49. [Google Scholar] [CrossRef] [PubMed]
  38. Gross, W.B.; Siegel, P.B. Effects of early environmental stresses on chicken body weight, antibody response to RBC antigens, feed efficiency, and response to fasting. Avian Dis. 1980, 24, 569–579. [Google Scholar] [CrossRef] [PubMed]
  39. Boa-Amponsem, K.; Dunnington, E.A.; Baker, K.S.; Siegel, P.B. Diet and immunological memory of lines of White Leghorn chickens divergently selected for antibody response to sheep red blood cells. Poult. Sci. 1999, 78, 165–170. [Google Scholar] [CrossRef] [PubMed]
  40. Watson, R.L.; Buck, J.; Levin, L.R.; Winger, R.C.; Wang, J.; Arase, H.; Muller, W.A. Endothelial CD99 signals through soluble adenylyl cyclase and PKA to regulate leukocyte transendothelial migration. J. Exp. Med. 2015, 212, 1021–1041. [Google Scholar] [CrossRef]
  41. Bixel, G.; Kloep, S.; Butz, S.; Petri, B.; Engelhardt, B.; Vestweber, D. Mouse CD99 participates in T-cell recruitment into inflamed skin. Blood 2004, 104, 3205–3213. [Google Scholar] [CrossRef]
  42. Manjula, P.; Fulton, J.E.; Seo, D.; Lee, J.H. Major histocompatibility complex B variability in Korean native chicken breeds. Poult. Sci. 2020, 99, 4704–4713. [Google Scholar] [CrossRef]
  43. Tarrant, K.J.; Lopez, R.; Loper, M.; Fulton, J.E. Assessing MHC-B diversity in Silkie chickens. Poult. Sci. 2020, 99, 2337–2341. [Google Scholar] [CrossRef]
  44. Nguyen-Phuc, H.; Fulton, J.E.; Berres, M.E. Genetic variation of major histocompatibility complex (MHC) in wild Red Junglefowl (Gallus gallus). Poult. Sci. 2016, 95, 400–411. [Google Scholar] [CrossRef] [PubMed]
  45. Fulton, J.E.; Lund, A.R.; McCarron, A.M.; Pinegar, K.N.; Korver, D.R.; Classen, H.L.; Aggrey, S.; Utterbach, C.; Anthony, N.B.; Berres, M.E. MHC variability in heritage breeds of chickens. Poult. Sci. 2016, 95, 393–399. [Google Scholar] [CrossRef]
  46. Storry, J.R.; Reid, M.E.; Yazer, M.H. The Cromer blood group system: A review. Immunohematology 2010, 26, 109–118. [Google Scholar] [CrossRef] [PubMed]
  47. Petty, A.C.; Green, C.A.; Poole, J.; Daniels, G.L. Analysis of Knops blood group antigens on CR1 (CD35) by the MAIEA test and by immunoblotting. Transfus. Med. 1997, 7, 55–62. [Google Scholar] [CrossRef]
  48. Moulds, J.M. The Knops blood-group system: A review. Immunohematology 2010, 26, 2–7. [Google Scholar] [CrossRef] [PubMed]
  49. Cooling, L. Blood Groups in Infection and Host Susceptibility. Clin. Microbiol. Rev. 2015, 28, 801–870. [Google Scholar] [CrossRef] [PubMed]
  50. Moller, M.; Lee, Y.Q.; Vidovic, K.; Kjellstrom, S.; Bjorkman, L.; Storry, J.R.; Olsson, M.L. Disruption of a GATA1-binding motif upstream of XG/PBDX abolishes Xg(a) expression and resolves the Xg blood group system. Blood 2018, 132, 334–338. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (A) GWAS obtained from 600 K SNP chip genotypes of pooled DNA from NIU pedigree samples segregating for D1 and D2 alleles. (B) Magnification of significant region on chromosome 1.
Figure 1. (A) GWAS obtained from 600 K SNP chip genotypes of pooled DNA from NIU pedigree samples segregating for D1 and D2 alleles. (B) Magnification of significant region on chromosome 1.
Genes 14 00402 g001
Figure 2. AlphaFold prediction for the structure of chicken CD99 (Uniprot E1CRC1). The transmembrane domains and signal peptide are predicted to be α helices. Other than two short α helices in the extracellular domain, the bulk of the structure is predicted to be random coil.
Figure 2. AlphaFold prediction for the structure of chicken CD99 (Uniprot E1CRC1). The transmembrane domains and signal peptide are predicted to be α helices. Other than two short α helices in the extracellular domain, the bulk of the structure is predicted to be random coil.
Genes 14 00402 g002
Figure 3. Phylogenetic trees for (A) CD99 and XG combined, (B) CD99 only, and (C) XG only. CD99 and XG are paralogous and XG appears to have arisen in the last common ancestor to amniotes (see text).
Figure 3. Phylogenetic trees for (A) CD99 and XG combined, (B) CD99 only, and (C) XG only. CD99 and XG are paralogous and XG appears to have arisen in the last common ancestor to amniotes (see text).
Genes 14 00402 g003
Figure 4. Syntenic relationships between zebrafish (Chr: 1, build: GRCz11), chicken (Chr: 1, build: GRCg7b), and human (Chr: X, build GRCh38.p14) for the CD99 chromosomal region.
Figure 4. Syntenic relationships between zebrafish (Chr: 1, build: GRCz11), chicken (Chr: 1, build: GRCg7b), and human (Chr: X, build GRCh38.p14) for the CD99 chromosomal region.
Genes 14 00402 g004
Figure 5. Syntenic relationship between Chicken chromosome 1 and the Human X chromosome. The brown and gray colors spanning between the Chicken to the Human chromosomes depict a region of synteny between the two species. The red bars indicate the division between the chicken and human chromosomes. In chickens, the syntenic regions span from approximately 110,900,000 to 130,000,000 on chromosome 1. In humans, the region spans from approximately 333,000 to 45,100,000 on chromosome X. The yellow bars indicate the approximate position of the CD99 and XG genes on the respective chromosomes.
Figure 5. Syntenic relationship between Chicken chromosome 1 and the Human X chromosome. The brown and gray colors spanning between the Chicken to the Human chromosomes depict a region of synteny between the two species. The red bars indicate the division between the chicken and human chromosomes. In chickens, the syntenic regions span from approximately 110,900,000 to 130,000,000 on chromosome 1. In humans, the region spans from approximately 333,000 to 45,100,000 on chromosome X. The yellow bars indicate the approximate position of the CD99 and XG genes on the respective chromosomes.
Genes 14 00402 g005
Table 1. Sources of alloantigen system D serologically defined samples used for candidate gene identification.
Table 1. Sources of alloantigen system D serologically defined samples used for candidate gene identification.
SourceBreed 1D alleles presentSamples (n)Reference 2
NIU DNA Bank; PedigreeWL, AnconaD1, D253none
NIU DNA BankWL, AnconaD1, D2, D351none
HAS/LASWLD1, D3, D444[15]
UCD001RJFunk10[17] 3
UCD003WLD320[19,20]
WL1WLD2, D358none
WL2WLD390none
WL3WLD1, D3120none
WL4WLD1, D3119none
WL5WLD3120none
RIR1RIRunk64none
WPR1WPRunk320none
WPR1WPRunk272none
Sequence only Avg. coverageSource
UCD001RJFunk6.6[21]
UCD003WLD318.7Hans Cheng, pers. comm.
IAH-61WLD315.2[18]
IAH-72WLD117.6[18]
IAH-RHCWLD315.9[18]
1 WL = White Leghorn, RJF = Red Jungle Fowl, RIR = Rhode Island Red, WPR = White Plymouth Rock. 2 Reference source for D system type information. 3 UCD 001 types not identified but described as different from the types in UCD 003 (D3). unk = no serology data.
Table 2. Title SNP within CD99 gene used to define haplotypes, the SNP genomic and exon location, their predicted amino acid changes, and all SNP combinations (haplotypes) found. The D blood group serological allele found for each haplotype is given, where known.
Table 2. Title SNP within CD99 gene used to define haplotypes, the SNP genomic and exon location, their predicted amino acid changes, and all SNP combinations (haplotypes) found. The D blood group serological allele found for each haplotype is given, where known.
Haplotype
SNP NameLocation (bp) *Exon Gene LocationCodon
Change
Nucleotide
Change
aaRefAltH01H02H03H04H05H06 AH07H08H09H10H11
rs74153692129,875,0942GTG > ATGG > AV27MGAGGGGGGAGGGG
rs314496839129,871,0243GTT > ATTG > AV37IGAAGGGAGGGAGA
rs312634736129,870,4894GGC > GACG > AG48DGAGAGGAGGGAGG
rs316308207129,867,6347GGT > GGCT > CG95GTCCCCCCTCTTTC
rs10730300129,867,6257CCA > CCGA > GP98PAGAAGAAAAAAAA
rs735363747129,866,1888GAG > GGGA > GE124GAGAAAAAAAAAAA
rs13623448129,866,175intron splice regionrefdeldeldelrefrefrefrefdeldeldelrefdel
rs735519530129,864,26611TTG > CTGT > CL185LTCTTTTTTTTTTC
D blood system serological allele D1D2D3D3unkD3unkunkunkunkunk
Exons and amino acid numbering as defined by ENSGALT00000041401; * location based on GRCg6a; Ref = reference sequence; Del = deleted; A haplotype found in the chicken Junglefowl reference genome; unk = unknown.
Table 3. Alloantigen system D alleles, their corresponding CD99 haplotypes, and sources of each allele.
Table 3. Alloantigen system D alleles, their corresponding CD99 haplotypes, and sources of each allele.
A AlleleHaplotypeSource(s)
D1CD99-H01NIU DNA Bank and NIU pedigree, IAH-72, HAS, LAS, WL4, WL3
D2CD99-H02NIU DNA Bank and NIU pedigree, WL1
D3CD99-H03NIU DNA Bank, IAH-61, IAH-RHC, UCD003, HAS, LAS, WL1, WL2, WL3, WL4, WL5
D3CD99-H04NIU DNA Bank
D3CD99-H06HAS, WL3
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fulton, J.E.; Drobik-Czwarno, W.; Lund, A.R.; Schmidt, C.J.; Taylor, R.L., Jr. CD99 and the Chicken Alloantigen D Blood System. Genes 2023, 14, 402. https://doi.org/10.3390/genes14020402

AMA Style

Fulton JE, Drobik-Czwarno W, Lund AR, Schmidt CJ, Taylor RL Jr. CD99 and the Chicken Alloantigen D Blood System. Genes. 2023; 14(2):402. https://doi.org/10.3390/genes14020402

Chicago/Turabian Style

Fulton, Janet E., Wiola Drobik-Czwarno, Ashlee R. Lund, Carl J. Schmidt, and Robert L. Taylor, Jr. 2023. "CD99 and the Chicken Alloantigen D Blood System" Genes 14, no. 2: 402. https://doi.org/10.3390/genes14020402

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop