<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Genes</journal-id>
<journal-title>Genes</journal-title>
<issn pub-type="epub">2073-4425</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/genes2010036</article-id>
<article-id pub-id-type="publisher-id">genes-02-00036</article-id>
<article-categories>
<subj-group>
<subject>Review</subject></subj-group></article-categories>
<title-group>
<article-title>An Exceptional Gene: Evolution of the <italic>TSPY</italic> Gene Family in Humans and Other Great Apes</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Xue</surname><given-names>Yali</given-names></name></contrib>
<contrib contrib-type="author">
<name><surname>Tyler-Smith</surname><given-names>Chris</given-names></name><xref ref-type="corresp" rid="c1-genes-02-00036"><sup>*</sup></xref></contrib>
<aff id="af1-genes-02-00036">The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambs. CB10 1SA, UK; E-Mail: <email>ylx@sanger.ac.uk</email></aff></contrib-group>
<author-notes>
<corresp id="c1-genes-02-00036">
<label>*</label> Author to whom correspondence should be addressed; E-Mail: <email>cts@sanger.ac.uk</email>; Tel.: +44-1223-495376; Fax: +44-1223-494919.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<pub-date pub-type="epub">
<day>10</day>
<month>01</month>
<year>2011</year></pub-date>
<volume>2</volume>
<issue>1</issue>
<fpage>36</fpage>
<lpage>47</lpage>
<history>
<date date-type="received">
<day>24</day>
<month>11</month>
<year>2010</year></date>
<date date-type="rev-recd">
<day>24</day>
<month>12</month>
<year>2010</year></date>
<date date-type="accepted">
<day>28</day>
<month>12</month>
<year>2010</year></date></history>
<permissions>
<copyright-statement>© 2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>The <italic>TSPY</italic> gene stands out from all other human protein-coding genes because of its high copy number and tandemly-repeated organization. Here, we review its evolutionary history in great apes in order to assess whether these unusual properties are more likely to result from a relaxation of constraint or an unusual functional role. Detailed comparisons with chimpanzee are possible because a finished sequence of the chimpanzee Y chromosome is available, together with more limited data from other apes. These comparisons suggest that the human-chimpanzee ancestral Y chromosome carried a tandem array of <italic>TSPY</italic> genes which expanded on the human lineage while undergoing multiple duplication events followed by pseudogene formation on the chimpanzee lineage. The protein coding region is the most highly conserved of the multi-copy Y genes in human-chimpanzee comparisons, and the analysis of the d<italic>N</italic>/d<italic>S</italic> ratio indicates that <italic>TSPY</italic> is evolutionarily highly constrained, but may have experienced positive selection after the human-chimpanzee split. We therefore conclude that the exceptionally high copy number in humans is most likely due to a human-specific but unknown functional role, possibly involving rapid production of a large amount of <italic>TSPY</italic> protein at some stage during spermatogenesis.</p></abstract>
<kwd-group>
<kwd><italic>TSPY</italic></kwd>
<kwd>Y chromosome</kwd>
<kwd>great ape</kwd>
<kwd>human</kwd>
<kwd>evolution</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>In 2010, we can view the <italic>TSPY</italic> gene family with the perspective provided by a high-quality finished sequence of the human genome [<xref ref-type="bibr" rid="b1-genes-02-00036">1</xref>], whole-genome sequences from multiple humans generated by the 1000 Genomes Project [<xref ref-type="bibr" rid="b2-genes-02-00036">2</xref>], published draft chimpanzee [<xref ref-type="bibr" rid="b3-genes-02-00036">3</xref>] and macaque [<xref ref-type="bibr" rid="b4-genes-02-00036">4</xref>] genome sequences, and unpublished draft sequences of the gorilla and orangutan which, according to general practice in genomics, are already freely available (<ext-link xlink:href="http://www.ensembl.org/index.html" ext-link-type="uri">http://www.ensembl.org/index.html</ext-link>). When we do this, we see that in humans <italic>TSPY</italic> has the unusual and striking characteristic of being carried at high copy number in a tandemly-repeated array of around 20–40 copies [<xref ref-type="bibr" rid="b5-genes-02-00036">5</xref>]. Most human genes are present in a single copy per haploid genome, and while duplicated genes with two copies are not unusual, protein-coding genes carried in higher copy numbers become progressively rarer [<xref ref-type="bibr" rid="b6-genes-02-00036">6</xref>]. No other protein-coding gene has so many copies. Why should humans have 20–40 copies of just one of their genes, <italic>TSPY</italic>? Many approaches to addressing this question could be taken, including studies of its biochemistry and expression or genetics, and the accompanying articles in this special issue of <italic>Genes</italic> explore several of these areas. Here, we take an evolutionary-genetic approach and review the history of the <italic>TSPY</italic> genes over the last few million years within the great apes. Have the unusual copy number and genomic organization seen in humans been a long-term characteristic of ape genomes, or are they specific to humans or a subset of apes? How rapidly has the <italic>TSPY</italic> amino acid sequence evolved?</p>
<p>We next summarize the steps that have led to our current understanding, and the resources available to address these questions. In the following sections, we present some additional evolutionary analyses based on the datasets available, and then discuss their implications for our understanding of the unusual properties of <italic>TSPY</italic>.</p>
<sec>
<label>1.1.</label>
<title>Human TSPY Genes</title>
<p><italic>TSPY</italic> genes, as implied by their name, are located on the Y chromosome. <italic>TSPY</italic> was in fact one of the first human Y-chromosomal genes to be identified, in 1987 when Arnemann <italic>et al.</italic> published the results of a survey in which they used 18 DNA fragments enriched for Y-chromosomal sequences to search for transcripts, and found evidence for an abundant testis-specific mRNA [<xref ref-type="bibr" rid="b7-genes-02-00036">7</xref>]. The next year, a study of Y-chromosomal repeated sequences identified a tandemly-repeated array, <italic>DYZ5</italic>, on Yp, consisting of multiple homogeneous 20.3 kb units [<xref ref-type="bibr" rid="b5-genes-02-00036">5</xref>], and it soon became apparent that each unit carried a copy of <italic>TSPY</italic>. Several <italic>TSPY</italic>-related sequences were present elsewhere on the chromosome [<xref ref-type="bibr" rid="b8-genes-02-00036">8</xref>], the most substantial being designated <italic>TSPY</italic> minor [<xref ref-type="bibr" rid="b5-genes-02-00036">5</xref>]. Fuller details of the genomic structure, showing the presence of one intact <italic>TSPY</italic> gene at <italic>TSPY</italic> minor, and a more accurate unit size for the major array of 20.4 kb, were revealed by the finished sequence of the euchromatic part of the Y chromosome [<xref ref-type="bibr" rid="b9-genes-02-00036">9</xref>]. Nevertheless, only the edges of the array could be assembled, and a gap corresponding to most of the array remains even within the current human genome reference assembly (GRCh37/hg19; <ext-link xlink:href="http://genome.ucsc.edu/cgi-bin/hgGateway" ext-link-type="uri">http://genome.ucsc.edu/cgi-bin/hgGateway</ext-link>; see <xref ref-type="fig" rid="f1-genes-02-00036">Figure 1</xref>).</p>
<p>There is substantial variation in the number and organization of human <italic>TSPY</italic> genes between individuals, and this can be understood as a consequence of the complex repeated structures of the regions in which they lie. As indicated above, the number of copies in the tandem array varies, most likely as a result of homologous but unequal exchange events between sister chromatids leading to expansion and contraction of the array. Copy numbers reported in population samples range from 27–40 (<italic>n</italic> = 17 [<xref ref-type="bibr" rid="b5-genes-02-00036">5</xref>]), 18–40 (<italic>n</italic> = 42 [<xref ref-type="bibr" rid="b10-genes-02-00036">10</xref>]), 18–48 (<italic>n</italic> = 93 [<xref ref-type="bibr" rid="b11-genes-02-00036">11</xref>]) or 23–64 (<italic>n</italic> = 47 [<xref ref-type="bibr" rid="b12-genes-02-00036">12</xref>]), revealing the presence of greater than three-fold variation. In addition, a ∼4 Mb section of Yp containing the <italic>TSPY</italic> genes can be found in either orientation and has apparently undergone ≥12 inversion events mediated by flanking IR3 repeats during the evolutionary history of extant Y chromosomes [<xref ref-type="bibr" rid="b12-genes-02-00036">12</xref>], around 100,000 years. In one of these orientations, recombination can occur between DNA including the genes <italic>AMELY, TBL1Y</italic> and <italic>PRKY</italic> as well as some <italic>TSPY</italic> copies. Deletion carriers show no overt phenotypic effects, and the deletion is present at a frequency of ∼2% in the Indian subcontinent [<xref ref-type="bibr" rid="b14-genes-02-00036">14</xref>,<xref ref-type="bibr" rid="b15-genes-02-00036">15</xref>].</p></sec>
<sec>
<label>1.2.</label>
<title>Ape TSPY Genes</title>
<p>Two factors limit comparisons between human <italic>TSPY</italic> genes and those of other apes. First, some reference sequences, such as that of the gorilla, have been derived from females and thus provide no information about Y-specific genes such as <italic>TSPY.</italic> Second, although early studies had revealed the likely presence of multiple Y-specific <italic>TSPY</italic> genes in other apes [<xref ref-type="bibr" rid="b7-genes-02-00036">7</xref>,<xref ref-type="bibr" rid="b16-genes-02-00036">16</xref>,<xref ref-type="bibr" rid="b17-genes-02-00036">17</xref>], the complexity of the <italic>TSPY</italic> genomic structure meant that finished sequence was necessary for detailed comparison. Fortunately, a finished Y sequence is available for the chimpanzee [<xref ref-type="bibr" rid="b18-genes-02-00036">18</xref>]. The title of the paper presenting this work was “<italic>Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content</italic>”. The authors indeed documented 41 differences in gene content between the two Y chromosomes, far in excess of findings on other chromosomes. An earlier genomewide comparison had, for example, identified just 134 gene increase and six decrease events on the human lineage [<xref ref-type="bibr" rid="b19-genes-02-00036">19</xref>]. But even more remarkably, 70% of the Y-chromosomal gene copy differences (involving 29 gene copies) were due to different numbers of <italic>TSPY</italic> genes. Although <italic>TSPY</italic> was the most highly repeated gene on the chimpanzee Y chromosome, with six copies (equal to <italic>RBMY</italic>), the uniquely high copy number of <italic>TSPY</italic> in humans is matched only by the uniquely large differences in copy number between humans and chimpanzees. A more detailed comparison of <italic>TSPY</italic> organization in the two species is provided below in Section 2.</p></sec>
<sec>
<label>1.3.</label>
<title>Why Has TSPY Gene Copy Number Changed Rapidly in the Apes?</title>
<p>Two broad classes of evolutionary explanation could be considered for the extensive differences in <italic>TSPY</italic> copy number between humans and chimpanzees. <italic>TSPY</italic> might be evolving neutrally in one or both species, in which case the difference between six and ∼35 copies would have no functional significance and would be a consequence of neutral genetic drift. The alternative is that <italic>TSPY</italic> retains an important functional role in both species, and the different copy numbers have biological significance. The conservation of multiple <italic>TSPY</italic> copies on both Y chromosomes provides some evidence for functional relevance [<xref ref-type="bibr" rid="b20-genes-02-00036">20</xref>], and studies in humans tend to support the hypothesis of a functional role. <italic>TSPY</italic> is expressed specifically in spermatogonia in the testis [<xref ref-type="bibr" rid="b21-genes-02-00036">21</xref>], and decreased copy number has been associated with impaired spermatogenesis [<xref ref-type="bibr" rid="b22-genes-02-00036">22</xref>], although not in all studies [<xref ref-type="bibr" rid="b23-genes-02-00036">23</xref>,<xref ref-type="bibr" rid="b24-genes-02-00036">24</xref>]. Evolutionary analyses can also address the issue of the likely functional importance of the <italic>TSPY</italic> genes, and are considered below in Section 2.</p></sec></sec>
<sec>
<label>2.</label>
<title>Comparison of Human and Chimpanzee <italic>TSPY</italic> Gene Organization</title>
<sec>
<label>2.1.</label>
<title>TSPY Gene and Pseudogene Copy Numbers</title>
<p>Human <italic>TSPY</italic> genes, as noted above, are located in two regions of the Y: a large tandem array of ∼35 <italic>TSPY</italic> copies, and a single separate gene. Five unprocessed pseudogenes are also annotated in the reference sequence ([<xref ref-type="bibr" rid="b9-genes-02-00036">9</xref>], <xref ref-type="table" rid="t1-genes-02-00036">Table 1</xref>). The chimpanzee Y reference sequence, in contrast, has six active genes divided among three clusters containing four, one and one respectively, but 21 annotated unprocessed pseudogenes ([<xref ref-type="bibr" rid="b18-genes-02-00036">18</xref>], <xref ref-type="table" rid="t1-genes-02-00036">Table 1</xref>). Thus while the numbers of intact genes differ six-fold, the total numbers of genes plus pseudogenes are more similar, particularly when the variation within humans is taken into account.</p></sec>
<sec>
<label>2.2.</label>
<title>TSPY Cluster Relationships</title>
<p>In order to investigate the relationships between the two human and three chimpanzee <italic>TSPY</italic> clusters, we compared them and their flanking sequences using the program DOTTER [<xref ref-type="bibr" rid="b25-genes-02-00036">25</xref>]. Single-copy sequences with high similarity produce a diagonal line in such an analysis; tandem repeats produce a series of diagonal lines offset by the size of the repeating unit. The results of this analysis are shown in <xref ref-type="fig" rid="f1-genes-02-00036">Figure 1</xref>. We can draw four main conclusions. First, the three panels representing the three chimpanzee <italic>TSPY</italic> clusters are broadly similar, apart from the different orientation of the cluster at 4.3 Mb. Second, the similarity between all of the different gene copies in the two species extends far outside the coding region to the entire repeat unit. This is expected from the locations of each chimpanzee <italic>TSPY</italic> cluster within one of the “pink” amplicons [<xref ref-type="bibr" rid="b18-genes-02-00036">18</xref>]. Third, the chimpanzee <italic>TSPY</italic> genes are organized into tandem arrays, but these are much smaller than the major human array, consisting of around four repeat units, and all have substantial deletions disrupting their regular structures. This is consistent with the inactive nature of the majority of the chimpanzee <italic>TSPY</italic> copies. Fourth, there is similarity between the sequences flanking the human major array on both sides and the sequences flanking the chimpanzee arrays. The sequences flanking the minor human array are less closely related.</p>
<p>To examine the phylogenetic relationships between the five <italic>TSPY</italic> clusters in more detail, we identified a flanking region with high sequence similarity (within the red circles in <xref ref-type="fig" rid="f1-genes-02-00036">Figure 1</xref>), aligned the five sequences using ClustalW2 (<ext-link xlink:href="http://www.ebi.ac.uk/Tools/msa/clustalw2/" ext-link-type="uri">http://www.ebi.ac.uk/Tools/msa/clustalw2/</ext-link>) [<xref ref-type="bibr" rid="b26-genes-02-00036">26</xref>] and corrected the alignment manually. The alignment spanned 6277 bp and came from two nearby segments (human 6,147,115–6,148,562 and 6,150,328–6,155,088; 9,400,312–9,401,761 and 9,403,547–9,408,316; chimpanzee 4,275,601–4,277,071 and 4,269,058–4,273,824; 9,990,626–9,992,099 and 9,993,868–9,998,636; 13,631,075–13,632,544 and 13,634,296–13,639,063). After excluding length variations in mononucleotide runs, variable positions consisting of base substitutions or insertions/deletions were identified and, if immediately adjacent, assigned to a single mutational event. In this way, 515 mutational events were identified, 514 of which were consistent with a simple parsimony tree. This unrooted tree is shown in <xref ref-type="fig" rid="f2-genes-02-00036">Figure 2</xref>.</p>
<p>From these two analyses, we can see that the human minor array is distinct from all the other arrays because most of its flanking regions do not align with them (<xref ref-type="fig" rid="f1-genes-02-00036">Figure 1</xref>), and even where there is good alignment, it is the most divergent sequence (<xref ref-type="fig" rid="f2-genes-02-00036">Figure 2</xref>). In contrast, the human major array and its flanking sequences show strong sequence similarity to the three chimpanzee arrays (<xref ref-type="fig" rid="f1-genes-02-00036">Figure 1</xref>). The phylogenetic reconstruction, which in this part of the tree can be rooted using the human minor array as an outgroup, shows that the chimpanzee arrays were all derived from a shared common ancestor on the chimpanzee lineage after the human-chimpanzee split, and that the arrays at 4.3 and 9.9 Mb are the most closely related. Since homologous clusters of multiple <italic>TSPY</italic> genes are found in both species, the common ancestor is likely to have carried such a cluster, and if this is the case, the multiple <italic>TSPY</italic> pseudogenes in the chimpanzee arrays are likely to have been generated independently from functional genes on the chimpanzee lineage. Since the large number of structural differences between the human and chimpanzee Y chromosomes, and the extensive structural polymorphism even within humans, suggest that large numbers of rearrangements have occurred during the descent from this ancestral structure, the intermediate steps cannot be reconstructed in more detail using the current data. The lack of an outgroup, such as a finished gorilla Y chromosome sequence, also limits the deductions we can make about the ancestral structure, although cytogenetic analyses show one signal of intermediate intensity in gorillas [<xref ref-type="bibr" rid="b16-genes-02-00036">16</xref>]. Nevertheless, a simple model consistent with the data available would propose a moderate-sized ancestral tandem array of <italic>TSPY</italic> genes in the human-chimpanzee ancestor, with expansion on the human lineage contrasted with array duplications and pseudogenization on the chimpanzee lineage.</p></sec></sec>
<sec>
<label>3.</label>
<title>Evolution of the <italic>TSPY</italic> Amino acid Sequence</title>
<sec>
<label>3.1.</label>
<title>Divergence of the Coding Sequence between Humans and Chimpanzees</title>
<p>Here we return to the question of whether the rapid evolution of the <italic>TSPY</italic> genes is more likely to represent relaxation of constraint or selection for different functional configurations in humans and chimpanzees. If the <italic>TSPY</italic> genes were functionally unimportant in either species, we would expect to see an increase in the rate of nucleotide substitutions in the coding region, which might approach the rate in predominantly neutral regions of the genome such as introns. Hughes and colleagues [<xref ref-type="bibr" rid="b18-genes-02-00036">18</xref>] tabulated a comparison of the extent of divergence of the ampliconic gene sequences, divided into coding and intronic sequences (<xref ref-type="table" rid="t2-genes-02-00036">Table 2</xref>).</p>
<p><xref ref-type="table" rid="t2-genes-02-00036">Table 2</xref> shows that introns have diverged by 1.4 to 2.9%, similar to the overall average figure 1.7% for the X-degenerate regions of the Y chromosomes. In contrast, the <italic>TSPY</italic> coding region shows 0.8% divergence, substantially lower. These conclusions are consistent with an earlier study that examined part of exon 1, exon 2 and intron 1, and estimated 1.9% divergence for the intron-exon segment as a whole [<xref ref-type="bibr" rid="b27-genes-02-00036">27</xref>]. <italic>TSPY</italic> shows the lowest coding divergence of any of the ampliconic genes, but the highest intronic divergence. It therefore seems unlikely that ampliconic organization or a reduced mutation rate could account for the low coding divergence: functional constraint provides the best explanation.</p></sec>
<sec>
<label>3.2.</label>
<title>The Ratio of Synonymous and Non-Synonymous Changes</title>
<p>To investigate this coding divergence in more detail, we compared the rates of synonymous substitutions, which do not change the coding sequence and are usually neutral, with those of non-synonymous, which alter the coding sequence, using the d<italic>N</italic>/d<italic>S</italic> statistic implemented in DNaSP (<ext-link xlink:href="http://www.ub.edu/dnasp/" ext-link-type="uri">http://www.ub.edu/dnasp/</ext-link>). For these comparisons, we chose the two alternative CCDS annotations of <italic>TSPY</italic> as the human sequences (CCDS48204, CCDS48205), the chimpanzee sequence (ENSPTRT00000055849 with manual annotation according to information from the Page lab and new alignment id), and used the <italic>TSPY</italic> sequence from a new world monkey, the marmoset, as an outgroup (ENSCJAG00000034791). The results are shown in <xref ref-type="table" rid="t3-genes-02-00036">Table 3</xref>.</p>
<p>A d<italic>N</italic>/d<italic>S</italic> value below 1 indicates purifying selection, a value of 1 neutrality, and a value above 1 positive selection. The marmoset provides a relatively distant outgroup, and this has the advantage that many differences between the marmoset-great ape sequences have accumulated. The d<italic>N</italic>/d<italic>S</italic> values between marmoset and either human or chimpanzee are similar, and less than 1, showing that selection has acted to reduce the number of amino acid changes: purifying selection is seen here, as is expected to predominate in protein-coding regions. This supports the idea that these <italic>TSPY</italic> sequences are functional, and have been for most of their evolutionary history. The human-chimpanzee value, in contrast, is positive for CCDS 48204 and undetermined for CCDS 48205 since there are no synonymous differences in this comparison. If the number of synonymous differences were conservatively set to 1 instead of zero, the d<italic>N</italic>/d<italic>S</italic> value would be 2.11. These positive values could indicate positive selection for change in the amino acid sequence of <italic>TSPY</italic> since the human-chimpanzee split. However, this interpretation needs to be considered cautiously: if there had been two synonymous differences in each case, d<italic>N</italic>/d<italic>S</italic> would be close to 1, consistent with neutrality. However, the low number of human-chimpanzee differences in the <italic>TSPY</italic> coding region, compared with introns and other genes (<xref ref-type="table" rid="t2-genes-02-00036">Table 2</xref>) is not consistent with neutrality. In all, the analyses of the <italic>TSPY</italic> coding region support the idea of purifying selection on the <italic>TSPY</italic> sequence for most of its history, and reveal likely positive selection in the last few million years.</p></sec></sec>
<sec sec-type="conclusions">
<label>4.</label>
<title>Conclusions</title>
<p>We wished to understand why humans have so many copies of one of their genes, <italic>TSPY</italic>. Within mammals, <italic>TSPY</italic> copy number is quite variable: mice, for example, have only an inactive copy [<xref ref-type="bibr" rid="b28-genes-02-00036">28</xref>,<xref ref-type="bibr" rid="b29-genes-02-00036">29</xref>], although rats appear to have one functional copy [<xref ref-type="bibr" rid="b28-genes-02-00036">28</xref>], and the copy number within cattle has been reported to vary between about 50 and 200 copies [<xref ref-type="bibr" rid="b30-genes-02-00036">30</xref>,<xref ref-type="bibr" rid="b31-genes-02-00036">31</xref>], although it remains unclear how many of these copies are functional. Within the great apes, <italic>TSPY</italic> copies appear from <italic>in situ</italic> hybridization studies to metaphase chromosomes (which do not distinguish between genes and pseudogenes) to be present at moderate levels in gorillas and bonobos, and at higher levels in orangutans, chimpanzees and humans [<xref ref-type="bibr" rid="b16-genes-02-00036">16</xref>]. <italic>TSPY</italic> is clearly not essential in all mammals, and varies substantially in copy number between even closely related species. Yet the conservation of its protein sequence, both as recognizable homologs in many mammals, and more specifically within the great apes as shown in Section 3 above, points to a functional role. <italic>TSPY</italic> is a member of the broad NAP/SET protein family [<xref ref-type="bibr" rid="b32-genes-02-00036">32</xref>], so a function as a protein chaperone might be sought.</p>
<p>Expression is predominantly in the tests, in spermatogonia, where the protein is seen mainly in the cytoplasm, and varies in extent between cells, being highest in adjacent pairs of cells involved in spermatogonial proliferation, together with a little staining in spermatocytes [<xref ref-type="bibr" rid="b21-genes-02-00036">21</xref>]. This suggests that its role may lie in male reproduction. It is notable that genes involved in reproductive processes are often found to evolve rapidly, showing signals of positive selection in genome-wide surveys. For example, the GO category “gametogenesis” showed evidence for unusually high levels of positive selection in a survey of human protein-coding genes [<xref ref-type="bibr" rid="b33-genes-02-00036">33</xref>], and the categories “gametogenesis”, “spermatogenesis and motility” and “fertilization” all showed similar evidence in a survey detecting positive selection in the last ∼30,000 years [<xref ref-type="bibr" rid="b34-genes-02-00036">34</xref>]. In addition, other differences between humans and chimpanzees in Y-encoded genes implicated in spermatogenesis have been identified [<xref ref-type="bibr" rid="b35-genes-02-00036">35</xref>]: inactivation of USP9Y in humans is usually [<xref ref-type="bibr" rid="b36-genes-02-00036">36</xref>] but not invariably [<xref ref-type="bibr" rid="b37-genes-02-00036">37</xref>] associated with spermatogenic failure, while chimpanzees and bonobos carry an inactive copy inherited from their common ancestor [<xref ref-type="bibr" rid="b38-genes-02-00036">38</xref>] who may have lived more than one million years ago, without manifesting defects in spermatogenesis. <italic>TSPY</italic> may therefore be a member of a class of genes involved in spermatogenesis that continue to evolve rapidly in a variety of ways.</p>
<p>We conclude that <italic>TSPY</italic> is likely to be present at such high copy number in humans because multiple copies have conferred a selective advantage on the human lineage. Although high copy numbers are unusual for protein-coding genes, RNA-coding genes are often highly repeated [<xref ref-type="bibr" rid="b39-genes-02-00036">39</xref>]. Here, no increase in product level by regulation of translation or protein turnover is possible, so more genes are needed to produce more product. Among protein-coding genes, a rare informative example of high copy number is provided by the sea urchin histone genes [<xref ref-type="bibr" rid="b40-genes-02-00036">40</xref>]. Histones are the major structural proteins in chromatin, and high levels are needed for rapid cell division during early development. These examples, together with the expression data, suggest that large amounts of <italic>TSPY</italic> protein are advantageous at some stage during spermatogenesis, and that this advantage is particularly marked in humans. Investigation of male Neanderthal or Denisovan genomes might provide some information on when the <italic>TSPY</italic> copy number increased and how specific this increase was to modern humans. Thus an evolutionary consideration of the <italic>TSPY</italic> gene family suggests multiple further directions for future research.</p></sec></body>
<back>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-genes-02-00036" position="float">
<label>Figure 1.</label>
<caption>
<p>Comparison of human and chimpanzee gene clusters. Left panels, human <italic>TSPY</italic> minor cluster at 6.1 Mb; right panels, human <italic>TSPY</italic> major array at 9.1–9.4 Mb including assembly gap shown as a blank region (GRCh37). Top panels, chimpanzee <italic>TSPY</italic> cluster 1 at 4.3 Mb (4 genes); middle panels, chimpanzee <italic>TSPY</italic> cluster 2 at 9.9 Mb (1 gene); bottom panels, chimpanzee <italic>TSPY</italic> cluster 3 at 13.6 Mb (1 gene) (panTro2). Red circles indicate the region analyzed in <xref ref-type="fig" rid="f2-genes-02-00036">Figure 2</xref>.</p></caption>
<graphic xlink:href="genes-02-00036f1.gif"/></fig>
<fig id="f2-genes-02-00036" position="float">
<label>Figure 2.</label>
<caption>
<p>Phylogenetic relationships of the two human and three chimpanzee <italic>TSPY</italic> arrays deduced from comparisons of the flanking sequences circled in <xref ref-type="fig" rid="f1-genes-02-00036">Figure 1</xref> (coordinates in text). The arrays, named according to their approximate genomic coordinates, are shown at the bottom, with active <italic>TSPY</italic> genes indicated by black arrowheads and the flanking region by a red rectangle. The topology of the tree is shown and the number of mutational events on each branch indicated, but the branches are not drawn to scale. Note that no outgroup sequence is available and the tree is unrooted.</p></caption>
<graphic xlink:href="genes-02-00036f2.gif"/></fig>
<table-wrap id="t1-genes-02-00036" position="float">
<label>Table 1.</label>
<caption>
<p>Numbers of <italic>TSPY</italic> gene and pseudogene copies in the human and chimpanzee reference sequences.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"/>
<th align="left" valign="top"><bold><italic>TSPY</italic> Genes</bold></th>
<th align="left" valign="top"><bold><italic>TSPY</italic> Pseudogenes</bold></th>
<th align="left" valign="top"><bold>Total</bold></th>
<th align="left" valign="top"><bold>Reference</bold></th></tr></thead>
<tbody>
<tr content-type="background-color:#C0C0C0">
<td align="left" valign="top"><bold>Human</bold></td>
<td align="left" valign="top">∼35 + 1</td>
<td align="left" valign="top">5</td>
<td align="left" valign="top">∼41</td>
<td align="left" valign="top">[<xref ref-type="bibr" rid="b9-genes-02-00036">9</xref>]</td></tr>
<tr>
<td align="left" valign="top"><bold>Chimpanzee</bold></td>
<td align="left" valign="top">4 + 1 + 1</td>
<td align="left" valign="top">21</td>
<td align="left" valign="top">27</td>
<td align="left" valign="top">[<xref ref-type="bibr" rid="b18-genes-02-00036">18</xref>]</td></tr></tbody></table></table-wrap>
<table-wrap id="t2-genes-02-00036" position="float">
<label>Table 2.</label>
<caption>
<p>Divergence of human and chimpanzee ampliconic gene sequences. All data are from [<xref ref-type="bibr" rid="b18-genes-02-00036">18</xref>]; NA = not applicable.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"/>
<th align="left" valign="top"><bold>% Coding Divergence</bold></th>
<th align="left" valign="top"><bold>% Intron Divergence</bold></th></tr></thead>
<tbody>
<tr content-type="background-color:#C0C0C0">
<td align="left" valign="top"><bold><italic>BPY2</italic></bold></td>
<td align="left" valign="top">0.935</td>
<td align="left" valign="top">1.504</td></tr>
<tr>
<td align="left" valign="top"><bold><italic>CDY</italic></bold></td>
<td align="left" valign="top">1.705</td>
<td align="left" valign="top">NA</td></tr>
<tr content-type="background-color:#C0C0C0">
<td align="left" valign="top"><bold><italic>DAZ</italic></bold></td>
<td align="left" valign="top">2.188</td>
<td align="left" valign="top">1.422</td></tr>
<tr>
<td align="left" valign="top"><bold><italic>RBMY</italic></bold></td>
<td align="left" valign="top">2.728</td>
<td align="left" valign="top">2.511</td></tr>
<tr content-type="background-color:#C0C0C0">
<td align="left" valign="top"><bold><italic>TSPY</italic></bold></td>
<td align="left" valign="top">0.794</td>
<td align="left" valign="top">2.888</td></tr>
<tr>
<td align="left" valign="top"><bold><italic>VCY</italic></bold></td>
<td align="left" valign="top">4.298</td>
<td align="left" valign="top">1.579</td></tr>
<tr content-type="background-color:#C0C0C0">
<td align="left" valign="top"><bold><italic>XKRY</italic></bold></td>
<td align="left" valign="top">1.626</td>
<td align="left" valign="top">NA</td></tr></tbody></table></table-wrap>
<table-wrap id="t3-genes-02-00036" position="float">
<label>Table 3.</label>
<caption>
<p>Selective forces on the <italic>TSPY</italic> amino acid sequence assessed using the d<italic>N</italic>/d<italic>S</italic> statistic.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle"><bold>Comparison</bold></th>
<th align="left" valign="middle"><bold>Non-Synonymous Differences</bold></th>
<th align="left" valign="middle"><bold>Synonymous Differences</bold></th>
<th align="left" valign="middle"><bold>d<italic>N</italic>/d<italic>S</italic></bold></th></tr></thead>
<tbody>
<tr content-type="background-color:#C0C0C0">
<td colspan="4" align="left" valign="top"><bold><underline>CCDS 48204</underline></bold></td></tr>
<tr>
<td align="left" valign="top"><bold>Marmoset-human</bold></td>
<td align="left" valign="top">71</td>
<td align="left" valign="top">35</td>
<td align="left" valign="top">0.58</td></tr>
<tr content-type="background-color:#C0C0C0">
<td align="left" valign="top"><bold>Marmoset-chimpanzee</bold></td>
<td align="left" valign="top">73</td>
<td align="left" valign="top">34</td>
<td align="left" valign="top">0.61</td></tr>
<tr>
<td align="left" valign="top"><bold>Human-chimpanzee</bold></td>
<td align="left" valign="top">5</td>
<td align="left" valign="top">1</td>
<td align="left" valign="top">1.49</td></tr>
<tr content-type="background-color:#C0C0C0">
<td colspan="4" align="left" valign="top"><bold><underline>CCDS 48205</underline></bold></td></tr>
<tr>
<td align="left" valign="top"><bold>Marmoset-human</bold></td>
<td align="left" valign="top">73</td>
<td align="left" valign="top">34</td>
<td align="left" valign="top">0.62</td></tr>
<tr content-type="background-color:#C0C0C0">
<td align="left" valign="top"><bold>Marmoset-chimpanzee</bold></td>
<td align="left" valign="top">73</td>
<td align="left" valign="top">35</td>
<td align="left" valign="top">0.61</td></tr>
<tr>
<td align="left" valign="top"><bold>Human-chimpanzee</bold></td>
<td align="left" valign="top">7</td>
<td align="left" valign="top">0</td>
<td align="left" valign="top">NA</td></tr></tbody></table></table-wrap></sec>
<ack>
<p>Our work is funded by The Wellcome Trust.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-genes-02-00036"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><collab>International Human Genome Sequencing Consortium</collab></person-group><article-title>Finishing the euchromatic sequence of the human genome</article-title><source>Nature</source><year>2004</year><volume>431</volume><fpage>931</fpage><lpage>945</lpage><pub-id pub-id-type="doi">10.1038/nature03001</pub-id><pub-id pub-id-type="pmid">15496913</pub-id></citation></ref>
<ref id="b2-genes-02-00036"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><collab>The 1000 Genomes Consortium</collab></person-group><article-title>A map of human genome variation from population-scale sequencing</article-title><source>Nature</source><year>2010</year><volume>467</volume><fpage>1061</fpage><lpage>1073</lpage><pub-id pub-id-type="doi">10.1038/nature09534</pub-id><pub-id pub-id-type="pmid">20981092</pub-id></citation></ref>
<ref id="b3-genes-02-00036"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><collab>Chimpanzee Sequencing and Analysis Consortium</collab></person-group><article-title>Initial sequence of the chimpanzee genome and comparison with the human genome</article-title><source>Nature</source><year>2005</year><volume>437</volume><fpage>69</fpage><lpage>87</lpage><pub-id pub-id-type="doi">10.1038/nature04072</pub-id><pub-id pub-id-type="pmid">16136131</pub-id></citation></ref>
<ref id="b4-genes-02-00036"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gibbs</surname><given-names>R.A.</given-names></name><name><surname>Rogers</surname><given-names>J.</given-names></name><name><surname>Katze</surname><given-names>M.G.</given-names></name><name><surname>Bumgarner</surname><given-names>R.</given-names></name><name><surname>Weinstock</surname><given-names>G.M.</given-names></name><name><surname>Mardis</surname><given-names>E.R.</given-names></name><name><surname>Remington</surname><given-names>K.A.</given-names></name><name><surname>Strausberg</surname><given-names>R.L.</given-names></name><name><surname>Venter</surname><given-names>J.C.</given-names></name><name><surname>Wilson</surname><given-names>R.K.</given-names></name><etal/></person-group><article-title>Evolutionary and biomedical insights from the rhesus macaque genome</article-title><source>Science</source><year>2007</year><volume>316</volume><fpage>222</fpage><lpage>234</lpage><pub-id pub-id-type="doi">10.1126/science.1139247</pub-id><pub-id pub-id-type="pmid">17431167</pub-id></citation></ref>
<ref id="b5-genes-02-00036"><label>5.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name><name><surname>Taylor</surname><given-names>L.</given-names></name><name><surname>Müller</surname><given-names>U.</given-names></name></person-group><article-title>Structure of a hypervariable tandemly repeated DNA sequence on the short arm of the human Y chromosome</article-title><source>J. Mol. Biol.</source><year>1988</year><volume>203</volume><fpage>837</fpage><lpage>848</lpage><pub-id pub-id-type="doi">10.1016/0022-2836(88)90110-6</pub-id><pub-id pub-id-type="pmid">3210241</pub-id></citation></ref>
<ref id="b6-genes-02-00036"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Warburton</surname><given-names>P.E.</given-names></name><name><surname>Hasson</surname><given-names>D.</given-names></name><name><surname>Guillem</surname><given-names>F.</given-names></name><name><surname>Lescale</surname><given-names>C.</given-names></name><name><surname>Jin</surname><given-names>X.</given-names></name><name><surname>Abrusan</surname><given-names>G.</given-names></name></person-group><article-title>Analysis of the largest tandemly repeated DNA families in the human genome</article-title><source>BMC Genomics</source><year>2008</year><volume>9</volume><fpage>533</fpage><pub-id pub-id-type="doi">10.1186/1471-2164-9-533</pub-id><pub-id pub-id-type="pmid">18992157</pub-id></citation></ref>
<ref id="b7-genes-02-00036"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arnemann</surname><given-names>J.</given-names></name><name><surname>Epplen</surname><given-names>J.T.</given-names></name><name><surname>Cooke</surname><given-names>H.J.</given-names></name><name><surname>Sauermann</surname><given-names>U.</given-names></name><name><surname>Engel</surname><given-names>W.</given-names></name><name><surname>Schmidtke</surname><given-names>J.</given-names></name></person-group><article-title>A human Y-chromosomal DNA sequence expressed in testicular tissue</article-title><source>Nucleic Acids Res.</source><year>1987</year><volume>15</volume><fpage>8713</fpage><lpage>8724</lpage><pub-id pub-id-type="doi">10.1093/nar/15.21.8713</pub-id><pub-id pub-id-type="pmid">3479749</pub-id></citation></ref>
<ref id="b8-genes-02-00036"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dechend</surname><given-names>F.</given-names></name><name><surname>Williams</surname><given-names>G.</given-names></name><name><surname>Skawran</surname><given-names>B.</given-names></name><name><surname>Schubert</surname><given-names>S.</given-names></name><name><surname>Krawczak</surname><given-names>M.</given-names></name><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name><name><surname>Schmidtke</surname><given-names>J.</given-names></name></person-group><article-title><italic>TSPY</italic> variants in six loci on the human Y chromosome</article-title><source>Cytogenet. Cell Genet.</source><year>2000</year><volume>91</volume><fpage>67</fpage><lpage>71</lpage><pub-id pub-id-type="doi">10.1159/000056821</pub-id><pub-id pub-id-type="pmid">11173833</pub-id></citation></ref>
<ref id="b9-genes-02-00036"><label>9.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Skaletsky</surname><given-names>H.</given-names></name><name><surname>Kuroda-Kawaguchi</surname><given-names>T.</given-names></name><name><surname>Minx</surname><given-names>P.J.</given-names></name><name><surname>Cordum</surname><given-names>H.S.</given-names></name><name><surname>Hillier</surname><given-names>L.</given-names></name><name><surname>Brown</surname><given-names>L.G.</given-names></name><name><surname>Repping</surname><given-names>S.</given-names></name><name><surname>Pyntikova</surname><given-names>T.</given-names></name><name><surname>Ali</surname><given-names>J.</given-names></name><name><surname>Bieri</surname><given-names>T.</given-names></name><etal/></person-group><article-title>The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes</article-title><source>Nature</source><year>2003</year><volume>423</volume><fpage>825</fpage><lpage>837</lpage><pub-id pub-id-type="doi">10.1038/nature01722</pub-id><pub-id pub-id-type="pmid">12815422</pub-id></citation></ref>
<ref id="b10-genes-02-00036"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oakey</surname><given-names>R.</given-names></name><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name></person-group><article-title>Y chromosome DNA haplotyping suggests that most European and Asian men are descended from one of two males</article-title><source>Genomics</source><year>1990</year><volume>7</volume><fpage>325</fpage><lpage>330</lpage><pub-id pub-id-type="doi">10.1016/0888-7543(90)90165-Q</pub-id><pub-id pub-id-type="pmid">1973137</pub-id></citation></ref>
<ref id="b11-genes-02-00036"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mathias</surname><given-names>N.</given-names></name><name><surname>Bayes</surname><given-names>M.</given-names></name><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name></person-group><article-title>Highly informative compound haplotypes for the human Y chromosome</article-title><source>Hum. Mol. Genet.</source><year>1994</year><volume>3</volume><fpage>115</fpage><lpage>123</lpage><pub-id pub-id-type="doi">10.1093/hmg/3.1.115</pub-id><pub-id pub-id-type="pmid">7909247</pub-id></citation></ref>
<ref id="b12-genes-02-00036"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Repping</surname><given-names>S.</given-names></name><name><surname>van Daalen</surname><given-names>S.K.</given-names></name><name><surname>Brown</surname><given-names>L.G.</given-names></name><name><surname>Korver</surname><given-names>C.M.</given-names></name><name><surname>Lange</surname><given-names>J.</given-names></name><name><surname>Marszalek</surname><given-names>J.D.</given-names></name><name><surname>Pyntikova</surname><given-names>T.</given-names></name><name><surname>van der Veen</surname><given-names>F.</given-names></name><name><surname>Skaletsky</surname><given-names>H.</given-names></name><name><surname>Page</surname><given-names>D.C.</given-names></name><etal/></person-group><article-title>High mutation rates have driven extensive structural polymorphism among human Y chromosomes</article-title><source>Nat. Genet.</source><year>2006</year><volume>38</volume><fpage>463</fpage><lpage>467</lpage><pub-id pub-id-type="doi">10.1038/ng1754</pub-id><pub-id pub-id-type="pmid">16501575</pub-id></citation></ref>
<ref id="b13-genes-02-00036"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murphy</surname><given-names>K.M.</given-names></name><name><surname>Cohen</surname><given-names>J.S.</given-names></name><name><surname>Goodrich</surname><given-names>A.</given-names></name><name><surname>Long</surname><given-names>P.P.</given-names></name><name><surname>Griffin</surname><given-names>C.A.</given-names></name></person-group><article-title>Constitutional duplication of a region of chromosome Yp encoding <italic>AMELY</italic>, <italic>PRKY</italic>, and <italic>TBL1Y</italic>: implications for sex chromosome analysis and bone marrow engraftment analysis</article-title><source>J. Mol. Diagn.</source><year>2007</year><volume>9</volume><fpage>408</fpage><lpage>413</lpage><pub-id pub-id-type="doi">10.2353/jmoldx.2007.060198</pub-id><pub-id pub-id-type="pmid">17591941</pub-id></citation></ref>
<ref id="b14-genes-02-00036"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Santos</surname><given-names>F.R.</given-names></name><name><surname>Pandya</surname><given-names>A.</given-names></name><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name></person-group><article-title>Reliability of DNA-based sex tests</article-title><source>Nat. Genet.</source><year>1998</year><volume>18</volume><fpage>103</fpage><pub-id pub-id-type="doi">10.1038/ng0298-103</pub-id><pub-id pub-id-type="pmid">9462733</pub-id></citation></ref>
<ref id="b15-genes-02-00036"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jobling</surname><given-names>M.A.</given-names></name><name><surname>Lo</surname><given-names>I.C.</given-names></name><name><surname>Turner</surname><given-names>D.J.</given-names></name><name><surname>Bowden</surname><given-names>G.R.</given-names></name><name><surname>Lee</surname><given-names>A.C.</given-names></name><name><surname>Xue</surname><given-names>Y.</given-names></name><name><surname>Carvalho-Silva</surname><given-names>D.</given-names></name><name><surname>Hurles</surname><given-names>M.E.</given-names></name><name><surname>Adams</surname><given-names>S.M.</given-names></name><name><surname>Chang</surname><given-names>Y.M.</given-names></name><etal/></person-group><article-title>Structural variation on the short arm of the human Y chromosome: recurrent multigene deletions encompassing Amelogenin Y</article-title><source>Hum. Mol. Genet.</source><year>2007</year><volume>16</volume><fpage>307</fpage><lpage>316</lpage><pub-id pub-id-type="pmid">17189292</pub-id></citation></ref>
<ref id="b16-genes-02-00036"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schempp</surname><given-names>W.</given-names></name><name><surname>Binkele</surname><given-names>A.</given-names></name><name><surname>Arnemann</surname><given-names>J.</given-names></name><name><surname>Glaser</surname><given-names>B.</given-names></name><name><surname>Ma</surname><given-names>K.</given-names></name><name><surname>Taylor</surname><given-names>K.</given-names></name><name><surname>Toder</surname><given-names>R.</given-names></name><name><surname>Wolfe</surname><given-names>J.</given-names></name><name><surname>Zeitler</surname><given-names>S.</given-names></name><name><surname>Chandley</surname><given-names>A.C.</given-names></name></person-group><article-title>Comparative mapping of YRRM- and <italic>TSPY</italic>-related cosmids in man and hominoid apes</article-title><source>Chromosome Res.</source><year>1995</year><volume>3</volume><fpage>227</fpage><lpage>234</lpage><pub-id pub-id-type="doi">10.1007/BF00713047</pub-id><pub-id pub-id-type="pmid">7606360</pub-id></citation></ref>
<ref id="b17-genes-02-00036"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>J.S.</given-names></name><name><surname>Yang-Feng</surname><given-names>T.L.</given-names></name><name><surname>Muller</surname><given-names>U.</given-names></name><name><surname>Mohandas</surname><given-names>T.K.</given-names></name><name><surname>de Jong</surname><given-names>P.J.</given-names></name><name><surname>Lau</surname><given-names>Y.F.</given-names></name></person-group><article-title>Molecular isolation and characterization of an expressed gene from the human Y chromosome</article-title><source>Hum. Mol. Genet.</source><year>1992</year><volume>1</volume><fpage>717</fpage><lpage>726</lpage><pub-id pub-id-type="doi">10.1093/hmg/1.9.717</pub-id><pub-id pub-id-type="pmid">1284595</pub-id></citation></ref>
<ref id="b18-genes-02-00036"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hughes</surname><given-names>J.F.</given-names></name><name><surname>Skaletsky</surname><given-names>H.</given-names></name><name><surname>Pyntikova</surname><given-names>T.</given-names></name><name><surname>Graves</surname><given-names>T.A.</given-names></name><name><surname>van Daalen</surname><given-names>S.K.</given-names></name><name><surname>Minx</surname><given-names>P.J.</given-names></name><name><surname>Fulton</surname><given-names>R.S.</given-names></name><name><surname>McGrath</surname><given-names>S.D.</given-names></name><name><surname>Locke</surname><given-names>D.P.</given-names></name><name><surname>Friedman</surname><given-names>C.</given-names></name><etal/></person-group><article-title>Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content</article-title><source>Nature</source><year>2010</year><volume>463</volume><fpage>536</fpage><lpage>539</lpage><pub-id pub-id-type="doi">10.1038/nature08700</pub-id><pub-id pub-id-type="pmid">20072128</pub-id></citation></ref>
<ref id="b19-genes-02-00036"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fortna</surname><given-names>A.</given-names></name><name><surname>Kim</surname><given-names>Y.</given-names></name><name><surname>MacLaren</surname><given-names>E.</given-names></name><name><surname>Marshall</surname><given-names>K.</given-names></name><name><surname>Hahn</surname><given-names>G.</given-names></name><name><surname>Meltesen</surname><given-names>L.</given-names></name><name><surname>Brenton</surname><given-names>M.</given-names></name><name><surname>Hink</surname><given-names>R.</given-names></name><name><surname>Burgers</surname><given-names>S.</given-names></name><name><surname>Hernandez-Boussard</surname><given-names>T.</given-names></name></person-group><article-title>Lineage-specific gene duplication and loss in human and great ape evolution</article-title><source>PLoS Biol.</source><year>2004</year><volume>2</volume><fpage>E207</fpage><pub-id pub-id-type="doi">10.1371/journal.pbio.0020207</pub-id><pub-id pub-id-type="pmid">15252450</pub-id></citation></ref>
<ref id="b20-genes-02-00036"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name></person-group><article-title>An evolutionary perspective on Y-chromosomal variation and male infertility</article-title><source>Int. J. Androl.</source><year>2008</year><volume>31</volume><fpage>376</fpage><lpage>382</lpage><pub-id pub-id-type="doi">10.1111/j.1365-2605.2008.00889.x</pub-id><pub-id pub-id-type="pmid">18399979</pub-id></citation></ref>
<ref id="b21-genes-02-00036"><label>21.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schnieders</surname><given-names>F.</given-names></name><name><surname>Dörk</surname><given-names>T.</given-names></name><name><surname>Arnemann</surname><given-names>J.</given-names></name><name><surname>Vogel</surname><given-names>T.</given-names></name><name><surname>Werner</surname><given-names>M.</given-names></name><name><surname>Schmidtke</surname><given-names>J.</given-names></name></person-group><article-title>Testis-specific protein, Y-encoded (<italic>TSPY</italic>) expression in testicular tissues</article-title><source>Hum. Mol. Genet.</source><year>1996</year><volume>5</volume><fpage>1801</fpage><lpage>1807</lpage><pub-id pub-id-type="doi">10.1093/hmg/5.11.1801</pub-id><pub-id pub-id-type="pmid">8923009</pub-id></citation></ref>
<ref id="b22-genes-02-00036"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giachini</surname><given-names>C.</given-names></name><name><surname>Nuti</surname><given-names>F.</given-names></name><name><surname>Turner</surname><given-names>D.J.</given-names></name><name><surname>Laface</surname><given-names>I.</given-names></name><name><surname>Xue</surname><given-names>Y.</given-names></name><name><surname>Daguin</surname><given-names>F.</given-names></name><name><surname>Forti</surname><given-names>G.</given-names></name><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name><name><surname>Krausz</surname><given-names>C.</given-names></name></person-group><article-title><italic>TSPY1</italic> copy number variation influences spermatogenesis and shows differences among Y lineages</article-title><source>J. Clin. Endocrinol. Metab.</source><year>2009</year><volume>94</volume><fpage>4016</fpage><lpage>4022</lpage><pub-id pub-id-type="doi">10.1210/jc.2009-1029</pub-id><pub-id pub-id-type="pmid">19773397</pub-id></citation></ref>
<ref id="b23-genes-02-00036"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nickkholgh</surname><given-names>B.</given-names></name><name><surname>Noordam</surname><given-names>M.J.</given-names></name><name><surname>Hovingh</surname><given-names>S.E.</given-names></name><name><surname>van Pelt</surname><given-names>A.M.</given-names></name><name><surname>van der Veen</surname><given-names>F.</given-names></name><name><surname>Repping</surname><given-names>S.</given-names></name></person-group><article-title>Y chromosome <italic>TSPY</italic> copy numbers and semen quality</article-title><source>Fertil. Steril.</source><year>2010</year><volume>94</volume><fpage>1744</fpage><lpage>1747</lpage><pub-id pub-id-type="doi">10.1016/j.fertnstert.2009.09.051</pub-id><pub-id pub-id-type="pmid">19917507</pub-id></citation></ref>
<ref id="b24-genes-02-00036"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vodicka</surname><given-names>R.</given-names></name><name><surname>Vrtel</surname><given-names>R.</given-names></name><name><surname>Dusek</surname><given-names>L.</given-names></name><name><surname>Singh</surname><given-names>A.R.</given-names></name><name><surname>Krizova</surname><given-names>K.</given-names></name><name><surname>Svacinova</surname><given-names>V.</given-names></name><name><surname>Horinova</surname><given-names>V.</given-names></name><name><surname>Dostal</surname><given-names>J.</given-names></name><name><surname>Oborna</surname><given-names>I.</given-names></name><name><surname>Brezinova</surname><given-names>J.</given-names></name><etal/></person-group><article-title><italic>TSPY</italic> gene copy number as a potential new risk factor for male infertility</article-title><source>Reprod. Biomed. Online</source><year>2007</year><volume>14</volume><fpage>579</fpage><lpage>587</lpage><pub-id pub-id-type="doi">10.1016/S1472-6483(10)61049-8</pub-id><pub-id pub-id-type="pmid">17509197</pub-id></citation></ref>
<ref id="b25-genes-02-00036"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sonnhammer</surname><given-names>E.L.</given-names></name><name><surname>Durbin</surname><given-names>R.</given-names></name></person-group><article-title>A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis</article-title><source>Gene</source><year>1995</year><volume>167</volume><fpage>GC1</fpage><lpage>10</lpage><pub-id pub-id-type="doi">10.1016/0378-1119(95)00714-8</pub-id><pub-id pub-id-type="pmid">8566757</pub-id></citation></ref>
<ref id="b26-genes-02-00036"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chenna</surname><given-names>R.</given-names></name><name><surname>Sugawara</surname><given-names>H.</given-names></name><name><surname>Koike</surname><given-names>T.</given-names></name><name><surname>Lopez</surname><given-names>R.</given-names></name><name><surname>Gibson</surname><given-names>T.J.</given-names></name><name><surname>Higgins</surname><given-names>D.G.</given-names></name><name><surname>Thompson</surname><given-names>J.D.</given-names></name></person-group><article-title>Multiple sequence alignment with the Clustal series of programs</article-title><source>Nucleic Acids Res.</source><year>2003</year><volume>31</volume><fpage>3497</fpage><lpage>3500</lpage><pub-id pub-id-type="doi">10.1093/nar/gkg500</pub-id><pub-id pub-id-type="pmid">12824352</pub-id></citation></ref>
<ref id="b27-genes-02-00036"><label>27.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname><given-names>H.S.</given-names></name><name><surname>Takenaka</surname><given-names>O.</given-names></name></person-group><article-title>A comparison of <italic>TSPY</italic> genes from Y-chromosomal DNA of the great apes and humans: sequence, evolution, and phylogeny</article-title><source>Am. J. Phys Anthropol.</source><year>1996</year><volume>100</volume><fpage>301</fpage><lpage>309</lpage><pub-id pub-id-type="doi">10.1002/(SICI)1096-8644(199607)100:3&lt;301::AID-AJPA1&gt;3.0.CO;2-X</pub-id><pub-id pub-id-type="pmid">8798990</pub-id></citation></ref>
<ref id="b28-genes-02-00036"><label>28.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mazeyrat</surname><given-names>S.</given-names></name><name><surname>Mitchell</surname><given-names>M.J.</given-names></name></person-group><article-title>Rodent Y chromosome <italic>TSPY</italic> gene is functional in rat and non-functional in mouse</article-title><source>Hum. Mol. Genet.</source><year>1998</year><volume>7</volume><fpage>557</fpage><lpage>562</lpage><pub-id pub-id-type="doi">10.1093/hmg/7.3.557</pub-id><pub-id pub-id-type="pmid">9467017</pub-id></citation></ref>
<ref id="b29-genes-02-00036"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vogel</surname><given-names>T.</given-names></name><name><surname>Boettger-Tong</surname><given-names>H.</given-names></name><name><surname>Nanda</surname><given-names>I.</given-names></name><name><surname>Dechend</surname><given-names>F.</given-names></name><name><surname>Agulnik</surname><given-names>A.I.</given-names></name><name><surname>Bishop</surname><given-names>C.E.</given-names></name><name><surname>Schmid</surname><given-names>M.</given-names></name><name><surname>Schmidtke</surname><given-names>J.</given-names></name></person-group><article-title>A murine <italic>TSPY</italic></article-title><source>Chromosome Res.</source><year>1998</year><volume>6</volume><fpage>35</fpage><lpage>40</lpage><pub-id pub-id-type="doi">10.1023/A:1009214307764</pub-id><pub-id pub-id-type="pmid">9510508</pub-id></citation></ref>
<ref id="b30-genes-02-00036"><label>30.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamilton</surname><given-names>C.K.</given-names></name><name><surname>Favetta</surname><given-names>L.A.</given-names></name><name><surname>Di Meo</surname><given-names>G.P.</given-names></name><name><surname>Floriot</surname><given-names>S.</given-names></name><name><surname>Perucatti</surname><given-names>A.</given-names></name><name><surname>Peippo</surname><given-names>J.</given-names></name><name><surname>Kantanen</surname><given-names>J.</given-names></name><name><surname>Eggen</surname><given-names>A.</given-names></name><name><surname>Iannuzzi</surname><given-names>L.</given-names></name><name><surname>King</surname><given-names>W.A.</given-names></name></person-group><article-title>Copy number variation of testis-specific protein, Y-encoded (<italic>TSPY</italic>) in 14 different breeds of cattle (Bos taurus)</article-title><source>Sex Dev.</source><year>2009</year><volume>3</volume><fpage>205</fpage><lpage>213</lpage><pub-id pub-id-type="doi">10.1159/000228721</pub-id><pub-id pub-id-type="pmid">19752600</pub-id></citation></ref>
<ref id="b31-genes-02-00036"><label>31.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jakubiczka</surname><given-names>S.</given-names></name><name><surname>Schnieders</surname><given-names>F.</given-names></name><name><surname>Schmidtke</surname><given-names>J.</given-names></name></person-group><article-title>A bovine homologue of the human <italic>TSPY</italic> gene</article-title><source>Genomics</source><year>1993</year><volume>17</volume><fpage>732</fpage><lpage>735</lpage><pub-id pub-id-type="doi">10.1006/geno.1993.1394</pub-id><pub-id pub-id-type="pmid">8244389</pub-id></citation></ref>
<ref id="b32-genes-02-00036"><label>32.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vogel</surname><given-names>T.</given-names></name><name><surname>Schmidtke</surname><given-names>J.</given-names></name></person-group><article-title>Structure and function of <italic>TSPY</italic>, the Y-chromosome gene coding for the “testis-specific protein”</article-title><source>Cytogenet. Cell Genet.</source><year>1998</year><volume>80</volume><fpage>209</fpage><lpage>213</lpage><pub-id pub-id-type="doi">10.1159/000014982</pub-id><pub-id pub-id-type="pmid">9678360</pub-id></citation></ref>
<ref id="b33-genes-02-00036"><label>33.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bustamante</surname><given-names>C.D.</given-names></name><name><surname>Fledel-Alon</surname><given-names>A.</given-names></name><name><surname>Williamson</surname><given-names>S.</given-names></name><name><surname>Nielsen</surname><given-names>R.</given-names></name><name><surname>Hubisz</surname><given-names>M.T.</given-names></name><name><surname>Glanowski</surname><given-names>S.</given-names></name><name><surname>Tanenbaum</surname><given-names>D.M.</given-names></name><name><surname>White</surname><given-names>T.J.</given-names></name><name><surname>Sninsky</surname><given-names>J.J.</given-names></name><name><surname>Hernandez</surname><given-names>R.D.</given-names></name><etal/></person-group><article-title>Natural selection on protein-coding genes in the human genome</article-title><source>Nature</source><year>2005</year><volume>437</volume><fpage>1153</fpage><lpage>1157</lpage><pub-id pub-id-type="doi">10.1038/nature04240</pub-id><pub-id pub-id-type="pmid">16237444</pub-id></citation></ref>
<ref id="b34-genes-02-00036"><label>34.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Voight</surname><given-names>B.F.</given-names></name><name><surname>Kudaravalli</surname><given-names>S.</given-names></name><name><surname>Wen</surname><given-names>X.</given-names></name><name><surname>Pritchard</surname><given-names>J.K.</given-names></name></person-group><article-title>A map of recent positive selection in the human genome</article-title><source>PLoS Biol.</source><year>2006</year><volume>4</volume><fpage>e72</fpage><pub-id pub-id-type="doi">10.1371/journal.pbio.0040072</pub-id><pub-id pub-id-type="pmid">16494531</pub-id></citation></ref>
<ref id="b35-genes-02-00036"><label>35.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tyler-Smith</surname><given-names>C.</given-names></name><name><surname>Krausz</surname><given-names>C.</given-names></name></person-group><article-title>The will-o'-the-wisp of genetics-hunting for the azoospermia factor gene</article-title><source>N Engl. J. Med.</source><year>2009</year><volume>360</volume><fpage>925</fpage><lpage>927</lpage><pub-id pub-id-type="doi">10.1056/NEJMe0900301</pub-id><pub-id pub-id-type="pmid">19246366</pub-id></citation></ref>
<ref id="b36-genes-02-00036"><label>36.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname><given-names>C.</given-names></name><name><surname>Skaletsky</surname><given-names>H.</given-names></name><name><surname>Birren</surname><given-names>B.</given-names></name><name><surname>Devon</surname><given-names>K.</given-names></name><name><surname>Tang</surname><given-names>Z.</given-names></name><name><surname>Silber</surname><given-names>S.</given-names></name><name><surname>Oates</surname><given-names>R.</given-names></name><name><surname>Page</surname><given-names>D.C.</given-names></name></person-group><article-title>An azoospermic man with a <italic>de novo</italic> point mutation in the Y-chromosomal gene USP9Y</article-title><source>Nat. Genet.</source><year>1999</year><volume>23</volume><fpage>429</fpage><lpage>432</lpage><pub-id pub-id-type="doi">10.1038/70539</pub-id><pub-id pub-id-type="pmid">10581029</pub-id></citation></ref>
<ref id="b37-genes-02-00036"><label>37.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luddi</surname><given-names>A.</given-names></name><name><surname>Margollicci</surname><given-names>M.</given-names></name><name><surname>Gambera</surname><given-names>L.</given-names></name><name><surname>Serafini</surname><given-names>F.</given-names></name><name><surname>Cioni</surname><given-names>M.</given-names></name><name><surname>De Leo</surname><given-names>V.</given-names></name><name><surname>Balestri</surname><given-names>P.</given-names></name><name><surname>Piomboni</surname><given-names>P.</given-names></name></person-group><article-title>Spermatogenesis in a man with complete deletion of USP9Y</article-title><source>N Engl. J. Med.</source><year>2009</year><volume>360</volume><fpage>881</fpage><lpage>885</lpage><pub-id pub-id-type="doi">10.1056/NEJMoa0806218</pub-id><pub-id pub-id-type="pmid">19246359</pub-id></citation></ref>
<ref id="b38-genes-02-00036"><label>38.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perry</surname><given-names>G.H.</given-names></name><name><surname>Tito</surname><given-names>R.Y.</given-names></name><name><surname>Verrelli</surname><given-names>B.C.</given-names></name></person-group><article-title>The evolutionary history of human and chimpanzee Y-chromosome gene loss</article-title><source>Mol. Biol. Evol.</source><year>2007</year><volume>24</volume><fpage>853</fpage><lpage>859</lpage><pub-id pub-id-type="pmid">17218643</pub-id></citation></ref>
<ref id="b39-genes-02-00036"><label>39.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallace</surname><given-names>H.</given-names></name><name><surname>Birnstiel</surname><given-names>M.L.</given-names></name></person-group><article-title>Ribosomal cistrons and the nucleolar organizer</article-title><source>Biochim. Biophys. Acta</source><year>1966</year><volume>114</volume><fpage>296</fpage><lpage>310</lpage><pub-id pub-id-type="doi">10.1016/0005-2787(66)90311-X</pub-id><pub-id pub-id-type="pmid">5943882</pub-id></citation></ref>
<ref id="b40-genes-02-00036"><label>40.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kedes</surname><given-names>L.H.</given-names></name><name><surname>Birnstiel</surname><given-names>L.H.</given-names></name></person-group><article-title>Reiteration and clustering of DNA sequences complementary to histone messenger RNA</article-title><source>Nat. New Biol.</source><year>1971</year><volume>230</volume><fpage>165</fpage><lpage>169</lpage><pub-id pub-id-type="pmid">5279989</pub-id></citation></ref></ref-list></back></article>
