You are currently viewing a new version of our website. To view the old version click .
Genes
  • Article
  • Open Access

15 October 2025

Investigating the “Dark” Genome: First Report of Partington Syndrome in Cyprus

,
,
,
,
,
,
,
,
and
1
Department of Clinical Genetics and Genomics, The Cyprus Institute of Neurology and Genetics, 2371 Nicosia, Cyprus
2
Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, 2371 Nicosia, Cyprus
3
Waltham Petcare Science Institute, Melton Mowbray, Leicestershire LE14 4RT, UK
4
Neurogenetics Department, The Cyprus Institute of Neurology and Genetics, 2371 Nicosia, Cyprus
Genes2025, 16(10), 1224;https://doi.org/10.3390/genes16101224 
(registering DOI)
This article belongs to the Special Issue Molecular Basis and Genetics of Intellectual Disability

Abstract

Background/Objectives: X-linked intellectual disability (XLID) is a highly heterogeneous disorder accounting for ~10% of all males with ID. Next-generation sequencing (NGS) has revolutionized the discovery of causal XLID genes and variants; however, many cases remain unresolved. We present a four-generation syndromic XLID family with multiple males exhibiting variable degree of ID, focal dystonia and epilepsy. Methods: Extensive cytogenetic and targeted genetic testing was initially performed, followed by whole-exome sequencing (WES) and short-read whole-genome sequencing (WGS). Apart from the routine NGS analysis pipelines, sequencing data was revisited by focusing on poorly covered/mapped regions on chromosome X (chrX), to potentially reveal unidentified clinically relevant variants. Candidate variant validation and family segregation analysis were performed with Sanger sequencing. Results: All initial diagnostic testing was negative. Subsequently, 300 previously reported “dark” chrX coding DNA sequences, overlapping 97 genes, were cross-checked against 29 chrX genes highly associated (p < 0.05) with ID and focal dystonia, according to Phenomizer. Manual inspection of the existing NGS data in two low-coverage regions, chrX:25013469-25013696 and chrX:111744737-111744820 (hg38), revealed a recurrent pathogenic ARX variant NM_139058.3:c.441_464dup p.(Ala148_Ala155dup) (ARXdup24) associated with non-syndromic or syndromic XLID, including Partington syndrome. Sanger sequencing confirmed ARXdup24 in all affected males, with carrier status in their unaffected mothers, and absence in other unaffected relatives. Conclusions: After several years of diagnostic odyssey, the pathogenic ARXdup24 variant was unmasked, supporting a genotype–phenotype correlation in the first Partington syndrome family in Cyprus. This study highlights that re-examining underrepresented genomic regions and using phenotype-driven tools can provide critical diagnostic insights in unresolved XLID cases.

1. Introduction

Intellectual disability (ID) is a genetically heterogeneous neurodevelopmental disorder characterized by significant limitations in both cognitive functioning and adaptive behaviour. X-linked intellectual disability (XLID) is caused by mutations in genes located on chromosome X (chrX) and accounts for ~10% of ID in males [,]. Based on the clinical presentation, XLID can be categorised as non-syndromic when isolated, or as syndromic when it presents with one or multiple clinical features or co-morbidities. Fragile X syndrome (FXS), typically caused by an expansion of the CGG triplet repeat (>200) in the fragile X messenger ribonucleoprotein 1 (FMR1) gene on Xq27.3, is the most common inherited form of intellectual disability, and the leading form of XLID []. Apart from FXS, around 200 XLID syndromes have been characterized to date, while thus far, 172 XLID genes have been identified, accounting for 157 of these syndromes [,].
The conventional strategies for the identification of XLID and/or associated genes/conditions include family pedigree analysis [], linkage analysis [], chromosomal analysis often coupled with fluorescence in situ hybridization (FISH) [], as well as array-based methods [,,]. Especially in carriers of balanced X-autosome translocations, fine-mapping of the breakpoints has been instrumental for the characterization of candidate genes implicated in XLID []. Next-generation sequencing (NGS) revolutionized the discovery of XLID genes and gene variants by assessing the chrX, exome or genome of patients in an efficient, cost-effective and high-throughput manner [,,,,,,]. Despite the finer resolution and deeper mechanistic insight offered by the evolving XLID research methods over time, many patients with syndromic or non-syndromic XLID remain without a genetic diagnosis. This diagnostic pitfall could be due to the involvement of yet to be discovered novel genes, or the presence of non-coding or regulatory causative variants, such as deep intronic splice variants, that cannot be captured by whole-exome sequencing (WES) or are not confidently supported even if identified by whole-genome sequencing (WGS) []. The presence of single-nucleotide variants (SNVs), structural variants or complex rearrangements often missed by short-read NGS methods, due to coverage and mapping limitations, could also be another explanation [,,].
In 2019, Ebbert et al. demonstrated the existence of approximately 37 thousand “dark” regions throughout the genome using short-read NGS technologies; that is, genomic regions with few or no mappable reads (dark-by-depth) or genomic regions with aligned reads but with low mapping quality (dark-by-mapping quality (MAPQ)) []. Such “dark” genomic regions span protein-coding exons from ~800 genes involved in many biological pathways relevant to human health and development, while ~10% of these genes are associated with known human diseases []. For example, the Survival of motor neuron 1 (SMN1) and Survival of motor neuron 2 (SMN2) genes (“dark” by 94% and 88%, respectively), both associated with spinal muscular atrophy (OMIM * 600354 and * 601627), the Aristaless-related homeobox (ARX) gene (“dark” by 12.8%) associated with a spectrum of X-linked developmental disorders including Partington syndrome (OMIM * 300382), and the T-box transcription factor 1 (TBX1) gene (“dark” by 10.6%) associated with the 22q11.2 deletion syndrome (OMIM * 602054), are all located within “dark” genomic regions []. Since disease-causing variants in such regions often escape the detection capabilities of standard short-read NGS methods, focused re-evaluation of existing NGS data in unresolved cases could be useful before proceeding with alternative diagnostic approaches.
We present a four-generation family with a suspected syndromic XLID disorder that remained undiagnosed despite extensive cytogenetic and molecular testing, including WES and short-read WGS. This study aims to perform an NGS data reanalysis, focusing on “dark” genomic regions on chrX, to potentially unmask any hidden variants in underrepresented genomic regions implicated in XLID.

2. Materials and Methods

2.1. Family Description and Clinical Evaluation

The family presented here included nine affected males across four generations; however, a thorough clinical evaluation was initially performed in four of the affected males (III:3, III:4, IV:18 and IV:25) (Figure 1). Three male family members were deceased (II:3, II:5 and II:10), while a detailed clinical description was not available for the remaining affected individuals (II:14 and IV:15). All four clinically evaluated patients had variable ID, epilepsy and movement abnormalities that included focal (hand) dystonia (Figure 1). The high incidence of affected male individuals in the family, the absence of phenotype in the mothers of affected males, and the absence of male-to-male transmission suggested an underlying X-linked disorder.
Figure 1. Schematic pedigree representation of the four-generation family with a suspected X-linked disorder. Affected males are indicated with colour-filled squares, while non-affected carrier females are marked with a dot inside a circle.

2.2. Cytogenetic and Initial Molecular Testing

Chromosomal analysis was performed from heparinized peripheral blood samples using the conventional G-banding technique at an average resolution of 550 bands. Peripheral blood in EDTA tubes was subjected to DNA extraction using the QIAamp DNA Blood Midi kit (Qiagen, Hilden, Germany) according to the manufacturer’s protocol.
In order to investigate the possibility of small copy number changes (deletions/duplications) that are beyond the resolution of chromosomal analysis, array-CGH was also performed with two types of arrays. An upgraded locally-designed chrX exon specific array was used first, with over 90,000 oligonucleotide probes on chrX (Oxford Gene Technology, Kidlington, UK) [], followed by a genome-wide CytoChip Oligo Array with 105,000 oligonucleotides (version 1.0—BlueGnome, Cambridge, UK). Array images were acquired using the Agilent microarray scanner G2565B (Agilent, Santa Clara, CA, USA), while fluorescent ratios were calculated using either the CytoSure Interpret Software (Oxford Gene Technology) or the BlueFuse Multi software (version 4.2—BlueGnome).
Fifty nanograms of the extracted DNA was used for Fragile X testing. Briefly, the number of the tandem (CGG) repeats at the FMR1 gene was determined using the appropriate forward (5′-GCTCAGCTCCGTTTCGGTTTCACTTCCGGT-3′—FAM labeled) and reverse (5′-AGCCCCGCACTTCCACCACCAGCTCCTCCA-3′) primers and the HotStarTaq Plus DNA Polymerase (Qiagen), following the manufacturer’s recommendations. The fluorescently labelled amplicon (281bp) was then mixed with Hi-Di™ Formamide (Applied Biosystems, Thermo Fisher Scientific, Waltham, MA, USA), and the GeneScan™ 500 LIZ™ dye Size Standard (Applied Biosystems), followed by brief denaturation and fragment analysis on the 3130xl Genetic Analyzer (Applied Biosystems).

2.3. Next-Generation Sequencing

WES was performed on samples from individuals III:3 and III:4 (affected siblings), as a service by Oxford Gene Technology, using the SureSelect DNA—Human All Exon Kit (version 1.2) (Agilent), according to the manufacturer’s recommendations. The enriched libraries for each sample were quantified using the qPCR NGS Library Quantification Kit (Agilent), followed by pooling and 100bp paired-end sequencing (TruSeq v3 mode) on the HiSeq2000 platform (Illumina Inc., San Diego, CA, USA). WES data was analysed using an in-house bioinformatics exome analysis pipeline as described previously [,]. WES data analysis was initially focused on high/medium impact variants (i.e., splice site, frameshift, nonsense, stop-loss, missense) (>20× coverage) overlapping XLID genes, as found in the OMIM® database (https://www.omim.org/), GeneCards database (https://www.genecards.org/), and XLID gene list by the Greenwood Genetic Centre (https://ggc.org/). A CNV analysis using the WES datasets from the affected siblings was performed using CoNIFER (version 0.2.2) [].
Short-read WGS was performed on samples from individuals III:3, III:4, II:1 and II:2, as a service by Macrogen Europe, using the TruSeq DNA PCR-free kit (Illumina), according to the manufacturer’s recommendations. Paired-end sequences (2 × 150 bp) generated by the NovaSeq6000 platform (Illumina) were aligned to the human genome assembly GRCh38 (hg38) using the Isaac aligner (version 04.18.11.09) []. SNVs and short insertions and deletions (indels), structural variants and large indels, and CNVs were identified using Strelka (version 2.9.10) [], Manta (version 1.5.0) [], and Control-FREEC (version 11.5) [], respectively. Variant annotation and effect prediction were performed using SnpEff (version 4.3t) [] and an in-house program by Macrogen Europe. WGS data analysis was initially focused on chrX variants commonly found between the affected male siblings tested and inherited in an X-linked manner.

2.4. Reanalysis of “Dark”Genomic Regions

In order to potentially reveal unidentified clinically relevant variants in “dark” genomic regions, the existing NGS data was reanalysed focusing on poorly covered (dark-by-depth) or poorly mapped (dark-by-MAPQ) regions previously published by Ebbert et al., 2019 []. Specifically, one of the authors’ publicly available datasets was downloaded (additional file 3 in ref. []), which includes “dark” genomic regions either by depth or by mapping quality based on whole-genome Illumina sequencing data (100bp read lengths). These were initially filtered to retain only “dark” coding DNA sequences (CDS) on chrX (File S1).
The Phenomizer web-based application (https://hpo.jax.org/tools/phenomizer) of the Human Phenotype Ontology (HPO) database (https://hpo.jax.org/) was also used in order to identify candidate diseases and implicated genes that best explain common clinical features present in multiple affected members of the family [,]. Briefly, the HPO terms intellectual disability (HP:0001249) and focal dystonia (HP:0004373) were entered in Phenomizer and an X-linked inheritance (HP:0001417) was selected, as suspected from the family pedigree. Only candidate diseases and implicated genes highly associated (p < 0.05) with the submitted HPO terms were taken into consideration for our focused short-read NGS data reanalysis (highlighted in red in File S1).
The resulting “dark” chrX CDS were then cross-checked with the list of genes highly associated with intellectual disability and focal dystonia. To do so, simple formulas were used to retain single values in each of the two datasets, followed by a comparison of the resulting data to identify common genes between the two (columns C and E in “Dark x Phenomizer” tab of File S1). That is, genes on chrX highly associated with intellectual disability and focal dystonia that fully or partially overlap “dark” CDS. The resulting targeted regions were manually inspected in the existing NGS data of the patients, using the Integrative Genomics Viewer (IGV), to reveal clinically relevant variants that may have remained undetected.

2.5. Sanger Sequencing Validation and Family Segregation Analysis

In order to validate the NGS reanalysis findings, PCR primer pairs (Metabion, Planegg, Germany) were designed flanking the region of interest using the Primer3 web interface tool (version 4.1.0) (https://primer3.ut.ee/) []. PCR amplification was then performed using the appropriate forward (5′-GTTCGAGGCCGAGCTGCAC-3′) and reverse (5′-CTCCTCCGGGTGCGTGAC-3′) primers, as well as the HotStarTaq Plus DNA Polymerase (Qiagen), following the manufacturer’s recommendations. Amplified products were visualized on a 2% agarose gel, which was stained with 1× GelRed® Nucleic Acid Gel Stain (Biotium, Fremont, CA, USA). PCR products with the expected fragment size were purified using ExoSAP-IT® (Affymetrix, Santa Clara, CA, USA), followed by cycle sequencing using the BigDye™ Terminator v1.1 Cycle Sequencing Kit (Applied Biosystems) and following the manufacturer’s recommendations. Cycle sequencing reaction clean-up was performed using Performa® DTR Gel Filtration Cartridges (EdgeBio, Gaithersburg, MD, USA), and purified sequencing reactions were processed on the 3130xl Genetic Analyzer (Applied Biosystems). Sanger sequencing electropherograms were visually inspected using the ABI Sequencing Analysis Software (version 5.4), while resulting sequences were aligned to the human genome assembly GRCh38 (hg38) using the BLAT tool from the UCSC Genome Browser (https://genome.ucsc.edu/cgi-bin/hgBlat) [].

2.6. Gene and Variant Nomenclature

Genes are described according to the HUGO Gene Nomenclature Committee (HGNC) guidelines []. All candidate variants are described according to the latest Human Genome Variation Society (HGVS) recommendations (https://hgvs-nomenclature.org/stable/) []. Finally, unless specified otherwise, all genomic coordinates in the present manuscript are based on the human genome assembly GRCh38 (hg38).

3. Results

3.1. Clinical Findings and Initial Testing Results

The family reported here includes nine affected males with a suspected X-linked disorder (Figure 1). The presented results are focused on the affected siblings III:3 and III:4, since they have gone through detailed clinical evaluation and comprehensive cytogenetic/molecular investigation in this study (Table 1).
Table 1. Clinical description and initial diagnostic testing results of the affected siblings III:3 and III:4.
Chromosomal G banding analysis in the two siblings (III:3 and III:4) revealed an apparently normal male karyotype. Subsequent array-CGH did not reveal any copy number changes other than CNVs in known polymorphic regions or CNVs inherited from a healthy parent, suggesting an apparently normal constitution. Genomic DNA analysis using PCR, determined no expansion of the tandem CGG repeats at the FMR1 gene; these were within normal limits (<55 CGG) in both male siblings (Table 1). Finally, both WES and WGS analyses were negative; no candidate SNVs, CNVs, or SVs were detected, by the standard analysis pipelines, that could follow the expected inheritance pattern and/or explain the patients’ phenotypes.

3.2. Unmasking Clinically Relevant “Dark” Genomic Regions

Since the family remained undiagnosed despite extensive cytogenetic and molecular testing, the NGS data was reanalysed focusing on poorly covered (dark-by-depth) and poorly mapped (dark-by-MAPQ) regions on chromosome X, given the suspected X-linked disorder. The publicly available dataset by Ebbert et al., 2019, includes in total 36,794 “dark” genomic regions based on whole-genome Illumina sequencing data (100bp read lengths) []. Filtering for coding DNA sequences, present only on chrX, reduced this number to 300 “dark” regions, partially overlapping 97 genes.
A crosscheck performed between these genes and the Phenomizer list of 29 X-linked genes highly associated (p < 0.05) with intellectual disability (HP:0001249) and focal dystonia (HP:0004373), resulted in two “dark” regions (highlighted in green in File S1). The first is chrX:25013469-25013696 (hg38) overlapping exon 2 of the Aristaless-related homeobox (ARX) gene (OMIM *300382) (NM_139058.3), while the second is chrX:111744737-111744820 (hg38) overlapping exon 24 of the ALG13 UDP-N-acetylglucosaminyltransferase subunit (ALG13) gene (OMIM *300776) (NM_001099922.3). Manual inspection of the resulting regions in the existing short-read WGS data from the affected male siblings III:3 and III:4, and their non-affected parents II:1 and II:2, did not reveal any candidate variants disrupting ALG13; however, a possible 24bp duplication overlapping ARX (referred here as ARXdup24) was selected for further investigation (Figure 2a). Remarkably, there was no indication of the ARXdup24 variant when retrospectively reviewing the WES data of the affected siblings (III:3 and III:4) at the corresponding genomic region in hg19 (chrX:25031586-25031813).
Figure 2. (a) IGV screenshot of the dark-by-depth region chrX:25013469-25013696 indicating the presence of a 24bp duplication (highlighted as an insertion in IGV) overlapping the ARX gene (III:3 = affected male; III:4 = affected sibling; II:1 = non-affected father; II:2 = non-affected mother). (b) Partial electropherograms confirming the ARXdup24 variant (red box) in the affected siblings (III:3 and III:4). The non-affected mother (II:2) is a carrier of ARXdup24, whereas ARXdup24 was not detected in the non-affected father (II:1). The electropherogram sequences displayed are from the reverse PCR reaction (plus strand) in a 5′3′ orientation. (c) The NP_620689.1:p.(Ala148_Ala155dup) variant (red box) is predicted to result in an expansion of the ARX polyalanine track by adding eight alanine amino acids (bold letters).
PCR using primers flanking ARXdup24 confirmed the identified variant in a hemizygous state in the affected male siblings (III:3 and III:4) (Figure 2b). The non-affected mother (II:2) was a carrier of ARXdup24, whereas ARXdup24 was not detected in the non-affected father (II:1) (Figure 2b). Additional family members were subsequently tested, demonstrating presence of the ARXdup24 variant in three other affected males (IV:15, IV:18, IV:25), heterozygosity in their non-affected mothers (III:12, III:15, III:19) and a number of non-affected female relatives (II:9, II:13, III:10, IV:14), and absence in the remaining non-affected male or female individuals tested (III:14, III:24, IV:1, IV:2, IV:3).
The ARXdup24 variant, reported here as NM_139058.3:c.441_464dup according to the HGVS nomenclature recommendations [], is predicted to result in an expansion of the ARX polyalanine track adding eight more alanine amino acids in the ARX protein NP_620689.1:p.(Ala148_Ala155dup) (Figure 2c).

4. Discussion

XLID is a highly heterogeneous neurodevelopmental disorder caused by direct or indirect disruptions of genes on chromosome X. Despite various technological advancements for detecting such causal chrX variants, many patients remain undiagnosed after routine genetic testing, even when short-read high-throughput sequencing methods are employed [,,,]. One reason for this may be the presence of “dark” regions in the genome, which cannot be successfully assembled or aligned using short-read NGS approaches, thus hindering the detection of clinically relevant gene variants within these underrepresented regions [,]. In the present study, a short-read NGS data reanalysis was performed in an undiagnosed XLID family focusing on such “dark” genomic regions. This enabled the unmasking of NM_139058.3:c.441_464dup p.(Ala148_Ala155dup) (ARXdup24), a recurrent 24bp in-frame duplication overlapping exon 2 of the ARX gene (NM_139058.3).
ARX, mapping on chromosome Xp21.3, is composed of 5 protein coding exons and is predominantly expressed in fetal brain, as well as adult skeletal muscle, heart, pancreas and liver [,]. It encodes the 562-amino acid Aristaless-related homeobox protein (NP_620689.1), which is composed of an N-terminal octapeptide domain, three nuclear localization sequences, four polyalanine tracts, an acidic domain, a homeobox domain, and a C-terminal aristaless domain []. ARX acts as a bifunctional transcriptional repressor and activator involved in the development of the brain as well as differentiation and maintenance of neurons in the cerebral cortex [,,,].
ARX is one of the most frequently mutated XLID genes, with at least 44 different variants reported to date (summarised in [,,]). These range from point mutations (splice, nonsense, and missense variants) to indels and duplications, and cluster mainly within exon 2 of ARX. ARXdup24 specifically maps in a mutation hot spot in the second ARX polyalanine tract and is the most common variant, representing >40% of all reported ARX variants, followed by NM_139058.3: c.315_335dup p.(Ala109_Ala115dup), a similar duplication variant mapping in the first ARX polyalanine tract [,]. Despite its high frequency, the detection of ARXdup24 can be easily missed using a non-targeted approach such as short-read high-throughput sequencing. This diagnostic pitfall could be due to ARX having an unusually high GC content (~73%) and 12.8% “dark” CDS [], particularly overlapping the first part of exon 2 where the recurrent ARXdup24 variant maps. The use of long-read sequencing technologies can substantially reduce the number of “dark” genomic regions [,], including those overlapping the ARX gene. However, validation by additional studies is required to support this further.
The identified ARXdup24 variant is classified here as pathogenic based on the Association for Clinical Genomic Science (ACGS) guidelines for variant classification in rare disease 2024 []. The criteria PS3_strong, PS4_strong, PM2_moderate and PP1_supporting were used to support this classification (ClinVar accession number: SCV006327456). The impact of ARXdup24 has been demonstrated in various functional studies (PS3_strong) supporting a partial loss of function of ARX. Specifically, a less striking reduction was observed in the transactivation and binding of ARX to the regulatory region of lysine-specific demethylase 5C (KDM5C) (OMIM * 314690), an XLID protein directly regulated by ARX and involved in chromatin remodeling and transcriptional repression []. Altered repression activity of the mutant ARX protein affecting the expression of transcriptional targets of ARX has also been demonstrated in an in vitro study after introducing the ARXdup24 variant []. Furthermore, the prevalence of ARXdup24 in affected individuals is significantly increased compared to the prevalence in controls (PS4_strong), as it has been identified in several affected individuals reported both in ClinVar (VCV000096455.23) and in the literature. It also has an extremely low frequency in controls (0.000002457) according to the Genome Aggregation Database (gnomAD v4.1.0) (PM2_moderate). Finally, it has been shown that ARXdup24 segregates with disease in related individuals, as also demonstrated in this study (PP1_supporting).
ARX variants, including ARXdup24, are associated with remarkable pleiotropy and a diverse spectrum of neurological phenotypes [,]. The majority of patients with ARXdup24 display non-syndromic ID (OMIM # 300419) [], while the remaining present with West syndrome (OMIM # 308350) characterised by developmental and epileptic encephalopathy [], and Partington syndrome (OMIM # 309510) characterised by syndromic XLID in conjunction with dystonic movement of the hands [,]. Partington syndrome is an X-linked neurodevelopmental disorder first described in a large Australian family by Partington et al., 1988 []. All affected males had mild to moderate ID and hand dystonia, inherited in an X-linked recessive pattern, while other phenotypes such as dysarthria, abnormal gait, autistic features and epilepsy were variable between individuals. The disease locus was initially mapped to the Xp21-Xpter chromosomal region, but this was later refined to Xp22.1 [,]. Fine mapping of the Partington syndrome locus, as well as identification of the causal ARX gene and ARXdup24 variant was accomplished in 2002 [,].
The patients in the family reported here (III:3, III:4, IV:18 and IV:25), in which ARXdup24 was found in a hemizygous state, commonly presented with mild ID and the characteristic dystonic movement of the hands, all of which are typical of Partington syndrome [,,,,]. In addition, similarly to previous published cases, variable intra-familial phenotypic expressivity was also observed; for example, only two patients (III:3 and IV:25) were reported to have epilepsy. Finally, all female ARXdup24 carriers tested in the present study (II:2, II:9, II:13, III:10, III:12, III:15, III:19, and IV:14) were asymptomatic. This is in concordance with previous reports supporting the rarity of clinically affected ARXdup24 female carriers, whereas females carrying either truncating or missense ARX variants have been described presenting with a variable clinical spectrum (reviewed in []).

5. Conclusions

After several years of diagnostic odyssey, the pathogenic ARXdup24 variant was unmasked in an XLID family by performing a focused short-read NGS data reanalysis and identifying “dark” genomic regions overlapping chrX genes highly associated with clinical features shared between affected family members. The ARXdup24 variant, validated in all patients tested so far, supports genotype–phenotype correlation in the first Partington syndrome family in Cyprus. This study highlights that examining underrepresented genomic regions and using phenotype-driven tools can provide critical diagnostic insights in unresolved XLID cases. We anticipate that a similar focused reanalysis approach can be successfully applied in additional unresolved cases of XLID or other conditions, to end the long and challenging diagnostic journey of patients, without the need for additional experimental procedures, leading to accurate diagnosis, management and counselling. These studies will collectively shed more light and potentially unravel the role of the “dark” genome in human diseases.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes16101224/s1, File S1: Dark x Phenomizer.

Author Contributions

Conceptualization, C.A., G.A.T. and C.S.; methodology, C.A., G.A.T. and C.S.; formal analysis, C.A., P.A. and A.T.; investigation, C.A., A.T., P.A., A.A., I.P., L.K., P.K., A.G., T.D., E.S., N.S., P.E., K.C., A.V., V.C.-A., G.A.T. and C.S.; resources, V.C.-A., G.A.T. and C.S.; data curation, C.A., A.T., P.A., A.A. and I.P.; writing—original draft preparation, C.A.; writing—review and editing, C.A., A.T., P.A., A.A., I.P., L.K., P.K., A.G., T.D., E.S., N.S., P.E., K.C., A.V., V.C.-A., G.A.T. and C.S.; visualization, C.A.; supervision, C.S.; project administration, C.S.; funding acquisition, G.A.T. and C.S.; (G.A.T. and C.S. contributed equally to this work and should be considered co-last authors). All authors have read and agreed to the published version of the manuscript.

Funding

This study received no external funding.

Institutional Review Board Statement

Ethical review and approval were waived for this study because the results of the clinical and genetic analysis presented in the submitted paper were part of the medical assessment of the patients. DNA from the patients and other family members was obtained and analysed as part of the routine genetic testing.

Data Availability Statement

The original contributions presented in this study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author. The reported pathogenic ARX duplication NM_139058.3:c.441_464dup p.(Ala148_Ala155dup) was submitted to the freely accessible, public database of human genetic variants ClinVar (www.ncbi.nlm.nih.gov/clinvar/) (ClinVar accession number: SCV006327456) [].

Acknowledgments

The authors of this publication Constantia Aristidou, Türem Delikurt, Elena Spanou, Alain Verloes and George A. Tanteles are members of the European Reference Network on Rare Congenital Malformations and Rare Intellectual Disability (ERN-ITHACA). ERN-ITHACA is funded by the European Union, under the grant agreement N°101156387.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ALG13ALG13 UDP-N-acetylglucosaminyltransferase subunit
array-CGHArray-Comparative Genomic Hybridization
ARXAristaless-related homeobox
CDSCoding DNA sequences
chrchromosome
CNVCopy number variant
FISHFluorescence in situ hybridization
FMR1Fragile X Messenger Ribonucleoprotein 1
FXSFragile X syndrome
HGVSHuman genome variation society
HPOHuman phenotype ontology
IDIntellectual disability
IGVIntegrative genomics viewer
indelInsertion/deletion
KDM5CLysine-specific demethylase 5C
NGSNext-generation sequencing
SMN1Survival of motor neuron 1
SMN2Survival of motor neuron 2
SNVSingle-nucleotide variant
TBX1T-box transcription factor 1
WESWhole-exome sequencing
WGSWhole-genome sequencing
XLIDX-linked intellectual disability

References

  1. Stevenson, R.E.; Schwartz, C.E. X-linked intellectual disability: Unique vulnerability of the male genome. Dev. Disabil. Res. Rev. 2009, 15, 361–368. [Google Scholar] [CrossRef] [PubMed]
  2. Lubs, H.A.; Stevenson, R.E.; Schwartz, C.E. Fragile X and X-linked intellectual disability: Four decades of discovery. Am. J. Hum. Genet. 2012, 90, 579–590. [Google Scholar] [CrossRef] [PubMed]
  3. Schwartz, C.E.; Louie, R.J.; Toutain, A.; Skinner, C.; Friez, M.J.; Stevenson, R.E. X-Linked intellectual disability update 2022. Am. J. Med. Genet. A 2023, 191, 144–159. [Google Scholar] [CrossRef] [PubMed]
  4. Greenwood Genetic Center. XLID Genetic Research. Available online: https://ggc.org/xlid-genetic-research (accessed on 22 August 2025).
  5. Martin, J.P.; Bell, J. A Pedigree of Mental Defect Showing Sex-Linkage. J. Neurol. Psychiatry 1943, 6, 154–157. [Google Scholar] [CrossRef]
  6. Arveiler, B.; Alembik, Y.; Hanauer, A.; Jacobs, P.; Tranebjaerg, L.; Mikkelsen, M.; Puissant, H.; Piet, L.L.; Mandel, J.L. Linkage analysis suggests at least two loci for X-linked non-specific mental retardation. Am. J. Med. Genet. 1988, 30, 473–483. [Google Scholar] [CrossRef]
  7. Verkerk, A.J.; Pieretti, M.; Sutcliffe, J.S.; Fu, Y.H.; Kuhl, D.P.; Pizzuti, A.; Reiner, O.; Richards, S.; Victoria, M.F.; Zhang, F.P.; et al. Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndrome. Cell 1991, 65, 905–914. [Google Scholar] [CrossRef]
  8. Bauters, M.; Van Esch, H.; Marynen, P.; Froyen, G. X chromosome array-CGH for the identification of novel X-linked mental retardation genes. Eur. J. Med. Genet. 2005, 48, 263–275. [Google Scholar] [CrossRef]
  9. Bashiardes, S.; Kousoulidou, L.; van Bokhoven, H.; Ropers, H.H.; Chelly, J.; Moraine, C.; de Brouwer, A.P.; Van Esch, H.; Froyen, G.; Patsalis, P.C. A new chromosome x exon-specific microarray platform for screening of patients with X-linked disorders. J. Mol. Diagn. 2009, 11, 562–568. [Google Scholar] [CrossRef]
  10. Whibley, A.C.; Plagnol, V.; Tarpey, P.S.; Abidi, F.; Fullston, T.; Choma, M.K.; Boucher, C.A.; Shepherd, L.; Willatt, L.; Parkin, G.; et al. Fine-scale survey of X chromosome copy number variants and indels underlying intellectual disability. Am. J. Hum. Genet. 2010, 87, 173–188. [Google Scholar] [CrossRef]
  11. Moyses-Oliveira, M.; Guilherme, R.S.; Meloni, V.A.; Di Battista, A.; de Mello, C.B.; Bragagnolo, S.; Moretti-Ferreira, D.; Kosyakova, N.; Liehr, T.; Carvalheira, G.M.; et al. X-linked intellectual disability related genes disrupted by balanced X-autosome translocations. Am. J. Med. Genet. B Neuropsychiatr. Genet. 2015, 168, 669–677. [Google Scholar] [CrossRef]
  12. Neri, G.; Schwartz, C.E.; Lubs, H.A.; Stevenson, R.E. X-linked intellectual disability update 2017. Am. J. Med. Genet A 2018, 176, 1375–1388. [Google Scholar] [CrossRef]
  13. de Ligt, J.; Willemsen, M.H.; van Bon, B.W.; Kleefstra, T.; Yntema, H.G.; Kroes, T.; Vulto-van Silfhout, A.T.; Koolen, D.A.; de Vries, P.; Gilissen, C.; et al. Diagnostic exome sequencing in persons with severe intellectual disability. N. Engl. J. Med. 2012, 367, 1921–1929. [Google Scholar] [CrossRef]
  14. Tzschach, A.; Grasshoff, U.; Beck-Woedl, S.; Dufke, C.; Bauer, C.; Kehrer, M.; Evers, C.; Moog, U.; Oehl-Jaschkowitz, B.; Di Donato, N.; et al. Next-generation sequencing in X-linked intellectual disability. Eur. J. Hum. Genet. 2015, 23, 1513–1518. [Google Scholar] [CrossRef] [PubMed]
  15. Hu, H.; Haas, S.A.; Chelly, J.; Van Esch, H.; Raynaud, M.; de Brouwer, A.P.; Weinert, S.; Froyen, G.; Frints, S.G.; Laumonnier, F.; et al. X-exome sequencing of 405 unresolved families identifies seven novel intellectual disability genes. Mol. Psychiatry 2016, 21, 133–148. [Google Scholar] [CrossRef] [PubMed]
  16. Field, M.J.; Kumar, R.; Hackett, A.; Kayumi, S.; Shoubridge, C.A.; Ewans, L.J.; Ivancevic, A.M.; Dudding-Byth, T.; Carroll, R.; Kroes, T.; et al. Different types of disease-causing noncoding variants revealed by genomic and gene expression analyses in families with X-linked intellectual disability. Hum. Mutat. 2021, 42, 835–847. [Google Scholar] [CrossRef] [PubMed]
  17. Wojcik, M.H.; Reuter, C.M.; Marwaha, S.; Mahmoud, M.; Duyzend, M.H.; Barseghyan, H.; Yuan, B.; Boone, P.M.; Groopman, E.E.; Delot, E.C.; et al. Beyond the exome: What’s next in diagnostic testing for Mendelian conditions. Am. J. Hum. Genet. 2023, 110, 1229–1248. [Google Scholar] [CrossRef]
  18. Mantere, T.; Kersten, S.; Hoischen, A. Long-Read Sequencing Emerging in Medical Genetics. Front. Genet. 2019, 10, 426. [Google Scholar] [CrossRef]
  19. Mantere, T.; Neveling, K.; Pebrel-Richard, C.; Benoist, M.; van der Zande, G.; Kater-Baats, E.; Baatout, I.; van Beek, R.; Yammine, T.; Oorsprong, M.; et al. Optical genome mapping enables constitutional chromosomal aberration detection. Am. J. Hum. Genet. 2021, 108, 1409–1422. [Google Scholar] [CrossRef]
  20. Ebbert, M.T.W.; Jensen, T.D.; Jansen-West, K.; Sens, J.P.; Reddy, J.S.; Ridge, P.G.; Kauwe, J.S.K.; Belzil, V.; Pregent, L.; Carrasquillo, M.M.; et al. Systematic analysis of dark and camouflaged genes reveals disease-relevant genes hiding in plain sight. Genome Biol. 2019, 20, 97. [Google Scholar] [CrossRef]
  21. Kritioti, E.; Theodosiou, A.; Parpaite, T.; Alexandrou, A.; Nicolaou, N.; Papaevripidou, I.; Séjourné, N.; Coste, B.; Christophidou-Anastasiadou, V.; Tanteles, G.A.; et al. Unravelling the genetic causes of multiple malformation syndromes: A whole exome sequencing study of the Cypriot population. PLoS ONE 2021, 16, e0253562. [Google Scholar] [CrossRef]
  22. Aristidou, C.; Theodosiou, A.; Alexandrou, A.; Papaevripidou, I.; Evangelidou, P.; Kosmaidou-Aravidou, Z.; Behjati, F.; Christophidou-Anastasiadou, V.; Tanteles, G.A.; Sismani, C. Exploring the Genetic Causality of Discordant Phenotypes in Familial Apparently Balanced Translocation Cases Using Whole Exome Sequencing. Genes 2022, 14, 82. [Google Scholar] [CrossRef]
  23. Krumm, N.; Sudmant, P.H.; Ko, A.; O’Roak, B.J.; Malig, M.; Coe, B.P.; Project, N.E.S.; Quinlan, A.R.; Nickerson, D.A.; Eichler, E.E. Copy number variation detection and genotyping from exome sequence data. Genome Res. 2012, 22, 1525–1532. [Google Scholar] [CrossRef]
  24. Raczy, C.; Petrovski, R.; Saunders, C.T.; Chorny, I.; Kruglyak, S.; Margulies, E.H.; Chuang, H.Y.; Kallberg, M.; Kumar, S.A.; Liao, A.; et al. Isaac: Ultra-fast whole-genome secondary analysis on Illumina sequencing platforms. Bioinformatics 2013, 29, 2041–2043. [Google Scholar] [CrossRef]
  25. Kim, S.; Scheffler, K.; Halpern, A.L.; Bekritsky, M.A.; Noh, E.; Kallberg, M.; Chen, X.; Kim, Y.; Beyter, D.; Krusche, P.; et al. Strelka2: Fast and accurate calling of germline and somatic variants. Nat. Methods 2018, 15, 591–594. [Google Scholar] [CrossRef] [PubMed]
  26. Chen, X.; Schulz-Trieglaff, O.; Shaw, R.; Barnes, B.; Schlesinger, F.; Kallberg, M.; Cox, A.J.; Kruglyak, S.; Saunders, C.T. Manta: Rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 2016, 32, 1220–1222. [Google Scholar] [CrossRef] [PubMed]
  27. Boeva, V.; Popova, T.; Bleakley, K.; Chiche, P.; Cappo, J.; Schleiermacher, G.; Janoueix-Lerosey, I.; Delattre, O.; Barillot, E. Control-FREEC: A tool for assessing copy number and allelic content using next-generation sequencing data. Bioinformatics 2012, 28, 423–425. [Google Scholar] [CrossRef] [PubMed]
  28. Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012, 6, 80–92. [Google Scholar] [CrossRef]
  29. Köhler, S.; Schulz, M.H.; Krawitz, P.; Bauer, S.; Dölken, S.; Ott, C.E.; Mundlos, C.; Horn, D.; Mundlos, S.; Robinson, P.N. Clinical Diagnostics in Human Genetics with Semantic Similarity Searches in Ontologies. Am. J. Hum. Genet. 2009, 85, 457–464. [Google Scholar] [CrossRef]
  30. Gargano, M.A.; Matentzoglu, N.; Coleman, B.; Addo-Lartey, E.B.; Anagnostopoulos, A.V.; Anderton, J.; Avillach, P.; Bagley, A.M.; Bakstein, E.; Balhoff, J.P.; et al. The Human Phenotype Ontology in 2024: Phenotypes around the world. Nucleic Acids Res. 2024, 52, D1333–D1346. [Google Scholar] [CrossRef]
  31. Untergasser, A.; Cutcutache, I.; Koressaar, T.; Ye, J.; Faircloth, B.C.; Remm, M.; Rozen, S.G. Primer3—New capabilities and interfaces. Nucleic Acids Res. 2012, 40, e115. [Google Scholar] [CrossRef]
  32. Perez, G.; Barber, G.P.; Benet-Pages, A.; Casper, J.; Clawson, H.; Diekhans, M.; Fischer, C.; Gonzalez, J.N.; Hinrichs, A.S.; Lee, C.M.; et al. The UCSC Genome Browser database: 2025 update. Nucleic Acids Res. 2025, 53, D1243–D1249. [Google Scholar] [CrossRef]
  33. Seal, R.L.; Braschi, B.; Gray, K.; Jones, T.E.M.; Tweedie, S.; Haim-Vilmovsky, L.; Bruford, E.A. Genenames.org: The HGNC resources in 2023. Nucleic Acids Res. 2023, 51, D1003–D1009. [Google Scholar] [CrossRef] [PubMed]
  34. Hart, R.K.; Fokkema, I.; DiStefano, M.; Hastings, R.; Laros, J.F.J.; Taylor, R.; Wagner, A.H.; den Dunnen, J.T. HGVS Nomenclature 2024: Improvements to community engagement, usability, and computability. Genome Med. 2024, 16, 149. [Google Scholar] [CrossRef] [PubMed]
  35. Ryan, N.M.; Corvin, A. Investigating the dark-side of the genome: A barrier to human disease variant discovery? Biol. Res. 2023, 56, 42. [Google Scholar] [CrossRef] [PubMed]
  36. Strømme, P.; Mangelsdorf, M.E.; Shaw, M.A.; Lower, K.M.; Lewis, S.M.E.; Bruyere, H.; Lütcherath, V.; Gedeon, Á.K.; Wallace, R.H.; Scheffer, I.E.; et al. Mutations in the human ortholog of Aristaless cause X-linked mental retardation and epilepsy. Nat. Genet. 2002, 30, 441–445. [Google Scholar] [CrossRef]
  37. Ohira, R.; Zhang, Y.H.; Guo, W.; Dipple, K.; Shih, S.L.; Doerr, J.; Huang, B.L.; Fu, L.J.; Abu-Khalil, A.; Geschwind, D.; et al. Human ARX gene: Genomic characterization and expression. Mol. Genet. Metab. 2002, 77, 179–188. [Google Scholar] [CrossRef]
  38. Bienvenu, T.; Poirier, K.; Friocourt, G.; Bahi, N.; Beaumont, D.; Fauchereau, F.; Jeema, L.B.; Zemni, R.; Vinet, M.C.; Francis, F.; et al. ARX, a novel Prd-class-homeobox gene highly expressed in the telencephalon, is mutated in X-linked mental retardation. Hum. Mol. Genet. 2002, 11, 981–991. [Google Scholar] [CrossRef]
  39. Seufert, D.W.; Prescott, N.L.; El-Hodiri, H.M. Xenopus aristaless-related homeobox (xARX) gene product functions as both a transcriptional activator and repressor in forebrain development. Dev. Dyn. 2005, 232, 313–324. [Google Scholar] [CrossRef]
  40. McKenzie, O.; Ponte, I.; Mangelsdorf, M.; Finnis, M.; Colasante, G.; Shoubridge, C.; Stifani, S.; Gecz, J.; Broccoli, V. Aristaless-related homeobox gene, the gene responsible for West syndrome and related disorders, is a Groucho/transducin-like enhancer of split dependent transcriptional repressor. Neuroscience 2007, 146, 236–247. [Google Scholar] [CrossRef]
  41. Gecz, J.; Cloosterman, D.; Partington, M. ARX: A gene for all seasons. Curr. Opin. Genet. Dev. 2006, 16, 308–316. [Google Scholar] [CrossRef]
  42. Poirier, K.; Lacombe, D.; Gilbert-Dussardier, B.; Raynaud, M.; Desportes, V.; de Brouwer, A.P.; Moraine, C.; Fryns, J.P.; Ropers, H.H.; Beldjord, C.; et al. Screening of ARX in mental retardation families: Consequences for the strategy of molecular diagnosis. Neurogenetics 2006, 7, 39–46. [Google Scholar] [CrossRef]
  43. Shoubridge, C.; Fullston, T.; Gecz, J. ARX spectrum disorders: Making inroads into the molecular pathology. Hum. Mutat. 2010, 31, 889–900. [Google Scholar] [CrossRef]
  44. Mahmoud, M.; Harting, J.; Corbitt, H.; Chen, X.; Jhangiani, S.N.; Doddapaneni, H.; Meng, Q.; Han, T.; Lambert, C.; Zhang, S.; et al. Closing the gap: Solving complex medically relevant genes at scale. medRxiv 2024. [Google Scholar] [CrossRef] [PubMed]
  45. ACGS Best Practice Guidelines for Variant Classification in Rare Disease 2024. Available online: https://www.genomicseducation.hee.nhs.uk/wp-content/uploads/2024/08/ACGS-2024_UK-practice-guidelines-for-variant-classification.pdf (accessed on 30 July 2025).
  46. Poeta, L.; Fusco, F.; Drongitis, D.; Shoubridge, C.; Manganelli, G.; Filosa, S.; Paciolla, M.; Courtney, M.; Collombat, P.; Lioi, M.B.; et al. A regulatory path associated with X-linked intellectual disability and epilepsy links KDM5C to the polyalanine expansions in ARX. Am. J. Hum. Genet. 2013, 92, 114–125. [Google Scholar] [CrossRef] [PubMed]
  47. Stromme, P.; Mangelsdorf, M.E.; Scheffer, I.E.; Gecz, J. Infantile spasms, dystonia, and other X-linked phenotypes caused by mutations in Aristaless related homeobox gene, ARX. Brain Dev. 2002, 24, 266–268. [Google Scholar] [CrossRef] [PubMed]
  48. Kato, M.; Das, S.; Petras, K.; Kitamura, K.; Morohashi, K.I.; Abuelo, D.N.; Barr, M.; Bonneau, D.; Brady, A.F.; Carpenter, N.J.; et al. Mutations of ARX Are Associated with Striking Pleiotropy and Consistent Genotype-Phenotype Correlation. Hum. Mutat. 2004, 23, 147–159. [Google Scholar] [CrossRef]
  49. Stepp, M.L.; Cason, A.L.; Finnis, M.; Mangelsdorf, M.; Holinski-Feder, E.; Macgregor, D.; MacMillan, A.; Holden, J.J.; Gecz, J.; Stevenson, R.E.; et al. XLMR in MRX families 29, 32, 33 and 38 results from the dup24 mutation in the ARX (Aristaless related homeobox) gene. BMC Med. Genet. 2005, 6, 16. [Google Scholar] [CrossRef]
  50. Kato, M.; Das, S.; Petras, K.; Sawaishi, Y.; Dobyns, W.B. Polyalanine expansion of ARX associated with cryptogenic West syndrome. Neurology 2003, 61, 267–276. [Google Scholar] [CrossRef]
  51. Partington, M.W.; Mulley, J.C.; Sutherland, G.R.; Hockey, A.; Thode, A.; Turner, G. X-linked mental retardation with dystonic movements of the hands. Am. J. Med. Genet. 1988, 30, 251–262. [Google Scholar] [CrossRef]
  52. Partington, M.W.; Turner, G.; Boyle, J.; Gecz, J. Three new families with X-linked mental retardation caused by the 428–451dup(24bp) mutation in ARX. Clin. Genet. 2004, 66, 39–45. [Google Scholar] [CrossRef]
  53. Gedeon, A.; Partington, M.; Mulley, J. X-linked mental retardation with dystonic movements of the hands (PRTS): Revisited. Am. J. Med. Genet. 1994, 51, 565–568. [Google Scholar] [CrossRef]
  54. Turner, G.; Partington, M.; Kerr, B.; Mangelsdorf, M.; Gecz, J. Variable expression of mental retardation, autism, seizures, and dystonic hand movements in two families with an identical ARX gene mutation. Am. J. Med. Genet. 2002, 112, 405–411. [Google Scholar] [CrossRef]
  55. Frints, S.G.; Froyen, G.; Marynen, P.; Willekens, D.; Legius, E.; Fryns, J.P. Re-evaluation of MRX36 family after discovery of an ARX gene mutation reveals mild neurological features of Partington syndrome. Am. J. Med. Genet. 2002, 112, 427–428. [Google Scholar] [CrossRef]
  56. Frints, S.G.; Borghgraef, M.; Froyen, G.; Marynen, P.; Fryns, J.P. Clinical study and haplotype analysis in two brothers with Partington syndrome. Am. J. Med. Genet. 2002, 112, 361–368. [Google Scholar] [CrossRef]
  57. Gras, M.; Heide, S.; Keren, B.; Valence, S.; Garel, C.; Whalen, S.; Jansen, A.C.; Keymolen, K.; Stouffs, K.; Jennesson, M.; et al. Further characterisation of ARX-related disorders in females due to inherited or de novo variants. J. Med. Genet. 2024, 61, 103–108. [Google Scholar] [CrossRef]
  58. Landrum, M.J.; Chitipiralla, S.; Brown, G.R.; Chen, C.; Gu, B.; Hart, J.; Hoffman, D.; Jang, W.; Kaur, K.; Liu, C.; et al. ClinVar: Improvements to accessing data. Nucleic Acids Res. 2020, 48, D835–D844. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.