Next Article in Journal
M-GNN: A Graph Neural Network Framework for Lung Cancer Detection Using Metabolomics and Heterogeneous Graph Modeling
Previous Article in Journal
Correction: Al Subait et al. Discovery of PPAR Alpha Lipid Pathway Modulators That Do Not Bind Directly to the Receptor as Potential Anti-Cancer Compounds. Int. J. Mol. Sci. 2025, 26, 736
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genotype–Phenotype Associations in Phelan–McDermid Syndrome: Insights into Novel Genes Beyond SHANK3

by
Julian Nevado
1,2,3,*,
Blanca Escalada
1,
Yolanda Muñoz-GªPorrero
1,
Carmen Adan
1,
Jair Tenorio-Castaño
1,2,3 and
Pablo Daniel Lapunzina
1,2,3
1
Instituto de Genética Médica y Molecular (INGEMM)-IdiPAZ, Hospital Universitario La Paz, Paseo de la Castellana 261, 28046 Madrid, Spain
2
CIBERER—Centro de Investigación Biomédica en Red de Enfermedades Raras, ISCIII, 28029 Madrid, Spain
3
ITHACA-European Reference Network-Hospital La Paz, 28046 Madrid, Spain
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(10), 4653; https://doi.org/10.3390/ijms26104653
Submission received: 17 March 2025 / Revised: 24 April 2025 / Accepted: 6 May 2025 / Published: 13 May 2025
(This article belongs to the Section Molecular Genetics and Genomics)

Abstract

:
Phelan–McDermid syndrome (PMS; #MIM: 606232) is a rare neurodevelopmental disorder primarily caused by the haploinsufficiency of the SHANK3 gene, most often due to deletions encompassing the gene or single nucleotide variants within it. Individuals with PMS display a wide range of clinical abnormalities and considerable genetic heterogeneity. This study aims to investigate genotype–phenotype correlations in a cohort of 213 individuals with PMS and to identify novel candidate genes, beyond SHANK3, that may contribute to the syndrome’s diverse clinical manifestations. Unsupervised clustering based on deletion size and Global Functional Assessment of the Patient (GFAP, previously described and developed by our group), along with additional analytical approaches, were employed to explore genotype–phenotype relationships. Deletion size within the 22q13.3 region emerged as a major determinant of phenotype, with larger deletions associated with more severe global functional impairment. Furthermore, CERK, TBC1D22A, CELSR1, and GRAMD4 were identified as candidate genes within 22q13.3, potentially contributing to core PMS phenotypes, and their putative interactions were explored. Our findings support the central role of SHANK3 in PMS, while also indicating that it does not account for the full phenotypic spectrum. This study underscores the variable impact of distinct genetic alterations in PMS and proposes additional loci implicated in its pathogenesis. These insights may inform future therapeutic strategies, emphasizing the importance of patient stratification and precision medicine.

1. Introduction

Phelan–McDermid syndrome (PMS; #MIM: 606232) is a rare genetic disorder. In Europe, a disease is classified as rare when its prevalence is ≤1 in 2000 individuals. Based on local data, we estimate a national prevalence of 4 × 10−4 per 10,000 inhabitants [1], which is lower than the previously reported, 1 in 20,000 [2], suggesting underdiagnosis. PMS is a neurodevelopmental disorder caused by haploinsufficiency of the SHANK3 gene, resulting either from deletions at 22q13.33 [3,4] or from pathogenic or likely pathogenic variants [3,5]. SHANK3 encodes a scaffolding protein essential to the postsynaptic density of excitatory neurons, playing a critical role in synaptic signaling, plasticity, and neural circuit formation [6].
PMS manifests with a broad spectrum of neurological, behavioral, and physical anomalies [7,8], including autism spectrum disorder (ASD) [9,10], intellectual disability [4], and schizophrenia [11]. Additional features include hypotonia, high pain tolerance, impulsivity, repetitive behaviors, developmental regression, and continuous crying [12]. Structural brain abnormalities, gastrointestinal disturbances, and renal malformations are also commonly observed [3,4,13]. Although SHANK3 haploinsufficiency is central to the disorder, other genes have been previously implicated in the PMS phenotype. For instance, TFC20 has been associated with intellectual disability and ASD [14], and CELSR1 has been linked to lymphedema [15,16,17].
Patients with deletions are more common than those carrying pathogenic SHANK3 single nucleotide variants (SNVs) (International PMS Registry; [3,4]). These deletions typically arise through non-allelic homologous recombination, unbalanced translocations, or other genomic rearrangements, such as ring chromosome 22 [3,8]. Most deletions occur de novo, although some cases result from parental mosaicism or balanced rearrangements [3]. Deletion sizes range from 12 kb to 10.3 Mb [3] and can be detected using CGH or SNP arrays, FISH, MLPA [3], or karyotyping for larger deletions (>4–5 Mb) [18].

2. Results

2.1. “Functional (GFAP)” Analyses Between the Different Groups of Patients with PMS

The global functional assessment of the patients (GFAP), as previously described [3], was applied in this cohort to stratify individuals into distinct groups: small deletions (≤0.25 Mb), deletions > 0.25 Mb, SHANK3 SNVs, isolated deletions, deletions with additional genomic rearrangements, and interstitial deletions. ANOVA analysis revealed statistically significant functional differences among these groups (F = 7.133; df = 5; p < 0.001), as illustrated in Figure 1.
The mean GFAP score for the entire cohort was 94.5 ± 41.3 arbitrary units. Patients with smaller deletions (≤0.25 Mb) had a mean GFAP score of 73.98 ± 45.7, compared to 108.34 ± 40.67 in those with deletions >0.25 Mb, demonstrating a highly significant statistical difference (p < 0.0001). As expected, individuals with smaller deletions exhibited a functional profile closely resembling that of patients with SHANK3 SNVs, suggesting overlapping pathogenic mechanisms. Overall, patients with smaller deletions presented with fewer comorbidities and lower GFAP scores, indicating better functional outcomes compared to those with larger deletions. In contrast, individuals with larger deletions were associated with higher GFAP scores, reflecting increased clinical severity, including profound intellectual disability, pronounced hypotonia, and significant behavioral disturbances. These findings support the consideration of deletion size as a key determinant of clinical severity, with larger deletions consistently correlating with higher GFAP scores. Figure 2 illustrates the average GFAP scores across different deletion size categories, highlighting that deletions exceeding 2.7 Mb are unequivocally associated with the highest GFAP scores, indicative of more severe functional impairment.
Interestingly, interstitial deletions involving SHANK3 exhibited statistically significant differences in GFAP scores when compared to cases with additional genomic rearrangements, isolated deletions, or deletions larger than 0.25 Mb. In contrast, interstitial deletions within 22q13.3 that do not include SHANK3 (n = 3) also yielded GFAP scores, with a mean value of 45.3 ± 16.8—the lowest among the groups analyzed. Notably, some of these individuals, despite the absence of SHANK3 involvement, presented clinical features reminiscent of PMS.

2.2. Genotype–Phenotype Analyses

2.2.1. Pearson’s Correlation Analyses (Continuous Variables)

Pearson’s correlation coefficient was used to assess the strength and direction of relationships between continuous variables, including deletion size, GFAP scores, and the number of affected genes. The results are summarized in Table 1. Larger deletions were positively correlated with higher GFAP scores, indicating more severe “functional” outcomes. As anticipated, an increase in the number of deleted genes was also associated with elevated GFAP scores, suggesting that the loss of multiple functional genes across various regions contributes to increased phenotypic severity. Consistent with previous observations [3], larger deletions tended to be diagnosed earlier, likely due to their more overt clinical presentation. Furthermore, deletion size showed a strong correlation with the number of OMIM genes lost, supporting the hypothesis that PMS with large deletions may represent a true contiguous gene syndrome, wherein the cumulative effect of multiple gene losses contributes to the clinical phenotype (Table 1).

2.2.2. Spearman’s Rank Correlation Analyses (Categorical Variables)

Spearman’s correlation was employed to assess associations between ordinal phenotypic traits and both deletion size and GFAP scores. Notably, larger deletions were inversely correlated with expressive language abilities (r = −0.421; p < 0.0001), comprehension skills (r = −0.266; p < 0.0001), and sphincter control (r = −0.277; p < 0.0001). Conversely, deletion size was positively correlated with the prevalence and severity of hypotonia (r = 0.304; p < 0.0001), growth anomalies (r = 0.227; p = 0.002), toe syndactyly (r = 0.223; p = 0.002), renal–urological abnormalities (r = 0.197; p = 0.007), lymphedema (r = 0.277; p < 0.0001), and large/fleshy hands (r = 0.266; p < 0.001). Additional traits—including seizures, strabismus (inversely), prominent ears, full/puffy eyelids, and hair-pulling—demonstrated unidirectional statistical significance (see Table S1, Supplemental Data).
Regarding associations with GFAP scores, we observed significant positive correlations with seizures (r = 0.207; p < 0.0001), developmental regression (r = 0.424; p < 0.0001), behavioral anomalies (r = 0.384; p < 0.0001), and high pain threshold (r = 0.358; p < 0.0001). Other notable correlations included core dysmorphic traits (e.g., long eyelashes, full brow, bulbous nose, wide nasal bridge, prominent ears) and various comorbidities such as renal abnormalities, gastrointestinal disturbances, hearing impairments, sleep disorders, and ophthalmological anomalies (see Table S1, Supplemental Data).

2.2.3. Analysis of Unsupervised Clusters by Deletion Size (Ward’s Method)

Unsupervised clustering was conducted using Ward’s hierarchical method, based on deletion size and associated phenotypic features (see Figure 3). The optimal number of clusters (n = 4) was determined using the Bayesian Information Criterion (BIC) and Akaike Information Criterion (AIC). Four primary clusters were identified, and a fifth cluster (Cluster 5, SHANK3 SNVs) was added to complete the comparative analysis. These clusters revealed distinct genetic and phenotypic profiles: Cluster 1 (CL1), mean deletion size = 3.054 ± 1.01 Mb (81 individuals); mean GFAP = 101.52 ± 39.8; Cluster 2 (CL2), mean deletion size = 6.29 ± 0.78 Mb (34 individuals); mean GFAP = 111.80 ± 37.96; Cluster 3 (CL3), mean deletion size = 0.246 ± 0.40 Mb (51 individuals); mean GFAP = 78.59 ± 46.93; Cluster 4 (CL4), mean deletion size = 8.48 ± 0.66 Mb (24 individuals); mean GFAP = 133.75 ± 32.60 and Cluster 5 (CL5, SHANK3 SNVs), no deletion (21 individuals); mean GFAP = 87.52 ± 22.8.
Frequency data for the various clinical variables across clusters are presented in Table S2 of the Supplemental Data. In brief, Cluster 1 (moderate deletions) comprises individuals with moderate functional impairments, characterized by a combination of developmental delays and behavioral features. Cluster 2 (large deletions) includes individuals with severe phenotypes, notably marked by profound hypotonia and significant cognitive delays. Cluster 3 (small deletions) consists of individuals with the mildest functional impairments, reflected by lower GFAP scores and a higher frequency of early independent walking and preserved language abilities.
As anticipated, Cluster 4 (very large deletions) represents individuals with the most severe clinical presentations, as indicated by the highest GFAP scores and elevated frequencies of seizures, renal anomalies, and macrocephaly. Cluster 5 (SHANK3 SNVs) includes individuals with comparatively milder phenotypes, functionally similar to those in Cluster 3. Overall, deletion size emerges as a key determinant of phenotypic severity in PMS, influencing a broad range of clinical features including non-fluent language, seizures, macrocephaly, dolichocephaly, strabismus, deep-set eyes, full/puffy eyelids, full eyebrows, wide nasal bridge, bulbous nose, auditory abnormalities, full/puffy cheeks, syndactyly, dysplastic nails, large/fleshy hands, ophthalmological anomalies, nephrourological abnormalities, obesity, lymphedema, uncontrolled laughter, hair-pulling, and altered growth trajectories. The frequency of these features increases with deletion size, which in turn correlates with both elevated GFAP scores and the number of deleted genes, including OMIM-annotated genes.

2.2.4. ANOVA Analysis-Comparison of GFAP Scores Across Clusters

A one-way ANOVA was conducted to compare GFAP scores across the five clusters, revealing statistically significant differences (F = 7.061; degrees of freedom = 5; p < 0.001). Specifically, individuals in Cluster 3 (small deletions) exhibited significantly better functional outcomes compared to those in Clusters 2 and 4 (larger deletions). No statistically significant difference was observed between Clusters 3 and 5 (p = 0.2), indicating that the phenotype associated with small deletions may closely resemble that of individuals with SHANK3 SNVs. Similarly, no significant difference was found between Clusters 2 and 4 (p = 0.4), suggesting that larger deletions consistently confer poor functional outcomes.

2.2.5. Chi-Square Test (For Categorical Variables)

Chi-square analyses were performed to assess associations between categorical phenotypic traits and the different genetic groups represented by the clusters described above. In addition, the application of χ2 tests followed by post hoc analyses revealed several statistically significant differences (p < 0.001), as detailed in Table 2 and Table 3.
  • By Cluster. Table 2 presents statistically significant associations between clusters and various categorical variables. In most cases, the observed data corroborated previously reported frequency patterns across clusters. Notable differences were identified in cognitive and motor domains, as well as in specific comorbidities and dysmorphic features (see Table 2). Pairwise comparisons between clusters are further detailed in Table S3 of the Supplemental Data. For example, seizures were significantly more prevalent in Clusters 2 and 4 (χ2 = 21.42, p < 0.001). Regarding language ability, Clusters 3 and 5 showed better outcomes, with a significantly higher proportion of individuals demonstrating preserved language skills (χ2 = 14.89, p = 0.001). Additionally, cranial size abnormalities—namely microcephaly and macrocephaly—were differentially associated with cluster assignment based on deletion size, with larger deletions linked to a higher prevalence of macrocephaly and a lower prevalence of microcephaly. Table 2 presents only those variables showing statistically significant differences among the clusters.
  • By Genetic Groups (Table 3). As shown in Table 3, significant differences in language abilities were observed between individuals with large deletions (>0.25 Mb) and those carrying SHANK3 SNVs (χ2 = 26.345; degrees of freedom = 2; p < 0.001). Additionally, renal abnormalities and lymphedema were significantly more prevalent among patients with deletions encompassing the CELSR1 and GRAMD4 genes (χ2 = 18.734; degrees of freedom = 2; p < 0.001). Hypotonia and seizures were also significantly more frequent in individuals with larger deletions, particularly those affecting TBC1D22A and CELSR12 = 18.734; degrees of freedom = 2; p < 0.001). Comparisons among other groups are further detailed in Table S4 of Supplemental Data.

2.3. Logistic Regression Analysis

To investigate the predictive value of genetic variables on clinical outcomes, several logistic regression models were constructed and evaluated. Two key models are highlighted below.
(i)
Prediction of speech delay (defined as the inability to form sentences): Independent variables included deletion size, presence of SHANK3 SNVs, and the number of deleted OMIM-annotated genes. The analysis revealed that for every 1 Mb increase in deletion size, the likelihood of significant speech delay increased by 85%. The odds ratio (OR) for deletion size was 1.85 (95% CI: 1.40–2.43; p < 0.001).
(ii)
Prediction of hypotonia: Independent variables included deletion size and GFAP score. The results indicated that larger deletions were significantly associated with an increased risk of hypotonia. The odds ratio for deletion size was 2.31 (95% CI: 1.75–3.12; p < 0.001).

2.4. Candidate Gene Analysis: GFAP Comparison Based on Clinical Status

To explore the potential involvement of candidate genes, we compared GFAP scores between individuals with and without specific clinical features. Binary variables (“1/Yes” vs. “0/No”) were analyzed using independent-samples Student’s t-tests, following Levene’s test for equality of variances. For variables with more than two categories, ANOVA was used. Variables showing statistically significant differences in GFAP scores based on clinical status are presented in Table 4.
Figure 4 and Figure 5 illustrate, as examples, two clinical variables that demonstrated statistically significant differences in GFAP scores. For each, we visualized the data using the UCSC Genome Browser (https://genome.ucsc.edu/ (accessed on 17 February 2025)) by grouping individuals based on the presence or absence of the clinical feature and plotting the corresponding median deletion sizes. This schematic approach facilitates the comparison of genomic intervals associated with differential clinical outcomes.

2.4.1. Nephro-Urological Alterations

We identified a set of genes that differ most prominently between individuals with and without nephro-urological alterations. These include TBC1D22A, CERK, GRAMD4, CELSR1, TRMU, CDPF1, PKDREJ, TTC38, and GTS1.

2.4.2. Lymphedema

We identified a set of genes that differ most prominently between individuals with and without lymphedema. These include TBC1D22A, CERK, GRAMD4, CELSR1, TRMU, CTSE1, CDPF1, TTC38, PKDREJ, PPARA, WNT7B, ATXN10, FBLN1, RIBC2, SMC1B, FAM118A, UPK3A, NUP50, KIAA0930, PHF21B, ARHGAP8, PRR5-ARHGAP8, PRR5, RTL6, PARVG, SHISAL1, PARVB, SAMM50, PNPLA3, PNPLA5, SULT4A1, EFCAB6, MPPED1, SCUBE1, TTLL12, TSPO, MCAT, and BIK.
The remaining clinical features that showed statistically significant associations in Table 4 are depicted in Figures S1–S9 of the Supplemental Data. Across the comparisons of all positive clinical conditions, we observed a recurrent and consistent haploinsufficient region in individuals exhibiting the different clinical traits. This shared region encompasses four key genes: TBC1D22A, CERK, GRAMD4, and CELSR1.

2.5. Analysis of Haploinsufficiency in Deleted Genes Within 22q13.31–22q13.33

To identify and characterize potential haploinsufficient regions within the 22q13.31–22q13.33 interval (8.3 Mb), we developed a composite haploinsufficiency score based on the sum of outputs from five established haploinsufficiency prediction algorithms (see Section 4). This yielded an overall score for each gene, visualized using a color-coded scheme and detailed in Table 5 and Table 6 below, as well as Table S5 of the Supplemental Data. These tables include the gene name, genomic coordinates, and calculated haploinsufficiency score. Gene functions, annotated through Gene Ontology (GO), are further illustrated. Based on the scores, genes were categorized as follows: Red: Score > 30, classified as highly haploinsufficient (Table 5); Yellow: Score > 20, classified as moderately haploinsufficient (Table 6); Orange: Score 10–20, classified as weakly haploinsufficient (Table S5a, Supplemental Data); Green: Score < 10, classified as very weakly haploinsufficient (Table S5b, Supplemental Data).
Based on the analysis of the previous tables, a customized track was created in the UCSC Genome Browser by entering the genomic coordinates of genes classified as highly and moderately haploinsufficient—highlighted in red and yellow, respectively. This visualization was used to identify potential haploinsufficient regions relevant to Phelan–McDermid syndrome within our cohort. As a result, three distinct regions of significant haploinsufficiency were identified and are illustrated in Figure 6.
The first haploinsufficient region (R1) comprises the following genes: G9 = BRD1, G10 = PIM3, G11 = PLXNB2, G12 = SBF1, G13 = MAPK8IP2, G14 = SHANK3, G20 = ZBED4, G21 = ALG12, and G22 = RABL2B. The second region (R2) includes G3 = PHF21B, G4 = NUP50, G5 = FBLN1, G6 = CELSR1, G7 = GRAMD4, G8 = TBC1D22A, G17 = PRR5, G18 = WNT7B, and G19 = PPARA. A third region (R3) is composed of G1 = SCUBE1, G2 = SULT4A1, G15 = TTLL12, and G16 = MPPED1. Region 1 includes the SHANK3 gene—considered the key gene in PMS—along with several neighboring genes. Region 2 aligns with the set of genes previously identified through regional comparisons (Section 4.3), particularly TBC1D22A, CERK, GRAMD4, and CELSR1. Interestingly, the mean haploinsufficiency score of genes in Region 2 (n = 9) is equivalent to that in Region 1 (n = 9), and notably higher than that of Region 3 (n = 4).

Analysis of Interaction Pathways Among Identified Haploinsufficient Genes

To examine the potential functional interactions among the haploinsufficient genes within Regions R1–R3, we analyzed known and predicted pathways using the STRING database (https://string-db.org/ (accessed on 17 February 2025)). Interactions were identified based on experimental data, text mining, and gene co-expression. The resulting interaction networks are illustrated in Figure 7A–C.
Gene–gene associations were identified both within and between the defined regions. Notably, inter-regional connections between genes at Regions R1 and R2 were observed through GRAMD4, TBC1D22A, and RABL2B (Figure 7A), while intra-regional associations within Region 1 were identified among PIM3, SHANK3, RABL2B, and ALG12 (Figure 7B). Additionally, connections were observed involving ZBED4, BRD1, and TBC1D22A, linking Regions R1 and R2, as well (Figure 7C).
The principal functions, metabolic pathways, and biological processes associated with the genes highly susceptible to haploinsufficiency in Regions R1 to R3 are summarized schematically in Figure 8.

3. Discussion

This retrospective study tests the hypothesis that genes beyond SHANK3 contribute to specific clinical features of Phelan–McDermid syndrome (PMS). The findings reinforce the concept of PMS as a contiguous gene syndrome, aligning with recent work by [19], while challenging the notion that SHANK3 alone accounts for the disorder’s phenotypic diversity. Although SHANK3 haploinsufficiency remains a central pathogenic mechanism, additional genes within the 22q13.31–22q13.33 region appear to influence key PMS features.

3.1. SHANK3 as the Core Driver, but Not the Sole Contributor

SHANK3 plays a pivotal role in synaptic development and function, explaining its association with hallmark neurological and behavioral traits of PMS, such as intellectual disability, ASD, and hypotonia. However, interstitial deletions that exclude SHANK3 but still present with PMS-like features suggest the involvement of additional genes, particularly in cases involving deletions larger than 0.25 Mb.
This study systematically defines three critical haploinsufficient regions (R1–R3) within 22q13.31–q13.33, emphasizing Region 2 (hg38, chr22: 44,702,204–47,175,693) as consistently associated with functional impairments reflected in GFAP scores. Interestingly, several genes within Region 2, including CERK, TBC1D22A, CELSR1, and GRAMD4 were found to be implicated in seizures, renal abnormalities, lymphedema, macrocephaly, and developmental delay in PMS [8,16,20,21,22].
Finally, we also highlight that the mean haploinsufficiency score of genes in Region 1 (which includes SHANK3) matches that of Region 2, it could suggest a comparable putative contribution to PMS pathogenesis of Region 2.

3.2. Deletion Size as a Key Determinant

GFAP analyses confirm that phenotypic severity correlates with deletion size, reinforcing the idea that additional genes modulate the clinical spectrum of PMS. Larger deletions are associated with poorer functional outcomes [3,23,24], whereas smaller deletions show clinical profiles similar to those observed in individuals with SHANK3 SNVs [3,4]. Even interstitial deletions not involving SHANK3 may present with PMS-like traits [7], supporting their inclusion in future PMS research.

3.3. Clusters and Phenotypic Variability

Phenotypic heterogeneity was evident across deletion size-based clusters. Cluster 3 (small deletions) exhibited milder phenotypes, while Cluster 4 (large deletions) was associated with more severe impairments, consistent with a cumulative effect of gene loss. Interestingly, the SHANK3 SNV (“Cluster 5”) showed functional overlap with Cluster 3, particularly in speech and motor outcomes, reinforcing the functional importance of SHANK3. Altogether also support the participation of genes beyond SHANK3.

3.4. Specific Phenotypes and Genetic Correlations

This study refines genotype–phenotype correlations for seizures, renal anomalies, and lymphedema, particularly in relation to deletions involving CELSR1 and GRAMD4 [22,25]. Larger deletions also strongly correlate with hypotonia and expressive language impairments [3]. Interestingly, the prevalence of ASD remained constant across deletion sizes, supporting SHANK3’s central role in this phenotype [2]. Conflicting findings regarding deletion size and ASD [19,23] may reflect diagnostic challenges in more severely affected individuals, where profound impairments may mask features of ASD or ADHD. Deletion size also influences cranial morphology (macrocephaly) associated with deletions > 3.5 Mb and microcephaly with smaller deletions [8,21]. A threshold of 2.7 Mb was associated with more severe developmental impairments, in agreement with prior breakpoint mapping studies [24].

3.5. Implications of Additional Genes

Candidate genes in Region-2 (including CERK, TBC1D22A, CELSR1, and GRAMD4) are likely contributors to PMS pathophysiology. CERK (ceramide kinase) is involved in lipid metabolism and potentially related to hypotonia and immune dysregulation. TBC1D22A participates in intracellular trafficking, associated with seizures and renal anomalies. CELSR1 is implicated in craniofacial and renal abnormalities, consistent with its role in planar cell polarity. GRAMD4 regulates apoptosis and is linked to seizures, lymphedema, and sensory processing dysfunction. These genes play crucial roles in the regulation of cell signaling, cytoskeleton organization, apoptosis, and immune response, participating in biological pathways essential for the maintenance of cellular homeostasis and embryonic development. Indeed, a few interactions among these genes (CERK, TBC1D22A, CELSR1, GRAMD4) have been identified in the available protein interaction databases. Interactions between CERK and TBC1D22A proteins have been reported in HeLa cells, detected by affinity capture and mass spectrometry techniques (BioGRID; http//:www.thebiogrid.org (accessed on 17 February 2025)). In addition, GRAMD4 is predicted to interact with other proteins, such as TBC1D22A and RABL2B (https://string-db.org/ (accessed on 16 February 2025)). Those interactions were based on evidence of gene co-expressions (with medium confidence), which suggests that both proteins may be involved in related cellular processes (Figure 7A). In the case of RABL2B interactions, these were also experimentally established (https://string-db.org/ (accessed on 16 February 2025)).
Regarding some functionality of these genes (TBC1D22A, CELSR1, and GRAMD4), we propose previously a region of 4.5 to 8 Mb from the telomere, strongly associated with macrocephaly in PMS [3], also supported by this study, and aligned with other works [21], suggesting to GRAMD4 associated with macrocephaly. TBC1D22A has been associated with structural or functional abnormalities of the head or the central nervous system (CNS), such as seizures, schizophrenia, or bipolar disorder [26,27]. In addition, CELSR1-deficient mouse has found to have microcephaly and cortical hypoplasia [28], but also with defects in neural tube and caudal agenesis [29], lymphedema [17], or several kidney disorders [22].
Additional genes from Regions 2 and 3 (PHF21B, SULT4A1, SCUBE1, and NUP50) we speculate that they may further contribute to cognitive deficits, neurodevelopmental features, and growth abnormalities.

3.6. Genetic Interactions and Future Directions

We hypothesize that interactions among these candidate genes and SHANK3 may account for the broad phenotypic variability observed in PMS. Currently, no direct interactions between the CERK, TBC1D22A, CELSR1, GRAMD4, and SHANK3 genes have been identified in the available protein interaction databases. Although their roles in neuronal development, cell signaling, and synaptic organization suggest that they could be involved in broader interaction networks that affect neuronal function, behavior, and beyond. On the other hand, we do know that SHANK3 is predicted to interact with at least six other genes located in 22q13.3—PLXNB2, BRD1, CELSR1, PHF21B, SULT4A1, and TFC20—underscoring the genetic complexity of the syndrome [20]. In this sense, in Figure 8 we describe a schematic pathway diagram illustrating the interactions between genes and their associated contributions to phenotypic manifestations in Phelan–McDermid syndrome.
In summary, our findings confirm significant functional variation among PMS patients based on deletion size, further supporting the classification of PMS as a contiguous gene syndrome. Future research should investigate whether these genes modulate SHANK3 expression or function, potentially revealing novel therapeutic targets.

4. Materials and Methods

4.1. PMS Spanish-Based Registry

This registry comprises 245 patients previously recruited through INGEMM (Institute of Medical and Molecular Genetics, Hospital Universitario La Paz, Madrid), in collaboration with the Spanish Association of PMS and the Argentine PMS Group [3]. Clinical data were obtained via two structured questionnaires completed by physicians, clinical geneticists, and caregivers, following informed consent. The study was approved by the Ethics Committee of Hospital Universitario La Paz (PI: 2735 HULP, Madrid, Spain). Parents or guardians provided informed consent.
In addition to clinical records, the registry includes curated genetic data derived from MLPA, karyotyping, FISH, and chromosomal microarray analysis (including high-resolution CMA in some cases). Cognitive behavioral evaluations and quality-of-life assessments were also conducted in a subset of 30 individuals. For this study, only data from the two primary questionnaires were analyzed, and hierarchical criteria were applied (e.g., nephro-urogenital anomalies as an umbrella category).

Generation of the Database from the “Spanish PMS Registry”

A comprehensive database was constructed using SPSS (version 29.0.2), with patient data pseudo-anonymized and coded from 1 to 213. The database contains both continuous and categorical variables. Key continuous variables include age at diagnosis, deletion size, GFAP score, number of deleted genes, and number of OMIM-annotated genes (with corresponding inheritance types, based on OMIM, updated as of 21 November 2024). Haploinsufficiency scores were also assigned. Sixty-four categorical variables representing various phenotypic features were incorporated.
The cohort includes individuals with confirmed diagnosis of PMS (mostly not related) from across Spain (n = 181) and South America (n = 32), and they were recruited between years 2008–2022. Twenty-seven of these individuals had incomplete clinical or molecular data and were not included in this study, but they form part of the registry. All cases analyzed are deposited in the DECIPHER Genomics repository under accession numbers 432868–433079 (IMMGPMS1–IMMGPMS211). The female-to-male ratio was 1.108:1 (112/101), with ages ranging from birth to 62 years. The majority of individuals with PMS in our cohort are of pediatric age (between 0 and 16 years old, 149 patients; 69.95%). The mean age at diagnosis was around 6 years old for deletions and around 8 years for the group with sequence variants in SHANK3. The mean age at evaluation were around 12.5 years and 11 years for deletions and SHANK3 sequence variants, respectively. Full demographic and complete clinical descriptions are available in [3].

4.2. The Study

This retrospective, experimental study was conducted using data from the Spanish PMS Registry, processed in SPSS (v. 29.0.2). Inclusion criteria were based primarily on the availability of high-resolution microarray genetic data. Chromosomal coordinates were standardized to the latest genome build, GRCh38/hg38, using the UCSC LiftOver tool (https://genome.ucsc.edu/ (accessed on 15 January 2025)).

4.3. Genetic Analysis

Of the 213 individuals in the cohort, 192 (90.14%) had deletions in the 22q13.3 region, while the remaining 21 patients (9.86%) harbored pathogenic or likely pathogenic single nucleotide variants (SNVs) in the SHANK3 gene. Deletion sizes ranged from ~0.01 Mb to over 10 Mb. Among those with deletions, 176 (91.66%) had terminal deletions, of which 13 (7.38%) were associated with translocations (de novo or inherited) and 20 (11.36%) with ring chromosome 22. Interstitial deletions including SHANK3 were identified in 13 individuals (6.77%), with an additional three interstitial deletions not involving SHANK3 (1.57%) [3]. Postzygotic mosaicism was identified in 17 cases (9.98%), and germline mosaicism was suspected in two others (0.94%).
The mean deletion size was 3.52 ± 2.54 Mb. Based on deletion size, individuals were classified into two groups: small deletions (≤0.25 Mb; 21.7%) and large deletions (>0.25 Mb; 78.3%). The 0.25 Mb threshold was selected because SHANK3 is located approximately 0.1–0.2 Mb from the telomere. The SNV group included pathogenic or likely pathogenic variants: 17 frameshift, one splice-site, one nonsense, and two missense variants of uncertain significance/likely pathogenic [3].

4.4. Patients’ Global Functional Assessment (GFAP) Construct

The GFAP (Global Functional Assessment of the Patients) test is a scoring system developed to quantify the overall clinical severity of individuals with Phelan–McDermid syndrome. It was designed using weighted clinical features from the Human Phenotype Ontology (HPO), based on their observed frequency in the studied cohort. Clinical traits occurring in more than 70% of patients were scored 10 points, those between 35 and 70% received 5 points, 20–35% achieved 2 points, and those under 20% were scored 1 point. The total GFAP score thus reflects the accumulation of core clinical manifestations, including developmental delay, speech impairment, hypotonia, and behavior anomalies. Its validity was confirmed through principal component analysis, ensuring that the score accurately reflects severity across diverse genetic subtypes. This allows comparisons among patients and subgroups, revealing, for instance, that larger deletions tend to be associated with higher GFAP scores, i.e., greater severity (for more information see [3]).

4.5. Global Haploinsufficiency Score of Deleted Genes in 22q13.31–q13.33

To identify regions of potential haploinsufficiency within 22q13.31–q13.33 (8.3 Mb), a composite score was calculated using five published haploinsufficiency prediction algorithms. For each gene, the following criteria were applied: (a) pLI [30]: <0.1 = 0 pts; 0.11–0.89 = 2 pts; >0.9 = 15 pts; (b) LOEUF [31]: <0.268 = 15 pts; >0.268 = 1 pt; (c) sHet [32]: >0.1 = 15 pts; <0.1 = 1 pt; (d) pHaplo [31]: >0.86 = 15 pts; 0.56–0.85 = 10 pts; <0.55 = 0 pts; (e) pTriplo [31]: >0.94 = 15 pts; 0.69–0.93 = 10 pts; <0.68 = 0 pts. These scores were summed to generate a global haploinsufficiency index for each gene.

4.6. Statistical Analysis

Statistical analyses were performed using SPSS Statistics v.29.0.2 (IBM Corporation, Chicago, IL, USA). Both univariate and multivariate methods were employed. Descriptive statistics (means, standard deviations) were calculated for continuous variables, while frequency distributions were used for categorical variables (coded as “1” for presence and “0” for absence of a feature).
Inferential statistics included Pearson and Spearman correlation coefficients, Chi-square tests, and ANOVA with Bonferroni and T3 Dunnett post hoc tests. Unsupervised clustering (Ward’s method) was applied to group individuals based on deletion size and/or GFAP score, facilitating comparative phenotypic analyses. Logistic and linear regression models were used to evaluate the predictive value of genetic variables for specific clinical outcomes. Statistical significance was set at p < 0.05, with p < 0.01 considered highly significant.

4.7. Study Limitations

While this study provides substantial evidence for genotype–phenotype correlations in PMS, several limitations should be acknowledged. (a) Sample Size: The relatively small number of patients with interstitial deletions limits the generalizability of findings related to genes outside SHANK3. (b) Lack of Longitudinal Data: As the data are cross-sectional, they do not capture changes in phenotypes over time. Longitudinal follow-up is needed to assess developmental trajectories and progression. (c) Non-Genetic Influences: The potential roles of epigenetic and environmental factors in shaping phenotypes remain underexplored. Integrating multi-omics data (e.g., transcriptomics, proteomics) could help elucidate pathways mediating gene dosage effects.

5. Conclusions

The findings of this study have important implications for the diagnosis and management of Phelan–McDermid syndrome (PMS), particularly in the context of comprehensive genomic profiling. Routine diagnostic workflows should incorporate high-resolution genomic technologies to accurately delineate deletion breakpoints and identify genes affected beyond SHANK3.
Future Personalized Interventions: Understanding the impact of deletion size and the contribution of specific gene losses may inform personalized therapeutic strategies. For example, individuals with large deletions may benefit from earlier and more intensive interventions focused on speech and motor development.
Prognostic Value of GFAP Scores: The GFAP score provides a quantitative, functional assessment of clinical severity and may serve as a valuable prognostic tool to prioritize therapeutic planning.
In conclusion, this study reaffirms the central role of SHANK3 in PMS while underscoring the relevance of additional genes in contributing to the disorder’s phenotypic complexity. By characterizing PMS as a contiguous gene syndrome, these findings support the implementation of more precise diagnostic practices and the development of tailored, gene-informed interventions.

Supplementary Materials

The following supporting information can be accessed online at: https://www.mdpi.com/article/10.3390/ijms26104653/s1.

Author Contributions

All authors contributed to the revision of the manuscript. The study was designed by J.N. and supervised by P.D.L. The manuscript was written by J.N., P.D.L. and J.T.-C. Statistical analyses were performed by J.N. and B.E.; microarray analyses were conducted by C.A. and Y.M.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was approved by the Ethical Scientific Committee of Hospital Universitario La Paz (HULP, Madrid, Spain; Protocol ID: PI 2735, FIBHULP), and was conducted in accordance with national regulations for human research. It complies with the principles outlined in the Declaration of Helsinki (2000).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Additional supporting information is available in the Supplementary Materials section online. All patient data have been submitted to the DECIPHER database (http://www.deciphergenomics.org (accessed on 12 April 2022)).

Acknowledgments

We gratefully acknowledge the support of the Phelan–McDermid España patient association. We thank all participating families for their invaluable collaboration.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Gómez Taylor, B.; Moreno Sancho, M.L.; Drehmer Rieger, E.; Carrera Juliá, S.; Nevado, J.; Sempere Ferre, F. Prevalence of the Phelan-McDermid Syndrome in Spain. Rev. Esp. Salud Pública 2020, 94, e202012121. [Google Scholar]
  2. Schön, M.; Lapunzina, P.; Nevado, J.; Mattina, T.; Gunnarsson, C.; Hadzsiev, K.; Verpelli, C.; Bourgeron, T.; Jesse, S.; van Ravenswaaij-Arts, C.M.A.; et al. Definition and clinical variability of SHANK3-related Phelan-McDermid syndrome. Eur. J. Med. Genet. 2023, 66, 104754. [Google Scholar] [CrossRef] [PubMed]
  3. Nevado, J.; García-Miñaúr, S.; Palomares-Bralo, M.; Vallespín, E.; Guillén-Navarro, E.; Rosell, J.; Bel-Fenellós, C.; Mori, M.Á.; Milá, M.; Campo, M.D.; et al. Variability in Phelan-McDermid Syndrome in a Cohort of 210 Individuals. Front. Genet. 2022, 13, 652454. [Google Scholar] [CrossRef] [PubMed]
  4. Levy, T.; Foss-Feig, J.H.; Betancur, C.; Siper, P.M.; Trelles-Thorne, M.d.P.; Halpern, D.; Frank, Y.; Lozano, R.; Layton, C.; Britvan, B.; et al. Strong evidence for genotype–Phenotype correlations in Phelan-McDermid syndrome: Results from the developmental synaptopathies consortium. Hum. Mol. Genet. 2022, 31, 625–637. [Google Scholar] [CrossRef]
  5. De Rubeis, S.; Siper, P.M.; Durkin, A.; Weissman, J.; Muratet, F.; Halpern, D.; Trelles, M.P.; Frank, Y.; Lozano, R.; Ting Wang, A.; et al. Delineation of the Genetic and Clinical Spectrum of Phelan-McDermid Syndrome Caused by SHANK3 point Mutations. Mol. Autism. 2018, 9, 31. [Google Scholar] [CrossRef]
  6. Durand, C.M.; Betancur, C.; Boeckers, T.M.; Bockmann, J.; Chaste, P.; Fauchereau, F.; Nygren, G.; Rastam, M.; Gillberg, I.C.; Anckarsäter, H.; et al. Mutations in the gene encoding the synaptic scaffolding protein SHANK3 are associated with autism spectrum disorders. Nat. Genet. 2006, 39, 25–27. [Google Scholar]
  7. Ricciardello, A.; Tomaiuolo, P.; Persico, A.M. Genotype–Phenotype correlation in Phelan-McDermid syndrome: A comprehensive review of chromosome 22q13 deleted genes. Am. J. Med. Genet. Part A 2021, 185, 2211–2233. [Google Scholar]
  8. Vitrac, A.; Leblond, C.S.; Rolland, T.; Cliquet, F.; Mathieu, A.; Maruani, A.; Delorme, R.; Schön, M.; Grabrucker, A.M.; van Ravenswaaij-Arts, C.; et al. Dissecting the 22q13 region to explore the genetic and phenotypic diversity of patients with Phelan-McDermid syndrome. Eur. J. Med. Genet. 2023, 66, 104732. [Google Scholar]
  9. Gergoudis, K.; Weinberg, A.; Templin, J.; Farmer, C.; Durkin, A.; Weissman, J.; Siper, P.; Foss-Feig, J.; del Pilar Trelles, M.; Bernstein, J.A.; et al. Psychometric Study of the Social Responsiveness Scale in Phelan–McDermid Syndrome. Autism Res. 2020, 13, 1383–1396. [Google Scholar]
  10. Burdeus-Olavarrieta, M.; San José-Cáceres, A.; García-Alcón, A.; González-Peñas, J.; Hernández-Jusdado, P.; Parellada-Redondo, M. Characterisation of the Clinical Phenotype in Phelan-McDermid Syndrome. J. Neurodev. Disord. 2021, 13, 26–40. [Google Scholar]
  11. Mitz, A.R.; Philyaw, T.J.; Boccuto, L.; Shcheglovitov, A.; Sarasua, S.M.; Kaufmann, W.E.; Thurm, A. Identification of 22q13 genes most likely to contribute to Phelan McDermid syndrome. Eur. J. Hum. Genet. 2018, 26, 293–302. [Google Scholar] [CrossRef] [PubMed]
  12. Xu, N.; Lv, H.; Yang, T.; Du, X.; Sun, Y.; Xiao, B.; Fan, Y.; Luo, X.; Zhan, Y.; Wang, L.; et al. A 29 Mainland Chinese cohort of patients with Phelan–McDermid syndrome: Genotype–phenotype correlations and the role of SHANK3 haploinsufficiency in the important phenotypes. Orphanet J. Rare Dis. 2020, 15, 335. [Google Scholar] [CrossRef] [PubMed]
  13. McCoy, M.D.; Sarasua, S.M.; DeLuca, J.M.; Davis, S.; Phelan, K.; Rogers, R.C.; Boccuto, L. State of the Science for Kidney Disorders in Phelan-McDermid Syndrome: UPK3A, FBLN1, WNT7B, and CELSR1 as Candidate Genes. Genes 2022, 13, 1042. [Google Scholar] [CrossRef]
  14. Vetrini, F.; McKee, S.; Rosenfeld, J.A.; Suri, M.; Lewis, A.M.; Nugent, K.M.; Roeder, E.; Littlejohn, R.O.; Holder, S.; Zhu, W.; et al. De novo and inherited TCF20 pathogenic variants are associated with intellectual disability, dysmorphic features, hypotonia, and neurological impairments with similarities to Smith-Magenis syndrome. Genome Med. 2019, 11, 12. [Google Scholar] [CrossRef]
  15. Gonzalez-Garay, M.L.; Aldrich, M.B.; Rasmussen, J.C.; Guilliod, R.; Lapinski, P.E.; King, P.D.; Sevick-Muraca, E.M. A novel mutation in CELSR1 is associated with hereditary lymphedema. Vasc Cell. 2016, 8, 1. [Google Scholar] [CrossRef]
  16. Smith, M.S.; Sarasua, S.M.; Rogers, C.; Phelan, K.; Boccuto, L. Lymphedema is associated with CELSR1 in Phelan-McDermid syndrome. Clin. Genet. 2023, 104, 472–478. [Google Scholar] [CrossRef]
  17. Maltese, P.E.; Michelini, S.; Ricci, M.; Maitz, S.; Fiorentino, A.; Serrani, R.; Lazzerotti, A.; Bruson, A.; Paolacci, S.; Benedetti, S.; et al. Increasing evidence of hereditary lymphedema caused by CELSR1 loss-of-function variants. Am. J. Med. Genet A. 2019, 179, 1718–1724. [Google Scholar] [CrossRef]
  18. Mellado, S.C. Síndromes por Microdeleción. Rev. Chil. Pediatría 2004, 75, 473–482. [Google Scholar] [CrossRef]
  19. Yin, R.; Wack, M.; Hassen-Kodja, C.; Mc Duffie, M.T.; Bliss, G.; Horn, E.J.; Kothari, C.; McLearny, B.; Davis, R.; Hanson, K.; et al. Phenome-wide profiling identifies genorype-phenotype associations in Phelan-McDermid syndrome using family-sourced data from an international registry. Mol. Autism 2024, 15, 40. [Google Scholar] [CrossRef]
  20. Mitz, A.R.; Boccuto, L.; Thurm, A. Evidence for common mechanisms of pathology between SHANK3 and other genes of Phelan-McDermid syndrome. Clin. Genet. 2024, 105, 459–469. [Google Scholar] [CrossRef]
  21. Sarasua, S.M.; DeLuca, J.M.; Rogers, C.; Phelan, K.; Rennert, L.; Powder, K.E.; Weisensee, K.; Boccuto, L. Head Size in Phelan-McDermid Syndrome: A Literature Review and Pooled Analysis of 198 Patients Identifies Candidate Genes on 22q13. Genes 2023, 14, 540. [Google Scholar] [CrossRef] [PubMed]
  22. McCoy, M.D.; Sarasua, S.M.; DeLuca, J.M.; Davis, S.; Rogers, R.C.; Phelan, K.; Boccuto, L. Genetics of kidney disorders in Phelan-McDermid syndrome: Evidence from 357 registry participants. Pediatr. Nephrol. 2024, 39, 749–760. [Google Scholar] [CrossRef] [PubMed]
  23. Sarasua, S.M.; Dwivedi, A.; Boccuto, L.; Rollins, J.D.; Chen, C.-F.; Rogers, R.C.; Phelan, K.; DuPont, B.R.; Collins, J.S. Association between deletion size and important phenotypes expands the genomic region of interest in Phelan-McDermid syndrome (22q13 deletion syndrome). J. Med. Genet. 2011, 48, 761–766. [Google Scholar] [CrossRef] [PubMed]
  24. Tabet, A.-C.; Rolland, T.; Ducloy, M.; Lévy, J.; Buratti, J.; Mathieu, A.; Haye, D.; Perrin, L.; Dupont, C.; Passemard, S.; et al. A framework to identify contributing genes in patients with Phelan-McDermid syndrome. NPJ Genom. Med. 2017, 2, 32. [Google Scholar] [CrossRef]
  25. Samogy-Costa, C.I.; Varella-Branco, E.; Monfardini, F.; Ferraz, H.; Fock, R.A.; Barbosa, R.H.A.; Pessoa, A.L.S.; Perez, A.B.A.; Lourenço, N.; Vibranovski, M.; et al. A Brazilian cohort of individuals with Phelan-McDermid syndrome: Genotype-phenotype correlation and identification of an atypical case. J. Neurodev. Disord. 2019, 11, 13. [Google Scholar] [CrossRef]
  26. Belhedi, N.; Bena, F.; Mrabet, A.; Guipponi, M.; Souissi, C.B.; Mrabet, H.K.; Elgaaied, A.B.; Malafosse, A.; Salzmann, A. A New Locus on Chromosome 22q13.31 Linked to Recessive Genetic Epilepsy with Febrile Seizures Plus (GEFS+) in a Tunisian Consanguineous Family. BMC Genet. 2013, 14, 93. [Google Scholar] [CrossRef]
  27. Jain, L.; Oberman, L.M.; Beamer, L.; Cascio, L.; May, M.; Srikanth, S.; Skinner, C.; Jones, K.; Allen, B.; Rogers, C.; et al. Genetic and Metabolic Profiling of individuals with Phelan-McDermid Syndrome Presenting with Seizures. Clin. Genet. 2022, 101, 87–100. [Google Scholar] [CrossRef]
  28. Boucherie, C.; Boutin, C.; Jossin, Y.; Schakman, O.; Goffinet, A.M.; Ris, L.; Gailly, P.; Tissir, F. Neural Progenitor Fate Decision Defects, Cortical Hypoplasia and Behavioral Impairment in Celsr1-Deficient Mice. Mol. Psychiatry 2018, 23, 723–734. [Google Scholar] [CrossRef]
  29. Allache, R.; De Marco, P.; Merello, E.; Capra, V.; Kibar, Z. Role of the planar cell polarity gene CELSR1 in neural tube defects and caudal agenesis. Birth Defects Res. A Clin. Mol. Teratol. 2012, 94, 176–181. [Google Scholar] [CrossRef]
  30. Samocha, K.E.; Robinson, E.B.; Sanders, S.J.; Stevens, C.; Sabo, A.; McGrath, L.M.; Kosmicki, J.A.; Rehnström, K.; Mallick, S.; Kirby, A.; et al. A framework for the interpretation of de novo mutation in human disease. Nat. Genet. 2014, 46, 944–950. [Google Scholar] [CrossRef]
  31. Karczewski, K.J.; Francioli, L.C.; Tiao, G.; Cummings, B.B.; Alföldi, J.; Wang, Q.; Collins, R.L.; Laricchia, K.M.; Ganna, A.; Birnbaum, D.P.; et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 2020, 581, 434–443. [Google Scholar] [CrossRef]
  32. Lek, M.; Karczewski, K.J.; Minikel, E.V.; Samocha, K.E.; Banks, E.; Fennell, T.; O’Donnell-Luria, A.H.; Ware, J.S.; Hill, A.J.; Cummings, B.B.; et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 2016, 536, 285–291. [Google Scholar] [CrossRef]
Figure 1. Comparison of the different patient subgroups based on their mean GFAP values (arbitrary units) and standard deviations. Asterisks (*) indicate statistically significant differences (p ≤ 0.01). Numbers within the black bars represent the mean ± standard deviation of the deletion size for each corresponding group.
Figure 1. Comparison of the different patient subgroups based on their mean GFAP values (arbitrary units) and standard deviations. Asterisks (*) indicate statistically significant differences (p ≤ 0.01). Numbers within the black bars represent the mean ± standard deviation of the deletion size for each corresponding group.
Ijms 26 04653 g001
Figure 2. Correlation between deletion size and GFAP score. The scatter plot illustrates a positive correlation between deletion size and GFAP scores, with larger deletions corresponding to more severe phenotypes and higher GFAP values. The red trend line highlights the overall linear relationship. The arrow denotes a potential inflection point beyond which functional outcomes appear to worsen more markedly.
Figure 2. Correlation between deletion size and GFAP score. The scatter plot illustrates a positive correlation between deletion size and GFAP scores, with larger deletions corresponding to more severe phenotypes and higher GFAP values. The red trend line highlights the overall linear relationship. The arrow denotes a potential inflection point beyond which functional outcomes appear to worsen more markedly.
Ijms 26 04653 g002
Figure 3. Representation of deletion-based clusters and SHANK3 variants as a function of GFAP scores. Asterisks (*) indicate statistically significant differences (p < 0.05). The ampersand symbol (&) denotes a trend toward significance that does not reach the conventional threshold of statistical significance (p ≥ 0.05).
Figure 3. Representation of deletion-based clusters and SHANK3 variants as a function of GFAP scores. Asterisks (*) indicate statistically significant differences (p < 0.05). The ampersand symbol (&) denotes a trend toward significance that does not reach the conventional threshold of statistical significance (p ≥ 0.05).
Ijms 26 04653 g003
Figure 4. Graphical representation using the UCSC Genome Browser of patients with and without nephro-urological alterations. (A) Patients without nephro-urological alterations; (B) patients with nephro-urological alterations; (C) comparison of the haploinsufficiency regions between groups A and B. Genomic coordinates are based on the GRCh38 (hg38) reference assembly.
Figure 4. Graphical representation using the UCSC Genome Browser of patients with and without nephro-urological alterations. (A) Patients without nephro-urological alterations; (B) patients with nephro-urological alterations; (C) comparison of the haploinsufficiency regions between groups A and B. Genomic coordinates are based on the GRCh38 (hg38) reference assembly.
Ijms 26 04653 g004
Figure 5. Graphical representation using the UCSC Genome Browser of patients with and without lymphedema. (A) Patients without lymphedema; (B) patients with lymphedema; (C) comparison of the haploinsufficiency regions between groups A and B. Genomic coordinates are based on the GRCh38 (hg38) reference genome.
Figure 5. Graphical representation using the UCSC Genome Browser of patients with and without lymphedema. (A) Patients without lymphedema; (B) patients with lymphedema; (C) comparison of the haploinsufficiency regions between groups A and B. Genomic coordinates are based on the GRCh38 (hg38) reference genome.
Ijms 26 04653 g005
Figure 6. Visualization of genes and genomic regions within 22q13.31–q13.33 identified as highly or moderately susceptible to haploinsufficiency. Genomic coordinates are based on the GRCh38 (hg38) reference assembly. Based on the scores, genes were categorized as follows: Red: Score > 30, classified as highly haploinsufficient; Yellow: Score > 20, classified as moderately haploinsufficient.
Figure 6. Visualization of genes and genomic regions within 22q13.31–q13.33 identified as highly or moderately susceptible to haploinsufficiency. Genomic coordinates are based on the GRCh38 (hg38) reference assembly. Based on the scores, genes were categorized as follows: Red: Score > 30, classified as highly haploinsufficient; Yellow: Score > 20, classified as moderately haploinsufficient.
Ijms 26 04653 g006
Figure 7. Interaction networks of selected haploinsufficient genes generated using STRING (https://string-db.org/ (accessed on 16 February 2015)). (A) Proteins interacting with GRAMD4; (B) Proteins interacting with PIM3; (C) Proteins interacting with ZBED4.
Figure 7. Interaction networks of selected haploinsufficient genes generated using STRING (https://string-db.org/ (accessed on 16 February 2015)). (A) Proteins interacting with GRAMD4; (B) Proteins interacting with PIM3; (C) Proteins interacting with ZBED4.
Ijms 26 04653 g007
Figure 8. Schematic pathway diagram illustrating the interactions between genes and their associated contributions to phenotypic manifestations in Phelan–McDermid syndrome. Blue nodes represent genes (e.g., SHANK3, GRAMD4, CELSR1), while boxes denote phenotypes and functional disruptions, including seizures, cranial abnormalities, and lipid metabolism dysfunctions, among others. Connecting lines indicate putative interactions/connections, illustrating how gene disruptions may lead to specific clinical outcomes in PMS. ASD; Autism spectrum disorder.
Figure 8. Schematic pathway diagram illustrating the interactions between genes and their associated contributions to phenotypic manifestations in Phelan–McDermid syndrome. Blue nodes represent genes (e.g., SHANK3, GRAMD4, CELSR1), while boxes denote phenotypes and functional disruptions, including seizures, cranial abnormalities, and lipid metabolism dysfunctions, among others. Connecting lines indicate putative interactions/connections, illustrating how gene disruptions may lead to specific clinical outcomes in PMS. ASD; Autism spectrum disorder.
Ijms 26 04653 g008
Table 1. Pearson’s correlations between continuous variables.
Table 1. Pearson’s correlations between continuous variables.
Variable-1Variable-2Pearson’s rp-Value
(Bilateral)
Interpretation
Deletion size GFAP * score0.359<0.0001Positive correlation: larger deletions linked to higher GFAP scores.
Number of genes0.949<0.0001Strong positive correlation.
Number of OMIM genes0.894<0.0001Strong positive correlation.
Number of OMIM genes with AD0.844<0.0001Strong positive correlation.
Number of OMIM genes with AR0.819<0.0001Strong positive correlation.
GFAP scoreNumber of genes0.377<0.0001Positive correlation: more genes lost lead to higher GFAP scores.
Number of OMIM genes0.378<0.001Strong positive correlation.
Number of OMIM genes with AD0.280<0.001Strong positive correlation.
Number of OMIM genes with AR0.376<0.001Strong positive correlation.
Age at diagnosisDeletion size−0.2490.001Inverse correlation: larger deletions diagnosed earlier.
GFAP score−0.1830.009Inverse correlation: higher GFAP scores diagnosed earlier.
* GFAP, global functional assessment of the patient; OMIM, online mendelian inheritance in man; AD, autosomal dominant inheritance; AR, autosomal recessive inheritance.
Table 2. Relationship between categorical variables and genetic clusters, showing statistically significant associations as determined by the Chi-square test.
Table 2. Relationship between categorical variables and genetic clusters, showing statistically significant associations as determined by the Chi-square test.
Chi-Square Test Among ClustersClusters with Statistical AssociationsValuedfSignificance (p) (Bilateral)
Ability to make Sentences CL1&3; CL2&3; CL3&435.5984<0.0001 **
Walk independent before 15 monthsCL1&3; CL2&3; CL3&442.8894<0.0001 **
Alteration of Growth PC > 95% or PC < 3%CL4&3; CL4&518.04110.021 *
HypotoniaCL1&3; CL2&3; CL3&425.2044<0.0001 **
Deep set eyes CL2&511.76440.02 *
Full browCL1&5; CL2&511.76440.019 *
Full/puffy eyelidsCL1&3; CL2&311.76440.004 *
Full/puffy cheeksCL1&310.34240.035 *
Toe syndactyly CL3&421.4894<0.0001 **
Large and fleshly handsCL2&314.57240.006 *
Sphincter controlCL1&2; CL1&3; CL1&5; CL2&5; CL4&529.7894<0.0001 **
Nephro-urological anomaliesCL1&4; CL3&4; CL4&516.50940.002
ObesityNo cases in CL3, CL4, CL510.41040.034 FeT *
LymphedemaCL1&2; CL1&4; no cases in CL522.0554<0.0001 **
High sensitive (touch, caress, skim)CL3&511.13310.025 *
V: test statistic value; gl: degrees of freedom; p: p-value. The table presents only those variables demonstrating significant differences across clusters, highlighting their interrelation. CL, Cluster; FeT, Fisher’s exact Test. ** A p-value less than 0.001 on Student’s t-test indicates a highly significant difference, while a p-value less than 0.05 * indicates a significant difference.
Table 3. Relationship between categorical variables and genetic groups defined by deletion size and SHANK3 variants, showing statistically significant associations as determined by the Chi-square test.
Table 3. Relationship between categorical variables and genetic groups defined by deletion size and SHANK3 variants, showing statistically significant associations as determined by the Chi-square test.
Chi-Square TestGroup 1Group 3
(a)Deletions
<0.25 Mb (N = 37)
SHANK3
SNVs (N = 18)
ValuedfSignificance (p) (Bilateral)Interpretation
Ability to make sentences 2454.10210.038 *Group 3 has a significantly lower chance of having the ability to make sentences compared to Group 1
Teeth anomalies1625.8210.028 FeT *Group 3 has a significantly lower chance of having teeth anomalies compared to Group 1
Sphincter control8144.7110.030 *Group 3 has a significantly higher chance of having sphincter control compared to Group 1
Sleeping problems795.8310.016 *Group 3 has a significantly higher chance of having sleeping anomalies compared to Group 1
Highly sensitive (touch, caress, skim)11127.1110.008 *Group 3 has a significantly higher chance of being very sensitive to touch compared to Group 1
Chi-square testGroup 2Group 3
(b)Deletions
>0.25 Mb (N = 150)
SHANK3
SNVs (N = 18)
ValuedfSignificance (p) (bilateral)
Walk independently before 15 months30138.84310.004 FeT *Group 3 (SNVs) have a much higher chance of walking before 15 months compared to Group 2
Alteration of Growth PC > 95% or PC < 3%75225.4691<0.0001 FeT *Group 3 is significantly less likely to exhibit growth anomalies compared to Group 2
Hypotonia121104.30810.038 *Group 3 has a significantly lower chance of having hypotonia compared to Group 2
Epicanthal folds4314.03410.045 FeT *Group 2 has a significantly higher chance of having epicanthus compared to Group 3
Deep set eyes 3805.57510.014 FeT *Group 3 has no chance of having deep set eyes compared to Group 2
Full brow6418.69110.003 FeT *Group 3 is significantly less likely to exhibit full brown compared to Group 2
Chi-square testGroup 1Group 2
(c)Deletions
<0.25 Mb (N = 37)
Deletions
>0.25 Mb (N = 150)
ValuedfSignificance (p) (bilateral)Interpretation
Ability to make Sentences 242233.0951<0.0001 *Group 2 has a very significantly lower chance of having the ability to make sentences compared to Group 1
Walk independently before 15 months193015.0861<0.0001 *Group 1 has a much higher chance of walking before 15 months compared to Group 2
Alteration of Growth PC > 95% or PC < 3%9757.7231<0.005 *Group 1 is significantly less likely to exhibit growth anomalies compared to Group 2
Hypotonia211219.28510.002 *Group 2 has a significantly higher chance of having hypotonia compared to Group 1
Strabismus4455.65210.021 FeT *Group 2 has a very higher chance of having strabismus compared to Group 1
Full/puffy eyelids2418.05910.004 FeT *Group 2 has a very higher chance of having full/puffy eyelids compared to Group 1
Full/puffy cheeks3395.45510.026 FeT *Group 2 has a very higher chance of having full/puffy cheeks compared to Group 1
Toe syndactyly 3488.54110.003 FeT *Group 2 has a very higher chance of having toe syndactyly compared to Group 1
Large/fleshly hands119015.948 0.001 *Group 2 has a very higher chance of having large/fleshly hands compared to Group 1
Sphincter control9174.18410.041 *Group 1 has a significantly higher chance of having sphincter control compared to Group 2
Pulling the hair (to others or themself)3395.45510.026 FeT *Group 2 has a significantly higher chance of having pulling the hair compared to Group 1
V: test statistic value; gl: degrees of freedom; p: p-value. The table displays only those variables with significant differences, emphasizing their interrelations across genetic subgroups. FeT, Fisher’s exact test. * p-value less than 0.05 indicates a significant difference.
Table 4. Statistical analysis of GFAP scores in individuals with deletions (N = 189), comparing those who present the clinical feature (coded as “1”) versus those who do not (coded as “0”).
Table 4. Statistical analysis of GFAP scores in individuals with deletions (N = 189), comparing those who present the clinical feature (coded as “1”) versus those who do not (coded as “0”).
Descriptive and Frequencies Statistics
Median
Mb
Median
GFAP
Mean
GFAP
FrequencyPercentage
GenderMale---8544.97%
Female---10455.02%
Total---189100%
& Developmental delay (walk before/after 15 months)>15 months3.47112110.10 ± 38.26 **13873.79%
Normal-----
<15 months1.397270.26 ± 38.994926.20%
Total---187100%
Ability to make sentencesNot3.90136132.21 ± 30.9914376.47%
Yes0,286560.56 ± 37.78 **4423.52%
Total---187100%
HypotoniaNot0,775253.66 ± 34,314524.06%
Yes30.85115.5114.23 ± 33.02 **14275.93%
Total---187100%
SeizuresNot2.8894.3494.55 ± 41.8912968.98%
Yes4.14115.5111.03 ± 40.91 *5831.01%
Total---187100%
$ Craneal anomaliesNormal3.03107100.58 ± 44.0310254.25%
Microcephaly2.559590.05 ± 38.703719.68%
Macrocephaly5107106.63 ± 41.274926.06%
Total---188100%
SyndactilyNot2.859794.34 ± 43.0613672.72%
Yes4.17115113.84 ± 36.47 **5127.27%
Total---187100%
Ophtalmological alterationsNot2.919793.39 ± 40.1914678.07%
Yes3.94126121.97 ± 41.99 **4121.92%
Total---187100%
Nephro-urological alterationsNot2.9510195.49 ± 40.6114677.65%
Yes4.25122116.07 ± 45.34 **4222.34%
Total --188100%
Dermatological anomaliesNot2.9510196.52 ± 42.5814677.65%
Yes4.07112.5112.5 ± 40.15 *4222.34%
Total---188100%
ObesityNot3.1610299.92 ± 42.7018497.87%
Yes6.91112.5107.5 ± 33.2742.12%
Total---188100%
LymphedemaNot2.9410197.23 ± 42.5016990.37%
Yes7.35125.5122.5 ± 31.69 **18962%
Total---187100%
The table includes only variables that showed statistically significant differences, highlighting associations between specific clinical traits and functional impairment as measured by GFAP. ** A p-value less than 0.001 on Student’s t-test indicates a highly significant difference, while a p-value less than 0.05 * indicates a significant difference. & (F: 12.609; mean square: 97.792; df < 1; p < 0.001); $ (F: 7.634; mean squared: 58.520; df: 2; p < 0.001).
Table 5. Gene clustering with a score > 30. Highly haploinsufficient genes. Genomic coordinates in GCRh38 (hg38).
Table 5. Gene clustering with a score > 30. Highly haploinsufficient genes. Genomic coordinates in GCRh38 (hg38).
GENCOORDINATESSCORE (AU)
SCUBE1chr22: 43197280–4334337232
SULT4A1chr22: 43824509–4386251350
PHF21Bchr22: 44881162–4501000546
NUP50chr22: 45163925–4518801734
FBLN1chr22: 45502238–4560113546
CELSR1chr22: 46360834–4653762056
GRAMD4chr22: 46576012–4667979070
TBC1D22Achr22: 46762617–4717569934
BRD1chr22: 49773283–4982787366
PIM3chr22: 49960768–4996407255
PLXNB2chr22: 50274979–5030764632
SBF1chr22: 50443219–5048392332
MAPK8IP2chr22: 50600793–5061398160
SHANK3chr22: 50674408–5073321245
Average *
* The average score value in genes >30 is 47. AU, arbitrary units.
Table 6. Gene clustering with GFAP score > 20. Moderately haploinsufficient genes. Genomic coordinates in GCRh38 (hg38).
Table 6. Gene clustering with GFAP score > 20. Moderately haploinsufficient genes. Genomic coordinates in GCRh38 (hg38).
GENCOORDINATESSCORE (AU)
TTLL12chr22: 43197280–4318713422
MPPED1chr22: 43411196–4450784826
PRR5chr22: 44668547–4473768128
WNT7Bchr22: 45920366–4597716227
PPARAchr22: 46150521–4624375527
ZBED4chr22: 49853844–4989008028
ALG12chr22: 49900229–4991843827
RABL2Bchr22: 50767501–5078366726
Average *
* The mean value of the score in genes > 20 is 26.375. AU, arbitrary units.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Nevado, J.; Escalada, B.; Muñoz-GªPorrero, Y.; Adan, C.; Tenorio-Castaño, J.; Lapunzina, P.D. Genotype–Phenotype Associations in Phelan–McDermid Syndrome: Insights into Novel Genes Beyond SHANK3. Int. J. Mol. Sci. 2025, 26, 4653. https://doi.org/10.3390/ijms26104653

AMA Style

Nevado J, Escalada B, Muñoz-GªPorrero Y, Adan C, Tenorio-Castaño J, Lapunzina PD. Genotype–Phenotype Associations in Phelan–McDermid Syndrome: Insights into Novel Genes Beyond SHANK3. International Journal of Molecular Sciences. 2025; 26(10):4653. https://doi.org/10.3390/ijms26104653

Chicago/Turabian Style

Nevado, Julian, Blanca Escalada, Yolanda Muñoz-GªPorrero, Carmen Adan, Jair Tenorio-Castaño, and Pablo Daniel Lapunzina. 2025. "Genotype–Phenotype Associations in Phelan–McDermid Syndrome: Insights into Novel Genes Beyond SHANK3" International Journal of Molecular Sciences 26, no. 10: 4653. https://doi.org/10.3390/ijms26104653

APA Style

Nevado, J., Escalada, B., Muñoz-GªPorrero, Y., Adan, C., Tenorio-Castaño, J., & Lapunzina, P. D. (2025). Genotype–Phenotype Associations in Phelan–McDermid Syndrome: Insights into Novel Genes Beyond SHANK3. International Journal of Molecular Sciences, 26(10), 4653. https://doi.org/10.3390/ijms26104653

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop