Abstract
Background: Autism spectrum disorder (ASD) is a common condition with substantial personal and financial burdens of lifelong implication. Multiple twin studies have confirmed a genetic or inherited component at ~80%, higher than any other common condition. However, ASD’s rapidly accelerating prevalence, now at 1 in 31 in the USA, appears to defy a predominantly genetic basis and implicate our rapidly changing environment. A potential explanation for this paradox is a recent increase in de novo variants (DNVs), which are “new” mutations present in the patient yet absent in both parents. The present authors recently reported using trio whole-genome sequencing (trio-WGS) that DNVs highly likely to be highly disease-associated (“Principal Diagnostic Variants”, PDVs), mostly missense variants, were present in (25/50) 50% of the ASD patients clinically evaluated by our team. Methods: The current study was designed to support this observation with trio-WGS in 100 additional unrelated ASD patients. Results: De novo PDVs were identified in 47/100 (47%) of cases, in close approximation to our previous work. Using non-transcribed (up and downstream) variants for all genes as a control group, these DNV-PDVs were far more likely (p < 0.0001, OR 5.8, 95% C.I. 2.9–11) to be in SFARI-listed genes associated with ASD. Consistent with the emerging polygenic model, using the same analyses, inherited missense variants were also associated with ASD (p < 0.0001). Highly unexpectedly, silent variants, both inherited (p < 0.0001) and de novo (p < 0.007), were also statistically associated with ASD, and, among inherited variants, silent variants were more associated with ASD than were missense variants (p < 0.0001). Adding silent DNVs as PDVs increases the proportion of our subjects with at least one DNV-PDV to 55% of the subjects. Conclusions: Our proposed model for ASD, with prominent DNVs in most that are genetic yet not inherited, predicts the known predominant genetic pathogenesis and the accelerating prevalence of ASD, possibly from environmental factors, including insufficient nutrients and toxicant exposures, and/or the disrupted folate metabolism known to be associated with ASD. Limitations to this study include predominant inclusion of severely affected individuals and the lack of an unaffected control group and functional validation of variant pathogenicity.
1. Introduction
Autism spectrum disorder (ASD) is a neurodevelopmental disorder present in early life with a core dual deficit in social communication and repetitive and/or restricted interests or behaviors [1]. Diagnosis is currently based on the observable behavioral phenotype with little convincing evidence of consistent biomarkers, suggesting biological heterogeneity [2]. Multiple studies have demonstrated that the heritability component in ASD is about 80% [3,4,5], which is the highest reported among all common (prevalence > 1%) disorders. However, frequent episodes of acute/sub-acute onset of severe ASD symptoms following environmental physiological stressors (e.g., infections) suggest the addition of critical environmental components [6]. Thus, in terms of pathogenesis, ASD is oftentimes similar to other common disorders (e.g., diabetes, asthma), in that there are underlying genetic factors resulting in biological vulnerability and environmental factors that may trigger disease onset or exacerbation.
ASD has also increased dramatically in incidence and prevalence, being a rare disorder (<1 in 1000) only 30 years ago, yet most recently found to be present in 1 out of 31 American children [7], which translates to over 10 million currently affected individuals in the USA. Some sources state that the explosion in ASD incidence is a result of better recognition and diagnosis [8]. However, a prevailing view of many of us “in the trenches” (e.g., pediatricians, educators) is that we were not seeing this magnitude of affected children previously under any label, be that intellectual disability, learning disability, or psychiatric disease [9], and that the accelerating disease increase is real. ASD results in great personal costs to the individuals affected and their family members, at least for the more severely affected among the vast spectrum of severity. The economic and societal burden of ASD is substantial, with lifetime care costs exceeding USD 2.4 million per individual (Autism Speaks, 2025) [8]. This provides an estimated, lifetime national cost of USD 25 trillion, very near the amount of the publicly held US national debt. This estimate presumes that the incidence of ASD will not continue to increase, a major assumption that unfortunately may not come to pass.
While the pathogeneses of most common disorders have recently given way to advances in the biological sciences, and this knowledge has generally resulted in improved clinical outcomes, ASD has been somewhat of a hold out. Given the overwhelming burden of the disease, why do we not better understand the pathogenesis of ASD? One of the main reasons is the extreme genotypic heterogeneity of ASD, which includes several hundred genes already so identified [10,11]. As genetic studies in ASD routinely identify multiple additional disease-associated genes, it appears that we have only identified a small minority of those genes to date, and that there are likely thousands of genes associated with ASD. In multiplex families (with two or more ASD-affected first-degree relatives), marked variable expressivity is common among the affected relatives, both in terms of disease severity and the presence and type of comorbid disease manifestations. In many cases, close relatives of people with ASD are themselves affected with another neurodevelopmental disorder or a forme fruste (incomplete or mild) phenotype (e.g., attention deficit hyperactivity disorder (ADHD), learning disabilities). Marked variable expressivity and forme fruste phenotypes are common even when a specific gene variant segregating in a family meets multiple criteria to be considered as a principal cause of disease [12], and this is highly suggestive of polygenic factors (e.g., genetic modifiers, genetic background) and/or environmental factors.
The key role of rare, highly penetrant variants (disease is likely in the presence of the variant), either inherited or non-inherited (de novo), in the development of ASD has been established by many studies (reviewed in [13,14]). Highly penetrant variants predisposing toward ASD often can be identified using DNA sequencing, with inheritance patterns revealed being either Mendelian (e.g., autosomal recessive or dominant, X-linked) or non-Mendelian (e.g., polygenic, maternal, de novo). Recent papers have tended to focus on de novo variants (DNVs, new mutations, absent from both parents) as being of particular importance in ASD. In recent years, these variants are often identified by whole-genome sequencing (WGS, covering over 99% of the entire DNA) in samples collected from trios (affected individual plus biological parents).
The yield of DNVs in ASD has been measured in trio-WGS studies at 20% [15], 21% [16], 31% [17], 41% [18], and 50% [14]. The methodology varied somewhat among these studies, although in general they did not comprehensively query for genes not previously identified in ASD. A study on 50 consecutive, unrelated ASD trios from the practice of the first author (RGB) [14] revealed a DNV diagnostic yield of 20%, if based solely on DNVs listed in the official report from the commercial laboratory (Variantyx, Framingham, MA, USA). However, the diagnostic yield for a de novo Principal Diagnostic Variant (PDV) was 50% (25/50) when trio-WGS was followed by comprehensive reanalysis of the raw DNA sequence data [14]. We defined a PDV using strict criteria (see the Subjects and Methods section) to ensure that each variant so designated is highly likely to be disease-related in that patient, and not an incidental finding. Of interest, the vast majority (15/18, 83%) of all DNVs not listed in the official laboratory report were in genes not previously reported in ASD (13 DNVs) or in those reported only in one to four individuals each (2 DNVs), and thus not expected to be listed in the report by any commercial diagnostic laboratory. This highlights both the likelihood that only a minority of ASD genes have so far been identified and the need to go beyond the commercial laboratory report in ASD diagnostics.
How does the autism community reconcile the conundrum of a disorder that is highly genetic in etiology with its rapidly expanding prevalence? For the most part, it does not, with various aspects of the community denying either the genetic basis or a truly expanding incidence. Assuming that both statements are correct, how can a genetic disorder increase rapidly in the population? One possible explanation involves DNVs, which are genetic but not inherited, with the DNVs themselves accelerated over time by environmental, likely chemical, mutagenesis, nutritional insufficiencies, or the known folate metabolism abnormalities associated with ASD. In the current study, we expand on our previous study to analyze trio-WGS data from Variantyx in 100 consecutive, unrelated subjects with ASD from the practice of the senior author (REF), with a focus on DNVs. Our data confirm earlier studies that DNVs are a major component of the genetic predisposition toward ASD and that they can be identified by trio-WGS followed by raw-data analysis. Is this an answer to the conundrum?
2. Subjects and Methods
2.1. Subjects
Our subjects consist of the 100 most recently evaluated, sequential, unrelated patients with a clinical diagnosis of ASD in which trio-WGS was performed at Variantyx® (Framingham, MA, USA). Each subject was evaluated clinically by the senior author, who is a child neurologist known for conducting clinical care and research in ASD. At a minimum, the evaluation in all subjects included a detailed history and a physical examination, either in person or via video-teleconferencing. The diagnosis of ASD in each case was confirmed by appropriate neuropsychiatric testing (e.g., ADOS-2). Subjects with additional neurodevelopmental diseases (NDDs) or non-NDD diagnoses were not excluded. In the few cases where more than one family member met the study criteria, the subject was assigned to be the proband (person first presenting as a patient). In cases of affected siblings presenting simultaneously, the elder was assigned. Thus, all study subjects have no known genetic relationships with each other. This study was approved by the Advarra institutional review board (IRB, human subjects committee, cirbi@advarra.com) as a retrospective chart review of available clinical records. No additional testing was performed for the purpose of this study. Trio-WGS in our 100 subjects was performed from January 2022 to July 2024, with our analysis of the raw individual sequence data completed from January through July 2024.
2.2. Sequencing and Data Analysis
Available clinical notes from all subjects were reviewed for phenotypic data. WGS analyses from Variantyx® included genome-wide sequence analysis (for single-nucleotide variants and small deletions/insertions), genome-wide structural variant analysis (for copy number variants (CNVs), including large duplications/deletions/inversions, mobile inserions, and aneuploidy), and mitochondrial genome sequence analysis (for heteroplasmy ≥ 5% and large deletion analysis). See our previous study for details of our DNA sequence data analysis, including Figure 1 of that paper for the Variantyx analysis pipeline [14]. Additional information is available at variantyx.com [19]. Raw genomic data from each subject were evaluated on the Variantyx® bioinformatics platform accessible to laboratory personnel in order to tabulate all de novo variants predicted to alter the amino acid code of any protein (“coding” variants). Analyses included the Integrative Genomics Viewer (IGV) of all small de novo variants and SVPlots of all large de novo variants to verify the presence of that variant and exclude artifacts. Inherited sequence variants were tabulated by the same software. In order to compare only protein-coding genes among the various variant types, all non-coding genes were manually removed, including RNA genes (e.g., gene symbols starting with LCA, LINC, LINP, LNC, Metazoa, MIR, MIRNA, PIRNA, PIWIL, RN7, RNA, RNU, RNV, RPL, SNOR, SNRNA, TRNA, U#, or YRNA), antisense genes (gene symbols ending with AS#), and pseudogenes (gene symbols that end with P#), where # is any number. These analyses were conducted genome-wide (on all genes) and were highly laborious; thus, they were completed only on a randomized subset of 50 subjects (25 for non-transcribed variants). Intronic variants were not tabulated as they are extremely numerous and continuously resulted in error messages from the Variantyx software.
2.3. Gene Categorization
In the determination of diagnostic yield, we sought to be conservative in that each variant determined to be disease-causal (PDVs) has a high probability of being so. In our previous study [14], we restricted PDV annotation to genes published with direct association with ASD, designated as A1 (the highest direct association) through A3 (the lowest direct association), in particular using SFARI rankings [11] as per Table 5 in our previous work [14]. Genes without a direct association with ASD were designated as B1 (indirect association) through B3 (highly unlikely to be ASD-associated). With the understanding that our prior B1 category was too broad, in the present study, we designated those genes with a published, one-degree-indirect, association to ASD as B0. This category contains genes with a direct association with other conditions associated with ASD (e.g., AD/HD, intellectual disability, schizophrenia, bipolar) and genes with a direct association with another gene that is itself directly related (A1–3) to ASD. The remainder of the prior B1 category comprises our current B1 category. Overall, B1–3 genes are most likely not associated with ASD, but association cannot be excluded.
2.4. Variant Categorization
In essence, we used standard American College of Medical Genetics and Genomics (ACMGG) criteria with some modifications designed to allow these criteria to apply to novel disorders. Details on the modifications are listed below, discussed in our previous study [14], including a figure delineating the Variantyx variant annotation pipeline, and further elaborated upon in the Discussion section. Variants were assigned as PDVs if they are real (verified using IGV or SVPlots), coding (changing the amino acid code), rare (allelic prevalence < 1/10,000, population prevalence < ~2/10,000), and evolutionarily conserved (at least moderate, conserved through mammals) in a gene published as directly (A1–A3) or single-step-indirectly (B0) associated with ASD. De novo mitochondrial DNA (mtDNA) variants were eligible for PDV status if coding and the subject had ≥40% and ≥2 times the heteroplasmic % as the mother (a presumed de novo event in the grandmother). Large deletions were counted as evolutionarily conserved when any conserved nucleotide was deleted. Characteristics of different types of coding variants (e.g., missense, frameshift, deletion), and the importance of prevalence and conservation to variant annotation, can be found in a recent review (Tables 2–4 of [20]). Moderate conservation was assumed present if both PhyloP and PhastCons were >0.7 and assumed absent if both were <0.4. Otherwise, conservation was manually determined using the University of California Santa Cruz (UCSC) Genome Browser [21] using a threshold of 80% of listed mammalian species. Splice-site variants were included if >0.6 on SpliceRF or SpliceADA. Thus, the focus of this study was on rare, high-penetrance variants. Statistical analyses were performed using a two-tailed Fisher Exact Test [22] and/or MedCalc® odds ratio calculator [23]. Based on our data analysis, silent DNVs were reclassified as PDVs (see the Results and Discussion sections). Note that CNVs widely considered to be Pathogenic/disease-associated were designated as PDVs regardless of other parameters. Those DNVs that met our criteria for being highly likely to be a genetic factor in that individual’s predisposition to develop ASD we refer to by a separate name (“Principal Diagnostic Variant” (PDV)) instead of “Pathogenic” or “Likely Pathogenic” to underscore that modifications were made from the dominant ACMGG criteria.
3. Results
3.1. Subject Characteristics
Among our 100 unrelated subjects, the age at the time of sequence review ranged from 4 to 40 years, with a median of 9 years. Mean maternal and paternal ages at the subject’s birth were 33.1 and 35.4 years, respectively. The race of 23 subjects was not recorded. Among the 77 subjects whose race or ethnicity were recorded, 43/77 (56%) were Caucasian, 30/77 (39%) were of other backgrounds (14 South Asians, 3 East Asians, 4 African Americans, and 9 Latinos), and 4/77 (5%) subjects were of mixed race or ethnicity. Twenty subjects (20%) were female. Intellectual disability (ID) was moderate or more in severity in 85/95 (89%) subjects. Twenty-nine (29%) subjects were nonverbal; 31 (31%) had epilepsy; and 57 (57%) experienced at least one episode of substantial developmental regression. Nine (9%) had tics, a potential sign of autoimmune encephalopathy. Additional clinical information is shown in Table S1.
3.2. De Novo Variants Identified and Their Characteristics
A total of 151 de novo variants (DNVs) were identified genome-wide that alter the amino acid code of any protein among the 100 subjects (mean 1.5 per subject, range 0–6, Table 1). Among these 151 DNVs, only 17 (present in 15 of the 100 subjects) were reported in the Variantyx laboratory report, and all 17 met the criteria for Principal Diagnostic Variants (PDVs) by our algorithm (Table 2, light blue background in column 1). Only 6 of those 17 variants (in six different subjects) were reported as “Positive” (Pathogenic, determined to be highly likely to be disease-related/causal) by the laboratory, and half (3) of those were large CNV deletions. Adding in an additional nine DNVs (in 7 subjects) with indeterminate designations by the laboratory (labeled as “Other variants of interest”, “Uncertain”, or “Supplementary”), the yield of genetic testing for DNVs related to disease in our cohort was 15/100 (15%). Two additional laboratory-report-listed DNVs labeled as “Negative” and “Likely Negative” were not counted, but if they are counted the yield increases to 17%. Note that this is not the “laboratory yield” as this analysis is limited to DNVs, and there were subjects with results indicating positive laboratory results for inherited variants.
Table 1.
All coding de novo variants identified in our 100 subjects with ASD.
Table 2.
Primary Diagnostic Variants (PDVs) that were and were not listed in the official laboratory report, with information regarding protein function.
Following our comprehensive sequence reanalysis (as per the Subjects and Methods section and [14]), we identified an additional 41 DNVs as PDVs. At least one DNV-PDV was identified in 47/100 subjects (47%). After adding the additional 19 silent DNVs we identified (based upon our analyses discussed later in this section), a total of 79 DNVs met our criteria for PDVs (Table 2, yellow background in column 1), with at least one DNV-PDV in 55 subjects. Thus, the overall yield for having at least one DNV labeled as a PDV was 55/100 (55%) subjects. Among the 79 DNV-PDVs, there were 43 missense (one on the X chromosome), 19 silent (one on the mtDNA), 4 frameshift, 3 nonsense (stop codon gain), 2 splice site, and 7 large copy number variants (4 duplications and 3 deletions). One, two, three, and four PDVs were identified in 37, 15, 2, and 1 subject(s), respectively (Table 1).
An additional 48 DNVs (none listed in the laboratory reports) were excluded as PDVs (17 for no/inadequate published link to ASD (B1–3 genes), 8 for inadequate evolutionary conservation, 10 for both gene association and conservation, 6 for being in genes in which autosomal recessive (AR) inheritance is well-established, but not autosomal dominant inheritance (likely indicating carrier status), 2 for AR plus gene association, 2 for AR, gene, and conservation, and 1 for prevalence). Regarding the latter, variants below an allelic prevalence of > 1/10,000 were excluded by the computer software, but one borderline case was manually excluded from our analyses (yet shown in Table 1). In subject 32, we labeled one DNV as a PDV in the KDM5B gene, which is known to have autosomal recessive inheritance, because of the presence of an additional inherited, rare, and highly conserved missense variant, although the phase is unknown.
A total of 73 DNV-PDVs involving only a single gene (72 single-nucleotide variants (SNVs) and one smaller CNV) were identified in 50 subjects. Among these 73 DNVs, 30 (41%) were in genes with ≥10 individual cases reported with clinical phenotypes (see Table 2 legend, #4 for details), and, thus, were labeled as “Known” disorders (Table 2, column 4). Another 19 DNV-PDVs (26%) were in genes with one to nine cases so reported, and, thus, labeled as “Very rare” disorders. Finally, 24 (33%) were in genes without any cases so reported, and, thus, labeled as “Novel” disorders. With the sole exception of two subjects (49 and 50) with DNV-PDVs in the titan (TTN) gene, a Known disorder, every gene is listed only once. Among the 24 Novel disorders, 10 have at least one case listed on the Human Genome Mutation Database (HGMD). Most of these HGMD listings are indexed from studies where over 1000 individuals were sequenced, and no phenotypic (if ASD criteria were truly met) or genotypic (variant parameters differentiating apparent pathogenic from benign) details are available. Thus, other cases may have been identified, although this is unclear in the absence of published phenotypic or genotypic elaboration. Fourteen of the Novel disorders have no case reports and no HGMD listings. For all 24 Novel disorders, the information in the present Table S1 (phenotypic), Table 1 (genotypic), and Table 2 (putative mechanistic) constitute the first true report.
Comparison of clinical data with the presence or absence of a DNV-PDV is fraught with low numbers for many parameters. However, finding such a variant was statistically more likely in the vast majority of the subjects with at least moderate intellectual disability (51/85, 60% versus 2/10, 20%, p = 0.02). There were trends for an increased likelihood of identifying a DNV-PDV in those more clinically affected regarding verbal ability (20/29, 69% versus 35/71, 49%, p = 0.08), epilepsy (18/36, 50% versus 13/36, 36%, p = 0.3), and a history of regression (36/60, 60% versus 19/40, 48%, p = 0.2). There were no significant differences or trends for adult age (>18 versus <18 years, p = 1.0), female sex (p = 0.8), or the presence of tics (p = 0.7).
3.3. Protein Functions and Pathways Related to the Identified DNV-PDVs
A synopsis of the known functions of each protein encoded by a DNV-PDV is shown in Table 2. Among the 73 of these variants involving only a single gene, known functions were tabulated for selected pathways (ion transport (13 PDVs, red in in Table 2), mitochondrial redox potential/energy metabolism/cell death responses (5, orange), immune system manifestations (11, yellow), ubiquitin-related protein degradation pathway (5, green), synapse/neurotransmission-related (12, blue), gene expression (17, purple), neurogenesis/brain development (14, pink), cytoskeleton-related (10, light grey) cell–cell interactions including adhesion (6, dark grey), signaling pathways other than synaptic transmission (14, black), and cell danger responses (5, brown)). As the causal gene is unclear for CNVs encompassing more than one gene, these data are not included in the above numbers, but the pathways related to the best candidate genes are shown in Table 2.
3.4. Tallying Inherited and De Novo Variants in Our Subjects
As ASD is often considered to be polygenic even within affected individuals [14,24], the total number of inherited variants, among all 20,000–23,000 genes, was tallied in a randomized group of our subjects for specific variant types (missense, silent, UTR (untranslated regions 5′ and 3′ added together), and up/downstream (~1 kb adjacent to each gene in each direction, added together)) and compared to de novo missense and silent variants (Table 3). The average number per subject for each variant type is shown in row 5 of Table 3. Each variant was queried as to whether the gene is listed in SFARI or not, and the average number of variants in SFARI genes for each variant type per subject is shown in row 7.
Table 3.
Small variants identified genome-wide in randomly selected subjects.
As shown in column B, among our 100 subjects, 11 of 43 (27%) small (not CNV), nuclear (not mtDNA), missense de novo PDVs are present in SFARI-listed genes. Among the same subjects (column D), 696 of 7838 (9.0%) inherited, small, nuclear, missense variants, genome-wide, are present in SFARI-listed genes (p = 0.0004, odds ratio 3.4, 95% confidence interval (CI) 1.7–6.8; cell B11, yellow background). Thus, de novo PDV missense variants are about 3½ times more likely to be SFARI-listed than are inherited missense variants among our subjects.
While non-transcribed variants in the vicinity of the gene (e.g., up/downstream) can affect protein function, it is widely believed that the vast majority of them do not, and thus these variants were chosen to be our controls. Being conservative and estimating the total number of genes adequately sequenced by Variantyx to be 20,000, the 1114 ASD-related genes listed by SFARI comprise 5.57% of all genes. This number is remarkably similar to the 5.43% figure in Table 3 (G8) regarding the proportion of up/downstream variants, genome-wide, that are in SFARI-listed genes, validating our choice of using these variants as controls. If we errored and some proportion of the up/downstream variants in our subjects indeed are disease/ASD-related, that would skew our findings toward the null hypothesis. Thus, we may have mildly underestimated the importance of de novo (and silent) variants in this study. All variant types we queried, both de novo and inherited, were found to be statistically more likely to be in SFARI-listed genes than are control (upstream/downstream) variants among our subjects (Table 3, row 14, pink background).
Our data demonstrate that inherited silent variants are highly more likely than control (inherited up/downstream) variants to be in SFARI-listed genes (p < 0.0001, Table 3, E14). Unexpectedly, these inherited silent variants are also highly more likely than inherited missense variants to be in SFARI-listed genes (p < 0.0001, D12, orange background). Additionally, despite small numbers, de novo silent variants are increased relative to control variants and have similar odds ratios as those for de novo missense variants (compare cells B14 and C14).
4. Discussion
4.1. Phenotypes in ASD
Overall, the current and previous study [14] cohorts are quite clinically similar in terms of the proportion that is female (20% versus 22%), nonverbal (29% vs. 26%), epileptic (27% vs. 30%), and post developmental regression (57% vs. 54%), respectively. As explained in [14], these parameters are rather typical for people with autism seen by tertiary care specialists. Tics are less common in the current cohort (9% vs. 26%, p = 0.013), although this is difficult to assess from chart review as parents often confuse tics with other conditions, such as repetitive autistic behavior, and may reflect practice methodology differences between the two physicians. However, the current cohort demonstrates an overall significantly higher severity of intellectual disability (ID), in that at least moderate ID is present in 85/95 (89% with 5 subjects not recorded) versus in 25/50 (50%) of the subjects we reported previously [14] (p < 0.0001). Although the numbers are small, individuals who were less affected in terms of intellectual disability, nonverbal status, epilepsy, or past developmental regression appear to have fewer important (PDVs) DNVs identified. Thus, the results of our study should be interpreted as applying to a cohort of predominantly children and young adults with autism on the more severe end of the spectrum that are referred to a tertiary care specialist. Lower genomic yields are possible, and likely expected, in individuals less clinically affected.
The polygenic nature of ASD [24] and the large number of involved genes preclude genotype–phenotype correlations in a study of this size. To address these correlations, phenotypic and genotypic information at least as detailed as tabulated in the present study in very large ASD cohorts will need to be reviewed, likely through a meta-analysis. The information presented in Table S1 and Table 1 can be used in such analyses. In addition, this information might be useful when additional cases are identified with DNVs in genes corresponding to the 24 Novel (no cases reported) and 19 Very rare (one to nine cases reported) disorders (Table 2) briefly characterized herein.
4.2. Genotypes in ASD
Physicians are aware of monogenic disorders, in which one to two variants in a single gene are predominately causal for disease (e.g., cystic fibrosis, sickle cell), and highly polygenic disorders, in which multiple common variants each contribute only a small degree of the genetic susceptibility (e.g., asthma, diabetes). Of course, in real life, there are numerous shades of grey between these models, and that is where ASD apparently oftentimes lies [24]. An inherited genetic variant in an unaffected or minimally-affected parent is unlikely to be a substantial risk factor for severe disease in their child (disease-causal or major risk factor), unless bi-allelic/recessive, but certainly could be a less substantial (intermediate or minor) risk factor, or unrelated. On the other hand, a DNV in that setting could be a risk factor with any degree of disease association, from disease-causal to unrelated.
Another approach would be to use a control group matched to important characteristics of the individuals with ASD in unaffected individuals without any first-degree relatives with a neurodevelopmental disorder. However, given the wide range of comorbidities seen in people with ASD, matching controls to many of the important comorbidities and other characteristics would mostly likely be incomplete if not impossible. Using the individuals as their own controls provides a tight matching to these characteristics.
Our best option was to look at SFARI status for each gene with an identified variant in the subjects. Listing genes with high degrees of certainty to be ASD-related, the SFARI database only lists a relatively small fraction of genes related to ASD based on a detailed literature search (Table 2, compare the fourth column to the fifth column). However, the latter analysis is extremely labor-intensive and thus not possible to use to score the thousands of genes in which inherited variants were identified. However, the SFARI status of the gene could be, and was, automated for every variant found. If missense variants throughout all genes are indeed part of the background genetic predisposition toward ASD (e.g., minor risk factors), the genes in which these variants are found should be weighed toward having more ASD-related genes, relative to controls. This is indeed what we are reporting. The comparison between de novo PDVs versus control (up/downstream) variants (Table 3, cell B14) is the primary test of the main point of this paper, and reflected in its title, that de novo variants are “predominant” in ASD. This is one of only two pre-analysis (conceived prior to data analyses) comparisons in this study. These data are highly statistically significant (p < 0.0001) and remain so following Bonferroni correction.
The number of variants and genes comprising this background/minor inherited risk among our cohort is large. For example, we identified an average of 157 inherited missense variants per subject, 14 (9%) of which are in SFARI-listed genes (Table 3, column D). Given the odds ratio of 1.7 versus controls (Table 3, cell D14), this means that missense variants in the cohort are 70% more likely to be in a SFARI gene relative to controls (50% increased odds if using the lower figure in the 95% confidence interval). The comparison of inherited missense to inherited control variants is the second and final pre-analysis comparison in this study. This comparison is highly statistically significant (p< 0.0001) and remains so following Bonferroni correction. For missense DNVs, we identified an average of 0.43 variants per subject, 0.11 (27%) of which are in SFARI-listed genes, with an odds ratio of 5.8 (Table 3, column B). This is an almost six-fold increased likelihood, suggesting that the majority (6:1) of the DNV-PDVs we present in the current tables are predicted to be ASD-related in those subjects. Again, and as per [24], we assert that DNVs in ASD can be benign, minor risk factors, major risk factors, or disease-causal, but that when strict criteria are imposed (like those we use to define PDVs), most of those identified are disease-related to various degrees. As a comparison, 1.5 DNVs per subject are present in our current ASD cohort, versus 0.2–0.3 per person in unaffected people (discussed in [14]).
4.3. Silent Variants in Autism
Silent, also known as synonymous, variants occur when a mutation in the third nucleotide of a codon does not change the amino acid code. Silent variants can affect the gene expression of proteins through various mechanisms, including changes to mRNA binding, microRNA, RNA splicing, and codon efficacy, among others. However, these effects are generally difficult to measure, and silent variants are often overlooked, particularly in clinical medicine. In our previous study, we used silent variants as controls (sic), and intended to do the same in the current study until we analyzed the data. However, as per the Results section and Table 3, our data reveal that silent variants, both inherited and de novo, are strongly associated with ASD in our cohort. Among inherited variants, silent variants are more likely to be in SFARI genes than are control (up/downstream) variants (p < 0.0001, OR 1.7, 95% C.I. 1.5–1.9). Inherited silent variants are also more likely to be in SFARI genes than are inherited missense variants (p < 0.0001, 1.3, 1.2–1.5). Among de novo variants, silent and missense variants have similar odds ratios, relative to controls, for being in SFARI genes (OR, 95% C.I.: 5.8, 2.9–11 versus 4.6, 1.5–14). The three comparisons in the Abstract section regarding silent variants are all post-analysis; thus, these should be considered to be putative despite remaining highly significant following Bonferroni corrections.
Takata et al., 2016 [25] “found that near-splice site de novo synonymous mutations are almost twice as frequent in ASD than controls” (p = 0.0003, OR 1.96), identifying “101 mutations in 1043 ASD cases and 37 mutations in 731 controls”. The estimated contributions of de novo silent variants were “comparable to that of” de novo loss-of-function variants (1.3%), “and much higher than that of” de novo missense variants (0.1%). Per Jaganathan et al., 2019 [26], “(d)e novo mutations that are predicted to disrupt splicing are enriched 1.51-fold in intellectual disability (p = 0.000416) and 1.30-fold in autism spectrum disorder (p = 0.0203) compared to healthy controls”. Rhine et al., 2022 [27] wrote that “(e)xonic splicing mutants were enriched in probands relative to unaffected siblings—especially synonymous variants (7.5% vs. 3.5%, respectively)”. An increase in silent postzygotic mosaic mutations was published in one study [28]. In addition, silent DNVs were published as being causal for ASD in at least three case reports [29,30,31].
The literature and the current data suggest that silent variants are important in ASD pathogenesis, perhaps with a higher disease association than missense variants, with odds ratios relative to controls varying from 1.3 to 2.0 (1.7 in the present study; the higher end of 2.0 was for near-splice site variants). Thus, silent DNVs perhaps should not be dismissed categorically when evaluating the DNA sequence of someone with ASD. Based on this information, 17 silent DNVs were re-scored as PNVs, which increased the number of our 100 subjects that have at least one DNV-PNV by 8, from 47 to 55.
4.4. ACMGG Criteria, near Misses, and Low Laboratory Yield
The main limitation of this study is the difficulty of determining if a variant alters protein function or is disease-related, which is also the main limitation in clinical genetic testing. In our determination of a DNV as a potential PDV, we aimed for a higher specificity (identifying less false positives) such that variants so designated are highly likely to be disease-related. As in our previous work [14], we used American College of Medical Genetics and Genomics (ACMGG) standards as much as possible and clearly stated any deviations. In particular, ACMGG guidelines are not designed for research on novel disorders, but for clinical analysis regarding known disorders. Indeed, the vast majority of the listed DNV-PDVs we report (Table 1 and Table 2) at least meet ACMGG criteria for Likely Pathogenic (based on PS2/DNV and PM2/not present in control individuals), with the only caveat that we applied PM2 for prevalence <0.00001 (<1 in 10,000 alleles). The ACMGG guidelines were published in 2015 when control sequences were limited, while current databases constitute about one million individuals. We added the requirement for at least moderate evolutionary conservation, which is beyond the ACMGG guidelines. Furthermore, the ACMGG guidelines are designed for known disorders, while we also wished to identify variants in very rare and novel disorders. Since we cannot rely on phenotypic matches for these disorder types, in order to increase specificity, we only scored as PDVs variants in genes published to be related to ASD, either directly or one-degree indirectly, as per the Subjects and Methods section. We believe that our modifications preserve the spirit and intent of the ACMGG guidelines, allow their translation into novel disorders, and require evolutionary conservation to increase specificity.
In the process of strengthening specificity, sensitivity is compromised. There are many “near-misses”, or DNVs that might be disease-related, but were not labeled as such due to failure to meet a single criterion. Two such examples include the frameshift variants in subjects #24 (CD101) and 38 (ZNF300), excluded for no known connection of the genes to ASD even though both are in pathways (immune, gene expression) known to be important in ASD pathogenesis (B1 genes). In particular, new pathways related to ASD will be missed by our, or any (e.g., ACMGG), methodology that requires a published connection to ASD for each gene in variant classification. A variant in another zinc finger gene (ZNF516) in subject #31 was excluded on grounds of conservation only. A variant in TWF2 in subject #19 was excluded for a combined prevalence figure just barely >1/10,000. A review of Table 1 reveals many other similar near-misses.
Based alone on a laboratory report of indeterminate or better, the yield for at least one DNV is only 15/100 (15%) in our cohort. However, this figure jumps to 55/100 (55%) for having at least one DNV-PDV following our methodology, providing additional DNVs highly likely to be disease-related in 40/100 (40%) additional subjects. There are many reasons for this discrepancy, but mostly the present methodology did not exclude the following: (1) very rare and, especially, novel disorders, which are beyond the preview of clinical laboratories, (2) very rare variants that are nonetheless still listed over zero prevalence in the over-one-million-person gnomAD control database, and (3) silent DNVs. Our methodology requires expertise in genomics and the pathophysiology of ASD; thus, it is not suitable for widespread adoption, although it is the clinical practice for all ASD patients seen by the first author.
4.5. Mechanistic Pathways and Clinical Utility
There are a variety of mechanistic pathways known to be involved in ASD, many of which are shown in Table 2 corresponding to the known functions of genes in which we identified a DNV-PDV. Most of these pathways are either neuron/brain-specific (synapse-related, neurogenesis) or ubiquitous to all cell types but of particular importance to neurons/the brain (ion transport, energy metabolism, immune system, ubiquitin-related protein turnover, cytoskeleton, cell–cell interactions, and cell danger response). The remaining pathways, gene expression and cell signaling, are highly complex as tissue specificity varies from case to case. Together, these pathways are highly fundamental to biology, in general. Considering the issue of multiple factors leading toward ASD from another direction, despite over 5% of all genes being listed on SFARI, that database likely only includes a small fraction of genes in which variants can predispose toward autism. Thus, a very sizable proportion of all genes is likely involved in ASD pathogenesis. How do so many genes in a variety of pathways fundamental to life predispose toward one entity—ASD? A parsimonious hypothesis is that social communication and executive functioning (e.g., ADHD, which is extremely common in ASD [32]) are highly vulnerable pathways that are oftentimes the main sequalae of generalized cellular insults (e.g., hypoxia, ethanol) and, thus, are oftentimes the main sequalae of a DNV in a very large number of genes in which the variant severely compromises general cellular homeostasis. Phenotypic targeting (e.g., the development of ASD versus intellectual disability, epilepsy, schizophrenia, etc.) may be in large part due to inherited genetic modifying variants and/or environmental factors. Future studies are needed to continue exploring the pathways that contribute to ASD to find additional actionable clinical targets.
Of particular clinical importance is that four of the pathways shown (ion transport, energy metabolism, synapse-related, and immune system) are at least partially treatable. In the clinical practice of the first and last authors, identifying variants, both de novo and inherited, in these pathways among our ASD patients frequently leads to treatment options with anecdotal clinical improvements.
4.6. Limitations
As explained above, the main limitation of this study is the difficulty in variant classification in terms of disease relationships, including the difficulty of determining if a gene is ASD-related. Our strict criteria likely led to an under-ascertainment of DNVs associated with ASD. Our cohort is small in terms of a sequencing study in ASD, but large for a study that correlates phenotype, genotype, and mechanisms. Hopefully, future studies will include more subjects as well as this information. Modifications to the ACMGG standards were made to better adapt to the analysis of new genes, and, in doing so, this study varies from that in many other reports. Computational-modeling evidence is provided for missense variants, but not for silent variants, and functional validation of variant pathogenicity is absent for both variant classes. Thus, conclusions are premature, especially regarding silent variants. Finally, our cohort represents individuals on the more severe end of the broad ASD spectrum and is likely not applicable to those with lesser degrees of clinical severity.
4.7. Potential Implications for the Increasing Prevalence of ASD
At the time of this publication, the autism community is bitterly divided among those that believe that ASD is genetic and not increasing in frequency (e.g., better recognition, altered diagnostic practices) and those that believe that the frequency is increasing dramatically, which can only be due to environmental toxicity. DNVs, which are genetic yet not inherited, occur in both monozygotic twins but only one dizygotic twin, and, thus, would be determined to be genetic in twin studies. DNVs generally occur in spermatogenesis (small variants) or oogenesis (CNVs), while a relatively small proportion of DNVs are postzygotic in early embryology. Most of the DNVs found in autism are small variants, usually single-nucleotide, thus likely occurring in spermatogenesis, possibly years or even decades prior to conception. DNVs are not new; they have always occurred. Indeed, they are the drivers of evolution as well as new genetic disorders. However, and herein the authors speculate, the rapidly increasing incidence of ASD might be due to an increasing rate of DNVs caused by mutagenesis secondary to multiple and dramatic environmental changes occurring in recent decades. In particular, heavy metals, chemicals such as dibenzodioxins and alkylating agents, and multiple metabolites from bacteria and fungi are known to be mutagenic.
In addition, insufficient folate during gestation or gametogenesis can result in DNVs (mutations) [33,34,35]. Insufficient folate during early gestation can cause postzygotic DNVs, while prezygotic maternally derived DNVs occur in the maternal grandmother during gestation of the mother. Additionally, prezygotic paternally derived DNVs can continue to occur during spermatogenesis, which commences in adolescence and continues throughout life. One of the unique characteristics of ASD is the relationship between paternal age and increasing ASD risk. Advanced paternal age provides more time for DNVs to occur. Toxicant exposure and poor folate intake throughout life could certainly result in a cumulative mutation load resulting in poorer sperm quality in age. Interestingly, folate is protective for environmental toxicants, so suboptimal folate intake itself may not cause DNVs but could increase the risk of toxicants causing DNVs. On the other hand, excessive folic acid may increase the DNV rate [36].
The authors assert that future studies are extremely important to answer the questions posed by our work. Are DNVs in humans increasing over time? Are they more numerous in people with ASD, or are those people simply unlucky as to where the DNVs occurred? What environmental factors are driving any increase in DNVs? Finally, perhaps a question that all aspects of the ASD community can agree on: What environmental epigenetic factors are contributing to ASD pathophysiology, whether the targeted genetic variants are de novo or inherited, and regardless of whether DNVs are (or ASD is) truly increasing in prevalence over time?
5. Conclusions
DNVs, including missense and silent, are likely related to disease pathogenesis in about one half of individuals with moderate-to-severe forms of ASD, likely as significant factors in disease pathogenesis. This statement is caveated by the lack of an unaffected control group. Numerous inherited variants, including missense and (provisionally) silent, are ASD-associated, likely each as minor factors in disease pathogenesis. DNVs can explain how a predominately genetic disorder could rapidly increase in true incidence and themselves can oftentimes suggest therapeutic options. However, knowledge in this area is still preliminary, and future studies are desperately needed.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes16091099/s1, Table S1: Clinical manifestations in our subjects.
Author Contributions
Conceptualization, R.G.B., O.B., P.T.B. and R.E.F.; methodology, R.G.B., O.B., P.T.B., Z.R.H. and R.E.F.; software, R.G.B., O.B., P.T.B. and Z.R.H.; validation, R.G.B.; formal analysis, R.G.B.; investigation, R.G.B. and O.B.; resources, R.G.B. and R.E.F.; data curation, R.G.B., O.B., P.T.B. and Z.R.H.; writing—original draft preparation R.G.B.; writing—review and editing R.G.B., O.B., P.T.B., Z.R.H. and R.E.F.; visualization, R.G.B. and R.E.F.; supervision, R.G.B. and R.E.F.; project administration, R.G.B. and R.E.F.; funding acquisition, R.G.B. and R.E.F. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the XEL Foundations (Pittsburgh, PA) to R.E.F.
Institutional Review Board Statement
This study was conducted in accordance with the Declaration of Helsinki and approved as “Exempt” by the Institutional Review Board of Advarra®. An exemption for this study was approved based on adherence to a retrospective chart-review format in accordance with national legislation and institutional requirements.
Informed Consent Statement
Informed consent was waived by the Institutional Review Board based on the retrospective study format and applicable law.
Data Availability Statement
The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Conflicts of Interest
R.G.B. is an officer and receives equity from NeuroNeeds®, a company that produces dietary supplements for neurological conditions. Otherwise, all authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
References
- American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders; American Psychiatric Association Publishing: Washington, DC, USA, 2022. [Google Scholar] [CrossRef]
- Kim, J.Y.; Son, M.J.; Son, C.Y.; Radua, J.; Eisenhut, M.; Gressier, F.; Koyanagi, A.; Carvalho, A.F.; Stubbs, B.; Solmi, M.; et al. Environmental risk factors and biomarkers for autism spectrum disorder: An umbrella review of the evidence. Lancet Psychiatry 2019, 6, 590–600. [Google Scholar] [CrossRef]
- Brown, W.T.; Friedman, E.; Jenkins, E.; Brooks, J.; Wisniewski, K.; Raguthu, S.; French, J. ASSOCIATION OF FRAGILE X SYNDROME WITH AUTISM. Lancet 1982, 319, 100. [Google Scholar] [CrossRef]
- Steffenburg, S.; Gillberg, C.; Hellgren, L.; Andersson, L.; Gillberg, I.C.; Jakobsson, G.; Bohman, M. A Twin Study of Autism in Denmark, Finland, Iceland, Norway and Sweden. J. Child Psychol. Psychiatry 1989, 30, 405–416. [Google Scholar] [CrossRef]
- Bailey, A.; Le Couteur, A.; Gottesman, I.; Bolton, P.; Simonoff, E.; Yuzda, E.; Rutter, M. Autism as a strongly genetic disorder: Evidence from a British twin study. Psychol. Med. 1995, 25, 63–77. [Google Scholar] [CrossRef] [PubMed]
- Dietert, R.R.; Dietert, J.M.; Dewitt, J.C. Environmental risk factors for autism. Emerg. Health Threat. J. 2011, 4, 7111. [Google Scholar] [CrossRef] [PubMed]
- Shaw, K.A.; Williams, S.; E Patrick, M.; Valencia-Prado, M.; Durkin, M.S.; Howerton, E.M.; Ladd-Acosta, C.M.; Pas, E.T.; Bakian, A.V.; Bartholomew, P.; et al. Prevalence and Early Identification of Autism Spectrum Disorder Among Children Aged 4 and 8 Years—Autism and Developmental Disabilities Monitoring Network, 16 Sites, United States, 2022. MMWR. Surveill. Summ. 2025, 74, 1–22. [Google Scholar] [CrossRef] [PubMed]
- “Autism Diagnosis on the Rise, According to Trends Study,” Autism Speaks. Available online: https://www.autismspeaks.org/science-news/why-autism-increasing (accessed on 28 May 2025).
- Leeb, R.T.; Danielson, M.L.; Claussen, A.H.; Robinson, L.R.; Lebrun-Harris, L.A.; Ghandour, R.; Bitsko, R.H.; Katz, S.M.; Kaminski, J.W.; Brown, J. Trends in Mental, Behavioral, and Developmental Disorders Among Children and Adolescents in the US, 2016–2021. Prev. Chronic Dis. 2024, 21, 240142. [Google Scholar] [CrossRef]
- AutDB. Available online: http://www.mindspec.org/autdb.html (accessed on 29 May 2025).
- SFARI Gene. Available online: https://gene.sfari.org (accessed on 29 May 2025).
- De Rubeis, S.; Buxbaum, J.D. Genetics and genomics of autism spectrum disorder: Embracing complexity. Hum. Mol. Genet. 2015, 24, R24–R31. [Google Scholar] [CrossRef]
- Kreiman, B.L.; Boles, R.G. State of the Art of Genetic Testing for Patients with Autism: A Practical Guide for Clinicians. Semin. Pediatr. Neurol. 2020, 34, 100804. [Google Scholar] [CrossRef]
- Bar, O.; Vahey, E.; Mintz, M.; Frye, R.E.; Boles, R.G. Reanalysis of Trio Whole-Genome Sequencing Data Doubles the Yield in Autism Spectrum Disorder: De Novo Variants Present in Half. Int. J. Mol. Sci. 2024, 25, 1192. [Google Scholar] [CrossRef]
- Sheth, F.; Shah, J.; Jain, D.; Shah, S.; Patel, H.; Patel, K.; I Solanki, D.; Iyer, A.S.; Menghani, B.; Mhatre, P.; et al. Comparative yield of molecular diagnostic algorithms for autism spectrum disorder diagnosis in India: Evidence supporting whole exome sequencing as first tier test. BMC Neurol. 2023, 23, 292. [Google Scholar] [CrossRef]
- Furukawa, S.; Kushima, I.; Kato, H.; Kimura, H.; Nawa, Y.; Aleksic, B.; Banno, M.; Yamamoto, M.; Uematsu, M.; Nagasaki, Y.; et al. Whole-genome sequencing analysis of Japanese autism spectrum disorder trios. Psychiatry Clin. Neurosci. 2024, 79, 87–97. [Google Scholar] [CrossRef]
- Jiang, Y.; Yuen, R.K.; Jin, X.; Wang, M.; Chen, N.; Wu, X.; Ju, J.; Mei, J.; Shi, Y.; He, M.; et al. Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing. Am. J. Hum. Genet. 2013, 93, 249–263. [Google Scholar] [CrossRef]
- Abdi, M.; Aliyev, E.; Trost, B.; Kohailan, M.; Aamer, W.; Syed, N.; Shaath, R.; Gandhi, G.D.; Engchuan, W.; Howe, J.; et al. Genomic architecture of autism spectrum disorder in Qatar: The BARAKA-Qatar Study. Genome Med. 2023, 15, 81. [Google Scholar] [CrossRef]
- Neerman, N.; Faust, G.; Meeks, N.; Modai, S.; Kalfon, L.; Falik-Zaccai, T.; Kaplun, A. A clinically validated whole genome pipeline for structural variant detection and analysis. BMC Genom. 2019, 20, 545. [Google Scholar] [CrossRef]
- Bar, O.; Ebenau, L.; Weiner, K.; Mintz, M.; Boles, R.G. Whole exome/genome sequencing in cyclic vomiting syndrome reveals multiple candidate genes, suggesting a model of elevated intracellular cations and mitochondrial dysfunction. Front. Neurol. 2023, 14, 1151835. [Google Scholar] [CrossRef]
- University of California Santa Cruz Genomic Institute UCSC Genome Browser. Available online: https://genome.ucsc.edu/ (accessed on 29 May 2025).
- GraphPad by Dotmatics. Available online: https://www.graphpad.com/quickcalcs/contingency1.cfm (accessed on 29 May 2025).
- MedCalc®. Available online: https://www.medcalc.org/calc/odds_ratio.php (accessed on 29 May 2025).
- Leblond, C.S.; Rolland, T.; Barthome, E.; Mougin, Z.; Fleury, M.; Ecker, C.; Bonnot-Briey, S.; Cliquet, F.; Tabet, A.-C.; Maruani, A.; et al. A Genetic Bridge Between Medicine and Neurodiversity for Autism. Annu. Rev. Genet. 2024, 58, 487–512. [Google Scholar] [CrossRef] [PubMed]
- Takata, A.; Ionita-Laza, I.; Gogos, J.A.; Xu, B.; Karayiorgou, M. De Novo Synonymous Mutations in Regulatory Elements Contribute to the Genetic Etiology of Autism and Schizophrenia. Neuron 2016, 89, 940–947. [Google Scholar] [CrossRef] [PubMed]
- Jaganathan, K.; Panagiotopoulou, S.K.; McRae, J.F.; Darbandi, S.F.; Knowles, D.; Li, Y.I.; Kosmicki, J.A.; Arbelaez, J.; Cui, W.; Schwartz, G.B.; et al. Predicting Splicing from Primary Sequence with Deep Learning. Cell 2019, 176, 535–548.e24. [Google Scholar] [CrossRef]
- Rhine, C.L.; Neil, C.; Wang, J.; Maguire, S.; Buerer, L.; Salomon, M.; Meremikwu, I.C.; Kim, J.; Strande, N.T.; Fairbrother, W.G.; et al. Massively parallel reporter assays discover de novo exonic splicing mutants in paralogs of Autism genes. PLoS Genet. 2022, 18, e1009884. [Google Scholar] [CrossRef] [PubMed]
- Krupp, D.R.; Barnard, R.A.; Duffourd, Y.; Evans, S.A.; Mulqueen, R.M.; Bernier, R.; Rivière, J.-B.; Fombonne, E.; O’rOak, B.J. Exonic Mosaic Mutations Contribute Risk for Autism Spectrum Disorder. Am. J. Hum. Genet. 2017, 101, 369–390. [Google Scholar] [CrossRef]
- Yi, S.; Chen, F.; Qin, Z.; Yi, S.; Huang, L.; Huang, L.; Feng, Y.; Wei, H.; Yang, Q.; Zhang, Q.; et al. Novel Synonymous and Frameshift Variants in the TRIP12 Gene Identified in 2 Chinese Patients with Intellectual Disability. Neurol. Genet. 2022, 8, e200025. [Google Scholar] [CrossRef]
- Benvenuto, M.; Cesarini, S.; Severi, G.; Ambrosini, E.; Russo, A.; Seri, M.; Palumbo, P.; Palumbo, O.; Castori, M.; Panza, E.; et al. Phenotypic Description of A Patient with ODLURO Syndrome and Functional Characterization of the Pathogenetic Role of A Synonymous Variant c.186G>A in KMT2E Gene. Genes 2024, 15, 430. [Google Scholar] [CrossRef]
- Zhou, F.; Zhong, H.; Wu, B.; Cui, Y.; Li, J.; Jia, X.; Yu, C.; Li, D.; Shu, J.; Cai, C. Identification of the synonymous variant c.3141G > A in TNRC6B gene that altered RNA splicing by minigene assay. Mol. Biol. Rep. 2024, 51, 899. [Google Scholar] [CrossRef] [PubMed]
- Hours, C.; Recasens, C.; Baleyte, J.-M. ASD and ADHD Comorbidity: What Are We Talking About? Front. Psychiatry 2022, 13, 837424. [Google Scholar] [CrossRef] [PubMed]
- LeBlanc, D.P.; Behan, N.A.; O’Brien, J.M.; Marchetti, F.; MacFarlane, A.J.; Gollapudi, B. Folate deficiency increases chromosomal damage and mutations in hematopoietic cells in the transgenic mutamouse model. Environ. Mol. Mutagen. 2018, 59, 366–374. [Google Scholar] [CrossRef] [PubMed]
- Rai, S.; Leydier, L.; Sharma, S.; Katwala, J.; Sahu, A. A quest for genetic causes underlying signaling pathways associated with neural tube defects. Front. Pediatr. 2023, 11, 1126209. [Google Scholar] [CrossRef]
- Copp, A.J.; DE Greene, N. Genetics and development of neural tube defects. J. Pathol. 2009, 220, 217–230. [Google Scholar] [CrossRef]
- Cao, X.; Xu, J.; Lin, Y.L.; Cabrera, R.M.; Chen, Q.; Zhang, C.; Steele, J.W.; Han, X.; Gross, S.S.; Wlodarczyk, B.J.; et al. Excess folic acid intake increases DNA de novo point mutations. Cell Discov. 2023, 9, 22. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).