Next Article in Journal
Epigenetic m6A RNA Modification Reader YTHDF1 in Merkel Cell Carcinoma
Previous Article in Journal
Understanding Breast-Cancer-Associated Fibroblasts and Their Epigenetic Activation to Unveil Novel Targets for Breast Cancer Therapy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Variant Characterization of a Representative Large Pedigree Suggests “Variant Risk Clusters” Convey Varying Predisposition of Risk to Lynch Syndrome

by
Mouadh Barbirou
1,2,3,*,
Amanda A. Miller
1,2,
Amel Mezlini
4,
Balkiss Bouhaouala-Zahar
3,5 and
Peter J. Tonellato
2
1
Circulating Tumor Cell Core Laboratory, Population Science Division, Medical Oncology Department, Medical College, Sidney Kimmel Cancer Center, Thomas Jefferson University, Philadelphia, PA 19107, USA
2
Center for Biomedical Informatics, Department of Health Management and Informatics, School of Medicine, University of Missouri, Columbia, MI 65211, USA
3
Medical School, University of Tunis El Manar, Tunis 1068, Tunisia
4
Medical Oncology Division, Salah Azeiz Oncology Institute, University of Tunis El Manar, Tunis 1068, Tunisia
5
Laboratory of Venoms and Therapeutic Biomolecules, LR16IPT08 Institute Pasteur of Tunis, University of Tunis El Manar, Tunis 1068, Tunisia
*
Author to whom correspondence should be addressed.
Cancers 2023, 15(16), 4074; https://doi.org/10.3390/cancers15164074
Submission received: 15 June 2023 / Revised: 9 August 2023 / Accepted: 10 August 2023 / Published: 12 August 2023
(This article belongs to the Section Cancer Causes, Screening and Diagnosis)

Abstract

:

Simple Summary

Approximately 20% of colorectal cancer (CRC) cases are diagnosed in individuals under 40, with a severe prognosis due to germline variant accumulation. Many of these variants have been classified as hereditary cancer causative, while others remain poorly researched. The identification of germline variants across different populations is critical for accurate prognosis, treatment, and follow-up. We aimed to identify and predict the functional implications of germline variants using whole-genome sequencing of a Tunisian pedigree with Lynch syndrome CRC risk. Two SNPs/indels were identified in genes with CRC association (MLH1 and PRH1-TAS2R14) and four in genes with non-CRC cancer association (PPP1R13B, LAMA5, FTO, and NLRP14). Three structural variants overlapped genes associated with non-CRC digestive cancer (RELN, IRS2, and FOXP1) and one overlapped RRAS2 with non-CRC cancer associations. This study provides further evidence of genetic predisposition according to the risk clustering of variants based on their functional implications and risk mechanisms within the same pedigree.

Abstract

Recently, worldwide incidences of young adult aggressive colorectal cancer (CRC) have rapidly increased. Of these incidences diagnosed as familial Lynch syndrome (LS) CRC, outcomes are extremely poor. In this study, we seek novel familial germline variants from a large pedigree Tunisian family with 12 LS-affected individuals to identify putative germline variants associated with varying risk of LS. Whole-genome sequencing analysis was performed to identify known and novel germline variants shared between affected and non-affected pedigree members. SNPs, indels, and structural variants (SVs) were computationally identified, and their oncological influence was predicted using the Genetic Association of Complex Diseases and Disorders, OncoKB, and My Cancer Genome databases. Of 94 germline familial variants identified with predicted functional impact, 37 SNPs/indels were detected in 28 genes, 2 of which (MLH1 and PRH1-TAS2R14) have known association with CRC and 4 others (PPP1R13B, LAMA5, FTO, and NLRP14) have known association with non-CRC cancers. In addition, 48 of 57 identified SVs overlap with 43 genes. Three of these genes (RELN, IRS2, and FOXP1) have a known association with non-CRC digestive cancers and one (RRAS2) has a known association with non-CRC cancer. Our study identified 83 novel, predicted functionally impactful germline variants grouped in three “variant risk clusters” shared in three familiarly associated LS groups (high, intermediate and low risk). This variant characterization study demonstrates that large pedigree investigations provide important evidence supporting the hypothesis that different “variant risk clusters” can convey different mechanisms of risk and oncogenesis of LS-CRC even within the same pedigree.

1. Introduction

Colorectal cancer (CRC) is the most frequent neoplasm worldwide, accounting for 8% of cancer-related deaths [1,2]. Pathogenetic variants in known high-penetrance cancer-risk-associated genes have been implicated in up to 8% of all CRC cases, where one in five (20%) cases of this type were diagnosed in under 40 year olds [3,4,5] compared to the average worldwide age at diagnosis of 65 [6]. Familial association (defined to be families with at least two affected members) appears in an estimated 35% of all CRC cases [7]. The most common familial CRC is Lynch syndrome (LS) [8,9]. The familial forms of CRC genetic predisposition have been correlated with germline mutations or epimutations in mismatch repair (MMR) genes such as MLH1, MSH2, MSH6, and PMS2 for nonpolyposis cases and in APC and MUTYH for Adenomatous colonic polyposis with recessive inheritance [4,5]. Despite the current knowledge of genetic predisposition in these hereditary forms of CRC (such as LS, Gardner syndrome, Juvenile polyposis coli, and others), much of the familial relationships and mechanisms of risk remain unexplained [10].
Genome-wide association via high throughput sequencing (HTS) technologies and the wide collection of variant functional predictive analytics, clouded applications, and databases are critical to the success of identifying new germline likely causative variants implicated in cancer predisposition. These powerful emerging tools help detect deleterious genomic changes in part responsible for the hereditary CRC development, diagnosis, predicted optimal treatment, and therefore long-term prognosis. Approximately 40% of patients with an inherited tumor syndrome exhibit a variant of uncertain significance, as revealed through sequencing analyses that examine germline variants involved in the production of truncated proteins and associated with alterations caused by hereditary pathologies at the germinal level [11]. These variants typically involve a single amino acid substitution, which cannot a priori be definitively classified as pathogenic or benign [12]. Conversely, synonymous nucleotide substitutions, which generally do not cause alterations in protein structure, have been found to be pathogenic in some instances, depending on their genomic location [13]. Additionally, variants appearing together in the same gene or different genes may coexist and co-segregated with the disease phenotype within a single family, potentially explaining the correlated predisposition risk of the family. In some cases, these variants contribute more significantly to cancer risk than classic pathogenic Mendelian variants, and when implicated in tumor predisposition, can cooperatively contribute to an increased risk of cancer development as low-risk alleles [14,15]. However, HTS studies have not covered all such cancers and therefore, it is highly likely that functionally pertinent variants and mutations and genes conveying predisposition to CRC and LS are yet to be discovered. This gap in the current knowledge of familial forms of CRC such as LS requires further clinical evaluation of hereditary CRCs supported by germline studies of familial cases [16].
In this study, we aim to identify, annotate, and computationally predict the functional implication of previously known and novel germline SNPs, indels, and structural variants using a whole-genome sequencing approach of a Tunisian large pedigree with three familiarly grouped members affected or at-risk to LS-CRC.

2. Materials and Methods

2.1. Sample/Data Collection

An LS-affected, large-pedigree Tunisian family with 37 total and 12 known affected members was recruited for this study (Figure 1). Peripheral blood from 11 members (oval circled) was collected. Individual subjects’ clinical, environmental, and behavioral data were collected from medical records and personal interviews based on an interrogatory form conducted by the study personnel.
Lynch syndrome status within this family was tested and confirmed during routine clinical work. The three patients collected from in the HRLS group were classified as meeting both the Amsterdam criteria II that have been established by the International Collaborative Group on HNPCC for assistance in identifying Lynch syndrome [17,18] and the original Bethesda guidelines [19]. Subsequent blood germline testing using PCR amplification and direct sequencing of the entire coding region and the exon–intron boundaries for MMR genes (MLH1, MSH2, and MSH6) [20,21] revealed a single deleterious germline alteration affecting the MLH1 gene (mutation c.-168_c.116 + 713 del). This corresponds to a 997 bp deletion that encompasses the entirety of exon 1, a portion of intron 1, and a section of the MLH1 promoter, and was observed in all subjects within the HRLS group who underwent germline testing.
Data summarized in Table 1 were recorded in a study database and maintained in a secure, private manner consistent with the Declaration of Helsinki and the permission of Salah Azaiz Institute Ethics Committee registration number: ISA/2016/02. All subjects were informed about the purposes of the study and consented in writing to participate in the study. The 11 subjects were stratified into three groups based on familial cancer status. The high risk to LS group (HRLS, Figure 1, red ovals) are affected subjects of the pedigree; intermediate risk to LS (IRLS, Figure 1, green ovals) includes CRC-free subjects that have at least one affected parent, and low risk to LS (LRLS, Figure 1, blue ovals) are those with no relatives affected in the subject’s immediate triplet (subject and both parents).

2.2. DNA Extraction and Quality Assessment

Genomic DNA was extracted from the 11 blood samples according to the manufacturer’s recommendation using a Flexigene DNA Whole Blood Kit (Qiagen, Hilden, Germany). DNA quality and quantity were assessed using a Qubit fluorometer (Invitrogen, Carlsbad, CA, USA) and electrophoresis migration in agarose gel 1%. The genomic DNA with good quality was subjected to library preparation prior to sequencing.

2.3. Whole Genome Sequencing (WGS)

Libraries were prepared using Nextera XT kit (Illumina, San Diego, CA) and pair-end sequencing (2 × 300 base pairs) with the Miseq Reagent V3 kit (Illumina) following the manufacturer’s instructions. The Nextera enzyme mix was used to simultaneously fragment input DNA and tag with universal adapters in a single tube reaction. Library purification was performed by Agincourt AMPure XP beads (Beckman Coulter, IN, USA) and Bioanalyzer (Agilent, Wilmington, DE, USA) was used for quantification and quality checking [22]. Libraries were sequenced using the Illumina NextSeq500 platform (Illumina Inc., San Diego, CA, USA). A total of 1564.4 GB with 142.22 GB on average per sample of raw data was generated on the sequencer, resulting in a mean sequence coverage depth of 45.88-fold (range of 37.71- to 59.17-fold).

2.4. Bioinformatic Variant Analysis (BVA)

The full bioinformatics variant analysis (BVA) is described in Figure 2 and the Supplemental Material. We used a novel functional implicated variant pipeline created in our previous work on breast cancer [23], modified to account for the pedigree relationship between subjects and familial and thus the risk-related nature of the detected variants. Briefly, for each subject’s high-throughput sequencing (HTS) sequence, alignment, poor-quality read filtering, single nucleotide polymorphisms (SNPs), insertions/deletions (indels), and structural variants (SVs) were called and variant quality score recalibration and filtering, annotation and removal of common and likely non-functional variants, and assessment of cancer-associated genes were performed.

3. Results

3.1. Clinical Characterization of the Pedigree Members

The large Tunisian pedigree (Figure 1) includes 39 members across four generations; 12 of them were diagnosed with LS CRC. In this study, 11 members’ (3 HRLS, 4 IRLS, and 6 LRLS) germline genomes were fully sequenced and their medical, environmental, and behavioral data were carefully analyzed. Individual and familial characterizations are described in Table 1. The HRLSs presented an average age of 48 ± 12.16 and BMI of 26.08 ± 3.11, IRLSs had an average age and BMI of 38.25 ± 17.63 and 26.77 ± 3.47, respectively, and the LRLSs had an average age and BMI of 35.50 ± 24.11 and 27.52 ± 2.64, respectively. Male sex distribution was 33.34% (1/3) HRLS, 0% (0/4) IRLS, and 50% (3/6) LRLS. Concerning lifestyle, 66.66% of HRLS recorded high brine and meat consumption, 100% of IRLS had a high fat consumption, and 75% of LRLS presented high meat and fat consumption. A total of 66.66% of HRLS were tobacco and alcohol users, 25% of IRLS were tobacco users, and 50% of LRLS were tobacco and alcohol users. A total of 66% of HRLS had hyperglycemia and hypertension, 75% of IRLS had hypertension and 50% had hyperglycemia, and 25% of LRLS had hypertension. An analysis of the clinical characteristics of each group found no statistically significant associations.

3.2. Variant Characterization of the Pedigree

3.2.1. SNPs and Indels

Before filtering, 7,836,438 SNPs and 2,039,903 indels were detected across the 11 genomes. A total of 23.63% (2,334,312 of 9,876,341) of all variants satisfied the low-frequency threshold (<0.01 AF 1000 G ALL and non-TCGA ExAC ALL) and 3.46% (80,905 of 2,334,312) of the variants demonstrated a probable deleterious function (CADD scaled score > 10). Subsequent annotation stratified the likely functionally implicated variants into coding (2824) and non-coding (78,081) variants. Lastly, filtering for variants predicted deleterious by having at least three of MutationTaster, PolyPhen V2, Provean, and SIFT resulted in 1961 coding and 9826 non-coding predicted deleterious variants. From the 1961 low-frequency, predicted functionally deleterious coding variants, 79 were found in all 11 sequenced individuals: 37 in all HRLS members, 27 in the IRLS group, and 15 in the LRLS. Of the 9826 non-coding filtered variants, all 9826 were found in members of at least one group; 4171 of the 9826 were found in each HRLS member, 2820 were found in every IRLS member, and 2835 in every LRLS member. In addition, 19 non-coding variants were exclusively detected only in a particular group of the pedigree: 2 variants exclusive to members in HRLS, 3 exclusives to IRLS, and 14 exclusives to LRLS, and 4 of these had a RegulomeDB Score < 4. Most of the detected non-coding variants were Intergenic (37,412) and found in all samples, whereas 30344 Intronic variants were found in all samples, and only 7 were found in samples in the same groups (Table 2).

3.2.2. SNP and Indels with Evidence of Familial and Risk-Implicated Genes

Of the 83 variants identified in the cohort, 39 genes were identified containing at least one variant. PABPC3 had four variants (rs79397892, rs78826513, rs78552667, and rs80261016) found in all samples and one variant (rs201411821) found in all high and intermediate risk samples (HRLS and IRLS). Three of these five variants (rs78826513, rs201411821, and rs80261016) were classified as oncogenic driver variants according to SNPnexus (“Driver” as defined in the oncogenic classification by Cancer Genome Interpreter). KRT18 had four variants found in all samples, though none satisfied the oncogenic (or “Driver” predicted as tumor driver according to Cancer Genome Interpreter) threshold. One additional variant (rs201602708) located in MACF1 did not satisfy the oncogenic threshold. Five of the detected variants were in different genes that were previously described as common mutations in a collection of cancers. rs63750539 in the MLH1 gene has been described in several types of cancer including CRC and LS, rs373141354 in the PPP1R13B gene is associated with melanoma according to the Genetic Association of Complex Diseases database, rs551763507 in the LAMA5 gene is correlated to neuroblastoma, rs76670455 in NLRP14 gene to leukemia, and rs763119571 in TAS2R19 is also described in CRC according to the Cancer Genome Interpreter database. Concerning non-coding variants, four variants belonging to different genes (rs544153916 in PODN, rs116197074 in SCP2, and rs116526711 in MAML3) were noted only in samples from the LRLS group, and one of these variants rs115378978 in the FTO gene was previously associated with prostate cancer according to Disorders and SNPnexus databases (Table 3).

3.2.3. Structural Variants (SVs)

A total of 11,171 SVs were found via smoove. Of the total SVs detected, their classifications were 4120 deletions, 194 duplications, 6560 breakends, and 297 inversions. Filtering for low-frequency novel variants resulted in 1122 deletions, 105 duplications, 5492 breakends, and 137 inversions. Filtering by highly likely functional (AnnotSV ranking > 3) resulted in 164 deletions, 17 duplications, 274 breakends, and 18 inversions. Finally, filtering for total length (>= 50 bp) resulted in 140 (of the 164) deletions. A familial analysis was performed to test sharing across study groups found, finding 154 breakends, 133 deletions, 4 duplications, and 13 inversions in each of the 11 subjects. A total of 35 shared breakends (23 shared within HRLS, 4 in IRLS, and 8 in LRLS) and 18 shared deletions (7 in HRLS, 4 in IRLS, and 7 in LRLS). Two duplications were shared by all members of LRLS and one inversion in IRLS and one inversion in LRLS. AnnotSV analysis for the SVs genomic location showed that 26 breakends and 17 deletions were intronic variants, but the 2 noted inversions were Transcript Start-Transcript End variants and the only detected duplication was an intronic variant (Table 4).

3.2.4. SVs with Risk-Implicated Genes

The 48 identified SVs were evaluated for functional prediction; the results showed that 43 SVs overlapped with genes with a probable functional impact, and 9 SVs overlapped with genes with an unlikely functional impact. Of these 43 SVs of probable impactful, 5 SVs overlapped with 4 genes with likely impact, 4 in the HRLS group (2 SVs in RELN, 1 each for FOXP1 and RRAS2 genes), and 1 in the IRLS group for the IRS2 gene previously associated with multiple cancers, including CRC according to OncoKB Cancer Genes list database. Two of the forty-three breakend SVs found in all members of the LRLS group had a potential impact on RELN, a gene correlated to several cancers including gastric cancer according to OncoKB Cancer Genes list database. One of the forty-three duplication SVs in all members of the IRLS group contains IRS2, a gene associated with esophageal, intestinal, stomach, and CRC cancers according to the My Cancer Genome database. One deletion SV in all members of LRLS impacts FOXP1, a gene correlated to several cancers including esophagogastric, gastrointestinal, and CRC in the My Cancer Genome database. Finally, one breakend SV found in all members of HRLS affects RRAS2, a gene associated with both breast and ovarian cancers according to the My Cancer Genome database (Table 5).

4. Discussion

Several studies have examined the complex molecular heterogeneity of LS CRC in large families. The literature suggests that LS is caused by genetic and epigenetic variants sporadically found in genes, such as MMR (MLH1, MSH2, MSH6, PMS2, and EPCAM) associated with flat intra-mucosal neoplastic lesions [24,25]. However, a recent study by Binder et al. defined a third pathway for LS and showed the existence of two distinct genetic subtypes of the LS CRC [26]. These recent findings suggest that the risk and genesis of LS CRC may be caused by various multiply expressed functionally important “variant risk clusters” of germline mutations, each cluster independently associated with various pathways to carcinogenesis, and in a similar manner, each cluster may define both the type and degree of risk to LS CRC even within one large pedigree. The evidence of our results coupled with the broader collection of results in the literature supports such a working hypothesis.
These observations lead to the hypothesis that, in addition to the relatively well-characterized MMR deficiency in LS, other germline mutations or groups of mutations may contribute to the disruption of previously unassociated pathways, thus being associated with varying risks and oncogenesis of LS CRC. In this study, we investigated the germline mutational profiles of a large pedigree Tunisian family with Lynch syndrome-associated colorectal cancer using high throughput whole-genome sequencing. Subjects were grouped by familial cancer status into high, intermediate, and low risk to LS. Overall, we identified 94 germline variants, including 11 novel and rare cancer pathogenic variants previously described in cancer. Then, we clustered the identified germline variants according to the pedigree risk status into LRLS, IRLS, and HRLS groups.
The high-risk LS patients shared a missense mutation (rs63750539, p.Ala111Val) in MLH1 which is unlikely to be the cause of LS-CRC predisposition.
The MLH1 gene has been established as a causative gene for LS and presents the highest risk of CRC among individuals over 75 (46.6% of women and 51.4% of men) who are affected by the MLH1 variation. Rates range from 0% (at age 30) to 48.3% (at age 75) in females, and from 4.5% (at age 30) to 57.1% (at age 75) in males [22,23]. In another study, MLH1 variants were correlated with the highest risk of developing CRC in both heredity and sporadic cases [27]. Furthermore, two MLH1 5′UTR variants (c.-28A > G and c.-7C > T) were associated with early-onset CRC [28]. In addition, a cohort study reported that MLH1 is the most frequently mutated gene in early-onset sporadic CRC patients, exhibiting four pathogenic variations: c.C793T (p.R265C), c.C1029A (p.Y343X), c.C793T (p.R265C), and c.C1029A (p.Y343X) [29]. On the other hand, recent studies have found that promoter methylation of the MLH1 gene is prone to be silenced in CRC carcinogenesis pathways, and around 50% of MLH1-deficient tumors exhibit MLH1 promoter methylation [30]. Moreover, MLH1 promoter methylation in CRC cases was highly correlated with a BRAF V600E somatic mutation [31]. This variant was clinically classified as a hereditary sequence variant identified in disease-related genes directly affecting the clinical management of patients with LS-CRC [32]. To better understand the high risk effect on the pedigree, we investigated known driver mutations that are likely related to MMR deficiency. First, we investigated mutations in genes that play a key role in the adenoma–carcinoma model of CRC such as APC, KRAS, TP53, and binding/transactivated genes. We found the rs373141354 variant in the PPP1R13B gene (p.Gly866Arg), which assists TP53 activation during the cell apoptosis and lowers their ability shared by all subjects HRLS group [33]. Although TP53 is known to be rarely mutated in LS [34], our findings can be attributed to the high efficiency of TP53 in maintaining genomic integrity by arresting cells with mutated or damaged DNA in the G1 phase of the cell cycle to enable the repair mechanism or induce the apoptosis pathway [35]. The balance between cell cycle arrest and induced apoptosis depends on TP53 efficiency, which is related to the PPP1R13B activity identified in our research. Hence, we suggest that the variant rs373141354 (p.Gly866Arg) is associated with an increased risk of LS-CRC due to its low efficiency in cell cycle arrest.
Concerning the detected SVs overlapped with genes that have been previously correlated with different types of cancers, including digestive cancers, four shared SVs were found among HRLS subjects. Two SVs (7_103463079_103463080_BND_1 and 7_103463462_103463463_BND_1) were found to be related to the intronic region of the RELN gene, which has been correlated to several cancer risks including gastric cancer [36]. The large CpG islands are located at RELN promoter sites, and their transcriptional silence has been shown to be strongly controlled by promoter hypermethylation [37]. Consequently, a relationship between SVs and DNA methylation in cancer is speculated. Recent studies suggest that somatic copy number alterations in cancer are associated with DNA methylation [38], and numerous studies demonstrate that SVs may have a causal role in regulating CpG methylation [39,40]. Conversely, it is also possible that methylation could lead to SV imbalance by increasing DNA breaking [41,42]. Our observation enhances our growing understanding of the relationship between genetic (SVs) and epigenetic variation in cellular phenotype and the mechanism of gene regulation as well as the traits underlying the evolution of cancer with the presence of SVs in specific genome sites. The 71242366_71242638_DEL_1 deletion was noted in the FOXP1 gene. A large amount of substantial evidence has demonstrated that the tumor microenvironment is closely linked to the initiation, promotion, and progression of CRC through various mechanisms, such as immune suppression and the angiogenesis process [43]. Variation in FOXP3, an intracellular key molecule for Treg development and function, has been associated with a dysregulation in subverting antitumor immune responses and promoting tumor progression. The last SV detected, 11_14348706_14348707_BND_1, was related to the intronic region of the RRAS2 gene, which has previously been described in breast and ovarian cancers [44]. The role of LS in ovarian cancer was established and widely accepted, but the long-standing question of whether breast cancer should also be included under the umbrella of LS is still debated [45,46]. A recent study, consistent with other studies, shows that carriers of LS mutations tend to have earlier manifestations of breast cancer [47].
Regarding the IRLS group, a unique missense variant rs551763507 (p.Gly3688Glu) in the LAMA5 Laminin gene was identified in all subjects of the group. Based on Laminin’s function, these variants are not the most probable candidates to play a role in CRC susceptibility [48]. However, recent studies have identified LAMA5 in orthotropic metastases. The expression of Laminin 511 was associated with the upregulation of a set of genes regulating angiogenesis in TCGA data [48,49]. The Gordon group has demonstrated that the Laminin chains are localized within the vascular basement membrane on the basolateral surface of cancer cells, while colonic epithelial cells normally do not express vascular basement membrane Laminins. Consequently, the profound effect on vascular morphology and function upon LAMA5 mutation and inhibition of LAMA5 expression specifically by colon cancer cells indicates that cancer cell Laminin 511 deposition is important for colonic cells, promoting angiogenesis. Our results may suggest that inhibiting the production of vascular basement Laminins by tumor cells may serve as an efficient approach to prevent growth and the ability of tumor cells to regulate angiogenesis. Another novel variant was identified as an IRLS member as rs76670455 in the NLRP14 gene. NLRP14 has been described as a negative regulator of IFN responses. Interestingly, as inflammatory signaling pathways contribute to B cell lymphoma transformation, it is tempting to speculate that NLRP14 might contribute to cancer [50]. Another interesting aspect of NLR proteins is their expression in a panel of immune cells, notably myeloid cells and B cells, and their function as a negative regulator of inflammatory responses [51]. This differential expression of NLRP14 might be involved in the malignant transformation process. For the noted SVs in this HRLS pedigree group, only one variant was detected, the 13_110418067_110419056_DEL_1 in the intronic region of the IRS2 gene, which was shared by all IRLS subjects and has been described as related to several digestive cancers. Over-expression of IRS2 increases CRC cell adhesion to a similar extent as IGF-1 stimulation. Changes in adhesion, both increasing and decreasing, are important properties of metastasizing cancer cells and are involved in the invasion process, migration, and distant seeding of tumors [52]. In addition, it has been proven that the PI3K pathway is frequently dysregulated during CRC progression [53]. The TCGA Network demonstrated that high levels of IRS2 expression are mutually exclusive with IGF2 over-expression and with other mutations in the PI3K pathways in CRC. This suggests that the over-expression of IRS2 may be one mechanism by which the PI3K pathway could be dysregulated in CRC [54]. In summary, IRS2 appears to be a potential candidate as an oncogene driver, and the IGF1R-IRS2-PI3K axis could be an important therapeutic target in CRC.
In connection with variants specific to LRLS, we noted rs763119571 in the TAS2R19 gene with a deleterious function. Previous research has shown the association of this variant with CRC risk [24]. Genetic variants in type 2 bitter taste receptors (TAS2R) may influence health-related outcomes and are expressed within the oral cavity [55,56], the gastrointestinal mucosa [57], and the lungs [58]. TAS2R variants are hypothesized to play roles in an individual’s food preferences [59] and the neutralization and expulsion of toxins from the colon/rectum [60], thereby influencing cancer risk. The last noted variant with the deleterious function was rs115378978 in the FTO gene. Different polymorphisms of the FTO gene have been consistently associated with obesity. However, recent genome studies reveal that genetic variants in this gene are associated not only with human adiposity and metabolic disorders but also with several cancers, including colorectal cancer, since they can activate several signaling and hormonal pathways to increase cancer incidence. The hormones included in this carcinogenesis process could be ghrelin, oxytocin, and Leptin. Hence, FTO polymorphisms could exert an influence on the hormonal balance and physiologic factors and might increase cancer risk [60]. The variants identified in FTO, TAS2R, and NLRP14 genes were correlated with lifestyle-related factors for cancer installation, which are expected to be found in cases belonging to the LRLS group who are CRC-free. No oncogenic impactful SVs were detected in the LRLS group, which may explain the low risk of these subjects developing LS CRC and the crucial role of these variants in heredity cancer development.
Our current findings clearly illustrate that subtle, familiarly grouped genetic factors underlying risk to LS CRC extend beyond the well-documented familial CRC syndrome genes. Our data suggest that comprehensive germline testing in all LS CRC patients will provide comprehensive results to identify substantially more opportunities for robust and accurate genetically driven cancer characterization and subsequent prevention than established in current practice. Moreover, the magnitude of developing secondary cancer (breast or ovarian) is still unknown. Hence, our study sheds light on the importance of risked grouped germline screening to identify secondary risks and even predict the site of metastases during CRC progression.
This study’s limitations include the lack of testing on all family members. On the other hand, given the limited studies on SV identification, the impact of the SVs found in our study was predicted based on the hypothesis and recent functional studies. Thus, further studies in many LS-CRC families are needed to confirm the effect of the 57 SVs identified in our study. In addition, despite the extensive advantages of whole-genome sequencing and data processing, there remain several gaps in the technology and analysis. For example, the restricted resolution is caused by limitations in the read depth related to the quality of aligned sequences; a higher depth provides a greater power to call and identify new variants [61]. However, our mixed pipeline generated for our WGS-specific study allows us to detect new and known coding and non-coding variants across the whole genome of the 11 sequenced individuals, greatly facilitating the germline profile evolution of LS-CRC pedigrees and opening new opportunities for cancer pedigree studies. The introduction of several commercially available multigene panels has tremendous promise for clinical use but simultaneously raises weaknesses such as clear criteria for selecting above average risk patients to undergo such clinical panel testing [62] and the optimal choice of the prescribed panel [63]. Several multigene panels are available for hereditary CRC and the National Comprehensive Cancer Network (NCCN) has provided a useful protocol for predicting the appropriate panel according to the family history of patients to better optimize the patient care [64]. A limitation of our study is that not all members of the family were tested for the MLH1 gene variant (c.-168_c.116 + 713del) detected in the family. This may lead to the study being considered a population-based WGS investigation in the context of MLH1 carriers and non-carriers, potentially weakening our conclusions about genetic modifiers in LS.

5. Conclusions

In this study, we analyzed the germline landscape of a Tunisian family with a predisposition to LS CRC and identified a total of 94 germline variants affecting 39 genes, 6 of which have been previously described in cancer, and 57 SVs, 48 of which were related to 43 genes, with 4 of these genes categorized as oncogenes. According to the familial definition of LS risk in the pedigree members, we identified three “variant risk clusters” associated with the high, intermediate, and low LS CRC risk groups in the pedigree. The results showed that variants related to high-risk LS members may be causative of the disease, while other variants present in intermediate-risk members may develop LS through very different mechanistic disruptions and low-risk members with these variants may not develop LS at all. The application of HTS technology with such variant clustering for germline screening will efficiently provide further insights into the etiology of hereditary cancer and a huge opportunity to improve LS clinical suspicion. The significance of germline variants in cancer predisposition is still poorly explored, and this study contributes to filling this knowledge gap.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cancers15164074/s1, Section S1: Data Processing and Whole-Genome Mapping; Section S2: Variant Calling and Annotation; Section S2.1: SNPs and Indels; Section S2.2: Structural Variants; Section S2.3: Segregation in Pedigrees; Section S3: Variant Filtering; Section S3.1: SNPs and Indels; Section S3.2: Structural Variants; Section S4: Filtered Variants According to Cancer Relationship; Section S4.1: SNPs and Indels; Section S4.2: Structural Variants. References [65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92] are cited in the supplementary materials (References 1–28).

Author Contributions

M.B.: participated in study concept and design, carried out the study, and managed all project study participants who aided with experiments, patient consenting and charts, data review, statistical analysis, and manuscript preparation. A.A.M.: data analysis and processing, sequencing alignment, variant calling, and manuscript editing. A.M.: patient recruitment and clinical data acquisition. B.B.-Z.: original idea and study concept. P.J.T.: project principal investigation, original idea, study concept and design, guided overall study analysis, bioinformatics and statistical analysis, discussion of results, and review and editing of the final manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by funding provided by the Center for Biomedical Informatics, School of Medicine, University of Missouri, Columbia.

Institutional Review Board Statement

All subject investigations conformed to the principles outlined in the Declaration of Helsinki and have been performed with permission of the study protocol approved by the ethics committee of Salah Azaiz Institute (SAI), under the same’s Ethics Committee registration number (#ISA/2019/04).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Acknowledgments

We thank all blood donors who participated in the present study. We express our thanks to Ikram Sghaier for reviewing the manuscript and Hushan Yang and Chun Wang for their excellent assistance with experiments, discussion of results, and suggested ideas for consideration.

Conflicts of Interest

All authors have declared that they have no conflict of interest.

References

  1. Guillén-Ponce, C.; Lastra, E.; Lorenzo-Lorenzo, I.; Martín Gómez, T.; Morales Chamorro, R.; Sánchez-Heras, A.B.; Serrano, R.; Soriano Rodríguez, M.C.; Soto, J.L.; Robles, L. SEOM Clinical Guideline on Hereditary Colorectal Cancer (2019). Clin. Transl. Oncol. 2020, 22, 201–212. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Estimating the Global Cancer Incidence and Mortality in 2018: GLOBOCAN Sources and Methods—Ferlay—2019—International Journal of Cancer—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1002/ijc.31937 (accessed on 17 April 2023).
  3. Ahnen, D.J.; Wade, S.W.; Jones, W.F.; Sifri, R.; Mendoza Silveiras, J.; Greenamyer, J.; Guiffre, S.; Axilbund, J.; Spiegel, A.; You, Y.N. The Increasing Incidence of Young-Onset Colorectal Cancer: A Call to Action. Mayo Clin. Proc. 2014, 89, 216–224. [Google Scholar] [CrossRef] [PubMed]
  4. Yurgelun, M.B.; Kulke, M.H.; Fuchs, C.S.; Allen, B.A.; Uno, H.; Hornick, J.L.; Ukaegbu, C.I.; Brais, L.K.; McNamara, P.G.; Mayer, R.J.; et al. Cancer Susceptibility Gene Mutations in Individuals With Colorectal Cancer. J. Clin. Oncol. 2017, 35, 1086–1095. [Google Scholar] [CrossRef] [PubMed]
  5. AlDubayan, S.H.; Giannakis, M.; Moore, N.D.; Han, G.C.; Reardon, B.; Hamada, T.; Mu, X.J.; Nishihara, R.; Qian, Z.; Liu, L.; et al. Inherited DNA-Repair Defects in Colorectal Cancer. Am. J. Hum. Genet. 2018, 102, 401–414. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Colorectal Cancer—Risk Factors and Prevention. Available online: https://www.cancer.net/cancer-types/colorectal-cancer/risk-factors-and-prevention (accessed on 17 April 2023).
  7. Concordant and Discordant Familial Cancer: Familial Risks, Proportions and Population Impact—Frank—2017—International Journal of Cancer—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1002/ijc.30583 (accessed on 17 April 2023).
  8. Umar, A.; Boland, C.R.; Terdiman, J.P.; Syngal, S.; de la Chapelle, A.; Rüschoff, J.; Fishel, R.; Lindor, N.M.; Burgart, L.J.; Hamelin, R.; et al. Revised Bethesda Guidelines for Hereditary Nonpolyposis Colorectal Cancer (Lynch Syndrome) and Microsatellite Instability. JNCI J. Natl. Cancer Inst. 2004, 96, 261–268. [Google Scholar] [CrossRef]
  9. Nagtegaal, I.D.; Odze, R.D.; Klimstra, D.; Paradis, V.; Rugge, M.; Schirmacher, P.; Washington, K.M.; Carneiro, F.; Cree, I.A. The 2019 WHO Classification of Tumours of the Digestive System. Histopathology 2020, 76, 182–188. [Google Scholar] [CrossRef] [Green Version]
  10. Medina Pabón, M.A.; Babiker, H.M. A Review Of Hereditary Colorectal Cancers. In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2023. [Google Scholar]
  11. Dominguez-Valentin, M.; Nakken, S.; Tubeuf, H.; Vodak, D.; Ekstrøm, P.O.; Nissen, A.M.; Morak, M.; Holinski-Feder, E.; Martins, A.; Møller, P.; et al. Identification of Genetic Variants for Clinical Management of Familial Colorectal Tumors. BMC Med. Genet. 2018, 19, 26. [Google Scholar] [CrossRef]
  12. Syngal, S.; Fox, E.A.; Li, C.; Dovidio, M.; Eng, C.; Kolodner, R.D.; Garber, J.E. Interpretation of Genetic Test Results for Hereditary Nonpolyposis Colorectal CancerImplications for Clinical Predisposition Testing. JAMA 1999, 282, 247–253. [Google Scholar] [CrossRef] [Green Version]
  13. Familial Adenomatous Polyposis: Aberrant Splicing Due to Missense or Silent Mutations in the APC Gene—Aretz—2004—Human Mutation—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/abs/10.1002/humu.20087 (accessed on 17 April 2023).
  14. Nolano, A.; Medugno, A.; Trombetti, S.; Liccardo, R.; De Rosa, M.; Izzo, P.; Duraturo, F. Hereditary Colorectal Cancer: State of the Art in Lynch Syndrome. Cancers 2023, 15, 75. [Google Scholar] [CrossRef]
  15. Association of Low-Risk MSH3 and MSH2 Variant Alleles with Lynch Syndrome: Probability of Synergistic Effects—Duraturo—2011—International Journal of Cancer—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1002/ijc.25824 (accessed on 17 April 2023).
  16. Rahman, N. Realizing the Promise of Cancer Predisposition Genes. Nature 2014, 505, 302–308. [Google Scholar] [CrossRef] [Green Version]
  17. Vasen, H.F.A.; Watson, P.; Mecklin, J.; Lynch, H.T. New Clinical Criteria for Hereditary Nonpolyposis Colorectal Cancer (HNPCC, Lynch Syndrome) Proposed by the International Collaborative Group on HNPCC. Gastroenterology 1999, 116, 1453–1456. [Google Scholar] [CrossRef]
  18. Vasen, H.F.A.; Mecklin, J.-P.; Meera Khan, P.; Lynch, H.T. The International Collaborative Group on Hereditary Non-Polyposis Colorectal Cancer (ICG-HNPCC). Dis. Colon Rectum 1991, 34, 424. [Google Scholar] [CrossRef] [PubMed]
  19. Rodriguez-Bigas, M.A.; Boland, C.R.; Hamilton, S.R.; Henson, D.E.; Srivastava, S.; Jass, J.R.; Khan, P.M.; Lynch, H.; Smyrk, T.; Perucho, M.; et al. A National Cancer Institute Workshop on Hereditary Nonpolyposis Colorectal Cancer Syndrome: Meeting Highlights and Bethesda Guidelines. JNCI J. Natl. Cancer Inst. 1997, 89, 1758–1762. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Hampel, H.; Frankel, W.; Panescu, J.; Lockman, J.; Sotamaa, K.; Fix, D.; Comeras, I.; La Jeunesse, J.; Nakagawa, H.; Westman, J.A.; et al. Screening for Lynch Syndrome (Hereditary Nonpolyposis Colorectal Cancer) among Endometrial Cancer Patients. Cancer Res. 2006, 66, 7810–7817. [Google Scholar] [CrossRef] [Green Version]
  21. Piñol, V.; Castells, A.; Andreu, M.; Castellví-Bel, S.; Alenda, C.; Llor, X.; Xicola, R.M.; Rodríguez-Moranta, F.; Payá, A.; Jover, R.; et al. Accuracy of Revised Bethesda Guidelines, Microsatellite Instability, and Immunohistochemistry for the Identification of Patients With Hereditary Nonpolyposis Colorectal Cancer. JAMA 2005, 293, 1986–1994. [Google Scholar] [CrossRef] [PubMed]
  22. Kamau, E.; Agoti, C.N.; Ngoi, J.M.; de Laurent, Z.R.; Gitonga, J.; Cotten, M.; Phan, M.V.T.; Nokes, D.J.; Delwart, E.; Sanders, E.; et al. Complete Genome Sequences of Dengue Virus Type 2 Strains from Kilifi, Kenya. Microbiol. Resour. Announc. 2019, 8, e01566-18. [Google Scholar] [CrossRef] [Green Version]
  23. Barbirou, M.; Miller, A.A.; Gafni, E.; Mezlini, A.; Zidi, A.; Boley, N.; Tonellato, P.J. Evaluation of CfDNA as an Early Detection Assay for Dense Tissue Breast Cancer. Sci. Rep. 2022, 12, 8458. [Google Scholar] [CrossRef]
  24. Ahadova, A.; von Knebel Doeberitz, M.; Bläker, H.; Kloor, M. CTNNB1-Mutant Colorectal Carcinomas with Immediate Invasive Growth: A Model of Interval Cancers in Lynch Syndrome. Fam. Cancer 2016, 15, 579–586. [Google Scholar] [CrossRef]
  25. Three Molecular Pathways Model Colorectal Carcinogenesis in Lynch Syndrome—Ahadova—2018—International Journal of Cancer—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1002/ijc.31300 (accessed on 17 April 2023).
  26. Binder, H.; Hopp, L.; Schweiger, M.R.; Hoffmann, S.; Jühling, F.; Kerick, M.; Timmermann, B.; Siebert, S.; Grimm, C.; Nersisyan, L.; et al. Genomic and Transcriptomic Heterogeneity of Colorectal Tumours Arising in Lynch Syndrome. J. Pathol. 2017, 243, 242–254. [Google Scholar] [CrossRef] [Green Version]
  27. Win, A.K.; Jenkins, M.A.; Dowty, J.G.; Antoniou, A.C.; Lee, A.; Giles, G.G.; Buchanan, D.D.; Clendenning, M.; Rosty, C.; Ahnen, D.J.; et al. Prevalence and Penetrance of Major Genes and Polygenes for Colorectal Cancer. Cancer Epidemiol Biomark Prev. 2017, 26, 404–412. [Google Scholar] [CrossRef] [Green Version]
  28. Hesson, L.B.; Packham, D.; Kwok, C.-T.; Nunez, A.C.; Ng, B.; Schmidt, C.; Fields, M.; Wong, J.W.H.; Sloane, M.A.; Ward, R.L. Lynch Syndrome Associated with Two MLH1 Promoter Variants and Allelic Imbalance of MLH1 Expression. Hum. Mutat. 2015, 36, 622–630. [Google Scholar] [CrossRef] [Green Version]
  29. Chang, P.-Y.; Chang, S.-C.; Wang, M.-C.; Chen, J.-S.; Tsai, W.-S.; You, J.-F.; Chen, C.-C.; Liu, H.-L.; Chiang, J.-M. Pathogenic Germline Mutations of DNA Repair Pathway Components in Early-Onset Sporadic Colorectal Polyp and Cancer Patients. Cancers 2020, 12, 3560. [Google Scholar] [CrossRef]
  30. Eikenboom, E.L.; van der Werf–’t Lam, A.-S.; Rodríguez-Girondo, M.; Van Asperen, C.J.; Dinjens, W.N.M.; Hofstra, R.M.W.; Van Leerdam, M.E.; Morreau, H.; Spaander, M.C.W.; Wagner, A.; et al. Universal Immunohistochemistry for Lynch Syndrome: A Systematic Review and Meta-Analysis of 58,580 Colorectal Carcinomas. Clin. Gastroenterol. Hepatol. 2022, 20, e496–e507. [Google Scholar] [CrossRef]
  31. Da Silva, S.I.O.; Domingos, T.A.; Kupper, B.E.C.; De Brot, L.; Aguiar Junior, S.; Carraro, D.M.; Torrezan, G.T. Amplicon-Based NGS Test for Assessing MLH1 Promoter Methylation and Its Correlation with BRAF Mutation in Colorectal Cancer Patients. Exp. Mol. Pathol. 2023, 130, 104855. [Google Scholar] [CrossRef] [PubMed]
  32. Thompson, B.A.; Spurdle, A.B.; Plazzer, J.-P.; Greenblatt, M.S.; Akagi, K.; Al-Mulla, F.; Bapat, B.; Bernstein, I.; Capellá, G.; den Dunnen, J.T.; et al. Application of a 5-Tiered Scheme for Standardized Classification of 2,360 Unique Mismatch Repair Gene Variants in the InSiGHT Locus-Specific Database. Nat. Genet. 2014, 46, 107–115. [Google Scholar] [CrossRef] [Green Version]
  33. Prognostic Values of Apoptosis-Stimulating P53-Binding Protein 1 and 2 and Their Relationships with Clinical Characteristics of Esophageal Squamous Cell Carcinoma Patients: A Retrospective Study|SpringerLink. Available online: https://link.springer.com/article/10.1186/s40880-016-0169-0 (accessed on 17 April 2023).
  34. Kloth, M.; Ruesseler, V.; Engel, C.; Koenig, K.; Peifer, M.; Mariotti, E.; Kuenstlinger, H.; Florin, A.; Rommerscheidt-Fuss, U.; Koitzsch, U.; et al. Activating ERBB2/HER2 Mutations Indicate Susceptibility to Pan-HER Inhibitors in Lynch and Lynch-like Colorectal Cancer. Gut 2016, 65, 1296–1305. [Google Scholar] [CrossRef]
  35. P53 Polymorphic Variants at Codon 72 Exert Different Effects on Cell Cycle Progression—Pim—2004—International Journal of Cancer—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1002/ijc.11548 (accessed on 17 April 2023).
  36. Jamshidi, M.; Mohammadi Pour, S.; Mahmoudian-Sani, M.-R. Single Nucleotide Variants Associated with Colorectal Cancer Among Iranian Patients: A Narrative Review. Pharmacogenomics Pers. Med. 2020, 13, 167–180. [Google Scholar] [CrossRef] [PubMed]
  37. Chen, Y.; Sharma, R.P.; Costa, R.H.; Costa, E.; Grayson, D.R. On the Epigenetic Regulation of the Human Reelin Promoter. Nucleic Acids Res. 2002, 30, 2930–2939. [Google Scholar] [CrossRef] [Green Version]
  38. Sun, W.; Bunn, P.; Jin, C.; Little, P.; Zhabotynsky, V.; Perou, C.M.; Hayes, D.N.; Chen, M.; Lin, D.-Y. The Association between Copy Number Aberration, DNA Methylation and Gene Expression in Tumor Samples. Nucleic Acids Res. 2018, 46, 3009–3018. [Google Scholar] [CrossRef] [PubMed]
  39. Gutierrez-Arcelus, M.; Lappalainen, T.; Montgomery, S.B.; Buil, A.; Ongen, H.; Yurovsky, A.; Bryois, J.; Giger, T.; Romano, L.; Planchon, A.; et al. Passive and Active DNA Methylation and the Interplay with Genetic Variation in Gene Regulation. eLife 2013, 2, e00523. [Google Scholar] [CrossRef] [PubMed]
  40. Shi, X.; Radhakrishnan, S.; Wen, J.; Chen, J.Y.; Chen, J.; Lam, B.A.; Mills, R.E.; Stranger, B.E.; Lee, C.; Setlur, S.R. Association of CNVs with Methylation Variation. npj Genom. Med. 2020, 5, 1–10. [Google Scholar] [CrossRef] [PubMed]
  41. Li, J.; Harris, R.A.; Cheung, S.W.; Coarfa, C.; Jeong, M.; Goodell, M.A.; White, L.D.; Patel, A.; Kang, S.-H.; Shaw, C.; et al. Genomic Hypomethylation in the Human Germline Associates with Selective Structural Mutability in the Human Genome. PLoS Genet. 2012, 8, e1002692. [Google Scholar] [CrossRef] [PubMed]
  42. Zhang, Y.; Yang, L.; Kucherlapati, M.; Hadjipanayis, A.; Pantazi, A.; Bristow, C.A.; Lee, E.A.; Mahadeshwar, H.S.; Tang, J.; Zhang, J.; et al. Global Impact of Somatic Structural Variation on the DNA Methylome of Human Cancers. Genome Biol. 2019, 20, 209. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Motz, G.T.; Coukos, G. The Parallel Lives of Angiogenesis and Immunosuppression: Cancer and Other Tales. Nat. Rev. Immunol. 2011, 11, 702–711. [Google Scholar] [CrossRef] [PubMed]
  44. Gordiev, M.; Shigapova, L.H.; Brovkina, O.; Enikeev, R.F.; Druzhkov, M.; Nikitin, A.; Shagimardanova, E.; Gusev, O. Lynch Syndrome-Associated Hereditary Mutations Cause Breast and Ovarian Cancer: Results from Russian Heredetary Oncogenomics Project. Ann. Oncol. 2018, 29, viii77. [Google Scholar] [CrossRef]
  45. Brovkina, O.I.; Shigapova, L.; Chudakova, D.A.; Gordiev, M.G.; Enikeev, R.F.; Druzhkov, M.O.; Khodyrev, D.S.; Shagimardanova, E.I.; Nikitin, A.G.; Gusev, O.A. The Ethnic-Specific Spectrum of Germline Nucleotide Variants in DNA Damage Response and Repair Genes in Hereditary Breast and Ovarian Cancer Patients of Tatar Descent. Front. Oncol. 2018, 8, 421. [Google Scholar] [CrossRef]
  46. Is Breast Cancer a Part of Lynch Syndrome?|SpringerLink. Available online: https://link.springer.com/article/10.1186/bcr3241 (accessed on 17 April 2023).
  47. Nikitin, A.G.; Chudakova, D.A.; Enikeev, R.F.; Sakaeva, D.; Druzhkov, M.; Shigapova, L.H.; Brovkina, O.I.; Shagimardanova, E.I.; Gusev, O.A.; Gordiev, M.G. Lynch Syndrome Germline Mutations in Breast Cancer: Next Generation Sequencing Case-Control Study of 1,263 Participants. Front. Oncol. 2020, 10, 666. [Google Scholar] [CrossRef]
  48. Use of Multigene-Panel Identifies Pathogenic Variants in Several CRC-Predisposing Genes in Patients Previously Tested for Lynch Syndrome—Hansen—2017—Clinical Genetics—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1111/cge.12994 (accessed on 17 April 2023).
  49. Gordon-Weeks, A.; Lim, S.Y.; Yuzhalin, A.; Lucotti, S.; Vermeer, J.A.F.; Jones, K.; Chen, J.; Muschel, R.J. Tumour-Derived Laminin A5 (LAMA5) Promotes Colorectal Liver Metastasis Growth, Branching Angiogenesis and Notch Pathway Inhibition. Cancers 2019, 11, 630. [Google Scholar] [CrossRef] [Green Version]
  50. Abe, T.; Lee, A.; Sitharam, R.; Kesner, J.; Rabadan, R.; Shapira, S.D. Germ-Cell-Specific Inflammasome Component NLRP14 Negatively Regulates Cytosolic Nucleic Acid Sensing to Promote Fertilization. Immunity 2017, 46, 621–634. [Google Scholar] [CrossRef] [Green Version]
  51. Ellwanger, K.; Becker, E.; Kienes, I.; Sowa, A.; Postma, Y.; Gloria, Y.C.; Weber, A.N.R.; Kufer, T.A. The NLR Family Pyrin Domain–Containing 11 Protein Contributes to the Regulation of Inflammatory Signaling. J. Biol. Chem. 2018, 293, 2701–2710. [Google Scholar] [CrossRef] [Green Version]
  52. Schlüter, K.; Gassmann, P.; Enns, A.; Korb, T.; Hemping-Bovenkerk, A.; Hölzen, J.; Haier, J. Organ-Specific Metastatic Tumor Cell Adhesion and Extravasation of Colon Carcinoma Cells with Different Metastatic Potential. Am. J. Pathol. 2006, 169, 1064–1073. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Parsons, D.W.; Wang, T.-L.; Samuels, Y.; Bardelli, A.; Cummins, J.M.; DeLong, L.; Silliman, N.; Ptak, J.; Szabo, S.; Willson, J.K.V.; et al. Mutations in a Signalling Pathway. Nature 2005, 436, 792. [Google Scholar] [CrossRef]
  54. IRS2 Is a Candidate Driver Oncogene on 13q34 in Colorectal Cancer—Day—2013—International Journal of Experimental Pathology—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1111/iep.12021 (accessed on 17 April 2023).
  55. Diószegi, J.; Llanaj, E.; Ádány, R. Genetic Background of Taste Perception, Taste Preferences, and Its Nutritional Implications: A Systematic Review. Front. Genet. 2019, 10, 1272. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Bachmanov, A.A.; Beauchamp, G.K. Taste Receptor Genes. Annu. Rev. Nutr. 2007, 27, 389–414. [Google Scholar] [CrossRef] [Green Version]
  57. Sternini, C. Taste Receptors in the Gastrointestinal Tract. IV. Functional Implications of Bitter Taste Receptors in Gastrointestinal Chemosensing. Am. J. Physiol.-Gastrointest. Liver Physiol. 2007, 292, G457–G461. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Deshpande, D.A.; Wang, W.C.H.; McIlmoyle, E.L.; Robinett, K.S.; Schillinger, R.M.; An, S.S.; Sham, J.S.K.; Liggett, S.B. Bitter Taste Receptors on Airway Smooth Muscle Bronchodilate by Localized Calcium Signaling and Reverse Obstruction. Nat. Med. 2010, 16, 1299–1304. [Google Scholar] [CrossRef] [Green Version]
  59. Drewnowski, A.; Rock, C.L. The Influence of Genetic Taste Markers on Food Acceptance. Am. J. Clin. Nutr. 1995, 62, 506–511. [Google Scholar] [CrossRef]
  60. Rozengurt, E. Taste Receptors in the Gastrointestinal Tract. I. Bitter Taste Receptors and α-Gustducin in the Mammalian Gut. Am. J. Physiol.-Gastrointest. Liver Physiol. 2006, 291, G171–G177. [Google Scholar] [CrossRef] [Green Version]
  61. Whole-Genome Sequencing of Synchronous Thyroid Carcinomas Identifies Aberrant DNA Repair in Thyroid Cancer Dedifferentiation—Paulsson—2020—The Journal of Pathology—Wiley Online Library. Available online: https://pathsocjournals.onlinelibrary.wiley.com/doi/full/10.1002/path.5359 (accessed on 17 April 2023).
  62. NCCN Guidelines Insights: Genetic/Familial High-Risk Assessment: Colorectal, Version 2.2019 in: Journal of the National Comprehensive Cancer Network Volume 17 Issue 9 (2019). Available online: https://jnccn.org/view/journals/jnccn/17/9/article-p1032.xml (accessed on 12 June 2023).
  63. Syngal, S.; Brand, R.E.; Church, J.M.; Giardiello, F.M.; Hampel, H.L.; Burt, R.W. ACG Clinical Guideline: Genetic Testing and Management of Hereditary Gastrointestinal Cancer Syndromes. Am. J. Gastroenterol. 2015, 110, 223–263. [Google Scholar] [CrossRef] [Green Version]
  64. Hampel, H.; Kalady, M.F.; Pearlman, R.; Stanich, P.P. Hereditary Colorectal Cancer. Hematol. Oncol. Clin. 2022, 36, 429–447. [Google Scholar] [CrossRef]
  65. Li, H.; Durbin, R. Fast and Accurate Long-Read Alignment with Burrows–Wheeler Transform. Bioinformatics 2010, 26, 589–595. [Google Scholar] [CrossRef] [Green Version]
  66. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map Format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [Green Version]
  67. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce Framework for Analyzing next-Generation DNA Sequencing Data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  68. Wang, K.; Li, M.; Hakonarson, H. ANNOVAR: Functional Annotation of Genetic Variants from High-Throughput Sequencing Data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef]
  69. 1000 Genomes Project Consortium; Auton, A.; Brooks, L.D.; Durbin, R.M.; Garrison, E.P.; Kang, H.M.; Korbel, J.O.; Marchini, J.L.; McCarthy, S.; McVean, G.A.; et al. A Global Reference for Human Genetic Variation. Nature 2015, 526, 68–74. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  70. Smigielski, E.M.; Sirotkin, K.; Ward, M.; Sherry, S.T. DbSNP: A Database of Single Nucleotide Polymorphisms. Nucleic Acids Res. 2000, 28, 352–355. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Lek, M.; Karczewski, K.J.; Minikel, E.V.; Samocha, K.E.; Banks, E.; Fennell, T.; O’Donnell-Luria, A.H.; Ware, J.S.; Hill, A.J.; Cummings, B.B.; et al. Analysis of Protein-Coding Genetic Variation in 60,706 Humans. Nature 2016, 536, 285–291. [Google Scholar] [CrossRef] [Green Version]
  72. Linderman, M.D.; Paudyal, C.; Shakeel, M.; Kelley, W.; Bashir, A.; Gelb, B.D. NPSV: A Simulation-Driven Approach to Genotyping Structural Variants in Whole-Genome Sequencing Data. GigaScience 2021, 10, giab046. [Google Scholar] [CrossRef]
  73. Layer, R.M.; Chiang, C.; Quinlan, A.R.; Hall, I.M. LUMPY: A Probabilistic Framework for Structural Variant Discovery. Genome Biol. 2014, 15, R84. [Google Scholar] [CrossRef] [Green Version]
  74. Chiang, C.; Layer, R.M.; Faust, G.G.; Lindberg, M.R.; Rose, D.B.; Garrison, E.P.; Marth, G.T.; Quinlan, A.R.; Hall, I.M. SpeedSeq: Ultra-Fast Personal Genome Analysis and Interpretation. Nat. Methods 2015, 12, 966–968. [Google Scholar] [CrossRef] [Green Version]
  75. Pedersen, B.S.; Quinlan, A.R. Duphold: Scalable, Depth-Based Annotation and Curation of High-Confidence Structural Variant Calls. GigaScience 2019, 8, giz040. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  76. Geoffroy, V.; Herenger, Y.; Kress, A.; Stoetzel, C.; Piton, A.; Dollfus, H.; Muller, J. AnnotSV: An Integrated Tool for Structural Variations Annotation. Bioinformatics 2018, 34, 3572–3574. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  77. Kircher, M.; Witten, D.M.; Jain, P.; O’Roak, B.J.; Cooper, G.M.; Shendure, J. A General Framework for Estimating the Relative Pathogenicity of Human Genetic Variants. Nat. Genet. 2014, 46, 310–315. [Google Scholar] [CrossRef] [Green Version]
  78. Schwarz, J.M.; Rödelsperger, C.; Schuelke, M.; Seelow, D. MutationTaster Evaluates Disease-Causing Potential of Sequence Alterations. Nat. Methods 2010, 7, 575–576. [Google Scholar] [CrossRef]
  79. Adzhubei, I.A.; Schmidt, S.; Peshkin, L.; Ramensky, V.E.; Gerasimova, A.; Bork, P.; Kondrashov, A.S.; Sunyaev, S.R. A Method and Server for Predicting Damaging Missense Mutations. Nat. Methods 2010, 7, 248–249. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  80. Choi, Y.; Chan, A.P. PROVEAN Web Server: A Tool to Predict the Functional Effect of Amino Acid Substitutions and Indels. Bioinformatics 2015, 31, 2745–2747. [Google Scholar] [CrossRef] [Green Version]
  81. Ng, P.C.; Henikoff, S. SIFT: Predicting Amino Acid Changes That Affect Protein Function. Nucleic Acids Res. 2003, 31, 3812–3814. [Google Scholar] [CrossRef] [Green Version]
  82. DbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs—Liu—2016—Human Mutation—Wiley Online Library. Available online: https://onlinelibrary.wiley.com/doi/full/10.1002/humu.22932 (accessed on 17 April 2023).
  83. Ward, L.D.; Kellis, M. HaploReg v4: Systematic Mining of Putative Causal Variants, Cell Types, Regulators and Target Genes for Human Complex Traits and Disease. Nucleic Acids Res. 2016, 44, D877–D881. [Google Scholar] [CrossRef]
  84. Boyle, A.P.; Hong, E.L.; Hariharan, M.; Cheng, Y.; Schaub, M.A.; Kasowski, M.; Karczewski, K.J.; Park, J.; Hitz, B.C.; Weng, S.; et al. Annotation of Functional Variation in Personal Genomes Using RegulomeDB. Genome Res. 2012, 22, 1790–1797. [Google Scholar] [CrossRef] [Green Version]
  85. Birney, E.; Stamatoyannopoulos, J.A.; Dutta, A.; Guigó, R.; Gingeras, T.R.; Margulies, E.H.; Weng, Z.; Snyder, M.; Dermitzakis, E.T.; Stamatoyannopoulos, J.A.; et al. Identification and Analysis of Functional Elements in 1% of the Human Genome by the ENCODE Pilot Project. Nature 2007, 447, 799–816. [Google Scholar] [CrossRef] [Green Version]
  86. Collins, R.L.; Brand, H.; Karczewski, K.J.; Zhao, X.; Alföldi, J.; Francioli, L.C.; Khera, A.V.; Lowther, C.; Gauthier, L.D.; Wang, H.; et al. A Structural Variation Reference for Medical and Population Genetics. Nature 2020, 581, 444–451. [Google Scholar] [CrossRef] [PubMed]
  87. Richards, S.; Aziz, N.; Bale, S.; Bick, D.; Das, S.; Gastier-Foster, J.; Grody, W.W.; Hegde, M.; Lyon, E.; Spector, E.; et al. Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 2015, 17, 405–423. [Google Scholar] [CrossRef] [Green Version]
  88. Oscanoa, J.; Sivapalan, L.; Gadaleta, E.; Dayem Ullah, A.Z.; Lemoine, N.R.; Chelala, C. SNPnexus: A Web Server for Functional Annotation of Human Genome Sequence Variation (2020 Update). Nucleic Acids Res. 2020, 48, W185–W192. [Google Scholar] [CrossRef] [PubMed]
  89. Becker, K.G.; Barnes, K.C.; Bright, T.J.; Wang, S.A. The Genetic Association Database. Nat. Genet. 2004, 36, 431–432. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  90. Tamborero, D.; Rubio-Perez, C.; Deu-Pons, J.; Schroeder, M.P.; Vivancos, A.; Rovira, A.; Tusquets, I.; Albanell, J.; Rodon, J.; Tabernero, J.; et al. Cancer Genome Interpreter Annotates the Biological and Clinical Relevance of Tumor Alterations. Genome Med. 2018, 10, 25. [Google Scholar] [CrossRef] [Green Version]
  91. Chakravarty, D.; Gao, J.; Phillips, S.; Kundra, R.; Zhang, H.; Wang, J.; Rudolph, J.E.; Yaeger, R.; Soumerai, T.; Nissan, M.H.; et al. OncoKB: A Precision Oncology Knowledge Base. JCO Precis. Oncol. 2017, 2017, PO.17.00011. [Google Scholar] [CrossRef]
  92. Home—My Cancer Genome. Available online: https://www.mycancergenome.org/ (accessed on 12 June 2023).
Figure 1. A pedigree of predisposition to colorectal cancer Lynch syndrome. I, II, III, IV the number of generations. * Sequenced subjects; Group 1 (high risk to LS group): subject and parent affected by LS-CRC, red circles; Group 2 (intermediate risk to LS): subject LS-CRC free and one of the parents is affected, green circles; Group 3 (low risk to LS): subject and both parents are LS-CRC free, blue circles.
Figure 1. A pedigree of predisposition to colorectal cancer Lynch syndrome. I, II, III, IV the number of generations. * Sequenced subjects; Group 1 (high risk to LS group): subject and parent affected by LS-CRC, red circles; Group 2 (intermediate risk to LS): subject LS-CRC free and one of the parents is affected, green circles; Group 3 (low risk to LS): subject and both parents are LS-CRC free, blue circles.
Cancers 15 04074 g001
Figure 2. Schematic representation of the germline variant prioritization workflow. WGS: whole genome sequencing, QC: quality control, SNPs: single nucleotide polymorphisms, indels: insertions/deletions, SVs: structural variants, CADD: combined annotation dependent depletion, OncoKB: Oncology Knowledge Base.
Figure 2. Schematic representation of the germline variant prioritization workflow. WGS: whole genome sequencing, QC: quality control, SNPs: single nucleotide polymorphisms, indels: insertions/deletions, SVs: structural variants, CADD: combined annotation dependent depletion, OncoKB: Oncology Knowledge Base.
Cancers 15 04074 g002
Table 1. Demographic and clinical characteristics of the included family members.
Table 1. Demographic and clinical characteristics of the included family members.
ParametersHRLS N = 3 (%)IRLS N = 4 (%)LRLS N = 4 (%)
Gender (Males/Females)1/20/42/2
Age (years) 148 ± 12.1638.25 ± 17.6335.50 ± 24.11
BMI 126.08 ± 3.1126.77 ± 3.4727.52 ± 2.64
Vegetable consumption
High0 (0)3 (75)4 (100)
Low3 (100)1(25)0 (0)
Brine consumption
High2 (66.66)0 (0)1 (25)
Low1 (33.34)4 (100)3 (75)
Meat consumption
High2 (66.66)2 (50)3 (75)
Low1 (33.34)2 (50)1 (25)
Fat consumption
High0 (0)4 (100)3 (75)
Low3 (100)0 (0)1 (25)
Smoking
Never used1 (33.34)3 (75)2 (50)
Tobacco users2 (66.66)1 (25)2 (50)
Alcohol
Never drink1 (33.34)4 (100)2 (50)
Alcohol users2 (66.66)0 (0)2 (50)
Physical activity level
High0 (0)2 (50)4 (100)
Low3 (100)2 (50)0 (0)
Medical History
Hypertension2 (66.66)3(75)1(25)
Hyperglycemia2 (66.66)2(50)0 (0)
Anemia0 (0)0 (0)0 (0)
BMI: Body mass index; tobacco users: former users and current users; alcohol users: former drinkers and current drinkers, 1: mean ± standard deviation; high risk to LS group (HRLS); intermediate risk to LS (IRLS); low risk to LS (LRLS).
Table 2. Variants count for the whole pedigree subjects and shared variants by subjects under the same pedigree group with the functional annotation of noncoding variants according to ANNOVAR.
Table 2. Variants count for the whole pedigree subjects and shared variants by subjects under the same pedigree group with the functional annotation of noncoding variants according to ANNOVAR.
Variants Filtering Variant Count
VQSR9,876,341 (7,836,438 SNPs and 2,039,903 Indels)
<0.01 AF 1000 G ALL and non-TCGA ExAC ALL2,334,312
CADD (SNPs) or CADD Indel (Indels) Scaled Phred Score > 1080,905
Variant stratification Coding VariantsNon-Coding Variants
Total count2824 78,081
Predicted deleterious by having at least three of MutationTaster, PolyPhen V2, Provean, and SIFT19619826
Shared by all samples in a group79 (HRLS: 37; IRLS: 27; LRLS: 15)9826 (HRLS: 4171; IRLS: 2820; LRLS: 2835)
Exclusive to a particular group019 (HRLS: 2; IRLS: 3; LRLS: 14)
RegulomeDB Score < 4NALRLS: 4
Functional annotation of noncoding variants according to ANNOVAR
Variants annotation according to region hit from RefSeqVariants shared by all subjects Variants shared by subjects in the same group
Intergenic37,41210
Intronic30,3447
ncRNA_intronic50720
3′UTR21690
Upstream and Downstream18252
5′UTR57680
ncRNA_exonic4470
RefSeq: reference sequence database; ncRNA: non-coding transcript variant; NA: not applicable; VQSR: Variant Quality Score Recalibration; ExAC: exome aggregation consortium; AF: allele frequency; 1000 G: 1000 genomes project for all individual in this release; CADD: combined annotation dependent depletion; SNPs: single nucleotide polymorphisms; indels: insertions/deletions; PolyPhen V2: PolyPhen Version 2; high risk to LS group (HRLS); intermediate risk to LS (IRLS) low risk to LS (LRLS); TCGA: The Cancer Genome Atlas Program; SIFT: sorting intolerant from tolerant; PROVEAN: Protein Variation Effect Analyzer.
Table 3. Classification of detected variants by gene and cancer impact according to pedigree groups.
Table 3. Classification of detected variants by gene and cancer impact according to pedigree groups.
Coding Variants
GenesDetected Variants Stratified by Pedigree GroupsAFFunctional AnnotationCancer RelatedCRC RelatedOncogenic ClassificationTumor Driver
SNP IDHRLSIRLSLRLS
PABPC3Variants Count554
rs79397892YesYesYes0.0054EX--PassengerTrue
rs78826513YesYesYesNAEX--DriverTrue
rs78552667YesYesYesNAEX--PassengerTrue
rs201411821YesYes-NAEX--DriverTrue
rs80261016YesYesYesNAEX--DriverTrue
KRT18Variants Count444
rs78343594YesYesYesNAEX--PassengerFalse
rs77999286YesYesYesNAEX--PassengerFalse
rs75441140YesYesYesNAEX--PassengerFalse
NAYesYesYesNAEX--NANA
CNN2Variants Count221
rs77830704YesYesYesNAEX--PassengerFalse
rs75676484YesYes-NAEX--PassengerFalse
SLC25A5Variants Count121
rs753913830YesYesYesNAEX--PassengerFalse
rs199707714-Yes-NAEX--PassengerFalse
MYH13Variants Count111
rs186137259YesYesYes0.0016EX--PassengerFalse
DNAH2Variants Count111
rs140035206YesYesYes0.0022EX--PassengerFalse
ANP32BVariants Count11-
rs76167314YesYes-NAEX--PassengerFalse
CNKSR1Variants Count11-
rs140685957YesYes-NAEX--PassengerFalse
CTNNBIP1Variants Count11-
rs138271667YesYes-0.0018EX--PassengerFalse
DNAH3Variants Count11-
rs147732992YesYes-0.0034EX--PassengerFalse
KDM1AVariants Count11-
rs144822945YesYes-0.0015EX--PassengerFalse
PRSS3Variants Count211
rs141382822YesYesYesNAEX--PassengerFalse
rs751456445Yes--NAEX--PassengerFalse
AATKVariants Count1--
rs61738829Yes--0.0088EX--PassengerFalse
ALPK3Variants Count1--
NAYes--NAEX--NANA
ANKRD34BVariants Count1--
rs145614517Yes--0.0030EX--PassengerFalse
ATXN2Variants Count1--
rs374319477Yes--NAEX--PassengerFalse
CUX2Variants Count1--
rs202242120Yes--0.0040EX--PassengerFalse
ERAP2Variants Count1--
rs145045143Yes--0.0010EX--PassengerFalse
FAM136AVariants Count1--
rs80277652Yes--NAEX--PassengerFalse
MACF1Variants Count1--
rs201602708Yes--0.0002EX--DriverTrue
MLH1Variants Count1--
rs63750539Yes--NAEXLeukemia; Lymphoma;
Melanoma; Pancreatic;
Liver; Fallopian; Endometrial; Ovarian; Breast; Prostate; Bladder; Thyroid; Esophageal; Lung
Lynch syndrome; Colorectal melanoma; Microsatellite Instability; Hereditary non-polyposis; Colon; Gastric;
Stomach
DriverTrue
PCSK5Variants Count1--
rs372055352Yes--NAEX--PassengerFalse
PLA2G6Variants Count1--
NAYes--NAEX--NANA
PPP1R13BVariants Count1--
rs373141354Yes--NAEXMelanoma-PassengerFalse
RIMKLAVariants Count1--
rs34142209Yes--0.0082EX--PassengerFalse
RNF207Variants Count1--
NAYes--NAEX--NANA
SIPA1L3Variants Count1--
rs201766021Yes--0.0002EX--PassengerFalse
XIRP1Variants Count1--
rs147417919Yes--0.0032EX--PassengerFalse
CD8AVariants Count-1-
rs200750291-Yes-0.0012EX--PassengerFalse
LAMA5Variants Count-1-
rs551763507-Yes-NAEXNeuroblastoma-PassengerFalse
MRPL24Variants Count-1-
rs561581574-Yes-NAEX--PassengerFalse
NLRP14Variants Count-1-
rs76670455-Yes-0.0058EXLeukemia-PassengerFalse
TAAR5Variants Count-1-
rs9493386-Yes-0.0026EX--PassengerFalse
PRH1-TAS2R14Variants Count--1
rs763119571--YesNAEX-ColorectalPassengerFalse
TMEM8BVariants Count--1
rs148540551--Yes0.0014EX--PassengerFalse
Noncoding variants
PODNVariants Count--1
rs544153916--Yes0.0002NA--Not protein-affectingFalse
SCP2Variants Count--1
rs116197074--Yes0.0056INT--Not protein-affectingFalse
MAML3Variants Count--1
rs116526711--Yes0.007INT--Not protein-affectingFalse
FTOVariants Count--1
rs115378978--Yes0.0078INTProstate-Not protein-affectingFalse
AF: 1000 G Phase 3 all population allele frequency; row in bold: variant previously described as associated with cancer; CRC: colorectal cancer; SNP: single nucleotide polymorphism; ID: identification; rs: reference SNP; INT: intronic; EX: exonic; NA: not applicable; high risk to LS group (HRLS); intermediate Risk to LS (IRLS); low risk to LS (LRLS).
Table 4. Variants count for the whole pedigree subjects and shared variants by subjects under the same pedigree group with variants location according to AnnotSV.
Table 4. Variants count for the whole pedigree subjects and shared variants by subjects under the same pedigree group with variants location according to AnnotSV.
Variants Filtering Variant Count
Deletions and duplications: duphold depth based4314 (4120 DEL, 194 DUP)
Breakends and inversions: NA6857 (6560 BND, 297 INV)
Total countBNDDELDUPINV
<0.01 AF 1000G ALL and <0.01 gnomAD54921122105137
AnnotSV ranking > 32741641718
Total length >= 50 bpNA1401718
Shared by all samples in a group154 (HRLS: 66; IRLS: 39; LRLS: 49)133 (HRLS: 48; IRLS: 46; LRLS: 39)4 (HRLS: 1; IRLS: 0; LRLS: 3)13 (HRLS: 4; IRLS: 4; LRLS: 5)
Shared by all samples under a particular group35 (HRLS: 23; IRLS: 4; LRLS: 8)18 (HRLS: 7; IRLS: 4; LRLS: 7)2 (HRLS: 0; IRLS: 0; LRLS: 2)2 (HRLS: 0; IRLS: 1; LRLS: 1)
Variant’s location according to AnnotSV
Variants annotation according to hit from RefSeqBNDDELDUPINV
Intronic261710
Exonic2000
txStart-txEnd0002
NA7110
RefSeq: reference sequence database; NA: not applicable; AF: allele frequency; 1000 G: 1000 genomes project for all individuals in this release; high risk to LS group (HRLS); intermediate risk to LS (IRLS); low risk to LS (LRLS); bp: base pair; BND: breakend; DEL: deletion; DUP: duplication; INV: inversion; gnomAD: Genome Aggregation Database; txStart-txEnd: Transcript Start-Transcript End.
Table 5. Classification of structural variants by gene and cancer impact according to pedigree groups.
Table 5. Classification of structural variants by gene and cancer impact according to pedigree groups.
Structural Variants
GenesDetected Variants Stratified by Pedigree GroupsAFLocationCancer RelatedCRC Related
AnnotSV IDLRLSIRLSHRLS
ADAM10Variants Count2--
15_58912864_58912865_BND_1Yes--NAINT--
15_58913242_58913243_BND_1Yes--NAINT
ATP11AVariants Count1-1
13_113518019_113519165_DEL_1Yes--NAINT--
13_113499584_113500055_DEL_1--YesNAINT
GIGYF2Variants Count2--
2_233668385_233668386_BND_1Yes--NAINT--
2_233668758_233668759_BND_1Yes--NAINT
LINC01137Variants Count2--
1_37938257_37938258_BND_1Yes--NAINT--
1_37938376_37938377_BND_1Yes--NAINT
RELNVariants Count--1
7_103463079_103463080_BND_1--Yes0.0001INTLeukemia; Lung; HepatocellularGastric
7_103463462_103463463_BND_1--Yes0.0001INT
DIP2CVariants Count1--
10_523437_523544_DEL_1Yes--NAINT--
WDR37Variants Count1--
10_1164005_1164234_DEL_1Yes--NAINT--
DCAKDVariants Count1--
17_43129091_43129429_DEL_1Yes--NAINT--
CDH4Variants Count1--
20_60216779_60217071_DEL_1Yes--0.0003INT--
MOV10L1Variants Count1--
22_50585735_50585941_DEL_1Yes--NAINT--
DNA2Variants Count1--
10_70222778_70222779_BND_1Yes--NAINT--
SBF2Variants Count1--
11_10293916_10293917_BND_1Yes--NAINT--
ANO5Variants Count1--
11_22214883_22214884_BND_1Yes--NAEX--
CHPT1Variants Count1--
12_102107161_102107162_BND_1Yes--NAINT--
MYO5BVariants Count1--
18_47698458_47698459_BND_1Yes--NAINT--
PLCB1Variants Count1--
20_8414039_8414040_BND_1Yes--0.0005INT--
APOL1Variants Count1--
22_36651344_36651345_BND_1Yes--0.0015INT--
NOP14Variants Count1--
4_2941531_2941532_BND_1Yes--NAINT--
MRPS18AVariants Count1--
6_43655533_43655534_BND_1Yes--NAEX--
CNTNAP2Variants Count1--
7_147571596_147571597_BND_1Yes--NAINT--
DPP6Variants Count1--
7_153760690_153760691_BND_1Yes--NAINT--
B4GALT1Variants Count1--
9_33130549_33130550_BND_1Yes--NAINT--
IRS2Variants Count-1-
13_110418067_110419056_DEL_1-Yes-NAINTEsophagealCRC; Intestinal; Stomach
RAP1GAP2Variants Count-1-
17_2904534_2904873_DEL_1-Yes-NAINT--
ASIC2Variants Count-1-
17_31596693_31596759_DEL_1-Yes-NAINT--
MMP20Variants Count-1-
11_102472245_102472246_BND_1-Yes-NAINT--
KCNIP4Variants Count-1-
4_20933624_20933792_DEL_1-Yes-NAINT--
ADGRE4PVariants Count-1-
19_6987869_6987870_BND_1-Yes-0.0007INT--
DYNLRB1Variants Count-1-
20_33116231_33116232_BND_1-Yes-NAINT--
DTX2Variants Count-1-
7_76128462_76128463_BND_1-Yes-NAINT--
FIRREVariants Count-1-
X_130813255_130974327_INV_1-Yes-NATX--
BCCIPVariants Count--1
10_127513335_127513754_DUP_1--YesNAINT--
MGAT5Variants Count--1
2_134966704_134970130_DEL_1--YesNAINT--
COL18A1Variants Count--1
21_46930863_46930934_DEL_1--YesNAINT--
FOXP1Variants Count--1
3_71242366_71242638_DEL_1--YesNAINTBladder; Endometrial; Lung; Salivary Gland; Breast; Skin; Hepatobiliary; Prostate; GliomaEsophagogastric; Gastrointestinal; CRC
CTNND2Variants Count--1
5_11816774_11817546_DEL_1--YesNAINT--
FSTL4Variants Count--1
5_132918980_132924990_DEL_1--YesNAINT--
DLGAP2Variants Count--1
8_1047119_1047802_DEL_1--Yes0.0001INT--
PPP2R5AVariants Count--1
1_212472702_212472703_BND_1--YesNAINT--
BMS1P4Variants Count--1
10_75489277_75489278_BND_1--YesNAINT--
IFITM3Variants Count--1
11_308031_320995_INV_1--YesNATX--
RRAS2Variants Count--1
11_14348706_14348707_BND_1--YesNAINTBreast; Ovarian-
PRKRAVariants Count--1
2_179314967_179314968_BND_1--YesNAINT--
Structural variants with no overlapping genes
NAVariants Count603
17_41438043_41440177_DEL_1Yes--NANANANA
11_1961293_1961294_BND_1Yes--NANA
17_38679442_38679443_BND_1Yes--NANA
19_17459958_17459959_BND_1Yes--NANA
6_28863601_28863602_BND_1Yes--0.0002NA
6_28863898_28863899_BND_1Yes--0.0002NA
17_41436517_41442619_DUP_1--YesNANA
9_107816642_107816643_BND_1--YesNANA
9_107817348_107817349_BND_1--YesNANA
AF: 1000 G Phase 3 all population allele frequency; row in bold: gene previously described as associated with cancer; CRC: colorectal cancer; ID: identification; INT: intronic; EX: exonic; NA: not applicable; txStart-txEnd: Transcript Start-Transcript End; high risk to LS group (HRLS); intermediate risk to LS (IRLS); low risk to LS (LRLS).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Barbirou, M.; Miller, A.A.; Mezlini, A.; Bouhaouala-Zahar, B.; Tonellato, P.J. Variant Characterization of a Representative Large Pedigree Suggests “Variant Risk Clusters” Convey Varying Predisposition of Risk to Lynch Syndrome. Cancers 2023, 15, 4074. https://doi.org/10.3390/cancers15164074

AMA Style

Barbirou M, Miller AA, Mezlini A, Bouhaouala-Zahar B, Tonellato PJ. Variant Characterization of a Representative Large Pedigree Suggests “Variant Risk Clusters” Convey Varying Predisposition of Risk to Lynch Syndrome. Cancers. 2023; 15(16):4074. https://doi.org/10.3390/cancers15164074

Chicago/Turabian Style

Barbirou, Mouadh, Amanda A. Miller, Amel Mezlini, Balkiss Bouhaouala-Zahar, and Peter J. Tonellato. 2023. "Variant Characterization of a Representative Large Pedigree Suggests “Variant Risk Clusters” Convey Varying Predisposition of Risk to Lynch Syndrome" Cancers 15, no. 16: 4074. https://doi.org/10.3390/cancers15164074

APA Style

Barbirou, M., Miller, A. A., Mezlini, A., Bouhaouala-Zahar, B., & Tonellato, P. J. (2023). Variant Characterization of a Representative Large Pedigree Suggests “Variant Risk Clusters” Convey Varying Predisposition of Risk to Lynch Syndrome. Cancers, 15(16), 4074. https://doi.org/10.3390/cancers15164074

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop