Next Article in Journal
Challenges in Humoral Immune Response to Adeno-Associated Viruses Determination
Next Article in Special Issue
Current Research Trends in Glioblastoma: Focus on Receptor Tyrosine Kinases
Previous Article in Journal
Environmental Exposure to Bisphenol A Enhances Invasiveness in Papillary Thyroid Cancer
Previous Article in Special Issue
Synergistic Effects of the Combination of Alpelisib (PI3K Inhibitor) and Ribociclib (CDK4/6 Inhibitor) in Preclinical Colorectal Cancer Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Haplotype GWAS in Syndromic Familial Colorectal Cancer

1
Department of Molecular Medicine and Surgery, Karolinska Institutet, 17176 Stockholm, Sweden
2
Division of Surgery, Department of Clinical Science Intervention and Technology (CLINTEC), Karolinska Institutet, 17177 Stockholm, Sweden
3
Department of Upper Abdominal Diseases, Karolinska University Hospital, 17177 Stockholm, Sweden
4
Department of Neuroscience, Uppsala University, 75237 Uppsala, Sweden
5
Department of Clinical Genetics and Genomics, Karolinska University Hospital, 17176 Stockholm, Sweden
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2025, 26(2), 817; https://doi.org/10.3390/ijms26020817
Submission received: 23 December 2024 / Revised: 14 January 2025 / Accepted: 16 January 2025 / Published: 19 January 2025

Abstract

:
A previous genome-wide association study (GWAS) in colorectal cancer (CRC) patients with gastric and/or prostate cancer in their families suggested genetic loci with a shared risk for these three cancers. A second haplotype GWAS was undertaken in the same colorectal cancer patients and different controls with the aim of confirming the result and finding novel loci. The haplotype GWAS analysis involved 685 patients with colorectal cancer cases and 1642 healthy controls from Sweden. A logistic regression model was used with a sliding window haplotype approach. Whole-genome and exome sequencing datawere used to find candidate SNPs to be tested in a nested case-control study. In the analysis of 685 colorectal cancer cases and 1642 controls, all ten candidate loci from the previous study were confirmed. Fifty candidate loci were suggested with a p-value < 5 × 10−6 and odds ratios between 1.35–6.52. Two of the 50 loci, on 13q33.3 and 16q23.3, were the same as in the previous study. Whole-genome or exome data from 122 colorectal cancer patients was used to search for candidate variants in these 50 loci. A nested case-control study was performed to test genetic variants at 11 loci in a cohort of 827 familial colorectal cancer and a sub-cohort of 293 familial CRC cases with colorectal, gastric, and/or prostate cancer within their families and 1530 healthy controls. One SNP, rs115943733 on 10q11.21, reached statistical significance (OR = 3.26, p = 0.009). Seven SNPs in 4 loci had a higher OR in the smaller cohort compared to the larger study CRC cases. The results in this GWAS gave support for suggested loci with an increased shared risk of CRC, gastric, and/or prostate cancer. Further studies are needed to confirm the shared risk to be able to use this information in cancer prevention.

1. Introduction

Colorectal cancer is the third most common cancer and the second most common cause of cancer death worldwide [1]. Approximately 75% of patients diagnosed with CRC exhibit sporadic disease, showing no apparent signs of inheriting the disorder. In contrast, 10% to 30% of patients have a family history of CRC, indicating a potential hereditary component, shared environmental exposures, common risk factors among family members, or a combination thereof [2]. Pathogenic variants in high-penetrance genes have been identified as the underlying cause of inherited cancer risk in certain families predisposed to CRC. However, these variants are estimated to account for only around 5% of CRC cases overall [3]. A known CRC syndrome, Lynch syndrome, with a genetic predisposition to non-polyposis CRC and other cancers, is caused by germline disease-causing variants in the DNA mismatch repair (MMR) genes MLH1, MSH2, MSH6, PMS2, or EPCAM [4]. Familial Adenomatous Polyposis (FAP) is caused by pathogenic variants in APC or MUTYH (MAP) [4]. However, despite these advances, a significant proportion of familial cases remain genetically uncharacterized [5].
Today, colorectal cancer and many other cancer types and common diseases are suggested as complex diseases caused by a combination of predisposing genetic variants in low-risk genes and environmental factors [6]. Thus, instead of searching for single high-penetrant genes, Genome-Wide Association Studies (GWAS) have been used to identify common genetic loci associated with an increased risk of and other cancers, as well as several common disorders. The findings have been validated using studies of thousands of individuals [7,8]. Many GWAS for risk in CRC have been performed to date and have identified more than 205 independent risk associations [8]. Still, these cannot explain most of the familial increased CRC risk. Studies examining pleiotropy across multiple cancer-associated loci have been done to identify common mechanisms of cancer development and progression [9].
A new CRC syndrome was suggested based on previous findings in families with colorectal and other cancers, most importantly, gastric and prostate cancers [10]. Linkage analysis in families with colorectal, gastric, and prostate cancer was undertaken to study the pattern of inheritance in the underlying hypothetical syndrome [11]. This analysis did not suggest any high penetrant disease-associated locus; instead, a complex disease for this putative syndrome was suggested [11]. To test this hypothesis, a haplotype genome-wide association study (GWAS) was first conducted, involving CRC patients who had gastric and/or prostate cancer among their relatives. The study used healthy, unrelated Swedish twins as controls [12]. The results suggested 10 risk loci associated with increased risk of colorectal, possibly also gastric, and prostate cancer [12]. Since another cohort of controls was available, it was possible to test the hypothesis further in a new study using the same CRC cases, but with this smaller cohort of geographically better-matched control persons recruited at the same time as cases and genotyped with the same chip [10,13,14]. The aim of the new study was to replicate the ten loci found in the previous study and to search for additional new loci for the same suggested syndrome.

2. Results

2.1. Replication of the Loci Found in Previous Haplotype GWAS

A test utilizing the 10 loci found in the previous study was undertaken. The positions for each of the ten statistically significant haplotype loci were searched for among the results from the new analysis. Two identical haplotypes, on loci 1q32.2 and 3q29, were found (Table 1). The exact same haplotypes could not be identified for any of the other eight loci because the controls used in this study were genotyped with a different chip than controls in the previous study, thus leading to somewhat different haplotypes in the new analysis. However, statistically significant findings of haplotypes at the same loci were still found (Table 1). Nine of the ten loci were confirmed with ORs > 1 and statistical significance p < 0.005). One locus, 16q23.3, had an OR = 3.6 and p = 0.06 (Table 1). The odds ratio (OR) in this new study using few but better-matched controls was higher in four and lower in six of the tested loci as compared to the previous findings (Table 1).

2.2. Haplotype GWAS with Correct Cases and Better-Matched Controls

Next, a sliding window (1–25) haplotype GWAS was conducted using 685 CRC cases and 1642 healthy controls (Supplemental Tables S1–S23). As the cohort of controls was smaller than the one used in the previous study, loci reaching p < 5 × 10−6 were considered for follow-up in this study. Only one locus on chromosome 9p24.3 reached a p-value generally considered statically significant in GWAS (OR = 2.3, p = 2.41 × 10−8) [15]. A total of fifty-five haplotypes at 50 loci had p < 5 × 10−6 and ORs from 1.35 to 6.52 (Table 2). Thirty-six loci had 1–3 coding genes in the haplotype region, and many had non-coding genes, RNA genes, pseudogenes, or other regulatory elements between the first and last position at the locus. Since loci with other elements except coding genes could be equally important, a search for candidate SNPs was undertaken in all fifty-five haplotype regions. Two loci among the 55, one on 13q33.3 and one on 16q23.3, were found also in the first study [12]. The OR for locus 13q33.3 (MYO16) was higher in the present study (Table 2). The other locus found in both studies was 16q23.3 (CDH13). However, the CDH13 locus involved two different regions. The region identified in the previous study was 82871769–82899877 (Table 1), and the region in this study was 82673410–82691564 (Table 2). Both haplotypes were within the gene CDH13 and suggested two unique risk variants in this gene.

2.3. Selection of Candidate Variants for SNP Analysis

Next, an attempt was made to find gene variants in each of the 55 haplotypes at all 50 loci. Variants in each locus are candidates to act as causative of an increased risk at that locus. In total, 122 familial CRC cases with gastric and/or prostate cancer in their families were sequenced using WES or WGS, and the data was searched for candidate SNPs in all 55 haplotype regions. Altogether, 33 variants at 14 loci (2p23.1, 2q33.1, 4q31.1, 4q31.3, 4q35.1, 6q25.3, 6q27, 10q11.21, 11q14.2, 13q12.11, 13q12.3, 14q32.11, 18q21.2, and 20q13.33) were selected and set up for testing. The variants were genotyped in MALDI-TOF analysis. As some SNPs were not compatible with each other in the primer design, they were eliminated, and after further QC, 17 variants in 11 loci were subject to testing. The 11 loci were 2q33.1, 4q31.1, 4q31.3, 4q35.1, 6q27, 10q11.21, 11q14.2, 13q12.11, 13q12.3, 14q32.11, and 20q13.33, and the 17 SNPs were rs114046582, rs115872046, rs115943733, rs1213626528, rs139227237, rs140852957, rs141501417, rs2427307, rs34803482, rs34851370, rs35516773, rs358314, rs358326, rs5743708, rs61735304, rs7150480, and rs72716373.
Finally, a case-control study tested all 17 candidate genetic variants in 827 familial CRC cases and 1530 healthy control persons. Among the cases, 293 also had a family history of gastric and/or prostate cancer. None of the loci tested in the association analysis showed a statistically significant result, although OR was >1 for nine of the tested SNPs (Table 3). A second association study was performed using a sub-cohort of only the 293 familial CRC cases with colorectal, gastric, and/or prostate cancer within their families and the same 1530 controls. In this smaller cohort, one SNP, rs115943733 on 10q11.21 (TMEM72), reached statistical significance (OR = 3.26, p = 0.009) (Table 3). Seven SNPs in 4 loci had a higher OR as compared with the previous analysis of 827 familial CRC cases, giving some support to the hypothesis of a higher risk of cancer in the smaller cohort of CRC cases selected because of their family history also including gastric and/or prostate cancer (Table 3).

3. Discussion

This haplotype GWAS study on the hereditary risk of colorectal, gastric, and prostate cancer replicates and confirms ten previously found candidate loci and suggests 50 novel candidate loci. A previous GWAS from the group on familial CRC cases with gastric and/or prostate cancer in their families used healthy controls from the Swedish twin registry. That study suggested ten risk loci with OR 1.71–3.62 and p-values of 2.56 × 10−10–4.47 × 10−8 [12]. The present study used a control population from the same geographical region as the cases, hoping to confirm and improve the results with better-matched controls. Cases and controls were in this study genotyped with the same chip, while, in the previous study, the controls were genotyped with a different chip, which could affect the results. The number of the new controls was smaller, because this time, the focus was on the odds ratio rather than the p-value to compare the results. Because different chips were used for controls in this and the previous study, generated haplotypes could be different because exact haplotypes could not always be compared. Instead, the positions for the chromosomal loci were considered, regardless of the actual haplotypes. The use of different SNPs in a GWAS means that a locus might be missed, but new loci could also be detected. Thus, the lack of confirmation of results from a previous study when replication used different SNPs for analysis does not rule out previous findings. Similar to the previous study using haplotype analysis, ORs were much higher compared to what has been seen in single-variant GWASs. This is probably because haplotype GWAS identifies rare loci, while single-variant GWAS finds common loci, and rare loci will only be found if they are associated with a higher OR and risk. Which p-values will be used can be discussed. In this study, a less stringent p-value was used; it could be argued that this is not correct. However, the finding of all these genes in the loci certainly gives strength to the results.
A first comparison of the 10 loci identified in the previous study, with findings in this study, noted that all 10 were confirmed with statistically significant positive ORs and statistically significant results, except the locus on chromosome 16q23.3 and the gene CDH13, which was only borderline significant at the exact positions (Table 1). However, another haplotype region in the same gene was observed with OR = 6.35, p = 3.18 × 10−6, and the same locus/gene (Table 2). This means that two risk variants in the same gene were suggested from our combined results, one between base pair position 82871769 and 82899877 in the previous study (Table 1) and one between the positions 82673410 and 82691564 in this new study with other controls (Table 2). The ORs were different. In the first study, the OR was 3.60 for the first haplotype in CDH13, which was 1.54 for the same positions in this study. The other sub-locus in the CDH13 had, in this study, an OR of 6.35, suggesting that these two variants were possibly associated with different risks. However, it could also be explained by the fact that the two different control populations used different SNPs, different numbers, and came from different geographical regions. Once the targeting variant at these two sub-loci is identified, this will be explained. The other locus, confirmed among the best 10 loci in this study, was MYO16 on 13q33.3 (OR = 2.54, p = 1.86 × 10−6). Here, this locus had a higher OR, 2.54, vs 1.71 in the previous study. The haplotype region in this study (109796718–109897922) was bigger than the one found in the first study (109832287–109897922). One candidate variant in MYO16 was tested in the previous study and had a positive OR = 1.24, supporting an increased risk of CRC and possibly also gastric and/or prostate cancer. The finding of a positive association between a certain variant and risk does not prove that the variant is causative—but it is a candidate for further testing in bigger association studies. The genes involved in these ten loci and their possible role in CRC were previously discussed [12].
This study found one locus with a p-value generally considered statistically significant for GWAS on 9p24.3 (OR = 2.3, p = 2.41 × 10−8). No gene was suggested at this locus. We still searched for SNPs at all 50 loci, which reached a p-value < 5 × 10−6 for further study. The ORs observed in our haplotype GWAS were higher (ranging from 1.35 to 6.52) compared to typical GWAS findings, which mostly stay below OR = 2 [16,17,18,19,20]. This is the same as we have found also in previous haplotype GWAS [14]. In the 50 loci, 46 genes were found, almost all implicated in cancer already (Table 2). Many of the genes, E2F6, TLR2, SFRP2, PIK3R1, EYA4, THBS2, ELMO1, RELN, GAS1, FZD4, KLF12, TDP1, SCG5, CDH13, and TCF4, have been published in relation to cancer more than 50 times, and some have been published in hundreds or even thousands of publications in relation to cancer and/or CRC (Table 2). A detailed report of all putative genes involved is outside the scope of this paper. However, some of the most studied genes in relation to cancer, and in particular CRC, will be mentioned briefly. One paper found that RBM15, by regulating E2F2, boosted malignant processes in CRC [21]. Inflammation is a known risk factor in cancer, including CRC. Toll-like receptors (TLRs) are known players in inflammatory response. A low expression of TLR2 in normal mucosa suggested that normal mucosa might contribute to a systemic inflammatory response [22]. Abnormal methylation of secreted frizzled-related proteins (SFRPs), specifically SFRP2, has been associated with cancer risk, especially in hepatocellular and colorectal carcinoma [23]. The same gene, SFRP2, as well as another gene suggested in Table 1, EYA4, were mentioned in relation to CRC risk and the effect of microbiota composition in methylation [24]. The phosphatidylinositol 3-kinase signaling pathway is known to be involved in cancer, and PIK3R1 has shown association with risk in CRC [25]. Hub genes are known to be involved in signal transduction and metabolic pathways in cancer. The gene THBS2 was shown to be implicated in metastatic CRC [26]. ELMO1 was shown to mediate tumor progression in CRC [27]. Reelin, an extracellular matrix protein, is known to reduce the susceptibility to dextran sulfate sodium (DSS)-colitis. The gene has also been suggested to possibly protect against CRC by maintaining intestinal epithelial cell homeostasis and providing resistance against colon pathology [28]. The TCF family of genes is known to regulate Wnt/B-catenin target gene expression. The TCF7L1 gene was shown to regulate CRC cell migration by repressing GAS1 expression [29]. One study showed that overexpression of FZD4 in CRC modified wnt/B-catenin signaling in CRC cells [30]. Krüppel-like factors have been suggested in malignancy by regulating various cellular functions. One study of these functions found a dramatic downregulation of KLF genes, including KLF12, in CRC [31]. Top1/Tdp1 are mediators of a DNA damage repair pathway and can influence the risk of CRC [32]. Polygenic modeling identified 10 SNP in the SCL22A3, SCG5, GREM1, and STXBP5-AS1 genes in an Indonesian CRC GWAS [33]. CDH13 is known to be downregulated in various cancers, and CDH13 promotor methylation has been suggested to act in CRC initiation and progression [34]. Mutations in the Wnt components, specifically TCF4, were found in microsatellite-unstable colon cancers [35].
The aim of this study was to replicate the loci from our previous study and to possibly find more loci with an increased risk of CRC, gastric cancer, and prostate cancer. A search of putative candidate SNPs in these loci was done in the same way as for the ten loci in the previous study. The same sequenced data from 122 patients was used to find markers for testing. In total, 33 SNPs were found available for genotyping and, finally, 17 SNPs could be used in two association studies. The first used 827 familial CRC cases and 1530 controls, and the second used a sub-cohort of 293 familial CRC cases with a family history also including gastric and/or prostate cancer in close family members and the same controls. Only 11 of the 50 loci could be tested for one to four markers and only one variant was found to be statistically significant (OR = 3.26, p = 0.009). The first analysis of 827 familial CRC cases versus 1530 healthy controls found support with ORs > 1 for seven of the 11 tested loci. When the analysis was redone in the smaller cohort of 293 familial CRC selected for the colon-gastric-prostate familiality, six loci had OR > 1, with three of them having higher OR and one having statistically significant results. This is interpreted as a support of those loci. However, too few of the loci could be tested for specific gene variants and none is therefore ruled out. The fact that so many of the suggested 50 haplotype regions involved coding genes, and that most of them already had been implicated in cancer, including CRC, can also be seen as supportive of the regions suggested as risk loci in this study.
The major limitation of both the previous and this study was that too few patients were sequenced. Since almost all suggested haplotype regions (loci) had an estimated sample frequency below 10%, the chance of finding the putative haplotype target gene variants in only 122 patients was small. Besides, even if ORs in haplotype GWAS are higher compared to those in single-variant GWAS, the numbers of cases and controls were too small to give statistically significant results. Another limitation was that the same cases were used. However, after careful consideration, it was decided to undertake this study since the extra control persons were available, although other cases were not. Perhaps the choice of patients for sequencing was not the best either, since familial CRC cases were used, while in the first GWAS, mostly sporadic cases (with a family history of gastric and/or prostate cancer) were used. Still, the study could replicate the ten loci from the first study, finding new candidate loci with high OR, although not statistically significant in this smaller study. Thus, both this and the previous study found risk loci were suggested to harbor genetic variants associated with a risk of colon, and possibly also gastric and prostate, cancer. Moreover, most of the genes found in our studies were already implicated in various cancers. The current study suggested that geographically better-matched controls resulted in better risk stratification as the odds ratio for the respective loci was higher, even though the cohort was smaller, and the results did not reach statistical significance. The study can serve as a proof of principle to find candidate risk markers in loci from haplotype GWAS and demonstrate the importance of the selection of population and size of the experiment in numbers to be able to give statistically significant results.

4. Materials and Methods

4.1. Cases and Controls for GWAS

Cases were selected as a part of a multi-center study, the Colorectal Cancer Low-risk study, with newly diagnosed CRC patients from the middle of Sweden between 2004 and 2009 [10]. More than 90% were estimated to be of Swedish origin, and most of the remaining 10% had Finnish background. No information was obtained for controls who were assumed to represent a similar proportion at the time of recruitment 2004–2009. Written informed consent was taken from all the patients. The study was approved by the regional research ethics committees in Stockholm 2002 (Stockholms Regionala Etikprövningsnämnd) and Uppsala (Uppsalas Regionala Etikprövningsnämnd, Dnr: 02-489 and 03-114). Detailed information regarding cancer occurrences in the family comprising of first- and second-degree relatives and cousins was recorded. Based on family history and the pathology and molecular testing for MSI, known cases of FAP and Lynch syndrome were excluded from the study. The selection criteria for the patients to be included in this study was having at least one case of gastric or prostate cancer among close relatives. In total, 685 cases fulfilling these criteria were genotyped and included as cases in this study. Of these 685 CRC cases, 510 were sporadic and 175 were familial. The relatives of these 685 individual CRC cases had various cancers in different locations, with gastric cancer (523) as the most common malignancy, followed by prostate cancer (468) and other various types of cancer, not unique for this sub-cohort of cases. The details regarding the different malignancies among the relatives have been reported previously [12]. Controls were also selected from the Low-risk study consisting of 1642 individuals, of which 536 were spouses of the cases, and 1106 were healthy blood donors from the same geographical region.

4.2. Cases for Whole-Genome Sequencing (WGS)

To search for candidate variants in all candidate loci, 89 familial CRC cases with gastric or prostate cancer in their families were used for WGS. The cases were selected based on the maximum number of cases of gastric and/or prostate cancer within the families. The families were described in detail in the previous GWAS with the same cases and different controls [12].

4.3. Cases for Whole-Exome Sequencing (WES)

From a previous study on WES in CRC patients, 33 familial CRC cases with gastric and/or prostate cancer in their families were also included to search for candidate variants. Analysis of WES data to search for candidate variants was performed as described [36,37].

4.4. Cases and Controls for the Association Studies

For the case-control study of candidate variants, 827 familial CRC cases and 1530 healthy controls were used, as already described. Out of the cases, 691 were from the Low-risk Colorectal cancer study and 136 cases were recruited from the Department of Clinical Genetics at the Karolinska University Hospital, Stockholm [12]. Controls were 990 blood donors from the Stockholm region and 540 healthy spouses from the Low-risk study [14]. In the other case-control study of the same candidate variants, a sub-cohort was selected from the 827 in the first study. In total, 293 cases were selected because they had relatives with gastric and/or prostate cancer. Controls were the same as in the first case-control study.

4.5. Genotyping, Quality Control, and Haplotype GWAS

DNA was extracted from blood using the standard protocol in our lab from 2004–2009. The genotyping for both cases and controls was performed at the Center for Inherited Disease Research at Johns Hopkins University, US, using the Illumina Infinium® OncoArray-500K (Illumina, Inc., San Diego, CA, USA) [38]. The first quality control (QC) was done within the CORECT study [13], and a second QC was done at Karolinska Institutet [14]. For the haplotype GWAS analysis, to examine the association between one SNP or haplotype and cancer risk, a logistic regression model was employed. The computational program PLINK V1.07 was used for the analysis and for the calculation of corresponding OR, standard errors, and 95% confidence intervals (CI) [39]. The following parameters were applied while using PLINK v1.07: “hap-logistic” (haplotype logistic regression analysis), “hap-window 1–25” (sliding window sizes 1 to 25), and default settings, which included haplotype phasing with the E-M algorithm, omnibus association test, and minor haplotype frequency of 0.01. Because the number of control persons was smaller than in the previous study, less stringent criteria for p-value were used. To find out what p-value to use, a hypothetical GWAS using 200 healthy persons was selected from the blood donors and used in a haplotype GWAS where 100 were used as “cases” and 100 as “controls”. No p-value was found below 5 × 10−6; therefore, a p-value criterion of p < 5 × 10−6 was used for this study. Different sizes of windows were previously tested and windows with more than 25 SNPs rarely gave any results. Thus, a sliding window haplotype analysis of window sizes 1–25 was used. In total, more than eight million sliding windows were tested. In this analysis, each SNP was tested several times, involving 24 SNPs upstream and 24 SNPs downstream. No adjustments were made for gender or age.

4.6. Algorithm for Selection of Candidate SNPs Using the Sequencing Data

WGS or WES data from 122 CRC samples was available to search for candidate variants in the candidate risk loci. To be able to search for candidate variants in the sequencing data, GRCh37 base pair positions were converted to GRCh38, as the haplotype data was in genome built 37 and the sequencing data was in genome built 38 (Ensembl genome browser 110). The haplotype region was searched in the Ensembl genome browser to look for genes, non-coding regions, pseudogenes, and regulatory elements. Initially, each chromosome was sorted by base pair positions (smallest to largest). Filters applied were: Synonymous variants, variants without known or rare (<0.005), or common (>0.15) allele frequency were not used. Only single-nucleotide variants (SNV) could be selected for testing with MALDI-TOF (Matrix-Assisted Laser Desorption-Ionization-Time of Flight) in a MassARRAY Platform [40]. The database gnomAD (version v.3.1.2) and the SweGen Variant Frequency database were used for extracting allele frequencies for the variants.

4.7. WGS and WES

The Illumina TruSeq PCR-free kit was used for preparing the sequencing libraries, with an average coverage of 30×, according to the manufacturer’s protocol (Illumina, Inc., San Diego, CA, USA). To summarize, Covaris was used for the fragmentation of genomic DNA extracted from 89 peripheral blood samples, and the fragmented DNA was subjected to library preparation involving the end-repair method. This was followed by A-tailing and adaptor ligation. Illumina sequencer NovaSeq6000 was used for the sequencing. Sarek germline pipeline was used to analyze the data [41].

4.8. Association Study Using MALDI-TOF

The MALDI-TOF-based MassARRAY platform was used for SNP genotyping analysis. For genotyping, the Core facility at Translational Analysis in Molecular Medicine (TAMM) at the Karolinska University Hospital was utilized [40]. In short, the various steps included primer designing for the various SNPs using a software package from Agena (Agena Bioscience, San Diego, CA, USA). This was followed by PCR amplification of the respective SNP loci, SAP enzyme clean up, extension of the PCR amplicons, and fragment analysis on the Agena MassARRAY analyzer. For automated allele calling, Agena’s SpectroTyper software (https://www.pubcompare.ai/product/uCviCZIBPBHhf-iFVzf3/ (accessed on 14 January 2025)) was used, and validation was done using human DNA from the CEU population genotyped by the HapMap consortium (CEU panel). Positive and negative controls were used in all steps. To ensure the reproducibility of the assay, a few of the samples were repeated. For each individual SNP, association testing was performed separately. Manual calculation was performed for OR using the genotype count in cases and controls; OR > 1 was associated with an increased risk of CRC.

5. Conclusions

The findings gave further support to the hypothesis of shared risk between CRC, gastric cancer, and prostate cancer. The ten loci suggested in a previous GWAS were replicated, and 50 new loci, already associated with cancer, were suggested. The results imply potential common genetic pathways involving susceptibility loci across these cancer types. The findings gave further support to the hypothesis of shared risk between CRC, gastric cancer, and prostate cancer. There is still a need for follow-up with new studies to confirm this syndrome and to find other associated tumors. It will be important to find the absolute risks related to these loci before they can be considered for implementation in cancer prevention.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/1422-0067/26/2/817/s1.

Author Contributions

Conceptualization, A.L.; methodology, L.V., J.S.W., W.L. and A.L.; software, L.V. and W.L.; validation, L.V., J.S.W., W.L. and A.L.; formal analysis, L.V., J.S.W., W.L., V.S. and A.L.; investigation, L.V., J.S.W., W.L. and A.L.; resources, A.L.; data curation, L.V., W.L., V.S., T.S. and A.L.; writing—original draft preparation, L.V., J.S.W. and A.L.; writing—review and editing, L.V., J.S.W., W.L., V.S., T.S., M.L. and A.L.; visualization, L.V., J.S.W. and A.L.; supervision, M.L. and A.L.; project administration, A.L.; funding acquisition, A.L. All authors have read and agreed to the published version of the manuscript.

Funding

Open access funding provided by Karolinska Institute. This research was funded by grants from the Swedish Research Council, the Swedish Cancer Society, and the Cancer Research Funds of Radiumhemmet. Financial support was also provided by the regional agreement on medical training and clinical research (ALF) between the Stockholm County Council and Karolinska Institutet.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki. The study was approved by the regional research ethics committees in Stockholm 2002 (Stockholms Regionala Etikprövningsnämnd) and Uppsala (Uppsalas Regionala Etikprövningsnämnd), Dnr: 02-489 and 03-114.

Informed Consent Statement

Written informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Access to the data is controlled. Variants that fulfilled our selection criteria can be found in the Supplemental Tables. However, Swedish laws and regulations prohibit the release of individual and personally identifying data. Therefore, the whole dataset cannot be made publicly available. The data that support the findings of this study are available from the corresponding authors upon a reasonable request.

Acknowledgments

Colorectal Transdisciplinary Study (CORECT): The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centers in the CORECT Consortium, nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government or the CORECT Consortium. We acknowledge all patients, and Richard Rosenquist, Hassan Foroughi Asl, Jessada Thutkawkorapin and Hafdis T. Helgadottir for their valuable contributions. The authors acknowledge support from the National Genomics Infrastructure in Stockholm funded by Science for Life Laboratory, the Knut and Alice Wallenberg Foundation, the Swedish Research Council, and SNIC/Uppsala Multidisciplinary Center for Advanced Computational Science for assistance with massively parallel sequencing and access to the UPPMAX computational infrastructure. We thank the Translational Analysis in Molecular Medicine (TAMM) at the Karolinska University Hospital for their support with the SNP genotyping work.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CRCColorectal Cancer
GWASGenome-Wide Association Study
SNVSingle-Nucleotide Variant
WGSWhole-Genome Sequencing

References

  1. Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2021, 71, 209–249. [Google Scholar] [CrossRef] [PubMed]
  2. Kanth, P.; Grimmett, J.; Champine, M.; Burt, R.; Samadder, N.J. Hereditary colorectal polyposis and cancer syndromes: A primer on diagnosis and management. Am. J. Gastroenterol. 2017, 112, 1509–1525. [Google Scholar] [CrossRef] [PubMed]
  3. Jackson, L.; Weedon, M.N.; Green, H.D.; Mallabar-Rimmer, B.; Harrison, J.W.; Wood, A.R.; Ruth, K.S.; Tyrrell, J.; Wright, C.F. Influence of family history on penetrance of hereditary cancers in a population setting. eClinicalMedicine 2023, 64, 102159. [Google Scholar] [CrossRef] [PubMed]
  4. Valle, L.; de Voer, R.M.; Goldberg, Y.; Sjursen, W.; Försti, A.; Ruiz-Ponte, C.; Caldés, T.; Garré, P.; Olsen, M.F.; Nordling, M.; et al. Update on genetic predisposition to colorectal cancer and polyposis. Mol. Asp. Med. 2019, 69, 10–26. [Google Scholar] [CrossRef]
  5. Olkinuora, A.P.; Peltomäki, P.T.; Aaltonen, L.A.; Rajamäki, K. From APC to the genetics of hereditary and familial colon cancer syndromes. Hum. Mol. Genet. 2021, 30, R206–R224. [Google Scholar] [CrossRef]
  6. Lindblom, A.; Zhou, X.L.; Liu, T.; Liljegren, A.; Skoglund, J.; Djureinovic, T. Colorectal cancer as a complex disease: Defining at-risk subjects in the general population—A preventive strategy. Expert. Rev. Anticancer Ther. 2004, 4, 377–385. [Google Scholar] [CrossRef]
  7. Tenesa, A.; Dunlop, M. New insights into the aetiology of colorectal cancer from genome-wide association studies. Nat. Rev. Genet. 2009, 10, 353–358. [Google Scholar] [CrossRef]
  8. Fernandez-Rozadilla, C.; Timofeeva, M.; Chen, Z.; Law, P.; Thomas, M.; Schmit, S.; Díez-Obrero, V.; Hsu, L.; Fernandez-Tajes, J.; Palles, C.; et al. Deciphering colorectal cancer genetics through multi-omic analysis of 100,204 cases and 154,587 controls of European and East Asian ancestries. Nat. Genet. 2023, 55, 89–99. [Google Scholar] [CrossRef]
  9. Fehringer, G.; Kraft, P.; Pharoah, P.D.; Eeles, R.A.; Chatterjee, N.; Schumacher, F.R.; Schildkraut, J.M.; Lindström, S.; Brennan, P.; Bickeböller, H.; et al. Cross-cancer genome-wide analysis of lung, ovary, breast, prostate, and colorectal cancer reveals novel pleiotropic associations. Cancer Res. 2016, 76, 5103–5114. [Google Scholar] [CrossRef]
  10. Forsberg, A.; Keranen, A.; von Holst, S.; Picelli, S.; Papadogiannakis, N.; Ghazi, S.; Lindblom, A. Defining new colorectal cancer syndromes in a population-based cohort of the disease. Anticancer. Res. 2017, 37, 1831–1835. [Google Scholar] [CrossRef]
  11. Wallander, K.; Liu, W.; von Holst, S.; Thutkawkorapin, J.; Kontham, V.; Forsberg, A.; Lindblom, A.; Lagerstedt-Robinson, K. Genetic analyses supporting colorectal, gastric, and prostate cancer syndromes. Genes Chromosomes Cancer 2019, 58, 775–782. [Google Scholar] [CrossRef] [PubMed]
  12. Winnberg, J.S.; Vermani, L.; Liu, W.; Soller, V.; Thutkawkorapin, J.; Lindblad, M.; Lindblom, A. A genome-wide association study in Swedish colorectal cancer patients with gastric- and prostate cancer in relatives. Hered. Cancer Clin. Pract. 2024, 22, 25. [Google Scholar] [CrossRef] [PubMed]
  13. Schmit, S.L.; Edlund, C.K.; Schumacher, F.R.; Gong, J.; A Harrison, T.; Huyghe, J.R.; Qu, C.; Melas, M.; Van Den Berg, D.J.; Wang, H.; et al. Novel common genetic susceptibility loci for colorectal cancer. J. Natl. Cancer Inst. 2019, 111, 146–157. [Google Scholar] [CrossRef] [PubMed]
  14. Liu, W.; Mahdessian, H.; Helgadottir, H.; Zhou, X.; Thutkawkorapin, J.; Jiao, X. Colorectal cancer risk susceptibility loci in a Swedish population. Mol. Carcinog. 2021, 61, 288–300. [Google Scholar] [CrossRef]
  15. Dudbridge, F.; Gusnanto, A. Estimation of significance thresholds for genomewide association scans. Genet. Epidemiol. 2008, 32, 227–234. [Google Scholar] [CrossRef]
  16. Michailidou, K.; Lindstrom, S.; Dennis, J.; Beesley, J.; Hui, S.; Kar, S.; Lemacon, A.; Soucy, P.; Glubb, D.; Rostamianfar, A.; et al. Association analysis identifies 65 new breast cancer risk loci. Nature 2017, 551, 924. [Google Scholar] [CrossRef]
  17. Law, P.J.; Timofeeva, M.; Fernandez-Rozadilla, C.; Broderick, P.; Studd, J.; Fernandez-Tajes, J.; Farrington, S.; Svinti, V.; Palles, C.; Orlando, G.; et al. Association analyses identify 31 new risk loci for colorectal cancer susceptibility. Nat. Commun. 2019, 10, 2154. [Google Scholar] [CrossRef]
  18. Dadaev, T.; Saunders, E.J.; Newcombe, P.J.; Anokian, E.; Leongamornlert, D.A.; Brook, M.N.; Cieza-Borrella, C.; Mijuskovic, M.; Wakerell, S.; Al Olama, A.A.; et al. Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants. Nat. Commun. 2018, 9, 2256. [Google Scholar] [CrossRef]
  19. Kuchenbaecker, K.B.; Ramus, S.J.; Tyrer, J.; Lee, A.; Shen, H.C.; Beesley, J.; Lawrenson, K.; McGuffog, L.; Healey, S.; Lee, J.M.; et al. Identification of six new susceptibility loci for invasive epithelial ovarian cancer. Nat. Genet. 2015, 47, 164–171. [Google Scholar] [CrossRef]
  20. Shrine, N.; Guyatt, A.L.; Erzurumluoglu, A.M.; Jackson, V.E.; Hobbs, B.D.; Melbourne, C.A.; Batini, C.; Fawcett, K.A.; Song, K.; Sakornsakolpat, P.; et al. New genetic signals for lung function highlight pathways and chronic obstructive pulmonary disease associations across multiple ancestries. Nat. Genet. 2019, 51, 481–493. [Google Scholar] [CrossRef]
  21. Zhang, H.; Li, Y.; Zhou, Y.; Xu, Q.; Liao, B.; Qiu, X.; Liu, J. RNA methylase RBM15 facilitates malignant progression of colorectal cancer through regulating E2F2 in an m6A modification-dependent manner. J. Biochem. Mol. Toxicol. 2024, 38, e70014. [Google Scholar] [CrossRef] [PubMed]
  22. Paarnio, K.; Väyrynen, J.P.; A Väyrynen, S.; Kantola, T.; Karhu, T.; Tervahartiala, T.; Klintrup, K.; Sorsa, T.; Salo, T.; Mäkelä, J.; et al. TLR2 and TLR4 in colorectal cancer: Relationship to tumor necrosis and markers of systemic inflammation. Neoplasma 2022, 69, 1418–1424. [Google Scholar] [CrossRef] [PubMed]
  23. Yu, J.; Xie, Y.; Li, M.; Zhou, F.; Zhong, Z.; Liu, Y.; Wang, F.; Qi, J. Association between SFRP promoter hypermethylation and different types of cancer: A systematic review and meta-analysis. Oncol. Lett. 2019, 18, 3481–3492. [Google Scholar] [CrossRef] [PubMed]
  24. Gutierrez-Angulo, M.; Ayala-Madrigal, M.L.; Moreno-Ortiz, J.M.; Peregrina-Sandoval, J.; Garcia-Ayala, F.D. Microbiota composition and its impact on DNA methylation in colorectal cancer. Front. Genet. 2023, 14, 1037406. [Google Scholar] [CrossRef]
  25. Li, L.; Plummer, S.J.; Thompson, C.L.; Tucker, T.C.; Casey, G. Association between phosphatidylinositol 3-kinase regulatory subunit p85alpha Met326Ile genetic polymorphism and colon cancer risk. Clin. Cancer Res. 2008, 14, 633–637. [Google Scholar] [CrossRef]
  26. Ershov, P.; Poyarkov, S.; Konstantinova, Y.; Veselovsky, E.; Makarova, A. Transcriptomic signatures in colorectal cancer progression. Curr. Mol. Med. 2023, 23, 239–249. [Google Scholar] [CrossRef]
  27. Park, Y.-L.; Cho, S.-B.; Park, S.-Y.; Oh, H.-H.; Myung, E.; Im, C.-M.; Son, S.; Kim, S.; Cho, S.-Y.; Chung, M.-W.; et al. Engulfment and Cell Motility 1 (ELMO1) regulates tumor cell behavior and predicts prognosis in colorectal cancer. Anticancer Res. 2022, 42, 5343–5355. [Google Scholar] [CrossRef]
  28. Carvajal, A.E.; Serrano-Morales, J.M.; Vázquez-Carretero, M.D.; García-Miranda, P.; Calonge, M.L.; Peral, M.J.; Ilundain, A.A. Reelin protects from colon pathology by maintaining the intestinal barrier integrity and repressing tumorigenic genes. Biochim. Biophys. Acta Mol. Basis Dis. 2017, 1863, 2126–2134. [Google Scholar] [CrossRef]
  29. King, C.M.; Ding, W.; Eshelman, M.A.; Yochum, G.S. TCF7L1 regulates colorectal cancer cell migration by repressing GAS1 expression. Sci. Rep. 2024, 14, 12477. [Google Scholar] [CrossRef]
  30. Zhang, G.; Liu, Z.; Zhong, J.; Lin, L. Circ-ACAP2 facilitates the progression of colorectal cancer through mediating miR-143-3p/FZD4 axis. Eur. J. Clin. Investig. 2021, 51, e13607. [Google Scholar] [CrossRef]
  31. Huang, Z.; He, H.; Qiu, F.; Qian, H. Expression and prognosis value of the KLF family members in colorectal cancer. J. Oncol. 2022, 2022, 6571272. [Google Scholar] [CrossRef] [PubMed]
  32. Zhang, Y.; Li, Y.; Sun, C.; Chen, X.; Han, L.; Wang, T.; Liu, J.; Chen, X.; Zhao, D. Effect of Pterostilbene, a natural derivative of resveratrol, in the treatment of colorectal cancer through Top1/Tdp1-mediated DNA repair pathway. Cancers 2021, 13, 4002. [Google Scholar] [CrossRef] [PubMed]
  33. Yusuf, I.; Pardamean, B.; Baurley, J.W.; Budiarto, A.; Miskad, U.A.; Lusikooy, R.E.; Arsyad, A.; Irwan, A.; Mathew, G.; Suriapranata, I.; et al. Genetic risk factors for colorectal cancer in multiethnic Indonesians. Sci. Rep. 2021, 11, 9988. [Google Scholar] [CrossRef] [PubMed]
  34. Ye, M.; Huang, T.; Li, J.; Zhou, C.; Yang, P.; Ni, C.; Chen, S. Role of CDH13 promoter methylation in the carcinogenesis, progression, and prognosis of colorectal cancer: A systematic meta-analysis under PRISMA guidelines. Medicine 2017, 96, e5956. [Google Scholar] [CrossRef]
  35. Segditsas, S.; Tomlinson, I. Colorectal cancer and genetic alterations in the Wnt pathway. Oncogene 2006, 25, 7531–7537. [Google Scholar] [CrossRef]
  36. Helgadottir, H.T.; Thutkawkorapin, J.; Rohlin, A.; Nordling, M.; Lagerstedt-Robinson, K.; Lindblom, A. Identification of known and novel familial cancer genes in Swedish colorectal cancer families. Int. J. Cancer 2021, 149, 627–634. [Google Scholar] [CrossRef]
  37. DePristo, M.A.; Banks, E.; Poplin, R.; Garimella, K.V.; Maguire, J.R.; Hartl, C.; Philippakis, A.A.; Del Angel, G.; Rivas, M.A.; Hanna, M.; et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 2011, 43, 491–498. [Google Scholar] [CrossRef]
  38. Illumina. Infinium OncoArray-500K [Internet]. Available online: http://www.illumina.com/products/by-type/microarray-kits/infinium-oncoarray-500k.html (accessed on 10 November 2024).
  39. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
  40. Medicine TAiM. SNP Genotyping Service with Agena, Based on MALDI-TOF Analysis 2023. Available online: https://www.maf.ki.se/snp-genotyping-agena/ (accessed on 20 November 2024).
  41. Garcia, M.; Juhos, S.; Larsson, M.; Olason, P.I.; Martin, M.; Eisfeldt, J.; DiLorenzo, S.; Sandgren, J.; De Ståhl, T.D.; Ewels, P.; et al. Sarek: A portable workflow for whole-genome sequencing analysis of germline and somatic variants. F1000Research 2020, 9, 63. [Google Scholar] [CrossRef]
Table 1. Comparison of ten loci found in previous GWAS study [12] (first analysis) with current GWAS (second analysis).
Table 1. Comparison of ten loci found in previous GWAS study [12] (first analysis) with current GWAS (second analysis).
LocusFirst Analysis
BP1–BP2 (GRCh37)
HFORp-ValueGenesSecond Analysis
BP1–BP2 (GRCh37)
HFORp-Value
1q32.2208968409–2090836090.032.083.17 × 10−8No gene208968409–2090836090.013.350.0002
3q29195750742–1959732440.022.992.32 × 10−8TFRC, SLC51A, ZDHHC19, PCYT1A195750742–1959732440.022.140.004
4q35.1185088648–1852528180.012.841.30 × 10−8ENPP6185170812–1852491470.071.460.001
4q26119506139–1198351480.013.621.59 × 10−8METTL14, SEC24D, SYNPO2119552849–1198095720.012.280.00001
4p15.3120852244–211120460.013.252.72 × 10−8KCNIP420864229–210008670.014.074.00 × 10−4
8p23.111236975–113558210.013.044.47 × 10−8FAM167A, BLK11303011–113615520.012.580.0004
13q33.3109832287–1098979220.111.719.20 × 10−9MYO16109796718–1098979220.032.5421.86 × 10−6
13q13.337374156–374606480.071.864.15 × 10−8RFXAP, SMAD937366006–374881310.052.058.00 × 10−6
16q23.382871769–828998770.013.601.38 × 10−8CDH1382866767–829125710.021.540.06
22q11.2119872009–199301210.032.302.56 × 10−10TXNRD2, COMT19889825–199340250.091.549.00 × 10−5
HF = Haplotype frequency in samples, OR = Odds ratio. BP1–BP2, first and last position in the haplotype; GRCh37, build 37 of the “human genome”.
Table 2. The 55 haplotypes in 50 loci suggested from haplotype GWAS.
Table 2. The 55 haplotypes in 50 loci suggested from haplotype GWAS.
LocusBP1–BP2 (GRCh 37)HFORp-ValueGene* PubMed Articles
2p25.25593978–57037840.023.43.86 × 10−6no gene
2p25.111503455–116527950.071.823.09 × 10−6E2F6, GREB1E2F6:153
2q33.1202688907–2028397680.023.453.64 × 10−7CDK1514
2q36.3226183385–2264477350.023.234.32 × 10−6NYAP21
2q37.3241894333–2419576770.042.232.74 × 10−6SNED17
4p1440625135–406774620.022.44.31 × 10−6RBM4746
4q13.163992411–641748760.032.713.83 × 10−6no gene
4q13.376049479–760838200.022.645.30 × 10−7no gene
4q21.2282401332–824754760.013.552.60 × 10−6RASGEF1B5
4q31.1141234930–1413721580.032.192.81 × 10−6SCOC, CLGN, MGAT4DSCOC:16
4q31.3154605745–1547870900.023.023.97 × 10−6TLR2, RNF175, SFRP2TLR2:1675; SFRP2:575
4q35.1183942308–1842156750.015.33.27 × 10−6WWC225
5q13.167529191–675536360.023.381.05 × 10−6PIK3R1643
6p21.145705079–457939720.023.171.09 × 10−6no gene
6q23.2133702841–1337415580.023.294.54 × 10−6EYA476
6q25.3158732602–1589734750.042.038.93 × 10−7TMEM181, TULP4TMEM181:2
6q27rs2093524-rs109454050.014.022.31 × 10−6THBS2258
7p14.236210485–362522930.024.021.62 × 10−6EEPD119
7p14.137298800–373825200.013.841.90 × 10−6ELMO181
7q22.1103119863–1031304030.032.723.86 ×10−6RELN167
7q35146559474–1467674040.015.413.08 × 10−7CNTNAP2
8p23.25430697–55051880.023.31.84 × 10−6no gene
8q24.22131997225–1321147620.022.594.11 × 10−6no gene
9p24.31124087–11994480.052.32.41 × 10−8no gene
9p24.31876496–19808190.023.377.11 × 10−7no gene
9q21.3182433182–824331820.611.382.20 × 10−6no gene
9q21.3182445976–824463940.541.353.54 × 10−6no gene
9q21.3389168700–893369870.013.929.69 × 10−7no gene
9q21.3389481996–895749020.051.824.99 × 10−6GAS1132
9q21.3389669067–898212190.071.743.83 × 10−6LINC028721
9q34.2136822827–1368669250.111.751.51 × 10−6VAV2178
10q11.2145270373–454511780.023.331.33 × 10−6TMEM72
10q26.3132676113–1327425850.014.772.33 × 10−6no gene
11q14.286657520–868366480.014.341.51 × 10−6FZD4, TMEM135, PRSS23FZD4:147; TMEM135:1
12q24.32128767084–1289167190.042.463.95 × 10−6TMEM132C8
13q12.1120407151–206862720.032.529.14 × 10−7ZMYM5, ZMYM2ZMYM5:3; ZMYM2:58
13q12.329989466–300756860.091.771.92 × 10−6MTUS29
13q22.174316318–743476730.071.742.29 × 10−6KLF1298
13q33.3109796718–1098979220.032.541.86 × 10−6MYO1612
13q33.3109886427–1099721820.042.034.22 × 10−6no gene
14q31.385180203–853776590.022.582.33 × 10−6no gene
14q32.1190422664–906217700.042.43.08 × 10−6TDP1, KCNK13TDP1:219
15q13.332962642–329760550.013.524.54 × 10−6SCG565
16p13.35420304–55254900.013.943.84 × 10−6RBFOX1
16p13.28120348–82779320.014.33.60 × 10−6no gene
16q23.382673410–826915640.016.353.18 × 10−6CDH13308
17p1214942734–150576910.042.194.31 × 10−6no gene
17q1231790250–318882050.016.523.40 × 10−6ASIC229
17q21.3246348384–463555500.022.762.46 × 10−6SKAP140
17q25.377925723–779582540.013.994.41 × 10−6TBC1D1611
18q21.253179419–533799920.032.592.53 × 10−7TCF41595
19q1220411543–206633140.022.894.94 × 10−6no gene
20q13.3360966686–609666860.711.472.84 × 10−7CABLES28
21q21.224419862–245556710.022.863.69 × 10−6no gene
22q12.231392606–314389310.032.414.97 × 10−6no gene
BP1–BP2, first and last position in the haplotype, GRCh37, build 37 of the “human genome”; HF, haplotype frequency in the population; * PubMed articles, the number of publications resulting from a PubMed search using “cancer and gene”.
Table 3. Candidate SNPs tested in association analysis of all familial CRC (case-control study) and a sub-cohort of families with colorectal, gastric, and prostate cancer (case-control study).
Table 3. Candidate SNPs tested in association analysis of all familial CRC (case-control study) and a sub-cohort of families with colorectal, gastric, and prostate cancer (case-control study).
LocusSNPPosition (GRCh37)GeneTypeRef
Allele
Alt
Allele
No of
Cases (827)
No
Ctrls
(1530)
Odds
Ratio
p-ValueNo of
Cases
(293)
Odds
Ratio
p-Value
2q33.1rs34851370201835676CDK15missenseCT81615241.31ns2931.44ns
4q31.1rs358314140381160SCOC3’UTRCT73013411.09ns2631.009ns
rs358326140388548CLGN3’UTRAG81715251.01ns2920.86ns
rs72716373140385611SCOC3’UTRTC81915160.82ns2930.86ns
rs114046582140398927CLGNmissenseGA80815120.87ns2901.08ns
4q31.3rs139227237153703557TLR2missenseTC81915270.95ns2931.20ns
rs34803482153748734RNF175missenseCG80514341.21ns2891.23ns
rs115872046153723390RNF175missenseGA81615261.10ns2931.43ns
rs5743708153705165TLR2missenseGA81915250.97ns2931.16ns
4q35.1rs141501417 183260972 WWC2missenseGA81415230.77ns2920.53ns
6q27rs140852957169221505THBS2missenseGA81815220.97ns2930.87ns
10q11.21rs11594373344936215TMEM723’UTRGC81715261.88ns2933.260.009
11q14.2rs6173530486954989FZD4missenseGA81815221.50ns2921.19ns
13q12.11rs3551677319993526ZMYM2missenseGC81915221.51ns2930.32ns
13q12.3rs121362652829505477MTUS23’UTRGT81915230ns2930ns
14q32.11rs715048090043536TDP13’UTRTC81315271.23ns2931.16ns
20q13.33rs242730762391630CABLES2IntronGA81915180.710.00012930.670.002
SNP, single nucleotide polymorphism, Position, the exact base pair for the SNP, Positions (GRCh37), build 37 of the “human genome”, Type, type of gene variant, Ref, reference allele, Alt, alternative allele.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Vermani, L.; Samola Winnberg, J.; Liu, W.; Soller, V.; Sjödin, T.; Lindblad, M.; Lindblom, A. A Haplotype GWAS in Syndromic Familial Colorectal Cancer. Int. J. Mol. Sci. 2025, 26, 817. https://doi.org/10.3390/ijms26020817

AMA Style

Vermani L, Samola Winnberg J, Liu W, Soller V, Sjödin T, Lindblad M, Lindblom A. A Haplotype GWAS in Syndromic Familial Colorectal Cancer. International Journal of Molecular Sciences. 2025; 26(2):817. https://doi.org/10.3390/ijms26020817

Chicago/Turabian Style

Vermani, Litika, Johanna Samola Winnberg, Wen Liu, Veronika Soller, Tilde Sjödin, Mats Lindblad, and Annika Lindblom. 2025. "A Haplotype GWAS in Syndromic Familial Colorectal Cancer" International Journal of Molecular Sciences 26, no. 2: 817. https://doi.org/10.3390/ijms26020817

APA Style

Vermani, L., Samola Winnberg, J., Liu, W., Soller, V., Sjödin, T., Lindblad, M., & Lindblom, A. (2025). A Haplotype GWAS in Syndromic Familial Colorectal Cancer. International Journal of Molecular Sciences, 26(2), 817. https://doi.org/10.3390/ijms26020817

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop