Next Article in Journal
Genetic Effects of GA-Responsive Dwarfing Gene Rht13 on Plant Height, Peduncle Length, Internodal Length and Grain Yield of Wheat under Drought Stress
Next Article in Special Issue
Evaluation of Library Preparation Workflows and Applications to Different Sample Types Using the PowerSeq® 46GY System with Massively Parallel Sequencing
Previous Article in Journal
DNA Barcoding, Phylogenetic Analysis and Secondary Structure Predictions of Nepenthes ampullaria, Nepenthes gracilis and Nepenthes rafflesiana
Previous Article in Special Issue
NIPAT as Non-Invasive Prenatal Paternity Testing Using a Panel of 861 SNVs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Association between Variants in the OCA2-HERC2 Region and Blue Eye Colour in HERC2 rs12913832 AA and AG Individuals

by
Nina Mjølsnes Salvo
1,*,
Jeppe Dyrberg Andersen
2,
Kirstin Janssen
1,
Olivia Luxford Meyer
2,
Thomas Berg
1,
Claus Børsting
2 and
Gunn-Hege Olsen
1
1
Centre for Forensic Genetics, Department of Medical Biology, Faculty of Health Sciences, UiT The Arctic University of Norway, 9037 Tromsø, Norway
2
Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100 Copenhagen, Denmark
*
Author to whom correspondence should be addressed.
Genes 2023, 14(3), 698; https://doi.org/10.3390/genes14030698
Submission received: 9 February 2023 / Revised: 1 March 2023 / Accepted: 9 March 2023 / Published: 11 March 2023
(This article belongs to the Special Issue State-of-the-Art in Forensic Genetics)

Abstract

:
The OCA2-HERC2 region is strongly associated with human pigmentation, especially eye colour. The HERC2 SNP rs12913832 is currently the best-known predictor for blue and brown eye colour. However, in a previous study we found that 43 of 166 Norwegians with the brown eye colour genotype rs12913832:AA or AG, did not have the expected brown eye colour. In this study, we carried out massively parallel sequencing of a ~500 kbp HERC2-OCA2 region in 94 rs12913832:AA and AG Norwegians (43 blue-eyed and 51 brown-eyed) to search for novel blue eye colour variants. The new candidate variants were subsequently typed in a Norwegian biobank population (total n = 519) for population specific association analysis. We identified five new variants, rs74409036:A, rs78544415:T, rs72714116:T, rs191109490:C and rs551217952:C, to be the most promising candidates for explaining blue eye colour in individuals with the rs12913832:AA and AG genotype. Additionally, we confirmed the association of the missense variants rs74653330:T and rs121918166:T with blue eye colour, and observed lighter skin colour in rs74653330:T individuals. In total, 37 (86%) of the 43 blue-eyed rs12913832:AA and AG Norwegians could potentially be explained by these seven variants, and we suggest including them in future prediction models.

1. Introduction

Many pigmentation genes have been extensively studied to explain human eye colour variation [1]. Oculocutaneous albinism type 2 (OCA2) and its neighbouring gene the HECT domain and RCC1-like domain 2 (HERC2) are of special interest because of their strong genetic influence on human pigmentation, especially eye colour variation [1,2]. OCA2 expression is regulated by the intronic SNP rs12913832, which is situated in a conserved enhancer region in HERC2 [3,4]. The ancestral A-allele in rs12913832 allows transcription factors to modulate long-range chromatin looping that leads to contact between the OCA2 promotor and the enhancer. This enhances OCA2 expression and thereby, melanin production. In contrast, the derived G-allele reduces chromatin looping and thereby, OCA2 expression and melanin production [3]. European populations have the largest variation in eye colour, and recent selection for the G-allele in Europeans has been documented [5]. The strong association between genotypes of rs12913832 and blue and brown eye colour allows accurate eye colour prediction, which has an important application in forensic genetics [4,6,7]. According to the dominant hypothesis, the genotypes rs12913832:AA or AG lead to brown eye colour, whereas the genotype rs12913832:GG is found in individuals with blue eye colour. This is correct for most individuals, but discrepancies in European populations are documented, where individuals with the genotype rs12913832:AA or AG have blue eye colour, and individuals with the genotype rs12913832:GG have brown eye colour [8,9,10]. Other polymorphisms in OCA2, both in regulatory and coding regions, have shown associations with eye colour, albeit with minor additive effects [11,12]. The missense OCA2 SNP rs1800407 was included in the IrisPlex prediction tool and improved the prediction accuracy of eye colour in a European population [6,13,14], whereas the missense variants rs74653330 and rs121918166 have been suggested to explain blue eye colour in Scandinavians [8]. Additionally, rs74653330 and rs121918166 are found to be associated with human skin colour variation [8,15]. In a previous study, we showed that the eye colour prediction accuracy in a Norwegian study population increased when rs74653330 and rs121918166 were included in the prediction model [16]. However, the eye colour of some blue-eyed Norwegians were still not accurately predicted, suggesting that other variants in or around OCA2 could affect eye colour formation in individuals with the rs12913832:AA and AG genotypes. Therefore, in this study, we sequenced the OCA2-HERC2 region in a Norwegian biobank population in order to find new candidate variants that may explain blue eye colour in individuals with the rs12913832:AA and AG genotype.

2. Materials and Methods

2.1. Study Population and Selection of Individuals for Genotyping

A Norwegian biobank population of 540 volunteers (presumably unrelated) residing in northern Norway from 2015 to 2017, was used to search for candidate variants that may explain blue eye colour [10]. The OCA2-HERC2 region was sequenced in a selected study cohort of 94 individuals, previously genotyped as HERC2 rs12913832 AA or AG [10]. For the sequenced study cohort, 43 individuals were selected because they did not follow the dominant hypothesis and had blue eye colour (study group, Table 1). Additionally, 41 of the 43 individuals were previously predicted incorrectly to have brown eye colour using the IrisPlex model [10,13] (Table S1). The remaining 51 individuals had the expected brown eye colour phenotype and were used as controls (control group, Table 1). The new candidate variants were subsequently typed in the remaining Norwegian samples (n = 446). All samples were collected with informed consent and subsequently anonymised. The project was approved by the Faculty of Health Sciences, UiT-The Arctic University of Norway (reference number 2021/2034).

2.2. Eye Colour Categorisation and Quantitative Eye Colour Measurements

High resolution photographs were taken of the participants’ eyes, and categorisation of eye colours was performed in the recent study from Salvo et al. [10]. In short, nine untrained individuals were asked to intuitively assign each photograph to one of four categories: blue, intermediate-blue, intermediate-brown, or brown eyes. Herein, a two-category system (blue and brown) was used. Therefore, blue and intermediate-blue were grouped into the blue category (n = 43), and brown and intermediate-brown were grouped into the brown category (n = 51). A quantitative eye colour score (Pixel Index of the Eye (PIE)—score) was calculated for each individual’s eye photograph using the custom-made Digital Analysis of Iris Tool software (DIAT) v.1 [9]. All the photographs from individuals categorised with blue eye colour had PIE-scores ranging from −0.3 to 1, and all the photographs from individuals categorised with brown eye colour had PIE-scores from −1 to −0.07 (Table S1). Only two individuals with blue eyes and six individuals with brown eyes had PIE-scores in the overlapping region (−0.3 to −0.07). The categorised eye colour highly correlated with the PIE-scores (Spearman’s correlation coefficient: −0.85, p < 0.001). For further details, see Salvo et al. [10].

2.3. Quantitative Skin Colour Measurements

Quantitative measurements of skin colour were performed using a NCS (Natural Colour System) Colour Scan 2.0 on the inside of the upper arm on 507 individuals in the Norwegian biobank population. The output metrics consisted of L* a* b* (L* = lightness, a* = red, b* = yellow). The L* value measures skin lightness, ranging from 0 to 100, where 0 is the darkest and 100 is the lightest. For a consensus measurement, at least three measurements were made, avoiding pigmentation spots and hair.

2.4. Probe Design and Sequencing the OCA2-HERC2 Region

The software SureDesign (Agilent Technologies, Santa Clara, CA, USA) was used to design capture-probes for a ~500 kbp region on chromosome 15 (GRCh38, chr15: 27,754,870–28,254,863) for SureSelectXT HS2 Target Enrichment System (Agilent Technologies, Santa Clara, CA, USA). The design included 15,627 probes targeting 427,954 bp. The library preparation was carried out according to the SureSelextXT HS2 DNA System protocol, version D0. All samples were sequenced on an Illumina MiSeq (Illumina, San Diego, CA, USA) according to the manufacturer’s instructions with paired-end sequencing (2 × 150 bp) using the MiSeq Reagent Kit V2 (300 cycles).

2.5. Analysis of Sequencing Data

The sequencing output was automatically converted to FASTQ files with the MiSeq Reporter Software. Agilent’s molecular barcode (MBC) sequences were extracted using the Agilent Genomics NextGen Toolkit (AGeNT), and the FASTQ files were trimmed using AdapterRemoval version 2.1.3 [17,18] with a minimum read length of 30 bp and Phred quality score of Q = 30. The files were subsequently aligned to the human reference sequence assembly GRCh38/hg38 with the Burrows–Wheeler Aligner, BWA-MEM algorithm [19,20]. Sequence alignment map (SAM) files were converted into binary alignment map (BAM) files using SAMtools [21]. The Genome Analysis Toolkit (GATK) with HaplotypeCaller version 4.0.0.0 [22] was used to create Variant Call Format (VCF) files. Variants located in the regions of interest were extracted using BEDTools version 2.22.1 [23]. Genotypes were accepted if the read depth was ≥10 and the heterozygote balance (Hb = read depth of allele/read depth of nucleotide position) was 0.20 < Hb < 0.80.
The fisher.test and Kruskal.test commands in R: A Language and Environment for Statistical Computing version 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria) [24] were used to perform Fisher’s exact test and Kruskal–Wallis test of the statistical association between genotype data, and eye colour categories and PIE-scores, respectively. Mann–Whitney tests for two independent samples and linear regression analysis were performed using the Real Statistics Resource Pack software (release 6.3; copyright (2013–2021) Charles Zaiontz, www.real-statistics.com, accessed on 16 January 2022). Hardy–Weinberg equilibrium (HWE) and pairwise r2 values for linkage disequilibrium (LD) testing were calculated using Haploview 4.2 [25]. For predictions of regulatory elements, the positions of the candidate variants were overlapped with datasets from the SCREEN: Search Candidate cis-Regulatory Elements by ENCODE platform, Registry of cCREs V3 [26]. Haplotypes were estimated using PHASE version 2.1 (University of Washington, Seattle, WA, USA) [27,28], using default settings.

2.6. Variant Typing

Eight candidate variants for blue eye colour, rs551217952, rs543971307, rs191109490, rs78544415, rs74409036, rs62008729, rs72714116 and rs12913832, were typed in the remaining 446 individuals in the Norwegian biobank population (total n = 540) using the SNaPshotTM multiplex system (Thermo Fisher Scientific, Waltham, MA, USA). The SNPs rs543971307 and rs191109490 were in complete LD. Qiagen Multiplex PCR kit (Qiagen, Hilden, Germany) was used to amplify DNA in one multiplex reaction in a final volume of 10 µL. The following PCR conditions were used: 95 °C for 15 min, 35 cycles of 94 °C for 30 s, 58 °C for 30 s and 72 °C for 30 s, and final extension at 72 °C for 10 min. A total of 5 µL amplified DNA was treated with 2 µL ExoSAP-ITTM PCR product clean-up reagent (Thermo Fisher Scientific, Waltham, MA, USA) at 37 °C for 60 min and 75 °C for 15 min. Single base extension (SBE) was performed with 1 µL purified PCR product, 1 µL MilliQ-water, 2 µL SNaPshot™ Multiplex Ready Reaction Mix (Thermo Fisher Scientific, Waltham, MA, USA) and 1 µL SBE primer mix (Table S2). The following cycle conditions were used: 30 cycles of 96 °C for 10 s, 55 °C for 5 s and 60 °C for 30 s. The SBE PCR products were then treated with 1 µL SAP (Thermo Fisher Scientific, Waltham, MA, USA) at 37 °C for 30 min and 75 °C for 15 min. Separation and detection of treated SBE products were carried out on a 3500xL Genetic Analyzer (Thermo Fisher Scientific, Waltham, MA, USA) using FragmentAnalysis with POP-4™ polymer, a 36 cm capillary and Dye Set E5. The analysis was performed with 1 µL SAP-treated SBE products, 20 µL Hi-Di formamide and GeneScan™-120 LIZ® Size Standard (200:1). The results were analysed with GeneMapper® ID-X v.1.5 (Thermo Fisher Scientific, Waltham, MA, USA), with a minimum peak height threshold of 50 RFU. In total, 425 individuals were successfully typed for all variants and were, together with the 94 sequenced individuals, used for population-specific analysis (total n = 519).

3. Results

3.1. Sequencing of the HERC2-OCA2 Region and Discovery of Candidate Blue Eye Colour Variants

Approximately 500 kbp of the HERC2-OCA2 region was sequenced in a study cohort of 94 individuals (43 in study group and 51 in control group). Across the target region, the median read depth ranged from 116 to 498 reads. In the captured 427,954 bp, a total of 2571 variants were identified after excluding 16 variant positions with less than 10 reads in more than 5% of the samples. Of the 2571 variants, 146 showed significant deviations from HWE (p-value < 0.05), and many variants were in strong or complete LD (pairwise r2 ≥ 0.8). A total of 419 independent variants were identified using the Tagger function in Haploview. Among the rs12913832 AA individuals, 24 variants were statistically significantly associated with both the PIE-score and eye colour categories (Kruskal–Wallis test and Fisher’s exact test, raw p-values < 0.05; Table S3). Among the rs12913832 AG individuals, 47 variants were statistically significantly associated with eye colour using both tests (Table S3). None of the variants were statistically associated with eye colour after Bonferroni correction for multiple testing (p-value < 0.00012 with m = 419 independent loci).
Six missense OCA2 variants were observed in the study cohort (rs1800414, rs74653330, rs121918166, rs1800407, rs1800401 and rs33929465). The SNPs rs74653330:T and rs121918166:T were observed in 24 individuals (56%) from the study group, whereas they were only observed in five individuals (9.8%) from the control group. These variants were statistically significantly associated with both quantitative and categorical eye colour (Kruskal–Wallis/Mann–Whitney test and Fisher’s exact test, raw p-values < 0.05). Thus, rs74653330:T and rs121918166:T could potentially explain blue eye colour (Figure S1). These two variants were previously typed in the same biobank population [16]. The other four missense variants (rs1800414, rs1800407, rs1800401 and rs33929465) were not associated with blue eye colour.
When searching among the individuals who did not carry the two missense variants, rs74653330:T and rs121918166:T (n = 65), we selected the six OCA2-HERC2 candidate variants rs74409036, rs78544415, rs72714116, rs191109490, rs551217952 and rs62008729 as the most promising candidates to explain blue eye colour (Table 2). These variants had a significant association with the PIE-score (Mann–Whitney test, raw p-value < 0.05), and the variant (minor) alleles were primarily observed in the study group and not in the control group (Figure 1 and Table 2). For population-specific association analysis, these candidate variants were typed in the remaining samples of the Norwegian biobank population.

3.2. Candidate Blue Eye Colour Variants and Their Association with Eye Colour in the Norwegian Population

The six candidate variants were all intronic, either in OCA2 or HERC2 (Figure S2). Only one variant, rs191109490, was predicted by SCREEN [26] to be in a candidate cis-regulatory element (cCRE) in foreskin melanocytes (EH38E1749549). Interestingly, the locus is located in HERC2, only 69bp from rs12913832 (r2 = 0). Notably, rs191109490 was in complete LD with rs543971307 (r2 = 1), which was also found to be in the cCRE in HERC2, close to rs12913832.
When typed in the Norwegian biobank population (n = 519), the six candidate variants had low frequencies (0.002–0.082) and were mainly observed in blue eyed individuals (Table 3 and Figure S3). Of the six variants, the SNP rs62008729:T showed the highest frequency of 0.082 (Table 1). However, five of eleven (45%) rs12913832 AG individuals carrying this variant had brown eyes (Figure S3). Therefore, this SNP was excluded from further analysis.
For statistical association analysis with eye colour variation in the Norwegian biobank population, haplotypes were estimated based on the genotypes of the five candidate variants, rs74409036:A, rs78544415:T, rs72714116:T, rs191109490:C and rs551217952:C, and with the missense variants, rs74653330:T and rs121918166:T. In total, 11 haplotypes were estimated (Table 4). The best estimated haplotype pairs (by PHASE) and PIE-scores for all 519 individuals are shown in Figure 2. Despite some of the haplotypes being observed in only a few individuals, the total correlation of the haplotypes to the PIE-score was 0.8 (linear regression, adjusted R2 = 0.80). All haplotype pairs, except AA, AG, AA3 and AG1, had a median PIE-score higher than −0.3, indicating blue eye colour (Figure 2A). The median PIE-scores of the haplotype pair AA and AG were −1 (Q1 = −1 and Q3 = −0.95) and −0.92 (Q1 = −0.97 and Q3 = −0.74), respectively. This indicates that most rs12913832 AA and AG individuals with blue eye colour did not carry one of these two haplotype pairs. The haplotype AG1 had the largest variation, with PIE-score ranging from −0.94 (brown) to 1 (blue), but this haplotype was only observed in five individuals. The haplotype A5G had a smaller variation (PIE-score ranging from −1 to 0.42), but the median PIE-score was 0.10. All, except one brown-eyed individual carrying this haplotype, had blue eyes with a pupillary ring of brown, resulting in perception of green elements. Interestingly, AA6 and A6G, with the rs191109490:C variant that was predicted to be in a cCRE, both showed high PIE-scores of 0.51 (blue) and 1 (blue), respectively. All rs12913832 GG haplotype pairs had median PIE-scores of 0.98 to 1 (blue) (Figure 2B).

3.3. HERC2-OCA2 Haplotypes and Their Association with Skin Colour in the Norwegian Population

Because SNPs in HERC2-OCA2 are also associated with skin pigmentation, the haplotypes in Table 4 were assessed for association with skin shade, from dark to light, in the Norwegian biobank population (Figure 3). The lightness values (L*) of 507 individuals were measured and used as a metric for skin colour. The L* value in the population varied from 46.92 to 77.58, with a median of 68.51. No overall significant associations were observed between the haplotypes and L* values. However, a trend of lighter skin colour was observed in individuals with rs74653330:T on a background of rs12913832:AA and AG genotypes. In total, 21 (91%) of 23 individuals with rs74653330:T (Haplotypes A3 and G3) had L* values higher or equal to the median skin colour in the population (L* = 68.51), indicating light skin colour. Furthermore, individuals with A3G had significantly lighter skin colour than the rest of the population (raw p-value < 0.02).

4. Discussion

In the present study, we sequenced a ~500 kbp region of OCA2-HERC2 to search for candidate blue eye colour variants in individuals carrying the rs12913832:AA or AG genotype. In the Norwegian biobank population, 43 individuals (study group) did not have the expected brown eye colour based on rs12913832 [10]. Because of the strong selection pressure on OCA2-HERC2 in blue-eyed Europeans [5], it was hypothesised that other variants in this region may have an effect on OCA2 function, either by regulating the protein expression or by altering the protein.
Assessment of the sequencing quality showed that the captured region was successfully sequenced with good quality (median read depth between samples ranged from 116 to 498 reads across the target regions). Only 16 variant positions had less than 10 reads in more than 5% of the samples and were excluded. Four of these variants were in OCA2, and one of them overlapped with a deletion that was previously identified in 35% (15/43) of the study samples, which could potentially explain the low read depth [29].
In addition, to confirm the association of the missense variants rs74653330 and rs121918166 with eye colour, we identified five new candidate variants (rs74409036:A, rs78544415:T, rs72714116:T, rs191109490:C and rs551217952:C) that may potentially explain blue eye colour in individuals with the rs12913832:AA and AG genotype. Independently, they were all associated with quantitative eye colour (PIE-score) in the Norwegian biobank population, and 37 (86%) of the 43 blue-eyed individuals in the study group had at least one of the seven variants (including rs74653330:T and rs121918166:T). To our knowledge, this is the first time that the variants rs74409036, rs78544415, rs72714116, rs191109490 and rs551217952 are reported to be associated with human eye colour variation. These variants have very low frequencies and are mainly observed in Europeans, and some of them also in Americans (https://gnomad.broadinstitute.org/, accessed on 8 December 2022). The SNP rs74653330:T is also moderately frequent in East Asia, where it is associated with lighter skin colour [15].
The candidate variants rs74409036, rs78544415, rs72714116, rs551217952 and rs191109490 are all intronic. With the currently available data, rs191109490 was the only variant found by SCREEN to be in a cCRE in melanocytes [26]. Interestingly, rs191109490 is contained in a distal enhancer-like signature in HERC2 in foreskin melanocytes, indicating a regulatory effect on OCA2 expression and thereby, pigmentation. However, the specific role of rs191109490 in the cCRE is yet to be identified. Despite rs191109490:C only being observed in two individuals (haplotype pairs: AA6 and A6G), these had higher PIE-scores (perceived as blue) than individuals with the AA and AG haplotype pairs. Notably, one individual carried the rs12913832:AA genotype and was incorrectly predicted to have brown eyes with a high probability (p-value = 0.96) by the IrisPlex tool (Table S1). Additionally, the individual with A6G showed lighter skin colour than most GG haplotypes, supporting the hypothesis that rs191109490:C influences OCA2 expression.
Individuals with rs72714116:T had the same haplotype pair, A5G, and four of the five rs72714116:T individuals had PIE-scores close to the mid-scale (0.05–0.4). These individuals had intermediate-blue eye colour that might be perceived as green or hazel. As discussed in many studies, eye colour is on a continuous scale from dark brown to light blue, and perception of colour varies from person to person (e.g., [10,30,31]). The most popular prediction tool for eye colour in forensic genetics, the IrisPlex, uses three eye colour categories, blue, intermediate and brown [6,13,14]. However, prediction of intermediate eye colour has proved to be difficult because of the lack of markers associated with the intermediate eye colour and because of the individual perception of eye colour [6,10,30,31]. Thus, several authors have suggested using a two-category system (blue and brown) [16,32,33] and others a quantitative system [9,34]. Our results indicate that the variant rs72714116:T may be an example of a marker for non-blue/non-brown eye colours in the Norwegian biobank population. However, this was observed in only four individuals, and the findings needs to be replicated in more samples.
Individuals estimated with the rs74409036:A rs12913832:G haplotype (AG1) had large variations in PIE-scores, and therefore it may not be easy to predict the eye colour of AG1 individuals. However, when rs74409036:A was observed together with other variants in the haplotype pairs A1G, A3G1 and A4G1, the individuals’ PIE-scores were significantly higher (perceived as blue) than AG1 individuals. Individuals with A3G1 and A4G1 haplotypes also carried the missense mutations rs74653330:T and rs121918166:T. These two variants are associated with blue eye colour in Scandinavians [8]. We have previously demonstrated that inclusion of these variants in the EC11 prediction model led to more correct predictions in the Norwegian biobank population than with the IrisPlex model [16], which supports their importance for prediction of eye colour. Interestingly, three blue-eyed individuals in the study group had the genotype rs12913832:AA, but were predicted to have brown eye colour by the IrisPlex model with high p-values (0.96, 0.97 and 0.98). In addition to the one individual with rs191109490:C (already mentioned), two had the rs74653330:T variant, potentially explaining blue eye colour in all these rs12913832:AA individuals.
In the present study, we also observed lighter skin colour in individuals with the rs74653330:T rs12913832:A haplotype (haplotype pairs: AA3, A3A3, A3G3 and A3G). This is in line with previous findings that this variant is associated with skin pigmentation variation in East Asians and Scandinavians [8,15]. Andersen et al. [8] observed lighter skin colour in individuals with the rs121918166:T rs12913832:A haplotype compared to rs12913832:GG individuals. However, we did not observe this trend in the Norwegian biobank population.
Here, we demonstrate high correlation between variants in the HERC2-OCA2 region and eye colour in the Norwegian biobank population. Their independent effect on human eye colour variation may be smaller than the HERC2 SNP rs12913832. However, including them in prediction models improves the precision for prediction of blue-eyed individuals with rs12913832:AA and AG genotypes. It would be interesting to assess the associations in similar populations, e.g., other Scandinavian or Northern European populations. While the HERC2-OCA2 region explains most of the blue and brown eye colour variability in Europeans, SNPs in other pigmentation genes, such as TYR, TYRP1, SLC24A4, SLC45A2, ASIP and IRF4, are also found to be associated with eye colour [35,36,37,38,39], albeit with varying population-specific effects [32,40]. We previously assessed the association between the IrisPlex SNPs (rs12913832 in HERC2, rs1800407 in OCA2, rs12896399 in SLC24A4, rs16891982 in SLC45A2, rs1393350 in TYR and rs12203592 in IRF4) and PIE-score in the Norwegian biobank population, and only observed rs16891982 in SLC45A2 to be significantly associated with blue eye colour in rs12913832:AA and AG individuals [10]. The protein SLC45A2 might have a similar role in melanosome maturation as OCA2 [41]. Thus, SLC45A2 may also be a target of interest to search for new blue eye colour variants. A recent GWAS identified 50 novel loci associated with eye colour, including pigmentation genes and genes involved in iris morphology and structure [42]. This demonstrates that human eye colour is a genetically highly complex trait, and the molecular mechanisms behind eye colour formation are yet to be fully understood. In the current study, we identify intronic variants to be associated with blue eye colour. Although the functional role of these variants in eye colour formation is still unknown, we suggest including them in future prediction models.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes14030698/s1, Figure S1: PIE-scores and genotypes of rs121918166 and rs74653330 in the study cohort (n = 94); Figure S2: Chromosomal position of the six blue eye colour candidate variants (red), the two missense variants rs74653330 and rs121918166, and the main eye colour predictor SNP rs12913832. Pos = chromosomal position (GRCh38); Figure S3: Genotypes and PIE-scores for the candidate blue eye colour variants typed in rs12913832 AA and AG individuals (n = 165) from the Norwegian biobank population; Table S1: Genotype and eye colour information of the study population; Table S2: Information on primers used for SBE typing of candidate variants; Table S3: Variants that were statistically significantly associated with both PIE-score and eye colour categories (Kruskal–Wallis test and Fisher’s exact test, raw p-values ≤0.05) in rs12913832 AA and AG individuals.

Author Contributions

Conceptualisation, N.M.S., J.D.A., K.J., T.B., C.B. and G.-H.O.; formal analysis, N.M.S. and J.D.A.; investigation, N.M.S. and K.J.; writing—original draft preparation, N.M.S.; writing—review and editing, N.M.S., J.D.A., K.J., O.L.M., T.B., C.B. and G.-H.O.; visualisation, N.M.S.; supervision, K.J., T.B. and G.-H.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Faculty of Health Sciences, UiT—The Arctic University of Norway (2021/2034).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data generated in the present study are included within the manuscript and supplementary files.

Acknowledgments

A special thanks to all participants. Thanks to Anthony Mathelier, from the Computational Biology & Gene Regulation Group at the Centre for Molecular Medicine Norway (NCMM) for helping with intersecting SNP data with cCRE data from SCREEN and for helpful discussions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sturm, R.A.; Larsson, M. Genetics of human iris colour and patterns. Pigment. Cell Melanoma Res. 2009, 22, 544–562. [Google Scholar] [CrossRef] [PubMed]
  2. Liu, F.; Wen, B.; Kayser, M. Colorful DNA polymorphisms in humans. Semin. Cell Dev. Biol. 2013, 24, 562–575. [Google Scholar] [CrossRef] [PubMed]
  3. Visser, M.; Kayser, M.; Palstra, R.-J. HERC2 rs12913832 modulates human pigmentation by attenuating chromatin-loop formation between a long-range enhancer and the OCA2 promoter. Genome Res. 2012, 22, 446–455. [Google Scholar] [CrossRef]
  4. Sturm, R.A.; Duffy, D.L.; Zhao, Z.Z.; Leite, F.P.; Stark, M.S.; Hayward, N.K.; Martin, N.G.; Montgomery, G.W. A Single SNP in an Evolutionary Conserved Region within Intron 86 of the HERC2 Gene Determines Human Blue-Brown Eye Color. Am. J. Hum. Genet. 2008, 82, 424–431. [Google Scholar] [CrossRef] [PubMed]
  5. Donnelly, M.P.; Paschou, P.; Grigorenko, E.; Gurwitz, D.; Barta, C.; Lu, R.-B.; Zhukova, O.V.; Kim, J.-J.; Siniscalco, M.; New, M.; et al. A global view of the OCA2-HERC2 region and pigmentation. Hum. Genet. 2011, 131, 683–696. [Google Scholar] [CrossRef] [PubMed]
  6. Walsh, S.; Wollstein, A.; Liu, F.; Chakravarthy, U.; Rahu, M.; Seland, J.H.; Soubrane, G.; Tomazzoli, L.; Topouzis, F.; Vingerling, J.R.; et al. DNA-based eye colour prediction across Europe with the IrisPlex system. Forensic Sci. Int. Genet. 2012, 6, 330–340. [Google Scholar] [CrossRef]
  7. Eiberg, H.; Troelsen, J.; Nielsen, M.; Mikkelsen, A.; Mengel-From, J.; Kjaer, K.W.; Hansen, L. Blue eye color in humans may be caused by a perfectly associated founder mutation in a regulatory element located within the HERC2 gene inhibiting OCA2 expression. Hum. Genet. 2008, 123, 177–187. [Google Scholar] [CrossRef] [PubMed]
  8. Andersen, J.D.; Pietroni, C.; Johansen, P.; Andersen, M.M.; Pereira, V.; Børsting, C.; Morling, N. Importance of nonsynonymous OCA 2 variants in human eye color prediction. Mol. Genet. Genom. Med. 2016, 4, 420–430. [Google Scholar] [CrossRef]
  9. Andersen, J.D.; Johansen, P.; Harder, S.; Christoffersen, S.R.; Delgado, M.C.; Henriksen, S.T.; Nielsen, M.M.; Sørensen, E.; Ullum, H.; Hansen, T.; et al. Genetic analyses of the human eye colours using a novel objective method for eye colour classification. Forensic Sci. Int. Genet. 2013, 7, 508–515. [Google Scholar] [CrossRef]
  10. Salvo, N.M.; Janssen, K.; Kirsebom, M.K.; Meyer, O.S.; Berg, T.; Olsen, G.-H. Predicting eye and hair colour in a Norwegian population using Verogen’s ForenSeq™ DNA signature prep kit. Forensic Sci. Int. Genet. 2021, 56, 102620. [Google Scholar] [CrossRef]
  11. Liu, F.; van Duijn, K.; Vingerling, J.R.; Hofman, A.; Uitterlinden, A.G.; Janssens, A.C.J.; Kayser, M. Eye color and the prediction of complex phenotypes from genotypes. Curr. Biol. 2009, 19, R192–R193. [Google Scholar] [CrossRef] [PubMed]
  12. Duffy, D.; Montgomery, G.; Chen, W.; Zhao, Z.Z.; Le, L.; James, M.R.; Hayward, N.K.; Martin, N.; Sturm, R.A. A Three–Single-Nucleotide Polymorphism Haplotype in Intron 1 of OCA2 Explains Most Human Eye-Color Variation. Am. J. Hum. Genet. 2007, 80, 241–252. [Google Scholar] [CrossRef] [PubMed]
  13. Walsh, S.; Liu, F.; Ballantyne, K.N.; van Oven, M.; Lao, O.; Kayser, M. IrisPlex: A sensitive DNA tool for accurate prediction of blue and brown eye colour in the absence of ancestry information. Forensic Sci. Int. Genet. 2011, 5, 170–180. [Google Scholar] [CrossRef] [PubMed]
  14. Walsh, S.; Liu, F.; Wollstein, A.; Kovatsi, L.; Ralf, A.; Kosiniak-Kamysz, A.; Branicki, W.; Kayser, M. The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci. Int. Genet. 2012, 7, 98–115. [Google Scholar] [CrossRef]
  15. Eaton, K.; Edwards, M.; Krithika, S.; Cook, G.; Norton, H.; Parra, E.J. Association study confirms the role of two OCA2 polymorphisms in normal skin pigmentation variation in East Asian populations. Am. J. Hum. Biol. 2015, 27, 520–525. [Google Scholar] [CrossRef]
  16. Meyer, O.; Salvo, N.; Kjærbye, A.; Kjersem, M.; Andersen, M.; Sørensen, E.; Ullum, H.; Janssen, K.; Morling, N.; Børsting, C.; et al. Prediction of Eye Colour in Scandinavians Using the EyeColour 11 (EC11) SNP Set. Genes 2021, 12, 821. [Google Scholar] [CrossRef]
  17. Schubert, M.; Lindgreen, S.; Orlando, L. AdapterRemoval v2: Rapid adapter trimming, identification, and read merging. BMC Res. Notes 2016, 9, 88. [Google Scholar] [CrossRef]
  18. Lindgreen, S. AdapterRemoval: Easy cleaning of next-generation sequencing reads. BMC Res. Notes 2012, 5, 337. [Google Scholar] [CrossRef]
  19. Li, H.; Durbin, R. Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics 2010, 26, 589–595. [Google Scholar] [CrossRef] [PubMed]
  20. Li, H. Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM. 2013. Available online: http://github.com/lh3/bwa (accessed on 31 January 2022).
  21. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
  22. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [PubMed]
  23. Quinlan, A.R.; Hall, I.M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26, 841–842. [Google Scholar] [CrossRef] [PubMed]
  24. R Core Team. R: A Language and Environment for Statistical Computing. 2019. Available online: https://www.r-project.org/ (accessed on 10 March 2023).
  25. Barrett, J.C.; Fry, B.; Maller, J.; Daly, M.J. Haploview: Analysis and visualization of LD and haplotype maps. Bioinformatics 2005, 21, 263–265. [Google Scholar] [CrossRef] [PubMed]
  26. The ENCODE Project Consortium; Moore, J.E.; Purcaro, M.J.; Pratt, H.E.; Epstein, C.B.; Shoresh, N.; Adrian, J.; Kawli, T.; Davis, C.A.; Dobin, A.; et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 2020, 583, 699–710. [Google Scholar] [CrossRef] [PubMed]
  27. Stephens, M.; Smith, N.J.; Donnelly, P. A New Statistical Method for Haplotype Reconstruction from Population Data. Am. J. Hum. Genet. 2001, 68, 978–989. [Google Scholar] [CrossRef]
  28. Stephens, M.; Scheet, P. Accounting for Decay of Linkage Disequilibrium in Haplotype Inference and Missing-Data Imputation. Am. J. Hum. Genet. 2005, 76, 449–462. [Google Scholar] [CrossRef] [PubMed]
  29. Salvo, N.M.; Janssen, K.; Olsen, G.-H.; Berg, T.; Andersen, J.D. Association between copy number variations in the OCA2-HERC2 locus and human eye colour. Forensic Sci. Int. Genet. Suppl. Ser. 2022, 8, 82–84. [Google Scholar] [CrossRef]
  30. Meyer, O.S.; Børsting, C.; Andersen, J.D. Perception of blue and brown eye colours for forensic DNA phenotyping. Forensic Sci. Int. Genet. Suppl. Ser. 2019, 7, 476–477. [Google Scholar] [CrossRef]
  31. Ruiz, Y.; Phillips, C.; Gomez-Tato, A.; Alvarez-Dios, J.; de Cal, M.C.; Cruz, R.; Maroñas, O.; Söchtig, J.; Fondevila, M.; Rodriguez-Cid, M.; et al. Further development of forensic eye color predictive tests. Forensic Sci. Int. Genet. 2012, 7, 28–40. [Google Scholar] [CrossRef]
  32. Pietroni, C.; Andersen, J.D.; Johansen, P.; Andersen, M.M.; Harder, S.; Paulsen, R.R.; Børsting, C.; Morling, N. The effect of gender on eye colour variation in European populations and an evaluation of the IrisPlex prediction model. Forensic Sci. Int. Genet. 2014, 11, 1–6. [Google Scholar] [CrossRef]
  33. Hart, K.L.; Kimura, S.L.; Mushailov, V.; Budimlija, Z.M.; Prinz, M.; Wurmbach, E. Improved eye- and skin-color prediction based on 8 SNPs. Croat. Med. J. 2013, 54, 248–256. [Google Scholar] [CrossRef] [PubMed]
  34. Paparazzo, E.; Gozalishvili, A.; Lagani, V.; Geracitano, S.; Bauleo, A.; Falcone, E.; Passarino, G.; Montesanto, A. A new approach to broaden the range of eye colour identifiable by IrisPlex in DNA phenotyping. Sci. Rep. 2022, 12, 12803. [Google Scholar] [CrossRef] [PubMed]
  35. Sulem, P.; Gudbjartsson, D.; Stacey, S.N.; Helgason, A.; Rafnar, T.; Magnusson, K.P.; Manolescu, A.; Karason, A.; Palsson, A.; Thorleifsson, G.; et al. Genetic determinants of hair, eye and skin pigmentation in Europeans. Nat. Genet. 2007, 39, 1443–1452. [Google Scholar] [CrossRef] [PubMed]
  36. Han, J.; Kraft, P.; Nan, H.; Guo, Q.; Chen, C.; Qureshi, A.; Hankinson, S.E.; Hu, F.B.; Duffy, D.; Zhao, Z.Z.; et al. A Genome-Wide Association Study Identifies Novel Alleles Associated with Hair Color and Skin Pigmentation. PLoS Genet. 2008, 4, e1000074. [Google Scholar] [CrossRef]
  37. Sulem, P.; Gudbjartsson, D.F.; Stacey, S.N.; Helgason, A.; Rafnar, T.; Jakobsdottir, M.; Steinberg, S.; Gudjonsson, S.A.; Palsson, A.; Thorleifsson, G.; et al. Two newly identified genetic determinants of pigmentation in Europeans. Nat. Genet. 2008, 40, 835–837. [Google Scholar] [CrossRef] [PubMed]
  38. Kanetsky, P.A.; Swoyer, J.; Panossian, S.; Holmes, R.; Guerry, D.; Rebbeck, T.R. A Polymorphism in the Agouti Signaling Protein Gene Is Associated with Human Pigmentation. Am. J. Hum. Genet. 2002, 70, 770–775. [Google Scholar] [CrossRef]
  39. Frudakis, T.; Thomas, M.; Gaskin, Z.; Venkateswarlu, K.; Chandra, K.S.; Ginjupalli, S.; Gunturi, S.; Natrajan, S.; Ponnuswamy, V.K.; Ponnuswamy, K.N. Sequences Associated with Human Iris Pigmentation. Genetics 2003, 165, 2071–2083. [Google Scholar] [CrossRef]
  40. Pośpiech, E.; Karłowska-Pik, J.; Ziemkiewicz, B.; Kukla, M.; Skowron, M.; Wojas-Pelc, A.; Branicki, W. Further evidence for population specific differences in the effect of DNA markers and gender on eye colour prediction in forensics. Int. J. Leg. Med. 2016, 130, 923–934. [Google Scholar] [CrossRef]
  41. Le, L.; Escobar, I.E.; Ho, T.; Lefkovith, A.J.; Latteri, E.; Haltaufderhyde, K.D.; Dennis, M.K.; Plowright, L.; Sviderskaya, E.V.; Bennett, D.C.; et al. SLC45A2 protein stability and regulation of melanosome pH determine melanocyte pigmentation. Mol. Biol. Cell 2020, 31, 2687–2702. [Google Scholar] [CrossRef]
  42. Simcoe, M.; Valdes, A.; Liu, F.; Furlotte, N.A.; Evans, D.M.; Hemani, G.; Ring, S.M.; Smith, G.D.; Duffy, D.L.; Zhu, G.; et al. Genome-wide association study in almost 195,000 individuals identifies 50 previously unidentified genetic loci for eye color. Sci. Adv. 2021, 7, eabd1239. [Google Scholar] [CrossRef]
Figure 1. Genotypes and PIE-scores for the candidate blue eye colour variants rs74409036, rs78544415, rs72714116, rs191109490, rs551217952 and rs62008729 in 65 rs12913832 AA and AG individuals that did not have the missense variants rs74653330:T and rs121918166:T. Statistical significance (Mann–Whitney test): * p < 0.05, ** p < 0.01.
Figure 1. Genotypes and PIE-scores for the candidate blue eye colour variants rs74409036, rs78544415, rs72714116, rs191109490, rs551217952 and rs62008729 in 65 rs12913832 AA and AG individuals that did not have the missense variants rs74653330:T and rs121918166:T. Statistical significance (Mann–Whitney test): * p < 0.05, ** p < 0.01.
Genes 14 00698 g001
Figure 2. Estimated haplotype pairs by PHASE and PIE-scores in the Norwegian biobank population (n = 519) comprised of the haplotypes in Table 4. (A) Shows rs12913832 AA and AG haplotype pairs. All rs12913832 GG haplotype pairs are grouped together as G*G* and are shown in more detail in (B) (n = 363). G* = rs12913832:G genotype regardless of haplotype. The number of individuals carrying the haplotype pair is indicated in red.
Figure 2. Estimated haplotype pairs by PHASE and PIE-scores in the Norwegian biobank population (n = 519) comprised of the haplotypes in Table 4. (A) Shows rs12913832 AA and AG haplotype pairs. All rs12913832 GG haplotype pairs are grouped together as G*G* and are shown in more detail in (B) (n = 363). G* = rs12913832:G genotype regardless of haplotype. The number of individuals carrying the haplotype pair is indicated in red.
Genes 14 00698 g002
Figure 3. Estimated haplotype pairs by PHASE and lightness value (L*) of skin colour in the Norwegian biobank population (n = 507), comprised of the rs12913832 AA and AG haplotype pairs (Table 4). All rs12913832 GG haplotype pairs are grouped together as G*G*. G* = rs12913832:G genotype regardless of haplotype. The number of individuals carrying each haplotype pair is indicated in red.
Figure 3. Estimated haplotype pairs by PHASE and lightness value (L*) of skin colour in the Norwegian biobank population (n = 507), comprised of the rs12913832 AA and AG haplotype pairs (Table 4). All rs12913832 GG haplotype pairs are grouped together as G*G*. G* = rs12913832:G genotype regardless of haplotype. The number of individuals carrying each haplotype pair is indicated in red.
Genes 14 00698 g003
Table 1. Number of rs12913832 AA and AG individuals with blue or brown eye colour selected for sequencing of the OCA2-HERC2 region.
Table 1. Number of rs12913832 AA and AG individuals with blue or brown eye colour selected for sequencing of the OCA2-HERC2 region.
rs12913832Brown
(Control Group)
Blue
(Study Group)
AA213
AG3040
Total5143
Table 2. The six OCA2-HERC2 candidate blue eye colour variants.
Table 2. The six OCA2-HERC2 candidate blue eye colour variants.
Candidate VariantChromosomal Position (GRCh38)Frequency Brown (n = 46) 1Frequency Blue (n = 19) 1Variants in Strong LD 2 (r2 ≥ 0.8)
rs7440903627,818,6060.030.11rs78114576
rs7854441527,842,6340.000.05rs79071800, rs28565430, rs28423991, rs74474678 and rs80177321
rs7271411628,083,0610.000.08rs72714116
rs19110949028,120,4030.000.05rs543971307 and rs145048438
rs55121795228,139,3870.010.05rs182498200 and rs184120129
rs6200872928,173,1400.010.08rs62008729
1 Frequency of variant allele in 65 rs12913832 AA and AG individuals that did not have the missense variants rs74653330:T and rs121918166:T; 2 LD = Linkage disequilibrium.
Table 3. The frequencies of the six candidate variant alleles in the Norwegian biobank population.
Table 3. The frequencies of the six candidate variant alleles in the Norwegian biobank population.
Variant Allele Frequency
rs IDReference AlleleVariant AlleleFrequency Norway (n = 519)Frequency Europe (GnomAD)
rs74409036GA0.0310.052
rs78544415CT0.0260.035
rs72714116CT0.0040.020
rs191109490AC0.0020.002
rs551217952AC0.0030.003
rs62008729CT0.0820.024
Table 4. HERC2-OCA2 haplotypes estimated by PHASE.
Table 4. HERC2-OCA2 haplotypes estimated by PHASE.
Haplotypes a,crs74409036:Ars78544415:Trs74653330:Trs121918166:Trs72714116:Trs191109490:Crs551217952:CFrequency NOR b (n = 519)
A-------0.135
G-------0.772
G1A------0.028
G2-T-----0.025
G3-TT----0.001
G4---T---0.001
A1A-----C0.003
A3--T----0.024
A4---T---0.006
A5----T--0.004
A6-----C-0.002
a Haplotype A and G indicates the A-allele and G-allele in rs12913832, respectively; b NOR = Norwegian biobank population; c Reference allele.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Salvo, N.M.; Andersen, J.D.; Janssen, K.; Meyer, O.L.; Berg, T.; Børsting, C.; Olsen, G.-H. Association between Variants in the OCA2-HERC2 Region and Blue Eye Colour in HERC2 rs12913832 AA and AG Individuals. Genes 2023, 14, 698. https://doi.org/10.3390/genes14030698

AMA Style

Salvo NM, Andersen JD, Janssen K, Meyer OL, Berg T, Børsting C, Olsen G-H. Association between Variants in the OCA2-HERC2 Region and Blue Eye Colour in HERC2 rs12913832 AA and AG Individuals. Genes. 2023; 14(3):698. https://doi.org/10.3390/genes14030698

Chicago/Turabian Style

Salvo, Nina Mjølsnes, Jeppe Dyrberg Andersen, Kirstin Janssen, Olivia Luxford Meyer, Thomas Berg, Claus Børsting, and Gunn-Hege Olsen. 2023. "Association between Variants in the OCA2-HERC2 Region and Blue Eye Colour in HERC2 rs12913832 AA and AG Individuals" Genes 14, no. 3: 698. https://doi.org/10.3390/genes14030698

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop