Next Article in Journal
An Integrative Approach to Identifying Neuroprotective Natural Compounds for Neurodevelopmental Disorders
Previous Article in Journal
Antifibrotic Effects of an α7 Nicotinic Acetylcholine Receptor Agonist in Keloid Fibroblasts and a Rat Scar Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genotype-Phenotype Correlation of Seven Known and Novel β-Globin Gene Variants

1
Centre for Research and Development of Medical Diagnostic Laboratories, Faculty of Associated Medical Sciences, Khon Kaen University, Khon Kaen 40002, Thailand
2
Biomedical Science Research Unit, Faculty of Medicine, Mahasarakham University, Mahasarakham 44000, Thailand
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2025, 26(18), 8872; https://doi.org/10.3390/ijms26188872
Submission received: 4 August 2025 / Revised: 31 August 2025 / Accepted: 8 September 2025 / Published: 12 September 2025
(This article belongs to the Section Molecular Biology)

Abstract

Variants of uncertain significance (VUS) are often challenging for genetic counseling and require additional data for accurate variant classification. This study aims to describe the genotype-phenotype correlation of the seven β-globin gene variants found in Thailand. Retrospective data in a total of 45,914 subjects encountered at our diagnostic laboratory from January 2012 to December 2024 were reviewed. A total of 33 leftover EDTA blood specimens, suspected of having β-globin gene defects, were included. Eighty-nine normal subjects were also analyzed to confirm phenotypic expression of the variants. The whole β-globin and Krüppel-like factor 1 (KLF1) genes were examined using PCR-based methods. Seven nucleotide variants were identified among 33 suspected subjects, including a novel (β−206(C>G)), four hitherto undescribed in Thailand [β−198(A>G), βIVSII−180(T>C), βIVSII−337(A>G), and β*233(G>C)], and two known variants [β−50(G>A) and βIVSII−258(G>A)]. The β−198(A>G) and β*233(G>C) variants were also identified in 1.69% of normal subjects, indicating neutral DNA polymorphisms. All subjects of β−198(A>G), βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G) with borderline Hb A2 levels had KLF1 mutations. Compound heterozygous β−206(C>G) and known β+-thalassemia trait revealed β-thalassemia trait phenotype. In silico pathogenicity prediction showed that the β−206(C>G), β−198(A>G), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C) were associated with benign variants. It was found that heterozygous β−50(G>A) had elevated Hb A2 levels resembling those of β-thalassemia trait. However, the association of the β−50(G>A) and Hb E or β-thalassemia revealed a phenotype of Hb E or β-thalassemia trait. Most prediction tools indicate that the β−50(G>A) is associated with benign variants; however, PromoterAI revealed that the β−50(G>A) is associated with under-expression of the β-globin gene with high sensitivity. Based on these findings, the β−50(G>A) is most likely a very mild β+-thalassemia allele. This study described the genotype-phenotype correlation of known and novel β-globin gene variants found in Thailand. The data should prove useful for accurate variant classification, genetic counseling, and a prevention and control program of severe thalassemia diseases in Thailand.

1. Introduction

β-Thalassemia is a group of genetic diseases characterized by defects of β-globin chain production. Two main types, β0- and β+-thalassemia, are, respectively, defined based on the absence or reduction in β-globin chain synthesis. While no β-globin chain production is β0-thalasemia, a reduction in β-globin chain synthesis is β+-thalassemia. A very mild β+-thalassemia allele is sometimes referred to as β++-thalassemia [1,2]. This depends mainly on the β-globin gene defect, which has implications for genetic counseling and prenatal diagnosis of the disease. Heterozygous β-thalassemia is associated with raised Hb A2 level and reduced mean corpuscular volume (MCV) and mean corpuscular hemoglobin (MCH), whereas the homozygote and compound heterozygote with other β-hemoglobinopathies can express as various clinical symptoms, ranging from mild to severe hemolytic anemia, jaundice, hepatosplenomegaly, and iron overload [1]. In Thailand, the average frequency of α-thalassemia is 20–30%, β-thalassemia is 3–9%, and that of Hb E is 30–50%. The interaction of these genes could lead to several thalassemia syndromes with variable phenotypic expression ranging from mild thalassemia intermedia to severe transfusion-dependent thalassemia [3].
Normal human hemoglobin (Hb) is a heterotetramer that contains two pairs of different globin chains, including the α-like (ζ- or α-chains) and β-like (ε-, γ-, δ-, or β-chains) globin chains. Adult Hb (Hb A), consisting of two α- and two β-globin chains (α2β2), is the major human Hb, followed by 2–3% of Hb A22δ2) and less than 1% of Hb F (α2γ2), after one year of birth. Many nucleotide variants of globin genes causing thalassemia and hemoglobinopathies have been described worldwide [1]. Most known thalassemia and hemoglobinopathies defects have been found in the β-globin and the α-globin genes due to the major adult Hb components [2]. Although most of them are associated with known phenotypic expressions, some variants lack sufficient information to support a more definitive classification and are therefore classified as variants of uncertain significance (VUS). VUS is often challenging for genetic counseling and requires additional data for accurate variant classification [4,5]. At our routine thalassemia diagnostics, many subjects were identified as having known and novel VUS in the β-globin gene, leading to inconsistencies between genotype and phenotype and a difficulty in providing genetic counseling. This study aims to describe the genotype-phenotype correlation of the seven β-globin gene variants, including β−206(C>G), β−198(A>G), β−50(G>A), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C), found in Thailand.

2. Results

Seven β-globin gene variants were identified during routine thalassemia investigation of 33 referral subjects to our laboratory, including a novel one [β−206(C>G) (HBB:c.-256C>G, rs376005360, IthaID: 4153)], four hitherto undescribed in Thailand [β−198(A>G), βIVSII−180(T>C), βIVSII−337(A>G), and β*233(G>C)], and two known variants [β−50(G>A) and βIVSII−258(G>A)] [2,6,7,8,9,10,11,12]. The β−198(A>G), βIVSII−180(T>C), βIVSII−337(A>G), and β*233(G>C) have been described in various ethnic groups, such as Azerbaijani, Chinese, Malay, Ibany, and Palestinian [2,6,7,8,9,10,11,12]. The corresponding DNA sequencing profiles are shown in Figure 1. Table 1 lists data of the subjects, including Hb and hematological parameters, α- and β-globin genotypes, and KLF1 mutations. The first three subjects (Table 1, no. 1–3) carried novel variant β−206(C>G) in different combinations. All of them had elevated Hb A2 levels (4.4–5.1%). β-Globin gene analysis identified a compound heterozygote state in all of them, i.e., [β−206(C>G)−31(A>G)] in subjects no. 1 and 2 and [β−206(C>G)/ β−50(G>A)] in subject no. 3. Allele-specific amplification of the β−206(C>G) followed by DNA sequencing confirmed that the β−206(C>G) and β-thalassemia mutations are located in trans (Figure 2). The representative DNA sequencing profiles are shown in Supplementary Figure S1. α-Globin genotyping revealed that subject no. 1 also carried a heterozygous Hb Constant Spring (αCSα) gene. As compared to those of subjects with corresponding mutations in our series, we did not observe that co-inheritance of this β−206(C>G) with the β−31(A>G) β+-thalassemia has worsened the hematological phenotype as compared to the patients with β−31(A>G) β+-thalassemia alone.
Six subjects (no. 4–9) were found to be heterozygous for β−198(A>G), involving three subjects with normal Hb A2 levels (2.8–2.9%), and another three subjects with borderline Hb A2 levels (3.6–3.9%) (Table 2). Most of them had relatively normal hematological parameters, although three of them (no. 4, 7, and 9) also carried α+-thalassemia (3.7 kb deletion). Notably, it was found that three of them with borderline Hb A2 levels (no. 7–9) were carriers of three different KLF1 mutations (NC_000019.10), c.892G>C (A298P), c.525_526insGCGCCGG (G176AfsX179), and c.895C>G (H299D), previously described in Thailand [13,14].
As shown in Table 1, β−50(G>A) is the most common variant found in this study. It was detected as a heterozygous state in 9 subjects (no. 10–18), a compound heterozygote with β-thalassemia in 4 subjects (no. 19–22), and in compound heterozygosis with β-thalassemia and Hb E mutations in 5 subjects (no. 23–27). The β−50(G>A) and Hb E mutations are located in trans, as confirmed by allele-specific amplification and DNA sequencing (Figure 2). Most of the subjects with heterozygous β−50(G>A) had borderline Hb A2 levels (3.6–4.1%). An exception was noted in one case (no.10) with Hb A2 2.0% who was a carrier of α+-thalassemia (3.7 kb deletion). In this group of subjects, two of them (no. 15 and 16) also carried heterozygous KLF1 mutations, i.e., c.525_526insGCGCCGG and c.983G>A (R328H). It is expected that the four subjects with compound β−50(G>A)/β-thalassemia (no. 19–22) had a β-thalassemia trait phenotype with slightly decreased mean corpuscular volume (MCV) and mean corpuscular hemoglobin (MCH) levels. All (no. 24–27) but one case (no. 23) of compound β−50(G>A)E exhibited the phenotype of Hb E trait. This case, no. 23 with more severe anemia (Hb 8.0 g/dL), was in fact the patient with Hb H-ConSp disease in combination with the β−50(G>A)E genotype, known as the ConSpEABart’s disease, commonly encountered in Thai populations [15,16]. As for the β−206(C>G), we found that co-inheritance of this β−50(G>A) with β-thalassemia or Hb E has not worsened the hematological phenotype as compared to the patients with β-thalassemia or Hb E alone. In addition, when the β−50(G>A)E genotype (n = 4) was compared to the β−28(G>A)E in our series (n = 143), it was apparent that the latter had a more severe hematological phenotype, i.e., lower Hb, MCV, MCH levels, and higher red cell distribution width (RDW), Hb E, and Hb F levels significantly. The β−50(G>A)E subjects (n = 4) had no significant difference in hematological parameters from those of subjects with Hb E traits in our series (βEA) (n = 112), except for the higher Hb F level (Table 2). Some insignificant statistical differences in the two groups may be due to the small sample size.
Subjects no. 28–30 represented cases with rarer β-variants in this study, including βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G). Although hematological parameters are not completely available, they all had borderline Hb A2 levels (3.6–3.8%). Further KLF1 gene analysis identified that all of them were carriers of KLF1 mutations, the c.892G>C (n = 2) and c.983G>A (n = 1). Molecular analysis of the last group (case no. 31–33), who had normal hematological features and Hb A2 levels (2.6–2.8%), identified that all of them were carriers of the β*233(G>C) variant [233 bp after stop codon of β-globin gene or TTS+99 (G>C)]. Only case no. 31 was found additionally to be a carrier of α+-thalassemia (3.7 kb deletion). In this study, we did not detect α-globin gene triplication (αααanti−3.7) in any of the subjects. Among 89 normal control subjects, three were found to be carriers of the β−198(A>G) and the β*233(G>C) variants, the data indicating neutral DNA polymorphisms of these two SNPs.
Table 3 summarizes allele frequencies of neutral and seven β-globin gene variants identified in this study, as compared globally using reference SNP (rs) reported on the National Center for Biotechnology Information (NCBI) (https://www.ncbi.nlm.nih.gov/). As shown in the table, high allele frequencies include βCD2(CAT>CAC), βIVSII−16(G>C), βIVSII−74(T>G), βIVSII−81(C>T), βIVSII−666(C>T), and β*233(G>C), whereas β−206(C>G), β−198(A>G), β−50(G>A), βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G) represent those with low allele frequencies. In our study on normal Thai population, β−198(A>G), βCD2(CAT>CAC), βIVS II−16(G>C), βIVSII−74(T>G), βIVSII−81(C>T), βIVSII−666(C>T), and β*233(G>C) were detected at the allele frequencies of more than 1%, whereas those of β−206(C>G), β−50(G>A), βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G) were not observed. It is most likely that these variants have very low frequencies in general populations. Supplementary Table S1 reveals no association between Hb A2 levels and variants found in the Thai population.
Table 4 summarizes the results of in silico functional prediction for the seven β-globin gene variants identified using SpliceAI, CADD, PhyloP, and PromoterAI. These analyses predict that β−206(C>G), β−198(A>G), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C) are associated with benign variants. The β−50(G>A) is also associated with benign variants by the SpliceAI, CADD, and PhyloP programs, but is found to be under-expressed with high sensitivity using the PromoterAI program. For the online TFBIND, many transcription factor binding sites were found to be altered in the promoter β-globin variants, particularly the CCAAT box in β−206(C>G), GATA-1 in β−198(A>G), and Sp1, GATA-1, and CP1 in β−50(G>A) (Supplementary Table S2). For splice site prediction, no difference was observed using the Neural Network in NNSPLICE 0.9 version and FGENESH for the βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and wild-type sequences.

3. Discussion

For globin genes, numerous single-nucleotide polymorphisms (SNPs) have been identified, particularly in the β-globin gene. The βCD2(CAT>CAC), βIVSII−16(G>C), βIVSII−74(T>G), βIVSII−81(C>T), and βIVSII−666 (C>T) are the common SNPs within the β-globin gene. During our routine thalassemia investigation, a novel and six known variants in the β-globin gene, including β−206(C>G), β−198(A>G), β−50(G>A), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C), were identified with genotype-phenotype inconsistency and need to be proven for appropriate variant classification. They are distributed and located in the promoter, intervening sequence, and 3ʹ end of the β-globin gene. Of these, the β−206(C>G) is a novel one. This novel β−206(C>G) and five known variants, including β−198(A>G), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C), should be considered as benign or likely benign variants due to the following reasons. First, the combination of the β−206(C>G) in trans to known β+-thalassemia mutations results in a β-thalassemia trait phenotype, which is distinct from the homozygous β-thalassemia phenotype, as compared in Table 2. Second, allele frequencies of the β−198(A>G) and the β*233(G>C) were more than 1% in normal control subjects, whereas no β−206(C>G), β−50(G>A), βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G) were observed. It is most likely that the variants have very low frequencies in the general population. Third, all subjects with β−198(A>G), βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G), who had borderline Hb A2 levels, carried KLF1 mutations known to be associated with increased Hb A2 levels [13,14]. Fourth, no difference in human splicing sites was predicted among the wild-type, βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G). Fifth, previous studies have identified that β−198(A>G), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C) are associated with benign phenotypes, characterized by normal MCV, MCH, and Hb A2 levels in the heterozygotes [6,7,9,10,11,12]. Lastly, all functional prediction programs indicated that these six variants are benign or likely benign variants. However, according to the nucleotide changes, it is conceivable that the β−206(C>G) and β−198(A>G) may alter erythroid-specific transcription factor binding sites, especially the CCAAT box (CCAAG to CCAAC) for β−206(C>G) and homologous GATA-1 binding motif (AGATAT to AGATGT) for β−198(A>G), respectively (Supplementary Table S1).
This contrasts with the β−50(G>A), which was first reported in Chinese and later in Thai and Malaysian patients [8,11,17,18,19]. It is also the most common β-globin variant found in this study, being detected in 18 of 33 subjects (Table 1). This β−50(G>A) variant is located within the highly conserved direct repeat element (DRE), AGGGCAGGAGCCAGGGCTGGGC, between the CCAAT and TATA elements in the human β-globin promoter. It has been documented that heterozygous β−50(G>A) had microcytic hypochromic RBCs with increased Hb A2 (4.6%) and Hb F (1.2%) levels, and compound heterozygous of β−50(G>A) and β0-thalassemia revealed moderate microcytic hypochromic anemia with high Hb F levels (27.7%), and a β0+ thalassemia phenotype. This finding suggests that the β−50(G>A) is associated with a β+-thalassemia [8]. However, there is a study showing that most cases of heterozygous β−50(G>A) are associated with normochromic normocytic RBCs and normal Hb A2 levels [17]. Another study reported that, as compared to the wild-type, heterozygous β−50(G>A) had decreased MCV and MCH values and slightly elevated Hb A2 levels [18]. Moreover, compound heterozygous β−50(G>A) and β0-thalassemia revealed a β-thalassemia trait phenotype [18]. Our study favors the latter. As shown in Table 1, most heterozygous β−50(G>A) revealed microcytic hypochromic RBCs with borderline Hb A2 levels (3.6–4.1%), no matter with or without KLF1 mutation. However, in the β−50(G>A)0 and β−50(G>A)+ genotypes, all patients revealed a β-thalassemia trait phenotype, rather than a severe homozygous β-thalassemia phenotype. Similarly, our subjects with β−50(G>A)E had a very similar phenotype to that of the Hb E trait, rather than a β+E thalassemia phenotype (Table 2). It is noteworthy that no β−50(G>A) was identified among normal control subjects. In addition, most prediction tools indicate that the β−50(G>A) is associated with benign variants; however, PromoterAI revealed that the β−50(G>A) is associated with under-expression with high sensitivity. Moreover, altered transcription factor binding sites, particularly those of Sp1, GATA-1, and CP1, were identified. Accordingly, based on these data, our data support that the β−50(G>A) should be classified as a very mild β+-thalassemia allele, corresponding to the β++ or β+++ phenotype.
In conclusion, we have identified a novel and six known β-globin gene variants in Thai individuals. Based on genotype-phenotype correlation, in silico functional analysis, frequency of each variant in normal control group, as well as accumulated data from previous studies, it is concluded that six of them including β−206(C>G), β−198(A>G), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C) are benign β-globin gene variants or normal phenotype (non-β-thalassemia), whereas β−50(G>A) is a β++ or β+++ thalassemia allele. These data should prove useful for accurate variant pathogenicity classification, genetic counselling, and the prevention and control program of severe thalassemia diseases in the region.

4. Materials and Methods

4.1. Subjects and Hematological Analyses

Ethical approval of the study protocol on human research was obtained by the Institutional Review Board (IRB) of Khon Kaen University, Thailand (HE682074). Retrospective data in a total of 45,914 subjects encountered at our center for thalassemia diagnosis from January 2012 to December 2024 were reviewed. A total of 33 leftover DNA specimens with inconsistencies between genotype and phenotype were selectively recruited. Eighty-nine normal control subjects (178 alleles) with Hb type A2A, normal Hb A2 levels (2.2–3.2%), and normal hematological parameters (non-anemia, MCV of 80–100 fL, and MCH of 27–33 pg) were also analyzed to confirm phenotypic expression of the variants. The β-thalassemia allele should not be identified in this group of normal control subjects. Hematological parameters were recorded on standard blood cell counters. Hb analysis was carried out by using capillary electrophoresis (Capillarys II Flex Piercing, Sebia, France) or high-performance liquid chromatography (HPLC) (VariantTM, Bio-Rad Laboratory, Hercules, CA, USA).

4.2. Molecular Analysis

Common β-thalassemia genes, α-thalassemia mutations (--SEA, --THAI, −α3.7, −α4.2, Hb Constant Spring [HBA2:c.427T>C], and Hb Paksé [HBA2:c.429A>T]), α-globin gene triplication (αααanti−3.7), and KLF1 mutations previously described in Thailand were characterized by PCR-based methods as described previously [3,13,14,20,21]. DNA sequencing of the whole β-globin gene was done using an ABI PRISMTM 3730 XL analyzer (Applied Biosystems, Foster City, CA, USA) and Barcode-tagged sequencing (BTSeqTM, Celemics, Korea).

4.3. Molecular Confirmation of the β−206(C>G)) and β−50(G>A)

To confirm the configurations (in cis or in trans location) of the β−206(C>G) and the β−50(G>A), found in the subject, allele-specific amplification was performed. The forward primers G220 (5′-TGTACTGATGGTATGGGGCG-3′) for β−206(C>G) specific allele and G118 (5′-ACTCCCAGGAGCAGGGAGGA-3′) for β−50(G>A) specific allele were amplified separately in two allele-specific PCR conditions with the common reverse primer S3 (5′-TCCCATAGACTCACCCTGAA-3′) to produce specific fragments of 734 bp and 580 bp, respectively. Normal subjects (βAA) were used as a negative control. Allele-specific DNA sequencing of the amplified fragment was then carried out.

4.4. Transcription Factor Binding Site Prediction for β−206(C>G), β−198(A>G), and β−50(G>A)

Transcription factor binding site prediction for β−206(C>G), β−198(A>G), and β−50(G>A) was examined by using online TFBIND (https://tfbind.hgc.jp/, accessed 24 April 2025) [22]. The similarity score related to the transcription factor binding affinity of the variant sequences was compared to the wild-type sequences (Supplementary Table S2).

4.5. Splice Site Prediction of βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G)

Human splice sites and the potential genes and exons in the human genomic DNA of βIVSII−180(T>C), βIVSII−258(G>A), and βIVSII−337(A>G) as compared to wild-type were predicted by Neural Network in NNSPLICE 0.9 version (https://www.fruitfly.org/seq_tools/splice.html, accessed 24 April 2025) [23] and the FGENESH 2.6 program (http://www.softberry.com/berry.phtml, accessed 24 April 2025) [24].

4.6. In Silico Functional Study Prediction for the Seven β-Globin Variants

In silico functional study prediction for pathogenicity classification of the seven β-globin variants found was analyzed by SpliceAI, Combined Annotations Dependent Depletion (CADD), PhyloP, and PromoterAI (https://spliceailookup.broadinstitute.org/, accessed 24 April 2025) [25,26,27].

4.7. Statistical Analysis

Hematological data were analyzed using the STATATM version 18 (Stata Corp, College Station, TX, USA). Non-parametric statistic was selected due to the non-normal distribution of data. The difference between the two independent groups was compared by the Mann–Whitney U test, with statistical significance defined as p-value < 0.05.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms26188872/s1.

Author Contributions

Conceptualization, K.S. and S.F.; Methodology, K.S., A.P., G.F. and S.F.; Validation, G.F. and S.F.; Formal Analysis, K.S., A.P., G.F. and S.F.; Investigation, K.S., A.P., G.F. and S.F.; Resources, G.F. and S.F.; Data Curation, K.S. and S.F; Writing—Original Draft Preparation, K.S.; Writing—Review and Editing, K.S., A.P., G.F. and S.F.; Visualization, K.S., A.P., G.F. and S.F.; Supervision, S.F.; Project Administration, G.F. and S.F.; Funding Acquisition, K.S. and S.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Khon Kaen University, Thailand, to S.F. (Contract ID: RP68-Research Center KKU), and Genomics Thailand, the Health System Research Institute (Contract ID: HSRI 68-049), and Faculty of Medicine, Mahasarakham University, Thailand, to K.S.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Board (IRB) of Khon Kaen University, Thailand (HE682074). The approval date was 1 April 2025.

Informed Consent Statement

Patient consent was waived by the IRB of Khon Kaen University because this is a retrospective study of existing data and biological specimens without further prospective data collection from or direct interactions with the subjects. Based on the CIOMS guidelines, the IRB of Khon Kaen University has considered that this research would not be feasible to carry out without the waiver and that this study involves no more than minimal risks to participants. Waiving the informed consent does not affect the rights and welfare of the participants.

Data Availability Statement

All relevant data are within the paper and its Supplementary Materials. Further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors have declared that no competing interests exist.

References

  1. Taher, A.T.; Weatherall, D.J.; Cappellini, M.D. Thalassaemia. Lancet 2018, 391, 155–167. [Google Scholar] [CrossRef] [PubMed]
  2. Kountouris, P.; Lederer, C.W.; Fanis, P.; Feleki, X.; Old, J.; Kleanthous, M. IthaGenes: An interactive database for haemoglobin variations and epidemiology. PLoS ONE 2014, 9, e103020. [Google Scholar] [CrossRef] [PubMed]
  3. Yamsri, S.; Sanchaisuriya, K.; Fucharoen, G.; Sae-Ung, N.; Ratanasiri, T.; Fucharoen, S. Prevention of severe thalassemia in northeast Thailand: 16 years of experience at a single university center. Prenat. Diagn. 2010, 30, 540–546. [Google Scholar] [CrossRef]
  4. Kountouris, P.; Stephanou, C.; Lederer, C.W.; Traeger-Synodinos, J.; Bento, C.; Harteveld, C.L.; Fylaktou, E.; Koopmann, T.T.; Halim-Fikri, H.; Michailidou, K.; et al. Adapting the ACMG/AMP variant classification framework: A perspective from the ClinGen Hemoglobinopathy Variant Curation Expert Panel. Hum. Mutat. 2022, 43, 1089–1096. [Google Scholar] [CrossRef]
  5. Tamana, S.; Xenophontos, M.; Minaidou, A.; Stephanou, C.; Harteveld, C.L.; Bento, C.; Traeger-Synodinos, J.; Fylaktou, I.; Yasin, N.M.; Abdul Hamid, F.S.; et al. Evaluation of in silico predictors on short nucleotide variants in HBA1, HBA2, and HBB associated with haemoglobinopathies. eLife 2022, 11, e79713. [Google Scholar] [CrossRef]
  6. Bayramov, B.; Aliyeva, G.; Asadov, C.; Mammadova, T.; Karimova, N.; Eynullazadeh, K.; Gafarova, S.; Akbarov, S.; Farhadova, S.; Safarzadeh, Z.; et al. A novel frameshift mutation at codon 2 (-T) (HBB: C.9delT) and first report of three new β-globin mutations from Azerbaijan. Hemoglobin 2019, 43, 280–282. [Google Scholar] [CrossRef]
  7. Luo, S.; Chen, X.; Zeng, D.; Tang, N.; Yuan, D.; Zhong, Q.; Mao, A.; Xu, R.; Yan, T. The value of single-molecule real-time technology in the diagnosis of rare thalassemia variants and analysis of phenotype-genotype correlation. J. Hum. Genet. 2022, 67, 183–195. [Google Scholar] [CrossRef]
  8. Li, D.Z.; Liao, C.; Xie, X.M.; Zhou, J.Y. A novel mutation of −50 (G-->A) in the direct repeat element of the beta-globin gene identified in a patient with severe beta-thalassemia. Ann. Hematol. 2009, 88, 1149–1150. [Google Scholar] [CrossRef]
  9. Sirdah, M.M.; Sievertsen, J.; Al-Yazji, M.S.; Tarazi, I.S.; Al-Haddad, R.M.; Horstmann, R.D.; Timmann, C. The spectrum of β-thalassemia mutations in Gaza Strip, Palestine. Blood Cells Mol. Dis. 2013, 50, 247–251. [Google Scholar] [CrossRef]
  10. Smith, D.L.; Mitui, M.; Park, J.Y.; Luu, H.S.; Timmons, C.F. Characterization of the HBB: C.*233G > C variant: No evidence of a β-thalassemic phenotype. Hemoglobin 2016, 40, 25–28. [Google Scholar] [CrossRef] [PubMed]
  11. Mahabhol, C.; Yothindamrongkul, L.; Nuntanajaroenkul, N.; Nawasod, P.; Tepakhan, W.; Karnpean, R.; Sowithayasakul, P.; Makruasi, N.; Trongwongsa, T.; Jomoui, W. The validation of whole β-globin gene sequencing for detecting β-thalassemia mutations found in Thailand using Next-Generation Sequencing (NGS). Ann. Hematol. 2025, 104, 2653–2661. [Google Scholar] [CrossRef] [PubMed]
  12. Liu, Q.; Chen, Q.; Zhang, Z.; Peng, S.; Liu, J.; Pang, J.; Jia, Z.; Xi, H.; Li, J.; Chen, L.; et al. Identification of rare thalassemia variants using third-generation sequencing. Front. Genet. 2023, 13, 1076035. [Google Scholar] [CrossRef]
  13. Srivorakun, H.; Thawinan, W.; Fucharoen, G.; Sanchaisuriya, K.; Fucharoen, S. Thalassemia and erythroid transcription factor KLF1 mutations associated with borderline hemoglobin A2 in the Thai population. Arch. Med. Sci. 2020, 18, 112–120. [Google Scholar] [CrossRef]
  14. Singha, K.; Teawtrakul, N.; Fucharoen, G.; Fucharoen, S. Molecular and haematological characterisation of haemolytic anaemia associated with biallelic KLF1 mutations: A case series. J. Clin. Pathol. 2024, 77, 783–789. [Google Scholar] [CrossRef]
  15. Songdej, D.; Teawtrakul, N.; Laoaroon, N.; Komvilaisak, P.; Sripornsawan, P.; Surapolchai, P.; Hantaweepant, C.; Tantiworawit, A.; Hantrakool, S.; Lauhasurayotin, S.; et al. Impact of HbE mutation on the clinical severity of HbH disease: A multicentre study from Thailand. Br. J. Haematol. 2025, 206, 703–712. [Google Scholar] [CrossRef]
  16. Jomoui, W.; Tepakhan, W.; Satthakarn, S.; Panyasai, S. Molecular spectrum of Hb H disease and characterization of rare deletional α-thalassemia found in Thailand. Scand. J. Clin. Lab. Investig. 2020, 80, 528–535. [Google Scholar] [CrossRef]
  17. Zhao, Y.; Jiang, F.; Li, D.Z. Hematological characteristics of β-globin gene mutation −50 (G>A) (HBB: C.-100G>A) carriers in mainland China. Hemoglobin 2020, 44, 240–243. [Google Scholar] [CrossRef]
  18. Long, J.; Liu, E. Identification of the β thalassemia allele β−50 and analysis of the hematology data of carriers in a southern Chinese population. Ann. Hum. Genet. 2022, 86, 63–70. [Google Scholar] [CrossRef]
  19. Mohd Yasin, N.; Abdul Hamid, F.S.; Hassan, S.; Mat Yusoff, Y.; Mohd Sahid, E.N.; Esa, E. An insight of −50 (G>A) mutation in the direct repeat element of the β-globin gene: From Malaysian perspective. Malays. J. Pathol. 2022, 44, 301–302. [Google Scholar] [PubMed]
  20. Charoenwijitkul, T.; Singha, K.; Fucharoen, G.; Sanchaisuriya, K.; Thepphitak, P.; Wintachai, P.; Karnpean, R.; Fucharoen, S. Molecular characteristics of α+-thalassemia (3.7 kb deletion) in Southeast Asia: Molecular subtypes; haplotypic heterogeneity; multiple founder effects and laboratory diagnostics. Clin. Biochem. 2019, 71, 31–37. [Google Scholar] [CrossRef] [PubMed]
  21. Chaibunruang, A.; Prommetta, S.; Yamsri, S.; Fucharoen, G.; Sae-Ung, N.; Sanchaisuriya, K.; Fucharoen, S. Molecular and hematological studies in a large cohort of α(0)-thalassemia in northeast Thailand: Data from a single referral center. Blood Cells Mol. Dis. 2013, 51, 89–93. [Google Scholar] [CrossRef]
  22. Tsunoda, T.; Takagi, T. Estimating transcription factor bindability on DNA. Bioinformatics 1999, 15, 622–630. [Google Scholar] [CrossRef] [PubMed]
  23. Reese, M.G.; Eeckman, F.H.; Kulp, D.; Haussler, D. Improved splice site detection in Genie. J. Comput. Biol. 1997, 4, 311–323. [Google Scholar] [CrossRef] [PubMed]
  24. Solovyev, V.; Kosarev, P.; Seledsov, I.; Vorobyev, D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome. Biol. 2006, 7 (Suppl. 1), S10. [Google Scholar] [CrossRef]
  25. de Sainte Agathe, J.M.; Filser, M.; Isidor, B.; Besnard, T.; Gueguen, P.; Perrin, A.; Van Goethem, C.; Verebi, C.; Masingue, M.; Rendu, J.; et al. SpliceAI-visual: A free online tool to improve SpliceAI splicing variant interpretation. Hum. Genom. 2023, 17, 7. [Google Scholar] [CrossRef] [PubMed]
  26. Pejaver, V.; Byrne, A.B.; Feng, B.J.; Pagel, K.A.; Mooney, S.D.; Karchin, R.; O’Donnell-Luria, A.; Harrison, S.M.; Tavtigian, S.V.; Greenblatt, M.S.; et al. Calibration of computational tools for missense variant pathogenicity classification and ClinGen recommendations for PP3/BP4 criteria. Am. J. Hum. Genet. 2022, 109, 2163–2177. [Google Scholar] [CrossRef]
  27. Bergquist, T.; Stenton, S.L.; Nadeau, E.A.W.; Byrne, A.B.; Greenblatt, M.S.; Harrison, S.M.; Tavtigian, S.V.; O’Donnell-Luria, A.; Biesecker, L.G.; Radivojac, P.; et al. Calibration of additional computational tools expands ClinGen recommendation options for variant classification with PP3/BP4 criteria. Genet. Med. 2025, 27, 101402. [Google Scholar] [CrossRef]
Figure 1. DNA sequencing profiles of a novel and six known β-globin variants, demonstrating β−206(C>G), β−198(A>G), β−50(G>A), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C). Abbreviation: HBB, hemoglobin subunit beta.
Figure 1. DNA sequencing profiles of a novel and six known β-globin variants, demonstrating β−206(C>G), β−198(A>G), β−50(G>A), βIVSII−180(T>C), βIVSII−258(G>A), βIVSII−337(A>G), and β*233(G>C). Abbreviation: HBB, hemoglobin subunit beta.
Ijms 26 08872 g001
Figure 2. Allele-specific amplification of β−206(C>G) (A) and β−50(G>A) (B). The specific β−206(C>G) allele was generated using primers G220 and S3 (A), and the β−50(G>A) allele was generated using primers S118 and S3 (B), to produce specific fragments of 734 bp and 580 bp, respectively. Normal subjects (βAA) were used as a negative control for the two conditions. M represents the GeneRulerTM 50 bp DNA ladder. Lanes 1, 2, 5, and 6: normal control subjects (βAA) in the two conditions, lanes 3 and 4: β−206(C>G) positive subjects, and lanes 7 and 8: β−50(G>A) positive subjects.
Figure 2. Allele-specific amplification of β−206(C>G) (A) and β−50(G>A) (B). The specific β−206(C>G) allele was generated using primers G220 and S3 (A), and the β−50(G>A) allele was generated using primers S118 and S3 (B), to produce specific fragments of 734 bp and 580 bp, respectively. Normal subjects (βAA) were used as a negative control for the two conditions. M represents the GeneRulerTM 50 bp DNA ladder. Lanes 1, 2, 5, and 6: normal control subjects (βAA) in the two conditions, lanes 3 and 4: β−206(C>G) positive subjects, and lanes 7 and 8: β−50(G>A) positive subjects.
Ijms 26 08872 g002
Table 1. Hb analysis, hematological parameters, and molecular analysis of subjects with seven β-globin gene variants.
Table 1. Hb analysis, hematological parameters, and molecular analysis of subjects with seven β-globin gene variants.
NoSex aHb TypeHb A2
(%)
Hb E + A2
(%)
Hb F
(%)
RBC
(1012/L)
Hb
(g/dL)
Hct
(%)
MCV
(fL)
MCH
(pg)
MCHC
(g/dL)
RDW
(%)
β-Globin Genotypeα-Globin GenotypeKLF1 Mutation
Identified b
1MA2A4.4-0.66.016.246.277.427.135.114.8β−206(C>G)−31(A>G) αCSα/αα-
2MA2A4.5-06.113.339.164.521.934.018.3β−206(C>G)−31(A>G) αα/αα-
3FA2A5.1-0.75.310.934.765.220.431.318.6β−206(C>G)−50(G>A)αα/αα-
4FA2A2.9-04.712.938.783.027.533.212.1β−198(A>G)A−α3.7/αα-
5MA2A2.9-04.914.242.186.829.233.613.2β−198(A>G)Aαα/αα-
6MA2A2.8-04.815.143.890.831.434.612.4β−198(A>G)Aαα/αα-
7MA2A3.6-0NaNaNa78.425.532.5Naβ−198(A>G)A−α3.7/ααA298P
8MA2A3.6-2.2NaNaNa86.628.833.3Naβ−198(A>G)Aαα/ααG176AfsX179
9FA2A3.9-2.45.813.642.272.823.432.2Naβ−198(A>G)A−α3.7/ααH299D
10FA2(F)A2.0-8.64.711.235.575.123.731.617.3β−50(G>A)A−α3.7/αα-
11MA2A3.6-0.3NaNaNaNaNaNaNaβ−50(G>A)ANa-
12FA2A3.6-1.5NaNaNa70.322.732.3Naβ−50(G>A)Aαα/αα-
13FA2A3.7-0.6NaNaNa80.726.933.3Naβ−50(G>A)A−α3.7/αα-
14MA2A3.8-0.35.214.041.079.027.034.214.4β−50(G>A)Aαα/αα-
15FA2A3.9-1.14.812.136.075.025.233.6Naβ−50(G>A)A−α3.7/ααG176AfsX179
16FA2A3.9-5.25.412.540.073.723.131.315.1β−50(G>A)A−α3.7/ααR328H
17FA2A4.1-0NaNaNa74.023.031.1Naβ−50(G>A)ANa-
18MA2A4.1-2.35.514.342.677.526.033.6Naβ−50(G>A)ANa-
19FA2A5.0-1.25.512.037.066.221.732.815.4β−50(G>A)Malayαα/αα-
20FA2A5.5-05.211.234.065.121.633.214.1β−50(G>A)41/42(-TTCT)αα/αα-
21FA2A6.1-1.74.48.326.961.018.931.017.6β−50(G>A)IVSI−1(G>T)αα/αα-
22MA2A6.2-0.65.912.940.067.420.632.4Naβ−50(G>A)IVSI−1(G>T)−α3.7/αα-
23MCSEABart’s2.316.31.75.18.029.257.715.827.423.5β−50(G>A)E--SEACSα-
24ME(F)A-27.76.86.214.244.772.022.731.518.2β−50(G>A)E−α4.2/αα-
25FEA-28.23.35.013.340.079.526.533.315.0β−50(G>A)Eαα/αα-
26FE(F)A-29.76.2NaNaNaNaNaNaNaβ−50(G>A)Eαα/αα-
27MEA3.431.51.1NaNaNa75.0NaNaNaβ−50(G>A)ENa-
28FA2A3.6-0NaNaNa74.924.532.7NaβIVSII−180(T>C)Aαα/ααA298P
29FA2A3.8-0NaNaNaNaNaNaNaβIVSII−258(G>A)Aαα/ααA298P
30FA2A3.6-5.0NaNaNa78.5NaNaNaβIVSII−337(A>G)A−α3.7/ααR328H
31FA2A2.8-04.012.338.194.330.632.413.7β*233(G>C)A−α3.7/αα-
32MA2A2.6-05.415.645.083.328.934.712.8β*233(G>C)Aαα/αα-
33FA2A2.7-04.212.436.386.329.434.113.3β*233(G>C)Aαα/αα-
a All subjects are adults, except subject no. 26, who is 1.5 years old. b KLF1 mutation found in a heterozygous state. Na: not available due to the lack of the indicated hematological data. Abbreviation: RBC, red blood cell; Hct, hematocrit; MCHC, mean corpuscular hemoglobin concentration; RDW, red cell distribution width.
Table 2. Hematological parameters and Hb analysis of β−206(C>G) and β−50(G>A) in heterozygotes and their combinations compared to other β-hemoglobinopathies.
Table 2. Hematological parameters and Hb analysis of β−206(C>G) and β−50(G>A) in heterozygotes and their combinations compared to other β-hemoglobinopathies.
β-Globin GenotypenHb A2
(%)
Hb E + A2
(%)
Hb F
(%)
RBC
(1012/L)
Hb
(g/dL)
MCV
(fL)
MCH
(pg)
RDW
(%)
β−206(C>G)−31(A>G) 24.4, 4.5-0.6, 06.0, 6.1 a16.2, 13.377.4, 64.527.1, 21.914.8, 18.3
βA−31(A>G) 414.7 ± 0.4-1.2 ± 0.15.0 ± 0.7 a12.9 ± 1.577.4 ± 4.525.4 ± 1.314.7 ± 1.2
β−50(G>A)A93.6 ± 0.3 b-2.2 ± 2.9 b5.1 ± 0.412.8 ± 1.3 b75.7 ± 3.3 b24.7 ± 1.8 b15.6 ± 1.5 b
βAA892.8 ± 0.2 b-0.1 ± 0.2 b5.0 ± 0.514.8 ± 1.6 b88.8 ± 3.9 b29.8 ± 1.4 b12.9 ± 0.9 b
β−50(G>A)Malay15.0-1.25.5312.066.221.715.4
βAMalay 294.4 ± 0.4-1.3 ± 1.45.2 ± 0.812.0 ± 1.670.2 ± 5.123.4 ± 2.515.6 ± 1.8
β−50(G>A)035.9 ± 0.4-0.8 ± 0.95.2 ± 0.810.8 ± 2.364.5 ± 3.220.4 ± 1.415.9 ± 2.5
βA0 3095.7 ± 0.7-1.4 ± 1.15.5 ± 0.911.4 ± 1.963.6 ± 4.020.6 ± 1.817.2 ± 2.0
β−50(G>A)E4-29.3 ± 1.7 d4.4 ± 2.7 c,d5.6 ± 0.813.8 ± 0.6 d75.5 ± 3.8 d24.6 ± 2.7 d16.6 ± 2.3 d
βAE 112-29.1 ± 3.01.4 ± 1.3 c5.1 ± 0.913.0 ± 1.777.3 ± 5.225.8 ± 2.214.7 ± 1.2
β−28(G>A)E 143-56.2 ± 6.7 d15.0 ± 8.2 d5.1 ± 0.99.7 ± 1.4 d60.8 ± 5.8 d19.2 ± 1.7 d21.9 ± 3.3 d
a Significant difference between β−206(C>G)−31(A>G) and βA−31(A>G). b Significant difference between β−50(G>A)A and βAA. c Significant difference between β−50(G>A)E and βAE. d Significant difference between β−50(G>A)E and β−28(G>A)E.
Table 3. β-globin variants in Thai normal control and global subjects.
Table 3. β-globin variants in Thai normal control and global subjects.
VariantsHGVS Name
(HBB)
RS NumberNormal Thai SubjectsGlobal (gnomAD v4-Genomes)
NReference AlleleAlternative AlleleNReference AlleleAlternative Allele
−206(C>G)c.−256C>Grs3760053601781.00 (n = 178)0.00 (n = 0)149,2100.9999060.000094
−198(A>G)c.−248A>Grs763063581780.9831 (n = 175)0.0169 (n = 3)149,2760.9995710.000429
−50(G>A)c.−100G>Ars2818645241781.00 (n = 178)0.00 (n = 0)149,2760.9999670.000033
CD 2 (CAT>CAC)c.9T>Crs7130401780.4213 (n = 75)0.5787 (n = 103)149,1340.1847940.815206
IVS II-16 (G>C)c.315 + 16G>Crs107686831780.4213 (n = 75)0.5787 (n = 103)149,0860.1825190.817481
IVS II-74 (T>G)c.315 + 74T>Grs74805261780.7921 (n = 141)0.2079 (n = 37)149,0200.5605890.439411
IVS II-81 (C>T)c.315 + 81C>Trs79467481780.9831 (n = 175)0.0169 (n = 3)149,1680.8938910.106109
IVS II-180 (T>C)c.315 + 180T>Crs5299311341781.00 (n = 178)0.00 (n = 0)149,3040.9999060.000094
IVS II-258 (G>A)c.315 + 258G>Ars10294102901781.00 (n = 178)0.00 (n = 0)145,5360.9999930.000007
IVS II-337 (A>G)c.315 + 337A>Grs5612585711781.00 (n = 178)0.00 (n = 0)149,1460.9999800.000020
IVS II-666 (C>T)c.316 − 185C>Trs16098121780.4157 (n = 74)0.5843 (n = 104)149,1280.1850560.814944
*233(G>C)c.*233G>Crs127880131780.9831 (n = 175)0.0169 (n = 3)149,1500.8939790.106021
Table 4. The online functional predictions, including SpliceAI, CADD, Phylop, and PromoterAI, of the seven β-globin variants.
Table 4. The online functional predictions, including SpliceAI, CADD, Phylop, and PromoterAI, of the seven β-globin variants.
VariantsSpliceAI aCADD bPhyloP bPromoterAI c
−206(C>G)0.022.08 (Moderate Benign)−0.029 (Moderate Benign)−0.03
−198(A>G)0.012.75 (Moderate Benign)0.427 (Supporting Benign)−0.03
−50(G>A)0.0111.9 (Moderate Benign)1.74 (Supporting Benign)−0.13
IVS II-180 (T>C)0.015.22 (Moderate Benign)0.7 (Supporting Benign)-
IVS II-258 (G>A)00.234 (Moderate Benign)−0.341 (Moderate Benign)-
IVS II-337 (A>G)00.607 (Moderate Benign)0.002 (Moderate Benign)-
TTS +99 (G>C)0.117.73 (Moderate Benign)−0.836 (Moderate Benign)-
a Delta scores range from 0 to 1 and can be interpreted as the probability that the variant affects splicing at any position within a window around it (+/−500bp by default). The SpliceAI score is provided for 0.2 (high recall), 0.5 (recommended), and 0.8 (high precision) cutoffs. b CADD and PhyloP scores are based on thresholds and points to predict clinical variant classification. c PromoterAI scores range from −1 to 1, with 0 meaning no activity. Negative values represent under-expression, and positive values represent over-expression. A threshold of +/−0.1 is used for high sensitivity, and +/−0.5 for high precision.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Singha, K.; Pansuwan, A.; Fucharoen, G.; Fucharoen, S. Genotype-Phenotype Correlation of Seven Known and Novel β-Globin Gene Variants. Int. J. Mol. Sci. 2025, 26, 8872. https://doi.org/10.3390/ijms26188872

AMA Style

Singha K, Pansuwan A, Fucharoen G, Fucharoen S. Genotype-Phenotype Correlation of Seven Known and Novel β-Globin Gene Variants. International Journal of Molecular Sciences. 2025; 26(18):8872. https://doi.org/10.3390/ijms26188872

Chicago/Turabian Style

Singha, Kritsada, Anupong Pansuwan, Goonnapa Fucharoen, and Supan Fucharoen. 2025. "Genotype-Phenotype Correlation of Seven Known and Novel β-Globin Gene Variants" International Journal of Molecular Sciences 26, no. 18: 8872. https://doi.org/10.3390/ijms26188872

APA Style

Singha, K., Pansuwan, A., Fucharoen, G., & Fucharoen, S. (2025). Genotype-Phenotype Correlation of Seven Known and Novel β-Globin Gene Variants. International Journal of Molecular Sciences, 26(18), 8872. https://doi.org/10.3390/ijms26188872

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop