Next Article in Journal
Radiomic-Based Machine Learning Classifiers for HPV Status Prediction in Oropharyngeal Cancer: A Systematic Review and Meta-Analysis
Previous Article in Journal
Diagnostic Accuracy of an Offline CNN Framework Utilizing Multi-View Chest X-Rays for Screening 14 Co-Occurring Communicable and Non-Communicable Diseases
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Method for Screening Thalassemia Patients Using Mid-Infrared Spectroscopy

by
Kanjana Thumanu
1,*,
Tanaporn Khamgasem
1,
Somsamorn Sukpong
2,
Rungrueang Phatthanakun
1,
Rawiwan Puangplruk
3,
Waraporn Tanthanuch
1,
Buabarn Kuaprasert
1,
Sukanya Tastub
1,
Roengrut Rujanakraikarn
1,
Saitip Tun
3,
Tassanee Saovana
4,
Thongperm Munkongdee
5 and
Sujintana Wongthong
6,*
1
Synchrotron Light Research Institute (Public Organization), 111 University Avenue, Mueang District, Nakhon Ratchasima 30000, Thailand
2
Blood Testing Section, Regional Blood Centre 5th, The Thai Red Cross Society, Nakhon Ratchasima 30000, Thailand
3
Regional Health Promotion Center 9 Nakhon Ratchasima, 177 Moo 6, Khok Kruat Subdistrict, Mueang District, Nakhon Ratchasima 30000, Thailand
4
Medical Technology, Faculty of Allied Health Sciences, Nakhon Ratchasima College, 290 Moo 2, Mittraphap Road, Ban Mai, Mueang District, Nakhon Ratchasima 30000, Thailand
5
Thalassemia Research Center, Institute of Molecular Biosciences, Mahidol University, Nakhon Pathom 73170, Thailand
6
Faculty of Public Health, Mahasarakham University, Khamriang Subdistrict, Kantarawichai District, Mahasarakham 44150, Thailand
*
Authors to whom correspondence should be addressed.
Diagnostics 2026, 16(1), 67; https://doi.org/10.3390/diagnostics16010067
Submission received: 30 October 2025 / Revised: 25 November 2025 / Accepted: 16 December 2025 / Published: 24 December 2025
(This article belongs to the Section Clinical Laboratory Medicine)

Abstract

Background/Objectives: Thalassemia is an inherited blood disorder caused by defects in hemoglobin production, where an imbalance or reduction in globin-chain synthesis impairs normal red cell development and results in anemia of varying severity. The disease is classified into α-thalassemia and β-thalassemia according to the affected globin genes. In recent years, infrared (IR) Microspectroscopy has gained increasing attention for blood analysis because it is rapid, label-free, and capable of detecting subtle biochemical alterations. Method: In this study, we analyzed hemoglobin lysate collected from clinically characterized normal, carriers, and thalassemia patients (n = 333) using IR Microspectroscopy combined with multivariate statistical methods, including Principal Component Analysis (PCA) and Partial Least Squares Discriminant Analysis (PLS-DA). This approach enabled us to investigate how spectral features correspond to disease status across a range of genotypes commonly encountered in clinical practice. Results: Clear intergroup spectral differences were observed, and the classification models demonstrated diagnostic performance with sensitivity and specificity of approximately 80–90%. Because the technique is non-destructive, requires no chemical reagents, and allows direct biochemical profiling of hemoglobin, it offers practical advantages over conventional hematologic or molecular assays. Conclusions: These findings support the potential of IR-based spectral analysis as a complementary tool for thalassemia screening. Looking ahead, incorporating advanced machine learning algorithms with IR Microspectroscopy may further enhance early detection, improve risk stratification, and strengthen prevention and management strategies in routine clinical workflows.

Graphical Abstract

1. Introduction

Thalassemia is an inherited hematologic disorder characterized by impaired synthesis of globin chains, the protein components of hemoglobin responsible for oxygen transport throughout the body. The disease follows an autosomal recessive inheritance pattern and is highly prevalent in Thailand. Clinically, thalassemia can be categorized into two principal groups: the Disease group, comprising affected patients, and the Carrier group, consisting of individuals who harbor thalassemia mutations without manifesting symptoms. These groups display varying degrees of severity and distinct clinical characteristics. Severe forms may lead to fetal hydrops and perinatal death, whereas moderate and mild cases commonly present with mild anemia and hepatosplenomegaly, which may worsen during febrile episodes. Although asymptomatic, individuals in the Carrier group can transmit the mutated gene to their offspring [1,2,3,4,5].
In Thailand, the combined prevalence of the Disease and Carrier groups is estimated to reach up to 40% of the screened population, posing a significant public health challenge. Current screening programs primarily target married couples; however, the diagnostic workflow remains costly and labor-intensive. The development of more accessible, rapid, and cost-effective screening methods is therefore essential to strengthen national healthcare capacity [6,7,8,9].
The aim of this study is to establish a practical and cost-efficient diagnostic strategy for thalassemia screening by identifying and classifying biochemical alterations in hemoglobin lysate samples from normal, carrier, and disease + symptom groups using infrared (IR) Microspectroscopy. This vibrational spectroscopic technique generates a distinctive molecular fingerprint that reflects the chemical composition and structural organization of biomolecules such as proteins, lipids, and nucleic acids [10,11]. IR Microspectroscopy offers multiple advantages: it is non-destructive, label-free, inexpensive, rapid, and requires minimal sample preparation without chemical reagents [12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27]. The acquired spectral data will be analyzed using artificial intelligence (AI)and multivariate statistical techniques to develop predictive classification models and an IR spectral database capable of accurately differentiating thalassemia types [6,28,29,30]. Thalassemia comprises several clinically relevant subtypes including β-thalassemia (β0 and β+), α-thalassemia (α0 and α+), hemoglobin H disease (HbH), and hemoglobin E (HbE) and reliable classification is crucial for genetic counseling, family planning, and clinical management, especially among carrier couples [31,32,33,34].
In this study, IR Microspectroscopy combined with Principal Component Analysis (PCA) and Partial Least Squares Discriminant Analysis (PLS-DA) will be used to establish a comprehensive spectral database and construct mathematical models capable of distinguishing biochemical severity between carrier and disease + symptom groups. Model performance will be assessed through sensitivity, specificity, and predictive accuracy, providing a solid foundation for integrating spectroscopic-based diagnostics into clinical thalassemia screening programs.

2. Materials and Methods

The study participants were categorized into three primary groups—Normal, Carrier, and Disease + Symptom—based on integrated hematological and genotypic assessments. Each primary group was subsequently divided into more specific genotypic or phenotypic subgroups, determined by molecular characteristics verified through hemoglobin typing and/or DNA analysis. A comprehensive overview of all subgroup classifications is provided in Table 1, along with the corresponding clinical classification illustrated in Figure 1, Table 2 and Supplementary Tables S1–S12.

2.1. Analysis of Blood Samples Using IR Microspectroscopy

Blood samples were analyzed using a Fourier Transform Infrared (FTIR) spectrometer (Tensor 27, Bruker Optics, Ettlingen, Germany) coupled with an IR Microscope (Hyperion 3000, Bruker Optics) equipped with an MCT detector. Spectral acquisition and instrument control were performed using OPUS 7.5 software (Bruker Optics, Germany). Measurements were conducted in transmission mode using a 15X objective lens (Bruker Optics, Ettlingen, Germany,) with a spectral resolution of 4 cm−1 and a background scan of 64 scans. The spectral range was recorded between 4000 and 600 cm−1, and each group of samples was prepared by placing three drops of the sample on the IR window for three replications, resulting in a total of about 45 spectra/sample to represent each patient. After that, spectra were analyzed to remove outliers within the groups by performing the same data preprocessing protocol using the Unscrambler 10.1

2.2. Spectral Preprocessing and Multivariate Data Analysis

All spectra were subjected to data preprocessing prior to analysis. The data preprocessing was conducted using baseline corrections and then normalized using EMSC within the spectral regions from 3171 to 2788 cm−1 and from 1750 to 800 cm−1. The preprocessed spectra were then analyzed using Principal Component Analysis (PCA) to identify and compare biochemical variations among different types of thalassemia. The preprocessing and PCA were performed using The Unscrambler X version 10.3 (CAMO Software, Oslo, Norway).

2.3. Construction of a Spectral Database for Thalassemia Classification

The processed data were compiled into a database to classify the IR spectral characteristics of different thalassemia subtypes, including Normal, Carrier and Disease + Symptom groups. Further classification and predictive modeling were carried out using Partial Least Squares Discriminant Analysis (PLS-DA) with The Unscrambler X version 10.3 (CAMO Software, Norway). Samples were categorized based on disease severity into Disease + Symptom, Carrier, and Normal groups, with at least 30 samples per phenotype. PLS-DA was then employed to analyze and validate the classification performance and model discrimination capability among these groups [35,36,37,38,39,40,41,42,43,44,45,46,47].
Table 2. Clinical and Hematological Criteria for Thalassemia Classification.
Table 2. Clinical and Hematological Criteria for Thalassemia Classification.
ParameterCriterion (Screening/Diagnostic Cut-Off)References
RBC count (×106/µL)>5.0 indicates microcytic erythrocytosis (suggestive of thalassemia trait)[48,49]
Hemoglobin (Hb, g/dL)<12.0 indicates anemia; <10.0 suggests moderate to severe anemia[48,50]
Hematocrit (Hct, %)<36% considered anemic threshold[48,49]
Mean Corpuscular Volume (MCV, fL)<80 fL indicates microcytosis[48,51,52]
Mean Corpuscular Hemoglobin (MCH, pg)<27 pg indicates hypochromia[48,50]
Red Cell Distribution Width (RDW, %)>14% indicates anisocytosis; may support thalassemia or IDA[49,51]
HbA2 (%)>3.5% diagnostic for β-thalassemia trait[50,51]
HbE (%)25–30% diagnostic for HbE trait[50,51]
HbF (%)>1% suggests β-thalassemia intermedia or major[49,51]

3. Results

Within the broader “Normal” category, three distinct clusters were observed: true normal samples (Group 10), α-thalassemia 2 heterozygotes (Group 22), and individuals carrying Hb Constant Spring or Hb Paksae’ heterozygotes (Group 25). Groups 10 and 22 cluster closely on the dendrogram, reflecting their minimal differences in globin synthesis and overall hemoglobin composition; as a result, their IR spectra remain largely similar, particularly in the Amide I and II regions. In contrast, Group 25 forms a clearly separated branch, consistent with the structural instability and altered folding caused by non-deletional α-globin variants. These molecular effects produce more pronounced spectral deviations, explaining why their IR profiles diverge from the other two subgroups.

3.1. IR Microspectroscopy and Functional Group Analysis

FTIR spectra of hemoglobin lysate were collected across all subgroups, with roughly 45 spectra recorded per sample. After removing obvious outliers, the data were explored using Principal Component Analysis (PCA) and Figure 2 presents the average FTIR spectra of hemoglobin (Hb) lysate red blood cell samples. The dataset itself covered a fairly wide range of clinically encountered thalassemia genotypes, such as the true Normal group (Group 10), α-thalassemia 2 heterozygote (Group 22), Hb Constant Spring or Hb Paksae’ heterozygotes (Group 25), other α-thalassemia genotypes (Groups 21, 24), several types of β-thalassemia heterozygote (Groups 31, 34), Hb_E heterozygotes (Group 32), compound heterozygotes such as Hb_E with α-thalassemia 1 heterozygote (Group 44), Hb_E homozygotes with α-thalassemia 1 heterozygote (Group 47), Hb_E homozygotes (Group 33), and additional less common categories (Groups 39, 51, 89). Most groups contained between 15 and 30 samples. For comparison in the PCA plots, three groups were selected to represent the broader clinical categories: Disease + Symptom (Group 51), Carrier (Group 21), and Normal (Group 10).
Across the dataset, the FTIR spectra showed the expected absorption bands associated with major biochemical components of hemoglobin (Table 3). Clear protein-related features were present, including Amide A and B, as well as CH2 and CH3 stretching signals linked to protein side chains. The Amide I region (around 1600–1700 cm−1) captured variations in secondary structure content, while the Amide II region (1500–1600 cm−1) reflected peptide-bond vibrations. Some differences between the Disease + Symptom and Carrier groups could also be seen in the mid-spectral region, particularly near 1450 and 1350 cm−1, where side-chain deformation modes appear.
Taken together, the spectra displayed consistent signatures from both the peptide backbone and the side-chain environments of hemoglobin. The variation observed among groups reflects real biochemical diversity and provides a reasonable basis for distinguishing thalassemia categories in relation to disease severity.

3.2. Cluster Analysis of FTIR Spectra of Hb Lysate

The dendrogram (Figure 3a) shows that Others-1 (Group 25) forms a clearly separate branch, indicating strong spectral and biochemical differences due to its non-deletional α-globin variants. In contrast, Alpha-thalassemia 2 heterozygotes (Group 22) cluster closely with the Normal group (Group 10), reflecting their similar hemoglobin composition and minimal structural alterations. Overall, Group 25 is distinctly separated, while Groups 22 and 10 show high similarity and form a joint cluster.
The dendrogram (Figure 3b) separates the subgroups into two major clusters. Others-2 (Group 39) and Hb_E heterozygotes (Group 32, A2) form Cluster A because Hb_E is a structural hemoglobin variant, producing characteristic changes in the IR spectra that differ from simple reductions in globin synthesis. Cluster B contains the α and β thalassemia subtypes, where the primary defect is reduced globin-chain production, leading to a different biochemical and spectral pattern. Within Cluster B, HbE heterozygotes with α-thalassemia (Group 44, B1) show an intermediate profile, as their spectra reflect the combined effects of both a structural HbE variant and α-globin deficiency.
The dendrogram (Figure 2c) divides the subgroups into two main clusters. Cluster A includes Hb_H disease (Group 51) and Hb_E homozygotes (Group 33), which cluster together because both conditions cause major imbalances in globin-chain production, leading to similar biochemical and spectral profiles. Cluster B consists of Others-3 (Group 89) and Hb_E homozygotes with α-thalassemia heterozygotes (Group 44); these groups separate from Cluster A due to their more complex or compound genetic defects, which generate distinct IR spectral patterns. Overall, the clustering reflects the underlying molecular differences between severe globin-chain imbalance (Cluster A) and mixed or atypical mutations (Cluster B).
When integrating all datasets (Figure 3d), two major clusters were observed: Cluster A (Normal) and Cluster B (Carrier + Disease). Cluster B could be further divided into B1 (Carrier) and B2 (Disease + Symptom), clearly demonstrating the ability of FTIR spectral profiles to distinguish between severity levels of thalassemia. When correlated with FTIR spectral features, these group separations were associated with specific functional group variations, particularly within the Amide I (1600–1700 cm−1) and Amide II (1500–1600 cm−1) regions related to protein secondary structure. These bands played a key role in differentiating Normal samples from Carrier and Disease + Symptom groups. In addition, signals corresponding to side-chain vibrations (CH2, CH3) and C–O stretching modes (1400–1000 cm−1) contributed to the discrimination of Disease + Symptom subtypes with greater specificity.
Overall, FTIR Microspectroscopy combined with hierarchical cluster analysis (HCA) proved effective for molecular-level screening and classification of thalassemia. The technique shows strong potential as a biomarker-based approach for predicting disease severity and supporting early diagnosis in clinical practice.

3.3. Principal Component Analysis (PCA) and Partial Least Squares Discriminant Analysis (PLS-DA) of FTIR Spectra of Hb Lysate

The PCA of the FTIR spectra from the hemoglobin lysate (Figure 4a) showed that most of the variation in the dataset could be explained by the first three components, which together captured roughly 76% of the total variance (PC-1: 64%, PC-3: 12%). When the scores were plotted, the Normal samples tended to cluster separately from the Carrier and Disease + Symptom groups along PC-1. The second and third components added some additional separation, mainly helping distinguish the Carrier group from the Disease + Symptom group.
From the loading patterns, PC-1 (Figure 4b) mainly reflected differences in several broad spectral regions, including the Amide A zone, CH2/CH3 stretching, and the Amide I–II regions. These areas often relate to variations in secondary-structure features of hemoglobin, which explains why PC-1 was effective at separating normal from thalassemic profiles. PC-3 (Figure 4c) included features within the Amide I and II regions as well, along with peaks in the lower wavenumber region linked to Amide III and C–O stretching, which tend to reflect more detailed changes in secondary structure or side-chain environments. Taken together, the PCA results suggest that the spectral differences associated with protein backbone structure and side-chain chemistry are sufficient to differentiate Normal, Carrier, and Disease + Symptom groups, even before applying a supervised method.

3.4. Correlation Loadings Analysis of PCA from FTIR Spectra of Hb Lysate

The PCA results showed that most of the variation across the FTIR spectra was captured by PC-1, which alone explained about 79% of the total variance. This component largely drove the separation between the Normal samples and those from the Carrier and Disease + Symptom groups, which matches what was seen in the score plot (Figure 5 and Figure 6). When looking at the loadings, the more positive contributions on PC-1 were mainly from the higher-wavenumber regions around 3243 and 3422 cm−1 and from the CH2/CH3 stretching range between 2873 and 2958 cm−1. These patterns generally relate to differences in hydrogen bonding and the chemical environment of protein side chains, which appear to shift noticeably in thalassemic samples compared with normal samples. On the opposite side, the signal at 1658 cm−1 (Amide I) loaded strongly in the negative direction and was more characteristic of the Normal group, suggesting a more stable arrangement of secondary-structure elements in these samples. Peaks commonly linked to Amide II, such as those near 1536 and 1587 cm−1, were more associated with the Carrier and Disease + Symptom samples, which is consistent with the altered peptide-bond vibrations expected from structural changes in thalassemic hemoglobin.
PC-3, although accounting for only 7% of the variance, still contributed to distinguishing the Carrier group from the Disease + Symptom group. The loadings at approximately 1081 and 1187 cm−1, typically linked to Amide III or C–O stretching and certain glycoprotein-related motions, indicated shifts in side-chain behaviour and other secondary-structure features that differ between these two categories.
Overall, the combined loading patterns point to changes in both peptide-backbone vibrations and the chemical environments of protein side chains. These spectral differences are in line with the underlying biochemical alterations seen in varying severities of thalassemia and help explain why the three groups can be separated effectively in PCA.
Factor-1 and Factor-4 provide a comprehensive molecular fingerprint of thalassemia severity. Factor-1 (Figure 7b) reflects primary disruptions of the protein backbone; the loading plot for Factor-1 highlights significant contributions at Amide I 1660 cm−1, as well as Amide II at 1587 and 1533 cm−1 [56]. Factor-4 (Figure 7c) captures secondary structural perturbations, including glycoprotein and side-chain alterations. Factor-4 provides finer resolution between groups, particularly between the Carrier and Disease + Symptom categories. The positive displacement in the score plot aligns with inverse features observed in the loading plot for Factor-4, highlighting significant contributions in the higher-wavenumber regions around 3270 cm−1, from the CH2/CH3 stretching range between 2873 and 2960 cm−1, the Amide III region (1240 cm−1),C–O stretching and glycoprotein-related regions (1170 and 1124 cm−1), as well as at 981 cm−1, a marker associated with carbohydrate or aromatic side-chain vibrations [57]. Elevated intensities in these regions in Disease + Symptom samples indicate perturbations in secondary structure stability, modifications of glycoprotein-associated residues, and changes in aromatic amino-acid environments. Together, this integrated interpretation demonstrates the effectiveness of FTIR-based PLS-DA in resolving both major and subtle biochemical differences among Normal, Carrier, and Disease + Symptom groups, supporting its potential as a diagnostic tool for thalassemia classification [59,61].
These findings underscore the sensitivity of FTIR spectroscopy in capturing nuanced conformational changes within hemoglobin molecules, which are often imperceptible through conventional hematological assays. By integrating multivariate PLS-DA analysis, subtle variations in protein secondary structure, glycoprotein modifications, and side-chain microenvironments can be quantitatively distinguished across phenotypic groups. The approach not only facilitates accurate classification of thalassemia subtypes but also provides mechanistic insights into the molecular underpinnings of disease progression. Consequently, this methodology holds promise for enhancing diagnostic precision, guiding therapeutic decisions, and informing future research into structure-function relationships in hemoglobinopathies.
Based on the experimental results, further analysis was conducted using Partial Least Squares Discriminant Analysis (PLS-DA) to classify the severity levels among the Disease + Symptom, Carrier, and Normal groups. For the classification model, the Normal group was represented by samples of Normal (Group 10), the Carrier group by Alpha-thalassemia 1 heterozygotes (Group 21), and the Disease + Symptom group by Hb_H Disease (Group 51) samples. The model performance was evaluated in terms of the R2 coefficient, where the Disease + Symptom group achieved an R2 of 0.82, the Carrier group achieved R2 = 0.80, and the Normal group achieved R2 = 0.53, as shown in Figure 8 and The PLS-DA model demonstrated high diagnostic accuracy across all categories. Specifically, it achieved 98% sensitivity and 99% specificity for the Normal group, and 99% sensitivity and 100% specificity for the Disease + Symptom group. The Carrier group was classified with a sensitivity of 81% and a specificity of 91% shown in Figure 9 and Table 4.
The analysis showed that, as illustrated in the heatmap (Figure 10), The Normal group exhibited the highest absorbance intensity in the Amide I region (1600–1700 cm−1), particularly near ~1650 cm−1, reflecting a well-preserved α-helix–rich secondary structure in hemoglobin. Higher absorbance was also observed in the Amide II region (1500–1600 cm−1) and in amino-acid side-chain bands at 1452 and 1387 cm−1, indicating stable peptide-bond vibrations and intact side-chain environments typical of healthy blood samples.
The Carrier group displayed moderately reduced absorbance in the Amide I and Amide II regions relative to the Normal group, yet remained higher than the Disease + Symptom group across much of the spectrum. These patterns suggest partial but not severe alterations in protein structure associated with carrier status. Slightly lower absorbance in the protein side-chain regions (3000 and 2875 cm−1) further indicates mild perturbations to lateral-chain environments.
In contrast, the Disease + Symptom group exhibited the most pronounced deviations. Absorbance in both Amide I and Amide II decreased substantially compared with the Normal and Carrier groups, signaling a loss of secondary-structure stability. Conversely, this group showed the highest absorbance in the Amide A region (3265–3329 cm−1, N–H stretching), consistent with hydrogen-bond disruption and potential protein unfolding or aggregation. Elevated signals in the Amide III region (1240–1220 cm−1) and in aromatic-residue markers (830–750 cm−1) further indicate secondary-structure alterations and changes involving aromatic amino acids such as tyrosine and phenylalanine [56].
In summary, the Normal group demonstrated strong absorbance in spectral features associated with stable protein backbone and side-chain structures (Amide I, Amide II, and amino-acid side chains). The Disease + Symptom group showed elevated absorbance in regions indicative of structural destabilization (Amide A, Amide III, and aromatic-residue bands), while the Carrier group exhibited intermediate characteristics, consistent with its mild and subclinical phenotype.

4. Conclusions

This study utilized infrared microspectroscopy to differentiate various forms of thalassemia and to develop a spectral database capable of identifying blood samples at risk of thalassemia. By establishing a comprehensive IR spectral library from blood samples categorized according to disease severity—Normal, Carrier, and Disease + Symptom—and applying Principal Component Analysis (PCA) and Partial Least Squares Discriminant Analysis (PLS-DA), we successfully distinguished the groups based on their underlying biochemical signatures. The analysis revealed clear molecular differences, particularly in protein secondary-structure features (Amide I region) and amino-acid side-chain vibrations, demonstrating that structural alterations in hemoglobin closely correlate with thalassemia severity.
In terms of diagnostic performance, the classification models achieved sensitivity and specificity values in the range of 80–90%, confirming the reliability and practical potential of this approach. Future work will involve expanding the dataset through additional sample collection from each thalassemia subtype, with particular emphasis on PCR-based genotyping of α-thalassemia 1 heterozygotes and α-thalassemia 2 heterozygotes. This will improve the resolution of Normal individuals without latent α-thalassemia mutations and enhance model robustness through AI-assisted spectral analysis. A petty patent titled “Process for Preparation and Measurement of Blood Samples” (Application No. 2503002000) has also been officially registered.
The next phase of this research will focus on integrating the developed system into a pilot thalassemia screening program at the Thai Red Cross Society, with the aim of establishing a practical, rapid, and cost-effective diagnostic tool for clinical implementation.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics16010067/s1, Table S1: Genotypic and Phenotypic Classification of Thalassemia of Normal Group: Normal (Group 10); Table S2: Genotypic and Phenotypic Classification of Thalassemia of Normal Group: Alpha thalassemia 2 heterozygote (Group 22); Table S3: Genotypic and Phenotypic Classification of Thalassemia of Normal Group: Others-1 (Group 25); Table S4: Genotypic and Phenotypic Classification of Thalassemia of Carrier Group: Alpha (Group 21, 24 and 45); Table S5: Genotypic and Phenotypic Classification of Thalassemia of Carrier Group: Beta (Group 31 and 34); Table S6: Genotypic and Phenotypic Classification of Thalassemia of Carrier Group: HbE Heterozygote (Group 32); Table S7: Genotypic and Phenotypic Classification of Thalassemia of Carrier Group: Hb E heterozygote with Alpha-thalassemia 1 heterozygote (Group 44); Table S8: Genotypic and Phenotypic Classification of Thalassemia of Carrier Group: Others-2 (Group 39); Table S9: Genotypic and Phenotypic Classification of Thalassemia of Disease + Symptom Group: Hb E homozygote (Group 33); Table S10: Genotypic and Phenotypic Classification of Thalassemia of Disease + Symptom Group: Hb E homozygote with Alpha-thalassemia 1 heterozygote (Group 47); Table S11: Genotypic and Phenotypic Classification of Thalassemia of Disease + Symptom Group: Hb H disease (Group 51); Table S12: Genotypic and Phenotypic Classification of Thalassemia of Disease + Symptom Group: Others-3 (Group 89).

Author Contributions

Conceptualization, K.T.; Methodology, T.K., S.T. (Saitip Tun), R.P. (Rawiwan Puangplruk) and T.M.; Software, R.P. (Rungrueang Phatthanakun) and R.R.; Validation, K.T. and R.P. (Rungrueang Phatthanakun); Formal analysis, T.K., S.S., S.T. (Saitip Tun), T.M.; Investigation, S.W.; Resources, R.P. (Rungrueang Phatthanakun); Data curation, B.K., S.T. (Sukanya Tastub), S.W. and W.T.; Writing—original draft, K.T. and T.S.; Writing—review and editing, B.K., S.T. (Sukanya Tastub), S.W. and W.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Fundamental fund research, grant number 4779133. The study was conducted in accordance with the Declaration of Helsinki and approved by the Human Research Ethics Committee of Nakhonratchasima College (NMC), as detailed in the attached document (NMCEC-0020/2566).

Institutional Review Board Statement

Faculty of Allied Health Sciences, Nakhon Ratchasima College, Certificate of Approval NMCEC-0020/2566, 10 April 2025.

Informed Consent Statement

Written informed consent was obtained from all participants involved in this study.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding a uthor upon reasonable request.

Acknowledgments

We thank the Synchrotron Light Research Institute (public organization), the Blood Testing Section, Regional Blood Centre, Medical Technology, the Faculty of Allied Health Sciences, Regional health promotion center 9, and the Faculty of Public Health, Mahasarakham University, who provided general support for their support and collaboration throughout the study.

Conflicts of Interest

The authors declare that they have no competing interests.

References

  1. Baker, M.J.; Trevisan, J.; Bassan, P.; Bhargava, R.; Butler, H.J.; Dorling, K.M.; Fielden, P.R.; Fogarty, S.W.; Fullwood, N.J.; Heys, K.A.; et al. Using Fourier transform infrared spectroscopy to analyze biological materials. Nat. Protoc. 2014, 9, 1771–1791. [Google Scholar] [CrossRef] [PubMed]
  2. Fucharoen, S.; Winichagoon, P. Haemoglobinopathies in Southeast Asia. Indian J. Med. Res. 2011, 134, 498–506. [Google Scholar]
  3. Harnsajarupant, P. Thalassemia. Jetanin Acad. J. 2023, 10. Available online: https://jetanin.com/th (accessed on 2 November 2025).
  4. MedlinePlus. Alpha Thalassemia. 2023. Available online: https://3billion.io/blog/rare-disease-series-3-thalassemia/ (accessed on 2 December 2022).
  5. Wasi, P.; Pootrakul, S.; Pootrakul, P.; Pravatmuang, P.; Initiator, P.; Fucharoen, S. Thalassemia in Thailand. Ann. N. Y. Acad. Sci. 1980, 344, 352–363. [Google Scholar] [CrossRef] [PubMed]
  6. Berzal, F. Redes Neuronales & Deep Learning; Universidad de Granada: Granada, Spain, 2018. [Google Scholar]
  7. Chonat, S.; Quinn, C.T. Current standards of care and long-term outcomes for thalassemia and sickle cell disease. In Gene and Cell Therapies for Beta-Globinopathies. Advances in Experimental Medicine and Biology; Malik, P., Tisdale, J., Eds.; Springer: New York, NY, USA, 2017; Volume 1013. [Google Scholar]
  8. Sanchaisuriya, K.; Fucharoen, G.; Sae-Ung, N.; Sae-Ue, N.; Baisungneon, R.; Jetsrisuparb, A.; Fucharoen, S. Molecular and hematological characterization of HbE heterozygote with alpha-thalassemia determinant. Southeast Asian J. Trop. Med. Public Health 1997, 28, 100–103. [Google Scholar]
  9. Thumanu, K.; Tanthanuch, W.; Ye, D.; Sangmalee, A.; Lorthongpanich, C.; Heraud, P.; Parnpai, R. Spectroscopic signature of mouse embryonic stem-cell–derived hepatocytes using synchrotron FTIR microspectroscopy. J. Biomed. Opt. 2011, 16, 057005. [Google Scholar] [CrossRef]
  10. Bajwa, H.; Basit, H. Thalassemia. In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2022. [Google Scholar]
  11. Movasaghi, Z.; Rehman, S.; Rehman, I.U. Fourier transform infrared (FTIR) spectroscopy of biological tissues. Appl. Spectrosc. Rev. 2008, 43, 134–179. [Google Scholar] [CrossRef]
  12. Butler, H.J.; Cameron, J.M.; Jenkins, C.A.; Hithell, G.; Hume, S.; Hunt, N.T.; Baker, M.J. Shining a light on clinical spectroscopy: Translation of diagnostic IR, 2D-IR and Raman spectroscopy towards the clinic. Clin. Spectrosc. 2019, 1, 100003. [Google Scholar] [CrossRef]
  13. Chukiatsiri, S.; Siriwong, S.; Thumanu, K. Pupae protein extracts exert anticancer effects by down-regulating IL-6, IL-1β, and TNF-α via biomolecular changes in human breast cancer cells. Biomed. Pharmacother. 2020, 128, 110278. [Google Scholar] [CrossRef]
  14. De Bruyne, S.; Speeckaert, M.M.; Delanghe, J.R. Applications of mid-infrared spectroscopy in the clinical laboratory setting. Crit. Rev. Clin. Lab. Sci. 2017, 55, 1–20. [Google Scholar] [CrossRef]
  15. Dunkhunthod, B.; Chira-atthakit, B.; Chitsomboon, B.; Kiatsongchai, R.; Thumanu, K.; Musika, S.; Sittisart, P. Apoptotic induction of the water fraction of Pseuderanthemum palatiferum ethanol-extract powder in Jurkat cells monitored by FTIR microspectroscopy. ScienceAsia 2021, 47, 556–566. [Google Scholar] [CrossRef]
  16. Dybas, J.; Alcicek, F.C.; Wajda, A.; Kaczmarska, M.; Zimna, A.; Bulat, K.; Marzec, K.M. Trends in biomedical analysis of red blood cells—Raman spectroscopy versus other spectroscopic, microscopic and classical techniques. TrAC Trends Anal. Chem. 2022, 146, 116481. [Google Scholar] [CrossRef]
  17. Fadlelmoula, A.; Pinho, D.; Carvalho, V.H.; Catarino, S.O.; Minas, G. Fourier Transform Infrared (FTIR) spectroscopy to analyse human blood over the last 20 years: A review towards lab-on-a-chip devices. Micromachines 2022, 13, 187. [Google Scholar] [CrossRef]
  18. Finlayson, D.; Rinaldi, C.; Baker, M.J. Is infrared spectroscopy ready for the clinic? Anal. Chem. 2019, 91, 12117–12128. [Google Scholar] [CrossRef]
  19. Guo, S.; Wei, G.; Chen, W.; Lei, C.; Xu, C.; Guan, Y.; Liu, H. Fast and deep diagnosis using blood-based ATR-FTIR spectroscopy for digestive tract cancers. Biomolecules 2022, 12, 1815. [Google Scholar] [CrossRef]
  20. Kaimuangpak, K.; Tamprasit, K.; Thumanu, K.; Weerapreeyakul, N. Extracellular vesicles derived from microgreens of Raphanus sativus L. var. caudatus Alef contain bioactive macromolecules and inhibit HCT116 cell proliferation. Sci. Rep. 2022, 12, 15686. [Google Scholar] [CrossRef]
  21. Kochan, K.; Bedolla, D.E.; Perez-Guaita, D.; Adegoke, J.A.; Veettil, T.C.P.; Martin, M.; Wood, B.R. Infrared spectroscopy of blood. Appl. Spectrosc. 2020, 75, 611–646. [Google Scholar] [CrossRef] [PubMed]
  22. Mistek-Morabito, E.; Lednev, I.K. FT-IR spectroscopy for identification of biological stains for forensic purposes. Spectroscopy 2018, 33, 8–19. [Google Scholar]
  23. Mostaço-Guidolin, L.B.; Bachmann, L. Application of FTIR spectroscopy for identification of blood and leukemia biomarkers: A review over the past 15 years. Appl. Spectrosc. Rev. 2011, 46, 388–404. [Google Scholar] [CrossRef]
  24. Remanan, S.; Rajeev, P.; Sukumaran, K.; Savithri, S. Robustness of FTIR-based ultrarapid COVID-19 diagnosis using PLS-DA. ACS Omega 2022, 7, 47357–47371. [Google Scholar] [CrossRef]
  25. Spoto, F.; Martimort, P.; Drusch, M. Selecting and interpreting measures of thematic classification accuracy. Remote Sens. Environ. 1997, 62, 77–89. [Google Scholar]
  26. Thumanu, K.; Sangrajrang, S.; Khuhaprema, T.; Kalalak, A.; Tanthanuch, W.; Pongpiachan, S.; Heraud, P. Diagnosis of liver cancer from blood sera using FTIR microspectroscopy: A preliminary study. J. Biophotonics 2014, 7, 222–231. [Google Scholar] [CrossRef] [PubMed]
  27. Wang, R.; Wang, Y. Fourier transform infrared spectroscopy in oral cancer diagnosis. Int. J. Mol. Sci. 2021, 22, 1206. [Google Scholar] [CrossRef] [PubMed]
  28. Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef]
  29. Ferih, K.; Elsayed, B.; Elshoeibi, A.M.; Elsabagh, A.A.; Elhadary, M.; Soliman, A.; Abdalgayoom, M.; Yassin, M. Applications of artificial intelligence in thalassemia: A comprehensive review. Diagnostics 2023, 13, 1551. [Google Scholar] [CrossRef]
  30. Sammut, C.; Webb, G.I. (Eds.) Encyclopedia of Machine Learning; Springer: Boston, MA, USA, 2010; pp. 1–1059. [Google Scholar]
  31. Aksoy, C.; Uckan, D.; Severcan, F. FTIR spectroscopic imaging of mesenchymal stem cells in beta thalassemia major disease state. Biomed. Spectrosc. Imaging 2012, 1, 67–78. [Google Scholar] [CrossRef]
  32. Liu, K.-Z.; Tsang, K.; Li, C.; Shaw, A.; Mantsch, H. Infrared spectroscopic identification of beta-thalassemia. Clin. Chem. 2003, 49, 1125–1132. [Google Scholar] [CrossRef]
  33. Peng, L.X.; Wang, G.W.; Yao, H.L.; Huang, S.S.; Wang, Y.B.; Tao, Z.H.; Li, Y.Q. FTIR-HATR to identify beta-thalassemia and its mechanism study. Guang Pu Xue Yu Guang Pu Fen Xi 2009, 29, 1232–1236. [Google Scholar]
  34. Sukpong, S.; Thumanu, K.; Saovana, T.; Siriwong, S.; Changlek, S. Application of infrared spectroscopy for classification of thalassemia and hemoglobin E. Suranaree J. Sci. Technol. 2018, 25, 191–200. [Google Scholar]
  35. Derczynski, L. Complementarity, F-score, and NLP evaluation. In Proceedings of the Language Resources and Evaluation Conference, Portorož, Slovenia, 23–28 May 2016; Volume 2016, pp. 261–266. [Google Scholar]
  36. Devore, J.L. Probability and Statistics for Engineering and the Sciences; Cengage Learning: Boston, MA, USA, 2010. [Google Scholar]
  37. Efron, B. Bootstrap methods: Another look at the jackknife. Ann. Stat. 1979, 7, 1–26. [Google Scholar] [CrossRef]
  38. Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
  39. Kohavi, R. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of the Joint Conference on Artificial Intelligence, Montreal, QC, Canada, 20–25 August 1995. [Google Scholar]
  40. Metz, C.E. Basic principles of ROC analysis. Semin. Nucl. Med. 1978, 8, 283–298. [Google Scholar] [CrossRef]
  41. Powers, D.M.W. Evaluation: From precision, recall and F-factor to ROC, informed ness, markedness & correlation. J. Mach. Learn. Technol. 2007, 2, 37–63. [Google Scholar]
  42. Raschka, S. Model evaluation, model selection, and algorithm selection in machine learning. arXiv 2018, arXiv:1811.12808. [Google Scholar]
  43. Saito, T.; Rehmsmeier, M. The precision-recall plot is more informative than the ROC plot for imbalanced datasets. PLoS ONE 2015, 10, e0118432. [Google Scholar] [CrossRef] [PubMed]
  44. Sasaki, Y. The Truth of the F-Measure; School of Computer Science, University of Manchester: Manchester, UK, 2007; pp. 1–5. [Google Scholar]
  45. Srisongkram, T.; Weerapreeyakul, N.; Thumanu, K. Evaluation of melanoma (SK-MEL-2) cell growth between 3D and 2D cell cultures with FTIR microspectroscopy. Int. J. Mol. Sci. 2020, 21, 4141. [Google Scholar] [CrossRef]
  46. Taylor, J.R. An Introduction to Error Analysis; University Science Books: Sausalito, CA, USA, 1997. [Google Scholar]
  47. Tharwat, A. Classification assessment methods. Appl. Comput. Inform. 2018, 17, 168–192; pp. 1–327. [Google Scholar] [CrossRef]
  48. Weatherall, D.J.; Clegg, J.B. The Thalassemia Syndromes, 4th ed.; Blackwell Science: Bognor Regis, UK, 2001. [Google Scholar]
  49. World Health Organization. Guidelines for the Control of Haemoglobin Disorders; World Health Organization: Geneva, Switzerland, 2011. [Google Scholar]
  50. Tantiworawit, A.; Winichakoon, P.; Rattanathammethee, T.; Hantrakool, S.; Chai-Adisaksopha, C.; Rattarittamrong, E.; Norasetthada, L.; Charoenkwan, P.; Fanhchaksai, K.; Srichairatanakool, S.; et al. Molecular characteristics of thalassemia and hemoglobin variants in prenatal diagnosis program in Northern Thailand. Int. J. Hematol. 2019, 110, 474–481. [Google Scholar] [CrossRef]
  51. Tongprasert, F.; Sirichotiyakul, S.; Piyamongkol, W.; Tongsong, T. Sensitivity and specificity of simple erythrocyte osmotic fragility test for screening of alpha-thalassemia-1 and beta-thalassemia trait in pregnant women. Gynecol. Obstet. Investig. 2010, 69, 217–220. [Google Scholar] [CrossRef] [PubMed]
  52. Xu, Y.; Goodacre, R. On splitting training and validation set: A comparative study of cross-validation, bootstrap and systematic sampling for estimating generalization. J. Anal. Test. 2018, 2, 249–262. [Google Scholar] [CrossRef] [PubMed]
  53. Krimm, S.; Bandekar, J. Vibrational spectroscopy and conformation of peptides, polypeptides, and proteins. Adv. Protein Chem. 1986, 38, 181–364. [Google Scholar]
  54. Surewicz, W.K.; Mantsch, H.H.; Chapman, D. Determination of protein secondary structure by Fourier transform infrared spectroscopy: A critical assessment. Biochemistry 1993, 32, 389–394. [Google Scholar] [CrossRef] [PubMed]
  55. Walsh, M.J.; Fellous, T.G.; Hammiche, A.; Lin, W.R.; Fullwood, N.J.; Grillo-Hill, B.K.; Jimenez, B.; Polyak, K.; Martin-Hirsch, P.L.; Nicholson, J.K.; et al. Fourier transform infrared microspectroscopy identifies symmetric PO2-modifications as a marker of the putative stem cell region of human intestinal crypts. Stem Cells 2008, 26, 108–118. [Google Scholar] [CrossRef]
  56. Barth, A. Infrared spectroscopy of proteins. Biochim. Biophys. Acta (BBA)–Bioenerg 2007, 1767, 1073–1101. [Google Scholar] [CrossRef]
  57. De Meutter, J.; Goormaghtigh, E. Evaluation of Protein Secondary Structure from FTIR Spectra Improved after Partial Deuteration. Anal. Biochem. 2021, 625, 114206. [Google Scholar] [CrossRef]
  58. Joyce, E.J.; Krimm, S.; Miller, W.G. Infrared spectra and assignments for the amide I band in polypeptides. Biopolymers 1993, 33, 1741–1752. [Google Scholar]
  59. Kong, J.; Yu, S. Fourier Transform Infrared Spectroscopic Analysis of Protein Secondary Structures. Acta Biochim. Biophys. Sin. 2007, 39, 549–559. [Google Scholar] [CrossRef] [PubMed]
  60. James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  61. Yang, S.; Zhang, Q.; Yang, H.; Shi, H.; Dong, A.; Wang, L.; Yu, S. Progress in Infrared Spectroscopy as an Efficient Tool for Predicting Protein Secondary Structure. Spectrochim. Acta A Mol. Biomol. Spectrosc. 2022, 268, 120604. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Clinical Criteria for Genotypic and Phenotypic Classification of Thalassemia Study Groups.
Figure 1. Clinical Criteria for Genotypic and Phenotypic Classification of Thalassemia Study Groups.
Diagnostics 16 00067 g001
Figure 2. Average FTIR spectra of Hb lysate red blood samples from the (a) Normal group (n = 100), (b) Carrier group (n = 145), (c) Disease + Symptom group (n = 88) and (d) Comparison of the 3 groups (n = 93) recorded in the 4000–900 cm−1 range using IR microscopy (100 × 100 µm, 4 cm−1 resolution, 64 scans). Spectra were preprocessed using baseline correction, smoothing, and Extended Multiplicative Signal Correction (EMSC), followed by PCA.
Figure 2. Average FTIR spectra of Hb lysate red blood samples from the (a) Normal group (n = 100), (b) Carrier group (n = 145), (c) Disease + Symptom group (n = 88) and (d) Comparison of the 3 groups (n = 93) recorded in the 4000–900 cm−1 range using IR microscopy (100 × 100 µm, 4 cm−1 resolution, 64 scans). Spectra were preprocessed using baseline correction, smoothing, and Extended Multiplicative Signal Correction (EMSC), followed by PCA.
Diagnostics 16 00067 g002
Figure 3. Dendrogram generated from Unsupervised Hierarchical Cluster Analysis (UHCA) of FTIR spectra from the (a) Normal (n = 100), (b) Carrier (n = 145), (c) Disease + Symptom (n = 88) and (d) Comparison of the 3 groups (n = 93) using Ward’s algorithm, covering spectral regions 781–1751 cm−1 and 2788–3712 cm−1.
Figure 3. Dendrogram generated from Unsupervised Hierarchical Cluster Analysis (UHCA) of FTIR spectra from the (a) Normal (n = 100), (b) Carrier (n = 145), (c) Disease + Symptom (n = 88) and (d) Comparison of the 3 groups (n = 93) using Ward’s algorithm, covering spectral regions 781–1751 cm−1 and 2788–3712 cm−1.
Diagnostics 16 00067 g003aDiagnostics 16 00067 g003b
Figure 4. Principal Component Analysis (PCA) of Hb lysate samples classified by thalassemia severity. (a) PCA score plot of Normal (Group 10), Carrier (Alpha-thalassemia 1 heterozygotes, Group 21), and Disease + Symptom (Hb_H Disease, Group 51) groups (n = 31). (b) PCA loading plot for PC-1 (64% explained variance) of FTIR spectra. (c) PCA loading plot for PC-3 (12% explained variance) of FTIR spectra.
Figure 4. Principal Component Analysis (PCA) of Hb lysate samples classified by thalassemia severity. (a) PCA score plot of Normal (Group 10), Carrier (Alpha-thalassemia 1 heterozygotes, Group 21), and Disease + Symptom (Hb_H Disease, Group 51) groups (n = 31). (b) PCA loading plot for PC-1 (64% explained variance) of FTIR spectra. (c) PCA loading plot for PC-3 (12% explained variance) of FTIR spectra.
Diagnostics 16 00067 g004
Figure 5. PCA score plot (PC-1 vs. PC-3) and correlation loading plot of FTIR spectra of Hb lysate samples, demonstrating clear separation among Normal, Carrier, and Disease + Symptom groups.
Figure 5. PCA score plot (PC-1 vs. PC-3) and correlation loading plot of FTIR spectra of Hb lysate samples, demonstrating clear separation among Normal, Carrier, and Disease + Symptom groups.
Diagnostics 16 00067 g005
Figure 6. Second-derivative FTIR spectra of Hb lysate samples from Normal, Carrier, and Disease + Symptom groups, highlighting the characteristic absorption bands associated with specific molecular vibrations.
Figure 6. Second-derivative FTIR spectra of Hb lysate samples from Normal, Carrier, and Disease + Symptom groups, highlighting the characteristic absorption bands associated with specific molecular vibrations.
Diagnostics 16 00067 g006
Figure 7. (a) Three-dimensional PLS-DA score plot of Factor-1 and Factor-4, showing clear separation among Normal (Group 10), Carrier (Group 21), and Disease + Symptom (Group 51). (b) Factor-1 loading plot (37–39% variance), highlighting major spectral differences between Normal and other groups. (c) Factor-4 loading plot (6–9% variance), showing subtle variations that refine separation between Carrier and Disease + Symptom.
Figure 7. (a) Three-dimensional PLS-DA score plot of Factor-1 and Factor-4, showing clear separation among Normal (Group 10), Carrier (Group 21), and Disease + Symptom (Group 51). (b) Factor-1 loading plot (37–39% variance), highlighting major spectral differences between Normal and other groups. (c) Factor-4 loading plot (6–9% variance), showing subtle variations that refine separation between Carrier and Disease + Symptom.
Diagnostics 16 00067 g007
Figure 8. (a) PLS-DA analysis of Hb lysate samples classified into three severity levels: Normal, Carrier, and Disease + Symptom. (bd) Calibration curves for Disease + Symptom, Carrier, and Normal groups, respectively. Each group contained 31 samples (Normal: Normal; Carrier: Alpha-thalassemia 1 heterozygotes (Group 21); Disease + Symptom: Hb_H Disease (Group 51), with a total of 3021 spectra analyzed).
Figure 8. (a) PLS-DA analysis of Hb lysate samples classified into three severity levels: Normal, Carrier, and Disease + Symptom. (bd) Calibration curves for Disease + Symptom, Carrier, and Normal groups, respectively. Each group contained 31 samples (Normal: Normal; Carrier: Alpha-thalassemia 1 heterozygotes (Group 21); Disease + Symptom: Hb_H Disease (Group 51), with a total of 3021 spectra analyzed).
Diagnostics 16 00067 g008
Figure 9. PLS-DA prediction model for Hb lysate samples, demonstrating classification among Normal (Normal, Group 10), Carrier (Alpha-thalassemia 1 heterozygotes, Group 21) and Disease + Symptom (Hb_H Disease, Group 51) groups. Each group contained 31 samples (n = 31), with a total of 3021 spectra analyzed.
Figure 9. PLS-DA prediction model for Hb lysate samples, demonstrating classification among Normal (Normal, Group 10), Carrier (Alpha-thalassemia 1 heterozygotes, Group 21) and Disease + Symptom (Hb_H Disease, Group 51) groups. Each group contained 31 samples (n = 31), with a total of 3021 spectra analyzed.
Diagnostics 16 00067 g009
Figure 10. Heatmap of FTIR spectra of Hb lysate samples comparing absorbance values across functional group regions among Normal (N), Carrier (C), and Disease + Symptom (D) groups. The analysis was conducted using Orange (Quasar) with normalization and Min–Max scaling applied per functional group.
Figure 10. Heatmap of FTIR spectra of Hb lysate samples comparing absorbance values across functional group regions among Normal (N), Carrier (C), and Disease + Symptom (D) groups. The analysis was conducted using Orange (Quasar) with normalization and Min–Max scaling applied per functional group.
Diagnostics 16 00067 g010
Table 1. Genotypic and Phenotypic Classification of Thalassemia Study Groups.
Table 1. Genotypic and Phenotypic Classification of Thalassemia Study Groups.
GroupPhenotype
NormalNormal10Normal
Alpha-thal 2 hetero22Alpha-thalassemia 2 heterozygote
Others-125Hb Constant Spring or Hb Paksaé heterozygote
CarrierAlpha21Alpha-thalassemia 1 heterozygote
24Compound Alpha-thalassemia 2 heterozygote
45Hb E heterozygote with Alpha-thal 2 heterozygote
Beta31Beta (0)-thalassemia heterozygote
34Beta (+)-thalassemia heterozygote
HbE hetero32Hb E heterozygote
HbE hetero with Alpha44Hb E heterozygote with Alpha-thalassemia 1 heterozygote
Others-239Hb E heterozygote with Alpha-thalassemia 2 heterozygote
DiseaseHbE homo33Hb E homozygote
HbE homo with Alpha47Hb E homozygote with Alpha-thalassemia 1 heterozygote
HbH51Hb H Disease
Others-389EA Bart’s Disease
Table 3. Band assignments of the chemical structure and composition of hemoglobin.
Table 3. Band assignments of the chemical structure and composition of hemoglobin.
Wavenumber (cm−1)AssignmentsMolecular Vibration of Functional GroupReferences
3290Amide AN-H Stretching[53]
3050Amide BN-H Stretching[53]
3000Protein side chainsSymmetric and asymmetric of CH2[54]
2875Protein side chainsstretching vibrations of CH3[54]
1600–1700Amide IC=O Stretching[55]
1500–1600Amide IIN-H Bending[55,56,57,58,59]
1452Amino acids in the protein side chainsBending vibrations of CH2: δ (CH2)[60]
1387Amino acids in the protein side chainsBending vibrations of CH3: δ(CH3)[60]
1240–1220Amide III Protein secondary structureC–N stretch + N–H bending[56]
1084–1090protein side chains/glycoprotein-related vibrationC–O stretching[23,56,61]
830–750Aromatic residues markerAromatic ring breathing (Tyr, Phe)[56]
Table 4. Sensitivity and specificity values calculated from the PLS-DA prediction analysis for model performance evaluation.
Table 4. Sensitivity and specificity values calculated from the PLS-DA prediction analysis for model performance evaluation.
GroupStatisticsNo. VigorSensitivitySpecificity
NormalTrue positive3780.980.99
False Positive6
CarrierTrue positive2720.810.91
False Positive62
Disease + SymptomTrue positive2870.991.00
False Positive3
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Thumanu, K.; Khamgasem, T.; Sukpong, S.; Phatthanakun, R.; Puangplruk, R.; Tanthanuch, W.; Kuaprasert, B.; Tastub, S.; Rujanakraikarn, R.; Tun, S.; et al. A New Method for Screening Thalassemia Patients Using Mid-Infrared Spectroscopy. Diagnostics 2026, 16, 67. https://doi.org/10.3390/diagnostics16010067

AMA Style

Thumanu K, Khamgasem T, Sukpong S, Phatthanakun R, Puangplruk R, Tanthanuch W, Kuaprasert B, Tastub S, Rujanakraikarn R, Tun S, et al. A New Method for Screening Thalassemia Patients Using Mid-Infrared Spectroscopy. Diagnostics. 2026; 16(1):67. https://doi.org/10.3390/diagnostics16010067

Chicago/Turabian Style

Thumanu, Kanjana, Tanaporn Khamgasem, Somsamorn Sukpong, Rungrueang Phatthanakun, Rawiwan Puangplruk, Waraporn Tanthanuch, Buabarn Kuaprasert, Sukanya Tastub, Roengrut Rujanakraikarn, Saitip Tun, and et al. 2026. "A New Method for Screening Thalassemia Patients Using Mid-Infrared Spectroscopy" Diagnostics 16, no. 1: 67. https://doi.org/10.3390/diagnostics16010067

APA Style

Thumanu, K., Khamgasem, T., Sukpong, S., Phatthanakun, R., Puangplruk, R., Tanthanuch, W., Kuaprasert, B., Tastub, S., Rujanakraikarn, R., Tun, S., Saovana, T., Munkongdee, T., & Wongthong, S. (2026). A New Method for Screening Thalassemia Patients Using Mid-Infrared Spectroscopy. Diagnostics, 16(1), 67. https://doi.org/10.3390/diagnostics16010067

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop