Automated diagnostic platforms are increasingly utilised across diverse types of medical data, ranging from physiological signals to imaging [
7,
8,
9,
10]. ANFIS has been successfully employed in numerous medical-related applications where medical signals and images serve as the primary inputs to the decision models. Typical examples include detection and diagnosis of diabetes, blood pH imbalances, valvular and rheumatic heart conditions, epileptic seizures, prostate malignancies, and various cancers (colon, leukaemia, and lymphoma) via microarray data. ANFIS has also been applied to ophthalmic and optic nerve disorders, analysis of Doppler ultrasound signals (including internal carotid assessments), interpretation of electroencephalogram (EEG) recordings, and identification of arterial abnormalities in the eye. These studies collectively demonstrated the versatility and effectiveness of ANFIS in medical decision-support systems [
11,
12,
13]. ANFIS models were widely used in the literature for classification of medical images mainly due to its fast convergence and effective classification abilities even with smaller datasets. For example, the study by Hemanth and colleagues [
14] applied an ANFIS to classify four types of abnormal magnetic resonance imaging brain-tumour images which included metastases, meningioma, glioma, and astrocytoma, using six texture features derived from grey-level co-occurrence matrices. Each input feature was fuzzified with two generalised bell-shaped membership functions, yielding 64 fuzzy IF-THEN rules. The model’s training combined least-squares estimation (for rule consequents) with gradient-descent tuning (for membership parameters) over 200 iterations. On a 460–image dataset (120 training, 340 testing), ANFIS achieved an overall classification accuracy of 93.3% which was higher than a fuzzy-nearest-centre classifier (88.6%) and a back-propagation neural network (85.7%). Moreover, the ANFIS model converged in just 1540 CPU cycles, roughly one-tenth the time required by the other methods, while producing low mean-square errors (training ≈ 0.001, testing ≈ 0.15). Kumar and colleagues [
15] presented an ANFIS-based approach to predict COVID-19 epidemic peaks and infection counts in India. They combined nationwide case data sourced from cloud repositories with local demographic and health indicators, which include population density, age distribution, comorbidities, and infrastructure metrics, to construct a two-input, single-output Sugeno model. Using nine trapezoidal membership functions per input and 81 fuzzy rules, the model was trained via a hybrid least-squares and gradient-descent algorithm. Validation against unseen data yielded a low mean square error (MSE = 1.184 × 10
−3) and an overall predictive accuracy of 86%, outperforming linear and multiple-regression baselines (≈ 83%) both in accuracy and computational efficiency (438 s versus 540–720 s). A hybrid ANFIS framework optimised by Adam and Particle Swarm Optimisation (PSO) was proposed to improve Parkinson’s disease (PD) diagnosis accuracy [
16]. Using a public UCI dataset of 756 voice recordings (755 features), they first employed an Extra Trees ensemble to select the top five predictive features. Two ANFIS models were then trained separately: one with Adam (gradient—based) and one with PSO (swarm—based) optimizers. The PSO—tuned ANFIS achieved lower training loss and higher precision, while the Adam—tuned model yielded superior accuracy, F1—score, and recall. Across varying epochs, membership functions, and PSO particle counts, both models converged efficiently, with PSO requiring fewer iterations but Adam delivering slightly faster convergence. The best configuration (1000 epochs, 50 particles, four rules per feature) produced test accuracies above 84%, precision up to 91%, and F1—scores near 84%.
Studies have developed artificial intelligence techniques applied to infrared thermal (IRT) images to screen wrist fractures. For example, Shobayo et al. [
17] developed and evaluated a convolutional neural network (CNN) to distinguish paediatric wrist fractures from sprains using infrared thermal (IRT) images. The data for each participant were fast Fourier transformed magnitude spectra of the wrist infrared thermal images. These images were recorded from 19 participants with wrist fractures and 21 participants with wrist sprains (i.e., wrist injury did not result in bone fracture). The confirmation of the diagnosis was by X-ray radiography. The image augmentation was employed to minimise overfitting during the training of the CNN. The CNN model achieved 88% sensitivity and 76% overall accuracy (AUC = 0.82). The same authors [
18] also used a multilayer perceptron (MLP) model with the same dataset to predict wrist fracture. The optimised MLP achieved a mean sensitivity of 84.2% and specificity of 71.4%, with an overall accuracy of 77.5%. The results obtained demonstrated that IRT imaging analysed by MLP can distinguish fractures from sprains, potentially reducing unnecessary X-rays in emergency settings. The authors suggest further validation with larger cohorts, adult patients, and other fracture types.
The literature also demonstrate that ANFIS can effectively leverage both neural learning and fuzzy interpretability to deliver rapid, accurate tumour classification in MRI imaging [
14], has the capacity to deliver timely, interpretable guidance for public-health policy and sustainable strategic resource allocation and planning [
11,
12,
13], and when combined with adaptive optimizers, it can produce a better interpretable strategy for early PD detection in clinical settings [
16]. Other studies that have used ANFIS for the classification of medical images are presented in
Table 1.
This article develops an ANFIS for the first time to interpret IRT images for paediatric wrist fracture screening, where we compared the results with our previous models (MLP and CNN). ANFIS delivers transparent decision making, whereby predictions are made from human-readable IF–THEN rules applied to linguistic membership functions (e.g., Std = high, IQR = low). Each rule’s firing strength and its linear Takagi–Sugeno consequent contribute to the final score, enabling clinicians to trace how specific temperature-distribution features raised or lowered fracture likelihood. Interpretability is further supported by visualisation of membership functions, rule weights, and output surfaces, so suspect outputs can be audited and refined. Unlike CNNs that involve millions of opaque parameters and extensive tuning, ANFIS uses a compact rule base and a small set of premise/consequent parameters, reducing overfitting risk, easing hyperparameter selection, and accelerating training while preserving clinical explainability.