Next Article in Journal
COVID-19 Infection in Academic Dental Hospital Personnel; A Cross-Sectional Survey in Saudi Arabia
Next Article in Special Issue
Validity of a Screening Tool for Patients with a Sub-Threshold Level of Lumbar Instability: A Cross-Sectional Study
Previous Article in Journal
The Relationship between Sustainable Built Environment, Art Therapy and Therapeutic Design in Promoting Health and Well-Being
Previous Article in Special Issue
Fraction and Number of Unemployed Associated with Self-Reported Low Back Pain: A Nation-Wide Cross-Sectional Study in Japan
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Artificial Intelligence and Computer Vision in Low Back Pain: A Systematic Review

1
Unit of Computer Systems and Bioinformatics, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo 21, 00128 Rome, Italy
2
Department of Orthopaedic Surgery, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo 200, 00128 Rome, Italy
*
Authors to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(20), 10909; https://doi.org/10.3390/ijerph182010909
Submission received: 7 September 2021 / Revised: 4 October 2021 / Accepted: 9 October 2021 / Published: 17 October 2021
(This article belongs to the Special Issue Low Back Pain (LBP))

Abstract

:
Chronic Low Back Pain (LBP) is a symptom that may be caused by several diseases, and it is currently the leading cause of disability worldwide. The increased amount of digital images in orthopaedics has led to the development of methods related to artificial intelligence, and to computer vision in particular, which aim to improve diagnosis and treatment of LBP. In this manuscript, we have systematically reviewed the available literature on the use of computer vision in the diagnosis and treatment of LBP. A systematic research of PubMed electronic database was performed. The search strategy was set as the combinations of the following keywords: “Artificial Intelligence”, “Feature Extraction”, “Segmentation”, “Computer Vision”, “Machine Learning”, “Deep Learning”, “Neural Network”, “Low Back Pain”, “Lumbar”. Results: The search returned a total of 558 articles. After careful evaluation of the abstracts, 358 were excluded, whereas 124 papers were excluded after full-text examination, taking the number of eligible articles to 76. The main applications of computer vision in LBP include feature extraction and segmentation, which are usually followed by further tasks. Most recent methods use deep learning models rather than digital image processing techniques. The best performing methods for segmentation of vertebrae, intervertebral discs, spinal canal and lumbar muscles achieve Sørensen–Dice scores greater than 90%, whereas studies focusing on localization and identification of structures collectively showed an accuracy greater than 80%. Future advances in artificial intelligence are expected to increase systems’ autonomy and reliability, thus providing even more effective tools for the diagnosis and treatment of LBP.

1. Introduction

In the last decade, a significant increase in the use of Artificial Intelligence (AI) has been experienced in the most disparate fields, ranging from vocal assistants commonly employed during our daily life to self-driving cars. Thanks to the unique ability of intelligent machines to be trained and automatically acquire new tasks based on previous experience or provided data, the use of AI is being increasingly investigated for applications in medical research [1]. Indeed, AI-based computers have already shown to potentially revolutionize drug design and discovery [2,3], automatic segmentation and relevant data extraction from radiological datasets [4] as well as the formulation of diagnosis, outcome prediction and treatment planning in different medical fields [5,6,7]. The adoption of this ground-breaking technology is being explored in spine surgery as well [1]. Indeed, thanks to its interdisciplinary nature and the wide utilization of radiological images to inspect the anatomical structures of the spine, the use of AI may be of particular value in determining, for example, which are the pathological discs [8], classifying a scoliotic curve [9] and predict its progression [10]. In this study, we have systematically reviewed the available literature on the use of AI, and more specifically computer vision, in the prevention, diagnosis, and treatment of chronic Low Back Pain (LBP).
LBP is mainly caused by intervertebral disc degeneration, and it is currently the leading cause of disability worldwide, as well as the most common reason for workers’ compensation claims [11]. AI has improved the clinical practice with regards to the treatment, prevention and outcome prediction of subjects suffering from LBP. This is mainly due to the ever-growing amount of clinical data available to practitioners, which allow to train and develop increasingly sophisticated AI methodologies. With particular regards to LBP, a huge amount of digital clinical images are gathered daily in order to detect signs of disease in the spinal structures. For this reason, several machine learning algorithms have been developed in recent years in order to speed-up the diagnostic process and to optimize patients’ recovery. The latest AI improvements were accompanied by the outbreak of deep learning and by an increase of computing capacity, which allow to develop models that are getting more and more autonomous and accurate. In particular, computer vision techniques applied to clinical images allow to detect some image features that are invisible to the human eye. The importance of computer vision in relation to LBP is multi-faceted: it allows to perform a plethora of tasks that may improve the clinical practice, such as automatically localizing and detecting lumbar structures with segmentation. Moreover, it allows to extract a set of features from the image that can be used as an input for further machine learning algorithms in order to provide a decision support to the physician or, in other cases, directly suggest the most appropriate diagnosis. For this reason, we have systematically reviewed the available literature on the application of computer vision on the diagnosis and treatment of LBP in order to describe the state of the art of such technology and its potential applications.

2. Materials and Methods

In order to perform an exhaustive research of AI articles related to LBP, we performed a query research on PubMed (Query research used: (((Artificial intelligence [Title/Abstract]) OR ((feature extraction[Title/Abstract]) OR ((segmentation[Title/Abstract]) OR (Computer Vision[Title/Abstract]) OR (Machine learning[Title/Abstract])) OR (deep learning[Title/Abstract]) OR (neural network[Title/Abstract]))) AND ((Low Back Pain [Title/Abstract]) OR (lumbar[Title/Abstract]))). All the search words had to be included in the title or in the abstract of the articles: the terms “low back pain” and “lumbar” were considered for the pathological part, and the terms “artificial intelligence”, “feature extraction”, “segmentation”, “computer vision”, “machine learning”, “deep learning” and “neural network” were considered for the AI part. We selected all the articles that included at least one term of the pathological part and at least one term of the artificial intelligence part in their title or abstract.

2.1. Inclusion and Exclusion Criteria

The aim of this work was to gather all the works concerning the utilization of AI, and particularly of computer vision, in the diagnosis, prevention, and treatment of chronic LBP and related diseases. Straightforwardly, all the selected articles had to meet all the following inclusion criteria:
Chronic LBP or lumbar diseases must have been among the main topics of the article. We included works on the prevention, diagnosis or treatment of chronic LBP and treating at least one of the structures involved in LBP (i.e., vertebrae, discs, muscles);
AI must have been used in the work with application to clinical images. We included articles exploiting AI methods falling in the areas of computer vision, machine learning and artificial Neural Networks (NNs);
Subjects of the study: all the articles must have been based on studies of human low back and related pathology, regardless of the age or employment of the subjects included in the study;
Language: all articles must have been written in English.
Conversely, articles that were excluded did not meet the inclusion criteria for one of the following reasons:
A different medical problem was considered: we excluded articles which did not consider chronic LBP and its related physical structures and medical data. For example, we excluded studies that considered only cervical or thoracic vertebrae, or that focused on osteoporosis, metastases, traumatic LBP, and other causes of non-discogenic LBP;
AI was not considered: some articles in the search results proposed definitions and practice for LBP based only on medical observation without utilization of AI;
Computer vision and clinical images were not considered in the study, regardless of whether AI was utilized for developing diagnosis or support systems;
Animal studies: we excluded studies based on vertebral structures of animals;
Embryonal studies: we excluded studies performed on embryos and concerning the embryogenesis of spinal structures.
A preliminary screening of the article selection allowed us to define three main categories in which the utilization of AI in LBP might be split, namely computer vision, computer aided diagnosis, and decision support systems (DSSs) (Figure 1). Computer vision is the field of AI that deals with how computers can gain high-level understanding from digital images or videos. With regards to LBP, its main applications concern feature extraction and image segmentation. Feature extraction is a dimensionality reduction process which is applied to images obtained using Magnetic Resonance Imaging (MRI), ultrasound, X-rays, and Computed Tomography (CT). The main goal of feature extraction is to retrieve a restricted number of relevant features from an image without losing important information, in order to facilitate subsequent tasks such as classification or regression.
Image segmentation is the task of dividing an image into subregions corresponding to different elements of the image. More in depth, the goal of image segmentation is the labeling of each pixel of an image with a corresponding class, e.g., foreground or background, in order to detect the relevant elements of an image. It mainly resorts to two principal techniques: deep learning, in which the image is directly given as input to an artificial NN which is trained on other images to automatically identify subregions, and digital image processing (DIP) techniques, which process digital images to find the edges of different regions based on semantic characteristics, exploiting methods such as gradient thresholding or statistical shape models.
Computer aided diagnosis is a group of techniques which help medical practitioners in identifying a pathology or in quantifying the grade of a disease. It can be split into classification and regression, in which machine or deep learning models are used to assign a predefined label or to generate a numeric output, respectively. In practice, classification is used to identify or categorize a pathology, whereas regression is used to produce a quantitative evaluation of some measure.
Decision support systems (DSSs) are software systems that allow medical practitioners to enhance the decision making and improve the outcome of patients suffering from a specific disease. The goal of the vast majority of DSSs is the outcome prediction, i.e., the prediction of the improvement that a patient would experience after exposure to a defined therapy. By predicting the extent to which a patient would benefit from a specific treatment, DSSs provide the physician with practical tools to assess whether or not surgery may be preferable to conservative treatment. Finally, DSSs can be used for prevention, e.g., by providing the user with recommendations or correct practice for preventing the onset of a disease. It is worth noting that computer vision techniques can be used as preprocessing for developing a DSS, as well as a for computer aided diagnosis.

2.2. Evaluation Metrics

Different tasks use different metrics to evaluate the performance of AI systems. However, considering the large amount of works reported in this review, different metrics were also considered within the same task. With regards to the feature extraction task, no specific evaluation metric was considered. This is because, in most cases, feature extraction is exploited as a preliminary step for further tasks such as classification and regression, and most papers only report the performance for the latter.
With regards to the classification task, we reported the results in terms of accuracy (Acc), where available. For brevity purposes, let us consider a binary classification task, e.g., positive vs. negative. Given a test set composed of N samples, defined the True Positives ( T P ) as the number of positive samples correctly classified, and the True Negatives ( T N ) as the number of negative samples correctly classified, then accuracy is defined as:
A c c % = T P + T N N × 100
Thus, greater values correspond to a better performance. For each class, recall and precision can be computed as well. Defined the False Positives ( F P ) and False Negatives ( F N ) as the number of misclassified positive/negative samples, then recall and precision are computed as:
R e c a l l = T P T P + F N P r e c i s i o n = T P T P + F P
In binary problems, recall is also called True Positive Rate and corresponds to sensitivity, whereas the True Negative Rate is also called specificity. In the case of multi-class problems, accuracy is computed by considering the T P for each class, and recall and precision per class can be computed. For imbalanced datasets, the F1-Score can be computed for each class. The F1-Score for class c is defined as:
F 1 - S c o r e c = 2 · R e c a l l c · P r e c i s i o n c R e c a l l c + P r e c i s i o n c
and takes into account both recall and precision of the class. Another widely used evaluation metric is the Area Under the Curve (AUC), which corresponds to the area under the Receiver Operating Characteristic (ROC) curve showing the performance of a classification model at all classification thresholds, which is plotted considering the True Positive Rate against the False Positive Rate. Its values range from 0 to 1 (the closer to 1, the better the performance).
With regards to the regression task, let us consider a sequence of original values x ( t ) and a sequence of predicted values x ˜ ( t ) . The Mean Absolute Error (MAE) for a sequence of N timestamps is defined as:
M A E = t = 1 N | x ( t ) x ˜ ( t ) | N
Thus, the closer to 0 the value, the better the performance. In some cases, percentage error values are used to evaluate performance, the meaning of which varies with the investigated task.
With regards to the segmentation task, two main percent performance indices are used which evaluate to what extent the segmentation result is close to the desired segmentation. As stated, segmentation consists in labeling each pixel of an image. Given two sets of data A and B, corresponding to the desired and the effective segmented areas, the Sørensen–Dice coefficient (DICE) is defined as:
D I C E ( A , B ) = 2 · | A B | | A | + | B |
where | A | and | B | are the cardinalities of the two sets. It divides the number of common elements of the two sets by the total number of elements of the two sets. When applied to binary data, it is equivalent to the F1-Score. Differently, the Jaccard index is defined as:
J a c c a r d ( A , B ) = | A B | | A B |
and is also known as Intersection Over Union. For both indices, the closer to 100 % the value, the better the performance. It is worth noting that D I C E ( A , B ) J a c c a r d ( A , B ) for any couple of sets ( A , B ) , and the relation J a c c a r d = D I C E / ( 2 D I C E ) exists to compute one value from the other.

3. Quality of Evidence

The methodological quality of the included studies was graded independently by two reviewers (L.A. and F.R.), and any disagreement was resolved by the intervention of a third reviewer (G.V.) The risks of bias and applicability of the included studies were assessed by using customized assessment criteria based on the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) [12]. This tool is based on 4 domains: patient selection, index test, reference standard, and flow and timing. Each domain is evaluated in terms of risk of bias, and the first 3 domains are also assessed in terms of concerns regarding applicability. Sixty-eight studies were rated on a 3-point scale, reflecting concerns about risk of bias and applicability as low, unclear or high, as shown in Figure 2 (the details of analysis are presented in Tables S1 and S2).

4. Results

The search was performed on 18 March 2021, and resulted in 558 articles. Nonetheless, many of these articles focused on a different topic from that of this review, so after a first screening based on the article titles and abstracts we reduced the number of eligible articles to 200. A second screening phase was performed after having read the full text of each article, which led the total amount of included articles to 76. We created a flow-chart diagram according to the PRISMA protocol that shows the selection process of the studies (Figure 3). The articles were screened by two independent reviewers and, in the event of discrepancies regarding the inclusion or exclusion of an article, they discussed together until consensus was reached.
It is worth noting how the amount of published work is increasing year by year, and that the number of articles published in 2020 is almost double that of 2019. This may be due to two main reasons: first, the ever-increasing amount of clinical images and data available to researchers and, secondly, the improvement of computing capacity observed in recent years. The final results of the search also include five reviews. One of them, published in 2020 by Tagliaferri et al. [13], is specifically focused on LBP, but considers only the diagnosis and prognosis capability of AI in comparison with the McKenzie and the STarT Back methods, and without taking into account works that exploit clinical images. The other four reviews do not focus specifically on LBP. In detail, in 2019 Tack [14] focused on musculoskeletal medicine in general, and determined in which fields AI had reached human prediction levels; in 2020, Azimi et al. [15] focused on the use of NNs for the treatment of the whole spine; in 2019, Galbusera et al. [1] described the application of AI to problems related to the whole spine; finally, in 2016 Yao et al. [16] performed a multi-center milestone comparative study for vertebral segmentation methods based on CT images. Two articles presenting databases were also found: LUMINOUS, which is a database of ultrasound images from 109 patients for multifidus muscle segmentation [17], and MyoSegmentum, which includes MRI images of 54 patients for the segmentation of lumbar muscles and vertebral bodies [18].
The remainder of this section reports the results of the search that include works concerning computer vision. In particular, we have listed manuscripts that performed a feature extraction task or that performed semantic segmentation, and we have described papers that used DIP/NN approaches in two different subsections.

4.1. Feature Extraction

Feature extraction is a dimensionality reduction process aimed at identifying a restricted set of relevant features in order to improve the predictive capability of a system. In this review, we identified a total of 8 papers, whose main characteristics are reported in Table 1, aiming to extract relevant features from several types of LBP-related images. In detail, we included:
six articles on MRI (1 of which considers 3D MRI);
one article on 3D images of the back surface;
one article on X-ray imaging.
Intervertebral discs (IVDs) are the most investigated lumbar structures (five papers), followed by vertebrae (three papers), whereas one paper evaluated LBP without focusing on a specific structure. It is worth noting that only two out of eight articles have exclusively focused on feature extraction, i.e., the work of Raudner et al. [21] in which the GRAPPATINI method is presented for IVD feature extraction from MRI, and the work of Abdollah et al. [22] in which a Random Forest and a Texture analysis are exploited on MRI for feature extraction from IVDs and vertebrae, respectively. The remaining six articles described the performance of further tasks after feature extraction. In detail, four of them performed classification, one performed regression, and one performed segmentation tasks.
All the works that performed further tasks following feature extraction exploited machine learning techniques rather than deep learning: this is one of the advantages of feature extraction, as it allows to achieve results using much faster and less computationally-expansive methods. With regards to classification, Adankon et al. [19] were the only ones to use 3D images of the surface of the human back: they extracted features for 165 patients using local geometric descriptors, and fed them to a least-squares Support Vector Machine (SVM) for the classification of scoliosis curve types, achieving 95% accuracy. Yang et al. [8] used a Gabor wavelet transform to extract features from MRI of 109 subjects, and a Kanade–Lucas–Tomasi (KLT) feature tracker to identify lumbar degenerative changes with an accuracy of 88.3%. Ruiz-España et al. [23] extracted features from MRI of 67 patients using Gradient Vector Flow, and tested several machine learning models to classify degenerated IVDs achieving accuracies greater than 90%. Ketola et al. [24] performed texture feature extraction from 518 MRI and used Logistic Regression to discriminate between symptomatic and asymptomatic LBP with an accuracy of 83%.
With regards to the regression task, Garcia-Cano et al. [10] extracted features from X-ray images of 150 patients through the medium of Independent Component Analysis, and used Random Forest Regression to predict the spinal curve progression in adolescents with idiopathic scoliosis, achieving a MAE of 4.79° for the Cobb angle.
With regards to the segmentation task, Castro-Mateos et al. [20] extracted features from 3D MRI of 59 subjects and performed IVDs segmentation using statistical shape model space and B-Spline space, achieving an average DICE score of 88.4%.

4.2. Segmentation

Image segmentation is the task of dividing an image into sub-regions corresponding to different elements of the image, with the aim of accurately identifying the borders of different elements in the image. This approach usually exploits manually-segmented images to train an AI model. Several manuscripts included in the reviewed performed a segmentation task, and some used segmentation as a preliminary step for further tasks. For this reason, in the next sections we report, where applicable, not only the segmentation results, but also those of the successive tasks for which segmentation is used with the aim of localizing and/or identifying structures. In this review, we refer to the task of detecting specific components (e.g., vertebrae) as “localization”, whereas we refer to the task of assigning a label to specific components (e.g., L1, L2, etc.) as “identification”. Moreover, we have differentiated included papers based on whether they exploited DIP techniques or NNs. In this review, we identified 38 manuscripts using DIP techniques, and 23 using NNs. However, it is worth noting how most recent research efforts are moving towards deep learning techniques: taking into account the articles published in the last 5 years (2016-2021), this review includes 16 papers using DIP, and 23 using NNs.

4.2.1. Digital Image Processing

DIP segmentation techniques process digital images to find the edges of different regions based on semantic characteristics, exploiting methods such as gradient thresholding or statistical shape models. In this review, we identified a total of 38 papers that performed DIP segmentation on different types of images (Table 2):
15 articles on MRI (2 of which considered 3D MRI);
15 articles on CT images;
1 articles on both MRI and CT images;
3 articles on fluoroscopic images;
2 articles on ultrasound images;
2 articles on X-ray images.
Vertebrae are the most investigated lumbar structures (26 papers), followed by IVDs (10 papers) and muscles (6 papers). It is worth noting that only one [25] out of the 21 works using CT, X-ray or fluoroscopic images did not involve segmentation of vertebral structures. In total, 20 articles focused only on segmentation without further tasks. Among the others, 12 performed successive structure localization, 6 conducted successive structure identification (4 of which performed both localization and identification), whereas regression, tracking, and 3D reconstruction were investigated by 1 manuscript for each task, respectively.
With regards to the papers that focused exclusively on segmentation, Haq et al. [26] used shape-aware models on 3D MRI of 21 patients for the segmentation of IVDs, achieving an average DICE of 91.7%. In addition, in a successive article Haq et al. [25] utilized a shape statistical deformable model for the segmentation of IVDs on CT images of 18 subjects from the SpineWeb dataset, achieving DICE scores ranging from 91.7 to 95.4%. Li et al. [28] applied a threshold to the results of a Gaussian Mixture Model for segmenting vertebrae on a total of 115 CT images from the SpineWeb and the Microsoft Research datasets, with an average DICE of 92.1%. Ibragimov et al. [29] used landmark detection and deformable models for segmenting 30 vertebrae on CT images, with a DICE of 84.7%. Yu et al. [30] utilized bone-sheet assisted grid cut to segment vertebrae from 21 CT images, achieving an average DICE of 93.9%. Korez et al. [31] applied a shape-constrained deformable model for vertebrae segmentation from CT images of 220 patients, with a DICE of 94.6%. Al-Helo et al. [32] combined Active-shape models and GVF-snake for the segmentation of vertebrae from CT images of 50 subjects, assessing the segmentation quality by visual evaluation. Ruiz-España et al. [33] used a Selective Binary Gaussian Filtering Regularized Level Set to segment vertebrae on CT images of 10 subjects, achieving an average DICE of 95%. Huang et al. [34] exploited Otsu thresholding, Edge- and Region-based level sets to segment vertebrae on CT images of 56 subjects, with a 94% DICE. Mastmeyer et al. [38] utilized volume growing and morphological operations to segment vertebrae on CT images of 41 subjects, achieving DICE scores greater than 98.6%. Zhang et al. [53] applied Hough transform and Fourier descriptors for vertebrae segmentation on one fluoroscopic image, assessing the segmentation quality by visual evaluation. Michopoulou et al. [45] used an Atlas-robust-fuzzy C-Means for segmenting IVDs on MRI of 34 subjects, achieving a 90% DICE. Fallah et al. [46] exploited Hierarchical Conditional Random Fields and a Random Forest for the segmentation of IVDs and vertebrae, respectively, on MRI of 34 subjects, achieving a DICE of 92.5 and 91.4%, respectively. Ghosh et al. [47] combined Random Forest and context features for the segmentation of IVDs and vertebrae, respectively, on MRI of 212 subjects, achieving a DICE of 87 and 84%, respectively. Kim et al. [48] used graph-based and line-based segmentation algorithms for segmenting vertebrae on MRI of 19 patients, achieving a 90% DICE. Gaonkar et al. [49] applied a multi-parametric ensemble to segment vertebrae on MRI of 63 subjects, with an average DICE of 83%. Gawel et al. [50] combined a cascade classifier and an Active Appearance Model to segment vertebrae on 50 MRI, achieving a DICE of 91.4%. Engstrom et al. [51] used a Statistical Shape model for the segmentation of the quadratus lumborum muscle on MRI of 20 patients, achieving a DICE of 87%. Baum et al. [52] exploited an Average Shape model and a Dual Feature model for paraspinal muscle segmentation on MRI of 10 subjects, with a DICE of 83%. Jurcak et al. [54] applied Probabilistic atlases and Geodesic Active Contours for the segmentation of quadratus lumborum muscle on MRI of 20 subjects with a 77% DICE. Ribeiro et al. [61] used Gabor Filters and an ANN to segment vertebrae on X-ray images of 41 patients, achieving a DICE of 91.7%.
With regards to the articles that performed localization following segmentation, Mahdy et al. [35] used a threshold method followed by an adaptive K-Means for the segmentation and localization of lumbar vertebrae on CT images of 10 subjects in order to identify degenerated IVDs, and evaluated the performance by visual evaluation. Courbot et al. [36] exploited a Hidden Markov Chain for semi-automated segmentation of vertebrae on CT images of 15 subjects, achieving a localization accuracy of 89.4%. Rasoulian et al. [37] developed a multi-object shape model for vertebrae localization on 32 CT images, correctly localizing the centers of mass with a MAE of 2 mm with the aim of identifying the optimal location for spinal needle injection. Štern et al. [42] performed an analysis of the geometry of the spinal structures to localize the centers of IVDs and vertebrae on 13 MRI and 29 CT images, respectively, with a localization error of 2.8 and 1.8 mm, respectively. Neubert et al. [56] used an Active Shape model to segment IVDs on MRI of 44 subjects achieving a DICE of 92.3%, and an AUC of 0.98 for localization of degenerated IVDs using Linear Discriminant Analysis and SVM. Kim et al. [59] exploited Fuzzy C-Means Clustering for the localization of lumbar multifidus muscle on ultrasound images of 50 subjects, with a 2 mm localization discrepancy. Lui et al. [60] utilized Decoupled Active Contour for the localization of lumbar multifidus muscle on ultrasound images of 10 subjects, achieving an F1-Score of 90.9%. Sa et al. [62] used Gradient Vector Flow Snake and SVM for the localization of vertebrae on X-ray images of 30 subjects, achieving a True Positive Rate of 75%.
With regards to the papers that performed identification following segmentation, Neubert et al. [27] used a Statistical Shape model on 3D MRI of 28 subjects to segment and identify IVDs and vertebrae, achieving segmentation DICE of 89 and 91%, respectively, and 98.3% specificity and 100% sensitivity for the identification of degenerated IVDs. Castro-Mateos et al. [58] described an Active Contour Model for the segmentation and a Feedforward NN for the identification and classification of IVDs on MRI of 48 subjects, achieving 87% Sensitivity.
With regards to the papers that performed both localization and identification, Jimenez-Pastor et al. [39] used a Decision Forest and morphological image processing to localize and identify vertebrae on 272 CT images, achieving a localization error of 13.7 mm and an accuracy of 74.8%. Lee et al. [40] exploited threshold and thinning-based integrated cost on CT images of 19 subjects, for the localization and identification of lumbar pedicles in order to increase accuracy and safety during transpedicular screw placement, with a localization error of 0.14 mm and 93.2% accuracy. Klinder et al. [41] used a Triangulated Shape model on CT images of 64 subjects, achieving a vertebrae localization error of 1.1 mm and 92% accuracy. Oktay et al. [57] combined a Probabilistic model with an SVM to localize and detect IVDs on MRI of 40 subjects, achieving a localization rate of 95.4% and an accuracy of 97%.
In addition, Wong et al. [43] used Wavelets and a Shape-Active Contour-Based model for vertebrae segmentation and Tracking on 2 videos of fluoroscopic images, evaluating the performance by visual evaluation. Zheng et al. [44] utilized Statistical Shape models for vertebrae segmentation and 3D reconstruction on 4 fluoroscopic images, achieving a mean reconstruction error of less than 1.6 mm. Finally, Fortin et al. [55] used a threshold algorithm for segmentation and quantification of paraspinal muscle composition with a reliability coefficient ranging between 97 and 99%.

4.2.2. Deep Learning

Deep learning is a class of AI algorithms based on Artificial Neural Networks. More in detail, an NN is said to be “deep” if it is composed of more than 2 hidden layers. Deep learning techniques for segmentation take as an input the whole original image, and perform feature extraction, feature selection, segmentation and any further step (e.g., classification, regression) in one single model. In this review, we identified a total of 23 papers that performed deep learning segmentation, and their main characteristics are reported in Table 3. In detail:
13 articles on MRI (2 of which considered 3D MRI and 1 with the addition of clinical notes);
5 articles on CT images;
4 articles on X-ray images (1 of which in combination with Moire images);
1 article on ultrasound images.
Vertebrae were the most investigated lumbar structures (16 papers), followed by IVDs (11 papers), spinal canal (7 papers), and muscles (5 papers). In total, 9 articles focused exclusively on segmentation without further tasks. Among the others, 5 manuscripts performed successive structure identification, 3 carried out a regression task, 3 performed successive structure reconstruction, 1 work performed classification, 1 performed structure localization, and 1 carried out both structure localization and identification. It is worth noting that the vast majority of the works included in this section exploited Convolutional Neural Networks (CNNs) or models that derive from them.
With regards to the articles that focused exclusively on segmentation, Iriondo et al. [63] used a Coarse-to-fine context memory NN to segment IVDs on 3D MRI of 31 subjects, achieving a DICE greater than 85%. Malinda et al. [67] utilized Generative Adversarial Networks (GANs) for vertebrae segmentation on CT images of 120 subjects, achieving a DICE of 94.2%. Kim et al. [71] exploited a BSU-net for IVDs segmentation on 20 MRI from the SpineWeb dataset, achieving a DICE of 89.4%. Shen et al. [72] used a Feedforward NN on MRI of 120 subjects, achieving a Jaccard index for the segmentation of IVDs, spinal canal and muscles of 87, 82 and 85%, respectively. Gaonkar et al. [73] applied a U-net to segment IVDs on 39295 MRI images, achieving an 88% DICE; they also combined an SVM with a Regression Tree to segment the spinal canal with a DICE of 87%. Huang et al. [74] used a U-net to segment IVDs and vertebrae on 100 MRI achieving a Jaccard index of 92.6 and 94.7%, respectively. Li et al. [75] utilized a CNN to segment vertebrae and spinal canal on MRI of 120 patients achieving an overall DICE of 92.5%. Moreover, they used a deformed U-net [76] for the segmentation of paraspinal muscles on 120 MRI achieving an overall DICE greater than 91.3%. Zhou et al. [77] utilized a U-net for vertebrae segmentation on MRI of 57 subjects, achieving a DICE of 84.9%.
With regards to the papers that performed structure identification following segmentation, Siemionow et al. [68] used a CNN to identify vertebrae on CT images of 45 subjects, with an overall accuracy ranging from 96 to 99%. Zhou et al. [80] combined a CNN and similarity with a beforehand lumbar image for vertebrae identification on MRI images of 1318 healthy and unhealthy subjects, achieving an accuracy of 98.9%. Forsberg et al. [81] combined a CNN and graph-based graphical models on MRI enriched with clinical notes to identify vertebrae of 475 patients, achieving an accuracy of 97%. Baka et al. [82] utilized a CNN and a matching strategy for vertebrae identification on ultrasound images from 19 datasets, achieving an accuracy of 92%. Li et al. [84] were the only to perform vertebrae identification on X-ray images. They applied a CNN on 110 images, achieving an 80.4% accuracy.
With regards to the articles that performed a regression task, Watanabe et al. [70] used a CNN to estimate spinal alignment on 1996 Moire images, with a Cobb angle MAE of 3.42°. Natalia et al. [79] combined a SegNet and a Contour Evolution Algorithm to measure anteroposterior diameter and foraminal widths on MRI of 515 patients suffering from lumbar spinal stenosis with a mean error of 0.9 mm. Cho et al. [83] used a U-net for the automated segmentation and measurement of lumbar lordosis on X-ray images of 629 patients, achieving a DICE of 82.1% and a MAE of 8.06°.
With regards to the articles performing a Reconstruction task, Staartjes et al. [64] developed a CNN to segment and reconstruct the lumbar structures from 3D MRI of 3 patients, evaluating the performance by visual evaluation. Lee et al. [65] used GANs to generate synthetic spine lumbar structures MRI from 280 CT images, with a MAE of 21 pixels. Fan et al. [66] axploited a U-net to reconstruct lumbar structures from CT images of 108 subjects, with a Kambin triangle of 161 mm 2 .
With regards to the articles performing a classification task, Jamuladin et al. [78] used a CNN for classification of IVDs and vertebrae on MRI of 2009 subjects achieving an accuracy of 95.6%.
In addition, Sa et al. [85] fine-tuned a Faster Region-based CNN (R-CNN) for IVD localization on 1081 X-ray images with a 90.5% precision. Finally, Netherton et al. [69] used an X-net ensemble to localize and identify vertebrae on 330 CT images, achieving a localization error of 2.2 mm and an accuracy of 94%.

5. Discussion

Due to the extensive use of advanced imaging modalities and the complexity of anatomical structures involved in the development of LBP and its sequelae, a vast body of research has been investigating the utilization of AI in the elaboration of digital images for different purposes. The vast majority of the works in the literature exploit MRI or CT imaging, whereas a minority of works exploit X-ray, fluoroscopic or ultrasound imaging. It is worth noting that vertebral structures are the main focus of articles performing segmentation, both with DIP and deep learning techniques; conversely, articles performing feature extraction are mainly focused on IVDs.
With regards to feature extraction, which is the capacity of a system to recognize a specific set of relevant features, all included studies collectively showed an accuracy > 80% in identifying the location of vertebrae [24] and IVDs [8,20,21,22,23,24], with the ability to even detect annular tears and lumbar disc herniation [21,22,23]. Although the majority of the studies were conducted on MRI images [8,20,21,22,23,24], one study utilized X-ray imaging [10] and another study built a 3D model of patients’ backs using a noninvasive surface acquisition technology [19]. Moreover, some of these studies also reported the capacity of the described systems to perform classification and regression tasks on extracted data, such as estimating the degree of IVD degeneration [8,22,23,24], scoliosis curve type classification [19] and prediction of curve progression [10], the presence of spinal stenosis [23] and to explore the correlation between degenerative changes and the presence of LBP [24].
However, most studies focused on segmentation, which is the differentiation of specific subregions of an image based on distinct parameters. Traditionally, segmentation tasks have been performed by DIP systems via subdivision of elements within an image based on gradient thresholding or statistical shape models, which fall under the definition of semantic segmentation [86]. However, recent research has been exploring the use of deep learning-based AI systems which are able to perform multiple tasks at the basic and advanced level in a single model [1]. Vertebrae are by far the most investigated structure, with AI systems reaching > 90% DICE and > 90% accuracy in the majority of studies included in our review, both using DIP [28,29,30,31,32,33,34,35,36,37,38,39,40,41,43,44,48,49,50,53,61,62] and deep learning models [67,69,77,80,81,82,83,84]. In particular, a study from Lee et al. [40] proposed a model to obtain an automated segmentation of lumbar pedicles from CT images in order to increase accuracy and safety during transpedicular screw placement. On the other hand, a study from Watanabe and colleagues [70] described a CNN able to estimate spinal alignment, vertebral rotation and Cobb angle with a mean absolute error of 3.6 pixels for vertebral position, 2.9° for vertebral rotation and 3.42° with regards to the estimated Cobb angle. Similarly, Cho et al. [83] presented a CNN capable of segmenting lumbar vertebrae and subsequently calculate lumbar lordosis, with a mean absolute error of 8.055°. In this manuscript, Several AI systems for automated segmentation of IVDs have been described as well [25,26,45,48,56,57,58,63] with a reported DICE > 90% in nearly all studies. Besides, performance of systems developed for the segmentation of paraspinal muscles have reported a higher variability compared to other structures [51,52,54,55,60,71], with higher DICE values for systems based on deep learning models [76]. In addition, some studies evaluated the simultaneous segmentation of multiple structures, in particular IVDs and vertebrae [27,42,46,56,74,78], with a DICE > 90% in DIP-based systems [27,42,46,56] and a reported accuracy > 95% in most deep learning-based systems [68,71,74,78,85]. Furthermore, some of the latter have been used in order to synthesize CT images from MRI and vice versa. For example, Staartjes et al. [64] introduced a CNN-based system able to generate synthetic CT images from spine MRI, so as to acquire more precise information about osseous structures compared to traditional MRI without the need to expose patients to additional radiation. On the other hand, Lee and colleagues [65] presented a model based on GANs capable of producing a synthetic MRI from spine CT scans, which resulted in a mean overall similarity with real MRI scans of 80.2%. This study demonstrated the possibility to extract accurate information about soft tissues from spine CT without the necessity to order an MRI, which is often expensive and time-consuming. Other studies have also shown the possibility to automatically calculate the spinal canal area [73] as well as segmenting and reconstructing multiple structures at the same time [47,66,72,75,79] with an elevate degree of accuracy.
Figure 4 shows a boxplot that summarizes the results for the segmentation of IVDs, vertebrae and lumbar muscles, and the identification accuracy for different lumbar structures. With regards to the segmentation of IVDs and vertebrae, it is worth noting that DIP and deep learning techniques achieve very similar results, with DIP methods performing slightly better. This is mainly due to the regular and homogeneous surface of such structures, whose well-defined edges can be effectively identified using DIP techniques such as threshold and region-growing methods. Conversely, lumbar muscle segmentation performance of deep learning techniques is sensitively better than that of DIP methods. Indeed, the structure of muscles is irregular and more challenging to detect properly, and deep NNs provide a better tool for such a task. With regards to the identification accuracy, deep learning provides generally better results; nonetheless, DIP methods followed by machine learning techniques are typically faster and less computationally expensive, and, in some cases, provide similar performance.
Although the application of computer vision to the elaboration of radiological images of the spine is continuously increasing, some concerns still exist. Indeed, system validation still largely depends on multiple user interventions and cannot replace the human counterpart for obvious reasons, from both clinical and ethical perspectives. Furthermore, the best performing methods are based on the application of NNs, which usually require a large amount of images and computational capacity for training, which are not available to all researchers. However, some DIP techniques provide equal or better performance in the segmentation of regular-shaped structures such as vertebrae and IVDs, while requiring a smaller amount of data for training and limiting the computational burden. Moreover, some methods already exist for the automatic detection and grading of conditions such as spondylolisthesis, disc herniation and scoliosis.

6. Conclusions

In the last decade, the utilization of AI has increased considerably in all fields, and medical research made no exception. Indeed, AI-based computers have already shown the potential to revolutionize the medical field, including spine surgery. In this study, we have systematically reviewed the available literature on the use of AI, and more specifically computer vision, in the prevention, diagnosis, and treatment of LBP. In conclusion, computer vision techniques bear promises for effectively improving clinical practice in coming years, thanks to the availability of public datasets and to the natural upcoming increase of the computational capacity. Furthermore, steps are being taken towards the interpretability of AI and, in particular, of deep learning models. Such improvements will lead to the development of systems that will not require multiple user interventions, thus providing a valid assessment tool for physicians. LBP diagnosis and treatment often require the utilization and integration of advanced imaging modalities. In addition, several structural alterations, often subtle and nonunivocal to interpret, concur to define the clinical scenario. In this picture, the use of AI and computer vision may effectively assist and implement the diagnostic process, thus possibly improving clinical outcomes and diagnostic accuracy.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/ijerph182010909/s1, Table S1: Summary of the methodological quality of included studies regarding the 4 domains assessing the risk of bias of the QUADAS-2 score; Table S2: Summary of the methodological quality of included studies regarding the 3 domains assessing applicability concerns of the QUADAS-2 score.

Author Contributions

Conceptualization, F.D., F.R., M.M.; methodology, F.D., F.R., L.A., L.V., G.V., M.M.; writing—original draft preparation, F.D., F.R., L.A., M.M.;investigation, F.D.; writing—review and editing, F.D., F.R., L.A., L.V., G.V., M.M., R.P., V.D.; visualization, F.D.; supervision, F.R., L.V., G.V., M.M., R.P., V.D.; funding acquisition, F.R., G.V., R.P., V.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Research Grant (BRiC-2018 ID3) of the Italian Workers’ Compensation Authority (INAIL).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Galbusera, F.; Casaroli, G.; Bassani, T. Artificial intelligence and machine learning in spine research. JOR Spine 2019, 2, e1044. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Chan, H.S.; Shan, H.; Dahoun, T.; Vogel, H.; Yuan, S. Advancing drug discovery via artificial intelligence. Trends Pharmacol. Sci. 2019, 40, 592–604. [Google Scholar] [CrossRef] [PubMed]
  3. Rashid, M.B.M.A.; Chow, E.K.H. Artificial intelligence-driven designer drug combinations: From drug development to personalized medicine. SLAS Technol. Transl. Life Sci. Innov. 2019, 24, 124–125. [Google Scholar] [CrossRef] [PubMed]
  4. Hosny, A.; Parmar, C.; Quackenbush, J.; Schwartz, L.H.; Aerts, H.J. Artificial intelligence in radiology. Nat. Rev. Cancer 2018, 18, 500–510. [Google Scholar] [CrossRef]
  5. Bur, A.M.; Shew, M.; New, J. Artificial intelligence for the otolaryngologist: A state of the art review. Otolaryngol.-Neck Surg. 2019, 160, 603–611. [Google Scholar] [CrossRef]
  6. Boon, I.S.; Au Yong, T.; Boon, C.S. Assessing the role of artificial intelligence (AI) in clinical oncology: Utility of machine learning in radiotherapy target volume delineation. Medicines 2018, 5, 131. [Google Scholar] [CrossRef] [Green Version]
  7. Loftus, T.J.; Tighe, P.J.; Filiberto, A.C.; Efron, P.A.; Brakenridge, S.C.; Mohr, A.M.; Rashidi, P.; Upchurch, G.R.; Bihorac, A. Artificial intelligence and surgical decision-making. JAMA Surg. 2020, 155, 148–158. [Google Scholar] [CrossRef]
  8. Yang, T.; Li, R.; Liang, N.; Li, J.; Yang, Y.; Huang, Q.; Li, Y.; Cao, W.; Wang, Q.; Zhang, H. The application of key feature extraction algorithm based on Gabor wavelet transformation in the diagnosis of lumbar intervertebral disc degenerative changes. PLoS ONE 2020, 15, e0227894. [Google Scholar] [CrossRef] [Green Version]
  9. Thong, W.; Parent, S.; Wu, J.; Aubin, C.E.; Labelle, H.; Kadoury, S. Three-dimensional morphology study of surgical adolescent idiopathic scoliosis patient from encoded geometric models. Eur. Spine J. 2016, 25, 3104–3113. [Google Scholar] [CrossRef]
  10. Garcia-Cano, E.; Cosío, F.A.; Duong, L.; Bellefleur, C.; Roy-Beaudry, M.; Joncas, J.; Parent, S.; Labelle, H. Prediction of spinal curve progression in adolescent idiopathic scoliosis using random forest regression. Comput. Biol. Med. 2018, 103, 34–43. [Google Scholar] [CrossRef]
  11. Franklin, G.M.; Wickizer, T.M.; Coe, N.B.; Fulton-Kehoe, D. Workers’ compensation: Poor quality health care and the growing disability problem in the United States. Am. J. Ind. Med. 2015, 58, 245–251. [Google Scholar] [CrossRef]
  12. Whiting, P.F.; Rutjes, A.W.; Westwood, M.E.; Mallett, S.; Deeks, J.J.; Reitsma, J.B.; Leeflang, M.M.; Sterne, J.A.; Bossuyt, P.M. QUADAS-2: A revised tool for the quality assessment of diagnostic accuracy studies. Ann. Intern. Med. 2011, 155, 529–536. [Google Scholar] [CrossRef]
  13. Tagliaferri, S.D.; Angelova, M.; Zhao, X.; Owen, P.J.; Miller, C.T.; Wilkin, T.; Belavy, D.L. Artificial intelligence to improve back pain outcomes and lessons learnt from clinical classification approaches: Three systematic reviews. NPJ Digit. Med. 2020, 3, 1–16. [Google Scholar] [CrossRef]
  14. Tack, C. Artificial intelligence and machine learning| applications in musculoskeletal physiotherapy. Musculoskelet. Sci. Pract. 2019, 39, 164–169. [Google Scholar] [CrossRef] [PubMed]
  15. Azimi, P.; Yazdanian, T.; Benzel, E.C.; Aghaei, H.N.; Azhari, S.; Sadeghi, S.; Montazeri, A. A Review on the Use of Artificial Intelligence in Spinal Diseases. Asian Spine J. 2020, 14, 543. [Google Scholar] [CrossRef] [PubMed]
  16. Yao, J.; Burns, J.E.; Forsberg, D.; Seitel, A.; Rasoulian, A.; Abolmaesumi, P.; Hammernik, K.; Urschler, M.; Ibragimov, B.; Korez, R.; et al. A multi-center milestone study of clinical vertebral CT segmentation. Comput. Med. Imaging Graph. 2016, 49, 16–28. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Belasso, C.J.; Behboodi, B.; Benali, H.; Boily, M.; Rivaz, H.; Fortin, M. LUMINOUS database: Lumbar multifidus muscle segmentation from ultrasound images. BMC Musculoskelet. Disord. 2020, 21, 1–11. [Google Scholar] [CrossRef]
  18. Burian, E.; Rohrmeier, A.; Schlaeger, S.; Dieckmeyer, M.; Diefenbach, M.N.; Syväri, J.; Klupp, E.; Weidlich, D.; Zimmer, C.; Rummeny, E.J.; et al. Lumbar muscle and vertebral bodies segmentation of chemical shift encoding-based water-fat MRI: The reference database myosegmentum spine. BMC Musculoskelet. Disord. 2019, 20, 1–7. [Google Scholar] [CrossRef]
  19. Adankon, M.M.; Dansereau, J.; Labelle, H.; Cheriet, F. Non invasive classification system of scoliosis curve types using least-squares support vector machines. Artif. Intell. Med. 2012, 56, 99–107. [Google Scholar] [CrossRef] [Green Version]
  20. Castro-Mateos, I.; Pozo, J.M.; Eltes, P.E.; Del Rio, L.; Lazary, A.; Frangi, A.F. 3D segmentation of annulus fibrosus and nucleus pulposus from T2-weighted magnetic resonance images. Phys. Med. Biol. 2014, 59, 7847. [Google Scholar] [CrossRef]
  21. Raudner, M.; Schreiner, M.M.; Hilbert, T.; Kober, T.; Weber, M.; Szelényi, A.; Windhager, R.; Juras, V.; Trattnig, S. Clinical implementation of accelerated T 2 mapping: Quantitative magnetic resonance imaging as a biomarker for annular tear and lumbar disc herniation. Eur. Radiol. 2020, 31, 3590–3599. [Google Scholar] [CrossRef] [PubMed]
  22. Abdollah, V.; Parent, E.C.; Dolatabadi, S.; Marr, E.; Croutze, R.; Wachowicz, K.; Kawchuk, G. Texture Analysis in the Classification of T2 Weighted Magnetic Resonance Images in Persons with and without Low Back Pain. J. Orthop. Res.® 2020, 39, 2187–2196. [Google Scholar] [CrossRef] [PubMed]
  23. Ruiz-España, S.; Arana, E.; Moratal, D. Semiautomatic computer-aided classification of degenerative lumbar spine disease in magnetic resonance imaging. Comput. Biol. Med. 2015, 62, 196–205. [Google Scholar] [CrossRef]
  24. Ketola, J.H.; Inkinen, S.I.; Karppinen, J.; Niinimäki, J.; Tervonen, O.; Nieminen, M.T. T 2-weighted magnetic resonance imaging texture as predictor of low back pain: A texture analysis-based classification pipeline to symptomatic and asymptomatic cases. J. Orthop. Res.® 2020. [Google Scholar] [CrossRef]
  25. Haq, R.; Schmid, J.; Borgie, R.; Cates, J.; Audette, M.A. Deformable multisurface segmentation of the spine for orthopedic surgery planning and simulation. J. Med. Imaging 2020, 7, 015002. [Google Scholar] [CrossRef]
  26. Haq, R.; Aras, R.; Besachio, D.A.; Borgie, R.C.; Audette, M.A. 3D lumbar spine intervertebral disc segmentation and compression simulation from MRI using shape-aware models. Int. J. Comput. Assist. Radiol. Surg. 2015, 10, 45–54. [Google Scholar] [CrossRef]
  27. Neubert, A.; Fripp, J.; Engstrom, C.; Schwarz, R.; Lauer, L.; Salvado, O.; Crozier, S. Automated detection, 3D segmentation and analysis of high resolution spine MR images using statistical shape models. Phys. Med. Biol. 2012, 57, 8357. [Google Scholar] [CrossRef]
  28. Li, Y.; Liang, W.; Zhang, Y.; Tan, J. Automatic global level set approach for lumbar vertebrae CT image segmentation. BioMed Res. Int. 2018, 2018, 6319879. [Google Scholar] [CrossRef]
  29. Ibragimov, B.; Korez, R.; Likar, B.; Pernuš, F.; Xing, L.; Vrtovec, T. Segmentation of pathological structures by landmark-assisted deformable models. IEEE Trans. Med. Imaging 2017, 36, 1457–1469. [Google Scholar] [CrossRef]
  30. Yu, W.; Liu, W.; Tan, L.; Zhang, S.; Zheng, G. Multi-object Model-Based Multi-atlas Segmentation Constrained Grid Cut for Automatic Segmentation of Lumbar Vertebrae from CT Images. In Intelligent Orthopaedics; Springer: Singapore, 2018; pp. 65–71. [Google Scholar] [CrossRef]
  31. Korez, R.; Ibragimov, B.; Likar, B.; Pernuš, F.; Vrtovec, T. A framework for automated spine and vertebrae interpolation-based detection and model-based segmentation. IEEE Trans. Med. Imaging 2015, 34, 1649–1662. [Google Scholar] [CrossRef] [PubMed]
  32. Al-Helo, S.; Raja’S, A.; Chaudhary, V.; Al-Zoubi, M. Segmentation of lumbar vertebrae from clinical CT using active shape models and GVF-snake. In Proceedings of the 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Boston, MA, USA, 30 August–3 September 2011; pp. 8033–8036. [Google Scholar] [CrossRef]
  33. Ruiz-España, S.; Díaz-Parra, A.; Arana, E.; Moratal, D. A fully automated level-set based segmentation method of thoracic and lumbar vertebral bodies in Computed Tomography images. In Proceedings of the 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Milan, Italy, 25–29 August 2015; pp. 3049–3052. [Google Scholar] [CrossRef]
  34. Huang, J.; Jian, F.; Wu, H.; Li, H. An improved level set method for vertebra CT image segmentation. Biomed. Eng. Online 2013, 12, 1–16. [Google Scholar] [CrossRef] [Green Version]
  35. Mahdy, L.N.; Ezzat, K.A.; Hassanien, A.E. Automatic detection System for Degenerative Disk and simulation for artificial disc replacement surgery in the spine. ISA Trans. 2018, 81, 244–258. [Google Scholar] [CrossRef]
  36. Courbot, J.B.; Rust, E.; Monfrini, E.; Collet, C. Vertebra segmentation based on two-step refinement. J. Comput. Surg. 2016, 4, 1–22. [Google Scholar] [CrossRef] [Green Version]
  37. Rasoulian, A.; Rohling, R.; Abolmaesumi, P. Lumbar spine segmentation using a statistical multi-vertebrae anatomical shape+ pose model. IEEE Trans. Med. Imaging 2013, 32, 1890–1900. [Google Scholar] [CrossRef]
  38. Mastmeyer, A.; Engelke, K.; Fuchs, C.; Kalender, W.A. A hierarchical 3D segmentation method and the definition of vertebral body coordinate systems for QCT of the lumbar spine. Med. Image Anal. 2006, 10, 560–577. [Google Scholar] [CrossRef]
  39. Jimenez-Pastor, A.; Alberich-Bayarri, A.; Fos-Guarinos, B.; Garcia-Castro, F.; Garcia-Juan, D.; Glocker, B.; Marti-Bonmati, L. Automated vertebrae localization and identification by decision forests and image-based refinement on real-world CT data. La Radiol. Medica 2020, 125, 48–56. [Google Scholar] [CrossRef] [PubMed]
  40. Lee, J.; Kim, S.; Kim, Y.S.; Chung, W.K. Automated segmentation of the lumbar pedicle in CT images for spinal fusion surgery. IEEE Trans. Biomed. Eng. 2011, 58, 2051–2063. [Google Scholar] [CrossRef]
  41. Klinder, T.; Ostermann, J.; Ehm, M.; Franz, A.; Kneser, R.; Lorenz, C. Automated model-based vertebra detection, identification, and segmentation in CT images. Med. Image Anal. 2009, 13, 471–482. [Google Scholar] [CrossRef] [PubMed]
  42. Štern, D.; Likar, B.; Pernuš, F.; Vrtovec, T. Automated detection of spinal centrelines, vertebral bodies and intervertebral discs in CT and MR images of lumbar spine. Phys. Med. Biol. 2009, 55, 247. [Google Scholar] [CrossRef] [PubMed]
  43. Wong, A.; Mishra, A.; Fieguth, P.; Clausi, D.; Dunk, N.M.; Callaghan, J.P. Shape-guided active contour based segmentation and tracking of lumbar vertebrae in video fluoroscopy using complex wavelets. In Proceedings of the 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vancouver, BC, Canada, 20–25 August 2008; pp. 863–866. [Google Scholar] [CrossRef] [Green Version]
  44. Zheng, G.; Nolte, L.P.; Ferguson, S.J. Scaled, patient-specific 3D vertebral model reconstruction based on 2D lateral fluoroscopy. Int. J. Comput. Assist. Radiol. Surg. 2011, 6, 351–366. [Google Scholar] [CrossRef]
  45. Michopoulou, S.K.; Costaridou, L.; Panagiotopoulos, E.; Speller, R.; Panayiotakis, G.; Todd-Pokropek, A. Atlas-based segmentation of degenerated lumbar intervertebral discs from MR images of the spine. IEEE Trans. Biomed. Eng. 2009, 56, 2225–2231. [Google Scholar] [CrossRef] [PubMed]
  46. Fallah, F.; Walter, S.S.; Bamberg, F.; Yang, B. Simultaneous volumetric segmentation of vertebral bodies and intervertebral discs on fat-water MR images. IEEE J. Biomed. Health Inform. 2018, 23, 1692–1701. [Google Scholar] [CrossRef] [PubMed]
  47. Ghosh, S.; Chaudhary, V. Supervised methods for detection and segmentation of tissues in clinical lumbar MRI. Comput. Med. Imaging Graph. 2014, 38, 639–649. [Google Scholar] [CrossRef]
  48. Kim, S.; Bae, W.C.; Masuda, K.; Chung, C.B.; Hwang, D. Semi-automatic segmentation of vertebral bodies in MR images of human lumbar spines. Appl. Sci. 2018, 8, 1586. [Google Scholar] [CrossRef] [Green Version]
  49. Gaonkar, B.; Xia, Y.; Villaroman, D.S.; Ko, A.; Attiah, M.; Beckett, J.S.; Macyszyn, L. Multi-parameter ensemble learning for automated vertebral body segmentation in heterogeneously acquired clinical MR images. IEEE J. Transl. Eng. Health Med. 2017, 5, 1–12. [Google Scholar] [CrossRef]
  50. Gaweł, D.; Główka, P.; Kotwicki, T.; Nowak, M. Automatic spine tissue segmentation from MRI data based on cascade of boosted classifiers and active appearance model. BioMed Res. Int. 2018, 2018, 7952946. [Google Scholar] [CrossRef]
  51. Engstrom, C.M.; Fripp, J.; Jurcak, V.; Walker, D.G.; Salvado, O.; Crozier, S. Segmentation of the quadratus lumborum muscle using statistical shape modeling. J. Magn. Reson. Imaging 2011, 33, 1422–1429. [Google Scholar] [CrossRef] [PubMed]
  52. Baum, T.; Lorenz, C.; Buerger, C.; Freitag, F.; Dieckmeyer, M.; Eggers, H.; Zimmer, C.; Karampinos, D.C.; Kirschke, J.S. Automated assessment of paraspinal muscle fat composition based on the segmentation of chemical shift encoding-based water/fat-separated images. Eur. Radiol. Exp. 2018, 2, 1–5. [Google Scholar] [CrossRef]
  53. Zheng, Y.; Nixon, M.S.; Allen, R. Automated segmentation of lumbar vertebrae in digital videofluoroscopic images. IEEE Trans. Med. Imaging 2004, 23, 45–52. [Google Scholar] [CrossRef]
  54. Jurcak, V.; Fripp, J.; Engstrom, C.; Walker, D.; Salvado, O.; Ourselin, S.; Crozier, S. Automated segmentation of the quadratus lumborum muscle from magnetic resonance images using a hybrid atlas based-geodesic active contour scheme. In Proceedings of the 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vancouver, BC, Canada, 20–25 August 2008; pp. 867–870. [Google Scholar] [CrossRef]
  55. Fortin, M.; Omidyeganeh, M.; Battié, M.C.; Ahmad, O.; Rivaz, H. Evaluation of an automated thresholding algorithm for the quantification of paraspinal muscle composition from MRI images. Biomed. Eng. Online 2017, 16, 1–12. [Google Scholar] [CrossRef] [Green Version]
  56. Neubert, A.; Fripp, J.; Engstrom, C.; Walker, D.; Weber, M.; Schwarz, R.; Crozier, S. Three-dimensional morphological and signal intensity features for detection of intervertebral disc degeneration from magnetic resonance images. J. Am. Med. Inform. Assoc. 2013, 20, 1082–1090. [Google Scholar] [CrossRef] [Green Version]
  57. Oktay, A.B.; Akgul, Y.S. Localization of the Lumbar Discs Using Machine Learning and Exact Probabilistic Inference. In Medical Image Computing and Computer-Assisted Intervention—MICCAI 2011; Fichtinger, G., Martel, A., Peters, T., Eds.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 158–165. [Google Scholar] [CrossRef] [Green Version]
  58. Castro-Mateos, I.; Hua, R.; Pozo, J.M.; Lazary, A.; Frangi, A.F. Intervertebral disc classification by its degree of degeneration from T2-weighted magnetic resonance images. Eur. Spine J. 2016, 25, 2721–2727. [Google Scholar] [CrossRef]
  59. Kim, K.B.; Park, H.J.; Song, D.H. Automatic Characterizations of Lumbar Multifidus Muscle and Intramuscular Fat with Fuzzy C-means based Quantization from Ultrasound Images. Curr. Med. Imaging 2020, 16, 592–600. [Google Scholar] [CrossRef]
  60. Lui, D.; Scharfenberger, C.; De Carvalho, D.E.; Callaghan, J.P.; Wong, A. Semi-automatic Fisher-Tippett guided active contour for lumbar multifidus muscle segmentation. In Proceedings of the 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Chicago, IL, USA, 26–30 August 2014; pp. 5530–5533. [Google Scholar] [CrossRef]
  61. Ribeiro, E.A.; Nogueira-Barbosa, M.H.; Rangayyan, R.M.; Azevedo-Marques, P.M. Detection of vertebral plateaus in lateral lumbar spinal X-ray images with Gabor filters. In Proceedings of the 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, Buenos Aires, Argentina, 31 August–4 September 2010; pp. 4052–4055. [Google Scholar] [CrossRef]
  62. Sa, R.; Owens, W.; Wiegand, R.; Chaudhary, V. Fast scale-invariant lateral lumbar vertebrae detection and segmentation in X-ray images. In Proceedings of the 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Orlando, FL, USA, 16–20 August 2016; pp. 1054–1057. [Google Scholar] [CrossRef]
  63. Iriondo, C.; Pedoia, V.; Majumdar, S. Lumbar intervertebral disc characterization through quantitative MRI analysis: An automatic voxel-based relaxometry approach. Magn. Reson. Med. 2020, 84, 1376–1390. [Google Scholar] [CrossRef] [PubMed]
  64. Staartjes, V.E.; Seevinck, P.R.; Vandertop, W.P.; van Stralen, M.; Schröder, M.L. Magnetic resonance imaging–based synthetic computed tomography of the lumbar spine for surgical planning: A clinical proof-of-concept. Neurosurg. Focus 2021, 50, E13. [Google Scholar] [CrossRef] [PubMed]
  65. Lee, J.H.; Han, I.H.; Kim, D.H.; Yu, S.; Lee, I.S.; Song, Y.S.; Joo, S.; Jin, C.B.; Kim, H. Spine computed tomography to magnetic resonance image synthesis using generative adversarial networks: A preliminary study. J. Korean Neurosurg. Soc. 2020, 63, 386. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Fan, G.; Liu, H.; Wang, D.; Feng, C.; Li, Y.; Yin, B.; Zhou, Z.; Gu, X.; Zhang, H.; Lu, Y.; et al. Deep learning-based lumbosacral reconstruction for difficulty prediction of percutaneous endoscopic transforaminal discectomy at L5/S1 level: A retrospective cohort study. Int. J. Surg. 2020, 82, 162–169. [Google Scholar] [CrossRef] [PubMed]
  67. Malinda, V.; Lee, D. Lumbar Vertebrae Synthetic Segmentation in Computed Tomography Images Using Hybrid Deep Generative Adversarial Networks. In Proceedings of the 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 20–24 July 2020; pp. 1327–1330. [Google Scholar] [CrossRef]
  68. Siemionow, K.; Luciano, C.; Forsthoefel, C.; Aydogmus, S. Autonomous image segmentation and identification of anatomical landmarks from lumbar spine intraoperative computed tomography scans using machine learning: A validation study. J. Craniovertebral Junction Spine 2020, 11, 99. [Google Scholar] [CrossRef] [PubMed]
  69. Netherton, T.J.; Rhee, D.J.; Cardenas, C.E.; Chung, C.; Klopp, A.H.; Peterson, C.B.; Howell, R.M.; Balter, P.A.; Court, L.E. Evaluation of a multiview architecture for automatic vertebral labeling of palliative radiotherapy simulation CT images. Med. Phys. 2020, 47, 5592. [Google Scholar] [CrossRef]
  70. Watanabe, K.; Aoki, Y.; Matsumoto, M. An application of artificial intelligence to diagnostic imaging of spine disease: Estimating spinal alignment from moire images. Neurospine 2019, 16, 697. [Google Scholar] [CrossRef]
  71. Kim, S.; Bae, W.C.; Masuda, K.; Chung, C.B.; Hwang, D. Fine-grain segmentation of the intervertebral discs from MR spine images using deep convolutional neural networks: BSU-Net. Appl. Sci. 2018, 8, 1656. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  72. Shen, H.; Huang, J.; Zheng, Q.; Zhu, Z.; Lv, X.; Liu, Y.; Wang, Y. A Deep-Learning–Based, Fully Automated Program to Segment and Quantify Major Spinal Components on Axial Lumbar Spine Magnetic Resonance Imaging. Phys. Ther. 2021, 101, pzab041. [Google Scholar] [CrossRef]
  73. Gaonkar, B.; Villaroman, D.; Beckett, J.; Ahn, C.; Attiah, M.; Babayan, D.; Villablanca, J.; Salamon, N.; Bui, A.; Macyszyn, L. Quantitative Analysis of Spinal Canal Areas in the Lumbar Spine: An Imaging Informatics and Machine Learning Study. Am. J. Neuroradiol. 2019, 40, 1586–1591. [Google Scholar] [CrossRef] [PubMed]
  74. Huang, J.; Shen, H.; Wu, J.; Hu, X.; Zhu, Z.; Lv, X.; Liu, Y.; Wang, Y. Spine Explorer: A deep learning based fully automated program for efficient and reliable quantifications of the vertebrae and discs on sagittal lumbar spine MR images. Spine J. 2020, 20, 590–599. [Google Scholar] [CrossRef]
  75. Li, H.; Luo, H.; Huan, W.; Shi, Z.; Yan, C.; Wang, L.; Mu, Y.; Liu, Y. Automatic lumbar spinal MRI image segmentation with a multi-scale attention network. Neural Comput. Appl. 2021, 1–14. [Google Scholar] [CrossRef]
  76. Li, H.; Luo, H.; Liu, Y. Paraspinal muscle segmentation based on deep neural network. Sensors 2019, 19, 2650. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  77. Zhou, J.; Damasceno, P.F.; Chachad, R.; Cheung, J.R.; Ballatori, A.; Lotz, J.C.; Lazar, A.A.; Link, T.M.; Fields, A.J.; Krug, R. Automatic Vertebral Body Segmentation Based on Deep Learning of Dixon Images for Bone Marrow Fat Fraction Quantification. Front. Endocrinol. 2020, 11, 612. [Google Scholar] [CrossRef]
  78. Jamaludin, A.; Lootus, M.; Kadir, T.; Zisserman, A.; Urban, J.; Battié, M.C.; Fairbank, J.; McCall, I. ISSLS PRIZE IN BIOENGINEERING SCIENCE 2017: Automation of reading of radiological features from magnetic resonance images (MRIs) of the lumbar spine without human intervention is comparable with an expert radiologist. Eur. Spine J. 2017, 26, 1374–1383. [Google Scholar] [CrossRef] [Green Version]
  79. Natalia, F.; Meidia, H.; Afriliana, N.; Young, J.C.; Yunus, R.E.; Al-Jumaily, M.; Al-Kafri, A.; Sudirman, S. Automated measurement of anteroposterior diameter and foraminal widths in MRI images for lumbar spinal stenosis diagnosis. PLoS ONE 2020, 15, e0241309. [Google Scholar] [CrossRef]
  80. Zhou, Y.; Liu, Y.; Chen, Q.; Gu, G.; Sui, X. Automatic lumbar MRI detection and identification based on deep learning. J. Digit. Imaging 2019, 32, 513–520. [Google Scholar] [CrossRef]
  81. Forsberg, D.; Sjöblom, E.; Sunshine, J.L. Detection and labeling of vertebrae in MR images using deep learning with clinical annotations as training data. J. Digit. Imaging 2017, 30, 406–412. [Google Scholar] [CrossRef] [PubMed]
  82. Baka, N.; Leenstra, S.; van Walsum, T. Ultrasound aided vertebral level localization for lumbar surgery. IEEE Trans. Med. Imaging 2017, 36, 2138–2147. [Google Scholar] [CrossRef] [PubMed]
  83. Cho, B.H.; Kaji, D.; Cheung, Z.B.; Ye, I.B.; Tang, R.; Ahn, A.; Carrillo, O.; Schwartz, J.T.; Valliani, A.A.; Oermann, E.K.; et al. Automated measurement of lumbar lordosis on radiographs using machine learning and computer vision. Glob. Spine J. 2020, 10, 611–618. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  84. Li, Y.; Liang, W.; Zhang, Y.; An, H.; Tan, J. Automatic lumbar vertebrae detection based on feature fusion deep learning for partial occluded C-arm X-ray images. In Proceedings of the 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Orlando, FL, USA, 16–20 August 2016; pp. 647–650. [Google Scholar] [CrossRef]
  85. Sa, R.; Owens, W.; Wiegand, R.; Studin, M.; Capoferri, D.; Barooha, K.; Greaux, A.; Rattray, R.; Hutton, A.; Cintineo, J.; et al. Intervertebral disc detection in X-ray images using faster R-CNN. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Jeju, Korea, 11–15 July 2017; pp. 564–567. [Google Scholar] [CrossRef]
  86. Liu, X.; Deng, Z.; Yang, Y. Recent progress in semantic image segmentation. Artif. Intell. Rev. 2019, 52, 1089–1106. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Schematic partitioning of the works concerning the application of AI in LBP.
Figure 1. Schematic partitioning of the works concerning the application of AI in LBP.
Ijerph 18 10909 g001
Figure 2. Summary of the methodological quality of included studies regarding the 4 domains assessing the risk of bias (A) and the 3 domains assessing applicability concerns (B) of the QUADAS-2 score. The portion of studies with a low risk of bias are highlighted in green, studies with an unclear risk of bias are depicted in blue and studies with a high risk of bias are represented in orange.
Figure 2. Summary of the methodological quality of included studies regarding the 4 domains assessing the risk of bias (A) and the 3 domains assessing applicability concerns (B) of the QUADAS-2 score. The portion of studies with a low risk of bias are highlighted in green, studies with an unclear risk of bias are depicted in blue and studies with a high risk of bias are represented in orange.
Ijerph 18 10909 g002
Figure 3. Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) flow diagram.
Figure 3. Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) flow diagram.
Ijerph 18 10909 g003
Figure 4. Boxplot summarizing the results for different structures and tasks. The three left columns refer to the DICE scores for the segmentation of IVDs, vertebrae and muscles; the right column refers to the identification accuracy for different structures.
Figure 4. Boxplot summarizing the results for different structures and tasks. The three left columns refer to the DICE scores for the segmentation of IVDs, vertebrae and muscles; the right column refers to the identification accuracy for different structures.
Ijerph 18 10909 g004
Table 1. Feature extraction. For each work, it is reported whether or not other tasks are performed following feature extraction. The reported results are related to the task following feature extraction. Abbreviations are used for Magnetic Resonance Imaging (MRI), Low Back Pain (LBP), Accuracy (Acc), Mean Absolute Error (MAE), Machine Learning (ML), Support Vector Machine (SVM).
Table 1. Feature extraction. For each work, it is reported whether or not other tasks are performed following feature extraction. The reported results are related to the task following feature extraction. Abbreviations are used for Magnetic Resonance Imaging (MRI), Low Back Pain (LBP), Accuracy (Acc), Mean Absolute Error (MAE), Machine Learning (ML), Support Vector Machine (SVM).
Author/YearMain TaskData Type# PatientsStructures InvolvedResultsModel
Adankon, 2012 [19]Feature Extraction and Classification3D image of the back surface165VertebraeAcc = 95%Local Geometric Descriptors and SVM
Castro-Mateos, 2014 [20]Feature Extraction and Segmentation3D MRI59DiscsDICE = 88.4%Statistical shape model space and B-Spline space
Raudner, 2020 [21]Feature ExtractionMRI58Discs/GRAPPATINI
Abdollah, 2020 [22]Feature ExtractionMRI28Discs, Vertebrae/Random Forest and texture analysis
Yang 2020 [8]Feature Extraction and ClassificationMRI109DiscsAcc = 88.3%Gabor wavelet transformation and KLT feature tracker
Ruiz-España, 2015 [23]Feature Extraction and ClassificationMRI67DiscsAcc > 90%Gradient Vector Flow, several ML models
Ketola, 2020 [24]Feature Extraction and ClassificationMRI518LBPAcc = 83%Texture feature extraction and Logistic Regression
Garcia-Cano, 2018 [10]Feature Extraction and RegressionX-rays150VertebraeCobb angle MAE = 4.79°Independent component analysis and Random Forest
Table 2. Segmentation—Digital Image Processing. For each work, the main task is reported, whether it concerns only the segmentation of lumbar components, or if it aims to localize specific parts (e.g., the center of mass) of the components, or if it aims to identify each component (e.g., differentiating vertebrae between each other). If more structures are investigated, the correspondent results are reported in the same order in which structures are presented in the column “Structures involved”. Abbreviations are used for Magnetic Resonance Imaging (MRI), Computed Tomography (CT), Accuracy (Acc), Sensitivity (Sen), Specificity (Spec), Area Under the Curve (AUC), Gradient Vector Flow (GVF), Support Vector Machine (SVM).
Table 2. Segmentation—Digital Image Processing. For each work, the main task is reported, whether it concerns only the segmentation of lumbar components, or if it aims to localize specific parts (e.g., the center of mass) of the components, or if it aims to identify each component (e.g., differentiating vertebrae between each other). If more structures are investigated, the correspondent results are reported in the same order in which structures are presented in the column “Structures involved”. Abbreviations are used for Magnetic Resonance Imaging (MRI), Computed Tomography (CT), Accuracy (Acc), Sensitivity (Sen), Specificity (Spec), Area Under the Curve (AUC), Gradient Vector Flow (GVF), Support Vector Machine (SVM).
Author/YearMain TaskData Type# PatientsStructures InvolvedResultsModel
Haq, 2015 [26]Segmentation3D MRI21DiscsDICE = 91.7%Shape-aware models
Neubert, 2012 [27]Segmentation and Identification3D MRI28Discs and VertebraeDICE = 89 and 91%, Sen = 100%, Spec = 98%Statistical shape model
Haq, 2020 [25]SegmentationCT images18 SpineWebDiscsDICE = from 91,7 to 95,4%Shape statistics deformable model
Li, 2018 [28]SegmentationCT images115 (Microsoft R.+ SpineWeb)VertebraeDICE = 92.1%Gaussian Mixture Model + threshold
Ibragimov, 2017 [29]SegmentationCT images30 vertebraeVertebraeDICE = 84.7%Landmark detection and deformable models
Yu, 2018 [30]SegmentationCT images21 imagesVertebraeDICE = 93.9%Bone-sheetness assisted grid cut
Korez, 2015 [31]SegmentationCT images220VertebraeDICE = 94.6%Shape-constrained deformable model
Al-Helo, 2011 [32]SegmentationCT images50VertebraeVisual evaluationActive shape models and GVF-snake
Ruiz-España, 2015 [33]SegmentationCT images10VertebraeDICE = 95%Selective Binary Gaussian Filtering Regularized Level Set
Huang, 2013 [34]SegmentationCT images56VertebraeDICE = 94%Otsu thresholding, edge- and region-based level set
Mahdy, 2018 [35]Segmentation and LocalizationCT images10VertebraeVisual evaluationThreshold and adaptive K-Means
Courbot, 2016 [36]LocalizationCT images15VertebraeVisual evaluation, Acc = 89.4%Hidden Markov Chain segmentation
Rasoulian, 2013 [37]LocalizationCT images32VertebraeVisual evaluation, Center of mass MAE = 2mmMulti-object shape model
Mastmeyer, 2006 [38]SegmentationCT images41VertebraeDICE > 98.6%Volume growing and morphological operations
Jimenez-Pastor, 2020 [39]Localization and IdentificationCT images272 imagesVertebraeLocalization error = 13.7mm, Acc = 74,8%Decision forest + morphological image processing
Lee, 2011 [40]Localization and IdentificationCT images19VertebraeLocalization error = 0.14mm, Acc = 93.2%Threshold and thinning-based integrated cost
Klinder, 2009 [41]Localization and IdentificationCT images64VertebraeLocalization error = 1.1mm, Acc = 92%Triangulated shape models
Štern, 2009 [42]LocalizationMRI and CT images13 and 29 imagesDiscs and VertebraeLocalization error = 2.8 and 1.8 mmAnalysis of the geometry of spinal structures
Wong, 2008 [43]Segmentation and TrackingFluoroscopic images2 videosVertebraeVisual evaluationWavelet and shape-active contour based
Zheng, 2011 [44]Segmentation and 3D reconstructionFluoroscopic images4VertebraeMean reconstruction error<1.6mmStatistical shape models
Michopoulou, 2009 [45]SegmentationMRI34DiscsDICE = 90%Atlas-robust-fuzzy C-Means
Fallah, 2018 [46]SegmentationMRI50Discs and VertebraeDICE = 92.5 and 91.4%Hierarchical conditional random field and Random Forest
Ghosh, 2014 [47]SegmentationMRI212Discs and VertebraeDICE = 87 and 84%Random Forest and context features
Kim, 2018 [48]SegmentationMRI19VertebraeDICE = 90%Graph-based and line-based segmentation algorithms
Gaonkar, 2017 [49]SegmentationMRI63VertebraeDICE = 83%Multi-parametric ensemble
Gawel, 2018 [50]SegmentationMRI50VertebraeDICE = 91.4%Cascade classifier and Active Appearance Model
Engstrom, 2011 [51]SegmentationMRI20MusclesDICE = 87%Statistical shape model
Baum, 2018 [52]SegmentationMRI10MusclesDICE = 83%Average shape model and dual feature model
Zheng, 2004 [53]SegmentationFluoroscopic images1VertebraeVisual evaluationHough transform and Fourier descriptors
Jurcak, 2008 [54]SegmentationMRI20MusclesDICE = 77%Probabilistic atlases and geodesic active contours
Fortin, 2017 [55]Segmentation and RegressionMRI30MusclesReliability coefficient = 97-99%Threshold
Neubert, 2013 [56]Segmentation and LocalizationMRI44DiscsDICE = 92.3%, AUC = 0.98Active shape model, Linear Discriminant Analysis, SVM
Oktay, 2011 [57]Localization and IdentificationMRI40DiscsLocalization rate = 95.4%, Acc = 97%Probabilistic model and SVM
Castro-Mateos, 2016 [58]IdentificationMRI48DiscsSensitivity = 87%Active contour model and Feedforward NN
Kim, 2020 [59]LocalizationUltrasound50Muscles2mm discrepancyFuzzy C-Means Clustering
Lui, 2014 [60]LocalizationUltrasound10MusclesF1-Score = 90.9%Decoupled Active Contour
Ribeiro, 2010 [61]SegmentationX-rays41VertebraeDICE = 91.7%Gabor Filters and NN
Sa, 2016 [62]LocalizationX-rays30VertebraeTrue Positive Rate = 75%GVF-snake and SVM
Table 3. Segmentation—Deep Learning. For each work, the main task is reported, whether it concerns only the segmentation of lumbar components, or if it aims to localize specific parts (e.g., the center of mass) of the components, or if it aims to identify each component (e.g., differentiating vertebrae between each other). If more structures are investigated, the correspondent results are reported following the same order by which structures are presented in the “Structures involved” column. Abbreviations are used for Magnetic Resonance Imaging (MRI), Computed Tomography (CT), Mean Absolute Error (MAE), Accuracy (Acc), Convolutional Neural Network (CNN), Support Vector Machine (SVM), Regression Trees (RT).
Table 3. Segmentation—Deep Learning. For each work, the main task is reported, whether it concerns only the segmentation of lumbar components, or if it aims to localize specific parts (e.g., the center of mass) of the components, or if it aims to identify each component (e.g., differentiating vertebrae between each other). If more structures are investigated, the correspondent results are reported following the same order by which structures are presented in the “Structures involved” column. Abbreviations are used for Magnetic Resonance Imaging (MRI), Computed Tomography (CT), Mean Absolute Error (MAE), Accuracy (Acc), Convolutional Neural Network (CNN), Support Vector Machine (SVM), Regression Trees (RT).
Author/YearMain TaskData Type# PatientsStructures InvolvedResultsModel
Iriondo, 2020 [63]Segmentation3D MRI31DiscsDICE > 85%Coarse-to-fine context memory NN
Staartjes, 2021 [64]Segmentation and Reconstruction3D MRI3All structuresVisual evaluationCNN
Lee, 2020 [65]Segmentation and ReconstructionCT images280 imagesAll structuresMAE = 21 pixelsGenerative Adversarial Networks
Fan, 2020 [66]Segmentation and ReconstructionCT images108All structuresKambin triangle = 161 mm 2 U-net
Malinda, 2020[67]SegmentationCT images120VertebraeDICE = 94.2%Generative Adversarial Networks
Siemionow, 2020 [68]IdentificationCT images45VertebraeAcc = 96 to 99%CNN
Netherton, 2020 [69]Localization and IdentificationCT images330 imagesVertebraeLocalization error = 2.2 mm, Acc = 94%X-net ensemble
Watanabe 2019 [70]RegressionMoire images + X-rays1996VertebraeCobb angle MAE = 3.42°CNN
Kim, 2018 [71]SegmentationMRISpineWeb 20DiscsDICE = 89.4%CNN (BSU-net)
Shen, 2021 [72]SegmentationMRI120Discs, Spinal canal and MusclesJaccard: 87, 82 and 85%Feedforward NN
Gaonkar, 2019 [73]SegmentationMRI39295Discs and Spinal canalDICE = 88 and 87%Discs: U-net, Canal: SVM and RT
Huang, 2020 [74]SegmentationMRI100Discs and VertebraeJaccard = 92.6 and 94.7%U-net
Li, 2021 [75]SegmentationMRI120Vertebrae and Spinal canalDICE = 92.5%CNN
Li, 2019 [76]SegmentationMRI120MusclesDICE > 91.3%Deformed U-net
Zhou, 2020 [77]SegmentationMRI57VertebraeDICE = 84.9%U-net
Jamaludin, 2017 [78]ClassificationMRI2009Discs and VertebraeAcc = 95.6%CNN
Natalia, 2020 [79]RegressionMRI515Discs and Spinal canalMean error: 0.9 mmSegNet and Contour Evolution Algorithm
Zhou, 2019 [80]IdentificationMRI1318VertebraeAcc = 98.9%CNN
Forsberg, 2017 [81]IdentificationMRI with clinical notes475VertebraeAcc = 97%CNN and parts-based graphical models
Baka, 2017 [82]IdentificationUltrasound19 data setsVertebraeAcc = 92%CNN and matching strategy
Cho, 2020 [83]Segmentation and RegressionX-rays629VertebraeDICE = 82.1%, MAE = 8,055°U-net
Li, 2016 [84]IdentificationX-rays110VertebraeAcc = 80.4%CNN
Sa, 2017 [85]LocalizationX-rays1081 imagesDiscsPrecision = 90.5%Faster R-CNN
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

D’Antoni, F.; Russo, F.; Ambrosio, L.; Vollero, L.; Vadalà, G.; Merone, M.; Papalia, R.; Denaro, V. Artificial Intelligence and Computer Vision in Low Back Pain: A Systematic Review. Int. J. Environ. Res. Public Health 2021, 18, 10909. https://doi.org/10.3390/ijerph182010909

AMA Style

D’Antoni F, Russo F, Ambrosio L, Vollero L, Vadalà G, Merone M, Papalia R, Denaro V. Artificial Intelligence and Computer Vision in Low Back Pain: A Systematic Review. International Journal of Environmental Research and Public Health. 2021; 18(20):10909. https://doi.org/10.3390/ijerph182010909

Chicago/Turabian Style

D’Antoni, Federico, Fabrizio Russo, Luca Ambrosio, Luca Vollero, Gianluca Vadalà, Mario Merone, Rocco Papalia, and Vincenzo Denaro. 2021. "Artificial Intelligence and Computer Vision in Low Back Pain: A Systematic Review" International Journal of Environmental Research and Public Health 18, no. 20: 10909. https://doi.org/10.3390/ijerph182010909

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop