Artificial Intelligence and Its Impact on the Management of Lumbar Degenerative Pathology: A Narrative Review

Trento, Alessandro; Rapisarda, Salvatore; Bresolin, Nicola; Valenti, Andrea; Giordan, Enrico

doi:10.3390/medicina61081400

Open AccessReview

Artificial Intelligence and Its Impact on the Management of Lumbar Degenerative Pathology: A Narrative Review

by

Alessandro Trento

¹

,

Salvatore Rapisarda

²,

Nicola Bresolin

²,

Andrea Valenti

² and

Enrico Giordan

^3,*

¹

Department of Neuroscience, University of Verona, 37126 Verona, Italy

²

Department of Neuroscience, University of Padua, 35128 Padua, Italy

³

Neurosurgical Department, Aulss2 Marca Trevigiana, 31100 Treviso, Italy

^*

Author to whom correspondence should be addressed.

Medicina 2025, 61(8), 1400; https://doi.org/10.3390/medicina61081400

Submission received: 9 June 2025 / Revised: 20 July 2025 / Accepted: 23 July 2025 / Published: 1 August 2025

(This article belongs to the Special Issue Clinical Applications of Modern Technologies in Neurosurgery and Spine Surgery)

Download

Browse Figure

Review Reports Versions Notes

Abstract

In this narrative review, we explore the role of artificial intelligence (AI) in managing lumbar degenerative conditions, a topic that has recently garnered significant interest. The use of AI-based solutions in spine surgery is particularly appealing due to its potential applications in preoperative planning and outcome prediction. This study aims to clarify the impact of artificial intelligence models on the diagnosis and prognosis of common types of degenerative conditions: lumbar disc herniation, spinal stenosis, and eventually spinal fusion. Additionally, the study seeks to identify predictive factors for lumbar fusion surgery based on a review of the literature from the past 10 years. From the literature search, 96 articles were examined. The literature on this topic appears to be consistent, describing various models that show promising results, particularly in predicting outcomes. However, most studies adopt a retrospective approach and often lack detailed information about imaging features, intraoperative findings, and postoperative functional metrics. Additionally, the predictive performance of these models varies significantly, and few studies include external validation. The application of artificial intelligence in treating degenerative spine conditions, while valid and promising, is still in a developmental phase. However, over the last decade, there has been an exponential growth in studies related to this subject, which is beginning to pave the way for its systematic use in clinical practice.

Keywords:

artificial intelligence; AI; lumbar; surgery; degenerative; spine

1. Introduction

Degenerative conditions of the lumbar spine are among the leading causes of pain and disability in the adult population. Approximately 266 million people (3.6% of the global population) suffer from low back pain associated with arthritic changes in the lumbar spine [1]. These conditions negatively affect quality of life by causing significant functional limitations in daily activities. They primarily include degenerative disc disease, spinal canal stenosis, and herniated discs. They are often associated with radiculopathy, back pain, loss of function, and reduction in lumbar lordosis.

Nonoperative strategies, such as percutaneous steroid injections and physical therapy, are typically effective as first-line treatments and are often combined with physiotherapy. In cases of persistent pain or neurological deficits, surgical interventions—such as discectomy, laminectomy, or spinal fusion—are preferred and considered definitive.

In recent years, applications of artificial intelligence (AI) have expanded rapidly in the healthcare sector, offering the potential to revolutionize the field by enhancing diagnostic accuracy and predicting clinical outcomes [2,3]. This trend extends to spine-related pathologies, both in diagnosis and treatment. The objective of this review is to collect, analyze, and highlight the role of AI in managing degenerative lumbar spine conditions, with a focus on the diagnosis and prognosis of lumbar disc herniation and spinal stenosis, as well as predictive factors for lumbar fusion surgery.

2. Results and Discussion

2.1. Study Characteristics

The PubMed, Scopus, and Web of Science databases were searched for articles published from May 2015 to May 2025 using the following keywords: “lumbar disc herniation AND artificial intelligence”, “lumbar stenosis AND artificial intelligence”, and “lumbar fusion surgery AND artificial intelligence”. Eligible studies included English-language articles.

Four authors (N.B., A.V., S.R., and A.T.) conducted the bibliographic and review search. A total of 331 articles were initially identified. Following a screening process based on titles and abstracts and subsequent full-text evaluation, 96 studies were deemed eligible for inclusion.

A summary of AI’s evolution over the past decade, based on our research, is shown in Figure 1.

To avoid confusion, the authors defined several terms related to the broad umbrella of artificial intelligence (AI) prior to conducting the search.

Machine learning (ML) is a subset of AI that focuses on optimization. When implemented correctly, it enables predictions that minimize errors compared to simple guessing. For example, companies use ML to recommend products to customers based on their prior browsing and purchasing behavior.

Deep learning (DL), a further subset of ML, differs primarily in how it learns and the volume of data it requires. DL automates much of the feature extraction process, reducing the need for manual human input. It also leverages large datasets, enhancing its predictive capabilities.

To effectively compare different predictive models, it is essential to understand that the area under the curve (AUC) serves as a key metric for evaluating model performance. AUC quantifies how well an AI model distinguishes between patients with specific characteristics and those without, using a probabilistic framework. An AUC score between 0.7 and 0.8 indicates acceptable performance; scores from 0.8 to 0.9 are considered good; and scores above 0.9 are excellent. In contrast, scores near 0.5 suggest performance equivalent to random chance.

2.2. AI for Lumbar Spinal Stenosis

Lumbar spinal stenosis (LSS) is one of the most common spinal conditions affecting adults, particularly the elderly. The clinical presentation typically includes low back pain radiating to the lower extremities, often accompanied by numbness. Symptoms usually worsen with walking, leading to significant limitations in daily activities [4].

2.2.1. Diagnosis

In recent years, ML and DL models have been extensively developed for the diagnosis and classification of lumbar spinal stenosis. Most studies have focused on magnetic resonance imaging (MRI), the gold-standard diagnostic modality due to its detailed assessment of the central canal, lateral recesses, and foramina, along with its excellent soft tissue contrast [5].

Although MRI provides valuable diagnostic information, executing and interpreting scans is time-consuming and heavily dependent on the radiologist’s expertise. Currently, no automatic quantitative criteria exist for diagnosing lumbar spinal stenosis [6]. Therefore, automated grading systems are warranted to reduce radiologists’ workloads and improve diagnostic accuracy.

One of the first AI models, developed by Jamaludin et al. [7], used a multitask architecture to classify degenerative conditions of the lumbar spine, including central canal stenosis. The condition was assessed in a binary fashion (present or absent), and only sagittal images were used.

Another notable early development was Deep Spine, created by Lu and colleagues [8], which evaluated and classified central canal and foraminal stenosis using both axial and sagittal images. This system was based on a weakly supervised interpretation of radiology reports. More recently, Hallinan et al. [9] developed a DL model to automatically detect and classify central canal, lateral recess, and neural foramina stenosis. Their study involved 446 patients and employed 12,403 axial T2-weighted images along with 6161 sagittal T1-weighted images for training, validation, and testing. The model showed strong agreement with radiologists in the dichotomous classification of central canal and lateral recess stenosis (normal or mild vs. moderate or severe). Nonetheless, the level of agreement was slightly lower for neural foraminal stenosis.

However, these studies did not provide quantitative measurements for stenosis classification. A model developed by Bharadwaj and colleagues [10] aimed to address this limitation by integrating a decision tree classifier that used quantitative measurements of the cross-sectional areas of the dural sac and intervertebral discs. This model also assessed facet arthropathy, an important contributor to lower back pain. While the binary classification of central canal stenosis, neural foramina stenosis, and facet arthropathy demonstrated accuracy, the study had notable limitations, including a small sample size (200 patients) and the lack of an external model of validation.

Furthermore, Van der Graaf et al. [11] developed an AI-based model for classifying central canal stenosis that automatically extracted the cross-sectional area, anteroposterior diameter of the dural sac, and cerebrospinal fluid (CSF) signal loss [12]. Their algorithm, based on the Lee classification [13] and using only sagittal images, achieved a sensitivity of 93% and a specificity of 91%. These results were comparable to assessments made by two expert radiologists.

Computed tomography (CT) imaging is generally not preferred for evaluating lumbar spinal stenosis due to increased noise from the bony structures surrounding the spinal canal. However, CT provides better delineation of the ligamentum flavum compared to MRI [14]. Based on this observation, Miyo et al. [15] applied a deep learning reconstruction method to 30 lumbar CT scans and compared the results to those obtained using hybrid iterative reconstruction, an older technique for enhancing CT image quality. The authors reported improved quantitative image noise and better interobserver agreement regarding the degree of lumbar spinal stenosis.

Artificial intelligence solutions can also be applied to simpler diagnostic tools. For example, Kim et al. [16] developed a deep learning algorithm to diagnose central lumbar spinal stenosis using radiographs. The study included 2303 patients with severe central canal stenosis confirmed by MRI and 2341 controls. Lateral lumbar radiographs in neutral, flexion, and extension positions—comprising 6325 images in the stenosis group and 6117 in the control group—were analyzed for disc height, intervertebral foramen height, pedicle length, and facet joint hypertrophy. Among the models trained, one achieved an area under the ROC curve (AUC) of 90%, with an accuracy of 81.8%, sensitivity of 85.9%, and specificity of 77.8% in the neutral position. These findings suggest that artificial intelligence may enable the use of simple and cost-effective diagnostic tools, such as a radiograph, to identify lumbar spinal stenosis.

It is important to note that the clinical presentation of lumbar spinal stenosis may not always align with radiological findings [17]. Therefore, self-reported questionnaires have been validated to assist in the symptomatic diagnosis of lumbar spinal stenosis [18]. Abel and colleagues [19] developed several ML models to identify lumbar spinal stenosis based on a 26-question survey that assessed pain severity and type, activities limited by pain, motor impairment, and overall physical and mental health. The best-performing model achieved an area under the curve (AUC) of 96%, with a sensitivity of 94% and a specificity of 88% in classifying patients with or without lumbar spinal stenosis. These results demonstrate the potential of AI to support diagnosis using a simple clinical assessment tool.

2.2.2. Treatment

Machine learning algorithms can also enhance the surgical management of lumbar spinal stenosis by facilitating the preoperative identification of patients who may benefit from surgery. In 2019, Siccoli et al. [20] developed a model based on 15 variables, including outcome measures such as the Numeric Rating Scale (NRS) for back and leg pain and the Oswestry Disability Index (ODI), as well as demographic characteristics such as age, sex, and body mass index (BMI). These data were extracted from 635 patients who underwent lumbar decompression. Clinical success was defined as achieving the minimum clinically important difference (MCID), characterized by an improvement in ODI or NRS of ≥30%. The model demonstrated the feasibility of predicting MCID at both 6 weeks and 12 months postoperatively, with prediction accuracies for NRS and ODI ranging from 62% to 85% and AUC values as high as 0.92 (for back pain NRS at 6 weeks).

More recently, Wilson and colleagues [21] developed an ML model to predict the need for surgery using axial T2-weighted MRI scans. Their study included 80 patients who underwent decompression surgery and 60 controls. As a measure of spinal stenosis, the authors considered the maximum percentage reduction in the spinal canal area on lumbar MRI, corresponding to the most compressed level. The model demonstrated high predictive accuracy, with an AUC greater than 0.88, in identifying patients who would undergo subsequent spinal decompression.

One potential drawback of current machine learning approaches is the high discrepancy between clinical symptoms and radiological findings of degenerative changes [22]. To address this limitation and predict the need for lumbar decompression based on both clinical and radiological data, Mourad et al. [23] developed a novel hybrid model using 500 medical vignettes. Each vignette included 36 variables representing clinical symptoms, MRI features, and demographic factors. The model was constructed using a weighted average between a Bayesian network, which incorporated expert opinion, and a machine learning model trained on the same 36-variable dataset. In a dichotomous classification framework (weak recommendation vs. strong recommendation for surgery), the hybrid model outperformed the recommendations of five individual experts, achieving an AUC of 0.92 compared to 0.84. The authors also developed a separate ML model using the same dataset, which achieved comparable accuracy [24].

2.2.3. Prognosis

AI models have applications in the postoperative care of patients undergoing lumbar decompression surgery. Patients often experience prolonged hospital stays due to difficulties with mobilization, which can negatively impact both healthcare costs and patient autonomy [25]. Addressing this issue, Ogink et al. [26] evaluated four ML algorithms to predict discharge destinations—either rehabilitation or nursing facilities—after lumbar decompression. Their dataset included 28,600 patients from the American College of Surgeons National Surgical Quality Improvement Program. The variables analyzed were age, body mass index (BMI), American Society of Anesthesiologists (ASA) classification, functional status, number of surgical levels, fusion, diabetes, preoperative hematocrit, and serum creatinine. The neural network model demonstrated promising accuracy (AUC = 0.75) in distinguishing patients discharged home from those requiring rehabilitation. Similarly, Saravi and colleagues [27] applied various algorithms to a retrospective cohort, considering clinical, demographic, and surgical variables such as sex, age, BMI, operation time, and C-reactive protein (CRP) levels, to predict hospital length of stay after lumbar decompression. One of the deep learning algorithms they tested achieved excellent accuracy (AUC = 0.87) in predicting prolonged hospital stays.

2.3. AI for Lumbar Disc Herniation

Lumbar disc herniation with radiculopathy is a common and debilitating spinal disorder that requires a multidisciplinary approach, in which artificial intelligence (AI) shows significant potential.

2.3.1. Diagnosis

Several DL models have demonstrated high performance in identifying and classifying disc pathologies using MRI and computed tomography (CT), thereby reducing diagnostic subjectivity [28,29,30,31,32]. Notably, Xu et al. [33] developed a DL model capable of simultaneously localizing and classifying lumbar disc herniation. The model analyzed axial T2-weighted MR sequences from 1496 patients, and its performance was compared with that of three spinal surgeons. The algorithm achieved diagnostic performance comparable to that of the experts, demonstrating reasonable accuracy (sensitivity: 87.0% for grading and 84.0% for localization; specificity: 95.5–94.4%, respectively). Despite its promise, the model’s performance declined significantly during external testing.

However, surgical selection for lumbar disc herniation relies not solely on imaging findings but primarily on clinical–radiological correlation. Staartjes et al. [34] developed a machine learning classification system that correlates radiological features with specific pain generators in lumbar degenerative pathology. Their study included 262 surgical candidates diagnosed with lumbar disc herniation, lumbar spinal stenosis, or discogenic chronic low back pain, all of whom underwent the five-repetition sit-to-stand test (5R-STS). By incorporating the test results alongside the patient’s age, gender, height, and weight, the model distinguished among the three pathologies with an accuracy of 96.2%, demonstrating excellent discrimination. Nevertheless, the model’s clinical applicability remains limited due to a lack of external validation.

2.3.2. Treatment

Beyond diagnosis, AI technologies have also demonstrated significant potential to enhance preoperative planning, intraoperative support, and procedural precision in spinal surgery. Zhu et al. [35] developed an AI-based three-dimensional (3D) model of the lumbosacral region to improve personalized trajectory planning for percutaneous lumbar endoscopic discectomy at the L5–S1 level. This reconstruction achieved accuracy comparable to that of a manual-segmentation-based 3D model in depicting the L5 and S1 vertebrae, the L5–S1 disc, the lumbosacral nerve roots, the iliac bone, and the skin. This region is of particular interest due to the anatomical challenge posed by the iliac crest [36].

Similarly, Yamada and colleagues [37] applied an AI-enhanced MRI 3D model to identify patients with an L5–S1 disc herniation suitable for endoscopic transforaminal removal. Among the 52 cases analyzed, 13 were deemed operable. All endoscopic surgeries proceeded without complications and resulted in positive outcomes, notably pain reduction.

AI solutions can also be applied to intraoperative imaging. Cui et al. [38] collected 65 videos of endoscopic transforaminal discectomy, extracting over 10,000 images. They then trained an ML detection system to identify the dura mater and nerve roots from this dataset. The algorithm demonstrated excellent performance, achieving a sensitivity of 90.9%, specificity of 93.7%, and accuracy of 92.3%. This performance matched that of an expert spinal endoscopist and significantly exceeded that of less experienced surgeons.

2.3.3. Prognosis

AI algorithms show promise in predicting surgical outcomes. ML models have demonstrated high accuracy in forecasting patient-specific results, thereby enabling more personalized treatment planning. Yen et al. [39] developed a model to predict prolonged opioid use following lumbar discectomy based on data from 1316 patients. This model achieved acceptable accuracy with an area under the curve (AUC) of 0.76. Similarly, Wang et al. [40] used a DL MRI model combined with clinical features to predict one-year outcomes after tubular microdiscectomy. This study involved 548 patients and incorporated sagittal and axial T2 sequences alongside preoperative clinical characteristics. One of the tested DL models yielded optimal results, with an AUC of 0.86 for internal validation and 0.83 for external validation.

2.4. AI for Lumbar Fusion Surgery

Lumbar spinal fusion has become one of the most commonly performed procedures in modern spine surgery. However, the rapid increase in procedure volume has been accompanied by a growing burden of perioperative complications and hospital readmissions, particularly among aging populations with complex comorbidities [41,42,43]. As healthcare systems shift toward value-based care, stratifying risk and personalizing perioperative management have become essential. Artificial intelligence (AI), particularly machine learning (ML) techniques, is increasingly recognized as a promising tool for addressing these challenges. Traditional statistical methods, such as logistic regression (LR), have played a crucial role in identifying key perioperative risk factors for lumbar fusion [44,45,46,47,48,49]. However, LR’s assumptions of linearity and limited ability to handle complex interactions have driven the adoption of more sophisticated ML models [50,51]. ML offers advantages in processing large datasets and detecting non-linear relationships. Several studies have demonstrated that ML models frequently outperform LR models in predictive accuracy for surgical outcomes [45,46,47,48].

2.4.1. Outcome Prediction

ML approaches have been explored across a wide spectrum of spinal pathologies [44,49,52,53,54]. Although several studies have shown promise, their scope, design, and methodology vary considerably. Berjano et al. [55] developed an ML model to predict favorable outcomes after lumbar arthrodesis. The study included 1243 patients and considered demographic, clinical, and surgical features. A good outcome at six-month follow-up was defined as an improvement of more than 12.7 points on the Oswestry Disability Index (ODI) [56]. The model demonstrated promising results, with a sensitivity of 74.3%, specificity of 79.4%, and accuracy of 75.8%. However, the algorithm lacked external validation, and the surgical approach was not specified.

Schönnagel et al. [57] employed XGBoost, a type of ML algorithm, to predict persistent lower back pain after lumbar fusion in patients with degenerative spondylolisthesis, achieving an AUC of 0.81. Nevertheless, the single-center design and small sample size (135 patients) raised concerns about overfitting and limited generalizability [58,59]. In contrast, Fatima et al. [60] applied ML to a larger dataset comprising over 80,000 patients with lumbar degenerative spondylolisthesis to predict 30-day adverse events. Their LR-based model achieved an AUC of 0.70 and outperformed traditional frailty indices [61]. They also developed a web-based calculator to facilitate clinical translation.

2.4.2. Complication Prediction

Several recent studies have sought to leverage machine learning’s capabilities to predict complications, readmissions, discharge destinations, and persistent pain in patients undergoing lumbar fusion. Dong et al. [62] utilized a support vector machine to predict poor outcomes following lumbar interbody fusion, demonstrating how ML can incorporate imaging-derived variables. Their results highlighted the predictive value of parameters, such as disc height and spinal alignment, which closely correlate with clinical outcomes [63,64,65,66,67,68,69].

Bui et al. [70] developed an ML pipeline to predict interbody cage height and the degree of pelvic mismatch after L4–L5 transforaminal lumbar interbody fusion (TLIF) surgery using preoperative X-ray images. The automated pipeline consisted of two stages: first, a DL model extracted essential features from the X-ray images; second, five ML algorithms were trained to identify optimal models for predicting interbody cage height and postoperative pelvic mismatch. Although the accuracy of the tested models was moderate, this study represents an important initial step toward developing tools to predict changes in sagittal balance following interbody fusion surgery.

Shah et al. [44] developed ML models using a national discharge database of nearly 39,000 patients to predict major complications and readmission risk. Their models outperformed LR and identified novel risk factors. These insights highlight modifiable factors, such as diabetes and cardiovascular disease, which may benefit from optimization prior to surgery, thereby helping clinicians provide more effective patient counseling. Janssen et al. [71] incorporated preoperative physical tests into their ML framework and identified aerobic capacity, muscle endurance, and flexibility as strong predictors of recovery. Their findings suggest that functional metrics—often overlooked in large datasets—hold significant clinical value. This perspective aligns with the existing literature emphasizing the importance of prehabilitation and physical conditioning in improving surgical outcomes [72,73,74,75,76].

Fusion surgery can be challenging, with prolonged operative time representing a significant risk factor for complications, such as infections [77]. Li and colleagues [78] recently developed a machine learning model to predict extended operation times in patients undergoing posterior lumbar interbody fusion. Their study included 3233 patients from 22 Chinese institutions. The model incorporated demographic variables, perioperative details, and laboratory results, achieving good predictive performance with an AUC of 0.82 in identifying patients at high risk of prolonged surgery (>240 min).

Xiong et al. [79] developed a model to predict postoperative infection by evaluating 584 patients who underwent fusion surgery. Their model achieved an AUC of 0.87 by considering preoperative variables, such as albumin level, diabetes status, intraoperative dural tears, and history of rheumatic disease.

AI has also been applied to predict cage subsidence [80]. Zou et al. [81] proposed a predictive model based on data from 305 patients across three centers undergoing lumbar fusion surgery. By integrating radiological features from CT and MR scans with clinical variables—including sex, age, number of surgical segments fused, body mass index (BMI), and presence of osteoporosis—the model achieved high accuracy, with AUC values up to 0.94. This model holds promise for improving clinical decision-making and reducing the need for revision surgeries.

The application of deep learning (DL) in lumbar fusion surgery presents both opportunities and challenges. Kuris et al. [82] applied an artificial neural network (ANN) to predict readmissions after various types of lumbar fusion in over 63,000 patients. Although the model achieved high classification accuracy (~94%), its AUC values were modest (0.64–0.65), indicating limited discriminative ability and underscoring the importance of selecting appropriate performance metrics [83,84]. Similarly, Hopkins et al. [49] demonstrated that DL neural networks can outperform traditional models in predicting 30-day readmission following lumbar fusion. However, their study revealed that strong predictions often depend on intra- and postoperative variables, limiting the model’s utility for early risk stratification. This work highlights both the potential and constraints of deep neural networks (DNNs) in informing hospital policy and emphasizes the need for AI systems that integrate data across the entire perioperative timeline [72,75,85]. Kim et al. [86] showed that ANN models outperformed LR in predicting complications—including mortality and wound issues—using data from over 22,000 posterior lumbar fusion cases. These findings align with those of Dreiseitl and Ohno-Machado [87], reinforcing the expanding role of AI in clinical prognostication and its potential for implementation in practice.

To address the challenge of AI model interpretability, some researchers have adopted explainable AI techniques. Guo et al. [88] developed a model to predict cerebrospinal fluid leakage in 3505 lumbar fusion cases, achieving an AUC of 0.87. This model not only demonstrated high accuracy but also highlighted clinically relevant imaging and clinical variables—such as ligamentum flavum thickness and facet joint degeneration—that are both measurable and potentially modifiable.

2.4.3. Cost Prediction

Other researchers have focused on discharge planning and resource allocation. Jain et al. [89] applied various ML models to predict discharge to a facility, 90-day readmission, and medical complications in nearly 38,000 patients undergoing long-segment lumbar fusion [90,91]. Similarly, Cabrera et al. [92] stratified ML models by age group to predict non-home discharge across 39,254 patients, enabling more precise perioperative planning compared to static models. These studies suggest that future AI applications must balance scalability with the inclusion of richer, patient-centered features to support individualized care planning [93,94,95].

Several investigations have also addressed cost and policy implications. Karnuta et al. [96] developed a classifier to predict inpatient cost, length of stay, and discharge disposition in over 38,000 non-scoliosis lumbar fusion patients. Achieving AUC values exceeding 0.88, the model facilitated the development of patient-specific payment models. This represents a critical step toward more equitable reimbursement structures that account for patient complexity—an increasingly important consideration within bundled-payment environments [97,98,99].

2.5. Limitations

The primary limitation of this review is its narrative nature in describing the AI landscape within common degenerative lumbar spine conditions. The study search did not follow a predefined protocol, and the data were neither systematically abstracted nor quantitatively analyzed. Instead, the findings were reported descriptively by the authors. Nonetheless, this review aims to capture and illustrate the current potential of AI and its deep learning models as they relate to the most prevalent degenerative lumbar spine pathologies.

3. Conclusions

The application of artificial intelligence (AI) and machine learning (ML) in the treatment of spinal pathologies and surgical procedures is advancing rapidly. Numerous models show strong potential to enhance diagnostic accuracy, optimize surgical planning, and predict adverse events, readmissions, recovery times, and associated costs. The most successful models are typically those trained on large, well-structured datasets and enriched with clinically relevant modifiable variables. Despite these promising results, the majority of current studies rely on retrospective data, often derived from administrative records that lack critical intraoperative variables, postoperative functional assessments, and detailed imaging data. Moreover, model performance varies considerably depending on cohort size, event rates, feature engineering, and validation methodologies. Few studies report external validation, raising concerns that many high-performing models may be overfitted to specific datasets or institutions.

To realize the full potential of AI in lumbar degenerative pathology management, models must be interpretable, transparent, and seamlessly integrated into electronic health records to facilitate tailored healthcare delivery. Additionally, ethical considerations—including data privacy, biases within training datasets, and risks of overreliance on automation—require careful attention. Future research is essential to ensure that AI becomes a safe, effective, and trustworthy tool in the management of lumbar degenerative pathology.

Author Contributions

Conception and design, A.T. and N.B.; administrative support, E.G.; provision of study materials or patients, A.T., N.B., S.R. and A.V.; collection and assembly of data, A.T., N.B., S.R. and A.V.; data analysis and interpretation, E.G.; manuscript writing, A.T., N.B., S.R., A.V. and E.G.; critical revision of the article, E.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research did not receive any specific grants from funding agencies in the public, commercial, or not-for-profit sectors.

Institutional Review Board Statement

The study did not require an ethics committee’s approval.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors have nothing to disclose.

References

Ravindra, V.M.; Senglaub, S.S.; Rattani, A.; Dewan, M.C.; Härtl, R.; Bisson, E.; Park, K.B.; Shrime, M.G. Degenerative Lumbar Spine Disease: Estimating Global Incidence and Worldwide Volume. Glob. Spine J. 2018, 8, 784–794. [Google Scholar] [CrossRef]
Yasaka, K.; Akai, H.; Kunimatsu, A.; Kiryu, S.; Abe, O. Deep learning with convolutional neural network in radiology. Jpn. J. Radiol. 2018, 36, 257–272. [Google Scholar] [CrossRef]
Helm, J.M.; Swiergosz, A.M.; Haeberle, H.S.; Karnuta, J.M.; Schaffer, J.L.; Krebs, V.E.; Spitzer, A.I.; Ramkumar, P.N. Machine Learning and Artificial Intelligence: Definitions, Applications, and Future Directions. Curr. Rev. Musculoskelet. Med. 2020, 13, 69–76. [Google Scholar] [CrossRef]
Webb, C.W.; Aguirre, K.; Seidenberg, P.H. Lumbar Spinal Stenosis: Diagnosis and Management. Am. Fam. Physician 2024, 109, 350–359. [Google Scholar]
Malfair, D.; Beall, D.P. Imaging the degenerative diseases of the lumbar spine. Magn. Reson. Imaging Clin. N. Am. 2007, 15, 221–238, vi. [Google Scholar] [CrossRef]
Steurer, J.; Roner, S.; Gnannt, R.; Hodler, J. LumbSten Research Collaboration. Quantitative radiologic criteria for the diagnosis of lumbar spinal stenosis: A systematic literature review. BMC Musculoskelet. Disord. 2011, 12, 175. [Google Scholar] [CrossRef]
Jamaludin, A.; Kadir, T.; Zisserman, A. SpineNet: Automated classification and evidence visualization in spinal MRIs. Med. Image Anal. 2017, 41, 63–73. [Google Scholar] [CrossRef]
Lu, J.; Pedemonte, S.; Bizzo, B.; Doyle, S.; Andriole, K.P.; Michalski, M.H.; Gonzalez, R.G.; Pomerantz, S.R. Deep Spine: Automated Lumbar Vertebral Segmentation, Disc-Level Designation, and Spinal Stenosis Grading using Deep Learning. In Proceedings of the 3rd Machine Learning for Healthcare Conference, Palo Alto, CA, USA, 17–18 August 2018; PMLR: Cambridge, MA, USA, 2018; Volume 85, pp. 403–419. [Google Scholar]
Hallinan, J.T.P.D.; Zhu, L.; Yang, K.; Makmur, A.; Algazwi, D.A.R.; Thian, Y.L.; Lau, S.; Choo, Y.S.; Eide, S.E.; Yap, Q.V.; et al. Deep Learning Model for Automated Detection and Classification of Central Canal, Lateral Recess, and Neural Foraminal Stenosis at Lumbar Spine MRI. Radiology 2021, 300, 130–138. [Google Scholar] [CrossRef] [PubMed]
Bharadwaj, U.U.; Christine, M.; Li, S.; Chou, D.; Pedoia, V.; Link, T.M.; Chin, C.T.; Majumdar, S. Deep learning for automated, interpretable classification of lumbar spinal stenosis and facet arthropathy from axial MRI. Eur. Radiol. 2023, 33, 3435–3443. [Google Scholar] [CrossRef] [PubMed]
van der Graaf, J.W.; Brundel, L.; van Hooff, M.L.; de Kleuver, M.; Lessmann, N.; Maresch, B.J.; Vestering, M.M.; Spermon, J.; van Ginneken, B.; Rutten, M.J.C.M. AI-based lumbar central canal stenosis classification on sagittal MR images is comparable to experienced radiologists using axial images. Eur. Radiol. 2025, 35, 2298–2306. [Google Scholar] [CrossRef]
Hızal, M.; Özdemir, F.; Kalaycıoğlu, O.; Işık, C. Cerebrospinal fluid signal loss sign: Assessment of a new radiological sign in lumbar spinal stenosis. Eur. Spine J. 2021, 30, 3297–3306. [Google Scholar] [CrossRef]
Lee, G.Y.; Lee, J.W.; Choi, H.S.; Oh, K.-J.; Kang, H.S. A new grading system of lumbar central canal stenosis on MRI: An easy and reliable method. Skelet. Radiol. 2011, 40, 1033–1039. [Google Scholar] [CrossRef]
Eun, S.S.; Lee, H.-Y.; Lee, S.-H.; Kim, K.H.; Liu, W.C. MRI versus CT for the diagnosis of lumbar spinal stenosis. J. Neuroradiol. 2012, 39, 104–109. [Google Scholar] [CrossRef] [PubMed]
Miyo, R.; Yasaka, K.; Hamada, A.; Sakamoto, N.; Hosoi, R.; Mizuki, M.; Abe, O. Deep-learning reconstruction for the evaluation of lumbar spinal stenosis in computed tomography. Medicine 2023, 102, e33910. [Google Scholar] [CrossRef] [PubMed]
Kim, T.; Kim, Y.G.; Park, S.; Lee, J.K.; Lee, C.H.; Hyun, S.J.; Kim, C.H.; Kim, K.J.; Chung, C.K. Diagnostic triage in patients with central lumbar spinal stenosis using a deep learning system of radiographs. J. Neurosurg. Spine 2022, 37, 104–111. [Google Scholar] [CrossRef]
Andrasinova, T.; Adamova, B.; Buskova, J.; Kerkovsky, M.; Jarkovsky, J.; Bednarik, J. Is there a Correlation Between Degree of Radiologic Lumbar Spinal Stenosis and its Clinical Manifestation? Clin. Spine Surg. 2018, 31, E403–E408. [Google Scholar] [CrossRef]
Tominaga, R.; Kurita, N.; Sekiguchi, M.; Yonemoto, K.; Kakuma, T.; Konno, S.-I. Diagnostic accuracy of the lumbar spinal stenosis-diagnosis support tool and the lumbar spinal stenosis-self-administered, self-reported history questionnaire. PLoS ONE 2022, 17, e0267892. [Google Scholar] [CrossRef]
Abel, F.; Garcia, E.; Andreeva, V.; Nikolaev, N.S.; Kolisnyk, S.; Sarbaev, R.; Novikov, I.; Kozinchenko, E.; Kim, J.; Rusakov, A.; et al. An Artificial Intelligence-Based Support Tool for Lumbar Spinal Stenosis Diagnosis from Self-Reported History Questionnaire. World Neurosurg. 2024, 181, e953–e962. [Google Scholar] [CrossRef]
Siccoli, A.; de Wispelaere, M.P.; Schröder, M.L.; Staartjes, V.E. Machine learning-based preoperative predictive analytics for lumbar spinal stenosis. Neurosurg. Focus 2019, 46, E5. [Google Scholar] [CrossRef]
Wilson, B.; Gaonkar, B.; Yoo, B.; Salehi, B.; Attiah, M.; Villaroman, D.; Ahn, C.; Edwards, M.; Laiwalla, A.; Ratnaparkhi, A.; et al. Predicting Spinal Surgery Candidacy from Imaging Data Using Machine Learning. Neurosurgery 2021, 89, 116–121. [Google Scholar] [CrossRef]
Boden, S.D.; O Davis, D.; Dina, T.S.; Patronas, N.J.; Wiesel, S.W. Abnormal magnetic-resonance scans of the lumbar spine in asymptomatic subjects. A prospective investigation. J. Bone Jt. Surg. Am. 1990, 72, 403–408. [Google Scholar] [CrossRef]
Mourad, R.; Kolisnyk, S.; Baiun, Y.; Falk, A.; Yuriy, T.; Valerii, F.; Kopeev, A.; Suldina, O.; Pospelov, A.; Kim, J.; et al. Performance of hybrid artificial intelligence in determining candidacy for lumbar stenosis surgery. Eur. Spine J. 2022, 31, 2149–2155. [Google Scholar] [CrossRef] [PubMed]
De Barros, A.; Abel, F.; Kolisnyk, S.; Geraci, G.C.; Hill, F.; Engrav, M.; Samavedi, S.; Suldina, O.; Kim, J.; Rusakov, A.; et al. Determining Prior Authorization Approval for Lumbar Stenosis Surgery with Machine Learning. Glob. Spine J. 2024, 14, 1753–1759. [Google Scholar] [CrossRef] [PubMed]
Hagan, M.J.; Sastry, R.A.; Feler, J.; Abdulrazeq, H.; Sullivan, P.Z.; Abinader, J.F.; Camara, J.Q.; Niu, T.; Fridley, J.S.; Oyelese, A.A.; et al. Neighborhood-level socioeconomic status, extended length of stay, and discharge disposition following elective lumbar spine surgery. N. Am. Spine Soc. J. 2022, 12, 100187. [Google Scholar] [CrossRef]
Ogink, P.T.; Karhade, A.V.; Thio, Q.C.B.S.; Gormley, W.B.; Oner, F.C.; Verlaan, J.J.; Schwab, J.H. Predicting discharge placement after elective surgery for lumbar spinal stenosis using machine learning methods. Eur. Spine J. 2019, 28, 1433–1440. [Google Scholar] [CrossRef]
Saravi, B.; Zink, A.; Ülkümen, S.; Couillard-Despres, S.; Hassel, F.; Lang, G. Performance of Artificial Intelligence-Based Algorithms to Predict Prolonged Length of Stay after Lumbar Decompression Surgery. J. Clin. Med. 2022, 11, 4050. [Google Scholar] [CrossRef]
Wan, L.; Su, X.; Xiong, Z.; Cui, Z.; Tang, G.; Zhang, H.; Zhang, L. Development and application of AI assisted automatic reconstruction of axial lumbar disc CT images and diagnosis of lumbar disc herniation. Eur. J. Radiol. 2025, 185, 112003. [Google Scholar] [CrossRef]
Liawrungrueang, W.; Park, J.B.; Cholamjiak, W.; Sarasombath, P.; Riew, K.D. Artificial Intelligence-Assisted MRI Diagnosis in Lumbar Degenerative Disc Disease: A Systematic Review. Glob. Spine J. 2024, 15, 1405–1418. [Google Scholar] [CrossRef]
He, Y.; He, Z.; Qiu, Y.; Liu, Z.; Huang, A.; Chen, C.; Bian, J. Deep Learning for Lumbar Disc Herniation Diagnosis and Treatment Decision-Making Using Magnetic Resonance Imagings: A Retrospective Study. World Neurosurg. 2025, 195, 123728. [Google Scholar] [CrossRef]
Duan, X.; Xiong, H.; Liu, R.; Duan, X.; Yu, H. Enhanced deep leaning model for detection and grading of lumbar disc herniation from MRI. Med. Biol. Eng. Comput. 2024, 62, 3709–3719. [Google Scholar] [CrossRef]
Fan, X.; Qiao, X.; Wang, Z.; Jiang, L.; Liu, Y.; Sun, Q.; Bhardwaj, A. Artificial Intelligence-Based CT Imaging on Diagnosis of Patients with Lumbar Disc Herniation by Scalpel Treatment. Comput. Intell. Neurosci. 2022, 2022, 1–8. [Google Scholar] [CrossRef]
Xu, Y.; Zheng, S.; Tian, Q.; Kou, Z.; Li, W.; Xie, X.; Wu, X. Deep Learning Model for Grading and Localization of Lumbar Disc Herniation on Magnetic Resonance Imaging. J. Magn. Reson. Imaging 2024, 61, 364–375. [Google Scholar] [CrossRef] [PubMed]
Staartjes, V.E.; Quddusi, A.; Klukowska, A.M.; Schröder, M.L. Initial classification of low back and leg pain based on objective functional testing: A pilot study of machine learning applied to diagnostics. Eur. Spine J. 2020, 29, 1702–1708. [Google Scholar] [CrossRef] [PubMed]
Zhu, Z.; Liu, E.; Su, Z.; Chen, W.; Liu, Z.; Chen, T.; Lu, H.; Zhou, J.; Li, Q.; Pang, S. Three-Dimensional Lumbosacral Reconstruction by An Artificial Intelligence-Based Automated MR Image Segmentation for Selecting the Approach of Percutaneous Endoscopic Lumbar Discectomy. Pain Physician J. 2024, 27, E245–E254. [Google Scholar]
Tezuka, F.; Sakai, T.; Abe, M.; Yamashita, K.; Takata, Y.; Higashino, K.; Chikawa, T.; Nagamachi, A.; Sairyo, K. Anatomical considerations of the iliac crest on percutaneous endoscopic discectomy using a transforaminal approach. Spine J. 2017, 17, 1875–1880. [Google Scholar] [CrossRef]
Yamada, K.; Nagahama, K.; Abe, Y.; Hyugaji, Y.; Ukeba, D.; Endo, T.; Ohnishi, T.; Ura, K.; Sudo, H.; Iwasaki, N.; et al. Evaluation of Surgical Indications for Full Endoscopic Discectomy at Lumbosacral Disc Levels Using Three-Dimensional Magnetic Resonance/Computed Tomography Fusion Images Created with Artificial Intelligence. Medicina 2023, 59, 860. [Google Scholar] [CrossRef]
Cui, P.; Shu, T.; Lei, J.; Chen, W. Nerve recognition in percutaneous transforaminal endoscopic discectomy using convolutional neural network. Med. Phys. 2021, 48, 2279–2288. [Google Scholar] [CrossRef]
Yen, H.-K.; Ogink, P.T.; Huang, C.-C.; Groot, O.Q.; Su, C.-C.; Chen, S.-F.; Chen, C.-W.; Karhade, A.V.; Peng, K.-P.; Lin, W.-H.; et al. A machine learning algorithm for predicting prolonged postoperative opioid prescription after lumbar disc herniation surgery. An external validation study using 1316 patients from a Taiwanese cohort. Spine J. 2022, 22, 1119–1130. [Google Scholar] [CrossRef]
Wang, K.; Lin, F.; Liao, Z.; Wang, Y.; Zhang, T.; Wang, R. Development of a Dual-Plane MRI-Based Deep Learning Model to Assess the 1-Year Postoperative Outcomes in Lumbar Disc Herniation After Tubular Microdiscectomy. J. Magn. Reson. Imaging 2025, 61, 2294–2307. [Google Scholar] [CrossRef]
Fingar, K.R.; Stocks, C.; Weiss, A.J.; Steiner, C.A. Most frequent operating room procedures performeed in U.S. hospitals, 2003–2012. In Healthcare Cost and Utilization Project (HCUP) Statistical Briefs; Agency for Healthcare Research and Quality (US): Rockville, MD, USA, 2014. [Google Scholar]
Goz, V.; Weinreb, J.H.; McCarthy, I.; Schwab, F.; Lafage, V.; Errico, T.J. Perioperative complications and mortality after spinal fusions: Analysis of trends and risk factors. Spine 2013, 38, 1970–1976. [Google Scholar] [CrossRef]
Goyal, A.; Ngufor, C.; Kerezoudis, P.; McCutcheon, B.; Storlie, C.; Mohamad, B. Can machine learning algorithms accurately predict discharge to nonhome facility and early unplanned readmissions following spinal fusion? Analysis of a national surgical registry. J. Neurosurg. Spine 2019, 31, 568–578. [Google Scholar] [CrossRef]
Shah, A.A.; Devana, S.K.; Lee, C.; Bugarin, A.; Lord, E.L.; Shamie, A.N.; Park, D.Y.; van der Schaar, M.; SooHoo, N.F. Prediction of major complications and readmission after lumbar spinal fusion: A machine learning-driven approach. World Neurosurg. 2021, 152, e227–e234. [Google Scholar] [CrossRef] [PubMed]
Cabitza, F.; Locoro, A.; Banfi, G. Machine learning in orthopedics: A literature review. Front. Bioeng. Biotechnol. 2018, 6, 75. [Google Scholar] [CrossRef] [PubMed]
Esteva, A.; Kuprel, B.; Novoa, R.A.; Ko, J.; Swetter, S.M.; Blau, H.M.; Thrun, S. Dermatologist-level classification of skin cancer with deep neural networks. Nature 2017, 542, 686. [Google Scholar] [CrossRef] [PubMed]
Gulshan, V.; Peng, L.; Coram, M.; Stumpe, M.C.; Wu, D.; Narayanaswamy, A.; Venugopalan, S.; Widner, K.; Madams, T.; Cuadros, J.; et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. J. Am. Med. Assoc. 2016, 316, 2402–2410. [Google Scholar] [CrossRef]
Menden, M.P.; Iorio, F.; Garnett, M.; McDermott, U.; Benes, C.H.; Ballester, P.J.; Saez-Rodriguez, J.; Raghava, G.P.S. Machine learning prediction of cancer cell sensitivity to drugs based on genomic and chemical properties. PLoS ONE 2013, 8, e61318. [Google Scholar] [CrossRef]
Hopkins, B.S.; Yamaguchi, J.T.; Garcia, R.; Kesavabhotla, K.; Weiss, H.; Hsu, W.K.; Smith, Z.A.; Dahdaleh, N.S. Using machine learning to predict 30-day readmissions after posterior lumbar fusion: An NSQIP study involving 23,264 patients. J. Neurosurg. Spine 2020, 32, 399–406. [Google Scholar] [CrossRef]
Chen, J.H.; Asch, S.M. Machine learning and predition in medicine—Beyond the peak of inflated expectations. N. Engl. J. Med. 2017, 376, 2507–2509. [Google Scholar] [CrossRef]
Hashimoto, D.A.; Rosman, G.; Rus, D.; Meireles, O.R.M. Artifical intelligence in surgery: Promises and perils. Ann. Surg. 2018, 268, 70–76. [Google Scholar] [CrossRef]
Huguet, A.; Hayden, J.A.; Stinson, J.; McGrath, P.J.; Chambers, C.T.; Tougas, M.E.; Wozney, L. Judging the quality of evidence in reviews of prognostic factor research: Adapting the GRADE framework. Syst. Rev. 2013, 2, 71. [Google Scholar] [CrossRef]
Bredow, J.; Meyer, C.; Oikonomidis, S.; Kernich, C.; Kernich, N.; Hofstetter, C.P.; Heck, V.J.; Eysel, P.; Prasse, T. Long-term radiological and clinical outcome after lumbar spinal fusion surgery in patients with degenerative spondylolisthesis: A prospective 6-year follow-up study. Orthop. Surg. 2022, 14, 1607–1614. [Google Scholar] [CrossRef]
Inose, H.; Kato, T.; Onuma, H.; Morishita, S.; Matsukura, Y.; Yuasa, M.; Hirai, T.; Yoshii, T.; Okawa, A. Predictive factors affecting surgical outcomes in patients with degenerative lumbar spondylolisthesis. Spine 2021, 46, 610–616. [Google Scholar] [CrossRef] [PubMed]
Berjano, P.; Langella, F.; Ventriglia, L.; Compagnone, D.; Barletta, P.; Huber, D.; Mangili, F.; Licandro, G.; Galbusera, F.; Cina, A.; et al. The Influence of Baseline Clinical Status and Surgical Strategy on Early Good to Excellent Result in Spinal Lumbar Arthrodesis: A Machine Learning Approach. J. Pers. Med. 2021, 11, 1377. [Google Scholar] [CrossRef] [PubMed]
Monticone, M.; Baiardi, P.; Vanti, C.; Ferrari, S.; Pillastrini, P.; Mugnai, R.; Foti, C. Responsiveness of the Oswestry Disability Index and the Roland Morris Disability Questionnaire in Italian subjects with sub-acute and chronic low back pain. Eur. Spine J. 2011, 21, 122–129. [Google Scholar] [CrossRef] [PubMed]
Schönnagel, L.; Caffard, T.; Vu-Han, T.-L.; Zhu, J.; Nathoo, I.; Finos, K.; Camino-Willhuber, G.; Tani, S.; Guven, A.E.; Haffer, H.; et al. Predicting postoperative outcomes in lumbar spinal fusion: Development of a machine learning model. Spine J. 2024, 24, 239–249. [Google Scholar] [CrossRef]
Hajian-Tilaki, K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Casp. J. Intern. Med. 2013, 4, 627–635. [Google Scholar]
Cook, J.A.; Ranstam, J. Overfitting. Br. J. Surg. 2016, 103, 1814. [Google Scholar] [CrossRef]
Fatima, N.; Zheng, H.; Massaad, E.; Hadzipasic, M.; Shankar, G.M.; Shin, J.H. Development and Validation of Machine Learning Algorithms for Predicting Adverse Events After Surgery for Lumbar Degenerative Spondylolisthesis. World Neurosurg. 2020, 140, 627–641. [Google Scholar] [CrossRef]
Leven, D.M.; Lee, N.J.; Kothari, P.; Steinberger, J.; Guzman, J.; Skovrlj, B.; Shin, J.I.; Caridi, J.M.; Cho, S.K. Frailty index is a significant predictor of complications and mortality after surgery for adult spinal deformity. Spine 2016, 41, E1394–E1401. [Google Scholar] [CrossRef]
Dong, S.; Zhu, Y.; Yang, H.; Tang, N.; Huang, G.; Li, J.; Tian, K. Evaluation of the Predictors for Unfavorable Clinical Outcomes of Degenerative Lumbar Spondylolisthesis After Lumbar Interbody Fusion Using Machine Learning. Front. Public Health 2022, 10, 835938. [Google Scholar] [CrossRef]
Ghogawala, Z.; Dziura, J.; Butler, W.E.; Dai, F.; Terrin, N.; Magge, S.N.; Coumans, J.-V.C.; Harrington, J.F.; Amin-Hanjani, S.; Schwartz, J.S.; et al. Laminectomy plus fusion versus laminectomy alone for lumbar spondylolisthesis. N. Engl. J. Med. 2016, 374, 1424–1434. [Google Scholar] [CrossRef]
Chan, A.K.; Bisson, E.F.; Bydon, M.; Glassman, S.D.; Foley, K.T.; Potts, E.A.; Shaffrey, C.I.; Shaffrey, M.E.; Coric, D.; Knightly, J.J.; et al. Laminectomy alone versus fusion for grade 1 lumbar spondylolisthesis in 426 patients from the prospective quality outcomes database. J. Neurosurg. Spine 2018, 30, 234–241. [Google Scholar] [CrossRef]
Sato, S.; Yagi, M.; Machida, M.; Yasuda, A.; Konomi, T.; Miyake, A.; Fujiyoshi, K.; Kaneko, S.; Takemitsu, M.; Yato, Y.; et al. Reoperation rate and risk factors of elective spinal surgery for degenerative spondylolisthesis: Minimum 5-year follow-up. Spine J. 2015, 15, 1536–1544. [Google Scholar] [CrossRef] [PubMed]
Yen, C.-P.; Beckman, J.M.; Vivas, A.C.; Bach, K.; Uribe, J.S. Effects of intradiscal vacuum phenomenon on surgical outcome of lateral interbody fusion for degenerative lumbar disease. Eur. Spine J. 2017, 26, 419–425. [Google Scholar] [CrossRef] [PubMed]
Deyo, R.A.; Martin, B.I.; Kreuter, W.; Jarvik, J.G.; Angier, H.; Mirza, S.K. Revision surgery following operations for lumbar stenosis. J. Bone Jt. Surg. Am. 2011, 93, 1979–1986. [Google Scholar] [CrossRef] [PubMed]
Watkins, R.G., 4th; Hanna, R.; Chang, D.; Watkins, R.G., 3rd. Sagittal alignment after lumbar interbody fusion: Comparing anterior, lateral, and transforaminal approaches. J. Spinal Disord. Tech. 2014, 27, 253–256. [Google Scholar] [CrossRef]
Yamasaki, K.; Hoshino, M.; Omori, K.; Igarashi, H.; Nemoto, Y.; Tsuruta, T.; Matsumoto, K.; Iriuchishima, T.; Ajiro, Y.; Matsuzaki, H. Risk Factors of Adjacent Segment Disease After Transforaminal Inter-Body Fusion for Degenerative Lumbar Disease. Spine 2017, 42, E86–E92. [Google Scholar] [CrossRef]
Bui, A.T.; Le, H.; Hoang, T.T.; Trinh, G.M.; Shao, H.-C.; Tsai, P.-I.; Chen, K.-J.; Hsieh, K.L.-C.; Huang, E.-W.; Hsu, C.-C.; et al. Development of End-to-End Artificial Intelligence Models for Surgical Planning in Transforaminal Lumbar Interbody Fusion. Bioengineering 2024, 11, 164. [Google Scholar] [CrossRef]
Janssen, E.R.; Osong, B.; van Soest, J.; Dekker, A.; van Meeteren, N.L.; Willems, P.C.; Punt, I.M. Exploring Associations of Preoperative Physical Performance with Postoperative Outcomes After Lumbar Spinal Fusion: A Machine Learning Approach. Arch. Phys. Med. Rehabil. 2021, 102, 1324–1330.e3. [Google Scholar] [CrossRef]
McGirt, M.J.; Sivaganesan, A.; Asher, A.L.; Devin, C.J. Prediction model for outcome after low-back surgery: Individualized likelihood of complication, hospital readmission, return to work, and 12-month improvement in functional disability. Neurosurg. Focus 2015, 39, E13. [Google Scholar] [CrossRef]
Janssen, E.R.C.; Punt, I.M.; van Kuijk, S.M.J.; Hoebink, E.A.; van Meeteren, N.L.U.; Willems, P.C. Development and validation of a prediction tool for pain reduction in adult patients undergoing elective lumbar spinal fusion: A multicentre cohort study. Eur. Spine J. 2020, 29, 1909–1916. [Google Scholar] [CrossRef]
Verbunt, J.; A Seelen, H.; Vlaeyen, J.W.; van de Heijden, G.J.; Heuts, P.H.; Pons, K.; Knottnerus, J.A. Disuse and deconditioning in chronic low back pain: Concepts and hypotheses on contributing mechanisms. Eur. J. Pain 2003, 7, 9–21. [Google Scholar] [CrossRef]
Ali, A.M.; Gibbons, C.E.R. Predictors of 30-day hospital readmission after hip fracture: A systematic review. Injury 2017, 48, 243–252. [Google Scholar] [CrossRef]
Wijeysundera, D.N.; Pearse, R.M.; Shulman, M.A.; Abbott, T.E.F.; Torres, E.; Ambosta, A.; Croal, B.L.; Granton, J.T.; Thorpe, K.E.; Grocott, M.P.W.; et al. Assessment of functional capacity before major non-cardiac surgery: An international prospective cohort study. Lancet 2018, 391, 2631–2640. [Google Scholar] [CrossRef]
Cheng, H.; Clymer, J.W.; Chen, B.P.-H.; Sadeghirad, B.; Ferko, N.C.; Cameron, C.G.; Hinoul, P. Prolonged operative duration is associated with complications: A systematic review and meta-analysis. J. Surg. Res. 2018, 229, 134–144. [Google Scholar] [CrossRef]
Li, R.; Wang, L.; Wang, X.; Grzegorzek, M.; Chen, A.-T.; Quan, X.; Hu, Z.; Liu, X.; Zhang, Y.; Xiang, T.; et al. Development of machine learning model for predicting prolonged operation time in lumbar stenosis undergoing posterior lumbar interbody fusion: A multicenter study. Spine J. 2025, 25, 460–473. [Google Scholar] [CrossRef]
Xiong, C.; Zhao, R.; Xu, J.; Liang, H.; Zhang, C.; Zhao, Z.; Huang, T.; Luo, X.; Chen, H. Construct and Validate a Predictive Model for Surgical Site Infection after Posterior Lumbar Interbody Fusion Based on Machine Learning Algorithm. Comput. Math. Methods Med. 2022, 2022, 2697841. [Google Scholar] [CrossRef] [PubMed]
Rao, P.J.; Phan, K.; Giang, G.; Maharaj, M.M.; Phan, S.; Mobbs, R.J. Subsidence following anterior lumbar interbody fusion (Alif): A prospective study. J. Spine Surg. 2017, 3, 168–175. [Google Scholar] [CrossRef] [PubMed]
Zou, C.; Chen, R.; Wang, B.; Fei, Q.; Song, H.; Zang, L. Development of a deep learning radiomics model combining lumbar CT, multi-sequence MRI, and clinical data to predict high-risk cage subsidence after lumbar fusion: A retrospective multicenter study. Biomed. Eng. Online 2025, 24, 27. [Google Scholar] [CrossRef] [PubMed]
Kuris, E.O.; Veeramani, A.; McDonald, C.L.; DiSilvestro, K.J.; Zhang, A.S.; Cohen, E.M.; Daniels, A.H. Predicting Readmission After Anterior, Posterior, and Posterior Interbody Lumbar Spinal Fusion: A Neural Network Machine Learning Approach. World Neurosurg. 2021, 151, e19–e27. [Google Scholar] [CrossRef]
Pugely, A.J.; Martin, C.T.; Gao, Y.; Mendoza-Lattes, S. Causes and risk factors for 30-day unplanned readmissions after lumbar spine surgery. Spine 2014, 39, 761–768. [Google Scholar] [CrossRef] [PubMed]
Puvanesarajah, V.; Nourbakhsh, A.; Hassanzadeh, H.; Shimer, A.L.; Shen, F.H.; Singla, A. Readmission rates, reasons, and risk factors in elderly patients treated with lumbar fusion for degenerative pathology. Spine 2016, 41, 1933–1938. [Google Scholar] [CrossRef]
Bernatz, J.T.; Anderson, P.A. Thirty-day readmission rates in spine surgery: Meta-analysis. Neurosurg. Focus 2015, 39, E7. [Google Scholar] [CrossRef] [PubMed]
Kim, B.D.; Smith, T.R.; Lim, S.; Cybulski, G.R.; Kim, J.Y. Predictors of unplanned readmission after lumbar spinal surgery. J. Neurosurg. Spine 2017, 26, 144–152. [Google Scholar]
Dreiseitl, S.; Ohno-Machado, L. Logistic regression and artificial neural network classification models: A methodology review. J. Biomed. Inform. 2002, 35, 352–359. [Google Scholar] [CrossRef]
Guo, Z.; Wang, P.; Ye, S.; Li, H.; Bao, J.; Shi, R.; Yang, S.; Yin, R.; Wu, X. Interpretable Machine Learning Models Based on Shapley Additive Explanations for Predicting the Risk of Cerebrospinal Fluid Leakage in Lumbar Fusion Surgery. Spine 2024, 49, 1281–1293. [Google Scholar] [CrossRef]
Jain, D.; Durand, W.B.; Burch, S.; Daniels, A.; Berven, S. Machine Learning for Predictive Modeling of 90-day Readmission, Major Medical Complication, and Discharge to a Facility in Patients Undergoing Long Segment Posterior Lumbar Spine Fusion. Spine 2020, 45, 1151–1160. [Google Scholar] [CrossRef]
Passias, P.G.; Poorman, G.W.; Bortz, C.A.; Qureshi, R.; Diebo, B.G.; Paul, J.C.; Horn, S.R.; Segreto, F.A.; Pyne, A.; Jalai, C.M.; et al. Predictors of adverse discharge disposition in adult spinal deformity and associated costs. Spine J. 2018, 18, 1845–1852. [Google Scholar] [CrossRef]
Kim, J.S.; Arvind, V.; Oermann, E.K.; Kaji, D.; Ranson, W.; Ukogu, C.; Hussain, A.K.; Caridi, J.; Cho, S.K. Predicting surgical complications in patients undergoing elective adult spinal deformity procedures using machine learning. Spine Deform. 2018, 6, 762–770. [Google Scholar] [CrossRef]
Cabrera, A.; Bouterse, A.; Nelson, M.; Razzouk, J.; Ramos, O.; Bono, C.M.; Cheng, W.; Danisa, O. Accounting for age in prediction of discharge destination following elective lumbar fusion: A supervised machine learning approach. Spine J. 2023, 23, 997–1006. [Google Scholar] [CrossRef]
Arrighi-Allisan, A.E.; Neifert, S.N.; Gal, J.S.; Deutsch, B.C.; Caridi, J.M. Discharge destination as a predictor of postoperative outcomes and readmission following posterior lumbar fusion. World Neurosurg. 2019, 122, e139–e146. [Google Scholar] [CrossRef] [PubMed]
Aldebeyan, S.; Aoude, A.; Fortin, M.; Nooh, A.; Jarzem, P.; Ouellet, J.; Weber, M.H. Predictors of discharge destination after lumbar spine fusion surgery. Spine 2016, 41, 1535–1541. [Google Scholar] [CrossRef]
Ogura, Y.; Gum, J.L.; Steele, P.; Crawford, C.H.; Djurasovic, M.; Owens, R.K.; Laratta, J.L.; Brown, M.; Daniels, C.; Dimar, J.R.; et al. Drivers for nonhome discharge in a consecutive series of 1502 patients undergoing 1- or 2-level lumbar fusion. J. Neurosurg. Spine 2020, 33, 766–771. [Google Scholar] [CrossRef]
Karnuta, J.M.; Golubovsky, J.L.; Haeberle, H.S.; Rajan, P.V.; Navarro, S.M.; Kamath, A.F.; Schaffer, J.L.; Krebs, V.E.; Pelle, D.W.; Ramkumar, P.N. Can a machine learning model accurately predict patient resource utilization following lumbar spinal fusion? Spine J. 2020, 20, 329–336. [Google Scholar] [CrossRef]
Bohl, D.D.; Haws, B.E.; Khechen, B.; Patel, D.V.; Mayo, B.C.; Ahn, J.; Louie, P.K.; Cardinal, K.L.; Guntin, J.A.; Singh, K. Impact of the number of levels on adverse events and length of stay following posterior lumbar fusion procedures. Clin. Spine Surg. 2019, 32, 120–124. [Google Scholar] [CrossRef]
Kalakoti, P.; Gao, Y.; Hendrickson, N.R.; Pugely, A.J. Preparing for bundled payments in cervical spine surgery. Spine 2019, 44, 334–345. [Google Scholar] [CrossRef]
Gruskay, J.A.; Fu, M.; Bohl, D.D.; Webb, M.L.; Grauer, J.N. Factors affecting length of stay after elective posterior lumbar spine surgery: A multivariate analysis. Spine J. 2015, 15, 1188–1195. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The annual number of articles related to lumbar degenerative pathology and artificial intelligence.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the Lithuanian University of Health Sciences. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Trento, A.; Rapisarda, S.; Bresolin, N.; Valenti, A.; Giordan, E. Artificial Intelligence and Its Impact on the Management of Lumbar Degenerative Pathology: A Narrative Review. Medicina 2025, 61, 1400. https://doi.org/10.3390/medicina61081400

AMA Style

Trento A, Rapisarda S, Bresolin N, Valenti A, Giordan E. Artificial Intelligence and Its Impact on the Management of Lumbar Degenerative Pathology: A Narrative Review. Medicina. 2025; 61(8):1400. https://doi.org/10.3390/medicina61081400

Chicago/Turabian Style

Trento, Alessandro, Salvatore Rapisarda, Nicola Bresolin, Andrea Valenti, and Enrico Giordan. 2025. "Artificial Intelligence and Its Impact on the Management of Lumbar Degenerative Pathology: A Narrative Review" Medicina 61, no. 8: 1400. https://doi.org/10.3390/medicina61081400

APA Style

Trento, A., Rapisarda, S., Bresolin, N., Valenti, A., & Giordan, E. (2025). Artificial Intelligence and Its Impact on the Management of Lumbar Degenerative Pathology: A Narrative Review. Medicina, 61(8), 1400. https://doi.org/10.3390/medicina61081400

Article Menu

Artificial Intelligence and Its Impact on the Management of Lumbar Degenerative Pathology: A Narrative Review

Abstract

1. Introduction

2. Results and Discussion

2.1. Study Characteristics

2.2. AI for Lumbar Spinal Stenosis

2.2.1. Diagnosis

2.2.2. Treatment

2.2.3. Prognosis

2.3. AI for Lumbar Disc Herniation

2.3.1. Diagnosis

2.3.2. Treatment

2.3.3. Prognosis

2.4. AI for Lumbar Fusion Surgery

2.4.1. Outcome Prediction

2.4.2. Complication Prediction

2.4.3. Cost Prediction

2.5. Limitations

3. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI