Next Article in Journal
Has the Expected Shift in HIV-Related Cancers Occurred? Findings from a Long-Term HIV Cohort in Turkey
Previous Article in Journal
Robotic-Assisted Kinematically Aligned Total Knee Arthroplasty Demonstrated Early Rehabilitation and Select Mental Health-Related Quality of Life Improvements Compared to Conventional MA-TKA
Previous Article in Special Issue
Mapping the Prevalence and Risk Factors of Low Back Pain Among University Populations in Saudi Arabia: A Systematic Review and Meta-Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Comparative Effects of Therapeutic Exercise and Manual Therapy Techniques on Self-Reported Disability in Chronic Non-Specific Low Back Pain: A Network Meta-Analysis

by
Miguel Robles-García
1,
Juan Luis Sánchez González
2,*,
José Luis Sánchez-Sánchez
3,
Laura Calderón-Díez
1,
Miguel Santos Del Rey
1 and
Javier Martín-Vallejo
4
1
Departamento de Anatomia e Histología, Facultad de Medicina, Campus Miguel de Unamuno, Universidad de Salamanca, s/n, 37007 Salamanca, Spain
2
Departamento de Medicina, Universidad de Salamanca, Instituto de Investigación Biomédica de Salamanca (IBSAL), Campus Miguel de Unamuno s/n, 37007 Salamanca, Spain
3
Departamento de Enfermería y Fisioterapia, Universidad de Salamanca, Instituto de Investigación Biomédica de Salamanca (IBSAL), Campus Miguel de Unamuno s/n, 37007 Salamanca, Spain
4
Departamento de Estadística, Universidad de Salamanca, Instituto de Investigación Biomédica de Salamanca (IBSAL), Campus Miguel de Unamuno s/n, 37007 Salamanca, Spain
*
Author to whom correspondence should be addressed.
J. Clin. Med. 2026, 15(12), 4809; https://doi.org/10.3390/jcm15124809 (registering DOI)
Submission received: 27 May 2026 / Revised: 17 June 2026 / Accepted: 18 June 2026 / Published: 21 June 2026
(This article belongs to the Special Issue Evidence-Based Diagnosis and Clinical Management of Low Back Pain)

Abstract

Background/Objectives: Chronic non-specific low back pain is a leading cause of disability. Although therapeutic exercise and manual therapy are commonly recommended, their relative effects are often interpreted using broad therapeutic categories. This network meta-analysis aimed to compare the relative effectiveness of specific therapeutic exercise and manual therapy techniques on post-treatment self-reported disability in adults with chronic non-specific low back pain. Methods: A systematic review and frequentist random-effects network meta-analysis were conducted according to Cochrane recommendations and PRISMA-NMA guidance. The protocol was registered in PROSPERO (CRD42022331411). Randomized controlled trials including adults aged 18–65 years with chronic non-specific low back pain were searched in CENTRAL, PubMed, PEDro, Google Scholar, and SciELO up to 31 March 2026. Disability was assessed using the Roland–Morris Disability Questionnaire or Oswestry Disability Index. Effects were synthesized as standardized mean differences. Risk of bias was assessed with RoB 2, and confidence in network estimates was evaluated using CINeMA. Results: Forty-five studies were included. Compared with control/placebo, the largest favorable estimates were observed for equipment-based Pilates, stabilization with motor control, stabilization exercise, soft tissue manipulation, and Pilates Mat. Equipment-based Pilates showed the largest favorable estimate with moderate-confidence evidence, and soft tissue manipulation also showed moderate-confidence evidence. However, heterogeneity was substantial, and confidence in most favorable exercise estimates was low. Conclusions: Specific exercise and manual therapy techniques may reduce post-treatment disability in adults with chronic non-specific low back pain. Equipment-based Pilates and soft tissue manipulation showed favorable signals supported by moderate-confidence evidence. However, the findings do not support a definitive hierarchy of efficacy or categorical superiority of therapeutic exercise over manual therapy.

1. Introduction

Low back pain is the leading cause of disability worldwide and one of the conditions with the greatest potential need for rehabilitation [1]. Its burden shows an increasing trajectory. In 2020, low back pain affected 619 million people worldwide, and this figure is projected to rise to 843 million by 2050 [2]. Approximately 90% of cases are classified as non-specific [3]. Low back pain is considered chronic when symptoms persist for more than 12 weeks from onset [4]. Beyond its clinical consequences, non-specific chronic low back pain (NSCLBP) generates a substantial economic, healthcare, and social burden in both high-income and low- and middle-income countries [5,6].
Because NSCLBP has a particularly relevant impact on functional limitation, self-reported disability is a central outcome for assessing the clinical effects of interventions. In this context, current clinical guidelines recommend non-surgical, person-centered care, with therapeutic exercise as one of its main pillars. Certain manual therapy techniques are also considered, especially when they are integrated into multimodal strategies adapted to patient characteristics and preferences [4,7,8,9,10,11,12].
Randomized controlled trials (RCTs) evaluating different forms of therapeutic exercise and manual therapy techniques in patients with NSCLBP have increased substantially in recent years. However, this body of evidence remains difficult to interpret due to variability in the interventions, heterogeneity in dosage and supervision, diversity of comparators, and the use of different instruments to assess disability. Pairwise meta-analysis can quantitatively synthesize direct comparisons, but it is limited when multiple potentially relevant interventions have not been compared directly with each other. In this context, network meta-analysis (NMA) provides a methodological framework for combining direct and indirect evidence and estimating the relative effectiveness of multiple interventions within the same analysis [13,14].
Several previous NMAs have examined the comparative efficacy of different therapeutic exercise modalities in patients with chronic low back pain. Pilates, strengthening exercises, core-based exercises, and motor control or stabilization programs have repeatedly been identified among the interventions with the most favorable relative performance for pain and disability [15,16,17,18,19,20]. However, these NMAs have mainly focused on comparisons between exercise modalities. Manual therapy has usually been considered as a broad comparator, an undifferentiated conservative treatment, or a component of more heterogeneous categories. This approach limits the possibility of drawing specific clinical inferences about concrete manual techniques, such as spinal manipulation or soft tissue manipulation. It also makes it more difficult to compare specific exercise and manual therapy modalities in a clinically meaningful way.
Consistent with this issue, the Cochrane review on exercise for chronic low back pain compared different exercise modalities not only with control but also with several non-invasive physiotherapy treatments. It was observed that exercise was probably more effective than advice or education alone, and also more effective than electrotherapy. However, no clear differences were identified when exercise was compared with manual therapy treatments [21]. As these comparisons were based on broad therapeutic categories, an important clinical question remains unresolved: whether a more specific classification of therapeutic exercise and manual therapy techniques could allow a more precise interpretation of their relative efficacy on self-reported disability.
Broad therapeutic labels such as “exercise therapy” or “manual therapy” may group interventions that differ substantially in therapeutic rationale, delivery requirements, progression, supervision and expected clinical use. For example, equipment-based Pilates and Mat Pilates share general principles but differ in external support, load progression and sensorimotor feedback; similarly, stabilization exercise with explicit motor-control retraining is not equivalent to non-specific trunk stabilization, and soft tissue manipulation should not be interpreted as clinically interchangeable with spinal manipulation. This lack of specificity may limit the usefulness of previous evidence syntheses for individualized rehabilitation decisions. Therefore, a more specific classification of therapeutic exercise and manual therapy techniques may help clinicians interpret which interventions show the most favorable disability outcomes, while also identifying where uncertainty remains. Moreover, treatment effects may also depend on dose, session duration, supervision, progression strategy, adherence and home practice, which may contribute to heterogeneity and limit the direct clinical transferability of broad therapeutic categories.
The protocol for the present study already identified the need to compare selected therapeutic exercise and manual therapy techniques using NMA. This approach was intended to reduce the heterogeneity produced by grouping clinically different interventions within the same category [13,14]. Since protocol registration, González-Gómez et al. reported a small advantage of therapeutic exercise over manual therapy for long-term disability. Nevertheless, the certainty of the evidence was very low, and the authors could not establish conclusive differences between the two modalities when analyzed as isolated treatments [22]. Because their study used pairwise meta-analysis, it could not simultaneously explore multiple specific modalities that had not been directly compared with each other [13,14,22]. Therefore, the objective of the present NMA was to compare the relative efficacy of specific therapeutic exercise and manual therapy techniques on self-reported disability in adults with NSCLBP at the end of the interventions. To address this objective, we applied a nodal classification based on the categories prespecified in the protocol and refined it using clinical and methodological criteria. This was done to improve the reproducibility of the classification and the clinical interpretability of the network.

2. Materials and Methods

This NMA was conducted following Cochrane methodological recommendations. It was reported according to the PRISMA extension statement for systematic reviews incorporating network meta-analyses of healthcare interventions: PRISMA-NMA [23]. The protocol was registered in PROSPERO on 20 May 2022 (registration ID: CRD42022331411).

2.1. Deviations from the Protocol

The present manuscript evaluates the outcome of self-reported disability at the end of the intervention. The pain outcome was not analyzed or interpreted in this manuscript.
The intervention nodes were defined according to the protocol. During the classification process and before quantitative analysis, operational refinements were introduced to improve reproducibility and clinical interpretability. Specifically, classical stabilization exercises (ST) were separated from stabilization exercises with an explicit motor control component (STMC). Mat Pilates programs (Pilates MAT) were also separated from programs in which apparatus use was a structural component of the intervention (Pilates APP). In addition, some active comparators could not be clearly assigned to the main nodes or to the control/placebo node. These were therefore included as accessory connectivity nodes: general exercise (GE), back school (BS), stretching (STRET), and usual care (UC).
Risk of bias was assessed using the Cochrane RoB 2 tool. The PEDro <6 exclusion criterion was prespecified in the registered protocol and was applied before quantitative synthesis as a minimum methodological adequacy filter to reduce the likelihood that trials with very limited methodological reporting or conduct contributed unstable or poorly interpretable data to the disability network. However, excluding trials on the basis of a summary methodological score may introduce quality-related selection bias and may obscure domain-specific sources of bias. This threshold was not used as a substitute for domain-level risk-of-bias assessment, nor was it used to weight studies, rank interventions, or determine the certainty of the evidence. Because PEDro summary scores and RoB 2 assess overlapping but non-equivalent constructs, the implications of this eligibility criterion are acknowledged in the limitations.
Although the protocol specified a fixed-effect model, the analysis was performed using a random-effects model. This decision was made after data extraction and before any quantitative network analysis, based on the clinical and methodological variability identified across eligible trials, including differences in intervention content, dose, duration, supervision, comparator characteristics and disability instruments.
The protocol was refined to assess the confidence in the network estimates for disability using CINeMA, a GRADE-based framework specifically developed for network meta-analysis. This addition was made to improve the interpretability and transparency of the network estimates. The assessment was restricted to the post-treatment disability outcome, which was the outcome addressed in the present manuscript and did not modify the eligibility criteria, intervention nodes, statistical model, or network effect estimates.
A detailed comparison between the registered PROSPERO protocol and the final review methods is provided in Supplementary Table S9.

2.2. Search Strategy

Two independent reviewers (FJMV and MRG) conducted a systematic search in CENTRAL, PubMed, PEDro, Google Scholar, and SciELO databases, up to 31 March 2026. The search aimed to identify randomized controlled trials (RCTs) assessing the effectiveness of manual therapy and therapeutic exercise techniques for reducing disability in people with NSCLBP.
The search strategy combined terms and Boolean operators. The specific details of the search strategies used in each database are provided in the Supplementaries Table S1a,b.
No language restrictions were applied. The Rayyan tool was used to identify duplicates and organize references. Disagreements were resolved by consensus or by a third independent reviewer (MSDR).

2.3. Eligibility Criteria

The inclusion criteria were defined according to the PICOS framework: participants, interventions, comparators, outcomes, and study design.
Participants: We included RCTs conducted in adults aged 18 to 65 years with chronic non-specific low back pain, defined as low back pain lasting 12 weeks or longer [4].
Type of intervention: We included trials in which the main intervention corresponded to one of the therapeutic exercise or manual therapy techniques prespecified in the protocol: progressive strengthening/resistance exercise (STR), stabilization exercises (ST), Pilates, McKenzie method (MDT), spinal manipulation (SM), soft tissue manipulation (STM), or dry needling (DN). Interventions were considered regardless of their frequency, intensity, or duration.
Comparator: The main comparator was pure control/placebo. This was defined as minimal intervention, waiting list, placebo, or isolated minimal education without a structured therapeutic exercise or manual therapy program.
Outcomes: The outcome was self-reported disability measured with the Roland–Morris Disability Questionnaire (RMDQ) or the Oswestry Disability Index (ODI). Both instruments are widely validated for assessing disability related to low back pain [24,25].

2.4. Study Design: RCTs

Studies were excluded if they specifically recruited participants with low back pain of identifiable origin, including diagnosed diskogenic or disk-related low back pain, radiculopathy, nerve-root involvement, post-surgical low back pain, recurrent low back pain, or other specific spinal pathology. Imaging-based exclusion of disc-related degenerative findings was not required when trials classified participants as having chronic non-specific low back pain. Studies were also excluded when follow-up was shorter than four weeks, when the PEDro score was <6, or when the intervention could not be assigned to a defined node. Additive designs were accepted only when both groups received an equivalent background co-intervention. This ensured that the differential effect could be attributed to the specific intervention under evaluation or to its placebo. Studies with unbalanced co-interventions or non-divisible multimodal comparators were excluded.

2.5. Classification of Interventions

Intervention nodes were established according to the protocol. During the classification process, and before quantitative analysis, the divisions described in the deviations section were applied. The final nodes were defined as follows:
Exercise-related interventions: Stabilization Exercises (ST), Stabilization Exercises With Motor Control (STMC), Strengthening Exercises (STR), Pilates Mat (PMAT), Equipment-based Pilates (PAPP), McKenzie technique (MDT).
Manual therapy-related interventions: Lumbar Spinal Manipulation (SM), Soft Tissue Manipulation (STM), Dry Needling (DN).
Control intervention: Control/Placebo (CTRL).
Accessory comparators: Back School (BS), Stretching (STRET), Usual Care (UC), General Exercise (GE).
The complete operational definitions for all intervention nodes are provided in the Supplementary Material (Table S2). Briefly, soft tissue manipulation was operationalised as physiotherapy techniques involving direct manual treatment of muscular or fascial tissues, including massage, myofascial techniques, or other soft tissue procedures.
Intervention-node classification was performed independently by two reviewers (FJMV and MRG), using the protocol-defined categories and the operational definitions reported in Supplementary Table S2. Disagreements were resolved by consensus; when consensus could not be reached, a third independent reviewer (MSDR) adjudicated the final node assignment.

2.6. Data Extraction

Two reviewers (FJMV and MRG) independently extracted the following data: author, country, year of publication, randomisation method, type of blinding, population characteristics, age, percentage of women, body mass index, sample size, intervention, intervention duration, losses to follow-up, post-treatment outcomes, and disability scale used. When discrepancies were found in the extracted outcomes, a third reviewer (MSDR) assessed the corresponding studies.
For disability outcomes, baseline and post-treatment values were extracted from the RMDQ and ODI scales. When both instruments were available for the same sample, RMDQ values were selected because of their superior construct validity [26].

2.7. Risk-of-Bias Assessment

Two reviewers (FJMV and MRG) independently assessed the risk of bias of the RCTs included in the analysis using the Cochrane Risk of Bias 2 (RoB 2) tool. Disagreements were resolved by consensus or by a third independent reviewer (MSDR).

2.8. Data Analysis

The analysis was conducted using a random-effects model. The network analysis was carried out using the frequentist perspective. The unbiased estimator of the standardized mean difference was used as the effect size because studies included different disability instruments and scales. Effect sizes were interpreted according to Cohen’s criteria, where values below 0.20 were classified as small, those between 0.20 and 0.80 as moderate, and values exceeding 0.80 as large. Transitivity was explored descriptively by tabulating available potential effect modifiers across direct comparisons. Available quantitative variables included baseline disability, disability instrument, treatment duration, age, sex distribution, body mass index, PEDro score and RoB 2 overall judgment. These variables were not incorporated as covariates in the network meta-analysis model, and no formal subgroup, meta-regression or adjusted network analyses were performed. Intervention-delivery variables, including frequency, supervision, progression, protocol fidelity/adherence and home practice, were extracted separately because they were heterogeneously described across trials and could not be consistently summarized quantitatively. The Q test and I2 were calculated to measure heterogeneity within designs (QW) and inconsistency between (QI) designs. Because QI is a global measure of inconsistency that cannot identify specific sources of discrepancy and may be inflated by substantial heterogeneity [27], the overall Q-test for the design-by-treatment interaction model was also computed. This model accounts for two distinct sources of discrepancy within the network. The first is loop inconsistency, which arises when direct evidence for a given comparison conflicts with the indirect evidence derived from closed treatment loops. The second is design inconsistency, which pertains to differences in effect estimates across studies evaluating different sets of treatments [28]. Inconsistency was also explored by aggregate mean values of the design and quality variables recorded, and by comparing the direct and indirect estimates of the different comparisons. The mean path length, minimal Parallelism and Net Heat Plot [29] procedures were used to analyze the validity of the results. The first coefficients indicate the consistency of the comparisons, and the heatplot indicates by the size of the squares and the background color how important the estimate of the comparison between a treatment in the rest of the comparisons and the inconsistency of the designs attributable to the rest of the designs, respectively. The p-score [30] was used as a procedure to rank the treatments involved according to their effect. Publication bias was explored using the Comparison-adjusted funnel plot [31]. Confidence intervals were calculated at 95% confidence, and a significance level of 5% was set. The netmeta (Netmeta 3.2.0) and dmetar libraries (dmetar 0.1.0) of the R statistics program were used to perform the analyses.

2.9. Certainty of the Evidence

Confidence in the network estimates for post-treatment disability was assessed using the CINeMA framework, a GRADE-based approach specifically developed for network meta-analysis [32]. The assessment considered six domains: within-study bias, reporting bias, indirectness, imprecision, heterogeneity, and incoherence.
Because disability was synthesized using standardized mean differences across different instruments, no single instrument-specific minimal clinically important difference could be applied consistently across the network. Therefore, a ±0.20 SMD decision threshold was defined before final CINeMA confidence adjudication and applied uniformly to imprecision, heterogeneity, and incoherence judgments. This value was selected as a conservative boundary for a small standardized effect, consistent with the conventional interpretation of Cohen-type standardized mean differences, and was used only as a methodological decision aid for judging whether uncertainty crossed a potentially relevant effect boundary. It should not be interpreted as an ODI- or RMDQ-specific minimal clinically important difference, and the threshold-dependent nature of the CINeMA ratings is acknowledged in the limitations [32,33].
Automatic CINeMA domain-level judgments were reviewed against additional network diagnostics defined before final confidence adjudication, including prediction intervals, contribution matrix, direct–indirect comparisons, design-level sensitivity analyses, minimal parallelism, mean path length, and the net heat plot. Manual adjudication was conservative and downgrade-only (manual adjudication fully documented in the Supplementary Material Text S1). Final confidence ratings were classified as high, moderate, low, or very low and reported in a CINeMA-based Summary of Findings table.

3. Results

Up to 31 March 2026, a total of 2024 records were identified: 1905 through database searches and 119 through other methods. 45 studies were included in the NMA [34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78]. The flow diagram is shown in Figure 1. Because no studies met the eligibility criteria for the dry needling intervention group, this node was removed from the analysis. Excluded studies and reasons for exclusion are reported in Supplementary Table S3. According to the PRISMA flow diagram, 57 full-text records were excluded with a PEDro score <6 as the primary reason recorded for PRISMA reporting. This category should be interpreted as the primary recorded reason rather than as an exclusive reason for exclusion, because some records could also have met other exclusion criteria.

3.1. Study Characteristics

The characteristics of the included studies are provided in the Supplementary Table S4a, together with the distribution of intervention nodes in the analytical network, Supplementary Table S4b, and the detailed description of the interventions, Supplementary Table S5.

3.2. Risk-of-Bias Assessment

A total of 48 comparisons from 45 studies were assessed using the Cochrane Risk of Bias 2 (RoB 2) tool. 15 comparisons (31.3%) were judged as low risk of bias, 29 (60.4%) as having some concerns, and 4 (8.3%) as high risk of bias (Figure 2). The risk-of-bias assessment of included intervention comparisons for disability outcome is presented in the Supplementary Figure S1.
Study characteristics and potential effect modifiers across direct comparisons, including sample sizes, sociodemographic data, baseline disability and PEDro quality scores, are detailed in Supplementary Table S6. The mean treatment duration was 6.5 weeks (range: 4–10). Mean age was balanced between groups (42.74 vs. 42.86 years), although participants in the STRET vs. CTRL and STMC vs. STR designs were younger on average. Most study designs favored a higher female-to-male ratio, except for four specific comparisons (STR vs. CTRL, MDT vs. UC, STMC vs. ST, and STMC vs. STR). Specific gender imbalances were identified in the MDT vs. UC design. Furthermore, lower PEDro scale scores within the eligible range (scores of 6–7) were observed for the STMC vs. ST, STR vs. GE, PILATES MAT vs. GE, ST vs. CTRL, and STMC vs. STR comparisons. Intervention-delivery variables, including frequency, supervision, progression, protocol fidelity/adherence and home practice, are reported in Supplementary Table S8. These variables were used for descriptive characterization of the evidence base and were not included as adjustment variables in the network model.
The value of the heterogeneity test QT is 576.14 (p < 0.001 and I2 = 93.8%). The test of heterogeneity within designs was significant (QW = 576.14; df = 36; p < 0.001), and the QI test for inconsistency was statistically significant, indicating relevant between-design variability (QI = 63.01; df = 15; p < 0.001). However, the global inconsistency Q-test for the design-by-treatment interaction model was non-significant (Q = 3.74; df = 15, p = 0.998). This discrepancy arises because the QI test measures global inconsistency, which can be confounded by high between-study heterogeneity. Furthermore, it evaluates general inconsistency without partitioning the specific discrepancies stemming from the interaction between treatment effects and study designs, which are explicitly accounted for in the Q-test of the design-by-treatment interaction model. The network graph appears in Figure 3, where comparisons between the most common interventions can be visualized.
The sensitivity analysis, conducted by systematically excluding individual treatment designs, revealed that ST vs. STR, Pilates Mat vs. CTRL, STMC vs. CTRL, and ST vs. STR vs. CTRL were identified as localized sources of network inconsistency. Specifically, the isolation and removal of each of these designs yielded a substantial decrease of more than 7 points in the global Q-test score. To further contextualize these localized instabilities within the broader network architecture, Supplementary Figure S2 illustrates the contribution of direct evidence alongside coefficients for minimal parallelism and mean path length. Notably, several comparisons are characterized by high mean path lengths and low minimal parallelism; these architectural patterns reinforce the primary role of indirect evidence and highlight a limited redundancy within those specific network nodes. The Net Heat Plot (Figure 4) confirms these localized discrepancies; it shows that while the ST vs. STR design itself exhibits high internal consistency, it contributed to the inconsistency observed in the multi-arm designs in which it is embedded. Additionally, the Pilates Mat vs. CTRL design displays a high level of internal inconsistency. Ultimately, these combined diagnostic findings supported a conservative interpretation of the corresponding network estimates and supported the conservative adjudication of CINeMA confidence ratings where appropriate.
Supplementary Table S7 presents direct, indirect, and network estimates. No statistically significant differences were observed between direct and indirect estimates; however, the largest discrepancies were noted in the comparisons of Pilates Mat vs. GE, Pilates Mat vs. CTRL, ST vs. CTRL.
Compared with CTRL, the largest favorable point estimates for post-treatment disability were observed for Pilates APP (SMD = −0.879, 95% CI −1.156 to −0.601; 95% PrI −1.434 to −0.323; P-score = 0.92; moderate confidence), STMC (SMD = −0.856, 95% CI −1.044 to −0.667; 95% PrI −1.370 to −0.342; P-score = 0.92; low confidence), ST (SMD = −0.757, 95% CI −0.988 to −0.527; 95% PrI −1.290 to −0.225; P-score = 0.83; low confidence), STM (SMD = −0.681, 95% CI −0.998 to −0.365; 95% PrI −1.259 to −0.104; P-score = 0.75; moderate confidence), and Pilates MAT (SMD = −0.643, 95% CI −0.851 to −0.436; 95% PrI −1.165 to −0.121; P-score = 0.72; very low confidence). GE, SM, STR, MDT, STRET, UC, and BS showed smaller point estimates, wider uncertainty, and/or lower confidence versus CTRL. Negative SMD values favor the intervention. For clinical interpretation, the largest favorable estimates versus control ranged from approximately −0.64 to −0.88 SMD. Based on conventional Cohen-type benchmarks for standardized effects, these values correspond to moderate-to-large standardized effects [33]. However, because disability was synthesized using SMDs across ODI and RMDQ, these values cannot be translated into a single instrument-specific minimal clinically important difference. Therefore, clinical interpretation was based on the joint consideration of effect size, 95% confidence intervals, 95% prediction intervals, P-score values and CINeMA confidence ratings, rather than on statistical significance or ranking position alone, as summarized in Table 1 and Figure 5.
Moderate-confidence evidence supported Pilates APP and STM, whereas low-confidence evidence supported STMC and ST. Pilates MAT and most remaining comparisons were supported by very-low-confidence evidence. Thus, confidence ratings did not always parallel ranking values, and interventions with similar P-score positions should not be interpreted as having equivalent credibility.
The funnel plot reveals an asymmetrical distribution, with a lack of studies in the lower-left quadrant and several outliers on the right, potentially indicating large effect sizes in small-sample studies; see Supplementary Figure S3. Although Egger’s test was non-significant, the association between large standard errors and large effect sizes may suggest a potential risk of publication bias. However, it is worth noting that this pattern is driven by only a small subset of the included studies, indicating that this potential asymmetry is localized rather than widespread across the network. Reporting bias was therefore judged as undetected/no downgrade in CINeMA.

4. Discussion

This NMA was designed to compare the relative efficacy of specific therapeutic exercise and manual therapy techniques on self-reported disability in adults with non-specific chronic low back pain at the end of the intervention. Pilates APP, STMC, ST, STM, and Pilates MAT showed the most favorable signals compared with control. However, the high heterogeneity of the network and the limited certainty of the evidence require cautious interpretation. These findings should be viewed as signals of potential benefit rather than as a definitive hierarchy of clinical efficacy. Although the most favorable point estimates were of moderate-to-large standardized magnitude, their clinical interpretation should remain cautious. The use of SMDs allowed ODI and RMDQ outcomes to be analyzed within the same network, but it also prevents a direct and uniform translation of the results into scale-specific clinically important changes. Accordingly, clinical interpretation should consider effect size, confidence intervals, prediction intervals, risk of bias, heterogeneity, indirectness, P-score values and CINeMA confidence ratings jointly, rather than treating ranking metrics as evidence of clinical superiority [30,79,80].
In our network, Pilates APP showed the largest favorable estimate and shared the highest-ranking position, with moderate confidence, which supports its cautious clinical consideration compared with interventions supported by lower-certainty evidence. STMC and ST also showed favorable estimates and high-ranking positions, but both comparisons were supported by low-confidence evidence. By contrast, STM showed a favorable signal with moderate confidence, supporting its cautious clinical consideration despite its lower position in the P-score ranking. Given the small differences in relative efficacy between the best-ranked interventions and the presence of moderate-confidence evidence for both exercise-based and manual therapy approaches, our findings do not support a categorical superiority of therapeutic exercise over manual therapy.
Previous literature has highlighted that the terms therapeutic exercise and manual therapy include highly heterogeneous interventions. This makes both clinical interpretation and estimation of relative effects difficult when pairwise comparisons or overly broad categories are used [13,14,21,22]. The contribution of the present study was to explore whether a more specific division of therapeutic exercise and manual therapy techniques could improve clinical interpretation.
This nodal division is particularly relevant for exercise interventions. Mat Pilates and apparatus-based Pilates share general principles, such as control, precision, breathing, and body awareness. However, apparatus use may modify movement assistance, sensorimotor feedback, and load progression, making their separation clinically justifiable [81]. Similarly, stabilization exercises with motor control are not equivalent to any lumbar stabilization program, because they include specific instructions, feedback, or retraining strategies aimed at lumbopelvic and trunk control [70,82,83]. Moreover, distinguishing stabilization/motor control from progressive strengthening was intended to avoid combining motor learning-oriented interventions with programs mainly focused on load, strength, or muscular endurance [18,84]. This separation was intended to improve clinical interpretability, although it also reduces the size of some nodes and requires cautious interpretation of the estimates.
The present findings are partly consistent with previous NMAs focused on therapeutic exercise. Fernández-Rodríguez et al. identified Pilates, strength training, core-based exercises, and mind–body exercise among the most favorable options for pain and disability in adults with chronic low back pain [16]. Hayden et al. also showed that some exercise modalities appear to be more effective than others, with Pilates and McKenzie ranking favorably for pain and functional limitation. However, they also emphasized the importance of dose, delivery format, and cointerventions [20]. Owen et al. and Li et al. reported similar findings, showing that different active exercise modalities may behave differently within a treatment network for chronic low back pain [15,17]. Zhao et al. further suggested that the effects of exercise depend not only on the type of intervention, but also on prescription parameters such as frequency, session duration, and total program duration [84].
Nevertheless, comparisons with these NMAs should be made with caution. In previous reviews, categories such as core-based exercise, stabilization, motor control, strengthening, or Pilates may have been partially grouped or overlapped. In our study, these interventions were separated into more specific nodes [16,17,18,19,20]. This methodological difference may explain why some interventions, such as STR, occupied a less prominent position in our network. This does not necessarily contradict previous evidence; rather, it may reflect the loss of the pooled effect produced when clinically related techniques are grouped together. Regarding MDT, its position in our network is consistent with Fernández-Rodríguez et al., where it did not show a clear advantage over control for this outcome [16]. In the NMA by Hayden et al., MDT ranked highest for disability, although its advantage was reduced after adjustment for dose and cointerventions [20]. Similarly, the low position of the STRET node is consistent with previous literature [16,17,20], although the limited number of comparisons in our network should also be considered.
The comparison with evidence directly contrasting therapeutic exercise and manual therapy should also be interpreted from this perspective. The Cochrane review on exercise for chronic low back pain showed that exercise is probably more effective than education or advice alone, and more effective than electrotherapy. However, it found no clear differences compared with manual therapy when these treatments were analyzed as broad comparator categories [21]. González-Gómez et al. found a small advantage of therapeutic exercise over manual therapy for long-term disability but concluded that the evidence was insufficient to establish conclusive differences between both modalities as isolated treatments. This was due to heterogeneity, the small number of studies, and the very low certainty of the evidence [22]. Our findings are consistent with this cautious interpretation, but they add a different perspective: the absence of categorical superiority between therapeutic families does not mean that all techniques behave similarly, nor that manual therapy should be interpreted as a homogeneous block.
The behavior of manual therapy within our network reinforces this interpretation. Soft tissue manipulation was among the interventions with a favorable signal compared with control and was the node with the highest relative certainty among the best-positioned interventions. By contrast, spinal manipulation showed a favorable signal of smaller magnitude and very low certainty. This difference suggests that the general category “manual therapy” may conceal different effects across specific manual techniques. External evidence also supports a cautious interpretation. Coulter et al. and de Zoete et al. showed that spinal manipulation or mobilization may improve pain and function in chronic low back pain, but these reviews do not clarify which specific manual technique provides greater relative value compared with specific exercise modalities [85,86]. Dos Santos et al. observed that adding manual therapy to exercise may improve disability in certain contexts. However, the evidence was limited and does not clearly establish which patients benefit, or which manual technique provides greater incremental value [87].
Dry needling was prespecified as a manual therapy node but could not be analyzed because no eligible RCT fulfilled the PICO and operational eligibility criteria required to populate this node in the disability network. This should not be interpreted as evidence that dry needling is ineffective for NSCLBP. Rather, it indicates a lack of eligible comparative evidence for the specific question addressed in this review: post-treatment self-reported disability measured with ODI or RMDQ, using an intervention contrast that allowed the effect of dry needling to be isolated or attributed to a balanced additive design. This distinction is clinically relevant because previous systematic evidence in chronic low back pain suggests that dry needling may have short-term effects on pain, particularly when combined with other therapies, but does not provide clear evidence of improvement in disability. Future trials should evaluate dry needling using clearly defined comparators, balanced co-interventions, adequate follow-up and standardized disability outcomes to allow its integration into future comparative evidence networks [88].
Variability in dose and intervention delivery may explain part of the heterogeneity observed in the network. Descriptive inspection of potential effect modifiers also showed variability across direct comparisons in baseline disability, age, sex distribution, BMI availability and methodological quality (Supplementary Table S6). The included programs differed in duration, frequency, supervision, progression, protocol fidelity, and use of home exercise (Supplementary Table S8). These differences are clinically relevant because exercise-prescription parameters, including frequency, session duration and total program duration, may influence treatment effects in chronic low back pain [84]. Similarly, therapeutic exercise progression in non-specific low back pain is heterogeneous and often insufficiently described, which may limit reproducibility and translation into clinical practice [89]. Recent meta-research using the i-CONTENT tool also showed that dosage and adherence are among the exercise-intervention domains with greater uncertainty in chronic low back pain trials [90]. Home-exercise delivery may be another relevant source of variability, as recent evidence suggests that unsupervised home exercise is less effective than supervised in-person exercise for short-term pain and functional disability in NSCLBP [91]. Finally, adherence should be interpreted as a complex, context-dependent implementation factor rather than as a simple binary variable; recent systematic evidence indicates that adherence to exercise therapy in NSCLBP is influenced by internal, external and intervention-related factors, including beliefs, motivation, supervision, feedback, program design, confidence and home-exercise support [92,93]. Therefore, two interventions classified within the same node may differ substantially in their real-world application. The nodes should be interpreted as clinically reasonable categories, but not as fully homogeneous interventions.
From a clinical perspective, self-reported disability is a broad outcome. This focus on disability is also supported by observational evidence showing that spinal pain is associated with functional decline and limitations in daily activities, although such evidence is contextual and not directly comparable to the intervention trials included in the present NMA [94]. It is influenced by pain, movement tolerance, perceived threat, self-efficacy, motor control, and functional participation [3,4]. This may explain why different interventions show similar improvements in disability at the end of the intervention, because of convergence through partially shared mechanisms, even if they are not equivalent. Pilates and stabilization/motor control programs may act through control, coordination, and progressive exposure to movement [81,83]. Manual techniques are currently explained through neurophysiological, contextual, and symptom-modulation mechanisms, rather than through an exclusively biomechanical interpretation. They may therefore influence pain, mechanical sensitivity, perceived threat, and movement tolerance [95,96]. However, this explanation should be considered exploratory, because the present NMA did not assess clinical mediators or physiological mechanisms.

4.1. Clinical Implications

From a clinical standpoint, the results are consistent with careful clinical consideration of structured, supervised interventions with potential for progression, particularly those based on Pilates APP, STMC, ST, and Pilates MAT. However, the favorable position of STM and the absence of clear statistically significant differences between these interventions prevent us from concluding that therapeutic exercise is globally superior to manual therapy for reducing self-reported disability at the end of the intervention. The findings are consistent with current recommendations, which favor non-surgical, person-centered care. In this approach, exercise is a therapeutic pillar, while manual therapy may be considered as part of a multimodal strategy when clinically justified [4,97,98].
In routine clinical care, the applicability of these findings will also be conditioned by the therapist’s familiarity with each technique, the patient’s expectations and preferences, and the feasibility of providing supervised or equipment-based programs in a given care setting. Therefore, treatment selection should integrate the overall certainty and applicability of the evidence with clinical reasoning and shared decision-making, rather than relying on the network estimates in isolation.
In patients with low initial tolerance to movement, high perceived threat, or symptoms that limit active participation, manual therapy may have a complementary role as a symptom-modulation strategy and as a means of improving movement tolerance. This may be particularly relevant when it is integrated with education and active progression. However, profile-based considerations should be understood as hypotheses for clinical reasoning and shared decision-making rather than as subgroup-specific treatment effects established by this NMA. Patient-level treatment-effect modifiers have been explored in persistent low back pain exercise trials using individual participant data meta-analysis, and baseline individual factors associated with outcomes after manual therapy have also been reviewed; however, these data do not establish subgroup-specific effects for the intervention nodes evaluated in the present network [99,100]. Available evidence also appears to support manual therapy primarily as a component of multimodal strategies rather than as a substitute for exercise or self-management [22,87,96,101].

4.2. Implications for Research

The results suggest several priorities for future research. In the short term, adequately powered head-to-head RCTs are needed between the nodes showing the most favorable signals, especially Pilates APP, STMC, ST, STM, and Pilates MAT. These trials should include adequate sample sizes, harmonized disability outcomes, predefined post-treatment and follow-up time points, and transparent reporting of dose, frequency, session duration, progression, supervision, adherence, protocol fidelity and home exercise. This is particularly relevant because exercise-prescription parameters may influence treatment effects in chronic low back pain [84], exercise progression is often heterogeneous and insufficiently described [89], and adherence is influenced by patient-, intervention- and context-related factors [92,93]. Improving intervention reporting and therapeutic standardization is therefore important to distinguish whether observed differences are attributable to the intervention type itself or to how the intervention is delivered.
In the longer term, future research should investigate treatment-effect modifiers and response subgroups using prespecified subgroup analyses or individual participant data approaches. Individual participant data meta-analysis has specifically examined exercise treatment-effect modifiers in persistent low back pain [99], and baseline individual factors associated with outcomes after manual therapy have recently been reviewed [100]; however, these approaches have not established subgroup-specific effects for the intervention nodes evaluated in the present NMA. Variables such as baseline disability, movement tolerance, kinesiophobia, symptom duration, expectations, self-efficacy, sensitisation, motor-control impairments, age, sex and BMI may help explain heterogeneity in response to exercise or manual therapy techniques. Future trials should also evaluate combined or sequential strategies, particularly those in which manual therapy is used as initial support for active exposure or to improve movement tolerance, without assuming in advance that its effect is independent of, or superior to, exercise [87].

4.3. Limitations

This study has several limitations that affect the interpretation of its findings. First, although the protocol included pain and function/disability, the present manuscript focused on self-reported disability at the end of the intervention. Therefore, no conclusions can be drawn about pain or about medium- or long-term effects. Second, the exclusion of diskogenic pain was based on the diagnostic labels and eligibility criteria reported in the original trials. Therefore, although studies explicitly recruiting participants with diskogenic or disc-related low back pain were excluded, some participants classified as having NSCLBP may have had unidentified disc-related changes, as most trials did not systematically use imaging to rule them out. Third, the network showed high heterogeneity, probably related to differences in population, intervention definitions, dose, supervision, outcome measures, and comparator characteristics. Potential effect modifiers and intervention-delivery characteristics were tabulated descriptively in Supplementary Tables S6 and S8, but they were not incorporated into adjusted network models because reporting was incomplete, heterogeneous and sparse across several comparisons. Therefore, residual intransitivity or confounding by intervention delivery cannot be excluded. Fourth, greater nodal specificity improves clinical interpretability, but it also reduces the size of some nodes, increases reliance on indirect evidence, and may make the analysis more sensitive to incomplete descriptions of intervention protocols. Fifth, dry needling could not be analyzed because no studies met the PICO and operational eligibility criteria required to populate this node in the disability network. As a result, the representation of manual therapy was limited to spinal manipulation and soft tissue manipulation. Sixth, the use of a PEDro score threshold as an eligibility criterion is another limitation of this review. Although the PEDro <6 exclusion criterion was prespecified in the registered protocol and was applied before quantitative synthesis, excluding trials on the basis of a summary methodological score may introduce quality-related selection bias. Finally, the CINeMA assessment and the ±0.20 SMD decision threshold were not specified in the original PROSPERO protocol. They were introduced after registration as a methodological refinement to improve transparency in the interpretation of network estimates. This refinement did not modify eligibility criteria, intervention nodes, the statistical model, or effect estimates. Because several automatic CINeMA judgments may not fully capture network-level instability identified by design-level sensitivity analyses, contribution profiles, and the net heat plot, final confidence ratings were conservatively adjudicated. Alternative thresholds or adjudication rules could have led to different confidence ratings; therefore, the CINeMA ratings should be interpreted as conservative and threshold-dependent. Reporting bias was not downgraded because no comparison-specific missing-evidence signal was identified, although small-study effects or missing evidence cannot be completely excluded. These limitations require the findings to be interpreted as a more precise reading of the available evidence rather than as a definitive hierarchy of efficacy.

5. Conclusions

This network meta-analysis suggests that several specific therapeutic exercise and manual therapy techniques may reduce post-treatment self-reported disability in adults with chronic non-specific low back pain when compared with control. Among the interventions with the most favorable point estimates, equipment-based Pilates and soft tissue manipulation showed favorable estimates supported by moderate-confidence evidence, whereas stabilization exercises with motor control and stabilization exercises were supported by low-confidence evidence, and Pilates Mat by very low-confidence evidence. However, the high heterogeneity of the network, the absence of clear statistically significant differences between the interventions with the most favorable estimates, and the threshold-dependent CINeMA adjudication require cautious interpretation. Overall, the results should be considered signals of potential post-treatment benefit rather than a definitive hierarchy of efficacy, and they do not support a categorical superiority of therapeutic exercise over manual therapy.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/jcm15124809/s1; Table S1a,b, database search strategies and Google Scholar/gray-literature searches; Table S2, operational definitions of intervention nodes; Table S3, excluded full-text records and reasons for exclusion; Table S4a,b, general study characteristics and distribution of intervention nodes; Table S5, characteristics of included comparisons and intervention extraction; Table S6, potential effect modifiers and transitivity assessment across direct comparisons; Table S7, direct–indirect decomposition of network estimates; Table S8, intervention delivery characteristics; Table S9, comparison between the registered PROSPERO protocol and final review methods; Figure S1, risk-of-bias assessment; Figure S2, evidence-flow and network-architecture diagnostics; Figure S3, comparison-adjusted funnel plot; Supplementary Material Text S1, CINeMA adjudication rules and rationale; Supplementary Checklist S1, PRISMA 2020 Checklist. Reference [102] are cited in the supplementary materials.

Author Contributions

Conceptualization, M.R.-G., J.L.S.G. and J.M.-V.; methodology, M.R.-G., J.L.S.G. and J.M.-V.; software, J.M.-V.; validation, M.R.-G., M.S.D.R. and J.M.-V.; formal analysis, J.M.-V.; investigation, M.R.-G., L.C.-D., M.S.D.R. and J.M.-V.; resources, J.L.S.G. and J.L.S.-S.; data curation, M.R.-G. and J.M.-V.; writing—original draft preparation, M.R.-G. and J.L.S.G.; writing—review and editing, M.R.-G., J.L.S.G., J.L.S.-S., L.C.-D., M.S.D.R. and J.M.-V.; visualization, M.R.-G. and J.M.-V.; supervision, J.L.S.G. and J.M.-V.; project administration, J.L.S.G.; funding acquisition, not applicable. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data supporting the findings of this study are available within the article and its Supplementary Materials. Additional extracted datasets and analysis files can be made available from the corresponding author upon reasonable request.

Acknowledgments

During the preparation of this manuscript, the authors used the Rayyan Citation tool for the purposes of structuring the systematic review conducted. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
NSCLBPNon-Specific Chronic Low Back Pain
LBPLow Back Pain
NMANetwork Meta-Analysis
STStabilization Exercises
STMCStabilization Exercises with Motor Control
BSBack School
GEGeneral Exercise
STRStrengthening Exercises
STRETStretching
UCUsual Care
CTRLControl
STMSoft Tissue Manipulation
SMSpinal Lumbar Manipulation
DNDry Needling
PMATPilates Mat
PAPPPilates Apparatus
MDTMcKenzie Method
ODIOswestry Disability Index
RMDQRoland–Morris Disability Questionnaire

References

  1. Low Back Pain. Available online: https://www.who.int/news-room/fact-sheets/detail/low-back-pain (accessed on 9 May 2026).
  2. GBD 2021 Low Back Pain Collaborators. Global, Regional, and National Burden of Low Back Pain, 1990–2020, Its Attributable Risk Factors, and Projections to 2050: A Systematic Analysis of the Global Burden of Disease Study 2021. Lancet Rheumatol. 2023, 5, e316–e329. [CrossRef] [PubMed]
  3. Hartvigsen, J.; Hancock, M.; Kongsted, A.; Louw, Q.; Ferreira, M.L.; Genevay, S.; Hoy, D.; Karppinen, J.; Pransky, G.; Sieper, J.; et al. What Low Back Pain Is and Why We Need to Pay Attention. Lancet 2018, 391, 2356–2367. [Google Scholar] [CrossRef] [PubMed]
  4. World Health Organization. WHO Guideline for Non-Surgical Management of Chronic Primary Low Back Pain in Adults in Primary and Community Care Settings; World Health Organization: Geneva, Switzerland, 2023. [Google Scholar]
  5. Fatoye, F.; Gebrye, T.; Ryan, C.G.; Useh, U.; Mbada, C.E. Global and Regional Estimates of Clinical and Economic Burden of Low Back Pain in High-Income Countries: A Systematic Review and Meta-Analysis. Front. Public Health 2023, 11, 1098100. [Google Scholar] [CrossRef] [PubMed]
  6. Fatoye, F.; Gebrye, T.; Mbada, C.E.; Useh, U. Clinical and Economic Burden of Low Back Pain in Low- and Middle-Income Countries: A Systematic Review. BMJ Open 2023, 13, e064119. [Google Scholar] [CrossRef] [PubMed]
  7. Qaseem, A.; Wilt, T.; McLean, R.; Forciea, M. Clinical Guidelines Committee of the American College of Physicians Noninvasive Treatments for Acute, Subacute, and Chronic Low Back Pain: A Clinical Practice Guideline. Ann. Intern. Med. 2017, 166, 514–530. [Google Scholar] [CrossRef] [PubMed]
  8. Maher, C.; Underwood, M.; Buchbinder, R. Non-Specific Low Back Pain. Lancet 2017, 389, 736–747. [Google Scholar] [CrossRef] [PubMed]
  9. The Diagnosis and Treatment of Low Back Pain Work Group; Department of Veterans Affairs; Department of Defense. VA/DoD Clinical Practice Guideline for the Diagnosis and Treatment of Low Back Pain. Version 3.0; Department of Veterans Affairs: Washington, DC, USA; Department of Defense: Washington, DC, USA, 2022.
  10. Corp, N.; Mansell, G.; Stynes, S.; Wynne-Jones, G.; Morsø, L.; Hill, J.C.; van der Windt, D.A. Evidence-Based Treatment Recommendations for Neck and Low Back Pain across Europe: A Systematic Review of Guidelines. Eur. J. Pain 2021, 25, 275–295. [Google Scholar] [CrossRef] [PubMed]
  11. McKenzie, B.J.; Haas, R.; Ferreira, G.E.; Gorelik, A.; Cyril, S.; Han, J.X.; Gilbert, S.; Grobler, L.; Maher, C.G.; Buchbinder, R. Agreement between High-Quality Clinical Practice Guidelines in Their Treatment Recommendations for Low Back Pain: A Systematic Review. Spine J. 2025, 25, 2587–2604. [Google Scholar] [CrossRef] [PubMed]
  12. Zhou, T.; Salman, D.; McGregor, A.H. Recent Clinical Practice Guidelines for the Management of Low Back Pain: A Global Comparison. BMC Musculoskelet. Disord. 2024, 25, 344. [Google Scholar] [CrossRef] [PubMed]
  13. Chapter 10: Analysing Data and Undertaking Meta-Analyses|Cochrane. Available online: https://www.cochrane.org/authors/handbooks-and-manuals/handbook/current/chapter-10 (accessed on 9 May 2026).
  14. Chapter 11: Undertaking Network Meta-Analyses|Cochrane. Available online: https://www.cochrane.org/authors/handbooks-and-manuals/handbook/current/chapter-11 (accessed on 9 May 2026).
  15. Li, Y.; Yan, L.; Hou, L.; Zhang, X.; Zhao, H.; Yan, C.; Li, X.; Li, Y.; Chen, X.; Ding, X. Exercise Intervention for Patients with Chronic Low Back Pain: A Systematic Review and Network Meta-Analysis. Front. Public Health 2023, 11, 1155225. [Google Scholar] [CrossRef] [PubMed]
  16. Fernández-Rodríguez, R.; Álvarez-Bueno, C.; Cavero-Redondo, I.; Torres-Costoso, A.; Pozuelo-Carrascosa, D.P.; Reina-Gutiérrez, S.; Pascual-Morena, C.; Martínez-Vizcaíno, V. Best Exercise Options for Reducing Pain and Disability in Adults With Chronic Low Back Pain: Pilates, Strength, Core-Based, and Mind-Body. A Network Meta-Analysis. J. Orthop. Sports Phys. Ther. 2022, 52, 505–521. [Google Scholar] [CrossRef] [PubMed]
  17. Owen, P.J.; Miller, C.T.; Mundell, N.L.; Verswijveren, S.J.J.M.; Tagliaferri, S.D.; Brisby, H.; Bowe, S.J.; Belavy, D.L. Which Specific Modes of Exercise Training Are Most Effective for Treating Low Back Pain? Network Meta-Analysis. Br. J. Sports Med. 2020, 54, 1279–1287. [Google Scholar] [CrossRef] [PubMed]
  18. Guo, X. Effects of Different Types of Core Training on Pain and Functional Status in Patients with Chronic Nonspecific Low Back Pain: A Systematic Review and Meta-Analysis. Front. Physiol. 2025, 16, 1672010. [Google Scholar] [CrossRef] [PubMed]
  19. Baroncini, A.; Maffulli, N.; Schäfer, L.; Manocchio, N.; Bossa, M.; Foti, C.; Klimuch, A.; Migliorini, F. Physiotherapeutic and Non-Conventional Approaches in Patients with Chronic Low-Back Pain: A Level I Bayesian Network Meta-Analysis. Sci. Rep. 2024, 14, 11546. [Google Scholar] [CrossRef] [PubMed]
  20. Hayden, J.A.; Ellis, J.; Ogilvie, R.; Stewart, S.A.; Bagg, M.K.; Stanojevic, S.; Yamato, T.P.; Saragiotto, B.T. Some Types of Exercise Are More Effective than Others in People with Chronic Low Back Pain: A Network Meta-Analysis. J. Physiother. 2021, 67, 252–262. [Google Scholar] [CrossRef] [PubMed]
  21. Hayden, J.A.; Ellis, J.; Ogilvie, R.; Malmivaara, A.; Tulder, M.W. van Exercise Therapy for Chronic Low Back Pain. Cochrane Database Syst. Rev. 2021, 24, 193–204. [Google Scholar] [CrossRef]
  22. González-Gómez, L.; Moral-Munoz, J.A.; Rosales-Tristancho, A.; Cuevas-Moreno, A.; Cardellat-González, M.; Rodríguez-Domínguez, Á.-J. Exercise Therapy Versus Manual Therapy for the Management of Pain Intensity, Disability, and Physical Function in People With Chronic Low Back Pain: A Systematic Review With Meta-Analysis and Meta-Regression. Eur. J. Pain 2025, 29, e70090. [Google Scholar] [CrossRef] [PubMed]
  23. Hutton, B.; Salanti, G.; Caldwell, D.M.; Chaimani, A.; Schmid, C.H.; Cameron, C.; Ioannidis, J.P.A.; Straus, S.; Thorlund, K.; Jansen, J.P.; et al. The PRISMA Extension Statement for Reporting of Systematic Reviews Incorporating Network Meta-Analyses of Health Care Interventions: Checklist and Explanations. Ann. Intern. Med. 2015, 162, 777–784. [Google Scholar] [CrossRef] [PubMed]
  24. Schmidt, G.J.; Ferreira, A.P.B.; Pietrobon, R.; Vissoci, J.R.N.; Santos, R.A.A.D.; Lauffer, R.F.; Maçaneiro, C.H. Comparative Evaluation of the Psychometric Properties of Orthopedic Scales for Low Back Pain. Coluna/Columna 2019, 18, 308–312. [Google Scholar] [CrossRef]
  25. Stevens, M.L.; Lin, C.C.-W.; Maher, C.G. The Roland Morris Disability Questionnaire. J. Physiother. 2016, 62, 116. [Google Scholar] [CrossRef] [PubMed]
  26. Chiarotto, A.; Maxwell, L.J.; Terwee, C.B.; Wells, G.A.; Tugwell, P.; Ostelo, R.W. Roland-Morris Disability Questionnaire and Oswestry Disability Index: Which Has Better Measurement Properties for Measuring Physical Functioning in Nonspecific Low Back Pain? Systematic Review and Meta-Analysis. Phys. Ther. 2016, 96, 1620–1637. [Google Scholar] [CrossRef] [PubMed]
  27. Ades, A.E.; Welton, N.J.; Dias, S.; Phillippo, D.M.; Caldwell, D.M. Twenty Years of Network Meta-Analysis: Continuing Controversies and Recent Developments. Res. Synth. Methods 2024, 15, 702–727. [Google Scholar] [CrossRef] [PubMed]
  28. Jackson, D.; Boddington, P.; White, I.R. The Design-by-Treatment Interaction Model: A Unifying Framework for Modelling Loop Inconsistency in Network Meta-Analysis. Res. Synth. Methods 2016, 7, 329–332. [Google Scholar] [CrossRef] [PubMed]
  29. König, J.; Krahn, U.; Binder, H. Visualizing the Flow of Evidence in Network Meta-analysis and Characterizing Mixed Treatment Comparisons. Stat. Med. 2013, 32, 5414–5429. [Google Scholar] [CrossRef] [PubMed]
  30. Rücker, G.; Schwarzer, G. Ranking Treatments in Frequentist Network Meta-Analysis Works without Resampling Methods. BMC Med. Res. Methodol. 2015, 15, 58. [Google Scholar] [CrossRef] [PubMed]
  31. Chaimani, A.; Higgins, J.P.T.; Mavridis, D.; Spyridonos, P.; Salanti, G. Graphical Tools for Network Meta-Analysis in STATA. PLoS ONE 2013, 8, e76654. [Google Scholar] [CrossRef] [PubMed]
  32. Nikolakopoulou, A.; Higgins, J.; Papakonstantinou, T.; Chaimani, A.; Del Giovane, C.; Egger, M. CINeMA: An Approach for Assessing Confidence in the Results of a Network Meta-Analysis. PLoS Med. 2020, 17, e1003082. [Google Scholar] [CrossRef] [PubMed]
  33. Kim, H.-Y. Statistical Notes for Clinical Researchers: Effect Size. Restor. Dent. Endod. 2015, 40, 328–331. [Google Scholar] [CrossRef] [PubMed]
  34. Elleuch, M.-H.; Ghroubi, S. Vertebral Manipulation in Chronic Low Back Pain: A Prospective Randomised Study of 85 Cases. Int. Musculoskelet. Med. 2009, 31, 57–62. [Google Scholar] [CrossRef]
  35. Waseem, M.; Karimi, H.; Gilani, S.A.; Hassan, D. Treatment of Disability Associated with Chronic Non-Specific Low Back Pain Using Core Stabilization Exercises in Pakistani Population. J. Back. Musculoskelet. Rehabil. 2019, 32, 149–154. [Google Scholar] [CrossRef] [PubMed]
  36. Noormohammadpour, P.; Kordi, M.; Mansournia, M.A.; Akbari-Fakhrabadi, M.; Kordi, R. The Role of a Multi-Step Core Stability Exercise Program in the Treatment of Nurses with Chronic Low Back Pain: A Single-Blinded Randomized Controlled Trial. Asian Spine J. 2018, 12, 490–502. [Google Scholar] [CrossRef] [PubMed]
  37. França, F.R.; Burke, T.N.; Hanada, E.S.; Marques, A.P. Segmental Stabilization and Muscular Strengthening in Chronic Low Back Pain—A Comparative Study. Clinics 2010, 65, 1013–1017. [Google Scholar] [CrossRef] [PubMed]
  38. Valenza, M.; Rodríguez-Torres, J.; Cabrera-Martos, I.; Díaz-Pelegrina, A.; Aguilar-Ferrándiz, M.; Castellote-Caballero, Y. Results of a Pilates Exercise Program in Patients with Chronic Non-Specific Low Back Pain: A Randomized Controlled Trial. Clin. Rehabil. 2016, 31, 753–760. [Google Scholar] [CrossRef] [PubMed]
  39. Unsgaard-Tøndel, M.; Fladmark, A.M.; Salvesen, Ø.; Vasseljen, O. Motor Control Exercises, Sling Exercises, and General Exercises for Patients with Chronic Low Back Pain: A Randomized Controlled Trial with 1-Year Follow-up. Phys. Ther. 2010, 90, 1426–1440. [Google Scholar] [CrossRef] [PubMed]
  40. Costa, L.O.; Maher, C.G.; Latimer, J.; Hodges, P.W.; Herbert, R.D.; Refshauge, K.M.; McAuley, J.H.; Jennings, M.D. Motor Control Exercise for Chronic Low Back Pain: A Randomized Placebo-Controlled Trial. Phys. Ther. 2009, 89, 1275–1286. [Google Scholar] [CrossRef] [PubMed]
  41. Garcia, A.N.; Costa, L.D.C.M.; Hancock, M.J.; Souza, F.S.D.; Gomes, G.V.F.D.O.; Almeida, M.O.D.; Costa, L.O.P. McKenzie Method of Mechanical Diagnosis and Therapy Was Slightly More Effective than Placebo for Pain, but Not for Disability, in Patients with Chronic Non-Specific Low Back Pain: A Randomised Placebo Controlled Trial with Short and Longer Term Follow-up. Br. J. Sports Med. 2018, 52, 594–600. [Google Scholar] [PubMed]
  42. Balthazard, P.; de Goumoens, P.; Rivier, G.; Demeulenaere, P.; Ballabeni, P.; Dériaz, O. Manual Therapy Followed by Specific Active Exercises versus a Placebo Followed by Specific Active Exercises on the Improvement of Functional Disability in Patients with Chronic Non Specific Low Back Pain: A Randomized Controlled Trial. BMC Musculoskelet. Disord. 2012, 13, 162. [Google Scholar] [CrossRef] [PubMed]
  43. Michaelson, P.; Holmberg, D.; Aasa, B.; Aasa, U. High Load Lifting Exercise and Low Load Motor Control Exercises as Interventions for Patients with Mechanical Low Back Pain: A Randomized Controlled Trial with 24-Month Follow-Up. J. Rehabil. Med. 2016, 48, 456–463. [Google Scholar] [CrossRef] [PubMed]
  44. Gatti, R.; Faccendini, S.; Tettamanti, A.; Barbero, M.; Balestri, A.; Calori, G. Efficacy of Trunk Balance Exercises for Individuals with Chronic Low Back Pain: A Randomized Clinical Trial. J. Orthop. Sports Phys. Ther. 2011, 41, 542–552. [Google Scholar] [CrossRef] [PubMed]
  45. Miyamoto, G.C.; Costa, L.O.P.; Galvanin, T.; Cabral, C.M.N. Efficacy of the Addition of Modified Pilates Exercises to a Minimal Intervention in Patients with Chronic Low Back Pain: A Randomized Controlled Trial. Phys. Ther. 2013, 93, 310–320. [Google Scholar] [CrossRef] [PubMed]
  46. Salik Sengul, Y.; Yilmaz, A.; Kirmizi, M.; Kahraman, T.; Kalemci, O. Effects of Stabilization Exercises on Disability, Pain and Core Stability in Patients with Non-Specific Low Back Pain: A Randomized Controlled Trial. Work 2021, 70, 99–107. [Google Scholar] [CrossRef] [PubMed]
  47. Arguisuelas, M.D.; Lisón, J.F.; Sánchez-Zuriaga, D.; Martínez-Hurtado, I.; Doménech-Fernández, J. Effects of Myofascial Release in Non-Specific Chronic Low Back Pain: A Randomized Clinical Trial. Spine 2017, 42, 627–634. [Google Scholar] [CrossRef] [PubMed]
  48. Bello, B.; Adeniyi, A.F. Effects of Lumbar Stabilisation and Treadmill Exercise on Function in Patients with Chronic Mechanical Low Back Pain. Int. J. Ther. Rehabil. 2018, 25, 493–499. [Google Scholar] [CrossRef]
  49. da Luz, M.A.; Costa, L.O.P.; Fuhro, F.F.; Manzoni, A.C.T.; Oliveira, N.T.B.; Cabral, C.M.N. Effectiveness of Mat Pilates or Equipment-Based Pilates Exercises in Patients with Chronic Nonspecific Low Back Pain: A Randomized Controlled Trial. Phys. Ther. 2014, 94, 623–631. [Google Scholar] [CrossRef] [PubMed]
  50. Garcia, A.N.; Costa, L.d.C.M.; da Silva, T.M.; Gondo, F.L.B.; Cyrillo, F.N.; Costa, R.A.; Costa, L.O.P. Effectiveness of Back School versus McKenzie Exercises in Patients with Chronic Nonspecific Low Back Pain: A Randomized Controlled Trial. Phys. Ther. 2013, 93, 729–747. [Google Scholar] [CrossRef] [PubMed]
  51. Batıbay, S.; Külcü, D.G.; Kaleoğlu, Ö.; Mesci, N. Effect of Pilates Mat Exercise and Home Exercise Programs on Pain, Functional Level, and Core Muscle Thickness in Women with Chronic Low Back Pain. J. Orthop. Sci. 2021, 26, 979–985. [Google Scholar] [CrossRef] [PubMed]
  52. van Dillen, L.; Lanier, V.; Steger-May, K.; Wallendorf, M.; Norton, B.; Civello, J.; Czuppon, S.; Francois, S.; Roles, K.; Lang, C. Effect of Motor Skill Training in Functional Activities vs Strength and Flexibility Exercise on Function in People With Chronic Low Back Pain: A Randomized Clinical Trial. JAMA Neurol. 2021, 78, 385–395. [Google Scholar] [CrossRef] [PubMed]
  53. Macedo, L.G.; Latimer, J.; Maher, C.G.; Hodges, P.W.; McAuley, J.H.; Nicholas, M.K.; Tonkin, L.; Stanton, C.J.; Stanton, T.R.; Stafford, R. Effect of Motor Control Exercises versus Graded Activity in Patients with Chronic Nonspecific Low Back Pain: A Randomized Controlled Trial. Phys. Ther. 2012, 92, 363–377. [Google Scholar] [CrossRef] [PubMed]
  54. Moon, H.J.; Choi, K.H.; Kim, D.H.; Kim, H.J.; Cho, Y.K.; Lee, K.H.; Kim, J.H.; Choi, Y.J. Effect of Lumbar Stabilization and Dynamic Lumbar Strengthening Exercises in Patients with Chronic Low Back Pain. Ann. Rehabil. Med. 2013, 37, 110–117. [Google Scholar] [CrossRef] [PubMed]
  55. Wang, X.; Zheng, J.; Bi, X.; Liu, J. Effect of Core Stability Training on Patients with Chronic Low Back Pain. HealthMED 2012, 6, 754–759. [Google Scholar]
  56. Senna, M.K.; Machaly, S.A. Does Maintained Spinal Manipulation Therapy for Chronic Nonspecific Low Back Pain Result in Better Long-Term Outcome? Spine 2011, 36, 1427–1437. [Google Scholar] [CrossRef] [PubMed]
  57. Miyamoto, G.C.; Franco, K.F.M.; van Dongen, J.M.; Franco, Y.R.D.S.; de Oliveira, N.T.B.; Amaral, D.D.V.; Branco, A.N.C.; da Silva, M.L.; van Tulder, M.W.; Cabral, C.M.N. Different Doses of Pilates-Based Exercise Therapy for Chronic Low Back Pain: A Randomised Controlled Trial with Economic Evaluation. Br. J. Sports Med. 2018, 52, 859–868. [Google Scholar] [CrossRef] [PubMed]
  58. Sertpoyraz, F.; Eyigor, S.; Karapolat, H.; Capaci, K.; Kirazli, Y. Comparison of Isokinetic Exercise versus Standard Exercise Training in Patients with Chronic Low Back Pain: A Randomized Controlled Study. Clin. Rehabil. 2009, 23, 238–247. [Google Scholar] [CrossRef] [PubMed]
  59. Ferreira, M.L.; Ferreira, P.H.; Latimer, J.; Herbert, R.D.; Hodges, P.W.; Jennings, M.D.; Maher, C.G.; Refshauge, K.M. Comparison of General Exercise, Motor Control Exercise and Spinal Manipulative Therapy for Chronic Low Back Pain: A Randomized Trial. Pain 2007, 131, 31–37. [Google Scholar] [CrossRef] [PubMed]
  60. Cruz-Díaz, D.; Bergamin, M.; Gobbo, S.; Martínez-Amat, A.; Hita-Contreras, F. Comparative Effects of 12 Weeks of Equipment Based and Mat Pilates in Patients with Chronic Low Back Pain on Pain, Function and Transversus Abdominis Activation. A Randomized Controlled Trial. Complement. Ther. Med. 2017, 33, 72–77. [Google Scholar] [CrossRef] [PubMed]
  61. Natour, J.; Cazotti, L.d.A.; Ribeiro, L.H.; Baptista, A.S.; Jones, A. Pilates Improves Pain, Function and Quality of Life in Patients with Chronic Low Back Pain: A Randomized Controlled Trial. Clin. Rehabil. 2015, 29, 59–68. [Google Scholar] [PubMed]
  62. Arora, L.; Arora, R.; Singh, J.; Kaur, H. A Single (Investigator) Blind Randomized Controlled Trial Comparing McKenzie Exercises and Lumbar Stabilization Exercises in Chronic Low Back Pain. Eur. J. Exp. Biol. 2012, 2, 2219–2228. [Google Scholar]
  63. Harts, C.C.; Helmhout, P.H.; de Bie, R.A.; Staal, J.B. A High-Intensity Lumbar Extensor Strengthening Program Is Little Better than a Low-Intensity Program or a Waiting List Control Group for Chronic Low Back Pain: A Randomised Clinical Trial. Aust. J. Physiother. 2008, 54, 23–31. [Google Scholar] [CrossRef] [PubMed][Green Version]
  64. Cherkin, D.C.; Sherman, K.J.; Kahn, J.; Wellman, R.; Cook, A.J.; Johnson, E.; Erro, J.; Delaney, K.; Deyo, R.A. A Comparison of the Effects of 2 Types of Massage and Usual Care on Chronic Low Back Pain: A Randomized, Controlled Trial. Ann. Intern. Med. 2011, 155, 1–9. [Google Scholar] [CrossRef] [PubMed]
  65. Murtezani, A.; Govori, V.; Meka, V.S.; Ibraimi, Z.; Rrecaj, S.; Gashi, S. A Comparison of McKenzie Therapy with Electrophysical Agents for the Treatment of Work Related Low Back Pain: A Randomized Controlled Trial. J. Back. Musculoskelet. Rehabil. 2015, 28, 247–253. [Google Scholar] [CrossRef] [PubMed]
  66. Yalfani, A.; Khani, M.M.; Ahmadi, M.; Asgarpoor, A. The Effect of Core Stability Exercises Combined with Abdominal Hollowing on Postural Balance in Patients with Non-Specific Chronic Low Back Pain: A Randomized Controlled Trial. Phys. Treat. 2023, 13, 165–174. [Google Scholar] [CrossRef]
  67. Gevers-Montoro, C.; Romero-Santiago, B.; Medina-García, I.; Larranaga-Arzamendi, B.; Álvarez-Gálovich, L.; Mues, A.O.-D.; Piché, M. Reduction of Chronic Primary Low Back Pain by Spinal Manipulative Therapy Is Accompanied by Decreases in Segmental Mechanical Hyperalgesia and Pain Catastrophizing: A Randomized Placebo-Controlled Dual-Blind Mixed Experimental Trial. J. Pain 2024, 25, 104500. [Google Scholar] [CrossRef] [PubMed]
  68. Raoufi, Z.; Tabatabaei, A.; Dadgoo, M.; Takamjani, E.E.; Salehi, R.; Sanjari, M.A. The Additional Effects of Dual-Task Training with Core Stability Exercises versus General Exercises on Disability and Pain in People with Nonspecific Chronic Low Back Pain: A Randomized Controlled Trial. Musculoskelet. Sci. Pract. 2026, 82, 103501. [Google Scholar] [CrossRef] [PubMed]
  69. Khaledi, A.; Gheitasi, M. Isometric versus Isotonic Core Stabilization Exercises to Improve Pain and Disability in Patients with Non-Specific Chronic Low Back Pain: A Randomized Controlled Trial. Anesthesiol. Pain Med. 2024, 14, e144046. [Google Scholar] [CrossRef] [PubMed]
  70. Xu, S.; Wang, R.; Ma, S.; He, B. Interventional Effect of Core Stability Training on Pain and Muscle Function of Youth with Chronic Non-Specific Lower Back Pain: A Randomized Controlled Trial. Heliyon 2024, 10, e32818. [Google Scholar] [CrossRef] [PubMed]
  71. Alqhtani, R.S.; Ahmed, H.; Ghulam, H.S.H.; Alyami, A.M.; Al Sharyah, Y.H.H.; Ahmed, R.; Khan, A.; Khan, A.R. Efficacy of Core-Strengthening and Intensive Dynamic Back Exercises on Pain, Core Muscle Endurance, and Functional Disability in Patients with Chronic Non-Specific Low Back Pain: A Randomized Comparative Study. J. Clin. Med. 2024, 13, 475. [Google Scholar] [CrossRef] [PubMed]
  72. Wongcharoen, C.; Sakulsriprasert, P.; Bunprajun, T.; Jensen, M.P.; Sornkaew, K. Effects of Flossing Technique and Core Stabilization Exercise on the Treatment of Chronic Non-Specific Low Back Pain—A Randomized Controlled Trial. J. Bodyw. Mov. Ther. 2025, 45, 680–687. [Google Scholar] [CrossRef] [PubMed]
  73. Huang, H.; Xie, H.; Zhang, G.; Xiao, W.; Ge, L.; Chen, S.; Zeng, Y.; Wang, C.; Li, H. Effects of Dynamic Neuromuscular Stabilization Training on the Core Muscle Contractility and Standing Postural Control in Patients with Chronic Low Back Pain: A Randomized Controlled Trial. BMC Musculoskelet. Disord. 2025, 26, 213. [Google Scholar] [CrossRef] [PubMed]
  74. Siglan, U.; Colak, S. Effects of Diaphragmatic and Iliopsoas Myofascial Release in Patients with Chronic Low Back Pain: A Randomized Controlled Study. J. Bodyw. Mov. Ther. 2022, 33, 120–127. [Google Scholar] [CrossRef] [PubMed]
  75. Coutinho, C.; Sampaio, D.; Pereira, N.; Pássaro, A.; Oliveira, V.; Casarotto, R. Effects of Combined Trunk Stretching and Lumbar Stabilization Exercises for Chronic Non-Specific Low Back Pain: A Randomized Clinical Trial. Braz. J. Med. Biol. Res. Rev. Bras. Pesqui. Medicas E Biol. 2026, 59, e14863. [Google Scholar] [CrossRef] [PubMed]
  76. Alkhathami, K.; Alshehre, Y.; Brizzolara, K.; Weber, M.; Wang-Price, S. Effectiveness of Spinal Stabilization Exercises on Movement Performance in Adults with Chronic Low Back Pain. Int. J. Sports Phys. Ther. 2023, 18, 169–172. [Google Scholar] [CrossRef] [PubMed]
  77. Rubí-Carnacea, F.; Masbernat-Almenara, M.; Climent-Sanz, C.; Soler-González, J.; García-Escudero, M.; Martínez-Navarro, O.; Valenzuela-Pascual, F. Effectiveness of an Exercise Intervention Based on Preactivation of the Abdominal Transverse Muscle in Patients with Chronic Nonspecific Low Back Pain in Primary Care: A Randomized Control Trial. BMC Prim. Care 2023, 24, 180. [Google Scholar] [CrossRef] [PubMed]
  78. Güler, M.A.; Demirdel, E.; Gezer, I.A.; Nayman, A.; Tezcan, E.A. Effect of Core Stabilization Exercises on Lumbar Multifidus Morphology and Functional Outcomes in Chronic Non-Specific Low Back Pain: A Randomized Controlled Trial. BMC Musculoskelet. Disord. 2025, 27, 83. [Google Scholar] [CrossRef] [PubMed]
  79. Brignardello-Petersen, R.; Tomlinson, G.; Flórez, I.; Rind, D.; Chu, D.K.; Morgan, R.; Mustafa, R.; Schünemann, H.; Guyatt, G. GRADE Concept Article 5: Addressing Intransitivity in a Network Meta-Analysis. J. Clin. Epidemiol. 2023, 160, 151–159. [Google Scholar] [CrossRef] [PubMed]
  80. Brignardello-Petersen, R.; Bonner, A.; Alexander, P.E.; Siemieniuk, R.A.; Furukawa, T.A.; Rochwerg, B.; Hazlewood, G.S.; Alhazzani, W.; Mustafa, R.A.; Murad, M.H.; et al. Advances in the GRADE Approach to Rate the Certainty in Estimates from a Network Meta-Analysis. J. Clin. Epidemiol. 2018, 93, 36–44. [Google Scholar] [CrossRef] [PubMed]
  81. Wells, C.; Kolt, G.S.; Bialocerkowski, A. Defining Pilates Exercise: A Systematic Review. Complement. Ther. Med. 2012, 20, 253–262. [Google Scholar] [CrossRef] [PubMed]
  82. Saragiotto, B.T.; Maher, C.G.; Yamato, T.P.; Costa, L.O.; Menezes Costa, L.C.; Ostelo, R.W.; Macedo, L.G. Motor Control Exercise for Chronic Non-specific Low-back Pain. Cochrane Database Syst. Rev. 2016, 2016, CD012004. [Google Scholar] [CrossRef] [PubMed]
  83. Massé-Alarie, H.; Shraim, M.; Hodges, P. Sensorimotor Integration in Chronic Low Back Pain. Neuroscience 2024, 552, 29–38. [Google Scholar] [CrossRef] [PubMed]
  84. Zhao, K.; Zhang, P.; Li, H.; Li, L. Exercise Prescription for Improving Chronic Low Back Pain in Adults: A Network Meta-Analysis. Front. Public Health 2025, 13, 1512450. [Google Scholar] [CrossRef] [PubMed]
  85. de Zoete, A.; Innocenti, T.; Petrozzi, M.; van Middelkoop, M.; Assendelft, W.; de Boer, M. Spinal Manipulative Therapy for Adults with Chronic Low Back Pain. Cochrane Database Syst. Rev. 2026, 2026, CD008112. [Google Scholar] [CrossRef] [PubMed]
  86. Coulter, I.; Crawford, C.; Hurwitz, E.; Vernon, H.; Khorsan, R.; Booth, M.; Herman, P. Manipulation and Mobilization for Treating Chronic Low Back Pain: A Systematic Review and Meta-Analysis. Spine J. 2018, 18, 866–879. [Google Scholar] [CrossRef] [PubMed]
  87. Dos Santos, E.C.S.; Dos Santos, A.T.; Da Silva, N.A.; Carneiro, S.R.; Rampazo, É.P.; Magalhães, M.O. Effectiveness of Adding Manual Therapy to Exercise for Pain and Disability in Chronic Non-Specific Low Back Pain: A Systematic Review and Meta-Analysis. Musculoskelet. Sci. Pract. 2026, 82, 103508. [Google Scholar] [CrossRef] [PubMed]
  88. Palomo, I.; Gil-Martínez, E.; López-Fernández, M.; González, L.; Querol-Zaldívar, M.; Castro-Sánchez, A. Efficacy of Dry Needling for Chronic Low Back Pain: A Systematic Review and Meta-Analysis of Randomized Controlled Trials. Altern. Ther. Health Med. 2022, 29, 110–120. [Google Scholar]
  89. Tuninetti, A.; Barbari, V.; Storari, L.; Bisconti, M.; Piano, L.; Dunning, J.; Mourad, F.; Maselli, F. Therapeutic Exercise Progression in Patients with Nonspecific Low Back Pain: A Systematic Review. J. Pain Res. 2025, 18, 6397–6407. [Google Scholar] [CrossRef] [PubMed]
  90. Geraci, I.; Bargeri, S.; Basso, G.; Castellini, G.; Chiarotto, A.; Gianola, S.; Raymond, O.; Testa, M.; Innocenti, T. Therapeutic Quality of Exercise Interventions for Chronic Low Back Pain: A Meta-Research Study Using i-CONTENT Tool. BMJ Evid.-Based Med. 2025, 30, 194–201. [Google Scholar] [CrossRef] [PubMed]
  91. Lapuente-Hernández, D.; Gil-Calvo, M.; Cuenca-Zaldívar, J.N.; Carcasona-Otal, A.; Herrero, P.; Matute-Llorente, Á. Home Physical Exercise Interventions in Chronic Non-Specific Low Back Pain: Systematic Review and Multivariate Meta-Analysis. Healthcare 2025, 13, 2094. [Google Scholar] [CrossRef] [PubMed]
  92. Meuwissen, I.; Vanderstraeten, R.; Roussel, N.; Meeus, M.; Van Eetvelde, J.; Meus, T. Contributors to Adherence to Exercise Therapy in Non-Specific Chronic Low Back Pain: A Systematic Review of Qualitative and Quantitative Research. J. Clin. Med. 2025, 14, 6251. [Google Scholar] [CrossRef] [PubMed]
  93. Gilanyi, Y.L.; Shah, B.; Cashin, A.; Gibbs, M.; Bellamy, J.; Day, R.; McAuley, J.H.; Jones, M.D. Barriers and Enablers to Exercise Adherence in People with Nonspecific Chronic Low Back Pain: A Systematic Review of Qualitative Evidence. Pain 2024, 165, 2200–2214. [Google Scholar] [CrossRef] [PubMed]
  94. Kowalewska, P.; Wójcik, M.; Banaszak, A.; Bernatowicz, K.; Motyl, M.; Wołodźko, P.; Sauermann, M.; Wołczek, M.; Pietruszak, E.; Aniśko, B. Physical Function, Daily Activities, and Spinal Pain in the Elderly: A Cross-Sectional Study. PLoS ONE 2026, 21, e0344292. [Google Scholar] [CrossRef] [PubMed]
  95. McDevitt, A.W.; O’Halloran, B.; Cook, C.E. Cracking the Code: Unveiling the Specific and Shared Mechanisms behind Musculoskeletal Interventions. Arch. Physiother. 2023, 13, 14. [Google Scholar] [CrossRef] [PubMed]
  96. Keter, D.L.; Bialosky, J.E.; Brochetti, K.; Courtney, C.A.; Funabashi, M.; Karas, S. The Mechanisms of Manual Therapy: A Living Review of Systematic, Narrative, and Scoping Reviews. PLoS ONE 2025, 20, e0319586. [Google Scholar] [CrossRef] [PubMed]
  97. George, S.Z.; Fritz, J.M.; Silfies, S.P.; Schneider, M.J.; Beneciuk, J.M.; Lentz, T.A.; Gilliam, J.R.; Hendren, S.; Norman, K.S. Interventions for the Management of Acute and Chronic Low Back Pain: Revision 2021. J. Orthop. Sports Phys. Ther. 2021, 51, CPG1–CPG60. [Google Scholar] [CrossRef] [PubMed]
  98. Briggs, A.M.; Sumi, Y.; Banerjee, A. The World Health Organization Guideline for Non-Surgical Management of Chronic Primary Low Back Pain in Adults: Implications for Equitable Care and Strengthening Health Systems Globally. Glob. Health Res. Policy 2025, 10, 26. [Google Scholar] [CrossRef] [PubMed]
  99. Hayden, J.; Wilson, M.; Stewart, S.; Cartwright, J.; Smith, A.; Riley, R.; van Tulder, M. Exercise Treatment Effect Modifiers in Persistent Low Back Pain: An Individual Participant Data Meta-Analysis of 3514 Participants from 27 Randomised Controlled Trials. Br. J. Sports Med. 2020, 54, 1277–1278. [Google Scholar] [CrossRef] [PubMed]
  100. Barbier, G.; Picchiottino, M.; Delafontaine, A.; Goncalves, G.; Bussières, A.; Cottin, F.; Lardon, A. Baseline Individual Factors Associated with Clinical Outcomes in Adults with Non-Specific Low Back Pain Following Manual Therapy: A Systematic Review. BMC Complement. Med. Ther. 2025, 25, 330. [Google Scholar] [CrossRef] [PubMed]
  101. Jupin, C.; Aibar, V.B.; Sarhan, F. Short-Term Effects of Spinal Manual Therapy on the Nervous System in Managing Musculoskeletal Pain: A Systematic Review. J. Clin. Med. 2025, 14, 3830. [Google Scholar] [CrossRef] [PubMed]
  102. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
Figure 1. PRISMA flow diagram for the systematic review and network meta-analysis. Note. Adapted from PRISMA 2020 and PRISMA NMA.
Figure 1. PRISMA flow diagram for the systematic review and network meta-analysis. Note. Adapted from PRISMA 2020 and PRISMA NMA.
Jcm 15 04809 g001
Figure 2. Risk of bias assessment across domains using the RoB 2 tool (intention-to-treat analysis). Note. Percentages indicate the distribution of studies across risk-of-bias judgments (low risk, some concerns, high risk) for each domain of the RoB 2 tool, based on intention-to-treat analyses.
Figure 2. Risk of bias assessment across domains using the RoB 2 tool (intention-to-treat analysis). Note. Percentages indicate the distribution of studies across risk-of-bias judgments (low risk, some concerns, high risk) for each domain of the RoB 2 tool, based on intention-to-treat analyses.
Jcm 15 04809 g002
Figure 3. Network graph. Note. The diameter of the nodes is proportional to the number of studies (k) assigned to the intervention and the thickness of the line to the number of studies of the direct comparison.
Figure 3. Network graph. Note. The diameter of the nodes is proportional to the number of studies (k) assigned to the intervention and the thickness of the line to the number of studies of the direct comparison.
Jcm 15 04809 g003
Figure 4. The heat map of the network of designs included in the network meta-analysis. Note. Heatmap of the design-by-treatment interaction model displaying the contribution matrix and standardized discrepancies. The size of the inner gray squares represents the leverage (statistical influence) of the treatment design on the horizontal axis on the network estimate on the vertical axis, with larger squares indicating greater contribution. The background color gradient maps the standardized residuals (Z-scores) of the discrepancies, centered at zero (white), which represents perfect consistency. Values deviating from zero—both warm colors (positive residuals) and cool colors (negative residuals)—indicate the magnitude and direction of network inconsistency. The intense coloration along the main diagonal highlights the specific designs that act as the primary drivers of the overall network inconsistency. PILM = Pilates Mat; PILA = Pilates Apparatus.
Figure 4. The heat map of the network of designs included in the network meta-analysis. Note. Heatmap of the design-by-treatment interaction model displaying the contribution matrix and standardized discrepancies. The size of the inner gray squares represents the leverage (statistical influence) of the treatment design on the horizontal axis on the network estimate on the vertical axis, with larger squares indicating greater contribution. The background color gradient maps the standardized residuals (Z-scores) of the discrepancies, centered at zero (white), which represents perfect consistency. Values deviating from zero—both warm colors (positive residuals) and cool colors (negative residuals)—indicate the magnitude and direction of network inconsistency. The intense coloration along the main diagonal highlights the specific designs that act as the primary drivers of the overall network inconsistency. PILM = Pilates Mat; PILA = Pilates Apparatus.
Jcm 15 04809 g004
Figure 5. Network estimates versus control with P-score and CINeMA confidence for post-treatment disability. Note. Negative standardized mean difference values favor the intervention over control. Solid horizontal lines represent 95% confidence intervals, whereas dashed horizontal lines represent 95% prediction intervals. The vertical line at 0 indicates no effect, and the dotted vertical lines at ±0.20 indicate the SMD decision threshold used for CINeMA judgments of imprecision, heterogeneity and incoherence. This threshold should not be interpreted as an ODI- or RMDQ-specific minimal clinically important difference. Interventions are ordered according to P-score values; however, P-scores should be interpreted as descriptive ranking metrics only and not as a stand-alone hierarchy of treatment efficacy. Evidence type, number of direct studies and final CINeMA confidence ratings are displayed alongside each estimate to support interpretation according to effect size, uncertainty, heterogeneity, directness of evidence and confidence in the network estimate.
Figure 5. Network estimates versus control with P-score and CINeMA confidence for post-treatment disability. Note. Negative standardized mean difference values favor the intervention over control. Solid horizontal lines represent 95% confidence intervals, whereas dashed horizontal lines represent 95% prediction intervals. The vertical line at 0 indicates no effect, and the dotted vertical lines at ±0.20 indicate the SMD decision threshold used for CINeMA judgments of imprecision, heterogeneity and incoherence. This threshold should not be interpreted as an ODI- or RMDQ-specific minimal clinically important difference. Interventions are ordered according to P-score values; however, P-scores should be interpreted as descriptive ranking metrics only and not as a stand-alone hierarchy of treatment efficacy. Evidence type, number of direct studies and final CINeMA confidence ratings are displayed alongside each estimate to support interpretation according to effect size, uncertainty, heterogeneity, directness of evidence and confidence in the network estimate.
Jcm 15 04809 g005
Table 1. CINeMA-Based Summary of Findings and Final Confidence Ratings for Post-Treatment Disability Versus CTRL.
Table 1. CINeMA-Based Summary of Findings and Final Confidence Ratings for Post-Treatment Disability Versus CTRL.
Intervention vs. CTRLEvidenceDirect StudiesNMA SMD [95% CI]95% PrIP-ScoreFinal
Confidence
Pilates APP vs. CTRLMixed2−0.879 [−1.156, −0.601][−1.434, −0.323]0.92Moderate
STMC vs. CTRLMixed4−0.856 [−1.044, −0.667][−1.370, −0.342]0.92Low
ST vs. CTRLMixed2−0.757 [−0.988, −0.527][−1.290, −0.225]0.83Low
STM vs. CTRLMixed2−0.681 [−0.998, −0.365][−1.259, −0.104]0.75Moderate
Pilates MAT vs. CTRLMixed4−0.643 [−0.851, −0.436][−1.165, −0.121]0.72Very low
GE vs. CTRLIndirect0−0.392 [−0.619, −0.164][−0.922, 0.139]0.52Very low
SM vs. CTRLDirect4−0.399 [−0.639, −0.160][−0.935, 0.137]0.51Very low
STR vs. CTRLMixed2−0.312 [−0.522, −0.101][−0.835, 0.211]0.44Very low
MDT vs. CTRLMixed1−0.138 [−0.461, 0.186][−0.719, 0.444]0.32Low
STRET vs. CTRLDirect10.047 [−0.467, 0.561][−0.667, 0.760]0.19Very low
UC vs. CTRLIndirect00.197 [−0.214, 0.608][−0.441, 0.835]0.10Very low
BS vs. CTRLIndirect00.269 [−0.296, 0.833][−0.485, 1.022]0.08Very low
Note. SMD = standardized mean difference; CI = confidence interval; PrI = prediction interval; CTRL = control; APP = apparatus-based Pilates; MAT = mat Pilates; STMC = stabilization exercises with motor control; ST = stabilization exercises; STM = soft tissue manipulation; SM = spinal manipulation; STR = strengthening or resistance exercise; MDT = McKenzie/Mechanical Diagnosis and Therapy; STRET = stretching; UC = usual care; BS = back school; GE = general exercise; Final confidence ratings were conservatively adjudicated from automatic CINeMA domain-level judgments and additional network diagnostics using a downgrade-only GRADE-style approach within the CINeMA framework. The ±0.20 SMD threshold was defined before final confidence adjudication and should not be interpreted as an ODI- or RMDQ-specific minimal clinically important difference. P-score values describe ranking only. The adjudication criteria are reported in Supplementary Material Text S1.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Robles-García, M.; González, J.L.S.; Sánchez-Sánchez, J.L.; Calderón-Díez, L.; Santos Del Rey, M.; Martín-Vallejo, J. Comparative Effects of Therapeutic Exercise and Manual Therapy Techniques on Self-Reported Disability in Chronic Non-Specific Low Back Pain: A Network Meta-Analysis. J. Clin. Med. 2026, 15, 4809. https://doi.org/10.3390/jcm15124809

AMA Style

Robles-García M, González JLS, Sánchez-Sánchez JL, Calderón-Díez L, Santos Del Rey M, Martín-Vallejo J. Comparative Effects of Therapeutic Exercise and Manual Therapy Techniques on Self-Reported Disability in Chronic Non-Specific Low Back Pain: A Network Meta-Analysis. Journal of Clinical Medicine. 2026; 15(12):4809. https://doi.org/10.3390/jcm15124809

Chicago/Turabian Style

Robles-García, Miguel, Juan Luis Sánchez González, José Luis Sánchez-Sánchez, Laura Calderón-Díez, Miguel Santos Del Rey, and Javier Martín-Vallejo. 2026. "Comparative Effects of Therapeutic Exercise and Manual Therapy Techniques on Self-Reported Disability in Chronic Non-Specific Low Back Pain: A Network Meta-Analysis" Journal of Clinical Medicine 15, no. 12: 4809. https://doi.org/10.3390/jcm15124809

APA Style

Robles-García, M., González, J. L. S., Sánchez-Sánchez, J. L., Calderón-Díez, L., Santos Del Rey, M., & Martín-Vallejo, J. (2026). Comparative Effects of Therapeutic Exercise and Manual Therapy Techniques on Self-Reported Disability in Chronic Non-Specific Low Back Pain: A Network Meta-Analysis. Journal of Clinical Medicine, 15(12), 4809. https://doi.org/10.3390/jcm15124809

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop