1. Introduction
Degenerative cervical myelopathy represents the most frequent cause of functional impairment from medullary origin in adult patients over 50 years of age, may result in a major disability like tetraparesia and is the consequence of degenerative changes in the cervical spine [
1]. The clinical presentation of DCM is variable [
2]. In order to harmonize the assessment of disability due to DCM, the modified-JOA (mJOA) is now commonly used [
2,
3,
4,
5].
MRI represents the gold standard imaging modality for the diagnosis of the disease. Different MRI signs have been reported with variable clinical significance [
5,
6,
7,
8,
9,
10,
11].
According to the literature, it seems that one MRI sign alone remains poorly correlated with the clinical presentation. The absence of a clear correlation between the symptoms and MRI findings poses some difficulties to precisely evaluating the disease’s severity and can thus make the decision-making uncomfortable for the clinicians.
In order to help practitioners to accurately assess the severity of the disease on MRI and therefore choose the best treatment option, the objective of the present study was to establish a multi-parametric weighted scoring system that is easy to use in daily practice, is based on the most significant MRI signs and correlates as strongly as possible with the clinical presentation (mJOA)—we call this system the SIMS or Severity on Imaging Myelopathy Score.
2. Materials and Methods
2.1. Study Design and Population
This is a single-center retrospective analysis of retrospectively collected data from 99 surgical patients with a diagnosis of cervical degenerative stenosis on MRI and who had a clinical evaluation with mJOA score established by at least one of the senior neurosurgeons of the Spine and Cord Unit of the Neurological Hospital of the Hospices Civils de Lyon (HCL), between January 2015 and March 2021, and who benefited from a cervical MRI with axial and sagittal slices, both within an interval of less than 1 year. The indication for surgery was left to the surgeon’s discretion and could be based on the patient’s clinical and functional assessment, radiological evolution or abnormalities found on evoked potentials.
Inclusion criteria were as follows: (1) age ≥18 years; (2) mJOA score found in the patient’s medical record; (3) cervical MRI with T2-weighted sequences including axial and sagittal slices.
The following patients were excluded from the study: patients with (1) age < 18 years; (2) a history of surgery and/or trauma and/or infection and/or neoplasia and/or congenital deformity of the cervical spine; (3) the presence of severe and/or advanced neurological or systemic disease that could influence the clinical or electrophysiological evaluation.
Informed consent was obtained from the participants. The study was approved by the Ethical Committee of French College of Neurosurgery (IRB00011687). All methods were carried out in accordance with relevant guidelines and regulations.
2.2. Selection of Variables
The selection of the parameters composing the SIMS was made on the basis of literature searches, on the PubMed platform, with the aim of retaining the variables best correlated with clinical symptoms. These parameters had to be measurable on MRI images using the HCL visualization software (PACS) at the most stenotic level.
The measurement of the different variables for each patient was performed jointly by 2 neurosurgeons specialized in spine surgery (CB and AM). The determination of the highest stenotic level was made according to a consensus between the two practitioners.
Initially, 8 variables were selected on the basis of the literature data. Due to the lack of a clear correlation with the clinical presentation in the statistical analysis of 2 of the criteria, it was decided to keep 6 criteria: the Fujiwara ratio; T2-weighted intramedullary hyperintensity; cerebrospinal fluid cisterns; the Torg–Pavlov ratio; local kyphosis; and the number of stenotic levels.
2.3. Categorization for Each SIMS Criteria
2.3.1. Fujiwara Ratio
The FR is defined as the ratio of the anteroposterior diameter of the spinal cord to the transverse diameter on an axial slice at maximum compression on T2-weighted sequences (
Figure 1).
Four grades were selected:
(A) FR ≥ 0.5;
(B) 0.4 ≤ FR < 0.5;
(C) 0.3 ≤ FR < 0.4;
(D) FR < 0.3.
2.3.2. T2-Weight Intramedullary Hyperintensity
Three grades were selected on a sagittal slice (
Figure 2):
(A) No T2HI;
(B) Focal T2HI (limited to one intervertebral space and adjacent vertebral bodies);
(C) Multi-segmental T2HI (Extending beyond two intervertebral spaces).
2.3.3. Cerebrospinal Fluid Cisterns
Four grades were selected on an axial slice (
Figure 3)
(A) CSF visible anteriorly and posteriorly;
(B) CSF erased anteriorly or posteriorly;
(C) CSF erased anteriorly and posteriorly but root cisterns still visible;
(D) Totally erased cisterns—no CSF visible on the slice.
2.3.4. Torg–Pavlov Ratio
Three grades were selected (based on T2-weighted sequence on median sagittal sections at the middle vertebral level overlying the compression) (
Figure 4). The Torg–Pavlov ratio is calculated using the sagittal canal-to-vertebral body ratio:
(A) TPR ≥ 0.8;
(B) 0.6 ≤ TPR < 0.8;
(C) TPR < 0.6.
2.3.5. Local Kyphosis
Two grades were selected (based on the focal angle measured on sagittal T2-weighted MRI sequences at the level of maximum compression) (
Figure 5):
(A) Lordosis or LK <4°
(B) LK ≥ 4°
2.3.6. Number of Stenotic Level(s)
Three grades were selected (any level (in addition to the maximum compression level) for which the FR in the axial section on T2-weighted sequences is < 0.40 is considered as an additional compression level):
(A) One stenotic level;
(B) Two stenotic levels;
(C) ≥Three stenotic levels.
2.4. Intraobserver and Interobserver Reliability
Intraobserver reliability was measured by comparing the measurements in 2 cases obtained by two observers (GP and DC) 10 days apart. Conversely, the interobserver reliability was measured by comparing the measurements in 2 cases obtained by the two observers on the same day.
2.5. General Statistical Analysis
Quantitative data were described using the mean, standard deviation, median, and first and third quartiles, as well as the minimum and maximum values.
Qualitative data were described using numbers and proportions (percentages).
For the mJOA score, the following 3 categories were used [
1]: mild (score ≥ 15), moderate (12 ≤ score ≤ 14), severe (score ≤ 11).
Comparisons of mean mJOA scores for each subgroup of each variable were performed using Student t tests.
Associations between quantitative variables were quantified by Spearman’s correlation coefficient, with the associated 95% confidence interval.
Linear regression models were used to assess the association between each of the categorized criteria and the mJOA score. The association was quantified by the coefficient of determination (R2).
Univariate analyses were followed by multivariable regression analyses.
The final choice of criteria to be included in the SIMS score was based on the values of the R2 coefficients obtained in the univariate analysis, on the results of the different multivariate models and on clinical considerations. Conditional on explanatory variables, the mJOA score was normally distributed.
The final association between the selected SIMS and the mJOA score was quantified by the Spearman correlation coefficient (95% confidence interval), and by describing the SIMS score values according to the 3 mJOA categories.
The analyses were performed with the R software.
The intraclass correlation coefficient (ICC) was utilized to measure the intra- and interobserver agreement for different scores, with a confidence interval (CI) of 95%. ICC values of 0.00 to 0.20 were considered to be in slight agreement; 0.21 to 0.40, fair agreement; 0.41 to 0.60, moderate agreement; 0.61 to 0.80, substantial agreement; and 0.81 to 1.00, almost perfect agreement. The analyses were performed with the JASP software ((Version 0.19.1) [Computer software]/
https://jasp-stats.org/download/ accessed on 18 September 2024).
4. Discussion
At the present time, the most commonly used parameters to assess spinal cord compression in cervical myelopathy on MRI are the appearance of T2HI and the disappearance of peri-medullary fluid spaces. Although T2HI is a marker of disease severity and poor prognosis, it usually occurs at a late stage of the disease [
12,
13,
14,
15,
16,
17].
Some classification systems have been published, such as those of Nagata et al., Muhle et al. or Kang et al., based on obliteration of the subarachnoid spaces, the degree of spinal cord compression or the presence of a change in spinal cord intensity. But these criteria are subjective, evaluate in sagittal plane alone and a correlation with clinical involvement has not been clearly established [
18,
19,
20].
Finally, Wang et al. have also established a new MRI score for assessing compression in patients with posterior longitudinal ligament ossification, following the same model as ours, but without weighting the variables according to their relative importance, and there was no evaluation of the correlation with the mJOA [
21].
The choice to retain the T2HI, the CSF and the NSL was natural since they were significantly associated with the JOA score (
p < 0.001) and are the most useful parameters for routine practice. The results are consistent with those in the literature. For example, Watabe et al. showed that the decrease in CSF flow was significantly associated with the severity of the myelopathy assessed by the JOA [
21,
22] and Wang et al. showed a significant correlation between different grades of CSF obliteration and the JOA score. Fehlings et al., for their part, found in their prospective surgical cohort an average of 3.86 decompressed levels and showed that the mJOA was inversely proportional to the number of levels affected [
2].
In order to be used in clinical routine, MRI parameters must be easily and quickly measurable.
The study of the FR in function of the SIMS showed a significant difference between the different subscores in the univariate modeling but not in the multivariate analysis. The small number of patients with a FR < 0.3 may explain the lack of power for detecting differences between categories. Nevertheless, this variable is easily measurable in routine clinical practice with parameters that have a good interobserver and intraobserver correlation, so it seemed useful to keep in the score [
23].
In the same way, the TPR has been integrated into the initial SIMS. Chrispin and Lee found that a ratio of less than 0.85 was a risk factor, which is consistent with the results of Ehni et al. and Pavlov et al. who found ratios of 0.80 and 0.82, respectively [
24]. Yue et al. found a limit of 0.72 in their retrospective cohort of 88 patients [
25]. Moreover, they demonstrated that in patients over 50 years of age, the relative risk of developing myelopathy was 96% when the Torg–Pavlov ratio was below 0.60. Unfortunately, the small number of patients with a TPR < 0.6 (
n = 6) in our series does not allow statistical analyses to be performed with correct power, and the fact that TPR was initially described on plain radiographs has to be kept in mind during the interpretation of our results.
Finally, Wu et al. showed that the JOA score had a large and significant negative correlation with focal kyphosis [
26]. Oshima et al. found a correlation between segmental kyphosis at the most compressed level and the risk of neurological deterioration requiring surgical conversion in patients with mild degenerative cervical myelopathy [
27]. From a mechanical and histological perspective, cervical kyphotic deformity increases the traction force exerted on the spinal cord through stretching of the dentate ligament and nerve roots, as well as causing the spinal cord to be compressed against the vertebral bodies and intervertebral disks. It has been shown that spinal cord flattening, demyelination and atrophy are significantly correlated with the degree of kyphosis and are more pronounced at the apex of the deformity [
28,
29,
30]. The results in this study showed an inverse correlation between the degree of kyphosis and the mJOA, but it was weak and not significant. Nevertheless, given the data in the literature, which show an association of cervical sagittal alignment with the severity of myelopathy and the need to take into account the kyphotic deformity of the cervical spine in the surgical consideration, we decided to keep this parameter in the final score in order to study its relevance more widely in future studies.
The intra- and interobserver reliability of the SIMS are almost perfect, demonstrating that its use could be implemented in routine clinical practice.
The SIMS was strongly correlated with the clinical presentation (evaluated by mJOA) supporting the interest to use a multiparametric score to assess the severity of DCM in clinical practice. Although very encouraging, these results need to be consolidated on a larger, prospective cohort. In addition, interobserver and intraobserver reliability must be analyzed, both between surgeons specializing in the spine and spinal cord, and with other practitioners involved in this pathology (general practitioners, neurologists, rehabilitation specialists, etc.), as this score is intended to be used widely. The second benefit of the prospective evaluation would be to demonstrate the potential prognostic value of this score.
This study had some limitations. First, it was a retrospective study, with some missing data. Then, even if already large with a series of 99 patients, the number of patients could be greater to improve the power of the study, especially concerning moderate and severe myelopathy. Finally, regarding the constitution of the score itself, the maintenance of some variables that were not significantly correlated to the mJOA score on multivariate analyses can be criticized, and their relevance should be confirmed.