Next Article in Journal
Arteriovenous Length Ratio: A Novel Method for Evaluating Retinal Vasculature Morphology and Its Diagnostic Potential in Eye-Related Diseases
Previous Article in Journal
Digital Grading the Color Fastness to Rubbing of Fabrics Based on Spectral Reconstruction and BP Neural Network
Previous Article in Special Issue
Assessing Acetabular Index Angle in Infants: A Deep Learning-Based Novel Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Radiomics Texture Analysis of Bone Marrow Alterations in MRI Knee Examinations

1
Medical Image and Signal Processing Laboratory, Department of Biomedical Engineering, University of West Attica, 12241 Athens, Greece
2
Department of Biomedical Sciences, University of West Attica, 12241 Athens, Greece
3
Department of Radiology, Animus Kyanous Stavros, 57014 Larissa, Greece
4
Department of Orthopedic, Animus Kyanous Stavros, 57014 Larissa, Greece
5
Department of Radiology, General University Hospital of Larissa, 41334 Larissa, Greece
*
Author to whom correspondence should be addressed.
J. Imaging 2023, 9(11), 252; https://doi.org/10.3390/jimaging9110252
Submission received: 27 September 2023 / Revised: 1 November 2023 / Accepted: 16 November 2023 / Published: 20 November 2023
(This article belongs to the Special Issue Advances in Image Analysis: Shapes, Textures and Multifractals)

Abstract

:
Accurate diagnosis and timely intervention are key to addressing common knee conditions effectively. In this work, we aim to identify textural changes in knee lesions based on bone marrow edema (BME), injury (INJ), and osteoarthritis (OST). One hundred and twenty-one MRI knee examinations were selected. Cases were divided into three groups based on radiological findings: forty-one in the BME, thirty-seven in the INJ, and forty-three in the OST groups. From each ROI, eighty-one radiomic descriptors were calculated, encoding texture information. The results suggested differences in the texture characteristics of regions of interest (ROIs) extracted from PD-FSE and STIR sequences. We observed that the ROIs associated with BME exhibited greater local contrast and a wider range of structural diversity compared to the ROIs corresponding to OST. When it comes to STIR sequences, the ROIs related to BME showed higher uniformity in terms of both signal intensity and the variability of local structures compared to the INJ ROIs. A combined radiomic descriptor managed to achieve a high separation ability, with AUC of 0.93 ± 0.02 in the test set. Radiomics analysis may provide a non-invasive and quantitative means to assess the spatial distribution and heterogeneity of bone marrow edema, aiding in its early detection and characterization.

1. Introduction

It is thought to be possible to predict pain, disability, and the structural progression of knee osteoarthritis using bone marrow lesions (BMLs). However, the relationship between knee loading and BMLs is not yet completely understood [1]. Moreover, the pathophysiology of osteoarthritis (OA) is also incompletely understood [2]. For example, the pathologic processes, primarily located at the level of the articular cartilage, do not completely characterize numerous OA findings. This includes bone marrow edema (BME) in the subchondral bone, detected via magnetic resonance imaging (MRI) in the affected joints [3]. Felson et al. defined a relationship between knee pain and bone marrow lesions using MRI [4], where bone marrow lesions on MRI were present in 78% of patients with painful OA knee compared to only 30% of patients with painless OA knee. Specifically, the degree of bone edema appeared to be associated with the severity of angular deformity at the knee [5,6].
There is evidence that subchondral BMLs imaged using MRI are involved in the pathogenesis of OA [4,5,7,8]. BME pattern is defined as an area of high signal intensity in T2- weighted, fat-saturated MRI or in short-inversion-time inversion recovery (STIR) images [9,10,11]. This increase in signal has been attributed to several factors, including abnormal trabeculae, bone marrow necrosis, swelling of fat cells, and marrow bleeding. It indicates a so-called bone bruise or impression of fracture due to translational injury, where the anterolateral femur impacts the posterolateral tibia (kissing contusions) when the ACL is ruptured [10]. Regarding healthy subjects with no OA, traumatic injuries of the knee can result in subchondral bone edema lesions [12,13]. However, the causal relationship between these lesions and the disease’s progression is unclear [8]. A better understanding of the interrelation of cartilage and the bone immediately under is necessary [14].
MRI-based texture analysis has been previously proposed for quantification of bone related abnormalities. Fritz et al. [15] and Cilengir et al. [16] have highlighted the diagnostic performance of texture analysis for cartilaginous bone tumors. Fritz et al. found 92.9% accuracy in differentiating benign from malignant cartilaginous lesions when employing both MRI and textural predictors. Cilengir et al. found an AUC of 0.84 in differentiating chondrosarcomas from encondromas employing the logistic regression classifier. Chuah et al. [17] performed texture analysis in the femur for the detection of cartilage-related bone marrow edema. They reported an AUC of 0.91 in differentiating normal from affected bone marrows lesions based on co-occurrence matrix features. Mackay et al. [18] used texture analysis for tibial subchondral bone assessment in OA. They reported 53% accuracy in differentiating between normal and OA cases based on statistical texture analysis. Recently, Li et al. used texture analysis of the infrapatellar fat pad for OA prediction [19]. They reported an AUC ≥0.75 in predicting OA progression employing textural features.
To the best of our knowledge, the utilization of texture analysis to differentiate between bone marrow edema, injury, and osteoarthritis lesions has not yet been reported. We chose to study these three clinical conditions, because they are very common in the knee joint and their radiological findings are not easy to distinguish. Sometimes, even when attempted in combination with clinical findings in patients with OA and injury, or in young patients with injury, it is difficult to make a specific diagnosis. However, such clinical cases require different clinical management, so it is important to differentiate, i.e., while bone marrow edema cases require no treatment, the injured and osteoarthritis cases require different clinical management. Hence, the present study is focused on studying the variations in those descriptors in the different knee abnormalities caused by bone marrow edema, injury, and osteoarthritis. For this, we considered a large pool of radiomics descriptors encoding texture information. These descriptors can express significant properties in the bone marrow edema, like lesions that might be useful in diagnosis, prognosis, or the treatment of individual patients [20].

2. Materials and Methods

2.1. Clinical Material

In the present study, one hundred and twenty-one individuals (121) were routinely scanned for knee examination from 2015 to 2017. This amounted to seventy females and fifty-one males with average age of 41 years. Non-selection criteria were subjects likely to have bone edema (multiple myeloma, Pazet disease, history of sarcoma, or known cancer showing secondary bone metastases).
The subjects were divided into three groups according to radiological findings [21]. The most dominant radiological findings are those which are related with the placement of the edema and its presentation (single-focal or multifocal). In combination with the clinical findings, the three groups were formed.
In the first group, namely, the bone marrow edema (“BME”) class, we placed forty-one (41) subjects (average age 25.5 y) diagnosed with erythropoiesis of bone marrow. These patients were young people and they had more than one focal point of high signal intensity (multifocal) in the presented bones (femur, tibia). They did not have findings of OA or injury of the knee joint.
In the second group, namely, the injured (“INJ”) class, we assigned thirty-seven (37) patients (average age 35.4 y) that had suffered recent injury and bone edema-like lesions, but which did not have findings of OA.
In the third group, namely, the osteoarthritis (“OST”) class, we placed forty-three (43) subjects (average age 61.8 y) with no history of recent injury, but with clinical findings (i.e., chondropathy, mediastinal stenosis, and osteophytes) of OA. In the case of recent injury, MRI examination repetition was conducted to confirm diagnosis.
Differentiation among the three groups is critical to providing accurate diagnosis, appropriate care, optimizing outcomes, preventing complications, and ensuring the efficient use of healthcare resources. Misdiagnosis can lead to inappropriate treatment, delayed management, and potentially worse outcomes. Precise differentiation can prevent excessive medication usage. It allows for tailored, patient-centered care that addresses the specific needs and characteristics of each condition.

2.2. MRI Imaging Techniques and ROIs Delineation

All knee examinations were performed using a 1.5 T scanner (SIGNA HDX Twin Speed, GE Healthcare, Chicago, IL, USA) and the General Electric four-channel matrix knee coil. Two sequences were employed: the fast-spin echo (FSE) proton density (PD-FSE) with fat suppression (FS) and the short tau inversion recovery (STIR) sequences in sagittal plane.
PD-FSE is a standard MRI sequence that provides detailed anatomical information by exploiting the differences in the density of protons in various tissues within the body. It is particularly useful for evaluating bone marrow because it highlights the contrast between fat and water. STIR is another MRI sequence that is highly effective in evaluating bone marrow. STIR imaging is a fat-suppressed sequence that is specifically designed to suppress the signal from fat and highlight the signal from water and other soft tissues. The parameters for PD-FSE and STIR sequences are presented in Table 1.
For each case and for each sequence, a region of interest (ROI) was manually delineated by an expert radiologist (A.T.) and an experienced radiographer (E.L.) employing a software program developed for the purposes of the present study. The software provided the radiologist with a series of DICOM MR images and the capability to select appropriate images and to delineate and eventually segment ROIs. Figure 1 shows a snapshot of delineation (bone marrow edema case) in the PD-FSE sequence and the corresponding ROI.
Finally, we extracted two ROIs for each case, one corresponding to STIR and one to the PD-FSE sequences accordingly. Figure 2 presents instances from the acquired ROIs according to pathological condition and MRI sequence protocol.

2.3. Extraction of Radiomic Descriptors

We computed eighty-one (81) descriptors from a single ROI, encoding texture information and employing a large spectrum of textural features as follows:
This included seventeen (17) descriptors from first-order statistics [22], describing the shape of distribution of pixel intensities in the ROI. These were histogram-based features (such as mean intensity, standard deviation, skewness, kurtosis, median, percentiles, range, interquartile range, and intensity-based entropy).
Fifty-four (54) descriptors were calculated from the second-order statistics, describing the statistical inter-relationships between neighboring pixels.
  • Twenty-two (22) of these were calculated based on the gray-level co-occurrence matrix (GLCM) [22,23,24,25,26] (angular second moment, contrast, correlation, sum of squares, inverse difference moment, sum average, sum variance, sum entropy, entropy, difference variance, difference entropy, information measure correlation 1, information measure correlation 2, maximal correlation coefficient, joint maximum, joint average, difference average, inverse difference, normalized inverse difference, autocorrelation, cluster shade, and cluster prominence).
  • Sixteen (16) were computed based on the gray-level run-length matrix (GLRLM) [22,27,28,29,30] (short-run emphasis, long-run emphasis, gray-level non-uniformity, run-length non-uniformity, run percentage, low-gray run emphasis, high-gray run emphasis, short-run low-gray emphasis, short-run high-gray emphasis, long-run low-gray emphasis, long-run high-gray emphasis, normalized gray-level non-uniformity, normalized run-length non-uniformity, gray-level run variance, and run-length variance, run entropy). For GLCM and GLRLM descriptors, the average across four orientations (N, S, E, W) was determined (using 1 pixel offset).
  • Sixteen (16) were considered based on the gray-level size zone matrix (GLSZM) [22,31] (small-zone emphasis, large-zone emphasis, low-gray-level zone emphasis, high-gray-level zone emphasis, small-zone low-grey emphasis, small-zone high-grey emphasis, large-zone low-grey emphasis, large-zone high-grey emphasis, gray-level non-uniformity, normalized gray-level non-uniformity, zone-size non-uniformity, normalized zone-size non-uniformity, zone percentage, grey-level zone variance, zone-size variance, zone-size entropy)
Six (6) descriptors were calculated based on Tamura descriptors [32] (Tamura coarseness 1, 2, 3, 4, contrast and roughness).
Four (4) descriptors were computed from the local binary pattern (LBP) [33] (mean, median, number of peaks, uniformity)

2.4. Statistical and Discriminant Analysis

We studied plausible alterations of textural descriptors amongst pathological conditions based on the statistical comparisons amongst the three groups (as BME vs INJ vs OST).
Initially, we assessed the normality distribution of each one of the features by means of the Jarque–Bera statistical test [34]. The Jarque–Bera test is a statistical test used to assess whether a dataset follows a normal distribution. Next, for those features that followed normal distribution, we employed the ANOVA test to evaluate the existence of statistically significant differences. ANOVA tests the null hypothesis that all group means are equal against the alternative hypothesis that at least one group mean is different. For the features that did not follow normal distribution, we utilized the Kruskal–Wallis test [35] to identify statistically significant differences, and we reported the median (IQR). The Kruskal–Wallis test is a non-parametric statistical test used to determine whether there are statistically significant differences between the medians of three or more independent groups or samples.
Moreover, we performed the multiple comparison test [36] as a post hoc test. Multiple comparison tests, also known as post hoc tests or post hoc analyses, are a group of statistical tests used after ANOVA, Kruskal–Wallis, or other omnibus tests to determine which specific group means are significantly different from each other. These tests are essential in situations where you have conducted ANOVA, Kruskal–Wallis or a similar test and found a significant difference among groups. The goal of multiple comparison tests is to identify which pairs of groups are responsible for this significant difference. To correct the significance level discrepancy that emerged on account of multiple tests, we employed the Benjamini and Hochberg false discovery rate (BH-FDR) [37,38]. Benjamini and Hochberg testing is a statistical method used to control the false discovery rate in multiple hypothesis testing.
Moreover, for the purposes of applying discriminant analysis on the data we employed the algorithms of the python’s Scikit-Learn (sklearn) open-source machine learning library [39]). First, we ordered the calculated radiomics’ descriptors (features) from the PD-FSE and STIR protocols in accordance with their classification importance by employing the sklearn’s feature_importances_ function. Next, we chose the first twenty (20) most highly important features and we formed all the plausible features’ combinations, with up to a maximum of five features per combination. We used each feature combination to design the classification model and we evaluated its classification accuracy. We tested sixteen sklearn classification models [39]. To test the generalization performance of each designed classification model, data were first augmented by applying the sklearn’s resample function and next the three classes were equalized by applying the python’s imbalanced-learn’s SMOTE (Synthetic Minority Over-sampling Technique) function.
For evaluation purposes, data were randomly split into train–validation (70%) datasets and test (30%) datasets using the sklearn’s train_test_split function to have an estimation of error variation. The train–validation (70%) dataset was used to design the classification model, based on the stratified 10-fold cross-validation method. The latter was employed for tuning hyperparameters and assessing the model’s performance during training. The trained model, so far fine-tuned, was used to classify the independent test set (30%). We use the test set to determine how well the model will perform on unseen data. The above process was repeated 10 times; each time, the evaluation metrics of the test set were recorded. Finally, the means of the evaluation metrics were recorded to provide an unbiased estimate of the model’s generalization performance on unseen data. The whole procedure was used to assess the performance for each one of the sixteen classification models.
In this way we found the best classification model providing the highest classification accuracy to the test dataset data. Three basic evaluation metrics were employed on the classified test datasets:
The overall accuracy as A c c = T P + T N T P + F P + F N + T N , the true positive rate as T P R = T P T P + F N   , and the area under the ROC curve as AUC.

3. Results and Discussion

In total, two hundred and forty-two (242) ROIs were extracted, eighty-two (82) accounting for the BME group (forty-one for each MRI sequence), seventy-four (74) accounting for the INJ group (thirty-seven for each MRI sequence) and eighty-six (86) accounting for the OST group (forty-three for each MRI sequence).
Concerning the PD-FSE sequence, of the eighty-one (81) descriptors, fifty-one (51) descriptors appeared to show statistically significant differences (p < 0.05). We found thirty-four (34) of the 81 descriptors to follow normal distribution according to the Jarque–Bera test, while the forty-seven (47) remaining did not (p < 0.05). Six (6) out of the thirty-four descriptors showed statistically significant differences (p < 0.001) using the ANOVA test after correcting using the BH-FDR test. Five (5) out of the forty-seven descriptors sustained statistically significant differences (p < 0.001) when employing the Kruskal–Wallis nonparametric test after correcting for significance using the BH-FDR test.
Regarding the STIR sequence, of the eighty-one (81) descriptors, thirty-nine (39) descriptors sustained statistically significant differences (p < 0.05). We found that thirty (30) of the 81 descriptors followed normal distribution according to the Jarque–Bera test, while the remaining fifty-one (51) did not (p < 0.05). Three (3) out of thirty descriptors showed statistically significant differences (p < 0.001) when assessed using the ANOVA test after correcting via BH-FDR testing. Eleven (11) out of the fifty-one descriptors showed statistically significant differences (p < 0.001) employing the Kruskal–Wallis nonparametric test, after correcting using the BH-FDR test.
Table 2 and Table 3 present the values for the descriptors extracted from PD-FSE and STIR image-ROIs, respectively, that sustained statistically significant differences, along with the post hoc explanation of pairwise comparison. The values reported as the mean ± standard deviation for features were examined using ANOVA testing, and those as the median (Interquartile range—IQR) were examined via the Kruskal–Wallis test.
Figure 3 and Figure 4 show the boxplots of the features distribution from BME, INJ and OST groups that present statistically significant differences (p < 0.001) concerning the PD-FSE and STIR sequences, respectively.
Table 2 and Table 3 show that there are features with statistically significant differences for one or two pairs of groups, but not for all three pairs of groups. For example, we found statistically significant difference between BME and INJ, as well as between BME and OST with respect to the feature ASM, but we did not find a difference between INJ and OST using the aforementioned feature. In the process of finding some descriptors that, when suitably combined in a regression model as a composite descriptor, may provide a good prediction amongst the group, we tested many different classification models from the Scikit-learn library [39]. The ensemble random forest (ERF) classifier achieved the best performance in terms of Acc, TPR and AUC.
We found five descriptors that appeared in the majority of the descriptors’ combinations for the different evaluation repetitions.
Table 4 presents the metrics values for different validation repetitions. The average metrics scores are Acc: 0.92 ± 0.02, TPRBME: 0.94 ± 0.03, TPRINJ: 0.91 ± 0.02, TPROST: 0.92 ± 0.03, and AUC: 0.93 ± 0.02.
Figure 5a,b present the prediction performance of the five combined features (composite descriptor) in the three categories. These descriptors encode information about first- and second-order statistics, three of them were calculated from the PD-FSE sequence protocol (JMX, LRLGE, RLV), and two were calculated from the STIR sequence protocol (MAD, GLZV).
In particular, the joint maximum (JMX) descriptor is a measure of orderliness in the ROI, considering spatial interrelationships between pixels. High values of the descriptor imply that many identical grey-level neighboring pairs will be located in the ROI. We found higher values in the BME group and smaller values in the OST group when the descriptor was calculated using the PD-FSE protocol ROIs.
The long-run low-gray emphasis (LRLGE) descriptor highlights the existence of large structures along four orientations (N, S, E, W) with low signal intensity, and thus an increased local contrast [29]. We found higher LRLGE values in the BME group and smaller values in the OST group for the ROIs extracted from the PD-FSE protocol images.
Run-length variance (RLV) is a gray-level run-length matrix (GLRLM)-based descriptor that estimates the diversity of structures in the ROI [22]. High values of the descriptor imply a variety in structures in the ROI. We found higher values in the BME group, and smaller values in the OST group when the descriptor was calculated on the basis of PD-FSE protocol ROIs. The importance of structural information extraction has been highlighted in previous work [40].
Mean absolute deviation (MAD) is an intensity-based statistical descriptor and measures the dispersion from the mean [22]. We found smaller deviations from the mean intensity in the BME group, and higher deviations in the INJ group when the descriptor was calculated on the STIR protocol ROIs.
Grey-level zone variance (GLZV) is a grey-level size zone matrix (GLSZM)-based descriptor that quantifies the variance in the number of zones over the grey levels [22]. We found smaller values for the BME group, and higher values for the INJ group when the descriptor is calculated on the STIR protocol ROIs.
Those findings implied deviations in ROI texture as extracted from PD-FSE and STIR sequences. In relation to PD-FSE sequences, we discovered that the BME ROIs exhibit greater local contrast and diversity in structures compared to the OST ROIs, while the INJ ROIs lie in between. Regarding the STIR sequences, the BME ROIs exhibit enhanced homogeneity in signal intensity levels and reduced variation in the local pixel zones compared to the INJ ROIs. On the other hand, the OST ROIs demonstrate intermediate results. Texture variations may originate from the necrosis, fibrosis, and trabeculae differentiations [3] and local subchondral changes in BMLs [8,41].
The division of data into groups was based on radiological findings and this could be considered a limitation of the method. For a more robust approach, arthroscopic findings would be helpful.
Future work may involve identifying and evaluating any textural changes following bone defect fillings. In particular, the integration of biomaterials with the surrounding bone can be analyzed through MRI texture analysis [42]. A successful filling should exhibit a texture that closely resembles the adjacent healthy bone tissue. Furthermore, MRI texture analysis may facilitate the monitoring of integration progress. This might be valuable in order to comprehend how well the filling material acclimates to the mechanical stresses and biological environment of the bone [43]. Changes in texture over time may suggest whether the integration is getting better or whether there are complications such as fibrosis or inflammation. Lastly, MRI texture analysis could be used to compare the integration of diverse filling materials. This might help the selection of appropriate materials for an individual patient and the optimization of treatment protocols.

4. Conclusions

In conclusion, under the concept that changes in the subchondral bone microenvironment may be captured by studying textural image properties, we found that the study of bone marrow edema in MRI knee images through texture analysis and machine learning methods holds significant promise in the field of musculoskeletal medicine. Radiomics analysis may provide a non-invasive and quantitative means of assessing the spatial distribution and heterogeneity of bone marrow edema, aiding in its early detection and characterization. As technology and methodologies continue to advance, the integration of radiomic analysis and machine learning methods into clinical practice is likely to play an increasingly vital role in the management of knee joint conditions.

Author Contributions

Conceptualization, E.L.; methodology, S.K., N.B. and D.C.; software, S.K. and D.C.; validation, S.K., D.C. and E.L.; formal analysis, S.K. and D.C.; investigation, E.L.; resources, N.B., A.T. (Antonios Tsagkalis), M.P. and A.T.(Alexandra Tsikrika); data curation, N.B., A.T. (Antonios Tsagkalis), M.P. and A.T. (Alexandra Tsikrika); writing—original draft preparation, S.K., and N.B.; writing—review and editing, D.C., D.G., P.A. and E.L.; visualization, S.K. and D.C.; supervision, E.L.; project administration, E.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical review and approval were waived for this study due to the reason that the Ethics Committee of the University of West Attica has no jurisdiction in cases where the data collection has been carried out before its establishment (2019). However, the study was conducted according to the guidelines of the Declaration of Helsinki.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Beckwee, D.; Vaes, P.; Shahabpour, M.; Muyldermans, R.; Rommers, N.; Bautmans, I. The Influence of Joint Loading on Bone Marrow Lesions in the Knee: A Systematic Review With Meta-analysis. Am. J. Sports Med. 2015, 43, 3093–3107. [Google Scholar] [CrossRef] [PubMed]
  2. Hunter, D.J.; Gerstenfeld, L.; Bishop, G.; Davis, A.D.; Mason, Z.D.; Einhorn, T.A.; Maciewicz, R.A.; Newham, P.; Foster, M.; Jackson, S.; et al. Bone marrow lesions from osteoarthritis knees are characterized by sclerotic bone that is less well mineralized. Arthritis Res. Ther. 2009, 11, R11. [Google Scholar] [CrossRef] [PubMed]
  3. Zanetti, M.; Bruder, E.; Romero, J.; Hodler, J. Bone marrow edema pattern in osteoarthritic knees: Correlation between MR imaging and histologic findings. Radiology 2000, 215, 835–840. [Google Scholar] [CrossRef] [PubMed]
  4. Felson, D.T.; Chaisson, C.E.; Hill, C.L.; Totterman, S.M.; Gale, M.E.; Skinner, K.M.; Kazis, L.; Gale, D.R. The association of bone marrow lesions with pain in knee osteoarthritis. Ann. Intern. Med. 2001, 134, 541–549. [Google Scholar] [CrossRef]
  5. Felson, D.T.; McLaughlin, S.; Goggins, J.; LaValley, M.P.; Gale, M.E.; Totterman, S.; Li, W.; Hill, C.; Gale, D. Bone marrow edema and its relation to progression of knee osteoarthritis. Ann. Intern. Med. 2003, 139, 330–336. [Google Scholar] [CrossRef]
  6. Sowers, M.F.; Hayes, C.; Jamadar, D.; Capul, D.; Lachance, L.; Jannausch, M.; Welch, G. Magnetic resonance-detected subchondral bone marrow and cartilage defect characteristics associated with pain and X-ray-defined knee osteoarthritis. Osteoarthr. Cartil. 2003, 11, 387–393. [Google Scholar] [CrossRef]
  7. McAlindon, T.E.; Watt, I.; McCrae, F.; Goddard, P.; Dieppe, P.A. Magnetic resonance imaging in osteoarthritis of the knee: Correlation with radiographic and scintigraphic findings. Ann. Rheum. Dis. 1991, 50, 14–19. [Google Scholar] [CrossRef]
  8. Shi, X.; Mai, Y.; Fang, X.; Wang, Z.; Xue, S.; Chen, H.; Dang, Q.; Wang, X.; Tang, S.A.; Ding, C.; et al. Bone marrow lesions in osteoarthritis: From basic science to clinical implications. Bone Rep. 2023, 18, 101667. [Google Scholar] [CrossRef]
  9. Costa-Paz, M.; Muscolo, D.L.; Ayerza, M.; Makino, A.; Aponte-Tinao, L. Magnetic resonance imaging follow-up study of bone bruises associated with anterior cruciate ligament ruptures. Arthrosc. J. Arthrosc. Relat. Surg. Off. Publ. Arthrosc. Assoc. N. Am. Int. Arthrosc. Assoc. 2001, 17, 445–449. [Google Scholar] [CrossRef]
  10. Bretlau, T.; Tuxoe, J.; Larsen, L.; Jorgensen, U.; Thomsen, H.S.; Lausten, G.S. Bone bruise in the acutely injured knee. Knee Surg. Sports Traumatol. Arthrosc. Off. J. ESSKA 2002, 10, 96–101. [Google Scholar] [CrossRef]
  11. Roemer, F.W.; Bohndorf, K. Long-term osseous sequelae after acute trauma of the knee joint evaluated by MRI. Skelet. Radiol. 2002, 31, 615–623. [Google Scholar]
  12. Palmer, W.E.; Levine, S.M.; Dupuy, D.E. Knee and shoulder fractures: Association of fracture detection and marrow edema on MR images with mechanism of injury. Radiology 1997, 204, 395–401. [Google Scholar] [CrossRef] [PubMed]
  13. Adams, J.G.; McAlindon, T.; Dimasi, M.; Carey, J.; Eustace, S. Contribution of meniscal extrusion and cartilage loss to joint space narrowing in osteoarthritis. Clin. Radiol. 1999, 54, 502–506. [Google Scholar] [CrossRef] [PubMed]
  14. Li, X.; Ma, B.C.; Bolbos, R.I.; Stahl, R.; Lozano, J.; Zuo, J.; Lin, K.; Link, T.M.; Safran, M.; Majumdar, S. Quantitative assessment of bone marrow edema-like lesion and overlying cartilage in knees with osteoarthritis and anterior cruciate ligament tear using MR imaging and spectroscopic imaging at 3 Tesla. J. Magn. Reson. Imaging JMRI 2008, 28, 453–461. [Google Scholar] [CrossRef] [PubMed]
  15. Fritz, B.; Muller, D.A.; Sutter, R.; Wurnig, M.C.; Wagner, M.W.; Pfirrmann, C.W.A.; Fischer, M.A. Magnetic Resonance Imaging-Based Grading of Cartilaginous Bone Tumors: Added Value of Quantitative Texture Analysis. Investig. Radiol. 2018, 53, 663–672. [Google Scholar] [CrossRef]
  16. Cilengir, A.H.; Evrimler, S.; Serel, T.A.; Uluc, E.; Tosun, O. The diagnostic value of magnetic resonance imaging-based texture analysis in differentiating enchondroma and chondrosarcoma. Skelet. Radiol. 2023, 52, 1039–1049. [Google Scholar] [CrossRef]
  17. Chuah, T.K.; Van Reeth, E.; Sheah, K.; Poh, C.L. Texture analysis of bone marrow in knee MRI for classification of subjects with bone marrow lesion—Data from the Osteoarthritis Initiative. Magn. Reson. Imaging 2013, 31, 930–938. [Google Scholar] [CrossRef]
  18. MacKay, J.W.; Murray, P.J.; Low, S.B.L.; Kasmai, B.; Johnson, G.; Donell, S.T.; Toms, A.P. Quantitative analysis of tibial subchondral bone: Texture analysis outperforms conventional trabecular microarchitecture analysis. J. Magn. Reson. Imaging 2016, 43, 1159–1170. [Google Scholar] [CrossRef]
  19. Li, J.; Fu, S.; Gong, Z.; Zhu, Z.; Zeng, D.; Cao, P.; Lin, T.; Chen, T.; Wang, X.; Lartey, R.; et al. MRI-based Texture Analysis of Infrapatellar Fat Pad to Predict Knee Osteoarthritis Incidence. Radiology 2022, 304, 611–621. [Google Scholar] [CrossRef]
  20. Rizzo, S.; Botta, F.; Raimondi, S.; Origgi, D.; Fanciullo, C.; Morganti, A.G.; Bellomi, M. Radiomics: The facts and the challenges of image analysis. Eur. Radiol. Exp. 2018, 2, 36. [Google Scholar] [CrossRef]
  21. Kornaat, P.R.; Ceulemans, R.Y.; Kroon, H.M.; Riyazi, N.; Kloppenburg, M.; Carter, W.O.; Woodworth, T.G.; Bloem, J.L. MRI assessment of knee osteoarthritis: Knee Osteoarthritis Scoring System (KOSS)—Inter-observer and intra-observer reproducibility of a compartment-based scoring system. Skelet. Radiol. 2005, 34, 95–102. [Google Scholar] [CrossRef] [PubMed]
  22. Zwanenburg, A.; Leger, S.; Vallières, M.; Löck, S. Image biomarker standardisation initiative. arXiv 2016, arXiv:1612.07003. [Google Scholar]
  23. Haralick, R.; Shanmugam, K.; Dinstein, I. Textural features for image classification. IEEE Trans. Syst. Man Cybern. 1973, 3, 610–621. [Google Scholar] [CrossRef]
  24. Unser, M. Sum and difference histograms for texture classification. IEEE Trans. Pattern Anal. Mach. Intell. 1986, 8, 118–125. [Google Scholar] [CrossRef]
  25. Clausi, D.A. An analysis of co-occurrence texture statistics as a function of grey level quantization. Can. J. Remote Sens. 2002, 28, 45–62. [Google Scholar] [CrossRef]
  26. van Griethuysen, J.J.M.; Fedorov, A.; Parmar, C.; Hosny, A.; Aucoin, N.; Narayan, V.; Beets-Tan, R.G.H.; Fillion-Robin, J.C.; Pieper, S.; Aerts, H. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 2017, 77, e104–e107. [Google Scholar] [CrossRef]
  27. Galloway, M.M. Texture Analysis Using Grey Level Run Lengths. Comput. Graph. Image Process. 1975, 4, 172–179. [Google Scholar] [CrossRef]
  28. Chu, A.; Sehgal, C.M.; Greenleaf, J.F. Use of gray value distribution of run lengths for texture analysis. Pattern Recognit. Lett. 1990, 11, 415–419. [Google Scholar] [CrossRef]
  29. Dasarathy, B.V.; Holder, E.B. Image characterizations based on joint gray level—Run length distributions. Pattern Recognit. Lett. 1991, 12, 497–502. [Google Scholar] [CrossRef]
  30. Albregtsen, F.; Nielsen, B.; Danielsen, H.E. Adaptive gray level run length features from class distance matrices. In Proceedings of the 15th International Conference on Pattern Recognition, ICPR-2000, Barcelona, Spain, 3–7 September 2000; Volume 733, pp. 738–741. [Google Scholar]
  31. Thibault, G.; Angulo, J.; Meyer, F. Advanced Statistical Matrices for Texture Characterization: Application to Cell Classification. IEEE Trans. Biomed. Eng. 2014, 61, 630–637. [Google Scholar] [CrossRef]
  32. Tamura, H.; Mori, S.; Yamawaki, T. Textural Features Corresponding to Visual Perception. IEEE Trans. Syst. Man Cybern. 1978, SMC-8, 460–472. [Google Scholar] [CrossRef]
  33. El Merabet, Y.; Ruichek, Y.; El Idrissi, A. Attractive-and-repulsive center-symmetric local binary patterns for texture classification. Eng. Appl. Artif. Intell. 2019, 78, 158–172. [Google Scholar] [CrossRef]
  34. Jarque, C.M.; Bera, A.K. A Test for Normality of Observations and Regression Residuals. Int. Stat. Rev. Rev. Int. De Stat. 1987, 55, 163–172. [Google Scholar] [CrossRef]
  35. McKight, P.E.; Najab, J. Kruskal-Wallis Test. In The Corsini Encyclopedia of Psychology; John Wiley & Sons: New York, NY, USA, 2010; p. 1. [Google Scholar]
  36. Hochberg, Y.; Tamhane, A.C. Multiple Comparison Procedures; John Wiley & Sons: Hoboken, NJ, USA, 1987. [Google Scholar]
  37. Storey, J.D. A direct approach to false discovery rates. J. R. Stat. Soc. Ser. B 2002, 64, 479–498. [Google Scholar] [CrossRef]
  38. Benjamini, Y.; Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
  39. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  40. Wehrli, F.W. Structural and functional assessment of trabecular and cortical bone by micro magnetic resonance imaging. J. Magn. Reson. Imaging 2007, 25, 390–409. [Google Scholar] [CrossRef]
  41. Hu, Y.; Chen, X.; Wang, S.; Jing, Y.; Su, J. Subchondral bone microenvironment in osteoarthritis and pain. Bone Res. 2021, 9, 20. [Google Scholar] [CrossRef]
  42. Pop, T.S.; Miron, A.D.T.; Pop, A.M.; Brinzaniuc, K.; Trambitas, C. Magnetic Resonance Imaging Assessment of Bone Regeneration in Osseous Defects Filled with Different Biomaterials. An experimental in vivo study. Mater. Plast. 2019, 56, 235–238. [Google Scholar] [CrossRef]
  43. Kühn, J.P.; Hernando, D.; Meffert, P.J.; Reeder, S.; Hosten, N.; Laqua, R.; Steveling, A.; Ender, S.; Schröder, H.; Pillich, D.T. Proton-density fat fraction and simultaneous R2* estimation as an MRI tool for assessment of osteoporosis. Eur. Radiol. 2013, 23, 3432–3439. [Google Scholar] [CrossRef]
Figure 1. An instance of ROI delineation from bone marrow edema like lesion.
Figure 1. An instance of ROI delineation from bone marrow edema like lesion.
Jimaging 09 00252 g001
Figure 2. Samples of region of interest for the three categories and two MRI sequences; bone marrow edema (first row), injured (middle row), and osteoarthritis (last row); PD-FSE (left column) and STIR (right column).
Figure 2. Samples of region of interest for the three categories and two MRI sequences; bone marrow edema (first row), injured (middle row), and osteoarthritis (last row); PD-FSE (left column) and STIR (right column).
Jimaging 09 00252 g002
Figure 3. Boxplots of the eleven features that present statistically significant differences amongst the groups (BME vs INJ vs OST) for the ROIs from PD-FSE sequence.
Figure 3. Boxplots of the eleven features that present statistically significant differences amongst the groups (BME vs INJ vs OST) for the ROIs from PD-FSE sequence.
Jimaging 09 00252 g003
Figure 4. Boxplots of the eleven features that present statistically significant differences amongst the groups (BME vs INJ vs OST) for the ROIs from STIR sequence.
Figure 4. Boxplots of the eleven features that present statistically significant differences amongst the groups (BME vs INJ vs OST) for the ROIs from STIR sequence.
Jimaging 09 00252 g004
Figure 5. (a) ROC curves by the ensemble random forest classifier in test set; (b) boxplot for the predicted values of the five combined composite descriptor.
Figure 5. (a) ROC curves by the ensemble random forest classifier in test set; (b) boxplot for the predicted values of the five combined composite descriptor.
Jimaging 09 00252 g005
Table 1. Sequences parameters for proton density (PD-FSE) fast-spin echo (FSE) with fat suppression and short tau inversion recovery (STIR).
Table 1. Sequences parameters for proton density (PD-FSE) fast-spin echo (FSE) with fat suppression and short tau inversion recovery (STIR).
SequencesPD-FSESTIR
Parameters
ΤR (ms)22005675
TE (ms)3050
ETL816
Band Width (kHz)31.2520.83
Freq384 × 224256 × 192
Freq DIRA/PA/P
Slice Thickness (mm)4.04.0
Spacing (mm)0.80.8
FOV (mm)1818
Slices1918
Acquisition Time (min)2′.17″2′.22″
Table 2. Features sustained statistically significant differences amongst the categories concerning the PD-FSE sequence, after BH-FDR correction.
Table 2. Features sustained statistically significant differences amongst the categories concerning the PD-FSE sequence, after BH-FDR correction.
FeatureBME
Mean ± Std or Median (IQR)
INJ
Mean ± Std or
Median (IQR)
OST
Mean ± Std or
Median (IQR)
Description of Post Hoc Pairwise Comparison (p < 0.001)
Entropy0.95 (0.09)0.90 (0.11)0.87 (0.08)The mean ranks of groups BME and OST are significantly different
ASM0.22 (0.07)0.18 (0.05)0.18 (0.03)Two groups have mean ranks significantly different from BME
SOQ4.86 (1.63)3.74 (1.39)4.32 (0.97)The mean ranks of groups BME and INJ are significantly different
SVAR18.73 (6.17)14.45 (5.48)16.71 (3.65)The mean ranks of groups BME and INJ are significantly different
SENTR2.59 (0.27)2.77 (0.28)2.77 (0.22)Two groups have mean ranks significantly different from BME
ENTR2.83( 0.32)3.00 (0.35)3.05 (0.23)Two groups have mean ranks significantly different from BME
JMX0.38± 0.080.32± 0.060.30± 0.05The mean ranks of groups BME and OST are significantly different
LRE97.74 ± 49.1253.51 ± 29.6151.62 ± 19.99Two groups have mean ranks significantly different from BME
LRLGE77.56 ± 47.4036.40 ± 25.9033.93 ± 17.51Two groups have mean ranks significantly different from BME
LRHGE727.09 ± 497.94478.65 ± 179.35553.00 ± 229.49The mean ranks of groups BME and INJ are significantly different
RLV70.55 ± 41.6033.38 ± 21.9632.01 ± 14.96Two groups have mean ranks significantly different from BME
Table 3. Features sustained statistically significant differences amongst the categories concerning the STIR sequence, after BH-FDR correction.
Table 3. Features sustained statistically significant differences amongst the categories concerning the STIR sequence, after BH-FDR correction.
FeatureBME
Mean ± Std or Median (IQR)
INJ
Mean ± Std or
Median (IQR)
OST
Mean ± Std or
Median (IQR)
Description of Post Hoc Pairwise Comparison (p < 0.001)
Std29.10 (7.87)36.58 (8.57)33.29 (7.71)Two groups have means significantly different from BME
MAD23.64 (6.66)29.40 (8.18)27.09 (6.30)Two groups have means significantly different from BME
Entropy0.96± 0.050.90± 0.070.92± 0.06The mean ranks of groups BME and INJ are significantly different
ASM0.20± 0.050.15± 0.030.15 ± 0.04Two groups have mean ranks significantly different from BME
SENTR2.72 ± 0.233.02 ± 0.183.00 ± 0.23Two groups have mean ranks significantly different from BME
ENTR3.12 ± 0.303.50 ± 0.253.47 ± 0.32Two groups have mean ranks significantly different from BME
JMX0.37 ± 0.080.28 ± 0.060.30 ± 0.06Two groups have mean ranks significantly different from BME
LRE43.60 ± 26.9822.57 ± 12.0625.47 ± 16.69Two groups have mean ranks significantly different from BME
LRLGE33.96 ± 24.8814.74 ± 10.2817.26 ± 12.92Two groups have mean ranks significantly different from BME
LRHGE277.32 ± 122.49195.16 ± 80.30217.82 ± 132.22The mean ranks of groups BME and INJ are significantly different
NGLNU0.23 ± 0.020.21 ± 0.020.22 ± 0.02Two groups have mean ranks significantly different from BME
RLV29.22 ± 20.7412.80 ± 8.4914.52 ± 10.82Two groups have mean ranks significantly different from BME
GLNUsz21.48 ± 10.2714.00 ± 8.3512.23 ± 5.76Two groups have mean ranks significantly different from BME
GLZV1.7 4 0.50)2.27 (0.54)2.08 (0.51)The means of groups BME and INJ are significantly different
Table 4. Classification metrics for different evaluation repetitions in the Test sets.
Table 4. Classification metrics for different evaluation repetitions in the Test sets.
k-foldAccTPRAUC
BMEINJOST
10.890.890.890.890.92
20.960.990.940.940.98
30.910.930.900.910.91
40.920.900.900.960.91
50.910.910.930.890.91
60.920.940.870.950.95
70.930.940.920.930.93
80.930.930.910.940.93
90.910.980.900.860.94
100.930.960.920.910.96
Average (std)0.92 (0.02)0.94 (0.03)0.91 (0.02)0.92 (0.03)0.93 (0.02)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kostopoulos, S.; Boci, N.; Cavouras, D.; Tsagkalis, A.; Papaioannou, M.; Tsikrika, A.; Glotsos, D.; Asvestas, P.; Lavdas, E. Radiomics Texture Analysis of Bone Marrow Alterations in MRI Knee Examinations. J. Imaging 2023, 9, 252. https://doi.org/10.3390/jimaging9110252

AMA Style

Kostopoulos S, Boci N, Cavouras D, Tsagkalis A, Papaioannou M, Tsikrika A, Glotsos D, Asvestas P, Lavdas E. Radiomics Texture Analysis of Bone Marrow Alterations in MRI Knee Examinations. Journal of Imaging. 2023; 9(11):252. https://doi.org/10.3390/jimaging9110252

Chicago/Turabian Style

Kostopoulos, Spiros, Nada Boci, Dionisis Cavouras, Antonios Tsagkalis, Maria Papaioannou, Alexandra Tsikrika, Dimitris Glotsos, Pantelis Asvestas, and Eleftherios Lavdas. 2023. "Radiomics Texture Analysis of Bone Marrow Alterations in MRI Knee Examinations" Journal of Imaging 9, no. 11: 252. https://doi.org/10.3390/jimaging9110252

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop