Next Article in Journal
Understanding Surgical Management and Outcomes in Mitral Valve Endocarditis
Previous Article in Journal
Investigating Peri-Ictal MRI Abnormalities: A Prospective Neuroimaging Study on Status Epilepticus, Seizure Clusters, and Single Seizures
Previous Article in Special Issue
Patient Recruitment Characteristics for Wearable-Sensor-Based Outcome Assessment in Trauma Surgery
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Power of Heuristics in Predicting Fracture Nonunion

1
BG Klinik Ludwigshafen, Department for Orthopaedics and Trauma Surgery, Heidelberg University, Ludwig-Guttmann-Str. 13, 67071 Ludwigshafen, Germany
2
Department of Trauma, Hand and Reconstructive Surgery, University Hospital Essen, Hufelandstr. 55, 45147 Essen, Germany
3
Department of Orthopedic and Trauma Surgery, BG Klinikum Duisburg, University of Duisburg-Essen, 47249 Duisburg, Germany
4
Department of Trauma Surgery, BG Unfallklinik Murnau, Professor-Küntscher-Str. 8, 82418 Murnau, Germany
5
Department for Trauma and Reconstructive Surgery, Eberhard Karls University Tuebingen, BG Klinik Tuebingen, Schnarrenbergstr. 95, 72076 Tuebingen, Germany
6
BG Kliniken—Klinikverbund der Gesetzlichen Unfallversicherung gGmbH, Leipziger Pl. 1, 10117 Berlin, Germany
7
OSORA Medical GmbH, Industriestr. 10, 89231 Neu-Ulm, Germany
*
Author to whom correspondence should be addressed.
J. Clin. Med. 2025, 14(8), 2713; https://doi.org/10.3390/jcm14082713
Submission received: 4 March 2025 / Revised: 7 April 2025 / Accepted: 11 April 2025 / Published: 15 April 2025

Abstract

:
Background/Objectives: Although extensive research on risk factors for nonunion development has been published, clinicians frequently rely on heuristic reasoning—intuitive, experience-based decision-making—to predict nonunions. However, the accuracy of these intuitive assessments and the influence of clinician experience remain uncertain. This study aims to assess clinicians’ diagnostic accuracy in predicting nonunion, investigate the impact of experience on predictive performance, and identify patient-specific factors contributing to diagnostic errors. Methods: This retrospective, multi-center cohort study included 98 patients with surgically treated tibial shaft fractures between 2018 and 2023 from four level-one trauma centers in Germany. Fracture outcomes were classified as either nonunion (n = 20) or regular fracture healing (n = 78). Patient cases were presented to 24 clinicians. Each clinician independently assessed preoperative and postoperative biplanar X-rays and patient histories to predict fracture healing. Results: Clinicians’ sensitivity significantly improved from 50.4% to 60.2%, while specificity declined (74.0% to 70.7%) with the addition of postoperative information. No significant differences in predictive performance were observed across different levels of clinician experience. Changes in assessment after reviewing postoperative information were equally likely to be beneficial or detrimental. Certain patient factors, including obesity and smoking, influenced prediction errors. Conclusions: This study is the first to assess heuristic reasoning in nonunion prediction. The findings suggest that clinician experience does not significantly enhance diagnostic accuracy under limited-information conditions. Patients should be informed that predicting individual nonunion risk remains challenging. Larger studies are needed to explore the role of patient-specific factors and refine clinical decision-making in fracture healing prognosis.

1. Introduction

Nonunions (NUs) are recognized as one of the most severe and challenging complications in orthopedics and traumatology [1]. An NU is defined as the failure of a fracture to consolidate without additional surgical intervention 6 to 9 months after the index event [2]. Treatment often requires multiple surgical procedures, the use of autologous or allogenic bone grafts to fill bone defects, and, in some serious cases, even amputation [3]. NUs are associated with prolonged pain and functional deficits, contributing to a considerable decline in quality of life comparable to advanced heart disease or stroke [4,5,6]. The socioeconomic burden of NUs is substantial, with costs far exceeding those of regular fracture healing [7,8].
NUs are commonly observed after fractures of the long bones, predominantly the tibia. Incidence rates of up to 30% have been reported, depending on risk factors such as smoking, diabetes mellitus, soft tissue damage, open wounds, and subsequent infection [7,9]. Certain fracture patterns significantly influence the risk of NU as well, with multi-fragmentary fractures (classified as type C in the AO/OTA system) exhibiting higher NU rates compared to simpler fractures (Type A or B) [9].
Optimal primary management of fractures is crucial in preventing NUs [10]. Given that infection is a major contributing factor, adherence to surgical asepsis and meticulous debridement of infected or necrotic tissue are essential [11]. Additionally, addressing patient-related variables associated with impaired bone metabolism further reduces the risk of NU [12,13]. From a surgical perspective, fracture gaps must be minimized to facilitate direct bone healing, and implants providing proper construct stability are critical. While too-rigid interfaces may hinder micromovements essential for fracture healing, insufficient firmness can result in excessive shear at the fracture site, predisposing patients to NU [14].
As these influencing factors are well documented in the literature, highlighted by a recent meta-analysis encompassing 111 studies and over 40,000 patients [9], clinicians often intuitively understand whether a fracture will heal or result in an NU. In scientific terms, this intuition is named heuristic reasoning. In psychology, heuristics are mental shortcuts that enable individuals to make decisions based on limited information [15]. While heuristic approaches may overlook certain details, they can often yield highly accurate results [15,16]. This is particularly valuable if experienced clinicians are consulted, whereas caution is necessary among younger clinicians or medical students [16,17,18]. Furthermore, clinicians of different clinical backgrounds may assign varying levels of importance to relevant information, leading to divergent heuristic decisions based on the same data [19]. Regarding experience, one study found no evidence of age-related differences in the use of common heuristics. However, it did reveal that younger adults were more susceptible to the sunk-cost bias—the tendency to continue an endeavor due to prior investments of time, effort, or money, even when the costs outweigh the benefits—suggesting a greater likelihood of persisting in suboptimal decisions compared to older adults [20].
Little is known about the role of heuristics in predicting fracture outcomes or diagnosing NUs. It remains unclear whether more experienced surgeons achieve greater diagnostic accuracy when relying on limited information. Additionally, questions persist regarding whether clinically known patient factors contribute to diagnostic bias and in which way. This study aims to address these gaps in the literature.

2. Materials and Methods

The Ethics Committee of the Medical Association of Rhineland-Palatinate, Germany, approved this study (approval number: 2023-17066-retrospektiv).
This multicenter, retrospective cohort study was conducted at four German level-one trauma centers and included anonymized data from patients treated for tibial shaft fractures between January 2018 and December 2023. The inclusion and exclusion criteria are outlined in Table 1.
We enrolled 98 patients, 20 of whom were diagnosed with NU, while 78 subjects showed normal fracture healing. Figure 1 illustrates a representative case.
Relevant demographic variables and medical history, including obesity, hypertension, and smoking, were documented and displayed alongside pre- and postoperative biplanar X-rays to the participating clinicians. Table 2 provides a summary of patient information.
According to the definition outlined in the Introduction Section (Section 1), fractures that did not progress to healing within a timeframe of 6 to 9 months after initial surgical treatment were considered NUs.
A total of twenty-four raters (six from each hospital) were recruited, representing a diverse range of experience and professional roles. These included the head of the surgical department, a consultant, a specialist, a resident, and an intern, all with backgrounds in orthopedic or trauma surgery. Additionally, one radiologist from each site was included, given their expertise in interpreting biplanar radiographs. Each rater independently evaluated each case to determine whether the fracture would heal or progress to an NU. This assessment was made in two stages. Initially, preoperative information and X-ray images obtained at the time of injury were presented, and clinicians were asked to predict the outcome. Subsequently, postoperative radiological results were provided alongside surgical information, and raters then reassessed the outcomes.
Statistical analysis: Data were collected using Microsoft Excel (Microsoft Corporation, Redmond, WA, USA) and analyzed using JASP v0.19.1 [22], pandas v2.2.3 [23], and seaborn v0.13.2 [24]. Fracture classification adhered to the Arbeitsgemeinschaft für Osteosynthesefragen (AO; Association for the Study of Internal Fixation) and the Orthopaedic Trauma Association (OTA). To quantify the predictive power of the experience-based risk assessment conducted by the clinical raters, a binary confusion matrix was computed for each rater, both for preoperative and postoperative assessments. A fracture union outcome y or prediction y ^ was coded as 0 (“negative”), while an NU was coded as 1 (“positive”). False positives (FPs) were cases in which raters incorrectly predicted an NU. In contrast, false negatives (FNs) represented fractures expected to heal resulting in NUs. Based on these confusion matrices, the binary classification metrics—sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV)—were calculated to identify the factors most frequently leading to incorrect predictions; the prediction error ε was defined as ε = y ^ y , where false positives (FP) corresponded to an error value of 1, and false negatives (FN) correspond to an error value of −1. A multiple linear regression model was applied to estimate the association between patient characteristics and ε .
To test the statistical significance of differences between n > 2 parametric variables, a one-way analysis of variance (ANOVA) followed by Bonferroni’s multiple comparisons test was performed. Two parametric variables, like the same group’s preoperative and postoperative performance metrics, were compared using Student’s paired t-test. Equal variances were assumed, depending on Levene’s test. A two-sided p-value < 0.05 was considered to show statistical significance, i.e., indicating that the results were not explainable by chance alone.

3. Results

3.1. Patient Characteristics

Ninety-eight patient cases were included in this study. Table 3 summarizes their main characteristics. Most tibial shaft fractures were classified as 42-B (n = 50), followed by 42-A (n = 39) and 42-C (n = 9). Upon further review, one patient was reclassified as having a distal tibia fracture (AO 43), as an intra-articular fissure was not identified during the initial assessment. Figure 2 shows the distribution of fracture types according to the AO classification.

3.2. Rater Characteristics

A total of 24 raters participated in this study. As expected, higher professional positions were associated with more years of experience. Table 4 provides a detailed summary of the characteristics of the raters.

3.3. Clinicians’ Performance Metrics in Predicting Nonunion Development

While sensitivity increased significantly with the addition of postoperative information from 50.4 to 60.2%, specificity decreased from 74.0% to 70.7% (p = 0.08). The positive predictive value (PPV) decreased marginally between preoperative and postoperative evaluations, whereas the negative predictive value (NPV) increased slightly (84.3% to 86.3%) when postoperative information was incorporated. Figure 3 presents the results.

3.4. No Significant Difference in Prediction Metrics Across Clinician Roles

To assess whether a clinician’s position influences predictive performance, the results were categorized based on the clinician’s stage of training. There were no significant differences in sensitivity, specificity, PPV, or NPV across different experience levels. Figure 4 presents the results.

3.5. Greater Experience Does Not Correlate with Improved Intuition

To further examine the impact of experience on clinicians’ ratings, performance metrics were analyzed in relation to years of clinical experience. PPV tended to correlate with experience, but only in the preoperative evaluation. No significant correlation was observed. Figure 5 presents the results.

3.6. Changing Initial Ratings After Receiving Postoperative Information Did Not Improve Clinicians’ Accuracy

When clinicians revised their evaluations after receiving additional postoperative information, these changes were equally likely to be beneficial or detrimental. Figure 6 presents the results.

3.7. Clinically Relevant Patient Characteristics Influence Prediction Error of Postoperative Evaluation

Patient characteristics—including obesity, smoking, and a postoperative fracture gap greater than 4 mm—were evaluated during the processing of case vignettes. Clinicians noticeably overestimated the risk of NU in cases of compartment syndrome, open fractures, and a postoperative fracture gap >4 mm. In contrast, polytrauma and obesity did not affect the prediction error, while the impact of smoking was underestimated. Due to the small case numbers for some of these factors (see Table 3), individual effects were not statistically analyzed. Figure 7 displays the mean prediction error of clinicians when these characteristics were present versus absent.

4. Discussion

4.1. Principal Findings

To our knowledge, this is the first study to investigate the role of heuristics in predicting NU after tibia fractures. This research project analyzed a cohort of 98 case vignettes, in which 20 tibial fractures resulted in an NU (20.4%), while 78 (79.6%) cases progressed with physiological fracture healing (see Table 3). This distribution aligns with reported NU rates in the literature, which range from 0% to 42.7% [9]. The cohort investigated is representative, comprising a predominantly male population (69%) with a mean age of 40 years [25]. Various risk factors were present, including open fractures (30%), obesity (7%), and smoking (9%). Additionally, compartment syndrome was diagnosed in 9% of cases, closely matching the reported incidence of 8% in the literature [26]. Soft tissue damage was documented in 29% of cases, which is somewhat lower than the rate suggested in the literature [27].
The overall sensitivity of clinician raters in excluding the progress to NUs was 50% when only preoperative information was provided. It improved to 60% with the inclusion of additional postoperative data (see Figure 3). The specificity declined from 74% to 70%. No significant differences were observed when performance metrics were analyzed according to clinician roles or correlated with years of experience (see Figure 4a,b and Figure 5a,b). Sensitivity significantly increased between preoperative and postoperative evaluations amongst specialists and residents. However, specificity significantly decreased in specialists. This trend was not observed among consultants and heads of surgical departments, suggesting that these groups were less likely to overestimate the risk of NU after reviewing postoperative information (see Figure 4c).
An analysis of changes between preoperative and postoperative evaluations did not reveal a consistent pattern. In some cases, postoperative information improved accuracy; in others, it had a contradicting effect (see Figure 6). Furthermore, the present data provided no evidence of a sunk cost fallacy, as reported by Taylor et al. [20], among clinician raters when diagnosing NU. Clinicians with more years of experience did not differ significantly in their total, beneficial, or detrimental switch counts compared to those with less experience (data not shown).

4.2. Prediction Errors of Clinician Raters Are Associated with Clinically Relevant Patient Factors

Patient-, injury-, and management-related features have been recognized as key determinants of post-surgical fracture outcomes [7,9,28,29]. Tian et al. systematically reviewed these variables in a meta-analysis encompassing over 40,000 patients [9]. They identified 15 items significantly influencing the likelihood of fracture union: age > 60 years, male gender, smoking, body mass index > 40, diabetes mellitus, intake of nonsteroidal anti-inflammatory drugs (NSAIDs) and/or opioids, fractures of the middle and distal tibia, high-energy and/or open fractures, Gustilo–Anderson grade IIIB or IIIC soft tissue injury, AO-type C fractures, open reduction, and surgical site infection. This spectrum compares well to that previously shown by Zura et al. in a nationwide U.S. claims database of approximately 90.1 million participants [25]. Due to the limited sample size, we were unable to investigate the contribution of all factors to clinical prediction thoroughly (see Table 3).
Clinicians overestimated the risk of NU if there was compartment syndrome, an open fracture, or a postoperative fracture gap > 4 mm. Although previous studies have demonstrated that these factors significantly affect fracture healing [9,25], their influence in the present cohort was less pronounced than anticipated by clinicians. Polytrauma and obesity did not affect accuracy, while the impact of smoking was underestimated. These findings highlight the challenges of accurately assessing the influence of individual patient factors on fracture healing outcomes. A potential solution may be to train machine learning algorithms using this and future larger datasets, as such algorithms are well suited for recognizing recurring patterns. Further studies are required to assess the performance of machine learning models in this context.

4.3. Heuristic Biases of Tibial Nonunion Diagnosis in Context

Cognitive bias can negatively affect surgical performance and patient outcomes at any stage of care [30]. Edelstein et al. showed that heuristic-based decision-making may lead to erroneous assumptions and suboptimal choices between total hip arthroplasty and hemiarthroplasty in patients with femoral neck fractures [31]. Similarly, Janssen et al. demonstrated that orthopedic surgeons are prone to common cognitive biases, including confirmation bias and anchoring [32].
The present study revealed some confirmation bias where participants tend to favor or recall information that aligns with their existing beliefs while discounting contradictory evidence. Some clinicians did not revise their evaluations at all after reviewing the postoperative information, while others made only minor adjustments. This pattern suggests that additional data were sometimes underutilized, despite their potential influence on the likelihood of nonunion. Notably, the total number of evaluation changes did not correlate with clinical experience or professional role, suggesting that more experienced physicians may just be as vulnerable to confirmation bias as their less experienced colleagues.
Anchoring describes a phenomenon in which individuals rely too heavily on the first piece of information (the “anchor”) when making subsequent decisions, even when new or more relevant data become available. Failing to adjust an assessment after reviewing postoperative findings may reflect anchoring bias, confirmation bias, or both. In practice, these two biases exhibit similar tendencies—a reluctance to update beliefs based on new information—so a lack of “postoperative switches” might be driven by overlapping cognitive factors.
A potential solution to suboptimal heuristic reasoning is to use objective scoring systems. Such tools can support clinical decision-making in orthopedics, traumatology, and other specialties; however, they have also been subject to scrutiny [33,34,35]. Regarding orthopedic research, recent work evaluating three recognized nonunion scores (FRACTING, NURD, LEG-NUI) in patients with tibial shaft fractures demonstrated high specificity (86.5–96.6%) but low sensitivity (14.6–63.4%) in predicting NU [36]. Compared with the present findings, these scoring methods could improve specificity for diagnosing nonunion but are unlikely to increase sensitivity, which is essential for identifying high-risk patients at an early stage. Consequently, some authors propose integrating artificial intelligence into clinical decision-making [37], which could represent a promising avenue for refining the diagnostic process in nonunion detection, as mentioned above.

4.4. Limitations

This study has several limitations. First, clinician raters could not access all the information typically available in a hospital setting. For example, while CT scans often provide additional details about fracture patterns, clinicians in this study were limited to evaluating X-ray images. Additionally, soft tissue injuries may have been underreported, as indicated by the discrepancy between detected injuries (see Table 3) and those reported in the literature (see Section 4.1). As clinicians know that most fractures have some degree of surrounding soft tissue injury, clinicians are likely to include this knowledge into their evaluation regardless of whether low-grade soft tissue injuries are reported or not.
Furthermore, this study included only cases from level 1 trauma centers in a developed country, where surgical process quality and outcomes are expected to be uniformly high. This uniformity made distinguishing between “good” and “poor” surgical treatments challenging, potentially leading to an underestimation of clinician heuristics.
Postoperative evaluation was based solely on radiographic imaging and implant information, but no data on postoperative weight-bearing were provided. Moreover, data on postoperative infections—an important factor known to impact patient outcomes—were not included. However, since this study aimed to investigate heuristics based on perioperative information, this limitation was an inherent aspect of the study design.
Finally, the number of patients in certain subgroups was too small to draw definitive statistical conclusions regarding the effect of specific patient factors. Larger studies are needed to investigate these associations.

5. Conclusions

This study represents the first analysis of clinician heuristics in predicting NUs, highlighting the inherent complexity of forecasting fracture outcomes. Notably, within this cohort and under limited information, neither clinician position nor experience significantly influenced diagnostic metrics. Clinical assessment of the risk of NU following fractures remains challenging. Future research should explore the potential of machine learning algorithms for early NU risk estimation. Furthermore, larger studies are needed to validate these findings and assess the specific impact of patient factors through subgroup analyses.

Author Contributions

Conceptualization, J.A., E.S., S.H., M.K.R., D.S., G.R., P.A.G. and H.F.; methodology, J.A., D.S. and H.F.; formal analysis, J.A., E.S., S.H., M.K.R., D.S., G.R. and H.F.; investigation, J.A., E.S., S.H., M.K.R. and H.F.; resources, J.A., E.S., S.H., M.K.R., D.S., G.R., P.A.G. and H.F.; data curation, J.A., D.S. and H.F.; writing—original draft preparation, J.A.; writing—review and editing, J.A., E.S., S.H., M.K.R., D.S., G.R., P.A.G. and H.F.; visualization, J.A., D.S. and F.N.; supervision, J.A., D.S. and H.F.; project administration, J.A. and H.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and was approved by the Ethics Committee of the State Chamber of Physicians of Rhineland-Palatinate, Germany (protocol code: 2023-17066-retrospektiv; date of approval: 17 July 2023).

Informed Consent Statement

Patient consent was waived, as the study used anonymized patient data, which was approved by the ethics committee.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to data privacy and ethical restrictions.

Acknowledgments

The use of LLMs, specifically ChatGPT 4o (OpenAI, San Francisco, CA, USA), in this study was limited to enhancing the readability and refining the language by improving the vocabulary and structure of pre-generated text by the authors.

Conflicts of Interest

The author F.N. is an employee of OSORA medical GmbH. F.N. provided technical support for this study by illustrating the results. F.N. had no influence on the design of the study, in conducting the experiment, or in the analysis or interpretation of the research results. The conclusions of this study are the result of an independent scientific investigation and reflect exclusively the views of the researchers. The authors hereby declare that no conflicts of interest exist.

Abbreviations

AOArbeitsgemeinschaft für Osteosynthesefragen (Association for the Study of Internal Fixation)
BMIBody mass index
CTComputed Tomography
FNFalse negative
FPFalse positive
LEG-NUILower Extremity Grading of Nonunion Index
NSAIDNonsteroidal anti-inflammatory drug
NUNonunion
NURDNonunion Risk Determination Score
NPVNegative predictive value
ORIFOpen Reduction and Internal Fixation
OTAOrthopaedic Trauma Association
PPVPositive predictive value
TENTitanium Elastic Nail

References

  1. Moghaddam, A.; Zietzschmann, S.; Bruckner, T.; Schmidmaier, G. Treatment of Atrophic Tibia Non-Unions According to “Diamond Concept”: Results of One- and Two-Step Treatment. Injury 2015, 46 (Suppl. S4), S39–S50. [Google Scholar] [CrossRef]
  2. Everding, J.; Roßlenbroich, S.; Raschke, M.J. Pseudarthrosen der langen Röhrenknochen. Chirurgie 2018, 89, 73–88. [Google Scholar] [CrossRef] [PubMed]
  3. Armbruster, J.; Bussmann, F.; Freischmidt, H.; Reiter, G.; Gruetzner, P.A.; El Barbari, J.S. Treatment of High-Grade Chronic Osteomyelitis and Nonunions with PerOssal®: A Retrospective Analysis of Clinical Efficacy and Patient Perspectives. J. Clin. Med. 2024, 13, 7764. [Google Scholar] [CrossRef] [PubMed]
  4. Brinker, M.R.; Hanus, B.D.; Sen, M.; O’Connor, D.P. The Devastating Effects of Tibial Nonunion on Health-Related Quality of Life. J. Bone Jt. Surg. Am. 2013, 95, 2170–2176. [Google Scholar] [CrossRef]
  5. Freischmidt, H.; Guehring, T.; Thomé, P.; Armbruster, J.; Reiter, G.; Grützner, P.A.; Nolte, P.-C. Treatment of Large Femoral and Tibial Bone Defects with Plate-Assisted Bone Segment Transport. J. Orthop. Trauma 2024, 38, 285–290. [Google Scholar] [CrossRef]
  6. Freischmidt, H.; Reiter, G.; Grützner, P.A.; Armbruster, J. Augmentation in surgical sepsis: Chances and limitations in the treatment of osteitis with calcium hydroxyapatite containing antibiotics. Unfallchirurgie 2022, 125, 452–459. [Google Scholar] [CrossRef]
  7. Reeh, F.M.; Sachse, S.; Wedekind, L.; Hofmann, G.O.; Lenz, M. Nonunions and Their Operative Treatment—A DRG-Based Epidemiological Analysis for the Years 2007–2019 in Germany. Dtsch. Arztebl. Int. 2022, 119, 869–875. [Google Scholar] [CrossRef]
  8. Walter, N.; Hierl, K.; Brochhausen, C.; Alt, V.; Rupp, M. The Epidemiology and Direct Healthcare Costs of Aseptic Nonunions in Germany—A Descriptive Report. Bone Jt. Res. 2022, 11, 541–547. [Google Scholar] [CrossRef]
  9. Tian, R.; Zheng, F.; Zhao, W.; Zhang, Y.; Yuan, J.; Zhang, B.; Li, L. Prevalence and Influencing Factors of Nonunion in Patients with Tibial Fracture: Systematic Review and Meta-Analysis. J. Orthop. Surg. Res. 2020, 15, 377. [Google Scholar] [CrossRef]
  10. Andrzejowski, P.; Giannoudis, P.V. The ‘Diamond Concept’ for Long Bone Non-Union Management. J. Orthop. Traumatol. 2019, 20, 21. [Google Scholar] [CrossRef]
  11. Kanakaris, N.K.; Tosounidis, T.H.; Giannoudis, P.V. Surgical Management of Infected Non-Unions: An Update. Injury 2015, 46 (Suppl. S5), S25–S32. [Google Scholar] [CrossRef]
  12. Jiao, H.; Xiao, E.; Graves, D.T. Diabetes and Its Effect on Bone and Fracture Healing. Curr. Osteoporos. Rep. 2015, 13, 327–335. [Google Scholar] [CrossRef]
  13. Lazzarini, L.; Mader, J.T.; Calhoun, J.H. Osteomyelitis in Long Bones. J. Bone Jt. Surg. Am. 2004, 86, 2305–2318. [Google Scholar] [CrossRef]
  14. Beltran, M.J.; Collinge, C.A.; Gardner, M.J. Stress Modulation of Fracture Fixation Implants. J. Am. Acad. Orthop. Surg. 2016, 24, 711–719. [Google Scholar] [CrossRef] [PubMed]
  15. Gigerenzer, G.; Brighton, H. Homo Heuristicus: Why Biased Minds Make Better Inferences. Top. Cogn. Sci. 2009, 1, 107–143. [Google Scholar] [CrossRef] [PubMed]
  16. Whelehan, D.F.; Conlon, K.C.; Ridgway, P.F. Medicine and Heuristics: Cognitive Biases and Medical Decision-Making. Ir. J. Med. Sci. 2020, 189, 1477–1484. [Google Scholar] [CrossRef]
  17. McDonald, C.J. Medical Heuristics: The Silent Adjudicators of Clinical Practice. Ann. Intern. Med. 1996, 124, 56–62. [Google Scholar] [CrossRef]
  18. Kempainen, R.R.; Migeon, M.B.; Wolf, F.M. Understanding Our Mistakes: A Primer on Errors in Clinical Reasoning. Med. Teach. 2003, 25, 177–181. [Google Scholar] [CrossRef]
  19. Kalf, A.J.; Spruijt-Metz, D. Variation in Diagnoses: Influence of Specialists’ Training on Selecting and Ranking Relevant Information in Geriatric Case Vignettes. Soc. Sci. Med. 1996, 42, 705–712. [Google Scholar] [CrossRef]
  20. Taylor, M.K.; Marsh, E.J.; Samanez-Larkin, G.R. Heuristic Decision-Making across Adulthood. Psychol. Aging 2023, 38, 508–518. [Google Scholar] [CrossRef]
  21. Oestern, H.J.; Tscherne, H. Pathophysiology and classification of soft tissue damage in fractures. Orthopade 1983, 12, 2–8. [Google Scholar] [PubMed]
  22. JASP Team. JASP, version 0.19.3; JASP Team: Amsterdam, The Netherlands, 2024. Available online: https://jasp-stats.org/ (accessed on 21 January 2025).
  23. McKinney, W. Data Structures for Statistical Computing in Python. SciPy 2010, 445, 51–56. [Google Scholar] [CrossRef]
  24. Waskom, M.L. Seaborn: Statistical Data Visualization. J. Open Source Softw. 2021, 6, 3021. [Google Scholar] [CrossRef]
  25. Zura, R.; Watson, J.T.; Einhorn, T.; Mehta, S.; Rocca, G.J.D.; Xiong, Z.; Wang, Z.; Jones, J.; Steen, R.G. An Inception Cohort Analysis to Predict Nonunion in Tibia and 17 Other Fracture Locations. Injury 2017, 48, 1194–1203. [Google Scholar] [CrossRef] [PubMed]
  26. Park, S.; Ahn, J.; Gee, A.O.; Kuntz, A.F.; Esterhai, J.L. Compartment Syndrome in Tibial Fractures. J. Orthop. Trauma 2009, 23, 514. [Google Scholar] [CrossRef]
  27. Joo, Y.-B.; Kim, Y.-M.; Park, Y.-C.; Chae, S.-H.; Kim, D.-H. Evaluating Meniscus, Ligament and Soft Tissue Injury Using MRI in Tibial Plateau Fractures: A Tscherne Classification Approach. Medicina 2024, 60, 2073. [Google Scholar] [CrossRef]
  28. Dickson, K.; Katzman, S.; Delgado, E.; Contreras, D. Delayed Unions and Nonunions of Open Tibial Fractures: Correlation with Arteriography Results. Clin. Orthop. Relat. Res. 1994, 302, 189–193. [Google Scholar] [CrossRef]
  29. Burrus, M.T.; Werner, B.C.; Yarboro, S.R. Obesity Is Associated with Increased Postoperative Complications After Operative Management of Tibial Shaft Fractures. Injury 2016, 47, 465–470. [Google Scholar] [CrossRef]
  30. Armstrong, B.A.; Dutescu, I.A.; Tung, A.; Carter, D.N.; Trbovich, P.L.; Wong, S.; Saposnik, G.; Grantcharov, T. Cognitive Biases in Surgery: Systematic Review. Br. J. Surg. 2023, 110, 645–654. [Google Scholar] [CrossRef]
  31. Edelstein, A.I.; Tanenbaum, J.T.; McGinley, E.L.; Dillingham, T.R.; Pezzin, L.E. Age-Based Heuristics Bias Treatment of Displaced Femoral Neck Fractures in the Elderly. Arthroplast. Today 2024, 27, 101356. [Google Scholar] [CrossRef]
  32. Janssen, S.J.; Teunis, T.; Ring, D.; Parisien, R.C. Cognitive Biases in Orthopaedic Surgery. J. Am. Acad. Orthop. Surg. 2021, 29, 624–633. [Google Scholar] [CrossRef] [PubMed]
  33. Chokshi, N.B.K.; Karmakar, B.; Pathan, S.K.; Joshi, V.; Gohel, D.M.; Coulshed, D.S.; Negishi, K.; Pathan, F.K. A Systematic Review of Frailty Scores Used in Heart Failure Patients. Heart Lung Circ. 2023, 32, 441–453. [Google Scholar] [CrossRef] [PubMed]
  34. Kabiri, A.; Gharin, P.; Forouzannia, S.A.; Ahmadzadeh, K.; Miri, R.; Yousefifard, M. HEART versus GRACE Score in Predicting the Outcomes of Patients with Acute Coronary Syndrome; a Systematic Review and Meta-Analysis. Arch. Acad. Emerg. Med. 2023, 11, e50. [Google Scholar] [CrossRef] [PubMed]
  35. Liu, X.; Wang, S.; He, W.; Guo, L. HAS-BLED vs. ORBIT Scores in Anticoagulated Patients with Atrial Fibrillation: A Systematic Review and Meta-Analysis. Front. Cardiovasc. Med. 2022, 9, 1042763. [Google Scholar] [CrossRef]
  36. Quarta, D.; Grassi, M.; Lattanzi, G.; Gigante, A.P.; D’Anca, A.; Potena, D. Three Predictive Scores Compared in a Retrospective Multicenter Study of Nonunion Tibial Shaft Fracture. World J. Orthop. 2024, 15, 560–569. [Google Scholar] [CrossRef]
  37. Brown, C.; Nazeer, R.; Gibbs, A.; Le Page, P.; Mitchell, A.R. Breaking Bias: The Role of Artificial Intelligence in Improving Clinical Decision-Making. Cureus 2023, 15, e36415. [Google Scholar] [CrossRef]
Figure 1. Demonstration of a patient case as presented to the clinical raters for healing prediction: (a) preoperative biplanar X-rays and patient background information; (b) postoperative biplanar X-rays (original text in German). R = right side.
Figure 1. Demonstration of a patient case as presented to the clinical raters for healing prediction: (a) preoperative biplanar X-rays and patient background information; (b) postoperative biplanar X-rays (original text in German). R = right side.
Jcm 14 02713 g001
Figure 2. Fracture types according to the AO classification.
Figure 2. Fracture types according to the AO classification.
Jcm 14 02713 g002
Figure 3. Performance ratings of all raters combined: Postoperative information like surgical reduction and implant information significantly increases the sensitivity and reduces clinician raters’ specificity. Sensitivitypreop = 50.4% vs. Sensitivitypostop = 60.2%, p < 0.001; Specificitypreop = 74.0% vs. Specificitypostop = 70.7%, p = 0.08; PPVpreop = 38.1% vs. PPVpostop = 35.5%, p = 0.40; NPVpreop = 85.8% vs. NPVpostop = 87.7%, p < 0.001. *** = p < 0.001. Error bars: bootstrapped 95% confidence intervals.
Figure 3. Performance ratings of all raters combined: Postoperative information like surgical reduction and implant information significantly increases the sensitivity and reduces clinician raters’ specificity. Sensitivitypreop = 50.4% vs. Sensitivitypostop = 60.2%, p < 0.001; Specificitypreop = 74.0% vs. Specificitypostop = 70.7%, p = 0.08; PPVpreop = 38.1% vs. PPVpostop = 35.5%, p = 0.40; NPVpreop = 85.8% vs. NPVpostop = 87.7%, p < 0.001. *** = p < 0.001. Error bars: bootstrapped 95% confidence intervals.
Jcm 14 02713 g003
Figure 4. Performance ratings of raters categorized by professional position: (a) Preoperative evaluation. No significant differences were observed between positions. (b) Postoperative evaluation. No significant differences were observed between positions. (c) Delta between preoperative and postoperative evaluation. Postoperative information, such as surgical reduction and implant details, led to a significant increase in sensitivity, particularly among specialists and residents, but also reduced their specificity. Additionally, the NPV of specialists showed a significant but marginal increase. However, no significant differences were observed in the delta between preoperative and postoperative evaluations across the categorized positions. ++ = p < 0.01, + = p < 0.05, preoperative to postoperative metric. Error bars: bootstrapped 95% confidence intervals.
Figure 4. Performance ratings of raters categorized by professional position: (a) Preoperative evaluation. No significant differences were observed between positions. (b) Postoperative evaluation. No significant differences were observed between positions. (c) Delta between preoperative and postoperative evaluation. Postoperative information, such as surgical reduction and implant details, led to a significant increase in sensitivity, particularly among specialists and residents, but also reduced their specificity. Additionally, the NPV of specialists showed a significant but marginal increase. However, no significant differences were observed in the delta between preoperative and postoperative evaluations across the categorized positions. ++ = p < 0.01, + = p < 0.05, preoperative to postoperative metric. Error bars: bootstrapped 95% confidence intervals.
Jcm 14 02713 g004
Figure 5. Correlation between raters’ performance metrics and experience. No significant correlation was found between performance metrics and rater experience. Preoperative PPV showed the highest correlation with experience but remained insignificant (Pearson’s r = 0.386, p = 0.062): (a) preoperative evaluation; (b) postoperative evaluation. Shaded areas indicate 95% confidence intervals.
Figure 5. Correlation between raters’ performance metrics and experience. No significant correlation was found between performance metrics and rater experience. Preoperative PPV showed the highest correlation with experience but remained insignificant (Pearson’s r = 0.386, p = 0.062): (a) preoperative evaluation; (b) postoperative evaluation. Shaded areas indicate 95% confidence intervals.
Jcm 14 02713 g005
Figure 6. Violin plots illustrating changes from preoperative to postoperative ratings after receiving additional information. The plots display the total number of changes and the counts of beneficial and detrimental changes for each individual clinician (black dots).
Figure 6. Violin plots illustrating changes from preoperative to postoperative ratings after receiving additional information. The plots display the total number of changes and the counts of beneficial and detrimental changes for each individual clinician (black dots).
Jcm 14 02713 g006
Figure 7. Mean prediction error by clinicians, stratified by the presence (1) or absence (0) of clinically relevant patient factors. Positive prediction errors reflect a tendency to overestimate the factor’s impact (false positives), whereas negative prediction errors indicate underestimation (false negatives): (a) compartment syndrome; (b) open fracture; (c) polytrauma; (d) post-surgical fracture gap (>4 mm); (e) obesity; (f) smoking. Error bars: bootstrapped 95% confidence intervals. Note the adjusted y-axis scales in (e,f) to better fit the data.
Figure 7. Mean prediction error by clinicians, stratified by the presence (1) or absence (0) of clinically relevant patient factors. Positive prediction errors reflect a tendency to overestimate the factor’s impact (false positives), whereas negative prediction errors indicate underestimation (false negatives): (a) compartment syndrome; (b) open fracture; (c) polytrauma; (d) post-surgical fracture gap (>4 mm); (e) obesity; (f) smoking. Error bars: bootstrapped 95% confidence intervals. Note the adjusted y-axis scales in (e,f) to better fit the data.
Jcm 14 02713 g007
Table 1. Inclusion and exclusion criteria for patient selection.
Table 1. Inclusion and exclusion criteria for patient selection.
InclusionExclusion
  • Tibial shaft fractures classified as AO Type 42, including all morphological subcategories
  • Surgical management (plating or intramedullary nailing) performed between January 2018 and December 2023
  • Insufficient follow-up (less than 9 months) or until osseous consolidation
  • No biplanar radiographs taken at the date of injury and immediately postoperatively available
Table 2. Patient information in addition to pre- and postoperative biplanar X-rays.
Table 2. Patient information in addition to pre- and postoperative biplanar X-rays.
Parameter GroupParameter [Unit]
Preoperative
  • Age [years]
  • Sex [male/female]
  • Weight [kg]
  • Past medical history
  • Compartment syndrome [yes/no]
  • Soft tissue damage, according to Tscherne und Oestern [21]:
    Closed fractures [Grade I–III]
    Open fractures [Grade I–IV]
  • One-step or two-step surgical protocol
  • Polytrauma
  • Previous infection [yes/no]
  • Plastic soft tissue coverage [yes/no]
  • Use of additional allograft or autograft
  • Allergies
Postoperative
  • Name and size of orthopedic implant
Table 3. Patient characteristics. Percentage rounded to whole numbers. n/a = not assessable; TEN = Titanium Elastic Nail.
Table 3. Patient characteristics. Percentage rounded to whole numbers. n/a = not assessable; TEN = Titanium Elastic Nail.
Patient
Characteristics
FrequencyPercent [%]Mean ± SD
Age [years]Total9810039.9 ± 16.5
Male686939.8 ± 16.7
Female303140.1 ± 16.0
TreatmentNail7071
Plate2728
TEN11
Open fractureYes2930
No6970
ObesityYes77
No9193
SmokingYes99
No8991
Compartment syndromeYes99
No8991
PolytraumaYes66
No9294
Plastic soft tissue coverageYes33
no9597
Fibula fracturedYes7678
No2222
Soft tissue damage0° or n/a6970
1818
77
44
Table 4. Rater characteristics.
Table 4. Rater characteristics.
PositionFrequency [n]Mean Experience ± SD [Years]
Heads of Surgical Departments428.8 ± 6.5
Consultants418.8 ± 6.5
Specialists48.8 ± 3.1
Residents43.8 ± 1.3
Interns40.0 ± 0.0
Radiologists410.0 ± 4.1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Armbruster, J.; Steinhausen, E.; Hackl, S.; Reumann, M.K.; Stengel, D.; Niemeyer, F.; Reiter, G.; Gruetzner, P.A.; Freischmidt, H. The Power of Heuristics in Predicting Fracture Nonunion. J. Clin. Med. 2025, 14, 2713. https://doi.org/10.3390/jcm14082713

AMA Style

Armbruster J, Steinhausen E, Hackl S, Reumann MK, Stengel D, Niemeyer F, Reiter G, Gruetzner PA, Freischmidt H. The Power of Heuristics in Predicting Fracture Nonunion. Journal of Clinical Medicine. 2025; 14(8):2713. https://doi.org/10.3390/jcm14082713

Chicago/Turabian Style

Armbruster, Jonas, Eva Steinhausen, Simon Hackl, Marie K. Reumann, Dirk Stengel, Frank Niemeyer, Gregor Reiter, Paul Alfred Gruetzner, and Holger Freischmidt. 2025. "The Power of Heuristics in Predicting Fracture Nonunion" Journal of Clinical Medicine 14, no. 8: 2713. https://doi.org/10.3390/jcm14082713

APA Style

Armbruster, J., Steinhausen, E., Hackl, S., Reumann, M. K., Stengel, D., Niemeyer, F., Reiter, G., Gruetzner, P. A., & Freischmidt, H. (2025). The Power of Heuristics in Predicting Fracture Nonunion. Journal of Clinical Medicine, 14(8), 2713. https://doi.org/10.3390/jcm14082713

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop