1. Introduction
Adolescent idiopathic scoliosis (AIS) is the most common form of structural spinal deformity in adolescents, with surgical treatment indicated in patients with progressive curves or significant deformity. Posterior spinal fusion remains the standard surgical approach for thoracic AIS, aiming to achieve durable coronal and sagittal correction while preserving spinal balance and optimizing patient-reported outcomes. Among surgically treated patients, those with Lenke type 1 (main thoracic) curves represent a relatively homogeneous subgroup, yet important variability exists in surgical strategy, particularly with respect to fusion level selection [
1,
2].
Selection of the upper instrumented vertebra (UIV) in Lenke type 1 AIS has long been recognized as a critical determinant of postoperative shoulder and proximal coronal balance. Previous studies have emphasized the importance of radiographic parameters such as T1 tilt, trunk shift, clavicular or shoulder height asymmetry, and upper thoracic vertebral orientation when determining whether fusion should be extended proximally [
3,
4,
5,
6,
7]. While extending the fusion to higher thoracic levels (e.g., T2 or T3) may improve proximal balance in selected patients, it also increases fusion length and may theoretically influence postoperative function and perceived stiffness. Conversely, limiting the fusion to lower thoracic levels (e.g., T4) allows for shorter constructs but may carry a risk of residual or postoperative shoulder imbalance [
5,
8,
9].
Despite growing interest in radiographic predictors of postoperative balance, the clinical relevance of residual proximal imbalance remains controversial. Several recent investigations have suggested that radiographic measures of shoulder or coronal imbalance do not consistently correlate with patient-reported outcomes following AIS surgery [
10,
11,
12,
13,
14]. Patient-reported outcome measures such as the Scoliosis Research Society—22 revised questionnaire (SRS-22r) and the Pediatric Quality of Life Inventory (PedsQL) capture broader dimensions of recovery, including pain, self-image, mental health, and overall quality of life, which may not directly mirror radiographic alignment alone [
13,
14,
15,
16]. Consequently, the extent to which differences in UIV selection translate into meaningful differences in long-term clinical outcomes remains unclear.
Importantly, most previous studies evaluating UIV selection have focused either on radiographic endpoints or on short- to mid-term follow-up, and many have included heterogeneous curve types or mixed fusion strategies. There remains a lack of long-term data specifically addressing whether radiological differences associated with UIV selection in Lenke type 1 AIS persist over time and, more importantly, whether such differences have a sustained impact on patient-reported clinical outcomes. This represents a relevant knowledge gap, particularly given the emphasis on individualized surgical decision-making and long-term functional recovery in contemporary AIS management.
Therefore, the purpose of the present study was to evaluate radiological and clinical outcomes in a homogeneous cohort of Lenke type 1 AIS patients undergoing posterior spinal fusion with the upper instrumented vertebra selected at either T2 or T4, with a minimum follow-up of five years. We hypothesized that although UIV selection would be associated with differences in proximal balance-related radiological parameters, these differences would not translate into clinically meaningful differences in patient-reported outcomes. Specifically, we aimed to test the hypothesis that radiological variations in parameters such as T1 tilt, shoulder imbalance, and trunk shift may be observed between fusion strategies, while overall clinical recovery patterns remain comparable.
2. Methods
2.1. Study Design and Patient Selection
This retrospective cohort study included patients with AIS who underwent posterior spinal fusion at a single tertiary spine center between January 2015 and December 2020. Only patients with Lenke type 1 (main thoracic) curves were included to ensure a homogeneous study population. During the study period, a total of 120 consecutive patients with Lenke type 1 AIS underwent posterior spinal fusion (T2:
n = 68; T4:
n = 52). Twelve patients (T2:
n = 7; T4:
n = 5) who developed major postoperative complications were excluded from the final radiological and clinical outcome analyses. Patients who developed major postoperative complications were excluded from the longitudinal radiological and patient-reported outcome analyses in order to preserve surgical and recovery homogeneity. Major complications requiring revision surgery may substantially alter long-term recovery trajectories and quality-of-life patterns, independent of UIV selection. Complication rates were comparable between groups (T2: 7/68 vs. T4: 5/52; Fisher’s exact test,
p = 1.00). The remaining 108 patients constituted the final study cohort (T2:
n = 61; T4:
n = 47). Surgical complications observed in the initially treated cohort are summarized in
Supplementary Table S1. Patients with non-idiopathic scoliosis, additional structural curves requiring fusion, prior spinal surgery, or incomplete radiographic or clinical follow-up were excluded. Patients requiring additional posterior column osteotomies (Ponte osteotomies) were excluded to maintain surgical homogeneity, as these procedures may influence correction magnitude, postoperative pain, recovery trajectory, and complication profile. All included patients had a minimum follow-up of 5 years. Demographic, radiological, and clinical data were extracted from institutional medical records and imaging archives. The study protocol was approved by the local institutional ethics committee, and informed consent was waived due to the retrospective study design and the use of anonymized patient data.
2.2. Radiological Evaluation
Standing full-length posteroanterior and lateral spine radiographs were obtained preoperatively, postoperatively, and at the final follow-up. Coronal curve magnitude, thoracic kyphosis, lumbar lordosis, trunk shift, shoulder imbalance, and T1 tilt angle were measured using standard radiographic techniques. Proximal balance was primarily assessed using trunk shift, shoulder imbalance, and T1 tilt angle, as these parameters are commonly used to evaluate coronal and shoulder balance in thoracic scoliosis and are closely related to UIV selection. All radiographic measurements were performed by a single experienced observer using a digital PACS system (Sectra AB, Linköping, Sweden) standardized measurement protocols. Each parameter was measured twice at separate time points, and mean values were used for analysis.
2.3. Assessment of Curve Flexibility
Preoperative curve flexibility was assessed using standing posteroanterior radiographs and preoperative bending radiographs, in accordance with standard practice in AIS surgery. Bending radiographs were obtained to evaluate the reducibility of the main thoracic curve and to support surgical planning. In addition, intraoperative assessment of curve flexibility was performed under general anesthesia using fluoroscopic imaging, allowing evaluation of spontaneous curve correction and segmental alignment in a relaxed state.
Information derived from preoperative and intraoperative flexibility assessment was used to support the determination of the extent of instrumentation, including the selection of fusion levels, with particular attention to overall coronal balance and segmental alignment. These assessments were integrated with established radiographic parameters and surgeon judgment rather than applied as isolated decision criteria.
2.4. Upper Instrumented Vertebra Selection
Selection of the UIV was based on preoperative radiographic characteristics and proximal balance considerations, in accordance with commonly accepted principles for Lenke type 1 scoliosis. Fusion was extended to T2 in patients demonstrating preoperative features associated with an increased risk of postoperative proximal imbalance, including clinically evident shoulder asymmetry in conjunction with increased T1 tilt and coronal trunk shift. In contrast, T4 was selected as the UIV in patients with relatively balanced shoulders, lower preoperative T1 tilt, and satisfactory proximal coronal alignment, allowing for a shorter fusion construct while preserving overall balance. UIV selection was individualized for each patient and reflected the surgeon’s judgment aimed at optimizing postoperative proximal balance rather than random assignment. Consequently, differences between the T2 and T4 groups were considered inherent to the clinical decision-making process.
2.5. Surgical Technique
All patients underwent posterior spinal fusion using a standard midline posterior approach. Segmental pedicle screw instrumentation (Tasarımmed Medical Devices, Istanbul, Turkey) was applied across the fused levels. Deformity correction was achieved using rod derotation, translation techniques, and selective compression and distraction to restore coronal alignment and thoracic kyphosis. Care was taken to avoid excessive proximal overcorrection in order to maintain shoulder and coronal balance. Local autograft bone obtained during posterior element decortication was used for fusion. No anterior procedures or osteotomies beyond standard posterior releases were performed in this cohort.
All constructs utilized 6.5 mm titanium rods. Pedicle screws of 5.5 mm and 6.5 mm in diameter were used according to patient age and pedicle anatomy at thoracic and lumbar levels, without systematic differences between cohorts. Screw density strategy and correction maneuvers were applied consistently across groups, and the only intended structural difference between cohorts was the selected UIV level. Particular attention was paid to proximal rod contouring and controlled correction maneuvers in order to avoid excessive proximal overcorrection. Intraoperative assessment of shoulder balance and coronal alignment was performed following rod insertion and correction maneuvers, with adjustments made as necessary to maintain harmonious proximal alignment. Fusion levels were determined based on preoperative radiographic characteristics, curve flexibility, and overall coronal balance rather than applying a fixed algorithm.
2.6. Postoperative Management and Rehabilitation
All patients followed a standardized postoperative care protocol. Early mobilization was initiated on the first postoperative day under supervision. A thoracolumbosacral orthosis (TLSO) brace was routinely prescribed for approximately 45 days following surgery. The brace was intended to provide temporary external support during the early postoperative phase, facilitating patient confidence during mobilization and daily activities rather than serving as a primary stabilizing measure.
Patients were encouraged to resume daily activities gradually, with avoidance of high-impact sports during the early recovery period. Return to non-contact sports was generally permitted after radiographic evidence of progressive fusion, whereas unrestricted activity was allowed following clinical and radiographic confirmation of stable fusion. Postoperative care and rehabilitation guidance were consistent across cohorts.
2.7. Clinical Outcome Assessment
Clinical outcomes were assessed preoperatively and at final follow-up using validated Turkish versions of the PedsQL and SRS-22r questionnaires [
17,
18]. Both instruments have undergone formal linguistic and cultural validation for Turkish-speaking populations. The questionnaires were completed by the patients in the presence of a parent, who provided assistance when clarification of specific items was required. PedsQL domain scores were calculated according to standard scoring guidelines, with higher scores indicating better health-related quality of life. SRS-22r domain scores were calculated as the mean of the corresponding item responses. The Satisfaction domain of the SRS-22r was not included in longitudinal analyses because it is assessed at a single postoperative time point and therefore is not appropriate for preoperative–postoperative comparison.
2.8. Statistical Analysis
Statistical analyses were performed to evaluate changes over time within each cohort and to compare outcomes between UIV strategies. Continuous variables are presented as mean ± standard deviation, and categorical variables are presented as counts and percentages. Longitudinal within-group changes from the preoperative period to the final follow-up were analyzed using paired statistical tests, as appropriate. Formal between-group comparisons of final follow-up values and change-from-baseline (Δ) scores were performed using independent-sample t-tests. Effect sizes were calculated using Cohen’s d to quantify the magnitude of between-group differences. A two-sided p value < 0.05 was considered statistically significant. All statistical analyses were conducted using SPSS Statistics (version 26.0, IBM Corp., Armonk, NY, USA).
4. Discussion
The present study evaluated the long-term radiological and clinical outcomes of posterior spinal fusion in a homogeneous cohort of Lenke type 1 AIS patients, with particular emphasis on UIV selection at T2 versus T4. The principal finding is that, although UIV selection was associated with expected differences in proximal balance-related radiological parameters, these differences did not translate into clinically meaningful differences in patient-reported outcomes at a minimum follow-up of five years.
Consistent with prior reports, substantial and durable coronal correction was achieved postoperatively and maintained over time within each cohort, with preservation of global sagittal alignment [
1,
2,
8]. Restoration and maintenance of thoracic kyphosis and lumbar lordosis indicate that neither proximal fusion strategy adversely affected sagittal balance in Lenke type 1 curves [
2,
5]. These observations support the concept that contemporary posterior correction techniques provide reliable long-term radiographic stability across different fusion-level strategies when applied using established principles [
8,
9].
As anticipated, parameters related to proximal coronal and shoulder balance demonstrated time-dependent improvement within each cohort, with the magnitude of residual trunk shift, shoulder imbalance, and T1 tilt varying according to the selected UIV. These findings align with prior work emphasizing the relevance of proximal balance parameters in guiding fusion-level selection in thoracic AIS [
3,
4,
5,
6,
7]. Extending fusion proximally has been advocated in patients with preoperative features associated with an increased risk of postoperative shoulder imbalance, such as elevated shoulders, increased T1 tilt, or significant coronal trunk deviation [
5,
6]. The present data demonstrate that differences in proximal balance metrics persist at long-term follow-up, reinforcing the durable radiographic impact of the UIV strategy.
Preoperative bending radiographs demonstrated substantial flexibility of the main thoracic curve, and the calculated flexibility index indicated preserved reducibility within each cohort. This observation is clinically relevant, as curve flexibility is routinely integrated into fusion-level planning in Lenke type 1 AIS to optimize balance while avoiding unnecessary extension of instrumentation [
1,
2]. In this context, the observed long-term differences in proximal balance parameters are more plausibly attributable to UIV selection and proximal alignment considerations rather than baseline curve rigidity.
Despite variability in proximal balance-related radiological parameters, longitudinal patterns of clinical recovery were favorable within each cohort. Improvements over time were observed in multiple domains of health-related quality of life, pain, self-image, and mental health, as assessed by the PedsQL and SRS-22r instruments. Prior studies have similarly reported a dissociation between radiographic alignment and patient-reported outcomes, suggesting that objective measures of balance do not consistently predict patient-perceived surgical success [
11,
12,
13,
14].
This dissociation underscores the multifactorial nature of postoperative recovery in AIS. While radiographic parameters provide an objective assessment of deformity correction and alignment, patient-reported outcomes reflect broader physical, psychological, and social dimensions of health [
13,
14,
15]. In this setting, modest residual differences in proximal balance may be well tolerated and may not meaningfully influence long-term quality of life. The present findings add long-term evidence supporting this concept in a strictly defined Lenke type 1 population [
15,
16].
An additional observation concerns the trajectory of physical functioning. Although overall recovery patterns were favorable, physical functioning scores demonstrated a decline over time within each cohort. This domain-specific pattern is consistent with reduced spinal mobility after fusion, long-term activity modification, and adaptive movement behaviors, rather than inadequate deformity correction or surgical failure [
16]. Importantly, between-group comparisons demonstrated similar change-from-baseline trajectories and small effect sizes, suggesting that this decline does not represent a strategy-specific disadvantage. Furthermore, previously reported minimal clinically important difference (MCID) thresholds for pediatric quality-of-life instruments indicate that numerical score changes do not necessarily equate to clinically meaningful deterioration. This decline occurred alongside significant improvements in psychosocial and pain-related domains, suggesting that long-term recovery in AIS reflects adaptation and recalibration of physical expectations rather than a global deterioration in health status. The standardized use of a temporary TLSO brace during the early postoperative period may also have contributed to adaptive movement behaviors and activity modification in the short term. However, given the relatively brief duration of brace use, the observed long-term trajectory in physical functioning is more likely attributable to fusion-related stiffness and biomechanical adaptation rather than external immobilization alone.
Beyond quantitative radiographic metrics, assessment of shoulder balance itself is heterogeneous and depends on the measurement technique and observer perspective. Recent studies have shown that cosmetic shoulder balance may vary when evaluated from different viewpoints, highlighting that radiographic symmetry does not always capture patient-perceived appearance or satisfaction [
19]. This consideration is particularly relevant for UIV selection, where surgical planning often targets measurable indices such as T1 tilt or shoulder height difference, yet patient-facing outcomes may be influenced more by global appearance and psychosocial adaptation than by a single radiographic parameter.
Interpretation of patient-reported outcomes should likewise consider instrument-specific characteristics and limitations. Although the SRS-22r is widely used and endorsed for AIS, it may not fully capture nuanced cosmetic concerns or activity-specific limitations, particularly in long-term follow-up [
17]. Similarly, while the PedsQL provides a validated and culturally adapted assessment of general health-related quality of life in adolescents [
18], it is not deformity-specific and may be less sensitive to subtle changes in radiographic balance parameters. These considerations offer a plausible explanation for the present findings: UIV selection can modify proximal balance metrics in a measurable manner, whereas the downstream effect on patient-reported recovery may be moderated by patient adaptation and by the scope and sensitivity of the instruments employed.
From a broader perspective, postoperative shoulder balance has long been described as an “elusive” endpoint in AIS surgery, reflecting the complex interaction among curve pattern, correction magnitude, proximal thoracic alignment, and fusion-level selection [
20,
21]. Prior investigations examining fusion to T2, T3, or T4 have reported radiographic differences related to shoulder balance, yet consistent clinical superiority of one level over another has been difficult to demonstrate across heterogeneous cohorts [
22]. By focusing on a homogeneous Lenke type 1 population with long-term follow-up, the present study supports contemporary guidance favoring individualized fusion-level selection rather than a uniform strategy [
23]. This approach is best contextualized within the Lenke classification framework, which remains foundational for understanding curve behavior and tailoring surgical planning in AIS [
10,
24].
This study has several notable strengths. The analysis was restricted to a homogeneous Lenke type 1 cohort, minimizing confounding related to curve heterogeneity and enabling focused evaluation of the UIV strategy. Surgical technique and postoperative management were standardized, and patients requiring posterior column osteotomies or major revision procedures were excluded to preserve procedural consistency. The minimum follow-up of five years allowed assessment of the durability of radiographic alignment and longitudinal patient-reported outcomes. Finally, integration of objective radiological parameters with validated, culturally adapted outcome measures provides a comprehensive perspective on both structural correction and patient-perceived recovery.
Several limitations warrant consideration. First, the retrospective, single-center design may limit generalizability and introduce inherent selection bias. In addition, UIV selection was not randomized but based on preoperative radiographic characteristics and surgeon judgment. Furthermore, patients requiring posterior column osteotomies or those who developed major postoperative complications were excluded to preserve surgical and recovery homogeneity. Major complications, particularly those requiring revision surgery, may substantially alter long-term recovery trajectories and quality-of-life patterns independent of UIV selection. Although complication distribution was comparable between cohorts, exclusion of these cases may limit generalizability to more complex recovery pathways. In addition, validated outcome measures were used, and therefore, subtle cosmetic perceptions and activity-specific limitations could not be fully captured. Although the sample size was moderate, effect size estimates were generally small across patient-reported outcome domains, suggesting that the absence of statistically significant between-group differences is unlikely to reflect a masked clinically meaningful effect. In addition, radiographic measurements were performed by a single observer. Although each parameter was measured twice and mean values were used for analysis to enhance internal consistency, formal interobserver reliability analysis was not conducted. This represents a methodological limitation.